Query 037565
Match_columns 218
No_of_seqs 194 out of 1625
Neff 7.7
Searched_HMMs 46136
Date Fri Mar 29 13:34:32 2013
Command hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/037565.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/037565hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PLN03146 aspartyl protease fam 100.0 7.6E-42 1.7E-46 310.0 20.6 205 9-216 4-208 (431)
2 KOG1339 Aspartyl protease [Pos 100.0 6.6E-28 1.4E-32 216.9 13.9 126 84-215 40-166 (398)
3 PF14543 TAXi_N: Xylanase inhi 99.9 7.3E-27 1.6E-31 186.3 11.9 117 91-217 1-118 (164)
4 cd06096 Plasmepsin_5 Plasmepsi 99.9 5.7E-26 1.2E-30 199.1 13.6 123 89-216 2-125 (326)
5 PTZ00165 aspartyl protease; Pr 99.9 1.7E-24 3.7E-29 198.6 16.0 111 79-214 109-221 (482)
6 cd05470 pepsin_retropepsin_lik 99.9 5.2E-24 1.1E-28 157.9 12.3 95 93-215 1-96 (109)
7 cd05490 Cathepsin_D2 Cathepsin 99.9 4.6E-23 1E-27 180.4 13.1 99 86-214 2-104 (325)
8 cd05478 pepsin_A Pepsin A, asp 99.9 1.5E-22 3.2E-27 176.7 13.9 105 83-215 3-107 (317)
9 cd05477 gastricsin Gastricsins 99.9 5.2E-22 1.1E-26 173.2 13.1 97 88-214 1-99 (318)
10 cd06097 Aspergillopepsin_like 99.9 5.7E-22 1.2E-26 169.9 12.6 98 91-214 1-98 (278)
11 cd06098 phytepsin Phytepsin, a 99.9 9.8E-22 2.1E-26 171.7 13.3 102 83-214 3-107 (317)
12 cd05485 Cathepsin_D_like Cathe 99.9 7.9E-22 1.7E-26 173.2 12.6 103 82-214 3-109 (329)
13 cd05486 Cathespin_E Cathepsin 99.9 1.1E-21 2.3E-26 171.2 11.4 94 91-214 1-96 (316)
14 cd05488 Proteinase_A_fungi Fun 99.9 4.5E-21 9.8E-26 167.6 13.4 101 84-214 4-106 (320)
15 PTZ00147 plasmepsin-1; Provisi 99.9 9.3E-21 2E-25 172.8 15.8 102 81-213 130-233 (453)
16 PTZ00013 plasmepsin 4 (PM4); P 99.8 5.3E-20 1.2E-24 167.7 15.3 102 80-212 128-231 (450)
17 cd05487 renin_like Renin stimu 99.8 3.1E-20 6.7E-25 162.8 12.7 98 85-213 3-104 (326)
18 cd05475 nucellin_like Nucellin 99.8 3.7E-20 8.1E-25 158.6 11.5 84 89-214 1-85 (273)
19 cd05473 beta_secretase_like Be 99.8 3.9E-19 8.5E-24 158.0 12.1 78 89-192 2-79 (364)
20 cd05471 pepsin_like Pepsin-lik 99.8 1.5E-18 3.1E-23 147.6 12.7 96 91-214 1-98 (283)
21 cd05472 cnd41_like Chloroplast 99.8 4.6E-19 1E-23 153.3 9.5 78 90-216 1-78 (299)
22 cd05489 xylanase_inhibitor_I_l 99.7 4.9E-18 1.1E-22 151.3 10.4 101 97-213 2-117 (362)
23 PF00026 Asp: Eukaryotic aspar 99.7 2.6E-17 5.6E-22 142.3 8.5 96 90-213 1-97 (317)
24 cd05476 pepsin_A_like_plant Ch 99.7 1.8E-16 3.8E-21 135.1 9.3 74 90-214 1-74 (265)
25 cd05474 SAP_like SAPs, pepsin- 99.5 5.7E-14 1.2E-18 120.8 10.3 68 90-212 2-69 (295)
26 cd05483 retropepsin_like_bacte 97.0 0.0025 5.4E-08 45.0 6.5 79 89-209 1-79 (96)
27 TIGR02281 clan_AA_DTGA clan AA 92.8 1.1 2.3E-05 33.7 8.5 84 87-211 8-91 (121)
28 cd05484 retropepsin_like_LTR_2 91.5 0.26 5.6E-06 34.8 3.5 27 91-119 1-27 (91)
29 PF13650 Asp_protease_2: Aspar 88.0 3.8 8.2E-05 27.9 7.2 23 94-118 2-24 (90)
30 PF13975 gag-asp_proteas: gag- 86.9 1.4 3E-05 29.9 4.3 32 87-120 5-36 (72)
31 PF00077 RVP: Retroviral aspar 80.6 2.9 6.2E-05 29.6 3.9 27 92-120 7-33 (100)
32 cd05479 RP_DDI RP_DDI; retrope 77.8 4.3 9.4E-05 30.4 4.3 31 87-119 13-43 (124)
33 COG3577 Predicted aspartyl pro 74.4 13 0.00028 30.8 6.4 72 86-192 101-172 (215)
34 cd05482 HIV_retropepsin_like R 71.5 6.2 0.00014 28.0 3.5 24 94-119 2-25 (87)
35 cd06095 RP_RTVL_H_like Retrope 65.1 9.1 0.0002 26.6 3.3 23 95-119 3-25 (86)
36 PF12384 Peptidase_A2B: Ty3 tr 48.5 26 0.00056 28.1 3.6 28 90-119 34-61 (177)
37 PF09668 Asp_protease: Asparty 35.3 78 0.0017 24.0 4.3 30 88-119 22-51 (124)
38 cd05475 nucellin_like Nucellin 34.6 43 0.00093 28.2 3.2 32 89-120 157-194 (273)
39 TIGR03698 clan_AA_DTGF clan AA 33.6 58 0.0013 23.6 3.3 24 93-116 2-29 (107)
40 PF05585 DUF1758: Putative pep 29.5 31 0.00068 26.8 1.4 19 101-119 10-28 (164)
41 cd06097 Aspergillopepsin_like 28.1 44 0.00096 28.2 2.2 33 88-120 176-215 (278)
42 PF11645 PDDEXK_5: PD-(D/E)XK 27.1 41 0.00089 26.2 1.6 31 90-121 21-51 (149)
43 cd00303 retropepsin_like Retro 27.0 73 0.0016 19.9 2.8 20 95-116 3-22 (92)
44 PF07172 GRP: Glycine rich pro 26.4 33 0.00072 24.7 0.9 21 1-22 1-21 (95)
45 cd06098 phytepsin Phytepsin, a 26.3 54 0.0012 28.3 2.4 31 89-119 188-226 (317)
46 cd06096 Plasmepsin_5 Plasmepsi 25.9 53 0.0012 28.5 2.4 31 89-119 208-247 (326)
47 cd05472 cnd41_like Chloroplast 24.6 56 0.0012 27.8 2.2 31 89-119 146-187 (299)
48 cd05471 pepsin_like Pepsin-lik 24.6 47 0.001 27.5 1.7 34 88-121 179-220 (283)
49 PLN03146 aspartyl protease fam 23.6 90 0.0019 28.5 3.5 16 105-120 309-324 (431)
50 PF03419 Peptidase_U4: Sporula 23.1 1.2E+02 0.0025 26.2 3.9 28 87-114 154-181 (293)
51 cd06094 RP_Saci_like RP_Saci_l 22.5 2.9E+02 0.0064 19.6 6.2 20 102-121 8-27 (89)
52 TIGR02854 spore_II_GA sigma-E 20.6 1.3E+02 0.0029 25.9 3.8 28 87-114 155-182 (288)
53 PF07496 zf-CW: CW-type Zinc F 20.5 54 0.0012 20.4 1.0 9 115-123 2-10 (50)
54 PF00026 Asp: Eukaryotic aspar 20.1 96 0.0021 26.2 2.8 28 93-120 185-216 (317)
55 PF01705 CX: CX module; Inter 20.1 84 0.0018 20.5 1.9 36 165-212 15-50 (61)
No 1
>PLN03146 aspartyl protease family protein; Provisional
Probab=100.00 E-value=7.6e-42 Score=309.97 Aligned_cols=205 Identities=51% Similarity=0.929 Sum_probs=172.7
Q ss_pred HHHHHHHHhccccceecCCceeEEEEecCCCCCCCCCCCCCCchHHHHHHHhhhhhhhccccCCCcCCCCCcccceecCC
Q 037565 9 AICITATLQFVFPFVESNNGCVTADLIHRDSPLSPLYSPSETNSERFHKAYLRSISRLNNHFNPKAASPEALQAPMVSSG 88 (218)
Q Consensus 9 ~~~~~~~~~~~~~~~~~~~~~~~l~L~h~~s~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~~~~ 88 (218)
++++++.-++.++..+++.++++++|+|++++++|++.++....++++++++++.+|.+ ++.++......+..++...+
T Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~~~~l~h~~~~~sp~~~~~~~~~~~~~~~~~~~~~r~~-~~~~~~~~~~~~~~~~~~~~ 82 (431)
T PLN03146 4 LLALCLFSFSELSAAEAPKGGFTVDLIHRDSPKSPFYNPSETPSQRLRNAFRRSISRVN-HFRPTDASPNDPQSDLISNG 82 (431)
T ss_pred hHHHHHHHHhhhhhccccCCceEEEEEeCCCCCCCCCCCCCChhHHHHHHHHHHHHHHH-HHhhccccCCccccCcccCC
Confidence 44455544565666677888999999999999999988877778999999999999988 77543222234455566778
Q ss_pred ceEEEEEEeCCCceeEEEEEeCCCCceeEecCCCCCCCCCCCCccCCCCCCcceeccCCCccccCCCCCCCCCCCCCcce
Q 037565 89 GEYFVNISIGTPRVELIGIADTASDLTWIQCKPCYKCYKQNPPIFDPKESSTSKNITCQSESCLRLEKDRRCDTSRNTCQ 168 (218)
Q Consensus 89 ~~Y~~~i~iGtP~q~~~~~~DTGS~~~Wv~c~~C~~C~~~~~~~f~p~~Sst~~~~~C~s~~C~~~~~~~~C~~~~~~c~ 168 (218)
++|+++|.||||||++.|++||||+++||||.+|..|+.|.++.|||++|+||+.++|.++.|..++....|... +.|.
T Consensus 83 ~~Y~v~i~iGTPpq~~~vi~DTGS~l~Wv~C~~C~~C~~~~~~~fdps~SST~~~~~C~s~~C~~~~~~~~c~~~-~~c~ 161 (431)
T PLN03146 83 GEYLMNISIGTPPVPILAIADTGSDLIWTQCKPCDDCYKQVSPLFDPKKSSTYKDVSCDSSQCQALGNQASCSDE-NTCT 161 (431)
T ss_pred ccEEEEEEcCCCCceEEEEECCCCCcceEcCCCCcccccCCCCcccCCCCCCCcccCCCCcccccCCCCCCCCCC-CCCe
Confidence 999999999999999999999999999999999999998889999999999999999999999887766567654 4699
Q ss_pred eEEEcCCCceEEEEEEEEEEEecCCCCCCCcccCCeEEEeEEcCCCCC
Q 037565 169 FYYSYADGSYTTGNVGTEVFVISSKTNPDQGTFPNIVYGCGNNNTGTF 216 (218)
Q Consensus 169 ~~~~Y~dgs~~~G~l~~D~~~~~~~~~~~~~~~~~~~FGc~~~~~g~f 216 (218)
|.+.|+||+.+.|++++|+|+|++..+. .+.++++.|||++.+.|.|
T Consensus 162 y~i~Ygdgs~~~G~l~~Dtltlg~~~~~-~~~v~~~~FGc~~~~~g~f 208 (431)
T PLN03146 162 YSYSYGDGSFTKGNLAVETLTIGSTSGR-PVSFPGIVFGCGHNNGGTF 208 (431)
T ss_pred eEEEeCCCCceeeEEEEEEEEeccCCCC-cceeCCEEEeCCCCCCCCc
Confidence 9999999998889999999999875332 2578999999999887766
No 2
>KOG1339 consensus Aspartyl protease [Posttranslational modification, protein turnover, chaperones]
Probab=99.95 E-value=6.6e-28 Score=216.87 Aligned_cols=126 Identities=43% Similarity=0.855 Sum_probs=110.9
Q ss_pred eecCCceEEEEEEeCCCceeEEEEEeCCCCceeEecCCCC-CCCCCCCCccCCCCCCcceeccCCCccccCCCCCCCCCC
Q 037565 84 MVSSGGEYFVNISIGTPRVELIGIADTASDLTWIQCKPCY-KCYKQNPPIFDPKESSTSKNITCQSESCLRLEKDRRCDT 162 (218)
Q Consensus 84 ~~~~~~~Y~~~i~iGtP~q~~~~~~DTGS~~~Wv~c~~C~-~C~~~~~~~f~p~~Sst~~~~~C~s~~C~~~~~~~~C~~ 162 (218)
..+.+++|+++|.||||||.|.|++||||+++||+|.+|. .|+.+.++.|||++|+||+.+.|.++.|...... |..
T Consensus 40 ~~~~~~~Y~~~i~IGTPpq~f~v~~DTGS~~lWV~c~~c~~~C~~~~~~~f~p~~SSt~~~~~c~~~~c~~~~~~--~~~ 117 (398)
T KOG1339|consen 40 SSYSSGEYYGNISIGTPPQSFTVVLDTGSDLLWVPCAPCSSACYSQHNPIFDPSASSTYKSVGCSSPRCKSLPQS--CSP 117 (398)
T ss_pred ccccccccEEEEecCCCCeeeEEEEeCCCCceeeccccccccccccCCCccCccccccccccCCCCccccccccC--ccc
Confidence 3457889999999999999999999999999999999999 8987666679999999999999999999988754 443
Q ss_pred CCCcceeEEEcCCCceEEEEEEEEEEEecCCCCCCCcccCCeEEEeEEcCCCC
Q 037565 163 SRNTCQFYYSYADGSYTTGNVGTEVFVISSKTNPDQGTFPNIVYGCGNNNTGT 215 (218)
Q Consensus 163 ~~~~c~~~~~Y~dgs~~~G~l~~D~~~~~~~~~~~~~~~~~~~FGc~~~~~g~ 215 (218)
. +.|.|.+.|+||+.++|.|++|+|+|++.+ .+.+++++|||+..+.|.
T Consensus 118 ~-~~C~y~i~Ygd~~~~~G~l~~Dtv~~~~~~---~~~~~~~~FGc~~~~~g~ 166 (398)
T KOG1339|consen 118 N-SSCPYSIQYGDGSSTSGYLATDTVTFGGTT---SLPVPNQTFGCGTNNPGS 166 (398)
T ss_pred C-CcCceEEEeCCCCceeEEEEEEEEEEcccc---ccccccEEEEeeecCccc
Confidence 3 689999999998888899999999999743 157788999999998763
No 3
>PF14543 TAXi_N: Xylanase inhibitor N-terminal; PDB: 3HD8_A 3VLB_A 3VLA_A 3AUP_D 1T6G_A 1T6E_X 2B42_A.
Probab=99.94 E-value=7.3e-27 Score=186.34 Aligned_cols=117 Identities=37% Similarity=0.755 Sum_probs=95.0
Q ss_pred EEEEEEeCCCceeEEEEEeCCCCceeEecCCCCCCCCCCCCccCCCCCCcceeccCCCccccCCCCC-CCCCCCCCccee
Q 037565 91 YFVNISIGTPRVELIGIADTASDLTWIQCKPCYKCYKQNPPIFDPKESSTSKNITCQSESCLRLEKD-RRCDTSRNTCQF 169 (218)
Q Consensus 91 Y~~~i~iGtP~q~~~~~~DTGS~~~Wv~c~~C~~C~~~~~~~f~p~~Sst~~~~~C~s~~C~~~~~~-~~C~~~~~~c~~ 169 (218)
|+++|.||||+|++.|+|||||+++|++| ..+.|+|.+|+||+.++|.++.|...++. ..|...++.|.|
T Consensus 1 Y~~~~~iGtP~~~~~lvvDtgs~l~W~~C---------~~~~f~~~~Sst~~~v~C~s~~C~~~~~~~~~~~~~~~~C~y 71 (164)
T PF14543_consen 1 YYVSVSIGTPPQPFSLVVDTGSDLTWVQC---------PDPPFDPSKSSTYRPVPCSSPQCSSAPSFCPCCCCSNNSCPY 71 (164)
T ss_dssp EEEEEECTCTTEEEEEEEETT-SSEEEET-------------STT-TTSSBEC-BTTSHHHHHCTSSBTCCTCESSEEEE
T ss_pred CEEEEEeCCCCceEEEEEECCCCceEEcC---------CCcccCCccCCcccccCCCCcchhhcccccccCCCCcCcccc
Confidence 89999999999999999999999999999 36889999999999999999999988753 233332368999
Q ss_pred EEEcCCCceEEEEEEEEEEEecCCCCCCCcccCCeEEEeEEcCCCCCC
Q 037565 170 YYSYADGSYTTGNVGTEVFVISSKTNPDQGTFPNIVYGCGNNNTGTFD 217 (218)
Q Consensus 170 ~~~Y~dgs~~~G~l~~D~~~~~~~~~~~~~~~~~~~FGc~~~~~g~f~ 217 (218)
.+.|+|++.++|++++|+|+++..++. ...+++++|||++.+.|.|.
T Consensus 72 ~~~y~~~s~~~G~l~~D~~~~~~~~~~-~~~~~~~~FGC~~~~~g~~~ 118 (164)
T PF14543_consen 72 SQSYGDGSSSSGFLASDTLTFGSSSGG-SNSVPDFIFGCATSNSGLFY 118 (164)
T ss_dssp EEEETTTEEEEEEEEEEEEEEEEESSS-SEEEEEEEEEEE-GGGTSST
T ss_pred eeecCCCccccCceEEEEEEecCCCCC-CceeeeEEEEeeeccccCCc
Confidence 999999999999999999999876432 25789999999999887663
No 4
>cd06096 Plasmepsin_5 Plasmepsins are a class of aspartic proteinases produced by the plasmodium parasite. The family contains a group of aspartic proteinases homologous to plasmepsin 5. Plasmepsins are a class of at least 10 enzymes produced by the plasmodium parasite. Through their haemoglobin-degrading activity, they are an important cause of symptoms in malaria sufferers. This family of enzymes is a potential target for anti-malarial drugs. Plasmepsins are aspartic acid proteases, which means their active site contains two aspartic acid residues. These two aspartic acid residue act respectively as proton donor and proton acceptor, catalyzing the hydrolysis of peptide bond in proteins. Aspartic proteinases are composed of two structurally similar beta barrel lobes, each lobe contributing an aspartic acid residue to form a catalytic dyad that acts to cleave the substrate peptide bond. The catalytic Asp residues are contained in an Asp-Thr-Gly-Ser/thr motif in both N- and C-terminal l
Probab=99.94 E-value=5.7e-26 Score=199.12 Aligned_cols=123 Identities=27% Similarity=0.478 Sum_probs=102.9
Q ss_pred ceEEEEEEeCCCceeEEEEEeCCCCceeEecCCCCCCCCCCCCccCCCCCCcceeccCCCccccCCCCCCCCCCCCCcce
Q 037565 89 GEYFVNISIGTPRVELIGIADTASDLTWIQCKPCYKCYKQNPPIFDPKESSTSKNITCQSESCLRLEKDRRCDTSRNTCQ 168 (218)
Q Consensus 89 ~~Y~~~i~iGtP~q~~~~~~DTGS~~~Wv~c~~C~~C~~~~~~~f~p~~Sst~~~~~C~s~~C~~~~~~~~C~~~~~~c~ 168 (218)
++|+++|.||||+|++.|++||||+++||+|.+|..|..+.++.|||++|+|++.+.|.+..|.. ...|.+ +.|.
T Consensus 2 ~~Y~~~i~vGtP~Q~~~v~~DTGS~~~wv~~~~C~~c~~~~~~~y~~~~Sst~~~~~C~~~~c~~---~~~~~~--~~~~ 76 (326)
T cd06096 2 AYYFIDIFIGNPPQKQSLILDTGSSSLSFPCSQCKNCGIHMEPPYNLNNSITSSILYCDCNKCCY---CLSCLN--NKCE 76 (326)
T ss_pred ceEEEEEEecCCCeEEEEEEeCCCCceEEecCCCCCcCCCCCCCcCcccccccccccCCCccccc---cCcCCC--CcCc
Confidence 58999999999999999999999999999999999998777899999999999999999999943 224554 5799
Q ss_pred eEEEcCCCceEEEEEEEEEEEecCCCCC-CCcccCCeEEEeEEcCCCCC
Q 037565 169 FYYSYADGSYTTGNVGTEVFVISSKTNP-DQGTFPNIVYGCGNNNTGTF 216 (218)
Q Consensus 169 ~~~~Y~dgs~~~G~l~~D~~~~~~~~~~-~~~~~~~~~FGc~~~~~g~f 216 (218)
|.+.|++|+.+.|.+++|+|+|++.... .+....++.|||+..+.+.|
T Consensus 77 ~~i~Y~~gs~~~G~~~~D~v~lg~~~~~~~~~~~~~~~fg~~~~~~~~~ 125 (326)
T cd06096 77 YSISYSEGSSISGFYFSDFVSFESYLNSNSEKESFKKIFGCHTHETNLF 125 (326)
T ss_pred EEEEECCCCceeeEEEEEEEEeccCCCCccccccccEEeccCccccCcc
Confidence 9999999998889999999999874210 00112368899998776644
No 5
>PTZ00165 aspartyl protease; Provisional
Probab=99.92 E-value=1.7e-24 Score=198.61 Aligned_cols=111 Identities=25% Similarity=0.405 Sum_probs=94.7
Q ss_pred CcccceecCCceEEEEEEeCCCceeEEEEEeCCCCceeEecCCCC--CCCCCCCCccCCCCCCcceeccCCCccccCCCC
Q 037565 79 ALQAPMVSSGGEYFVNISIGTPRVELIGIADTASDLTWIQCKPCY--KCYKQNPPIFDPKESSTSKNITCQSESCLRLEK 156 (218)
Q Consensus 79 ~~~~~~~~~~~~Y~~~i~iGtP~q~~~~~~DTGS~~~Wv~c~~C~--~C~~~~~~~f~p~~Sst~~~~~C~s~~C~~~~~ 156 (218)
..++..++.+.+|+++|.||||||+|.|++||||+++||+|..|. .| +.++.|||++|+||+.+.+..
T Consensus 109 ~~~~l~n~~d~~Y~~~I~IGTPpQ~f~Vv~DTGSS~lWVps~~C~~~~C--~~~~~yd~s~SSTy~~~~~~~-------- 178 (482)
T PTZ00165 109 LQQDLLNFHNSQYFGEIQVGTPPKSFVVVFDTGSSNLWIPSKECKSGGC--APHRKFDPKKSSTYTKLKLGD-------- 178 (482)
T ss_pred cceecccccCCeEEEEEEeCCCCceEEEEEeCCCCCEEEEchhcCcccc--cccCCCCccccCCcEecCCCC--------
Confidence 345667899999999999999999999999999999999999997 47 667899999999999852110
Q ss_pred CCCCCCCCCcceeEEEcCCCceEEEEEEEEEEEecCCCCCCCcccCCeEEEeEEcCCC
Q 037565 157 DRRCDTSRNTCQFYYSYADGSYTTGNVGTEVFVISSKTNPDQGTFPNIVYGCGNNNTG 214 (218)
Q Consensus 157 ~~~C~~~~~~c~~~~~Y~dgs~~~G~l~~D~~~~~~~~~~~~~~~~~~~FGc~~~~~g 214 (218)
....+.+.|++|+.. |.+++|+|+|++ +.++++.|||++.+.+
T Consensus 179 --------~~~~~~i~YGsGs~~-G~l~~DtV~ig~------l~i~~q~FG~a~~~s~ 221 (482)
T PTZ00165 179 --------ESAETYIQYGTGECV-LALGKDTVKIGG------LKVKHQSIGLAIEESL 221 (482)
T ss_pred --------ccceEEEEeCCCcEE-EEEEEEEEEECC------EEEccEEEEEEEeccc
Confidence 122567999999876 999999999987 6899999999998654
No 6
>cd05470 pepsin_retropepsin_like Cellular and retroviral pepsin-like aspartate proteases. This family includes both cellular and retroviral pepsin-like aspartate proteases. The cellular pepsin and pepsin-like enzymes are twice as long as their retroviral counterparts. The cellular pepsin-like aspartic proteases are found in mammals, plants, fungi and bacteria. These well known and extensively characterized enzymes include pepsins, chymosin, rennin, cathepsins, and fungal aspartic proteases. Several have long been known to be medically (rennin, cathepsin D and E, pepsin) or commercially (chymosin) important. The eukaryotic pepsin-like proteases contain two domains possessing similar topological features. The N- and C-terminal domains, although structurally related by a 2-fold axis, have only limited sequence homology except in the vicinity of the active site. This suggests that the enzymes evolved by an ancient duplication event. The eukaryotic pepsin-like proteases have two active site
Probab=99.91 E-value=5.2e-24 Score=157.91 Aligned_cols=95 Identities=28% Similarity=0.543 Sum_probs=84.3
Q ss_pred EEEEeCCCceeEEEEEeCCCCceeEecCCCCCCCCCCCCcc-CCCCCCcceeccCCCccccCCCCCCCCCCCCCcceeEE
Q 037565 93 VNISIGTPRVELIGIADTASDLTWIQCKPCYKCYKQNPPIF-DPKESSTSKNITCQSESCLRLEKDRRCDTSRNTCQFYY 171 (218)
Q Consensus 93 ~~i~iGtP~q~~~~~~DTGS~~~Wv~c~~C~~C~~~~~~~f-~p~~Sst~~~~~C~s~~C~~~~~~~~C~~~~~~c~~~~ 171 (218)
++|.||||||++.|+|||||+++||+|..|..|..+.++.| +|.+|++++. ..|.|.+
T Consensus 1 ~~i~vGtP~q~~~~~~DTGSs~~Wv~~~~c~~~~~~~~~~~~~~~~sst~~~---------------------~~~~~~~ 59 (109)
T cd05470 1 IEIGIGTPPQTFNVLLDTGSSNLWVPSVDCQSLAIYSHSSYDDPSASSTYSD---------------------NGCTFSI 59 (109)
T ss_pred CEEEeCCCCceEEEEEeCCCCCEEEeCCCCCCcccccccccCCcCCCCCCCC---------------------CCcEEEE
Confidence 47999999999999999999999999999987765666777 9999999876 3699999
Q ss_pred EcCCCceEEEEEEEEEEEecCCCCCCCcccCCeEEEeEEcCCCC
Q 037565 172 SYADGSYTTGNVGTEVFVISSKTNPDQGTFPNIVYGCGNNNTGT 215 (218)
Q Consensus 172 ~Y~dgs~~~G~l~~D~~~~~~~~~~~~~~~~~~~FGc~~~~~g~ 215 (218)
.|++|+.. |.++.|+|+|++ ..++++.|||++.+.+.
T Consensus 60 ~Y~~g~~~-g~~~~D~v~ig~------~~~~~~~fg~~~~~~~~ 96 (109)
T cd05470 60 TYGTGSLS-GGLSTDTVSIGD------IEVVGQAFGCATDEPGA 96 (109)
T ss_pred EeCCCeEE-EEEEEEEEEECC------EEECCEEEEEEEecCCc
Confidence 99999765 999999999987 57999999999988664
No 7
>cd05490 Cathepsin_D2 Cathepsin_D2, pepsin family of proteinases. Cathepsin D is the major aspartic proteinase of the lysosomal compartment where it functions in protein catabolism. It is a member of the pepsin family of proteinases. This enzyme is distinguished from other members of the pepsin family by two features that are characteristic of lysosomal hydrolases. First, mature Cathepsin D is found predominantly in a two-chain form due to a posttranslational cleavage event. Second, it contains phosphorylated, N-linked oligosaccharides that target the enzyme to lysosomes via mannose-6-phosphate receptors. Cathepsin D preferentially attacks peptide bonds flanked by bulky hydrophobic amino acids and its pH optimum is between pH 2.8 and 4.0. Two active site aspartic acid residues are essential for the catalytic activity of aspartic proteinases. Like other aspartic proteinases, Cathepsin D is a bilobed molecule; the two evolutionary related lobes are mostly made up of beta-sheets and flank
Probab=99.90 E-value=4.6e-23 Score=180.38 Aligned_cols=99 Identities=24% Similarity=0.430 Sum_probs=88.7
Q ss_pred cCCceEEEEEEeCCCceeEEEEEeCCCCceeEecCCCC----CCCCCCCCccCCCCCCcceeccCCCccccCCCCCCCCC
Q 037565 86 SSGGEYFVNISIGTPRVELIGIADTASDLTWIQCKPCY----KCYKQNPPIFDPKESSTSKNITCQSESCLRLEKDRRCD 161 (218)
Q Consensus 86 ~~~~~Y~~~i~iGtP~q~~~~~~DTGS~~~Wv~c~~C~----~C~~~~~~~f~p~~Sst~~~~~C~s~~C~~~~~~~~C~ 161 (218)
+.+.+|+++|.||||||++.|++||||+++||+|..|. .| ..++.|||++|+||+.
T Consensus 2 ~~~~~Y~~~i~iGtP~q~~~v~~DTGSs~~Wv~~~~C~~~~~~C--~~~~~y~~~~SsT~~~------------------ 61 (325)
T cd05490 2 YMDAQYYGEIGIGTPPQTFTVVFDTGSSNLWVPSVHCSLLDIAC--WLHHKYNSSKSSTYVK------------------ 61 (325)
T ss_pred CcCCEEEEEEEECCCCcEEEEEEeCCCccEEEEcCCCCCCCccc--cCcCcCCcccCcceee------------------
Confidence 56889999999999999999999999999999999997 46 4567899999999987
Q ss_pred CCCCcceeEEEcCCCceEEEEEEEEEEEecCCCCCCCcccCCeEEEeEEcCCC
Q 037565 162 TSRNTCQFYYSYADGSYTTGNVGTEVFVISSKTNPDQGTFPNIVYGCGNNNTG 214 (218)
Q Consensus 162 ~~~~~c~~~~~Y~dgs~~~G~l~~D~~~~~~~~~~~~~~~~~~~FGc~~~~~g 214 (218)
..|.|.+.|++|+. .|.+++|+|+|++ ..++++.|||++.+.|
T Consensus 62 ---~~~~~~i~Yg~G~~-~G~~~~D~v~~g~------~~~~~~~Fg~~~~~~~ 104 (325)
T cd05490 62 ---NGTEFAIQYGSGSL-SGYLSQDTVSIGG------LQVEGQLFGEAVKQPG 104 (325)
T ss_pred ---CCcEEEEEECCcEE-EEEEeeeEEEECC------EEEcCEEEEEEeeccC
Confidence 35899999999985 6999999999987 5899999999987654
No 8
>cd05478 pepsin_A Pepsin A, aspartic protease produced in gastric mucosa of mammals. Pepsin, a well-known aspartic protease, is produced by the human gastric mucosa in seven different zymogen isoforms, subdivided into two types: pepsinogen A and pepsinogen C. The prosequence of the zymogens are self cleaved under acidic pH. The mature enzymes are called pepsin A and pepsin C, correspondingly. The well researched porcine pepsin is also in this pepsin A family. Pepsins play an integral role in the digestion process of vertebrates. Pepsins are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe may be evolved from the other through ancient gene-duplication event. More recently evolved enzymes have similar three-dimensional structures, however their amino acid sequences are more divergent except for the conserved catalytic site motif. Pepsins specifically cleave bonds in peptides which
Probab=99.89 E-value=1.5e-22 Score=176.71 Aligned_cols=105 Identities=28% Similarity=0.401 Sum_probs=92.0
Q ss_pred ceecCCceEEEEEEeCCCceeEEEEEeCCCCceeEecCCCCCCCCCCCCccCCCCCCcceeccCCCccccCCCCCCCCCC
Q 037565 83 PMVSSGGEYFVNISIGTPRVELIGIADTASDLTWIQCKPCYKCYKQNPPIFDPKESSTSKNITCQSESCLRLEKDRRCDT 162 (218)
Q Consensus 83 ~~~~~~~~Y~~~i~iGtP~q~~~~~~DTGS~~~Wv~c~~C~~C~~~~~~~f~p~~Sst~~~~~C~s~~C~~~~~~~~C~~ 162 (218)
..++.+.+|+++|.||||+|++.|++||||+++||+|..|..|.-+.++.|||++|+|++..
T Consensus 3 l~n~~~~~Y~~~i~vGtp~q~~~v~~DTGS~~~wv~~~~C~~~~c~~~~~f~~~~Sst~~~~------------------ 64 (317)
T cd05478 3 LTNYLDMEYYGTISIGTPPQDFTVIFDTGSSNLWVPSVYCSSQACSNHNRFNPRQSSTYQST------------------ 64 (317)
T ss_pred cccccCCEEEEEEEeCCCCcEEEEEEeCCCccEEEecCCCCcccccccCcCCCCCCcceeeC------------------
Confidence 34667899999999999999999999999999999999998432256789999999999874
Q ss_pred CCCcceeEEEcCCCceEEEEEEEEEEEecCCCCCCCcccCCeEEEeEEcCCCC
Q 037565 163 SRNTCQFYYSYADGSYTTGNVGTEVFVISSKTNPDQGTFPNIVYGCGNNNTGT 215 (218)
Q Consensus 163 ~~~~c~~~~~Y~dgs~~~G~l~~D~~~~~~~~~~~~~~~~~~~FGc~~~~~g~ 215 (218)
.|.|.+.|++|+. .|.++.|+|+|++ +.++++.|||++.+.+.
T Consensus 65 ---~~~~~~~yg~gs~-~G~~~~D~v~ig~------~~i~~~~fg~~~~~~~~ 107 (317)
T cd05478 65 ---GQPLSIQYGTGSM-TGILGYDTVQVGG------ISDTNQIFGLSETEPGS 107 (317)
T ss_pred ---CcEEEEEECCceE-EEEEeeeEEEECC------EEECCEEEEEEEecCcc
Confidence 5789999999985 6999999999987 68999999999877654
No 9
>cd05477 gastricsin Gastricsins, asparate proteases produced in gastric mucosa. Gastricsin is also called pepsinogen C. Gastricsins are produced in gastric mucosa of mammals. It is synthesized by the chief cells in the stomach as an inactive zymogen. It is self-converted to a mature enzyme under acidic conditions. Human gastricsin is distributed throughout all parts of the stomach. Gastricsin is synthesized as an inactive progastricsin that has an approximately 40 residue prosequence. It is self-converting to a mature enzyme being triggered by a drop in pH from neutrality to acidic conditions. Like other aspartic proteases, gastricsin are characterized by two catalytic aspartic residues at the active site, and display optimal activity at acidic pH. Mature enzyme has a pseudo-2-fold symmetry that passes through the active site between the catalytic aspartate residues. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic aspartate residue, with an exten
Probab=99.88 E-value=5.2e-22 Score=173.23 Aligned_cols=97 Identities=31% Similarity=0.535 Sum_probs=87.5
Q ss_pred CceEEEEEEeCCCceeEEEEEeCCCCceeEecCCCC--CCCCCCCCccCCCCCCcceeccCCCccccCCCCCCCCCCCCC
Q 037565 88 GGEYFVNISIGTPRVELIGIADTASDLTWIQCKPCY--KCYKQNPPIFDPKESSTSKNITCQSESCLRLEKDRRCDTSRN 165 (218)
Q Consensus 88 ~~~Y~~~i~iGtP~q~~~~~~DTGS~~~Wv~c~~C~--~C~~~~~~~f~p~~Sst~~~~~C~s~~C~~~~~~~~C~~~~~ 165 (218)
|.+|+++|.||||||++.|++||||+++||+|..|. .| ..++.|||++|+||+. .
T Consensus 1 ~~~y~~~i~iGtP~q~~~v~~DTGS~~~wv~~~~C~~~~C--~~~~~f~~~~SsT~~~---------------------~ 57 (318)
T cd05477 1 DMSYYGEISIGTPPQNFLVLFDTGSSNLWVPSVLCQSQAC--TNHTKFNPSQSSTYST---------------------N 57 (318)
T ss_pred CcEEEEEEEECCCCcEEEEEEeCCCccEEEccCCCCCccc--cccCCCCcccCCCceE---------------------C
Confidence 468999999999999999999999999999999998 57 4578999999999987 3
Q ss_pred cceeEEEcCCCceEEEEEEEEEEEecCCCCCCCcccCCeEEEeEEcCCC
Q 037565 166 TCQFYYSYADGSYTTGNVGTEVFVISSKTNPDQGTFPNIVYGCGNNNTG 214 (218)
Q Consensus 166 ~c~~~~~Y~dgs~~~G~l~~D~~~~~~~~~~~~~~~~~~~FGc~~~~~g 214 (218)
.|.|.+.|++|+. .|.+++|+|+|++ +.++++.|||++...+
T Consensus 58 ~~~~~~~Yg~Gs~-~G~~~~D~i~~g~------~~i~~~~Fg~~~~~~~ 99 (318)
T cd05477 58 GETFSLQYGSGSL-TGIFGYDTVTVQG------IIITNQEFGLSETEPG 99 (318)
T ss_pred CcEEEEEECCcEE-EEEEEeeEEEECC------EEEcCEEEEEEEeccc
Confidence 6899999999986 5999999999987 6889999999988654
No 10
>cd06097 Aspergillopepsin_like Aspergillopepsin_like, aspartic proteases of fungal origin. The members of this family are aspartic proteases of fungal origin, including aspergillopepsin, rhizopuspepsin, endothiapepsin, and rodosporapepsin. The various fungal species in this family may be the most economically important genus of fungi. They may serve as virulence factors or as industrial aids. For example, Aspergillopepsin from A. fumigatus is involved in invasive aspergillosis owing to its elastolytic activity and Aspergillopepsins from the mold A. saitoi are used in fermentation industry. Aspartic proteinases are a group of proteolytic enzymes in which the scissile peptide bond is attacked by a nucleophilic water molecule activated by two aspartic residues in a DT(S)G motif at the active site. They have a similar fold composed of two beta-barrel domains. Between the N-terminal and C-terminal domains, each of which contributes one catalytic aspartic residue, there is an extended active-
Probab=99.87 E-value=5.7e-22 Score=169.93 Aligned_cols=98 Identities=28% Similarity=0.368 Sum_probs=89.0
Q ss_pred EEEEEEeCCCceeEEEEEeCCCCceeEecCCCCCCCCCCCCccCCCCCCcceeccCCCccccCCCCCCCCCCCCCcceeE
Q 037565 91 YFVNISIGTPRVELIGIADTASDLTWIQCKPCYKCYKQNPPIFDPKESSTSKNITCQSESCLRLEKDRRCDTSRNTCQFY 170 (218)
Q Consensus 91 Y~~~i~iGtP~q~~~~~~DTGS~~~Wv~c~~C~~C~~~~~~~f~p~~Sst~~~~~C~s~~C~~~~~~~~C~~~~~~c~~~ 170 (218)
|+++|.||||||++.|++||||+++||+|..|..|..+.++.||+++|+|++.. ..|.|.
T Consensus 1 Y~~~i~vGtP~Q~~~v~~DTGS~~~wv~~~~c~~~~~~~~~~y~~~~Sst~~~~--------------------~~~~~~ 60 (278)
T cd06097 1 YLTPVKIGTPPQTLNLDLDTGSSDLWVFSSETPAAQQGGHKLYDPSKSSTAKLL--------------------PGATWS 60 (278)
T ss_pred CeeeEEECCCCcEEEEEEeCCCCceeEeeCCCCchhhccCCcCCCccCccceec--------------------CCcEEE
Confidence 899999999999999999999999999999999887667788999999999875 368999
Q ss_pred EEcCCCceEEEEEEEEEEEecCCCCCCCcccCCeEEEeEEcCCC
Q 037565 171 YSYADGSYTTGNVGTEVFVISSKTNPDQGTFPNIVYGCGNNNTG 214 (218)
Q Consensus 171 ~~Y~dgs~~~G~l~~D~~~~~~~~~~~~~~~~~~~FGc~~~~~g 214 (218)
+.|++|+...|.+++|+|+|++ ..++++.|||++...+
T Consensus 61 i~Y~~G~~~~G~~~~D~v~ig~------~~~~~~~fg~~~~~~~ 98 (278)
T cd06097 61 ISYGDGSSASGIVYTDTVSIGG------VEVPNQAIELATAVSA 98 (278)
T ss_pred EEeCCCCeEEEEEEEEEEEECC------EEECCeEEEEEeecCc
Confidence 9999998778999999999987 5789999999987654
No 11
>cd06098 phytepsin Phytepsin, a plant homolog of mammalian lysosomal pepsins. Phytepsin, a plant homolog of mammalian lysosomal pepsins, resides in grains, roots, stems, leaves and flowers. Phytepsin may participate in metabolic turnover and in protein processing events. In addition, it highly expressed in several plant tissues undergoing apoptosis. Phytepsin contains an internal region consisting of about 100 residues not present in animal or microbial pepsins. This region is thus called a plant specific insert. The insert is highly similar to saponins, which are lysosomal sphingolipid-activating proteins in mammalian cells. The saponin-like domain may have a role in the vacuolar targeting of phytepsin. Phytepsin, as its animal counterparts, possesses a topology typical of all aspartic proteases. They are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe has probably evolved fro
Probab=99.87 E-value=9.8e-22 Score=171.70 Aligned_cols=102 Identities=25% Similarity=0.382 Sum_probs=90.0
Q ss_pred ceecCCceEEEEEEeCCCceeEEEEEeCCCCceeEecCCCC---CCCCCCCCccCCCCCCcceeccCCCccccCCCCCCC
Q 037565 83 PMVSSGGEYFVNISIGTPRVELIGIADTASDLTWIQCKPCY---KCYKQNPPIFDPKESSTSKNITCQSESCLRLEKDRR 159 (218)
Q Consensus 83 ~~~~~~~~Y~~~i~iGtP~q~~~~~~DTGS~~~Wv~c~~C~---~C~~~~~~~f~p~~Sst~~~~~C~s~~C~~~~~~~~ 159 (218)
..++.+.+|+++|.||||||++.|++||||+++||+|..|. .| ..++.|||++|+||+..
T Consensus 3 l~n~~~~~Y~~~i~iGtP~Q~~~v~~DTGSs~lWv~~~~C~~~~~C--~~~~~y~~~~SsT~~~~--------------- 65 (317)
T cd06098 3 LKNYLDAQYFGEIGIGTPPQKFTVIFDTGSSNLWVPSSKCYFSIAC--YFHSKYKSSKSSTYKKN--------------- 65 (317)
T ss_pred ccccCCCEEEEEEEECCCCeEEEEEECCCccceEEecCCCCCCccc--cccCcCCcccCCCcccC---------------
Confidence 34678899999999999999999999999999999999996 68 45688999999999873
Q ss_pred CCCCCCcceeEEEcCCCceEEEEEEEEEEEecCCCCCCCcccCCeEEEeEEcCCC
Q 037565 160 CDTSRNTCQFYYSYADGSYTTGNVGTEVFVISSKTNPDQGTFPNIVYGCGNNNTG 214 (218)
Q Consensus 160 C~~~~~~c~~~~~Y~dgs~~~G~l~~D~~~~~~~~~~~~~~~~~~~FGc~~~~~g 214 (218)
...+.+.|++|+. .|.+++|+|+|++ ..++++.|||++.+.+
T Consensus 66 ------~~~~~i~Yg~G~~-~G~~~~D~v~ig~------~~v~~~~f~~~~~~~~ 107 (317)
T cd06098 66 ------GTSASIQYGTGSI-SGFFSQDSVTVGD------LVVKNQVFIEATKEPG 107 (317)
T ss_pred ------CCEEEEEcCCceE-EEEEEeeEEEECC------EEECCEEEEEEEecCC
Confidence 4678999999986 5999999999987 5899999999987543
No 12
>cd05485 Cathepsin_D_like Cathepsin_D_like, pepsin family of proteinases. Cathepsin D is the major aspartic proteinase of the lysosomal compartment where it functions in protein catabolism. It is a member of the pepsin family of proteinases. This enzyme is distinguished from other members of the pepsin family by two features that are characteristic of lysosomal hydrolases. First, mature Cathepsin D is found predominantly in a two-chain form due to a posttranslational cleavage event. Second, it contains phosphorylated, N-linked oligosaccharides that target the enzyme to lysosomes via mannose-6-phosphate receptors. Cathepsin D preferentially attacks peptide bonds flanked by bulky hydrophobic amino acids and its pH optimum is between pH 2.8 and 4.0. Two active site aspartic acid residues are essential for the catalytic activity of aspartic proteinases. Like other aspartic proteinases, Cathepsin D is a bilobed molecule; the two evolutionary related lobes are mostly made up of beta-sheets an
Probab=99.87 E-value=7.9e-22 Score=173.16 Aligned_cols=103 Identities=26% Similarity=0.421 Sum_probs=91.2
Q ss_pred cceecCCceEEEEEEeCCCceeEEEEEeCCCCceeEecCCCC----CCCCCCCCccCCCCCCcceeccCCCccccCCCCC
Q 037565 82 APMVSSGGEYFVNISIGTPRVELIGIADTASDLTWIQCKPCY----KCYKQNPPIFDPKESSTSKNITCQSESCLRLEKD 157 (218)
Q Consensus 82 ~~~~~~~~~Y~~~i~iGtP~q~~~~~~DTGS~~~Wv~c~~C~----~C~~~~~~~f~p~~Sst~~~~~C~s~~C~~~~~~ 157 (218)
...++.+.+|+++|.||||+|++.|++||||+++||+|..|. .| ..++.|||++|+|++.
T Consensus 3 ~~~n~~~~~Y~~~i~vGtP~q~~~v~~DTGSs~~Wv~~~~C~~~~~~c--~~~~~y~~~~Sst~~~-------------- 66 (329)
T cd05485 3 PLSNYMDAQYYGVITIGTPPQSFKVVFDTGSSNLWVPSKKCSWTNIAC--LLHNKYDSTKSSTYKK-------------- 66 (329)
T ss_pred cceeccCCeEEEEEEECCCCcEEEEEEcCCCccEEEecCCCCCCCccc--cCCCeECCcCCCCeEE--------------
Confidence 345788999999999999999999999999999999999997 46 3467899999999987
Q ss_pred CCCCCCCCcceeEEEcCCCceEEEEEEEEEEEecCCCCCCCcccCCeEEEeEEcCCC
Q 037565 158 RRCDTSRNTCQFYYSYADGSYTTGNVGTEVFVISSKTNPDQGTFPNIVYGCGNNNTG 214 (218)
Q Consensus 158 ~~C~~~~~~c~~~~~Y~dgs~~~G~l~~D~~~~~~~~~~~~~~~~~~~FGc~~~~~g 214 (218)
..|.|.+.|++|+. .|.+++|+|+|++ +.++++.|||+.++.|
T Consensus 67 -------~~~~~~i~Y~~g~~-~G~~~~D~v~ig~------~~~~~~~fg~~~~~~~ 109 (329)
T cd05485 67 -------NGTEFAIQYGSGSL-SGFLSTDTVSVGG------VSVKGQTFAEAINEPG 109 (329)
T ss_pred -------CCeEEEEEECCceE-EEEEecCcEEECC------EEECCEEEEEEEecCC
Confidence 36899999999985 6999999999987 5789999999987654
No 13
>cd05486 Cathespin_E Cathepsin E, non-lysosomal aspartic protease. Cathepsin E is an intracellular, non-lysosomal aspartic protease expressed in a variety of cells and tissues. The protease has proposed physiological roles in antigen presentation by the MHC class II system, in the biogenesis of the vasoconstrictor peptide endothelin, and in neurodegeneration associated with brain ischemia and aging. Cathepsin E is the only A1 aspartic protease that exists as a homodimer with a disulfide bridge linking the two monomers. Like many other aspartic proteases, it is synthesized as a zymogen which is catalytically inactive towards its natural substrates at neutral pH and which auto-activates in an acidic environment. The overall structure follows the general fold of aspartic proteases of the A1 family, it is composed of two structurally similar beta barrel lobes, each lobe contributing an aspartic acid residue to form a catalytic dyad that acts to cleave the substrate peptide bond. The catalyt
Probab=99.86 E-value=1.1e-21 Score=171.19 Aligned_cols=94 Identities=34% Similarity=0.516 Sum_probs=84.8
Q ss_pred EEEEEEeCCCceeEEEEEeCCCCceeEecCCCC--CCCCCCCCccCCCCCCcceeccCCCccccCCCCCCCCCCCCCcce
Q 037565 91 YFVNISIGTPRVELIGIADTASDLTWIQCKPCY--KCYKQNPPIFDPKESSTSKNITCQSESCLRLEKDRRCDTSRNTCQ 168 (218)
Q Consensus 91 Y~~~i~iGtP~q~~~~~~DTGS~~~Wv~c~~C~--~C~~~~~~~f~p~~Sst~~~~~C~s~~C~~~~~~~~C~~~~~~c~ 168 (218)
|+++|.||||||++.|++||||+++||+|..|. .| ..++.|||++|+||+. ..|.
T Consensus 1 Y~~~i~iGtP~Q~~~v~~DTGSs~~Wv~s~~C~~~~C--~~~~~y~~~~SsT~~~---------------------~~~~ 57 (316)
T cd05486 1 YFGQISIGTPPQNFTVIFDTGSSNLWVPSIYCTSQAC--TKHNRFQPSESSTYVS---------------------NGEA 57 (316)
T ss_pred CeEEEEECCCCcEEEEEEcCCCccEEEecCCCCCccc--CccceECCCCCccccc---------------------CCcE
Confidence 899999999999999999999999999999997 57 5678999999999987 3689
Q ss_pred eEEEcCCCceEEEEEEEEEEEecCCCCCCCcccCCeEEEeEEcCCC
Q 037565 169 FYYSYADGSYTTGNVGTEVFVISSKTNPDQGTFPNIVYGCGNNNTG 214 (218)
Q Consensus 169 ~~~~Y~dgs~~~G~l~~D~~~~~~~~~~~~~~~~~~~FGc~~~~~g 214 (218)
|.+.|++|+. .|.+++|+|+|++ +.++++.|||+..+.+
T Consensus 58 ~~i~Yg~g~~-~G~~~~D~v~ig~------~~~~~~~fg~~~~~~~ 96 (316)
T cd05486 58 FSIQYGTGSL-TGIIGIDQVTVEG------ITVQNQQFAESVSEPG 96 (316)
T ss_pred EEEEeCCcEE-EEEeeecEEEECC------EEEcCEEEEEeeccCc
Confidence 9999999975 6999999999987 6899999999977654
No 14
>cd05488 Proteinase_A_fungi Fungal Proteinase A , aspartic proteinase superfamily. Fungal Proteinase A, a proteolytic enzyme distributed among a variety of organisms, is a member of the aspartic proteinase superfamily. In Saccharomyces cerevisiae, targeted to the vacuole as a zymogen, activation of proteinases A at acidic pH can occur by two different pathways: a one-step process to release mature proteinase A, involving the intervention of proteinase B, or a step-wise pathway via the auto-activation product known as pseudo-proteinase A. Once active, S. cerevisiae proteinase A is essential to the activities of other yeast vacuolar hydrolases, including proteinase B and carboxypeptidase Y. The mature enzyme is bilobal, with each lobe providing one of the two catalytically essential aspartic acid residues in the active site. The crystal structure of free proteinase A shows that flap loop is atypically pointing directly into the S(1) pocket of the enzyme. Proteinase A preferentially hydro
Probab=99.86 E-value=4.5e-21 Score=167.63 Aligned_cols=101 Identities=28% Similarity=0.442 Sum_probs=90.0
Q ss_pred eecCCceEEEEEEeCCCceeEEEEEeCCCCceeEecCCCC--CCCCCCCCccCCCCCCcceeccCCCccccCCCCCCCCC
Q 037565 84 MVSSGGEYFVNISIGTPRVELIGIADTASDLTWIQCKPCY--KCYKQNPPIFDPKESSTSKNITCQSESCLRLEKDRRCD 161 (218)
Q Consensus 84 ~~~~~~~Y~~~i~iGtP~q~~~~~~DTGS~~~Wv~c~~C~--~C~~~~~~~f~p~~Sst~~~~~C~s~~C~~~~~~~~C~ 161 (218)
.++.+.+|+++|.||||+|++.|++||||+++||+|..|. .| ..++.|+|++|+|++.
T Consensus 4 ~n~~~~~Y~~~i~iGtp~q~~~v~~DTGSs~~wv~~~~C~~~~C--~~~~~y~~~~Sst~~~------------------ 63 (320)
T cd05488 4 TNYLNAQYFTDITLGTPPQKFKVILDTGSSNLWVPSVKCGSIAC--FLHSKYDSSASSTYKA------------------ 63 (320)
T ss_pred cccCCCEEEEEEEECCCCcEEEEEEecCCcceEEEcCCCCCccc--CCcceECCCCCcceee------------------
Confidence 3567889999999999999999999999999999999997 57 4567999999999986
Q ss_pred CCCCcceeEEEcCCCceEEEEEEEEEEEecCCCCCCCcccCCeEEEeEEcCCC
Q 037565 162 TSRNTCQFYYSYADGSYTTGNVGTEVFVISSKTNPDQGTFPNIVYGCGNNNTG 214 (218)
Q Consensus 162 ~~~~~c~~~~~Y~dgs~~~G~l~~D~~~~~~~~~~~~~~~~~~~FGc~~~~~g 214 (218)
+.|.|.+.|++|+. .|.+++|+|+|++ +.++++.|||++...|
T Consensus 64 ---~~~~~~~~y~~g~~-~G~~~~D~v~ig~------~~~~~~~f~~a~~~~g 106 (320)
T cd05488 64 ---NGTEFKIQYGSGSL-EGFVSQDTLSIGD------LTIKKQDFAEATSEPG 106 (320)
T ss_pred ---CCCEEEEEECCceE-EEEEEEeEEEECC------EEECCEEEEEEecCCC
Confidence 46899999999985 6999999999987 6889999999987655
No 15
>PTZ00147 plasmepsin-1; Provisional
Probab=99.86 E-value=9.3e-21 Score=172.82 Aligned_cols=102 Identities=16% Similarity=0.244 Sum_probs=88.6
Q ss_pred ccceecCCceEEEEEEeCCCceeEEEEEeCCCCceeEecCCCC--CCCCCCCCccCCCCCCcceeccCCCccccCCCCCC
Q 037565 81 QAPMVSSGGEYFVNISIGTPRVELIGIADTASDLTWIQCKPCY--KCYKQNPPIFDPKESSTSKNITCQSESCLRLEKDR 158 (218)
Q Consensus 81 ~~~~~~~~~~Y~~~i~iGtP~q~~~~~~DTGS~~~Wv~c~~C~--~C~~~~~~~f~p~~Sst~~~~~C~s~~C~~~~~~~ 158 (218)
.+..++.+.+|+++|.||||||++.|++||||+++||+|..|. .| +.++.|||++|+||+..
T Consensus 130 v~L~n~~n~~Y~~~I~IGTP~Q~f~Vi~DTGSsdlWVps~~C~~~~C--~~~~~yd~s~SsT~~~~-------------- 193 (453)
T PTZ00147 130 VELKDLANVMSYGEAKLGDNGQKFNFIFDTGSANLWVPSIKCTTEGC--ETKNLYDSSKSKTYEKD-------------- 193 (453)
T ss_pred eeccccCCCEEEEEEEECCCCeEEEEEEeCCCCcEEEeecCCCcccc--cCCCccCCccCcceEEC--------------
Confidence 3444778899999999999999999999999999999999997 56 56789999999999873
Q ss_pred CCCCCCCcceeEEEcCCCceEEEEEEEEEEEecCCCCCCCcccCCeEEEeEEcCC
Q 037565 159 RCDTSRNTCQFYYSYADGSYTTGNVGTEVFVISSKTNPDQGTFPNIVYGCGNNNT 213 (218)
Q Consensus 159 ~C~~~~~~c~~~~~Y~dgs~~~G~l~~D~~~~~~~~~~~~~~~~~~~FGc~~~~~ 213 (218)
.+.|.+.|++|+. .|.+++|+|+|++ +.++ ..|||+....
T Consensus 194 -------~~~f~i~Yg~Gsv-sG~~~~DtVtiG~------~~v~-~qF~~~~~~~ 233 (453)
T PTZ00147 194 -------GTKVEMNYVSGTV-SGFFSKDLVTIGN------LSVP-YKFIEVTDTN 233 (453)
T ss_pred -------CCEEEEEeCCCCE-EEEEEEEEEEECC------EEEE-EEEEEEEecc
Confidence 6899999999975 5999999999987 5676 5799987643
No 16
>PTZ00013 plasmepsin 4 (PM4); Provisional
Probab=99.84 E-value=5.3e-20 Score=167.66 Aligned_cols=102 Identities=15% Similarity=0.268 Sum_probs=88.2
Q ss_pred cccceecCCceEEEEEEeCCCceeEEEEEeCCCCceeEecCCCC--CCCCCCCCccCCCCCCcceeccCCCccccCCCCC
Q 037565 80 LQAPMVSSGGEYFVNISIGTPRVELIGIADTASDLTWIQCKPCY--KCYKQNPPIFDPKESSTSKNITCQSESCLRLEKD 157 (218)
Q Consensus 80 ~~~~~~~~~~~Y~~~i~iGtP~q~~~~~~DTGS~~~Wv~c~~C~--~C~~~~~~~f~p~~Sst~~~~~C~s~~C~~~~~~ 157 (218)
..+..++.+.+|+++|.||||+|++.|++||||+++||+|..|. .| +.++.|||++|+|++..
T Consensus 128 ~~~l~d~~n~~Yy~~i~IGTP~Q~f~vi~DTGSsdlWV~s~~C~~~~C--~~~~~yd~s~SsT~~~~------------- 192 (450)
T PTZ00013 128 VIELDDVANIMFYGEGEVGDNHQKFMLIFDTGSANLWVPSKKCDSIGC--SIKNLYDSSKSKSYEKD------------- 192 (450)
T ss_pred ceeeeccCCCEEEEEEEECCCCeEEEEEEeCCCCceEEecccCCcccc--ccCCCccCccCcccccC-------------
Confidence 34445778889999999999999999999999999999999997 57 56789999999999873
Q ss_pred CCCCCCCCcceeEEEcCCCceEEEEEEEEEEEecCCCCCCCcccCCeEEEeEEcC
Q 037565 158 RRCDTSRNTCQFYYSYADGSYTTGNVGTEVFVISSKTNPDQGTFPNIVYGCGNNN 212 (218)
Q Consensus 158 ~~C~~~~~~c~~~~~Y~dgs~~~G~l~~D~~~~~~~~~~~~~~~~~~~FGc~~~~ 212 (218)
.|.|.+.|++|+. .|.+++|+|+|++ +.++ ..|||+...
T Consensus 193 --------~~~~~i~YG~Gsv-~G~~~~Dtv~iG~------~~~~-~~f~~~~~~ 231 (450)
T PTZ00013 193 --------GTKVDITYGSGTV-KGFFSKDLVTLGH------LSMP-YKFIEVTDT 231 (450)
T ss_pred --------CcEEEEEECCceE-EEEEEEEEEEECC------EEEc-cEEEEEEec
Confidence 6899999999984 6999999999987 5666 578888654
No 17
>cd05487 renin_like Renin stimulates production of angiotensin and thus affects blood pressure. Renin, also known as angiotensinogenase, is a circulating enzyme that participates in the renin-angiotensin system that mediates extracellular volume, arterial vasoconstriction, and consequently mean arterial blood pressure. The enzyme is secreted by the kidneys from specialized juxtaglomerular cells in response to decreases in glomerular filtration rate (a consequence of low blood volume), diminished filtered sodium chloride and sympathetic nervous system innervation. The enzyme circulates in the blood stream and hydrolyzes angiotensinogen secreted from the liver into the peptide angiotensin I. Angiotensin I is further cleaved in the lungs by endothelial bound angiotensin converting enzyme (ACE) into angiotensin II, the final active peptide. Renin is a member of the aspartic protease family. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Aspartate r
Probab=99.83 E-value=3.1e-20 Score=162.75 Aligned_cols=98 Identities=24% Similarity=0.447 Sum_probs=85.5
Q ss_pred ecCCceEEEEEEeCCCceeEEEEEeCCCCceeEecCCCCC----CCCCCCCccCCCCCCcceeccCCCccccCCCCCCCC
Q 037565 85 VSSGGEYFVNISIGTPRVELIGIADTASDLTWIQCKPCYK----CYKQNPPIFDPKESSTSKNITCQSESCLRLEKDRRC 160 (218)
Q Consensus 85 ~~~~~~Y~~~i~iGtP~q~~~~~~DTGS~~~Wv~c~~C~~----C~~~~~~~f~p~~Sst~~~~~C~s~~C~~~~~~~~C 160 (218)
++.+.+|+++|.||||+|++.|++||||+++||+|..|.. | ..++.|||++|+||+.
T Consensus 3 ~~~~~~y~~~i~iGtP~q~~~v~~DTGSs~~Wv~~~~C~~~~~~c--~~~~~y~~~~SsT~~~----------------- 63 (326)
T cd05487 3 NYLDTQYYGEIGIGTPPQTFKVVFDTGSSNLWVPSSKCSPLYTAC--VTHNLYDASDSSTYKE----------------- 63 (326)
T ss_pred ccCCCeEEEEEEECCCCcEEEEEEeCCccceEEccCCCcCcchhh--cccCcCCCCCCeeeeE-----------------
Confidence 5678899999999999999999999999999999888864 6 4567899999999987
Q ss_pred CCCCCcceeEEEcCCCceEEEEEEEEEEEecCCCCCCCcccCCeEEEeEEcCC
Q 037565 161 DTSRNTCQFYYSYADGSYTTGNVGTEVFVISSKTNPDQGTFPNIVYGCGNNNT 213 (218)
Q Consensus 161 ~~~~~~c~~~~~Y~dgs~~~G~l~~D~~~~~~~~~~~~~~~~~~~FGc~~~~~ 213 (218)
..|.|.+.|++|+ +.|.+++|+|+|++ +.+ .+.|||..+..
T Consensus 64 ----~~~~~~~~Yg~g~-~~G~~~~D~v~~g~------~~~-~~~fg~~~~~~ 104 (326)
T cd05487 64 ----NGTEFTIHYASGT-VKGFLSQDIVTVGG------IPV-TQMFGEVTALP 104 (326)
T ss_pred ----CCEEEEEEeCCce-EEEEEeeeEEEECC------EEe-eEEEEEEEecc
Confidence 3689999999998 56999999999986 455 47899998753
No 18
>cd05475 nucellin_like Nucellins, plant aspartic proteases specifically expressed in nucellar cells during degradation. Nucellins are important regulators of nucellar cell's progressive degradation after ovule fertilization. This degradation is a characteristic of programmed cell death. Nucellins are plant aspartic proteases specifically expressed in nucellar cells during degradation. The enzyme is characterized by having two aspartic protease catalytic site motifs, the Asp-Thr-Gly-Ser in the N-terminal and Asp-Ser-Gly-Ser in the C-terminal region, and two other regions nearly identical to two regions of plant aspartic proteases. Aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe may be evolved from the other through ancient gene-duplication event. Although the three-dimensional structures of the two lobes are very similar, the amino acid sequences are more d
Probab=99.83 E-value=3.7e-20 Score=158.56 Aligned_cols=84 Identities=39% Similarity=0.850 Sum_probs=72.4
Q ss_pred ceEEEEEEeCCCceeEEEEEeCCCCceeEecC-CCCCCCCCCCCccCCCCCCcceeccCCCccccCCCCCCCCCCCCCcc
Q 037565 89 GEYFVNISIGTPRVELIGIADTASDLTWIQCK-PCYKCYKQNPPIFDPKESSTSKNITCQSESCLRLEKDRRCDTSRNTC 167 (218)
Q Consensus 89 ~~Y~~~i~iGtP~q~~~~~~DTGS~~~Wv~c~-~C~~C~~~~~~~f~p~~Sst~~~~~C~s~~C~~~~~~~~C~~~~~~c 167 (218)
++|+++|.||||||++.|++||||+++||+|. +|..| .|
T Consensus 1 ~~Y~~~i~iGtP~q~~~v~~DTGS~~~Wv~c~~~c~~c----------------------------------------~c 40 (273)
T cd05475 1 GYYYVTINIGNPPKPYFLDIDTGSDLTWLQCDAPCTGC----------------------------------------QC 40 (273)
T ss_pred CceEEEEEcCCCCeeEEEEEccCCCceEEeCCCCCCCC----------------------------------------cC
Confidence 57999999999999999999999999999994 66555 27
Q ss_pred eeEEEcCCCceEEEEEEEEEEEecCCCCCCCcccCCeEEEeEEcCCC
Q 037565 168 QFYYSYADGSYTTGNVGTEVFVISSKTNPDQGTFPNIVYGCGNNNTG 214 (218)
Q Consensus 168 ~~~~~Y~dgs~~~G~l~~D~~~~~~~~~~~~~~~~~~~FGc~~~~~g 214 (218)
.|.+.|+||+.++|.+++|+|+|+..++. ..++++.|||+..+.+
T Consensus 41 ~~~i~Ygd~~~~~G~~~~D~v~~~~~~~~--~~~~~~~Fgc~~~~~~ 85 (273)
T cd05475 41 DYEIEYADGGSSMGVLVTDIFSLKLTNGS--RAKPRIAFGCGYDQQG 85 (273)
T ss_pred ccEeEeCCCCceEEEEEEEEEEEeecCCC--cccCCEEEEeeeccCC
Confidence 89999998888889999999999765442 5788999999987654
No 19
>cd05473 beta_secretase_like Beta-secretase, aspartic-acid protease important in the pathogenesis of Alzheimer's disease. Beta-secretase also called BACE (beta-site of APP cleaving enzyme) or memapsin-2. Beta-secretase is an aspartic-acid protease important in the pathogenesis of Alzheimer's disease, and in the formation of myelin sheaths in peripheral nerve cells. It cleaves amyloid precursor protein (APP) to reveal the N-terminus of the beta-amyloid peptides. The beta-amyloid peptides are the major components of the amyloid plaques formed in the brain of patients with Alzheimer's disease (AD). Since BACE mediates one of the cleavages responsible for generation of AD, it is regarded as a potential target for pharmacological intervention in AD. Beta-secretase is a member of pepsin family of aspartic proteases. Same as other aspartic proteases, beta-secretase is a bilobal enzyme, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two
Probab=99.80 E-value=3.9e-19 Score=158.04 Aligned_cols=78 Identities=27% Similarity=0.403 Sum_probs=70.7
Q ss_pred ceEEEEEEeCCCceeEEEEEeCCCCceeEecCCCCCCCCCCCCccCCCCCCcceeccCCCccccCCCCCCCCCCCCCcce
Q 037565 89 GEYFVNISIGTPRVELIGIADTASDLTWIQCKPCYKCYKQNPPIFDPKESSTSKNITCQSESCLRLEKDRRCDTSRNTCQ 168 (218)
Q Consensus 89 ~~Y~~~i~iGtP~q~~~~~~DTGS~~~Wv~c~~C~~C~~~~~~~f~p~~Sst~~~~~C~s~~C~~~~~~~~C~~~~~~c~ 168 (218)
.+|+++|.||||+|++.|+|||||+++||+|.+|. +.++.|||++|+||+.. .|.
T Consensus 2 ~~Y~~~i~iGtP~Q~~~v~~DTGSs~lWv~~~~~~----~~~~~f~~~~SsT~~~~---------------------~~~ 56 (364)
T cd05473 2 QGYYIEMLIGTPPQKLNILVDTGSSNFAVAAAPHP----FIHTYFHRELSSTYRDL---------------------GKG 56 (364)
T ss_pred CceEEEEEecCCCceEEEEEecCCcceEEEcCCCc----cccccCCchhCcCcccC---------------------Cce
Confidence 46999999999999999999999999999998874 34678999999999884 689
Q ss_pred eEEEcCCCceEEEEEEEEEEEecC
Q 037565 169 FYYSYADGSYTTGNVGTEVFVISS 192 (218)
Q Consensus 169 ~~~~Y~dgs~~~G~l~~D~~~~~~ 192 (218)
|.+.|++|+. .|.+++|+|+|++
T Consensus 57 ~~i~Yg~Gs~-~G~~~~D~v~ig~ 79 (364)
T cd05473 57 VTVPYTQGSW-EGELGTDLVSIPK 79 (364)
T ss_pred EEEEECcceE-EEEEEEEEEEECC
Confidence 9999999986 5999999999985
No 20
>cd05471 pepsin_like Pepsin-like aspartic proteases, bilobal enzymes that cleave bonds in peptides at acidic pH. Pepsin-like aspartic proteases are found in mammals, plants, fungi and bacteria. These well known and extensively characterized enzymes include pepsins, chymosin, renin, cathepsins, and fungal aspartic proteases. Several have long been known to be medically (renin, cathepsin D and E, pepsin) or commercially (chymosin) important. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Aspartate residue, with an extended active site cleft localized between the two lobes of the molecule. The N- and C-terminal domains, although structurally related by a 2-fold axis, have only limited sequence homology except the vicinity of the active site. This suggests that the enzymes evolved by an ancient duplication event. Most members of the pepsin family specifically cleave bonds in peptides that are at least six residues in length, with hydrophobic residu
Probab=99.79 E-value=1.5e-18 Score=147.58 Aligned_cols=96 Identities=40% Similarity=0.756 Sum_probs=82.0
Q ss_pred EEEEEEeCCCceeEEEEEeCCCCceeEecCCCCCCCCCCCCc--cCCCCCCcceeccCCCccccCCCCCCCCCCCCCcce
Q 037565 91 YFVNISIGTPRVELIGIADTASDLTWIQCKPCYKCYKQNPPI--FDPKESSTSKNITCQSESCLRLEKDRRCDTSRNTCQ 168 (218)
Q Consensus 91 Y~~~i~iGtP~q~~~~~~DTGS~~~Wv~c~~C~~C~~~~~~~--f~p~~Sst~~~~~C~s~~C~~~~~~~~C~~~~~~c~ 168 (218)
|+++|.||+|+|++.|++||||+++||+|..|..|..+.... |++..|+++.. ..|.
T Consensus 1 Y~~~i~iGtp~q~~~l~~DTGS~~~wv~~~~c~~~~~~~~~~~~~~~~~s~~~~~---------------------~~~~ 59 (283)
T cd05471 1 YYGEITIGTPPQKFSVIFDTGSSLLWVPSSNCTSCSCQKHPRFKYDSSKSSTYKD---------------------TGCT 59 (283)
T ss_pred CEEEEEECCCCcEEEEEEeCCCCCEEEecCCCCccccccCCCCccCccCCceeec---------------------CCCE
Confidence 789999999999999999999999999999999875554444 67777766544 4799
Q ss_pred eEEEcCCCceEEEEEEEEEEEecCCCCCCCcccCCeEEEeEEcCCC
Q 037565 169 FYYSYADGSYTTGNVGTEVFVISSKTNPDQGTFPNIVYGCGNNNTG 214 (218)
Q Consensus 169 ~~~~Y~dgs~~~G~l~~D~~~~~~~~~~~~~~~~~~~FGc~~~~~g 214 (218)
|.+.|++|+.. |.+++|+|+|++ ..++++.|||++...+
T Consensus 60 ~~~~Y~~g~~~-g~~~~D~v~~~~------~~~~~~~fg~~~~~~~ 98 (283)
T cd05471 60 FSITYGDGSVT-GGLGTDTVTIGG------LTIPNQTFGCATSESG 98 (283)
T ss_pred EEEEECCCeEE-EEEEEeEEEECC------EEEeceEEEEEeccCC
Confidence 99999998765 999999999997 4689999999998653
No 21
>cd05472 cnd41_like Chloroplast Nucleoids DNA-binding Protease, catalyzes the degradation of ribulose-1,5-bisphosphate carboxylase/oxygenase. Chloroplast Nucleoids DNA-binding Protease catalyzes the degradation of ribulose-1,5-bisphosphate carboxylase/oxygenase (Rubisco) in senescent leaves of tobacco. Antisense tobacco with reduced amount of CND41 maintained green leaves and constant protein levels, especially Rubisco. CND41 has DNA-binding as well as aspartic protease activities. The pepsin-like aspartic protease domain is located at the C-terminus of the protein. The enzyme is characterized by having two aspartic protease catalytic site motifs, the Asp-Thr-Gly-Ser in the N-terminal and Asp-Ser-Gly-Ser in the C-terminal region. Aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe may be evolved from the other through ancient gene-duplication event. This fami
Probab=99.78 E-value=4.6e-19 Score=153.33 Aligned_cols=78 Identities=47% Similarity=1.032 Sum_probs=69.1
Q ss_pred eEEEEEEeCCCceeEEEEEeCCCCceeEecCCCCCCCCCCCCccCCCCCCcceeccCCCccccCCCCCCCCCCCCCccee
Q 037565 90 EYFVNISIGTPRVELIGIADTASDLTWIQCKPCYKCYKQNPPIFDPKESSTSKNITCQSESCLRLEKDRRCDTSRNTCQF 169 (218)
Q Consensus 90 ~Y~~~i~iGtP~q~~~~~~DTGS~~~Wv~c~~C~~C~~~~~~~f~p~~Sst~~~~~C~s~~C~~~~~~~~C~~~~~~c~~ 169 (218)
+|+++|.||||||++.|++||||+++||+|.+| |.|
T Consensus 1 ~Y~~~i~iGtP~q~~~v~~DTGSs~~Wv~c~~c--------------------------------------------~~~ 36 (299)
T cd05472 1 EYVVTVGLGTPARDQTVIVDTGSDLTWVQCQPC--------------------------------------------CLY 36 (299)
T ss_pred CeEEEEecCCCCcceEEEecCCCCcccccCCCC--------------------------------------------Cee
Confidence 699999999999999999999999999988654 368
Q ss_pred EEEcCCCceEEEEEEEEEEEecCCCCCCCcccCCeEEEeEEcCCCCC
Q 037565 170 YYSYADGSYTTGNVGTEVFVISSKTNPDQGTFPNIVYGCGNNNTGTF 216 (218)
Q Consensus 170 ~~~Y~dgs~~~G~l~~D~~~~~~~~~~~~~~~~~~~FGc~~~~~g~f 216 (218)
.+.|++|+.++|.+++|+|+|++. ..++++.|||+..+.+.|
T Consensus 37 ~i~Yg~Gs~~~G~~~~D~v~ig~~-----~~~~~~~Fg~~~~~~~~~ 78 (299)
T cd05472 37 QVSYGDGSYTTGDLATDTLTLGSS-----DVVPGFAFGCGHDNEGLF 78 (299)
T ss_pred eeEeCCCceEEEEEEEEEEEeCCC-----CccCCEEEECCccCCCcc
Confidence 999999998889999999999872 268999999999876644
No 22
>cd05489 xylanase_inhibitor_I_like TAXI-I inhibits degradation of xylan in the cell wall. Xylanase inhibitor-I (TAXI-I) is a member of potent TAXI-type inhibitors of fungal and bacterial family 11 xylanases. Plants developed a diverse battery of defense mechanisms in response to continual challenges by a broad spectrum of pathogenic microorganisms. Their defense arsenal includes inhibitors of cell wall-degrading enzymes, which hinder a possible invasion and colonization by antagonists. Xylanases of fungal and bacterial pathogens are the key enzymes in the degradation of xylan in the cell wall. Plants secrete proteins that inhibit these degradation glycosidases, including xylanase. Surprisingly, TAXI-I displays structural homology with the pepsin-like family of aspartic proteases but is proteolytically nonfunctional, because one or more residues of the essential catalytic triad are absent. The structure of the TAXI-inhibitor, Aspergillus niger xylanase I complex, illustrates the ability
Probab=99.75 E-value=4.9e-18 Score=151.25 Aligned_cols=101 Identities=24% Similarity=0.395 Sum_probs=81.7
Q ss_pred eCCCcee-EEEEEeCCCCceeEecCCCCCCCCCCCCccCCCCCCcceeccCCCccccCCCCC-----------CCCCCCC
Q 037565 97 IGTPRVE-LIGIADTASDLTWIQCKPCYKCYKQNPPIFDPKESSTSKNITCQSESCLRLEKD-----------RRCDTSR 164 (218)
Q Consensus 97 iGtP~q~-~~~~~DTGS~~~Wv~c~~C~~C~~~~~~~f~p~~Sst~~~~~C~s~~C~~~~~~-----------~~C~~~~ 164 (218)
+|||-.+ +.|++||||+++||||.+ .+|+||+.++|.++.|+..+.+ ..|.+
T Consensus 2 ~~~~~~~~~~~~~DTGS~l~WvqC~~--------------~~sst~~~~~C~s~~C~~~~~~~~~~~~~~~~~~~c~~-- 65 (362)
T cd05489 2 TITPLKGAVPLVLDLAGPLLWSTCDA--------------GHSSTYQTVPCSSSVCSLANRYHCPGTCGGAPGPGCGN-- 65 (362)
T ss_pred cccCccCCeeEEEECCCCceeeeCCC--------------CCcCCCCccCcCChhhccccccCCCccccCCCCCCCCC--
Confidence 5888888 999999999999999975 4688999999999999876532 24654
Q ss_pred CcceeEEE-cCCCceEEEEEEEEEEEecCCCCCC--CcccCCeEEEeEEcCC
Q 037565 165 NTCQFYYS-YADGSYTTGNVGTEVFVISSKTNPD--QGTFPNIVYGCGNNNT 213 (218)
Q Consensus 165 ~~c~~~~~-Y~dgs~~~G~l~~D~~~~~~~~~~~--~~~~~~~~FGc~~~~~ 213 (218)
+.|.|... |++|+.++|+|++|+|+|+..++.. ..+++++.|||++++.
T Consensus 66 ~~C~y~~~~y~~gs~t~G~l~~Dtl~~~~~~g~~~~~~~~~~~~FGC~~~~~ 117 (362)
T cd05489 66 NTCTAHPYNPVTGECATGDLTQDVLSANTTDGSNPLLVVIFNFVFSCAPSLL 117 (362)
T ss_pred CcCeeEccccccCcEeeEEEEEEEEEecccCCCCcccceeCCEEEEcCCccc
Confidence 46999765 8899899999999999998654421 1368999999998753
No 23
>PF00026 Asp: Eukaryotic aspartyl protease The Prosite entry also includes Pfam:PF00077.; InterPro: IPR001461 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Aspartic endopeptidases 3.4.23. from EC of vertebrate, fungal and retroviral origin have been characterised []. More recently, aspartic endopeptidases associated with the processing of bacterial type 4 prepilin [] and archaean preflagellin have been described [, ]. Structurally, aspartic endopeptidases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localised between the two lobes of the molecule. One lobe has probably evolved from the other through a gene duplication event in the distant past. In modern-day enzymes, although the three-dimensional structures are very similar, the amino acid sequences are more divergent, except for the catalytic site motif, which is very conserved. The presence and position of disulphide bridges are other conserved features of aspartic peptidases. All or most aspartate peptidases are endopeptidases. These enzymes have been assigned into clans (proteins which are evolutionary related), and further sub-divided into families, largely on the basis of their tertiary structure. This group of aspartic peptidases belong to MEROPS peptidase family A1 (pepsin family, clan AA). The type example is pepsin A from Homo sapiens (Human) . More than 70 aspartic peptidases, from all from eukaryotic organisms, have been identified. These include pepsins, cathepsins, and renins. The enzymes are synthesised with signal peptides, and the proenzymes are secreted or passed into the lysosomal/endosomal system, where acidification leads to autocatalytic activation. Most members of the pepsin family specifically cleave bonds in peptides that are at least six residues in length, with hydrophobic residues in both the P1 and P1' positions []. Crystallography has shown the active site to form a groove across the junction of the two lobes, with an extended loop projecting over the cleft to form an 11-residue flap, which encloses substrates and inhibitors within the active site []. Specificity is determined by several hydrophobic residues surrounding the catalytic aspartates, and by three residues in the flap. Cysteine residues are well conserved within the pepsin family, pepsin itself containing three disulphide loops. The first loop is found in all but the fungal enzymes, and is usually around five residues in length, but is longer in barrierpepsin and candidapepsin; the second loop is also small and found only in the animal enzymes; and the third loop is the largest, found in all members of the family, except for the cysteine-free polyporopepsin. The loops are spread unequally throughout the two lobes, suggesting that they formed after the initial gene duplication and fusion event []. This family does not include the retroviral nor retrotransposon aspartic proteases which are much smaller and appear to be homologous to the single domain aspartic proteases.; GO: 0004190 aspartic-type endopeptidase activity, 0006508 proteolysis; PDB: 1CZI_E 3CMS_A 1CMS_A 4CMS_A 1YG9_A 2NR6_A 3LIZ_A 1FLH_A 3UTL_A 1QRP_E ....
Probab=99.71 E-value=2.6e-17 Score=142.33 Aligned_cols=96 Identities=32% Similarity=0.601 Sum_probs=83.8
Q ss_pred eEEEEEEeCCCceeEEEEEeCCCCceeEecCCCCCC-CCCCCCccCCCCCCcceeccCCCccccCCCCCCCCCCCCCcce
Q 037565 90 EYFVNISIGTPRVELIGIADTASDLTWIQCKPCYKC-YKQNPPIFDPKESSTSKNITCQSESCLRLEKDRRCDTSRNTCQ 168 (218)
Q Consensus 90 ~Y~~~i~iGtP~q~~~~~~DTGS~~~Wv~c~~C~~C-~~~~~~~f~p~~Sst~~~~~C~s~~C~~~~~~~~C~~~~~~c~ 168 (218)
+|+++|.||||+|++.|++||||+.+||++..|..| .-.....|++.+|+|++.. .+.
T Consensus 1 ~Y~~~v~iGtp~q~~~~~iDTGS~~~wv~~~~c~~~~~~~~~~~y~~~~S~t~~~~---------------------~~~ 59 (317)
T PF00026_consen 1 QYYINVTIGTPPQTFRVLIDTGSSDTWVPSSNCNSCSSCASSGFYNPSKSSTFSNQ---------------------GKP 59 (317)
T ss_dssp EEEEEEEETTTTEEEEEEEETTBSSEEEEBTTECSHTHHCTSC-BBGGGSTTEEEE---------------------EEE
T ss_pred CeEEEEEECCCCeEEEEEEecccceeeeceeccccccccccccccccccccccccc---------------------eee
Confidence 699999999999999999999999999998888765 2245678999999999884 578
Q ss_pred eEEEcCCCceEEEEEEEEEEEecCCCCCCCcccCCeEEEeEEcCC
Q 037565 169 FYYSYADGSYTTGNVGTEVFVISSKTNPDQGTFPNIVYGCGNNNT 213 (218)
Q Consensus 169 ~~~~Y~dgs~~~G~l~~D~~~~~~~~~~~~~~~~~~~FGc~~~~~ 213 (218)
+.+.|++|+ .+|.+++|+|.|++ +.++++.||.+....
T Consensus 60 ~~~~y~~g~-~~G~~~~D~v~ig~------~~~~~~~f~~~~~~~ 97 (317)
T PF00026_consen 60 FSISYGDGS-VSGNLVSDTVSIGG------LTIPNQTFGLADSYS 97 (317)
T ss_dssp EEEEETTEE-EEEEEEEEEEEETT------EEEEEEEEEEEEEEE
T ss_pred eeeeccCcc-cccccccceEeeee------ccccccceecccccc
Confidence 999999999 66999999999987 678899999998843
No 24
>cd05476 pepsin_A_like_plant Chroloplast Nucleoids DNA-binding Protease and Nucellin, pepsin-like aspartic proteases from plants. This family contains pepsin like aspartic proteases from plants including Chloroplast Nucleoids DNA-binding Protease and Nucellin. Chloroplast Nucleoids DNA-binding Protease catalyzes the degradation of ribulose-1,5-bisphosphate carboxylase/oxygenase (Rubisco) in senescent leaves of tobacco and Nucellins are important regulators of nucellar cell's progressive degradation after ovule fertilization. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. The N- and C-terminal domains, although structurally related by a 2-fold axis, have only limited sequence homology except the vicinity of the active site. This suggests that the enzymes evolved by an ancient duplication event. The enzymes specifically cleave bonds in peptides which
Probab=99.67 E-value=1.8e-16 Score=135.14 Aligned_cols=74 Identities=50% Similarity=0.972 Sum_probs=65.4
Q ss_pred eEEEEEEeCCCceeEEEEEeCCCCceeEecCCCCCCCCCCCCccCCCCCCcceeccCCCccccCCCCCCCCCCCCCccee
Q 037565 90 EYFVNISIGTPRVELIGIADTASDLTWIQCKPCYKCYKQNPPIFDPKESSTSKNITCQSESCLRLEKDRRCDTSRNTCQF 169 (218)
Q Consensus 90 ~Y~~~i~iGtP~q~~~~~~DTGS~~~Wv~c~~C~~C~~~~~~~f~p~~Sst~~~~~C~s~~C~~~~~~~~C~~~~~~c~~ 169 (218)
+|+++|.||||||++.|++||||+++||+| |.|
T Consensus 1 ~Y~~~i~iGtP~q~~~v~~DTGSs~~wv~~-----------------------------------------------~~~ 33 (265)
T cd05476 1 EYLVTLSIGTPPQPFSLIVDTGSDLTWTQC-----------------------------------------------CSY 33 (265)
T ss_pred CeEEEEecCCCCcceEEEecCCCCCEEEcC-----------------------------------------------Cce
Confidence 699999999999999999999999999986 257
Q ss_pred EEEcCCCceEEEEEEEEEEEecCCCCCCCcccCCeEEEeEEcCCC
Q 037565 170 YYSYADGSYTTGNVGTEVFVISSKTNPDQGTFPNIVYGCGNNNTG 214 (218)
Q Consensus 170 ~~~Y~dgs~~~G~l~~D~~~~~~~~~~~~~~~~~~~FGc~~~~~g 214 (218)
.+.|+||+..+|.+++|+|+|++.. ..++++.|||+.++.+
T Consensus 34 ~~~Y~dg~~~~G~~~~D~v~~g~~~----~~~~~~~Fg~~~~~~~ 74 (265)
T cd05476 34 EYSYGDGSSTSGVLATETFTFGDSS----VSVPNVAFGCGTDNEG 74 (265)
T ss_pred EeEeCCCceeeeeEEEEEEEecCCC----CccCCEEEEecccccC
Confidence 8899999888999999999999721 1689999999998753
No 25
>cd05474 SAP_like SAPs, pepsin-like proteinases secreted from pathogens to degrade host proteins. SAPs (Secreted aspartic proteinases) are secreted from a group of pathogenic fungi, predominantly Candida species. They are secreted from the pathogen to degrade host proteins. SAP is one of the most significant extracellular hydrolytic enzymes produced by C. albicans. SAP proteins, encoded by a family of 10 SAP genes. All 10 SAP genes of C. albicans encode preproenzymes, approximately 60 amino acid longer than the mature enzyme, which are processed when transported via the secretory pathway. The mature enzymes contain sequence motifs typical for all aspartyl proteinases, including the two conserved aspartate residues other active site and conserved cysteine residues implicated in the maintenance of the three-dimensional structure. Most Sap proteins contain putative N-glycosylation sites, but it remains to be determined which Sap proteins are glycosylated. This family of aspartate proteases
Probab=99.52 E-value=5.7e-14 Score=120.82 Aligned_cols=68 Identities=31% Similarity=0.572 Sum_probs=61.5
Q ss_pred eEEEEEEeCCCceeEEEEEeCCCCceeEecCCCCCCCCCCCCccCCCCCCcceeccCCCccccCCCCCCCCCCCCCccee
Q 037565 90 EYFVNISIGTPRVELIGIADTASDLTWIQCKPCYKCYKQNPPIFDPKESSTSKNITCQSESCLRLEKDRRCDTSRNTCQF 169 (218)
Q Consensus 90 ~Y~~~i~iGtP~q~~~~~~DTGS~~~Wv~c~~C~~C~~~~~~~f~p~~Sst~~~~~C~s~~C~~~~~~~~C~~~~~~c~~ 169 (218)
.|+++|.||||+|++.|++||||+++||+ .|
T Consensus 2 ~Y~~~i~iGtp~q~~~v~~DTgS~~~wv~-------------------------------------------------~~ 32 (295)
T cd05474 2 YYSAELSVGTPPQKVTVLLDTGSSDLWVP-------------------------------------------------DF 32 (295)
T ss_pred eEEEEEEECCCCcEEEEEEeCCCCcceee-------------------------------------------------ee
Confidence 59999999999999999999999999996 25
Q ss_pred EEEcCCCceEEEEEEEEEEEecCCCCCCCcccCCeEEEeEEcC
Q 037565 170 YYSYADGSYTTGNVGTEVFVISSKTNPDQGTFPNIVYGCGNNN 212 (218)
Q Consensus 170 ~~~Y~dgs~~~G~l~~D~~~~~~~~~~~~~~~~~~~FGc~~~~ 212 (218)
.+.|++|+.+.|.+++|+|+|++ ..++++.|||+++.
T Consensus 33 ~~~Y~~g~~~~G~~~~D~v~~g~------~~~~~~~fg~~~~~ 69 (295)
T cd05474 33 SISYGDGTSASGTWGTDTVSIGG------ATVKNLQFAVANST 69 (295)
T ss_pred EEEeccCCcEEEEEEEEEEEECC------eEecceEEEEEecC
Confidence 77899988788999999999987 47889999999863
No 26
>cd05483 retropepsin_like_bacteria Bacterial aspartate proteases, retropepsin-like protease family. This family of bacteria aspartate proteases is a subfamily of retropepsin-like protease family, which includes enzymes from retrovirus and retrotransposons. While fungal and mammalian pepsin-like aspartate proteases are bilobal proteins with structurally related N- and C-termini, this family of bacteria aspartate proteases is half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples where catalytic residues have been identified. This group of aspartate proteases is classified by MEROPS as the peptidase family A2 (retropepsin family, clan AA), subfamily A2A.
Probab=97.05 E-value=0.0025 Score=44.98 Aligned_cols=79 Identities=18% Similarity=0.148 Sum_probs=52.0
Q ss_pred ceEEEEEEeCCCceeEEEEEeCCCCceeEecCCCCCCCCCCCCccCCCCCCcceeccCCCccccCCCCCCCCCCCCCcce
Q 037565 89 GEYFVNISIGTPRVELIGIADTASDLTWIQCKPCYKCYKQNPPIFDPKESSTSKNITCQSESCLRLEKDRRCDTSRNTCQ 168 (218)
Q Consensus 89 ~~Y~~~i~iGtP~q~~~~~~DTGS~~~Wv~c~~C~~C~~~~~~~f~p~~Sst~~~~~C~s~~C~~~~~~~~C~~~~~~c~ 168 (218)
+.|++++.|+ .+++.+++|||++.+|+.-.-...+ .. + .. ....
T Consensus 1 ~~~~v~v~i~--~~~~~~llDTGa~~s~i~~~~~~~l-----~~--~-----~~----------------------~~~~ 44 (96)
T cd05483 1 GHFVVPVTIN--GQPVRFLLDTGASTTVISEELAERL-----GL--P-----LT----------------------LGGK 44 (96)
T ss_pred CcEEEEEEEC--CEEEEEEEECCCCcEEcCHHHHHHc-----CC--C-----cc----------------------CCCc
Confidence 3589999999 6999999999999999864211111 00 0 00 1234
Q ss_pred eEEEcCCCceEEEEEEEEEEEecCCCCCCCcccCCeEEEeE
Q 037565 169 FYYSYADGSYTTGNVGTEVFVISSKTNPDQGTFPNIVYGCG 209 (218)
Q Consensus 169 ~~~~Y~dgs~~~G~l~~D~~~~~~~~~~~~~~~~~~~FGc~ 209 (218)
..+...+|.........+.|++++ ..+.++.+.-.
T Consensus 45 ~~~~~~~G~~~~~~~~~~~i~ig~------~~~~~~~~~v~ 79 (96)
T cd05483 45 VTVQTANGRVRAARVRLDSLQIGG------ITLRNVPAVVL 79 (96)
T ss_pred EEEEecCCCccceEEEcceEEECC------cEEeccEEEEe
Confidence 566777777666666688899987 45666555444
No 27
>TIGR02281 clan_AA_DTGA clan AA aspartic protease, TIGR02281 family. This family consists of predicted aspartic proteases, typically from 180 to 230 amino acids in length, in MEROPS clan AA. This model describes the well-conserved 121-residue C-terminal region. The poorly conserved, variable length N-terminal region usually contains a predicted transmembrane helix. Sequences in the seed alignment and those scoring above the trusted cutoff are Proteobacterial; homologs scroing between trusted and noise are found in Pyrobaculum aerophilum str. IM2 (archaeal), Pirellula sp. (Planctomycetes), and Nostoc sp. PCC 7120 (Cyanobacteria).
Probab=92.82 E-value=1.1 Score=33.70 Aligned_cols=84 Identities=11% Similarity=0.090 Sum_probs=52.9
Q ss_pred CCceEEEEEEeCCCceeEEEEEeCCCCceeEecCCCCCCCCCCCCccCCCCCCcceeccCCCccccCCCCCCCCCCCCCc
Q 037565 87 SGGEYFVNISIGTPRVELIGIADTASDLTWIQCKPCYKCYKQNPPIFDPKESSTSKNITCQSESCLRLEKDRRCDTSRNT 166 (218)
Q Consensus 87 ~~~~Y~~~i~iGtP~q~~~~~~DTGS~~~Wv~c~~C~~C~~~~~~~f~p~~Sst~~~~~C~s~~C~~~~~~~~C~~~~~~ 166 (218)
.+|.|++++.|. .+++.+++|||++.+-+.-.--..- ..++.. ..
T Consensus 8 ~~g~~~v~~~In--G~~~~flVDTGAs~t~is~~~A~~L------gl~~~~---------------------------~~ 52 (121)
T TIGR02281 8 GDGHFYATGRVN--GRNVRFLVDTGATSVALNEEDAQRL------GLDLNR---------------------------LG 52 (121)
T ss_pred CCCeEEEEEEEC--CEEEEEEEECCCCcEEcCHHHHHHc------CCCccc---------------------------CC
Confidence 578899999997 5799999999999987753210000 011110 01
Q ss_pred ceeEEEcCCCceEEEEEEEEEEEecCCCCCCCcccCCeEEEeEEc
Q 037565 167 CQFYYSYADGSYTTGNVGTEVFVISSKTNPDQGTFPNIVYGCGNN 211 (218)
Q Consensus 167 c~~~~~Y~dgs~~~G~l~~D~~~~~~~~~~~~~~~~~~~FGc~~~ 211 (218)
-...+.=..|........-|.+.+|+ ..+.++.+.....
T Consensus 53 ~~~~~~ta~G~~~~~~~~l~~l~iG~------~~~~nv~~~v~~~ 91 (121)
T TIGR02281 53 YTVTVSTANGQIKAARVTLDRVAIGG------IVVNDVDAMVAEG 91 (121)
T ss_pred ceEEEEeCCCcEEEEEEEeCEEEECC------EEEeCcEEEEeCC
Confidence 12233334555554566888999987 5777887776643
No 28
>cd05484 retropepsin_like_LTR_2 Retropepsins_like_LTR, pepsin-like aspartate proteases. Retropepsin of retrotransposons with long terminal repeats are pepsin-like aspartate proteases. While fungal and mammalian pepsins are bilobal proteins with structurally related N- and C-termini, retropepsins are half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and retropepsins function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Retroviral aspartyl protease is synthesized as part of the POL polyprotein that contains an aspartyl protease, a reverse transcriptase, RNase H, and an integrase. The POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples where catalytic residues have been identified. This group of aspartate peptidases is classif
Probab=91.46 E-value=0.26 Score=34.81 Aligned_cols=27 Identities=26% Similarity=0.097 Sum_probs=23.5
Q ss_pred EEEEEEeCCCceeEEEEEeCCCCceeEec
Q 037565 91 YFVNISIGTPRVELIGIADTASDLTWIQC 119 (218)
Q Consensus 91 Y~~~i~iGtP~q~~~~~~DTGS~~~Wv~c 119 (218)
|++++.|+ .+++.+++||||+..++.-
T Consensus 1 ~~~~~~In--g~~i~~lvDTGA~~svis~ 27 (91)
T cd05484 1 KTVTLLVN--GKPLKFQLDTGSAITVISE 27 (91)
T ss_pred CEEEEEEC--CEEEEEEEcCCcceEEeCH
Confidence 57888888 5899999999999999864
No 29
>PF13650 Asp_protease_2: Aspartyl protease
Probab=88.00 E-value=3.8 Score=27.93 Aligned_cols=23 Identities=13% Similarity=0.205 Sum_probs=18.7
Q ss_pred EEEeCCCceeEEEEEeCCCCceeEe
Q 037565 94 NISIGTPRVELIGIADTASDLTWIQ 118 (218)
Q Consensus 94 ~i~iGtP~q~~~~~~DTGS~~~Wv~ 118 (218)
++.|+ .+++.+++|||++...+.
T Consensus 2 ~v~vn--g~~~~~liDTGa~~~~i~ 24 (90)
T PF13650_consen 2 PVKVN--GKPVRFLIDTGASISVIS 24 (90)
T ss_pred EEEEC--CEEEEEEEcCCCCcEEEC
Confidence 56666 589999999999987764
No 30
>PF13975 gag-asp_proteas: gag-polyprotein putative aspartyl protease
Probab=86.94 E-value=1.4 Score=29.85 Aligned_cols=32 Identities=28% Similarity=0.474 Sum_probs=27.4
Q ss_pred CCceEEEEEEeCCCceeEEEEEeCCCCceeEecC
Q 037565 87 SGGEYFVNISIGTPRVELIGIADTASDLTWIQCK 120 (218)
Q Consensus 87 ~~~~Y~~~i~iGtP~q~~~~~~DTGS~~~Wv~c~ 120 (218)
..+.+++.+.||. +.+.+++|||++...+.-.
T Consensus 5 ~~g~~~v~~~I~g--~~~~alvDtGat~~fis~~ 36 (72)
T PF13975_consen 5 DPGLMYVPVSIGG--VQVKALVDTGATHNFISES 36 (72)
T ss_pred cCCEEEEEEEECC--EEEEEEEeCCCcceecCHH
Confidence 4578999999995 9999999999999887654
No 31
>PF00077 RVP: Retroviral aspartyl protease The Prosite entry also includes Pfam:PF00026; InterPro: IPR018061 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Aspartic endopeptidases 3.4.23. from EC of vertebrate, fungal and retroviral origin have been characterised []. More recently, aspartic endopeptidases associated with the processing of bacterial type 4 prepilin [] and archaean preflagellin have been described [, ]. Structurally, aspartic endopeptidases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localised between the two lobes of the molecule. One lobe has probably evolved from the other through a gene duplication event in the distant past. In modern-day enzymes, although the three-dimensional structures are very similar, the amino acid sequences are more divergent, except for the catalytic site motif, which is very conserved. The presence and position of disulphide bridges are other conserved features of aspartic peptidases. All or most aspartate peptidases are endopeptidases. These enzymes have been assigned into clans (proteins which are evolutionary related), and further sub-divided into families, largely on the basis of their tertiary structure. This group of aspartic peptidases belong to the MEROPS peptidase family A2 (retropepsin family, clan AA), subfamily A2A. The family includes the single domain aspartic proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). Retroviral aspartyl protease is synthesised as part of the POL polyprotein that contains; an aspartyl protease, a reverse transcriptase, RNase H and integrase. POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins.; PDB: 3D3T_B 3SQF_A 1NSO_A 2HB3_A 2HS2_A 2HS1_B 3K4V_A 3GGV_C 1HTG_B 2FDE_A ....
Probab=80.56 E-value=2.9 Score=29.65 Aligned_cols=27 Identities=22% Similarity=0.458 Sum_probs=21.3
Q ss_pred EEEEEeCCCceeEEEEEeCCCCceeEecC
Q 037565 92 FVNISIGTPRVELIGIADTASDLTWIQCK 120 (218)
Q Consensus 92 ~~~i~iGtP~q~~~~~~DTGS~~~Wv~c~ 120 (218)
+.+|.+. .+++.+++|||++..-++-.
T Consensus 7 ~i~v~i~--g~~i~~LlDTGA~vsiI~~~ 33 (100)
T PF00077_consen 7 YITVKIN--GKKIKALLDTGADVSIISEK 33 (100)
T ss_dssp EEEEEET--TEEEEEEEETTBSSEEESSG
T ss_pred eEEEeEC--CEEEEEEEecCCCcceeccc
Confidence 4566666 47999999999999888643
No 32
>cd05479 RP_DDI RP_DDI; retropepsin-like domain of DNA damage inducible protein. The family represents the retropepsin-like domain of DNA damage inducible protein. DNA damage inducible protein has a retropepsin-like domain and an amino-terminal ubiquitin-like domain and/or a UBA (ubiquitin-associated) domain. This CD represents the retropepsin-like domain of DDI.
Probab=77.78 E-value=4.3 Score=30.37 Aligned_cols=31 Identities=16% Similarity=0.305 Sum_probs=26.1
Q ss_pred CCceEEEEEEeCCCceeEEEEEeCCCCceeEec
Q 037565 87 SGGEYFVNISIGTPRVELIGIADTASDLTWIQC 119 (218)
Q Consensus 87 ~~~~Y~~~i~iGtP~q~~~~~~DTGS~~~Wv~c 119 (218)
....+++++.|+ .+++.+++|||++..++.-
T Consensus 13 ~~~~~~v~~~In--g~~~~~LvDTGAs~s~Is~ 43 (124)
T cd05479 13 KVPMLYINVEIN--GVPVKAFVDSGAQMTIMSK 43 (124)
T ss_pred eeeEEEEEEEEC--CEEEEEEEeCCCceEEeCH
Confidence 345689999998 5889999999999998854
No 33
>COG3577 Predicted aspartyl protease [General function prediction only]
Probab=74.43 E-value=13 Score=30.79 Aligned_cols=72 Identities=15% Similarity=0.083 Sum_probs=50.2
Q ss_pred cCCceEEEEEEeCCCceeEEEEEeCCCCceeEecCCCCCCCCCCCCccCCCCCCcceeccCCCccccCCCCCCCCCCCCC
Q 037565 86 SSGGEYFVNISIGTPRVELIGIADTASDLTWIQCKPCYKCYKQNPPIFDPKESSTSKNITCQSESCLRLEKDRRCDTSRN 165 (218)
Q Consensus 86 ~~~~~Y~~~i~iGtP~q~~~~~~DTGS~~~Wv~c~~C~~C~~~~~~~f~p~~Sst~~~~~C~s~~C~~~~~~~~C~~~~~ 165 (218)
..+|-|.++..|- .|++..++|||-+.+-++-..- ..--||... .
T Consensus 101 ~~~GHF~a~~~VN--Gk~v~fLVDTGATsVal~~~dA------~RlGid~~~---------------------------l 145 (215)
T COG3577 101 SRDGHFEANGRVN--GKKVDFLVDTGATSVALNEEDA------RRLGIDLNS---------------------------L 145 (215)
T ss_pred cCCCcEEEEEEEC--CEEEEEEEecCcceeecCHHHH------HHhCCCccc---------------------------c
Confidence 4688899999987 6999999999999887753210 011233332 1
Q ss_pred cceeEEEcCCCceEEEEEEEEEEEecC
Q 037565 166 TCQFYYSYADGSYTTGNVGTEVFVISS 192 (218)
Q Consensus 166 ~c~~~~~Y~dgs~~~G~l~~D~~~~~~ 192 (218)
.-++.+.-.+|....-.+--|.|.||+
T Consensus 146 ~y~~~v~TANG~~~AA~V~Ld~v~IG~ 172 (215)
T COG3577 146 DYTITVSTANGRARAAPVTLDRVQIGG 172 (215)
T ss_pred CCceEEEccCCccccceEEeeeEEEcc
Confidence 235556667787766778889999987
No 34
>cd05482 HIV_retropepsin_like Retropepsins, pepsin-like aspartate proteases. This is a subfamily of retropepsins. The family includes pepsin-like aspartate proteases from retroviruses, retrotransposons and retroelements. While fungal and mammalian pepsins are bilobal proteins with structurally related N- and C-termini, retropepsins are half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and retropepsins function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Retroviral aspartyl protease is synthesized as part of the POL polyprotein that contains an aspartyl protease, a reverse transcriptase, RNase H, and an integrase. The POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples where catalytic residues have been identified. This gro
Probab=71.55 E-value=6.2 Score=27.98 Aligned_cols=24 Identities=25% Similarity=0.329 Sum_probs=19.7
Q ss_pred EEEeCCCceeEEEEEeCCCCceeEec
Q 037565 94 NISIGTPRVELIGIADTASDLTWIQC 119 (218)
Q Consensus 94 ~i~iGtP~q~~~~~~DTGS~~~Wv~c 119 (218)
++.|+ .|.+.+++|||.+++-+.-
T Consensus 2 ~~~i~--g~~~~~llDTGAd~Tvi~~ 25 (87)
T cd05482 2 TLYIN--GKLFEGLLDTGADVSIIAE 25 (87)
T ss_pred EEEEC--CEEEEEEEccCCCCeEEcc
Confidence 35566 6999999999999998863
No 35
>cd06095 RP_RTVL_H_like Retropepsin of the RTVL_H family of human endogenous retrovirus-like elements. This family includes aspartate proteases from retroelements with LTR (long terminal repeats) including the RTVL_H family of human endogenous retrovirus-like elements. While fungal and mammalian pepsins are bilobal proteins with structurally related N- and C-termini, retropepsins are half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and retropepsins function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Retroviral aspartyl protease is synthesized as part of the POL polyprotein that contains an aspartyl protease, a reverse transcriptase, RNase H, and an integrase. The POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples where
Probab=65.08 E-value=9.1 Score=26.57 Aligned_cols=23 Identities=17% Similarity=0.342 Sum_probs=18.4
Q ss_pred EEeCCCceeEEEEEeCCCCceeEec
Q 037565 95 ISIGTPRVELIGIADTASDLTWIQC 119 (218)
Q Consensus 95 i~iGtP~q~~~~~~DTGS~~~Wv~c 119 (218)
+.|. .+++.+++|||.+.+-+.-
T Consensus 3 v~In--G~~~~fLvDTGA~~tii~~ 25 (86)
T cd06095 3 ITVE--GVPIVFLVDTGATHSVLKS 25 (86)
T ss_pred EEEC--CEEEEEEEECCCCeEEECH
Confidence 4444 5899999999999998853
No 36
>PF12384 Peptidase_A2B: Ty3 transposon peptidase; InterPro: IPR024650 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Ty3 is a gypsy-type, retrovirus-like, element found in the budding yeast. The Ty3 aspartyl protease is required for processing of the viral polyprotein into its mature species [].
Probab=48.50 E-value=26 Score=28.09 Aligned_cols=28 Identities=14% Similarity=0.233 Sum_probs=20.9
Q ss_pred eEEEEEEeCCCceeEEEEEeCCCCceeEec
Q 037565 90 EYFVNISIGTPRVELIGIADTASDLTWIQC 119 (218)
Q Consensus 90 ~Y~~~i~iGtP~q~~~~~~DTGS~~~Wv~c 119 (218)
.+.+.+. .-..++.++|||||....+..
T Consensus 34 T~~v~l~--~~~t~i~vLfDSGSPTSfIr~ 61 (177)
T PF12384_consen 34 TAIVQLN--CKGTPIKVLFDSGSPTSFIRS 61 (177)
T ss_pred EEEEEEe--ecCcEEEEEEeCCCccceeeh
Confidence 3444444 446899999999999988764
No 37
>PF09668 Asp_protease: Aspartyl protease; InterPro: IPR019103 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Aspartic endopeptidases 3.4.23. from EC of vertebrate, fungal and retroviral origin have been characterised []. More recently, aspartic endopeptidases associated with the processing of bacterial type 4 prepilin [] and archaean preflagellin have been described [, ]. Structurally, aspartic endopeptidases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localised between the two lobes of the molecule. One lobe has probably evolved from the other through a gene duplication event in the distant past. In modern-day enzymes, although the three-dimensional structures are very similar, the amino acid sequences are more divergent, except for the catalytic site motif, which is very conserved. The presence and position of disulphide bridges are other conserved features of aspartic peptidases. All or most aspartate peptidases are endopeptidases. These enzymes have been assigned into clans (proteins which are evolutionary related), and further sub-divided into families, largely on the basis of their tertiary structure. This family of eukaryotic aspartyl proteases have a fold similar to retroviral proteases which implies they function proteolytically during regulated protein turnover []. ; GO: 0004190 aspartic-type endopeptidase activity, 0006508 proteolysis; PDB: 3S8I_A 2I1A_B.
Probab=35.25 E-value=78 Score=23.96 Aligned_cols=30 Identities=20% Similarity=0.245 Sum_probs=21.1
Q ss_pred CceEEEEEEeCCCceeEEEEEeCCCCceeEec
Q 037565 88 GGEYFVNISIGTPRVELIGIADTASDLTWIQC 119 (218)
Q Consensus 88 ~~~Y~~~i~iGtP~q~~~~~~DTGS~~~Wv~c 119 (218)
...+|+++.|. .+++.+.+|||...+-+.-
T Consensus 22 v~mLyI~~~in--g~~vkA~VDtGAQ~tims~ 51 (124)
T PF09668_consen 22 VSMLYINCKIN--GVPVKAFVDTGAQSTIMSK 51 (124)
T ss_dssp ----EEEEEET--TEEEEEEEETT-SS-EEEH
T ss_pred cceEEEEEEEC--CEEEEEEEeCCCCccccCH
Confidence 44689999999 5999999999998877753
No 38
>cd05475 nucellin_like Nucellins, plant aspartic proteases specifically expressed in nucellar cells during degradation. Nucellins are important regulators of nucellar cell's progressive degradation after ovule fertilization. This degradation is a characteristic of programmed cell death. Nucellins are plant aspartic proteases specifically expressed in nucellar cells during degradation. The enzyme is characterized by having two aspartic protease catalytic site motifs, the Asp-Thr-Gly-Ser in the N-terminal and Asp-Ser-Gly-Ser in the C-terminal region, and two other regions nearly identical to two regions of plant aspartic proteases. Aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe may be evolved from the other through ancient gene-duplication event. Although the three-dimensional structures of the two lobes are very similar, the amino acid sequences are more d
Probab=34.61 E-value=43 Score=28.24 Aligned_cols=32 Identities=16% Similarity=0.182 Sum_probs=22.8
Q ss_pred ceEEEE---EEeCC---CceeEEEEEeCCCCceeEecC
Q 037565 89 GEYFVN---ISIGT---PRVELIGIADTASDLTWIQCK 120 (218)
Q Consensus 89 ~~Y~~~---i~iGt---P~q~~~~~~DTGS~~~Wv~c~ 120 (218)
..|.++ |.||. +.....+++|||+.++.+|-.
T Consensus 157 ~~y~v~l~~i~vg~~~~~~~~~~~ivDTGTt~t~lp~~ 194 (273)
T cd05475 157 KHYSPGPASLLFNGQPTGGKGLEVVFDSGSSYTYFNAQ 194 (273)
T ss_pred CeEEEeEeEEEECCEECcCCCceEEEECCCceEEcCCc
Confidence 466655 57763 233467999999999999854
No 39
>TIGR03698 clan_AA_DTGF clan AA aspartic protease, AF_0612 family. Members of this protein family are clan AA aspartic proteases, related to family TIGR02281. These proteins resemble retropepsins, pepsin-like proteases of retroviruses such as HIV. Members of this family are found in archaea and bacteria.
Probab=33.63 E-value=58 Score=23.65 Aligned_cols=24 Identities=17% Similarity=0.382 Sum_probs=18.2
Q ss_pred EEEEeCCCc----eeEEEEEeCCCCcee
Q 037565 93 VNISIGTPR----VELIGIADTASDLTW 116 (218)
Q Consensus 93 ~~i~iGtP~----q~~~~~~DTGS~~~W 116 (218)
+++.|..|. -++.+++|||.+..-
T Consensus 2 ~~v~~~~p~~~~~~~v~~LVDTGat~~~ 29 (107)
T TIGR03698 2 LDVELSNPKNPEFMEVRALVDTGFSGFL 29 (107)
T ss_pred EEEEEeCCCCCCceEEEEEEECCCCeEE
Confidence 567787772 368999999998653
No 40
>PF05585 DUF1758: Putative peptidase (DUF1758); InterPro: IPR008737 This is a family of nematode proteins of unknown function []. However, it seems likely that these proteins act as aspartic peptidases.
Probab=29.54 E-value=31 Score=26.84 Aligned_cols=19 Identities=26% Similarity=0.364 Sum_probs=15.8
Q ss_pred ceeEEEEEeCCCCceeEec
Q 037565 101 RVELIGIADTASDLTWIQC 119 (218)
Q Consensus 101 ~q~~~~~~DTGS~~~Wv~c 119 (218)
.++..++||+||+..++.-
T Consensus 10 ~~~~~~LlDsGSq~SfIt~ 28 (164)
T PF05585_consen 10 QVEARALLDSGSQRSFITE 28 (164)
T ss_pred EEEEEEEEecCCchhHHhH
Confidence 4677899999999988854
No 41
>cd06097 Aspergillopepsin_like Aspergillopepsin_like, aspartic proteases of fungal origin. The members of this family are aspartic proteases of fungal origin, including aspergillopepsin, rhizopuspepsin, endothiapepsin, and rodosporapepsin. The various fungal species in this family may be the most economically important genus of fungi. They may serve as virulence factors or as industrial aids. For example, Aspergillopepsin from A. fumigatus is involved in invasive aspergillosis owing to its elastolytic activity and Aspergillopepsins from the mold A. saitoi are used in fermentation industry. Aspartic proteinases are a group of proteolytic enzymes in which the scissile peptide bond is attacked by a nucleophilic water molecule activated by two aspartic residues in a DT(S)G motif at the active site. They have a similar fold composed of two beta-barrel domains. Between the N-terminal and C-terminal domains, each of which contributes one catalytic aspartic residue, there is an extended active-
Probab=28.14 E-value=44 Score=28.16 Aligned_cols=33 Identities=24% Similarity=0.364 Sum_probs=22.7
Q ss_pred CceEEEE---EEeCC----CceeEEEEEeCCCCceeEecC
Q 037565 88 GGEYFVN---ISIGT----PRVELIGIADTASDLTWIQCK 120 (218)
Q Consensus 88 ~~~Y~~~---i~iGt----P~q~~~~~~DTGS~~~Wv~c~ 120 (218)
...|.++ |.||. ......+++|||+.++++|-.
T Consensus 176 ~~~w~v~l~~i~v~~~~~~~~~~~~~iiDSGTs~~~lP~~ 215 (278)
T cd06097 176 SGFWQFTSTSYTVGGDAPWSRSGFSAIADTGTTLILLPDA 215 (278)
T ss_pred CcEEEEEEeeEEECCcceeecCCceEEeecCCchhcCCHH
Confidence 3455544 35653 235678999999999999753
No 42
>PF11645 PDDEXK_5: PD-(D/E)XK endonuclease; InterPro: IPR021671 This family are putative endonuclease proteins which are restricted to Synechocystis. ; PDB: 2OST_D.
Probab=27.14 E-value=41 Score=26.24 Aligned_cols=31 Identities=29% Similarity=0.289 Sum_probs=25.2
Q ss_pred eEEEEEEeCCCceeEEEEEeCCCCceeEecCC
Q 037565 90 EYFVNISIGTPRVELIGIADTASDLTWIQCKP 121 (218)
Q Consensus 90 ~Y~~~i~iGtP~q~~~~~~DTGS~~~Wv~c~~ 121 (218)
.|.+.+.+|- .+++.|++|++..+.-|||+.
T Consensus 21 GY~V~~P~gD-n~~YDLV~d~eg~L~RIQvKT 51 (149)
T PF11645_consen 21 GYSVSIPFGD-NLKYDLVFDKEGILWRIQVKT 51 (149)
T ss_dssp T-EEEEESST-TSS-SEEEEETTEEEEEEEEE
T ss_pred CcEEEeecCC-CCCcCEEEecCCcEEEEEEee
Confidence 4888888887 679999999999999999974
No 43
>cd00303 retropepsin_like Retropepsins; pepsin-like aspartate proteases. The family includes pepsin-like aspartate proteases from retroviruses, retrotransposons and retroelements, as well as eukaryotic dna-damage-inducible proteins (DDIs), and bacterial aspartate peptidases. While fungal and mammalian pepsins are bilobal proteins with structurally related N and C-terminals, retropepsins are half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and retropepsins function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Retroviral aspartyl protease is synthesized as part of the POL polyprotein that contains an aspartyl protease, a reverse transcriptase, RNase H, and an integrase. The POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples
Probab=27.02 E-value=73 Score=19.92 Aligned_cols=20 Identities=15% Similarity=0.315 Sum_probs=14.6
Q ss_pred EEeCCCceeEEEEEeCCCCcee
Q 037565 95 ISIGTPRVELIGIADTASDLTW 116 (218)
Q Consensus 95 i~iGtP~q~~~~~~DTGS~~~W 116 (218)
+.++. ..+.+++|||+...-
T Consensus 3 ~~~~~--~~~~~liDtgs~~~~ 22 (92)
T cd00303 3 GKING--VPVRALVDSGASVNF 22 (92)
T ss_pred EEECC--EEEEEEEcCCCcccc
Confidence 44443 788999999987643
No 44
>PF07172 GRP: Glycine rich protein family; InterPro: IPR010800 This family consists of glycine rich proteins. Some of them may be involved in resistance to environmental stress [].
Probab=26.40 E-value=33 Score=24.72 Aligned_cols=21 Identities=14% Similarity=0.207 Sum_probs=9.9
Q ss_pred ChhhHHHHHHHHHHHHhccccc
Q 037565 1 MAALISLLAICITATLQFVFPF 22 (218)
Q Consensus 1 M~~~~~~~~~~~~~~~~~~~~~ 22 (218)
|++= .+|+++++|+.+++++.
T Consensus 1 MaSK-~~llL~l~LA~lLlisS 21 (95)
T PF07172_consen 1 MASK-AFLLLGLLLAALLLISS 21 (95)
T ss_pred Cchh-HHHHHHHHHHHHHHHHh
Confidence 5522 34555555554444443
No 45
>cd06098 phytepsin Phytepsin, a plant homolog of mammalian lysosomal pepsins. Phytepsin, a plant homolog of mammalian lysosomal pepsins, resides in grains, roots, stems, leaves and flowers. Phytepsin may participate in metabolic turnover and in protein processing events. In addition, it highly expressed in several plant tissues undergoing apoptosis. Phytepsin contains an internal region consisting of about 100 residues not present in animal or microbial pepsins. This region is thus called a plant specific insert. The insert is highly similar to saponins, which are lysosomal sphingolipid-activating proteins in mammalian cells. The saponin-like domain may have a role in the vacuolar targeting of phytepsin. Phytepsin, as its animal counterparts, possesses a topology typical of all aspartic proteases. They are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe has probably evolved fro
Probab=26.29 E-value=54 Score=28.33 Aligned_cols=31 Identities=23% Similarity=0.204 Sum_probs=21.4
Q ss_pred ceEEEE---EEeCCC-----ceeEEEEEeCCCCceeEec
Q 037565 89 GEYFVN---ISIGTP-----RVELIGIADTASDLTWIQC 119 (218)
Q Consensus 89 ~~Y~~~---i~iGtP-----~q~~~~~~DTGS~~~Wv~c 119 (218)
+.|.++ |.||.. .+...+++|||++++++|-
T Consensus 188 ~~w~v~l~~i~v~g~~~~~~~~~~~aivDTGTs~~~lP~ 226 (317)
T cd06098 188 GYWQFEMGDVLIGGKSTGFCAGGCAAIADSGTSLLAGPT 226 (317)
T ss_pred cEEEEEeCeEEECCEEeeecCCCcEEEEecCCcceeCCH
Confidence 445554 567642 2346799999999999874
No 46
>cd06096 Plasmepsin_5 Plasmepsins are a class of aspartic proteinases produced by the plasmodium parasite. The family contains a group of aspartic proteinases homologous to plasmepsin 5. Plasmepsins are a class of at least 10 enzymes produced by the plasmodium parasite. Through their haemoglobin-degrading activity, they are an important cause of symptoms in malaria sufferers. This family of enzymes is a potential target for anti-malarial drugs. Plasmepsins are aspartic acid proteases, which means their active site contains two aspartic acid residues. These two aspartic acid residue act respectively as proton donor and proton acceptor, catalyzing the hydrolysis of peptide bond in proteins. Aspartic proteinases are composed of two structurally similar beta barrel lobes, each lobe contributing an aspartic acid residue to form a catalytic dyad that acts to cleave the substrate peptide bond. The catalytic Asp residues are contained in an Asp-Thr-Gly-Ser/thr motif in both N- and C-terminal l
Probab=25.86 E-value=53 Score=28.47 Aligned_cols=31 Identities=23% Similarity=0.229 Sum_probs=22.0
Q ss_pred ceEEEE---EEeCCC------ceeEEEEEeCCCCceeEec
Q 037565 89 GEYFVN---ISIGTP------RVELIGIADTASDLTWIQC 119 (218)
Q Consensus 89 ~~Y~~~---i~iGtP------~q~~~~~~DTGS~~~Wv~c 119 (218)
..|.++ |.||.. .....+++|||+.+++++-
T Consensus 208 ~~y~v~l~~i~vg~~~~~~~~~~~~~aivDSGTs~~~lp~ 247 (326)
T cd06096 208 YYYYVKLEGLSVYGTTSNSGNTKGLGMLVDSGSTLSHFPE 247 (326)
T ss_pred ceEEEEEEEEEEcccccceecccCCCEEEeCCCCcccCCH
Confidence 456555 467743 2445789999999999975
No 47
>cd05472 cnd41_like Chloroplast Nucleoids DNA-binding Protease, catalyzes the degradation of ribulose-1,5-bisphosphate carboxylase/oxygenase. Chloroplast Nucleoids DNA-binding Protease catalyzes the degradation of ribulose-1,5-bisphosphate carboxylase/oxygenase (Rubisco) in senescent leaves of tobacco. Antisense tobacco with reduced amount of CND41 maintained green leaves and constant protein levels, especially Rubisco. CND41 has DNA-binding as well as aspartic protease activities. The pepsin-like aspartic protease domain is located at the C-terminus of the protein. The enzyme is characterized by having two aspartic protease catalytic site motifs, the Asp-Thr-Gly-Ser in the N-terminal and Asp-Ser-Gly-Ser in the C-terminal region. Aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe may be evolved from the other through ancient gene-duplication event. This fami
Probab=24.64 E-value=56 Score=27.81 Aligned_cols=31 Identities=29% Similarity=0.287 Sum_probs=21.2
Q ss_pred ceEEEE---EEeCCCc--------eeEEEEEeCCCCceeEec
Q 037565 89 GEYFVN---ISIGTPR--------VELIGIADTASDLTWIQC 119 (218)
Q Consensus 89 ~~Y~~~---i~iGtP~--------q~~~~~~DTGS~~~Wv~c 119 (218)
..|.++ |.||.-. ....+++|||+.+++++-
T Consensus 146 ~~y~v~l~~i~vg~~~~~~~~~~~~~~~~ivDSGTt~~~lp~ 187 (299)
T cd05472 146 TFYYVGLTGISVGGRRLPIPPASFGAGGVIIDSGTVITRLPP 187 (299)
T ss_pred CeEEEeeEEEEECCEECCCCccccCCCCeEEeCCCcceecCH
Confidence 456655 4676421 223689999999999974
No 48
>cd05471 pepsin_like Pepsin-like aspartic proteases, bilobal enzymes that cleave bonds in peptides at acidic pH. Pepsin-like aspartic proteases are found in mammals, plants, fungi and bacteria. These well known and extensively characterized enzymes include pepsins, chymosin, renin, cathepsins, and fungal aspartic proteases. Several have long been known to be medically (renin, cathepsin D and E, pepsin) or commercially (chymosin) important. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Aspartate residue, with an extended active site cleft localized between the two lobes of the molecule. The N- and C-terminal domains, although structurally related by a 2-fold axis, have only limited sequence homology except the vicinity of the active site. This suggests that the enzymes evolved by an ancient duplication event. Most members of the pepsin family specifically cleave bonds in peptides that are at least six residues in length, with hydrophobic residu
Probab=24.59 E-value=47 Score=27.50 Aligned_cols=34 Identities=24% Similarity=0.297 Sum_probs=24.3
Q ss_pred CceEEEE---EEeCC-----CceeEEEEEeCCCCceeEecCC
Q 037565 88 GGEYFVN---ISIGT-----PRVELIGIADTASDLTWIQCKP 121 (218)
Q Consensus 88 ~~~Y~~~---i~iGt-----P~q~~~~~~DTGS~~~Wv~c~~ 121 (218)
...|.+. |.+|. ......+++|||+..+|++-.-
T Consensus 179 ~~~~~v~l~~i~v~~~~~~~~~~~~~~iiDsGt~~~~lp~~~ 220 (283)
T cd05471 179 PGYWQVPLDGISVGGKSVISSSGGGGAIVDSGTSLIYLPSSV 220 (283)
T ss_pred CCEEEEEeCeEEECCceeeecCCCcEEEEecCCCCEeCCHHH
Confidence 4556554 45664 2456899999999999998643
No 49
>PLN03146 aspartyl protease family protein; Provisional
Probab=23.61 E-value=90 Score=28.54 Aligned_cols=16 Identities=25% Similarity=0.275 Sum_probs=13.9
Q ss_pred EEEEeCCCCceeEecC
Q 037565 105 IGIADTASDLTWIQCK 120 (218)
Q Consensus 105 ~~~~DTGS~~~Wv~c~ 120 (218)
.++||||+.+++++-.
T Consensus 309 ~~iiDSGTt~t~Lp~~ 324 (431)
T PLN03146 309 NIIIDSGTTLTLLPSD 324 (431)
T ss_pred cEEEeCCccceecCHH
Confidence 6899999999999753
No 50
>PF03419 Peptidase_U4: Sporulation factor SpoIIGA This family belongs to family U4 of the peptidase classification.; InterPro: IPR005081 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. The peptidases families associated with clan U- have an unknown catalytic mechanism as the protein fold of the active site domain and the active site residues have not been reported. This group of peptidases belong to the MEROPS peptidase family U4 (SpoIIGA peptidase family, clan U-). Sporulation in bacteria such as Bacillus subtilis involves the formation of a polar septum, which divides the sporangium into a mother cell and a forespore. The sigma E factor, which is encoded within the spoIIG operon, is a cell-specific regulatory protein that directs gene transcription in the mother cell. Sigma E is synthesised as an inactive proprotein pro-sigma E, which is converted to the mature factor by the putative processing enzyme SpoIIGA []. ; GO: 0004190 aspartic-type endopeptidase activity, 0006508 proteolysis, 0030436 asexual sporulation
Probab=23.07 E-value=1.2e+02 Score=26.16 Aligned_cols=28 Identities=36% Similarity=0.493 Sum_probs=24.0
Q ss_pred CCceEEEEEEeCCCceeEEEEEeCCCCc
Q 037565 87 SGGEYFVNISIGTPRVELIGIADTASDL 114 (218)
Q Consensus 87 ~~~~Y~~~i~iGtP~q~~~~~~DTGS~~ 114 (218)
.+..|-+++.+|.-..+++..+|||..+
T Consensus 154 ~~~~~~v~i~~~~~~~~~~allDTGN~L 181 (293)
T PF03419_consen 154 KDYLYPVTIEIGGKKIELKALLDTGNQL 181 (293)
T ss_pred CcEEEEEEEEECCEEEEEEEEEECCCcc
Confidence 3556899999998888899999999877
No 51
>cd06094 RP_Saci_like RP_Saci_like, retropepsin family. Retropepsin on retrotransposons with long terminal repeats (LTR) including Saci-1, -2 and -3 of Schistosoma mansoni. Retropepsins are related to fungal and mammalian pepsins. While fungal and mammalian pepsins are bilobal proteins with structurally related N- and C-termini, retropepsins are half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and retropepsins function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Retroviral aspartyl protease is synthesized as part of the POL polyprotein that contains an aspartyl protease, a reverse transcriptase, RNase H, and an integrase. The POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples where catalytic residues have been identified
Probab=22.51 E-value=2.9e+02 Score=19.64 Aligned_cols=20 Identities=15% Similarity=0.174 Sum_probs=15.8
Q ss_pred eeEEEEEeCCCCceeEecCC
Q 037565 102 VELIGIADTASDLTWIQCKP 121 (218)
Q Consensus 102 q~~~~~~DTGS~~~Wv~c~~ 121 (218)
.....++|||+...-+|...
T Consensus 8 s~~~fLVDTGA~vSviP~~~ 27 (89)
T cd06094 8 SGLRFLVDTGAAVSVLPASS 27 (89)
T ss_pred CCcEEEEeCCCceEeecccc
Confidence 35688999999999887543
No 52
>TIGR02854 spore_II_GA sigma-E processing peptidase SpoIIGA. Members of this protein family are the stage II sporulation protein SpoIIGA. This protein acts as an activating protease for Sigma-E, one of several specialized sigma factors of the sporulation process in Bacillus subtilis and related endospore-forming bacteria.
Probab=20.57 E-value=1.3e+02 Score=25.92 Aligned_cols=28 Identities=25% Similarity=0.408 Sum_probs=24.3
Q ss_pred CCceEEEEEEeCCCceeEEEEEeCCCCc
Q 037565 87 SGGEYFVNISIGTPRVELIGIADTASDL 114 (218)
Q Consensus 87 ~~~~Y~~~i~iGtP~q~~~~~~DTGS~~ 114 (218)
.+..|-+++.++.-..+++..+|||..+
T Consensus 155 ~~~~~~v~i~~~g~~~~~~alvDTGN~L 182 (288)
T TIGR02854 155 DKQIYELEICLDGKKVTIKGFLDTGNQL 182 (288)
T ss_pred hceEEEEEEEECCEEEEEEEEEecCCcc
Confidence 3567999999998888899999999877
No 53
>PF07496 zf-CW: CW-type Zinc Finger; InterPro: IPR011124 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents a CW-type zinc finger motif, named for its conserved cysteine and tryptophan residues. It is predicted to be a highly specialised mononuclear four-cysteine (C4) zinc finger that plays a role in DNA binding and/or promoting protein-protein interactions in complicated eukaryotic processes including chromatin methylation status and early embryonic development. Weak homology to members of IPR001965 from INTERPRO further evidences these predictions. The domain is found exclusively in vertebrates, vertebrate-infecting parasites and higher plants []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding; PDB: 2RR4_A 2E61_A 2L7P_A.
Probab=20.46 E-value=54 Score=20.43 Aligned_cols=9 Identities=44% Similarity=1.553 Sum_probs=5.9
Q ss_pred eeEecCCCC
Q 037565 115 TWIQCKPCY 123 (218)
Q Consensus 115 ~Wv~c~~C~ 123 (218)
.||||..|.
T Consensus 2 ~WVQCd~C~ 10 (50)
T PF07496_consen 2 YWVQCDSCL 10 (50)
T ss_dssp EEEE-TTT-
T ss_pred eEEECCCCC
Confidence 599998775
No 54
>PF00026 Asp: Eukaryotic aspartyl protease The Prosite entry also includes Pfam:PF00077.; InterPro: IPR001461 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Aspartic endopeptidases 3.4.23. from EC of vertebrate, fungal and retroviral origin have been characterised []. More recently, aspartic endopeptidases associated with the processing of bacterial type 4 prepilin [] and archaean preflagellin have been described [, ]. Structurally, aspartic endopeptidases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localised between the two lobes of the molecule. One lobe has probably evolved from the other through a gene duplication event in the distant past. In modern-day enzymes, although the three-dimensional structures are very similar, the amino acid sequences are more divergent, except for the catalytic site motif, which is very conserved. The presence and position of disulphide bridges are other conserved features of aspartic peptidases. All or most aspartate peptidases are endopeptidases. These enzymes have been assigned into clans (proteins which are evolutionary related), and further sub-divided into families, largely on the basis of their tertiary structure. This group of aspartic peptidases belong to MEROPS peptidase family A1 (pepsin family, clan AA). The type example is pepsin A from Homo sapiens (Human) . More than 70 aspartic peptidases, from all from eukaryotic organisms, have been identified. These include pepsins, cathepsins, and renins. The enzymes are synthesised with signal peptides, and the proenzymes are secreted or passed into the lysosomal/endosomal system, where acidification leads to autocatalytic activation. Most members of the pepsin family specifically cleave bonds in peptides that are at least six residues in length, with hydrophobic residues in both the P1 and P1' positions []. Crystallography has shown the active site to form a groove across the junction of the two lobes, with an extended loop projecting over the cleft to form an 11-residue flap, which encloses substrates and inhibitors within the active site []. Specificity is determined by several hydrophobic residues surrounding the catalytic aspartates, and by three residues in the flap. Cysteine residues are well conserved within the pepsin family, pepsin itself containing three disulphide loops. The first loop is found in all but the fungal enzymes, and is usually around five residues in length, but is longer in barrierpepsin and candidapepsin; the second loop is also small and found only in the animal enzymes; and the third loop is the largest, found in all members of the family, except for the cysteine-free polyporopepsin. The loops are spread unequally throughout the two lobes, suggesting that they formed after the initial gene duplication and fusion event []. This family does not include the retroviral nor retrotransposon aspartic proteases which are much smaller and appear to be homologous to the single domain aspartic proteases.; GO: 0004190 aspartic-type endopeptidase activity, 0006508 proteolysis; PDB: 1CZI_E 3CMS_A 1CMS_A 4CMS_A 1YG9_A 2NR6_A 3LIZ_A 1FLH_A 3UTL_A 1QRP_E ....
Probab=20.11 E-value=96 Score=26.18 Aligned_cols=28 Identities=25% Similarity=0.296 Sum_probs=20.2
Q ss_pred EEEEeCCC----ceeEEEEEeCCCCceeEecC
Q 037565 93 VNISIGTP----RVELIGIADTASDLTWIQCK 120 (218)
Q Consensus 93 ~~i~iGtP----~q~~~~~~DTGS~~~Wv~c~ 120 (218)
..|.+|.= .....+++|||+..+|++-.
T Consensus 185 ~~i~i~~~~~~~~~~~~~~~Dtgt~~i~lp~~ 216 (317)
T PF00026_consen 185 DSISIGGESVFSSSGQQAILDTGTSYIYLPRS 216 (317)
T ss_dssp EEEEETTEEEEEEEEEEEEEETTBSSEEEEHH
T ss_pred ccccccccccccccceeeecccccccccccch
Confidence 33566642 23468999999999999864
No 55
>PF01705 CX: CX module; InterPro: IPR002619 This domain has no known function. It is found in several Caenorhabditis elegans proteins. The domain contains 6 conserved cysteines that probably form three disulphide bridges.
Probab=20.08 E-value=84 Score=20.52 Aligned_cols=36 Identities=19% Similarity=0.358 Sum_probs=24.1
Q ss_pred CcceeEEEcCCCceEEEEEEEEEEEecCCCCCCCcccCCeEEEeEEcC
Q 037565 165 NTCQFYYSYADGSYTTGNVGTEVFVISSKTNPDQGTFPNIVYGCGNNN 212 (218)
Q Consensus 165 ~~c~~~~~Y~dgs~~~G~l~~D~~~~~~~~~~~~~~~~~~~FGc~~~~ 212 (218)
..|.|.+...|+... .+.|.+. ...+.+.|||...+
T Consensus 15 ~~C~y~i~~~d~el~-------~v~F~nG-----t~p~si~fgC~~~~ 50 (61)
T PF01705_consen 15 NICEYQISEDDWELG-------NVTFPNG-----TRPKSIFFGCPSYE 50 (61)
T ss_pred eEEEEECCCCCcccc-------ceEccCC-----CcceeEEEeCCCCC
Confidence 469999987665432 3455442 36789999998653
Done!