Query psy14862
Match_columns 263
No_of_seqs 212 out of 1464
Neff 6.9
Searched_HMMs 46136
Date Fri Aug 16 22:43:16 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy14862.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/14862hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1542|consensus 100.0 5.2E-52 1.1E-56 375.5 12.3 180 75-260 67-329 (372)
2 PTZ00203 cathepsin L protease; 100.0 3.1E-47 6.8E-52 354.2 15.8 145 74-221 33-178 (348)
3 PTZ00021 falcipain-2; Provisio 100.0 1.4E-45 3E-50 354.0 16.4 181 74-261 164-434 (489)
4 PTZ00200 cysteine proteinase; 100.0 7.4E-44 1.6E-48 340.3 16.7 182 69-259 116-398 (448)
5 KOG1543|consensus 100.0 5.1E-38 1.1E-42 290.2 13.3 166 83-259 30-281 (325)
6 smart00645 Pept_C1 Papain fami 99.9 2.7E-26 5.9E-31 194.5 7.4 89 168-261 1-134 (174)
7 cd02621 Peptidase_C1A_Cathepsi 99.9 2.1E-25 4.5E-30 198.2 7.1 53 168-221 1-63 (243)
8 cd02698 Peptidase_C1A_Cathepsi 99.9 2.9E-25 6.2E-30 197.1 7.0 53 168-221 1-62 (239)
9 cd02248 Peptidase_C1A Peptidas 99.9 1.9E-24 4.2E-29 186.5 7.4 88 169-261 1-172 (210)
10 cd02620 Peptidase_C1A_Cathepsi 99.9 2.8E-24 6.1E-29 190.4 6.0 52 169-221 1-58 (236)
11 PF00112 Peptidase_C1: Papain 99.9 7.1E-23 1.5E-27 176.6 3.9 89 168-261 1-179 (219)
12 PTZ00364 dipeptidyl-peptidase 99.9 2.5E-22 5.4E-27 195.6 6.8 56 165-221 202-269 (548)
13 PTZ00049 cathepsin C-like prot 99.8 1.1E-21 2.4E-26 193.8 7.1 56 165-221 378-447 (693)
14 cd02619 Peptidase_C1 C1 Peptid 99.8 1.4E-19 3E-24 156.3 6.0 49 171-221 1-51 (223)
15 PF08246 Inhibitor_I29: Cathep 99.7 5.6E-18 1.2E-22 118.5 7.1 58 79-136 1-58 (58)
16 KOG1544|consensus 99.7 2.3E-17 5E-22 149.2 1.8 110 108-221 151-265 (470)
17 PTZ00462 Serine-repeat antigen 99.6 1.8E-16 3.8E-21 161.3 5.0 41 180-221 544-584 (1004)
18 smart00848 Inhibitor_I29 Cathe 99.6 1.2E-15 2.5E-20 105.8 5.6 57 79-135 1-57 (57)
19 COG4870 Cysteine protease [Pos 99.2 6.8E-12 1.5E-16 115.8 1.9 41 166-207 97-137 (372)
20 cd00585 Peptidase_C1B Peptidas 98.1 8.3E-06 1.8E-10 78.6 6.9 40 181-221 55-95 (437)
21 PF08127 Propeptide_C1: Peptid 95.5 0.012 2.7E-07 38.1 2.6 31 108-140 4-34 (41)
22 PF03051 Peptidase_C1_2: Pepti 93.0 0.07 1.5E-06 51.8 2.7 39 182-221 57-96 (438)
23 PF05543 Peptidase_C47: Stapho 70.2 3.9 8.5E-05 34.8 2.6 36 186-221 18-61 (175)
24 cd00042 CY Substituted updates 64.5 9 0.00019 28.9 3.5 22 28-50 80-101 (105)
25 COG3579 PepC Aminopeptidase C 63.7 6.1 0.00013 37.3 2.7 22 182-204 59-80 (444)
26 smart00043 CY Cystatin-like do 44.7 12 0.00027 28.3 1.3 15 32-46 85-99 (107)
27 PHA02088 hypothetical protein 29.0 22 0.00049 27.6 0.5 43 22-90 6-48 (125)
28 KOG2735|consensus 26.6 38 0.00083 32.6 1.6 27 184-212 368-394 (466)
29 PF00031 Cystatin: Cystatin do 25.5 37 0.00081 24.8 1.1 17 26-42 78-94 (94)
30 KOG3554|consensus 24.4 2.5E+02 0.0054 27.9 6.6 19 75-93 292-310 (693)
31 PF13529 Peptidase_C39_2: Pept 24.3 56 0.0012 24.9 2.0 14 246-259 122-135 (144)
32 KOG4128|consensus 24.2 24 0.00052 33.3 -0.2 40 181-221 63-103 (457)
33 KOG4702|consensus 20.8 2E+02 0.0044 20.9 4.0 32 77-109 29-60 (77)
No 1
>KOG1542|consensus
Probab=100.00 E-value=5.2e-52 Score=375.55 Aligned_cols=180 Identities=37% Similarity=0.632 Sum_probs=155.6
Q ss_pred HHHHHHHHHHHhCCccCChHHHHHHHHHHHHHHHHHHHhccCCCCceEEEeccCCCCChHHHhcccCCCchhhhhhcccc
Q psy14862 75 HGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLSSLDWEQIENLKST 154 (263)
Q Consensus 75 ~~~~F~~f~~~y~K~Y~s~~E~~~R~~iF~~Nl~~I~~~N~~~~~s~~~giN~FsDlT~eEf~~~~~~~~~~~~~~~~~~ 154 (263)
..++|..|+.+|+|+|.+.+|..+|+.||++|+..++++++.+.++..+|+|+|||||+|||++++++.+.... +..
T Consensus 67 ~~~~F~~F~~kf~r~Y~s~eE~~~Rl~iF~~N~~~a~~~q~~d~gsA~yGvtqFSDlT~eEFkk~~l~~~~~~~---~~~ 143 (372)
T KOG1542|consen 67 LEDSFKLFTIKFGRSYASREEHAHRLSIFKHNLLRAERLQENDPGSAEYGVTQFSDLTEEEFKKIYLGVKRRGS---KLP 143 (372)
T ss_pred hHHHHHHHHHhcCcccCcHHHHHHHHHHHHHHHHHHHHhhhcCccccccCccchhhcCHHHHHHHhhccccccc---cCc
Confidence 47799999999999999999999999999999999999999987899999999999999999999988765311 011
Q ss_pred CcccccCCCCCCCCCcccccccCCCCCCcccCCCCCchhHHHHhhhHHHHHHHHHcCcceecCCCCc-------------
Q psy14862 155 FETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQPP------------- 221 (263)
Q Consensus 155 ~~~~~~~~~~~~~lP~s~DWR~~g~Vtp~VknQg~CGSCWAFat~~~iEs~~~I~tg~~~~LSeQqL------------- 221 (263)
.............+|++||||++|+||| |||||+||||||||+||+||++++|++|++++||||||
T Consensus 144 ~~~~~~~~~~~~~lP~~fDWR~kgaVTp-VKnQG~CGSCWAFS~tG~vEga~~i~~g~LvsLSEQeLvDCD~~d~gC~GG 222 (372)
T KOG1542|consen 144 GDAAEAPIEPGESLPESFDWRDKGAVTP-VKNQGMCGSCWAFSTTGAVEGAWAIATGKLVSLSEQELVDCDSCDNGCNGG 222 (372)
T ss_pred cccccCcCCCCCCCCcccchhccCCccc-cccCCcCcchhhhhhhhhhhhHHHhhcCcccccchhhhhcccCcCCcCCCC
Confidence 1111111133567999999999999999 99999999999999999999999999999999999999
Q ss_pred -------------------------c---------------------------------------------ccCCCCCCc
Q psy14862 222 -------------------------K---------------------------------------------THGRFYKGG 231 (263)
Q Consensus 222 -------------------------K---------------------------------------------~~f~~Y~~G 231 (263)
| ..||+|++|
T Consensus 223 l~~nA~~~~~~~gGL~~E~dYPY~g~~~~~C~~~~~~~~v~I~~f~~l~~nE~~ia~wLv~~GPi~vgiNa~~mQ~YrgG 302 (372)
T KOG1542|consen 223 LMDNAFKYIKKAGGLEKEKDYPYTGKKGNQCHFDKSKIVVSIKDFSMLSNNEDQIAAWLVTFGPLSVGINAKPMQFYRGG 302 (372)
T ss_pred ChhHHHHHHHHhCCccccccCCccccCCCccccchhhceEEEeccEecCCCHHHHHHHHHhcCCeEEEEchHHHHHhccc
Confidence 0 789999999
Q ss_pred cccCCCCcCCCCCCCCCeEEEEEEEecCC
Q psy14862 232 VMNLPHMLCSKGPYSLNHAVLNVGYDNES 260 (263)
Q Consensus 232 V~~~~~c~~~~~~~~~nHaVliVGYG~~~ 260 (263)
|+.+..-.|+ +..+||||||||||...
T Consensus 303 V~~P~~~~Cs--~~~~~HaVLlvGyG~~g 329 (372)
T KOG1542|consen 303 VSCPSKYICS--PKLLNHAVLLVGYGSSG 329 (372)
T ss_pred ccCCCcccCC--ccccCceEEEEeecCCC
Confidence 9999444447 77799999999999875
No 2
>PTZ00203 cathepsin L protease; Provisional
Probab=100.00 E-value=3.1e-47 Score=354.16 Aligned_cols=145 Identities=28% Similarity=0.519 Sum_probs=118.5
Q ss_pred HHHHHHHHHHHHhCCccCChHHHHHHHHHHHHHHHHHHHhccCCCCceEEEeccCCCCChHHHhcccCCCch-hhhhhcc
Q psy14862 74 DHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLSSLDW-EQIENLK 152 (263)
Q Consensus 74 ~~~~~F~~f~~~y~K~Y~s~~E~~~R~~iF~~Nl~~I~~~N~~~~~s~~~giN~FsDlT~eEf~~~~~~~~~-~~~~~~~ 152 (263)
.+..+|++|+++|+|.|.+.+|+.+|+.||++|+++|++||+++ .+|++|+|+|+|||+|||.+++++... ...+ ..
T Consensus 33 ~~~~~f~~~~~~~~K~Y~~~~E~~~R~~iF~~N~~~I~~~N~~~-~~~~lg~N~FaDlT~eEf~~~~l~~~~~~~~~-~~ 110 (348)
T PTZ00203 33 PAAALFEEFKRTYQRAYGTLTEEQQRLANFERNLELMREHQARN-PHARFGITKFFDLSEAEFAARYLNGAAYFAAA-KQ 110 (348)
T ss_pred HHHHHHHHHHHHhCCCCCChHHHHHHHHHHHHHHHHHHHHhccC-CCeEEeccccccCCHHHHHHHhcCCCcccccc-cc
Confidence 36678999999999999998899999999999999999999875 699999999999999999987653211 1000 00
Q ss_pred ccCcccccCCCCCCCCCcccccccCCCCCCcccCCCCCchhHHHHhhhHHHHHHHHHcCcceecCCCCc
Q psy14862 153 STFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQPP 221 (263)
Q Consensus 153 ~~~~~~~~~~~~~~~lP~s~DWR~~g~Vtp~VknQg~CGSCWAFat~~~iEs~~~I~tg~~~~LSeQqL 221 (263)
..............++|++||||++|+|+| |||||.||||||||++++||++++|++|+++.||+|||
T Consensus 111 ~~~~~~~~~~~~~~~lP~~~DWR~~g~Vtp-VkdQg~CGSCWAfa~~~aiEs~~~i~~~~~~~LSeQqL 178 (348)
T PTZ00203 111 HAGQHYRKARADLSAVPDAVDWREKGAVTP-VKNQGACGSCWAFSAVGNIESQWAVAGHKLVRLSEQQL 178 (348)
T ss_pred cccccccccccccccCCCCCcCCcCCCCCC-ccccCCCccHHHHhhHHHHHHHHHHhcCCCccCCHHHH
Confidence 000111111112236899999999999999 99999999999999999999999999999999999999
No 3
>PTZ00021 falcipain-2; Provisional
Probab=100.00 E-value=1.4e-45 Score=354.03 Aligned_cols=181 Identities=34% Similarity=0.514 Sum_probs=146.9
Q ss_pred HHHHHHHHHHHHhCCccCChHHHHHHHHHHHHHHHHHHHhccCCCCceEEEeccCCCCChHHHhcccCCCchh-hhhhcc
Q psy14862 74 DHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLSSLDWE-QIENLK 152 (263)
Q Consensus 74 ~~~~~F~~f~~~y~K~Y~s~~E~~~R~~iF~~Nl~~I~~~N~~~~~s~~~giN~FsDlT~eEf~~~~~~~~~~-~~~~~~ 152 (263)
+....|++|+.+|+|+|.+.+|+.+|+.||++|+++|++||+.++.+|++|+|+|+|||.|||++++++.... ......
T Consensus 164 e~~~~F~~wk~ky~K~Y~~~eE~~~R~~iF~~Nl~~Ie~hN~~~~~ty~lgiNqFsDlT~EEF~~~~l~~~~~~~~~~~~ 243 (489)
T PTZ00021 164 ENVNSFYLFIKEHGKKYQTPDEMQQRYLSFVENLAKINAHNNKENVLYKKGMNRFGDLSFEEFKKKYLTLKSFDFKSNGK 243 (489)
T ss_pred HHHHHHHHHHHHhCCcCCCHHHHHHHHHHHHHHHHHHHHhhccCCCCEEEeccccccCCHHHHHHHhccccccccccccc
Confidence 3557899999999999999999999999999999999999987667999999999999999999887764321 000000
Q ss_pred ccC--ccc----c-cCCCCCCCCCcccccccCCCCCCcccCCCCCchhHHHHhhhHHHHHHHHHcCcceecCCCCc----
Q psy14862 153 STF--ETY----S-FNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQPP---- 221 (263)
Q Consensus 153 ~~~--~~~----~-~~~~~~~~lP~s~DWR~~g~Vtp~VknQg~CGSCWAFat~~~iEs~~~I~tg~~~~LSeQqL---- 221 (263)
... ... . .........|++||||++|+||| |||||.||||||||++++||++++|++|+++.||+|||
T Consensus 244 ~~~~~~~~~~~~~~~~~~~~~~~P~s~DWR~~g~Vtp-VKdQG~CGSCWAFAa~~alEs~~~I~~g~~v~LSeQqLVDCs 322 (489)
T PTZ00021 244 KSPRVINYDDVIKKYKPKDATFDHAKYDWRLHNGVTP-VKDQKNCGSCWAFSTVGVVESQYAIRKNELVSLSEQELVDCS 322 (489)
T ss_pred cccccccccccccccccccccCCccccccccCCCCCC-cccccccccHHHHHHHHHHHHHHHHHcCCCcccCHHHHhhhc
Confidence 000 000 0 00001112499999999999999 99999999999999999999999999999999999999
Q ss_pred --------------------------------------------------------------c----------------c
Q psy14862 222 --------------------------------------------------------------K----------------T 223 (263)
Q Consensus 222 --------------------------------------------------------------K----------------~ 223 (263)
| .
T Consensus 323 ~~n~GC~GG~~~~Af~yi~~~gGl~tE~~YPY~~~~~~~C~~~~~~~~~~i~~y~~i~~~~lk~al~~~GPVsv~i~a~~ 402 (489)
T PTZ00021 323 FKNNGCYGGLIPNAFEDMIELGGLCSEDDYPYVSDTPELCNIDRCKEKYKIKSYVSIPEDKFKEAIRFLGPISVSIAVSD 402 (489)
T ss_pred cCCCCCCCcchHhhhhhhhhccccCcccccCccCCCCCccccccccccceeeeEEEecHHHHHHHHHhcCCeEEEEEeec
Confidence 0 5
Q ss_pred cCCCCCCccccCCCCcCCCCCCCCCeEEEEEEEecCCC
Q psy14862 224 HGRFYKGGVMNLPHMLCSKGPYSLNHAVLNVGYDNEST 261 (263)
Q Consensus 224 ~f~~Y~~GV~~~~~c~~~~~~~~~nHaVliVGYG~~~g 261 (263)
+|++|++|||+++ | ...+||||+|||||.++|
T Consensus 403 ~f~~YkgGIy~~~-C-----~~~~nHAVlIVGYG~e~~ 434 (489)
T PTZ00021 403 DFAFYKGGIFDGE-C-----GEEPNHAVILVGYGMEEI 434 (489)
T ss_pred ccccCCCCcCCCC-C-----CCccceEEEEEEecCcCC
Confidence 8999999999874 7 445899999999998753
No 4
>PTZ00200 cysteine proteinase; Provisional
Probab=100.00 E-value=7.4e-44 Score=340.30 Aligned_cols=182 Identities=30% Similarity=0.421 Sum_probs=146.0
Q ss_pred chhHhHHHHHHHHHHHHhCCccCChHHHHHHHHHHHHHHHHHHHhccCCCCceEEEeccCCCCChHHHhcccCCCchhhh
Q psy14862 69 LEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLSSLDWEQI 148 (263)
Q Consensus 69 l~~~~~~~~~F~~f~~~y~K~Y~s~~E~~~R~~iF~~Nl~~I~~~N~~~~~s~~~giN~FsDlT~eEf~~~~~~~~~~~~ 148 (263)
...+.++..+|++|+++|+|.|.+.+|+.+|+.||++|+++|++||.. .+|++|+|+|+|||+|||.+++++...+..
T Consensus 116 ~~~e~e~~~~F~~f~~ky~K~Y~~~~E~~~R~~iF~~Nl~~I~~hN~~--~~y~lgiN~FsDlT~eEF~~~~~~~~~~~~ 193 (448)
T PTZ00200 116 PKLEFEVYLEFEEFNKKYNRKHATHAERLNRFLTFRNNYLEVKSHKGD--EPYSKEINKFSDLTEEEFRKLFPVIKVPPK 193 (448)
T ss_pred ccchHHHHHHHHHHHHHhCCcCCCHHHHHHHHHHHHHHHHHHHHhcCc--CCeEEeccccccCCHHHHHHHhccCCCccc
Confidence 334455888999999999999999999999999999999999999964 589999999999999999987765432100
Q ss_pred hhc--cc--------cCccc----ccC-----CC--CCCCCCcccccccCCCCCCcccCCC-CCchhHHHHhhhHHHHHH
Q psy14862 149 ENL--KS--------TFETY----SFN-----SS--NSYGLAESINYKDKGKVLPKVQDQH-LCGSCWAHSAVACLESAY 206 (263)
Q Consensus 149 ~~~--~~--------~~~~~----~~~-----~~--~~~~lP~s~DWR~~g~Vtp~VknQg-~CGSCWAFat~~~iEs~~ 206 (263)
... .. ....+ +.. .. ....+|++||||+.|+|+| ||||| .||||||||++++||+++
T Consensus 194 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~P~~~DWR~~g~vtp-VkdQG~~CGSCWAFat~~aiEs~~ 272 (448)
T PTZ00200 194 SNSTSHNNDFKARHVSNPTYLKNLKKAKNTDEDVKDPSKITGEGLDWRRADAVTK-VKDQGLNCGSCWAFSSVGSVESLY 272 (448)
T ss_pred ccccccccccccccccccccccccccccccccccccccccCCCCccCCCCCCCCC-cccCCCccchHHHHhHHHHHHHHH
Confidence 000 00 00000 000 00 0123699999999999999 99999 999999999999999999
Q ss_pred HHHcCcceecCCCCc----------------------------------------------------------------c
Q psy14862 207 AIKHNELIELSKQPP----------------------------------------------------------------K 222 (263)
Q Consensus 207 ~I~tg~~~~LSeQqL----------------------------------------------------------------K 222 (263)
+|++++++.||+||| +
T Consensus 273 ~i~~~~~~~LSeQqLvDC~~~~~GC~GG~~~~A~~yi~~~Gi~~e~~YPY~~~~~~C~~~~~~~~~i~~y~~~~~~~~l~ 352 (448)
T PTZ00200 273 KIYRDKSVDLSEQELVNCDTKSQGCSGGYPDTALEYVKNKGLSSSSDVPYLAKDGKCVVSSTKKVYIDSYLVAKGKDVLN 352 (448)
T ss_pred HHhcCCCeecCHHHHhhccCccCCCCCCcHHHHHHHHhhcCccccccCCCCCCCCCCcCCCCCeeEecceEecCHHHHHH
Confidence 999999999999999 0
Q ss_pred ---------------ccCCCCCCccccCCCCcCCCCCCCCCeEEEEEEEecC
Q psy14862 223 ---------------THGRFYKGGVMNLPHMLCSKGPYSLNHAVLNVGYDNE 259 (263)
Q Consensus 223 ---------------~~f~~Y~~GV~~~~~c~~~~~~~~~nHaVliVGYG~~ 259 (263)
.+|++|++|||+++ | +..+||||+|||||.+
T Consensus 353 ~~l~~GPV~v~i~~~~~f~~Yk~GIy~~~-C-----~~~~nHaV~lVGyG~d 398 (448)
T PTZ00200 353 KSLVISPTVVYIAVSRELLKYKSGVYNGE-C-----GKSLNHAVLLVGEGYD 398 (448)
T ss_pred HHHhcCCEEEEeecccccccCCCCccccc-c-----CCCCcEEEEEEEeccc
Confidence 68999999999874 7 3458999999999964
No 5
>KOG1543|consensus
Probab=100.00 E-value=5.1e-38 Score=290.19 Aligned_cols=166 Identities=33% Similarity=0.515 Sum_probs=139.5
Q ss_pred HHHhCCccCChHHHHHHHHHHHHHHHHHHHhccCCCCceEEEeccCCCCChHHHhcccCCCchhhhhhccccCcccccCC
Q psy14862 83 VREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNS 162 (263)
Q Consensus 83 ~~~y~K~Y~s~~E~~~R~~iF~~Nl~~I~~~N~~~~~s~~~giN~FsDlT~eEf~~~~~~~~~~~~~~~~~~~~~~~~~~ 162 (263)
+.+|.+.|.+..|...|+.+|.+|++.|+.||.....+|++|+|+|+|++.+||++.+.+..++.. ...... ..
T Consensus 30 ~~~~~~~y~~~~~~~~r~~~f~~n~~~~~~~n~~~~~~~~~g~n~~~d~~~ee~~~~~~~~~~~~~-----~~~~~~-~~ 103 (325)
T KOG1543|consen 30 LVKFLKRYEDRVEKKARRAIFKENLQKIESHNLKYVLSFLMGVNQFADLTTEEFKRKKTGKKPPEI-----KRDKFT-EK 103 (325)
T ss_pred hhhhccccccHHHHHHHHHHHHHHHHHHHhhhhhhceeeeeccccccccchHHHHHhhccccCccc-----cccccc-cc
Confidence 677777887778999999999999999999999856899999999999999999998777654421 001111 11
Q ss_pred CCCCCCCcccccccCC-CCCCcccCCCCCchhHHHHhhhHHHHHHHHHcC-cceecCCCCc-------------------
Q psy14862 163 SNSYGLAESINYKDKG-KVLPKVQDQHLCGSCWAHSAVACLESAYAIKHN-ELIELSKQPP------------------- 221 (263)
Q Consensus 163 ~~~~~lP~s~DWR~~g-~Vtp~VknQg~CGSCWAFat~~~iEs~~~I~tg-~~~~LSeQqL------------------- 221 (263)
....++|++||||++| .++| |||||.||||||||++++||++++|++| .+++||+|||
T Consensus 104 ~~~~~~p~s~DwR~~~~~~~~-vkdQg~CgsCWAFaa~~aie~~~~i~~g~~l~sLSeq~lvdC~~~~~~GC~GG~~~~A 182 (325)
T KOG1543|consen 104 LDGDDLPDSFDWRDKGAVTPP-VKDQGSCGSCWAFAATGALEDRYNIKTGGKLLSLSEQDLVDCCGECGDGCNGGEPKNA 182 (325)
T ss_pred cchhhCCCCccccccCCcCCC-cCCCCcCcchHHHHHHHHHHHHHHHHhCCccCccChhhhhhccCCCCCCcCCCCHHHH
Confidence 2245799999999996 5666 9999999999999999999999999999 8999999999
Q ss_pred -----------------------------------------------------------------cccCCCCCCccccCC
Q psy14862 222 -----------------------------------------------------------------KTHGRFYKGGVMNLP 236 (263)
Q Consensus 222 -----------------------------------------------------------------K~~f~~Y~~GV~~~~ 236 (263)
..+|++|++|||.++
T Consensus 183 ~~yi~~~G~~t~~~~Ypy~~~~~~C~~~~~~~~~~~~~~~~~~~~e~~i~~~v~~~GPv~v~~~a~~~F~~Y~~GVy~~~ 262 (325)
T KOG1543|consen 183 FKYIKKNGGVTECENYPYIGKDGTCKSNKKDKTVTIKGFYNVPANEEAIAEAVAKNGPVSVAIDAYEDFSLYKGGVYAEE 262 (325)
T ss_pred HHHHHHhCCCCCCcCCCCcCCCCCccCCCccceeEeeeeeecCcCHHHHHHHHHhcCCeEEEEeehhhhhhccCceEeCC
Confidence 078999999999999
Q ss_pred CCcCCCCCCCCCeEEEEEEEecC
Q psy14862 237 HMLCSKGPYSLNHAVLNVGYDNE 259 (263)
Q Consensus 237 ~c~~~~~~~~~nHaVliVGYG~~ 259 (263)
.|.+ ..++|||+|||||+.
T Consensus 263 ~~~~----~~~~Hav~iVGyG~~ 281 (325)
T KOG1543|consen 263 KGDD----KEGDHAVLIVGYGTG 281 (325)
T ss_pred CCCC----CCCCceEEEEEEcCC
Confidence 8822 269999999999993
No 6
>smart00645 Pept_C1 Papain family cysteine protease.
Probab=99.93 E-value=2.7e-26 Score=194.54 Aligned_cols=89 Identities=47% Similarity=0.809 Sum_probs=81.1
Q ss_pred CCcccccccCCCCCCcccCCCCCchhHHHHhhhHHHHHHHHHcCcceecCCCCc-----------------------c--
Q psy14862 168 LAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQPP-----------------------K-- 222 (263)
Q Consensus 168 lP~s~DWR~~g~Vtp~VknQg~CGSCWAFat~~~iEs~~~I~tg~~~~LSeQqL-----------------------K-- 222 (263)
||++||||+.++++| |+|||.||+|||||++++||++++|++++.+.||+|+| +
T Consensus 1 lP~~~D~R~~~~~~~-v~dQg~CGsCwAfa~~~~ie~~~~i~~~~~~~lS~q~l~~C~~~~~~gC~GG~~~~a~~~~~~~ 79 (174)
T smart00645 1 LPESFDWRKKGAVTP-VKDQGQCGSCWAFSATGALEGRYCIKTGKLVSLSEQQLVDCSTGGNNGCNGGLPDNAFEYIKKN 79 (174)
T ss_pred CCCcCcccccCCCCc-cccCcccchHHHHHHHHHHHHHHHHhcCCccccCHHHHhhhcCCCCCCCCCcCHHHHHHHHHHc
Confidence 699999999999999 99999999999999999999999999999999999999 0
Q ss_pred -------------------ccCCCCCCccccCCCCcCCCCCCCCCeEEEEEEEecC-CC
Q psy14862 223 -------------------THGRFYKGGVMNLPHMLCSKGPYSLNHAVLNVGYDNE-ST 261 (263)
Q Consensus 223 -------------------~~f~~Y~~GV~~~~~c~~~~~~~~~nHaVliVGYG~~-~g 261 (263)
.+|++|++|||+.+.| . ...++|+|+|||||.+ +|
T Consensus 80 ~Gi~~e~~~PY~~~~~~~~~~f~~Y~~Gi~~~~~~--~--~~~~~Hav~ivGyg~~~~g 134 (174)
T smart00645 80 GGLETESCYPYTGSVAIDASDFQFYKSGIYDHPGC--G--SGTLDHAVLIVGYGTEENG 134 (174)
T ss_pred CCcccccccCcccEEEEEcccccCCcCeEECCCCC--C--CCcccEEEEEEEEeecCCC
Confidence 5799999999998767 4 3447999999999987 55
No 7
>cd02621 Peptidase_C1A_CathepsinC Cathepsin C; also known as Dipeptidyl Peptidase I (DPPI), an atypical papain-like cysteine peptidase with chloride dependency and dipeptidyl aminopeptidase activity, resulting from its tetrameric structure which limits substrate access. Each subunit of the tetramer is composed of three peptides: the heavy and light chains, which together adopts the papain fold and forms the catalytic domain; and the residual propeptide region, which forms a beta barrel and points towards the substrate's N-terminus. The subunit composition is the result of the unique characteristic of procathepsin C maturation involving the cleavage of the catalytic domain and the non-autocatalytic excision of an activation peptide within its propeptide region. By removing N-terminal dipeptide extensions, cathepsin C activates granule serine peptidases (granzymes) involved in cell-mediated apoptosis, inflammation and tissue remodelling. Loss-of-function mutations in cathepsin C are assoc
Probab=99.92 E-value=2.1e-25 Score=198.19 Aligned_cols=53 Identities=38% Similarity=0.528 Sum_probs=50.9
Q ss_pred CCcccccccCC----CCCCcccCCCCCchhHHHHhhhHHHHHHHHHcCc------ceecCCCCc
Q psy14862 168 LAESINYKDKG----KVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNE------LIELSKQPP 221 (263)
Q Consensus 168 lP~s~DWR~~g----~Vtp~VknQg~CGSCWAFat~~~iEs~~~I~tg~------~~~LSeQqL 221 (263)
||++||||+.+ +|+| |+|||.||||||||++++||++++|++++ .+.||+|||
T Consensus 1 lP~~fDwr~~~~~~~~v~~-v~dQg~CGsCwAfa~~~~ies~~~i~~~~~~~~~~~~~lS~q~l 63 (243)
T cd02621 1 LPKSFDWGDVNNGFNYVSP-VRNQGGCGSCYAFASVYALEARIMIASNKTDPLGQQPILSPQHV 63 (243)
T ss_pred CCCcccccccCCCCccccc-CCCCCcCccHHHHHHHHHHHHHHHHHhCCCCccccCcccCHHHh
Confidence 69999999998 9999 99999999999999999999999999876 789999999
No 8
>cd02698 Peptidase_C1A_CathepsinX Cathepsin X; the only papain-like lysosomal cysteine peptidase exhibiting carboxymonopeptidase activity. It can also act as a carboxydipeptidase, like cathepsin B, but has been shown to preferentially cleave substrates through a monopeptidyl carboxypeptidase pathway. The propeptide region of cathepsin X, the shortest among papain-like peptidases, is covalently attached to the active site cysteine in the inactive form of the enzyme. Little is known about the biological function of cathepsin X. Some studies point to a role in early tumorigenesis. A more recent study indicates that cathepsin X expression is restricted to immune cells suggesting a role in phagocytosis and the regulation of the immune response.
Probab=99.91 E-value=2.9e-25 Score=197.08 Aligned_cols=53 Identities=34% Similarity=0.641 Sum_probs=49.5
Q ss_pred CCcccccccCC---CCCCcccCCC---CCchhHHHHhhhHHHHHHHHHcC---cceecCCCCc
Q psy14862 168 LAESINYKDKG---KVLPKVQDQH---LCGSCWAHSAVACLESAYAIKHN---ELIELSKQPP 221 (263)
Q Consensus 168 lP~s~DWR~~g---~Vtp~VknQg---~CGSCWAFat~~~iEs~~~I~tg---~~~~LSeQqL 221 (263)
||++||||+.+ +|+| ||||| .||||||||++++||++++|+++ ..+.||+|||
T Consensus 1 lP~~~Dwr~~~~~~~v~~-vk~Qg~~~~CGsCwAfa~~~aies~~~i~~~~~~~~~~lS~Q~l 62 (239)
T cd02698 1 LPKSWDWRNVNGVNYVSP-TRNQHIPQYCGSCWAHGSTSALADRINIARKGAWPSVYLSVQVV 62 (239)
T ss_pred CCCCcccccCCCCcccCc-cccCCCCCCCCcchHHHhHHHHHHHHHHHHCCCCCCcccCHHHH
Confidence 69999999988 9999 99998 89999999999999999999875 3678999999
No 9
>cd02248 Peptidase_C1A Peptidase C1A subfamily (MEROPS database nomenclature); composed of cysteine peptidases (CPs) similar to papain, including the mammalian CPs (cathepsins B, C, F, H, L, K, O, S, V, X and W). Papain is an endopeptidase with specific substrate preferences, primarily for bulky hydrophobic or aromatic residues at the S2 subsite, a hydrophobic pocket in papain that accommodates the P2 sidechain of the substrate (the second residue away from the scissile bond). Most members of the papain subfamily are endopeptidases. Some exceptions to this rule can be explained by specific details of the catalytic domains like the occluding loop in cathepsin B which confers an additional carboxydipeptidyl activity and the mini-chain of cathepsin H resulting in an N-terminal exopeptidase activity. Papain-like CPs have different functions in various organisms. Plant CPs are used to mobilize storage proteins in seeds. Parasitic CPs act extracellularly to help invade tissues and cells, to h
Probab=99.90 E-value=1.9e-24 Score=186.49 Aligned_cols=88 Identities=52% Similarity=0.862 Sum_probs=80.8
Q ss_pred CcccccccCCCCCCcccCCCCCchhHHHHhhhHHHHHHHHHcCcceecCCCCc---------------------------
Q psy14862 169 AESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQPP--------------------------- 221 (263)
Q Consensus 169 P~s~DWR~~g~Vtp~VknQg~CGSCWAFat~~~iEs~~~I~tg~~~~LSeQqL--------------------------- 221 (263)
|++||||+.+.++| |+|||.||+|||||+++++|++++|+++..+.||+|+|
T Consensus 1 P~~~d~r~~~~~~~-v~dQg~cgsCwAfa~~~~le~~~~i~~~~~~~lS~q~l~~c~~~~~~gC~GG~~~~a~~~~~~~G 79 (210)
T cd02248 1 PESVDWREKGAVTP-VKDQGSCGSCWAFSTVGALEGAYAIKTGKLVSLSEQQLVDCSTSGNNGCNGGNPDNAFEYVKNGG 79 (210)
T ss_pred CCcccCCcCCCCCC-CccCCCCcchHHhHHHHHHHHHHHHHcCCCcccCHHHHhccCCCCCCCCCCCCHHHhHHHHHHCC
Confidence 88999999999999 99999999999999999999999999999999999999
Q ss_pred -----------------------------------------c----------------ccCCCCCCccccCCCCcCCCCC
Q psy14862 222 -----------------------------------------K----------------THGRFYKGGVMNLPHMLCSKGP 244 (263)
Q Consensus 222 -----------------------------------------K----------------~~f~~Y~~GV~~~~~c~~~~~~ 244 (263)
| ++|+.|++|||..+.| + .
T Consensus 80 i~~e~~yPY~~~~~~C~~~~~~~~~~i~~~~~i~~~~~~~ik~~l~~~gPV~~~~~~~~~f~~y~~Giy~~~~~--~--~ 155 (210)
T cd02248 80 LASESDYPYTGKDGTCKYNSSKVGAKITGYSNVPPGDEEALKAALANYGPVSVAIDASSSFQFYKGGIYSGPCC--S--N 155 (210)
T ss_pred cCccccCCccCCCCCccCCCCcccEEEeeEEEcCCCcHHHHHHHHhhcCCEEEEEecCcccccCCCCceeCCCC--C--C
Confidence 0 6999999999999866 3 4
Q ss_pred CCCCeEEEEEEEecCCC
Q psy14862 245 YSLNHAVLNVGYDNEST 261 (263)
Q Consensus 245 ~~~nHaVliVGYG~~~g 261 (263)
...+|||+|||||.++|
T Consensus 156 ~~~~Hav~iVGy~~~~~ 172 (210)
T cd02248 156 TNLNHAVLLVGYGTENG 172 (210)
T ss_pred CcCCEEEEEEEEeecCC
Confidence 56799999999998764
No 10
>cd02620 Peptidase_C1A_CathepsinB Cathepsin B group; composed of cathepsin B and similar proteins, including tubulointerstitial nephritis antigen (TIN-Ag). Cathepsin B is a lysosomal papain-like cysteine peptidase which is expressed in all tissues and functions primarily as an exopeptidase through its carboxydipeptidyl activity. Together with other cathepsins, it is involved in the degradation of proteins, proenzyme activation, Ag processing, metabolism and apoptosis. Cathepsin B has been implicated in a number of human diseases such as cancer, rheumatoid arthritis, osteoporosis and Alzheimer's disease. The unique carboxydipeptidyl activity of cathepsin B is attributed to the presence of an occluding loop in its active site which favors the binding of the C-termini of substrate proteins. Some members of this group do not possess the occluding loop. TIN-Ag is an extracellular matrix basement protein which was originally identified as a target Ag involved in anti-tubular basement membrane
Probab=99.90 E-value=2.8e-24 Score=190.41 Aligned_cols=52 Identities=38% Similarity=0.603 Sum_probs=48.2
Q ss_pred CcccccccC--CCC--CCcccCCCCCchhHHHHhhhHHHHHHHHHcC--cceecCCCCc
Q psy14862 169 AESINYKDK--GKV--LPKVQDQHLCGSCWAHSAVACLESAYAIKHN--ELIELSKQPP 221 (263)
Q Consensus 169 P~s~DWR~~--g~V--tp~VknQg~CGSCWAFat~~~iEs~~~I~tg--~~~~LSeQqL 221 (263)
|++||||++ +++ +| |+|||.||||||||++++||++++|+++ +.+.||+|||
T Consensus 1 p~~~DwR~~~~~~~~v~~-v~dQg~CGsCwAfa~~~~le~~~~i~~~~~~~~~LS~Q~l 58 (236)
T cd02620 1 PESFDAREKWPNCISIGE-IRDQGNCGSCWAFSAVEAFSDRLCIQSNGKENVLLSAQDL 58 (236)
T ss_pred CCcccchhhCCCCCCccc-cCCcccchhHHHHHHHHHHhhHHHHhcCCCCccccCHHHH
Confidence 899999997 554 59 9999999999999999999999999998 7889999999
No 11
>PF00112 Peptidase_C1: Papain family cysteine protease This is family C1 in the peptidase classification. ; InterPro: IPR000668 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad []. This group of proteins belong to the peptidase family C1, sub-family C1A (papain family, clan CA). It includes proteins classed as non-peptidase homologs. These are have either been shown experimentally to lack peptidase activity or lack one or more of the active site residues. The papain family has a wide variety of activities, including broad-range (papain) and narrow-range endo-peptidases, aminopeptidases, dipeptidyl peptidases and enzymes with both exo- and endo-peptidase activity []. Members of the papain family are widespread, found in baculovirus [], eubacteria, yeast, and practically all protozoa, plants and mammals []. The proteins are typically lysosomal or secreted, and proteolytic cleavage of the propeptide is required for enzyme activation, although bleomycin hydrolase is cytosolic in fungi and mammals []. Papain-like cysteine proteinases are essentially synthesised as inactive proenzymes (zymogens) with N-terminal propeptide regions. The activation process of these enzymes includes the removal of propeptide regions. The propeptide regions serve a variety of functions in vivo and in vitro. The pro-region is required for the proper folding of the newly synthesised enzyme, the inactivation of the peptidase domain and stabilisation of the enzyme against denaturing at neutral to alkaline pH conditions. Amino acid residues within the pro-region mediate their membrane association, and play a role in the transport of the proenzyme to lysosomes. Among the most notable features of propeptides is their ability to inhibit the activity of their cognate enzymes and that certain propeptides exhibit high selectivity for inhibition of the peptidases from which they originate []. The catalytic residues of papain are Cys-25 and His-159, other important residues being Gln-19, which helps form the 'oxyanion hole', and Asn-175, which orientates the imidazole ring of His-159. ; GO: 0008234 cysteine-type peptidase activity, 0006508 proteolysis; PDB: 3MOR_B 3HHI_B 1S4V_A 3F75_A 1MEG_A 1PCI_C 1PPO_A 3HD3_B 1F29_A 1EWL_A ....
Probab=99.86 E-value=7.1e-23 Score=176.59 Aligned_cols=89 Identities=46% Similarity=0.747 Sum_probs=77.2
Q ss_pred CCcccccccC-CCCCCcccCCCCCchhHHHHhhhHHHHHHHHHc-CcceecCCCCc------------------------
Q psy14862 168 LAESINYKDK-GKVLPKVQDQHLCGSCWAHSAVACLESAYAIKH-NELIELSKQPP------------------------ 221 (263)
Q Consensus 168 lP~s~DWR~~-g~Vtp~VknQg~CGSCWAFat~~~iEs~~~I~t-g~~~~LSeQqL------------------------ 221 (263)
||++||||+. +.++| |+|||.||+|||||+++++|++++++. +..+.||+|+|
T Consensus 1 lP~~~D~r~~~~~~~~-v~dQg~~gsCwafa~~~~~e~~~~~~~~~~~~~lS~q~l~~~~~~~~~~c~gg~~~~a~~~~~ 79 (219)
T PF00112_consen 1 LPKSFDWRDKGGRITP-VRDQGSCGSCWAFAAAAALESRLAIQNNGKNVDLSEQYLIDCSNKYNKGCDGGSPFDALKYIK 79 (219)
T ss_dssp STSSEEGGGTTTCSG----BTTSSBTHHHHHHHHHHHHHHHHHHTSSCEEB-HHHHHHHSTGTSSTTBBBEHHHHHHHHH
T ss_pred CCCCEecccCCCCcCc-cccCCcccccccchhccceeccccccccccccccccccccccccccccccccCcccccceeec
Confidence 7999999998 48999 999999999999999999999999999 78999999999
Q ss_pred -----------------------------------------------c----------------c-cCCCCCCccccCCC
Q psy14862 222 -----------------------------------------------K----------------T-HGRFYKGGVMNLPH 237 (263)
Q Consensus 222 -----------------------------------------------K----------------~-~f~~Y~~GV~~~~~ 237 (263)
| . +|+.|++|||..+.
T Consensus 80 ~~~Gi~~e~~~pY~~~~~~~c~~~~~~~~~~~i~~~~~~~~~~~~~ik~~L~~~gpV~~~~~~~~~~f~~~~~gi~~~~~ 159 (219)
T PF00112_consen 80 NNNGIVTEEDYPYNGNENPTCKSKKSNSYYVKIKGYGKVKDNDIEDIKKALMKYGPVVASIDVSSEDFQNYKSGIYDPPD 159 (219)
T ss_dssp HHTSBEBTTTS--SSSSSCSSCHSGGGEEEBEESEEEEEESTCHHHHHHHHHHHSSEEEEEEEESHHHHTEESSEECSTS
T ss_pred ccCcccccccccccccccccccccccccccccccccccccccchhHHHHHHhhCceeeeeeeccccccccccceeeeccc
Confidence 0 5 69999999999997
Q ss_pred CcCCCCCCCCCeEEEEEEEecCCC
Q psy14862 238 MLCSKGPYSLNHAVLNVGYDNEST 261 (263)
Q Consensus 238 c~~~~~~~~~nHaVliVGYG~~~g 261 (263)
| . ....+|||+|||||.+.|
T Consensus 160 ~--~--~~~~~Hav~iVGy~~~~~ 179 (219)
T PF00112_consen 160 C--S--NESGGHAVLIVGYDDENG 179 (219)
T ss_dssp S--S--SSSEEEEEEEEEEEEETT
T ss_pred c--c--cccccccccccccccccc
Confidence 7 4 467899999999998854
No 12
>PTZ00364 dipeptidyl-peptidase I precursor; Provisional
Probab=99.86 E-value=2.5e-22 Score=195.64 Aligned_cols=56 Identities=16% Similarity=0.196 Sum_probs=51.7
Q ss_pred CCCCCcccccccCC---CCCCcccCCCC---CchhHHHHhhhHHHHHHHHHcC------cceecCCCCc
Q psy14862 165 SYGLAESINYKDKG---KVLPKVQDQHL---CGSCWAHSAVACLESAYAIKHN------ELIELSKQPP 221 (263)
Q Consensus 165 ~~~lP~s~DWR~~g---~Vtp~VknQg~---CGSCWAFat~~~iEs~~~I~tg------~~~~LSeQqL 221 (263)
..++|++||||++| +|+| |||||. ||||||||++++||++++|+++ +.+.||+|||
T Consensus 202 ~~~LP~sfDWR~~gg~~~Vtp-VrdQg~~~~CGSCWAFAav~alEsr~~I~tn~~~~~g~~~~LS~QqL 269 (548)
T PTZ00364 202 GDPPPAAWSWGDVGGASFLPA-APPASPGRGCNSSYVEAALAAMMARVMVASNRTDPLGQQTFLSARHV 269 (548)
T ss_pred ccCCCCccccCcCCCCccCCC-CcCCCCCCCCcCHHHHHHHHHHHHHHHHHhCCCcccCcccCcCHHHH
Confidence 35799999999997 7999 999999 9999999999999999999984 4688999999
No 13
>PTZ00049 cathepsin C-like protein; Provisional
Probab=99.85 E-value=1.1e-21 Score=193.75 Aligned_cols=56 Identities=25% Similarity=0.351 Sum_probs=49.2
Q ss_pred CCCCCcccccccC----CCCCCcccCCCCCchhHHHHhhhHHHHHHHHHcCc-----c-----eecCCCCc
Q psy14862 165 SYGLAESINYKDK----GKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNE-----L-----IELSKQPP 221 (263)
Q Consensus 165 ~~~lP~s~DWR~~----g~Vtp~VknQg~CGSCWAFat~~~iEs~~~I~tg~-----~-----~~LSeQqL 221 (263)
..+||++||||+. ++|+| |+|||.||||||||++++||++++|++++ . ..||+|||
T Consensus 378 ~~~LP~sfDWRd~~~~~~~vtp-VkdQG~CGSCWAFAat~alEsR~~Ia~~~~l~~~~~~~~~~~LS~QqL 447 (693)
T PTZ00049 378 IDELPKNFTWGDPFNNNTREYD-VTNQLLCGSCYIASQMYAFKRRIEIALTKNLDKKYLNNFDDLLSIQTV 447 (693)
T ss_pred cccCCCCEecCcCCCCCCcccC-CCCCccCcHHHHHHHHHHHHHHHHHHhccccccccccccccCcCHHHh
Confidence 3579999999985 68999 99999999999999999999999999743 1 26899998
No 14
>cd02619 Peptidase_C1 C1 Peptidase family (MEROPS database nomenclature), also referred to as the papain family; composed of two subfamilies of cysteine peptidases (CPs), C1A (papain) and C1B (bleomycin hydrolase). Papain-like enzymes are mostly endopeptidases with some exceptions like cathepsins B, C, H and X, which are exopeptidases. Papain-like CPs have different functions in various organisms. Plant CPs are used to mobilize storage proteins in seeds while mammalian CPs are primarily lysosomal enzymes responsible for protein degradation in the lysosome. Papain-like CPs are synthesized as inactive proenzymes with N-terminal propeptide regions, which are removed upon activation. Bleomycin hydrolase (BH) is a CP that detoxifies bleomycin by hydrolysis of an amide group. It acts as a carboxypeptidase on its C-terminus to convert itself into an aminopeptidase and peptide ligase. BH is found in all tissues in mammals as well as in many other eukaryotes. It forms a hexameric ring barrel str
Probab=99.78 E-value=1.4e-19 Score=156.27 Aligned_cols=49 Identities=41% Similarity=0.639 Sum_probs=47.0
Q ss_pred ccccccCCCCCCcccCCCCCchhHHHHhhhHHHHHHHHHcC--cceecCCCCc
Q psy14862 171 SINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHN--ELIELSKQPP 221 (263)
Q Consensus 171 s~DWR~~g~Vtp~VknQg~CGSCWAFat~~~iEs~~~I~tg--~~~~LSeQqL 221 (263)
++|||+.+ ++| |+|||.||+|||||+++++|++++++++ +.+.||+|+|
T Consensus 1 ~~d~r~~~-~~~-v~dQg~~gsCwafa~~~~les~~~~~~~~~~~~~lS~q~l 51 (223)
T cd02619 1 SVDLRPLR-LTP-VKNQGSRGSCWAFASAYALESAYRIKGGEDEYVDLSPQYL 51 (223)
T ss_pred CCcchhcC-CCC-cccCCCCcCcHHHHHHHHHHHHHHHhcCCcccccCCHHHH
Confidence 48999998 999 9999999999999999999999999998 8899999999
No 15
>PF08246 Inhibitor_I29: Cathepsin propeptide inhibitor domain (I29); InterPro: IPR013201 Peptide proteinase inhibitors can be found as single domain proteins or as single or multiple domains within proteins; these are referred to as either simple or compound inhibitors, respectively. In many cases they are synthesised as part of a larger precursor protein, either as a prepropeptide or as an N-terminal domain associated with an inactive peptidase or zymogen. This domain prevents access of the substrate to the active site. Removal of the N-terminal inhibitor domain either by interaction with a second peptidase or by autocatalytic cleavage activates the zymogen. Other inhibitors interact direct with proteinases using a simple noncovalent lock and key mechanism; while yet others use a conformational change-based trapping mechanism that depends on their structural and thermodynamic properties. This entry represents a peptidase inhibitor domain, which belongs to MEROPS peptidase inhibitor family I29. The domain is also found at the N terminus of a variety of peptidase precursors that belong to MEROPS peptidase subfamily C1A; these include cathepsin L, papain, and procaricain (P10056 from SWISSPROT) []. It forms an alpha-helical domain that runs through the substrate-binding site, preventing access. Removal of this region by proteolytic cleavage results in activation of the enzyme. This domain is also found, in one or more copies, in a variety of cysteine peptidase inhibitors such as salarin [].; PDB: 3QT4_A 3QJ3_A 2C0Y_A 2L95_A 1CJL_A 1CS8_A 7PCK_A 1BY8_A 1PCI_A 2O6X_A ....
Probab=99.74 E-value=5.6e-18 Score=118.54 Aligned_cols=58 Identities=40% Similarity=0.734 Sum_probs=51.8
Q ss_pred HHHHHHHhCCccCChHHHHHHHHHHHHHHHHHHHhccCCCCceEEEeccCCCCChHHH
Q psy14862 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEF 136 (263)
Q Consensus 79 F~~f~~~y~K~Y~s~~E~~~R~~iF~~Nl~~I~~~N~~~~~s~~~giN~FsDlT~eEf 136 (263)
|++|+++|+|.|.+.+|+.+|+.+|++|++.|++||+.+..+|++|+|+|||||.+||
T Consensus 1 F~~~~~~~~k~Y~~~~e~~~R~~~F~~N~~~I~~~N~~~~~~~~~~~N~fsD~t~eEf 58 (58)
T PF08246_consen 1 FEQFKKKYGKSYKSAEEEARRFAIFKENLRRIEEHNANGNNTYKLGLNQFSDMTPEEF 58 (58)
T ss_dssp HHHHHHHCT---SSHHHHHHHHHHHHHHHHHHHHHHHTTSSSEEE-SSTTTTSSHHHH
T ss_pred CHHHHHHcCCCCCCHHHHHHHHHHHHHHHHHHHHHhcCCCCCeEEeCccccCcChhhC
Confidence 8899999999999999999999999999999999997666899999999999999998
No 16
>KOG1544|consensus
Probab=99.65 E-value=2.3e-17 Score=149.16 Aligned_cols=110 Identities=18% Similarity=0.055 Sum_probs=81.4
Q ss_pred HHHHHhccCCCCceEEEe-ccCCCCChHHHhcccCCCchhhhhhccccCcccccCCCCCCCCCcccccccC--CCCCCcc
Q psy14862 108 KTIDYYTKHEQGTATYGV-NRFADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDK--GKVLPKV 184 (263)
Q Consensus 108 ~~I~~~N~~~~~s~~~gi-N~FsDlT~eEf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lP~s~DWR~~--g~Vtp~V 184 (263)
.+|++.|+.. .+|+++- .+|-.||.+.=.+..+|..++... ......+........+||+.||.|++ +.+.| |
T Consensus 151 d~iE~in~G~-YgW~A~NYSaFWGmtL~DGiKyRLGTL~Ps~s--v~nMNEi~~~l~p~~~LPE~F~As~KWp~liH~-p 226 (470)
T KOG1544|consen 151 DMIEAINQGN-YGWQAGNYSAFWGMTLDDGIKYRLGTLRPSSS--VMNMNEIYTVLNPGEVLPEAFEASEKWPNLIHE-P 226 (470)
T ss_pred HHHHHHhcCC-ccccccchhhhhcccccccceeeecccCchhh--hhhHHhHhhccCcccccchhhhhhhcCCccccC-c
Confidence 3678888664 6777754 489999998877766676654331 11111111111223679999999998 88999 9
Q ss_pred cCCCCCchhHHHHhhhHHHHHHHHHcCc--ceecCCCCc
Q psy14862 185 QDQHLCGSCWAHSAVACLESAYAIKHNE--LIELSKQPP 221 (263)
Q Consensus 185 knQg~CGSCWAFat~~~iEs~~~I~tg~--~~~LSeQqL 221 (263)
-|||+||++|||||+++..++++|.+.. ...||+|||
T Consensus 227 lDQgnCa~SWafSTaavasDRiAI~S~GR~t~~LSpQnL 265 (470)
T KOG1544|consen 227 LDQGNCAGSWAFSTAAVASDRVAIHSLGRMTPVLSPQNL 265 (470)
T ss_pred cccCCcccceeeeeehhccceeEEeeccccccccChHHh
Confidence 9999999999999999999999998743 467999999
No 17
>PTZ00462 Serine-repeat antigen protein; Provisional
Probab=99.63 E-value=1.8e-16 Score=161.32 Aligned_cols=41 Identities=22% Similarity=0.289 Sum_probs=39.5
Q ss_pred CCCcccCCCCCchhHHHHhhhHHHHHHHHHcCcceecCCCCc
Q psy14862 180 VLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQPP 221 (263)
Q Consensus 180 Vtp~VknQg~CGSCWAFat~~~iEs~~~I~tg~~~~LSeQqL 221 (263)
..| |+|||.||||||||+++++|++++|++++.+.||+|||
T Consensus 544 ~i~-VKDQG~CGSCWAFASaaaLES~~cIkgg~~v~LSeQqL 584 (1004)
T PTZ00462 544 KIQ-IEDQGNCAISWIFASKYHLETIKCMKGYEPHAISALYI 584 (1004)
T ss_pred CCC-cccCCcchHHHHHHHHHHHHHHHHHhcCCCcccCHHHH
Confidence 578 99999999999999999999999999999999999999
No 18
>smart00848 Inhibitor_I29 Cathepsin propeptide inhibitor domain (I29). This domain is found at the N-terminus of some C1 peptidases such as Cathepsin L where it acts as a propeptide. There are also a number of proteins that are composed solely of multiple copies of this domain such as the peptidase inhibitor salarin. This family is classified as I29 by MEROPS. Peptide proteinase inhibitors can be found as single domain proteins or as single or multiple domains within proteins; these are referred to as either simple or compound inhibitors, respectively. In many cases they are synthesised as part of a larger precursor protein, either as a prepropeptide or as an N-terminal domain associated with an inactive peptidase or zymogen. This domain prevents access of the substrate to the active site. Removal of the N-terminal inhibitor domain either by interaction with a second peptidase or by autocatalytic cleavage activates the zymogen. Other inhibitors interact direct with proteinases using a s
Probab=99.60 E-value=1.2e-15 Score=105.82 Aligned_cols=57 Identities=40% Similarity=0.738 Sum_probs=54.0
Q ss_pred HHHHHHHhCCccCChHHHHHHHHHHHHHHHHHHHhccCCCCceEEEeccCCCCChHH
Q psy14862 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSE 135 (263)
Q Consensus 79 F~~f~~~y~K~Y~s~~E~~~R~~iF~~Nl~~I~~~N~~~~~s~~~giN~FsDlT~eE 135 (263)
|++|+.+|+|.|.+.+|...|+.+|++|++.|+.||+.+..+|++|+|+|||||++|
T Consensus 1 f~~~~~~~~k~y~~~~e~~~r~~~f~~n~~~i~~~N~~~~~~~~~~~N~fsDlt~eE 57 (57)
T smart00848 1 FEQWKKKYGKSYSSEEEELRRFEIFKENLKFIEEHNKKNDHSYTLGLNQFADLTNEE 57 (57)
T ss_pred ChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHHHhcCCCCeEecCcccccCCCCC
Confidence 688999999999999999999999999999999999876689999999999999986
No 19
>COG4870 Cysteine protease [Posttranslational modification, protein turnover, chaperones]
Probab=99.18 E-value=6.8e-12 Score=115.78 Aligned_cols=41 Identities=41% Similarity=0.664 Sum_probs=38.1
Q ss_pred CCCCcccccccCCCCCCcccCCCCCchhHHHHhhhHHHHHHH
Q psy14862 166 YGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYA 207 (263)
Q Consensus 166 ~~lP~s~DWR~~g~Vtp~VknQg~CGSCWAFat~~~iEs~~~ 207 (263)
..+|+.||||+.|-|+| ||+||.||+||||++++++|+...
T Consensus 97 ~s~~~~fd~r~~g~vs~-v~dQg~~Gscwaf~t~~sles~l~ 137 (372)
T COG4870 97 ASLPSYFDRRDEGKVSP-VKDQGSGGSCWAFATTRSLESYLN 137 (372)
T ss_pred ccchhheeeeccCCccc-ccccCcccceEeeeehhhhhheec
Confidence 34899999999999999 999999999999999999998654
No 20
>cd00585 Peptidase_C1B Peptidase C1B subfamily (MEROPS database nomenclature); composed of eukaryotic bleomycin hydrolases (BH) and bacterial aminopeptidases C (pepC). The proteins of this subfamily contain a large insert relative to the C1A peptidase (papain) subfamily. BH is a cysteine peptidase that detoxifies bleomycin by hydrolysis of an amide group. It acts as a carboxypeptidase on its C-terminus to convert itself into an aminopeptidase and peptide ligase. BH is found in all tissues in mammals as well as in many other eukaryotes. Bleomycin, a glycopeptide derived from the fungus Streptomyces verticullus, is an effective anticancer drug due to its ability to induce DNA strand breaks. Human BH is the major cause of tumor cell resistance to bleomycin chemotherapy, and is also genetically linked to Alzheimer's disease. In addition to its peptidase activity, the yeast BH (Gal6) binds DNA and acts as a repressor in the Gal4 regulatory system. BH forms a hexameric ring barrel structure w
Probab=98.06 E-value=8.3e-06 Score=78.56 Aligned_cols=40 Identities=28% Similarity=0.490 Sum_probs=35.0
Q ss_pred CCcccCCCCCchhHHHHhhhHHHHHHHHH-cCcceecCCCCc
Q psy14862 181 LPKVQDQHLCGSCWAHSAVACLESAYAIK-HNELIELSKQPP 221 (263)
Q Consensus 181 tp~VknQg~CGSCWAFat~~~iEs~~~I~-tg~~~~LSeQqL 221 (263)
.| |+||+.-|-||.||+..+++.....+ +.+.+.||+--|
T Consensus 55 ~~-vtnQ~~SGrCW~FA~Ln~lr~~~~k~~~~~~felSq~Yl 95 (437)
T cd00585 55 EP-VTNQKSSGRCWLFAALNVLRHQFMKKLNLKEFEFSQSYL 95 (437)
T ss_pred CC-cccCCCCchhHHHHCHHHHHHHHHHHcCCCCEEeCcHHH
Confidence 38 99999999999999999999988875 456788998877
No 21
>PF08127 Propeptide_C1: Peptidase family C1 propeptide; InterPro: IPR012599 This domain is found at the N-terminal of cathepsin B and cathepsin B-like peptidases that belong to MEROPS peptidase subfamily C1A. Cathepsin B are lysosomal cysteine proteinases belonging to the papain superfamily and are unique in their ability to act as both an endo- and an exopeptidases. They are synthesized as inactive zymogens. Activation of the peptidases occurs with the removal of the propeptide [, ]. ; GO: 0004197 cysteine-type endopeptidase activity, 0050790 regulation of catalytic activity; PDB: 1MIR_A 1PBH_A 2PBH_A 3PBH_A.
Probab=95.55 E-value=0.012 Score=38.13 Aligned_cols=31 Identities=26% Similarity=0.288 Sum_probs=21.6
Q ss_pred HHHHHhccCCCCceEEEeccCCCCChHHHhccc
Q psy14862 108 KTIDYYTKHEQGTATYGVNRFADMTDSEFNHGL 140 (263)
Q Consensus 108 ~~I~~~N~~~~~s~~~giN~FsDlT~eEf~~~~ 140 (263)
++|+..|+. +.+|++|.| |.+.|.+.+++++
T Consensus 4 e~I~~IN~~-~~tWkAG~N-F~~~~~~~ik~Ll 34 (41)
T PF08127_consen 4 EFIDYINSK-NTTWKAGRN-FENTSIEYIKRLL 34 (41)
T ss_dssp HHHHHHHHC-T-SEEE-----SSB-HHHHHHCS
T ss_pred HHHHHHHcC-CCcccCCCC-CCCCCHHHHHHHc
Confidence 578999998 589999999 8999999998864
No 22
>PF03051 Peptidase_C1_2: Peptidase C1-like family This family is a subfamily of the Prosite entry; InterPro: IPR004134 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad []. This group of proteins belong to MEROPS peptidase family C1, sub-family C1B (bleomycin hydrolase, clan CA). This family contains prokaryotic and eukaryotic aminopeptidases and bleomycin hydrolases.; GO: 0004197 cysteine-type endopeptidase activity, 0006508 proteolysis; PDB: 3PW3_F 2CB5_A 1CB5_C 2DZZ_A 2E02_A 2E01_A 2E03_A 1A6R_A 1GCB_A 3GCB_A ....
Probab=93.01 E-value=0.07 Score=51.75 Aligned_cols=39 Identities=33% Similarity=0.521 Sum_probs=28.0
Q ss_pred CcccCCCCCchhHHHHhhhHHHHHHHHHcC-cceecCCCCc
Q psy14862 182 PKVQDQHLCGSCWAHSAVACLESAYAIKHN-ELIELSKQPP 221 (263)
Q Consensus 182 p~VknQg~CGSCWAFat~~~iEs~~~I~tg-~~~~LSeQqL 221 (263)
| |+||+.-|-||.||+..+++..+..+.+ +...||+--|
T Consensus 57 ~-vtnQk~SGRCW~FA~lN~lR~~~~kk~~l~~felSq~Yl 96 (438)
T PF03051_consen 57 P-VTNQKSSGRCWLFAALNVLRHEIMKKLNLKDFELSQNYL 96 (438)
T ss_dssp S---B--BSSTHHHHHHHHHHHHHHHHHCT-SS--B-HHHH
T ss_pred C-CCCCCCCCCcchhhchHHHHHHHHHHcCCCceEeechHH
Confidence 8 9999999999999999999999888876 6677777665
No 23
>PF05543 Peptidase_C47: Staphopain peptidase C47; InterPro: IPR008750 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad []. This group of cysteine peptidases belong to the peptidase family C47 (staphopain family, clan CA). The type example are the staphopains, which are one of four major families of proteinases secreted by the Gram-positive Staphylococcus aureus. These staphylococcal cysteine proteases are secreted as preproenzymes that are proteolytically cleaved to generate the mature enzyme [, , ].; GO: 0008234 cysteine-type peptidase activity, 0006508 proteolysis; PDB: 1X9Y_D 1Y4H_B 1PXV_B 1CV8_A.
Probab=70.20 E-value=3.9 Score=34.81 Aligned_cols=36 Identities=17% Similarity=0.083 Sum_probs=21.5
Q ss_pred CCCCCchhHHHHhhhHHHHHHHHH--------cCcceecCCCCc
Q psy14862 186 DQHLCGSCWAHSAVACLESAYAIK--------HNELIELSKQPP 221 (263)
Q Consensus 186 nQg~CGSCWAFat~~~iEs~~~I~--------tg~~~~LSeQqL 221 (263)
.||.=+=|-+|+.++.|-...-.+ ..--..+|+|||
T Consensus 18 tQg~~pWCa~Ya~aailN~~~~~~~~~A~~iMr~~yPn~s~~~l 61 (175)
T PF05543_consen 18 TQGYNPWCAGYAMAAILNATTNTKIYNAKDIMRYLYPNVSEEQL 61 (175)
T ss_dssp --SSSS-HHHHHHHHHHHHHCT-S---HHHHHHHHSTTS-CCCH
T ss_pred ccCcCcHHHHHHHHHHHHhhhCcCcCCHHHHHHHHCCCCCHHHH
Confidence 468778899999999887653211 111356899999
No 24
>cd00042 CY Substituted updates: Jan 30, 2002
Probab=64.55 E-value=9 Score=28.93 Aligned_cols=22 Identities=18% Similarity=0.351 Sum_probs=17.1
Q ss_pred hccceeEeeecccccccchhhhh
Q psy14862 28 ASGCVHTYLGWHPRWTGRVHNLI 50 (263)
Q Consensus 28 ~~~~~~~~~~~~~~w~~~~~~l~ 50 (263)
...-.|.++||..||++. ..++
T Consensus 80 ~~~~~C~~~V~~~pw~~~-~~l~ 101 (105)
T cd00042 80 GKKKFCTAKVWEKPWENF-KELL 101 (105)
T ss_pred CCCEEEEEEEEecCCCCc-eeee
Confidence 456789999999999977 4444
No 25
>COG3579 PepC Aminopeptidase C [Amino acid transport and metabolism]
Probab=63.66 E-value=6.1 Score=37.25 Aligned_cols=22 Identities=27% Similarity=0.679 Sum_probs=19.4
Q ss_pred CcccCCCCCchhHHHHhhhHHHH
Q psy14862 182 PKVQDQHLCGSCWAHSAVACLES 204 (263)
Q Consensus 182 p~VknQg~CGSCWAFat~~~iEs 204 (263)
| |-||..-|-||.||+...+--
T Consensus 59 ~-vtNQk~SGRCWmFAAlNtfRh 80 (444)
T COG3579 59 K-VTNQKQSGRCWMFAALNTFRH 80 (444)
T ss_pred c-cccccccceehHHHHHHHHHH
Confidence 7 999999999999999887643
No 26
>smart00043 CY Cystatin-like domain. Cystatins are a family of cysteine protease inhibitors that occur mainly as single domain proteins. However some extracellular proteins such as kininogen, His-rich glycoprotein and fetuin also contain these domains.
Probab=44.68 E-value=12 Score=28.27 Aligned_cols=15 Identities=20% Similarity=0.603 Sum_probs=13.2
Q ss_pred eeEeeecccccccch
Q psy14862 32 VHTYLGWHPRWTGRV 46 (263)
Q Consensus 32 ~~~~~~~~~~w~~~~ 46 (263)
.|.+.||..||+...
T Consensus 85 ~C~~~V~~~pw~~~~ 99 (107)
T smart00043 85 FCTAKVWEKPWENKI 99 (107)
T ss_pred EEEEEEEecCCCCcc
Confidence 799999999999773
No 27
>PHA02088 hypothetical protein
Probab=29.02 E-value=22 Score=27.56 Aligned_cols=43 Identities=9% Similarity=0.288 Sum_probs=30.3
Q ss_pred eechhhhccceeEeeecccccccchhhhhhhccCCCCCCccccccccchhHhHHHHHHHHHHHHhCCcc
Q psy14862 22 QYEAEWASGCVHTYLGWHPRWTGRVHNLILQRSQPNSYGSEEASTFDLEEFLDHGNQFKDFVREYERQY 90 (263)
Q Consensus 22 ~~~~~~~~~~~~~~~~~~~~w~~~~~~l~~~~~~~~~~~~~~~~~~~l~~~~~~~~~F~~f~~~y~K~Y 90 (263)
..-...+.-..|-|+.||..|..+-. .-...-..|.++|+-+.
T Consensus 6 fi~~~~~~liac~~s~wd~~w~~k~~--------------------------~e~~~~n~wl~ky~~sm 48 (125)
T PHA02088 6 FIAGLVLGLIACCFSTWDSRWRKKAR--------------------------NEQAKINTWLAKYGISM 48 (125)
T ss_pred EeHHHHHHHHHhhhHHhhHHHHHHHH--------------------------HHHHHHHHHHHHhceEE
Confidence 34445666678999999999998831 12345678999998543
No 28
>KOG2735|consensus
Probab=26.59 E-value=38 Score=32.60 Aligned_cols=27 Identities=26% Similarity=0.456 Sum_probs=22.4
Q ss_pred ccCCCCCchhHHHHhhhHHHHHHHHHcCc
Q psy14862 184 VQDQHLCGSCWAHSAVACLESAYAIKHNE 212 (263)
Q Consensus 184 VknQg~CGSCWAFat~~~iEs~~~I~tg~ 212 (263)
+|--| --||.|.++.++|...|||.|.
T Consensus 368 ~KrvG--~qcWv~~aI~~~El~IciKfg~ 394 (466)
T KOG2735|consen 368 CKRVG--MQCWVFLAICALELLICIKFGS 394 (466)
T ss_pred hhhcc--hhHHHHHHHHHHHhhhheeeCC
Confidence 44444 2699999999999999999986
No 29
>PF00031 Cystatin: Cystatin domain; InterPro: IPR000010 Peptide proteinase inhibitors can be found as single domain proteins or as single or multiple domains within proteins; these are referred to as either simple or compound inhibitors, respectively. In many cases they are synthesised as part of a larger precursor protein, either as a prepropeptide or as an N-terminal domain associated with an inactive peptidase or zymogen. This domain prevents access of the substrate to the active site. Removal of the N-terminal inhibitor domain either by interaction with a second peptidase or by autocatalytic cleavage activates the zymogen. Other inhibitors interact direct with proteinases using a simple noncovalent lock and key mechanism; while yet others use a conformational change-based trapping mechanism that depends on their structural and thermodynamic properties. The cystatins are cysteine proteinase inhibitors belonging to MEROPS inhibitor family I25, clan IH [, , ]. They mainly inhibit peptidases belonging to peptidase families C1 (papain family) and C13 (legumain family). The cystatin family includes: The Type 1 cystatins, which are intracellular cystatins that are present in the cytosol of many cell types, but can also appear in body fluids at significant concentrations. They are single-chain polypeptides of about 100 residues, which have neither disulphide bonds nor carbohydrate side chains. The Type 2 cystatins, which are mainly extracellular secreted polypeptides synthesised with a 19-28 residue signal peptide. They are broadly distributed and found in most body fluids. The Type 3 cystatins, which are multidomain proteins. The mammalian representatives of this group are the kininogens. There are three different kininogens in mammals: H- (high molecular mass, IPR002395 from INTERPRO) and L- (low molecular mass) kininogen which are found in a number of species, and T-kininogen that is found only in rat. Unclassified cystatins. These are cystatin-like proteins found in a range of organisms: plant phytocystatins, fetuin in mammals, insect cystatins and a puff adder venom cystatin which inhibits metalloproteases of the MEROPS peptidase family M12 (astacin/adamalysin). Also a number of the cystatins-like proteins have been shown to be devoid of inhibitory activity. All true cystatins inhibit cysteine peptidases of the papain family (MEROPS peptidase family C1), and some also inhibit legumain family enzymes (MEROPS peptidase family C13). These peptidases play key roles in physiological processes, such as intracellular protein degradation (cathepsins B, H and L), are pivotal in the remodelling of bone (cathepsin K), and may be important in the control of antigen presentation (cathepsin S, mammalian legumain). Moreover, the activities of such peptidases are increased in pathophysiological conditions, such as cancer metastasis and inflammation. Additionally, such peptidases are essential for several pathogenic parasites and bacteria. Thus in animals cystatins not only have capacity to regulate normal body processes and perhaps cause disease when down-regulated, but in other organisms may also participate in defence against biotic and abiotic stress. ; GO: 0004869 cysteine-type endopeptidase inhibitor activity; PDB: 3L0R_B 2W9P_K 2W9Q_A 3S67_A 3QRD_D 1R4C_G 3GAX_A 1TIJ_B 1G96_A 3NX0_A ....
Probab=25.52 E-value=37 Score=24.83 Aligned_cols=17 Identities=18% Similarity=0.401 Sum_probs=13.9
Q ss_pred hhhccceeEeeeccccc
Q psy14862 26 EWASGCVHTYLGWHPRW 42 (263)
Q Consensus 26 ~~~~~~~~~~~~~~~~w 42 (263)
.....-.|.+.||.+||
T Consensus 78 ~~~~~~~C~~~v~~~pW 94 (94)
T PF00031_consen 78 EQPWTKFCKFTVWERPW 94 (94)
T ss_dssp TTSSEEEEEEEEEEECG
T ss_pred cCCceeeEEEEEEECCC
Confidence 35566789999999999
No 30
>KOG3554|consensus
Probab=24.44 E-value=2.5e+02 Score=27.88 Aligned_cols=19 Identities=16% Similarity=0.522 Sum_probs=16.0
Q ss_pred HHHHHHHHHHHhCCccCCh
Q psy14862 75 HGNQFKDFVREYERQYDSD 93 (263)
Q Consensus 75 ~~~~F~~f~~~y~K~Y~s~ 93 (263)
-..+|++=+.+|||.+.+.
T Consensus 292 EanLFEeALeKyGKDFndI 310 (693)
T KOG3554|consen 292 EANLFEEALEKYGKDFNDI 310 (693)
T ss_pred hhHHHHHHHHHhcccHHHH
Confidence 4578999999999998664
No 31
>PF13529 Peptidase_C39_2: Peptidase_C39 like family; PDB: 3ERV_A.
Probab=24.27 E-value=56 Score=24.92 Aligned_cols=14 Identities=36% Similarity=0.622 Sum_probs=11.1
Q ss_pred CCCeEEEEEEEecC
Q psy14862 246 SLNHAVLNVGYDNE 259 (263)
Q Consensus 246 ~~nHaVliVGYG~~ 259 (263)
...|.|+|+||...
T Consensus 122 ~~~H~vvi~Gy~~~ 135 (144)
T PF13529_consen 122 YGGHYVVIIGYDED 135 (144)
T ss_dssp TTEEEEEEEEE-SS
T ss_pred cCCEEEEEEEEeCC
Confidence 46999999999874
No 32
>KOG4128|consensus
Probab=24.24 E-value=24 Score=33.33 Aligned_cols=40 Identities=30% Similarity=0.451 Sum_probs=26.7
Q ss_pred CCcccCCCCCchhHHHHhhhHHHHHHHHHcC-cceecCCCCc
Q psy14862 181 LPKVQDQHLCGSCWAHSAVACLESAYAIKHN-ELIELSKQPP 221 (263)
Q Consensus 181 tp~VknQg~CGSCWAFat~~~iEs~~~I~tg-~~~~LSeQqL 221 (263)
+| |-||.+-|-||.|+.+..+---+..+-+ +-..||...|
T Consensus 63 ~p-vtnqkssGrcWift~ln~lrl~~~~kLnl~eFElSqayL 103 (457)
T KOG4128|consen 63 QP-VTNQKSSGRCWIFTGLNLLRLEMDRKLNLPEFELSQAYL 103 (457)
T ss_pred cc-cccCcCCCceEEEechhHHHHHHHhcCCcchhhhhhHHH
Confidence 58 9999999999999998765432222222 2345666655
No 33
>KOG4702|consensus
Probab=20.78 E-value=2e+02 Score=20.86 Aligned_cols=32 Identities=31% Similarity=0.619 Sum_probs=22.9
Q ss_pred HHHHHHHHHhCCccCChHHHHHHHHHHHHHHHH
Q psy14862 77 NQFKDFVREYERQYDSDSEIERRFDIFRNNLKT 109 (263)
Q Consensus 77 ~~F~~f~~~y~K~Y~s~~E~~~R~~iF~~Nl~~ 109 (263)
..|++|+..|.+.-..+ |...|.+-|.+-++.
T Consensus 29 e~Fee~v~~~krel~pp-e~~~~~EE~~~~lRe 60 (77)
T KOG4702|consen 29 EIFEEFVRGYKRELSPP-EATKRKEEYENFLRE 60 (77)
T ss_pred HHHHHHHHhccccCCCh-HHHhhHHHHHHHHHH
Confidence 47999999999876554 666676666655443
Done!