Query 044448
Match_columns 308
No_of_seqs 254 out of 1578
Neff 8.0
Searched_HMMs 46136
Date Fri Mar 29 03:23:59 2013
Command hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/044448.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/044448hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1542 Cysteine proteinase Ca 100.0 1.5E-82 3.2E-87 569.2 25.7 284 9-307 65-369 (372)
2 PTZ00203 cathepsin L protease; 100.0 8.7E-76 1.9E-80 547.1 31.7 285 8-306 31-337 (348)
3 PTZ00021 falcipain-2; Provisio 100.0 4E-75 8.7E-80 558.0 30.5 292 8-308 162-487 (489)
4 PTZ00200 cysteine proteinase; 100.0 6.3E-75 1.4E-79 554.8 31.5 292 6-308 117-444 (448)
5 KOG1543 Cysteine proteinase Ca 100.0 7.9E-67 1.7E-71 483.5 27.9 270 19-308 30-323 (325)
6 cd02698 Peptidase_C1A_Cathepsi 100.0 1.3E-58 2.8E-63 413.7 22.4 207 92-307 1-236 (239)
7 cd02621 Peptidase_C1A_Cathepsi 100.0 4.2E-58 9E-63 411.5 22.4 207 92-306 1-239 (243)
8 cd02248 Peptidase_C1A Peptidas 100.0 1.5E-57 3.3E-62 398.5 22.8 203 93-307 1-210 (210)
9 cd02620 Peptidase_C1A_Cathepsi 100.0 2.8E-57 6.1E-62 404.3 20.7 201 93-305 1-234 (236)
10 PF00112 Peptidase_C1: Papain 100.0 1.6E-55 3.5E-60 386.9 14.9 208 92-308 1-219 (219)
11 PTZ00049 cathepsin C-like prot 100.0 2E-54 4.4E-59 423.7 22.9 209 90-307 379-674 (693)
12 PTZ00364 dipeptidyl-peptidase 100.0 1.3E-53 2.8E-58 413.9 22.3 203 91-305 204-455 (548)
13 smart00645 Pept_C1 Papain fami 100.0 4.5E-49 9.7E-54 335.8 17.6 165 92-304 1-170 (174)
14 cd02619 Peptidase_C1 C1 Peptid 100.0 1.8E-46 3.8E-51 330.2 20.5 191 95-292 1-213 (223)
15 PTZ00462 Serine-repeat antigen 100.0 3.8E-45 8.1E-50 367.7 21.2 200 104-308 544-780 (1004)
16 KOG1544 Predicted cysteine pro 100.0 6.9E-42 1.5E-46 303.5 5.8 247 50-305 170-456 (470)
17 COG4870 Cysteine protease [Pos 100.0 8.9E-30 1.9E-34 231.5 7.3 194 91-292 98-314 (372)
18 cd00585 Peptidase_C1B Peptidas 99.9 1.1E-24 2.4E-29 207.7 15.3 180 105-291 55-399 (437)
19 PF03051 Peptidase_C1_2: Pepti 99.7 6.9E-17 1.5E-21 154.5 16.6 179 105-290 56-399 (438)
20 PF08246 Inhibitor_I29: Cathep 99.4 4E-13 8.8E-18 93.4 5.8 45 15-59 1-58 (58)
21 smart00848 Inhibitor_I29 Cathe 99.1 6.7E-11 1.4E-15 81.7 3.8 44 15-58 1-57 (57)
22 COG3579 PepC Aminopeptidase C 99.0 5.1E-09 1.1E-13 95.0 10.8 78 210-289 296-400 (444)
23 KOG4128 Bleomycin hydrolases a 97.9 1.2E-05 2.7E-10 73.2 4.6 73 105-178 63-168 (457)
24 PF13529 Peptidase_C39_2: Pept 96.9 0.011 2.5E-07 47.2 9.8 56 209-276 87-144 (144)
25 PF05543 Peptidase_C47: Stapho 94.0 0.34 7.3E-06 40.9 7.9 118 109-277 18-145 (175)
26 PF14399 Transpep_BrtH: NlpC/p 91.2 0.57 1.2E-05 43.2 6.5 55 211-274 78-133 (317)
27 COG4990 Uncharacterized protei 85.0 2.6 5.6E-05 35.8 5.7 52 204-277 116-168 (195)
28 PF09778 Guanylate_cyc_2: Guan 83.1 5 0.00011 35.1 7.0 51 210-260 112-172 (212)
29 cd02549 Peptidase_C39A A sub-f 60.9 20 0.00044 28.2 5.1 33 214-258 70-103 (141)
30 PF12385 Peptidase_C70: Papain 42.7 62 0.0013 27.1 5.1 38 210-260 97-135 (166)
31 PF11567 PfUIS3: Plasmodium fa 34.3 10 0.00022 28.2 -0.6 31 31-61 19-49 (101)
32 PF05391 Lsm_interact: Lsm int 32.4 34 0.00074 18.4 1.4 12 53-64 9-20 (21)
33 KOG4702 Uncharacterized conser 26.3 1.7E+02 0.0036 20.9 4.3 33 12-45 28-60 (77)
34 PF01640 Peptidase_C10: Peptid 25.2 2.7E+02 0.0059 23.7 6.6 49 212-287 141-192 (192)
35 KOG4621 Uncharacterized conser 25.2 1.7E+02 0.0037 23.6 4.8 51 210-260 58-123 (167)
36 PF08664 YcbB: YcbB domain; I 23.1 1.5E+02 0.0033 24.0 4.3 56 9-65 40-104 (134)
37 cd00044 CysPc Calpains, domain 21.0 1.1E+02 0.0024 28.2 3.6 40 248-289 235-300 (315)
38 PF07351 DUF1480: Protein of u 20.3 1.2E+02 0.0026 22.0 2.7 23 239-261 28-50 (80)
No 1
>KOG1542 consensus Cysteine proteinase Cathepsin F [Posttranslational modification, protein turnover, chaperones]
Probab=100.00 E-value=1.5e-82 Score=569.19 Aligned_cols=284 Identities=36% Similarity=0.669 Sum_probs=250.3
Q ss_pred hHHHHHHHHHHHHhCCccCCHHHHHHHHHHHHHHHHH-------------hcCCCCCCCCHHHHHhhhcCCCCCCCCCCC
Q 044448 9 GNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTREKFLASYTGYKPPPTDHPH 75 (308)
Q Consensus 9 ~~~~~~f~~~~~~~~k~Y~~~~e~~~R~~iF~~N~~~-------------~g~N~fsDlt~eEf~~~~~~~~~~~~~~~~ 75 (308)
..+.+.|..|+.+|+|+|.+.+|...|+.||++|+.. +|+|+|||||+|||++++++.+......+.
T Consensus 65 l~~~~~F~~F~~kf~r~Y~s~eE~~~Rl~iF~~N~~~a~~~q~~d~gsA~yGvtqFSDlT~eEFkk~~l~~~~~~~~~~~ 144 (372)
T KOG1542|consen 65 LGLEDSFKLFTIKFGRSYASREEHAHRLSIFKHNLLRAERLQENDPGSAEYGVTQFSDLTEEEFKKIYLGVKRRGSKLPG 144 (372)
T ss_pred cchHHHHHHHHHhcCcccCcHHHHHHHHHHHHHHHHHHHHhhhcCccccccCccchhhcCHHHHHHHhhccccccccCcc
Confidence 4568899999999999999999999999999999987 899999999999999999876653111110
Q ss_pred CCCCCccccCCCCCCCCCceeecCCCCCCCccCCCCC-CchHHHHHHHHHHHHHHHhcCCcccCCHHHHhhcCC-CCCCC
Q 044448 76 SNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGS-YCCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCST-LNGCA 153 (308)
Q Consensus 76 ~~~~~~~~~~~~~~~~lP~~~Dwr~~g~v~pv~dQg~-gsCwAfa~~~~~e~~~~i~~~~~~~lS~q~l~dc~~-~~gC~ 153 (308)
.. .. .. ......||++||||++|.||||||||+ |||||||+|+++|+++.|++|++++||||||+||+. ++||+
T Consensus 145 ~~-~~-~~--~~~~~~lP~~fDWR~kgaVTpVKnQG~CGSCWAFS~tG~vEga~~i~~g~LvsLSEQeLvDCD~~d~gC~ 220 (372)
T KOG1542|consen 145 DA-AE-AP--IEPGESLPESFDWRDKGAVTPVKNQGMCGSCWAFSTTGAVEGAWAIATGKLVSLSEQELVDCDSCDNGCN 220 (372)
T ss_pred cc-cc-Cc--CCCCCCCCcccchhccCCccccccCCcCcchhhhhhhhhhhhHHHhhcCcccccchhhhhcccCcCCcCC
Confidence 00 00 11 112236999999999999999999999 999999999999999999999999999999999999 99999
Q ss_pred CCcHHHHHHHHHHcCCCCCCCCcCCCCCCCC-CCcccccCCCCccEEEeeeEEcCCCCHHHHHHHHh-cCCeEEEEecCc
Q 044448 154 KNFLENAFEYIRQYQRLASECVYPYQGRQDY-YCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVS-RQPVSVAIDATW 231 (308)
Q Consensus 154 GG~~~~a~~~~~~~~Gi~~e~~yPY~~~~~~-~C~~~~~~~~~~~~~i~~~~~v~~~~~~~lk~~l~-~gPV~v~i~~~~ 231 (308)
||.+.+|++|+++.+|+..|.+|||++ ..+ .|. .+ .....+.|++|..++ .||++|.+.|. +|||+|+|++..
T Consensus 221 GGl~~nA~~~~~~~gGL~~E~dYPY~g-~~~~~C~--~~-~~~~~v~I~~f~~l~-~nE~~ia~wLv~~GPi~vgiNa~~ 295 (372)
T KOG1542|consen 221 GGLMDNAFKYIKKAGGLEKEKDYPYTG-KKGNQCH--FD-KSKIVVSIKDFSMLS-NNEDQIAAWLVTFGPLSVGINAKP 295 (372)
T ss_pred CCChhHHHHHHHHhCCccccccCCccc-cCCCccc--cc-hhhceEEEeccEecC-CCHHHHHHHHHhcCCeEEEEchHH
Confidence 999999999988888999999999999 777 999 66 567889999999998 69999999888 699999999779
Q ss_pred ccccCCceEeC--C-CCCC-CCeEEEEEEeCCcCCCCCCCCeEEEecCCCCCcCCCceEEEEeCCCCCCCcccccccccc
Q 044448 232 FNFYHGGVFTG--P-CGNT-PNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGSGLCNIAANAAYP 307 (308)
Q Consensus 232 f~~y~~Giy~~--~-c~~~-~~Hav~iVGyg~~~~~~~g~~ywivkNSWG~~WGe~Gy~~i~~~~~~~~~Cgi~~~~~yp 307 (308)
+|+|++||+.+ . |+.. ++|||+|||||.+. - .++|||||||||++|||+||+|+.||. |.|||+++++-+
T Consensus 296 mQ~YrgGV~~P~~~~Cs~~~~~HaVLlvGyG~~g--~-~~PYWIVKNSWG~~WGE~GY~~l~RG~---N~CGi~~mvss~ 369 (372)
T KOG1542|consen 296 MQFYRGGVSCPSKYICSPKLLNHAVLLVGYGSSG--Y-EKPYWIVKNSWGTSWGEKGYYKLCRGS---NACGIADMVSSA 369 (372)
T ss_pred HHHhcccccCCCcccCCccccCceEEEEeecCCC--C-CCceEEEECCccccccccceEEEeccc---cccccccchhhh
Confidence 99999999988 3 9875 99999999999973 2 589999999999999999999999997 789999998754
No 2
>PTZ00203 cathepsin L protease; Provisional
Probab=100.00 E-value=8.7e-76 Score=547.06 Aligned_cols=285 Identities=34% Similarity=0.635 Sum_probs=238.4
Q ss_pred hhHHHHHHHHHHHHhCCccCCHHHHHHHHHHHHHHHHH------------hcCCCCCCCCHHHHHhhhcC-CCCCCCCCC
Q 044448 8 TGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF------------LRLNKFADLTREKFLASYTG-YKPPPTDHP 74 (308)
Q Consensus 8 ~~~~~~~f~~~~~~~~k~Y~~~~e~~~R~~iF~~N~~~------------~g~N~fsDlt~eEf~~~~~~-~~~~~~~~~ 74 (308)
+..+..+|++||++|+|.|.+.+|+.+|++||++|+++ ||+|+|+|||+|||.+++++ .........
T Consensus 31 ~~~~~~~f~~~~~~~~K~Y~~~~E~~~R~~iF~~N~~~I~~~N~~~~~~~lg~N~FaDlT~eEf~~~~l~~~~~~~~~~~ 110 (348)
T PTZ00203 31 GTPAAALFEEFKRTYQRAYGTLTEEQQRLANFERNLELMREHQARNPHARFGITKFFDLSEAEFAARYLNGAAYFAAAKQ 110 (348)
T ss_pred ccHHHHHHHHHHHHhCCCCCChHHHHHHHHHHHHHHHHHHHHhccCCCeEEeccccccCCHHHHHHHhcCCCcccccccc
Confidence 46788899999999999999988999999999999998 89999999999999987763 221110100
Q ss_pred CCCCCCccccCCCCCCCCCceeecCCCCCCCccCCCCC-CchHHHHHHHHHHHHHHHhcCCcccCCHHHHhhcCC-CCCC
Q 044448 75 HSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGS-YCCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCST-LNGC 152 (308)
Q Consensus 75 ~~~~~~~~~~~~~~~~~lP~~~Dwr~~g~v~pv~dQg~-gsCwAfa~~~~~e~~~~i~~~~~~~lS~q~l~dc~~-~~gC 152 (308)
... .. +........+||++||||++|+|+||||||. |||||||++++||++++|++++++.||+|+|+||+. +.||
T Consensus 111 ~~~-~~-~~~~~~~~~~lP~~~DWR~~g~VtpVkdQg~CGSCWAfa~~~aiEs~~~i~~~~~~~LSeQqLvdC~~~~~GC 188 (348)
T PTZ00203 111 HAG-QH-YRKARADLSAVPDAVDWREKGAVTPVKNQGACGSCWAFSAVGNIESQWAVAGHKLVRLSEQQLVSCDHVDNGC 188 (348)
T ss_pred ccc-cc-ccccccccccCCCCCcCCcCCCCCCccccCCCccHHHHhhHHHHHHHHHHhcCCCccCCHHHHHhccCCCCCC
Confidence 000 00 1111111125899999999999999999999 999999999999999999999999999999999998 7899
Q ss_pred CCCcHHHHHHHHHHc--CCCCCCCCcCCCCCCCC---CCcccccCCCCccEEEeeeEEcCCCCHHHHHHHHh-cCCeEEE
Q 044448 153 AKNFLENAFEYIRQY--QRLASECVYPYQGRQDY---YCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVS-RQPVSVA 226 (308)
Q Consensus 153 ~GG~~~~a~~~~~~~--~Gi~~e~~yPY~~~~~~---~C~~~~~~~~~~~~~i~~~~~v~~~~~~~lk~~l~-~gPV~v~ 226 (308)
+||++..||+|+.++ +|+++|++|||.+ .++ .|. ........+++.+|..++. ++++|+.+|+ +|||+|+
T Consensus 189 ~GG~~~~a~~yi~~~~~ggi~~e~~YPY~~-~~~~~~~C~--~~~~~~~~~~i~~~~~i~~-~e~~~~~~l~~~GPv~v~ 264 (348)
T PTZ00203 189 GGGLMLQAFEWVLRNMNGTVFTEKSYPYVS-GNGDVPECS--NSSELAPGARIDGYVSMES-SERVMAAWLAKNGPISIA 264 (348)
T ss_pred CCCCHHHHHHHHHHhcCCCCCccccCCCcc-CCCCCCcCC--CCcccccceEecceeecCc-CHHHHHHHHHhCCCEEEE
Confidence 999999999999764 5789999999998 655 687 4312234567889998874 7889999998 5999999
Q ss_pred EecCcccccCCceEeCCCCC-CCCeEEEEEEeCCcCCCCCCCCeEEEecCCCCCcCCCceEEEEeCCCCCCCcccccccc
Q 044448 227 IDATWFNFYHGGVFTGPCGN-TPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGSGLCNIAANAA 305 (308)
Q Consensus 227 i~~~~f~~y~~Giy~~~c~~-~~~Hav~iVGyg~~~~~~~g~~ywivkNSWG~~WGe~Gy~~i~~~~~~~~~Cgi~~~~~ 305 (308)
|++.+|++|++|||+. |.. .++|||+|||||.+ + |++|||||||||++|||+|||||+|+. |.|||++.++
T Consensus 265 i~a~~f~~Y~~GIy~~-c~~~~~nHaVliVGYG~~---~-g~~YWiikNSWG~~WGe~GY~ri~rg~---n~Cgi~~~~~ 336 (348)
T PTZ00203 265 VDASSFMSYHSGVLTS-CIGEQLNHGVLLVGYNMT---G-EVPYWVIKNSWGEDWGEKGYVRVTMGV---NACLLTGYPV 336 (348)
T ss_pred EEhhhhcCccCceeec-cCCCCCCeEEEEEEEecC---C-CceEEEEEcCCCCCcCcCceEEEEcCC---CcccccceEE
Confidence 9988999999999985 864 58999999999987 5 889999999999999999999999986 7899998775
Q ss_pred c
Q 044448 306 Y 306 (308)
Q Consensus 306 y 306 (308)
.
T Consensus 337 ~ 337 (348)
T PTZ00203 337 S 337 (348)
T ss_pred E
Confidence 4
No 3
>PTZ00021 falcipain-2; Provisional
Probab=100.00 E-value=4e-75 Score=558.02 Aligned_cols=292 Identities=33% Similarity=0.587 Sum_probs=244.7
Q ss_pred hhHHHHHHHHHHHHhCCccCCHHHHHHHHHHHHHHHHH-------------hcCCCCCCCCHHHHHhhhcCCCCCC--CC
Q 044448 8 TGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTREKFLASYTGYKPPP--TD 72 (308)
Q Consensus 8 ~~~~~~~f~~~~~~~~k~Y~~~~e~~~R~~iF~~N~~~-------------~g~N~fsDlt~eEf~~~~~~~~~~~--~~ 72 (308)
+.+....|++||++|+|+|.+.+|+.+|+.||++|+++ +|+|+|+|||.|||++++++..... ..
T Consensus 162 n~e~~~~F~~wk~ky~K~Y~~~eE~~~R~~iF~~Nl~~Ie~hN~~~~~ty~lgiNqFsDlT~EEF~~~~l~~~~~~~~~~ 241 (489)
T PTZ00021 162 NLENVNSFYLFIKEHGKKYQTPDEMQQRYLSFVENLAKINAHNNKENVLYKKGMNRFGDLSFEEFKKKYLTLKSFDFKSN 241 (489)
T ss_pred ChHHHHHHHHHHHHhCCcCCCHHHHHHHHHHHHHHHHHHHHhhccCCCCEEEeccccccCCHHHHHHHhccccccccccc
Confidence 35566889999999999999998999999999999998 8999999999999999887654211 00
Q ss_pred -CCCCCCCCccc-----cCCCCCCCCCceeecCCCCCCCccCCCCC-CchHHHHHHHHHHHHHHHhcCCcccCCHHHHhh
Q 044448 73 -HPHSNRSNWFK-----NLNSSKMSFYDSIDWNERGAVTPVKDQGS-YCCWAFTAVATVEGLNKIRTGQLVTRSKHQLVD 145 (308)
Q Consensus 73 -~~~~~~~~~~~-----~~~~~~~~lP~~~Dwr~~g~v~pv~dQg~-gsCwAfa~~~~~e~~~~i~~~~~~~lS~q~l~d 145 (308)
........ +. ..+.....+|++||||+.|.|+||||||. |||||||++++||++++|+++.++.||+|+|+|
T Consensus 242 ~~~~~~~~~-~~~~~~~~~~~~~~~~P~s~DWR~~g~VtpVKdQG~CGSCWAFAa~~alEs~~~I~~g~~v~LSeQqLVD 320 (489)
T PTZ00021 242 GKKSPRVIN-YDDVIKKYKPKDATFDHAKYDWRLHNGVTPVKDQKNCGSCWAFSTVGVVESQYAIRKNELVSLSEQELVD 320 (489)
T ss_pred ccccccccc-ccccccccccccccCCccccccccCCCCCCcccccccccHHHHHHHHHHHHHHHHHcCCCcccCHHHHhh
Confidence 00000000 00 00001112499999999999999999999 999999999999999999999999999999999
Q ss_pred cCC-CCCCCCCcHHHHHHHHHHcCCCCCCCCcCCCCCC-CCCCcccccCCCCccEEEeeeEEcCCCCHHHHHHHHh-cCC
Q 044448 146 CST-LNGCAKNFLENAFEYIRQYQRLASECVYPYQGRQ-DYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVS-RQP 222 (308)
Q Consensus 146 c~~-~~gC~GG~~~~a~~~~~~~~Gi~~e~~yPY~~~~-~~~C~~~~~~~~~~~~~i~~~~~v~~~~~~~lk~~l~-~gP 222 (308)
|+. +.||+||++..|+.|+.+++||++|++|||.+ . .+.|. .. .....++|.+|..++ +++|+++|+ .||
T Consensus 321 Cs~~n~GC~GG~~~~Af~yi~~~gGl~tE~~YPY~~-~~~~~C~--~~-~~~~~~~i~~y~~i~---~~~lk~al~~~GP 393 (489)
T PTZ00021 321 CSFKNNGCYGGLIPNAFEDMIELGGLCSEDDYPYVS-DTPELCN--ID-RCKEKYKIKSYVSIP---EDKFKEAIRFLGP 393 (489)
T ss_pred hccCCCCCCCcchHhhhhhhhhccccCcccccCccC-CCCCccc--cc-cccccceeeeEEEec---HHHHHHHHHhcCC
Confidence 998 88999999999999998877999999999998 6 47898 44 344568899999987 578999998 599
Q ss_pred eEEEEecC-cccccCCceEeCCCCCCCCeEEEEEEeCCcCC-------CCCCCCeEEEecCCCCCcCCCceEEEEeCCCC
Q 044448 223 VSVAIDAT-WFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTE-------AEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG 294 (308)
Q Consensus 223 V~v~i~~~-~f~~y~~Giy~~~c~~~~~Hav~iVGyg~~~~-------~~~g~~ywivkNSWG~~WGe~Gy~~i~~~~~~ 294 (308)
|+|+|++. +|++|++|||+.+|+..++|||+|||||++.. .. +.+|||||||||++|||+|||||+|+.++
T Consensus 394 Vsv~i~a~~~f~~YkgGIy~~~C~~~~nHAVlIVGYG~e~~~~~~~~~~~-~~~YWIVKNSWGt~WGE~GY~rI~r~~~g 472 (489)
T PTZ00021 394 ISVSIAVSDDFAFYKGGIFDGECGEEPNHAVILVGYGMEEIYNSDTKKME-KRYYYIIKNSWGESWGEKGFIRIETDENG 472 (489)
T ss_pred eEEEEEeecccccCCCCcCCCCCCCccceEEEEEEecCcCCcccccccCC-CCCEEEEECCCCCCcccCeEEEEEcCCCC
Confidence 99999998 99999999999889888999999999997521 01 35799999999999999999999999754
Q ss_pred -CCCccccccccccC
Q 044448 295 -SGLCNIAANAAYPL 308 (308)
Q Consensus 295 -~~~Cgi~~~~~yp~ 308 (308)
.|+|||++.++||+
T Consensus 473 ~~n~CGI~t~a~yP~ 487 (489)
T PTZ00021 473 LMKTCSLGTEAYVPL 487 (489)
T ss_pred CCCCCCCcccceeEe
Confidence 47999999999995
No 4
>PTZ00200 cysteine proteinase; Provisional
Probab=100.00 E-value=6.3e-75 Score=554.77 Aligned_cols=292 Identities=32% Similarity=0.569 Sum_probs=243.6
Q ss_pred CChhHHHHHHHHHHHHhCCccCCHHHHHHHHHHHHHHHHH-----------hcCCCCCCCCHHHHHhhhcCCCCCCCCC-
Q 044448 6 HKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-----------LRLNKFADLTREKFLASYTGYKPPPTDH- 73 (308)
Q Consensus 6 ~~~~~~~~~f~~~~~~~~k~Y~~~~e~~~R~~iF~~N~~~-----------~g~N~fsDlt~eEf~~~~~~~~~~~~~~- 73 (308)
....++..+|++|+++|+|.|.+.+|+.+|+.||++|+++ +|+|+|+|||+|||.+++++.+.+....
T Consensus 117 ~~e~e~~~~F~~f~~ky~K~Y~~~~E~~~R~~iF~~Nl~~I~~hN~~~~y~lgiN~FsDlT~eEF~~~~~~~~~~~~~~~ 196 (448)
T PTZ00200 117 KLEFEVYLEFEEFNKKYNRKHATHAERLNRFLTFRNNYLEVKSHKGDEPYSKEINKFSDLTEEEFRKLFPVIKVPPKSNS 196 (448)
T ss_pred cchHHHHHHHHHHHHHhCCcCCCHHHHHHHHHHHHHHHHHHHHhcCcCCeEEeccccccCCHHHHHHHhccCCCcccccc
Confidence 3346677899999999999999999999999999999998 8999999999999999877654321100
Q ss_pred --CCC-------CCCCcccc---------CCC-C-CCCCCceeecCCCCCCCccCCCC-C-CchHHHHHHHHHHHHHHHh
Q 044448 74 --PHS-------NRSNWFKN---------LNS-S-KMSFYDSIDWNERGAVTPVKDQG-S-YCCWAFTAVATVEGLNKIR 131 (308)
Q Consensus 74 --~~~-------~~~~~~~~---------~~~-~-~~~lP~~~Dwr~~g~v~pv~dQg-~-gsCwAfa~~~~~e~~~~i~ 131 (308)
... .... +.. ..+ . ...+|++||||+.|.|+|||||| . |||||||+++++|++++|+
T Consensus 197 ~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~P~~~DWR~~g~vtpVkdQG~~CGSCWAFat~~aiEs~~~i~ 275 (448)
T PTZ00200 197 TSHNNDFKARHVSNPT-YLKNLKKAKNTDEDVKDPSKITGEGLDWRRADAVTKVKDQGLNCGSCWAFSSVGSVESLYKIY 275 (448)
T ss_pred cccccccccccccccc-cccccccccccccccccccccCCCCccCCCCCCCCCcccCCCccchHHHHhHHHHHHHHHHHh
Confidence 000 0000 100 000 0 01269999999999999999999 9 9999999999999999999
Q ss_pred cCCcccCCHHHHhhcCC-CCCCCCCcHHHHHHHHHHcCCCCCCCCcCCCCCCCCCCcccccCCCCccEEEeeeEEcCCCC
Q 044448 132 TGQLVTRSKHQLVDCST-LNGCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPAT 210 (308)
Q Consensus 132 ~~~~~~lS~q~l~dc~~-~~gC~GG~~~~a~~~~~~~~Gi~~e~~yPY~~~~~~~C~~~~~~~~~~~~~i~~~~~v~~~~ 210 (308)
++..+.||+|+|+||+. +.||+||++..|++|++++ ||++|++|||.+ ..+.|. .. ....+.|.+|..++ .
T Consensus 276 ~~~~~~LSeQqLvDC~~~~~GC~GG~~~~A~~yi~~~-Gi~~e~~YPY~~-~~~~C~--~~--~~~~~~i~~y~~~~--~ 347 (448)
T PTZ00200 276 RDKSVDLSEQELVNCDTKSQGCSGGYPDTALEYVKNK-GLSSSSDVPYLA-KDGKCV--VS--STKKVYIDSYLVAK--G 347 (448)
T ss_pred cCCCeecCHHHHhhccCccCCCCCCcHHHHHHHHhhc-CccccccCCCCC-CCCCCc--CC--CCCeeEecceEecC--H
Confidence 99999999999999998 8899999999999999877 999999999999 889998 54 23456788888765 4
Q ss_pred HHHHHHHHhcCCeEEEEecC-cccccCCceEeCCCCCCCCeEEEEEEeCCcCCCCCCCCeEEEecCCCCCcCCCceEEEE
Q 044448 211 EEGLQDVVSRQPVSVAIDAT-WFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIF 289 (308)
Q Consensus 211 ~~~lk~~l~~gPV~v~i~~~-~f~~y~~Giy~~~c~~~~~Hav~iVGyg~~~~~~~g~~ywivkNSWG~~WGe~Gy~~i~ 289 (308)
.+.|+++|.+|||+|+|+++ +|+.|++|||+++|+..++|||+|||||.+.+ + |.+|||||||||++|||+|||||+
T Consensus 348 ~~~l~~~l~~GPV~v~i~~~~~f~~Yk~GIy~~~C~~~~nHaV~lVGyG~d~~-~-g~~YWIIkNSWG~~WGe~GY~ri~ 425 (448)
T PTZ00200 348 KDVLNKSLVISPTVVYIAVSRELLKYKSGVYNGECGKSLNHAVLLVGEGYDEK-T-KKRYWIIKNSWGTDWGENGYMRLE 425 (448)
T ss_pred HHHHHHHHhcCCEEEEeecccccccCCCCccccccCCCCcEEEEEEEecccCC-C-CCceEEEEcCCCCCcccCeeEEEE
Confidence 56677777789999999998 99999999999889877999999999996421 5 789999999999999999999999
Q ss_pred eCCCCCCCccccccccccC
Q 044448 290 RGVGGSGLCNIAANAAYPL 308 (308)
Q Consensus 290 ~~~~~~~~Cgi~~~~~yp~ 308 (308)
|+..+.|.|||++.+.||+
T Consensus 426 r~~~g~n~CGI~~~~~~P~ 444 (448)
T PTZ00200 426 RTNEGTDKCGILTVGLTPV 444 (448)
T ss_pred eCCCCCCcCCccccceeeE
Confidence 9742248899999999995
No 5
>KOG1543 consensus Cysteine proteinase Cathepsin L [Posttranslational modification, protein turnover, chaperones]
Probab=100.00 E-value=7.9e-67 Score=483.50 Aligned_cols=270 Identities=43% Similarity=0.771 Sum_probs=234.2
Q ss_pred HHHhCCccCCHHHHHHHHHHHHHHHHH-------------hcCCCCCCCCHHHHHhhhcCCCCCCCCCCCCCCCCccccC
Q 044448 19 MVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNL 85 (308)
Q Consensus 19 ~~~~~k~Y~~~~e~~~R~~iF~~N~~~-------------~g~N~fsDlt~eEf~~~~~~~~~~~~~~~~~~~~~~~~~~ 85 (308)
+.+|.+.|.+..|...|+.+|.+|++. +|+|+|+|++.+||+..+.+++++.... .. +...
T Consensus 30 ~~~~~~~y~~~~~~~~r~~~f~~n~~~~~~~n~~~~~~~~~g~n~~~d~~~ee~~~~~~~~~~~~~~~-----~~-~~~~ 103 (325)
T KOG1543|consen 30 LVKFLKRYEDRVEKKARRAIFKENLQKIESHNLKYVLSFLMGVNQFADLTTEEFKRKKTGKKPPEIKR-----DK-FTEK 103 (325)
T ss_pred hhhhccccccHHHHHHHHHHHHHHHHHHHhhhhhhceeeeeccccccccchHHHHHhhccccCccccc-----cc-cccc
Confidence 567788887778999999999999766 8999999999999999988776543311 11 1111
Q ss_pred CCCCCCCCceeecCCCC-CCCccCCCCC-CchHHHHHHHHHHHHHHHhcC-CcccCCHHHHhhcCC--CCCCCCCcHHHH
Q 044448 86 NSSKMSFYDSIDWNERG-AVTPVKDQGS-YCCWAFTAVATVEGLNKIRTG-QLVTRSKHQLVDCST--LNGCAKNFLENA 160 (308)
Q Consensus 86 ~~~~~~lP~~~Dwr~~g-~v~pv~dQg~-gsCwAfa~~~~~e~~~~i~~~-~~~~lS~q~l~dc~~--~~gC~GG~~~~a 160 (308)
....++|++||||++| .++||||||. |||||||++++||++++|+++ .++.||+|+|+||+. +.||.||.+..|
T Consensus 104 -~~~~~~p~s~DwR~~~~~~~~vkdQg~CgsCWAFaa~~aie~~~~i~~g~~l~sLSeq~lvdC~~~~~~GC~GG~~~~A 182 (325)
T KOG1543|consen 104 -LDGDDLPDSFDWRDKGAVTPPVKDQGSCGSCWAFAATGALEDRYNIKTGGKLLSLSEQDLVDCCGECGDGCNGGEPKNA 182 (325)
T ss_pred -cchhhCCCCccccccCCcCCCcCCCCcCcchHHHHHHHHHHHHHHHHhCCccCccChhhhhhccCCCCCCcCCCCHHHH
Confidence 1112699999999996 5556999999 999999999999999999999 899999999999998 889999999999
Q ss_pred HHHHHHcCCCCC-CCCcCCCCCCCCCCcccccCCCCccEEEeeeEEcCCCCHHHHHHHHh-cCCeEEEEecC-cccccCC
Q 044448 161 FEYIRQYQRLAS-ECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVS-RQPVSVAIDAT-WFNFYHG 237 (308)
Q Consensus 161 ~~~~~~~~Gi~~-e~~yPY~~~~~~~C~~~~~~~~~~~~~i~~~~~v~~~~~~~lk~~l~-~gPV~v~i~~~-~f~~y~~ 237 (308)
++|+.++ |+++ +++|||.+ ..+.|. .+ .....+.+.++..++. ++++|+++|+ +|||+|+|++. +|+.|++
T Consensus 183 ~~yi~~~-G~~t~~~~Ypy~~-~~~~C~--~~-~~~~~~~~~~~~~~~~-~e~~i~~~v~~~GPv~v~~~a~~~F~~Y~~ 256 (325)
T KOG1543|consen 183 FKYIKKN-GGVTECENYPYIG-KDGTCK--SN-KKDKTVTIKGFYNVPA-NEEAIAEAVAKNGPVSVAIDAYEDFSLYKG 256 (325)
T ss_pred HHHHHHh-CCCCCCcCCCCcC-CCCCcc--CC-CccceeEeeeeeecCc-CHHHHHHHHHhcCCeEEEEeehhhhhhccC
Confidence 9999999 6666 99999999 999999 65 3367788889998885 5999999999 59999999999 9999999
Q ss_pred ceEeCC-CCC-CCCeEEEEEEeCCcCCCCCCCCeEEEecCCCCCcCCCceEEEEeCCCCCCCccccccccc-cC
Q 044448 238 GVFTGP-CGN-TPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGSGLCNIAANAAY-PL 308 (308)
Q Consensus 238 Giy~~~-c~~-~~~Hav~iVGyg~~~~~~~g~~ywivkNSWG~~WGe~Gy~~i~~~~~~~~~Cgi~~~~~y-p~ 308 (308)
|||.++ |.. .++|||+|||||+ . + +.+|||||||||+.|||+|||||.|+. +.|+|++.++| |+
T Consensus 257 GVy~~~~~~~~~~~Hav~iVGyG~-~--~-~~~YWivkNSWG~~WGe~Gy~ri~r~~---~~~~I~~~~~~~p~ 323 (325)
T KOG1543|consen 257 GVYAEEKGDDKEGDHAVLIVGYGT-G--D-GVDYWIVKNSWGTDWGEKGYFRIARGV---NKCGIASEASYGPI 323 (325)
T ss_pred ceEeCCCCCCCCCCceEEEEEEcC-C--C-CceeEEEEcCCCCCcccCceEEEecCC---CchhhhcccccCCC
Confidence 999998 444 5999999999999 3 5 889999999999999999999999998 67999999998 64
No 6
>cd02698 Peptidase_C1A_CathepsinX Cathepsin X; the only papain-like lysosomal cysteine peptidase exhibiting carboxymonopeptidase activity. It can also act as a carboxydipeptidase, like cathepsin B, but has been shown to preferentially cleave substrates through a monopeptidyl carboxypeptidase pathway. The propeptide region of cathepsin X, the shortest among papain-like peptidases, is covalently attached to the active site cysteine in the inactive form of the enzyme. Little is known about the biological function of cathepsin X. Some studies point to a role in early tumorigenesis. A more recent study indicates that cathepsin X expression is restricted to immune cells suggesting a role in phagocytosis and the regulation of the immune response.
Probab=100.00 E-value=1.3e-58 Score=413.69 Aligned_cols=207 Identities=25% Similarity=0.448 Sum_probs=180.8
Q ss_pred CCceeecCCCC---CCCccCCCC---C-CchHHHHHHHHHHHHHHHhcC---CcccCCHHHHhhcCCCCCCCCCcHHHHH
Q 044448 92 FYDSIDWNERG---AVTPVKDQG---S-YCCWAFTAVATVEGLNKIRTG---QLVTRSKHQLVDCSTLNGCAKNFLENAF 161 (308)
Q Consensus 92 lP~~~Dwr~~g---~v~pv~dQg---~-gsCwAfa~~~~~e~~~~i~~~---~~~~lS~q~l~dc~~~~gC~GG~~~~a~ 161 (308)
||++||||+.+ +|+|||||| . |||||||++++||++++|+++ ..+.||+|+|+||+.+.||+||++..|+
T Consensus 1 lP~~~Dwr~~~~~~~v~~vk~Qg~~~~CGsCwAfa~~~aies~~~i~~~~~~~~~~lS~Q~lldC~~~~gC~GG~~~~a~ 80 (239)
T cd02698 1 LPKSWDWRNVNGVNYVSPTRNQHIPQYCGSCWAHGSTSALADRINIARKGAWPSVYLSVQVVIDCAGGGSCHGGDPGGVY 80 (239)
T ss_pred CCCCcccccCCCCcccCccccCCCCCCCCcchHHHhHHHHHHHHHHHHCCCCCCcccCHHHHHhCCCCCCccCcCHHHHH
Confidence 69999999987 999999998 8 999999999999999999875 3578999999999988899999999999
Q ss_pred HHHHHcCCCCCCCCcCCCCCCCCCCcccccC--------------CCCccEEEeeeEEcCCCCHHHHHHHHh-cCCeEEE
Q 044448 162 EYIRQYQRLASECVYPYQGRQDYYCDWWRSS--------------ASGKYGAIRGYQYVQPATEEGLQDVVS-RQPVSVA 226 (308)
Q Consensus 162 ~~~~~~~Gi~~e~~yPY~~~~~~~C~~~~~~--------------~~~~~~~i~~~~~v~~~~~~~lk~~l~-~gPV~v~ 226 (308)
+|++++ |+++|++|||.. ....|. ... .....+++.+|..++ ++++|+++|. +|||+|+
T Consensus 81 ~~~~~~-Gl~~e~~yPY~~-~~~~C~--~~~~~~~c~~~~~c~~~~~~~~~~i~~~~~~~--~~~~i~~~l~~~GPV~v~ 154 (239)
T cd02698 81 EYAHKH-GIPDETCNPYQA-KDGECN--PFNRCGTCNPFGECFAIKNYTLYFVSDYGSVS--GRDKMMAEIYARGPISCG 154 (239)
T ss_pred HHHHHc-CcCCCCeeCCcC-CCCCCc--CCCCCCCcccCcccccccccceEEeeeceecC--CHHHHHHHHHHcCCEEEE
Confidence 999987 999999999998 666665 210 012346778887775 5788999887 6999999
Q ss_pred EecC-cccccCCceEeCC-CCCCCCeEEEEEEeCCcCCCCCCCCeEEEecCCCCCcCCCceEEEEeCCCC--CCCccccc
Q 044448 227 IDAT-WFNFYHGGVFTGP-CGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG--SGLCNIAA 302 (308)
Q Consensus 227 i~~~-~f~~y~~Giy~~~-c~~~~~Hav~iVGyg~~~~~~~g~~ywivkNSWG~~WGe~Gy~~i~~~~~~--~~~Cgi~~ 302 (308)
|.+. +|+.|++|||+.+ |...++|||+|||||++. + +++|||||||||++|||+|||||+|+... .|+|||++
T Consensus 155 i~~~~~f~~Y~~GIy~~~~~~~~~~HaV~IVGyG~~~--~-g~~YWiikNSWG~~WGe~Gy~~i~rg~~~~~~~~~~i~~ 231 (239)
T cd02698 155 IMATEALENYTGGVYKEYVQDPLINHIISVAGWGVDE--N-GVEYWIVRNSWGEPWGERGWFRIVTSSYKGARYNLAIEE 231 (239)
T ss_pred EEecccccccCCeEEccCCCCCcCCeEEEEEEEEecC--C-CCEEEEEEcCCCcccCcCceEEEEccCCccccccccccc
Confidence 9999 9999999999887 556689999999999874 5 78999999999999999999999999721 47899999
Q ss_pred ccccc
Q 044448 303 NAAYP 307 (308)
Q Consensus 303 ~~~yp 307 (308)
.+.|+
T Consensus 232 ~~~~~ 236 (239)
T cd02698 232 DCAWA 236 (239)
T ss_pred ceEEE
Confidence 99886
No 7
>cd02621 Peptidase_C1A_CathepsinC Cathepsin C; also known as Dipeptidyl Peptidase I (DPPI), an atypical papain-like cysteine peptidase with chloride dependency and dipeptidyl aminopeptidase activity, resulting from its tetrameric structure which limits substrate access. Each subunit of the tetramer is composed of three peptides: the heavy and light chains, which together adopts the papain fold and forms the catalytic domain; and the residual propeptide region, which forms a beta barrel and points towards the substrate's N-terminus. The subunit composition is the result of the unique characteristic of procathepsin C maturation involving the cleavage of the catalytic domain and the non-autocatalytic excision of an activation peptide within its propeptide region. By removing N-terminal dipeptide extensions, cathepsin C activates granule serine peptidases (granzymes) involved in cell-mediated apoptosis, inflammation and tissue remodelling. Loss-of-function mutations in cathepsin C are assoc
Probab=100.00 E-value=4.2e-58 Score=411.53 Aligned_cols=207 Identities=29% Similarity=0.601 Sum_probs=177.8
Q ss_pred CCceeecCCCC----CCCccCCCCC-CchHHHHHHHHHHHHHHHhcCC------cccCCHHHHhhcCC-CCCCCCCcHHH
Q 044448 92 FYDSIDWNERG----AVTPVKDQGS-YCCWAFTAVATVEGLNKIRTGQ------LVTRSKHQLVDCST-LNGCAKNFLEN 159 (308)
Q Consensus 92 lP~~~Dwr~~g----~v~pv~dQg~-gsCwAfa~~~~~e~~~~i~~~~------~~~lS~q~l~dc~~-~~gC~GG~~~~ 159 (308)
||++||||+.+ +|+||||||. |||||||++++||++++|+++. .+.||+|+|+||+. +.||+||++..
T Consensus 1 lP~~fDwr~~~~~~~~v~~v~dQg~CGsCwAfa~~~~ies~~~i~~~~~~~~~~~~~lS~q~l~dC~~~~~GC~GG~~~~ 80 (243)
T cd02621 1 LPKSFDWGDVNNGFNYVSPVRNQGGCGSCYAFASVYALEARIMIASNKTDPLGQQPILSPQHVLSCSQYSQGCDGGFPFL 80 (243)
T ss_pred CCCcccccccCCCCcccccCCCCCcCccHHHHHHHHHHHHHHHHHhCCCCccccCcccCHHHhhhhcCCCCCCCCCCHHH
Confidence 79999999988 9999999999 9999999999999999998876 68999999999998 88999999999
Q ss_pred HHHHHHHcCCCCCCCCcCCCCCCCCCCcccccCCCCccEEEeeeEEcC----CCCHHHHHHHHh-cCCeEEEEecC-ccc
Q 044448 160 AFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQ----PATEEGLQDVVS-RQPVSVAIDAT-WFN 233 (308)
Q Consensus 160 a~~~~~~~~Gi~~e~~yPY~~~~~~~C~~~~~~~~~~~~~i~~~~~v~----~~~~~~lk~~l~-~gPV~v~i~~~-~f~ 233 (308)
|++|+.++ ||++|++|||.....+.|. ........+++..|..+. ..++++||++|. +|||+|+|++. +|+
T Consensus 81 a~~~~~~~-Gi~~e~~yPY~~~~~~~C~--~~~~~~~~~~~~~~~~i~~~~~~~~~~~ik~~i~~~GPv~v~~~~~~~F~ 157 (243)
T cd02621 81 VGKFAEDF-GIVTEDYFPYTADDDRPCK--ASPSECRRYYFSDYNYVGGCYGCTNEDEMKWEIYRNGPIVVAFEVYSDFD 157 (243)
T ss_pred HHHHHHhc-CcCCCceeCCCCCCCCCCC--CCccccccccccceeEcccccccCCHHHHHHHHHHcCCEEEEEEeccccc
Confidence 99999887 9999999999862356788 442133444555555442 247899999998 59999999999 999
Q ss_pred ccCCceEeCC-----CCC---------CCCeEEEEEEeCCcCCCCCCCCeEEEecCCCCCcCCCceEEEEeCCCCCCCcc
Q 044448 234 FYHGGVFTGP-----CGN---------TPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGSGLCN 299 (308)
Q Consensus 234 ~y~~Giy~~~-----c~~---------~~~Hav~iVGyg~~~~~~~g~~ywivkNSWG~~WGe~Gy~~i~~~~~~~~~Cg 299 (308)
+|++|||+.+ |.. .++|||+|||||++.. + +.+|||||||||++|||+|||||+|+. |.||
T Consensus 158 ~Y~~GIy~~~~~~~~C~~~~~~~~~~~~~~HaV~iVGyg~~~~-~-g~~YWiirNSWG~~WGe~Gy~~i~~~~---~~cg 232 (243)
T cd02621 158 FYKEGVYHHTDNDEVSDGDNDNFNPFELTNHAVLLVGWGEDEI-K-GEKYWIVKNSWGSSWGEKGYFKIRRGT---NECG 232 (243)
T ss_pred ccCCeEECcCCcccccccccccccCcccCCeEEEEEEeeccCC-C-CCcEEEEEcCCCCCCCcCCeEEEecCC---cccC
Confidence 9999999875 642 4799999999998631 3 789999999999999999999999986 7899
Q ss_pred ccccccc
Q 044448 300 IAANAAY 306 (308)
Q Consensus 300 i~~~~~y 306 (308)
|++.+++
T Consensus 233 i~~~~~~ 239 (243)
T cd02621 233 IESQAVF 239 (243)
T ss_pred cccceEe
Confidence 9999854
No 8
>cd02248 Peptidase_C1A Peptidase C1A subfamily (MEROPS database nomenclature); composed of cysteine peptidases (CPs) similar to papain, including the mammalian CPs (cathepsins B, C, F, H, L, K, O, S, V, X and W). Papain is an endopeptidase with specific substrate preferences, primarily for bulky hydrophobic or aromatic residues at the S2 subsite, a hydrophobic pocket in papain that accommodates the P2 sidechain of the substrate (the second residue away from the scissile bond). Most members of the papain subfamily are endopeptidases. Some exceptions to this rule can be explained by specific details of the catalytic domains like the occluding loop in cathepsin B which confers an additional carboxydipeptidyl activity and the mini-chain of cathepsin H resulting in an N-terminal exopeptidase activity. Papain-like CPs have different functions in various organisms. Plant CPs are used to mobilize storage proteins in seeds. Parasitic CPs act extracellularly to help invade tissues and cells, to h
Probab=100.00 E-value=1.5e-57 Score=398.55 Aligned_cols=203 Identities=53% Similarity=0.975 Sum_probs=187.6
Q ss_pred CceeecCCCCCCCccCCCCC-CchHHHHHHHHHHHHHHHhcCCcccCCHHHHhhcCC--CCCCCCCcHHHHHHHHHHcCC
Q 044448 93 YDSIDWNERGAVTPVKDQGS-YCCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCST--LNGCAKNFLENAFEYIRQYQR 169 (308)
Q Consensus 93 P~~~Dwr~~g~v~pv~dQg~-gsCwAfa~~~~~e~~~~i~~~~~~~lS~q~l~dc~~--~~gC~GG~~~~a~~~~~~~~G 169 (308)
|++||||+.+.++||+|||. |+|||||++++||++++++++..+.||+|+|++|.. +.+|.||.+..|++++.+. |
T Consensus 1 P~~~d~r~~~~~~~v~dQg~cgsCwAfa~~~~le~~~~i~~~~~~~lS~q~l~~c~~~~~~gC~GG~~~~a~~~~~~~-G 79 (210)
T cd02248 1 PESVDWREKGAVTPVKDQGSCGSCWAFSTVGALEGAYAIKTGKLVSLSEQQLVDCSTSGNNGCNGGNPDNAFEYVKNG-G 79 (210)
T ss_pred CCcccCCcCCCCCCCccCCCCcchHHhHHHHHHHHHHHHHcCCCcccCHHHHhccCCCCCCCCCCCCHHHhHHHHHHC-C
Confidence 78999999999999999999 999999999999999999999889999999999997 6899999999999998877 9
Q ss_pred CCCCCCcCCCCCCCCCCcccccCCCCccEEEeeeEEcCCCCHHHHHHHHhc-CCeEEEEecC-cccccCCceEeCC-C-C
Q 044448 170 LASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSR-QPVSVAIDAT-WFNFYHGGVFTGP-C-G 245 (308)
Q Consensus 170 i~~e~~yPY~~~~~~~C~~~~~~~~~~~~~i~~~~~v~~~~~~~lk~~l~~-gPV~v~i~~~-~f~~y~~Giy~~~-c-~ 245 (308)
+++|++|||.. ....|. .. .....++|++|..++..++++||++|++ |||+++|.+. +|+.|++|||..+ | .
T Consensus 80 i~~e~~yPY~~-~~~~C~--~~-~~~~~~~i~~~~~i~~~~~~~ik~~l~~~gPV~~~~~~~~~f~~y~~Giy~~~~~~~ 155 (210)
T cd02248 80 LASESDYPYTG-KDGTCK--YN-SSKVGAKITGYSNVPPGDEEALKAALANYGPVSVAIDASSSFQFYKGGIYSGPCCSN 155 (210)
T ss_pred cCccccCCccC-CCCCcc--CC-CCcccEEEeeEEEcCCCcHHHHHHHHhhcCCEEEEEecCcccccCCCCceeCCCCCC
Confidence 99999999998 888999 55 4467899999999987678999999995 9999999999 9999999999987 5 3
Q ss_pred CCCCeEEEEEEeCCcCCCCCCCCeEEEecCCCCCcCCCceEEEEeCCCCCCCcccccccccc
Q 044448 246 NTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGSGLCNIAANAAYP 307 (308)
Q Consensus 246 ~~~~Hav~iVGyg~~~~~~~g~~ywivkNSWG~~WGe~Gy~~i~~~~~~~~~Cgi~~~~~yp 307 (308)
..++|||+|||||++ . +.+|||||||||+.||++|||||+++. +.|||++.+.||
T Consensus 156 ~~~~Hav~iVGy~~~---~-~~~ywiv~NSWG~~WG~~Gy~~i~~~~---~~cgi~~~~~~~ 210 (210)
T cd02248 156 TNLNHAVLLVGYGTE---N-GVDYWIVKNSWGTSWGEKGYIRIARGS---NLCGIASYASYP 210 (210)
T ss_pred CcCCEEEEEEEEeec---C-CceEEEEEcCCCCccccCcEEEEEcCC---CccCceeeeecC
Confidence 568999999999998 4 889999999999999999999999996 689999999887
No 9
>cd02620 Peptidase_C1A_CathepsinB Cathepsin B group; composed of cathepsin B and similar proteins, including tubulointerstitial nephritis antigen (TIN-Ag). Cathepsin B is a lysosomal papain-like cysteine peptidase which is expressed in all tissues and functions primarily as an exopeptidase through its carboxydipeptidyl activity. Together with other cathepsins, it is involved in the degradation of proteins, proenzyme activation, Ag processing, metabolism and apoptosis. Cathepsin B has been implicated in a number of human diseases such as cancer, rheumatoid arthritis, osteoporosis and Alzheimer's disease. The unique carboxydipeptidyl activity of cathepsin B is attributed to the presence of an occluding loop in its active site which favors the binding of the C-termini of substrate proteins. Some members of this group do not possess the occluding loop. TIN-Ag is an extracellular matrix basement protein which was originally identified as a target Ag involved in anti-tubular basement membrane
Probab=100.00 E-value=2.8e-57 Score=404.32 Aligned_cols=201 Identities=28% Similarity=0.573 Sum_probs=172.5
Q ss_pred CceeecCCC--CCC--CccCCCCC-CchHHHHHHHHHHHHHHHhcC--CcccCCHHHHhhcCC--CCCCCCCcHHHHHHH
Q 044448 93 YDSIDWNER--GAV--TPVKDQGS-YCCWAFTAVATVEGLNKIRTG--QLVTRSKHQLVDCST--LNGCAKNFLENAFEY 163 (308)
Q Consensus 93 P~~~Dwr~~--g~v--~pv~dQg~-gsCwAfa~~~~~e~~~~i~~~--~~~~lS~q~l~dc~~--~~gC~GG~~~~a~~~ 163 (308)
|++||||++ +++ +||+|||. |||||||++++||+++.|+++ +.+.||+|+|+||+. +.||+||++..|++|
T Consensus 1 p~~~DwR~~~~~~~~v~~v~dQg~CGsCwAfa~~~~le~~~~i~~~~~~~~~LS~Q~lidC~~~~~~gC~GG~~~~a~~~ 80 (236)
T cd02620 1 PESFDAREKWPNCISIGEIRDQGNCGSCWAFSAVEAFSDRLCIQSNGKENVLLSAQDLLSCCSGCGDGCNGGYPDAAWKY 80 (236)
T ss_pred CCcccchhhCCCCCCccccCCcccchhHHHHHHHHHHhhHHHHhcCCCCccccCHHHHHhhcCCCCCCCCCCCHHHHHHH
Confidence 899999996 454 59999999 999999999999999999987 778999999999987 689999999999999
Q ss_pred HHHcCCCCCCCCcCCCCCCCCC------------------CcccccCC---CCccEEEeeeEEcCCCCHHHHHHHHh-cC
Q 044448 164 IRQYQRLASECVYPYQGRQDYY------------------CDWWRSSA---SGKYGAIRGYQYVQPATEEGLQDVVS-RQ 221 (308)
Q Consensus 164 ~~~~~Gi~~e~~yPY~~~~~~~------------------C~~~~~~~---~~~~~~i~~~~~v~~~~~~~lk~~l~-~g 221 (308)
++++ |+++|++|||.+ .... |. .... ....+++..+..+. .++++||.+|. +|
T Consensus 81 i~~~-G~~~e~~yPY~~-~~~~~~~~~~~~~~~~~~~~~~C~--~~~~~~~~~~~~~~~~~~~~~-~~~~~ik~~l~~~G 155 (236)
T cd02620 81 LTTT-GVVTGGCQPYTI-PPCGHHPEGPPPCCGTPYCTPKCQ--DGCEKTYEEDKHKGKSAYSVP-SDETDIMKEIMTNG 155 (236)
T ss_pred HHhc-CCCcCCEecCcC-CCCccCCCCCCCCCCCCCCCCCCC--cCCccccceeeeeecceeeeC-CHHHHHHHHHHHCC
Confidence 9987 999999999988 5432 33 2100 11234555666665 47899999998 59
Q ss_pred CeEEEEecC-cccccCCceEeCCCCC-CCCeEEEEEEeCCcCCCCCCCCeEEEecCCCCCcCCCceEEEEeCCCCCCCcc
Q 044448 222 PVSVAIDAT-WFNFYHGGVFTGPCGN-TPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGSGLCN 299 (308)
Q Consensus 222 PV~v~i~~~-~f~~y~~Giy~~~c~~-~~~Hav~iVGyg~~~~~~~g~~ywivkNSWG~~WGe~Gy~~i~~~~~~~~~Cg 299 (308)
||+|+|.+. +|+.|++|||+.+|+. .++|||+|||||++ + +++|||||||||++|||+|||||+|+. |+||
T Consensus 156 Pv~v~i~~~~~f~~Y~~Giy~~~~~~~~~~HaV~iVGyg~~---~-g~~YWivrNSWG~~WGe~Gy~ri~~~~---~~cg 228 (236)
T cd02620 156 PVQAAFTVYEDFLYYKSGVYQHTSGKQLGGHAVKIIGWGVE---N-GVPYWLAANSWGTDWGENGYFRILRGS---NECG 228 (236)
T ss_pred CeEEEEEechhhhhcCCcEEeecCCCCcCCeEEEEEEEecc---C-CeeEEEEEeCCCCCCCCCcEEEEEccC---cccc
Confidence 999999998 9999999999876654 47999999999987 5 889999999999999999999999986 7899
Q ss_pred cccccc
Q 044448 300 IAANAA 305 (308)
Q Consensus 300 i~~~~~ 305 (308)
|++.++
T Consensus 229 i~~~~~ 234 (236)
T cd02620 229 IESEVV 234 (236)
T ss_pred ccccee
Confidence 999875
No 10
>PF00112 Peptidase_C1: Papain family cysteine protease This is family C1 in the peptidase classification. ; InterPro: IPR000668 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad []. This group of proteins belong to the peptidase family C1, sub-family C1A (papain family, clan CA). It includes proteins classed as non-peptidase homologs. These are have either been shown experimentally to lack peptidase activity or lack one or more of the active site residues. The papain family has a wide variety of activities, including broad-range (papain) and narrow-range endo-peptidases, aminopeptidases, dipeptidyl peptidases and enzymes with both exo- and endo-peptidase activity []. Members of the papain family are widespread, found in baculovirus [], eubacteria, yeast, and practically all protozoa, plants and mammals []. The proteins are typically lysosomal or secreted, and proteolytic cleavage of the propeptide is required for enzyme activation, although bleomycin hydrolase is cytosolic in fungi and mammals []. Papain-like cysteine proteinases are essentially synthesised as inactive proenzymes (zymogens) with N-terminal propeptide regions. The activation process of these enzymes includes the removal of propeptide regions. The propeptide regions serve a variety of functions in vivo and in vitro. The pro-region is required for the proper folding of the newly synthesised enzyme, the inactivation of the peptidase domain and stabilisation of the enzyme against denaturing at neutral to alkaline pH conditions. Amino acid residues within the pro-region mediate their membrane association, and play a role in the transport of the proenzyme to lysosomes. Among the most notable features of propeptides is their ability to inhibit the activity of their cognate enzymes and that certain propeptides exhibit high selectivity for inhibition of the peptidases from which they originate []. The catalytic residues of papain are Cys-25 and His-159, other important residues being Gln-19, which helps form the 'oxyanion hole', and Asn-175, which orientates the imidazole ring of His-159. ; GO: 0008234 cysteine-type peptidase activity, 0006508 proteolysis; PDB: 3MOR_B 3HHI_B 1S4V_A 3F75_A 1MEG_A 1PCI_C 1PPO_A 3HD3_B 1F29_A 1EWL_A ....
Probab=100.00 E-value=1.6e-55 Score=386.92 Aligned_cols=208 Identities=37% Similarity=0.757 Sum_probs=180.8
Q ss_pred CCceeecCCC-CCCCccCCCCC-CchHHHHHHHHHHHHHHHhc-CCcccCCHHHHhhcCC--CCCCCCCcHHHHHHHHHH
Q 044448 92 FYDSIDWNER-GAVTPVKDQGS-YCCWAFTAVATVEGLNKIRT-GQLVTRSKHQLVDCST--LNGCAKNFLENAFEYIRQ 166 (308)
Q Consensus 92 lP~~~Dwr~~-g~v~pv~dQg~-gsCwAfa~~~~~e~~~~i~~-~~~~~lS~q~l~dc~~--~~gC~GG~~~~a~~~~~~ 166 (308)
||++||||+. +.++||+||+. |+|||||+++++|++++++. ...+.||+|+|++|.. +.+|+||++..|++++++
T Consensus 1 lP~~~D~r~~~~~~~~v~dQg~~gsCwafa~~~~~e~~~~~~~~~~~~~lS~q~l~~~~~~~~~~c~gg~~~~a~~~~~~ 80 (219)
T PF00112_consen 1 LPKSFDWRDKGGRITPVRDQGSCGSCWAFAAAAALESRLAIQNNGKNVDLSEQYLIDCSNKYNKGCDGGSPFDALKYIKN 80 (219)
T ss_dssp STSSEEGGGTTTCSG---BTTSSBTHHHHHHHHHHHHHHHHHHTSSCEEB-HHHHHHHSTGTSSTTBBBEHHHHHHHHHH
T ss_pred CCCCEecccCCCCcCccccCCcccccccchhccceeccccccccccccccccccccccccccccccccCcccccceeecc
Confidence 7999999998 48999999999 99999999999999999999 7889999999999997 789999999999999998
Q ss_pred cCCCCCCCCcCCCCCCC-CCCcccccCCCCccEEEeeeEEcCCCCHHHHHHHHhc-CCeEEEEecC--cccccCCceEeC
Q 044448 167 YQRLASECVYPYQGRQD-YYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSR-QPVSVAIDAT--WFNFYHGGVFTG 242 (308)
Q Consensus 167 ~~Gi~~e~~yPY~~~~~-~~C~~~~~~~~~~~~~i~~~~~v~~~~~~~lk~~l~~-gPV~v~i~~~--~f~~y~~Giy~~ 242 (308)
+.|+++|++|||.. .. ..|. ........+++..|..+...++++||++|.+ |||+++|.+. +|+.|++|||..
T Consensus 81 ~~Gi~~e~~~pY~~-~~~~~c~--~~~~~~~~~~i~~~~~~~~~~~~~ik~~L~~~gpV~~~~~~~~~~f~~~~~gi~~~ 157 (219)
T PF00112_consen 81 NNGIVTEEDYPYNG-NENPTCK--SKKSNSYYVKIKGYGKVKDNDIEDIKKALMKYGPVVASIDVSSEDFQNYKSGIYDP 157 (219)
T ss_dssp HTSBEBTTTS--SS-SSSCSSC--HSGGGEEEBEESEEEEEESTCHHHHHHHHHHHSSEEEEEEEESHHHHTEESSEECS
T ss_pred cCcccccccccccc-ccccccc--ccccccccccccccccccccchhHHHHHHhhCceeeeeeeccccccccccceeeec
Confidence 34999999999998 66 7898 4411212478999999987779999999996 9999999999 499999999998
Q ss_pred C-CCC-CCCeEEEEEEeCCcCCCCCCCCeEEEecCCCCCcCCCceEEEEeCCCCCCCccccccccccC
Q 044448 243 P-CGN-TPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGSGLCNIAANAAYPL 308 (308)
Q Consensus 243 ~-c~~-~~~Hav~iVGyg~~~~~~~g~~ywivkNSWG~~WGe~Gy~~i~~~~~~~~~Cgi~~~~~yp~ 308 (308)
+ |.. .++|||+|||||++ . +++|||||||||++||++|||||+|+.+ ++|||++.++||+
T Consensus 158 ~~~~~~~~~Hav~iVGy~~~---~-~~~~wiv~NSWG~~WG~~Gy~~i~~~~~--~~c~i~~~~~~~~ 219 (219)
T PF00112_consen 158 PDCSNESGGHAVLIVGYDDE---N-GKGYWIVKNSWGTDWGDNGYFRISYDYN--NECGIESQAVYPI 219 (219)
T ss_dssp TSSSSSSEEEEEEEEEEEEE---T-TEEEEEEE-SBTTTSTBTTEEEEESSSS--SGGGTTSSEEEEE
T ss_pred cccccccccccccccccccc---c-ceeeEeeehhhCCccCCCeEEEEeeCCC--CcCccCceeeecC
Confidence 6 764 68999999999998 4 8999999999999999999999999974 4899999999995
No 11
>PTZ00049 cathepsin C-like protein; Provisional
Probab=100.00 E-value=2e-54 Score=423.72 Aligned_cols=209 Identities=20% Similarity=0.404 Sum_probs=176.1
Q ss_pred CCCCceeecCCC----CCCCccCCCCC-CchHHHHHHHHHHHHHHHhcCC-----c-----ccCCHHHHhhcCC-CCCCC
Q 044448 90 MSFYDSIDWNER----GAVTPVKDQGS-YCCWAFTAVATVEGLNKIRTGQ-----L-----VTRSKHQLVDCST-LNGCA 153 (308)
Q Consensus 90 ~~lP~~~Dwr~~----g~v~pv~dQg~-gsCwAfa~~~~~e~~~~i~~~~-----~-----~~lS~q~l~dc~~-~~gC~ 153 (308)
.+||++||||+. +.++||+|||. |||||||++++||++++|++++ . ..||+|+|+||+. +.||+
T Consensus 379 ~~LP~sfDWRd~~~~~~~vtpVkdQG~CGSCWAFAat~alEsR~~Ia~~~~l~~~~~~~~~~~LS~QqLLDCs~~nqGC~ 458 (693)
T PTZ00049 379 DELPKNFTWGDPFNNNTREYDVTNQLLCGSCYIASQMYAFKRRIEIALTKNLDKKYLNNFDDLLSIQTVLSCSFYDQGCN 458 (693)
T ss_pred ccCCCCEecCcCCCCCCcccCCCCCccCcHHHHHHHHHHHHHHHHHHhccccccccccccccCcCHHHhcccCCCCCCcC
Confidence 369999999984 67999999999 9999999999999999998643 1 2799999999998 89999
Q ss_pred CCcHHHHHHHHHHcCCCCCCCCcCCCCCCCCCCcccccCCC--------------------------------------C
Q 044448 154 KNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSAS--------------------------------------G 195 (308)
Q Consensus 154 GG~~~~a~~~~~~~~Gi~~e~~yPY~~~~~~~C~~~~~~~~--------------------------------------~ 195 (308)
||++..|++|+.++ ||++|++|||.+ ..+.|+ ..... .
T Consensus 459 GG~~~~A~kya~~~-GI~tEscYPY~a-~~g~C~--~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 534 (693)
T PTZ00049 459 GGFPYLVSKMAKLQ-GIPLDKVFPYTA-TEQTCP--YQVDQSANSMNGSANLRQINAVFFSSETQSDMHADFEAPISSEP 534 (693)
T ss_pred CCcHHHHHHHHHHC-CCCcCCccCCcC-CCCCCC--CCCCCccccccccccccccccccccccccccccccccccccccc
Confidence 99999999999887 999999999998 778886 32110 1
Q ss_pred ccEEEeeeEEcCC-------CCHHHHHHHHh-cCCeEEEEecC-cccccCCceEeCC-------CCC-------------
Q 044448 196 KYGAIRGYQYVQP-------ATEEGLQDVVS-RQPVSVAIDAT-WFNFYHGGVFTGP-------CGN------------- 246 (308)
Q Consensus 196 ~~~~i~~~~~v~~-------~~~~~lk~~l~-~gPV~v~i~~~-~f~~y~~Giy~~~-------c~~------------- 246 (308)
.++.+++|..+.. .++++|+.+|. +|||+|+|++. +|++|++|||+.+ |..
T Consensus 535 ~r~y~k~y~yI~g~y~~~~~~~E~~Im~eI~~~GPVsVsIda~~dF~~YksGVY~~~~~~h~~~C~~d~~~~~~~~~~~G 614 (693)
T PTZ00049 535 ARWYAKDYNYIGGCYGCNQCNGEKIMMNEIYRNGPIVASFEASPDFYDYADGVYYVEDFPHARRCTVDLPKHNGVYNITG 614 (693)
T ss_pred cceeeeeeEEecccccccCCCCHHHHHHHHHhcCCEEEEEEechhhhcCCCccccCcccccccccCCccccccccccccc
Confidence 1234566666531 46889999998 59999999999 9999999999852 642
Q ss_pred --CCCeEEEEEEeCCcCCCCCC--CCeEEEecCCCCCcCCCceEEEEeCCCCCCCcccccccccc
Q 044448 247 --TPNHGVTIVGYGTTTEAEGQ--QPYWLVKNRWGTNWDEGGSMRIFRGVGGSGLCNIAANAAYP 307 (308)
Q Consensus 247 --~~~Hav~iVGyg~~~~~~~g--~~ywivkNSWG~~WGe~Gy~~i~~~~~~~~~Cgi~~~~~yp 307 (308)
.++|||+|||||.+.+ + | .+|||||||||+.||++|||||+|+. |.|||++.++|+
T Consensus 615 ~e~~NHAVlIVGwG~d~e-n-G~~~~YWIVRNSWGt~WGenGYfKI~RG~---N~CGIEs~a~~~ 674 (693)
T PTZ00049 615 WEKVNHAIVLVGWGEEEI-N-GKLYKYWIGRNSWGKNWGKEGYFKIIRGK---NFSGIESQSLFI 674 (693)
T ss_pred cccCceEEEEEEeccccC-C-CcccCEEEEECCCCCCcccCceEEEEcCC---CccCCccceeEE
Confidence 3699999999998531 3 5 37999999999999999999999997 789999999886
No 12
>PTZ00364 dipeptidyl-peptidase I precursor; Provisional
Probab=100.00 E-value=1.3e-53 Score=413.92 Aligned_cols=203 Identities=22% Similarity=0.405 Sum_probs=172.8
Q ss_pred CCCceeecCCCC---CCCccCCCCC----CchHHHHHHHHHHHHHHHhcC------CcccCCHHHHhhcCC-CCCCCCCc
Q 044448 91 SFYDSIDWNERG---AVTPVKDQGS----YCCWAFTAVATVEGLNKIRTG------QLVTRSKHQLVDCST-LNGCAKNF 156 (308)
Q Consensus 91 ~lP~~~Dwr~~g---~v~pv~dQg~----gsCwAfa~~~~~e~~~~i~~~------~~~~lS~q~l~dc~~-~~gC~GG~ 156 (308)
+||++||||+.| +|+||||||. |||||||++++||++++|+++ ..+.||+|+|+||+. ++||+||+
T Consensus 204 ~LP~sfDWR~~gg~~~VtpVrdQg~~~~CGSCWAFAav~alEsr~~I~tn~~~~~g~~~~LS~QqLVDCs~~n~GCdGG~ 283 (548)
T PTZ00364 204 PPPAAWSWGDVGGASFLPAAPPASPGRGCNSSYVEAALAAMMARVMVASNRTDPLGQQTFLSARHVLDCSQYGQGCAGGF 283 (548)
T ss_pred CCCCccccCcCCCCccCCCCcCCCCCCCCcCHHHHHHHHHHHHHHHHHhCCCcccCcccCcCHHHHhcccCCCCCCCCCc
Confidence 699999999987 7999999973 999999999999999999883 468899999999998 89999999
Q ss_pred HHHHHHHHHHcCCCCCCCCc--CCCCCCCC---CCcccccCCCCccEEEee------eEEcCCCCHHHHHHHHh-cCCeE
Q 044448 157 LENAFEYIRQYQRLASECVY--PYQGRQDY---YCDWWRSSASGKYGAIRG------YQYVQPATEEGLQDVVS-RQPVS 224 (308)
Q Consensus 157 ~~~a~~~~~~~~Gi~~e~~y--PY~~~~~~---~C~~~~~~~~~~~~~i~~------~~~v~~~~~~~lk~~l~-~gPV~ 224 (308)
+..|++|+.++ ||++|++| ||.+ .++ .|+ .. .....+.+++ |..+. .++++|+.+|+ +|||+
T Consensus 284 p~~A~~yi~~~-GI~tE~dY~~PY~~-~dg~~~~Ck--~~-~~~~~y~~~~~~~I~gyy~~~-~~e~~I~~eI~~~GPVs 357 (548)
T PTZ00364 284 PEEVGKFAETF-GILTTDSYYIPYDS-GDGVERACK--TR-RPSRRYYFTNYGPLGGYYGAV-TDPDEIIWEIYRHGPVP 357 (548)
T ss_pred HHHHHHHHHhC-CcccccccCCCCCC-CCCCCCCCC--CC-cccceeeeeeeEEecceeecC-CcHHHHHHHHHHcCCeE
Confidence 99999999877 99999999 9987 555 587 44 2333444444 33333 47889999998 59999
Q ss_pred EEEecC-cccccCCceEeC---------CC-----------CCCCCeEEEEEEeCCcCCCCCCCCeEEEecCCCC--CcC
Q 044448 225 VAIDAT-WFNFYHGGVFTG---------PC-----------GNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGT--NWD 281 (308)
Q Consensus 225 v~i~~~-~f~~y~~Giy~~---------~c-----------~~~~~Hav~iVGyg~~~~~~~g~~ywivkNSWG~--~WG 281 (308)
|+|++. +|+.|++|||.+ .| ...++|||+|||||.+. + |.+|||||||||+ +||
T Consensus 358 VaIda~~df~~YksGiy~gi~~~~~~~~~~~~~~~~~~~~~~~~~nHAVlIVGYG~de--~-G~~YWIVKNSWGt~~~WG 434 (548)
T PTZ00364 358 ASVYANSDWYNCDENSTEDVRYVSLDDYSTASADRPLRHYFASNVNHTVLIIGWGTDE--N-GGDYWLVLDPWGSRRSWC 434 (548)
T ss_pred EEEEechHHHhcCCCCccCeeccccccccccccCCcccccccccCCeEEEEEEecccC--C-CceEEEEECCCCCCCCcc
Confidence 999999 999999999752 12 13479999999999864 5 8899999999999 999
Q ss_pred CCceEEEEeCCCCCCCcccccccc
Q 044448 282 EGGSMRIFRGVGGSGLCNIAANAA 305 (308)
Q Consensus 282 e~Gy~~i~~~~~~~~~Cgi~~~~~ 305 (308)
|+|||||+|+. |.|||++.++
T Consensus 435 E~GYfRI~RG~---N~CGIes~~v 455 (548)
T PTZ00364 435 DGGTRKIARGV---NAYNIESEVV 455 (548)
T ss_pred cCCeEEEEcCC---Ccccccceee
Confidence 99999999997 7899999987
No 13
>smart00645 Pept_C1 Papain family cysteine protease.
Probab=100.00 E-value=4.5e-49 Score=335.81 Aligned_cols=165 Identities=53% Similarity=1.004 Sum_probs=146.5
Q ss_pred CCceeecCCCCCCCccCCCCC-CchHHHHHHHHHHHHHHHhcCCcccCCHHHHhhcCC--CCCCCCCcHHHHHHHHHHcC
Q 044448 92 FYDSIDWNERGAVTPVKDQGS-YCCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCST--LNGCAKNFLENAFEYIRQYQ 168 (308)
Q Consensus 92 lP~~~Dwr~~g~v~pv~dQg~-gsCwAfa~~~~~e~~~~i~~~~~~~lS~q~l~dc~~--~~gC~GG~~~~a~~~~~~~~ 168 (308)
||++||||+.++++||+||+. |+|||||++++||+++++++++.+.||+|+|++|.. +.+|+||++..|++|+.++.
T Consensus 1 lP~~~D~R~~~~~~~v~dQg~CGsCwAfa~~~~ie~~~~i~~~~~~~lS~q~l~~C~~~~~~gC~GG~~~~a~~~~~~~~ 80 (174)
T smart00645 1 LPESFDWRKKGAVTPVKDQGQCGSCWAFSATGALEGRYCIKTGKLVSLSEQQLVDCSTGGNNGCNGGLPDNAFEYIKKNG 80 (174)
T ss_pred CCCcCcccccCCCCccccCcccchHHHHHHHHHHHHHHHHhcCCccccCHHHHhhhcCCCCCCCCCcCHHHHHHHHHHcC
Confidence 699999999999999999999 999999999999999999999899999999999997 56999999999999998766
Q ss_pred CCCCCCCcCCCCCCCCCCcccccCCCCccEEEeeeEEcCCCCHHHHHHHHhcCCeEEEEecCcccccCCceEeCC-CCCC
Q 044448 169 RLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATWFNFYHGGVFTGP-CGNT 247 (308)
Q Consensus 169 Gi~~e~~yPY~~~~~~~C~~~~~~~~~~~~~i~~~~~v~~~~~~~lk~~l~~gPV~v~i~~~~f~~y~~Giy~~~-c~~~ 247 (308)
|+++|++|||.. ++.+.+.+|+.|++|||+.+ |...
T Consensus 81 Gi~~e~~~PY~~-------------------------------------------~~~~~~~~f~~Y~~Gi~~~~~~~~~ 117 (174)
T smart00645 81 GLETESCYPYTG-------------------------------------------SVAIDASDFQFYKSGIYDHPGCGSG 117 (174)
T ss_pred CcccccccCccc-------------------------------------------EEEEEcccccCCcCeEECCCCCCCC
Confidence 899999999831 44555448999999999985 8654
Q ss_pred -CCeEEEEEEeCCcCCCCCCCCeEEEecCCCCCcCCCceEEEEeCCCCCCCccccccc
Q 044448 248 -PNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGSGLCNIAANA 304 (308)
Q Consensus 248 -~~Hav~iVGyg~~~~~~~g~~ywivkNSWG~~WGe~Gy~~i~~~~~~~~~Cgi~~~~ 304 (308)
++|+|+|||||.+. + +++|||||||||+.|||+|||||+|+.. +.|||+...
T Consensus 118 ~~~Hav~ivGyg~~~--~-g~~yWii~NSwG~~WG~~G~~~i~~~~~--~~c~i~~~~ 170 (174)
T smart00645 118 TLDHAVLIVGYGTEE--N-GKDYWIVKNSWGTDWGENGYFRIARGKN--NECGIEASV 170 (174)
T ss_pred cccEEEEEEEEeecC--C-CeeEEEEECCCCCCcccCeEEEEEcCCC--CccCceeee
Confidence 79999999999862 4 8899999999999999999999999852 679995543
No 14
>cd02619 Peptidase_C1 C1 Peptidase family (MEROPS database nomenclature), also referred to as the papain family; composed of two subfamilies of cysteine peptidases (CPs), C1A (papain) and C1B (bleomycin hydrolase). Papain-like enzymes are mostly endopeptidases with some exceptions like cathepsins B, C, H and X, which are exopeptidases. Papain-like CPs have different functions in various organisms. Plant CPs are used to mobilize storage proteins in seeds while mammalian CPs are primarily lysosomal enzymes responsible for protein degradation in the lysosome. Papain-like CPs are synthesized as inactive proenzymes with N-terminal propeptide regions, which are removed upon activation. Bleomycin hydrolase (BH) is a CP that detoxifies bleomycin by hydrolysis of an amide group. It acts as a carboxypeptidase on its C-terminus to convert itself into an aminopeptidase and peptide ligase. BH is found in all tissues in mammals as well as in many other eukaryotes. It forms a hexameric ring barrel str
Probab=100.00 E-value=1.8e-46 Score=330.15 Aligned_cols=191 Identities=27% Similarity=0.434 Sum_probs=165.5
Q ss_pred eeecCCCCCCCccCCCCC-CchHHHHHHHHHHHHHHHhcC--CcccCCHHHHhhcCC-C-----CCCCCCcHHHHHH-HH
Q 044448 95 SIDWNERGAVTPVKDQGS-YCCWAFTAVATVEGLNKIRTG--QLVTRSKHQLVDCST-L-----NGCAKNFLENAFE-YI 164 (308)
Q Consensus 95 ~~Dwr~~g~v~pv~dQg~-gsCwAfa~~~~~e~~~~i~~~--~~~~lS~q~l~dc~~-~-----~gC~GG~~~~a~~-~~ 164 (308)
.+|||+.+ ++||+|||. |+|||||+++++|+++++++. +.+.||+|+|++|.. . .+|.||.+..++. ++
T Consensus 1 ~~d~r~~~-~~~v~dQg~~gsCwafa~~~~les~~~~~~~~~~~~~lS~q~l~~c~~~~~~~~~~~c~gG~~~~~~~~~~ 79 (223)
T cd02619 1 SVDLRPLR-LTPVKNQGSRGSCWAFASAYALESAYRIKGGEDEYVDLSPQYLYICANDECLGINGSCDGGGPLSALLKLV 79 (223)
T ss_pred CCcchhcC-CCCcccCCCCcCcHHHHHHHHHHHHHHHhcCCcccccCCHHHHHHhccccccccCCCCCCCcHHHHHHHHH
Confidence 47999998 999999999 999999999999999999987 889999999999998 2 5899999999998 77
Q ss_pred HHcCCCCCCCCcCCCCCCCCCCccccc---CCCCccEEEeeeEEcCCCCHHHHHHHHhc-CCeEEEEecC-cccccCCce
Q 044448 165 RQYQRLASECVYPYQGRQDYYCDWWRS---SASGKYGAIRGYQYVQPATEEGLQDVVSR-QPVSVAIDAT-WFNFYHGGV 239 (308)
Q Consensus 165 ~~~~Gi~~e~~yPY~~~~~~~C~~~~~---~~~~~~~~i~~~~~v~~~~~~~lk~~l~~-gPV~v~i~~~-~f~~y~~Gi 239 (308)
.++ ||++|++|||.. ....|. .. ......+++..|..+...++++||++|.+ |||+++|.+. .|..|++|+
T Consensus 80 ~~~-Gi~~e~~~Py~~-~~~~~~--~~~~~~~~~~~~~~~~y~~~~~~~~~~ik~aL~~~gPv~~~~~~~~~~~~~~~~~ 155 (223)
T cd02619 80 ALK-GIPPEEDYPYGA-ESDGEE--PKSEAALNAAKVKLKDYRRVLKNNIEDIKEALAKGGPVVAGFDVYSGFDRLKEGI 155 (223)
T ss_pred HHc-CCCccccCCCCC-CCCCCC--CCCccchhhcceeecceeEeCchhHHHHHHHHHHCCCEEEEEEcccchhcccCcc
Confidence 766 999999999998 666665 21 13345688999999987778999999995 9999999999 999999998
Q ss_pred Ee-----CC-CC-CCCCeEEEEEEeCCcCCCCCCCCeEEEecCCCCCcCCCceEEEEeCC
Q 044448 240 FT-----GP-CG-NTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292 (308)
Q Consensus 240 y~-----~~-c~-~~~~Hav~iVGyg~~~~~~~g~~ywivkNSWG~~WGe~Gy~~i~~~~ 292 (308)
+. .. |. ..++|||+|||||++.. . +++|||||||||+.||++||+||+++.
T Consensus 156 ~~~~~~~~~~~~~~~~~Hav~ivGy~~~~~-~-~~~~~i~~NSwG~~wg~~Gy~~i~~~~ 213 (223)
T cd02619 156 IYEEIVYLLYEDGDLGGHAVVIVGYDDNYV-E-GKGAFIVKNSWGTDWGDNGYGRISYED 213 (223)
T ss_pred ccccccccccCCCccCCeEEEEEeecCCCC-C-CCCEEEEEeCCCCccccCCEEEEehhh
Confidence 73 22 33 35899999999998742 3 679999999999999999999999984
No 15
>PTZ00462 Serine-repeat antigen protein; Provisional
Probab=100.00 E-value=3.8e-45 Score=367.70 Aligned_cols=200 Identities=21% Similarity=0.367 Sum_probs=159.8
Q ss_pred CCccCCCCC-CchHHHHHHHHHHHHHHHhcCCcccCCHHHHhhcCC---CCCCCCCc-HHHHHHHHHHcCCCCCCCCcCC
Q 044448 104 VTPVKDQGS-YCCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCST---LNGCAKNF-LENAFEYIRQYQRLASECVYPY 178 (308)
Q Consensus 104 v~pv~dQg~-gsCwAfa~~~~~e~~~~i~~~~~~~lS~q~l~dc~~---~~gC~GG~-~~~a~~~~~~~~Gi~~e~~yPY 178 (308)
..||||||. |+|||||+++++|++++|+++..+.||+|+|+||+. +.||.||+ +..++.|+.+++||++|++|||
T Consensus 544 ~i~VKDQG~CGSCWAFASaaaLES~~cIkgg~~v~LSeQqLVDCs~~~gn~GC~GG~~~~efl~yI~e~GgLptESdYPY 623 (1004)
T PTZ00462 544 KIQIEDQGNCAISWIFASKYHLETIKCMKGYEPHAISALYIANCSKGEHKDRCDEGSNPLEFLQIIEDNGFLPADSNYLY 623 (1004)
T ss_pred CCCcccCCcchHHHHHHHHHHHHHHHHHhcCCCcccCHHHHHhcccccCCCCCCCCCcHHHHHHHHHHcCCCcccccCCC
Confidence 579999999 999999999999999999999999999999999986 57999997 5566799988866899999999
Q ss_pred CCC-CCCCCcccccCC-----------------CCccEEEeeeEEcCCC----C----HHHHHHHHhc-CCeEEEEecCc
Q 044448 179 QGR-QDYYCDWWRSSA-----------------SGKYGAIRGYQYVQPA----T----EEGLQDVVSR-QPVSVAIDATW 231 (308)
Q Consensus 179 ~~~-~~~~C~~~~~~~-----------------~~~~~~i~~~~~v~~~----~----~~~lk~~l~~-gPV~v~i~~~~ 231 (308)
... ..+.|+ .... ....+.+.+|..+... + +++|+++|++ |||+|+|++.+
T Consensus 624 t~k~~~g~Cp--~~~~~w~n~~~~~kll~~~~~~~~~i~~kgY~~~~s~~~~~n~d~~i~~IK~eI~~kGPVaV~IdAsd 701 (1004)
T PTZ00462 624 NYTKVGEDCP--DEEDHWMNLLDHGKILNHNKKEPNSLDGKAYRAYESEHFHDKMDAFIKIIKDEIMNKGSVIAYIKAEN 701 (1004)
T ss_pred ccCCCCCCCC--CCcccccccccccccccccccccceeeccceEEecccccccchhhHHHHHHHHHHhcCCEEEEEEeeh
Confidence 741 456787 3210 0113345667666532 1 4688999995 99999999878
Q ss_pred cccc-CCceEeCC-CCC-CCCeEEEEEEeCCcCC--CCCCCCeEEEecCCCCCcCCCceEEEEeCCCCCCCccccccccc
Q 044448 232 FNFY-HGGVFTGP-CGN-TPNHGVTIVGYGTTTE--AEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGSGLCNIAANAAY 306 (308)
Q Consensus 232 f~~y-~~Giy~~~-c~~-~~~Hav~iVGyg~~~~--~~~g~~ywivkNSWG~~WGe~Gy~~i~~~~~~~~~Cgi~~~~~y 306 (308)
|+.| .+|||..+ |+. .++|||+|||||.+.. .. +++|||||||||+.|||+|||||.|.. .++|||+....+
T Consensus 702 f~~Y~~sGIyv~~~Cgs~~~nHAVlIVGYGt~in~eg~-gk~YWIVRNSWGt~WGEnGYFKI~r~g--~n~CGin~i~t~ 778 (1004)
T PTZ00462 702 VLGYEFNGKKVQNLCGDDTADHAVNIVGYGNYINDEDE-KKSYWIVRNSWGKYWGDEGYFKVDMYG--PSHCEDNFIHSV 778 (1004)
T ss_pred HHhhhcCCccccCCCCCCcCCceEEEEEecccccccCC-CCceEEEEcCCCCCcCCCeEEEEEeCC--CCCCccchheee
Confidence 8888 48987665 985 5899999999997521 13 578999999999999999999999953 288999877665
Q ss_pred cC
Q 044448 307 PL 308 (308)
Q Consensus 307 p~ 308 (308)
|+
T Consensus 779 ~~ 780 (1004)
T PTZ00462 779 VI 780 (1004)
T ss_pred ee
Confidence 53
No 16
>KOG1544 consensus Predicted cysteine proteinase TIN-ag [General function prediction only]
Probab=100.00 E-value=6.9e-42 Score=303.55 Aligned_cols=247 Identities=24% Similarity=0.425 Sum_probs=189.6
Q ss_pred CCCCCCHHHHHhhhcCCCCCCCC-CCCCCCCCccccCCCCCCCCCceeecCCC--CCCCccCCCCC-CchHHHHHHHHHH
Q 044448 50 KFADLTREKFLASYTGYKPPPTD-HPHSNRSNWFKNLNSSKMSFYDSIDWNER--GAVTPVKDQGS-YCCWAFTAVATVE 125 (308)
Q Consensus 50 ~fsDlt~eEf~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~lP~~~Dwr~~--g~v~pv~dQg~-gsCwAfa~~~~~e 125 (308)
+|..||.++=.+..+|..+|... ..|+++.. .. .+. .+||+.||-|++ +++.|+.|||+ ++.|||+++++..
T Consensus 170 aFWGmtL~DGiKyRLGTL~Ps~sv~nMNEi~~-~l-~p~--~~LPE~F~As~KWp~liH~plDQgnCa~SWafSTaavas 245 (470)
T KOG1544|consen 170 AFWGMTLDDGIKYRLGTLRPSSSVMNMNEIYT-VL-NPG--EVLPEAFEASEKWPNLIHEPLDQGNCAGSWAFSTAAVAS 245 (470)
T ss_pred hhhcccccccceeeecccCchhhhhhHHhHhh-cc-Ccc--cccchhhhhhhcCCccccCccccCCcccceeeeeehhcc
Confidence 78888888766666776665543 33432111 11 111 259999999986 89999999999 9999999999999
Q ss_pred HHHHHhcCC--cccCCHHHHhhcCC--CCCCCCCcHHHHHHHHHHcCCCCCCCCcCCCCC---CCCCCccc---------
Q 044448 126 GLNKIRTGQ--LVTRSKHQLVDCST--LNGCAKNFLENAFEYIRQYQRLASECVYPYQGR---QDYYCDWW--------- 189 (308)
Q Consensus 126 ~~~~i~~~~--~~~lS~q~l~dc~~--~~gC~GG~~~~a~~~~~~~~Gi~~e~~yPY~~~---~~~~C~~~--------- 189 (308)
.+++|.+.. ...||+|+|++|.. .+||.||..+.|+=|+.+. |++...+|||... ..+.|..-
T Consensus 246 DRiAI~S~GR~t~~LSpQnLlSC~~h~q~GC~gG~lDRAWWYlRKr-GvVsdhCYP~~~dQ~~~~~~C~m~sR~~grgkR 324 (470)
T KOG1544|consen 246 DRVAIHSLGRMTPVLSPQNLLSCDTHQQQGCRGGRLDRAWWYLRKR-GVVSDHCYPFSGDQAGPAPPCMMHSRAMGRGKR 324 (470)
T ss_pred ceeEEeeccccccccChHHhcchhhhhhccCccCcccchheeeecc-cccccccccccCCCCCCCCCceeeccccCcccc
Confidence 999998643 36799999999998 7899999999999999888 9999999999863 22334310
Q ss_pred -------ccC-CCCccEEEeeeEEcCCCCHHHHHHHHh-cCCeEEEEecC-cccccCCceEeCCCC---------CCCCe
Q 044448 190 -------RSS-ASGKYGAIRGYQYVQPATEEGLQDVVS-RQPVSVAIDAT-WFNFYHGGVFTGPCG---------NTPNH 250 (308)
Q Consensus 190 -------~~~-~~~~~~~i~~~~~v~~~~~~~lk~~l~-~gPV~v~i~~~-~f~~y~~Giy~~~c~---------~~~~H 250 (308)
... .+...++++.=..|. .++++|++.|+ +|||-+.|.+. +|+.|++|||.+... ..+.|
T Consensus 325 qat~~CPn~~~~Sn~iyq~tPPYrVS-SnE~eImkElM~NGPVQA~m~VHEDFF~YkgGiY~H~~~~~~~~e~yr~~gtH 403 (470)
T KOG1544|consen 325 QATAHCPNSYVNSNDIYQVTPPYRVS-SNEKEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTH 403 (470)
T ss_pred cccCcCCCcccccCceeeecCCeecc-CCHHHHHHHHHhCCChhhhhhhhhhhhhhccceeeccccccCCchhhhhcccc
Confidence 110 122344444444454 46777877777 79999999999 999999999987521 14889
Q ss_pred EEEEEEeCCcCCCCC-CCCeEEEecCCCCCcCCCceEEEEeCCCCCCCcccccccc
Q 044448 251 GVTIVGYGTTTEAEG-QQPYWLVKNRWGTNWDEGGSMRIFRGVGGSGLCNIAANAA 305 (308)
Q Consensus 251 av~iVGyg~~~~~~~-g~~ywivkNSWG~~WGe~Gy~~i~~~~~~~~~Cgi~~~~~ 305 (308)
+|.|.|||++..++| ..+|||..||||+.|||+|||||-|+. |.|-|+++.+
T Consensus 404 sVk~tGWG~~~~~~G~~~KyW~aANSWG~~WGE~GYFriLRGv---NecdIEsfvI 456 (470)
T KOG1544|consen 404 SVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGYFRILRGV---NECDIESFVI 456 (470)
T ss_pred eEEEeecccccCCCCCeeEEEEeecccccccccCceEEEeccc---cchhhhHhhh
Confidence 999999999864341 357999999999999999999999998 7799998754
No 17
>COG4870 Cysteine protease [Posttranslational modification, protein turnover, chaperones]
Probab=99.96 E-value=8.9e-30 Score=231.45 Aligned_cols=194 Identities=24% Similarity=0.332 Sum_probs=134.6
Q ss_pred CCCceeecCCCCCCCccCCCCC-CchHHHHHHHHHHHHHHHhcCCcccCCHHHH-----hhcCC--CC-CCCCCcHHHHH
Q 044448 91 SFYDSIDWNERGAVTPVKDQGS-YCCWAFTAVATVEGLNKIRTGQLVTRSKHQL-----VDCST--LN-GCAKNFLENAF 161 (308)
Q Consensus 91 ~lP~~~Dwr~~g~v~pv~dQg~-gsCwAfa~~~~~e~~~~i~~~~~~~lS~q~l-----~dc~~--~~-gC~GG~~~~a~ 161 (308)
.+|+.||||+.|.|+||||||. |+||||++++++|+.+.-.. ...+|+-.+ +-|.. .. --+||....+.
T Consensus 98 s~~~~fd~r~~g~vs~v~dQg~~Gscwaf~t~~sles~l~~~~--~w~~s~~nm~~ll~~~ye~~fd~~~~d~g~~~m~~ 175 (372)
T COG4870 98 SLPSYFDRRDEGKVSPVKDQGSGGSCWAFATTRSLESYLNPES--AWDFSENNMKNLLGVPYEKGFDYTSNDGGNADMSA 175 (372)
T ss_pred cchhheeeeccCCcccccccCcccceEeeeehhhhhheecccc--cccccccchhhhcCCCccccCCCccccCCcccccc
Confidence 4899999999999999999999 99999999999999885443 233444333 22322 11 12377777777
Q ss_pred HHHHHcCCCCCCCCcCCCCCCCCCCcccccCCCCccEEEeeeEEcCCC----CHHHHHHHHh-cCCeE--EEEecCcccc
Q 044448 162 EYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPA----TEEGLQDVVS-RQPVS--VAIDATWFNF 234 (308)
Q Consensus 162 ~~~~~~~Gi~~e~~yPY~~~~~~~C~~~~~~~~~~~~~i~~~~~v~~~----~~~~lk~~l~-~gPV~--v~i~~~~f~~ 234 (308)
.|+.+..|-+.+.+-||.. ....|. .. .....++.....++.. +...|++++. .|-++ +.|++..+..
T Consensus 176 a~l~e~sgpv~et~d~y~~-~s~~~~--~~--~p~~k~~~~~~~i~~~~~~LdnG~i~~~~~~yg~~s~~~~id~~~~~~ 250 (372)
T COG4870 176 AYLTEWSGPVYETDDPYSE-NSYFSP--TN--LPVTKHVQEAQIIPSRKKYLDNGNIKAMFGFYGAVSSSMYIDATNSLG 250 (372)
T ss_pred ccccccCCcchhhcCcccc-ccccCC--cC--CchhhccccceecccchhhhcccchHHHHhhhccccceeEEecccccc
Confidence 7888888999999999988 666666 32 1222233444444421 2334666666 36554 3355553333
Q ss_pred cCCceEeCCCCCCCCeEEEEEEeCCcCC-------CCCCCCeEEEecCCCCCcCCCceEEEEeCC
Q 044448 235 YHGGVFTGPCGNTPNHGVTIVGYGTTTE-------AEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292 (308)
Q Consensus 235 y~~Giy~~~c~~~~~Hav~iVGyg~~~~-------~~~g~~ywivkNSWG~~WGe~Gy~~i~~~~ 292 (308)
..-+.|..+.....+|||+||||++... +. |.+.||||||||+.||++|||||+...
T Consensus 251 ~~~~~~~~~s~~~~gHAv~iVGyDDs~~~n~~~~~~~-g~GAfiikNSWGt~wG~~GYfwisY~y 314 (372)
T COG4870 251 ICIPYPYVDSGENWGHAVLIVGYDDSFDINNFKYGPP-GDGAFIIKNSWGTNWGENGYFWISYYY 314 (372)
T ss_pred cccCCCCCCccccccceEEEEeccccccccccccCCC-CCceEEEECccccccccCceEEEEeee
Confidence 3344444333356899999999999653 33 678999999999999999999999875
No 18
>cd00585 Peptidase_C1B Peptidase C1B subfamily (MEROPS database nomenclature); composed of eukaryotic bleomycin hydrolases (BH) and bacterial aminopeptidases C (pepC). The proteins of this subfamily contain a large insert relative to the C1A peptidase (papain) subfamily. BH is a cysteine peptidase that detoxifies bleomycin by hydrolysis of an amide group. It acts as a carboxypeptidase on its C-terminus to convert itself into an aminopeptidase and peptide ligase. BH is found in all tissues in mammals as well as in many other eukaryotes. Bleomycin, a glycopeptide derived from the fungus Streptomyces verticullus, is an effective anticancer drug due to its ability to induce DNA strand breaks. Human BH is the major cause of tumor cell resistance to bleomycin chemotherapy, and is also genetically linked to Alzheimer's disease. In addition to its peptidase activity, the yeast BH (Gal6) binds DNA and acts as a repressor in the Gal4 regulatory system. BH forms a hexameric ring barrel structure w
Probab=99.92 E-value=1.1e-24 Score=207.74 Aligned_cols=180 Identities=18% Similarity=0.225 Sum_probs=129.1
Q ss_pred CccCCCCC-CchHHHHHHHHHHHHHHHh-cCCcccCCHHHHhh----------------cCC-------------CCCCC
Q 044448 105 TPVKDQGS-YCCWAFTAVATVEGLNKIR-TGQLVTRSKHQLVD----------------CST-------------LNGCA 153 (308)
Q Consensus 105 ~pv~dQg~-gsCwAfa~~~~~e~~~~i~-~~~~~~lS~q~l~d----------------c~~-------------~~gC~ 153 (308)
.||+||+. |.||.||+...|++.+..+ +...+.||+.++.- +.. ....+
T Consensus 55 ~~vtnQ~~SGrCW~FA~Ln~lr~~~~k~~~~~~felSq~Yl~f~dklEkaN~fle~ii~~~~~~~~~R~v~~ll~~~~~D 134 (437)
T cd00585 55 EPVTNQKSSGRCWLFAALNVLRHQFMKKLNLKEFEFSQSYLFFWDKLEKANYFLENIIETADEPLDDRLVQFLLANPQND 134 (437)
T ss_pred CCcccCCCCchhHHHHCHHHHHHHHHHHcCCCCEEeCcHHHHHHHHHHHHHHHHHHHHHHhcCCCccHHHHHHHhCCcCC
Confidence 48999999 9999999999999988774 45678999987754 221 34578
Q ss_pred CCcHHHHHHHHHHcCCCCCCCCcCCCCC--------------------------CCC-----------------------
Q 044448 154 KNFLENAFEYIRQYQRLASECVYPYQGR--------------------------QDY----------------------- 184 (308)
Q Consensus 154 GG~~~~a~~~~~~~~Gi~~e~~yPY~~~--------------------------~~~----------------------- 184 (308)
||....+...+.+. |+++.+.||-+.. ..+
T Consensus 135 GGqw~m~~~li~KY-GvVPk~~~pet~~s~~t~~~n~~L~~kLr~~a~~lr~~~~~~~~~~~l~~~~~~~~~~iy~il~~ 213 (437)
T cd00585 135 GGQWDMLVNLIEKY-GLVPKSVMPESFNSENSRRLNYLLNRKLREDALELRKLVAKGASKEEIEAKKEEMLKEVYRILAI 213 (437)
T ss_pred CCchHHHHHHHHHc-CCCcccccCCCcCccchHHHHHHHHHHHHHHHHHHHHHHhcCCcHHHHHHHHHHHHHHHHHHHHH
Confidence 99999999999887 9999999984321 000
Q ss_pred ---CCcc---cc--cC---------------------------------CC--C---ccE-----------EEeeeEEcC
Q 044448 185 ---YCDW---WR--SS---------------------------------AS--G---KYG-----------AIRGYQYVQ 207 (308)
Q Consensus 185 ---~C~~---~~--~~---------------------------------~~--~---~~~-----------~i~~~~~v~ 207 (308)
.++. |. ++ +. . ..+ ....|..+|
T Consensus 214 ~lG~pP~~F~~~y~dkd~~~~~~~~~TP~~F~~~yv~~~~~dyV~l~~~p~~~~p~~~~y~ve~~~Nv~~g~~~~y~Nvp 293 (437)
T cd00585 214 ALGEPPEKFDWEYRDKDKKYHEIKELTPLEFYKKYVKFDLDDYVSLINDPRPDKPYNKLYTVEYLGNVVGGRPILYLNVP 293 (437)
T ss_pred HcCCCCceEEEEEEeCCCCeeeCCCcCHHHHHHHhcCCCccceEEEEeCCCCCCCCCceEEEecCCcccccccceEEecC
Confidence 0000 00 00 00 0 000 111222332
Q ss_pred CCCHHHHH----HHHhc-CCeEEEEecCcccccCCceEeCC----------------------CCCCCCeEEEEEEeCCc
Q 044448 208 PATEEGLQ----DVVSR-QPVSVAIDATWFNFYHGGVFTGP----------------------CGNTPNHGVTIVGYGTT 260 (308)
Q Consensus 208 ~~~~~~lk----~~l~~-gPV~v~i~~~~f~~y~~Giy~~~----------------------c~~~~~Hav~iVGyg~~ 260 (308)
++.|+ ++|.. +||.+++++..|+.|++||++.. |.+..+|||+|||||.+
T Consensus 294 ---~d~l~~~~~~~L~~g~pV~~g~Dv~~~~~~k~GI~d~~~~~~~~~f~~~~~~~KaeRl~~~es~~tHAM~ivGv~~D 370 (437)
T cd00585 294 ---MDVLKKAAIAQLKDGEPVWFGCDVGKFSDRKSGILDTDLFDYELLFGIDFGLNKAERLDYGESLMTHAMVLTGVDLD 370 (437)
T ss_pred ---HHHHHHHHHHHHhcCCCEEEEEEcChhhccCCccccCcccchhhhcCccccCCHHHHHhhcCCcCCeEEEEEEEEec
Confidence 56665 45565 69999999997779999999653 23457899999999987
Q ss_pred CCCCCCC-CeEEEecCCCCCcCCCceEEEEeC
Q 044448 261 TEAEGQQ-PYWLVKNRWGTNWDEGGSMRIFRG 291 (308)
Q Consensus 261 ~~~~~g~-~ywivkNSWG~~WGe~Gy~~i~~~ 291 (308)
. + |+ .||+||||||+.||++||++|+++
T Consensus 371 ~--~-g~p~yw~VkNSWG~~~G~~Gy~~ms~~ 399 (437)
T cd00585 371 E--D-GKPVKWKVENSWGEKVGKKGYFVMSDD 399 (437)
T ss_pred C--C-CCcceEEEEcccCCCCCCCcceehhHH
Confidence 5 5 65 699999999999999999999875
No 19
>PF03051 Peptidase_C1_2: Peptidase C1-like family This family is a subfamily of the Prosite entry; InterPro: IPR004134 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad []. This group of proteins belong to MEROPS peptidase family C1, sub-family C1B (bleomycin hydrolase, clan CA). This family contains prokaryotic and eukaryotic aminopeptidases and bleomycin hydrolases.; GO: 0004197 cysteine-type endopeptidase activity, 0006508 proteolysis; PDB: 3PW3_F 2CB5_A 1CB5_C 2DZZ_A 2E02_A 2E01_A 2E03_A 1A6R_A 1GCB_A 3GCB_A ....
Probab=99.74 E-value=6.9e-17 Score=154.48 Aligned_cols=179 Identities=21% Similarity=0.285 Sum_probs=107.9
Q ss_pred CccCCCCC-CchHHHHHHHHHHHHHHHhcC-CcccCCHHHHh----------------hcCC-------------CCCCC
Q 044448 105 TPVKDQGS-YCCWAFTAVATVEGLNKIRTG-QLVTRSKHQLV----------------DCST-------------LNGCA 153 (308)
Q Consensus 105 ~pv~dQg~-gsCwAfa~~~~~e~~~~i~~~-~~~~lS~q~l~----------------dc~~-------------~~gC~ 153 (308)
.||.||+. |.||.||+..+++..+..+.+ ....||+.+|. ++.. ....+
T Consensus 56 ~~vtnQk~SGRCW~FA~lN~lR~~~~kk~~l~~felSq~Yl~F~DKlEKaN~fLe~ii~~~~~~~d~R~v~~ll~~~~~D 135 (438)
T PF03051_consen 56 GPVTNQKSSGRCWLFAALNVLRHEIMKKLNLKDFELSQNYLFFWDKLEKANYFLENIIDTADEPLDDRLVRFLLKNPVSD 135 (438)
T ss_dssp -S--B--BSSTHHHHHHHHHHHHHHHHHCT-SS--B-HHHHHHHHHHHHHHHHHHHHHHCCTS-TTSHHHHHHHHSTT-S
T ss_pred CCCCCCCCCCCcchhhchHHHHHHHHHHcCCCceEeechHHHHHHHHHHHHHHHHHHHHHhcCCcchHHHHHHHhcCCCC
Confidence 49999999 999999999999999988765 67899998864 3332 34578
Q ss_pred CCcHHHHHHHHHHcCCCCCCCCcCCCCC--------------------------CC------------------------
Q 044448 154 KNFLENAFEYIRQYQRLASECVYPYQGR--------------------------QD------------------------ 183 (308)
Q Consensus 154 GG~~~~a~~~~~~~~Gi~~e~~yPY~~~--------------------------~~------------------------ 183 (308)
||....+...|++. ||++.+.||-+.. ..
T Consensus 136 GGqw~~~~nli~KY-GvVPk~~mpet~~s~~t~~~n~~l~~~Lr~~a~~LR~~~~~~~~~~~l~~~k~~~l~~iy~il~~ 214 (438)
T PF03051_consen 136 GGQWDMVVNLIKKY-GVVPKSVMPETFSSSNTSEMNEMLNTKLREYALELRKLVKAGKSEEELRKLKEEMLAEIYRILAI 214 (438)
T ss_dssp -B-HHHHHHHHHHH----BGGGSTTGCGCHBHHHHHHHHHHHHHHHHHHHHHHHHTTTTCHHHHHHHHHHHHHHHHHHHH
T ss_pred CCchHHHHHHHHHc-CcCcHhhCCCCCCCCChHHHHHHHHHHHHHHHHHHHHHHHcCCCHHHHHHHHHHHHHHHHHHHHH
Confidence 99999999999887 9999999985431 00
Q ss_pred --CCCcc---cc--cC---------------------------------C-----CCccEEE-----------eeeEEcC
Q 044448 184 --YYCDW---WR--SS---------------------------------A-----SGKYGAI-----------RGYQYVQ 207 (308)
Q Consensus 184 --~~C~~---~~--~~---------------------------------~-----~~~~~~i-----------~~~~~v~ 207 (308)
|.++. |. .+ + -...+.+ ..|..+|
T Consensus 215 ~lG~PP~~F~~ey~dkd~~~~~~~~~TP~eF~~kyv~~~~ddyVsLin~P~~~~py~~~y~ve~~~Nv~~g~~~~ylNvp 294 (438)
T PF03051_consen 215 YLGEPPEKFTWEYRDKDKKYHRGKNYTPLEFYKKYVGFDLDDYVSLINDPRSHHPYNKLYTVEYLGNVVGGRPVRYLNVP 294 (438)
T ss_dssp HH---SSSEEEEEE-TTS-EEEEEEE-HHHHHHHCTTS-GGGEEEEE--T-TTS-TTCEEEETTTTSSTT-EEEEEEE--
T ss_pred HcCCCChheeEEEeccccccccccccCchhHHHHHhCCCCcceEEEeeCCCccCccceeEEEccCCCEECCcceeEeccC
Confidence 00000 00 00 0 0001110 1122332
Q ss_pred CCCHHHHHHH----HhcC-CeEEEEecCcccccCCceEeCCCC----------------------CCCCeEEEEEEeCCc
Q 044448 208 PATEEGLQDV----VSRQ-PVSVAIDATWFNFYHGGVFTGPCG----------------------NTPNHGVTIVGYGTT 260 (308)
Q Consensus 208 ~~~~~~lk~~----l~~g-PV~v~i~~~~f~~y~~Giy~~~c~----------------------~~~~Hav~iVGyg~~ 260 (308)
.+.|+++ |..| ||..+-++..+...+.||.+...- +..+|||+|||.+.+
T Consensus 295 ---id~lk~~~i~~Lk~G~~VwfgcDV~k~~~~k~Gi~D~~~~d~~~~fg~~~~~~K~~Rl~~~eS~~tHAM~itGv~~D 371 (438)
T PF03051_consen 295 ---IDELKDAAIKSLKAGYPVWFGCDVGKFFDRKNGIMDTDLYDYDSLFGVDFNMSKAERLDYGESTMTHAMVITGVDLD 371 (438)
T ss_dssp ---HHHHHHHHHHHHHTT--EEEEEETTTTEETTTTEE-TTSB-HHHHHT--S-S-HHHHHHTTSS--EEEEEEEEEEE-
T ss_pred ---HHHHHHHHHHHHHcCCcEEEeccCCccccccchhhccchhhhhhhhccccccCHHHHHHhCCCCCceeEEEEEEEec
Confidence 5666554 4456 999999999555668898754310 136899999999987
Q ss_pred CCCCCCC-CeEEEecCCCCCcCCCceEEEEe
Q 044448 261 TEAEGQQ-PYWLVKNRWGTNWDEGGSMRIFR 290 (308)
Q Consensus 261 ~~~~~g~-~ywivkNSWG~~WGe~Gy~~i~~ 290 (308)
. + |+ .+|+|+||||+..|.+||+.|+.
T Consensus 372 ~--~-g~p~~wkVeNSWG~~~g~kGy~~msd 399 (438)
T PF03051_consen 372 E--D-GKPVRWKVENSWGTDNGDKGYFYMSD 399 (438)
T ss_dssp T--T-SSEEEEEEE-SBTTTSTBTTEEEEEH
T ss_pred c--C-CCeeEEEEEcCCCCCCCCCcEEEECH
Confidence 5 5 65 59999999999999999999984
No 20
>PF08246 Inhibitor_I29: Cathepsin propeptide inhibitor domain (I29); InterPro: IPR013201 Peptide proteinase inhibitors can be found as single domain proteins or as single or multiple domains within proteins; these are referred to as either simple or compound inhibitors, respectively. In many cases they are synthesised as part of a larger precursor protein, either as a prepropeptide or as an N-terminal domain associated with an inactive peptidase or zymogen. This domain prevents access of the substrate to the active site. Removal of the N-terminal inhibitor domain either by interaction with a second peptidase or by autocatalytic cleavage activates the zymogen. Other inhibitors interact direct with proteinases using a simple noncovalent lock and key mechanism; while yet others use a conformational change-based trapping mechanism that depends on their structural and thermodynamic properties. This entry represents a peptidase inhibitor domain, which belongs to MEROPS peptidase inhibitor family I29. The domain is also found at the N terminus of a variety of peptidase precursors that belong to MEROPS peptidase subfamily C1A; these include cathepsin L, papain, and procaricain (P10056 from SWISSPROT) []. It forms an alpha-helical domain that runs through the substrate-binding site, preventing access. Removal of this region by proteolytic cleavage results in activation of the enzyme. This domain is also found, in one or more copies, in a variety of cysteine peptidase inhibitors such as salarin [].; PDB: 3QT4_A 3QJ3_A 2C0Y_A 2L95_A 1CJL_A 1CS8_A 7PCK_A 1BY8_A 1PCI_A 2O6X_A ....
Probab=99.41 E-value=4e-13 Score=93.44 Aligned_cols=45 Identities=42% Similarity=0.735 Sum_probs=39.7
Q ss_pred HHHHHHHhCCccCCHHHHHHHHHHHHHHHHH-------------hcCCCCCCCCHHHH
Q 044448 15 HEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTREKF 59 (308)
Q Consensus 15 f~~~~~~~~k~Y~~~~e~~~R~~iF~~N~~~-------------~g~N~fsDlt~eEf 59 (308)
|++|+++|+|.|.+.+|+.+|+.||++|++. +|||+|||||++||
T Consensus 1 F~~~~~~~~k~Y~~~~e~~~R~~~F~~N~~~I~~~N~~~~~~~~~~~N~fsD~t~eEf 58 (58)
T PF08246_consen 1 FEQFKKKYGKSYKSAEEEARRFAIFKENLRRIEEHNANGNNTYKLGLNQFSDMTPEEF 58 (58)
T ss_dssp HHHHHHHCT---SSHHHHHHHHHHHHHHHHHHHHHHHTTSSSEEE-SSTTTTSSHHHH
T ss_pred CHHHHHHcCCCCCCHHHHHHHHHHHHHHHHHHHHHhcCCCCCeEEeCccccCcChhhC
Confidence 8999999999999999999999999999998 99999999999998
No 21
>smart00848 Inhibitor_I29 Cathepsin propeptide inhibitor domain (I29). This domain is found at the N-terminus of some C1 peptidases such as Cathepsin L where it acts as a propeptide. There are also a number of proteins that are composed solely of multiple copies of this domain such as the peptidase inhibitor salarin. This family is classified as I29 by MEROPS. Peptide proteinase inhibitors can be found as single domain proteins or as single or multiple domains within proteins; these are referred to as either simple or compound inhibitors, respectively. In many cases they are synthesised as part of a larger precursor protein, either as a prepropeptide or as an N-terminal domain associated with an inactive peptidase or zymogen. This domain prevents access of the substrate to the active site. Removal of the N-terminal inhibitor domain either by interaction with a second peptidase or by autocatalytic cleavage activates the zymogen. Other inhibitors interact direct with proteinases using a s
Probab=99.10 E-value=6.7e-11 Score=81.73 Aligned_cols=44 Identities=48% Similarity=0.872 Sum_probs=41.4
Q ss_pred HHHHHHHhCCccCCHHHHHHHHHHHHHHHHH-------------hcCCCCCCCCHHH
Q 044448 15 HEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTREK 58 (308)
Q Consensus 15 f~~~~~~~~k~Y~~~~e~~~R~~iF~~N~~~-------------~g~N~fsDlt~eE 58 (308)
|++|+++|+|.|.+.+|...|+.+|.+|++. +|+|+|||||++|
T Consensus 1 f~~~~~~~~k~y~~~~e~~~r~~~f~~n~~~i~~~N~~~~~~~~~~~N~fsDlt~eE 57 (57)
T smart00848 1 FEQWKKKYGKSYSSEEEELRRFEIFKENLKFIEEHNKKNDHSYTLGLNQFADLTNEE 57 (57)
T ss_pred ChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHHHhcCCCCeEecCcccccCCCCC
Confidence 6899999999999999999999999999987 8999999999876
No 22
>COG3579 PepC Aminopeptidase C [Amino acid transport and metabolism]
Probab=98.96 E-value=5.1e-09 Score=95.02 Aligned_cols=78 Identities=19% Similarity=0.322 Sum_probs=57.5
Q ss_pred CHHHHHHHHh----cC-CeEEEEecCcccccCCceEeCC------------C---------C-CCCCeEEEEEEeCCcCC
Q 044448 210 TEEGLQDVVS----RQ-PVSVAIDATWFNFYHGGVFTGP------------C---------G-NTPNHGVTIVGYGTTTE 262 (308)
Q Consensus 210 ~~~~lk~~l~----~g-PV~v~i~~~~f~~y~~Giy~~~------------c---------~-~~~~Hav~iVGyg~~~~ 262 (308)
+.+.+|++.. .| ||-.+-++.-+..-+.||.+-. . + +-..|||+|.|.+.+.
T Consensus 296 ~me~lkkl~~~q~qagetVwFG~dvgq~s~rk~Gimdtd~~~~~s~~g~~~~q~KA~RldY~eSLmTHAMvlTGvd~d~- 374 (444)
T COG3579 296 DMERLKKLAIKQMQAGETVWFGCDVGQLSDRKTGIMDTDIYDYESSLGINLTQDKAGRLDYGESLMTHAMVLTGVDLDE- 374 (444)
T ss_pred cHHHHHHHHHHHHhcCCcEEeecCchhhcccccceeeehhccchhhhCCCcccchhhccccchHHHHHHHHhhcccccc-
Confidence 4677777533 35 8888888877777888876421 0 0 0256999999999876
Q ss_pred CCCCCCeEEEecCCCCCcCCCceEEEE
Q 044448 263 AEGQQPYWLVKNRWGTNWDEGGSMRIF 289 (308)
Q Consensus 263 ~~~g~~ywivkNSWG~~WGe~Gy~~i~ 289 (308)
+|..-=|.|.||||..=|.+|||-++
T Consensus 375 -~g~p~rwkVENSWG~d~G~~GyfvaS 400 (444)
T COG3579 375 -TGNPLRWKVENSWGKDVGKKGYFVAS 400 (444)
T ss_pred -CCCceeeEeecccccccCCCceEeeh
Confidence 42234699999999999999999876
No 23
>KOG4128 consensus Bleomycin hydrolases and aminopeptidases of cysteine protease family [Amino acid transport and metabolism]
Probab=97.91 E-value=1.2e-05 Score=73.18 Aligned_cols=73 Identities=18% Similarity=0.216 Sum_probs=52.4
Q ss_pred CccCCCCC-CchHHHHHHHHHHHHHHHhcC-CcccCCHHHHhh--------------------cCC-----------CCC
Q 044448 105 TPVKDQGS-YCCWAFTAVATVEGLNKIRTG-QLVTRSKHQLVD--------------------CST-----------LNG 151 (308)
Q Consensus 105 ~pv~dQg~-gsCwAfa~~~~~e~~~~i~~~-~~~~lS~q~l~d--------------------c~~-----------~~g 151 (308)
+||.||.+ |-||.|+.+..+---...+-+ ....||..+|.- |.. +.-
T Consensus 63 ~pvtnqkssGrcWift~ln~lrl~~~~kLnl~eFElSqayLFFwdKlErcnyFL~~vvd~a~r~ep~DgRlvq~Ll~nP~ 142 (457)
T KOG4128|consen 63 QPVTNQKSSGRCWIFTGLNLLRLEMDRKLNLPEFELSQAYLFFWDKLERCNYFLWTVVDLAMRCEPLDGRLVQNLLKNPV 142 (457)
T ss_pred cccccCcCCCceEEEechhHHHHHHHhcCCcchhhhhhHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccHHHHHHHhCCC
Confidence 69999999 999999999987554444332 346788876631 222 333
Q ss_pred CCCCcHHHHHHHHHHcCCCCCCCCcCC
Q 044448 152 CAKNFLENAFEYIRQYQRLASECVYPY 178 (308)
Q Consensus 152 C~GG~~~~a~~~~~~~~Gi~~e~~yPY 178 (308)
-+||.-..-.+.+++. |+.+..+||-
T Consensus 143 ~DGGqw~MfvNlVkKY-GviPKkcy~~ 168 (457)
T KOG4128|consen 143 PDGGQWQMFVNLVKKY-GVIPKKCYLH 168 (457)
T ss_pred CCCchHHHHHHHHHHh-CCCcHHhccc
Confidence 4688777777787776 9999999964
No 24
>PF13529 Peptidase_C39_2: Peptidase_C39 like family; PDB: 3ERV_A.
Probab=96.87 E-value=0.011 Score=47.15 Aligned_cols=56 Identities=25% Similarity=0.509 Sum_probs=34.5
Q ss_pred CCHHHHHHHHhcC-CeEEEEecC-cccccCCceEeCCCCCCCCeEEEEEEeCCcCCCCCCCCeEEEecCC
Q 044448 209 ATEEGLQDVVSRQ-PVSVAIDAT-WFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRW 276 (308)
Q Consensus 209 ~~~~~lk~~l~~g-PV~v~i~~~-~f~~y~~Giy~~~c~~~~~Hav~iVGyg~~~~~~~g~~ywivkNSW 276 (308)
.+.+.|++.|.+| ||.+.+... .-. .+..+.. ....|.|+|+||+.+ + +++|-.+|
T Consensus 87 ~~~~~i~~~i~~G~Pvi~~~~~~~~~~--~~~~~~~---~~~~H~vvi~Gy~~~-----~--~~~v~DP~ 144 (144)
T PF13529_consen 87 ASFDDIKQEIDAGRPVIVSVNSGWRPP--NGDGYDG---TYGGHYVVIIGYDED-----G--YVYVNDPW 144 (144)
T ss_dssp S-HHHHHHHHHTT--EEEEEETTSS----TTEEEEE----TTEEEEEEEEE-SS-----E---EEEE-TT
T ss_pred CcHHHHHHHHHCCCcEEEEEEcccccC--CCCCcCC---CcCCEEEEEEEEeCC-----C--EEEEeCCC
Confidence 4679999999985 999998743 111 1112211 357999999999874 4 78887776
No 25
>PF05543 Peptidase_C47: Staphopain peptidase C47; InterPro: IPR008750 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad []. This group of cysteine peptidases belong to the peptidase family C47 (staphopain family, clan CA). The type example are the staphopains, which are one of four major families of proteinases secreted by the Gram-positive Staphylococcus aureus. These staphylococcal cysteine proteases are secreted as preproenzymes that are proteolytically cleaved to generate the mature enzyme [, , ].; GO: 0008234 cysteine-type peptidase activity, 0006508 proteolysis; PDB: 1X9Y_D 1Y4H_B 1PXV_B 1CV8_A.
Probab=93.97 E-value=0.34 Score=40.86 Aligned_cols=118 Identities=17% Similarity=0.273 Sum_probs=67.2
Q ss_pred CCCC-CchHHHHHHHHHHHHHH--------HhcCCcccCCHHHHhhcCCCCCCCCCcHHHHHHHHHHcCCCCCCCCcCCC
Q 044448 109 DQGS-YCCWAFTAVATVEGLNK--------IRTGQLVTRSKHQLVDCSTLNGCAKNFLENAFEYIRQYQRLASECVYPYQ 179 (308)
Q Consensus 109 dQg~-gsCwAfa~~~~~e~~~~--------i~~~~~~~lS~q~l~dc~~~~gC~GG~~~~a~~~~~~~~Gi~~e~~yPY~ 179 (308)
.||. +=|-+||.+++|-.... |.+.-.+.+|+++|.+++. .+...++|.+.. |.... |
T Consensus 18 tQg~~pWCa~Ya~aailN~~~~~~~~~A~~iMr~~yPn~s~~~l~~~~~-------~~~~~i~y~ks~-g~~~~----~- 84 (175)
T PF05543_consen 18 TQGYNPWCAGYAMAAILNATTNTKIYNAKDIMRYLYPNVSEEQLKFTSL-------TPNQMIKYAKSQ-GRNPQ----Y- 84 (175)
T ss_dssp --SSSS-HHHHHHHHHHHHHCT-S---HHHHHHHHSTTS-CCCHHH--B--------HHHHHHHHHHT-TEEEE----E-
T ss_pred ccCcCcHHHHHHHHHHHHhhhCcCcCCHHHHHHHHCCCCCHHHHhhcCC-------CHHHHHHHHHHc-Ccchh----H-
Confidence 4888 99999999998866421 1122246788888877642 466788887655 43210 0
Q ss_pred CCCCCCCcccccCCCCccEEEeeeEEcCCCCHHHHHHHHhc-CCeEEEEecCcccccCCceEeCCCCCCCCeEEEEEEeC
Q 044448 180 GRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSR-QPVSVAIDATWFNFYHGGVFTGPCGNTPNHGVTIVGYG 258 (308)
Q Consensus 180 ~~~~~~C~~~~~~~~~~~~~i~~~~~v~~~~~~~lk~~l~~-gPV~v~i~~~~f~~y~~Giy~~~c~~~~~Hav~iVGyg 258 (308)
.+ ..+ +.+++++.+.+ -|+.+..+... + ..+...+|||+||||-
T Consensus 85 ----------~n-------------~~~--s~~eV~~~~~~nk~i~i~~~~v~-----~-----~~~~~~gHAlavvGya 129 (175)
T PF05543_consen 85 ----------NN-------------RMP--SFDEVKKLIDNNKGIAILADRVE-----Q-----TNGPHAGHALAVVGYA 129 (175)
T ss_dssp ----------EC-------------S-----HHHHHHHHHTT-EEEEEEEETT-----S-----CTTB--EEEEEEEEEE
T ss_pred ----------hc-------------CCC--CHHHHHHHHHcCCCeEEEecccc-----c-----CCCCccceeEEEEeee
Confidence 00 011 47889998885 67777655311 1 1223579999999997
Q ss_pred CcCCCCCCCCeEEEecCCC
Q 044448 259 TTTEAEGQQPYWLVKNRWG 277 (308)
Q Consensus 259 ~~~~~~~g~~ywivkNSWG 277 (308)
.-. + |.++.++=|=|-
T Consensus 130 ~~~--~-g~~~y~~WNPW~ 145 (175)
T PF05543_consen 130 KPN--N-GQKTYYFWNPWW 145 (175)
T ss_dssp EET--T-SEEEEEEE-TT-
T ss_pred ecC--C-CCeEEEEeCCcc
Confidence 643 4 788999977774
No 26
>PF14399 Transpep_BrtH: NlpC/p60-like transpeptidase
Probab=91.19 E-value=0.57 Score=43.17 Aligned_cols=55 Identities=18% Similarity=0.422 Sum_probs=35.5
Q ss_pred HHHHHHHHhcC-CeEEEEecCcccccCCceEeCCCCCCCCeEEEEEEeCCcCCCCCCCCeEEEec
Q 044448 211 EEGLQDVVSRQ-PVSVAIDATWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKN 274 (308)
Q Consensus 211 ~~~lk~~l~~g-PV~v~i~~~~f~~y~~Giy~~~c~~~~~Hav~iVGyg~~~~~~~g~~ywivkN 274 (308)
.+.|+++|.+| ||.+.++.+.+ -|...-|. .....|.|+|+||+++ +..+.++-.
T Consensus 78 ~~~l~~~l~~g~pv~~~~D~~~l-py~~~~~~---~~~~~H~i~v~G~d~~-----~~~~~v~D~ 133 (317)
T PF14399_consen 78 WEELKEALDAGRPVIVWVDMYYL-PYRPNYYK---KHHADHYIVVYGYDEE-----EDVFYVSDP 133 (317)
T ss_pred HHHHHHHHhCCCceEEEeccccC-CCCccccc---cccCCcEEEEEEEeCC-----CCEEEEEcC
Confidence 45778888877 99999887522 22221111 1236899999999976 345666644
No 27
>COG4990 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=84.98 E-value=2.6 Score=35.80 Aligned_cols=52 Identities=17% Similarity=0.312 Sum_probs=36.7
Q ss_pred EEcCCCCHHHHHHHHhc-CCeEEEEecCcccccCCceEeCCCCCCCCeEEEEEEeCCcCCCCCCCCeEEEecCCC
Q 044448 204 QYVQPATEEGLQDVVSR-QPVSVAIDATWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWG 277 (308)
Q Consensus 204 ~~v~~~~~~~lk~~l~~-gPV~v~i~~~~f~~y~~Giy~~~c~~~~~Hav~iVGyg~~~~~~~g~~ywivkNSWG 277 (308)
..++..++.+|+..|.+ .||.+-.-. |-. ..-|+|+|.||++. |+..-++||
T Consensus 116 ~d~tGksl~~ik~ql~kg~PV~iw~T~--~~~------------~s~H~v~itgyDk~--------n~yynDpyG 168 (195)
T COG4990 116 VDLTGKSLSDIKGQLLKGRPVVIWVTN--FHS------------YSIHSVLITGYDKY--------NIYYNDPYG 168 (195)
T ss_pred ccCcCCcHHHHHHHHhcCCcEEEEEec--ccc------------cceeeeEeeccccc--------ceEeccccc
Confidence 34566789999999997 599876543 211 34799999999874 355556664
No 28
>PF09778 Guanylate_cyc_2: Guanylylate cyclase; InterPro: IPR018616 Members of this family of proteins catalyse the conversion of guanosine triphosphate (GTP) to 3',5'-cyclic guanosine monophosphate (cGMP) and pyrophosphate.
Probab=83.12 E-value=5 Score=35.13 Aligned_cols=51 Identities=22% Similarity=0.452 Sum_probs=32.4
Q ss_pred CHHHHHHHHhc-CCeEEEEecCccc--ccCCceEeC---CC--C--CCCCeEEEEEEeCCc
Q 044448 210 TEEGLQDVVSR-QPVSVAIDATWFN--FYHGGVFTG---PC--G--NTPNHGVTIVGYGTT 260 (308)
Q Consensus 210 ~~~~lk~~l~~-gPV~v~i~~~~f~--~y~~Giy~~---~c--~--~~~~Hav~iVGyg~~ 260 (308)
+.++|..+|.. ||+.+-++..-.. .-+.-.... .| . .-.+|-|+|+||+.+
T Consensus 112 s~~ei~~hl~~g~~aIvLVd~~~L~C~~Ck~~~~~~~~~~~~~~~~~Y~GHYVVlcGyd~~ 172 (212)
T PF09778_consen 112 SIQEIIEHLSSGGPAIVLVDASLLHCDLCKSNCFDPIGSKCFGRSPDYQGHYVVLCGYDAA 172 (212)
T ss_pred cHHHHHHHHhCCCcEEEEEccccccChhhcccccccccccccCCCCCccEEEEEEEeecCC
Confidence 58999999996 6777777776111 112222211 12 1 247899999999986
No 29
>cd02549 Peptidase_C39A A sub-family of peptidase family C39. Peptidase family C39 mostly contains bacteriocin-processing endopeptidases from bacteria. The cysteine peptidases in family C39 cleave the "double-glycine" leader peptides from the precursors of various bacteriocins (mostly non-lantibiotic). The cleavage is mediated by the transporter as part of the secretion process. Bacteriocins are antibiotic proteins secreted by some species of bacteria that inhibit the growth of other bacterial species. The bacteriocin is synthesized as a precursor with an N-terminal leader peptide, and processing involves removal of the leader peptide by cleavage at a Gly-Gly bond, followed by translocation of the mature bacteriocin across the cytoplasmic membrane. Most endopeptidases of family C39 are N-terminal domains in larger proteins (ABC transporters) that serve both functions. The proposed protease active site is conserved in this sub-family of proteins with a single peptidase domain, which are
Probab=60.89 E-value=20 Score=28.23 Aligned_cols=33 Identities=21% Similarity=0.454 Sum_probs=23.6
Q ss_pred HHHHHhc-CCeEEEEecCcccccCCceEeCCCCCCCCeEEEEEEeC
Q 044448 214 LQDVVSR-QPVSVAIDATWFNFYHGGVFTGPCGNTPNHGVTIVGYG 258 (308)
Q Consensus 214 lk~~l~~-gPV~v~i~~~~f~~y~~Giy~~~c~~~~~Hav~iVGyg 258 (308)
+++.|.. -||.+.++.. .-....+|.|+|+||+
T Consensus 70 ~~~~l~~~~Pvi~~~~~~------------~~~~~~gH~vVv~g~~ 103 (141)
T cd02549 70 LLRQLAAGHPVIVSVNLG------------VSITPSGHAMVVIGYD 103 (141)
T ss_pred HHHHHHCCCeEEEEEecC------------cccCCCCeEEEEEEEc
Confidence 7778886 5998877641 0112468999999998
No 30
>PF12385 Peptidase_C70: Papain-like cysteine protease AvrRpt2; InterPro: IPR022118 This is a family of cysteine proteases, found in actinobacteria, protobacteria and firmicutes. Papain-like cysteine proteases play a crucial role in plant-pathogen/pest interactions. On entering the host they act on non-self substrates, thereby manipulating the host to evade proteolysis []. AvrRpt2 from Pseudomonas syringae pv tomato DC3000 triggers resistance to P. syringae-2-dependent defence responses, including hypersensitive cell death, by cleaving the Arabidopsis RIN4 protein which is monitored by the cognate resistance protein RPS2 [].
Probab=42.68 E-value=62 Score=27.06 Aligned_cols=38 Identities=32% Similarity=0.439 Sum_probs=28.1
Q ss_pred CHHHHHHHHh-cCCeEEEEecCcccccCCceEeCCCCCCCCeEEEEEEeCCc
Q 044448 210 TEEGLQDVVS-RQPVSVAIDATWFNFYHGGVFTGPCGNTPNHGVTIVGYGTT 260 (308)
Q Consensus 210 ~~~~lk~~l~-~gPV~v~i~~~~f~~y~~Giy~~~c~~~~~Hav~iVGyg~~ 260 (308)
+.+.+...|. +||+-++... |-..-..|+++|.|-+.+
T Consensus 97 t~e~~~~LL~~yGPLwv~~~~-------------P~~~~~~H~~ViTGI~~d 135 (166)
T PF12385_consen 97 TAEGLANLLREYGPLWVAWEA-------------PGDSWVAHASVITGIDGD 135 (166)
T ss_pred CHHHHHHHHHHcCCeEEEecC-------------CCCcceeeEEEEEeecCC
Confidence 5789999999 5999988654 211224699999998765
No 31
>PF11567 PfUIS3: Plasmodium falciparum UIS3 membrane protein; InterPro: IPR021626 UIS3 is a membrane protein essential for sporozoite development in infected hepatocytes. This family is 130-229 of the Plasmodium falciparum UIS3 protein which is compact and has an all alpha-helical structure.PfUIS3(130-229) interacts with lipids, phospholipid lysosomes, the human liver fatty acid-binding protein and with the lipid phosphatidylethanolamine. The interaction with liver fatty acid-binding protein provides the parasite with a method to import essential fatty acids/lipids during rapid growth phases of sporozoites []. ; PDB: 2VWA_C.
Probab=34.31 E-value=10 Score=28.19 Aligned_cols=31 Identities=26% Similarity=0.368 Sum_probs=22.2
Q ss_pred HHHHHHHHHHHHHHHhcCCCCCCCCHHHHHh
Q 044448 31 EKEMRFKIFKKNHEFLRLNKFADLTREKFLA 61 (308)
Q Consensus 31 e~~~R~~iF~~N~~~~g~N~fsDlt~eEf~~ 61 (308)
--.+||.+|..|.+...--+|++||.+.-.-
T Consensus 19 vpiKrfN~F~Dn~rla~qhHF~~LSn~Qq~y 49 (101)
T PF11567_consen 19 VPIKRFNIFMDNARLAAQHHFSNLSNEQQKY 49 (101)
T ss_dssp --HHHHHHHHHHHHHHHHHHHHHS-HHHHHH
T ss_pred ccHHHHHHHHHHHHHHHHHHHHhcCcHHHHH
Confidence 4578999999999984445788888776544
No 32
>PF05391 Lsm_interact: Lsm interaction motif; InterPro: IPR008669 This short motif is found at the C terminus of Prp24 proteins and probably interacts with the Lsm proteins to promote U4/U6 formation [].
Probab=32.39 E-value=34 Score=18.41 Aligned_cols=12 Identities=8% Similarity=0.252 Sum_probs=9.6
Q ss_pred CCCHHHHHhhhc
Q 044448 53 DLTREKFLASYT 64 (308)
Q Consensus 53 Dlt~eEf~~~~~ 64 (308)
-+++++|+++++
T Consensus 9 p~SNddFrkmfl 20 (21)
T PF05391_consen 9 PKSNDDFRKMFL 20 (21)
T ss_pred ccchHHHHHHHc
Confidence 478899998876
No 33
>KOG4702 consensus Uncharacterized conserved protein [Function unknown]
Probab=26.33 E-value=1.7e+02 Score=20.94 Aligned_cols=33 Identities=12% Similarity=0.149 Sum_probs=24.5
Q ss_pred HHHHHHHHHHhCCccCCHHHHHHHHHHHHHHHHH
Q 044448 12 AAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF 45 (308)
Q Consensus 12 ~~~f~~~~~~~~k~Y~~~~e~~~R~~iF~~N~~~ 45 (308)
-..|++|+..|++.-.++ |...|..-|.+-++.
T Consensus 28 pe~Fee~v~~~krel~pp-e~~~~~EE~~~~lRe 60 (77)
T KOG4702|consen 28 PEIFEEFVRGYKRELSPP-EATKRKEEYENFLRE 60 (77)
T ss_pred hHHHHHHHHhccccCCCh-HHHhhHHHHHHHHHH
Confidence 357999999999987665 666677666665553
No 34
>PF01640 Peptidase_C10: Peptidase C10 family classification.; InterPro: IPR000200 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad []. This group of cysteine peptidases belong to MEROPS peptidase family C10 (streptopain family, clan CA). Streptopain is a cysteine protease found in Streptococcus pyogenes that shows some structural and functional similarity to papain (family C1) [, ]. The order of the catalytic cysteine/histidine dyad is the same and the surrounding sequences are similar. The two proteins also show similar specificities, both preferring a hydrophobic residue at the P2 site [, ]. Streptopain shows a high degree of sequence similarity to the S. pyogenes exotoxin B, and strong similarity to the prtT gene product of Porphyromonas gingivalis (Bacteroides gingivalis), both of which have been included in the family [].; GO: 0008234 cysteine-type peptidase activity, 0006508 proteolysis; PDB: 4D8I_A 4D8E_A 4D8B_A 3BBA_B 3BB7_A 2JTC_A 1PVJ_A 1DKI_D 2UZJ_A.
Probab=25.24 E-value=2.7e+02 Score=23.66 Aligned_cols=49 Identities=24% Similarity=0.620 Sum_probs=28.8
Q ss_pred HHHHHHHhc-CCeEEEEecCcccccCCceEeCCCCCCCCeEEEEEEeCCcCCCCCCCCeEEEecCCCCCcC--CCceEE
Q 044448 212 EGLQDVVSR-QPVSVAIDATWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWD--EGGSMR 287 (308)
Q Consensus 212 ~~lk~~l~~-gPV~v~i~~~~f~~y~~Giy~~~c~~~~~Hav~iVGyg~~~~~~~g~~ywivkNSWG~~WG--e~Gy~~ 287 (308)
+.|+..|.+ .||.+.-.. . ..+||.+|=||.. ..|+-+ -|| || .+||++
T Consensus 141 ~~i~~el~~~rPV~~~g~~-----------~-----~~GHawViDGy~~-------~~~~H~--NwG--W~G~~nGyy~ 192 (192)
T PF01640_consen 141 DMIRNELDNGRPVLYSGNS-----------K-----SGGHAWVIDGYDS-------DGYFHC--NWG--WGGSSNGYYR 192 (192)
T ss_dssp HHHHHHHHTT--EEEEEEE-----------T-----TEEEEEEEEEEES-------SSEEEE--E-S--STTTT-EEEE
T ss_pred HHHHHHHHcCCCEEEEEec-----------C-----CCCeEEEEcCccC-------CCeEEE--eeC--ccCCCCCccC
Confidence 456777775 699754321 0 1299999999954 357766 455 54 568875
No 35
>KOG4621 consensus Uncharacterized conserved protein [Function unknown]
Probab=25.19 E-value=1.7e+02 Score=23.57 Aligned_cols=51 Identities=16% Similarity=0.284 Sum_probs=32.7
Q ss_pred CHHHHHHHHhcC-CeEEEEecC-----ccc--ccCCceEeCC-----CCC--CCCeEEEEEEeCCc
Q 044448 210 TEEGLQDVVSRQ-PVSVAIDAT-----WFN--FYHGGVFTGP-----CGN--TPNHGVTIVGYGTT 260 (308)
Q Consensus 210 ~~~~lk~~l~~g-PV~v~i~~~-----~f~--~y~~Giy~~~-----c~~--~~~Hav~iVGyg~~ 260 (308)
++.+|...|+.| -|++.+.-. ++- --+++.+.+. |.+ ..+|-|+|-||+-.
T Consensus 58 Si~dIqahLaqGnhiAIaLVdq~~Lhcdlceeplk~ccfspnghhcfcrtp~YqGHfiVi~GYd~a 123 (167)
T KOG4621|consen 58 SIHDIQAHLAQGNHIAIALVDQDKLHCDLCEEPLKSCCFSPNGHHCFCRTPCYQGHFIVICGYDAA 123 (167)
T ss_pred eHHHHHHHHhcCCeEEEEEecCCceehHHHHhHHHHhccCCCCccccccCCcccccEEEEeccccc
Confidence 478899999987 676655432 221 2244555432 333 37899999999875
No 36
>PF08664 YcbB: YcbB domain; InterPro: IPR013972 YcbB is a DNA-binding protein [].
Probab=23.10 E-value=1.5e+02 Score=23.98 Aligned_cols=56 Identities=16% Similarity=0.187 Sum_probs=39.0
Q ss_pred hHHHHHHHHHHHHh-------CCccCCHHHHHHHHHHHH--HHHHHhcCCCCCCCCHHHHHhhhcC
Q 044448 9 GNIAAKHEQWMVEF-------ARTYKDQAEKEMRFKIFK--KNHEFLRLNKFADLTREKFLASYTG 65 (308)
Q Consensus 9 ~~~~~~f~~~~~~~-------~k~Y~~~~e~~~R~~iF~--~N~~~~g~N~fsDlt~eEf~~~~~~ 65 (308)
-.+...|.+..... .|.=+. .|...|+.|++ .|+..||+.-|++-.-+|+...+-.
T Consensus 40 ~~Lk~~f~~~~~~~~~~~~~~~~e~Ka-~EQRIRRai~~al~nlAsLGl~Dy~N~~Fe~YA~~lFd 104 (134)
T PF08664_consen 40 PSLKEIFEELAQKKLASDEEIEKEKKA-IEQRIRRAIKQALTNLASLGLEDYSNPIFEEYASRLFD 104 (134)
T ss_pred CcHHHHHHHHHHhhccchhhhhHHHHH-HHHHHHHHHHHHHHHHHHhCCcccCChHHHHHHHHcCC
Confidence 34677788777666 333332 36677888884 6777799998888888887766543
No 37
>cd00044 CysPc Calpains, domains IIa, IIb; calcium-dependent cytoplasmic cysteine proteinases, papain-like. Functions in cytoskeletal remodeling processes, cell differentiation, apoptosis and signal transduction.
Probab=20.98 E-value=1.1e+02 Score=28.21 Aligned_cols=40 Identities=20% Similarity=0.542 Sum_probs=0.0
Q ss_pred CCeEEEEEEeCCcCCCCCCCCeEEEecCCC------------CCc--------------CCCceEEEE
Q 044448 248 PNHGVTIVGYGTTTEAEGQQPYWLVKNRWG------------TNW--------------DEGGSMRIF 289 (308)
Q Consensus 248 ~~Hav~iVGyg~~~~~~~g~~ywivkNSWG------------~~W--------------Ge~Gy~~i~ 289 (308)
.+||=.|++.-.... . +.+...+||-|| +.| .++|-|+|+
T Consensus 235 ~~HaY~Vl~~~~~~~-~-~~~lv~lrNPWg~~~w~G~ws~~~~~w~~~~~~~~~~~~~~~~dG~Fwm~ 300 (315)
T cd00044 235 KGHAYSVLDVREVQE-E-GLRLLRLRNPWGVGEWWGGWSDDSSEWWVIDAERKKLLLSGKDDGEFWMS 300 (315)
T ss_pred cCcceEEeEEEEEcc-C-ceEEEEecCCccCCCccCCCCCCCchhccChHHHHHhcCCCCCCCEEEEE
No 38
>PF07351 DUF1480: Protein of unknown function (DUF1480); InterPro: IPR009950 This family consists of several hypothetical Enterobacterial proteins of around 80 residues in length. The function of this family is unknown.
Probab=20.34 E-value=1.2e+02 Score=22.04 Aligned_cols=23 Identities=22% Similarity=0.554 Sum_probs=18.1
Q ss_pred eEeCCCCCCCCeEEEEEEeCCcC
Q 044448 239 VFTGPCGNTPNHGVTIVGYGTTT 261 (308)
Q Consensus 239 iy~~~c~~~~~Hav~iVGyg~~~ 261 (308)
....||.++.+-+|-|=||+.+.
T Consensus 28 tlsIPCksdpdlcmQLDgWDe~T 50 (80)
T PF07351_consen 28 TLSIPCKSDPDLCMQLDGWDEHT 50 (80)
T ss_pred eEEeecCCChhheeEecccccCC
Confidence 34457988888999999998764
Done!