Query psy15348
Match_columns 298
No_of_seqs 175 out of 1300
Neff 7.8
Searched_HMMs 46136
Date Fri Aug 16 18:37:00 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy15348.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/15348hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1542|consensus 100.0 5.3E-55 1.1E-59 392.1 11.7 206 6-260 160-370 (372)
2 cd02621 Peptidase_C1A_Cathepsi 100.0 7.4E-54 1.6E-58 383.3 19.0 224 6-261 4-242 (243)
3 cd02620 Peptidase_C1A_Cathepsi 100.0 1.8E-53 3.9E-58 379.2 19.2 230 5-258 2-235 (236)
4 KOG1543|consensus 100.0 1.2E-53 2.5E-58 395.3 17.1 213 5-262 111-324 (325)
5 cd02698 Peptidase_C1A_Cathepsi 100.0 1.5E-52 3.2E-57 374.0 18.9 220 6-261 4-238 (239)
6 PTZ00203 cathepsin L protease; 100.0 7.3E-52 1.6E-56 386.0 19.7 208 6-260 129-339 (348)
7 cd02248 Peptidase_C1A Peptidas 100.0 1.1E-51 2.5E-56 360.5 19.0 207 6-258 3-209 (210)
8 PTZ00049 cathepsin C-like prot 100.0 9.7E-51 2.1E-55 397.8 19.1 234 6-268 384-683 (693)
9 PTZ00021 falcipain-2; Provisio 100.0 1.3E-50 2.8E-55 388.7 17.1 205 6-261 269-488 (489)
10 PTZ00462 Serine-repeat antigen 100.0 4.5E-50 9.7E-55 403.1 19.9 252 8-294 534-813 (1004)
11 PTZ00200 cysteine proteinase; 100.0 4E-50 8.7E-55 383.8 18.2 203 6-261 237-445 (448)
12 PTZ00364 dipeptidyl-peptidase 100.0 2.2E-49 4.8E-54 384.2 19.1 227 6-262 208-460 (548)
13 PF00112 Peptidase_C1: Papain 100.0 1.7E-49 3.8E-54 347.5 14.1 213 5-260 3-219 (219)
14 smart00645 Pept_C1 Papain fami 100.0 2.6E-45 5.6E-50 312.3 17.9 165 6-257 4-171 (174)
15 cd02619 Peptidase_C1 C1 Peptid 100.0 2.3E-42 4.9E-47 303.0 17.1 196 19-247 9-213 (223)
16 KOG1544|consensus 100.0 8.8E-43 1.9E-47 309.7 3.4 218 16-263 220-462 (470)
17 COG4870 Cysteine protease [Pos 99.9 1.7E-24 3.8E-29 197.2 5.1 193 18-247 110-314 (372)
18 cd00585 Peptidase_C1B Peptidas 99.8 1.5E-20 3.2E-25 179.5 11.3 75 165-246 303-399 (437)
19 PF03051 Peptidase_C1_2: Pepti 99.4 1E-12 2.2E-17 125.9 10.0 80 20-103 56-157 (438)
20 COG3579 PepC Aminopeptidase C 98.2 6.9E-06 1.5E-10 75.1 7.9 40 206-245 360-401 (444)
21 PF05543 Peptidase_C47: Stapho 94.4 0.19 4.1E-06 42.4 7.2 132 22-247 16-155 (175)
22 PF13529 Peptidase_C39_2: Pept 92.7 0.12 2.7E-06 40.9 3.4 54 162-231 91-144 (144)
23 KOG4128|consensus 82.9 0.14 3E-06 47.3 -2.8 80 19-102 62-165 (457)
24 PF12385 Peptidase_C70: Papain 79.2 16 0.00035 30.5 8.2 36 162-219 100-135 (166)
25 KOG4128|consensus 77.1 2.8 6E-05 39.1 3.5 39 207-245 371-413 (457)
26 PF14399 Transpep_BrtH: NlpC/p 72.9 4.7 0.0001 36.9 4.1 54 162-229 80-133 (317)
27 cd00044 CysPc Calpains, domain 60.3 14 0.0003 34.2 4.5 28 206-233 234-263 (315)
28 PF09778 Guanylate_cyc_2: Guan 59.5 15 0.00033 32.2 4.3 58 162-229 115-180 (212)
29 cd02549 Peptidase_C39A A sub-f 36.9 52 0.0011 25.7 3.9 43 165-231 72-114 (141)
30 smart00230 CysPc Calpain-like 26.6 1.1E+02 0.0023 28.4 4.6 28 206-233 226-255 (318)
31 COG4990 Uncharacterized protei 26.4 45 0.00097 28.5 1.8 44 162-232 125-168 (195)
No 1
>KOG1542|consensus
Probab=100.00 E-value=5.3e-55 Score=392.13 Aligned_cols=206 Identities=20% Similarity=0.267 Sum_probs=173.5
Q ss_pred ccccCCcCCcccccCCCCCCCCCccHHHHHHHHHHHHHHHHHcCCcccccccccCHHHHhhccCcccccCCCCCCCCCcH
Q psy15348 6 SSRIRDMSYGATVYNRRPYALSCIEARAVATATPLAFAVCRSSKMHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSSGIS 85 (298)
Q Consensus 6 ~~~~~~~~~~~~~~~~v~dQg~cGsCwAfA~~~~le~~~~i~~~~~~~~~~~~LS~q~l~~c~~~~~~~~~~~gC~GG~~ 85 (298)
+=+|||+ -.++||||||+||||||||+++++|.+++|++++ .+.||+|+|+||.. ++.||+||.+
T Consensus 160 ~fDWR~k----gaVTpVKnQG~CGSCWAFS~tG~vEga~~i~~g~-----LvsLSEQeLvDCD~------~d~gC~GGl~ 224 (372)
T KOG1542|consen 160 SFDWRDK----GAVTPVKNQGMCGSCWAFSTTGAVEGAWAIATGK-----LVSLSEQELVDCDS------CDNGCNGGLM 224 (372)
T ss_pred ccchhcc----CCccccccCCcCcchhhhhhhhhhhhHHHhhcCc-----ccccchhhhhcccC------cCCcCCCCCh
Confidence 3456665 4788999999999999999999999999999998 89999999999996 4899999999
Q ss_pred HHHHHHHHH-cCcCCCCCCCCCCCCcCCCCCCCCCCCCCCCCC-CCCCCCCCCCccceeeccCCccceeecccccccccc
Q psy15348 86 SSTWAWVHK-RGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEP-ECKTLATPQPKCHTRCTNDNYGRGFFQDKYQINGLG 163 (298)
Q Consensus 86 ~~a~~~~~~-~Gl~~e~~y~~~~~c~PY~~~~~~~~~~~~~~~-~C~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~ 163 (298)
..||+|+++ .||..|.+| ||+ ++.+ .|.... ......+ .+...+. ..+++|
T Consensus 225 ~nA~~~~~~~gGL~~E~dY-------PY~----------g~~~~~C~~~~---~~~~v~I-~~f~~l~------~nE~~i 277 (372)
T KOG1542|consen 225 DNAFKYIKKAGGLEKEKDY-------PYT----------GKKGNQCHFDK---SKIVVSI-KDFSMLS------NNEDQI 277 (372)
T ss_pred hHHHHHHHHhCCccccccC-------Ccc----------ccCCCccccch---hhceEEE-eccEecC------CCHHHH
Confidence 999999555 589999999 999 4555 898771 2233333 2222222 122267
Q ss_pred ccccccCCceeEEEeccccccCCCCccCCCCceEEe--cCCcccccCeEEEEEeeeccC-CeeEEEEEcCCCCCcCCCce
Q psy15348 164 LYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAV--SASAEIVAYATVKIVGWGEEN-GRPYWTIVSTFGEQFGDKGT 240 (298)
Q Consensus 164 ~~~l~~~GPV~v~i~~~~~~v~~~f~~y~~G~Iy~~--~~~~~~~~~HaV~IVGyg~~~-g~~yWivkNSWG~~WGe~Gy 240 (298)
...|..+|||+|+| ++ ..++.|++| |..+ ..|.+..++|+|+|||||.+. .++|||||||||+.|||+||
T Consensus 278 a~wLv~~GPi~vgi-----Na-~~mQ~YrgG-V~~P~~~~Cs~~~~~HaVLlvGyG~~g~~~PYWIVKNSWG~~WGE~GY 350 (372)
T KOG1542|consen 278 AAWLVTFGPLSVGI-----NA-KPMQFYRGG-VSCPSKYICSPKLLNHAVLLVGYGSSGYEKPYWIVKNSWGTSWGEKGY 350 (372)
T ss_pred HHHHHhcCCeEEEE-----ch-HHHHHhccc-ccCCCcccCCccccCceEEEEeecCCCCCCceEEEECCccccccccce
Confidence 77888999999999 85 578999999 9988 568877899999999999998 89999999999999999999
Q ss_pred EEEEecCCcccccceeeeee
Q psy15348 241 IKILRGRNEAIIESLVNGAL 260 (298)
Q Consensus 241 ~~i~rg~n~cgie~~~~~~~ 260 (298)
|||.||.|.|||++++..+.
T Consensus 351 ~~l~RG~N~CGi~~mvss~~ 370 (372)
T KOG1542|consen 351 YKLCRGSNACGIADMVSSAA 370 (372)
T ss_pred EEEeccccccccccchhhhh
Confidence 99999999999999988765
No 2
>cd02621 Peptidase_C1A_CathepsinC Cathepsin C; also known as Dipeptidyl Peptidase I (DPPI), an atypical papain-like cysteine peptidase with chloride dependency and dipeptidyl aminopeptidase activity, resulting from its tetrameric structure which limits substrate access. Each subunit of the tetramer is composed of three peptides: the heavy and light chains, which together adopts the papain fold and forms the catalytic domain; and the residual propeptide region, which forms a beta barrel and points towards the substrate's N-terminus. The subunit composition is the result of the unique characteristic of procathepsin C maturation involving the cleavage of the catalytic domain and the non-autocatalytic excision of an activation peptide within its propeptide region. By removing N-terminal dipeptide extensions, cathepsin C activates granule serine peptidases (granzymes) involved in cell-mediated apoptosis, inflammation and tissue remodelling. Loss-of-function mutations in cathepsin C are assoc
Probab=100.00 E-value=7.4e-54 Score=383.34 Aligned_cols=224 Identities=24% Similarity=0.284 Sum_probs=173.6
Q ss_pred ccccCCcCCcccccCCCCCCCCCccHHHHHHHHHHHHHHHHHcCCccc-ccccccCHHHHhhccCcccccCCCCCCCCCc
Q psy15348 6 SSRIRDMSYGATVYNRRPYALSCIEARAVATATPLAFAVCRSSKMHVE-CTSFRFIAGVKQRCAWLVSRWMTIWVCSSGI 84 (298)
Q Consensus 6 ~~~~~~~~~~~~~~~~v~dQg~cGsCwAfA~~~~le~~~~i~~~~~~~-~~~~~LS~q~l~~c~~~~~~~~~~~gC~GG~ 84 (298)
+-+||+..-...+++||+||+.||+|||||++++||++++|++++..+ ...+.||+|+|++|+.. +.||+||+
T Consensus 4 ~fDwr~~~~~~~~v~~v~dQg~CGsCwAfa~~~~ies~~~i~~~~~~~~~~~~~lS~q~l~dC~~~------~~GC~GG~ 77 (243)
T cd02621 4 SFDWGDVNNGFNYVSPVRNQGGCGSCYAFASVYALEARIMIASNKTDPLGQQPILSPQHVLSCSQY------SQGCDGGF 77 (243)
T ss_pred cccccccCCCCcccccCCCCCcCccHHHHHHHHHHHHHHHHHhCCCCccccCcccCHHHhhhhcCC------CCCCCCCC
Confidence 456777542345789999999999999999999999999998775110 12588999999999864 68999999
Q ss_pred HHHHHHHHHHcCcCCCCCCCCCCCCcCCCCCCCCCCCCCCCCCCCCCCCCCCCccceeeccCCccceeeccccccccccc
Q psy15348 85 SSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDKYQINGLGL 164 (298)
Q Consensus 85 ~~~a~~~~~~~Gl~~e~~y~~~~~c~PY~~~~~~~~~~~~~~~~C~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~ 164 (298)
+..|++|++++|+++|++| ||.. .....|+... .... .... .....+... ......++|+
T Consensus 78 ~~~a~~~~~~~Gi~~e~~y-------PY~~---------~~~~~C~~~~-~~~~-~~~~-~~~~~i~~~-~~~~~~~~ik 137 (243)
T cd02621 78 PFLVGKFAEDFGIVTEDYF-------PYTA---------DDDRPCKASP-SECR-RYYF-SDYNYVGGC-YGCTNEDEMK 137 (243)
T ss_pred HHHHHHHHHhcCcCCCcee-------CCCC---------CCCCCCCCCc-cccc-cccc-cceeEcccc-cccCCHHHHH
Confidence 9999999999999997777 9983 1456776541 0000 0000 011111100 0011123688
Q ss_pred cccccCCceeEEEeccccccCCCCccCCCCceEEecC----Ccc--------cccCeEEEEEeeeccC--CeeEEEEEcC
Q psy15348 165 YFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAVSA----SAE--------IVAYATVKIVGWGEEN--GRPYWTIVST 230 (298)
Q Consensus 165 ~~l~~~GPV~v~i~~~~~~v~~~f~~y~~G~Iy~~~~----~~~--------~~~~HaV~IVGyg~~~--g~~yWivkNS 230 (298)
.+|+++|||+++| .+.++|++|++| ||..+. |.. ..++|||+|||||+++ +.+|||||||
T Consensus 138 ~~i~~~GPv~v~~-----~~~~~F~~Y~~G-Iy~~~~~~~~C~~~~~~~~~~~~~~HaV~iVGyg~~~~~g~~YWiirNS 211 (243)
T cd02621 138 WEIYRNGPIVVAF-----EVYSDFDFYKEG-VYHHTDNDEVSDGDNDNFNPFELTNHAVLLVGWGEDEIKGEKYWIVKNS 211 (243)
T ss_pred HHHHHcCCEEEEE-----EecccccccCCe-EECcCCcccccccccccccCcccCCeEEEEEEeeccCCCCCcEEEEEcC
Confidence 8999999999999 988899999999 998864 532 2579999999999886 8999999999
Q ss_pred CCCCcCCCceEEEEecCCcccccceeeeeec
Q psy15348 231 FGEQFGDKGTIKILRGRNEAIIESLVNGALP 261 (298)
Q Consensus 231 WG~~WGe~Gy~~i~rg~n~cgie~~~~~~~p 261 (298)
||++||++|||||+|+.|.|||++++++++|
T Consensus 212 WG~~WGe~Gy~~i~~~~~~cgi~~~~~~~~~ 242 (243)
T cd02621 212 WGSSWGEKGYFKIRRGTNECGIESQAVFAYP 242 (243)
T ss_pred CCCCCCcCCeEEEecCCcccCcccceEeecc
Confidence 9999999999999999999999999999987
No 3
>cd02620 Peptidase_C1A_CathepsinB Cathepsin B group; composed of cathepsin B and similar proteins, including tubulointerstitial nephritis antigen (TIN-Ag). Cathepsin B is a lysosomal papain-like cysteine peptidase which is expressed in all tissues and functions primarily as an exopeptidase through its carboxydipeptidyl activity. Together with other cathepsins, it is involved in the degradation of proteins, proenzyme activation, Ag processing, metabolism and apoptosis. Cathepsin B has been implicated in a number of human diseases such as cancer, rheumatoid arthritis, osteoporosis and Alzheimer's disease. The unique carboxydipeptidyl activity of cathepsin B is attributed to the presence of an occluding loop in its active site which favors the binding of the C-termini of substrate proteins. Some members of this group do not possess the occluding loop. TIN-Ag is an extracellular matrix basement protein which was originally identified as a target Ag involved in anti-tubular basement membrane
Probab=100.00 E-value=1.8e-53 Score=379.19 Aligned_cols=230 Identities=28% Similarity=0.438 Sum_probs=170.2
Q ss_pred CccccCCcCCcccccCCCCCCCCCccHHHHHHHHHHHHHHHHHcCCcccccccccCHHHHhhccCcccccCCCCCCCCCc
Q psy15348 5 TSSRIRDMSYGATVYNRRPYALSCIEARAVATATPLAFAVCRSSKMHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSSGI 84 (298)
Q Consensus 5 ~~~~~~~~~~~~~~~~~v~dQg~cGsCwAfA~~~~le~~~~i~~~~~~~~~~~~LS~q~l~~c~~~~~~~~~~~gC~GG~ 84 (298)
.|.+||++.-....++||+|||.||+|||||++++||++++|+.+... .+.||+|+|+||+.. .+.||+||+
T Consensus 2 ~~~DwR~~~~~~~~v~~v~dQg~CGsCwAfa~~~~le~~~~i~~~~~~---~~~LS~Q~lidC~~~-----~~~gC~GG~ 73 (236)
T cd02620 2 ESFDAREKWPNCISIGEIRDQGNCGSCWAFSAVEAFSDRLCIQSNGKE---NVLLSAQDLLSCCSG-----CGDGCNGGY 73 (236)
T ss_pred CcccchhhCCCCCCccccCCcccchhHHHHHHHHHHhhHHHHhcCCCC---ccccCHHHHHhhcCC-----CCCCCCCCC
Confidence 466788863111124799999999999999999999999999887322 689999999999875 368999999
Q ss_pred HHHHHHHHHHcCcCCCCCCCCCCCCcCCCCCCCCCCCCCCCCCCCCCCCCCCCccceeec----cCCccceeeccccccc
Q psy15348 85 SSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCT----NDNYGRGFFQDKYQIN 160 (298)
Q Consensus 85 ~~~a~~~~~~~Gl~~e~~y~~~~~c~PY~~~~~~~~~~~~~~~~C~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~ 160 (298)
+..||+|++++|+++|++| ||....+..+.. ....|.......+.|..... ...+.+..........
T Consensus 74 ~~~a~~~i~~~G~~~e~~y-------PY~~~~~~~~~~--~~~~~~~~~~~~~~C~~~~~~~~~~~~~~~~~~~~~~~~~ 144 (236)
T cd02620 74 PDAAWKYLTTTGVVTGGCQ-------PYTIPPCGHHPE--GPPPCCGTPYCTPKCQDGCEKTYEEDKHKGKSAYSVPSDE 144 (236)
T ss_pred HHHHHHHHHhcCCCcCCEe-------cCcCCCCccCCC--CCCCCCCCCCCCCCCCcCCccccceeeeeecceeeeCCHH
Confidence 9999999999999998777 998544322110 00112211001111211100 0001110000011112
Q ss_pred cccccccccCCceeEEEeccccccCCCCccCCCCceEEecCCcccccCeEEEEEeeeccCCeeEEEEEcCCCCCcCCCce
Q psy15348 161 GLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGT 240 (298)
Q Consensus 161 e~~~~~l~~~GPV~v~i~~~~~~v~~~f~~y~~G~Iy~~~~~~~~~~~HaV~IVGyg~~~g~~yWivkNSWG~~WGe~Gy 240 (298)
+++|.+|.++|||+++| .+.++|+.|++| ||+.+ |....++|||+|||||++++++|||||||||+.|||+||
T Consensus 145 ~~ik~~l~~~GPv~v~i-----~~~~~f~~Y~~G-iy~~~-~~~~~~~HaV~iVGyg~~~g~~YWivrNSWG~~WGe~Gy 217 (236)
T cd02620 145 TDIMKEIMTNGPVQAAF-----TVYEDFLYYKSG-VYQHT-SGKQLGGHAVKIIGWGVENGVPYWLAANSWGTDWGENGY 217 (236)
T ss_pred HHHHHHHHHCCCeEEEE-----EechhhhhcCCc-EEeec-CCCCcCCeEEEEEEEeccCCeeEEEEEeCCCCCCCCCcE
Confidence 36888999999999999 988899999999 99876 455668999999999999999999999999999999999
Q ss_pred EEEEecCCcccccceeee
Q psy15348 241 IKILRGRNEAIIESLVNG 258 (298)
Q Consensus 241 ~~i~rg~n~cgie~~~~~ 258 (298)
|||+|+.|.|+|++.++.
T Consensus 218 ~ri~~~~~~cgi~~~~~~ 235 (236)
T cd02620 218 FRILRGSNECGIESEVVA 235 (236)
T ss_pred EEEEccCcccccccceec
Confidence 999999999999999875
No 4
>KOG1543|consensus
Probab=100.00 E-value=1.2e-53 Score=395.26 Aligned_cols=213 Identities=22% Similarity=0.233 Sum_probs=179.7
Q ss_pred CccccCCcCCcccccCCCCCCCCCccHHHHHHHHHHHHHHHHHcCCcccccccccCHHHHhhccCcccccCCCCCCCCCc
Q psy15348 5 TSSRIRDMSYGATVYNRRPYALSCIEARAVATATPLAFAVCRSSKMHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSSGI 84 (298)
Q Consensus 5 ~~~~~~~~~~~~~~~~~v~dQg~cGsCwAfA~~~~le~~~~i~~~~~~~~~~~~LS~q~l~~c~~~~~~~~~~~gC~GG~ 84 (298)
.|.+||++. .+++|||||+.||||||||++++||++++|++++ . .+.||+|+|+||... ++.||.||.
T Consensus 111 ~s~DwR~~~---~~~~~vkdQg~CgsCWAFaa~~aie~~~~i~~g~-~---l~sLSeq~lvdC~~~-----~~~GC~GG~ 178 (325)
T KOG1543|consen 111 DSFDWRDKG---AVTPPVKDQGSCGSCWAFAATGALEDRYNIKTGG-K---LLSLSEQDLVDCCGE-----CGDGCNGGE 178 (325)
T ss_pred CCccccccC---CcCCCcCCCCcCcchHHHHHHHHHHHHHHHHhCC-c---cCccChhhhhhccCC-----CCCCcCCCC
Confidence 466777776 7888999999999999999999999999999994 2 799999999999986 578999999
Q ss_pred HHHHHHHHHHcCcCC-CCCCCCCCCCcCCCCCCCCCCCCCCCCCCCCCCCCCCCccceeeccCCccceeecccccccccc
Q psy15348 85 SSSTWAWVHKRGLVT-GGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDKYQINGLG 163 (298)
Q Consensus 85 ~~~a~~~~~~~Gl~~-e~~y~~~~~c~PY~~~~~~~~~~~~~~~~C~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~ 163 (298)
+..|++|++++|+++ +.+| ||. .....|..... ...... ...+.++ ...+++
T Consensus 179 ~~~A~~yi~~~G~~t~~~~Y-------py~----------~~~~~C~~~~~---~~~~~~-~~~~~~~------~~e~~i 231 (325)
T KOG1543|consen 179 PKNAFKYIKKNGGVTECENY-------PYI----------GKDGTCKSNKK---DKTVTI-KGFYNVP------ANEEAI 231 (325)
T ss_pred HHHHHHHHHHhCCCCCCcCC-------CCc----------CCCCCccCCCc---cceeEe-eeeeecC------cCHHHH
Confidence 999999999999888 8888 998 55678887721 111111 1222222 113368
Q ss_pred ccccccCCceeEEEeccccccCCCCccCCCCceEEecCCcccccCeEEEEEeeeccCCeeEEEEEcCCCCCcCCCceEEE
Q psy15348 164 LYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKI 243 (298)
Q Consensus 164 ~~~l~~~GPV~v~i~~~~~~v~~~f~~y~~G~Iy~~~~~~~~~~~HaV~IVGyg~~~g~~yWivkNSWG~~WGe~Gy~~i 243 (298)
+.+|+.+|||+++| .+..+|+.|++| ||.++.|.....+|||+|||||+.++.+|||||||||+.|||+|||||
T Consensus 232 ~~~v~~~GPv~v~~-----~a~~~F~~Y~~G-Vy~~~~~~~~~~~Hav~iVGyG~~~~~~YWivkNSWG~~WGe~Gy~ri 305 (325)
T KOG1543|consen 232 AEAVAKNGPVSVAI-----DAYEDFSLYKGG-VYAEEKGDDKEGDHAVLIVGYGTGDGVDYWIVKNSWGTDWGEKGYFRI 305 (325)
T ss_pred HHHHHhcCCeEEEE-----eehhhhhhccCc-eEeCCCCCCCCCCceEEEEEEcCCCCceeEEEEcCCCCCcccCceEEE
Confidence 99999999999999 999999999999 999998654469999999999996679999999999999999999999
Q ss_pred EecCCcccccceeeeeecc
Q psy15348 244 LRGRNEAIIESLVNGALPK 262 (298)
Q Consensus 244 ~rg~n~cgie~~~~~~~p~ 262 (298)
.|+++.|+|++.+++..|+
T Consensus 306 ~r~~~~~~I~~~~~~~p~~ 324 (325)
T KOG1543|consen 306 ARGVNKCGIASEASYGPIK 324 (325)
T ss_pred ecCCCchhhhcccccCCCC
Confidence 9999999999999885543
No 5
>cd02698 Peptidase_C1A_CathepsinX Cathepsin X; the only papain-like lysosomal cysteine peptidase exhibiting carboxymonopeptidase activity. It can also act as a carboxydipeptidase, like cathepsin B, but has been shown to preferentially cleave substrates through a monopeptidyl carboxypeptidase pathway. The propeptide region of cathepsin X, the shortest among papain-like peptidases, is covalently attached to the active site cysteine in the inactive form of the enzyme. Little is known about the biological function of cathepsin X. Some studies point to a role in early tumorigenesis. A more recent study indicates that cathepsin X expression is restricted to immune cells suggesting a role in phagocytosis and the regulation of the immune response.
Probab=100.00 E-value=1.5e-52 Score=373.98 Aligned_cols=220 Identities=20% Similarity=0.286 Sum_probs=171.2
Q ss_pred ccccCCcCCcccccCCCCCCC---CCccHHHHHHHHHHHHHHHHHcCCcccccccccCHHHHhhccCcccccCCCCCCCC
Q psy15348 6 SSRIRDMSYGATVYNRRPYAL---SCIEARAVATATPLAFAVCRSSKMHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSS 82 (298)
Q Consensus 6 ~~~~~~~~~~~~~~~~v~dQg---~cGsCwAfA~~~~le~~~~i~~~~~~~~~~~~LS~q~l~~c~~~~~~~~~~~gC~G 82 (298)
+.+||+.. ....++|||||| .||||||||++++||++++|+.+... +.+.||+|+|+||+. +.||+|
T Consensus 4 ~~Dwr~~~-~~~~v~~vk~Qg~~~~CGsCwAfa~~~aies~~~i~~~~~~--~~~~lS~Q~lldC~~-------~~gC~G 73 (239)
T cd02698 4 SWDWRNVN-GVNYVSPTRNQHIPQYCGSCWAHGSTSALADRINIARKGAW--PSVYLSVQVVIDCAG-------GGSCHG 73 (239)
T ss_pred CcccccCC-CCcccCccccCCCCCCCCcchHHHhHHHHHHHHHHHHCCCC--CCcccCHHHHHhCCC-------CCCccC
Confidence 45677754 445788999998 89999999999999999999876432 157899999999985 479999
Q ss_pred CcHHHHHHHHHHcCcCCCCCCCCCCCCcCCCCCCCCCCCCCCCCCCCCCCCCCCCccc--eeec----cCCccceeeccc
Q psy15348 83 GISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCH--TRCT----NDNYGRGFFQDK 156 (298)
Q Consensus 83 G~~~~a~~~~~~~Gl~~e~~y~~~~~c~PY~~~~~~~~~~~~~~~~C~~~~~~~~~~~--~~~~----~~~~~~~~~~~~ 156 (298)
|++..|++|++++|+++|.+| ||.. ....|+.. .....|. ..|. ...+.+.....
T Consensus 74 G~~~~a~~~~~~~Gl~~e~~y-------PY~~----------~~~~C~~~-~~~~~c~~~~~c~~~~~~~~~~i~~~~~- 134 (239)
T cd02698 74 GDPGGVYEYAHKHGIPDETCN-------PYQA----------KDGECNPF-NRCGTCNPFGECFAIKNYTLYFVSDYGS- 134 (239)
T ss_pred cCHHHHHHHHHHcCcCCCCee-------CCcC----------CCCCCcCC-CCCCCcccCcccccccccceEEeeecee-
Confidence 999999999999999997777 9983 33344321 0000110 0110 00111110000
Q ss_pred cccccccccccccCCceeEEEeccccccCCCCccCCCCceEEecCCcccccCeEEEEEeeeccC-CeeEEEEEcCCCCCc
Q psy15348 157 YQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEEN-GRPYWTIVSTFGEQF 235 (298)
Q Consensus 157 ~~~~e~~~~~l~~~GPV~v~i~~~~~~v~~~f~~y~~G~Iy~~~~~~~~~~~HaV~IVGyg~~~-g~~yWivkNSWG~~W 235 (298)
...++++|.+|.++|||+++| .+.++|+.|++| ||+...| ...++|||+|||||+++ +++|||||||||++|
T Consensus 135 ~~~~~~i~~~l~~~GPV~v~i-----~~~~~f~~Y~~G-Iy~~~~~-~~~~~HaV~IVGyG~~~~g~~YWiikNSWG~~W 207 (239)
T cd02698 135 VSGRDKMMAEIYARGPISCGI-----MATEALENYTGG-VYKEYVQ-DPLINHIISVAGWGVDENGVEYWIVRNSWGEPW 207 (239)
T ss_pred cCCHHHHHHHHHHcCCEEEEE-----EecccccccCCe-EEccCCC-CCcCCeEEEEEEEEecCCCCEEEEEEcCCCccc
Confidence 012336888899999999999 998899999999 9988765 34589999999999886 999999999999999
Q ss_pred CCCceEEEEecC-----Ccccccceeeeeec
Q psy15348 236 GDKGTIKILRGR-----NEAIIESLVNGALP 261 (298)
Q Consensus 236 Ge~Gy~~i~rg~-----n~cgie~~~~~~~p 261 (298)
|++|||||+|+. |+|+||+.+++++|
T Consensus 208 Ge~Gy~~i~rg~~~~~~~~~~i~~~~~~~~~ 238 (239)
T cd02698 208 GERGWFRIVTSSYKGARYNLAIEEDCAWADP 238 (239)
T ss_pred CcCceEEEEccCCcccccccccccceEEEee
Confidence 999999999999 99999999999998
No 6
>PTZ00203 cathepsin L protease; Provisional
Probab=100.00 E-value=7.3e-52 Score=385.97 Aligned_cols=208 Identities=17% Similarity=0.249 Sum_probs=164.1
Q ss_pred ccccCCcCCcccccCCCCCCCCCccHHHHHHHHHHHHHHHHHcCCcccccccccCHHHHhhccCcccccCCCCCCCCCcH
Q psy15348 6 SSRIRDMSYGATVYNRRPYALSCIEARAVATATPLAFAVCRSSKMHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSSGIS 85 (298)
Q Consensus 6 ~~~~~~~~~~~~~~~~v~dQg~cGsCwAfA~~~~le~~~~i~~~~~~~~~~~~LS~q~l~~c~~~~~~~~~~~gC~GG~~ 85 (298)
|.+||++ ..++||||||.||||||||++++||++++|++++ .+.||+|+|+||+.. +.||+||.+
T Consensus 129 ~~DWR~~----g~VtpVkdQg~CGSCWAfa~~~aiEs~~~i~~~~-----~~~LSeQqLvdC~~~------~~GC~GG~~ 193 (348)
T PTZ00203 129 AVDWREK----GAVTPVKNQGACGSCWAFSAVGNIESQWAVAGHK-----LVRLSEQQLVSCDHV------DNGCGGGLM 193 (348)
T ss_pred CCcCCcC----CCCCCccccCCCccHHHHhhHHHHHHHHHHhcCC-----CccCCHHHHHhccCC------CCCCCCCCH
Confidence 4555554 3578999999999999999999999999999887 689999999999864 689999999
Q ss_pred HHHHHHHHHc---CcCCCCCCCCCCCCcCCCCCCCCCCCCCCCCCCCCCCCCCCCccceeeccCCccceeeccccccccc
Q psy15348 86 SSTWAWVHKR---GLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDKYQINGL 162 (298)
Q Consensus 86 ~~a~~~~~~~---Gl~~e~~y~~~~~c~PY~~~~~~~~~~~~~~~~C~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~ 162 (298)
..||+|++++ |+++|++| ||.... .....|... .. ......+ ..+.. .....+.
T Consensus 194 ~~a~~yi~~~~~ggi~~e~~Y-------PY~~~~-------~~~~~C~~~-~~-~~~~~~i--~~~~~-----i~~~e~~ 250 (348)
T PTZ00203 194 LQAFEWVLRNMNGTVFTEKSY-------PYVSGN-------GDVPECSNS-SE-LAPGARI--DGYVS-----MESSERV 250 (348)
T ss_pred HHHHHHHHHhcCCCCCccccC-------CCccCC-------CCCCcCCCC-cc-cccceEe--cceee-----cCcCHHH
Confidence 9999999864 58898888 998321 112256532 10 0001111 11211 0111225
Q ss_pred cccccccCCceeEEEeccccccCCCCccCCCCceEEecCCcccccCeEEEEEeeeccCCeeEEEEEcCCCCCcCCCceEE
Q psy15348 163 GLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIK 242 (298)
Q Consensus 163 ~~~~l~~~GPV~v~i~~~~~~v~~~f~~y~~G~Iy~~~~~~~~~~~HaV~IVGyg~~~g~~yWivkNSWG~~WGe~Gy~~ 242 (298)
++.+|..+|||+++| ++. +|++|++| ||.. |.....+|||+|||||+++|++|||||||||++||++||||
T Consensus 251 ~~~~l~~~GPv~v~i-----~a~-~f~~Y~~G-Iy~~--c~~~~~nHaVliVGYG~~~g~~YWiikNSWG~~WGe~GY~r 321 (348)
T PTZ00203 251 MAAWLAKNGPISIAV-----DAS-SFMSYHSG-VLTS--CIGEQLNHGVLLVGYNMTGEVPYWVIKNSWGEDWGEKGYVR 321 (348)
T ss_pred HHHHHHhCCCEEEEE-----Ehh-hhcCccCc-eeec--cCCCCCCeEEEEEEEecCCCceEEEEEcCCCCCcCcCceEE
Confidence 778888899999999 884 89999999 9974 55556799999999999999999999999999999999999
Q ss_pred EEecCCcccccceeeeee
Q psy15348 243 ILRGRNEAIIESLVNGAL 260 (298)
Q Consensus 243 i~rg~n~cgie~~~~~~~ 260 (298)
|+|+.|.|||+++++.+.
T Consensus 322 i~rg~n~Cgi~~~~~~~~ 339 (348)
T PTZ00203 322 VTMGVNACLLTGYPVSVH 339 (348)
T ss_pred EEcCCCcccccceEEEEe
Confidence 999999999999988875
No 7
>cd02248 Peptidase_C1A Peptidase C1A subfamily (MEROPS database nomenclature); composed of cysteine peptidases (CPs) similar to papain, including the mammalian CPs (cathepsins B, C, F, H, L, K, O, S, V, X and W). Papain is an endopeptidase with specific substrate preferences, primarily for bulky hydrophobic or aromatic residues at the S2 subsite, a hydrophobic pocket in papain that accommodates the P2 sidechain of the substrate (the second residue away from the scissile bond). Most members of the papain subfamily are endopeptidases. Some exceptions to this rule can be explained by specific details of the catalytic domains like the occluding loop in cathepsin B which confers an additional carboxydipeptidyl activity and the mini-chain of cathepsin H resulting in an N-terminal exopeptidase activity. Papain-like CPs have different functions in various organisms. Plant CPs are used to mobilize storage proteins in seeds. Parasitic CPs act extracellularly to help invade tissues and cells, to h
Probab=100.00 E-value=1.1e-51 Score=360.47 Aligned_cols=207 Identities=21% Similarity=0.276 Sum_probs=170.7
Q ss_pred ccccCCcCCcccccCCCCCCCCCccHHHHHHHHHHHHHHHHHcCCcccccccccCHHHHhhccCcccccCCCCCCCCCcH
Q psy15348 6 SSRIRDMSYGATVYNRRPYALSCIEARAVATATPLAFAVCRSSKMHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSSGIS 85 (298)
Q Consensus 6 ~~~~~~~~~~~~~~~~v~dQg~cGsCwAfA~~~~le~~~~i~~~~~~~~~~~~LS~q~l~~c~~~~~~~~~~~gC~GG~~ 85 (298)
+.+||+... ++||+|||.||+|||||++++||++++++++. ..+||+|+|++|... .+.+|.||.+
T Consensus 3 ~~d~r~~~~----~~~v~dQg~cgsCwAfa~~~~le~~~~i~~~~-----~~~lS~q~l~~c~~~-----~~~gC~GG~~ 68 (210)
T cd02248 3 SVDWREKGA----VTPVKDQGSCGSCWAFSTVGALEGAYAIKTGK-----LVSLSEQQLVDCSTS-----GNNGCNGGNP 68 (210)
T ss_pred cccCCcCCC----CCCCccCCCCcchHHhHHHHHHHHHHHHHcCC-----CcccCHHHHhccCCC-----CCCCCCCCCH
Confidence 456777643 78999999999999999999999999998885 689999999999874 3689999999
Q ss_pred HHHHHHHHHcCcCCCCCCCCCCCCcCCCCCCCCCCCCCCCCCCCCCCCCCCCccceeeccCCccceeecccccccccccc
Q psy15348 86 SSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDKYQINGLGLY 165 (298)
Q Consensus 86 ~~a~~~~~~~Gl~~e~~y~~~~~c~PY~~~~~~~~~~~~~~~~C~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~ 165 (298)
..|+++++++|+++|++| ||. .....|+... . ....++ .....+. . ...+++|.
T Consensus 69 ~~a~~~~~~~Gi~~e~~y-------PY~----------~~~~~C~~~~-~--~~~~~i-~~~~~i~---~--~~~~~ik~ 122 (210)
T cd02248 69 DNAFEYVKNGGLASESDY-------PYT----------GKDGTCKYNS-S--KVGAKI-TGYSNVP---P--GDEEALKA 122 (210)
T ss_pred HHhHHHHHHCCcCccccC-------Ccc----------CCCCCccCCC-C--cccEEE-eeEEEcC---C--CcHHHHHH
Confidence 999999999999998888 998 4456776541 1 111111 1111111 0 01236899
Q ss_pred ccccCCceeEEEeccccccCCCCccCCCCceEEecCCcccccCeEEEEEeeeccCCeeEEEEEcCCCCCcCCCceEEEEe
Q psy15348 166 FDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILR 245 (298)
Q Consensus 166 ~l~~~GPV~v~i~~~~~~v~~~f~~y~~G~Iy~~~~~~~~~~~HaV~IVGyg~~~g~~yWivkNSWG~~WGe~Gy~~i~r 245 (298)
+|+.+|||+++| .+.++|+.|++| ||..+.|....++|||+|||||++.+.+|||||||||+.||++|||||+|
T Consensus 123 ~l~~~gPV~~~~-----~~~~~f~~y~~G-iy~~~~~~~~~~~Hav~iVGy~~~~~~~ywiv~NSWG~~WG~~Gy~~i~~ 196 (210)
T cd02248 123 ALANYGPVSVAI-----DASSSFQFYKGG-IYSGPCCSNTNLNHAVLLVGYGTENGVDYWIVKNSWGTSWGEKGYIRIAR 196 (210)
T ss_pred HHhhcCCEEEEE-----ecCcccccCCCC-ceeCCCCCCCcCCEEEEEEEEeecCCceEEEEEcCCCCccccCcEEEEEc
Confidence 999999999999 998899999999 99998765567899999999999989999999999999999999999999
Q ss_pred cCCcccccceeee
Q psy15348 246 GRNEAIIESLVNG 258 (298)
Q Consensus 246 g~n~cgie~~~~~ 258 (298)
+.+.|+|++++.+
T Consensus 197 ~~~~cgi~~~~~~ 209 (210)
T cd02248 197 GSNLCGIASYASY 209 (210)
T ss_pred CCCccCceeeeec
Confidence 9999999987764
No 8
>PTZ00049 cathepsin C-like protein; Provisional
Probab=100.00 E-value=9.7e-51 Score=397.80 Aligned_cols=234 Identities=18% Similarity=0.229 Sum_probs=172.8
Q ss_pred ccccCCcCCcccccCCCCCCCCCccHHHHHHHHHHHHHHHHHcCCccc-----ccccccCHHHHhhccCcccccCCCCCC
Q psy15348 6 SSRIRDMSYGATVYNRRPYALSCIEARAVATATPLAFAVCRSSKMHVE-----CTSFRFIAGVKQRCAWLVSRWMTIWVC 80 (298)
Q Consensus 6 ~~~~~~~~~~~~~~~~v~dQg~cGsCwAfA~~~~le~~~~i~~~~~~~-----~~~~~LS~q~l~~c~~~~~~~~~~~gC 80 (298)
+.+||+.-.....++||+|||.||||||||++++||+|++|+..+... .....||+|+|+||+.. +.||
T Consensus 384 sfDWRd~~~~~~~vtpVkdQG~CGSCWAFAat~alEsR~~Ia~~~~l~~~~~~~~~~~LS~QqLLDCs~~------nqGC 457 (693)
T PTZ00049 384 NFTWGDPFNNNTREYDVTNQLLCGSCYIASQMYAFKRRIEIALTKNLDKKYLNNFDDLLSIQTVLSCSFY------DQGC 457 (693)
T ss_pred CEecCcCCCCCCcccCCCCCccCcHHHHHHHHHHHHHHHHHHhccccccccccccccCcCHHHhcccCCC------CCCc
Confidence 445666432224578999999999999999999999999998753110 01248999999999864 7899
Q ss_pred CCCcHHHHHHHHHHcCcCCCCCCCCCCCCcCCCCCCCCCCCCCCCCCCCCCCCCCCC-------------------ccce
Q psy15348 81 SSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP-------------------KCHT 141 (298)
Q Consensus 81 ~GG~~~~a~~~~~~~Gl~~e~~y~~~~~c~PY~~~~~~~~~~~~~~~~C~~~~~~~~-------------------~~~~ 141 (298)
+||.+..|++|++++||++|..| ||.. ..+.|+......+ .+..
T Consensus 458 ~GG~~~~A~kya~~~GI~tEscY-------PY~a----------~~g~C~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~ 520 (693)
T PTZ00049 458 NGGFPYLVSKMAKLQGIPLDKVF-------PYTA----------TEQTCPYQVDQSANSMNGSANLRQINAVFFSSETQS 520 (693)
T ss_pred CCCcHHHHHHHHHHCCCCcCCcc-------CCcC----------CCCCCCCCCCCccccccccccccccccccccccccc
Confidence 99999999999999999997666 9983 3445543210000 0000
Q ss_pred eec-----------cCCcccee--eccc-----cccccccccccccCCceeEEEeccccccCCCCccCCCCceEEec---
Q psy15348 142 RCT-----------NDNYGRGF--FQDK-----YQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAVS--- 200 (298)
Q Consensus 142 ~~~-----------~~~~~~~~--~~~~-----~~~~e~~~~~l~~~GPV~v~i~~~~~~v~~~f~~y~~G~Iy~~~--- 200 (298)
.+. ...+.-.| .... ....+.||.+|+.+|||+|+| ++.++|++|++| ||..+
T Consensus 521 ~~~~~~~~~~~~~~~r~y~k~y~yI~g~y~~~~~~~E~~Im~eI~~~GPVsVsI-----da~~dF~~YksG-VY~~~~~~ 594 (693)
T PTZ00049 521 DMHADFEAPISSEPARWYAKDYNYIGGCYGCNQCNGEKIMMNEIYRNGPIVASF-----EASPDFYDYADG-VYYVEDFP 594 (693)
T ss_pred cccccccccccccccceeeeeeEEecccccccCCCCHHHHHHHHHhcCCEEEEE-----EechhhhcCCCc-cccCcccc
Confidence 000 00011011 0000 011235888999999999999 988899999999 99864
Q ss_pred ---CCccc--------------ccCeEEEEEeeecc--CCe--eEEEEEcCCCCCcCCCceEEEEecCCcccccceeeee
Q psy15348 201 ---ASAEI--------------VAYATVKIVGWGEE--NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259 (298)
Q Consensus 201 ---~~~~~--------------~~~HaV~IVGyg~~--~g~--~yWivkNSWG~~WGe~Gy~~i~rg~n~cgie~~~~~~ 259 (298)
.|... ..+|||+|||||++ +|. +|||||||||+.||++|||||.||.|.||||++++++
T Consensus 595 h~~~C~~d~~~~~~~~~~~G~e~~NHAVlIVGwG~d~enG~~~~YWIVRNSWGt~WGenGYfKI~RG~N~CGIEs~a~~~ 674 (693)
T PTZ00049 595 HARRCTVDLPKHNGVYNITGWEKVNHAIVLVGWGEEEINGKLYKYWIGRNSWGKNWGKEGYFKIIRGKNFSGIESQSLFI 674 (693)
T ss_pred cccccCCccccccccccccccccCceEEEEEEeccccCCCcccCEEEEECCCCCCcccCceEEEEcCCCccCCccceeEE
Confidence 25321 46999999999975 463 7999999999999999999999999999999999999
Q ss_pred ecccCCCCC
Q psy15348 260 LPKDNYGVE 268 (298)
Q Consensus 260 ~p~~~~~~~ 268 (298)
+|++.++..
T Consensus 675 ~pd~~rg~~ 683 (693)
T PTZ00049 675 EPDFSRGAG 683 (693)
T ss_pred eeeccccHH
Confidence 999987654
No 9
>PTZ00021 falcipain-2; Provisional
Probab=100.00 E-value=1.3e-50 Score=388.73 Aligned_cols=205 Identities=17% Similarity=0.193 Sum_probs=163.6
Q ss_pred ccccCCcCCcccccCCCCCCCCCccHHHHHHHHHHHHHHHHHcCCcccccccccCHHHHhhccCcccccCCCCCCCCCcH
Q psy15348 6 SSRIRDMSYGATVYNRRPYALSCIEARAVATATPLAFAVCRSSKMHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSSGIS 85 (298)
Q Consensus 6 ~~~~~~~~~~~~~~~~v~dQg~cGsCwAfA~~~~le~~~~i~~~~~~~~~~~~LS~q~l~~c~~~~~~~~~~~gC~GG~~ 85 (298)
|.+||+.. .++||+|||.||||||||++++||++++|+++. .+.||+|+|+||+.. +.||+||++
T Consensus 269 s~DWR~~g----~VtpVKdQG~CGSCWAFAa~~alEs~~~I~~g~-----~v~LSeQqLVDCs~~------n~GC~GG~~ 333 (489)
T PTZ00021 269 KYDWRLHN----GVTPVKDQKNCGSCWAFSTVGVVESQYAIRKNE-----LVSLSEQELVDCSFK------NNGCYGGLI 333 (489)
T ss_pred ccccccCC----CCCCcccccccccHHHHHHHHHHHHHHHHHcCC-----CcccCHHHHhhhccC------CCCCCCcch
Confidence 34555542 468999999999999999999999999999887 689999999999964 789999999
Q ss_pred HHHHHHHHHc-CcCCCCCCCCCCCCcCCCCCCCCCCCCCCCCCCCCCCCCCCCccceeeccCCccceeeccccccccccc
Q psy15348 86 SSTWAWVHKR-GLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDKYQINGLGL 164 (298)
Q Consensus 86 ~~a~~~~~~~-Gl~~e~~y~~~~~c~PY~~~~~~~~~~~~~~~~C~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~ 164 (298)
..||+|+.+. ||++|++| ||... ..+.|... . +. ..+.+....... .+.++
T Consensus 334 ~~Af~yi~~~gGl~tE~~Y-------PY~~~---------~~~~C~~~-~----~~-----~~~~i~~y~~i~--~~~lk 385 (489)
T PTZ00021 334 PNAFEDMIELGGLCSEDDY-------PYVSD---------TPELCNID-R----CK-----EKYKIKSYVSIP--EDKFK 385 (489)
T ss_pred HhhhhhhhhccccCccccc-------CccCC---------CCCccccc-c----cc-----ccceeeeEEEec--HHHHH
Confidence 9999999776 89998888 99831 13567643 1 10 111111100111 12578
Q ss_pred cccccCCceeEEEeccccccCCCCccCCCCceEEecCCcccccCeEEEEEeeeccCC----------eeEEEEEcCCCCC
Q psy15348 165 YFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENG----------RPYWTIVSTFGEQ 234 (298)
Q Consensus 165 ~~l~~~GPV~v~i~~~~~~v~~~f~~y~~G~Iy~~~~~~~~~~~HaV~IVGyg~~~g----------~~yWivkNSWG~~ 234 (298)
.+|..+|||+|+| .+.++|++|++| ||.. .|+. .++|||+|||||++++ .+|||||||||++
T Consensus 386 ~al~~~GPVsv~i-----~a~~~f~~YkgG-Iy~~-~C~~-~~nHAVlIVGYG~e~~~~~~~~~~~~~~YWIVKNSWGt~ 457 (489)
T PTZ00021 386 EAIRFLGPISVSI-----AVSDDFAFYKGG-IFDG-ECGE-EPNHAVILVGYGMEEIYNSDTKKMEKRYYYIIKNSWGES 457 (489)
T ss_pred HHHHhcCCeEEEE-----EeecccccCCCC-cCCC-CCCC-ccceEEEEEEecCcCCcccccccCCCCCEEEEECCCCCC
Confidence 8898899999999 988899999999 9976 4654 4799999999997642 4799999999999
Q ss_pred cCCCceEEEEecC----Ccccccceeeeeec
Q psy15348 235 FGDKGTIKILRGR----NEAIIESLVNGALP 261 (298)
Q Consensus 235 WGe~Gy~~i~rg~----n~cgie~~~~~~~p 261 (298)
|||+|||||+|+. |.|||.+.+.+|+.
T Consensus 458 WGE~GY~rI~r~~~g~~n~CGI~t~a~yP~~ 488 (489)
T PTZ00021 458 WGEKGFIRIETDENGLMKTCSLGTEAYVPLI 488 (489)
T ss_pred cccCeEEEEEcCCCCCCCCCCCcccceeEec
Confidence 9999999999986 59999999998874
No 10
>PTZ00462 Serine-repeat antigen protein; Provisional
Probab=100.00 E-value=4.5e-50 Score=403.05 Aligned_cols=252 Identities=15% Similarity=0.169 Sum_probs=193.5
Q ss_pred ccCCcCCcccccCCCCCCCCCccHHHHHHHHHHHHHHHHHcCCcccccccccCHHHHhhccCcccccCCCCCCCCCc-HH
Q psy15348 8 RIRDMSYGATVYNRRPYALSCIEARAVATATPLAFAVCRSSKMHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSSGI-SS 86 (298)
Q Consensus 8 ~~~~~~~~~~~~~~v~dQg~cGsCwAfA~~~~le~~~~i~~~~~~~~~~~~LS~q~l~~c~~~~~~~~~~~gC~GG~-~~ 86 (298)
||+|++.|.+ +.+|+|||.||+|||||++++||++++|+.+. .+.||+|+|+||+... .+.||.||. +.
T Consensus 534 R~kD~~sC~s-~i~VKDQG~CGSCWAFASaaaLES~~cIkgg~-----~v~LSeQqLVDCs~~~----gn~GC~GG~~~~ 603 (1004)
T PTZ00462 534 RLKDENNCIS-KIQIEDQGNCAISWIFASKYHLETIKCMKGYE-----PHAISALYIANCSKGE----HKDRCDEGSNPL 603 (1004)
T ss_pred ccccCCCCCC-CCCcccCCcchHHHHHHHHHHHHHHHHHhcCC-----CcccCHHHHHhccccc----CCCCCCCCCcHH
Confidence 8999999988 45599999999999999999999999999876 6899999999998642 357999997 55
Q ss_pred HHHHHHHHcC-cCCCCCCCCCCCCcCCCCCCCCCCCCCCCCCCCCCCCCC------C-C------ccceeeccCCcccee
Q psy15348 87 STWAWVHKRG-LVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATP------Q-P------KCHTRCTNDNYGRGF 152 (298)
Q Consensus 87 ~a~~~~~~~G-l~~e~~y~~~~~c~PY~~~~~~~~~~~~~~~~C~~~~~~------~-~------~~~~~~~~~~~~~~~ 152 (298)
.++.|++++| |++|++| ||... ...+.|+..... . . ..........|. .+
T Consensus 604 efl~yI~e~GgLptESdY-------PYt~k--------~~~g~Cp~~~~~w~n~~~~~kll~~~~~~~~~i~~kgY~-~~ 667 (1004)
T PTZ00462 604 EFLQIIEDNGFLPADSNY-------LYNYT--------KVGEDCPDEEDHWMNLLDHGKILNHNKKEPNSLDGKAYR-AY 667 (1004)
T ss_pred HHHHHHHHcCCCcccccC-------CCccC--------CCCCCCCCCcccccccccccccccccccccceeeccceE-Ee
Confidence 6679998885 8887777 99742 134567643100 0 0 000000001111 11
Q ss_pred ecccccc-----ccccccccccCCceeEEEeccccccCCCCccCC-CCceEEecCCcccccCeEEEEEeeecc-----CC
Q psy15348 153 FQDKYQI-----NGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQT-NGRVYAVSASAEIVAYATVKIVGWGEE-----NG 221 (298)
Q Consensus 153 ~~~~~~~-----~e~~~~~l~~~GPV~v~i~~~~~~v~~~f~~y~-~G~Iy~~~~~~~~~~~HaV~IVGyg~~-----~g 221 (298)
....... .+.++.+|+.+|||+|+| ++. +|+.|. +| ||..+.|+...++|||+|||||++ ++
T Consensus 668 ~s~~~~~n~d~~i~~IK~eI~~kGPVaV~I-----dAs-df~~Y~~sG-Iyv~~~Cgs~~~nHAVlIVGYGt~in~eg~g 740 (1004)
T PTZ00462 668 ESEHFHDKMDAFIKIIKDEIMNKGSVIAYI-----KAE-NVLGYEFNG-KKVQNLCGDDTADHAVNIVGYGNYINDEDEK 740 (1004)
T ss_pred cccccccchhhHHHHHHHHHHhcCCEEEEE-----Eee-hHHhhhcCC-ccccCCCCCCcCCceEEEEEecccccccCCC
Confidence 1111011 125788899999999999 874 788885 89 988877876678999999999974 25
Q ss_pred eeEEEEEcCCCCCcCCCceEEEEe-cCCcccccceeeeeecccCCCCCcccCcccccc-cccccccCCchHHhhc
Q psy15348 222 RPYWTIVSTFGEQFGDKGTIKILR-GRNEAIIESLVNGALPKDNYGVEFGEESGERLS-EEFGVRAESSEEFREN 294 (298)
Q Consensus 222 ~~yWivkNSWG~~WGe~Gy~~i~r-g~n~cgie~~~~~~~p~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~ 294 (298)
++|||||||||+.||++|||||.| +.+.|+|.+.+.+++.+ ..++..+....... +.|.|++|.||+|+.|
T Consensus 741 k~YWIVRNSWGt~WGEnGYFKI~r~g~n~CGin~i~t~~~fn--~d~~~~~~~~~~~~~~~~~y~~k~spdf~~n 813 (1004)
T PTZ00462 741 KSYWIVRNSWGKYWGDEGYFKVDMYGPSHCEDNFIHSVVIFN--IDLPKNKKSPKKESFKIYDYYLKASPDFYHN 813 (1004)
T ss_pred CceEEEEcCCCCCcCCCeEEEEEeCCCCCCccchheeeeeEe--eccccccCCccccccchheeeeccChhHhhh
Confidence 799999999999999999999998 68999999999999955 46777777777777 9999999999999987
No 11
>PTZ00200 cysteine proteinase; Provisional
Probab=100.00 E-value=4e-50 Score=383.79 Aligned_cols=203 Identities=16% Similarity=0.179 Sum_probs=163.9
Q ss_pred ccccCCcCCcccccCCCCCCC-CCccHHHHHHHHHHHHHHHHHcCCcccccccccCHHHHhhccCcccccCCCCCCCCCc
Q psy15348 6 SSRIRDMSYGATVYNRRPYAL-SCIEARAVATATPLAFAVCRSSKMHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSSGI 84 (298)
Q Consensus 6 ~~~~~~~~~~~~~~~~v~dQg-~cGsCwAfA~~~~le~~~~i~~~~~~~~~~~~LS~q~l~~c~~~~~~~~~~~gC~GG~ 84 (298)
+.+||+. ..++|||||| .||||||||++++||++++++.+. .+.||+|+|+||... +.||+||+
T Consensus 237 ~~DWR~~----g~vtpVkdQG~~CGSCWAFat~~aiEs~~~i~~~~-----~~~LSeQqLvDC~~~------~~GC~GG~ 301 (448)
T PTZ00200 237 GLDWRRA----DAVTKVKDQGLNCGSCWAFSSVGSVESLYKIYRDK-----SVDLSEQELVNCDTK------SQGCSGGY 301 (448)
T ss_pred CccCCCC----CCCCCcccCCCccchHHHHhHHHHHHHHHHHhcCC-----CeecCHHHHhhccCc------cCCCCCCc
Confidence 4445554 3578999999 999999999999999999998776 689999999999864 68999999
Q ss_pred HHHHHHHHHHcCcCCCCCCCCCCCCcCCCCCCCCCCCCCCCCCCCCCCCCCCCccceeeccCCccceeeccccccccccc
Q psy15348 85 SSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDKYQINGLGL 164 (298)
Q Consensus 85 ~~~a~~~~~~~Gl~~e~~y~~~~~c~PY~~~~~~~~~~~~~~~~C~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~ 164 (298)
+..|++|++++||++|++| ||. +..+.|... . .....+ ..+.+.. ..+.++
T Consensus 302 ~~~A~~yi~~~Gi~~e~~Y-------PY~----------~~~~~C~~~-~---~~~~~i--~~y~~~~------~~~~l~ 352 (448)
T PTZ00200 302 PDTALEYVKNKGLSSSSDV-------PYL----------AKDGKCVVS-S---TKKVYI--DSYLVAK------GKDVLN 352 (448)
T ss_pred HHHHHHHHhhcCccccccC-------CCC----------CCCCCCcCC-C---CCeeEe--cceEecC------HHHHHH
Confidence 9999999999999998888 998 556788654 1 111122 2232211 112344
Q ss_pred cccccCCceeEEEeccccccCCCCccCCCCceEEecCCcccccCeEEEEEeeec--cCCeeEEEEEcCCCCCcCCCceEE
Q psy15348 165 YFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGE--ENGRPYWTIVSTFGEQFGDKGTIK 242 (298)
Q Consensus 165 ~~l~~~GPV~v~i~~~~~~v~~~f~~y~~G~Iy~~~~~~~~~~~HaV~IVGyg~--~~g~~yWivkNSWG~~WGe~Gy~~ 242 (298)
.++ .+|||+|+| .+..+|+.|++| ||.++ |+.. ++|||+|||||. ++|.+|||||||||++||++||||
T Consensus 353 ~~l-~~GPV~v~i-----~~~~~f~~Yk~G-Iy~~~-C~~~-~nHaV~lVGyG~d~~~g~~YWIIkNSWG~~WGe~GY~r 423 (448)
T PTZ00200 353 KSL-VISPTVVYI-----AVSRELLKYKSG-VYNGE-CGKS-LNHAVLLVGEGYDEKTKKRYWIIKNSWGTDWGENGYMR 423 (448)
T ss_pred HHH-hcCCEEEEe-----ecccccccCCCC-ccccc-cCCC-CcEEEEEEEecccCCCCCceEEEEcCCCCCcccCeeEE
Confidence 444 689999999 988899999999 99864 6654 899999999995 468899999999999999999999
Q ss_pred EEec---CCcccccceeeeeec
Q psy15348 243 ILRG---RNEAIIESLVNGALP 261 (298)
Q Consensus 243 i~rg---~n~cgie~~~~~~~p 261 (298)
|+|+ .|.|||++.+.+|+.
T Consensus 424 i~r~~~g~n~CGI~~~~~~P~~ 445 (448)
T PTZ00200 424 LERTNEGTDKCGILTVGLTPVF 445 (448)
T ss_pred EEeCCCCCCcCCccccceeeEE
Confidence 9996 489999999988763
No 12
>PTZ00364 dipeptidyl-peptidase I precursor; Provisional
Probab=100.00 E-value=2.2e-49 Score=384.23 Aligned_cols=227 Identities=22% Similarity=0.226 Sum_probs=170.8
Q ss_pred ccccCCcCCcccccCCCCCCCC---CccHHHHHHHHHHHHHHHHHcCCccc-ccccccCHHHHhhccCcccccCCCCCCC
Q psy15348 6 SSRIRDMSYGATVYNRRPYALS---CIEARAVATATPLAFAVCRSSKMHVE-CTSFRFIAGVKQRCAWLVSRWMTIWVCS 81 (298)
Q Consensus 6 ~~~~~~~~~~~~~~~~v~dQg~---cGsCwAfA~~~~le~~~~i~~~~~~~-~~~~~LS~q~l~~c~~~~~~~~~~~gC~ 81 (298)
+.+||+.. ....++|||||+. ||||||||++++||++++|+++...+ ...+.||+|+|+||+.. +.||+
T Consensus 208 sfDWR~~g-g~~~VtpVrdQg~~~~CGSCWAFAav~alEsr~~I~tn~~~~~g~~~~LS~QqLVDCs~~------n~GCd 280 (548)
T PTZ00364 208 AWSWGDVG-GASFLPAAPPASPGRGCNSSYVEAALAAMMARVMVASNRTDPLGQQTFLSARHVLDCSQY------GQGCA 280 (548)
T ss_pred ccccCcCC-CCccCCCCcCCCCCCCCcCHHHHHHHHHHHHHHHHHhCCCcccCcccCcCHHHHhcccCC------CCCCC
Confidence 45666653 4457899999999 99999999999999999998864311 12578999999999864 78999
Q ss_pred CCcHHHHHHHHHHcCcCCCCCCCCCCCCcCCCCCCCCCCCCCCCCCCCCCCCCCCCccceeeccCCccce-eeccccccc
Q psy15348 82 SGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRG-FFQDKYQIN 160 (298)
Q Consensus 82 GG~~~~a~~~~~~~Gl~~e~~y~~~~~c~PY~~~~~~~~~~~~~~~~C~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~ 160 (298)
||++..|++|++++||++|++| |.||.... .....|+.. . ....... .....+. +.. .....
T Consensus 281 GG~p~~A~~yi~~~GI~tE~dY-----~~PY~~~d-------g~~~~Ck~~-~--~~~~y~~-~~~~~I~gyy~-~~~~e 343 (548)
T PTZ00364 281 GGFPEEVGKFAETFGILTTDSY-----YIPYDSGD-------GVERACKTR-R--PSRRYYF-TNYGPLGGYYG-AVTDP 343 (548)
T ss_pred CCcHHHHHHHHHhCCccccccc-----CCCCCCCC-------CCCCCCCCC-c--ccceeee-eeeEEecceee-cCCcH
Confidence 9999999999999999996665 55997321 122357643 1 1101011 0111110 100 01122
Q ss_pred cccccccccCCceeEEEeccccccCCCCccCCCCceEEec--------CCc----------ccccCeEEEEEeeec-cCC
Q psy15348 161 GLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAVS--------ASA----------EIVAYATVKIVGWGE-ENG 221 (298)
Q Consensus 161 e~~~~~l~~~GPV~v~i~~~~~~v~~~f~~y~~G~Iy~~~--------~~~----------~~~~~HaV~IVGyg~-~~g 221 (298)
++++.+|+.+|||+|+| ++..+|+.|++| ||... .|. ....+|||+|||||+ ++|
T Consensus 344 ~~I~~eI~~~GPVsVaI-----da~~df~~YksG-iy~gi~~~~~~~~~~~~~~~~~~~~~~~~~nHAVlIVGYG~de~G 417 (548)
T PTZ00364 344 DEIIWEIYRHGPVPASV-----YANSDWYNCDEN-STEDVRYVSLDDYSTASADRPLRHYFASNVNHTVLIIGWGTDENG 417 (548)
T ss_pred HHHHHHHHHcCCeEEEE-----EechHHHhcCCC-CccCeeccccccccccccCCcccccccccCCeEEEEEEecccCCC
Confidence 36888999999999999 998899999999 98621 111 135799999999997 478
Q ss_pred eeEEEEEcCCCC--CcCCCceEEEEecCCcccccceeeeeecc
Q psy15348 222 RPYWTIVSTFGE--QFGDKGTIKILRGRNEAIIESLVNGALPK 262 (298)
Q Consensus 222 ~~yWivkNSWG~--~WGe~Gy~~i~rg~n~cgie~~~~~~~p~ 262 (298)
.+|||||||||+ +|||+|||||+||.|+||||+.++++.|.
T Consensus 418 ~~YWIVKNSWGt~~~WGE~GYfRI~RG~N~CGIes~~v~~~~~ 460 (548)
T PTZ00364 418 GDYWLVLDPWGSRRSWCDGGTRKIARGVNAYNIESEVVVMYWA 460 (548)
T ss_pred ceEEEEECCCCCCCCcccCCeEEEEcCCCcccccceeeeeeee
Confidence 999999999999 99999999999999999999999988884
No 13
>PF00112 Peptidase_C1: Papain family cysteine protease This is family C1 in the peptidase classification. ; InterPro: IPR000668 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad []. This group of proteins belong to the peptidase family C1, sub-family C1A (papain family, clan CA). It includes proteins classed as non-peptidase homologs. These are have either been shown experimentally to lack peptidase activity or lack one or more of the active site residues. The papain family has a wide variety of activities, including broad-range (papain) and narrow-range endo-peptidases, aminopeptidases, dipeptidyl peptidases and enzymes with both exo- and endo-peptidase activity []. Members of the papain family are widespread, found in baculovirus [], eubacteria, yeast, and practically all protozoa, plants and mammals []. The proteins are typically lysosomal or secreted, and proteolytic cleavage of the propeptide is required for enzyme activation, although bleomycin hydrolase is cytosolic in fungi and mammals []. Papain-like cysteine proteinases are essentially synthesised as inactive proenzymes (zymogens) with N-terminal propeptide regions. The activation process of these enzymes includes the removal of propeptide regions. The propeptide regions serve a variety of functions in vivo and in vitro. The pro-region is required for the proper folding of the newly synthesised enzyme, the inactivation of the peptidase domain and stabilisation of the enzyme against denaturing at neutral to alkaline pH conditions. Amino acid residues within the pro-region mediate their membrane association, and play a role in the transport of the proenzyme to lysosomes. Among the most notable features of propeptides is their ability to inhibit the activity of their cognate enzymes and that certain propeptides exhibit high selectivity for inhibition of the peptidases from which they originate []. The catalytic residues of papain are Cys-25 and His-159, other important residues being Gln-19, which helps form the 'oxyanion hole', and Asn-175, which orientates the imidazole ring of His-159. ; GO: 0008234 cysteine-type peptidase activity, 0006508 proteolysis; PDB: 3MOR_B 3HHI_B 1S4V_A 3F75_A 1MEG_A 1PCI_C 1PPO_A 3HD3_B 1F29_A 1EWL_A ....
Probab=100.00 E-value=1.7e-49 Score=347.49 Aligned_cols=213 Identities=22% Similarity=0.282 Sum_probs=166.9
Q ss_pred CccccCCcCCcccccCCCCCCCCCccHHHHHHHHHHHHHHHHHcCCcccccccccCHHHHhhccCcccccCCCCCCCCCc
Q psy15348 5 TSSRIRDMSYGATVYNRRPYALSCIEARAVATATPLAFAVCRSSKMHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSSGI 84 (298)
Q Consensus 5 ~~~~~~~~~~~~~~~~~v~dQg~cGsCwAfA~~~~le~~~~i~~~~~~~~~~~~LS~q~l~~c~~~~~~~~~~~gC~GG~ 84 (298)
.+.+||+.. ..++||+||+.||+|||||++++||++++++.. .. .++||+|+|++|... .+.+|+||+
T Consensus 3 ~~~D~r~~~---~~~~~v~dQg~~gsCwafa~~~~~e~~~~~~~~-~~---~~~lS~q~l~~~~~~-----~~~~c~gg~ 70 (219)
T PF00112_consen 3 KSFDWRDKG---GRITPVRDQGSCGSCWAFAAAAALESRLAIQNN-GK---NVDLSEQYLIDCSNK-----YNKGCDGGS 70 (219)
T ss_dssp SSEEGGGTT---TCSG---BTTSSBTHHHHHHHHHHHHHHHHHHT-SS---CEEB-HHHHHHHSTG-----TSSTTBBBE
T ss_pred CCEecccCC---CCcCccccCCcccccccchhccceecccccccc-cc---ccccccccccccccc-----cccccccCc
Confidence 356677732 238899999999999999999999999999985 22 799999999999973 367999999
Q ss_pred HHHHHHHHHH-cCcCCCCCCCCCCCCcCCCCCCCCCCCCCCCC-CCCCCCCCCCCccceeeccCCccceeeccccccccc
Q psy15348 85 SSSTWAWVHK-RGLVTGGAHHSNTGCQPVSFPPCNHANYTTSE-PECKTLATPQPKCHTRCTNDNYGRGFFQDKYQINGL 162 (298)
Q Consensus 85 ~~~a~~~~~~-~Gl~~e~~y~~~~~c~PY~~~~~~~~~~~~~~-~~C~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~ 162 (298)
+..|++++++ +|+++|.+| ||. ... ..|... . ......++ ..+... ... ..++
T Consensus 71 ~~~a~~~~~~~~Gi~~e~~~-------pY~----------~~~~~~c~~~-~-~~~~~~~i--~~~~~~--~~~--~~~~ 125 (219)
T PF00112_consen 71 PFDALKYIKNNNGIVTEEDY-------PYN----------GNENPTCKSK-K-SNSYYVKI--KGYGKV--KDN--DIED 125 (219)
T ss_dssp HHHHHHHHHHHTSBEBTTTS---------S----------SSSSCSSCHS-G-GGEEEBEE--SEEEEE--EST--CHHH
T ss_pred ccccceeecccCcccccccc-------ccc----------cccccccccc-c-cccccccc--cccccc--ccc--chhH
Confidence 9999999999 999998888 999 333 567654 1 11001111 111110 011 1236
Q ss_pred cccccccCCceeEEEeccccccCC-CCccCCCCceEEecCCcccccCeEEEEEeeeccCCeeEEEEEcCCCCCcCCCceE
Q psy15348 163 GLYFDPHFGPFWPAFWRSFCTKYT-RPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTI 241 (298)
Q Consensus 163 ~~~~l~~~GPV~v~i~~~~~~v~~-~f~~y~~G~Iy~~~~~~~~~~~HaV~IVGyg~~~g~~yWivkNSWG~~WGe~Gy~ 241 (298)
++.+|+.+|||+++| .+.+ +|..|++| ||..+.|....++|||+|||||++.+++|||||||||++||++|||
T Consensus 126 ik~~L~~~gpV~~~~-----~~~~~~f~~~~~g-i~~~~~~~~~~~~Hav~iVGy~~~~~~~~wiv~NSWG~~WG~~Gy~ 199 (219)
T PF00112_consen 126 IKKALMKYGPVVASI-----DVSSEDFQNYKSG-IYDPPDCSNESGGHAVLIVGYDDENGKGYWIVKNSWGTDWGDNGYF 199 (219)
T ss_dssp HHHHHHHHSSEEEEE-----EEESHHHHTEESS-EECSTSSSSSSEEEEEEEEEEEEETTEEEEEEE-SBTTTSTBTTEE
T ss_pred HHHHHhhCceeeeee-----eccccccccccce-eeeccccccccccccccccccccccceeeEeeehhhCCccCCCeEE
Confidence 899999999999999 9888 59999999 9999877777899999999999999999999999999999999999
Q ss_pred EEEecCC-cccccceeeeee
Q psy15348 242 KILRGRN-EAIIESLVNGAL 260 (298)
Q Consensus 242 ~i~rg~n-~cgie~~~~~~~ 260 (298)
||.|+.+ +|+|++++++|+
T Consensus 200 ~i~~~~~~~c~i~~~~~~~~ 219 (219)
T PF00112_consen 200 RISYDYNNECGIESQAVYPI 219 (219)
T ss_dssp EEESSSSSGGGTTSSEEEEE
T ss_pred EEeeCCCCcCccCceeeecC
Confidence 9999997 999999999985
No 14
>smart00645 Pept_C1 Papain family cysteine protease.
Probab=100.00 E-value=2.6e-45 Score=312.29 Aligned_cols=165 Identities=24% Similarity=0.371 Sum_probs=143.2
Q ss_pred ccccCCcCCcccccCCCCCCCCCccHHHHHHHHHHHHHHHHHcCCcccccccccCHHHHhhccCcccccCCCCCCCCCcH
Q psy15348 6 SSRIRDMSYGATVYNRRPYALSCIEARAVATATPLAFAVCRSSKMHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSSGIS 85 (298)
Q Consensus 6 ~~~~~~~~~~~~~~~~v~dQg~cGsCwAfA~~~~le~~~~i~~~~~~~~~~~~LS~q~l~~c~~~~~~~~~~~gC~GG~~ 85 (298)
+.+||++. .++||+||+.||+|||||++++||+++++++++ .++||+|+|++|... .+.+|+||.+
T Consensus 4 ~~D~R~~~----~~~~v~dQg~CGsCwAfa~~~~ie~~~~i~~~~-----~~~lS~q~l~~C~~~-----~~~gC~GG~~ 69 (174)
T smart00645 4 SFDWRKKG----AVTPVKDQGQCGSCWAFSATGALEGRYCIKTGK-----LVSLSEQQLVDCSTG-----GNNGCNGGLP 69 (174)
T ss_pred cCcccccC----CCCccccCcccchHHHHHHHHHHHHHHHHhcCC-----ccccCHHHHhhhcCC-----CCCCCCCcCH
Confidence 45566654 567899999999999999999999999999887 689999999999974 3569999999
Q ss_pred HHHHHHHHHc-CcCCCCCCCCCCCCcCCCCCCCCCCCCCCCCCCCCCCCCCCCccceeeccCCccceeeccccccccccc
Q psy15348 86 SSTWAWVHKR-GLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDKYQINGLGL 164 (298)
Q Consensus 86 ~~a~~~~~~~-Gl~~e~~y~~~~~c~PY~~~~~~~~~~~~~~~~C~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~ 164 (298)
..|++|++++ |+++|++| ||. .
T Consensus 70 ~~a~~~~~~~~Gi~~e~~~-------PY~-------------------------------~------------------- 92 (174)
T smart00645 70 DNAFEYIKKNGGLETESCY-------PYT-------------------------------G------------------- 92 (174)
T ss_pred HHHHHHHHHcCCccccccc-------Ccc-------------------------------c-------------------
Confidence 9999999998 99997777 997 1
Q ss_pred cccccCCceeEEEeccccccCCCCccCCCCceEEecCCcccccCeEEEEEeeecc-CCeeEEEEEcCCCCCcCCCceEEE
Q psy15348 165 YFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEE-NGRPYWTIVSTFGEQFGDKGTIKI 243 (298)
Q Consensus 165 ~~l~~~GPV~v~i~~~~~~v~~~f~~y~~G~Iy~~~~~~~~~~~HaV~IVGyg~~-~g~~yWivkNSWG~~WGe~Gy~~i 243 (298)
++.+ .+. +|+.|++| ||+.+.|....++|+|+|||||++ +|++|||||||||+.||++|||||
T Consensus 93 ---------~~~~-----~~~-~f~~Y~~G-i~~~~~~~~~~~~Hav~ivGyg~~~~g~~yWii~NSwG~~WG~~G~~~i 156 (174)
T smart00645 93 ---------SVAI-----DAS-DFQFYKSG-IYDHPGCGSGTLDHAVLIVGYGTEENGKDYWIVKNSWGTDWGENGYFRI 156 (174)
T ss_pred ---------EEEE-----Ecc-cccCCcCe-EECCCCCCCCcccEEEEEEEEeecCCCeeEEEEECCCCCCcccCeEEEE
Confidence 3445 443 69999999 998876665557999999999987 899999999999999999999999
Q ss_pred EecC-Ccccccceee
Q psy15348 244 LRGR-NEAIIESLVN 257 (298)
Q Consensus 244 ~rg~-n~cgie~~~~ 257 (298)
.|+. |.|+|+....
T Consensus 157 ~~~~~~~c~i~~~~~ 171 (174)
T smart00645 157 ARGKNNECGIEASVA 171 (174)
T ss_pred EcCCCCccCceeeee
Confidence 9998 9999987654
No 15
>cd02619 Peptidase_C1 C1 Peptidase family (MEROPS database nomenclature), also referred to as the papain family; composed of two subfamilies of cysteine peptidases (CPs), C1A (papain) and C1B (bleomycin hydrolase). Papain-like enzymes are mostly endopeptidases with some exceptions like cathepsins B, C, H and X, which are exopeptidases. Papain-like CPs have different functions in various organisms. Plant CPs are used to mobilize storage proteins in seeds while mammalian CPs are primarily lysosomal enzymes responsible for protein degradation in the lysosome. Papain-like CPs are synthesized as inactive proenzymes with N-terminal propeptide regions, which are removed upon activation. Bleomycin hydrolase (BH) is a CP that detoxifies bleomycin by hydrolysis of an amide group. It acts as a carboxypeptidase on its C-terminus to convert itself into an aminopeptidase and peptide ligase. BH is found in all tissues in mammals as well as in many other eukaryotes. It forms a hexameric ring barrel str
Probab=100.00 E-value=2.3e-42 Score=302.99 Aligned_cols=196 Identities=16% Similarity=0.082 Sum_probs=148.9
Q ss_pred cCCCCCCCCCccHHHHHHHHHHHHHHHHHcCCcccccccccCHHHHhhccCcccccCCCCCCCCCcHHHHHH-HHHHcCc
Q psy15348 19 YNRRPYALSCIEARAVATATPLAFAVCRSSKMHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSSGISSSTWA-WVHKRGL 97 (298)
Q Consensus 19 ~~~v~dQg~cGsCwAfA~~~~le~~~~i~~~~~~~~~~~~LS~q~l~~c~~~~~~~~~~~gC~GG~~~~a~~-~~~~~Gl 97 (298)
++||+|||.||+|||||++++||++++++..... .++||+|+|++|...... ....+|.||.+..++. +++++|+
T Consensus 9 ~~~v~dQg~~gsCwafa~~~~les~~~~~~~~~~---~~~lS~q~l~~c~~~~~~-~~~~~c~gG~~~~~~~~~~~~~Gi 84 (223)
T cd02619 9 LTPVKNQGSRGSCWAFASAYALESAYRIKGGEDE---YVDLSPQYLYICANDECL-GINGSCDGGGPLSALLKLVALKGI 84 (223)
T ss_pred CCCcccCCCCcCcHHHHHHHHHHHHHHHhcCCcc---cccCCHHHHHHhcccccc-ccCCCCCCCcHHHHHHHHHHHcCC
Confidence 7899999999999999999999999999887212 689999999999875100 0026999999999998 8899999
Q ss_pred CCCCCCCCCCCCcCCCCCCCCCCCCCCCCCCCCCCC-CCCCccceeeccCCccceeeccccccccccccccccCCceeEE
Q psy15348 98 VTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLA-TPQPKCHTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPA 176 (298)
Q Consensus 98 ~~e~~y~~~~~c~PY~~~~~~~~~~~~~~~~C~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~l~~~GPV~v~ 176 (298)
++|.+| ||.. ....|.... ........+. ..+.+.. . ...+.+|.+|.++|||+++
T Consensus 85 ~~e~~~-------Py~~----------~~~~~~~~~~~~~~~~~~~~--~~y~~~~--~--~~~~~ik~aL~~~gPv~~~ 141 (223)
T cd02619 85 PPEEDY-------PYGA----------ESDGEEPKSEAALNAAKVKL--KDYRRVL--K--NNIEDIKEALAKGGPVVAG 141 (223)
T ss_pred CccccC-------CCCC----------CCCCCCCCCccchhhcceee--cceeEeC--c--hhHHHHHHHHHHCCCEEEE
Confidence 998888 9983 233333210 0000011111 2222111 0 0123689999999999999
Q ss_pred EeccccccCCCCccCCCCceEE-----ecCCcccccCeEEEEEeeeccC--CeeEEEEEcCCCCCcCCCceEEEEecC
Q psy15348 177 FWRSFCTKYTRPLFQTNGRVYA-----VSASAEIVAYATVKIVGWGEEN--GRPYWTIVSTFGEQFGDKGTIKILRGR 247 (298)
Q Consensus 177 i~~~~~~v~~~f~~y~~G~Iy~-----~~~~~~~~~~HaV~IVGyg~~~--g~~yWivkNSWG~~WGe~Gy~~i~rg~ 247 (298)
| .+.+.|+.|++| ++. ...+....++|||+|||||++. +++|||||||||+.||++|||||.++.
T Consensus 142 ~-----~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~Hav~ivGy~~~~~~~~~~~i~~NSwG~~wg~~Gy~~i~~~~ 213 (223)
T cd02619 142 F-----DVYSGFDRLKEG-IIYEEIVYLLYEDGDLGGHAVVIVGYDDNYVEGKGAFIVKNSWGTDWGDNGYGRISYED 213 (223)
T ss_pred E-----EcccchhcccCc-cccccccccccCCCccCCeEEEEEeecCCCCCCCCEEEEEeCCCCccccCCEEEEehhh
Confidence 9 999999999999 862 3334556789999999999987 899999999999999999999999974
No 16
>KOG1544|consensus
Probab=100.00 E-value=8.8e-43 Score=309.73 Aligned_cols=218 Identities=23% Similarity=0.344 Sum_probs=172.5
Q ss_pred ccccCCCCCCCCCccHHHHHHHHHHHHHHHHHcCCcccccccccCHHHHhhccCcccccCCCCCCCCCcHHHHHHHHHHc
Q psy15348 16 ATVYNRRPYALSCIEARAVATATPLAFAVCRSSKMHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSSGISSSTWAWVHKR 95 (298)
Q Consensus 16 ~~~~~~v~dQg~cGsCwAfA~~~~le~~~~i~~~~~~~~~~~~LS~q~l~~c~~~~~~~~~~~gC~GG~~~~a~~~~~~~ 95 (298)
+.++-++.|||+|++.|||+++++..+|++|.+.++. ...||+|+|++|... ...||.||++..|+=|+++.
T Consensus 220 p~liH~plDQgnCa~SWafSTaavasDRiAI~S~GR~---t~~LSpQnLlSC~~h-----~q~GC~gG~lDRAWWYlRKr 291 (470)
T KOG1544|consen 220 PNLIHEPLDQGNCAGSWAFSTAAVASDRVAIHSLGRM---TPVLSPQNLLSCDTH-----QQQGCRGGRLDRAWWYLRKR 291 (470)
T ss_pred CccccCccccCCcccceeeeeehhccceeEEeecccc---ccccChHHhcchhhh-----hhccCccCcccchheeeecc
Confidence 4578899999999999999999999999999999988 899999999999987 36899999999999999999
Q ss_pred CcCCCCCCCCCCCCcCCCCCCCCCCCCCCCCCCCCCCCC----CCCccceeeccCC-------ccc--eeeccccccccc
Q psy15348 96 GLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLAT----PQPKCHTRCTNDN-------YGR--GFFQDKYQINGL 162 (298)
Q Consensus 96 Gl~~e~~y~~~~~c~PY~~~~~~~~~~~~~~~~C~~~~~----~~~~~~~~~~~~~-------~~~--~~~~~~~~~~e~ 162 (298)
|++. ..||||.... .+..+.|...+. .+.+..+.| +.. |.. +|. ....+++
T Consensus 292 GvVs-------dhCYP~~~dQ------~~~~~~C~m~sR~~grgkRqat~~C-Pn~~~~Sn~iyq~tPPYr--VSSnE~e 355 (470)
T KOG1544|consen 292 GVVS-------DHCYPFSGDQ------AGPAPPCMMHSRAMGRGKRQATAHC-PNSYVNSNDIYQVTPPYR--VSSNEKE 355 (470)
T ss_pred cccc-------cccccccCCC------CCCCCCceeeccccCcccccccCcC-CCcccccCceeeecCCee--ccCCHHH
Confidence 9998 6788998321 133445533210 011111222 221 221 111 1112336
Q ss_pred cccccccCCceeEEEeccccccCCCCccCCCCceEEecCCcc-------cccCeEEEEEeeeccCC-----eeEEEEEcC
Q psy15348 163 GLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASAE-------IVAYATVKIVGWGEENG-----RPYWTIVST 230 (298)
Q Consensus 163 ~~~~l~~~GPV~v~i~~~~~~v~~~f~~y~~G~Iy~~~~~~~-------~~~~HaV~IVGyg~~~g-----~~yWivkNS 230 (298)
||++|+++|||.+.| .|+++|++|++| ||.+.+-.. ..+.|+|.|.|||++.+ .+|||..||
T Consensus 356 ImkElM~NGPVQA~m-----~VHEDFF~YkgG-iY~H~~~~~~~~e~yr~~gtHsVk~tGWG~~~~~~G~~~KyW~aANS 429 (470)
T KOG1544|consen 356 IMKELMENGPVQALM-----EVHEDFFLYKGG-IYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDGRTLKYWTAANS 429 (470)
T ss_pred HHHHHHhCCChhhhh-----hhhhhhhhhccc-eeeccccccCCchhhhhcccceEEEeecccccCCCCCeeEEEEeecc
Confidence 899999999999999 999999999999 999865221 35889999999998743 789999999
Q ss_pred CCCCcCCCceEEEEecCCcccccceeeeeeccc
Q psy15348 231 FGEQFGDKGTIKILRGRNEAIIESLVNGALPKD 263 (298)
Q Consensus 231 WG~~WGe~Gy~~i~rg~n~cgie~~~~~~~p~~ 263 (298)
||+.|||+|||||.||.|+|-||+.+.+++-.+
T Consensus 430 WG~~WGE~GYFriLRGvNecdIEsfvIgAWGr~ 462 (470)
T KOG1544|consen 430 WGPAWGERGYFRILRGVNECDIESFVIGAWGRV 462 (470)
T ss_pred cccccccCceEEEeccccchhhhHhhhhhhhcc
Confidence 999999999999999999999999999998644
No 17
>COG4870 Cysteine protease [Posttranslational modification, protein turnover, chaperones]
Probab=99.90 E-value=1.7e-24 Score=197.17 Aligned_cols=193 Identities=16% Similarity=0.035 Sum_probs=111.5
Q ss_pred ccCCCCCCCCCccHHHHHHHHHHHHHHHHHcCCcccccccccCHHHHhhccCcccccCCCCC-CCCCcHHHHHHHHHH-c
Q psy15348 18 VYNRRPYALSCIEARAVATATPLAFAVCRSSKMHVECTSFRFIAGVKQRCAWLVSRWMTIWV-CSSGISSSTWAWVHK-R 95 (298)
Q Consensus 18 ~~~~v~dQg~cGsCwAfA~~~~le~~~~i~~~~~~~~~~~~LS~q~l~~c~~~~~~~~~~~g-C~GG~~~~a~~~~~~-~ 95 (298)
-+++|||||.||+||||+++++||+.+.-...-.. ....+..+..+-|... +..+ -+||....+..|+.+ .
T Consensus 110 ~vs~v~dQg~~Gscwaf~t~~sles~l~~~~~w~~--s~~nm~~ll~~~ye~~-----fd~~~~d~g~~~m~~a~l~e~s 182 (372)
T COG4870 110 KVSPVKDQGSGGSCWAFATTRSLESYLNPESAWDF--SENNMKNLLGVPYEKG-----FDYTSNDGGNADMSAAYLTEWS 182 (372)
T ss_pred CcccccccCcccceEeeeehhhhhheecccccccc--cccchhhhcCCCcccc-----CCCccccCCccccccccccccC
Confidence 36789999999999999999999998765442100 0122222222222222 1111 125666555556655 4
Q ss_pred CcCCCCCCCCCCCCcCCCCCCCCCCCCCCCCCCCCCCCCCCCccceeeccCCccceeeccccccccccccccccCCceeE
Q psy15348 96 GLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWP 175 (298)
Q Consensus 96 Gl~~e~~y~~~~~c~PY~~~~~~~~~~~~~~~~C~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~l~~~GPV~v 175 (298)
|.+.+.+- ||. .....|+.. .+..+....+ ..+. .......+..++.++..+|-+..
T Consensus 183 gpv~et~d-------~y~----------~~s~~~~~~-~p~~k~~~~~----~~i~-~~~~~LdnG~i~~~~~~yg~~s~ 239 (372)
T COG4870 183 GPVYETDD-------PYS----------ENSYFSPTN-LPVTKHVQEA----QIIP-SRKKYLDNGNIKAMFGFYGAVSS 239 (372)
T ss_pred CcchhhcC-------ccc----------cccccCCcC-Cchhhccccc----eecc-cchhhhcccchHHHHhhhccccc
Confidence 77766555 777 333444432 1111111111 1111 11122233357778888888876
Q ss_pred EEeccccccCCCCccCCCCceEEecCCcccccCeEEEEEeeecc----------CCeeEEEEEcCCCCCcCCCceEEEEe
Q psy15348 176 AFWRSFCTKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEE----------NGRPYWTIVSTFGEQFGDKGTIKILR 245 (298)
Q Consensus 176 ~i~~~~~~v~~~f~~y~~G~Iy~~~~~~~~~~~HaV~IVGyg~~----------~g~~yWivkNSWG~~WGe~Gy~~i~r 245 (298)
.||. ++...+. ...+ .+.... . ...+|||+||||++. .|.+.||||||||++||++|||||.+
T Consensus 240 ~~~i---d~~~~~~-~~~~-~~~~~s-~-~~~gHAv~iVGyDDs~~~n~~~~~~~g~GAfiikNSWGt~wG~~GYfwisY 312 (372)
T COG4870 240 SMYI---DATNSLG-ICIP-YPYVDS-G-ENWGHAVLIVGYDDSFDINNFKYGPPGDGAFIIKNSWGTNWGENGYFWISY 312 (372)
T ss_pred eeEE---ecccccc-cccC-CCCCCc-c-ccccceEEEEeccccccccccccCCCCCceEEEECccccccccCceEEEEe
Confidence 6631 3332222 2233 332222 2 568999999999975 25669999999999999999999999
Q ss_pred cC
Q psy15348 246 GR 247 (298)
Q Consensus 246 g~ 247 (298)
..
T Consensus 313 ~y 314 (372)
T COG4870 313 YY 314 (372)
T ss_pred ee
Confidence 75
No 18
>cd00585 Peptidase_C1B Peptidase C1B subfamily (MEROPS database nomenclature); composed of eukaryotic bleomycin hydrolases (BH) and bacterial aminopeptidases C (pepC). The proteins of this subfamily contain a large insert relative to the C1A peptidase (papain) subfamily. BH is a cysteine peptidase that detoxifies bleomycin by hydrolysis of an amide group. It acts as a carboxypeptidase on its C-terminus to convert itself into an aminopeptidase and peptide ligase. BH is found in all tissues in mammals as well as in many other eukaryotes. Bleomycin, a glycopeptide derived from the fungus Streptomyces verticullus, is an effective anticancer drug due to its ability to induce DNA strand breaks. Human BH is the major cause of tumor cell resistance to bleomycin chemotherapy, and is also genetically linked to Alzheimer's disease. In addition to its peptidase activity, the yeast BH (Gal6) binds DNA and acts as a repressor in the Gal4 regulatory system. BH forms a hexameric ring barrel structure w
Probab=99.83 E-value=1.5e-20 Score=179.47 Aligned_cols=75 Identities=16% Similarity=0.272 Sum_probs=62.6
Q ss_pred cccccCCceeEEEeccccccCCCCccCCCCceEEecC--------------------CcccccCeEEEEEeeeccC-Ce-
Q psy15348 165 YFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAVSA--------------------SAEIVAYATVKIVGWGEEN-GR- 222 (298)
Q Consensus 165 ~~l~~~GPV~v~i~~~~~~v~~~f~~y~~G~Iy~~~~--------------------~~~~~~~HaV~IVGyg~~~-g~- 222 (298)
.+|..++||.++. ++. .|+.|++| |+.... |.....+|||+|||||.+. |+
T Consensus 303 ~~L~~g~pV~~g~-----Dv~-~~~~~k~G-I~d~~~~~~~~~f~~~~~~~KaeRl~~~es~~tHAM~ivGv~~D~~g~p 375 (437)
T cd00585 303 AQLKDGEPVWFGC-----DVG-KFSDRKSG-ILDTDLFDYELLFGIDFGLNKAERLDYGESLMTHAMVLTGVDLDEDGKP 375 (437)
T ss_pred HHHhcCCCEEEEE-----EcC-hhhccCCc-cccCcccchhhhcCccccCCHHHHHhhcCCcCCeEEEEEEEEecCCCCc
Confidence 4566889999999 986 67899999 996531 2234578999999999764 76
Q ss_pred eEEEEEcCCCCCcCCCceEEEEec
Q psy15348 223 PYWTIVSTFGEQFGDKGTIKILRG 246 (298)
Q Consensus 223 ~yWivkNSWG~~WGe~Gy~~i~rg 246 (298)
.||+||||||+.||++|||+|++.
T Consensus 376 ~yw~VkNSWG~~~G~~Gy~~ms~~ 399 (437)
T cd00585 376 VKWKVENSWGEKVGKKGYFVMSDD 399 (437)
T ss_pred ceEEEEcccCCCCCCCcceehhHH
Confidence 699999999999999999999986
No 19
>PF03051 Peptidase_C1_2: Peptidase C1-like family This family is a subfamily of the Prosite entry; InterPro: IPR004134 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad []. This group of proteins belong to MEROPS peptidase family C1, sub-family C1B (bleomycin hydrolase, clan CA). This family contains prokaryotic and eukaryotic aminopeptidases and bleomycin hydrolases.; GO: 0004197 cysteine-type endopeptidase activity, 0006508 proteolysis; PDB: 3PW3_F 2CB5_A 1CB5_C 2DZZ_A 2E02_A 2E01_A 2E03_A 1A6R_A 1GCB_A 3GCB_A ....
Probab=99.40 E-value=1e-12 Score=125.87 Aligned_cols=80 Identities=14% Similarity=0.049 Sum_probs=47.6
Q ss_pred CCCCCCCCCccHHHHHHHHHHHHHHHHHcCCcccccccccCHHHHh----------------hccCcc--cc----cCCC
Q psy15348 20 NRRPYALSCIEARAVATATPLAFAVCRSSKMHVECTSFRFIAGVKQ----------------RCAWLV--SR----WMTI 77 (298)
Q Consensus 20 ~~v~dQg~cGsCwAfA~~~~le~~~~i~~~~~~~~~~~~LS~q~l~----------------~c~~~~--~~----~~~~ 77 (298)
.+|.||++-|.||.||++..|+..+..+.+.+ .++||..++. ++.... +| ....
T Consensus 56 ~~vtnQk~SGRCW~FA~lN~lR~~~~kk~~l~----~felSq~Yl~F~DKlEKaN~fLe~ii~~~~~~~d~R~v~~ll~~ 131 (438)
T PF03051_consen 56 GPVTNQKSSGRCWLFAALNVLRHEIMKKLNLK----DFELSQNYLFFWDKLEKANYFLENIIDTADEPLDDRLVRFLLKN 131 (438)
T ss_dssp -S--B--BSSTHHHHHHHHHHHHHHHHHCT-S----S--B-HHHHHHHHHHHHHHHHHHHHHHCCTS-TTSHHHHHHHHS
T ss_pred CCCCCCCCCCCcchhhchHHHHHHHHHHcCCC----ceEeechHHHHHHHHHHHHHHHHHHHHHhcCCcchHHHHHHHhc
Confidence 58999999999999999999999988877642 6999998865 222110 00 0012
Q ss_pred CCCCCCcHHHHHHHHHHcCcCCCCCC
Q psy15348 78 WVCSSGISSSTWAWVHKRGLVTGGAH 103 (298)
Q Consensus 78 ~gC~GG~~~~a~~~~~~~Gl~~e~~y 103 (298)
...+||.-..+..-++++||++.+.|
T Consensus 132 ~~~DGGqw~~~~nli~KYGvVPk~~m 157 (438)
T PF03051_consen 132 PVSDGGQWDMVVNLIKKYGVVPKSVM 157 (438)
T ss_dssp TT-S-B-HHHHHHHHHHH---BGGGS
T ss_pred CCCCCCchHHHHHHHHHcCcCcHhhC
Confidence 34579999999999999999997777
No 20
>COG3579 PepC Aminopeptidase C [Amino acid transport and metabolism]
Probab=98.16 E-value=6.9e-06 Score=75.08 Aligned_cols=40 Identities=18% Similarity=0.411 Sum_probs=33.4
Q ss_pred ccCeEEEEEeeecc-CC-eeEEEEEcCCCCCcCCCceEEEEe
Q psy15348 206 VAYATVKIVGWGEE-NG-RPYWTIVSTFGEQFGDKGTIKILR 245 (298)
Q Consensus 206 ~~~HaV~IVGyg~~-~g-~~yWivkNSWG~~WGe~Gy~~i~r 245 (298)
...|||+|.|.+.+ +| ---|.|.||||++=|.+|||-++-
T Consensus 360 LmTHAMvlTGvd~d~~g~p~rwkVENSWG~d~G~~GyfvaSd 401 (444)
T COG3579 360 LMTHAMVLTGVDLDETGNPLRWKVENSWGKDVGKKGYFVASD 401 (444)
T ss_pred HHHHHHHhhccccccCCCceeeEeecccccccCCCceEeehH
Confidence 35699999999965 34 336999999999999999998764
No 21
>PF05543 Peptidase_C47: Staphopain peptidase C47; InterPro: IPR008750 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad []. This group of cysteine peptidases belong to the peptidase family C47 (staphopain family, clan CA). The type example are the staphopains, which are one of four major families of proteinases secreted by the Gram-positive Staphylococcus aureus. These staphylococcal cysteine proteases are secreted as preproenzymes that are proteolytically cleaved to generate the mature enzyme [, , ].; GO: 0008234 cysteine-type peptidase activity, 0006508 proteolysis; PDB: 1X9Y_D 1Y4H_B 1PXV_B 1CV8_A.
Probab=94.36 E-value=0.19 Score=42.39 Aligned_cols=132 Identities=11% Similarity=-0.031 Sum_probs=71.3
Q ss_pred CCCCCCCccHHHHHHHHHHHHHHHHHcCCccc-------ccccccCHHHHhhccCcccccCCCCCCCCCcHHHHHHHHHH
Q psy15348 22 RPYALSCIEARAVATATPLAFAVCRSSKMHVE-------CTSFRFIAGVKQRCAWLVSRWMTIWVCSSGISSSTWAWVHK 94 (298)
Q Consensus 22 v~dQg~cGsCwAfA~~~~le~~~~i~~~~~~~-------~~~~~LS~q~l~~c~~~~~~~~~~~gC~GG~~~~a~~~~~~ 94 (298)
-.-|+.-+-|-+|+.+++|-... +.+.. -..+.+|.++|..+.- .+...++|.+.
T Consensus 16 ~EtQg~~pWCa~Ya~aailN~~~----~~~~~~A~~iMr~~yPn~s~~~l~~~~~--------------~~~~~i~y~ks 77 (175)
T PF05543_consen 16 RETQGYNPWCAGYAMAAILNATT----NTKIYNAKDIMRYLYPNVSEEQLKFTSL--------------TPNQMIKYAKS 77 (175)
T ss_dssp ----SSSS-HHHHHHHHHHHHHC----T-S---HHHHHHHHSTTS-CCCHHH--B---------------HHHHHHHHHH
T ss_pred eeccCcCcHHHHHHHHHHHHhhh----CcCcCCHHHHHHHHCCCCCHHHHhhcCC--------------CHHHHHHHHHH
Confidence 35678889999999999887652 11100 0145566666655542 46788999888
Q ss_pred cCcCCCCCCCCCCCCcCCCCCCCCCCCCCCCCCCCCCCCCCCCccceeeccCCccceeeccccccccccccccccCCcee
Q psy15348 95 RGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFW 174 (298)
Q Consensus 95 ~Gl~~e~~y~~~~~c~PY~~~~~~~~~~~~~~~~C~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~l~~~GPV~ 174 (298)
.|... -|. .. ... .. +++..+..+-|+.
T Consensus 78 ~g~~~-----------~~~-------------------------------n~--~~s------~~--eV~~~~~~nk~i~ 105 (175)
T PF05543_consen 78 QGRNP-----------QYN-------------------------------NR--MPS------FD--EVKKLIDNNKGIA 105 (175)
T ss_dssp TTEEE-----------EEE-------------------------------CS-----------HH--HHHHHHHTT-EEE
T ss_pred cCcch-----------hHh-------------------------------cC--CCC------HH--HHHHHHHcCCCeE
Confidence 87654 111 00 000 01 3444555667777
Q ss_pred EEEeccccccCCCCccCCCCceEEecCCcccccCeEEEEEeeec-cCCeeEEEEEcCCCCCcCCCceEEEEecC
Q psy15348 175 PAFWRSFCTKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGE-ENGRPYWTIVSTFGEQFGDKGTIKILRGR 247 (298)
Q Consensus 175 v~i~~~~~~v~~~f~~y~~G~Iy~~~~~~~~~~~HaV~IVGyg~-~~g~~yWivkNSWG~~WGe~Gy~~i~rg~ 247 (298)
+.. ...+ ..+| ...+|||+||||-. .+|.++.++=|-| ++++|-+....
T Consensus 106 i~~-----~~v~----~~~~----------~~~gHAlavvGya~~~~g~~~y~~WNPW-----~~~~~~~sa~s 155 (175)
T PF05543_consen 106 ILA-----DRVE----QTNG----------PHAGHALAVVGYAKPNNGQKTYYFWNPW-----WNDVMIQSAKS 155 (175)
T ss_dssp EEE-----EETT----SCTT----------B--EEEEEEEEEEEETTSEEEEEEE-TT------SS-EEEETT-
T ss_pred EEe-----cccc----cCCC----------CccceeEEEEeeeecCCCCeEEEEeCCc-----cCCcEEEecCC
Confidence 666 3210 1122 24689999999987 4579999999999 67777776653
No 22
>PF13529 Peptidase_C39_2: Peptidase_C39 like family; PDB: 3ERV_A.
Probab=92.70 E-value=0.12 Score=40.89 Aligned_cols=54 Identities=13% Similarity=-0.036 Sum_probs=30.9
Q ss_pred ccccccccCCceeEEEeccccccCCCCccCCCCceEEecCCcccccCeEEEEEeeeccCCeeEEEEEcCC
Q psy15348 162 LGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTF 231 (298)
Q Consensus 162 ~~~~~l~~~GPV~v~i~~~~~~v~~~f~~y~~G~Iy~~~~~~~~~~~HaV~IVGyg~~~g~~yWivkNSW 231 (298)
.++..|....||++.+ ... +....+. .+... ...|.|+|+||.++. +++|..+|
T Consensus 91 ~i~~~i~~G~Pvi~~~-----~~~--~~~~~~~-~~~~~-----~~~H~vvi~Gy~~~~---~~~v~DP~ 144 (144)
T PF13529_consen 91 DIKQEIDAGRPVIVSV-----NSG--WRPPNGD-GYDGT-----YGGHYVVIIGYDEDG---YVYVNDPW 144 (144)
T ss_dssp HHHHHHHTT--EEEEE-----ETT--SS--TTE-EEEE------TTEEEEEEEEE-SSE----EEEE-TT
T ss_pred HHHHHHHCCCcEEEEE-----Ecc--cccCCCC-CcCCC-----cCCEEEEEEEEeCCC---EEEEeCCC
Confidence 4677888889999999 521 1111222 33222 378999999997643 78888777
No 23
>KOG4128|consensus
Probab=82.89 E-value=0.14 Score=47.35 Aligned_cols=80 Identities=15% Similarity=0.055 Sum_probs=52.5
Q ss_pred cCCCCCCCCCccHHHHHHHHHHHHHHHHHcCCcccccccccCHHHHhh--------------------ccCcccc----c
Q psy15348 19 YNRRPYALSCIEARAVATATPLAFAVCRSSKMHVECTSFRFIAGVKQR--------------------CAWLVSR----W 74 (298)
Q Consensus 19 ~~~v~dQg~cGsCwAfA~~~~le~~~~i~~~~~~~~~~~~LS~q~l~~--------------------c~~~~~~----~ 74 (298)
-.||-+|.+-|-||.|+.+..|---+..+-+- +.+.||..+|+- |-+..+| -
T Consensus 62 ~~pvtnqkssGrcWift~ln~lrl~~~~kLnl----~eFElSqayLFFwdKlErcnyFL~~vvd~a~r~ep~DgRlvq~L 137 (457)
T KOG4128|consen 62 RQPVTNQKSSGRCWIFTGLNLLRLEMDRKLNL----PEFELSQAYLFFWDKLERCNYFLWTVVDLAMRCEPLDGRLVQNL 137 (457)
T ss_pred CcccccCcCCCceEEEechhHHHHHHHhcCCc----chhhhhhHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccHHHHHH
Confidence 46899999999999999998876555544433 268888866651 2221111 0
Q ss_pred CCCCCCCCCcHHHHHHHHHHcCcCCCCC
Q psy15348 75 MTIWVCSSGISSSTWAWVHKRGLVTGGA 102 (298)
Q Consensus 75 ~~~~gC~GG~~~~a~~~~~~~Gl~~e~~ 102 (298)
..+..-+||.-....+.++++|+....-
T Consensus 138 l~nP~~DGGqw~MfvNlVkKYGviPKkc 165 (457)
T KOG4128|consen 138 LKNPVPDGGQWQMFVNLVKKYGVIPKKC 165 (457)
T ss_pred HhCCCCCCchHHHHHHHHHHhCCCcHHh
Confidence 0123457888888888888899776333
No 24
>PF12385 Peptidase_C70: Papain-like cysteine protease AvrRpt2; InterPro: IPR022118 This is a family of cysteine proteases, found in actinobacteria, protobacteria and firmicutes. Papain-like cysteine proteases play a crucial role in plant-pathogen/pest interactions. On entering the host they act on non-self substrates, thereby manipulating the host to evade proteolysis []. AvrRpt2 from Pseudomonas syringae pv tomato DC3000 triggers resistance to P. syringae-2-dependent defence responses, including hypersensitive cell death, by cleaving the Arabidopsis RIN4 protein which is monitored by the cognate resistance protein RPS2 [].
Probab=79.18 E-value=16 Score=30.49 Aligned_cols=36 Identities=17% Similarity=0.235 Sum_probs=24.7
Q ss_pred ccccccccCCceeEEEeccccccCCCCccCCCCceEEecCCcccccCeEEEEEeeecc
Q psy15348 162 LGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEE 219 (298)
Q Consensus 162 ~~~~~l~~~GPV~v~i~~~~~~v~~~f~~y~~G~Iy~~~~~~~~~~~HaV~IVGyg~~ 219 (298)
.+...|.++||+.+++ .... .....|+++|+|-..+
T Consensus 100 ~~~~LL~~yGPLwv~~-----~~P~-----------------~~~~~H~~ViTGI~~d 135 (166)
T PF12385_consen 100 GLANLLREYGPLWVAW-----EAPG-----------------DSWVAHASVITGIDGD 135 (166)
T ss_pred HHHHHHHHcCCeEEEe-----cCCC-----------------CcceeeEEEEEeecCC
Confidence 4566788999999998 5421 1134688888887654
No 25
>KOG4128|consensus
Probab=77.07 E-value=2.8 Score=39.09 Aligned_cols=39 Identities=18% Similarity=0.376 Sum_probs=32.3
Q ss_pred cCeEEEEEeee-cc---CCeeEEEEEcCCCCCcCCCceEEEEe
Q psy15348 207 AYATVKIVGWG-EE---NGRPYWTIVSTFGEQFGDKGTIKILR 245 (298)
Q Consensus 207 ~~HaV~IVGyg-~~---~g~~yWivkNSWG~~WGe~Gy~~i~r 245 (298)
-.||++|.|-| ++ .+-.-|-|.||||++-|.+|+.+|..
T Consensus 371 mthAml~T~v~~kd~~~g~~~~~rVenswgkd~gkkg~~~mt~ 413 (457)
T KOG4128|consen 371 MTHAMLLTSVGLKDPATGGLNEHRVENSWGKDLGKKGVNKMTA 413 (457)
T ss_pred HHHHHHhhhccccCcccCCchhhhhhchhhhhccccchhhhhH
Confidence 57999999998 33 34557999999999999999977755
No 26
>PF14399 Transpep_BrtH: NlpC/p60-like transpeptidase
Probab=72.89 E-value=4.7 Score=36.93 Aligned_cols=54 Identities=15% Similarity=-0.002 Sum_probs=33.9
Q ss_pred ccccccccCCceeEEEeccccccCCCCccCCCCceEEecCCcccccCeEEEEEeeeccCCeeEEEEEc
Q psy15348 162 LGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVS 229 (298)
Q Consensus 162 ~~~~~l~~~GPV~v~i~~~~~~v~~~f~~y~~G~Iy~~~~~~~~~~~HaV~IVGyg~~~g~~yWivkN 229 (298)
.++..|..+.||.+.+ +.+ +.-|... -| ......|.|+|+||+++ ...|.++-.
T Consensus 80 ~l~~~l~~g~pv~~~~-----D~~--~lpy~~~-~~-----~~~~~~H~i~v~G~d~~-~~~~~v~D~ 133 (317)
T PF14399_consen 80 ELKEALDAGRPVIVWV-----DMY--YLPYRPN-YY-----KKHHADHYIVVYGYDEE-EDVFYVSDP 133 (317)
T ss_pred HHHHHHhCCCceEEEe-----ccc--cCCCCcc-cc-----ccccCCcEEEEEEEeCC-CCEEEEEcC
Confidence 4677887777999999 653 1223222 11 12246899999999865 345666544
No 27
>cd00044 CysPc Calpains, domains IIa, IIb; calcium-dependent cytoplasmic cysteine proteinases, papain-like. Functions in cytoskeletal remodeling processes, cell differentiation, apoptosis and signal transduction.
Probab=60.34 E-value=14 Score=34.21 Aligned_cols=28 Identities=14% Similarity=0.172 Sum_probs=24.7
Q ss_pred ccCeEEEEEeeeccC--CeeEEEEEcCCCC
Q psy15348 206 VAYATVKIVGWGEEN--GRPYWTIVSTFGE 233 (298)
Q Consensus 206 ~~~HaV~IVGyg~~~--g~~yWivkNSWG~ 233 (298)
..+||-.|++.-+-+ +.+.-.+||.||.
T Consensus 234 ~~~HaY~Vl~~~~~~~~~~~lv~lrNPWg~ 263 (315)
T cd00044 234 VKGHAYSVLDVREVQEEGLRLLRLRNPWGV 263 (315)
T ss_pred ccCcceEEeEEEEEccCceEEEEecCCccC
Confidence 578999999998766 8999999999994
No 28
>PF09778 Guanylate_cyc_2: Guanylylate cyclase; InterPro: IPR018616 Members of this family of proteins catalyse the conversion of guanosine triphosphate (GTP) to 3',5'-cyclic guanosine monophosphate (cGMP) and pyrophosphate.
Probab=59.51 E-value=15 Score=32.18 Aligned_cols=58 Identities=10% Similarity=-0.025 Sum_probs=32.1
Q ss_pred ccccccccCCceeEEEeccccccCCCCcc---CCCCceEE--ecCC---cccccCeEEEEEeeeccCCeeEEEEEc
Q psy15348 162 LGLYFDPHFGPFWPAFWRSFCTKYTRPLF---QTNGRVYA--VSAS---AEIVAYATVKIVGWGEENGRPYWTIVS 229 (298)
Q Consensus 162 ~~~~~l~~~GPV~v~i~~~~~~v~~~f~~---y~~G~Iy~--~~~~---~~~~~~HaV~IVGyg~~~g~~yWivkN 229 (298)
++...|..+||+++-+ +.. +.. -+.- +.. .+.| .....+|-|+|+||+.+. +-+++||
T Consensus 115 ei~~hl~~g~~aIvLV-----d~~--~L~C~~Ck~~-~~~~~~~~~~~~~~~Y~GHYVVlcGyd~~~--~~~~yrd 180 (212)
T PF09778_consen 115 EIIEHLSSGGPAIVLV-----DAS--LLHCDLCKSN-CFDPIGSKCFGRSPDYQGHYVVLCGYDAAT--KEFEYRD 180 (212)
T ss_pred HHHHHHhCCCcEEEEE-----ccc--cccChhhccc-ccccccccccCCCCCccEEEEEEEeecCCC--CeEEEeC
Confidence 4567777888777766 432 211 0111 111 1111 234789999999997653 3455555
No 29
>cd02549 Peptidase_C39A A sub-family of peptidase family C39. Peptidase family C39 mostly contains bacteriocin-processing endopeptidases from bacteria. The cysteine peptidases in family C39 cleave the "double-glycine" leader peptides from the precursors of various bacteriocins (mostly non-lantibiotic). The cleavage is mediated by the transporter as part of the secretion process. Bacteriocins are antibiotic proteins secreted by some species of bacteria that inhibit the growth of other bacterial species. The bacteriocin is synthesized as a precursor with an N-terminal leader peptide, and processing involves removal of the leader peptide by cleavage at a Gly-Gly bond, followed by translocation of the mature bacteriocin across the cytoplasmic membrane. Most endopeptidases of family C39 are N-terminal domains in larger proteins (ABC transporters) that serve both functions. The proposed protease active site is conserved in this sub-family of proteins with a single peptidase domain, which are
Probab=36.93 E-value=52 Score=25.72 Aligned_cols=43 Identities=5% Similarity=-0.098 Sum_probs=27.6
Q ss_pred cccccCCceeEEEeccccccCCCCccCCCCceEEecCCcccccCeEEEEEeeeccCCeeEEEEEcCC
Q psy15348 165 YFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTF 231 (298)
Q Consensus 165 ~~l~~~GPV~v~i~~~~~~v~~~f~~y~~G~Iy~~~~~~~~~~~HaV~IVGyg~~~g~~yWivkNSW 231 (298)
..+....||++.+ .. +. .....+|.|+|+||.. ....+|.+.|
T Consensus 72 ~~l~~~~Pvi~~~-----~~--------~~--------~~~~~gH~vVv~g~~~---~~~~~i~DP~ 114 (141)
T cd02549 72 RQLAAGHPVIVSV-----NL--------GV--------SITPSGHAMVVIGYDR---KGNVYVNDPG 114 (141)
T ss_pred HHHHCCCeEEEEE-----ec--------Cc--------ccCCCCeEEEEEEEcC---CCCEEEECCC
Confidence 4566788999888 53 11 1113679999999961 1235666765
No 30
>smart00230 CysPc Calpain-like thiol protease family. Calpain-like thiol protease family (peptidase family C2). Calcium activated neutral protease (large subunit).
Probab=26.64 E-value=1.1e+02 Score=28.44 Aligned_cols=28 Identities=18% Similarity=0.275 Sum_probs=22.3
Q ss_pred ccCeEEEEEeeeccCCee--EEEEEcCCCC
Q psy15348 206 VAYATVKIVGWGEENGRP--YWTIVSTFGE 233 (298)
Q Consensus 206 ~~~HaV~IVGyg~~~g~~--yWivkNSWG~ 233 (298)
..+||=.|++.-.-++.+ -..+||-||.
T Consensus 226 v~~HaYsVl~v~~~~~~~~~Ll~lrNPWg~ 255 (318)
T smart00230 226 VKGHAYSVTDVREVQGRRQELLRLRNPWGQ 255 (318)
T ss_pred ccCccEEEEEEEEEecCCeEEEEEECCCCC
Confidence 468999999887655545 8999999993
No 31
>COG4990 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=26.39 E-value=45 Score=28.53 Aligned_cols=44 Identities=14% Similarity=-0.121 Sum_probs=30.7
Q ss_pred ccccccccCCceeEEEeccccccCCCCccCCCCceEEecCCcccccCeEEEEEeeeccCCeeEEEEEcCCC
Q psy15348 162 LGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFG 232 (298)
Q Consensus 162 ~~~~~l~~~GPV~v~i~~~~~~v~~~f~~y~~G~Iy~~~~~~~~~~~HaV~IVGyg~~~g~~yWivkNSWG 232 (298)
.++..|....||.+.. .. |.. ..-|+|+|+||++. ++..-++||
T Consensus 125 ~ik~ql~kg~PV~iw~-----T~---~~~---------------~s~H~v~itgyDk~----n~yynDpyG 168 (195)
T COG4990 125 DIKGQLLKGRPVVIWV-----TN---FHS---------------YSIHSVLITGYDKY----NIYYNDPYG 168 (195)
T ss_pred HHHHHHhcCCcEEEEE-----ec---ccc---------------cceeeeEeeccccc----ceEeccccc
Confidence 5677788899998777 32 321 23599999999765 556666664
Done!