Query psy12185
Match_columns 317
No_of_seqs 244 out of 1617
Neff 8.4
Searched_HMMs 46136
Date Fri Aug 16 20:53:05 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy12185.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/12185hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1542|consensus 100.0 1.8E-70 4E-75 487.2 22.9 260 33-316 67-335 (372)
2 PTZ00203 cathepsin L protease; 100.0 4E-67 8.8E-72 486.2 31.4 265 32-316 33-304 (348)
3 PTZ00021 falcipain-2; Provisio 100.0 9.1E-64 2E-68 476.8 28.4 265 32-312 164-432 (489)
4 PTZ00200 cysteine proteinase; 100.0 5.5E-62 1.2E-66 463.1 29.8 282 18-316 107-406 (448)
5 KOG1543|consensus 100.0 6.3E-57 1.4E-61 415.2 26.9 251 41-316 30-287 (325)
6 cd02621 Peptidase_C1A_Cathepsi 100.0 3.4E-45 7.3E-50 326.8 18.6 175 138-316 1-206 (243)
7 cd02698 Peptidase_C1A_Cathepsi 100.0 3.4E-44 7.3E-49 319.3 18.5 170 138-316 1-197 (239)
8 cd02620 Peptidase_C1A_Cathepsi 100.0 5.3E-44 1.2E-48 317.5 17.8 172 139-316 1-202 (236)
9 cd02248 Peptidase_C1A Peptidas 100.0 2.5E-43 5.3E-48 307.9 19.6 172 139-316 1-176 (210)
10 PTZ00049 cathepsin C-like prot 100.0 8.7E-42 1.9E-46 332.8 19.4 175 135-312 378-630 (693)
11 PTZ00364 dipeptidyl-peptidase 100.0 1.3E-41 2.9E-46 328.3 18.7 178 135-316 202-421 (548)
12 PF00112 Peptidase_C1: Papain 100.0 6.1E-41 1.3E-45 293.9 14.2 175 138-316 1-183 (219)
13 cd02619 Peptidase_C1 C1 Peptid 100.0 7.7E-37 1.7E-41 268.4 17.8 169 141-316 1-191 (223)
14 smart00645 Pept_C1 Papain fami 100.0 4E-37 8.7E-42 260.9 14.0 134 138-316 1-138 (174)
15 PTZ00462 Serine-repeat antigen 100.0 1.6E-33 3.4E-38 282.3 17.3 160 150-312 544-733 (1004)
16 KOG1544|consensus 100.0 5.8E-33 1.3E-37 244.6 5.2 229 65-313 151-414 (470)
17 PF08246 Inhibitor_I29: Cathep 99.7 2.6E-17 5.7E-22 113.8 7.4 57 37-95 1-58 (58)
18 COG4870 Cysteine protease [Pos 99.7 2.3E-17 4.9E-22 149.6 3.6 171 136-312 97-276 (372)
19 smart00848 Inhibitor_I29 Cathe 99.6 5.1E-15 1.1E-19 101.9 5.1 56 37-94 1-57 (57)
20 cd00585 Peptidase_C1B Peptidas 99.4 4.7E-12 1E-16 120.7 12.0 77 151-229 55-159 (437)
21 PF03051 Peptidase_C1_2: Pepti 97.6 0.00012 2.5E-09 70.5 5.7 77 151-229 56-160 (438)
22 PF08127 Propeptide_C1: Peptid 95.6 0.012 2.6E-07 37.1 2.7 36 64-105 3-38 (41)
23 PF05543 Peptidase_C47: Stapho 93.9 0.53 1.2E-05 39.4 8.9 104 155-311 18-130 (175)
24 COG3579 PepC Aminopeptidase C 92.1 0.37 8E-06 44.4 5.9 76 152-229 59-162 (444)
25 KOG4128|consensus 87.3 0.64 1.4E-05 42.8 3.5 77 150-228 62-168 (457)
26 PF13529 Peptidase_C39_2: Pept 87.0 1.5 3.2E-05 34.6 5.3 48 257-313 87-136 (144)
27 PF14399 Transpep_BrtH: NlpC/p 86.6 1.5 3.2E-05 40.4 5.7 49 260-315 79-127 (317)
28 PF09778 Guanylate_cyc_2: Guan 81.6 4.6 9.9E-05 35.1 6.1 57 258-314 112-174 (212)
29 PF13956 Ibs_toxin: Toxin Ibs, 72.2 2 4.3E-05 21.8 0.8 14 1-14 1-14 (19)
30 PF12385 Peptidase_C70: Papain 67.1 14 0.00029 30.6 5.0 39 259-313 98-136 (166)
31 cd02549 Peptidase_C39A A sub-f 61.5 20 0.00044 28.2 5.2 34 262-310 70-103 (141)
32 COG4990 Uncharacterized protei 56.9 22 0.00047 30.1 4.6 40 257-313 121-160 (195)
33 PF03051 Peptidase_C1_2: Pepti 48.9 47 0.001 32.4 6.2 53 260-312 300-371 (438)
34 PRK00888 ftsB cell division pr 48.7 46 0.00099 25.5 5.0 19 2-20 1-19 (105)
35 PF02013 CBM_10: Cellulose or 44.5 8.8 0.00019 23.4 0.3 10 308-317 25-34 (36)
36 PF09551 Spore_II_R: Stage II 39.8 67 0.0014 25.7 4.8 51 261-317 60-115 (130)
37 PF10717 ODV-E18: Occlusion-de 36.8 36 0.00078 24.9 2.5 21 2-22 28-48 (85)
38 PF08139 LPAM_1: Prokaryotic m 35.9 22 0.00049 19.8 1.1 16 2-17 8-23 (25)
39 PF11873 DUF3393: Domain of un 33.4 46 0.001 28.8 3.2 18 2-19 1-18 (204)
40 COG5510 Predicted small secret 31.5 66 0.0014 20.5 2.7 11 1-11 1-11 (44)
41 KOG4702|consensus 31.2 1E+02 0.0022 21.8 3.9 33 34-66 28-60 (77)
42 TIGR02837 spore_II_R stage II 28.6 1.3E+02 0.0029 25.1 4.9 49 262-317 96-150 (168)
43 PRK10081 entericidin B membran 26.9 80 0.0017 20.6 2.7 14 1-14 1-14 (48)
44 PF00879 Defensin_propep: Defe 25.1 1.1E+02 0.0024 20.3 3.1 27 2-28 1-27 (52)
45 COG2854 Ttg2D ABC-type transpo 24.0 4E+02 0.0086 23.1 7.2 17 79-96 77-93 (202)
46 COG3167 PilO Tfp pilus assembl 23.8 1.1E+02 0.0024 26.1 3.7 62 3-64 24-96 (211)
47 PF11912 DUF3430: Protein of u 21.5 75 0.0016 27.1 2.4 17 2-18 1-17 (212)
48 PHA02661 vascular endothelial 20.7 72 0.0016 26.1 2.0 49 2-50 1-49 (146)
49 PRK09810 entericidin A; Provis 20.1 1.1E+02 0.0025 19.2 2.3 10 1-10 1-10 (41)
No 1
>KOG1542|consensus
Probab=100.00 E-value=1.8e-70 Score=487.16 Aligned_cols=260 Identities=35% Similarity=0.588 Sum_probs=227.0
Q ss_pred HHHHHHHHHHHhCCccCch-hHHHHHHHHHHHHHHHHHHhhcCCCCCCeEEecCcCCCCCHHHHHHhhhccccc-ccccc
Q psy12185 33 KLELFSSFQQRYKKSYSKS-EHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVN-KHVLM 110 (317)
Q Consensus 33 ~~~~f~~~~~~~~K~Y~~~-E~~~R~~iF~~n~~~I~~~N~~~~~~~~~~~g~N~fsD~t~eEf~~~~~~~~~~-~~~~~ 110 (317)
....|..|+.+|+|+|.+. |...|+.||+.|+..+++++.. ...+..+|+|+|||||+|||+++++..... .+.+.
T Consensus 67 ~~~~F~~F~~kf~r~Y~s~eE~~~Rl~iF~~N~~~a~~~q~~--d~gsA~yGvtqFSDlT~eEFkk~~l~~~~~~~~~~~ 144 (372)
T KOG1542|consen 67 LEDSFKLFTIKFGRSYASREEHAHRLSIFKHNLLRAERLQEN--DPGSAEYGVTQFSDLTEEEFKKIYLGVKRRGSKLPG 144 (372)
T ss_pred hHHHHHHHHHhcCcccCcHHHHHHHHHHHHHHHHHHHHhhhc--CccccccCccchhhcCHHHHHHHhhccccccccCcc
Confidence 4789999999999999998 9999999999999999998852 445899999999999999998765532211 00111
Q ss_pred cCCCCCCCCCccccccccccCCCCCCCCCceeecccCCCCCCCcCCCCCccHHHHHHHHHHHHHHHHHcCCCccCCHHHH
Q psy12185 111 SHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEV 190 (317)
Q Consensus 111 ~~~~~~~~~~~~~~~~~~~~~~~~~~~lP~~~DwR~~g~v~~v~dQg~CGsCwAfa~~~~le~~~~i~~~~~~~lS~q~l 190 (317)
.. ......+...||++||||++|+||||||||.||||||||+++++|++++|++|++++||||||
T Consensus 145 ~~---------------~~~~~~~~~~lP~~fDWR~kgaVTpVKnQG~CGSCWAFS~tG~vEga~~i~~g~LvsLSEQeL 209 (372)
T KOG1542|consen 145 DA---------------AEAPIEPGESLPESFDWRDKGAVTPVKNQGMCGSCWAFSTTGAVEGAWAIATGKLVSLSEQEL 209 (372)
T ss_pred cc---------------ccCcCCCCCCCCcccchhccCCccccccCCcCcchhhhhhhhhhhhHHHhhcCcccccchhhh
Confidence 00 000123456899999999999999999999999999999999999999999999999999999
Q ss_pred hhcCCCCCCCCCCCchHHHHHHHHHhCCCCCCCCCCccccCCC-cccCCCCCCCceEEeeeEEecCCCcHHHHHHHHHhC
Q psy12185 191 IDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDA-ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269 (317)
Q Consensus 191 ~dC~~~~~~gC~GG~~~~a~~y~~~~~~Gi~~e~~yPY~~~~~-~C~~~~~~~~~~~i~~~~~~~~~~~~~~i~~al~~~ 269 (317)
+||+. .++||+||.+..||+|+++. .||..|.+|||++..+ .|..++.. ..+.|++|..+. .||++|...|+++
T Consensus 210 vDCD~-~d~gC~GGl~~nA~~~~~~~-gGL~~E~dYPY~g~~~~~C~~~~~~-~~v~I~~f~~l~--~nE~~ia~wLv~~ 284 (372)
T KOG1542|consen 210 VDCDS-CDNGCNGGLMDNAFKYIKKA-GGLEKEKDYPYTGKKGNQCHFDKSK-IVVSIKDFSMLS--NNEDQIAAWLVTF 284 (372)
T ss_pred hcccC-cCCcCCCCChhHHHHHHHHh-CCccccccCCccccCCCccccchhh-ceEEEeccEecC--CCHHHHHHHHHhc
Confidence 99998 89999999999999997664 8999999999999988 89998877 889999998765 6999999999999
Q ss_pred CCeEEEEecccccccCCCeEeC---CCCCCCCCCCeEEEEEEeecCC---Cee
Q psy12185 270 GPVIAAVNALTWQYYLGGVIQY---NCDGSLANINHAVQIVGYDNYS---RTW 316 (317)
Q Consensus 270 gPv~v~i~~~~f~~Y~~GIy~~---~c~~~~~~~~Hav~iVGyg~~~---~~W 316 (317)
|||+|+|++..+|+|.+||+.+ .|.+.. +||||+|||||.++ .||
T Consensus 285 GPi~vgiNa~~mQ~YrgGV~~P~~~~Cs~~~--~~HaVLlvGyG~~g~~~PYW 335 (372)
T KOG1542|consen 285 GPLSVGINAKPMQFYRGGVSCPSKYICSPKL--LNHAVLLVGYGSSGYEKPYW 335 (372)
T ss_pred CCeEEEEchHHHHHhcccccCCCcccCCccc--cCceEEEEeecCCCCCCceE
Confidence 9999999999999999999998 698876 89999999999876 477
No 2
>PTZ00203 cathepsin L protease; Provisional
Probab=100.00 E-value=4e-67 Score=486.19 Aligned_cols=265 Identities=32% Similarity=0.600 Sum_probs=215.1
Q ss_pred HHHHHHHHHHHHhCCccCch-hHHHHHHHHHHHHHHHHHHhhcCCCCCCeEEecCcCCCCCHHHHHHhhhcccccccccc
Q psy12185 32 QKLELFSSFQQRYKKSYSKS-EHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLM 110 (317)
Q Consensus 32 ~~~~~f~~~~~~~~K~Y~~~-E~~~R~~iF~~n~~~I~~~N~~~~~~~~~~~g~N~fsD~t~eEf~~~~~~~~~~~~~~~ 110 (317)
.+..+|++|+++|+|.|.+. |+.+|++||++|+++|++||+ .+.+|++|+|+|+|||+|||.++++.+.... ...
T Consensus 33 ~~~~~f~~~~~~~~K~Y~~~~E~~~R~~iF~~N~~~I~~~N~---~~~~~~lg~N~FaDlT~eEf~~~~l~~~~~~-~~~ 108 (348)
T PTZ00203 33 PAAALFEEFKRTYQRAYGTLTEEQQRLANFERNLELMREHQA---RNPHARFGITKFFDLSEAEFAARYLNGAAYF-AAA 108 (348)
T ss_pred HHHHHHHHHHHHhCCCCCChHHHHHHHHHHHHHHHHHHHHhc---cCCCeEEeccccccCCHHHHHHHhcCCCccc-ccc
Confidence 46778999999999999998 999999999999999999995 3568999999999999999976554322110 000
Q ss_pred cCCCCCCCCCccccccccccCCCCCCCCCceeecccCCCCCCCcCCCCCccHHHHHHHHHHHHHHHHHcCCCccCCHHHH
Q psy12185 111 SHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEV 190 (317)
Q Consensus 111 ~~~~~~~~~~~~~~~~~~~~~~~~~~~lP~~~DwR~~g~v~~v~dQg~CGsCwAfa~~~~le~~~~i~~~~~~~lS~q~l 190 (317)
.......+. . ......++|++||||++|+|+||||||.||||||||+++++|++++|++++.+.||+|||
T Consensus 109 ~~~~~~~~~----~------~~~~~~~lP~~~DWR~~g~VtpVkdQg~CGSCWAfa~~~aiEs~~~i~~~~~~~LSeQqL 178 (348)
T PTZ00203 109 KQHAGQHYR----K------ARADLSAVPDAVDWREKGAVTPVKNQGACGSCWAFSAVGNIESQWAVAGHKLVRLSEQQL 178 (348)
T ss_pred ccccccccc----c------cccccccCCCCCcCCcCCCCCCccccCCCccHHHHhhHHHHHHHHHHhcCCCccCCHHHH
Confidence 000000000 0 001124689999999999999999999999999999999999999999999999999999
Q ss_pred hhcCCCCCCCCCCCchHHHHHHHHHh-CCCCCCCCCCccccCCC---cccCCCCCCCceEEeeeEEecCCCcHHHHHHHH
Q psy12185 191 IDCAGNGNMGCSGGDFCALLDWMDVN-KVVLEPESEYPLLLKDA---ACKRKATSPNGVKIKSYTCDTLIPSESSILTDI 266 (317)
Q Consensus 191 ~dC~~~~~~gC~GG~~~~a~~y~~~~-~~Gi~~e~~yPY~~~~~---~C~~~~~~~~~~~i~~~~~~~~~~~~~~i~~al 266 (317)
+||+. .+.||+||++..||+|+.++ .+|+++|++|||.+.++ .|..........++++|..+. .++++|+++|
T Consensus 179 vdC~~-~~~GC~GG~~~~a~~yi~~~~~ggi~~e~~YPY~~~~~~~~~C~~~~~~~~~~~i~~~~~i~--~~e~~~~~~l 255 (348)
T PTZ00203 179 VSCDH-VDNGCGGGLMLQAFEWVLRNMNGTVFTEKSYPYVSGNGDVPECSNSSELAPGARIDGYVSME--SSERVMAAWL 255 (348)
T ss_pred HhccC-CCCCCCCCCHHHHHHHHHHhcCCCCCccccCCCccCCCCCCcCCCCcccccceEecceeecC--cCHHHHHHHH
Confidence 99997 67899999999999999763 24689999999998776 586433221346788887654 4788999999
Q ss_pred HhCCCeEEEEecccccccCCCeEeCCCCCCCCCCCeEEEEEEeecCC--Cee
Q psy12185 267 ATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYDNYS--RTW 316 (317)
Q Consensus 267 ~~~gPv~v~i~~~~f~~Y~~GIy~~~c~~~~~~~~Hav~iVGyg~~~--~~W 316 (317)
+++|||+|+|++.+|++|++|||+. |.... +||||+|||||+++ +||
T Consensus 256 ~~~GPv~v~i~a~~f~~Y~~GIy~~-c~~~~--~nHaVliVGYG~~~g~~YW 304 (348)
T PTZ00203 256 AKNGPISIAVDASSFMSYHSGVLTS-CIGEQ--LNHGVLLVGYNMTGEVPYW 304 (348)
T ss_pred HhCCCEEEEEEhhhhcCccCceeec-cCCCC--CCeEEEEEEEecCCCceEE
Confidence 9999999999998899999999985 86544 79999999999876 488
No 3
>PTZ00021 falcipain-2; Provisional
Probab=100.00 E-value=9.1e-64 Score=476.78 Aligned_cols=265 Identities=30% Similarity=0.432 Sum_probs=212.2
Q ss_pred HHHHHHHHHHHHhCCccCch-hHHHHHHHHHHHHHHHHHHhhcCCCCCCeEEecCcCCCCCHHHHHHhhhcccccccccc
Q psy12185 32 QKLELFSSFQQRYKKSYSKS-EHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLM 110 (317)
Q Consensus 32 ~~~~~f~~~~~~~~K~Y~~~-E~~~R~~iF~~n~~~I~~~N~~~~~~~~~~~g~N~fsD~t~eEf~~~~~~~~~~~~~~~ 110 (317)
+....|++|+.+|+|+|.+. |+.+|+.||++|+++|++||++ .+.+|++|+|+|+|||.|||+++++. ......+.
T Consensus 164 e~~~~F~~wk~ky~K~Y~~~eE~~~R~~iF~~Nl~~Ie~hN~~--~~~ty~lgiNqFsDlT~EEF~~~~l~-~~~~~~~~ 240 (489)
T PTZ00021 164 ENVNSFYLFIKEHGKKYQTPDEMQQRYLSFVENLAKINAHNNK--ENVLYKKGMNRFGDLSFEEFKKKYLT-LKSFDFKS 240 (489)
T ss_pred HHHHHHHHHHHHhCCcCCCHHHHHHHHHHHHHHHHHHHHhhcc--CCCCEEEeccccccCCHHHHHHHhcc-cccccccc
Confidence 34578999999999999998 8899999999999999999953 45789999999999999999765432 11100000
Q ss_pred cCCCCCCCCCccccccccccCCCC-CCCCCceeecccCCCCCCCcCCCCCccHHHHHHHHHHHHHHHHHcCCCccCCHHH
Q psy12185 111 SHHKHHDHHHNHVKKRSITTGITI-PTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQE 189 (317)
Q Consensus 111 ~~~~~~~~~~~~~~~~~~~~~~~~-~~~lP~~~DwR~~g~v~~v~dQg~CGsCwAfa~~~~le~~~~i~~~~~~~lS~q~ 189 (317)
......... . .........+. ....|+++|||+.|.|+||||||.||||||||+++++|++++|+++..+.||+||
T Consensus 241 ~~~~~~~~~-~--~~~~~~~~~~~~~~~~P~s~DWR~~g~VtpVKdQG~CGSCWAFAa~~alEs~~~I~~g~~v~LSeQq 317 (489)
T PTZ00021 241 NGKKSPRVI-N--YDDVIKKYKPKDATFDHAKYDWRLHNGVTPVKDQKNCGSCWAFSTVGVVESQYAIRKNELVSLSEQE 317 (489)
T ss_pred ccccccccc-c--ccccccccccccccCCccccccccCCCCCCcccccccccHHHHHHHHHHHHHHHHHcCCCcccCHHH
Confidence 000000000 0 00000000000 1124999999999999999999999999999999999999999999999999999
Q ss_pred HhhcCCCCCCCCCCCchHHHHHHHHHhCCCCCCCCCCccccC-CCcccCCCCCCCceEEeeeEEecCCCcHHHHHHHHHh
Q psy12185 190 VIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLK-DAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIAT 268 (317)
Q Consensus 190 l~dC~~~~~~gC~GG~~~~a~~y~~~~~~Gi~~e~~yPY~~~-~~~C~~~~~~~~~~~i~~~~~~~~~~~~~~i~~al~~ 268 (317)
|+||+. .+.||+||++..||+|+.++ +||++|++|||.+. .+.|...... ..++|++|..++ +.+|+++|+.
T Consensus 318 LVDCs~-~n~GC~GG~~~~Af~yi~~~-gGl~tE~~YPY~~~~~~~C~~~~~~-~~~~i~~y~~i~----~~~lk~al~~ 390 (489)
T PTZ00021 318 LVDCSF-KNNGCYGGLIPNAFEDMIEL-GGLCSEDDYPYVSDTPELCNIDRCK-EKYKIKSYVSIP----EDKFKEAIRF 390 (489)
T ss_pred Hhhhcc-CCCCCCCcchHhhhhhhhhc-cccCcccccCccCCCCCcccccccc-ccceeeeEEEec----HHHHHHHHHh
Confidence 999997 68999999999999999764 69999999999987 4789866544 567899997653 5789999999
Q ss_pred CCCeEEEEec-ccccccCCCeEeCCCCCCCCCCCeEEEEEEeecC
Q psy12185 269 HGPVIAAVNA-LTWQYYLGGVIQYNCDGSLANINHAVQIVGYDNY 312 (317)
Q Consensus 269 ~gPv~v~i~~-~~f~~Y~~GIy~~~c~~~~~~~~Hav~iVGyg~~ 312 (317)
+|||+|+|++ .+|++|++|||++.|+.. +||||+|||||++
T Consensus 391 ~GPVsv~i~a~~~f~~YkgGIy~~~C~~~---~nHAVlIVGYG~e 432 (489)
T PTZ00021 391 LGPISVSIAVSDDFAFYKGGIFDGECGEE---PNHAVILVGYGME 432 (489)
T ss_pred cCCeEEEEEeecccccCCCCcCCCCCCCc---cceEEEEEEecCc
Confidence 9999999999 899999999999889753 7999999999975
No 4
>PTZ00200 cysteine proteinase; Provisional
Probab=100.00 E-value=5.5e-62 Score=463.14 Aligned_cols=282 Identities=27% Similarity=0.379 Sum_probs=214.7
Q ss_pred HhhccccCCCCchHHHHHHHHHHHHHhCCccCch-hHHHHHHHHHHHHHHHHHHhhcCCCCCCeEEecCcCCCCCHHHHH
Q psy12185 18 FLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKS-EHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFK 96 (317)
Q Consensus 18 ~~~~~~~~~~~~~~~~~~~f~~~~~~~~K~Y~~~-E~~~R~~iF~~n~~~I~~~N~~~~~~~~~~~g~N~fsD~t~eEf~ 96 (317)
+-.-.+..+...+.+...+|++|+++|+|.|.+. |+.+|+.||++|+++|++||. ..+|++|+|+|||||+|||.
T Consensus 107 ~~~~~i~~~~~~e~e~~~~F~~f~~ky~K~Y~~~~E~~~R~~iF~~Nl~~I~~hN~----~~~y~lgiN~FsDlT~eEF~ 182 (448)
T PTZ00200 107 FKDGYISDDPKLEFEVYLEFEEFNKKYNRKHATHAERLNRFLTFRNNYLEVKSHKG----DEPYSKEINKFSDLTEEEFR 182 (448)
T ss_pred hhCCCcCCCccchHHHHHHHHHHHHHhCCcCCCHHHHHHHHHHHHHHHHHHHHhcC----cCCeEEeccccccCCHHHHH
Confidence 3334444445555567789999999999999998 999999999999999999993 46899999999999999997
Q ss_pred Hhhhcccccc-cccccCC---------CCCCCCCccccc-cccccCCCCCCCCCceeecccCCCCCCCcCCC-CCccHHH
Q psy12185 97 TRHLRHSVNK-HVLMSHH---------KHHDHHHNHVKK-RSITTGITIPTGIPVKKDWREAGIIGKVRNQQ-TCGACWA 164 (317)
Q Consensus 97 ~~~~~~~~~~-~~~~~~~---------~~~~~~~~~~~~-~~~~~~~~~~~~lP~~~DwR~~g~v~~v~dQg-~CGsCwA 164 (317)
+.++ +.... ....... ....+....... +...........+|++||||+.|.|+|||||| .||||||
T Consensus 183 ~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~P~~~DWR~~g~vtpVkdQG~~CGSCWA 261 (448)
T PTZ00200 183 KLFP-VIKVPPKSNSTSHNNDFKARHVSNPTYLKNLKKAKNTDEDVKDPSKITGEGLDWRRADAVTKVKDQGLNCGSCWA 261 (448)
T ss_pred HHhc-cCCCcccccccccccccccccccccccccccccccccccccccccccCCCCccCCCCCCCCCcccCCCccchHHH
Confidence 6533 21110 0000000 000010000000 00000000112369999999999999999999 9999999
Q ss_pred HHHHHHHHHHHHHHcCCCccCCHHHHhhcCCCCCCCCCCCchHHHHHHHHHhCCCCCCCCCCccccCCCcccCCCCCCCc
Q psy12185 165 FSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNG 244 (317)
Q Consensus 165 fa~~~~le~~~~i~~~~~~~lS~q~l~dC~~~~~~gC~GG~~~~a~~y~~~~~~Gi~~e~~yPY~~~~~~C~~~~~~~~~ 244 (317)
||+++++|++++|+++..+.||+|||+||+. .+.||+||++..||+|+.+ .||++|++|||.+..+.|..... ..
T Consensus 262 Fat~~aiEs~~~i~~~~~~~LSeQqLvDC~~-~~~GC~GG~~~~A~~yi~~--~Gi~~e~~YPY~~~~~~C~~~~~--~~ 336 (448)
T PTZ00200 262 FSSVGSVESLYKIYRDKSVDLSEQELVNCDT-KSQGCSGGYPDTALEYVKN--KGLSSSSDVPYLAKDGKCVVSST--KK 336 (448)
T ss_pred HhHHHHHHHHHHHhcCCCeecCHHHHhhccC-ccCCCCCCcHHHHHHHHhh--cCccccccCCCCCCCCCCcCCCC--Ce
Confidence 9999999999999999999999999999997 6889999999999999976 69999999999999999986543 34
Q ss_pred eEEeeeEEecCCCcHHHHHHHHHhCCCeEEEEec-ccccccCCCeEeCCCCCCCCCCCeEEEEEEeecC--C--Cee
Q psy12185 245 VKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA-LTWQYYLGGVIQYNCDGSLANINHAVQIVGYDNY--S--RTW 316 (317)
Q Consensus 245 ~~i~~~~~~~~~~~~~~i~~al~~~gPv~v~i~~-~~f~~Y~~GIy~~~c~~~~~~~~Hav~iVGyg~~--~--~~W 316 (317)
..|.+|... ...+.+++++ .+|||+|+|++ .+|++|++|||++.|+.. +||||+|||||.+ + +||
T Consensus 337 ~~i~~y~~~---~~~~~l~~~l-~~GPV~v~i~~~~~f~~Yk~GIy~~~C~~~---~nHaV~lVGyG~d~~~g~~YW 406 (448)
T PTZ00200 337 VYIDSYLVA---KGKDVLNKSL-VISPTVVYIAVSRELLKYKSGVYNGECGKS---LNHAVLLVGEGYDEKTKKRYW 406 (448)
T ss_pred eEecceEec---CHHHHHHHHH-hcCCEEEEeecccccccCCCCccccccCCC---CcEEEEEEEecccCCCCCceE
Confidence 678888643 2344555555 57999999999 899999999999889754 7999999999853 2 478
No 5
>KOG1543|consensus
Probab=100.00 E-value=6.3e-57 Score=415.24 Aligned_cols=251 Identities=30% Similarity=0.510 Sum_probs=207.7
Q ss_pred HHHhCCccCch-hHHHHHHHHHHHHHHHHHHhhcCCCCCCeEEecCcCCCCCHHHHHHhhhcccccccccccCCCCCCCC
Q psy12185 41 QQRYKKSYSKS-EHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHH 119 (317)
Q Consensus 41 ~~~~~K~Y~~~-E~~~R~~iF~~n~~~I~~~N~~~~~~~~~~~g~N~fsD~t~eEf~~~~~~~~~~~~~~~~~~~~~~~~ 119 (317)
+.+|.+.|.+. |...|+.+|.+|++.|+.||.. ...+|.+|+|+|+|++.+||...+...... .. ....+.
T Consensus 30 ~~~~~~~y~~~~~~~~r~~~f~~n~~~~~~~n~~--~~~~~~~g~n~~~d~~~ee~~~~~~~~~~~----~~--~~~~~~ 101 (325)
T KOG1543|consen 30 LVKFLKRYEDRVEKKARRAIFKENLQKIESHNLK--YVLSFLMGVNQFADLTTEEFKRKKTGKKPP----EI--KRDKFT 101 (325)
T ss_pred hhhhccccccHHHHHHHHHHHHHHHHHHHhhhhh--hceeeeeccccccccchHHHHHhhccccCc----cc--cccccc
Confidence 67788888866 9999999999999999999953 378899999999999999996543321111 00 000000
Q ss_pred CccccccccccCCCCCCCCCceeecccCC-CCCCCcCCCCCccHHHHHHHHHHHHHHHHHcC-CCccCCHHHHhhcCCCC
Q psy12185 120 HNHVKKRSITTGITIPTGIPVKKDWREAG-IIGKVRNQQTCGACWAFSTVETAESMHALKNG-TLSLLSVQEVIDCAGNG 197 (317)
Q Consensus 120 ~~~~~~~~~~~~~~~~~~lP~~~DwR~~g-~v~~v~dQg~CGsCwAfa~~~~le~~~~i~~~-~~~~lS~q~l~dC~~~~ 197 (317)
. .....++|++||||++| .++||||||.||||||||++++||++++|+++ .++.||+|||+||+..+
T Consensus 102 ~-----------~~~~~~~p~s~DwR~~~~~~~~vkdQg~CgsCWAFaa~~aie~~~~i~~g~~l~sLSeq~lvdC~~~~ 170 (325)
T KOG1543|consen 102 E-----------KLDGDDLPDSFDWRDKGAVTPPVKDQGSCGSCWAFAATGALEDRYNIKTGGKLLSLSEQDLVDCCGEC 170 (325)
T ss_pred c-----------ccchhhCCCCccccccCCcCCCcCCCCcCcchHHHHHHHHHHHHHHHHhCCccCccChhhhhhccCCC
Confidence 0 12246899999999986 55569999999999999999999999999999 99999999999999855
Q ss_pred CCCCCCCchHHHHHHHHHhCCCCCC-CCCCccccCCCcccCCCCCCCceEEeeeEEecCCCcHHHHHHHHHhCCCeEEEE
Q psy12185 198 NMGCSGGDFCALLDWMDVNKVVLEP-ESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAV 276 (317)
Q Consensus 198 ~~gC~GG~~~~a~~y~~~~~~Gi~~-e~~yPY~~~~~~C~~~~~~~~~~~i~~~~~~~~~~~~~~i~~al~~~gPv~v~i 276 (317)
+.||+||.+..||+|+.. +|+++ +.+|||.+..+.|..+... ..+.+.++..++ .+|.+|+.+|+.+|||+|+|
T Consensus 171 ~~GC~GG~~~~A~~yi~~--~G~~t~~~~Ypy~~~~~~C~~~~~~-~~~~~~~~~~~~--~~e~~i~~~v~~~GPv~v~~ 245 (325)
T KOG1543|consen 171 GDGCNGGEPKNAFKYIKK--NGGVTECENYPYIGKDGTCKSNKKD-KTVTIKGFYNVP--ANEEAIAEAVAKNGPVSVAI 245 (325)
T ss_pred CCCcCCCCHHHHHHHHHH--hCCCCCCcCCCCcCCCCCccCCCcc-ceeEeeeeeecC--cCHHHHHHHHHhcCCeEEEE
Confidence 889999999999999998 46666 9999999999999988765 677788887664 55999999999999999999
Q ss_pred ec-ccccccCCCeEeCCCCCCCCCCCeEEEEEEeecCC--Cee
Q psy12185 277 NA-LTWQYYLGGVIQYNCDGSLANINHAVQIVGYDNYS--RTW 316 (317)
Q Consensus 277 ~~-~~f~~Y~~GIy~~~c~~~~~~~~Hav~iVGyg~~~--~~W 316 (317)
++ .+|++|++|||.++|.... .++|||+|||||+.+ +||
T Consensus 246 ~a~~~F~~Y~~GVy~~~~~~~~-~~~Hav~iVGyG~~~~~~YW 287 (325)
T KOG1543|consen 246 DAYEDFSLYKGGVYAEEKGDDK-EGDHAVLIVGYGTGDGVDYW 287 (325)
T ss_pred eehhhhhhccCceEeCCCCCCC-CCCceEEEEEEcCCCCceeE
Confidence 99 9999999999999544321 389999999999843 477
No 6
>cd02621 Peptidase_C1A_CathepsinC Cathepsin C; also known as Dipeptidyl Peptidase I (DPPI), an atypical papain-like cysteine peptidase with chloride dependency and dipeptidyl aminopeptidase activity, resulting from its tetrameric structure which limits substrate access. Each subunit of the tetramer is composed of three peptides: the heavy and light chains, which together adopts the papain fold and forms the catalytic domain; and the residual propeptide region, which forms a beta barrel and points towards the substrate's N-terminus. The subunit composition is the result of the unique characteristic of procathepsin C maturation involving the cleavage of the catalytic domain and the non-autocatalytic excision of an activation peptide within its propeptide region. By removing N-terminal dipeptide extensions, cathepsin C activates granule serine peptidases (granzymes) involved in cell-mediated apoptosis, inflammation and tissue remodelling. Loss-of-function mutations in cathepsin C are assoc
Probab=100.00 E-value=3.4e-45 Score=326.76 Aligned_cols=175 Identities=26% Similarity=0.505 Sum_probs=149.0
Q ss_pred CCceeecccCC----CCCCCcCCCCCccHHHHHHHHHHHHHHHHHcCC------CccCCHHHHhhcCCCCCCCCCCCchH
Q psy12185 138 IPVKKDWREAG----IIGKVRNQQTCGACWAFSTVETAESMHALKNGT------LSLLSVQEVIDCAGNGNMGCSGGDFC 207 (317)
Q Consensus 138 lP~~~DwR~~g----~v~~v~dQg~CGsCwAfa~~~~le~~~~i~~~~------~~~lS~q~l~dC~~~~~~gC~GG~~~ 207 (317)
||++||||+.+ +|+||||||.||||||||+++++|++++|+++. .+.||+|||+||+. .+.||+||++.
T Consensus 1 lP~~fDwr~~~~~~~~v~~v~dQg~CGsCwAfa~~~~ies~~~i~~~~~~~~~~~~~lS~q~l~dC~~-~~~GC~GG~~~ 79 (243)
T cd02621 1 LPKSFDWGDVNNGFNYVSPVRNQGGCGSCYAFASVYALEARIMIASNKTDPLGQQPILSPQHVLSCSQ-YSQGCDGGFPF 79 (243)
T ss_pred CCCcccccccCCCCcccccCCCCCcCccHHHHHHHHHHHHHHHHHhCCCCccccCcccCHHHhhhhcC-CCCCCCCCCHH
Confidence 69999999988 999999999999999999999999999999876 78999999999987 67899999999
Q ss_pred HHHHHHHHhCCCCCCCCCCcccc-CCCcccCCC-CCCCceEEeeeEEec---CCCcHHHHHHHHHhCCCeEEEEec-ccc
Q psy12185 208 ALLDWMDVNKVVLEPESEYPLLL-KDAACKRKA-TSPNGVKIKSYTCDT---LIPSESSILTDIATHGPVIAAVNA-LTW 281 (317)
Q Consensus 208 ~a~~y~~~~~~Gi~~e~~yPY~~-~~~~C~~~~-~~~~~~~i~~~~~~~---~~~~~~~i~~al~~~gPv~v~i~~-~~f 281 (317)
.|++|+.+ .|+++|++|||.. ..+.|.... .. ..++++.|..+. ...++++||++|+++|||+|+|++ ++|
T Consensus 80 ~a~~~~~~--~Gi~~e~~yPY~~~~~~~C~~~~~~~-~~~~~~~~~~i~~~~~~~~~~~ik~~i~~~GPv~v~~~~~~~F 156 (243)
T cd02621 80 LVGKFAED--FGIVTEDYFPYTADDDRPCKASPSEC-RRYYFSDYNYVGGCYGCTNEDEMKWEIYRNGPIVVAFEVYSDF 156 (243)
T ss_pred HHHHHHHh--cCcCCCceeCCCCCCCCCCCCCcccc-ccccccceeEcccccccCCHHHHHHHHHHcCCEEEEEEecccc
Confidence 99999987 7999999999998 778898654 33 344555554321 135789999999999999999999 899
Q ss_pred cccCCCeEeCC-----CCCCC------CCCCeEEEEEEeecCC----Cee
Q psy12185 282 QYYLGGVIQYN-----CDGSL------ANINHAVQIVGYDNYS----RTW 316 (317)
Q Consensus 282 ~~Y~~GIy~~~-----c~~~~------~~~~Hav~iVGyg~~~----~~W 316 (317)
++|++|||+.. |.... ..++|||+|||||++. +||
T Consensus 157 ~~Y~~GIy~~~~~~~~C~~~~~~~~~~~~~~HaV~iVGyg~~~~~g~~YW 206 (243)
T cd02621 157 DFYKEGVYHHTDNDEVSDGDNDNFNPFELTNHAVLLVGWGEDEIKGEKYW 206 (243)
T ss_pred cccCCeEECcCCcccccccccccccCcccCCeEEEEEEeeccCCCCCcEE
Confidence 99999999873 75321 1379999999999876 577
No 7
>cd02698 Peptidase_C1A_CathepsinX Cathepsin X; the only papain-like lysosomal cysteine peptidase exhibiting carboxymonopeptidase activity. It can also act as a carboxydipeptidase, like cathepsin B, but has been shown to preferentially cleave substrates through a monopeptidyl carboxypeptidase pathway. The propeptide region of cathepsin X, the shortest among papain-like peptidases, is covalently attached to the active site cysteine in the inactive form of the enzyme. Little is known about the biological function of cathepsin X. Some studies point to a role in early tumorigenesis. A more recent study indicates that cathepsin X expression is restricted to immune cells suggesting a role in phagocytosis and the regulation of the immune response.
Probab=100.00 E-value=3.4e-44 Score=319.33 Aligned_cols=170 Identities=27% Similarity=0.523 Sum_probs=144.2
Q ss_pred CCceeecccCC---CCCCCcCCC---CCccHHHHHHHHHHHHHHHHHcC---CCccCCHHHHhhcCCCCCCCCCCCchHH
Q psy12185 138 IPVKKDWREAG---IIGKVRNQQ---TCGACWAFSTVETAESMHALKNG---TLSLLSVQEVIDCAGNGNMGCSGGDFCA 208 (317)
Q Consensus 138 lP~~~DwR~~g---~v~~v~dQg---~CGsCwAfa~~~~le~~~~i~~~---~~~~lS~q~l~dC~~~~~~gC~GG~~~~ 208 (317)
||++||||+.+ +|+|||||| .||||||||++++||++++|+++ ..+.||+|||+||+. +.||+||++..
T Consensus 1 lP~~~Dwr~~~~~~~v~~vk~Qg~~~~CGsCwAfa~~~aies~~~i~~~~~~~~~~lS~Q~lldC~~--~~gC~GG~~~~ 78 (239)
T cd02698 1 LPKSWDWRNVNGVNYVSPTRNQHIPQYCGSCWAHGSTSALADRINIARKGAWPSVYLSVQVVIDCAG--GGSCHGGDPGG 78 (239)
T ss_pred CCCCcccccCCCCcccCccccCCCCCCCCcchHHHhHHHHHHHHHHHHCCCCCCcccCHHHHHhCCC--CCCccCcCHHH
Confidence 69999999987 999999998 89999999999999999999876 367899999999986 68999999999
Q ss_pred HHHHHHHhCCCCCCCCCCccccCCCcccCCCC--------------CCCceEEeeeEEecCCCcHHHHHHHHHhCCCeEE
Q psy12185 209 LLDWMDVNKVVLEPESEYPLLLKDAACKRKAT--------------SPNGVKIKSYTCDTLIPSESSILTDIATHGPVIA 274 (317)
Q Consensus 209 a~~y~~~~~~Gi~~e~~yPY~~~~~~C~~~~~--------------~~~~~~i~~~~~~~~~~~~~~i~~al~~~gPv~v 274 (317)
+++|+.+ .|+++|++|||....+.|..... ....+++++|..+ .++++||++|+++|||+|
T Consensus 79 a~~~~~~--~Gl~~e~~yPY~~~~~~C~~~~~~~~c~~~~~c~~~~~~~~~~i~~~~~~---~~~~~i~~~l~~~GPV~v 153 (239)
T cd02698 79 VYEYAHK--HGIPDETCNPYQAKDGECNPFNRCGTCNPFGECFAIKNYTLYFVSDYGSV---SGRDKMMAEIYARGPISC 153 (239)
T ss_pred HHHHHHH--cCcCCCCeeCCcCCCCCCcCCCCCCCcccCcccccccccceEEeeeceec---CCHHHHHHHHHHcCCEEE
Confidence 9999987 69999999999987777753110 0023566776544 357889999999999999
Q ss_pred EEec-ccccccCCCeEeCCCCCCCCCCCeEEEEEEeecCC---Cee
Q psy12185 275 AVNA-LTWQYYLGGVIQYNCDGSLANINHAVQIVGYDNYS---RTW 316 (317)
Q Consensus 275 ~i~~-~~f~~Y~~GIy~~~c~~~~~~~~Hav~iVGyg~~~---~~W 316 (317)
+|.+ ++|++|++|||+..+.... ++|||+|||||+++ +||
T Consensus 154 ~i~~~~~f~~Y~~GIy~~~~~~~~--~~HaV~IVGyG~~~~g~~YW 197 (239)
T cd02698 154 GIMATEALENYTGGVYKEYVQDPL--INHIISVAGWGVDENGVEYW 197 (239)
T ss_pred EEEecccccccCCeEEccCCCCCc--CCeEEEEEEEEecCCCCEEE
Confidence 9999 8999999999988443333 79999999999875 477
No 8
>cd02620 Peptidase_C1A_CathepsinB Cathepsin B group; composed of cathepsin B and similar proteins, including tubulointerstitial nephritis antigen (TIN-Ag). Cathepsin B is a lysosomal papain-like cysteine peptidase which is expressed in all tissues and functions primarily as an exopeptidase through its carboxydipeptidyl activity. Together with other cathepsins, it is involved in the degradation of proteins, proenzyme activation, Ag processing, metabolism and apoptosis. Cathepsin B has been implicated in a number of human diseases such as cancer, rheumatoid arthritis, osteoporosis and Alzheimer's disease. The unique carboxydipeptidyl activity of cathepsin B is attributed to the presence of an occluding loop in its active site which favors the binding of the C-termini of substrate proteins. Some members of this group do not possess the occluding loop. TIN-Ag is an extracellular matrix basement protein which was originally identified as a target Ag involved in anti-tubular basement membrane
Probab=100.00 E-value=5.3e-44 Score=317.46 Aligned_cols=172 Identities=31% Similarity=0.523 Sum_probs=141.6
Q ss_pred CceeecccC--CCC--CCCcCCCCCccHHHHHHHHHHHHHHHHHcC--CCccCCHHHHhhcCCCCCCCCCCCchHHHHHH
Q psy12185 139 PVKKDWREA--GII--GKVRNQQTCGACWAFSTVETAESMHALKNG--TLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW 212 (317)
Q Consensus 139 P~~~DwR~~--g~v--~~v~dQg~CGsCwAfa~~~~le~~~~i~~~--~~~~lS~q~l~dC~~~~~~gC~GG~~~~a~~y 212 (317)
|++||||++ +++ +||+|||.||||||||++++||++++|+++ +.+.||+|||+||+...+.||+||++..|++|
T Consensus 1 p~~~DwR~~~~~~~~v~~v~dQg~CGsCwAfa~~~~le~~~~i~~~~~~~~~LS~Q~lidC~~~~~~gC~GG~~~~a~~~ 80 (236)
T cd02620 1 PESFDAREKWPNCISIGEIRDQGNCGSCWAFSAVEAFSDRLCIQSNGKENVLLSAQDLLSCCSGCGDGCNGGYPDAAWKY 80 (236)
T ss_pred CCcccchhhCCCCCCccccCCcccchhHHHHHHHHHHhhHHHHhcCCCCccccCHHHHHhhcCCCCCCCCCCCHHHHHHH
Confidence 889999986 554 599999999999999999999999999988 78999999999998733789999999999999
Q ss_pred HHHhCCCCCCCCCCccccCCCc------------------ccCCCC---CCCceEEeeeEEecCCCcHHHHHHHHHhCCC
Q psy12185 213 MDVNKVVLEPESEYPLLLKDAA------------------CKRKAT---SPNGVKIKSYTCDTLIPSESSILTDIATHGP 271 (317)
Q Consensus 213 ~~~~~~Gi~~e~~yPY~~~~~~------------------C~~~~~---~~~~~~i~~~~~~~~~~~~~~i~~al~~~gP 271 (317)
+++ .|+++|++|||.+.... |..... ....+++..+..+. .++++||.+|+++||
T Consensus 81 i~~--~G~~~e~~yPY~~~~~~~~~~~~~~~~~~~~~~~~C~~~~~~~~~~~~~~~~~~~~~~--~~~~~ik~~l~~~GP 156 (236)
T cd02620 81 LTT--TGVVTGGCQPYTIPPCGHHPEGPPPCCGTPYCTPKCQDGCEKTYEEDKHKGKSAYSVP--SDETDIMKEIMTNGP 156 (236)
T ss_pred HHh--cCCCcCCEecCcCCCCccCCCCCCCCCCCCCCCCCCCcCCccccceeeeeecceeeeC--CHHHHHHHHHHHCCC
Confidence 987 69999999999886543 322111 00123444444332 478999999999999
Q ss_pred eEEEEec-ccccccCCCeEeCCCCCCCCCCCeEEEEEEeecCC--Cee
Q psy12185 272 VIAAVNA-LTWQYYLGGVIQYNCDGSLANINHAVQIVGYDNYS--RTW 316 (317)
Q Consensus 272 v~v~i~~-~~f~~Y~~GIy~~~c~~~~~~~~Hav~iVGyg~~~--~~W 316 (317)
|+|+|++ ++|+.|++|||+..|.... ++|||+|||||+++ +||
T Consensus 157 v~v~i~~~~~f~~Y~~Giy~~~~~~~~--~~HaV~iVGyg~~~g~~YW 202 (236)
T cd02620 157 VQAAFTVYEDFLYYKSGVYQHTSGKQL--GGHAVKIIGWGVENGVPYW 202 (236)
T ss_pred eEEEEEechhhhhcCCcEEeecCCCCc--CCeEEEEEEEeccCCeeEE
Confidence 9999999 8999999999987665443 79999999999875 377
No 9
>cd02248 Peptidase_C1A Peptidase C1A subfamily (MEROPS database nomenclature); composed of cysteine peptidases (CPs) similar to papain, including the mammalian CPs (cathepsins B, C, F, H, L, K, O, S, V, X and W). Papain is an endopeptidase with specific substrate preferences, primarily for bulky hydrophobic or aromatic residues at the S2 subsite, a hydrophobic pocket in papain that accommodates the P2 sidechain of the substrate (the second residue away from the scissile bond). Most members of the papain subfamily are endopeptidases. Some exceptions to this rule can be explained by specific details of the catalytic domains like the occluding loop in cathepsin B which confers an additional carboxydipeptidyl activity and the mini-chain of cathepsin H resulting in an N-terminal exopeptidase activity. Papain-like CPs have different functions in various organisms. Plant CPs are used to mobilize storage proteins in seeds. Parasitic CPs act extracellularly to help invade tissues and cells, to h
Probab=100.00 E-value=2.5e-43 Score=307.90 Aligned_cols=172 Identities=39% Similarity=0.746 Sum_probs=153.6
Q ss_pred CceeecccCCCCCCCcCCCCCccHHHHHHHHHHHHHHHHHcCCCccCCHHHHhhcCCCCCCCCCCCchHHHHHHHHHhCC
Q psy12185 139 PVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKV 218 (317)
Q Consensus 139 P~~~DwR~~g~v~~v~dQg~CGsCwAfa~~~~le~~~~i~~~~~~~lS~q~l~dC~~~~~~gC~GG~~~~a~~y~~~~~~ 218 (317)
|++||||+.+.++||+|||.||+|||||++++||++++++++....||+|+|++|....+.||.||++..|++++.+ .
T Consensus 1 P~~~d~r~~~~~~~v~dQg~cgsCwAfa~~~~le~~~~i~~~~~~~lS~q~l~~c~~~~~~gC~GG~~~~a~~~~~~--~ 78 (210)
T cd02248 1 PESVDWREKGAVTPVKDQGSCGSCWAFSTVGALEGAYAIKTGKLVSLSEQQLVDCSTSGNNGCNGGNPDNAFEYVKN--G 78 (210)
T ss_pred CCcccCCcCCCCCCCccCCCCcchHHhHHHHHHHHHHHHHcCCCcccCHHHHhccCCCCCCCCCCCCHHHhHHHHHH--C
Confidence 78999999999999999999999999999999999999999999999999999999733789999999999999876 7
Q ss_pred CCCCCCCCccccCCCcccCCCCCCCceEEeeeEEecCCCcHHHHHHHHHhCCCeEEEEec-ccccccCCCeEeCC-CCCC
Q psy12185 219 VLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA-LTWQYYLGGVIQYN-CDGS 296 (317)
Q Consensus 219 Gi~~e~~yPY~~~~~~C~~~~~~~~~~~i~~~~~~~~~~~~~~i~~al~~~gPv~v~i~~-~~f~~Y~~GIy~~~-c~~~ 296 (317)
|+++|++|||......|...... ..++|++|..+. ..++++||++|+++|||+++|.+ ++|+.|++|||... |...
T Consensus 79 Gi~~e~~yPY~~~~~~C~~~~~~-~~~~i~~~~~i~-~~~~~~ik~~l~~~gPV~~~~~~~~~f~~y~~Giy~~~~~~~~ 156 (210)
T cd02248 79 GLASESDYPYTGKDGTCKYNSSK-VGAKITGYSNVP-PGDEEALKAALANYGPVSVAIDASSSFQFYKGGIYSGPCCSNT 156 (210)
T ss_pred CcCccccCCccCCCCCccCCCCc-ccEEEeeEEEcC-CCcHHHHHHHHhhcCCEEEEEecCcccccCCCCceeCCCCCCC
Confidence 99999999999988899876655 678999998775 34578999999999999999999 89999999999874 4333
Q ss_pred CCCCCeEEEEEEeecCC--Cee
Q psy12185 297 LANINHAVQIVGYDNYS--RTW 316 (317)
Q Consensus 297 ~~~~~Hav~iVGyg~~~--~~W 316 (317)
.++|||+|||||++. +||
T Consensus 157 --~~~Hav~iVGy~~~~~~~yw 176 (210)
T cd02248 157 --NLNHAVLLVGYGTENGVDYW 176 (210)
T ss_pred --cCCEEEEEEEEeecCCceEE
Confidence 379999999999973 566
No 10
>PTZ00049 cathepsin C-like protein; Provisional
Probab=100.00 E-value=8.7e-42 Score=332.79 Aligned_cols=175 Identities=23% Similarity=0.410 Sum_probs=143.0
Q ss_pred CCCCCceeecccC----CCCCCCcCCCCCccHHHHHHHHHHHHHHHHHcCC-----C-----ccCCHHHHhhcCCCCCCC
Q psy12185 135 PTGIPVKKDWREA----GIIGKVRNQQTCGACWAFSTVETAESMHALKNGT-----L-----SLLSVQEVIDCAGNGNMG 200 (317)
Q Consensus 135 ~~~lP~~~DwR~~----g~v~~v~dQg~CGsCwAfa~~~~le~~~~i~~~~-----~-----~~lS~q~l~dC~~~~~~g 200 (317)
..+||++||||+. +.++||+|||.||||||||++++||++++|+++. . ..||+|+|+||+. .+.|
T Consensus 378 ~~~LP~sfDWRd~~~~~~~vtpVkdQG~CGSCWAFAat~alEsR~~Ia~~~~l~~~~~~~~~~~LS~QqLLDCs~-~nqG 456 (693)
T PTZ00049 378 IDELPKNFTWGDPFNNNTREYDVTNQLLCGSCYIASQMYAFKRRIEIALTKNLDKKYLNNFDDLLSIQTVLSCSF-YDQG 456 (693)
T ss_pred cccCCCCEecCcCCCCCCcccCCCCCccCcHHHHHHHHHHHHHHHHHHhccccccccccccccCcCHHHhcccCC-CCCC
Confidence 4689999999984 6899999999999999999999999999999742 1 2799999999997 6899
Q ss_pred CCCCchHHHHHHHHHhCCCCCCCCCCccccCCCcccCCCCC--------------------------------------C
Q psy12185 201 CSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATS--------------------------------------P 242 (317)
Q Consensus 201 C~GG~~~~a~~y~~~~~~Gi~~e~~yPY~~~~~~C~~~~~~--------------------------------------~ 242 (317)
|+||++..|++|+.+ .||++|++|||.+..+.|+..... .
T Consensus 457 C~GG~~~~A~kya~~--~GI~tEscYPY~a~~g~C~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 534 (693)
T PTZ00049 457 CNGGFPYLVSKMAKL--QGIPLDKVFPYTATEQTCPYQVDQSANSMNGSANLRQINAVFFSSETQSDMHADFEAPISSEP 534 (693)
T ss_pred cCCCcHHHHHHHHHH--CCCCcCCccCCcCCCCCCCCCCCCccccccccccccccccccccccccccccccccccccccc
Confidence 999999999999977 699999999999988888653211 0
Q ss_pred CceEEeeeEEec------CCCcHHHHHHHHHhCCCeEEEEec-ccccccCCCeEeC-------CCCCCC-----------
Q psy12185 243 NGVKIKSYTCDT------LIPSESSILTDIATHGPVIAAVNA-LTWQYYLGGVIQY-------NCDGSL----------- 297 (317)
Q Consensus 243 ~~~~i~~~~~~~------~~~~~~~i~~al~~~gPv~v~i~~-~~f~~Y~~GIy~~-------~c~~~~----------- 297 (317)
.++.++.|..+. ...++++||++|+++|||+|+|++ ++|++|++|||+. .|....
T Consensus 535 ~r~y~k~y~yI~g~y~~~~~~~E~~Im~eI~~~GPVsVsIda~~dF~~YksGVY~~~~~~h~~~C~~d~~~~~~~~~~~G 614 (693)
T PTZ00049 535 ARWYAKDYNYIGGCYGCNQCNGEKIMMNEIYRNGPIVASFEASPDFYDYADGVYYVEDFPHARRCTVDLPKHNGVYNITG 614 (693)
T ss_pred cceeeeeeEEecccccccCCCCHHHHHHHHHhcCCEEEEEEechhhhcCCCccccCcccccccccCCccccccccccccc
Confidence 123344454331 124688999999999999999999 8999999999985 374320
Q ss_pred -CCCCeEEEEEEeecC
Q psy12185 298 -ANINHAVQIVGYDNY 312 (317)
Q Consensus 298 -~~~~Hav~iVGyg~~ 312 (317)
..+||||+|||||.+
T Consensus 615 ~e~~NHAVlIVGwG~d 630 (693)
T PTZ00049 615 WEKVNHAIVLVGWGEE 630 (693)
T ss_pred cccCceEEEEEEeccc
Confidence 136999999999975
No 11
>PTZ00364 dipeptidyl-peptidase I precursor; Provisional
Probab=100.00 E-value=1.3e-41 Score=328.25 Aligned_cols=178 Identities=21% Similarity=0.388 Sum_probs=145.3
Q ss_pred CCCCCceeecccCC---CCCCCcCCCC---CccHHHHHHHHHHHHHHHHHcC------CCccCCHHHHhhcCCCCCCCCC
Q psy12185 135 PTGIPVKKDWREAG---IIGKVRNQQT---CGACWAFSTVETAESMHALKNG------TLSLLSVQEVIDCAGNGNMGCS 202 (317)
Q Consensus 135 ~~~lP~~~DwR~~g---~v~~v~dQg~---CGsCwAfa~~~~le~~~~i~~~------~~~~lS~q~l~dC~~~~~~gC~ 202 (317)
..+||++||||+.| +|+||||||. ||||||||++++||++++|+++ ..+.||+|||+||+. .+.||+
T Consensus 202 ~~~LP~sfDWR~~gg~~~VtpVrdQg~~~~CGSCWAFAav~alEsr~~I~tn~~~~~g~~~~LS~QqLVDCs~-~n~GCd 280 (548)
T PTZ00364 202 GDPPPAAWSWGDVGGASFLPAAPPASPGRGCNSSYVEAALAAMMARVMVASNRTDPLGQQTFLSARHVLDCSQ-YGQGCA 280 (548)
T ss_pred ccCCCCccccCcCCCCccCCCCcCCCCCCCCcCHHHHHHHHHHHHHHHHHhCCCcccCcccCcCHHHHhcccC-CCCCCC
Confidence 46799999999987 7999999999 9999999999999999999984 468899999999987 688999
Q ss_pred CCchHHHHHHHHHhCCCCCCCCCC--ccccCCC---cccCCCCCCCceEEeeeEEec----CCCcHHHHHHHHHhCCCeE
Q psy12185 203 GGDFCALLDWMDVNKVVLEPESEY--PLLLKDA---ACKRKATSPNGVKIKSYTCDT----LIPSESSILTDIATHGPVI 273 (317)
Q Consensus 203 GG~~~~a~~y~~~~~~Gi~~e~~y--PY~~~~~---~C~~~~~~~~~~~i~~~~~~~----~~~~~~~i~~al~~~gPv~ 273 (317)
||++..|++|+.+ .||++|++| ||.+.++ .|...... ..+.++.+..+. ...++++||.+|+++|||+
T Consensus 281 GG~p~~A~~yi~~--~GI~tE~dY~~PY~~~dg~~~~Ck~~~~~-~~y~~~~~~~I~gyy~~~~~e~~I~~eI~~~GPVs 357 (548)
T PTZ00364 281 GGFPEEVGKFAET--FGILTTDSYYIPYDSGDGVERACKTRRPS-RRYYFTNYGPLGGYYGAVTDPDEIIWEIYRHGPVP 357 (548)
T ss_pred CCcHHHHHHHHHh--CCcccccccCCCCCCCCCCCCCCCCCccc-ceeeeeeeEEecceeecCCcHHHHHHHHHHcCCeE
Confidence 9999999999977 699999999 9988665 58765444 444454433221 1257889999999999999
Q ss_pred EEEec-ccccccCCCeEeC---------CCCCC--------CCCCCeEEEEEEeecC-C--Cee
Q psy12185 274 AAVNA-LTWQYYLGGVIQY---------NCDGS--------LANINHAVQIVGYDNY-S--RTW 316 (317)
Q Consensus 274 v~i~~-~~f~~Y~~GIy~~---------~c~~~--------~~~~~Hav~iVGyg~~-~--~~W 316 (317)
|+|++ .+|++|++|||.+ .|... ...+||||+|||||++ + +||
T Consensus 358 VaIda~~df~~YksGiy~gi~~~~~~~~~~~~~~~~~~~~~~~~~nHAVlIVGYG~de~G~~YW 421 (548)
T PTZ00364 358 ASVYANSDWYNCDENSTEDVRYVSLDDYSTASADRPLRHYFASNVNHTVLIIGWGTDENGGDYW 421 (548)
T ss_pred EEEEechHHHhcCCCCccCeeccccccccccccCCcccccccccCCeEEEEEEecccCCCceEE
Confidence 99999 8999999999862 12110 1237999999999974 3 488
No 12
>PF00112 Peptidase_C1: Papain family cysteine protease This is family C1 in the peptidase classification. ; InterPro: IPR000668 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad []. This group of proteins belong to the peptidase family C1, sub-family C1A (papain family, clan CA). It includes proteins classed as non-peptidase homologs. These are have either been shown experimentally to lack peptidase activity or lack one or more of the active site residues. The papain family has a wide variety of activities, including broad-range (papain) and narrow-range endo-peptidases, aminopeptidases, dipeptidyl peptidases and enzymes with both exo- and endo-peptidase activity []. Members of the papain family are widespread, found in baculovirus [], eubacteria, yeast, and practically all protozoa, plants and mammals []. The proteins are typically lysosomal or secreted, and proteolytic cleavage of the propeptide is required for enzyme activation, although bleomycin hydrolase is cytosolic in fungi and mammals []. Papain-like cysteine proteinases are essentially synthesised as inactive proenzymes (zymogens) with N-terminal propeptide regions. The activation process of these enzymes includes the removal of propeptide regions. The propeptide regions serve a variety of functions in vivo and in vitro. The pro-region is required for the proper folding of the newly synthesised enzyme, the inactivation of the peptidase domain and stabilisation of the enzyme against denaturing at neutral to alkaline pH conditions. Amino acid residues within the pro-region mediate their membrane association, and play a role in the transport of the proenzyme to lysosomes. Among the most notable features of propeptides is their ability to inhibit the activity of their cognate enzymes and that certain propeptides exhibit high selectivity for inhibition of the peptidases from which they originate []. The catalytic residues of papain are Cys-25 and His-159, other important residues being Gln-19, which helps form the 'oxyanion hole', and Asn-175, which orientates the imidazole ring of His-159. ; GO: 0008234 cysteine-type peptidase activity, 0006508 proteolysis; PDB: 3MOR_B 3HHI_B 1S4V_A 3F75_A 1MEG_A 1PCI_C 1PPO_A 3HD3_B 1F29_A 1EWL_A ....
Probab=100.00 E-value=6.1e-41 Score=293.89 Aligned_cols=175 Identities=35% Similarity=0.622 Sum_probs=147.5
Q ss_pred CCceeecccC-CCCCCCcCCCCCccHHHHHHHHHHHHHHHHHc-CCCccCCHHHHhhcCCCCCCCCCCCchHHHHHHHHH
Q psy12185 138 IPVKKDWREA-GIIGKVRNQQTCGACWAFSTVETAESMHALKN-GTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDV 215 (317)
Q Consensus 138 lP~~~DwR~~-g~v~~v~dQg~CGsCwAfa~~~~le~~~~i~~-~~~~~lS~q~l~dC~~~~~~gC~GG~~~~a~~y~~~ 215 (317)
||++||||+. +.++||+|||.||+|||||+++++|+++++++ ...+.||+|+|++|....+.+|+||++..|++++.+
T Consensus 1 lP~~~D~r~~~~~~~~v~dQg~~gsCwafa~~~~~e~~~~~~~~~~~~~lS~q~l~~~~~~~~~~c~gg~~~~a~~~~~~ 80 (219)
T PF00112_consen 1 LPKSFDWRDKGGRITPVRDQGSCGSCWAFAAAAALESRLAIQNNGKNVDLSEQYLIDCSNKYNKGCDGGSPFDALKYIKN 80 (219)
T ss_dssp STSSEEGGGTTTCSG---BTTSSBTHHHHHHHHHHHHHHHHHHTSSCEEB-HHHHHHHSTGTSSTTBBBEHHHHHHHHHH
T ss_pred CCCCEecccCCCCcCccccCCcccccccchhccceeccccccccccccccccccccccccccccccccCcccccceeecc
Confidence 7999999997 58999999999999999999999999999999 789999999999999734679999999999999988
Q ss_pred hCCCCCCCCCCccccCC-CcccCCCCCCCceEEeeeEEecCCCcHHHHHHHHHhCCCeEEEEec-c-cccccCCCeEeC-
Q psy12185 216 NKVVLEPESEYPLLLKD-AACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA-L-TWQYYLGGVIQY- 291 (317)
Q Consensus 216 ~~~Gi~~e~~yPY~~~~-~~C~~~~~~~~~~~i~~~~~~~~~~~~~~i~~al~~~gPv~v~i~~-~-~f~~Y~~GIy~~- 291 (317)
+ .|+++|++|||.... ..|..........++..|..+. ..+.++||++|+++|||+++|.+ + +|+.|++|||+.
T Consensus 81 ~-~Gi~~e~~~pY~~~~~~~c~~~~~~~~~~~i~~~~~~~-~~~~~~ik~~L~~~gpV~~~~~~~~~~f~~~~~gi~~~~ 158 (219)
T PF00112_consen 81 N-NGIVTEEDYPYNGNENPTCKSKKSNSYYVKIKGYGKVK-DNDIEDIKKALMKYGPVVASIDVSSEDFQNYKSGIYDPP 158 (219)
T ss_dssp H-TSBEBTTTS--SSSSSCSSCHSGGGEEEBEESEEEEEE-STCHHHHHHHHHHHSSEEEEEEEESHHHHTEESSEECST
T ss_pred c-Cccccccccccccccccccccccccccccccccccccc-ccchhHHHHHHhhCceeeeeeeccccccccccceeeecc
Confidence 4 699999999999887 6898764431136888888775 34689999999999999999999 6 699999999998
Q ss_pred CCCCCCCCCCeEEEEEEeecC--CCee
Q psy12185 292 NCDGSLANINHAVQIVGYDNY--SRTW 316 (317)
Q Consensus 292 ~c~~~~~~~~Hav~iVGyg~~--~~~W 316 (317)
.|.... ++|||+|||||++ .++|
T Consensus 159 ~~~~~~--~~Hav~iVGy~~~~~~~~w 183 (219)
T PF00112_consen 159 DCSNES--GGHAVLIVGYDDENGKGYW 183 (219)
T ss_dssp SSSSSS--EEEEEEEEEEEEETTEEEE
T ss_pred cccccc--ccccccccccccccceeeE
Confidence 576544 8999999999998 4566
No 13
>cd02619 Peptidase_C1 C1 Peptidase family (MEROPS database nomenclature), also referred to as the papain family; composed of two subfamilies of cysteine peptidases (CPs), C1A (papain) and C1B (bleomycin hydrolase). Papain-like enzymes are mostly endopeptidases with some exceptions like cathepsins B, C, H and X, which are exopeptidases. Papain-like CPs have different functions in various organisms. Plant CPs are used to mobilize storage proteins in seeds while mammalian CPs are primarily lysosomal enzymes responsible for protein degradation in the lysosome. Papain-like CPs are synthesized as inactive proenzymes with N-terminal propeptide regions, which are removed upon activation. Bleomycin hydrolase (BH) is a CP that detoxifies bleomycin by hydrolysis of an amide group. It acts as a carboxypeptidase on its C-terminus to convert itself into an aminopeptidase and peptide ligase. BH is found in all tissues in mammals as well as in many other eukaryotes. It forms a hexameric ring barrel str
Probab=100.00 E-value=7.7e-37 Score=268.42 Aligned_cols=169 Identities=31% Similarity=0.453 Sum_probs=143.9
Q ss_pred eeecccCCCCCCCcCCCCCccHHHHHHHHHHHHHHHHHcC--CCccCCHHHHhhcCCCC----CCCCCCCchHHHHH-HH
Q psy12185 141 KKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG--TLSLLSVQEVIDCAGNG----NMGCSGGDFCALLD-WM 213 (317)
Q Consensus 141 ~~DwR~~g~v~~v~dQg~CGsCwAfa~~~~le~~~~i~~~--~~~~lS~q~l~dC~~~~----~~gC~GG~~~~a~~-y~ 213 (317)
++|||+.+ ++||+|||.||+|||||+++++|++++++++ +.+.||+|+|++|.... ..||.||++..++. ++
T Consensus 1 ~~d~r~~~-~~~v~dQg~~gsCwafa~~~~les~~~~~~~~~~~~~lS~q~l~~c~~~~~~~~~~~c~gG~~~~~~~~~~ 79 (223)
T cd02619 1 SVDLRPLR-LTPVKNQGSRGSCWAFASAYALESAYRIKGGEDEYVDLSPQYLYICANDECLGINGSCDGGGPLSALLKLV 79 (223)
T ss_pred CCcchhcC-CCCcccCCCCcCcHHHHHHHHHHHHHHHhcCCcccccCCHHHHHHhccccccccCCCCCCCcHHHHHHHHH
Confidence 48999988 9999999999999999999999999999988 88999999999998742 37999999999998 77
Q ss_pred HHhCCCCCCCCCCccccCCCcccCC----CCCCCceEEeeeEEecCCCcHHHHHHHHHhCCCeEEEEec-ccccccCCCe
Q psy12185 214 DVNKVVLEPESEYPLLLKDAACKRK----ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA-LTWQYYLGGV 288 (317)
Q Consensus 214 ~~~~~Gi~~e~~yPY~~~~~~C~~~----~~~~~~~~i~~~~~~~~~~~~~~i~~al~~~gPv~v~i~~-~~f~~Y~~GI 288 (317)
.+ .|+++|++|||......|... ... ..+++..|..+. ..++++||++|+++|||+++|.+ ..|..|++|+
T Consensus 80 ~~--~Gi~~e~~~Py~~~~~~~~~~~~~~~~~-~~~~~~~y~~~~-~~~~~~ik~aL~~~gPv~~~~~~~~~~~~~~~~~ 155 (223)
T cd02619 80 AL--KGIPPEEDYPYGAESDGEEPKSEAALNA-AKVKLKDYRRVL-KNNIEDIKEALAKGGPVVAGFDVYSGFDRLKEGI 155 (223)
T ss_pred HH--cCCCccccCCCCCCCCCCCCCCccchhh-cceeecceeEeC-chhHHHHHHHHHHCCCEEEEEEcccchhcccCcc
Confidence 66 799999999999988777543 222 457888888764 45678999999999999999999 8999999999
Q ss_pred Ee----C--CCCCCCCCCCeEEEEEEeecCC----Cee
Q psy12185 289 IQ----Y--NCDGSLANINHAVQIVGYDNYS----RTW 316 (317)
Q Consensus 289 y~----~--~c~~~~~~~~Hav~iVGyg~~~----~~W 316 (317)
|. . .|... .++|||+|||||++. ++|
T Consensus 156 ~~~~~~~~~~~~~~--~~~Hav~ivGy~~~~~~~~~~~ 191 (223)
T cd02619 156 IYEEIVYLLYEDGD--LGGHAVVIVGYDDNYVEGKGAF 191 (223)
T ss_pred ccccccccccCCCc--cCCeEEEEEeecCCCCCCCCEE
Confidence 73 2 34333 389999999999975 666
No 14
>smart00645 Pept_C1 Papain family cysteine protease.
Probab=100.00 E-value=4e-37 Score=260.95 Aligned_cols=134 Identities=37% Similarity=0.731 Sum_probs=118.9
Q ss_pred CCceeecccCCCCCCCcCCCCCccHHHHHHHHHHHHHHHHHcCCCccCCHHHHhhcCCCCCCCCCCCchHHHHHHHHHhC
Q psy12185 138 IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNK 217 (317)
Q Consensus 138 lP~~~DwR~~g~v~~v~dQg~CGsCwAfa~~~~le~~~~i~~~~~~~lS~q~l~dC~~~~~~gC~GG~~~~a~~y~~~~~ 217 (317)
||++||||+.++++||+|||.||+|||||+++++|++++++++..++||+|+|++|....+.||+||++..|++|+.++
T Consensus 1 lP~~~D~R~~~~~~~v~dQg~CGsCwAfa~~~~ie~~~~i~~~~~~~lS~q~l~~C~~~~~~gC~GG~~~~a~~~~~~~- 79 (174)
T smart00645 1 LPESFDWRKKGAVTPVKDQGQCGSCWAFSATGALEGRYCIKTGKLVSLSEQQLVDCSTGGNNGCNGGLPDNAFEYIKKN- 79 (174)
T ss_pred CCCcCcccccCCCCccccCcccchHHHHHHHHHHHHHHHHhcCCccccCHHHHhhhcCCCCCCCCCcCHHHHHHHHHHc-
Confidence 6999999999999999999999999999999999999999999999999999999997336699999999999999873
Q ss_pred CCCCCCCCCccccCCCcccCCCCCCCceEEeeeEEecCCCcHHHHHHHHHhCCCeEEEEecccccccCCCeEeC-CCCCC
Q psy12185 218 VVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQY-NCDGS 296 (317)
Q Consensus 218 ~Gi~~e~~yPY~~~~~~C~~~~~~~~~~~i~~~~~~~~~~~~~~i~~al~~~gPv~v~i~~~~f~~Y~~GIy~~-~c~~~ 296 (317)
.|+++|++|||.. ++.+.+.+|++|++|||+. .|...
T Consensus 80 ~Gi~~e~~~PY~~------------------------------------------~~~~~~~~f~~Y~~Gi~~~~~~~~~ 117 (174)
T smart00645 80 GGLETESCYPYTG------------------------------------------SVAIDASDFQFYKSGIYDHPGCGSG 117 (174)
T ss_pred CCcccccccCccc------------------------------------------EEEEEcccccCCcCeEECCCCCCCC
Confidence 4999999999974 5666666799999999988 58654
Q ss_pred CCCCCeEEEEEEeecC-C--Cee
Q psy12185 297 LANINHAVQIVGYDNY-S--RTW 316 (317)
Q Consensus 297 ~~~~~Hav~iVGyg~~-~--~~W 316 (317)
. ++|+|+|||||.+ + .||
T Consensus 118 ~--~~Hav~ivGyg~~~~g~~yW 138 (174)
T smart00645 118 T--LDHAVLIVGYGTEENGKDYW 138 (174)
T ss_pred c--ccEEEEEEEEeecCCCeeEE
Confidence 3 7999999999976 3 376
No 15
>PTZ00462 Serine-repeat antigen protein; Provisional
Probab=100.00 E-value=1.6e-33 Score=282.30 Aligned_cols=160 Identities=24% Similarity=0.391 Sum_probs=125.9
Q ss_pred CCCCcCCCCCccHHHHHHHHHHHHHHHHHcCCCccCCHHHHhhcCCC-CCCCCCCCchH-HHHHHHHHhCCCCCCCCCCc
Q psy12185 150 IGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFC-ALLDWMDVNKVVLEPESEYP 227 (317)
Q Consensus 150 v~~v~dQg~CGsCwAfa~~~~le~~~~i~~~~~~~lS~q~l~dC~~~-~~~gC~GG~~~-~a~~y~~~~~~Gi~~e~~yP 227 (317)
..||||||.||+|||||+++++|++++|+++..+.||+|+|+||+.. ++.||.||+.. .++.|+.++ +|+++|++||
T Consensus 544 ~i~VKDQG~CGSCWAFASaaaLES~~cIkgg~~v~LSeQqLVDCs~~~gn~GC~GG~~~~efl~yI~e~-GgLptESdYP 622 (1004)
T PTZ00462 544 KIQIEDQGNCAISWIFASKYHLETIKCMKGYEPHAISALYIANCSKGEHKDRCDEGSNPLEFLQIIEDN-GFLPADSNYL 622 (1004)
T ss_pred CCCcccCCcchHHHHHHHHHHHHHHHHHhcCCCcccCHHHHHhcccccCCCCCCCCCcHHHHHHHHHHc-CCCcccccCC
Confidence 57999999999999999999999999999999999999999999863 46899999755 555888763 5789999999
Q ss_pred ccc--CCCcccCCCCC-----------------CCceEEeeeEEecC---CCc----HHHHHHHHHhCCCeEEEEecccc
Q psy12185 228 LLL--KDAACKRKATS-----------------PNGVKIKSYTCDTL---IPS----ESSILTDIATHGPVIAAVNALTW 281 (317)
Q Consensus 228 Y~~--~~~~C~~~~~~-----------------~~~~~i~~~~~~~~---~~~----~~~i~~al~~~gPv~v~i~~~~f 281 (317)
|.. ..+.|+..... ...+.+++|..+.. ..+ +++||++|+++|||+|+|++.+|
T Consensus 623 Yt~k~~~g~Cp~~~~~w~n~~~~~kll~~~~~~~~~i~~kgY~~~~s~~~~~n~d~~i~~IK~eI~~kGPVaV~IdAsdf 702 (1004)
T PTZ00462 623 YNYTKVGEDCPDEEDHWMNLLDHGKILNHNKKEPNSLDGKAYRAYESEHFHDKMDAFIKIIKDEIMNKGSVIAYIKAENV 702 (1004)
T ss_pred CccCCCCCCCCCCcccccccccccccccccccccceeeccceEEecccccccchhhHHHHHHHHHHhcCCEEEEEEeehH
Confidence 985 56779743210 01223455554321 111 36899999999999999999778
Q ss_pred ccc-CCCeEeC-CCCCCCCCCCeEEEEEEeecC
Q psy12185 282 QYY-LGGVIQY-NCDGSLANINHAVQIVGYDNY 312 (317)
Q Consensus 282 ~~Y-~~GIy~~-~c~~~~~~~~Hav~iVGyg~~ 312 (317)
+.| .+|||.. .|+... ++|||+|||||.+
T Consensus 703 ~~Y~~sGIyv~~~Cgs~~--~nHAVlIVGYGt~ 733 (1004)
T PTZ00462 703 LGYEFNGKKVQNLCGDDT--ADHAVNIVGYGNY 733 (1004)
T ss_pred HhhhcCCccccCCCCCCc--CCceEEEEEeccc
Confidence 888 4898765 697644 7999999999974
No 16
>KOG1544|consensus
Probab=99.97 E-value=5.8e-33 Score=244.58 Aligned_cols=229 Identities=20% Similarity=0.295 Sum_probs=168.6
Q ss_pred HHHHHHhhcCCCCCCeEEe-cCcCCCCCHHHHHHhhhcccccccccccCCCCCCCCCccccccccccCCCCCCCCCceee
Q psy12185 65 DIIEELNKNRQSPESARYG-ITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKD 143 (317)
Q Consensus 65 ~~I~~~N~~~~~~~~~~~g-~N~fsD~t~eEf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lP~~~D 143 (317)
++|+++| .+..+|+.+ ..+|=.||.++=. ++.++.+. |..+..+ .+. +....++..+||+.||
T Consensus 151 d~iE~in---~G~YgW~A~NYSaFWGmtL~DGi-KyRLGTL~---Ps~sv~n--------MNE-i~~~l~p~~~LPE~F~ 214 (470)
T KOG1544|consen 151 DMIEAIN---QGNYGWQAGNYSAFWGMTLDDGI-KYRLGTLR---PSSSVMN--------MNE-IYTVLNPGEVLPEAFE 214 (470)
T ss_pred HHHHHHh---cCCccccccchhhhhcccccccc-eeeecccC---chhhhhh--------HHh-HhhccCcccccchhhh
Confidence 4789999 677777765 2368788877632 44455543 2111100 000 0001233478999999
Q ss_pred cccC--CCCCCCcCCCCCccHHHHHHHHHHHHHHHHHcC--CCccCCHHHHhhcCCCCCCCCCCCchHHHHHHHHHhCCC
Q psy12185 144 WREA--GIIGKVRNQQTCGACWAFSTVETAESMHALKNG--TLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVV 219 (317)
Q Consensus 144 wR~~--g~v~~v~dQg~CGsCwAfa~~~~le~~~~i~~~--~~~~lS~q~l~dC~~~~~~gC~GG~~~~a~~y~~~~~~G 219 (317)
-+++ +++.|+.|||+|++.|||+++++...+++|... -...||+|+|++|......||+||+.+.||=||.+ .|
T Consensus 215 As~KWp~liH~plDQgnCa~SWafSTaavasDRiAI~S~GR~t~~LSpQnLlSC~~h~q~GC~gG~lDRAWWYlRK--rG 292 (470)
T KOG1544|consen 215 ASEKWPNLIHEPLDQGNCAGSWAFSTAAVASDRVAIHSLGRMTPVLSPQNLLSCDTHQQQGCRGGRLDRAWWYLRK--RG 292 (470)
T ss_pred hhhcCCccccCccccCCcccceeeeeehhccceeEEeeccccccccChHHhcchhhhhhccCccCcccchheeeec--cc
Confidence 9975 899999999999999999999999999999876 45789999999999877899999999999999977 79
Q ss_pred CCCCCCCccccC----CCcc------------------cCCCCCC-CceEEeeeEEecCCCcHHHHHHHHHhCCCeEEEE
Q psy12185 220 LEPESEYPLLLK----DAAC------------------KRKATSP-NGVKIKSYTCDTLIPSESSILTDIATHGPVIAAV 276 (317)
Q Consensus 220 i~~e~~yPY~~~----~~~C------------------~~~~~~~-~~~~i~~~~~~~~~~~~~~i~~al~~~gPv~v~i 276 (317)
++...||||... .+.| +.+...+ ..++.+.-..+ +.+|++||++|+++|||.+.|
T Consensus 293 vVsdhCYP~~~dQ~~~~~~C~m~sR~~grgkRqat~~CPn~~~~Sn~iyq~tPPYrV--SSnE~eImkElM~NGPVQA~m 370 (470)
T KOG1544|consen 293 VVSDHCYPFSGDQAGPAPPCMMHSRAMGRGKRQATAHCPNSYVNSNDIYQVTPPYRV--SSNEKEIMKELMENGPVQALM 370 (470)
T ss_pred ccccccccccCCCCCCCCCceeeccccCcccccccCcCCCcccccCceeeecCCeec--cCCHHHHHHHHHhCCChhhhh
Confidence 999999999852 2334 3321110 22333332222 578999999999999999999
Q ss_pred ec-ccccccCCCeEeCCCCC------CCCCCCeEEEEEEeecCC
Q psy12185 277 NA-LTWQYYLGGVIQYNCDG------SLANINHAVQIVGYDNYS 313 (317)
Q Consensus 277 ~~-~~f~~Y~~GIy~~~c~~------~~~~~~Hav~iVGyg~~~ 313 (317)
.+ ++|.+|++|||.+.-.. ....+.|+|.|.|||++.
T Consensus 371 ~VHEDFF~YkgGiY~H~~~~~~~~e~yr~~gtHsVk~tGWG~~~ 414 (470)
T KOG1544|consen 371 EVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEET 414 (470)
T ss_pred hhhhhhhhhccceeeccccccCCchhhhhcccceEEEeeccccc
Confidence 99 99999999999872111 112478999999999864
No 17
>PF08246 Inhibitor_I29: Cathepsin propeptide inhibitor domain (I29); InterPro: IPR013201 Peptide proteinase inhibitors can be found as single domain proteins or as single or multiple domains within proteins; these are referred to as either simple or compound inhibitors, respectively. In many cases they are synthesised as part of a larger precursor protein, either as a prepropeptide or as an N-terminal domain associated with an inactive peptidase or zymogen. This domain prevents access of the substrate to the active site. Removal of the N-terminal inhibitor domain either by interaction with a second peptidase or by autocatalytic cleavage activates the zymogen. Other inhibitors interact direct with proteinases using a simple noncovalent lock and key mechanism; while yet others use a conformational change-based trapping mechanism that depends on their structural and thermodynamic properties. This entry represents a peptidase inhibitor domain, which belongs to MEROPS peptidase inhibitor family I29. The domain is also found at the N terminus of a variety of peptidase precursors that belong to MEROPS peptidase subfamily C1A; these include cathepsin L, papain, and procaricain (P10056 from SWISSPROT) []. It forms an alpha-helical domain that runs through the substrate-binding site, preventing access. Removal of this region by proteolytic cleavage results in activation of the enzyme. This domain is also found, in one or more copies, in a variety of cysteine peptidase inhibitors such as salarin [].; PDB: 3QT4_A 3QJ3_A 2C0Y_A 2L95_A 1CJL_A 1CS8_A 7PCK_A 1BY8_A 1PCI_A 2O6X_A ....
Probab=99.71 E-value=2.6e-17 Score=113.82 Aligned_cols=57 Identities=39% Similarity=0.714 Sum_probs=50.2
Q ss_pred HHHHHHHhCCccCch-hHHHHHHHHHHHHHHHHHHhhcCCCCCCeEEecCcCCCCCHHHH
Q psy12185 37 FSSFQQRYKKSYSKS-EHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95 (317)
Q Consensus 37 f~~~~~~~~K~Y~~~-E~~~R~~iF~~n~~~I~~~N~~~~~~~~~~~g~N~fsD~t~eEf 95 (317)
|++|+++|+|.|.+. |+.+|+.+|.+|+++|++||+ .+..+|++|+|+|||||.+||
T Consensus 1 F~~~~~~~~k~Y~~~~e~~~R~~~F~~N~~~I~~~N~--~~~~~~~~~~N~fsD~t~eEf 58 (58)
T PF08246_consen 1 FEQFKKKYGKSYKSAEEEARRFAIFKENLRRIEEHNA--NGNNTYKLGLNQFSDMTPEEF 58 (58)
T ss_dssp HHHHHHHCT---SSHHHHHHHHHHHHHHHHHHHHHHH--TTSSSEEE-SSTTTTSSHHHH
T ss_pred CHHHHHHcCCCCCCHHHHHHHHHHHHHHHHHHHHHhc--CCCCCeEEeCccccCcChhhC
Confidence 899999999999888 999999999999999999994 578899999999999999998
No 18
>COG4870 Cysteine protease [Posttranslational modification, protein turnover, chaperones]
Probab=99.67 E-value=2.3e-17 Score=149.57 Aligned_cols=171 Identities=18% Similarity=0.165 Sum_probs=106.2
Q ss_pred CCCCceeecccCCCCCCCcCCCCCccHHHHHHHHHHHHHHHHHcC---CCccCCHHHHhhcCCCCCCC-CCCCchHHHHH
Q psy12185 136 TGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG---TLSLLSVQEVIDCAGNGNMG-CSGGDFCALLD 211 (317)
Q Consensus 136 ~~lP~~~DwR~~g~v~~v~dQg~CGsCwAfa~~~~le~~~~i~~~---~~~~lS~q~l~dC~~~~~~g-C~GG~~~~a~~ 211 (317)
..+|+.||||..|.|+||||||.||+||||++++++|+.+.-... ....+..+..+-|......+ -+||....+..
T Consensus 97 ~s~~~~fd~r~~g~vs~v~dQg~~Gscwaf~t~~sles~l~~~~~w~~s~~nm~~ll~~~ye~~fd~~~~d~g~~~m~~a 176 (372)
T COG4870 97 ASLPSYFDRRDEGKVSPVKDQGSGGSCWAFATTRSLESYLNPESAWDFSENNMKNLLGVPYEKGFDYTSNDGGNADMSAA 176 (372)
T ss_pred ccchhheeeeccCCcccccccCcccceEeeeehhhhhheecccccccccccchhhhcCCCccccCCCccccCCccccccc
Confidence 458999999999999999999999999999999999998765441 11223233333333211111 12677777777
Q ss_pred HHHHhCCCCCCCCCCccccCCCcccCCCCCCCceEEeeeEEec---CCCcHHHHHHHHHhCCCeEEEEec--ccccccCC
Q psy12185 212 WMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDT---LIPSESSILTDIATHGPVIAAVNA--LTWQYYLG 286 (317)
Q Consensus 212 y~~~~~~Gi~~e~~yPY~~~~~~C~~~~~~~~~~~i~~~~~~~---~~~~~~~i~~al~~~gPv~v~i~~--~~f~~Y~~ 286 (317)
|+.++ .|.+.|.+-||......|..+... ..++..-..++ -..+.-.|++++..+|-++..|.. ..+..-.-
T Consensus 177 ~l~e~-sgpv~et~d~y~~~s~~~~~~~p~--~k~~~~~~~i~~~~~~LdnG~i~~~~~~yg~~s~~~~id~~~~~~~~~ 253 (372)
T COG4870 177 YLTEW-SGPVYETDDPYSENSYFSPTNLPV--TKHVQEAQIIPSRKKYLDNGNIKAMFGFYGAVSSSMYIDATNSLGICI 253 (372)
T ss_pred ccccc-CCcchhhcCccccccccCCcCCch--hhccccceecccchhhhcccchHHHHhhhccccceeEEeccccccccc
Confidence 88776 788999999998766666543221 11111111111 011334477778778877755544 33322333
Q ss_pred CeEeCCCCCCCCCCCeEEEEEEeecC
Q psy12185 287 GVIQYNCDGSLANINHAVQIVGYDNY 312 (317)
Q Consensus 287 GIy~~~c~~~~~~~~Hav~iVGyg~~ 312 (317)
+.|...-. . ..+|||+||||+|.
T Consensus 254 ~~~~~~s~-~--~~gHAv~iVGyDDs 276 (372)
T COG4870 254 PYPYVDSG-E--NWGHAVLIVGYDDS 276 (372)
T ss_pred CCCCCCcc-c--cccceEEEEecccc
Confidence 44443221 2 37999999999985
No 19
>smart00848 Inhibitor_I29 Cathepsin propeptide inhibitor domain (I29). This domain is found at the N-terminus of some C1 peptidases such as Cathepsin L where it acts as a propeptide. There are also a number of proteins that are composed solely of multiple copies of this domain such as the peptidase inhibitor salarin. This family is classified as I29 by MEROPS. Peptide proteinase inhibitors can be found as single domain proteins or as single or multiple domains within proteins; these are referred to as either simple or compound inhibitors, respectively. In many cases they are synthesised as part of a larger precursor protein, either as a prepropeptide or as an N-terminal domain associated with an inactive peptidase or zymogen. This domain prevents access of the substrate to the active site. Removal of the N-terminal inhibitor domain either by interaction with a second peptidase or by autocatalytic cleavage activates the zymogen. Other inhibitors interact direct with proteinases using a s
Probab=99.55 E-value=5.1e-15 Score=101.93 Aligned_cols=56 Identities=41% Similarity=0.703 Sum_probs=51.7
Q ss_pred HHHHHHHhCCccCch-hHHHHHHHHHHHHHHHHHHhhcCCCCCCeEEecCcCCCCCHHH
Q psy12185 37 FSSFQQRYKKSYSKS-EHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94 (317)
Q Consensus 37 f~~~~~~~~K~Y~~~-E~~~R~~iF~~n~~~I~~~N~~~~~~~~~~~g~N~fsD~t~eE 94 (317)
|++|+.+|+|.|.+. |+.+|+.+|.+|++.|+.||+. +..+|++|+|+|||||++|
T Consensus 1 f~~~~~~~~k~y~~~~e~~~r~~~f~~n~~~i~~~N~~--~~~~~~~~~N~fsDlt~eE 57 (57)
T smart00848 1 FEQWKKKYGKSYSSEEEELRRFEIFKENLKFIEEHNKK--NDHSYTLGLNQFADLTNEE 57 (57)
T ss_pred ChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHHHhc--CCCCeEecCcccccCCCCC
Confidence 689999999999999 9999999999999999999963 4578999999999999886
No 20
>cd00585 Peptidase_C1B Peptidase C1B subfamily (MEROPS database nomenclature); composed of eukaryotic bleomycin hydrolases (BH) and bacterial aminopeptidases C (pepC). The proteins of this subfamily contain a large insert relative to the C1A peptidase (papain) subfamily. BH is a cysteine peptidase that detoxifies bleomycin by hydrolysis of an amide group. It acts as a carboxypeptidase on its C-terminus to convert itself into an aminopeptidase and peptide ligase. BH is found in all tissues in mammals as well as in many other eukaryotes. Bleomycin, a glycopeptide derived from the fungus Streptomyces verticullus, is an effective anticancer drug due to its ability to induce DNA strand breaks. Human BH is the major cause of tumor cell resistance to bleomycin chemotherapy, and is also genetically linked to Alzheimer's disease. In addition to its peptidase activity, the yeast BH (Gal6) binds DNA and acts as a repressor in the Gal4 regulatory system. BH forms a hexameric ring barrel structure w
Probab=99.36 E-value=4.7e-12 Score=120.69 Aligned_cols=77 Identities=23% Similarity=0.308 Sum_probs=63.0
Q ss_pred CCCcCCCCCccHHHHHHHHHHHHHHHHHc-CCCccCCHHHHhh----------------cCCC-----------CCCCCC
Q psy12185 151 GKVRNQQTCGACWAFSTVETAESMHALKN-GTLSLLSVQEVID----------------CAGN-----------GNMGCS 202 (317)
Q Consensus 151 ~~v~dQg~CGsCwAfa~~~~le~~~~i~~-~~~~~lS~q~l~d----------------C~~~-----------~~~gC~ 202 (317)
.||+||+.-|.||.||+..+++..+..+. .+.+.||+.+|.- +... .....+
T Consensus 55 ~~vtnQ~~SGrCW~FA~Ln~lr~~~~k~~~~~~felSq~Yl~f~dklEkaN~fle~ii~~~~~~~~~R~v~~ll~~~~~D 134 (437)
T cd00585 55 EPVTNQKSSGRCWLFAALNVLRHQFMKKLNLKEFEFSQSYLFFWDKLEKANYFLENIIETADEPLDDRLVQFLLANPQND 134 (437)
T ss_pred CCcccCCCCchhHHHHCHHHHHHHHHHHcCCCCEEeCcHHHHHHHHHHHHHHHHHHHHHHhcCCCccHHHHHHHhCCcCC
Confidence 48999999999999999999999888754 4789999876654 3211 245678
Q ss_pred CCchHHHHHHHHHhCCCCCCCCCCccc
Q psy12185 203 GGDFCALLDWMDVNKVVLEPESEYPLL 229 (317)
Q Consensus 203 GG~~~~a~~y~~~~~~Gi~~e~~yPY~ 229 (317)
||.-..+...+.+ +|+++.+.||-+
T Consensus 135 GGqw~m~~~li~K--YGvVPk~~~pet 159 (437)
T cd00585 135 GGQWDMLVNLIEK--YGLVPKSVMPES 159 (437)
T ss_pred CCchHHHHHHHHH--cCCCcccccCCC
Confidence 9999999999977 899999999943
No 21
>PF03051 Peptidase_C1_2: Peptidase C1-like family This family is a subfamily of the Prosite entry; InterPro: IPR004134 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad []. This group of proteins belong to MEROPS peptidase family C1, sub-family C1B (bleomycin hydrolase, clan CA). This family contains prokaryotic and eukaryotic aminopeptidases and bleomycin hydrolases.; GO: 0004197 cysteine-type endopeptidase activity, 0006508 proteolysis; PDB: 3PW3_F 2CB5_A 1CB5_C 2DZZ_A 2E02_A 2E01_A 2E03_A 1A6R_A 1GCB_A 3GCB_A ....
Probab=97.56 E-value=0.00012 Score=70.47 Aligned_cols=77 Identities=25% Similarity=0.359 Sum_probs=51.3
Q ss_pred CCCcCCCCCccHHHHHHHHHHHHHHHHHcC-CCccCCHHHHh----------------hcCCC-----------CCCCCC
Q psy12185 151 GKVRNQQTCGACWAFSTVETAESMHALKNG-TLSLLSVQEVI----------------DCAGN-----------GNMGCS 202 (317)
Q Consensus 151 ~~v~dQg~CGsCwAfa~~~~le~~~~i~~~-~~~~lS~q~l~----------------dC~~~-----------~~~gC~ 202 (317)
.||.||..-|.||.||+...++..+..+.+ +...||+-+|. ++... .....+
T Consensus 56 ~~vtnQk~SGRCW~FA~lN~lR~~~~kk~~l~~felSq~Yl~F~DKlEKaN~fLe~ii~~~~~~~d~R~v~~ll~~~~~D 135 (438)
T PF03051_consen 56 GPVTNQKSSGRCWLFAALNVLRHEIMKKLNLKDFELSQNYLFFWDKLEKANYFLENIIDTADEPLDDRLVRFLLKNPVSD 135 (438)
T ss_dssp -S--B--BSSTHHHHHHHHHHHHHHHHHCT-SS--B-HHHHHHHHHHHHHHHHHHHHHHCCTS-TTSHHHHHHHHSTT-S
T ss_pred CCCCCCCCCCCcchhhchHHHHHHHHHHcCCCceEeechHHHHHHHHHHHHHHHHHHHHHhcCCcchHHHHHHHhcCCCC
Confidence 489999999999999999999999998876 78999998764 43321 134578
Q ss_pred CCchHHHHHHHHHhCCCCCCCCCCccc
Q psy12185 203 GGDFCALLDWMDVNKVVLEPESEYPLL 229 (317)
Q Consensus 203 GG~~~~a~~y~~~~~~Gi~~e~~yPY~ 229 (317)
||.-..+.+-+.. +||++.+.||-+
T Consensus 136 GGqw~~~~nli~K--YGvVPk~~mpet 160 (438)
T PF03051_consen 136 GGQWDMVVNLIKK--YGVVPKSVMPET 160 (438)
T ss_dssp -B-HHHHHHHHHH--H---BGGGSTTG
T ss_pred CCchHHHHHHHHH--cCcCcHhhCCCC
Confidence 9999998888877 899999999954
No 22
>PF08127 Propeptide_C1: Peptidase family C1 propeptide; InterPro: IPR012599 This domain is found at the N-terminal of cathepsin B and cathepsin B-like peptidases that belong to MEROPS peptidase subfamily C1A. Cathepsin B are lysosomal cysteine proteinases belonging to the papain superfamily and are unique in their ability to act as both an endo- and an exopeptidases. They are synthesized as inactive zymogens. Activation of the peptidases occurs with the removal of the propeptide [, ]. ; GO: 0004197 cysteine-type endopeptidase activity, 0050790 regulation of catalytic activity; PDB: 1MIR_A 1PBH_A 2PBH_A 3PBH_A.
Probab=95.64 E-value=0.012 Score=37.14 Aligned_cols=36 Identities=19% Similarity=0.240 Sum_probs=22.6
Q ss_pred HHHHHHHhhcCCCCCCeEEecCcCCCCCHHHHHHhhhccccc
Q psy12185 64 LDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVN 105 (317)
Q Consensus 64 ~~~I~~~N~~~~~~~~~~~g~N~fsD~t~eEf~~~~~~~~~~ 105 (317)
-++|+.+|+ .+.+|+.|.| |.+.+.+.+ +.|+|...
T Consensus 3 de~I~~IN~---~~~tWkAG~N-F~~~~~~~i--k~LlGv~~ 38 (41)
T PF08127_consen 3 DEFIDYINS---KNTTWKAGRN-FENTSIEYI--KRLLGVLP 38 (41)
T ss_dssp HHHHHHHHH---CT-SEEE-----SSB-HHHH--HHCS-B-T
T ss_pred HHHHHHHHc---CCCcccCCCC-CCCCCHHHH--HHHcCCCC
Confidence 368999995 4889999999 899999999 56777643
No 23
>PF05543 Peptidase_C47: Staphopain peptidase C47; InterPro: IPR008750 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad []. This group of cysteine peptidases belong to the peptidase family C47 (staphopain family, clan CA). The type example are the staphopains, which are one of four major families of proteinases secreted by the Gram-positive Staphylococcus aureus. These staphylococcal cysteine proteases are secreted as preproenzymes that are proteolytically cleaved to generate the mature enzyme [, , ].; GO: 0008234 cysteine-type peptidase activity, 0006508 proteolysis; PDB: 1X9Y_D 1Y4H_B 1PXV_B 1CV8_A.
Probab=93.91 E-value=0.53 Score=39.39 Aligned_cols=104 Identities=13% Similarity=0.112 Sum_probs=59.9
Q ss_pred CCCCCccHHHHHHHHHHHHHHHHH--------cCCCccCCHHHHhhcCCCCCCCCCCCchHHHHHHHHHhCCCCCCCCCC
Q psy12185 155 NQQTCGACWAFSTVETAESMHALK--------NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEY 226 (317)
Q Consensus 155 dQg~CGsCwAfa~~~~le~~~~i~--------~~~~~~lS~q~l~dC~~~~~~gC~GG~~~~a~~y~~~~~~Gi~~e~~y 226 (317)
.||.=+=|-+|+.++.|-+....+ +...+.+|+++|.+++. .+...++|.+. .|.
T Consensus 18 tQg~~pWCa~Ya~aailN~~~~~~~~~A~~iMr~~yPn~s~~~l~~~~~---------~~~~~i~y~ks--~g~------ 80 (175)
T PF05543_consen 18 TQGYNPWCAGYAMAAILNATTNTKIYNAKDIMRYLYPNVSEEQLKFTSL---------TPNQMIKYAKS--QGR------ 80 (175)
T ss_dssp --SSSS-HHHHHHHHHHHHHCT-S---HHHHHHHHSTTS-CCCHHH--B----------HHHHHHHHHH--TTE------
T ss_pred ccCcCcHHHHHHHHHHHHhhhCcCcCCHHHHHHHHCCCCCHHHHhhcCC---------CHHHHHHHHHH--cCc------
Confidence 588888899999999886642211 11357788888888754 45678888765 332
Q ss_pred ccccCCCcccCCCCCCCceEEeeeEEecCCCcHHHHHHHHHhCCCeEEEEec-ccccccCCCeEeCCCCCCCCCCCeEEE
Q psy12185 227 PLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA-LTWQYYLGGVIQYNCDGSLANINHAVQ 305 (317)
Q Consensus 227 PY~~~~~~C~~~~~~~~~~~i~~~~~~~~~~~~~~i~~al~~~gPv~v~i~~-~~f~~Y~~GIy~~~c~~~~~~~~Hav~ 305 (317)
.+ .+... ..+-+++++.+-++-|+.+..+. +. ......+||++
T Consensus 81 -------------------~~-~~~n~--~~s~~eV~~~~~~nk~i~i~~~~v~~--------------~~~~~~gHAla 124 (175)
T PF05543_consen 81 -------------------NP-QYNNR--MPSFDEVKKLIDNNKGIAILADRVEQ--------------TNGPHAGHALA 124 (175)
T ss_dssp -------------------EE-EEECS-----HHHHHHHHHTT-EEEEEEEETTS--------------CTTB--EEEEE
T ss_pred -------------------ch-hHhcC--CCCHHHHHHHHHcCCCeEEEeccccc--------------CCCCccceeEE
Confidence 11 22211 12457889899888888886653 11 11123699999
Q ss_pred EEEeec
Q psy12185 306 IVGYDN 311 (317)
Q Consensus 306 iVGyg~ 311 (317)
||||-.
T Consensus 125 vvGya~ 130 (175)
T PF05543_consen 125 VVGYAK 130 (175)
T ss_dssp EEEEEE
T ss_pred EEeeee
Confidence 999976
No 24
>COG3579 PepC Aminopeptidase C [Amino acid transport and metabolism]
Probab=92.09 E-value=0.37 Score=44.42 Aligned_cols=76 Identities=24% Similarity=0.341 Sum_probs=51.1
Q ss_pred CCcCCCCCccHHHHHHHHHHHHHHHHHcC-CCccCCHHHHh----------------hcCCC-----------CCCCCCC
Q psy12185 152 KVRNQQTCGACWAFSTVETAESMHALKNG-TLSLLSVQEVI----------------DCAGN-----------GNMGCSG 203 (317)
Q Consensus 152 ~v~dQg~CGsCwAfa~~~~le~~~~i~~~-~~~~lS~q~l~----------------dC~~~-----------~~~gC~G 203 (317)
||-||..-|-||-||+..++--.+.-.-+ +.+.||..++. .-... ...-=+|
T Consensus 59 ~vtNQk~SGRCWmFAAlNtfRhk~~~el~le~fElSQaytfFwDKlEKaN~FleqIi~tadq~ldsRlv~~LL~~PqqDG 138 (444)
T COG3579 59 KVTNQKQSGRCWMFAALNTFRHKLISELKLEDFELSQAYTFFWDKLEKANWFLEQIIETADQELDSRLVSFLLATPQQDG 138 (444)
T ss_pred ccccccccceehHHHHHHHHHHHHHHhcCcceeehhhHHHHHHHHHHHhhHHHHHHHhhcccchHHHHHHHHHcCccccC
Confidence 88999999999999999887654443333 45677765432 21110 1333457
Q ss_pred CchHHHHHHHHHhCCCCCCCCCCccc
Q psy12185 204 GDFCALLDWMDVNKVVLEPESEYPLL 229 (317)
Q Consensus 204 G~~~~a~~y~~~~~~Gi~~e~~yPY~ 229 (317)
|--......+.. +|+++-++||-+
T Consensus 139 GQwdM~v~l~eK--YGvVpK~~ypes 162 (444)
T COG3579 139 GQWDMFVSLFEK--YGVVPKSVYPES 162 (444)
T ss_pred chHHHHHHHHHH--hCCCchhhcccc
Confidence 766666666655 899999999954
No 25
>KOG4128|consensus
Probab=87.28 E-value=0.64 Score=42.77 Aligned_cols=77 Identities=21% Similarity=0.253 Sum_probs=56.0
Q ss_pred CCCCcCCCCCccHHHHHHHHHHHHHHHHHcC-CCccCCHHHHhh--------------------cCCC---------CCC
Q psy12185 150 IGKVRNQQTCGACWAFSTVETAESMHALKNG-TLSLLSVQEVID--------------------CAGN---------GNM 199 (317)
Q Consensus 150 v~~v~dQg~CGsCwAfa~~~~le~~~~i~~~-~~~~lS~q~l~d--------------------C~~~---------~~~ 199 (317)
-+||.||..-|-||.|+....+---+..+-+ .-..||..+|.- |... .+.
T Consensus 62 ~~pvtnqkssGrcWift~ln~lrl~~~~kLnl~eFElSqayLFFwdKlErcnyFL~~vvd~a~r~ep~DgRlvq~Ll~nP 141 (457)
T KOG4128|consen 62 RQPVTNQKSSGRCWIFTGLNLLRLEMDRKLNLPEFELSQAYLFFWDKLERCNYFLWTVVDLAMRCEPLDGRLVQNLLKNP 141 (457)
T ss_pred CcccccCcCCCceEEEechhHHHHHHHhcCCcchhhhhhHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccHHHHHHHhCC
Confidence 4699999999999999999987655544433 457788776641 2221 244
Q ss_pred CCCCCchHHHHHHHHHhCCCCCCCCCCcc
Q psy12185 200 GCSGGDFCALLDWMDVNKVVLEPESEYPL 228 (317)
Q Consensus 200 gC~GG~~~~a~~y~~~~~~Gi~~e~~yPY 228 (317)
-=+||.-..-.+.++. +|+.+..|||-
T Consensus 142 ~~DGGqw~MfvNlVkK--YGviPKkcy~~ 168 (457)
T KOG4128|consen 142 VPDGGQWQMFVNLVKK--YGVIPKKCYLH 168 (457)
T ss_pred CCCCchHHHHHHHHHH--hCCCcHHhccc
Confidence 4568888877788876 89999999984
No 26
>PF13529 Peptidase_C39_2: Peptidase_C39 like family; PDB: 3ERV_A.
Probab=86.99 E-value=1.5 Score=34.57 Aligned_cols=48 Identities=31% Similarity=0.324 Sum_probs=29.4
Q ss_pred CcHHHHHHHHHhCCCeEEEEec--ccccccCCCeEeCCCCCCCCCCCeEEEEEEeecCC
Q psy12185 257 PSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGYDNYS 313 (317)
Q Consensus 257 ~~~~~i~~al~~~gPv~v~i~~--~~f~~Y~~GIy~~~c~~~~~~~~Hav~iVGyg~~~ 313 (317)
.+.+.|+++|....||.+.+.. ... .++.+.. . ...|.|+|+||+++.
T Consensus 87 ~~~~~i~~~i~~G~Pvi~~~~~~~~~~---~~~~~~~----~--~~~H~vvi~Gy~~~~ 136 (144)
T PF13529_consen 87 ASFDDIKQEIDAGRPVIVSVNSGWRPP---NGDGYDG----T--YGGHYVVIIGYDEDG 136 (144)
T ss_dssp S-HHHHHHHHHTT--EEEEEETTSS-----TTEEEEE-------TTEEEEEEEEE-SSE
T ss_pred CcHHHHHHHHHCCCcEEEEEEcccccC---CCCCcCC----C--cCCEEEEEEEEeCCC
Confidence 4568899999988899999973 111 1122221 1 169999999999754
No 27
>PF14399 Transpep_BrtH: NlpC/p60-like transpeptidase
Probab=86.65 E-value=1.5 Score=40.38 Aligned_cols=49 Identities=18% Similarity=0.277 Sum_probs=31.7
Q ss_pred HHHHHHHHhCCCeEEEEecccccccCCCeEeCCCCCCCCCCCeEEEEEEeecCCCe
Q psy12185 260 SSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYDNYSRT 315 (317)
Q Consensus 260 ~~i~~al~~~gPv~v~i~~~~f~~Y~~GIy~~~c~~~~~~~~Hav~iVGyg~~~~~ 315 (317)
+.|+++|....||.+.++.-.+ -|...-| .. ....|.|+|+||+++++.
T Consensus 79 ~~l~~~l~~g~pv~~~~D~~~l-py~~~~~----~~--~~~~H~i~v~G~d~~~~~ 127 (317)
T PF14399_consen 79 EELKEALDAGRPVIVWVDMYYL-PYRPNYY----KK--HHADHYIVVYGYDEEEDV 127 (317)
T ss_pred HHHHHHHhCCCceEEEeccccC-CCCcccc----cc--ccCCcEEEEEEEeCCCCE
Confidence 6788888776699999875211 1211111 11 126999999999987654
No 28
>PF09778 Guanylate_cyc_2: Guanylylate cyclase; InterPro: IPR018616 Members of this family of proteins catalyse the conversion of guanosine triphosphate (GTP) to 3',5'-cyclic guanosine monophosphate (cGMP) and pyrophosphate.
Probab=81.64 E-value=4.6 Score=35.14 Aligned_cols=57 Identities=25% Similarity=0.404 Sum_probs=35.6
Q ss_pred cHHHHHHHHHhCCCeEEEEeccccc--ccCCCeEe---CCC-CCCCCCCCeEEEEEEeecCCC
Q psy12185 258 SESSILTDIATHGPVIAAVNALTWQ--YYLGGVIQ---YNC-DGSLANINHAVQIVGYDNYSR 314 (317)
Q Consensus 258 ~~~~i~~al~~~gPv~v~i~~~~f~--~Y~~GIy~---~~c-~~~~~~~~Hav~iVGyg~~~~ 314 (317)
..++|...|.+.||++|-+++.-.. .-++-... ..| +....-.+|-|+|+||+...+
T Consensus 112 s~~ei~~hl~~g~~aIvLVd~~~L~C~~Ck~~~~~~~~~~~~~~~~~Y~GHYVVlcGyd~~~~ 174 (212)
T PF09778_consen 112 SIQEIIEHLSSGGPAIVLVDASLLHCDLCKSNCFDPIGSKCFGRSPDYQGHYVVLCGYDAATK 174 (212)
T ss_pred cHHHHHHHHhCCCcEEEEEccccccChhhcccccccccccccCCCCCccEEEEEEEeecCCCC
Confidence 4578999999999998888873222 11222221 122 222223799999999997654
No 29
>PF13956 Ibs_toxin: Toxin Ibs, type I toxin-antitoxin system
Probab=72.22 E-value=2 Score=21.77 Aligned_cols=14 Identities=29% Similarity=0.539 Sum_probs=7.9
Q ss_pred CccHHHHHHHHHHH
Q psy12185 1 MFDVKNVLFIVALI 14 (317)
Q Consensus 1 ~~~~~~~~~~~~l~ 14 (317)
||+..+.|++++++
T Consensus 1 MMk~vIIlvvLLli 14 (19)
T PF13956_consen 1 MMKLVIILVVLLLI 14 (19)
T ss_pred CceehHHHHHHHhc
Confidence 67766555554443
No 30
>PF12385 Peptidase_C70: Papain-like cysteine protease AvrRpt2; InterPro: IPR022118 This is a family of cysteine proteases, found in actinobacteria, protobacteria and firmicutes. Papain-like cysteine proteases play a crucial role in plant-pathogen/pest interactions. On entering the host they act on non-self substrates, thereby manipulating the host to evade proteolysis []. AvrRpt2 from Pseudomonas syringae pv tomato DC3000 triggers resistance to P. syringae-2-dependent defence responses, including hypersensitive cell death, by cleaving the Arabidopsis RIN4 protein which is monitored by the cognate resistance protein RPS2 [].
Probab=67.10 E-value=14 Score=30.61 Aligned_cols=39 Identities=26% Similarity=0.315 Sum_probs=26.6
Q ss_pred HHHHHHHHHhCCCeEEEEecccccccCCCeEeCCCCCCCCCCCeEEEEEEeecCC
Q psy12185 259 ESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYDNYS 313 (317)
Q Consensus 259 ~~~i~~al~~~gPv~v~i~~~~f~~Y~~GIy~~~c~~~~~~~~Hav~iVGyg~~~ 313 (317)
.+.+...|.++||+-++..++ .... ..|+++|.|-..++
T Consensus 98 ~e~~~~LL~~yGPLwv~~~~P--------------~~~~--~~H~~ViTGI~~dg 136 (166)
T PF12385_consen 98 AEGLANLLREYGPLWVAWEAP--------------GDSW--VAHASVITGIDGDG 136 (166)
T ss_pred HHHHHHHHHHcCCeEEEecCC--------------CCcc--eeeEEEEEeecCCC
Confidence 467889999999999986641 1111 35888888755443
No 31
>cd02549 Peptidase_C39A A sub-family of peptidase family C39. Peptidase family C39 mostly contains bacteriocin-processing endopeptidases from bacteria. The cysteine peptidases in family C39 cleave the "double-glycine" leader peptides from the precursors of various bacteriocins (mostly non-lantibiotic). The cleavage is mediated by the transporter as part of the secretion process. Bacteriocins are antibiotic proteins secreted by some species of bacteria that inhibit the growth of other bacterial species. The bacteriocin is synthesized as a precursor with an N-terminal leader peptide, and processing involves removal of the leader peptide by cleavage at a Gly-Gly bond, followed by translocation of the mature bacteriocin across the cytoplasmic membrane. Most endopeptidases of family C39 are N-terminal domains in larger proteins (ABC transporters) that serve both functions. The proposed protease active site is conserved in this sub-family of proteins with a single peptidase domain, which are
Probab=61.47 E-value=20 Score=28.16 Aligned_cols=34 Identities=41% Similarity=0.681 Sum_probs=24.8
Q ss_pred HHHHHHhCCCeEEEEecccccccCCCeEeCCCCCCCCCCCeEEEEEEee
Q psy12185 262 ILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310 (317)
Q Consensus 262 i~~al~~~gPv~v~i~~~~f~~Y~~GIy~~~c~~~~~~~~Hav~iVGyg 310 (317)
+++.+...-||.+.++. +.. .. ..+|.|+|+||+
T Consensus 70 ~~~~l~~~~Pvi~~~~~--------~~~-----~~--~~gH~vVv~g~~ 103 (141)
T cd02549 70 LLRQLAAGHPVIVSVNL--------GVS-----IT--PSGHAMVVIGYD 103 (141)
T ss_pred HHHHHHCCCeEEEEEec--------Ccc-----cC--CCCeEEEEEEEc
Confidence 77888888899998764 000 11 168999999998
No 32
>COG4990 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=56.91 E-value=22 Score=30.10 Aligned_cols=40 Identities=33% Similarity=0.464 Sum_probs=30.0
Q ss_pred CcHHHHHHHHHhCCCeEEEEecccccccCCCeEeCCCCCCCCCCCeEEEEEEeecCC
Q psy12185 257 PSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYDNYS 313 (317)
Q Consensus 257 ~~~~~i~~al~~~gPv~v~i~~~~f~~Y~~GIy~~~c~~~~~~~~Hav~iVGyg~~~ 313 (317)
.+..+|+..|.+..||.+.... |.- . .-|+|+|.||++.+
T Consensus 121 ksl~~ik~ql~kg~PV~iw~T~--~~~-------------~--s~H~v~itgyDk~n 160 (195)
T COG4990 121 KSLSDIKGQLLKGRPVVIWVTN--FHS-------------Y--SIHSVLITGYDKYN 160 (195)
T ss_pred CcHHHHHHHHhcCCcEEEEEec--ccc-------------c--ceeeeEeecccccc
Confidence 3568999999998999887653 211 1 47999999999864
No 33
>PF03051 Peptidase_C1_2: Peptidase C1-like family This family is a subfamily of the Prosite entry; InterPro: IPR004134 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad []. This group of proteins belong to MEROPS peptidase family C1, sub-family C1B (bleomycin hydrolase, clan CA). This family contains prokaryotic and eukaryotic aminopeptidases and bleomycin hydrolases.; GO: 0004197 cysteine-type endopeptidase activity, 0006508 proteolysis; PDB: 3PW3_F 2CB5_A 1CB5_C 2DZZ_A 2E02_A 2E01_A 2E03_A 1A6R_A 1GCB_A 3GCB_A ....
Probab=48.92 E-value=47 Score=32.38 Aligned_cols=53 Identities=15% Similarity=0.171 Sum_probs=33.2
Q ss_pred HHHHHHHHhCCCeEEEEecccccccCCCeEeCCCC-------C------------CCCCCCeEEEEEEeecC
Q psy12185 260 SSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCD-------G------------SLANINHAVQIVGYDNY 312 (317)
Q Consensus 260 ~~i~~al~~~gPv~v~i~~~~f~~Y~~GIy~~~c~-------~------------~~~~~~Hav~iVGyg~~ 312 (317)
+.+..+|...-||..+-++..+..-+.||.+...- . .....+|||+|||.+.+
T Consensus 300 ~~~i~~Lk~G~~VwfgcDV~k~~~~k~Gi~D~~~~d~~~~fg~~~~~~K~~Rl~~~eS~~tHAM~itGv~~D 371 (438)
T PF03051_consen 300 DAAIKSLKAGYPVWFGCDVGKFFDRKNGIMDTDLYDYDSLFGVDFNMSKAERLDYGESTMTHAMVITGVDLD 371 (438)
T ss_dssp HHHHHHHHTT--EEEEEETTTTEETTTTEE-TTSB-HHHHHT--S-S-HHHHHHTTSS--EEEEEEEEEEE-
T ss_pred HHHHHHHHcCCcEEEeccCCccccccchhhccchhhhhhhhccccccCHHHHHHhCCCCCceeEEEEEEEec
Confidence 56667777778999999995455678898854220 0 01125899999999873
No 34
>PRK00888 ftsB cell division protein FtsB; Reviewed
Probab=48.69 E-value=46 Score=25.51 Aligned_cols=19 Identities=16% Similarity=0.315 Sum_probs=10.1
Q ss_pred ccHHHHHHHHHHHHHHHhh
Q psy12185 2 FDVKNVLFIVALIALCFLA 20 (317)
Q Consensus 2 ~~~~~~~~~~~l~~~~~~~ 20 (317)
|++++++++++++.+.+.+
T Consensus 1 m~~~~~vll~ll~~l~y~l 19 (105)
T PRK00888 1 MRLLTLLLLALLVWLQYSL 19 (105)
T ss_pred CcHHHHHHHHHHHHHHHHH
Confidence 5555555555555555443
No 35
>PF02013 CBM_10: Cellulose or protein binding domain; InterPro: IPR002883 This domain is found in two distinct sets of proteins with different functions. Those found in aerobic bacteria bind cellulose (or other carbohydrates); but in anaerobic fungi they are protein binding domains, referred to as dockerin domains or docking domains. They are believed to be responsible for the assembly of a multiprotein cellulase/hemicellulase complex, similar to the cellulosome found in certain anaerobic bacteria. The recycling of photosynthetically fixed carbon in plant cell walls is a key microbial process. Enzyme systems that attack the plant cell wall contain noncatalytic carbohydrate-binding modules that mediate attachment to this composite structure and play a pivotal role in maximizing the hydrolytic process. In anaerobes, the degradation is carried out by a high molecular weight, multifunctional complex termed the cellulosome. This consists of a number of independent enzyme components, each of which contains a conserved 40-residue dockerin domain, which functions to bind the enzyme to a cohesin domain within the scaffoldin protein [, ]. In anaerobic bacteria that degrade plant cell walls, exemplified by Clostridium thermocellum, the dockerin domains of the catalytic polypeptides can bind equally well to any cohesin from the same organism. More recently, anaerobic fungi, typified by Piromyces equi, have been suggested to also synthesise a cellulosome complex, although the dockerin sequences of the bacterial and fungal enzymes are completely different []. For example, the fungal enzymes contain one, two or three copies of the dockerin sequence in tandem within the catalytic polypeptide. In contrast, all the C. thermocellum cellulosome catalytic components contain a single dockerin domain. The anaerobic bacterial dockerins are homologous to EF hands (calcium-binding motifs) and require calcium for activity whereas the fungal dockerin does not require calcium. Finally, the interaction between cohesin and dockerin appears to be species specific in bacteria, there is almost no species specificity of binding within fungal species and no identified sites that distinguish different species. The structure of dockerin from P. equi contains two helical stretches and four short beta-strands which form an antiparallel sheet structure adjacent to an additional short twisted parallel strand. The N- and C-termini are adjacent to each other. Aerobic bacteria contain related regions, however these appear to function as cellulose/carbohydrate binding domains.; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds, 0005975 carbohydrate metabolic process; PDB: 2J4M_A 2J4N_A 1E8R_A 1QLD_A 1E8P_A 1E8Q_A.
Probab=44.48 E-value=8.8 Score=23.41 Aligned_cols=10 Identities=30% Similarity=1.049 Sum_probs=8.3
Q ss_pred EeecCCCeeC
Q psy12185 308 GYDNYSRTWC 317 (317)
Q Consensus 308 Gyg~~~~~W~ 317 (317)
|||.+++-||
T Consensus 25 ~WGvEN~~wC 34 (36)
T PF02013_consen 25 GWGVENGDWC 34 (36)
T ss_dssp EEEEETTEEE
T ss_pred CEeeECCceE
Confidence 6888888887
No 36
>PF09551 Spore_II_R: Stage II sporulation protein R (spore_II_R); InterPro: IPR014202 This entry is designated stage II sporulation protein R. A comparative genome analysis of all sequenced genomes of Firmicutes shows that the proteins are strictly conserved among the sub-set of endospore-forming species. SpoIIR is a signalling protein that links the activation of sigma E to the transcriptional activity of sigma F during sporulation [, ].
Probab=39.83 E-value=67 Score=25.71 Aligned_cols=51 Identities=18% Similarity=0.274 Sum_probs=31.8
Q ss_pred HHHHHHHhCC---CeEEEEecccc--cccCCCeEeCCCCCCCCCCCeEEEEEEeecCCCeeC
Q psy12185 261 SILTDIATHG---PVIAAVNALTW--QYYLGGVIQYNCDGSLANINHAVQIVGYDNYSRTWC 317 (317)
Q Consensus 261 ~i~~al~~~g---Pv~v~i~~~~f--~~Y~~GIy~~~c~~~~~~~~Hav~iVGyg~~~~~W~ 317 (317)
...+.|...| ||.|.+....| ..|.+-+| +++. ..--.+++|=|.-.++||
T Consensus 60 ~A~~~l~~~G~~y~v~v~~~~~~FPtK~YG~~~~--PaG~----YeAlrI~IG~g~G~NWWC 115 (130)
T PF09551_consen 60 IAEEVLAEEGYDYPVKVELGRFYFPTKTYGDIVL--PAGE----YEALRITIGEGKGHNWWC 115 (130)
T ss_pred HHHHHHHHhCCCCcEEEEEEeeeCCCceECCEec--cCCc----eEEEEEEecCccCcceEE
Confidence 3345555555 78888766556 66766666 3332 122345678887778999
No 37
>PF10717 ODV-E18: Occlusion-derived virus envelope protein ODV-E18; InterPro: IPR019655 Baculovirus occlusion-derived virus (ODV) derives its envelope from an intranuclear membrane source. Occlusion-derived viral envelope proteins that are detected in viral-induced intranuclear microvesicles, but not detected in the plasma membrane, cytoplasmic membranes, or the nuclear envelope. This entry represents ODV-E18 protein which is encoded by baculovirus late genes with transcription initiating from a TAAG motif. ODV-E18 exists as a dimer in the ODV envelope, which contains a hydrophobic domain that putatively acts as a target or retention signal for intranuclear microvesicles []. ; GO: 0019031 viral envelope
Probab=36.82 E-value=36 Score=24.86 Aligned_cols=21 Identities=24% Similarity=0.414 Sum_probs=12.0
Q ss_pred ccHHHHHHHHHHHHHHHhhcc
Q psy12185 2 FDVKNVLFIVALIALCFLAIP 22 (317)
Q Consensus 2 ~~~~~~~~~~~l~~~~~~~~~ 22 (317)
|-++++|+|+.|++++|-.+.
T Consensus 28 MtILivLVIIiLlImlfqsSS 48 (85)
T PF10717_consen 28 MTILIVLVIIILLIMLFQSSS 48 (85)
T ss_pred HHHHHHHHHHHHHHHHHhccC
Confidence 445556666666666665443
No 38
>PF08139 LPAM_1: Prokaryotic membrane lipoprotein lipid attachment site; InterPro: IPR012640 In prokaryotes, membrane lipoproteins are synthesized with a precursor signal peptide, which is cleaved by a specific lipoprotein signal peptidase (signal peptidase II). The peptidase recognises a conserved sequence and cuts upstream of a cysteine residue to which a glyceride-fatty acid lipid is attached [,]. This lipid attachment site is found in homologues of the VirB proteins of type IV secretion systems (T4SS). Conjugal transfer across the cell envelope of Gram-negative bacteria is mediated by a supramolecular structure termed mating pair formation (Mpf) complex. Collectively, secretion pathways ancestrally related to bacterial conjugation systems are now known as T4SS. T4SS are involved in the delivery of effector molecules to eukaryotic target cells; each of these systems exports distinct DNA or protein substrates to effect a myriad of changes in host cell physiology during infection [].
Probab=35.86 E-value=22 Score=19.77 Aligned_cols=16 Identities=19% Similarity=0.274 Sum_probs=8.3
Q ss_pred ccHHHHHHHHHHHHHH
Q psy12185 2 FDVKNVLFIVALIALC 17 (317)
Q Consensus 2 ~~~~~~~~~~~l~~~~ 17 (317)
+++++.++.+++++-|
T Consensus 8 Kkil~~l~a~~~LagC 23 (25)
T PF08139_consen 8 KKILFPLLALFMLAGC 23 (25)
T ss_pred HHHHHHHHHHHHHhhc
Confidence 4555555555545444
No 39
>PF11873 DUF3393: Domain of unknown function (DUF3393); InterPro: IPR024570 Membrane-bound lytic murein transglycosylase C (also known as murein hydrolase C), is a murein-degrading enzyme that may play a role in the recycling of muropeptides during cell elongation and/or cell division. This entry represents the N-terminal domain, whose function is currently not known.
Probab=33.35 E-value=46 Score=28.83 Aligned_cols=18 Identities=22% Similarity=0.209 Sum_probs=12.4
Q ss_pred ccHHHHHHHHHHHHHHHh
Q psy12185 2 FDVKNVLFIVALIALCFL 19 (317)
Q Consensus 2 ~~~~~~~~~~~l~~~~~~ 19 (317)
|++++++++++|++.|..
T Consensus 1 ~k~l~~~~~~~lL~~Cs~ 18 (204)
T PF11873_consen 1 KKKLLLLLIALLLSGCSS 18 (204)
T ss_pred CcCHHHHHHHHHHHHhCC
Confidence 566667777777777774
No 40
>COG5510 Predicted small secreted protein [Function unknown]
Probab=31.48 E-value=66 Score=20.45 Aligned_cols=11 Identities=18% Similarity=0.298 Sum_probs=4.4
Q ss_pred CccHHHHHHHH
Q psy12185 1 MFDVKNVLFIV 11 (317)
Q Consensus 1 ~~~~~~~~~~~ 11 (317)
||+-..+++++
T Consensus 1 mmk~t~l~i~~ 11 (44)
T COG5510 1 MMKKTILLIAL 11 (44)
T ss_pred CchHHHHHHHH
Confidence 44443333333
No 41
>KOG4702|consensus
Probab=31.24 E-value=1e+02 Score=21.79 Aligned_cols=33 Identities=30% Similarity=0.459 Sum_probs=26.3
Q ss_pred HHHHHHHHHHhCCccCchhHHHHHHHHHHHHHH
Q psy12185 34 LELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDI 66 (317)
Q Consensus 34 ~~~f~~~~~~~~K~Y~~~E~~~R~~iF~~n~~~ 66 (317)
..-|++|+..|++.-.++|...|..-|++-++.
T Consensus 28 pe~Fee~v~~~krel~ppe~~~~~EE~~~~lRe 60 (77)
T KOG4702|consen 28 PEIFEEFVRGYKRELSPPEATKRKEEYENFLRE 60 (77)
T ss_pred hHHHHHHHHhccccCCChHHHhhHHHHHHHHHH
Confidence 456999999999988777888888777766543
No 42
>TIGR02837 spore_II_R stage II sporulation protein R. A comparative genome analysis of all sequenced genomes of shows a number of proteins conserved strictly among the endospore-forming subset of the Firmicutes. This protein, a member of this panel, is designated stage II sporulation protein R.
Probab=28.59 E-value=1.3e+02 Score=25.14 Aligned_cols=49 Identities=20% Similarity=0.360 Sum_probs=32.3
Q ss_pred HHHHHHhCC---CeEEEEecccc--cccCCCeEeCCCCCCCCCCCeEEEE-EEeecCCCeeC
Q psy12185 262 ILTDIATHG---PVIAAVNALTW--QYYLGGVIQYNCDGSLANINHAVQI-VGYDNYSRTWC 317 (317)
Q Consensus 262 i~~al~~~g---Pv~v~i~~~~f--~~Y~~GIy~~~c~~~~~~~~Hav~i-VGyg~~~~~W~ 317 (317)
..+.|...| ||.|.+.-..| ..|..-+| +++ ..+|+.| +|=|.-.++||
T Consensus 96 a~~~l~~~G~~y~v~v~~~~~~FPtK~YG~~~~--PaG-----~YeAlrI~IG~g~G~NWWC 150 (168)
T TIGR02837 96 AESVIKAEGADYKVRVELGKYSFPTKLYGNIVL--PAG-----EYEALRILIGEGAGANWWC 150 (168)
T ss_pred HHHHHHHhCCCCCeEEEEEEEeCCCcccCCEec--cCC-----ceEEEEEEecCcCCcceEE
Confidence 345555544 88888866666 67776666 333 2566654 68877778999
No 43
>PRK10081 entericidin B membrane lipoprotein; Provisional
Probab=26.93 E-value=80 Score=20.56 Aligned_cols=14 Identities=14% Similarity=0.183 Sum_probs=6.3
Q ss_pred CccHHHHHHHHHHH
Q psy12185 1 MFDVKNVLFIVALI 14 (317)
Q Consensus 1 ~~~~~~~~~~~~l~ 14 (317)
||+-.+.+++++++
T Consensus 1 MmKk~i~~i~~~l~ 14 (48)
T PRK10081 1 MVKKTIAAIFSVLV 14 (48)
T ss_pred ChHHHHHHHHHHHH
Confidence 55554444443333
No 44
>PF00879 Defensin_propep: Defensin propeptide The pattern for this Prosite entry doesn't match the propeptide.; InterPro: IPR002366 Defensins are 2-6 kDa, cationic, microbicidal peptides active against many Gram-negative and Gram-positive bacteria, fungi, and enveloped viruses [], containing three pairs of intramolecular disulphide bonds []. On the basis of their size and pattern of disulphide bonding, mammalian defensins are classified into alpha, beta and theta categories. Alpha-defensins, which have been identified in humans, monkeys and several rodent species, are particularly abundant in neutrophils, certain macrophage populations and Paneth cells of the small intestine. Every mammalian species explored thus far has beta-defensins. In cows, as many as 13 beta-defensins exist in neutrophils. However, in other species, beta-defensins are more often produced by epithelial cells lining various organs (e.g. the epidermis, bronchial tree and genitourinary tract). Theta-defensins are cyclic and have so far only been identified in primate phagocytes. Defensins are produced constitutively and/or in response to microbial products or proinflammatory cytokines. Some defensins are also called corticostatins (CS) because they inhibit corticotropin-stimulated corticosteroid production. The mechanism(s) by which microorganisms are killed and/or inactivated by defensins is not understood completely. However, it is generally believed that killing is a consequence of disruption of the microbial membrane. The polar topology of defensins, with spatially separated charged and hydrophobic regions, allows them to insert themselves into the phospholipid membranes so that their hydrophobic regions are buried within the lipid membrane interior and their charged (mostly cationic) regions interact with anionic phospholipid head groups and water. Subsequently, some defensins can aggregate to form `channel-like' pores; others might bind to and cover the microbial membrane in a `carpet-like' manner. The net outcome is the disruption of membrane integrity and function, which ultimately leads to the lysis of microorganisms. Some defensins are synthesized as propeptides which may be relevant to this process - in neutrophils only the mature peptides have been identified but in Paneth cells, the propeptide is stored in vesicles [] and appears to be cleaved by trypsin on activation. ; GO: 0006952 defense response
Probab=25.13 E-value=1.1e+02 Score=20.31 Aligned_cols=27 Identities=22% Similarity=0.205 Sum_probs=18.9
Q ss_pred ccHHHHHHHHHHHHHHHhhccccCCCC
Q psy12185 2 FDVKNVLFIVALIALCFLAIPVKVSKP 28 (317)
Q Consensus 2 ~~~~~~~~~~~l~~~~~~~~~~~~~~~ 28 (317)
||.+.+|..++|+.+..-+-+.....+
T Consensus 1 MRTL~LLaAlLLlAlqaQAepl~~~~d 27 (52)
T PF00879_consen 1 MRTLALLAALLLLALQAQAEPLQERAD 27 (52)
T ss_pred CcHHHHHHHHHHHHHHHhccccccccc
Confidence 667777777878888877777654444
No 45
>COG2854 Ttg2D ABC-type transport system involved in resistance to organic solvents, auxiliary component [Secondary metabolites biosynthesis, transport, and catabolism]
Probab=24.02 E-value=4e+02 Score=23.08 Aligned_cols=17 Identities=18% Similarity=0.202 Sum_probs=13.4
Q ss_pred CeEEecCcCCCCCHHHHH
Q psy12185 79 SARYGITEFSDLSEEEFK 96 (317)
Q Consensus 79 ~~~~g~N~fsD~t~eEf~ 96 (317)
.+.|| +.+...|+|+..
T Consensus 77 ~~vLG-k~~k~aspeQ~~ 93 (202)
T COG2854 77 KLVLG-KYYKTASPEQRQ 93 (202)
T ss_pred HHHhc-cccccCCHHHHH
Confidence 47788 889999998654
No 46
>COG3167 PilO Tfp pilus assembly protein PilO [Cell motility and secretion / Intracellular trafficking and secretion]
Probab=23.76 E-value=1.1e+02 Score=26.14 Aligned_cols=62 Identities=18% Similarity=0.134 Sum_probs=29.9
Q ss_pred cHHHHHHHHHHHHHHHhhccccCCCCchHHHHHHHHHHHHHhCCc---------cCch--hHHHHHHHHHHHH
Q psy12185 3 DVKNVLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKS---------YSKS--EHDIRFKNFEKSL 64 (317)
Q Consensus 3 ~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~K~---------Y~~~--E~~~R~~iF~~n~ 64 (317)
+++++|++.+++.........+.....-+.....=+.-|.+|+++ +... |.++|+.+-..-+
T Consensus 24 rlv~~lL~~~~V~~lGy~f~~s~k~eel~~~~~eEe~LKs~~q~K~~~aanL~~lr~Ql~emee~~~~llrQL 96 (211)
T COG3167 24 RLVFCLLAVAAVLGLGYAFYLSGKLEELEELEAEEEELKSTYQQKAIQAANLEALRAQLAEMEERFDILLRQL 96 (211)
T ss_pred HHHHHHHHHHHHHHHHHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHHHhchHHHHHHHHHHHHHHHHHHHhC
Confidence 455666555555544444444444443333344444445555432 3333 5555665555443
No 47
>PF11912 DUF3430: Protein of unknown function (DUF3430); InterPro: IPR021837 This family of proteins are functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 209 to 265 amino acids in length.
Probab=21.55 E-value=75 Score=27.13 Aligned_cols=17 Identities=18% Similarity=0.409 Sum_probs=8.6
Q ss_pred ccHHHHHHHHHHHHHHH
Q psy12185 2 FDVKNVLFIVALIALCF 18 (317)
Q Consensus 2 ~~~~~~~~~~~l~~~~~ 18 (317)
||+++.|++++++...+
T Consensus 1 MKll~~lilli~~~~~~ 17 (212)
T PF11912_consen 1 MKLLISLILLILLIINF 17 (212)
T ss_pred CcHHHHHHHHHHHHHhh
Confidence 66655555544444443
No 48
>PHA02661 vascular endothelial growth factor like protein; Provisional
Probab=20.74 E-value=72 Score=26.07 Aligned_cols=49 Identities=18% Similarity=0.242 Sum_probs=24.7
Q ss_pred ccHHHHHHHHHHHHHHHhhccccCCCCchHHHHHHHHHHHHHhCCccCc
Q psy12185 2 FDVKNVLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSK 50 (317)
Q Consensus 2 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~K~Y~~ 50 (317)
|+++..|-..+++.+++...+........+.....=..|+..|+|.+=.
T Consensus 1 m~l~~~~~~~~a~l~~~~~~~~~~~~~~~~~~~~~v~~~~ev~~~s~Ck 49 (146)
T PHA02661 1 MKLITTLQFAVALLLCMYNVPWCFSGSTGSSSASSLSSWLDTSDKSSCQ 49 (146)
T ss_pred CchHHHHHHHHHHHHHHHhchhcccccccCCCCcccccHHHHhhccCce
Confidence 4455445444444455555554443332222334455677777777543
No 49
>PRK09810 entericidin A; Provisional
Probab=20.11 E-value=1.1e+02 Score=19.17 Aligned_cols=10 Identities=30% Similarity=0.328 Sum_probs=5.1
Q ss_pred CccHHHHHHH
Q psy12185 1 MFDVKNVLFI 10 (317)
Q Consensus 1 ~~~~~~~~~~ 10 (317)
||+-++.+++
T Consensus 1 mMkk~~~l~~ 10 (41)
T PRK09810 1 MMKRLIVLVL 10 (41)
T ss_pred ChHHHHHHHH
Confidence 5555555443
Done!