RPS-BLAST 2.2.26 [Sep-21-2011]

Database: pdb70 
           27,921 sequences; 6,701,793 total letters

Searching..................................................done

Query= psy1705
         (309 letters)



>1by8_A Protein (procathepsin K); hydrolase(sulfhydryl proteinase), papain;
           2.60A {Homo sapiens} SCOP: d.3.1.1 PDB: 7pck_A
          Length = 314

 Score =  334 bits (858), Expect = e-115
 Identities = 117/302 (38%), Positives = 170/302 (56%), Gaps = 8/302 (2%)

Query: 14  QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
           +K ++K Y  K  +  ++L W+ N K I  HN EA  G+H Y L  NHL D+     +++
Sbjct: 15  KKTHRKQYNNKVDEISRRLIWEKNLKYISIHNLEASLGVHTYELAMNHLGDMTSEEVVQK 74

Query: 74  MTRLTHSRIRRTLVRSPESNESVL-IPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQ 132
           MT L           +    E     PD +D+R+KG++TP  NQ  CG+C+AFS   A++
Sbjct: 75  MTGLKVPLSHSRSNDTLYIPEWEGRAPDSVDYRKKGYVTPVKNQGQCGSCWAFSSVGALE 134

Query: 133 GQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQ 192
           GQ+ K T ++  LS Q +VDC   S N GC GG + N   YVQ   G+  E+ YPY G++
Sbjct: 135 GQLKKKTGKLLNLSPQNLVDCV--SENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQE 192

Query: 193 SICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDE 252
             C +           +  +P  +E ALK  +A VGP++V+I+AS  +FQ Y+ G+Y DE
Sbjct: 193 ESCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQFYSKGVYYDE 252

Query: 253 ACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYA 307
           +C SD +NHA+L VGY        WI+KN W  +WG+ GY+ + R  NN CGIAN A + 
Sbjct: 253 SCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKGYILMARNKNNACGIANLASFP 312

Query: 308 LI 309
            +
Sbjct: 313 KM 314


>1cs8_A Human procathepsin L; prosegment, propeptide, inhibition,
           hydrolase; HET: OCS; 1.80A {Homo sapiens} SCOP: d.3.1.1
           PDB: 1cjl_A 3hwn_A*
          Length = 316

 Score =  329 bits (847), Expect = e-113
 Identities = 106/304 (34%), Positives = 161/304 (52%), Gaps = 13/304 (4%)

Query: 15  KKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEM 74
             + + Y     +  ++  W+ N K I  HNQE ++G H +T+  N   D+    + + M
Sbjct: 17  AMHNRLYGMNE-EGWRRAVWEKNMKMIELHNQEYREGKHSFTMAMNAFGDMTSEEFRQVM 75

Query: 75  TRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQ 134
               + + R+  V           P  +DWREKG++TP  NQ  CG+C+AFS   A++GQ
Sbjct: 76  NGFQNRKPRKGKVFQEPLFYEA--PRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQ 133

Query: 135 IFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSI 194
           +F+ T  +  LS Q +VDCS   GN GC GG +     YVQ  GGL  EE YPY+  +  
Sbjct: 134 MFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEES 193

Query: 195 CKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEAC 254
           CK+     V + + +  + P+ E AL   +ATVGPI+V+I+A   +F  Y  GIY +  C
Sbjct: 194 CKYNPKYSVANDAGFVDI-PKQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDC 252

Query: 255 TSDYVNHAMLLVGY--------TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAV 305
           +S+ ++H +L+VGY            W++KN W   WG  GY+ + +   N CGIA+ A 
Sbjct: 253 SSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAAS 312

Query: 306 YALI 309
           Y  +
Sbjct: 313 YPTV 316


>2c0y_A Procathepsin S; proenzyme, proteinase, hydrolase, thiol protease,
           prosegment binding loop, glycoprotein, lysosome,
           protease, zymogen; 2.1A {Homo sapiens}
          Length = 315

 Score =  326 bits (839), Expect = e-112
 Identities = 104/301 (34%), Positives = 159/301 (52%), Gaps = 8/301 (2%)

Query: 15  KKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEM 74
           K Y K Y++K  ++ ++L W+ N K +  HN E   G+H Y L  NHL D+     +  M
Sbjct: 17  KTYGKQYKEKNEEAVRRLIWEKNLKFVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLM 76

Query: 75  TRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQ 134
           + L      +  +    +   +L PD +DWREKG +T    Q  CGA +AFS   A++ Q
Sbjct: 77  SSLRVPSQWQRNITYKSNPNRIL-PDSVDWREKGCVTEVKYQGSCGAAWAFSAVGALEAQ 135

Query: 135 IFKSTSEIEELSIQQVVDCSIIS-GNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQS 193
           +   T ++  LS Q +VDCS    GN GC GG +     Y+    G+  +  YPYK    
Sbjct: 136 LKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAMDQ 195

Query: 194 ICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEA 253
            C++         S ++ LP   E  LK  +A  GP++V ++A   +F LY SG+Y + +
Sbjct: 196 KCQYDSKYRAATCSKYTELPYGREDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYEPS 255

Query: 254 CTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYAL 308
           CT + VNH +L+VGY     +  W++KN W H++G+ GY+ + R   N CGIA++  Y  
Sbjct: 256 CTQN-VNHGVLVVGYGDLNGKEYWLVKNSWGHNFGEEGYIRMARNKGNHCGIASFPSYPE 314

Query: 309 I 309
           I
Sbjct: 315 I 315


>2o6x_A Procathepsin L1, secreted cathepsin L 1; hydrolase, thiol protease,
           cysteine protease, zymogen, hydro; 1.40A {Fasciola
           hepatica}
          Length = 310

 Score =  324 bits (833), Expect = e-111
 Identities = 97/301 (32%), Positives = 147/301 (48%), Gaps = 8/301 (2%)

Query: 14  QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
           ++ Y K+Y   A D  ++  W+ N K I  HN     GL  YTL  N  +D+    +  +
Sbjct: 9   KRMYNKEY-NGADDQHRRNIWEKNVKHIQEHNLRHDLGLVTYTLGLNQFTDMTFEEFKAK 67

Query: 74  MTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQG 133
                           P    +  +PD +DWRE G++T   +Q +CG+ +AFS    ++G
Sbjct: 68  YLTEMSRASDILSHGVPYEANNRAVPDKIDWRESGYVTEVKDQGNCGSGWAFSTTGTMEG 127

Query: 134 QIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQS 193
           Q  K+       S QQ+VDCS   GN GC GG + N   Y++   GL  E  YPY   + 
Sbjct: 128 QYMKNERTSISFSEQQLVDCSRPWGNNGCGGGLMENAYQYLK-QFGLETESSYPYTAVEG 186

Query: 194 ICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEA 253
            C++ +   V  ++ +  +    E  LK  +   GP AV+++     F +Y SGIY  + 
Sbjct: 187 QCRYNKQLGVAKVTGFYTVHSGSEVELKNLVGAEGPAAVAVDVE-SDFMMYRSGIYQSQT 245

Query: 254 CTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYAL 308
           C+   VNHA+L VGY      + WI+KN W   WG+ GY+ + R   N CGIA+ A   +
Sbjct: 246 CSPLRVNHAVLAVGYGTQGGTDYWIVKNSWGLSWGERGYIRMVRNRGNMCGIASLASLPM 305

Query: 309 I 309
           +
Sbjct: 306 V 306


>3qj3_A Cathepsin L-like protein; hydrolase, proteinase, larVal midgut;
           1.85A {Tenebrio molitor}
          Length = 331

 Score =  314 bits (807), Expect = e-107
 Identities = 93/309 (30%), Positives = 145/309 (46%), Gaps = 16/309 (5%)

Query: 14  QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
           +  Y + Y     ++ +K  +Q   +    HN++ +QGL  YTL  N  +D+ P      
Sbjct: 26  KTTYARSYVNAKEETFRKQIFQKKLETFEEHNEKYRQGLVSYTLGVNLFTDMTPEEMKAY 85

Query: 74  MTRLTHSRIRRTLVRSPESNESVL------IPDHLDWREKGFITPDWNQEDCGACYAFSI 127
              L             ++ E +        P   DWR++G ++P  NQ  CG+ +AFS 
Sbjct: 86  THGLIMPADLHKNGIPIKTREDLGLNASVRYPASFDWRDQGMVSPVKNQGSCGSSWAFSS 145

Query: 128 ASAIQGQIFKSTSE--IEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEED 185
             AI+ Q+  +        +S QQ+VDC      LGC+GG + +   YV   GG+  E  
Sbjct: 146 TGAIESQMKIANGAGYDSSVSEQQLVDCV--PNALGCSGGWMNDAFTYVAQNGGIDSEGA 203

Query: 186 YPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYA 245
           YPY+     C +    +   +S +  L   DE+ L   +AT GP+AV+ +A    F  Y+
Sbjct: 204 YPYEMADGNCHYDPNQVAARLSGYVYLSGPDENMLADMVATKGPVAVAFDAD-DPFGSYS 262

Query: 246 SGIYDDEACTSDYVNHAMLLVGY-TRNS---WILKNWWSHHWGDNGYMYLKRG-NNRCGI 300
            G+Y +  C ++   HA+L+VGY   N    W++KN W   WG +GY  + R  NN CGI
Sbjct: 263 GGVYYNPTCETNKFTHAVLIVGYGNENGQDYWLVKNSWGDGWGLDGYFKIARNANNHCGI 322

Query: 301 ANYAVYALI 309
           A  A    +
Sbjct: 323 AGVASVPTL 331


>3qt4_A Cathepsin-L-like midgut cysteine proteinase; hydrolase, zymogen,
           intramolecular DISS bonds, insect larVal midgut; HET:
           PG4 PG6; 2.11A {Tenebrio molitor}
          Length = 329

 Score =  312 bits (801), Expect = e-106
 Identities = 90/302 (29%), Positives = 153/302 (50%), Gaps = 9/302 (2%)

Query: 14  QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
           +  +KK Y     + +++L ++ N  KI  HN + ++G   Y+   N   D+    ++  
Sbjct: 31  KLTHKKSYSSPIEEIRRQLIFKDNVAKIAEHNAKFEKGEVTYSKAMNQFGDMSKEEFLAY 90

Query: 74  MTRLTHSRIRR-TLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQ 132
           + R    + +    +R P  +    +   +DWR    ++   +Q  CG+ ++FS   A++
Sbjct: 91  VNRGKAQKPKHPENLRMPYVSSKKPLAASVDWRSNA-VSEVKDQGQCGSSWSFSTTGAVE 149

Query: 133 GQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQ 192
           GQ+      +  LS Q ++DCS   GN GC GG + +  +Y+    G+M E  YPY+ + 
Sbjct: 150 GQLALQRGRLTSLSEQNLIDCSSSYGNAGCDGGWMDSAFSYIH-DYGIMSESAYPYEAQG 208

Query: 193 SICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDE 252
             C+F     V  +S +  LP  DE++L   +   GP+AV+I+A+    Q Y+ G++ D+
Sbjct: 209 DYCRFDSSQSVTTLSGYYDLPSGDENSLADAVGQAGPVAVAIDAT-DELQFYSGGLFYDQ 267

Query: 253 ACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYA 307
            C    +NH +L+VGY     ++ WILKN W   WG++GY    R   N CGIA  A Y 
Sbjct: 268 TCNQSDLNHGVLVVGYGSDNGQDYWILKNSWGSGWGESGYWRQVRNYGNNCGIATAASYP 327

Query: 308 LI 309
            +
Sbjct: 328 AL 329


>1pci_A Procaricain; zymogen, hydrolase, thiol protease; 3.20A {Carica
           papaya} SCOP: d.3.1.1
          Length = 322

 Score =  277 bits (711), Expect = 6e-93
 Identities = 85/303 (28%), Positives = 142/303 (46%), Gaps = 20/303 (6%)

Query: 14  QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
              + K Y        +   ++ N   I   N++     + Y L  N  +DL    + ++
Sbjct: 26  MLNHNKFYENVDEKLYRFEIFKDNLNYIDETNKKN----NSYWLGLNEFADLSNDEFNEK 81

Query: 74  MT-RLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQ 132
               L  + I ++      + + V +P+++DWR+KG +TP  +Q  CG+C+AFS  + ++
Sbjct: 82  YVGSLIDATIEQSYDEEFINEDIVNLPENVDWRKKGAVTPVRHQGSCGSCWAFSAVATVE 141

Query: 133 GQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQ 192
           G     T ++ ELS Q++VDC     + GC GG     L YV    G+     YPYK KQ
Sbjct: 142 GINKIRTGKLVELSEQELVDCE--RRSHGCKGGYPPYALEYVA-KNGIHLRSKYPYKAKQ 198

Query: 193 SICKFKRPNI-VVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDD 251
             C+ K+    +V  S    + P +E  L   +A   P++V + +    FQLY  GI++ 
Sbjct: 199 GTCRAKQVGGPIVKTSGVGRVQPNNEGNLLNAIAK-QPVSVVVESKGRPFQLYKGGIFEG 257

Query: 252 EACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRGNN----RCGIANY 303
             C +  V+ A+  VGY     +   ++KN W   WG+ GY+ +KR        CG+   
Sbjct: 258 P-CGTK-VDGAVTAVGYGKSGGKGYILIKNSWGTAWGEKGYIRIKRAPGNSPGVCGLYKS 315

Query: 304 AVY 306
           + Y
Sbjct: 316 SYY 318


>1xkg_A DER P I, major mite fecal allergen DER P 1; major allergen,
           cysteine protease, house DUST mite, dermatop
           pteronyssinus; 1.61A {Dermatophagoides pteronyssinus}
           SCOP: d.3.1.1
          Length = 312

 Score =  274 bits (703), Expect = 7e-92
 Identities = 67/307 (21%), Positives = 120/307 (39%), Gaps = 27/307 (8%)

Query: 14  QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
           +K + K Y     +   + ++  + K + ++               NHLSDL    +   
Sbjct: 12  KKAFNKSYATFEDEEAARKNFLESVKYVQSNG-----------GAINHLSDLSLDEFKNR 60

Query: 74  MTRLTHSRIRRTLVRSPESNESVL-----IPDHLDWREKGFITPDWNQEDCGACYAFSIA 128
                 +           +  +        P  +D R+   +TP   Q  CG+ +AFS  
Sbjct: 61  FLMSAEAFEHLKTQFDLNAETNACSINGNAPAEIDLRQMRTVTPIRMQGGCGSAWAFSGV 120

Query: 129 SAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPY 188
           +A +        +  +L+ Q++VDC+      GC G ++   + Y+Q   G+++E  Y Y
Sbjct: 121 AATESAYLAYRDQSLDLAEQELVDCA---SQHGCHGDTIPRGIEYIQ-HNGVVQESYYRY 176

Query: 189 KGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLA-TVGPIAVSINAS-PHTFQLYAS 246
             ++  C+         IS++  + P + + ++  LA T   IAV I       F+ Y  
Sbjct: 177 VAREQSCRRPNAQ-RFGISNYCQIYPPNANKIREALAQTHSAIAVIIGIKDLDAFRHYDG 235

Query: 247 GIYDDEACTSDYVNHAMLLVGY-TRNS---WILKNWWSHHWGDNGYMYLKRGNNRCGIAN 302
                         HA+ +VGY        WI++N W  +WGDNGY Y     +   I  
Sbjct: 236 RTIIQRDNGYQPNYHAVNIVGYSNAQGVDYWIVRNSWDTNWGDNGYGYFAANIDLMMIEE 295

Query: 303 YAVYALI 309
           Y    ++
Sbjct: 296 YPYVVIL 302


>3kwz_A Cathepsin K; enzyme inhibitor, covalent reversible inhibitor,
           disease mutation, disulfide bond, glycoprotein,
           hydrolase, lysosome, protease; HET: KWZ; 1.49A {Homo
           sapiens} PDB: 1au0_A* 1au2_A* 1au3_A* 1au4_A* 1ayu_A*
           1ayv_A* 1ayw_A* 1bgo_A* 1atk_A* 1nl6_A* 1nlj_A* 1q6k_A*
           1mem_A* 1yk7_A* 1yk8_A* 1yt7_A* 2ato_A* 2aux_A* 2auz_A*
           2bdl_A* ...
          Length = 215

 Score =  270 bits (694), Expect = 8e-92
 Identities = 92/216 (42%), Positives = 132/216 (61%), Gaps = 7/216 (3%)

Query: 99  PDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISG 158
           PD +D+R+KG++TP  NQ  CG+C+AFS   A++GQ+ K T ++  LS Q +VDC   S 
Sbjct: 2   PDSVDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SE 59

Query: 159 NLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEH 218
           N GC GG + N   YVQ   G+  E+ YPY G++  C +           +  +P  +E 
Sbjct: 60  NDGCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQEESCMYNPTGKAAKCRGYREIPEGNEK 119

Query: 219 ALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY----TRNSWI 274
           ALK  +A VGP++V+I+AS  +FQ Y+ G+Y DE+C SD +NHA+L VGY        WI
Sbjct: 120 ALKRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNSDNLNHAVLAVGYGIQKGNKHWI 179

Query: 275 LKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYALI 309
           +KN W  +WG+ GY+ + R  NN CGIAN A +  +
Sbjct: 180 IKNSWGENWGNKGYILMARNKNNACGIANLASFPKM 215


>8pch_A Cathepsin H; hydrolase, protease, cysteine proteinase,
           aminopeptidase; HET: NAG BMA; 2.10A {Sus scrofa} SCOP:
           d.3.1.1 PDB: 1nb3_A* 1nb5_A*
          Length = 220

 Score =  268 bits (689), Expect = 6e-91
 Identities = 80/217 (36%), Positives = 121/217 (55%), Gaps = 8/217 (3%)

Query: 99  PDHLDWREKG-FITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIIS 157
           P  +DWR+KG F++P  NQ  CG+C+ FS   A++  +  +T ++  L+ QQ+VDC+   
Sbjct: 2   PPSMDWRKKGNFVSPVKNQGSCGSCWTFSTTGALESAVAIATGKMLSLAEQQLVDCAQNF 61

Query: 158 GNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDE 217
            N GC GG       Y+++  G+M E+ YPYKG+   CKF+    +  +   + +   DE
Sbjct: 62  NNHGCQGGLPSQAFEYIRYNKGIMGEDTYPYKGQDDHCKFQPDKAIAFVKDVANITMNDE 121

Query: 218 HALKVTLATVGPIAVSINASPHTFQLYASGIYDDEAC--TSDYVNHAMLLVGY-TRNS-- 272
            A+   +A   P++ +   + + F +Y  GIY   +C  T D VNHA+L VGY   N   
Sbjct: 122 EAMVEAVALYNPVSFAFEVT-NDFLMYRKGIYSSTSCHKTPDKVNHAVLAVGYGEENGIP 180

Query: 273 -WILKNWWSHHWGDNGYMYLKRGNNRCGIANYAVYAL 308
            WI+KN W   WG NGY  ++RG N CG+A  A Y +
Sbjct: 181 YWIVKNSWGPQWGMNGYFLIERGKNMCGLAACASYPI 217


>2xu3_A Cathepsin L1; hydrolase, drug design, thiol protease; HET: XU3 BTB;
           0.90A {Homo sapiens} PDB: 2xu4_A* 2xu5_A* 2yj2_A*
           2yj8_A* 2yj9_A* 2yjb_A* 2yjc_A* 3bc3_A* 3h89_A* 3h8b_A*
           3h8c_A* 3of9_A* 3of8_A* 3hha_A* 2xu1_A* 3iv2_A* 3k24_A*
           2nqd_B* 3kse_A* 2vhs_A ...
          Length = 220

 Score =  268 bits (687), Expect = 1e-90
 Identities = 89/220 (40%), Positives = 128/220 (58%), Gaps = 10/220 (4%)

Query: 99  PDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISG 158
           P  +DWREKG++TP  NQ  CG+C+AFS   A++GQ+F+ T  +  LS Q +VDCS   G
Sbjct: 2   PRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQG 61

Query: 159 NLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEH 218
           N GC GG +     YVQ  GGL  EE YPY+  +  CK+     V + + +  + P+ E 
Sbjct: 62  NEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDI-PKQEK 120

Query: 219 ALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY--------TR 270
           AL   +ATVGPI+V+I+A   +F  Y  GIY +  C+S+ ++H +L+VGY          
Sbjct: 121 ALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNN 180

Query: 271 NSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYALI 309
             W++KN W   WG  GY+ + +   N CGIA+ A Y  +
Sbjct: 181 KYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 220


>3ovx_A Cathepsin S; hydrolase, covalent inhibitor, aldehyde warhead is
           covalently bound to Cys25, lysosomeal protein; HET: O64;
           1.49A {Homo sapiens} PDB: 2h7j_A* 2f1g_A* 2hh5_B*
           2hhn_A* 2hxz_A* 2op3_A* 2frq_A* 2fra_A* 2fq9_A* 2ft2_A*
           2fud_A* 2g7y_A* 1ms6_A* 2r9m_A* 2r9n_A* 2r9o_A* 3n3g_A*
           3n4c_A* 3mpe_A* 1nqc_A* ...
          Length = 218

 Score =  267 bits (686), Expect = 1e-90
 Identities = 81/216 (37%), Positives = 122/216 (56%), Gaps = 7/216 (3%)

Query: 97  LIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSII 156
           ++PD +DWREKG +T    Q  CGAC+AFS   A++ Q+   T ++  LS Q +VDCS  
Sbjct: 1   ILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTE 60

Query: 157 S-GNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQ 215
             GN GC GG +     Y+    G+  +  YPYK     C++         S ++ LP  
Sbjct: 61  KYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAMDQKCQYDSKYRAATCSKYTELPYG 120

Query: 216 DEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY----TRN 271
            E  LK  +A  GP++V ++A   +F LY SG+Y + +CT + VNH +L+VGY     + 
Sbjct: 121 REDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYEPSCTQN-VNHGVLVVGYGDLNGKE 179

Query: 272 SWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVY 306
            W++KN W H++G+ GY+ + R   N CGIA++  Y
Sbjct: 180 YWLVKNSWGHNFGEEGYIRMARNKGNHCGIASFPSY 215


>1m6d_A Cathepsin F, catsf; papain family cysteine protease, hydrolase;
           HET: MYP; 1.70A {Homo sapiens} SCOP: d.3.1.1
          Length = 214

 Score =  259 bits (664), Expect = 3e-87
 Identities = 80/217 (36%), Positives = 113/217 (52%), Gaps = 11/217 (5%)

Query: 99  PDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISG 158
           P   DWR KG +T   +Q  CG+C+AFS+   ++GQ F +   +  LS Q+++DC     
Sbjct: 2   PPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCD--KM 59

Query: 159 NLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEH 218
           +  C GG   N  + ++  GGL  E+DY Y+G    C+F      V I     L  Q+E 
Sbjct: 60  DKACMGGLPSNAYSAIKNLGGLETEDDYSYQGHMQSCQFSAEKAKVYIQDSVEL-SQNEQ 118

Query: 219 ALKVTLATVGPIAVSINASPHTFQLYASGIYD--DEACTSDYVNHAMLLVGY-TRNS--- 272
            L   LA  GPI+V+INA     Q Y  GI       C+   ++HA+LLVGY  R+    
Sbjct: 119 KLAAWLAKRGPISVAINAF--GMQFYRHGISRPLRPLCSPWLIDHAVLLVGYGQRSDVPF 176

Query: 273 WILKNWWSHHWGDNGYMYLKRGNNRCGIANYAVYALI 309
           W +KN W   WG+ GY YL RG+  CG+   A  A++
Sbjct: 177 WAIKNSWGTDWGEKGYYYLHRGSGACGVNTMASSAVV 213


>3f5v_A DER P 1 allergen; allergy, asthma, DUST mites, glycoprotein,
           hydrola protease, secreted, thiol protease; HET: P6G;
           1.36A {Dermatophagoides pteronyssinus} PDB: 2as8_A
           3rvw_A* 3rvx_A 3rvv_A* 3d6s_A*
          Length = 222

 Score =  254 bits (652), Expect = 3e-85
 Identities = 57/224 (25%), Positives = 98/224 (43%), Gaps = 11/224 (4%)

Query: 92  SNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVV 151
            + +   P  +D R+   +TP   Q  CG+ +AFS  +A +        +  +L+ Q++V
Sbjct: 4   CSINGNAPAEIDLRQMRTVTPIRMQGGCGSAWAFSGVAATESAYLAYRQQSLDLAEQELV 63

Query: 152 DCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSV 211
           DC   +   GC G ++   + Y+Q   G+++E  Y Y  ++  C+         IS++  
Sbjct: 64  DC---ASQHGCHGDTIPRGIEYIQ-HNGVVQESYYRYVAREQSCRRPNAQ-RFGISNYCQ 118

Query: 212 LPPQDEHALKVTLA-TVGPIAVSINAS-PHTFQLYASGIYDDEACTSDYVNHAMLLVGY- 268
           + P + + ++  LA T   IAV I       F+ Y                HA+ +VGY 
Sbjct: 119 IYPPNANKIREALAQTHSAIAVIIGIKDLDAFRHYDGRTIIQRDNGYQPNYHAVNIVGYS 178

Query: 269 TRNS---WILKNWWSHHWGDNGYMYLKRGNNRCGIANYAVYALI 309
                  WI++N W  +WGDNGY Y     +   I  Y    ++
Sbjct: 179 NAQGVDYWIVRNSWDTNWGDNGYGYFAANIDLMMIEEYPYVVIL 222


>3f75_A Toxopain-2, cathepsin L protease; medical structural genomics of
           pathogenic protozoa, MSGPP, C protease, parasite,
           protozoa, hydrolase; 1.99A {Toxoplasma gondii}
          Length = 224

 Score =  253 bits (649), Expect = 7e-85
 Identities = 77/218 (35%), Positives = 122/218 (55%), Gaps = 12/218 (5%)

Query: 98  IPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIIS 157
           +P  +DWR +G +TP  +Q DCG+C+AFS   A++G     T ++  LS Q+++DCS   
Sbjct: 7   LPAGVDWRSRGCVTPVKDQRDCGSCWAFSTTGALEGAHCAKTGKLVSLSEQELMDCSRAE 66

Query: 158 GNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDE 217
           GN  C+GG + +   YV  +GG+  E+ YPY  +   C+ +    VV I  +  +P + E
Sbjct: 67  GNQSCSGGEMNDAFQYVLDSGGICSEDAYPYLARDEECRAQSCEKVVKILGFKDVPRRSE 126

Query: 218 HALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY------TRN 271
            A+K  LA   P++++I A    FQ Y  G++D   C +D ++H +LLVGY       ++
Sbjct: 127 AAMKAALAK-SPVSIAIEADQMPFQFYHEGVFDAS-CGTD-LDHGVLLVGYGTDKESKKD 183

Query: 272 SWILKNWWSHHWGDNGYMYLKRG---NNRCGIANYAVY 306
            WI+KN W   WG +GYMY+        +CG+   A +
Sbjct: 184 FWIMKNSWGTGWGRDGYMYMAMHKGEEGQCGLLLDASF 221


>3p5u_A Actinidin; SAD, cysteine proteinases, hydrolase; 1.50A {Actinidia
           arguta} PDB: 3p5v_A 3p5w_A 3p5x_A 1aec_A* 2act_A
          Length = 220

 Score =  250 bits (641), Expect = 1e-83
 Identities = 72/217 (33%), Positives = 119/217 (54%), Gaps = 11/217 (5%)

Query: 98  IPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIIS 157
           +PD++DWR  G +    +Q  CG+ +AFS  +A++G    +T ++  LS Q++VDC    
Sbjct: 1   LPDYVDWRSSGAVVDIKDQGQCGSAWAFSTIAAVEGINKIATGDLISLSEQELVDCGRTQ 60

Query: 158 GNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPN-IVVDISSWSVLPPQD 216
              GC GG + +   ++   GG+  E +YPY  ++  C         V I ++  +P  +
Sbjct: 61  NTRGCDGGFMTDGFQFIINNGGINTEANYPYTAEEGQCNLDLQQEKYVSIDTYENVPYNN 120

Query: 217 EHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY----TRNS 272
           E AL+  +A   P++V++ A+ + FQ Y+SGI+     T+  V+HA+ +VGY      + 
Sbjct: 121 EWALQTAVA-YQPVSVALEAAGYNFQHYSSGIFTGPCGTA--VDHAVTIVGYGTEGGIDY 177

Query: 273 WILKNWWSHHWGDNGYMYLKRG---NNRCGIANYAVY 306
           WI+KN W   WG+ GYM ++R      +CGIA  A Y
Sbjct: 178 WIVKNSWGTTWGEEGYMRIQRNVGGVGQCGIAKKASY 214


>1cqd_A Protein (protease II); cysteine protease, glycoprotein, proline
           specificity, carboh papain family, hydrolase; HET: NAG
           FUL FUC; 2.10A {Zingiber officinale} SCOP: d.3.1.1
          Length = 221

 Score =  249 bits (638), Expect = 3e-83
 Identities = 81/217 (37%), Positives = 125/217 (57%), Gaps = 13/217 (5%)

Query: 98  IPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIIS 157
           +PD +DWRE G + P  NQ  CG+C+AFS  +A++G     T ++  LS QQ+VDC+  +
Sbjct: 3   LPDSIDWRENGAVVPVKNQGGCGSCWAFSTVAAVEGINQIVTGDLISLSEQQLVDCT--T 60

Query: 158 GNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDE 217
            N GC GG +     ++   GG+  EE YPY+G+  IC       VV I S+  +P  +E
Sbjct: 61  ANHGCRGGWMNPAFQFIVNNGGINSEETYPYRGQDGICNSTVNAPVVSIDSYENVPSHNE 120

Query: 218 HALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY----TRNSW 273
            +L+  +A   P++V+++A+   FQLY SGI+      S   NHA+ +VGY     ++ W
Sbjct: 121 QSLQKAVAN-QPVSVTMDAAGRDFQLYRSGIFTGSCNIS--ANHALTVVGYGTENDKDFW 177

Query: 274 ILKNWWSHHWGDNGYMYLKRGNN----RCGIANYAVY 306
           I+KN W  +WG++GY+  +R       +CGI  +A Y
Sbjct: 178 IVKNSWGKNWGESGYIRAERNIENPDGKCGITRFASY 214


>1iwd_A Ervatamin B; cysteine protease, alpha-beta protein, catalytic DYAD,
           L-DOM domain., hydrolase; 1.63A {Tabernaemontana
           divaricata} SCOP: d.3.1.1
          Length = 215

 Score =  248 bits (635), Expect = 7e-83
 Identities = 74/217 (34%), Positives = 122/217 (56%), Gaps = 14/217 (6%)

Query: 98  IPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIIS 157
           +P  +DWR KG +    NQ+ CG+C+AFS  +A++      T ++  LS Q++VDC   +
Sbjct: 1   LPSFVDWRSKGAVNSIKNQKQCGSCWAFSAVAAVESINKIRTGQLISLSEQELVDCD--T 58

Query: 158 GNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDE 217
            + GC GG + N   Y+   GG+  +++YPY   Q  CK  R   VV I+ +  +   +E
Sbjct: 59  ASHGCNGGWMNNAFQYIITNGGIDTQQNYPYSAVQGSCKPYRLR-VVSINGFQRVTRNNE 117

Query: 218 HALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY----TRNSW 273
            AL+  +A+  P++V++ A+   FQ Y+SGI+     T+   NH +++VGY     +N W
Sbjct: 118 SALQSAVAS-QPVSVTVEAAGAPFQHYSSGIFTGPCGTA--QNHGVVIVGYGTQSGKNYW 174

Query: 274 ILKNWWSHHWGDNGYMYLKRGNN----RCGIANYAVY 306
           I++N W  +WG+ GY++++R        CGIA    Y
Sbjct: 175 IVRNSWGQNWGNQGYIWMERNVASSAGLCGIAQLPSY 211


>2bdz_A Mexicain; cysteine protease, peptidase_C1, papain-like, HYDR; HET:
           E64; 2.10A {Jacaratia mexicana}
          Length = 214

 Score =  248 bits (635), Expect = 7e-83
 Identities = 74/213 (34%), Positives = 114/213 (53%), Gaps = 11/213 (5%)

Query: 99  PDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISG 158
           P+ +DWREKG +TP  NQ  CG+C+AFS  + I+G     T ++  LS Q+++DC     
Sbjct: 2   PESIDWREKGAVTPVKNQNPCGSCWAFSTVATIEGINKIITGQLISLSEQELLDCE--RR 59

Query: 159 NLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPN-IVVDISSWSVLPPQDE 217
           + GC GG    +L YV    G+  E +YPY+ KQ  C+ K      V I+ +  +P  DE
Sbjct: 60  SHGCDGGYQTTSLQYVV-DNGVHTEREYPYEKKQGRCRAKDKKGPKVYITGYKYVPANDE 118

Query: 218 HALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNSWILKN 277
            +L   +A   P++V  ++    FQ Y  GIY+    T+   +HA+  VGY +   +LKN
Sbjct: 119 ISLIQAIAN-QPVSVVTDSRGRGFQFYKGGIYEGPCGTN--TDHAVTAVGYGKTYLLLKN 175

Query: 278 WWSHHWGDNGYMYLKRG----NNRCGIANYAVY 306
            W  +WG+ GY+ +KR        CG+   + +
Sbjct: 176 SWGPNWGEKGYIRIKRASGRSKGTCGVYTSSFF 208


>1s4v_A Cysteine endopeptidase; KDEL ER retention signal, endosperm,
           ricinosomes, SEED germi senescence, hydrolase-hydrolase
           inhibitor complex; 2.00A {Ricinus communis} SCOP:
           d.3.1.1
          Length = 229

 Score =  248 bits (635), Expect = 1e-82
 Identities = 73/219 (33%), Positives = 116/219 (52%), Gaps = 14/219 (6%)

Query: 98  IPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIIS 157
           +P  +DWR+KG +T   +Q  CG+C+AFS   A++G     T+++  LS Q++VDC    
Sbjct: 2   VPASVDWRKKGAVTSVKDQGQCGSCWAFSTIVAVEGINQIKTNKLVSLSEQELVDCDT-D 60

Query: 158 GNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNI-VVDISSWSVLPPQD 216
            N GC GG +     +++  GG+  E +YPY+     C   + N   V I     +P  D
Sbjct: 61  QNQGCNGGLMDYAFEFIKQRGGITTEANYPYEAYDGTCDVSKENAPAVSIDGHENVPEND 120

Query: 217 EHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY-----TRN 271
           E+AL   +A   P++V+I+A    FQ Y+ G++    C ++ ++H + +VGY        
Sbjct: 121 ENALLKAVAN-QPVSVAIDAGGSDFQFYSEGVFTGS-CGTE-LDHGVAIVGYGTTIDGTK 177

Query: 272 SWILKNWWSHHWGDNGYMYLKRG----NNRCGIANYAVY 306
            W +KN W   WG+ GY+ ++RG       CGIA  A Y
Sbjct: 178 YWTVKNSWGPEWGEKGYIRMERGISDKEGLCGIAMEASY 216


>2cio_A Papain; hydrolase/inhibitor, complex hydrolase/inhibitor, ICP,
           cysteine protease, allergen, protease, thiol protease;
           1.5A {Carica papaya} PDB: 1khq_A 1khp_A 1ppn_A 3e1z_B
           3ima_A 3lfy_A 9pap_A 1bqi_A* 1bp4_A* 1pad_A 1pe6_A*
           1pip_A* 1pop_A* 1ppd_A 1ppp_A* 1stf_E* 2pad_A 4pad_A*
           5pad_A* 6pad_A* ...
          Length = 212

 Score =  246 bits (631), Expect = 3e-82
 Identities = 73/214 (34%), Positives = 110/214 (51%), Gaps = 11/214 (5%)

Query: 98  IPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIIS 157
           IP+++DWR+KG +TP  NQ  CG+C+AFS    I+G I   T  + + S Q+++DC    
Sbjct: 1   IPEYVDWRQKGAVTPVKNQGSCGSCWAFSAVVTIEGIIKIRTGNLNQYSEQELLDCD--R 58

Query: 158 GNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPN-IVVDISSWSVLPPQD 216
            + GC GG   + L  V    G+     YPY+G Q  C+ +              + P +
Sbjct: 59  RSYGCNGGYPWSALQLVA-QYGIHYRNTYPYEGVQRYCRSREKGPYAAKTDGVRQVQPYN 117

Query: 217 EHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNSWILK 276
           E AL  ++A   P++V + A+   FQLY  GI+         V+HA+  VGY  N  ++K
Sbjct: 118 EGALLYSIAN-QPVSVVLEAAGKDFQLYRGGIFVGPCGNK--VDHAVAAVGYGPNYILIK 174

Query: 277 NWWSHHWGDNGYMYLKRGNN----RCGIANYAVY 306
           N W   WG+NGY+ +KRG       CG+   + Y
Sbjct: 175 NSWGTGWGENGYIRIKRGTGNSYGVCGLYTSSFY 208


>3ioq_A CMS1MS2; caricaceae, cysteine protease, papain family, hydrolase;
           HET: E64 SO4; 1.87A {Carica candamarcensis}
          Length = 213

 Score =  246 bits (630), Expect = 3e-82
 Identities = 71/214 (33%), Positives = 112/214 (52%), Gaps = 11/214 (5%)

Query: 98  IPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIIS 157
           IP  +DWR+KG +TP  NQ  CG+C+ FS  +A++G     T ++  LS Q+++DC    
Sbjct: 1   IPTSIDWRQKGAVTPVRNQGGCGSCWTFSSVAAVEGINKIVTGQLLSLSEQELLDCE--R 58

Query: 158 GNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPN-IVVDISSWSVLPPQD 216
            + GC GG     L YV    G+   + YPY+G Q  C+  +     V       +P  +
Sbjct: 59  RSYGCRGGFPLYALQYVA-NSGIHLRQYYPYEGVQRQCRASQAKGPKVKTDGVGRVPRNN 117

Query: 217 EHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNSWILK 276
           E AL   +A + P+++ + A    FQ Y  GI+     TS  ++HA+  VGY  +  ++K
Sbjct: 118 EQALIQRIA-IQPVSIVVEAKGRAFQNYRGGIFAGPCGTS--IDHAVAAVGYGNDYILIK 174

Query: 277 NWWSHHWGDNGYMYLKRG----NNRCGIANYAVY 306
           N W   WG+ GY+ +KRG       CG+ + +V+
Sbjct: 175 NSWGTGWGEGGYIRIKRGSGNPQGACGVLSDSVF 208


>3i06_A Cruzipain; autocatalytic cleavage, glycoprotein, protease, thiol
           protease, zymogen; HET: QL2; 1.10A {Trypanosoma cruzi}
           PDB: 1ewm_A* 1ewo_A* 1ewl_A* 1f29_A* 1ewp_A* 1f2b_A*
           1f2c_A* 1f2a_A* 1me4_A* 1u9q_X* 2aim_A* 2efm_A* 2oz2_A*
           1me3_A* 3kku_A* 3lxs_A* 1aim_A* 3iut_A* 3hd3_A* 2p86_A*
           ...
          Length = 215

 Score =  243 bits (623), Expect = 5e-81
 Identities = 69/220 (31%), Positives = 114/220 (51%), Gaps = 16/220 (7%)

Query: 99  PDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISG 158
           P  +DWR +G +T   +Q  CG+C+AFS    ++ Q F +   +  LS Q +V C     
Sbjct: 2   PAAVDWRARGAVTAVKDQGQCGSCWAFSAIGNVECQWFLAGHPLTNLSEQMLVSCD--KT 59

Query: 159 NLGCAGGSLRNTLNYVQFA--GGLMKEEDYPYKGKQ---SICKFKRPNIVVDISSWSVLP 213
           + GC+GG + N   ++     G +  E+ YPY   +     C      +   I+    L 
Sbjct: 60  DSGCSGGLMNNAFEWIVQENNGAVYTEDSYPYASGEGISPPCTTSGHTVGATITGHVEL- 118

Query: 214 PQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY-TRNS 272
           PQDE  +   LA  GP+AV+++AS  ++  Y  G+     C S+ ++H +LLVGY    +
Sbjct: 119 PQDEAQIAAWLAVNGPVAVAVDAS--SWMTYTGGVMTS--CVSEQLDHGVLLVGYNDSAA 174

Query: 273 ---WILKNWWSHHWGDNGYMYLKRGNNRCGIANYAVYALI 309
              WI+KN W+  WG+ GY+ + +G+N+C +   A  A++
Sbjct: 175 VPYWIIKNSWTTQWGEEGYIRIAKGSNQCLVKEEASSAVV 214


>1ppo_A Protease omega; hydrolase(thiol protease); 1.80A {Carica papaya}
           SCOP: d.3.1.1 PDB: 1meg_A*
          Length = 216

 Score =  242 bits (620), Expect = 1e-80
 Identities = 73/218 (33%), Positives = 112/218 (51%), Gaps = 15/218 (6%)

Query: 98  IPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIIS 157
           +P+++DWR+KG +TP  +Q  CG+C+AFS  + ++G     T ++ ELS Q++VDC    
Sbjct: 1   LPENVDWRKKGAVTPVRHQGSCGSCWAFSAVATVEGINKIRTGKLVELSEQELVDCE--R 58

Query: 158 GNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPN-IVVDISSWSVLPPQD 216
            + GC GG     L YV    G+     YPYK KQ  C+ K+    +V  S    + P +
Sbjct: 59  RSHGCKGGYPPYALEYVA-KNGIHLRSKYPYKAKQGTCRAKQVGGPIVKTSGVGRVQPNN 117

Query: 217 EHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY-TRNS--- 272
           E  L   +A   P++V + +    FQLY  GI++    T   V+HA+  VGY        
Sbjct: 118 EGNLLNAIAK-QPVSVVVESKGRPFQLYKGGIFEGPCGTK--VDHAVTAVGYGKSGGKGY 174

Query: 273 WILKNWWSHHWGDNGYMYLKRGNN----RCGIANYAVY 306
            ++KN W   WG+ GY+ +KR        CG+   + Y
Sbjct: 175 ILIKNSWGTAWGEKGYIRIKRAPGNSPGVCGLYKSSYY 212


>2b1m_A SPE31; papain-like, sugar binding protein; HET: NAG FUC PG4; 2.00A
           {Pachyrhizus erosus} PDB: 2b1n_A*
          Length = 246

 Score =  242 bits (620), Expect = 4e-80
 Identities = 74/228 (32%), Positives = 111/228 (48%), Gaps = 21/228 (9%)

Query: 98  IPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIIS 157
            P+  DW +KG IT    Q  CG+ +AFS   AI+     +T  +  LS Q+++DC    
Sbjct: 2   APESWDWSKKGVITKVKFQGQCGSGWAFSATGAIEAAHAIATGNLVSLSEQELIDCV--D 59

Query: 158 GNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSV------ 211
            + GC  G    +  +V   GG+  E DYPYK +   CK       V I ++ V      
Sbjct: 60  ESEGCYNGWHYQSFEWVVKHGGIASEADYPYKARDGKCKANEIQDKVTIDNYGVQILSNE 119

Query: 212 -LPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEAC-TSDYVNHAMLLVGY- 268
               + E +L+  +    PI+VSI+A    F  Y+ GIYD   C +   +NH +L+VGY 
Sbjct: 120 STESEAESSLQSFVLE-QPISVSIDAK--DFHFYSGGIYDGGNCSSPYGINHFVLIVGYG 176

Query: 269 TRNS---WILKNWWSHHWGDNGYMYLKRGNN----RCGIANYAVYALI 309
           + +    WI KN W   WG +GY+ ++R        CG+  +A Y +I
Sbjct: 177 SEDGVDYWIAKNSWGEDWGIDGYIRIQRNTGNLLGVCGMNYFASYPII 224


>2fo5_A Cysteine proteinase EP-B 2; EP-B2, EPB2, EPB, cysteine
           endoprotease, endopeptidase, LEUP hydrolase; HET: AR7;
           2.20A {Hordeum vulgare}
          Length = 262

 Score =  242 bits (619), Expect = 8e-80
 Identities = 71/222 (31%), Positives = 116/222 (52%), Gaps = 17/222 (7%)

Query: 98  IPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIIS 157
           +P  +DWR+KG +T   +Q  CG+C+AFS   +++G     T  +  LS Q+++DC   +
Sbjct: 4   LPPSVDWRQKGAVTGVKDQGKCGSCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDT-A 62

Query: 158 GNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPN----IVVDISSWSVLP 213
            N GC GG + N   Y++  GGL+ E  YPY+  +  C   R      +VV I     +P
Sbjct: 63  DNDGCQGGLMDNAFEYIKNNGGLITEAAYPYRAARGTCNVARAAQNSPVVVHIDGHQDVP 122

Query: 214 PQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY----- 268
              E  L   +A   P++V++ AS   F  Y+ G++  E C ++ ++H + +VGY     
Sbjct: 123 ANSEEDLARAVAN-QPVSVAVEASGKAFMFYSEGVFTGE-CGTE-LDHGVAVVGYGVAED 179

Query: 269 TRNSWILKNWWSHHWGDNGYMYLKRGNN----RCGIANYAVY 306
            +  W +KN W   WG+ GY+ +++ +      CGIA  A Y
Sbjct: 180 GKAYWTVKNSWGPSWGEQGYIRVEKDSGASGGLCGIAMEASY 221


>1yal_A Chymopapain; hydrolase, thiol protease; 1.70A {Carica papaya} SCOP:
           d.3.1.1 PDB: 1gec_E*
          Length = 218

 Score =  239 bits (613), Expect = 2e-79
 Identities = 78/217 (35%), Positives = 114/217 (52%), Gaps = 15/217 (6%)

Query: 99  PDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISG 158
           P  +DWR KG +TP  NQ  CG+C+AFS  + ++G     T  + ELS Q++VDC     
Sbjct: 2   PQSIDWRAKGAVTPVKNQGACGSCWAFSTIATVEGINKIVTGNLLELSEQELVDCD--KH 59

Query: 159 NLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICK-FKRPNIVVDISSWSVLPPQDE 217
           + GC GG    +L YV    G+   + YPY+ KQ  C+   +P   V I+ +  +P   E
Sbjct: 60  SYGCKGGYQTTSLQYVA-NNGVHTSKVYPYQAKQYKCRATDKPGPKVKITGYKRVPSNCE 118

Query: 218 HALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY----TRNSW 273
            +    LA   P++V + A    FQLY SG++D   C +  ++HA+  VGY     +N  
Sbjct: 119 TSFLGALAN-QPLSVLVEAGGKPFQLYKSGVFDGP-CGTK-LDHAVTAVGYGTSDGKNYI 175

Query: 274 ILKNWWSHHWGDNGYMYLKRG----NNRCGIANYAVY 306
           I+KN W  +WG+ GYM LKR        CG+   + Y
Sbjct: 176 IIKNSWGPNWGEKGYMRLKRQSGNSQGTCGVYKSSYY 212


>3pdf_A Cathepsin C, dipeptidyl peptidase 1; two domains, cystein protease,
           hydrolase-hydrolase inhibitor; HET: LXV NAG; 1.85A {Homo
           sapiens} PDB: 1jqp_A* 2djf_B* 1k3b_B* 2djg_B* 2djf_A*
           1k3b_A* 2djg_A* 2djf_C* 1k3b_C* 2djg_C*
          Length = 441

 Score =  239 bits (612), Expect = 1e-76
 Identities = 83/315 (26%), Positives = 127/315 (40%), Gaps = 28/315 (8%)

Query: 16  KYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEM- 74
                + K + +      ++ +H  +   N                   L     I+   
Sbjct: 126 YVNTAHLKNSQEKYSNRLYKYDHNFVKAINA---IQKSWTATTYMEYETLTLGDMIRRSG 182

Query: 75  -TRLTHSRIRRTLVRSPESNESVLIPDHLDWREK---GFITPDWNQEDCGACYAFSIASA 130
                  R +   + +    + + +P   DWR      F++P  NQ  CG+CY+F+    
Sbjct: 183 GHSRKIPRPKPAPLTAEIQQKILFLPTSWDWRNVHGINFVSPVRNQASCGSCYSFASMGM 242

Query: 131 IQGQIFKST--SEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPY 188
           ++ +I   T  S+   LS Q+VV CS      GC GG             GL++E  +PY
Sbjct: 243 LEARIRILTNNSQTPILSPQEVVSCS--QYAQGCEGGFPYLIAGKYAQDFGLVEEACFPY 300

Query: 189 KGKQSICKFKRPNIVVDISSWSVLPP----QDEHALKVTLATVGPIAVSINASPHTFQLY 244
            G  S CK K        S +  +       +E  +K+ L   GP+AV+       F  Y
Sbjct: 301 TGTDSPCKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVY-DDFLHY 359

Query: 245 ASGIYD-----DEACTSDYVNHAMLLVGY-TRNS-----WILKNWWSHHWGDNGYMYLKR 293
             GIY      D     +  NHA+LLVGY T ++     WI+KN W   WG+NGY  ++R
Sbjct: 360 KKGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWGENGYFRIRR 419

Query: 294 GNNRCGIANYAVYAL 308
           G + C I + AV A 
Sbjct: 420 GTDECAIESIAVAAT 434


>1o0e_A Ervatamin C; plant cysteine protease, two domain, stable at PH
           2-12, HYDR; 1.90A {Tabernaemontana divaricata} SCOP:
           d.3.1.1 PDB: 2pns_A* 2pre_A* 3bcn_A*
          Length = 208

 Score =  231 bits (591), Expect = 2e-76
 Identities = 81/211 (38%), Positives = 117/211 (55%), Gaps = 9/211 (4%)

Query: 98  IPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIIS 157
           +P+ +DWR+KG +TP  NQ  CG+C+AFS  S ++      T  +  LS Q++VDC    
Sbjct: 1   LPEQIDWRKKGAVTPVKNQGSCGSCWAFSTVSTVESINQIRTGNLISLSEQELVDCD--K 58

Query: 158 GNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDE 217
            N GC GG+      Y+   GG+  + +YPYK  Q  C+      VV I  ++ +P  +E
Sbjct: 59  KNHGCLGGAFVFAYQYIINNGGIDTQANYPYKAVQGPCQAASK--VVSIDGYNGVPFCNE 116

Query: 218 HALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNSWILKN 277
            ALK  +A V P  V+I+AS   FQ Y+SGI+     T   +NH + +VGY  N WI++N
Sbjct: 117 XALKQAVA-VQPSTVAIDASSAQFQQYSSGIFSGPCGTK--LNHGVTIVGYQANYWIVRN 173

Query: 278 WWSHHWGDNGYMYLKR--GNNRCGIANYAVY 306
            W  +WG+ GY+ + R  G   CGIA    Y
Sbjct: 174 SWGRYWGEKGYIRMLRVGGCGLCGIARLPYY 204


>3bwk_A Cysteine protease falcipain-3; malaria, hydrolase; HET: C1P; 2.42A
           {Plasmodium falciparum} PDB: 3bpm_A*
          Length = 243

 Score =  231 bits (591), Expect = 8e-76
 Identities = 77/238 (32%), Positives = 112/238 (47%), Gaps = 27/238 (11%)

Query: 91  ESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQV 150
           +  ++ L     DWR  G +TP  +Q  CG+C+AFS   +++ Q       +   S Q++
Sbjct: 13  KPADAKLDRIAYDWRLHGGVTPVKDQALCGSCWAFSSVGSVESQYAIRKKALFLFSEQEL 72

Query: 151 VDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGK-QSICKFKRPNIVVDISSW 209
           VDCS    N GC GG + N  + +   GGL  ++DYPY       C  KR N    I S+
Sbjct: 73  VDCS--VKNNGCYGGYITNAFDDMIDLGGLCSQDDYPYVSNLPETCNLKRCNERYTIKSY 130

Query: 210 SVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY- 268
             +P   +   K  L  +GPI++SI AS   F  Y  G YD E C +   NHA++LVGY 
Sbjct: 131 VSIP---DDKFKEALRYLGPISISIAAS-DDFAFYRGGFYDGE-CGAA-PNHAVILVGYG 184

Query: 269 TRNS-------------WILKNWWSHHWGDNGYMYLKRG----NNRCGIANYAVYALI 309
            ++              +I+KN W   WG+ GY+ L+         C I   A   L+
Sbjct: 185 MKDIYNEDTGRMEKFYYYIIKNSWGSDWGEGGYINLETDENGYKKTCSIGTEAYVPLL 242


>2oul_A Falcipain 2; cysteine protease, inhibitor, macromolecular
           interaction, HY hydrolase inhibitor complex; 2.20A
           {Plasmodium falciparum} SCOP: d.3.1.1 PDB: 2ghu_A 1yvb_A
           3bpf_A* 3pnr_A
          Length = 241

 Score =  230 bits (590), Expect = 1e-75
 Identities = 70/235 (29%), Positives = 114/235 (48%), Gaps = 27/235 (11%)

Query: 94  ESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDC 153
           E        DWR    +TP  +Q++CG+C+AFS   +++ Q     +++  LS Q++VDC
Sbjct: 14  EENFDHAAYDWRLHSGVTPVKDQKNCGSCWAFSSIGSVESQYAIRKNKLITLSEQELVDC 73

Query: 154 SIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQ-SICKFKRPNIVVDISSWSVL 212
           S    N GC GG + N    +   GG+  + DYPY     ++C   R      I ++  +
Sbjct: 74  S--FKNYGCNGGLINNAFEDMIELGGICPDGDYPYVSDAPNLCNIDRCTEKYGIKNYLSV 131

Query: 213 PPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY-TRN 271
           P   ++ LK  L  +GPI++S+  S   F  Y  GI+D E C    +NHA++LVG+  + 
Sbjct: 132 P---DNKLKEALRFLGPISISVAVS-DDFAFYKEGIFDGE-CGDQ-LNHAVMLVGFGMKE 185

Query: 272 S-------------WILKNWWSHHWGDNGYMYLKRGNNR----CGIANYAVYALI 309
                         +I+KN W   WG+ G++ ++   +     CG+   A   LI
Sbjct: 186 IVNPLTKKGEKHYYYIIKNSWGQQWGERGFINIETDESGLMRKCGLGTDAFIPLI 240


>2wbf_X Serine-repeat antigen protein; SERA, malaria, vacuole, protease,
           cathepsin, hydrolase, glycoprotein, thiol protease; HET:
           DMS; 1.60A {Plasmodium falciparum} PDB: 3ch3_X 3ch2_X
          Length = 265

 Score =  209 bits (534), Expect = 6e-67
 Identities = 42/249 (16%), Positives = 71/249 (28%), Gaps = 39/249 (15%)

Query: 99  PDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISG 158
               D           +Q +C   + F+    ++        E  ++S   V +C     
Sbjct: 11  NRLKDENNCISNLQVEDQGNCDTSWIFASKYHLETIRCMKGYEPTKISALYVANCYKGEH 70

Query: 159 NLGCAGGSLRNT-LNYVQFAGGLMKEEDYPYKGKQSI------------------CKFKR 199
              C  GS     L  ++  G L  E +YPY   +                        +
Sbjct: 71  KDRCDEGSSPMEFLQIIEDYGFLPAESNYPYNYVKVGEQCPKVEDHWMNLWDNGKILHNK 130

Query: 200 PN---------IVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYD 250
                         +   +          +K  +   G +   I A       + SG   
Sbjct: 131 NEPNSLDGKGYTAYESERFHDNMDAFVKIIKTEVMNKGSVIAYIKAENVMGYEF-SGKKV 189

Query: 251 DEACTSDYVNHAMLLVGY-TRNS--------WILKNWWSHHWGDNGYMYLKR-GNNRCGI 300
              C  D  +HA+ +VGY    +        WI++N W  +WGD GY  +   G   C  
Sbjct: 190 KNLCGDDTADHAVNIVGYGNYVNSEGEKKSYWIVRNSWGPYWGDEGYFKVDMYGPTHCHF 249

Query: 301 ANYAVYALI 309
                  + 
Sbjct: 250 NFIHSVVIF 258


>3hhi_A Cathepsin B-like cysteine protease; occluding loop, hydrolase, THIO
           protease; HET: 074; 1.60A {Trypanosoma brucei} PDB:
           3mor_A*
          Length = 325

 Score =  195 bits (498), Expect = 7e-61
 Identities = 64/308 (20%), Positives = 110/308 (35%), Gaps = 43/308 (13%)

Query: 37  NHKKIHTHNQEAQQGLHGYTLREN-HLSDLHPRHYIKEM--TRLTHSRIRRTLVRSPESN 93
           +   +   N+  +     +  + +  + ++  R   +     +  ++       R  E  
Sbjct: 11  SKAFVDRVNRLNR---GIWKAKYDGVMQNITLREAKRLNGVIKKNNNASILPKRRFTEEE 67

Query: 94  ESVLIPDHLD----WREKGFITPDWNQEDCGACYAFSIASAIQGQIF-KSTSEIEELSIQ 148
               +P   D    W     I    +Q  CG+C+A + ASA+  +       +   +S  
Sbjct: 68  ARAPLPSSFDSAEAWPNCPTIPQIADQSACGSCWAVAAASAMSDRFCTMGGVQDVHISAG 127

Query: 149 QVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNI------ 202
            ++ C    G+ GC GG       Y   + GL+ +   PY         K  N       
Sbjct: 128 DLLACCSDCGD-GCNGGDPDRAWAYFS-STGLVSDYCQPYPFPHCSHHSKSKNGYPPCSQ 185

Query: 203 ------------------VVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLY 244
                             VV+  SW+    Q E      L   GP  V+ +     F  Y
Sbjct: 186 FNFDTPKCDYTCDDPTIPVVNYRSWTSYALQGEDDYMRELFFRGPFEVAFDVYE-DFIAY 244

Query: 245 ASGIYDDEACTSDYVNHAMLLVGY-TRNS---WILKNWWSHHWGDNGYMYLKRGNNRCGI 300
            SG+Y   +       HA+ LVG+ T N    W + N W+  WG +GY  ++RG++ CGI
Sbjct: 245 NSGVYHHVSGQYL-GGHAVRLVGWGTSNGVPYWKIANSWNTEWGMDGYFLIRRGSSECGI 303

Query: 301 ANYAVYAL 308
            +     +
Sbjct: 304 EDGGSAGI 311


>3u8e_A Papain-like cysteine protease; papain-like cysteine peptidase,
           peptidase_C1A, hydrolase, in form; 1.31A {Crocus
           sativus}
          Length = 222

 Score =  189 bits (481), Expect = 1e-59
 Identities = 73/220 (33%), Positives = 109/220 (49%), Gaps = 17/220 (7%)

Query: 99  PDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISG 158
           P  +DWR+KG +T   +Q  CG C+AF    AI+G    +T  +  +S QQ+VDC   + 
Sbjct: 2   PASIDWRKKGAVTSVKDQGACGMCWAFGATGAIEGIDAITTGRLISVSEQQIVDCD--TX 59

Query: 159 NLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEH 218
                GG   +   +V   GG+  + +YPY G    C      I   I  ++ + P    
Sbjct: 60  XXXXXGGDADDAFRWVITNGGIASDANYPYTGVDGTCDLN-KPIAARIDGYTNV-PNSSS 117

Query: 219 ALKVTLATVGPIAVSINASPHTFQLYAS-GIYDDEACTSDY--VNHAMLLVGYTRNS--- 272
           AL   +A   P++V+I  S  +FQLY   GI+   +C+ D   V+H +L+VGY  N    
Sbjct: 118 ALLDAVAK-QPVSVNIYTSSTSFQLYTGPGIFAGSSCSDDPATVDHTVLIVGYGSNGTNA 176

Query: 273 --WILKNWWSHHWGDNGYMYLKRGNNR----CGIANYAVY 306
             WI+KN W   WG +GY+ ++R  NR    C I  +  Y
Sbjct: 177 DYWIVKNSWGTEWGIDGYILIRRNTNRPDGVCAIDAWGSY 216


>1deu_A Procathepsin X; cysteine protease, proregion, prosegment, HY; 1.70A
           {Homo sapiens} SCOP: d.3.1.1 PDB: 1ef7_A
          Length = 277

 Score =  184 bits (470), Expect = 3e-57
 Identities = 56/275 (20%), Positives = 94/275 (34%), Gaps = 44/275 (16%)

Query: 70  YIKEMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKG---FITPDWNQED---CGACY 123
           Y            R T  R  E      +P   DWR      + +   NQ     CG+C+
Sbjct: 8   YRPLRGDGLAPLGRTTYPRPHEYLSPADLPKSWDWRNVDGVNYASITRNQHIPQYCGSCW 67

Query: 124 AFSIASAIQGQIF---KSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGL 180
           A +  SA+  +I    K       LS+Q V+DC        C GG+  +  +Y     G+
Sbjct: 68  AHASTSAMADRINIKRKGAWPSTLLSVQNVIDCG---NAGSCEGGNDLSVWDYAH-QHGI 123

Query: 181 MKEEDYPYKGKQSICKFKRPNI---------------VVDISSWSVLPPQDEHALKVTLA 225
             E    Y+ K   C                      +  +  +          +   + 
Sbjct: 124 PDETCNNYQAKDQECDKFNQCGTCNEFKECHAIRNYTLWRVGDYGS--LSGREKMMAEIY 181

Query: 226 TVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY-TRNS---WILKNWWSH 281
             GPI+  I A+      Y  GIY +   T+  +NH + + G+   +    WI++N W  
Sbjct: 182 ANGPISCGIMAT-ERLANYTGGIYAEYQDTTY-INHVVSVAGWGISDGTEYWIVRNSWGE 239

Query: 282 HWGDNGYMYLKRGNNRCG--------IANYAVYAL 308
            WG+ G++ +     + G        I  +  +  
Sbjct: 240 PWGERGWLRIVTSTYKDGKGARYNLAIEEHCTFGD 274


>3pbh_A Procathepsin B; thiol protease, cysteine protease, proenzyme,
           papain; 2.50A {Homo sapiens} SCOP: d.3.1.1 PDB: 2pbh_A
           1pbh_A 1mir_A
          Length = 317

 Score =  179 bits (457), Expect = 9e-55
 Identities = 58/313 (18%), Positives = 103/313 (32%), Gaps = 51/313 (16%)

Query: 37  NHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVRSPESNESV 96
           + + ++  N+        +    ++  ++   +  +              V   E  +  
Sbjct: 11  SDELVNYVNKRN----TTWQA-GHNFYNVDMSYLKRLCGTFLGGPKPPQRVMFTEDLK-- 63

Query: 97  LIPDHLDWREK----GFITPDWNQEDCGACYAFSIASAIQGQIFKST--SEIEELSIQQV 150
            +P   D RE+      I    +Q  CG+C+AF    AI  +I   T      E+S + +
Sbjct: 64  -LPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDL 122

Query: 151 VDCSIISGNLGCAGGSLRNTLNYVQFAGGL------MKEEDYPYKGK------------- 191
           + C       GC GG      N+    G +            PY                
Sbjct: 123 LTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEHHVNGSRPPC 182

Query: 192 ---------QSICKFKRPNIVVDISSWSVLP---PQDEHALKVTLATVGPIAVSINASPH 239
                      IC+            +          E  +   +   GP+  + +    
Sbjct: 183 TGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYS- 241

Query: 240 TFQLYASGIYDDEACTSDYVNHAMLLVGY-TRNS---WILKNWWSHHWGDNGYMYLKRGN 295
            F LY SG+Y           HA+ ++G+   N    W++ N W+  WGDNG+  + RG 
Sbjct: 242 DFLLYKSGVYQHVTGEMM-GGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQ 300

Query: 296 NRCGIANYAVYAL 308
           + CGI +  V  +
Sbjct: 301 DHCGIESEVVAGI 313


>3qsd_A Cathepsin B-like peptidase (C01 family); cysteine peptidase,
           digestive tract, hydrolase-hydrolase INH complex; HET:
           074; 1.30A {Schistosoma mansoni} PDB: 3s3q_A* 3s3r_A*
          Length = 254

 Score =  175 bits (445), Expect = 1e-53
 Identities = 61/254 (24%), Positives = 91/254 (35%), Gaps = 45/254 (17%)

Query: 98  IPDHLDWREK----GFITPDWNQEDCGACYAFSIASAIQGQIFKST--SEIEELSIQQVV 151
           IP   D R+K      I    +Q  CG+C+AF    A+  +    +   +  ELS   ++
Sbjct: 3   IPSSFDSRKKWPRCKSIATIRDQSRCGSCWAFGAVEAMSDRSCIQSGGKQNVELSAVDLL 62

Query: 152 DCSIISGNLGCAGGSLRNTLNYVQFAGGLMKE--------EDYPYKGKQSICKFKRPNIV 203
            C    G  GC GG L    +Y    G +           E YP+   +   K K P   
Sbjct: 63  SCCESCGL-GCEGGILGPAWDYWVKEGIVTGSSKENHAGCEPYPFPKCEHHTKGKYPPCG 121

Query: 204 VDISSW------------------------SVLPPQDEHALKVTLATVGPIAVSINASPH 239
             I                           S     DE A++  +   GP+         
Sbjct: 122 SKIYKTPRCKQTCQKKYKTPYTQDKHRGKSSYNVKNDEKAIQKEIMKYGPVEAGFTVYED 181

Query: 240 TFQLYASGIYDDEACTSDYVNHAMLLVGY-TRNS---WILKNWWSHHWGDNGYMYLKRGN 295
            F  Y SGIY      +    HA+ ++G+   N    W++ N W+  WG+NGY  + RG 
Sbjct: 182 -FLNYKSGIYKHITGETL-GGHAIRIIGWGVENKAPYWLIANSWNEDWGENGYFRIVRGR 239

Query: 296 NRCGIANYAVYALI 309
           + C I +      I
Sbjct: 240 DECSIESEVTAGRI 253


>3cbj_A Cathepsin B; cathepsin B, occluding loop, chagas disease, glyco
           hydrolase, lysosome, protease, thiol protease, zymogen,
           CYT vesicle; 1.80A {Homo sapiens} PDB: 3cbk_A 1gmy_A*
           3ai8_B* 3k9m_A 1the_A* 1cpj_A* 1cte_A 2dcc_A* 2dc6_A*
           1ito_A* 2dc8_A* 2dc9_A* 2dca_A* 2dcb_A* 2dc7_A* 2dcd_A*
           1qdq_A* 1csb_B* 1huc_B 2ipp_B ...
          Length = 266

 Score =  175 bits (445), Expect = 1e-53
 Identities = 55/256 (21%), Positives = 87/256 (33%), Gaps = 43/256 (16%)

Query: 94  ESVLIPDHLDWREK----GFITPDWNQEDCGACYAFSIASAIQGQIFKST--SEIEELSI 147
           E + +P   D RE+      I    +Q  CG+ +AF    AI  +I   T      E+S 
Sbjct: 3   EDLKLPASFDAREQWPQCPTIKEIRDQGSCGSAWAFGAVEAISDRICIHTNAHVSVEVSA 62

Query: 148 QQVVDCSIISGNLGCAGGSLRNTLNYVQFAGG------LMKEEDYPYKGK---------- 191
           + ++ C       GC GG      N+    G              PY             
Sbjct: 63  EDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEAHVNGAR 122

Query: 192 ------------QSICKFKRPNIVVDISSWSVLP---PQDEHALKVTLATVGPIAVSINA 236
                         IC+            +          E  +   +   GP+  + + 
Sbjct: 123 PPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSV 182

Query: 237 SPHTFQLYASGIYDDEACTSDYVNHAMLLVGY-TRNS---WILKNWWSHHWGDNGYMYLK 292
               F LY SG+Y           HA+ ++G+   N    W++ N W+  WGDNG+  + 
Sbjct: 183 YS-DFLLYKSGVYQHVTG-EMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKIL 240

Query: 293 RGNNRCGIANYAVYAL 308
           RG + CGI +  V  +
Sbjct: 241 RGQDHCGIESEVVAGI 256


>3ois_A Cysteine protease; alpha and beta, hydrolase; HET: UDP; 1.65A
           {Xylella fastidiosa}
          Length = 291

 Score =  175 bits (446), Expect = 2e-53
 Identities = 50/288 (17%), Positives = 101/288 (35%), Gaps = 34/288 (11%)

Query: 42  HTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVRSPESNESVLIPDH 101
           H H+  +     G   R +H+  +  R        +      R    +PE +    +P  
Sbjct: 5   HHHHHHSS----GLVPRGSHMQTVLKRRKKSGYGYIPDIADIRDFSYTPEKSVIAALPPK 60

Query: 102 LDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKS--TSEIEELSIQQVVDCSIISGN 159
           +D          ++Q   G+C A ++A+AIQ +      + E     +    +   I G+
Sbjct: 61  VDLTPP---FQVYDQGRIGSCTANALAAAIQFERIHDKQSPEFIPSRLFIYYNERKIEGH 117

Query: 160 LGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIV---------------- 203
           +    G++      V    G+  E+++PY    +  + +                     
Sbjct: 118 VNYDSGAMIRDGIKVLHKLGVCPEKEWPYGDTPADPRTEEFPPGAPASKKPSDQCYKDAQ 177

Query: 204 -VDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYD---DEACTSDYV 259
              I+ +S +  QD   LK  LA   P     +    ++    S            +   
Sbjct: 178 NYKITEYSRV-AQDIDHLKACLAVGSPFVFGFSVYN-SWVGNNSLPVRIPLPTKNDTLEG 235

Query: 260 NHAMLLVGY--TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYA 304
            HA+L VGY      + ++N W ++ G++GY ++     +   +A+  
Sbjct: 236 GHAVLCVGYDDEIRHFRIRNSWGNNVGEDGYFWMPYEYISNTQLADDF 283


>2l95_A Crammer, LP06209P; cysteine proteinase inhibitor, intrinsic
          disorder P like protein, hydrolase; NMR {Drosophila
          melanogaster}
          Length = 80

 Score = 51.2 bits (123), Expect = 6e-09
 Identities = 12/52 (23%), Positives = 30/52 (57%), Gaps = 1/52 (1%)

Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDL 65
          + K+ K+Y  +    ++++ +  +  +I  HN++ ++G   + +  NHL+DL
Sbjct: 14 KSKFDKNYEAEEDLMRRRI-YAESKARIEEHNRKFEKGEVTWKMGINHLADL 64


>3f75_P Toxopain-2, cathepsin L propeptide; medical structural genomics
          of pathogenic protozoa, MSGPP, C protease, parasite,
          protozoa, hydrolase; 1.99A {Toxoplasma gondii}
          Length = 106

 Score = 48.2 bits (115), Expect = 1e-07
 Identities = 19/72 (26%), Positives = 34/72 (47%), Gaps = 4/72 (5%)

Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
          Q  Y K Y  +    ++   +++N   IHTHNQ+     + Y+L+ NH  DL    + ++
Sbjct: 29 QAMYAKSYATEEEKQRRYAIFKNNLVYIHTHNQQG----YSYSLKMNHFGDLSRDEFRRK 84

Query: 74 MTRLTHSRIRRT 85
                SR  ++
Sbjct: 85 YLGFKKSRNLKS 96


>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis,
           programmed cell death; HET: DTP; 6.90A {Drosophila
           melanogaster} PDB: 3iz8_A*
          Length = 1221

 Score = 46.8 bits (110), Expect = 6e-06
 Identities = 52/323 (16%), Positives = 89/323 (27%), Gaps = 94/323 (29%)

Query: 31  KLHWQSNHKKIHTHNQ--EAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVR 88
           K+ W  N K  ++     E  Q L  Y +  N  S       IK       + +RR L++
Sbjct: 183 KIFW-LNLKNCNSPETVLEMLQKLL-YQIDPNWTSRSDHSSNIKLRIHSIQAELRR-LLK 239

Query: 89  SPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAF-------------SIASAIQGQI 135
           S      +L+  ++    K      WN        AF              +   +    
Sbjct: 240 SKPYENCLLVLLNV-QNAK-----AWN--------AFNLSCKILLTTRFKQVTDFLSAAT 285

Query: 136 FK-----------STSEIEELSIQQVVDCSI-------ISGN---LGCAGGSLR---NTL 171
                        +  E++ L + + +DC         ++ N   L     S+R    T 
Sbjct: 286 TTHISLDHHSMTLTPDEVKSL-LLKYLDCRPQDLPREVLTTNPRRLSIIAESIRDGLATW 344

Query: 172 NYVQFAGG-------------LMKEEDYPYKGKQSICKFKRPNIVVDISS------WSVL 212
           +  +                 L   E      + S+  F  P+    I +      W  +
Sbjct: 345 DNWKHVNCDKLTTIIESSLNVLEPAEYRKMFDRLSV--F-PPS--AHIPTILLSLIWFDV 399

Query: 213 PPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSD--YVNHAMLLVGYTR 270
              D   +   L       V       T  +    IY +     +  Y  H  ++  Y  
Sbjct: 400 IKSDVMVVVNKLHKYS--LVEKQPKESTISIP--SIYLELKVKLENEYALHRSIVDHYN- 454

Query: 271 NSWILKNWWSHHWGDN---GYMY 290
              I K + S          Y Y
Sbjct: 455 ---IPKTFDSDDLIPPYLDQYFY 474



 Score = 32.5 bits (73), Expect = 0.20
 Identities = 34/238 (14%), Positives = 69/238 (28%), Gaps = 76/238 (31%)

Query: 30  KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVRS 89
            KLH  S  +K     +E+   +    L          +  ++    L H  I       
Sbjct: 409 NKLHKYSLVEK---QPKESTISIPSIYLEL--------KVKLENEYAL-HRSIV------ 450

Query: 90  PESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFS-----IASAIQGQIFKSTSEI-E 143
               +   IP   D  +   + P +  +     Y +S     + +    +       +  
Sbjct: 451 ----DHYNIPKTFDSDD---LIPPYLDQ-----YFYSHIGHHLKNIEHPERMTLFRMVFL 498

Query: 144 ELS-IQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGK-QSICKFKRPN 201
           +   ++Q +           A GS+ NTL  ++F    + + D  Y+    +I  F    
Sbjct: 499 DFRFLEQKI---RHDSTAWNASGSILNTLQQLKFYKPYICDNDPKYERLVNAILDF---- 551

Query: 202 IVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHT------FQLYASGIYDDEA 253
                     LP  +E+ +                S +T             I+++  
Sbjct: 552 ----------LPKIEENLIC---------------SKYTDLLRIALMAEDEAIFEEAH 584


>1qzv_F Plant photosystem I: subunit PSAF; photosynthesis,plant
           photosynthetic reaction center, peripheral antenna; HET:
           CL1 PQN; 4.44A {Pisum sativum} SCOP: i.5.1.1
          Length = 154

 Score = 35.3 bits (80), Expect = 0.009
 Identities = 9/47 (19%), Positives = 17/47 (36%), Gaps = 20/47 (42%)

Query: 191 KQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINAS 237
           KQ++ K                    + +LK+      P A++I A+
Sbjct: 19  KQALKKL-------------------QASLKLYADDSAP-ALAIKAT 45


>3pw3_A Aminopeptidase C; bleomycin, cysteine proteinase fold, structural
           genomics, JO center for structural genomics, JCSG; HET:
           MSE; 2.23A {Parabacteroides distasonis}
          Length = 383

 Score = 34.8 bits (79), Expect = 0.030
 Identities = 16/87 (18%), Positives = 26/87 (29%), Gaps = 8/87 (9%)

Query: 212 LPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRN 271
           L   D             +           Q      YD+   T D   H M + G  ++
Sbjct: 272 LSGSDMAHWLKLKPEEKKLNTKPQPQKWCTQAERQLAYDNYETTDD---HGMQIYGIAKD 328

Query: 272 S-----WILKNWWSHHWGDNGYMYLKR 293
                 +++KN W  +   NG  Y  +
Sbjct: 329 QEGNEYYMVKNSWGTNSKYNGIWYASK 355



 Score = 30.2 bits (67), Expect = 0.80
 Identities = 15/72 (20%), Positives = 28/72 (38%), Gaps = 4/72 (5%)

Query: 110 ITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRN 169
           IT   NQ   G C+ +S  S ++ ++ +      +LS    V  + +      A  ++R 
Sbjct: 22  ITSVKNQNRAGTCWCYSSYSFLESELLRMGKGEYDLSEMFTVYNTYLD----RADAAVRT 77

Query: 170 TLNYVQFAGGLM 181
             +     GG  
Sbjct: 78  HGDVSFSQGGSF 89


>2e01_A Cysteine proteinase 1; bleomycin hydrolase, thiol protease, C1
           protease, hydrolase; 1.73A {Saccharomyces cerevisiae}
           PDB: 2e02_A 2e03_A 2dzy_A 1a6r_A 2e00_A 2dzz_A 3gcb_A
           1gcb_A
          Length = 457

 Score = 34.3 bits (78), Expect = 0.047
 Identities = 8/40 (20%), Positives = 16/40 (40%), Gaps = 7/40 (17%)

Query: 259 VNHAMLLVGYTRNS-------WILKNWWSHHWGDNGYMYL 291
           +  AML+ G   +        + ++N W    G +G   +
Sbjct: 371 MTAAMLITGCHVDETSKLPLRYRVENSWGKDSGKDGLYVM 410


>2pff_B Fatty acid synthase subunit beta; fatty acid synthase,
            acyl-carrier-protein, beta-ketoacyl RED beta-ketoacyl
            synthase, dehydratase; 4.00A {Saccharomyces cerevisiae}
          Length = 2006

 Score = 34.3 bits (78), Expect = 0.055
 Identities = 37/238 (15%), Positives = 59/238 (24%), Gaps = 104/238 (43%)

Query: 35   QSNHKKIHTHNQEAQQGLHGYTLRENHLS---------DLHPRHYIKEMTRLTHSRIRRT 85
             +N   +  H      G  G  +REN+ +          L      KE+   + S     
Sbjct: 1666 INNPVNLTIHFG----GEKGKRIRENYSAMIFETIVDGKLKTEKIFKEINEHSTSYT--- 1718

Query: 86   LVRSPES--NE-----------SVLIPDHLDWREKGFITPDWNQEDC---G--------- 120
              RS +   +                    D + KG I       D    G         
Sbjct: 1719 -FRSEKGLLSATQFTQPALTLMEKAA--FEDLKSKGLI-----PADATFAGHSLGEYAAL 1770

Query: 121  ACYA--FSIASAIQ-----GQIFKSTSEIEEL----------------------SIQQVV 151
            A  A   SI S ++     G   +     +EL                      ++Q VV
Sbjct: 1771 ASLADVMSIESLVEVVFYRGMTMQVAVPRDELGRSNYGMIAINPGRVAASFSQEALQYVV 1830

Query: 152  D---------CSIISGNLGCAG------GSLRNTLNYVQFAGGLMKEEDYPYKGKQSI 194
            +           I+  N           G LR     +     ++      +   Q I
Sbjct: 1831 ERVGKRTGWLVEIV--NYNVENQQYVAAGDLRA----LDTVTNVLN-----FIKLQKI 1877


>3cam_A Cold-shock domain family protein; cold shock protein, chain SWAP,
           STRU genomics, oxford protein production facility, OPPF,
           gene RE; 2.60A {Neisseria meningitidis MC58}
          Length = 67

 Score = 31.0 bits (71), Expect = 0.059
 Identities = 19/54 (35%), Positives = 24/54 (44%), Gaps = 11/54 (20%)

Query: 108 GFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLG 161
           GFITPD   ED  A +     SAI  + FK+  E       Q V   + +G  G
Sbjct: 16  GFITPDEGGEDLFAHF-----SAINMEGFKTLKE------GQRVSFDVTTGPKG 58


>3i2z_B RNA chaperone, negative regulator of CSPA transcription; beta
           barrel, DNA binding protein/transcription, cytoplasm,
           gene regulation; 1.10A {Salmonella typhimurium} PDB:
           2l15_A 1mjc_A 3mef_A
          Length = 71

 Score = 30.3 bits (69), Expect = 0.10
 Identities = 18/54 (33%), Positives = 25/54 (46%), Gaps = 11/54 (20%)

Query: 108 GFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLG 161
           GFITP+   +D    +     SAIQ   FK+ +E       Q V+  I +G  G
Sbjct: 20  GFITPEDGSKDVFVHF-----SAIQTNGFKTLAE------GQRVEFEITNGAKG 62


>2cb5_A Protein (bleomycin hydrolase); aminopeptidase, cysteine protease,
           SELF- compartmentalizing, cylinase; 1.85A {Homo sapiens}
           SCOP: d.3.1.1 PDB: 1cb5_A
          Length = 453

 Score = 32.4 bits (73), Expect = 0.18
 Identities = 9/41 (21%), Positives = 14/41 (34%), Gaps = 8/41 (19%)

Query: 259 VNHAMLLVGY--------TRNSWILKNWWSHHWGDNGYMYL 291
           + HAM                 W ++N W    G  GY+ +
Sbjct: 369 MTHAMTFTAVSEKDDQDGAFTKWRVENSWGEDHGHKGYLCM 409


>2lss_A Cold shock-like protein; CSD, CSP, oligonucleotide binding F fold,
           RNA binding protein, DNA binding protein; NMR
           {Rickettsia rickettsii}
          Length = 70

 Score = 28.4 bits (64), Expect = 0.52
 Identities = 10/34 (29%), Positives = 12/34 (35%), Gaps = 5/34 (14%)

Query: 108 GFITPDWNQEDCGACYAFSIASAIQGQIFKSTSE 141
           GFI  D   +D      F   SA+      S  E
Sbjct: 19  GFIEQDNGGKDV-----FVHKSAVDAAGLHSLEE 47


>1g6p_A Cold shock protein TMCSP; greek-KEY, beta barrel, OB-fold,
           structural genomics; NMR {Thermotoga maritima} SCOP:
           b.40.4.5
          Length = 66

 Score = 27.2 bits (61), Expect = 1.6
 Identities = 18/54 (33%), Positives = 23/54 (42%), Gaps = 12/54 (22%)

Query: 108 GFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLG 161
           GFIT D    D    +     SAI+ + FK+  E       QVV+  I  G  G
Sbjct: 15  GFITKD-EGGDVFVHW-----SAIEMEGFKTLKE------GQVVEFEIQEGKKG 56


>3lvg_D LCB, clathrin light chain B; SELF assembly, coated PIT, cytoplasmic
           vesicle, membrane, Ca structural protein; 7.94A {Bos
           taurus}
          Length = 190

 Score = 29.0 bits (64), Expect = 1.7
 Identities = 8/63 (12%), Positives = 19/63 (30%), Gaps = 23/63 (36%)

Query: 4   KEWIIIFIFPQKKYKKDYRKKATDSKKKLH------------WQSNHK----KIHTHNQE 47
           ++W       +++ +K  ++    SK                W         K   +N+ 
Sbjct: 88  RKW-------REEQRKRLQELDAASKVMEQEWREKAKKDLEEWNQRQSEQVEKNKINNRI 140

Query: 48  AQQ 50
           A +
Sbjct: 141 ADK 143


>1c9o_A CSPB, cold-shock protein; beta barrel, homodimer, transcription;
           1.17A {Bacillus caldolyticus} SCOP: b.40.4.5 PDB: 2hax_A
           1hz9_A 1hzb_A 1i5f_A 1hza_A 1hzc_A 3pf4_A 1csq_A 1nmf_A
           1nmg_A 1csp_A 2f52_A 2es2_A 3pf5_A 2i5m_X 2i5l_X
          Length = 66

 Score = 26.8 bits (60), Expect = 1.7
 Identities = 17/54 (31%), Positives = 23/54 (42%), Gaps = 12/54 (22%)

Query: 108 GFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLG 161
           GFI  +    D    +     +AIQG+ FK+  E       Q V   I+ GN G
Sbjct: 16  GFIEVE-GGSDVFVHF-----TAIQGEGFKTLEE------GQEVSFEIVQGNRG 57


>3suk_A Cerato-platanin-like protein; double PSI beta barrel, unknown
           function; 1.34A {Moniliophthora perniciosa}
          Length = 125

 Score = 27.8 bits (61), Expect = 2.2
 Identities = 6/32 (18%), Positives = 9/32 (28%)

Query: 109 FITPDWNQEDCGACYAFSIASAIQGQIFKSTS 140
                +N   CG CY  S       +     +
Sbjct: 51  SDIGGFNSPACGNCYTISFTFQGVTRSINLVA 82


>3m3g_A EPL1 protein; fungal, plant defense, fungus, polysaccharide-binding
           protei; 1.39A {Hypocrea virens}
          Length = 120

 Score = 27.7 bits (61), Expect = 2.6
 Identities = 5/22 (22%), Positives = 7/22 (31%)

Query: 109 FITPDWNQEDCGACYAFSIASA 130
                WN   CG C+    +  
Sbjct: 50  AAVAGWNSASCGTCWKLQYSGH 71


>3szv_A Pyroglutatmate porin OPDO; beta-barrel, channel, bacterial outer
           membrane, membrane Pro; HET: C8E; 1.45A {Pseudomonas
           aeruginosa} PDB: 2y0k_A*
          Length = 401

 Score = 28.3 bits (62), Expect = 4.0
 Identities = 7/56 (12%), Positives = 15/56 (26%)

Query: 240 TFQLYASGIYDDEACTSDYVNHAMLLVGYTRNSWILKNWWSHHWGDNGYMYLKRGN 295
              L  +   +D             L      +  +   +    GD+ Y Y+   +
Sbjct: 238 KSDLRFARASEDGGFRELDNRAFGALFSLRLGAHAVAAGYQRISGDDPYPYIAGSD 293


>3sul_A Cerato-platanin-like protein; double PSI beta barrel, unknown
           function; 1.63A {Moniliophthora perniciosa}
          Length = 122

 Score = 27.0 bits (59), Expect = 4.0
 Identities = 7/22 (31%), Positives = 9/22 (40%)

Query: 109 FITPDWNQEDCGACYAFSIASA 130
                WN E CG CY  + +  
Sbjct: 50  DTITGWNSESCGTCYQITWSGT 71


>2kqa_A Cerato-platanin; elicitor, secreted, toxin; NMR {Ceratocystis
           platani}
          Length = 129

 Score = 27.0 bits (59), Expect = 4.2
 Identities = 5/22 (22%), Positives = 9/22 (40%)

Query: 109 FITPDWNQEDCGACYAFSIASA 130
                W+   CG C+  +I + 
Sbjct: 56  PDIAGWDSPSCGTCWKVTIPNG 77


>1qht_A Protein (DNA polymerase); archaea, hyperthermostable, family B
           polymer alpha family polymerase, transferase; 2.10A
           {Thermococcus SP} SCOP: c.55.3.5 e.8.1.1 PDB: 1tgo_A
           2xhb_A* 2vwj_A* 2vwk_A* 1wns_A* 1wn7_A 1qqc_A* 4ahc_A*
           4ail_C* 3a2f_A* 2jgu_A* 1d5a_A
          Length = 775

 Score = 28.2 bits (63), Expect = 4.7
 Identities = 6/44 (13%), Positives = 15/44 (34%), Gaps = 4/44 (9%)

Query: 71  IKEMTRLTHSRIRRTLVRSP-ESNESVLIPDHLDWREKGFITPD 113
             +++RL    +      S     E  L+       ++  + P+
Sbjct: 330 EAQLSRLIGQSLWDVSRSSTGNLVEWFLLRKA---YKRNELAPN 370


>2k9m_A RNA polymerase sigma factor RPON; core binding domain,
           transcription; NMR {Aquifex aeolicus}
          Length = 130

 Score = 26.8 bits (60), Expect = 4.8
 Identities = 7/31 (22%), Positives = 12/31 (38%)

Query: 55  YTLRENHLSDLHPRHYIKEMTRLTHSRIRRT 85
             + +  L DL     +K   +   SR+R  
Sbjct: 92  EEILKKALRDLKRGKKLKPEIKGKLSRLRLF 122


>3u5c_P 40S ribosomal protein S15; translation, ribosome, ribosomal,
          ribosomal R ribosomal protein, eukaryotic ribosome,
          RNA-protein C; 3.00A {Saccharomyces cerevisiae} PDB:
          3izb_R 3o30_I 3o2z_I 3u5g_P 1s1h_S 3jyv_S*
          Length = 142

 Score = 26.9 bits (60), Expect = 4.8
 Identities = 9/45 (20%), Positives = 19/45 (42%), Gaps = 6/45 (13%)

Query: 54 GYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVRSPESNESVLI 98
          G  L +  L ++      ++  +L  +R+RR   R   S  +  +
Sbjct: 19 GVDLEK--LLEMS----TEDFVKLAPARVRRRFARGMTSKPAGFM 57


>3erv_A Putative C39-like peptidase; structural genomics, unknown function,
           PSI-2, protein structure initiative; 2.10A {Bacillus
           anthracis}
          Length = 236

 Score = 27.4 bits (60), Expect = 5.2
 Identities = 7/49 (14%), Positives = 21/49 (42%)

Query: 232 VSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNSWILKNWWS 280
            +   +P     + +   ++   +  Y  H ++L+GY + S  +++   
Sbjct: 149 TNATFAPLDEDEFTTWETNNGDVSITYNEHCVVLIGYDQESVYIRDPLK 197


>2au5_A Conserved domain protein; structural genomics, PSI, protein STR
           initiative, midwest center for structural genomics,
           MCSG, U function; 2.10A {Enterococcus faecalis} SCOP:
           a.244.1.1
          Length = 139

 Score = 27.0 bits (59), Expect = 5.7
 Identities = 7/31 (22%), Positives = 15/31 (48%)

Query: 112 PDWNQEDCGACYAFSIASAIQGQIFKSTSEI 142
           P+W +E  G      + S ++ + F S  ++
Sbjct: 55  PNWLEEAAGGMQGVIVQSLLEDENFSSVEQL 85


>2ykt_A Brain-specific angiogenesis inhibitor 1-associate protein 2;
           signaling protein, NPY motif, binding pocket; 2.11A
           {Homo sapiens} PDB: 1y2o_A 1wdz_A
          Length = 253

 Score = 27.2 bits (59), Expect = 6.1
 Identities = 12/79 (15%), Positives = 25/79 (31%), Gaps = 7/79 (8%)

Query: 4   KEWIIIFIFPQKKYKKDYR-------KKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYT 56
           +          KKY+ + R       K   + KK        K    ++ +  Q +   +
Sbjct: 113 ELDSRYLSAALKKYQTEQRSKGDALDKCQAELKKLRKKSQGSKNPQKYSDKELQYIDAIS 172

Query: 57  LRENHLSDLHPRHYIKEMT 75
            ++  L +     Y   +T
Sbjct: 173 NKQGELENYVSDGYKTALT 191


>3ok8_A Brain-specific angiogenesis inhibitor 1-associate 2-like protein 2;
           I-BAR, protein binding; 2.25A {Mus musculus}
          Length = 222

 Score = 27.1 bits (59), Expect = 7.3
 Identities = 8/51 (15%), Positives = 23/51 (45%), Gaps = 1/51 (1%)

Query: 4   KEWIIIFIFPQKKYKKDYRKKATDSKKKL-HWQSNHKKIHTHNQEAQQGLH 53
           K  +       + Y+ +YR +A + +K +       +K   + +E ++ ++
Sbjct: 111 KLDMQFIKDSCQHYEIEYRHRAANLEKCMSELWRMERKRDKNAREMKESVN 161


>2y2x_A OPDK, vanillate porin OPDK; membrane protein, outer membrane, OPRD,
           transport; HET: C8E VNL; 1.65A {Pseudomonas aeruginosa
           PA01} PDB: 2qtk_A* 3sys_A*
          Length = 390

 Score = 27.0 bits (59), Expect = 8.0
 Identities = 5/56 (8%), Positives = 11/56 (19%)

Query: 241 FQLYASGIYDDEACTSDYVNHAMLLVGYTRNSWILKNWWSHHWGDNGYMYLKRGNN 296
             L+               +    L                  GD+G+  +   + 
Sbjct: 228 LGLFVDRDDGAARAGEIDSHTVYGLFSAGIGLHTFYLGLQKVGGDSGWQSVYGSSG 283


>2dqb_A Deoxyguanosinetriphosphate triphosphohydrolase, P; dntpase, DNTP,
          single-stranded DNA, DNA dGTPase, HD superfamily,
          structural genomics; 2.20A {Thermus thermophilus}
          Length = 376

 Score = 27.3 bits (61), Expect = 8.0
 Identities = 11/39 (28%), Positives = 18/39 (46%), Gaps = 11/39 (28%)

Query: 60 NHLSDLHPRHYIKEMTRLTHS----RIRRTLVRSPESNE 94
              D + R      TRLTH+    ++ R++ R+   NE
Sbjct: 67 GWAGD-YYR------TRLTHTLEVAQVSRSIARALGLNE 98


>3szd_B Porin; beta-barrel, channel, bacterial outer membrane, membrane
           Pro; HET: C8E 3PE; 2.31A {Pseudomonas aeruginosa} PDB:
           3jty_A*
          Length = 405

 Score = 27.0 bits (59), Expect = 8.1
 Identities = 6/56 (10%), Positives = 11/56 (19%)

Query: 241 FQLYASGIYDDEACTSDYVNHAMLLVGYTRNSWILKNWWSHHWGDNGYMYLKRGNN 296
              +                    L         L        GD+G+M +   + 
Sbjct: 231 LGGFRGRDAGSARAGKLDNRTVSALFSARYGLHTLYLGLQKVSGDDGWMRVNGTSG 286


>1cmx_A Protein (ubiquitin YUH1-UBAL); ubiquitin hydrolase,
          deubiquitinating enzyme, cysteine protease, enzyme
          specificity; 2.25A {Synthetic} SCOP: d.3.1.6
          Length = 235

 Score = 26.6 bits (58), Expect = 9.6
 Identities = 6/31 (19%), Positives = 12/31 (38%)

Query: 8  IIFIFPQKKYKKDYRKKATDSKKKLHWQSNH 38
          I+ +FP  + +K    +   S   + W    
Sbjct: 55 IVLLFPINEDRKSSTSQQITSSYDVIWFKQS 85


  Database: pdb70
    Posted date:  Sep 4, 2012  3:40 AM
  Number of letters in database: 6,701,793
  Number of sequences in database:  27,921
  
Lambda     K      H
   0.319    0.134    0.425 

Gapped
Lambda     K      H
   0.267   0.0856    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 27921
Number of Hits to DB: 4,926,246
Number of extensions: 282931
Number of successful extensions: 868
Number of sequences better than 10.0: 1
Number of HSP's gapped: 625
Number of HSP's successfully gapped: 74
Length of query: 309
Length of database: 6,701,793
Length adjustment: 93
Effective length of query: 216
Effective length of database: 4,105,140
Effective search space: 886710240
Effective search space used: 886710240
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 57 (25.5 bits)