RPS-BLAST 2.2.26 [Sep-21-2011]

Database: pdb70 
           27,921 sequences; 6,701,793 total letters

Searching..................................................done

Query= psy2558
         (348 letters)



>3qj3_A Cathepsin L-like protein; hydrolase, proteinase, larVal midgut;
           1.85A {Tenebrio molitor}
          Length = 331

 Score =  347 bits (893), Expect = e-119
 Identities = 106/333 (31%), Positives = 156/333 (46%), Gaps = 35/333 (10%)

Query: 31  HHLHHVKHTAL--------FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLL-QDTEH 81
           HH HH++ +AL        +  F   + ++Y    E   R  IF   L   +   +    
Sbjct: 3   HHHHHLEGSALPSTFVAEKWENFKTTYARSYVNAKEETFRKQIFQKKLETFEEHNEKYRQ 62

Query: 82  GSGVY--GLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMI-------PNITLPRAFDW 132
           G   Y  G+N F+D++  E +A   G  +        +P           ++  P +FDW
Sbjct: 63  GLVSYTLGVNLFTDMTPEEMKAYTHGLIMPADLHKNGIPIKTREDLGLNASVRYPASFDW 122

Query: 133 REYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLV--SLSEQELIDCDQEDDGCEG 190
           R+   V+ VK+Q  CGSSWAFS+TG IE             S+SEQ+L+DC     GC G
Sbjct: 123 RDQGMVSPVKNQGSCGSSWAFSSTGAIESQMKIANGAGYDSSVSEQQLVDCVPNALGCSG 182

Query: 191 GSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSV-SRDETDMAK 249
           G +++AF  +     GG++ E  YPY   D  C  +      +++GYV +   DE  +A 
Sbjct: 183 GWMNDAFTYVAQN--GGIDSEGAYPYEMADGNCHYDPNQVAARLSGYVYLSGPDENMLAD 240

Query: 250 YLVENGPMAVAINA-YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKA 308
            +   GP+AVA +A      Y  GV +     C       +H+VLIVGYG          
Sbjct: 241 MVATKGPVAVAFDADDPFGSYSGGVYYNPT--C--ETNKFTHAVLIVGYG------NENG 290

Query: 309 VPYWIIKNSWGEGWGEKGYFRLYRG-DGSCGIN 340
             YW++KNSWG+GWG  GYF++ R  +  CGI 
Sbjct: 291 QDYWLVKNSWGDGWGLDGYFKIARNANNHCGIA 323


>1xkg_A DER P I, major mite fecal allergen DER P 1; major allergen,
           cysteine protease, house DUST mite, dermatop
           pteronyssinus; 1.61A {Dermatophagoides pteronyssinus}
           SCOP: d.3.1.1
          Length = 312

 Score =  336 bits (863), Expect = e-115
 Identities = 81/320 (25%), Positives = 129/320 (40%), Gaps = 34/320 (10%)

Query: 40  ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
             F  + +  NK+YAT  +  +    F  +++ +Q      +G     +N  SDLS  EF
Sbjct: 6   KTFEEYKKAFNKSYATFEDEEAARKNFLESVKYVQ-----SNGGA---INHLSDLSLDEF 57

Query: 100 QAKYLGFKLKPSYADRSVP------AMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAF 153
           + ++L       +            A   N   P   D R+   VT ++ Q  CGS+WAF
Sbjct: 58  KNRFLMSAEAFEHLKTQFDLNAETNACSINGNAPAEIDLRQMRTVTPIRMQGGCGSAWAF 117

Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
           S     E  Y A   + + L+EQEL+DC     GC G +I    + I      G+ +E  
Sbjct: 118 SGVAATESAYLAYRDQSLDLAEQELVDCAS-QHGCHGDTIPRGIEYIQH---NGVVQESY 173

Query: 214 YPYRGDDKACRLNKKATQVKINGYVSV-SRDETDMAKYLVE-NGPMAVAINAY---ALQF 268
           Y Y   +++CR         I+ Y  +   +   + + L + +  +AV I      A + 
Sbjct: 174 YRYVAREQSCRRPNAQR-FGISNYCQIYPPNANKIREALAQTHSAIAVIIGIKDLDAFRH 232

Query: 269 YVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYF 328
           Y              G +   H+V IVGY         + V YWI++NSW   WG+ GY 
Sbjct: 233 YDGRTIIQRD--N--GYQPNYHAVNIVGYS------NAQGVDYWIVRNSWDTNWGDNGYG 282

Query: 329 RLYRGDGSCGINDYVRSALV 348
                     I +Y    ++
Sbjct: 283 YFAANIDLMMIEEYPYVVIL 302


>1cs8_A Human procathepsin L; prosegment, propeptide, inhibition,
           hydrolase; HET: OCS; 1.80A {Homo sapiens} SCOP: d.3.1.1
           PDB: 1cjl_A 3hwn_A*
          Length = 316

 Score =  330 bits (849), Expect = e-113
 Identities = 105/309 (33%), Positives = 163/309 (52%), Gaps = 18/309 (5%)

Query: 40  ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLL-QDTEHGSGVY--GLNEFSDLST 96
           A +  +   HN+ Y    E   R  ++  N++ I+L  Q+   G   +   +N F D+++
Sbjct: 10  AQWTKWKAMHNRLYGM-NEEGWRRAVWEKNMKMIELHNQEYREGKHSFTMAMNAFGDMTS 68

Query: 97  AEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTT 156
            EF+    GF+ +     +           PR+ DWRE   VT VK+Q  CGS WAFS T
Sbjct: 69  EEFRQVMNGFQNRKPRKGKVFQEP-LFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSAT 127

Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
           G +EG    KT +L+SLSEQ L+DC     ++GC GG +  AF  +     GGL+ E++Y
Sbjct: 128 GALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDN--GGLDSEESY 185

Query: 215 PYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYVTG 272
           PY   +++C+ N K +     G+V + + E  + K +   GP++VAI+A   +  FY  G
Sbjct: 186 PYEATEESCKYNPKYSVANDAGFVDIPKQEKALMKAVATVGPISVAIDAGHESFLFYKEG 245

Query: 273 VSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR 332
           +       C   +E++ H VL+VGYG + T+  +    YW++KNSWGE WG  GY ++ +
Sbjct: 246 IYFEPD--C--SSEDMDHGVLVVGYGFESTESDNN--KYWLVKNSWGEEWGMGGYVKMAK 299

Query: 333 G-DGSCGIN 340
                CGI 
Sbjct: 300 DRRNHCGIA 308


>1by8_A Protein (procathepsin K); hydrolase(sulfhydryl proteinase), papain;
           2.60A {Homo sapiens} SCOP: d.3.1.1 PDB: 7pck_A
          Length = 314

 Score =  330 bits (848), Expect = e-113
 Identities = 109/312 (34%), Positives = 163/312 (52%), Gaps = 25/312 (8%)

Query: 40  ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLL-QDTEHGSGVY--GLNEFSDLST 96
             +  + + H K Y   V+  SR  I+  NL+ I +   +   G   Y   +N   D+++
Sbjct: 9   THWELWKKTHRKQYNNKVDEISRRLIWEKNLKYISIHNLEASLGVHTYELAMNHLGDMTS 68

Query: 97  AEFQAKYLGFKLKPSYADRSVPAMIP--NITLPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
            E   K  G K+  S++  +    IP      P + D+R+   VT VK+Q  CGS WAFS
Sbjct: 69  EEVVQKMTGLKVPLSHSRSNDTLYIPEWEGRAPDSVDYRKKGYVTPVKNQGQCGSCWAFS 128

Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
           + G +EG    KT KL++LS Q L+DC  E+DGC GG ++NAF  +      G++ E  Y
Sbjct: 129 SVGALEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQYVQKN--RGIDSEDAY 186

Query: 215 PYRGDDKACRLNKKATQVKINGYVSV-SRDETDMAKYLVENGPMAVAINA--YALQFYVT 271
           PY G +++C  N      K  GY  +   +E  + + +   GP++VAI+A   + QFY  
Sbjct: 187 PYVGQEESCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQFYSK 246

Query: 272 GVSHPIQFFCDG--GNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
           GV      + D    ++NL+H+VL VGYG+ +         +WIIKNSWGE WG KGY  
Sbjct: 247 GV------YYDESCNSDNLNHAVLAVGYGIQKGN------KHWIIKNSWGENWGNKGYIL 294

Query: 330 LYRG-DGSCGIN 340
           + R  + +CGI 
Sbjct: 295 MARNKNNACGIA 306


>1m6d_A Cathepsin F, catsf; papain family cysteine protease, hydrolase;
           HET: MYP; 1.70A {Homo sapiens} SCOP: d.3.1.1
          Length = 214

 Score =  322 bits (829), Expect = e-111
 Identities = 105/222 (47%), Positives = 142/222 (63%), Gaps = 10/222 (4%)

Query: 127 PRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDD 186
           P  +DWR   AVT VKDQ MCGS WAFS TGN+EG +      L+SLSEQEL+DCD+ D 
Sbjct: 2   PPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDKMDK 61

Query: 187 GCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETD 246
            C GG  SNA+  I +   GGLE E  Y Y+G  ++C+ + +  +V I   V +S++E  
Sbjct: 62  ACMGGLPSNAYSAIKNL--GGLETEDDYSYQGHMQSCQFSAEKAKVYIQDSVELSQNEQK 119

Query: 247 MAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTH 306
           +A +L + GP++VAINA+ +QFY  G+S P++  C      + H+VL+VGYG        
Sbjct: 120 LAAWLAKRGPISVAINAFGMQFYRHGISRPLRPLC--SPWLIDHAVLLVGYG------QR 171

Query: 307 KAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
             VP+W IKNSWG  WGEKGY+ L+RG G+CG+N    SA+V
Sbjct: 172 SDVPFWAIKNSWGTDWGEKGYYYLHRGSGACGVNTMASSAVV 213


>2c0y_A Procathepsin S; proenzyme, proteinase, hydrolase, thiol protease,
           prosegment binding loop, glycoprotein, lysosome,
           protease, zymogen; 2.1A {Homo sapiens}
          Length = 315

 Score =  324 bits (834), Expect = e-111
 Identities = 96/313 (30%), Positives = 153/313 (48%), Gaps = 27/313 (8%)

Query: 40  ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY----GLNEFSDLS 95
             ++ + + + K Y    E   R  I+  NL+ + L    EH  G++    G+N   D++
Sbjct: 10  HHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLKFVMLHN-LEHSMGMHSYDLGMNHLGDMT 68

Query: 96  TAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFST 155
           + E  +     ++   +         PN  LP + DWRE   VT VK Q  CG++WAFS 
Sbjct: 69  SEEVMSLMSSLRVPSQWQRNITYKSNPNRILPDSVDWREKGCVTEVKYQGSCGAAWAFSA 128

Query: 156 TGNIEGVYAAKTKKLVSLSEQELIDCDQE---DDGCEGGSISNAFDTIMSKLGGGLEEEK 212
            G +E     KT KLVSLS Q L+DC  E   + GC GG ++ AF  I+     G++ + 
Sbjct: 129 VGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDN--KGIDSDA 186

Query: 213 TYPYRGDDKACRLNKKATQVKINGYVSV-SRDETDMAKYLVENGPMAVAINA--YALQFY 269
           +YPY+  D+ C+ + K      + Y  +    E  + + +   GP++V ++A   +   Y
Sbjct: 187 SYPYKAMDQKCQYDSKYRAATCSKYTELPYGREDVLKEAVANKGPVSVGVDARHPSFFLY 246

Query: 270 VTGVSHPIQFFCDGG-NENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYF 328
            +GV      + +    +N++H VL+VGYG            YW++KNSWG  +GE+GY 
Sbjct: 247 RSGV------YYEPSCTQNVNHGVLVVGYG------DLNGKEYWLVKNSWGHNFGEEGYI 294

Query: 329 RLYRG-DGSCGIN 340
           R+ R     CGI 
Sbjct: 295 RMARNKGNHCGIA 307


>2o6x_A Procathepsin L1, secreted cathepsin L 1; hydrolase, thiol protease,
           cysteine protease, zymogen, hydro; 1.40A {Fasciola
           hepatica}
          Length = 310

 Score =  324 bits (833), Expect = e-111
 Identities = 109/310 (35%), Positives = 156/310 (50%), Gaps = 23/310 (7%)

Query: 40  ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLL-QDTEHGSGVY--GLNEFSDLST 96
            L++ +   +NK Y    +   R +I+  N++ IQ      + G   Y  GLN+F+D++ 
Sbjct: 3   DLWHQWKRMYNKEYNG-ADDQHRRNIWEKNVKHIQEHNLRHDLGLVTYTLGLNQFTDMTF 61

Query: 97  AEFQAKYLGFKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFST 155
            EF+AKYL    + S           N   +P   DWRE   VT VKDQ  CGS WAFST
Sbjct: 62  EEFKAKYLTEMSRASDILSHGVPYEANNRAVPDKIDWRESGYVTEVKDQGNCGSGWAFST 121

Query: 156 TGNIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
           TG +EG Y    +  +S SEQ+L+DC +   ++GC GG + NA+  +      GLE E +
Sbjct: 122 TGTMEGQYMKNERTSISFSEQQLVDCSRPWGNNGCGGGLMENAYQYLKQ---FGLETESS 178

Query: 214 YPYRGDDKACRLNKKATQVKINGYVSV-SRDETDMAKYLVENGPMAVAINA-YALQFYVT 271
           YPY   +  CR NK+    K+ G+ +V S  E ++   +   GP AVA++       Y +
Sbjct: 179 YPYTAVEGQCRYNKQLGVAKVTGFYTVHSGSEVELKNLVGAEGPAAVAVDVESDFMMYRS 238

Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
           G+              ++H+VL VGYG            YWI+KNSWG  WGE+GY R+ 
Sbjct: 239 GIYQS----QTCSPLRVNHAVLAVGYGTQGGT------DYWIVKNSWGLSWGERGYIRMV 288

Query: 332 RG-DGSCGIN 340
           R     CGI 
Sbjct: 289 RNRGNMCGIA 298


>3qt4_A Cathepsin-L-like midgut cysteine proteinase; hydrolase, zymogen,
           intramolecular DISS bonds, insect larVal midgut; HET:
           PG4 PG6; 2.11A {Tenebrio molitor}
          Length = 329

 Score =  321 bits (826), Expect = e-109
 Identities = 120/326 (36%), Positives = 166/326 (50%), Gaps = 28/326 (8%)

Query: 27  DEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLL-QDTEHGSGV 85
           D ++  L        ++ F   H K+Y++ +E   R  IF  N+ KI       E G   
Sbjct: 12  DLEICSLPKSLFQEQWSQFKLTHKKSYSSPIEEIRRQLIFKDNVAKIAEHNAKFEKGEVT 71

Query: 86  Y--GLNEFSDLSTAEFQAKYLGFKLKPSYADRS--VPAMIPNITLPRAFDWREYDAVTGV 141
           Y   +N+F D+S  EF A     K +      +  +P +     L  + DWR  +AV+ V
Sbjct: 72  YSKAMNQFGDMSKEEFLAYVNRGKAQKPKHPENLRMPYVSSKKPLAASVDWRS-NAVSEV 130

Query: 142 KDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDT 199
           KDQ  CGSSW+FSTTG +EG  A +  +L SLSEQ LIDC     + GC+GG + +AF  
Sbjct: 131 KDQGQCGSSWSFSTTGAVEGQLALQRGRLTSLSEQNLIDCSSSYGNAGCDGGWMDSAFSY 190

Query: 200 IMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSV-SRDETDMAKYLVENGPMA 258
           I      G+  E  YPY      CR +   +   ++GY  + S DE  +A  + + GP+A
Sbjct: 191 IHD---YGIMSESAYPYEAQGDYCRFDSSQSVTTLSGYYDLPSGDENSLADAVGQAGPVA 247

Query: 259 VAINA-YALQFYVTGVSHPIQFFCDG--GNENLSHSVLIVGYGVDRTKFTHKAVPYWIIK 315
           VAI+A   LQFY  G+      F D      +L+H VL+VGYG D  +       YWI+K
Sbjct: 248 VAIDATDELQFYSGGL------FYDQTCNQSDLNHGVLVVGYGSDNGQ------DYWILK 295

Query: 316 NSWGEGWGEKGYFRLYRG-DGSCGIN 340
           NSWG GWGE GY+R  R    +CGI 
Sbjct: 296 NSWGSGWGESGYWRQVRNYGNNCGIA 321


>1pci_A Procaricain; zymogen, hydrolase, thiol protease; 3.20A {Carica
           papaya} SCOP: d.3.1.1
          Length = 322

 Score =  320 bits (822), Expect = e-109
 Identities = 109/329 (33%), Positives = 152/329 (46%), Gaps = 27/329 (8%)

Query: 22  FMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEH 81
           F +VG  +       +   LFN ++  HNK Y  + E   R  IF  NL  I    + ++
Sbjct: 2   FSIVGYSQDDLTSTERLIQLFNSWMLNHNKFYENVDEKLYRFEIFKDNLNYIDE-TNKKN 60

Query: 82  GSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIP--NITLPRAFDWREYDAVT 139
            S   GLNEF+DLS  EF  KY+G  +  +         I    + LP   DWR+  AVT
Sbjct: 61  NSYWLGLNEFADLSNDEFNEKYVGSLIDATIEQSYDEEFINEDIVNLPENVDWRKKGAVT 120

Query: 140 GVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDT 199
            V+ Q  CGS WAFS    +EG+   +T KLV LSEQEL+DC++   GC+GG    A + 
Sbjct: 121 PVRHQGSCGSCWAFSAVATVEGINKIRTGKLVELSEQELVDCERRSHGCKGGYPPYALEY 180

Query: 200 IMSKLGGGLEEEKTYPYRGDDKACRLNK-KATQVKINGYVSV-SRDETDMAKYLVENGPM 257
           +      G+     YPY+     CR  +     VK +G   V   +E ++    +   P+
Sbjct: 181 VAK---NGIHLRSKYPYKAKQGTCRAKQVGGPIVKTSGVGRVQPNNEGNLLN-AIAKQPV 236

Query: 258 AVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIK 315
           +V + +     Q Y  G+      F       +  +V  VGYG    K       Y +IK
Sbjct: 237 SVVVESKGRPFQLYKGGI------FEGPCGTKVDGAVTAVGYGKSGGK------GYILIK 284

Query: 316 NSWGEGWGEKGYFRLYRG----DGSCGIN 340
           NSWG  WGEKGY R+ R      G CG+ 
Sbjct: 285 NSWGTAWGEKGYIRIKRAPGNSPGVCGLY 313


>3i06_A Cruzipain; autocatalytic cleavage, glycoprotein, protease, thiol
           protease, zymogen; HET: QL2; 1.10A {Trypanosoma cruzi}
           PDB: 1ewm_A* 1ewo_A* 1ewl_A* 1f29_A* 1ewp_A* 1f2b_A*
           1f2c_A* 1f2a_A* 1me4_A* 1u9q_X* 2aim_A* 2efm_A* 2oz2_A*
           1me3_A* 3kku_A* 3lxs_A* 1aim_A* 3iut_A* 3hd3_A* 2p86_A*
           ...
          Length = 215

 Score =  315 bits (809), Expect = e-108
 Identities = 93/225 (41%), Positives = 123/225 (54%), Gaps = 15/225 (6%)

Query: 127 PRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDD 186
           P A DWR   AVT VKDQ  CGS WAFS  GN+E  +      L +LSEQ L+ CD+ D 
Sbjct: 2   PAAVDWRARGAVTAVKDQGQCGSCWAFSAIGNVECQWFLAGHPLTNLSEQMLVSCDKTDS 61

Query: 187 GCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDD---KACRLNKKATQVKINGYVSVSRD 243
           GC GG ++NAF+ I+ +  G +  E +YPY   +     C  +       I G+V + +D
Sbjct: 62  GCSGGLMNNAFEWIVQENNGAVYTEDSYPYASGEGISPPCTTSGHTVGATITGHVELPQD 121

Query: 244 ETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTK 303
           E  +A +L  NGP+AVA++A +   Y  GV       C    E L H VL+VGY      
Sbjct: 122 EAQIAAWLAVNGPVAVAVDASSWMTYTGGVMTS----CVS--EQLDHGVLLVGYN----- 170

Query: 304 FTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
               AVPYWIIKNSW   WGE+GY R+ +G   C + +   SA+V
Sbjct: 171 -DSAAVPYWIIKNSWTTQWGEEGYIRIAKGSNQCLVKEEASSAVV 214


>3pdf_A Cathepsin C, dipeptidyl peptidase 1; two domains, cystein protease,
           hydrolase-hydrolase inhibitor; HET: LXV NAG; 1.85A {Homo
           sapiens} PDB: 1jqp_A* 2djf_B* 1k3b_B* 2djg_B* 2djf_A*
           1k3b_A* 2djg_A* 2djf_C* 1k3b_C* 2djg_C*
          Length = 441

 Score =  298 bits (765), Expect = 7e-99
 Identities = 87/317 (27%), Positives = 133/317 (41%), Gaps = 21/317 (6%)

Query: 47  EQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGF 106
              N  +    +      ++  +   ++ +   +         E+  L+  +   +  G 
Sbjct: 125 VYVNTAHLKNSQEKYSNRLYKYDHNFVKAINAIQKSWTATTYMEYETLTLGDMIRRSGGH 184

Query: 107 KL---KPSYADRSVPAMIPNITLPRAFDWREYDA---VTGVKDQTMCGSSWAFSTTGNIE 160
                +P  A  +       + LP ++DWR       V+ V++Q  CGS ++F++ G +E
Sbjct: 185 SRKIPRPKPAPLTAEIQQKILFLPTSWDWRNVHGINFVSPVRNQASCGSCYSFASMGMLE 244

Query: 161 GVYAAKTKKLVS--LSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG 218
                 T    +  LS QE++ C Q   GCEGG               GL EE  +PY G
Sbjct: 245 ARIRILTNNSQTPILSPQEVVSCSQYAQGCEGGFPYLIAGKYAQD--FGLVEEACFPYTG 302

Query: 219 DDKACRLNKKATQVKINGYVSVSR-----DETDMAKYLVENGPMAVAINAYA-LQFYVTG 272
            D  C++ +   +   + Y  V       +E  M   LV +GPMAVA   Y     Y  G
Sbjct: 303 TDSPCKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDDFLHYKKG 362

Query: 273 V-SHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
           +  H          E  +H+VL+VGYG D    +   + YWI+KNSWG GWGE GYFR+ 
Sbjct: 363 IYHHTGLRDPFNPFELTNHAVLLVGYGTD----SASGMDYWIVKNSWGTGWGENGYFRIR 418

Query: 332 RGDGSCGINDYVRSALV 348
           RG   C I     +A  
Sbjct: 419 RGTDECAIESIAVAATP 435


>3bwk_A Cysteine protease falcipain-3; malaria, hydrolase; HET: C1P; 2.42A
           {Plasmodium falciparum} PDB: 3bpm_A*
          Length = 243

 Score =  289 bits (741), Expect = 6e-98
 Identities = 83/234 (35%), Positives = 118/234 (50%), Gaps = 20/234 (8%)

Query: 125 TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE 184
               A+DWR +  VT VKDQ +CGS WAFS+ G++E  YA + K L   SEQEL+DC  +
Sbjct: 19  LDRIAYDWRLHGGVTPVKDQALCGSCWAFSSVGSVESQYAIRKKALFLFSEQELVDCSVK 78

Query: 185 DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG-DDKACRLNKKATQVKINGYVSVSRD 243
           ++GC GG I+NAFD ++    GGL  +  YPY     + C L +   +  I  YVS+   
Sbjct: 79  NNGCYGGYITNAFDDMIDL--GGLCSQDDYPYVSNLPETCNLKRCNERYTIKSYVSI--P 134

Query: 244 ETDMAKYLVENGPMAVAINA-YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGV--- 299
           +    + L   GP++++I A     FY  G        C       +H+V++VGYG+   
Sbjct: 135 DDKFKEALRYLGPISISIAASDDFAFYRGGFYDGE---C---GAAPNHAVILVGYGMKDI 188

Query: 300 -DRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGINDYVRSALV 348
            +      +   Y+IIKNSWG  WGE GY  L         +C I       L+
Sbjct: 189 YNEDTGRMEKFYYYIIKNSWGSDWGEGGYINLETDENGYKKTCSIGTEAYVPLL 242


>2oul_A Falcipain 2; cysteine protease, inhibitor, macromolecular
           interaction, HY hydrolase inhibitor complex; 2.20A
           {Plasmodium falciparum} SCOP: d.3.1.1 PDB: 2ghu_A 1yvb_A
           3bpf_A* 3pnr_A
          Length = 241

 Score =  287 bits (736), Expect = 4e-97
 Identities = 78/248 (31%), Positives = 126/248 (50%), Gaps = 20/248 (8%)

Query: 111 SYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKL 170
           +Y +              A+DWR +  VT VKDQ  CGS WAFS+ G++E  YA +  KL
Sbjct: 3   NYEEVIKKYRGEENFDHAAYDWRLHSGVTPVKDQKNCGSCWAFSSIGSVESQYAIRKNKL 62

Query: 171 VSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG-DDKACRLNKKA 229
           ++LSEQEL+DC  ++ GC GG I+NAF+ ++    GG+  +  YPY       C +++  
Sbjct: 63  ITLSEQELVDCSFKNYGCNGGLINNAFEDMIEL--GGICPDGDYPYVSDAPNLCNIDRCT 120

Query: 230 TQVKINGYVSVSRDETDMAKYLVENGPMAVAINA-YALQFYVTGVSHPIQFFCDGGNENL 288
            +  I  Y+SV   +  + + L   GP+++++       FY  G+       C    + L
Sbjct: 121 EKYGIKNYLSV--PDNKLKEALRFLGPISISVAVSDDFAFYKEGIFDGE---C---GDQL 172

Query: 289 SHSVLIVGYGVD----RTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGIN 340
           +H+V++VG+G+           +   Y+IIKNSWG+ WGE+G+  +          CG+ 
Sbjct: 173 NHAVMLVGFGMKEIVNPLTKKGEKHYYYIIKNSWGQQWGERGFINIETDESGLMRKCGLG 232

Query: 341 DYVRSALV 348
                 L+
Sbjct: 233 TDAFIPLI 240


>8pch_A Cathepsin H; hydrolase, protease, cysteine proteinase,
           aminopeptidase; HET: NAG BMA; 2.10A {Sus scrofa} SCOP:
           d.3.1.1 PDB: 1nb3_A* 1nb5_A*
          Length = 220

 Score =  284 bits (730), Expect = 1e-96
 Identities = 79/219 (36%), Positives = 111/219 (50%), Gaps = 15/219 (6%)

Query: 127 PRAFDWREYDA-VTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE- 184
           P + DWR+    V+ VK+Q  CGS W FSTTG +E   A  T K++SL+EQ+L+DC Q  
Sbjct: 2   PPSMDWRKKGNFVSPVKNQGSCGSCWTFSTTGALESAVAIATGKMLSLAEQQLVDCAQNF 61

Query: 185 -DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSV-SR 242
            + GC+GG  S AF+ I      G+  E TYPY+G D  C+         +    ++   
Sbjct: 62  NNHGCQGGLPSQAFEYIRYN--KGIMGEDTYPYKGQDDHCKFQPDKAIAFVKDVANITMN 119

Query: 243 DETDMAKYLVENGPMAVAINA-YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDR 301
           DE  M + +    P++ A         Y  G+       C    + ++H+VL VGYG   
Sbjct: 120 DEEAMVEAVALYNPVSFAFEVTNDFLMYRKGIYSS--TSCHKTPDKVNHAVLAVGYG--- 174

Query: 302 TKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGIN 340
                  +PYWI+KNSWG  WG  GYF + RG   CG+ 
Sbjct: 175 ---EENGIPYWIVKNSWGPQWGMNGYFLIERGKNMCGLA 210


>2b1m_A SPE31; papain-like, sugar binding protein; HET: NAG FUC PG4; 2.00A
           {Pachyrhizus erosus} PDB: 2b1n_A*
          Length = 246

 Score =  282 bits (723), Expect = 3e-95
 Identities = 88/227 (38%), Positives = 116/227 (51%), Gaps = 24/227 (10%)

Query: 126 LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQED 185
            P ++DW +   +T VK Q  CGS WAFS TG IE  +A  T  LVSLSEQELIDC  E 
Sbjct: 2   APESWDWSKKGVITKVKFQGQCGSGWAFSATGAIEAAHAIATGNLVSLSEQELIDCVDES 61

Query: 186 DGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSV----- 240
           +GC  G    +F+ ++    GG+  E  YPY+  D  C+ N+   +V I+ Y        
Sbjct: 62  EGCYNGWHYQSFEWVVKH--GGIASEADYPYKARDGKCKANEIQDKVTIDNYGVQILSNE 119

Query: 241 ---SRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGY 297
              S  E+ +    V   P++V+I+A    FY  G+       C      ++H VLIVGY
Sbjct: 120 STESEAESSLQS-FVLEQPISVSIDAKDFHFYSGGIYDGGN--C-SSPYGINHFVLIVGY 175

Query: 298 GVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGIN 340
           G      +   V YWI KNSWGE WG  GY R+ R      G CG+N
Sbjct: 176 G------SEDGVDYWIAKNSWGEDWGIDGYIRIQRNTGNLLGVCGMN 216


>2xu3_A Cathepsin L1; hydrolase, drug design, thiol protease; HET: XU3 BTB;
           0.90A {Homo sapiens} PDB: 2xu4_A* 2xu5_A* 2yj2_A*
           2yj8_A* 2yj9_A* 2yjb_A* 2yjc_A* 3bc3_A* 3h89_A* 3h8b_A*
           3h8c_A* 3of9_A* 3of8_A* 3hha_A* 2xu1_A* 3iv2_A* 3k24_A*
           2nqd_B* 3kse_A* 2vhs_A ...
          Length = 220

 Score =  276 bits (709), Expect = 2e-93
 Identities = 87/219 (39%), Positives = 127/219 (57%), Gaps = 13/219 (5%)

Query: 127 PRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE-- 184
           PR+ DWRE   VT VK+Q  CGS WAFS TG +EG    KT +L+SLSEQ L+DC     
Sbjct: 2   PRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQG 61

Query: 185 DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDE 244
           ++GC GG +  AF  +     GGL+ E++YPY   +++C+ N K +     G+V + + E
Sbjct: 62  NEGCNGGLMDYAFQYVQDN--GGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPKQE 119

Query: 245 TDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRT 302
             + K +   GP++VAI+A   +  FY  G+       C   +E++ H VL+VGYG + T
Sbjct: 120 KALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPD--C--SSEDMDHGVLVVGYGFEST 175

Query: 303 KFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-DGSCGIN 340
           +  +    YW++KNSWGE WG  GY ++ +     CGI 
Sbjct: 176 ESDNN--KYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIA 212


>2cio_A Papain; hydrolase/inhibitor, complex hydrolase/inhibitor, ICP,
           cysteine protease, allergen, protease, thiol protease;
           1.5A {Carica papaya} PDB: 1khq_A 1khp_A 1ppn_A 3e1z_B
           3ima_A 3lfy_A 9pap_A 1bqi_A* 1bp4_A* 1pad_A 1pe6_A*
           1pip_A* 1pop_A* 1ppd_A 1ppp_A* 1stf_E* 2pad_A 4pad_A*
           5pad_A* 6pad_A* ...
          Length = 212

 Score =  271 bits (696), Expect = 1e-91
 Identities = 81/223 (36%), Positives = 107/223 (47%), Gaps = 28/223 (12%)

Query: 126 LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQED 185
           +P   DWR+  AVT VK+Q  CGS WAFS    IEG+   +T  L   SEQEL+DCD+  
Sbjct: 1   IPEYVDWRQKGAVTPVKNQGSCGSCWAFSAVVTIEGIIKIRTGNLNQYSEQELLDCDRRS 60

Query: 186 DGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQ-VKINGYVSV-SRD 243
            GC GG   +A   +      G+    TYPY G  + CR  +K     K +G   V   +
Sbjct: 61  YGCNGGYPWSALQLVAQ---YGIHYRNTYPYEGVQRYCRSREKGPYAAKTDGVRQVQPYN 117

Query: 244 ETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDR 301
           E  +    + N P++V + A     Q Y  G+      F       + H+V  VGYG + 
Sbjct: 118 EGALLY-SIANQPVSVVLEAAGKDFQLYRGGI------FVGPCGNKVDHAVAAVGYGPN- 169

Query: 302 TKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGIN 340
                    Y +IKNSWG GWGE GY R+ RG     G CG+ 
Sbjct: 170 ---------YILIKNSWGTGWGENGYIRIKRGTGNSYGVCGLY 203


>3f5v_A DER P 1 allergen; allergy, asthma, DUST mites, glycoprotein,
           hydrola protease, secreted, thiol protease; HET: P6G;
           1.36A {Dermatophagoides pteronyssinus} PDB: 2as8_A
           3rvw_A* 3rvx_A 3rvv_A* 3d6s_A*
          Length = 222

 Score =  272 bits (697), Expect = 1e-91
 Identities = 62/225 (27%), Positives = 94/225 (41%), Gaps = 20/225 (8%)

Query: 123 NITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCD 182
           N   P   D R+   VT ++ Q  CGS+WAFS     E  Y A  ++ + L+EQEL+DC 
Sbjct: 7   NGNAPAEIDLRQMRTVTPIRMQGGCGSAWAFSGVAATESAYLAYRQQSLDLAEQELVDCA 66

Query: 183 QEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSV-S 241
               GC G +I    + I      G+ +E  Y Y   +++CR         I+ Y  +  
Sbjct: 67  S-QHGCHGDTIPRGIEYIQH---NGVVQESYYRYVAREQSCRRPNAQR-FGISNYCQIYP 121

Query: 242 RDETDMAKYLVE-NGPMAVAINA---YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGY 297
            +   + + L + +  +AV I      A + Y              G +   H+V IVGY
Sbjct: 122 PNANKIREALAQTHSAIAVIIGIKDLDAFRHYDGRTIIQRD--N--GYQPNYHAVNIVGY 177

Query: 298 GVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDY 342
              +         YWI++NSW   WG+ GY           I +Y
Sbjct: 178 SNAQGV------DYWIVRNSWDTNWGDNGYGYFAANIDLMMIEEY 216


>3kwz_A Cathepsin K; enzyme inhibitor, covalent reversible inhibitor,
           disease mutation, disulfide bond, glycoprotein,
           hydrolase, lysosome, protease; HET: KWZ; 1.49A {Homo
           sapiens} PDB: 1au0_A* 1au2_A* 1au3_A* 1au4_A* 1ayu_A*
           1ayv_A* 1ayw_A* 1bgo_A* 1atk_A* 1nl6_A* 1nlj_A* 1q6k_A*
           1mem_A* 1yk7_A* 1yk8_A* 1yt7_A* 2ato_A* 2aux_A* 2auz_A*
           2bdl_A* ...
          Length = 215

 Score =  270 bits (693), Expect = 4e-91
 Identities = 88/218 (40%), Positives = 126/218 (57%), Gaps = 16/218 (7%)

Query: 127 PRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDD 186
           P + D+R+   VT VK+Q  CGS WAFS+ G +EG    KT KL++LS Q L+DC  E+D
Sbjct: 2   PDSVDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSEND 61

Query: 187 GCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSV-SRDET 245
           GC GG ++NAF  +      G++ E  YPY G +++C  N      K  GY  +   +E 
Sbjct: 62  GCGGGYMTNAFQYVQKN--RGIDSEDAYPYVGQEESCMYNPTGKAAKCRGYREIPEGNEK 119

Query: 246 DMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTK 303
            + + +   GP++VAI+A   + QFY  GV +     C   ++NL+H+VL VGYG+ +  
Sbjct: 120 ALKRAVARVGPVSVAIDASLTSFQFYSKGVYYDES--C--NSDNLNHAVLAVGYGIQKGN 175

Query: 304 FTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-DGSCGIN 340
                  +WIIKNSWGE WG KGY  + R  + +CGI 
Sbjct: 176 ------KHWIIKNSWGENWGNKGYILMARNKNNACGIA 207


>3ioq_A CMS1MS2; caricaceae, cysteine protease, papain family, hydrolase;
           HET: E64 SO4; 1.87A {Carica candamarcensis}
          Length = 213

 Score =  270 bits (692), Expect = 6e-91
 Identities = 79/223 (35%), Positives = 117/223 (52%), Gaps = 28/223 (12%)

Query: 126 LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQED 185
           +P + DWR+  AVT V++Q  CGS W FS+   +EG+    T +L+SLSEQEL+DC++  
Sbjct: 1   IPTSIDWRQKGAVTPVRNQGGCGSCWTFSSVAAVEGINKIVTGQLLSLSEQELLDCERRS 60

Query: 186 DGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNK-KATQVKINGYVSV-SRD 243
            GC GG    A   + +    G+   + YPY G  + CR ++ K  +VK +G   V   +
Sbjct: 61  YGCRGGFPLYALQYVAN---SGIHLRQYYPYEGVQRQCRASQAKGPKVKTDGVGRVPRNN 117

Query: 244 ETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDR 301
           E  + +  +   P+++ + A   A Q Y  G+      F      ++ H+V  VGYG D 
Sbjct: 118 EQALIQ-RIAIQPVSIVVEAKGRAFQNYRGGI------FAGPCGTSIDHAVAAVGYGND- 169

Query: 302 TKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGIN 340
                    Y +IKNSWG GWGE GY R+ RG     G+CG+ 
Sbjct: 170 ---------YILIKNSWGTGWGEGGYIRIKRGSGNPQGACGVL 203


>1ppo_A Protease omega; hydrolase(thiol protease); 1.80A {Carica papaya}
           SCOP: d.3.1.1 PDB: 1meg_A*
          Length = 216

 Score =  268 bits (687), Expect = 5e-90
 Identities = 77/223 (34%), Positives = 107/223 (47%), Gaps = 24/223 (10%)

Query: 126 LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQED 185
           LP   DWR+  AVT V+ Q  CGS WAFS    +EG+   +T KLV LSEQEL+DC++  
Sbjct: 1   LPENVDWRKKGAVTPVRHQGSCGSCWAFSAVATVEGINKIRTGKLVELSEQELVDCERRS 60

Query: 186 DGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQV-KINGYVSV-SRD 243
            GC+GG    A + +      G+     YPY+     CR  +    + K +G   V   +
Sbjct: 61  HGCKGGYPPYALEYVAK---NGIHLRSKYPYKAKQGTCRAKQVGGPIVKTSGVGRVQPNN 117

Query: 244 ETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDR 301
           E ++    +   P++V + +     Q Y  G+      F       + H+V  VGYG   
Sbjct: 118 EGNLLN-AIAKQPVSVVVESKGRPFQLYKGGI------FEGPCGTKVDHAVTAVGYGKSG 170

Query: 302 TKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGIN 340
            K       Y +IKNSWG  WGEKGY R+ R      G CG+ 
Sbjct: 171 GK------GYILIKNSWGTAWGEKGYIRIKRAPGNSPGVCGLY 207


>1cqd_A Protein (protease II); cysteine protease, glycoprotein, proline
           specificity, carboh papain family, hydrolase; HET: NAG
           FUL FUC; 2.10A {Zingiber officinale} SCOP: d.3.1.1
          Length = 221

 Score =  267 bits (684), Expect = 1e-89
 Identities = 92/222 (41%), Positives = 125/222 (56%), Gaps = 22/222 (9%)

Query: 126 LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQED 185
           LP + DWRE  AV  VK+Q  CGS WAFST   +EG+    T  L+SLSEQ+L+DC   +
Sbjct: 3   LPDSIDWRENGAVVPVKNQGGCGSCWAFSTVAAVEGINQIVTGDLISLSEQQLVDCTTAN 62

Query: 186 DGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSV-SRDE 244
            GC GG ++ AF  I++   GG+  E+TYPYRG D  C     A  V I+ Y +V S +E
Sbjct: 63  HGCRGGWMNPAFQFIVNN--GGINSEETYPYRGQDGICNSTVNAPVVSIDSYENVPSHNE 120

Query: 245 TDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRT 302
             + K  V N P++V ++A     Q Y +G+      F    N + +H++ +VGYG +  
Sbjct: 121 QSLQK-AVANQPVSVTMDAAGRDFQLYRSGI------FTGSCNISANHALTVVGYGTEND 173

Query: 303 KFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGIN 340
           K       +WI+KNSWG+ WGE GY R  R     DG CGI 
Sbjct: 174 K------DFWIVKNSWGKNWGESGYIRAERNIENPDGKCGIT 209


>2bdz_A Mexicain; cysteine protease, peptidase_C1, papain-like, HYDR; HET:
           E64; 2.10A {Jacaratia mexicana}
          Length = 214

 Score =  266 bits (683), Expect = 1e-89
 Identities = 83/222 (37%), Positives = 115/222 (51%), Gaps = 28/222 (12%)

Query: 127 PRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDD 186
           P + DWRE  AVT VK+Q  CGS WAFST   IEG+    T +L+SLSEQEL+DC++   
Sbjct: 2   PESIDWREKGAVTPVKNQNPCGSCWAFSTVATIEGINKIITGQLISLSEQELLDCERRSH 61

Query: 187 GCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNK-KATQVKINGYVSV-SRDE 244
           GC+GG  + +   ++     G+  E+ YPY      CR    K  +V I GY  V + DE
Sbjct: 62  GCDGGYQTTSLQYVVD---NGVHTEREYPYEKKQGRCRAKDKKGPKVYITGYKYVPANDE 118

Query: 245 TDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRT 302
             + +  + N P++V  ++     QFY  G+      +      N  H+V  VGYG    
Sbjct: 119 ISLIQ-AIANQPVSVVTDSRGRGFQFYKGGI------YEGPCGTNTDHAVTAVGYGKT-- 169

Query: 303 KFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGIN 340
                   Y ++KNSWG  WGEKGY R+ R      G+CG+ 
Sbjct: 170 --------YLLLKNSWGPNWGEKGYIRIKRASGRSKGTCGVY 203


>1o0e_A Ervatamin C; plant cysteine protease, two domain, stable at PH
           2-12, HYDR; 1.90A {Tabernaemontana divaricata} SCOP:
           d.3.1.1 PDB: 2pns_A* 2pre_A* 3bcn_A*
          Length = 208

 Score =  264 bits (676), Expect = 1e-88
 Identities = 88/219 (40%), Positives = 116/219 (52%), Gaps = 24/219 (10%)

Query: 126 LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQED 185
           LP   DWR+  AVT VK+Q  CGS WAFST   +E +   +T  L+SLSEQEL+DCD+++
Sbjct: 1   LPEQIDWRKKGAVTPVKNQGSCGSCWAFSTVSTVESINQIRTGNLISLSEQELVDCDKKN 60

Query: 186 DGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDET 245
            GC GG+   A+  I++   GG++ +  YPY+     C+   K   V I+GY  V     
Sbjct: 61  HGCLGGAFVFAYQYIINN--GGIDTQANYPYKAVQGPCQAASKV--VSIDGYNGVPFCNE 116

Query: 246 DMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTK 303
              K  V   P  VAI+A     Q Y +G+      F       L+H V IVGY  +   
Sbjct: 117 XALKQAVAVQPSTVAIDASSAQFQQYSSGI------FSGPCGTKLNHGVTIVGYQAN--- 167

Query: 304 FTHKAVPYWIIKNSWGEGWGEKGYFRLYR--GDGSCGIN 340
                  YWI++NSWG  WGEKGY R+ R  G G CGI 
Sbjct: 168 -------YWIVRNSWGRYWGEKGYIRMLRVGGCGLCGIA 199


>1s4v_A Cysteine endopeptidase; KDEL ER retention signal, endosperm,
           ricinosomes, SEED germi senescence, hydrolase-hydrolase
           inhibitor complex; 2.00A {Ricinus communis} SCOP:
           d.3.1.1
          Length = 229

 Score =  263 bits (674), Expect = 6e-88
 Identities = 100/225 (44%), Positives = 125/225 (55%), Gaps = 23/225 (10%)

Query: 125 TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE 184
           T+P + DWR+  AVT VKDQ  CGS WAFST   +EG+   KT KLVSLSEQEL+DCD +
Sbjct: 1   TVPASVDWRKKGAVTSVKDQGQCGSCWAFSTIVAVEGINQIKTNKLVSLSEQELVDCDTD 60

Query: 185 -DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQV-KINGYVSV-S 241
            + GC GG +  AF+ I  +  GG+  E  YPY   D  C ++K+      I+G+ +V  
Sbjct: 61  QNQGCNGGLMDYAFEFIKQR--GGITTEANYPYEAYDGTCDVSKENAPAVSIDGHENVPE 118

Query: 242 RDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGV 299
            DE  + K  V N P++VAI+A     QFY  GV      F       L H V IVGYG 
Sbjct: 119 NDENALLK-AVANQPVSVAIDAGGSDFQFYSEGV------FTGSCGTELDHGVAIVGYGT 171

Query: 300 DRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGIN 340
                      YW +KNSWG  WGEKGY R+ RG    +G CGI 
Sbjct: 172 TI-----DGTKYWTVKNSWGPEWGEKGYIRMERGISDKEGLCGIA 211


>3ovx_A Cathepsin S; hydrolase, covalent inhibitor, aldehyde warhead is
           covalently bound to Cys25, lysosomeal protein; HET: O64;
           1.49A {Homo sapiens} PDB: 2h7j_A* 2f1g_A* 2hh5_B*
           2hhn_A* 2hxz_A* 2op3_A* 2frq_A* 2fra_A* 2fq9_A* 2ft2_A*
           2fud_A* 2g7y_A* 1ms6_A* 2r9m_A* 2r9n_A* 2r9o_A* 3n3g_A*
           3n4c_A* 3mpe_A* 1nqc_A* ...
          Length = 218

 Score =  262 bits (671), Expect = 1e-87
 Identities = 79/222 (35%), Positives = 117/222 (52%), Gaps = 20/222 (9%)

Query: 126 LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE- 184
           LP + DWRE   VT VK Q  CG+ WAFS  G +E     KT KLVSLS Q L+DC  E 
Sbjct: 2   LPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEK 61

Query: 185 --DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSV-S 241
             + GC GG ++ AF  I+     G++ + +YPY+  D+ C+ + K      + Y  +  
Sbjct: 62  YGNKGCNGGFMTTAFQYIIDN--KGIDSDASYPYKAMDQKCQYDSKYRAATCSKYTELPY 119

Query: 242 RDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGV 299
             E  + + +   GP++V ++A   +   Y +GV     ++     +N++H VL+VGYG 
Sbjct: 120 GREDVLKEAVANKGPVSVGVDARHPSFFLYRSGV-----YYEPSCTQNVNHGVLVVGYG- 173

Query: 300 DRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-DGSCGIN 340
                      YW++KNSWG  +GE+GY R+ R     CGI 
Sbjct: 174 -----DLNGKEYWLVKNSWGHNFGEEGYIRMARNKGNHCGIA 210


>3f75_A Toxopain-2, cathepsin L protease; medical structural genomics of
           pathogenic protozoa, MSGPP, C protease, parasite,
           protozoa, hydrolase; 1.99A {Toxoplasma gondii}
          Length = 224

 Score =  261 bits (670), Expect = 2e-87
 Identities = 94/223 (42%), Positives = 119/223 (53%), Gaps = 21/223 (9%)

Query: 126 LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE- 184
           LP   DWR    VT VKDQ  CGS WAFSTTG +EG + AKT KLVSLSEQEL+DC +  
Sbjct: 7   LPAGVDWRSRGCVTPVKDQRDCGSCWAFSTTGALEGAHCAKTGKLVSLSEQELMDCSRAE 66

Query: 185 -DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSV-SR 242
            +  C GG +++AF  ++    GG+  E  YPY   D+ CR       VKI G+  V  R
Sbjct: 67  GNQSCSGGEMNDAFQYVLDS--GGICSEDAYPYLARDEECRAQSCEKVVKILGFKDVPRR 124

Query: 243 DETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVD 300
            E  M    +   P+++AI A     QFY  GV       C     +L H VL+VGYG D
Sbjct: 125 SEAAMKA-ALAKSPVSIAIEADQMPFQFYHEGVFDA---SC---GTDLDHGVLLVGYGTD 177

Query: 301 RTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG---DGSCGIN 340
           +         +WI+KNSWG GWG  GY  +      +G CG+ 
Sbjct: 178 KESKKD----FWIMKNSWGTGWGRDGYMYMAMHKGEEGQCGLL 216


>1iwd_A Ervatamin B; cysteine protease, alpha-beta protein, catalytic DYAD,
           L-DOM domain., hydrolase; 1.63A {Tabernaemontana
           divaricata} SCOP: d.3.1.1
          Length = 215

 Score =  260 bits (666), Expect = 5e-87
 Identities = 81/222 (36%), Positives = 117/222 (52%), Gaps = 23/222 (10%)

Query: 126 LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQED 185
           LP   DWR   AV  +K+Q  CGS WAFS    +E +   +T +L+SLSEQEL+DCD   
Sbjct: 1   LPSFVDWRSKGAVNSIKNQKQCGSCWAFSAVAAVESINKIRTGQLISLSEQELVDCDTAS 60

Query: 186 DGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSV-SRDE 244
            GC GG ++NAF  I++   GG++ ++ YPY     +C+  +    V ING+  V   +E
Sbjct: 61  HGCNGGWMNNAFQYIITN--GGIDTQQNYPYSAVQGSCKPYRLRV-VSINGFQRVTRNNE 117

Query: 245 TDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRT 302
           + +    V + P++V + A     Q Y +G+      F        +H V+IVGYG    
Sbjct: 118 SALQS-AVASQPVSVTVEAAGAPFQHYSSGI------FTGPCGTAQNHGVVIVGYGTQSG 170

Query: 303 KFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGIN 340
           K       YWI++NSWG+ WG +GY  + R      G CGI 
Sbjct: 171 K------NYWIVRNSWGQNWGNQGYIWMERNVASSAGLCGIA 206


>2fo5_A Cysteine proteinase EP-B 2; EP-B2, EPB2, EPB, cysteine
           endoprotease, endopeptidase, LEUP hydrolase; HET: AR7;
           2.20A {Hordeum vulgare}
          Length = 262

 Score =  261 bits (670), Expect = 7e-87
 Identities = 98/226 (43%), Positives = 123/226 (54%), Gaps = 24/226 (10%)

Query: 126 LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQED 185
           LP + DWR+  AVTGVKDQ  CGS WAFST  ++EG+ A +T  LVSLSEQELIDCD  D
Sbjct: 4   LPPSVDWRQKGAVTGVKDQGKCGSCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDTAD 63

Query: 186 D-GCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQ----VKINGYVSV 240
           + GC+GG + NAF+ I +   GGL  E  YPYR     C + + A      V I+G+  V
Sbjct: 64  NDGCQGGLMDNAFEYIKNN--GGLITEAAYPYRAARGTCNVARAAQNSPVVVHIDGHQDV 121

Query: 241 SRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYG 298
             +  +     V N P++VA+ A   A  FY  GV      F       L H V +VGYG
Sbjct: 122 PANSEEDLARAVANQPVSVAVEASGKAFMFYSEGV------FTGECGTELDHGVAVVGYG 175

Query: 299 VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGIN 340
           V           YW +KNSWG  WGE+GY R+ +      G CGI 
Sbjct: 176 V-----AEDGKAYWTVKNSWGPSWGEQGYIRVEKDSGASGGLCGIA 216


>1yal_A Chymopapain; hydrolase, thiol protease; 1.70A {Carica papaya} SCOP:
           d.3.1.1 PDB: 1gec_E*
          Length = 218

 Score =  258 bits (662), Expect = 3e-86
 Identities = 89/222 (40%), Positives = 112/222 (50%), Gaps = 24/222 (10%)

Query: 127 PRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDD 186
           P++ DWR   AVT VK+Q  CGS WAFST   +EG+    T  L+ LSEQEL+DCD+   
Sbjct: 2   PQSIDWRAKGAVTPVKNQGACGSCWAFSTIATVEGINKIVTGNLLELSEQELVDCDKHSY 61

Query: 187 GCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQ-VKINGYVSV-SRDE 244
           GC+GG  + +   + +    G+   K YPY+     CR   K    VKI GY  V S  E
Sbjct: 62  GCKGGYQTTSLQYVAN---NGVHTSKVYPYQAKQYKCRATDKPGPKVKITGYKRVPSNCE 118

Query: 245 TDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRT 302
           T      + N P++V + A     Q Y +GV      F       L H+V  VGYG    
Sbjct: 119 TSFLG-ALANQPLSVLVEAGGKPFQLYKSGV------FDGPCGTKLDHAVTAVGYGTSDG 171

Query: 303 KFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGIN 340
           K       Y IIKNSWG  WGEKGY RL R      G+CG+ 
Sbjct: 172 K------NYIIIKNSWGPNWGEKGYMRLKRQSGNSQGTCGVY 207


>3p5u_A Actinidin; SAD, cysteine proteinases, hydrolase; 1.50A {Actinidia
           arguta} PDB: 3p5v_A 3p5w_A 3p5x_A 1aec_A* 2act_A
          Length = 220

 Score =  256 bits (656), Expect = 2e-85
 Identities = 85/224 (37%), Positives = 120/224 (53%), Gaps = 24/224 (10%)

Query: 126 LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE- 184
           LP   DWR   AV  +KDQ  CGS+WAFST   +EG+    T  L+SLSEQEL+DC +  
Sbjct: 1   LPDYVDWRSSGAVVDIKDQGQCGSAWAFSTIAAVEGINKIATGDLISLSEQELVDCGRTQ 60

Query: 185 -DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQV-KINGYVSV-S 241
              GC+GG +++ F  I++   GG+  E  YPY  ++  C L+ +  +   I+ Y +V  
Sbjct: 61  NTRGCDGGFMTDGFQFIINN--GGINTEANYPYTAEEGQCNLDLQQEKYVSIDTYENVPY 118

Query: 242 RDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGV 299
            +E  +    V   P++VA+ A  Y  Q Y +G+      F       + H+V IVGYG 
Sbjct: 119 NNEWALQT-AVAYQPVSVALEAAGYNFQHYSSGI------FTGPCGTAVDHAVTIVGYGT 171

Query: 300 DRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG---DGSCGIN 340
           +        + YWI+KNSWG  WGE+GY R+ R     G CGI 
Sbjct: 172 E------GGIDYWIVKNSWGTTWGEEGYMRIQRNVGGVGQCGIA 209


>2wbf_X Serine-repeat antigen protein; SERA, malaria, vacuole, protease,
           cathepsin, hydrolase, glycoprotein, thiol protease; HET:
           DMS; 1.60A {Plasmodium falciparum} PDB: 3ch3_X 3ch2_X
          Length = 265

 Score =  253 bits (649), Expect = 9e-84
 Identities = 59/255 (23%), Positives = 94/255 (36%), Gaps = 38/255 (14%)

Query: 126 LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE- 184
             R  D     +   V+DQ  C +SW F++  ++E +   K  +   +S   + +C +  
Sbjct: 10  CNRLKDENNCISNLQVEDQGNCDTSWIFASKYHLETIRCMKGYEPTKISALYVANCYKGE 69

Query: 185 -DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG------------------DDKACRL 225
             D C+ GS    F  I+    G L  E  YPY                    D+     
Sbjct: 70  HKDRCDEGSSPMEFLQIIED-YGFLPAESNYPYNYVKVGEQCPKVEDHWMNLWDNGKILH 128

Query: 226 NKKATQ-VKINGYVSV---------SRDETDMAKYLVENGPMAVAINAY-ALQFYVTGVS 274
           NK     +   GY +                +   ++  G +   I A   + +  +G  
Sbjct: 129 NKNEPNSLDGKGYTAYESERFHDNMDAFVKIIKTEVMNKGSVIAYIKAENVMGYEFSGKK 188

Query: 275 HPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR-G 333
                 C  G++   H+V IVGYG        K   YWI++NSWG  WG++GYF++   G
Sbjct: 189 VK--NLC--GDDTADHAVNIVGYGNYVNSEGEK-KSYWIVRNSWGPYWGDEGYFKVDMYG 243

Query: 334 DGSCGINDYVRSALV 348
              C  N      + 
Sbjct: 244 PTHCHFNFIHSVVIF 258


>3u8e_A Papain-like cysteine protease; papain-like cysteine peptidase,
           peptidase_C1A, hydrolase, in form; 1.31A {Crocus
           sativus}
          Length = 222

 Score =  237 bits (606), Expect = 5e-78
 Identities = 83/221 (37%), Positives = 113/221 (51%), Gaps = 18/221 (8%)

Query: 127 PRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDD 186
           P + DWR+  AVT VKDQ  CG  WAF  TG IEG+ A  T +L+S+SEQ+++DCD    
Sbjct: 2   PASIDWRKKGAVTSVKDQGACGMCWAFGATGAIEGIDAITTGRLISVSEQQIVDCDTXXX 61

Query: 187 GCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETD 246
              GG   +AF  +++   GG+  +  YPY G D  C LNK     +I+GY +V    + 
Sbjct: 62  XXXGGDADDAFRWVITN--GGIASDANYPYTGVDGTCDLNKPIA-ARIDGYTNVPNSSSA 118

Query: 247 MAKYLVENGPMAVAINA--YALQFYVT-GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTK 303
           +    V   P++V I     + Q Y   G+       C      + H+VLIVGYG +   
Sbjct: 119 LLD-AVAKQPVSVNIYTSSTSFQLYTGPGIFAGSS--CSDDPATVDHTVLIVGYGSNG-- 173

Query: 304 FTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGIN 340
                  YWI+KNSWG  WG  GY  + R     DG C I+
Sbjct: 174 ---TNADYWIVKNSWGTEWGIDGYILIRRNTNRPDGVCAID 211


>3hhi_A Cathepsin B-like cysteine protease; occluding loop, hydrolase, THIO
           protease; HET: 074; 1.60A {Trypanosoma brucei} PDB:
           3mor_A*
          Length = 325

 Score =  230 bits (589), Expect = 7e-74
 Identities = 71/292 (24%), Positives = 103/292 (35%), Gaps = 53/292 (18%)

Query: 89  NEFSDLSTAEFQAKYLGFKLKPSYA----DRSVPAMIPNITLPRAFD----WREYDAVTG 140
               +++  E + +  G   K + A     R          LP +FD    W     +  
Sbjct: 32  GVMQNITLREAK-RLNGVIKKNNNASILPKRRFTEEEARAPLPSSFDSAEAWPNCPTIPQ 90

Query: 141 VKDQTMCGSSWAFSTTGNIEGVYAAKT-KKLVSLSEQELIDCDQE-DDGCEGGSISNAFD 198
           + DQ+ CGS WA +    +   +      + V +S  +L+ C  +  DGC GG    A+ 
Sbjct: 91  IADQSACGSCWAVAAASAMSDRFCTMGGVQDVHISAGDLLACCSDCGDGCNGGDPDRAWA 150

Query: 199 TIMSKLGGGLEEEKTYPYRGDDKACRL------------------------NKKATQVKI 234
              S    GL  +   PY     +                           +     V  
Sbjct: 151 YFSST---GLVSDYCQPYPFPHCSHHSKSKNGYPPCSQFNFDTPKCDYTCDDPTIPVVNY 207

Query: 235 NGYVSVS-RDETDMAKYLVENGPMAVAINAYA-LQFYVTGV-SHPIQFFCDGGNENLSHS 291
             + S + + E D  + L   GP  VA + Y     Y +GV  H        G     H+
Sbjct: 208 RSWTSYALQGEDDYMRELFFRGPFEVAFDVYEDFIAYNSGVYHHV------SGQYLGGHA 261

Query: 292 VLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYV 343
           V +VG+G          VPYW I NSW   WG  GYF + RG   CGI D  
Sbjct: 262 VRLVGWGTSNG------VPYWKIANSWNTEWGMDGYFLIRRGSSECGIEDGG 307


>1deu_A Procathepsin X; cysteine protease, proregion, prosegment, HY; 1.70A
           {Homo sapiens} SCOP: d.3.1.1 PDB: 1ef7_A
          Length = 277

 Score =  216 bits (552), Expect = 6e-69
 Identities = 67/277 (24%), Positives = 109/277 (39%), Gaps = 51/277 (18%)

Query: 99  FQAKYLGFKLKPSYADRSVPA--------MIPNITLPRAFDWREYDAV---TGVKDQTM- 146
           F+     ++         +           +    LP+++DWR  D V   +  ++Q + 
Sbjct: 1   FRRGQTCYRPLRGDGLAPLGRTTYPRPHEYLSPADLPKSWDWRNVDGVNYASITRNQHIP 60

Query: 147 --CGSSWAFSTTGNIEGVYAAKTK---KLVSLSEQELIDCDQEDDGCEGGSISNAFDTIM 201
             CGS WA ++T  +      K K       LS Q +IDC      CEGG+  + +D   
Sbjct: 61  QYCGSCWAHASTSAMADRINIKRKGAWPSTLLSVQNVIDCG-NAGSCEGGNDLSVWDYAH 119

Query: 202 SKLGGGLEEEKTYPYRGDD---------------KACRLNKKATQVKINGYVSVSRDETD 246
                G+ +E    Y+  D               K C   +  T  ++  Y S+S  E  
Sbjct: 120 QH---GIPDETCNNYQAKDQECDKFNQCGTCNEFKECHAIRNYTLWRVGDYGSLSGREKM 176

Query: 247 MAKYLVENGPMAVAINAY-ALQFYVTGV-SHPIQFFCDGGNENLSHSVLIVGYGVDRTKF 304
           MA+ +  NGP++  I A   L  Y  G+ +             ++H V + G+G+     
Sbjct: 177 MAE-IYANGPISCGIMATERLANYTGGIYAEY------QDTTYINHVVSVAGWGIS---- 225

Query: 305 THKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGIND 341
                 YWI++NSWGE WGE+G+ R+       G   
Sbjct: 226 --DGTEYWIVRNSWGEPWGERGWLRIVTSTYKDGKGA 260


>3pbh_A Procathepsin B; thiol protease, cysteine protease, proenzyme,
           papain; 2.50A {Homo sapiens} SCOP: d.3.1.1 PDB: 2pbh_A
           1pbh_A 1mir_A
          Length = 317

 Score =  214 bits (548), Expect = 7e-68
 Identities = 75/299 (25%), Positives = 113/299 (37%), Gaps = 59/299 (19%)

Query: 87  GLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREY----DAVTGVK 142
           G N F ++  +  + +  G  L              ++ LP +FD RE       +  ++
Sbjct: 28  GHN-FYNVDMSYLK-RLCGTFLGGP-KPPQRVMFTEDLKLPASFDAREQWPQCPTIKEIR 84

Query: 143 DQTMCGSSWAFSTTGNIEGVYAAKTKKLVS--LSEQELIDC--DQEDDGCEGGSISNAFD 198
           DQ  CGS WAF     I       T   VS  +S ++L+ C      DGC GG  + A++
Sbjct: 85  DQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWN 144

Query: 199 TIMSKLGGGLEEEKTY-------PYRGDDKACRLNKKATQVKING--------------- 236
               K   GL     Y       PY        +N         G               
Sbjct: 145 FWTRK---GLVSGGLYESHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSP 201

Query: 237 -----------YVSVSRDETDMAKYLVENGPMAVAINAYA-LQFYVTGVSHPIQFFCDGG 284
                        SVS  E D+   + +NGP+  A + Y+    Y +GV     +    G
Sbjct: 202 TYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGV-----YQHVTG 256

Query: 285 NENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYV 343
                H++ I+G+GV+         PYW++ NSW   WG+ G+F++ RG   CGI   V
Sbjct: 257 EMMGGHAIRILGWGVENG------TPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEV 309


>3qsd_A Cathepsin B-like peptidase (C01 family); cysteine peptidase,
           digestive tract, hydrolase-hydrolase INH complex; HET:
           074; 1.30A {Schistosoma mansoni} PDB: 3s3q_A* 3s3r_A*
          Length = 254

 Score =  212 bits (541), Expect = 1e-67
 Identities = 65/266 (24%), Positives = 110/266 (41%), Gaps = 54/266 (20%)

Query: 124 ITLPRAFDWRE----YDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKT--KKLVSLSEQE 177
           + +P +FD R+      ++  ++DQ+ CGS WAF     +      ++  K+ V LS  +
Sbjct: 1   VEIPSSFDSRKKWPRCKSIATIRDQSRCGSCWAFGAVEAMSDRSCIQSGGKQNVELSAVD 60

Query: 178 LIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEE--------KTYPYRGDDKACRLN-- 226
           L+ C +    GCEGG +  A+D  + +  G +           + YP+   +   +    
Sbjct: 61  LLSCCESCGLGCEGGILGPAWDYWVKE--GIVTGSSKENHAGCEPYPFPKCEHHTKGKYP 118

Query: 227 ------------------KKATQVKINGY-----VSVSRDETDMAKYLVENGPMAVAINA 263
                             K  T    + +      +V  DE  + K +++ GP+      
Sbjct: 119 PCGSKIYKTPRCKQTCQKKYKTPYTQDKHRGKSSYNVKNDEKAIQKEIMKYGPVEAGFTV 178

Query: 264 YA-LQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGW 322
           Y     Y +G+     +    G     H++ I+G+GV+         PYW+I NSW E W
Sbjct: 179 YEDFLNYKSGI-----YKHITGETLGGHAIRIIGWGVENK------APYWLIANSWNEDW 227

Query: 323 GEKGYFRLYRGDGSCGINDYVRSALV 348
           GE GYFR+ RG   C I   V +  +
Sbjct: 228 GENGYFRIVRGRDECSIESEVTAGRI 253


>3cbj_A Cathepsin B; cathepsin B, occluding loop, chagas disease, glyco
           hydrolase, lysosome, protease, thiol protease, zymogen,
           CYT vesicle; 1.80A {Homo sapiens} PDB: 3cbk_A 1gmy_A*
           3ai8_B* 3k9m_A 1the_A* 1cpj_A* 1cte_A 2dcc_A* 2dc6_A*
           1ito_A* 2dc8_A* 2dc9_A* 2dca_A* 2dcb_A* 2dc7_A* 2dcd_A*
           1qdq_A* 1csb_B* 1huc_B 2ipp_B ...
          Length = 266

 Score =  208 bits (532), Expect = 4e-66
 Identities = 70/263 (26%), Positives = 105/263 (39%), Gaps = 56/263 (21%)

Query: 123 NITLPRAFDWREY----DAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVS--LSEQ 176
           ++ LP +FD RE       +  ++DQ  CGS+WAF     I       T   VS  +S +
Sbjct: 4   DLKLPASFDAREQWPQCPTIKEIRDQGSCGSAWAFGAVEAISDRICIHTNAHVSVEVSAE 63

Query: 177 ELIDC--DQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY-------PYRGDDKACRLNK 227
           +L+ C      DGC GG  + A++    K   GL     Y       PY        +N 
Sbjct: 64  DLLTCCGSMCGDGCNGGYPAEAWNFWTRK---GLVSGGLYESHVGCRPYSIPPCEAHVNG 120

Query: 228 --------------------------KATQVKINGYVSVSRDETDMAKYLVENGPMAVAI 261
                                     K  +       SVS  E D+   + +NGP+  A 
Sbjct: 121 ARPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAF 180

Query: 262 NAYA-LQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGE 320
           + Y+    Y +GV   +      G     H++ I+G+GV+         PYW++ NSW  
Sbjct: 181 SVYSDFLLYKSGVYQHVT-----GEMMGGHAIRILGWGVENG------TPYWLVANSWNT 229

Query: 321 GWGEKGYFRLYRGDGSCGINDYV 343
            WG+ G+F++ RG   CGI   V
Sbjct: 230 DWGDNGFFKILRGQDHCGIESEV 252


>3ois_A Cysteine protease; alpha and beta, hydrolase; HET: UDP; 1.65A
           {Xylella fastidiosa}
          Length = 291

 Score =  202 bits (514), Expect = 4e-63
 Identities = 55/286 (19%), Positives = 85/286 (29%), Gaps = 43/286 (15%)

Query: 81  HGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYAD----RSVPAMIPNITLPRAFDWREYD 136
           H SG+              +    G+   P  AD       P       LP   D     
Sbjct: 10  HSSGLVPRGSHMQTVLKRRKKSGYGYI--PDIADIRDFSYTPEKSVIAALPPKVDLTPP- 66

Query: 137 AVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE----DDGCEGGS 192
               V DQ   GS  A +    I+       +    +  +  I  ++         + G+
Sbjct: 67  --FQVYDQGRIGSCTANALAAAIQFERIHDKQSPEFIPSRLFIYYNERKIEGHVNYDSGA 124

Query: 193 ISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLN-----------------KKATQVKIN 235
           +      ++ K   G+  EK +PY       R                   K A   KI 
Sbjct: 125 MIRDGIKVLHK--LGVCPEKEWPYGDTPADPRTEEFPPGAPASKKPSDQCYKDAQNYKIT 182

Query: 236 GYVSVSRDETDMAKYLVENGPMAVAINAYA-LQFYVTGVSHPIQFFCDGGNENLSHSVLI 294
            Y  V++D   +   L    P     + Y       +     I            H+VL 
Sbjct: 183 EYSRVAQDIDHLKACLAVGSPFVFGFSVYNSWVGNNSLPVR-IPLPTKNDTLEGGHAVLC 241

Query: 295 VGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL-YRGDGSCGI 339
           VGY           + ++ I+NSWG   GE GYF + Y    +  +
Sbjct: 242 VGYD--------DEIRHFRIRNSWGNNVGEDGYFWMPYEYISNTQL 279


>3f75_P Toxopain-2, cathepsin L propeptide; medical structural genomics of
           pathogenic protozoa, MSGPP, C protease, parasite,
           protozoa, hydrolase; 1.99A {Toxoplasma gondii}
          Length = 106

 Score = 78.2 bits (193), Expect = 2e-18
 Identities = 31/95 (32%), Positives = 39/95 (41%), Gaps = 9/95 (9%)

Query: 31  HHLHHVKHT--------ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHG 82
           HH HH              F+ F   + K+YAT  E   R  IF  NL  I       + 
Sbjct: 6   HHHHHGSIWEWKEAHFQDAFSSFQAMYAKSYATEEEKQRRYAIFKNNLVYIHTHNQQGY- 64

Query: 83  SGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSV 117
           S    +N F DLS  EF+ KYLGFK   +     +
Sbjct: 65  SYSLKMNHFGDLSRDEFRRKYLGFKKSRNLKSHHL 99


>2l95_A Crammer, LP06209P; cysteine proteinase inhibitor, intrinsic
           disorder P like protein, hydrolase; NMR {Drosophila
           melanogaster}
          Length = 80

 Score = 58.9 bits (143), Expect = 1e-11
 Identities = 17/75 (22%), Positives = 34/75 (45%), Gaps = 5/75 (6%)

Query: 40  ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLL-QDTEHGSGVY--GLNEFSDLST 96
             +  +  + +K Y    E   R  I++ +  +I+   +  E G   +  G+N  +DL+ 
Sbjct: 8   EEWVEYKSKFDKNYEAE-EDLMRRRIYAESKARIEEHNRKFEKGEVTWKMGINHLADLTP 66

Query: 97  AEFQAKYLGFKLKPS 111
            EF  +  G K+ P+
Sbjct: 67  EEFAQRS-GKKVPPN 80


>3pw3_A Aminopeptidase C; bleomycin, cysteine proteinase fold, structural
           genomics, JO center for structural genomics, JCSG; HET:
           MSE; 2.23A {Parabacteroides distasonis}
          Length = 383

 Score = 53.7 bits (128), Expect = 3e-08
 Identities = 31/166 (18%), Positives = 49/166 (29%), Gaps = 16/166 (9%)

Query: 127 PRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQED- 185
              F   + + +T VK+Q   G+ W +S+   +E       K    LSE   +     D 
Sbjct: 11  GFVFTTVKENPITSVKNQNRAGTCWCYSSYSFLESELLRMGKGEYDLSEMFTVYNTYLDR 70

Query: 186 -----------DGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKI 234
                         +GGS  +A   + +    GL  E+             N        
Sbjct: 71  ADAAVRTHGDVSFSQGGSFYDALYGMETF---GLVPEEEMRPGMMYADTLSNHTELSALT 127

Query: 235 NGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFF 280
           +  V+             EN  M       A+     GV  P +F 
Sbjct: 128 DAMVAAIAKGKLRKLQSDENNAMLWKKAVAAVHQIYLGVP-PEKFT 172



 Score = 46.8 bits (110), Expect = 5e-06
 Identities = 18/86 (20%), Positives = 29/86 (33%), Gaps = 7/86 (8%)

Query: 245 TDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKF 304
           +DMA +L               Q + T     + +  D       H + I G   D+   
Sbjct: 275 SDMAHWLKLKPEEKKLNTKPQPQKWCTQAERQLAY--DNYETTDDHGMQIYGIAKDQEG- 331

Query: 305 THKAVPYWIIKNSWGEGWGEKGYFRL 330
                 Y+++KNSWG      G +  
Sbjct: 332 ----NEYYMVKNSWGTNSKYNGIWYA 353


>2pff_B Fatty acid synthase subunit beta; fatty acid synthase,
            acyl-carrier-protein, beta-ketoacyl RED beta-ketoacyl
            synthase, dehydratase; 4.00A {Saccharomyces cerevisiae}
          Length = 2006

 Score = 47.4 bits (112), Expect = 6e-06
 Identities = 48/274 (17%), Positives = 85/274 (31%), Gaps = 94/274 (34%)

Query: 60   YSRLHIFSGNLRKIQLLQDTEHGSGV---YGLNEFSDLSTAEFQAKYLGFK----LKPSY 112
            +S L I   N   + +    E G  +   Y    F  +   + + + + FK       SY
Sbjct: 1659 FSILDIVINNPVNLTIHFGGEKGKRIRENYSAMIFETIVDGKLKTEKI-FKEINEHSTSY 1717

Query: 113  ---ADRSV--------PAMIPNITLPRAF--DWREYDAVTGVKDQTMCGSSWAFSTTGNI 159
               +++ +        PA+     + +A   D +    +    D T  G S      G  
Sbjct: 1718 TFRSEKGLLSATQFTQPALT---LMEKAAFEDLKSKGLI--PADATFAGHS-----LG-- 1765

Query: 160  EGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAF-DTIMSKLGGGLEEEKTYPYRG 218
            E  YAA    L                      +S      +  ++   +       YRG
Sbjct: 1766 E--YAA----LA----------------SLADVMS--IESLV--EV---VF------YRG 1790

Query: 219  DDKACRLNKKATQVKINGYVSVSRDETDMAKYL---VENGPMAVAINAYALQFYVTGVSH 275
                        QV      +V RDE   + Y    +  G +A + +  ALQ+ V  V  
Sbjct: 1791 ---------MTMQV------AVPRDELGRSNYGMIAINPGRVAASFSQEALQYVVERVGK 1835

Query: 276  PIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAV 309
               +  +  N N+ +   +   G        +A+
Sbjct: 1836 RTGWLVEIVNYNVENQQYVAA-G------DLRAL 1862



 Score = 45.8 bits (108), Expect = 2e-05
 Identities = 50/301 (16%), Positives = 96/301 (31%), Gaps = 96/301 (31%)

Query: 11  ALLSLTVSVSSFMVVGDEKLHHLHHVKHTALF-------NYFLE-QH-NKTYATLVEYYS 61
           AL     +V      G+ +L         A+F       +YF E +   +TY  LV    
Sbjct: 144 ALFR---AVGE----GNAQLV--------AIFGGQGNTDDYFEELRDLYQTYHVLVGDL- 187

Query: 62  RLHIFSGNLRKIQLLQDTEHGSGVY--GLNEFSDLSTAEFQ--AKYLGFKLKPSYADRSV 117
            +   +  L   +L++ T     V+  GLN    L          YL           S+
Sbjct: 188 -IKFSAETLS--ELIRTTLDAEKVFTQGLNILEWLENPSNTPDKDYL----------LSI 234

Query: 118 PAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSL--SE 175
           P   P I + +   +     + G     +   S+    TG+ +G+  A     ++   S 
Sbjct: 235 PISCPLIGVIQLAHYVVTAKLLGFTPGEL--RSYLKGATGHSQGLVTA---VAIAETDSW 289

Query: 176 QELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYP-------------YRGDDK- 221
           +      ++       +I+  F  I      G+   + YP                +   
Sbjct: 290 ESFFVSVRK-------AITVLF-FI------GVRCYEAYPNTSLPPSILEDSLENNEGVP 335

Query: 222 ----ACR-LNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVA-INAYALQFYVTGVSH 275
               +   L ++  Q  +        ++T+   +L     + ++ +N  A    V+G   
Sbjct: 336 SPMLSISNLTQEQVQDYV--------NKTN--SHLPAGKQVEISLVNG-AKNLVVSG--P 382

Query: 276 P 276
           P
Sbjct: 383 P 383



 Score = 33.9 bits (77), Expect = 0.089
 Identities = 32/188 (17%), Positives = 50/188 (26%), Gaps = 57/188 (30%)

Query: 4   FYFFAGVA------LLSLTVSVS-----------SFM-VVGDEKLHHLHHVKHTALFNYF 45
             FF GV         SL  S+            S M  + +     +    +    N  
Sbjct: 302 VLFFIGVRCYEAYPNTSLPPSILEDSLENNEGVPSPMLSISNLTQEQVQ--DYVNKTNSH 359

Query: 46  LEQHNKTYATLVE---------YYSRLHIFSGNLRKIQ----LLQDTEHGSG--VYGLNE 90
           L    +   +LV              L+  +  LRK +    L Q     S   +   N 
Sbjct: 360 LPAGKQVEISLVNGAKNLVVSGPPQSLYGLNLTLRKAKAPSGLDQSRIPFSERKLKFSNR 419

Query: 91  FSDLS-TAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWRE-------YDAVTGVK 142
           F  L   + F +  L               +I    +     +         YD   G  
Sbjct: 420 F--LPVASPFHSHLL----------VPASDLINKDLVKNNVSFNAKDIQIPVYDTFDG-S 466

Query: 143 D-QTMCGS 149
           D + + GS
Sbjct: 467 DLRVLSGS 474



 Score = 33.5 bits (76), Expect = 0.13
 Identities = 47/314 (14%), Positives = 86/314 (27%), Gaps = 107/314 (34%)

Query: 46  LEQHNKTYATLVEYYSRLHIFSGNLRKI---QLLQDTEHGSGVYGLNEFSDLSTAEFQAK 102
           L   +  +  LV   +     +  L++     L + TE   G    +E +  + AE   K
Sbjct: 11  LSHGSLEHVLLVP--TASFFIASQLQEQFNKILPEPTE---GFAADDEPT--TPAELVGK 63

Query: 103 YLGF---KLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGS---SWAFSTT 156
           +LG+    ++PS   +    +  N+ L   F+   Y          + G+   + A    
Sbjct: 64  FLGYVSSLVEPSKVGQFDQVL--NLCL-TEFE-NCY----------LEGNDIHALAAKLL 109

Query: 157 GNIEGV-----------YAAKT---KKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMS 202
              +               A+    +     S   L     E +             +++
Sbjct: 110 QENDTTLVKTKELIKNYITARIMAKRPFDKKSNSALFRAVGEGNA-----------QLVA 158

Query: 203 KLGG-G-----LEE--E--KTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLV 252
             GG G      EE  +  +TY     D    L K + +        + R   D  K   
Sbjct: 159 IFGGQGNTDDYFEELRDLYQTYHVLVGD----LIKFSAET----LSELIRTTLDAEKVFT 210

Query: 253 E--------NGP--------MAVA------INAYAL-QFYVT----GVSHPIQFFCDGGN 285
           +          P        +         I    L  + VT    G   P +       
Sbjct: 211 QGLNILEWLENPSNTPDKDYLLSIPISCPLIGVIQLAHYVVTAKLLGF-TPGELR----- 264

Query: 286 ENLSHSVLIVGYGV 299
            +          G+
Sbjct: 265 -SYLKGATGHSQGL 277


>1qzv_F Plant photosystem I: subunit PSAF; photosynthesis,plant
           photosynthetic reaction center, peripheral antenna; HET:
           CL1 PQN; 4.44A {Pisum sativum} SCOP: i.5.1.1
          Length = 154

 Score = 43.4 bits (101), Expect = 2e-05
 Identities = 14/30 (46%), Positives = 16/30 (53%), Gaps = 2/30 (6%)

Query: 98  EFQA-KYLGFKLKPSYADRSVPAMIPNITL 126
           E QA K L   LK  YAD S PA+    T+
Sbjct: 18  EKQALKKLQASLKL-YADDSAPALAIKATM 46



 Score = 38.4 bits (88), Expect = 8e-04
 Identities = 6/25 (24%), Positives = 14/25 (56%), Gaps = 1/25 (4%)

Query: 239 SVSRDETDMAKYLVENGPMAVAINA 263
           ++ + +  +  Y  ++ P A+AI A
Sbjct: 21  ALKKLQASLKLYADDSAP-ALAIKA 44


>2cb5_A Protein (bleomycin hydrolase); aminopeptidase, cysteine protease,
           SELF- compartmentalizing, cylinase; 1.85A {Homo sapiens}
           SCOP: d.3.1.1 PDB: 1cb5_A
          Length = 453

 Score = 43.6 bits (102), Expect = 7e-05
 Identities = 19/84 (22%), Positives = 33/84 (39%), Gaps = 3/84 (3%)

Query: 246 DMAKYL-VENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKF 304
           D+ K+   + G   + +  + L F V+  +         G   ++H++            
Sbjct: 326 DVGKHFNSKLGLSDMNLYDHELVFGVSLKNMNKAERLTFGESLMTHAMTFTAVSEK--DD 383

Query: 305 THKAVPYWIIKNSWGEGWGEKGYF 328
              A   W ++NSWGE  G KGY 
Sbjct: 384 QDGAFTKWRVENSWGEDHGHKGYL 407


>2e01_A Cysteine proteinase 1; bleomycin hydrolase, thiol protease, C1
           protease, hydrolase; 1.73A {Saccharomyces cerevisiae}
           PDB: 2e02_A 2e03_A 2dzy_A 1a6r_A 2e00_A 2dzz_A 3gcb_A
           1gcb_A
          Length = 457

 Score = 41.6 bits (97), Expect = 2e-04
 Identities = 18/84 (21%), Positives = 34/84 (40%), Gaps = 6/84 (7%)

Query: 246 DMAKYL-VENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKF 304
              K++  + G M + +  Y    Y        +         ++ ++LI G  VD T  
Sbjct: 330 HTPKFMDKKTGVMDIELWNYPAIGYNLPQQKASRI--RYHESLMTAAMLITGCHVDET-- 385

Query: 305 THKAVPYWIIKNSWGEGWGEKGYF 328
             K    + ++NSWG+  G+ G +
Sbjct: 386 -SKLPLRYRVENSWGKDSGKDGLY 408


>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis,
           programmed cell death; HET: DTP; 6.90A {Drosophila
           melanogaster} PDB: 3iz8_A*
          Length = 1221

 Score = 34.1 bits (77), Expect = 0.088
 Identities = 45/343 (13%), Positives = 88/343 (25%), Gaps = 102/343 (29%)

Query: 33  LHHVKHTALFNYFLEQHNKTYATLV--EYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNE 90
           L +V++   +N F    N +   L+   +       S        L              
Sbjct: 250 LLNVQNAKAWNAF----NLSCKILLTTRFKQVTDFLSAATTTHISLDHHSMT-------- 297

Query: 91  FSDLSTAEFQAKYLGFKLK--PSYADRSVP---AMIPNITLPRAFDWREYDAVTGVKDQT 145
            +         KYL  + +  P     + P   ++I          W  +  V   K  T
Sbjct: 298 LTPDEVKSLLLKYLDCRPQDLPREVLTTNPRRLSIIAESIRDGLATWDNWKHVNCDKLTT 357

Query: 146 MCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELI----------DCDQEDDGCEGGSISN 195
           +  SS           ++     +L        I          D  + D          
Sbjct: 358 IIESSLNVLEPAEYRKMF----DRLSVFPPSAHIPTILLSLIWFDVIKSDV--------- 404

Query: 196 AFDTIMSKL-GGGLEEE--KTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLV 252
               +++KL    L E+  K            L  K              +E  + + +V
Sbjct: 405 --MVVVNKLHKYSLVEKQPKESTISIPSIYLELKVKL------------ENEYALHRSIV 450

Query: 253 ENGPMAVAINAYALQ--FYVTGVSHPI--QFFCDGGNENLSHSVLIVGYGVDRTKFTHKA 308
           +          Y +   F    +  P   Q+F        SH    +G+ +   +   + 
Sbjct: 451 D---------HYNIPKTFDSDDLIPPYLDQYFY-------SH----IGHHLKNIEHPERM 490

Query: 309 VPY--------WI---IKNSWGEGWGEKGY-------FRLYRG 333
             +        ++   I++     W   G         + Y+ 
Sbjct: 491 TLFRMVFLDFRFLEQKIRHD-STAWNASGSILNTLQQLKFYKP 532



 Score = 29.1 bits (64), Expect = 2.9
 Identities = 41/331 (12%), Positives = 95/331 (28%), Gaps = 112/331 (33%)

Query: 19  VSSFM--VVGDEKLHHLHHVKHTA-----LFNYFLEQHNKTYATLVEYYSRLHIFSGNLR 71
           V      ++  E++ H+   K        LF   L +  +     VE             
Sbjct: 38  VQDMPKSILSKEEIDHIIMSKDAVSGTLRLFWTLLSKQEEMVQKFVE------------- 84

Query: 72  KIQLLQDTEHGSGVYG--LNEFSDLSTAEFQAKYLGFKLKPSYADR---SVPAMIP-NIT 125
             ++L+        Y   ++        E +   +  ++     DR           N++
Sbjct: 85  --EVLRIN------YKFLMSPIK----TEQRQPSMMTRMYIEQRDRLYNDNQVFAKYNVS 132

Query: 126 LPRAFDWREYDAVTGVKDQT------M--CGSSW-AFSTTGNIE-------GVY------ 163
             + +  +   A+  ++         +   G +W A     + +        ++      
Sbjct: 133 RLQPY-LKLRQALLELRPAKNVLIDGVLGSGKTWVALDVCLSYKVQCKMDFKIFWLNLKN 191

Query: 164 ----AAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG- 218
                   + L  L  Q  ID +         +I     +I ++L   L + K Y     
Sbjct: 192 CNSPETVLEMLQKLLYQ--IDPNWTSRSDHSSNIKLRIHSIQAEL-RRLLKSKPYE-NCL 247

Query: 219 ---DD-------KA----CRL--------------NKKATQVKINGYV-SVSRDETD--M 247
               +        A    C++                  T + ++ +  +++ DE    +
Sbjct: 248 LVLLNVQNAKAWNAFNLSCKILLTTRFKQVTDFLSAATTTHISLDHHSMTLTPDEVKSLL 307

Query: 248 AKYL-----------VENGPMAVAINAYALQ 267
            KYL           +   P  ++I A +++
Sbjct: 308 LKYLDCRPQDLPREVLTTNPRRLSIIAESIR 338



 Score = 27.1 bits (59), Expect = 10.0
 Identities = 17/105 (16%), Positives = 33/105 (31%), Gaps = 16/105 (15%)

Query: 31  HHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNE 90
           HH HH+      ++   +H   Y  ++  +     F  N    + +QD      +    E
Sbjct: 2   HHHHHM------DFETGEHQYQYKDILSVF--EDAFVDNF-DCKDVQDMP--KSILSKEE 50

Query: 91  FSDL---STAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDW 132
              +     A      L F    S  +  V   +  + L   + +
Sbjct: 51  IDHIIMSKDAVSGTLRL-FWTLLSKQEEMVQKFVEEV-LRINYKF 93


>3t4l_A Histidine kinase 4; PAS domain, hormone receptor, endop reticulum;
           HET: ZEA; 1.53A {Arabidopsis thaliana} PDB: 3t4k_A*
           3t4j_A* 3t4o_A* 3t4q_A* 3t4s_A* 3t4t_A*
          Length = 270

 Score = 32.1 bits (72), Expect = 0.22
 Identities = 14/90 (15%), Positives = 27/90 (30%), Gaps = 4/90 (4%)

Query: 72  KIQLLQDTEHG----SGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLP 127
             +LL+    G      VY  +   + +  E  A   G+       +  V  ++  +   
Sbjct: 155 PFRLLETHHLGVVLTFPVYKSSLPENPTVEERIAATAGYLGGAFDVESLVENLLGQLAGN 214

Query: 128 RAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
           +A     YD         M G+    +   
Sbjct: 215 QAIVVHVYDITNASDPLVMYGNQDEEADRS 244


>3n89_A Defective in GERM LINE development protein 3, ISO; KH domains, RNA
           binding, cell cycle; 2.79A {Caenorhabditis elegans}
          Length = 376

 Score = 29.1 bits (64), Expect = 2.2
 Identities = 10/62 (16%), Positives = 18/62 (29%)

Query: 144 QTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSK 203
               G+ +     GNI+ V  A+   +  L      +    D              I+ +
Sbjct: 230 NETRGNIYEIKVVGNIDNVLKARRYIMDLLPISMCFNIKNTDMAEPSRVSDRNIHMIIDE 289

Query: 204 LG 205
            G
Sbjct: 290 SG 291


>3eye_A PTS system N-acetylgalactosamine-specific IIB component 1;
           structural genomics, phosphotransferase, PSI-2, protein
           structure initiative; 1.45A {Escherichia coli O157}
          Length = 168

 Score = 27.4 bits (61), Expect = 4.6
 Identities = 4/30 (13%), Positives = 13/30 (43%)

Query: 226 NKKATQVKINGYVSVSRDETDMAKYLVENG 255
           +    + +I+  V V   +    +++ + G
Sbjct: 113 HFSEGKKQISSKVYVDDQDLTDLRFIKQRG 142


>1ble_A Fructose permease; phosphotransferase, sugar transport; 2.90A
           {Bacillus subtilis} SCOP: c.38.1.1
          Length = 163

 Score = 27.0 bits (60), Expect = 5.7
 Identities = 6/30 (20%), Positives = 13/30 (43%)

Query: 226 NKKATQVKINGYVSVSRDETDMAKYLVENG 255
             +  + +I   VSV+  +    + L + G
Sbjct: 109 RFENHRRQITKSVSVTEQDIKAFETLSDKG 138


>1nrz_A PTS system, sorbose-specific IIB component; beta sheet core,
           flanking helices, right handed beta-alpha-B crossover,
           transferase; 1.75A {Klebsiella pneumoniae} SCOP:
           c.38.1.1
          Length = 164

 Score = 27.0 bits (60), Expect = 5.7
 Identities = 4/30 (13%), Positives = 12/30 (40%)

Query: 226 NKKATQVKINGYVSVSRDETDMAKYLVENG 255
             +  + ++   VS+   +    + L + G
Sbjct: 108 AWRPGKKQLTKAVSLDPQDIQAFRELDKLG 137


>3bde_A MLL5499 protein; stress responsive A/B barrel domain, structural
           genomics, JO center for structural genomics, JCSG; 1.79A
           {Mesorhizobium loti}
          Length = 120

 Score = 26.8 bits (59), Expect = 6.0
 Identities = 24/125 (19%), Positives = 44/125 (35%), Gaps = 17/125 (13%)

Query: 25  VGDEKLHHLHH---------VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQL 75
           +G +K+HH HH         ++HT +F        K  +  +E    L      L  I+ 
Sbjct: 1   MGSDKIHHHHHHENLYFQGMIRHTVVFTL------KHASHSLEEKRFLVDAKKILSAIRG 54

Query: 76  LQDTEHGSGVYGLNEFSDLSTAEF--QAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWR 133
           +   E    +    ++    + EF  QA Y  +   P +        +P +      D+ 
Sbjct: 55  VTHFEQLRQISPKIDYHFGFSMEFADQAAYTRYNDHPDHVAFVRDRWVPEVEKFLEIDYV 114

Query: 134 EYDAV 138
              +V
Sbjct: 115 PLGSV 119


>1vsq_C Mannose-specific phosphotransferase enzyme IIB component; sugar
           transport, complex (transferase/phosphocarrier,
           cytoplasm, membrane; HET: NEP; NMR {Escherichia coli}
           PDB: 2jzn_C 2jzo_D 2jzh_A
          Length = 165

 Score = 27.0 bits (60), Expect = 6.5
 Identities = 7/30 (23%), Positives = 13/30 (43%)

Query: 226 NKKATQVKINGYVSVSRDETDMAKYLVENG 255
             +  + ++N  VSV   + +  K L   G
Sbjct: 111 AFRQGKTQVNNAVSVDEKDIEAFKKLNARG 140


>3ic6_A Putative methylase family protein; putative methylase family Pro
           structural genomics, PSI-2, protein structure
           initiative; 2.59A {Neisseria gonorrhoeae fa 1090}
          Length = 223

 Score = 26.9 bits (60), Expect = 8.4
 Identities = 5/11 (45%), Positives = 8/11 (72%)

Query: 287 NLSHSVLIVGY 297
           NL+ +V +V Y
Sbjct: 178 NLAQAVQVVCY 188


>3kty_A Probable methyltransferase; alpha-beta-alpha sandwich, structural
           genomics, PSI-2, prote structure initiative; 2.30A
           {Bordetella pertussis}
          Length = 173

 Score = 26.7 bits (60), Expect = 8.5
 Identities = 1/11 (9%), Positives = 7/11 (63%)

Query: 287 NLSHSVLIVGY 297
           N++ ++ +  +
Sbjct: 156 NVAQALQLAAW 166


  Database: pdb70
    Posted date:  Sep 4, 2012  3:40 AM
  Number of letters in database: 6,701,793
  Number of sequences in database:  27,921
  
Lambda     K      H
   0.319    0.136    0.414 

Gapped
Lambda     K      H
   0.267   0.0856    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 27921
Number of Hits to DB: 5,411,979
Number of extensions: 323407
Number of successful extensions: 940
Number of sequences better than 10.0: 1
Number of HSP's gapped: 727
Number of HSP's successfully gapped: 66
Length of query: 348
Length of database: 6,701,793
Length adjustment: 94
Effective length of query: 254
Effective length of database: 4,077,219
Effective search space: 1035613626
Effective search space used: 1035613626
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 58 (25.9 bits)