RPS-BLAST 2.2.26 [Sep-21-2011]

Database: pdb70 
           27,921 sequences; 6,701,793 total letters

Searching..................................................done

Query= psy8713
         (309 letters)



>3hhi_A Cathepsin B-like cysteine protease; occluding loop, hydrolase, THIO
           protease; HET: 074; 1.60A {Trypanosoma brucei} PDB:
           3mor_A*
          Length = 325

 Score =  246 bits (630), Expect = 1e-80
 Identities = 94/330 (28%), Positives = 121/330 (36%), Gaps = 105/330 (31%)

Query: 40  QAEKNS-LSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCP 98
           +A+ +  + NI     K   GV    N  +          E    LP++FDS   WPNCP
Sbjct: 27  KAKYDGVMQNITLREAKRLNGVIKKNNNASILPKRRFTEEEARAPLPSSFDSAEAWPNCP 86

Query: 99  TIREIRDQGSCGSCW-------------------------------------GC------ 115
           TI +I DQ +CGSCW                                     GC      
Sbjct: 87  TIPQIADQSACGSCWAVAAASAMSDRFCTMGGVQDVHISAGDLLACCSDCGDGCNGGDPD 146

Query: 116 ----------------RPYEIAPCEHHVNGTR--PSCDASKGHTPKCVRECQENYDVPYK 157
                           +PY    C HH       P C      TPKC   C +       
Sbjct: 147 RAWAYFSSTGLVSDYCQPYPFPHCSHHSKSKNGYPPCSQFNFDTPKCDYTCDDPT---IP 203

Query: 158 KDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLI 217
                   SY++   E   M+E++  GP E AF V++D I Y SG               
Sbjct: 204 VVNYRSWTSYAL-QGEDDYMRELFFRGPFEVAFDVYEDFIAYNSG--------------- 247

Query: 218 KWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWN 277
                             V+       SG+ LGGHA+R++GWG        YW IANSWN
Sbjct: 248 ------------------VYHH----VSGQYLGGHAVRLVGWGTSN--GVPYWKIANSWN 283

Query: 278 TDWGDNGLFKILRGKDECGIESSITAGVPK 307
           T+WG +G F I RG  ECGIE   +AG+P 
Sbjct: 284 TEWGMDGYFLIRRGSSECGIEDGGSAGIPL 313



 Score = 54.7 bits (132), Expect = 1e-08
 Identities = 14/28 (50%), Positives = 17/28 (60%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAY 36
           CG GCNGG P  AW Y+  +G+VS    
Sbjct: 136 CGDGCNGGDPDRAWAYFSSTGLVSDYCQ 163


>3pbh_A Procathepsin B; thiol protease, cysteine protease, proenzyme,
           papain; 2.50A {Homo sapiens} SCOP: d.3.1.1 PDB: 2pbh_A
           1pbh_A 1mir_A
          Length = 317

 Score =  223 bits (571), Expect = 7e-72
 Identities = 100/203 (49%), Positives = 124/203 (61%), Gaps = 40/203 (19%)

Query: 107 GSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKS 166
           G   S  GCRPY I PCEHHVNG+RP C   +G TPKC + C+  Y   YK+D ++G  S
Sbjct: 155 GLYESHVGCRPYSIPPCEHHVNGSRPPCT-GEGDTPKCSKICEPGYSPTYKQDKHYGYNS 213

Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
           YSVS++EK IM EIY++GPVEGAF+V+ D +LYKSG                        
Sbjct: 214 YSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSG------------------------ 249

Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
                    V+       +G+ +GGHAIRILGWG +      YWL+ANSWNTDWGDNG F
Sbjct: 250 ---------VYQH----VTGEMMGGHAIRILGWGVENG--TPYWLVANSWNTDWGDNGFF 294

Query: 287 KILRGKDECGIESSITAGVPKLD 309
           KILRG+D CGIES + AG+P+ D
Sbjct: 295 KILRGQDHCGIESEVVAGIPRTD 317



 Score = 77.8 bits (192), Expect = 1e-16
 Identities = 35/74 (47%), Positives = 47/74 (63%), Gaps = 6/74 (8%)

Query: 40  QAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPT 99
           QA  N   N+  ++LK   G      L   + P+ + ++E D  LPA+FD+R +WP CPT
Sbjct: 26  QAGHNF-YNVDMSYLKRLCGTF----LGGPKPPQRVMFTE-DLKLPASFDAREQWPQCPT 79

Query: 100 IREIRDQGSCGSCW 113
           I+EIRDQGSCGSCW
Sbjct: 80  IKEIRDQGSCGSCW 93



 Score = 57.8 bits (140), Expect = 1e-09
 Identities = 18/32 (56%), Positives = 22/32 (68%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
           CG GCNGG+P  AW +W + G+VSGG Y S  
Sbjct: 130 CGDGCNGGYPAEAWNFWTRKGLVSGGLYESHV 161


>3cbj_A Cathepsin B; cathepsin B, occluding loop, chagas disease, glyco
           hydrolase, lysosome, protease, thiol protease, zymogen,
           CYT vesicle; 1.80A {Homo sapiens} PDB: 3cbk_A 1gmy_A*
           3ai8_B* 3k9m_A 1the_A* 1cpj_A* 1cte_A 2dcc_A* 2dc6_A*
           1ito_A* 2dc8_A* 2dc9_A* 2dca_A* 2dcb_A* 2dc7_A* 2dcd_A*
           1qdq_A* 1csb_B* 1huc_B 2ipp_B ...
          Length = 266

 Score =  219 bits (561), Expect = 4e-71
 Identities = 99/205 (48%), Positives = 122/205 (59%), Gaps = 40/205 (19%)

Query: 105 DQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGA 164
             G   S  GCRPY I PCE HVNG RP C   +G TPKC + C+  Y   YK+D ++G 
Sbjct: 96  SGGLYESHVGCRPYSIPPCEAHVNGARPPCT-GEGDTPKCSKICEPGYSPTYKQDKHYGY 154

Query: 165 KSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDN 224
            SYSVS++EK IM EIY++GPVEGAF+V+ D +LYKSG                      
Sbjct: 155 NSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSG---------------------- 192

Query: 225 TSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNG 284
                      V+       +G+ +GGHAIRILGWG +      YWL+ANSWNTDWGDNG
Sbjct: 193 -----------VYQH----VTGEMMGGHAIRILGWGVENG--TPYWLVANSWNTDWGDNG 235

Query: 285 LFKILRGKDECGIESSITAGVPKLD 309
            FKILRG+D CGIES + AG+P+ D
Sbjct: 236 FFKILRGQDHCGIESEVVAGIPRTD 260



 Score = 72.7 bits (179), Expect = 4e-15
 Identities = 25/37 (67%), Positives = 31/37 (83%), Gaps = 1/37 (2%)

Query: 77  YSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCW 113
           ++E D  LPA+FD+R +WP CPTI+EIRDQGSCGS W
Sbjct: 1   FTE-DLKLPASFDAREQWPQCPTIKEIRDQGSCGSAW 36



 Score = 57.3 bits (139), Expect = 9e-10
 Identities = 18/32 (56%), Positives = 22/32 (68%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
           CG GCNGG+P  AW +W + G+VSGG Y S  
Sbjct: 73  CGDGCNGGYPAEAWNFWTRKGLVSGGLYESHV 104


>3qsd_A Cathepsin B-like peptidase (C01 family); cysteine peptidase,
           digestive tract, hydrolase-hydrolase INH complex; HET:
           074; 1.30A {Schistosoma mansoni} PDB: 3s3q_A* 3s3r_A*
          Length = 254

 Score =  216 bits (553), Expect = 6e-70
 Identities = 83/202 (41%), Positives = 111/202 (54%), Gaps = 39/202 (19%)

Query: 105 DQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGA 164
              S  +  GC PY    CEHH  G  P C +    TP+C + CQ+ Y  PY +D + G 
Sbjct: 91  TGSSKENHAGCEPYPFPKCEHHTKGKYPPCGSKIYKTPRCKQTCQKKYKTPYTQDKHRGK 150

Query: 165 KSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDN 224
            SY+V ++EK+I KEI ++GPVE  FTV++D + YKSG                      
Sbjct: 151 SSYNVKNDEKAIQKEIMKYGPVEAGFTVYEDFLNYKSG---------------------- 188

Query: 225 TSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNG 284
                      ++       +G+ LGGHAIRI+GWG +  +K  YWLIANSWN DWG+NG
Sbjct: 189 -----------IYKH----ITGETLGGHAIRIIGWGVE--NKAPYWLIANSWNEDWGENG 231

Query: 285 LFKILRGKDECGIESSITAGVP 306
            F+I+RG+DEC IES +TAG  
Sbjct: 232 YFRIVRGRDECSIESEVTAGRI 253



 Score = 71.9 bits (177), Expect = 6e-15
 Identities = 19/31 (61%), Positives = 24/31 (77%)

Query: 83  DLPANFDSRTKWPNCPTIREIRDQGSCGSCW 113
           ++P++FDSR KWP C +I  IRDQ  CGSCW
Sbjct: 2   EIPSSFDSRKKWPRCKSIATIRDQSRCGSCW 32



 Score = 58.0 bits (141), Expect = 5e-10
 Identities = 17/35 (48%), Positives = 21/35 (60%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEK 43
           CG GC GG  G AW YWVK GIV+G +  +    +
Sbjct: 68  CGLGCEGGILGPAWDYWVKEGIVTGSSKENHAGCE 102


>1deu_A Procathepsin X; cysteine protease, proregion, prosegment, HY; 1.70A
           {Homo sapiens} SCOP: d.3.1.1 PDB: 1ef7_A
          Length = 277

 Score =  186 bits (475), Expect = 5e-58
 Identities = 48/298 (16%), Positives = 75/298 (25%), Gaps = 123/298 (41%)

Query: 81  DEDLPANFDSRTKWPNCPTIREIRDQ---GSCGSCW------------------------ 113
             DLP ++D R            R+Q     CGSCW                        
Sbjct: 33  PADLPKSWDWRNVDG-VNYASITRNQHIPQYCGSCWAHASTSAMADRINIKRKGAWPSTL 91

Query: 114 -------------GC----------------------RPYEIAPCEHHVNGTRPSCDASK 138
                         C                        Y+    E        +C    
Sbjct: 92  LSVQNVIDCGNAGSCEGGNDLSVWDYAHQHGIPDETCNNYQAKDQECDKFNQCGTC---- 147

Query: 139 GHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLIL 198
                  +EC    +    +        Y   S  + +M EIY +GP+       + L  
Sbjct: 148 ----NEFKECHAIRNYTLWRV-----GDYGSLSGREKMMAEIYANGPISCGIMATERLAN 198

Query: 199 YKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILG 258
           Y  G                                 ++ +            H + + G
Sbjct: 199 YTGG---------------------------------IYAE----YQDTTYINHVVSVAG 221

Query: 259 WGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECG--------IESSITAGVPKL 308
           WG  +    +YW++ NSW   WG+ G  +I+    + G        IE   T G P +
Sbjct: 222 WGISD--GTEYWIVRNSWGEPWGERGWLRIVTSTYKDGKGARYNLAIEEHCTFGDPIV 277



 Score = 50.8 bits (122), Expect = 1e-07
 Identities = 7/28 (25%), Positives = 8/28 (28%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAY 36
               C GG     W Y  + GI      
Sbjct: 102 NAGSCEGGNDLSVWDYAHQHGIPDETCN 129


>3pdf_A Cathepsin C, dipeptidyl peptidase 1; two domains, cystein protease,
           hydrolase-hydrolase inhibitor; HET: LXV NAG; 1.85A {Homo
           sapiens} PDB: 1jqp_A* 2djf_B* 1k3b_B* 2djg_B* 2djf_A*
           1k3b_A* 2djg_A* 2djf_C* 1k3b_C* 2djg_C*
          Length = 441

 Score =  183 bits (467), Expect = 5e-55
 Identities = 69/335 (20%), Positives = 96/335 (28%), Gaps = 118/335 (35%)

Query: 34  GAYGSKQAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTK 93
                          +    +    G H          P      +    LP ++D R  
Sbjct: 157 IQKSWTATTYMEYETLTLGDMIRRSGGHSRKIPRPKPAPLTAEIQQKILFLPTSWDWRNV 216

Query: 94  WPNCPTIREIRDQGSCGSCW-------------------------------------GC- 115
                 +  +R+Q SCGSC+                                     GC 
Sbjct: 217 HG-INFVSPVRNQASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVVSCSQYAQGCE 275

Query: 116 ----------------------RPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYD 153
                                  PY          GT   C        K   +C   Y 
Sbjct: 276 GGFPYLIAGKYAQDFGLVEEACFPYT---------GTDSPC--------KMKEDCFRYYS 318

Query: 154 VPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTA 213
             Y          +    NE  +  E+  HGP+  AF V+DD + YK G           
Sbjct: 319 SEYHYV-----GGFYGGCNEALMKLELVHHGPMAVAFEVYDDFLHYKKG----------- 362

Query: 214 MSLIKWTIRDNTSQLGAEGAFTVFDD--LILYKSGKALGGHAIRILGWGEDEKSKEKYWL 271
                                 ++    L    +   L  HA+ ++G+G D  S   YW+
Sbjct: 363 ----------------------IYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWI 400

Query: 272 IANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
           + NSW T WG+NG F+I RG DEC IES   A  P
Sbjct: 401 VKNSWGTGWGENGYFRIRRGTDECAIESIAVAATP 435



 Score = 44.0 bits (104), Expect = 3e-05
 Identities = 9/29 (31%), Positives = 13/29 (44%), Gaps = 1/29 (3%)

Query: 9   CGFGCNGGFPGMA-WRYWVKSGIVSGGAY 36
              GC GGFP +   +Y    G+V    +
Sbjct: 270 YAQGCEGGFPYLIAGKYAQDFGLVEEACF 298


>3ois_A Cysteine protease; alpha and beta, hydrolase; HET: UDP; 1.65A
           {Xylella fastidiosa}
          Length = 291

 Score =  112 bits (282), Expect = 2e-29
 Identities = 45/309 (14%), Positives = 76/309 (24%), Gaps = 121/309 (39%)

Query: 55  KSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQ---GS--- 108
           KS  G  PD  +   R         V   LP   D    +        + DQ   GS   
Sbjct: 30  KSGYGYIPD--IADIRDFSYTPEKSVIAALPPKVDLTPPFQ-------VYDQGRIGSCTA 80

Query: 109 ------------------------------------CGSCWG------------------ 114
                                                 +                     
Sbjct: 81  NALAAAIQFERIHDKQSPEFIPSRLFIYYNERKIEGHVNYDSGAMIRDGIKVLHKLGVCP 140

Query: 115 --CRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYS-VSS 171
               PY   P +       P   ASK  + +C           YK   N+    YS V+ 
Sbjct: 141 EKEWPYGDTPADPRTEEFPPGAPASKKPSDQC-----------YKDAQNYKITEYSRVAQ 189

Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
           +   +   +    P    F+V++  +   S    +P                        
Sbjct: 190 DIDHLKACLAVGSPFVFGFSVYNSWVGNNSLPVRIPLPTK-------------------- 229

Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
                        +    GGHA+  +G+ ++ +    ++ I NSW  + G++G F +   
Sbjct: 230 -------------NDTLEGGHAVLCVGYDDEIR----HFRIRNSWGNNVGEDGYFWMPYE 272

Query: 292 KDE-CGIES 299
                 +  
Sbjct: 273 YISNTQLAD 281



 Score = 35.2 bits (81), Expect = 0.016
 Identities = 3/28 (10%), Positives = 7/28 (25%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAY 36
                +G       +   K G+     +
Sbjct: 117 HVNYDSGAMIRDGIKVLHKLGVCPEKEW 144


>2wbf_X Serine-repeat antigen protein; SERA, malaria, vacuole, protease,
           cathepsin, hydrolase, glycoprotein, thiol protease; HET:
           DMS; 1.60A {Plasmodium falciparum} PDB: 3ch3_X 3ch2_X
          Length = 265

 Score = 73.9 bits (182), Expect = 2e-15
 Identities = 34/199 (17%), Positives = 57/199 (28%), Gaps = 43/199 (21%)

Query: 117 PYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYS--VSSNEK 174
           PY                     +  K +    E   +  K    + ++ +   + +  K
Sbjct: 100 PYNYVKVGEQCPKVEDHWMNLWDNG-KILHNKNEPNSLDGKGYTAYESERFHDNMDAFVK 158

Query: 175 SIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAF 234
            I  E+   G V       + +    SG+                               
Sbjct: 159 IIKTEVMNKGSVIAYIKAENVMGYEFSGKKVKN--------------------------- 191

Query: 235 TVFDDLILYKSGKALGGHAIRILGWG---EDEKSKEKYWLIANSWNTDWGDNGLFKILR- 290
                      G     HA+ I+G+G     E  K+ YW++ NSW   WGD G FK+   
Sbjct: 192 ---------LCGDDTADHAVNIVGYGNYVNSEGEKKSYWIVRNSWGPYWGDEGYFKVDMY 242

Query: 291 GKDECGIESSITAGVPKLD 309
           G   C      +  +  +D
Sbjct: 243 GPTHCHFNFIHSVVIFNVD 261



 Score = 37.3 bits (87), Expect = 0.003
 Identities = 9/27 (33%), Positives = 16/27 (59%), Gaps = 1/27 (3%)

Query: 88  FDSRTK-WPNCPTIREIRDQGSCGSCW 113
           + +R K   NC +  ++ DQG+C + W
Sbjct: 9   YCNRLKDENNCISNLQVEDQGNCDTSW 35


>8pch_A Cathepsin H; hydrolase, protease, cysteine proteinase,
           aminopeptidase; HET: NAG BMA; 2.10A {Sus scrofa} SCOP:
           d.3.1.1 PDB: 1nb3_A* 1nb5_A*
          Length = 220

 Score = 68.6 bits (169), Expect = 7e-14
 Identities = 31/128 (24%), Positives = 57/128 (44%), Gaps = 36/128 (28%)

Query: 170 SSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLG 229
            ++E+++++ +  + PV  AF V +D ++Y+ G                           
Sbjct: 118 MNDEEAMVEAVALYNPVSFAFEVTNDFLMYRKG--------------------------- 150

Query: 230 AEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKIL 289
                 ++     +K+   +  HA+  +G+GE+      YW++ NSW   WG NG F I 
Sbjct: 151 ------IYSSTSCHKTPDKVN-HAVLAVGYGEENG--IPYWIVKNSWGPQWGMNGYFLIE 201

Query: 290 RGKDECGI 297
           RGK+ CG+
Sbjct: 202 RGKNMCGL 209



 Score = 34.0 bits (79), Expect = 0.032
 Identities = 13/29 (44%), Positives = 18/29 (62%), Gaps = 3/29 (10%)

Query: 85  PANFDSRTKWPNCPTIREIRDQGSCGSCW 113
           P + D R K  N  +   +++QGSCGSCW
Sbjct: 2   PPSMDWRKK-GNFVS--PVKNQGSCGSCW 27


>1m6d_A Cathepsin F, catsf; papain family cysteine protease, hydrolase;
           HET: MYP; 1.70A {Homo sapiens} SCOP: d.3.1.1
          Length = 214

 Score = 64.8 bits (159), Expect = 1e-12
 Identities = 30/132 (22%), Positives = 50/132 (37%), Gaps = 37/132 (28%)

Query: 166 SYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNT 225
           S  +S NE+ +   + + GP+  A      +  Y+ G                       
Sbjct: 110 SVELSQNEQKLAAWLAKRGPISVAINA-FGMQFYRHG----------------------- 145

Query: 226 SQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGL 285
                     +   L    S   +  HA+ ++G+G+   S   +W I NSW TDWG+ G 
Sbjct: 146 ----------ISRPLRPLCSPWLID-HAVLLVGYGQR--SDVPFWAIKNSWGTDWGEKGY 192

Query: 286 FKILRGKDECGI 297
           + + RG   CG+
Sbjct: 193 YYLHRGSGACGV 204



 Score = 34.8 bits (81), Expect = 0.021
 Identities = 13/29 (44%), Positives = 18/29 (62%), Gaps = 4/29 (13%)

Query: 85  PANFDSRTKWPNCPTIREIRDQGSCGSCW 113
           P  +D R+K     T  +++DQG CGSCW
Sbjct: 2   PPEWDWRSK--GAVT--KVKDQGMCGSCW 26


>3i06_A Cruzipain; autocatalytic cleavage, glycoprotein, protease, thiol
           protease, zymogen; HET: QL2; 1.10A {Trypanosoma cruzi}
           PDB: 1ewm_A* 1ewo_A* 1ewl_A* 1f29_A* 1ewp_A* 1f2b_A*
           1f2c_A* 1f2a_A* 1me4_A* 1u9q_X* 2aim_A* 2efm_A* 2oz2_A*
           1me3_A* 3kku_A* 3lxs_A* 1aim_A* 3iut_A* 3hd3_A* 2p86_A*
           ...
          Length = 215

 Score = 63.7 bits (156), Expect = 3e-12
 Identities = 25/129 (19%), Positives = 44/129 (34%), Gaps = 41/129 (31%)

Query: 169 VSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQL 228
           +  +E  I   +  +GPV  A       + Y  G                          
Sbjct: 118 LPQDEAQIAAWLAVNGPVAVAVDA-SSWMTYTGG-------------------------- 150

Query: 229 GAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKI 288
                  V          + L  H + ++G+ +   +   YW+I NSW T WG+ G  +I
Sbjct: 151 -------VMTS----CVSEQLD-HGVLLVGYNDS--AAVPYWIIKNSWTTQWGEEGYIRI 196

Query: 289 LRGKDECGI 297
            +G ++C +
Sbjct: 197 AKGSNQCLV 205



 Score = 34.4 bits (80), Expect = 0.026
 Identities = 13/29 (44%), Positives = 16/29 (55%), Gaps = 4/29 (13%)

Query: 85  PANFDSRTKWPNCPTIREIRDQGSCGSCW 113
           PA  D R +     T   ++DQG CGSCW
Sbjct: 2   PAAVDWRAR--GAVT--AVKDQGQCGSCW 26


>1xkg_A DER P I, major mite fecal allergen DER P 1; major allergen,
           cysteine protease, house DUST mite, dermatop
           pteronyssinus; 1.61A {Dermatophagoides pteronyssinus}
           SCOP: d.3.1.1
          Length = 312

 Score = 63.8 bits (156), Expect = 7e-12
 Identities = 18/48 (37%), Positives = 25/48 (52%), Gaps = 2/48 (4%)

Query: 252 HAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIES 299
           HA+ I+G+         YW++ NSW+T+WGDNG        D   IE 
Sbjct: 250 HAVNIVGYSNA--QGVDYWIVRNSWDTNWGDNGYGYFAANIDLMMIEE 295



 Score = 39.9 bits (94), Expect = 7e-04
 Identities = 13/37 (35%), Positives = 16/37 (43%), Gaps = 4/37 (10%)

Query: 77  YSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCW 113
              ++ + PA  D R       T   IR QG CGS W
Sbjct: 83  ACSINGNAPAEIDLRQM--RTVT--PIRMQGGCGSAW 115



 Score = 29.5 bits (67), Expect = 1.3
 Identities = 7/26 (26%), Positives = 10/26 (38%)

Query: 11  FGCNGGFPGMAWRYWVKSGIVSGGAY 36
            GC+G        Y   +G+V    Y
Sbjct: 149 HGCHGDTIPRGIEYIQHNGVVQESYY 174


>3f5v_A DER P 1 allergen; allergy, asthma, DUST mites, glycoprotein,
           hydrola protease, secreted, thiol protease; HET: P6G;
           1.36A {Dermatophagoides pteronyssinus} PDB: 2as8_A
           3rvw_A* 3rvx_A 3rvv_A* 3d6s_A*
          Length = 222

 Score = 62.5 bits (153), Expect = 8e-12
 Identities = 18/48 (37%), Positives = 26/48 (54%), Gaps = 2/48 (4%)

Query: 252 HAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIES 299
           HA+ I+G+   +     YW++ NSW+T+WGDNG        D   IE 
Sbjct: 170 HAVNIVGYSNAQG--VDYWIVRNSWDTNWGDNGYGYFAANIDLMMIEE 215



 Score = 40.6 bits (96), Expect = 2e-04
 Identities = 13/37 (35%), Positives = 16/37 (43%), Gaps = 4/37 (10%)

Query: 77  YSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCW 113
              ++ + PA  D R       T   IR QG CGS W
Sbjct: 3   ACSINGNAPAEIDLRQM--RTVT--PIRMQGGCGSAW 35



 Score = 30.6 bits (70), Expect = 0.55
 Identities = 7/26 (26%), Positives = 10/26 (38%)

Query: 11 FGCNGGFPGMAWRYWVKSGIVSGGAY 36
           GC+G        Y   +G+V    Y
Sbjct: 69 HGCHGDTIPRGIEYIQHNGVVQESYY 94


>3qj3_A Cathepsin L-like protein; hydrolase, proteinase, larVal midgut;
           1.85A {Tenebrio molitor}
          Length = 331

 Score = 61.5 bits (150), Expect = 6e-11
 Identities = 33/129 (25%), Positives = 48/129 (37%), Gaps = 39/129 (30%)

Query: 170 SSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLG 229
             +E  +   +   GPV  AF   D    Y  G ++ P  ET   +              
Sbjct: 232 GPDENMLADMVATKGPVAVAFDADDPFGSYSGGVYYNPTCETNKFT-------------- 277

Query: 230 AEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKIL 289
                                 HA+ I+G+G +  + + YWL+ NSW   WG +G FKI 
Sbjct: 278 ----------------------HAVLIVGYGNE--NGQDYWLVKNSWGDGWGLDGYFKIA 313

Query: 290 RGKD-ECGI 297
           R  +  CGI
Sbjct: 314 RNANNHCGI 322



 Score = 39.2 bits (92), Expect = 0.001
 Identities = 13/63 (20%), Positives = 23/63 (36%), Gaps = 4/63 (6%)

Query: 51  RAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCG 110
           +A+    +     +                    PA+FD R +     +   +++QGSCG
Sbjct: 83  KAYTHGLIMPADLHKNGIPIKTREDLGLNASVRYPASFDWRDQ--GMVS--PVKNQGSCG 138

Query: 111 SCW 113
           S W
Sbjct: 139 SSW 141


>3f75_A Toxopain-2, cathepsin L protease; medical structural genomics of
           pathogenic protozoa, MSGPP, C protease, parasite,
           protozoa, hydrolase; 1.99A {Toxoplasma gondii}
          Length = 224

 Score = 58.3 bits (142), Expect = 2e-10
 Identities = 18/49 (36%), Positives = 31/49 (63%), Gaps = 3/49 (6%)

Query: 252 HAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDE---CGI 297
           H + ++G+G D++SK+ +W++ NSW T WG +G   +   K E   CG+
Sbjct: 167 HGVLLVGYGTDKESKKDFWIMKNSWGTGWGRDGYMYMAMHKGEEGQCGL 215



 Score = 37.5 bits (88), Expect = 0.002
 Identities = 14/31 (45%), Positives = 19/31 (61%), Gaps = 4/31 (12%)

Query: 83  DLPANFDSRTKWPNCPTIREIRDQGSCGSCW 113
           +LPA  D R++   C T   ++DQ  CGSCW
Sbjct: 6   ELPAGVDWRSR--GCVT--PVKDQRDCGSCW 32


>3u8e_A Papain-like cysteine protease; papain-like cysteine peptidase,
           peptidase_C1A, hydrolase, in form; 1.31A {Crocus
           sativus}
          Length = 222

 Score = 57.6 bits (139), Expect = 4e-10
 Identities = 27/138 (19%), Positives = 51/138 (36%), Gaps = 11/138 (7%)

Query: 169 VSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSL-----IKWTIRD 223
           V +N        Y +  V+G   +   +     G   VP + +  +       +   I  
Sbjct: 75  VITNGGIASDANYPYTGVDGTCDLNKPIAARIDGYTNVPNSSSALLDAVAKQPVSVNIYT 134

Query: 224 NTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDN 283
           +++         +F           +  H + I+G+G +  +   YW++ NSW T+WG +
Sbjct: 135 SSTSFQLYTGPGIFAGSSCSDDPATVD-HTVLIVGYGSNGTNA-DYWIVKNSWGTEWGID 192

Query: 284 GLFKILRGKDE----CGI 297
           G   I R  +     C I
Sbjct: 193 GYILIRRNTNRPDGVCAI 210



 Score = 34.8 bits (80), Expect = 0.018
 Identities = 13/29 (44%), Positives = 17/29 (58%), Gaps = 4/29 (13%)

Query: 85  PANFDSRTKWPNCPTIREIRDQGSCGSCW 113
           PA+ D R K     T   ++DQG+CG CW
Sbjct: 2   PASIDWRKK--GAVT--SVKDQGACGMCW 26


>2xu3_A Cathepsin L1; hydrolase, drug design, thiol protease; HET: XU3 BTB;
           0.90A {Homo sapiens} PDB: 2xu4_A* 2xu5_A* 2yj2_A*
           2yj8_A* 2yj9_A* 2yjb_A* 2yjc_A* 3bc3_A* 3h89_A* 3h8b_A*
           3h8c_A* 3of9_A* 3of8_A* 3hha_A* 2xu1_A* 3iv2_A* 3k24_A*
           2nqd_B* 3kse_A* 2vhs_A ...
          Length = 220

 Score = 56.8 bits (138), Expect = 8e-10
 Identities = 31/136 (22%), Positives = 53/136 (38%), Gaps = 40/136 (29%)

Query: 169 VSSNEKSIMKEIYEHGPVEGAFTV-FDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQ 227
           +   EK++MK +   GP+  A     +  + YK G +F P                ++  
Sbjct: 115 IPKQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEP--------------DCSSED 160

Query: 228 LGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSK--EKYWLIANSWNTDWGDNGL 285
           +          D            H + ++G+G +       KYWL+ NSW  +WG  G 
Sbjct: 161 M----------D------------HGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGY 198

Query: 286 FKILRGKD-ECGIESS 300
            K+ + +   CGI S+
Sbjct: 199 VKMAKDRRNHCGIASA 214



 Score = 35.2 bits (82), Expect = 0.015
 Identities = 12/29 (41%), Positives = 16/29 (55%), Gaps = 4/29 (13%)

Query: 85  PANFDSRTKWPNCPTIREIRDQGSCGSCW 113
           P + D R K     T   +++QG CGSCW
Sbjct: 2   PRSVDWREK--GYVT--PVKNQGQCGSCW 26


>2b1m_A SPE31; papain-like, sugar binding protein; HET: NAG FUC PG4; 2.00A
           {Pachyrhizus erosus} PDB: 2b1n_A*
          Length = 246

 Score = 56.8 bits (138), Expect = 1e-09
 Identities = 17/50 (34%), Positives = 25/50 (50%), Gaps = 6/50 (12%)

Query: 252 HAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDE----CGI 297
           H + I+G+G +      YW+  NSW  DWG +G  +I R        CG+
Sbjct: 168 HFVLIVGYGSE--DGVDYWIAKNSWGEDWGIDGYIRIQRNTGNLLGVCGM 215



 Score = 39.1 bits (92), Expect = 0.001
 Identities = 11/31 (35%), Positives = 16/31 (51%), Gaps = 4/31 (12%)

Query: 83  DLPANFDSRTKWPNCPTIREIRDQGSCGSCW 113
           D P ++D   K     T  +++ QG CGS W
Sbjct: 1   DAPESWDWSKK--GVIT--KVKFQGQCGSGW 27


>1ppo_A Protease omega; hydrolase(thiol protease); 1.80A {Carica papaya}
           SCOP: d.3.1.1 PDB: 1meg_A*
          Length = 216

 Score = 56.3 bits (137), Expect = 1e-09
 Identities = 18/50 (36%), Positives = 26/50 (52%), Gaps = 6/50 (12%)

Query: 252 HAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD----ECGI 297
           HA+  +G+G+     + Y LI NSW T WG+ G  +I R        CG+
Sbjct: 159 HAVTAVGYGKS--GGKGYILIKNSWGTAWGEKGYIRIKRAPGNSPGVCGL 206



 Score = 35.5 bits (83), Expect = 0.011
 Identities = 16/30 (53%), Positives = 17/30 (56%), Gaps = 4/30 (13%)

Query: 84  LPANFDSRTKWPNCPTIREIRDQGSCGSCW 113
           LP N D R K     T   +R QGSCGSCW
Sbjct: 1   LPENVDWRKK--GAVT--PVRHQGSCGSCW 26



 Score = 30.1 bits (69), Expect = 0.74
 Identities = 11/26 (42%), Positives = 13/26 (50%)

Query: 11 FGCNGGFPGMAWRYWVKSGIVSGGAY 36
           GC GG+P  A  Y  K+GI     Y
Sbjct: 61 HGCKGGYPPYALEYVAKNGIHLRSKY 86


>3qt4_A Cathepsin-L-like midgut cysteine proteinase; hydrolase, zymogen,
           intramolecular DISS bonds, insect larVal midgut; HET:
           PG4 PG6; 2.11A {Tenebrio molitor}
          Length = 329

 Score = 57.3 bits (139), Expect = 1e-09
 Identities = 28/132 (21%), Positives = 52/132 (39%), Gaps = 39/132 (29%)

Query: 170 SSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLG 229
           S +E S+   + + GPV  A    D+L  Y  G F+      + ++              
Sbjct: 230 SGDENSLADAVGQAGPVAVAIDATDELQFYSGGLFYDQTCNQSDLN-------------- 275

Query: 230 AEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKIL 289
                                 H + ++G+G D    + YW++ NSW + WG++G ++ +
Sbjct: 276 ----------------------HGVLVVGYGSDNG--QDYWILKNSWGSGWGESGYWRQV 311

Query: 290 RGKD-ECGIESS 300
           R     CGI ++
Sbjct: 312 RNYGNNCGIATA 323



 Score = 38.4 bits (90), Expect = 0.002
 Identities = 13/37 (35%), Positives = 19/37 (51%), Gaps = 5/37 (13%)

Query: 77  YSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCW 113
           Y    + L A+ D R+      +  E++DQG CGS W
Sbjct: 109 YVSSKKPLAASVDWRSN---AVS--EVKDQGQCGSSW 140



 Score = 28.4 bits (64), Expect = 3.0
 Identities = 11/26 (42%), Positives = 15/26 (57%)

Query: 11  FGCNGGFPGMAWRYWVKSGIVSGGAY 36
            GC+GG+   A+ Y    GI+S  AY
Sbjct: 177 AGCDGGWMDSAFSYIHDYGIMSESAY 202


>1cs8_A Human procathepsin L; prosegment, propeptide, inhibition,
           hydrolase; HET: OCS; 1.80A {Homo sapiens} SCOP: d.3.1.1
           PDB: 1cjl_A 3hwn_A*
          Length = 316

 Score = 56.9 bits (138), Expect = 2e-09
 Identities = 31/138 (22%), Positives = 51/138 (36%), Gaps = 40/138 (28%)

Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTV-FDDLILYKSGRFFVPGNETTAMSLIKWTIRDNT 225
             +   EK++MK +   GP+  A     +  + YK G +F P   +  M           
Sbjct: 209 VDIPKQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMD---------- 258

Query: 226 SQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSK--EKYWLIANSWNTDWGDN 283
                                     H + ++G+G +       KYWL+ NSW  +WG  
Sbjct: 259 --------------------------HGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMG 292

Query: 284 GLFKILRGKD-ECGIESS 300
           G  K+ + +   CGI S+
Sbjct: 293 GYVKMAKDRRNHCGIASA 310



 Score = 38.8 bits (91), Expect = 0.001
 Identities = 12/37 (32%), Positives = 18/37 (48%), Gaps = 4/37 (10%)

Query: 77  YSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCW 113
              +  + P + D R K     T   +++QG CGSCW
Sbjct: 90  QEPLFYEAPRSVDWREK--GYVT--PVKNQGQCGSCW 122



 Score = 27.2 bits (61), Expect = 7.7
 Identities = 10/27 (37%), Positives = 15/27 (55%), Gaps = 1/27 (3%)

Query: 11  FGCNGGFPGMAWRYWVKS-GIVSGGAY 36
            GCNGG    A++Y   + G+ S  +Y
Sbjct: 159 EGCNGGLMDYAFQYVQDNGGLDSEESY 185


>3bwk_A Cysteine protease falcipain-3; malaria, hydrolase; HET: C1P; 2.42A
           {Plasmodium falciparum} PDB: 3bpm_A*
          Length = 243

 Score = 55.7 bits (135), Expect = 2e-09
 Identities = 21/84 (25%), Positives = 35/84 (41%), Gaps = 20/84 (23%)

Query: 233 AFTVFDDLILYKSG-------KALGGHAIRILGWGEDE--------KSKEKYWLIANSWN 277
           +    DD   Y+ G        A   HA+ ++G+G  +          K  Y++I NSW 
Sbjct: 151 SIAASDDFAFYRGGFYDGECGAAPN-HAVILVGYGMKDIYNEDTGRMEKFYYYIIKNSWG 209

Query: 278 TDWGDNGLFKILRGKDE----CGI 297
           +DWG+ G   +   ++     C I
Sbjct: 210 SDWGEGGYINLETDENGYKKTCSI 233



 Score = 37.2 bits (87), Expect = 0.004
 Identities = 10/35 (28%), Positives = 14/35 (40%), Gaps = 4/35 (11%)

Query: 79  EVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCW 113
              +     +D R       T   ++DQ  CGSCW
Sbjct: 15  ADAKLDRIAYDWRLH--GGVT--PVKDQALCGSCW 45


>2o6x_A Procathepsin L1, secreted cathepsin L 1; hydrolase, thiol protease,
           cysteine protease, zymogen, hydro; 1.40A {Fasciola
           hepatica}
          Length = 310

 Score = 56.1 bits (136), Expect = 3e-09
 Identities = 26/129 (20%), Positives = 45/129 (34%), Gaps = 39/129 (30%)

Query: 170 SSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLG 229
           S +E  +   +   GP   A  V  D ++Y+SG +                         
Sbjct: 207 SGSEVELKNLVGAEGPAAVAVDVESDFMMYRSGIYQSQ---------------------- 244

Query: 230 AEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKIL 289
              +    +             HA+  +G+G        YW++ NSW   WG+ G  +++
Sbjct: 245 -TCSPLRVN-------------HAVLAVGYGTQ--GGTDYWIVKNSWGLSWGERGYIRMV 288

Query: 290 RGKD-ECGI 297
           R +   CGI
Sbjct: 289 RNRGNMCGI 297



 Score = 40.7 bits (96), Expect = 3e-04
 Identities = 14/47 (29%), Positives = 22/47 (46%), Gaps = 4/47 (8%)

Query: 67  PANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCW 113
            ++ L   + Y   +  +P   D R       T  E++DQG+CGS W
Sbjct: 75  ASDILSHGVPYEANNRAVPDKIDWRES--GYVT--EVKDQGNCGSGW 117



 Score = 28.0 bits (63), Expect = 4.3
 Identities = 8/26 (30%), Positives = 14/26 (53%)

Query: 11  FGCNGGFPGMAWRYWVKSGIVSGGAY 36
            GC GG    A++Y  + G+ +  +Y
Sbjct: 154 NGCGGGLMENAYQYLKQFGLETESSY 179


>2cio_A Papain; hydrolase/inhibitor, complex hydrolase/inhibitor, ICP,
           cysteine protease, allergen, protease, thiol protease;
           1.5A {Carica papaya} PDB: 1khq_A 1khp_A 1ppn_A 3e1z_B
           3ima_A 3lfy_A 9pap_A 1bqi_A* 1bp4_A* 1pad_A 1pe6_A*
           1pip_A* 1pop_A* 1ppd_A 1ppp_A* 1stf_E* 2pad_A 4pad_A*
           5pad_A* 6pad_A* ...
          Length = 212

 Score = 54.8 bits (133), Expect = 4e-09
 Identities = 20/50 (40%), Positives = 27/50 (54%), Gaps = 10/50 (20%)

Query: 252 HAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD----ECGI 297
           HA+  +G+G +      Y LI NSW T WG+NG  +I RG       CG+
Sbjct: 159 HAVAAVGYGPN------YILIKNSWGTGWGENGYIRIKRGTGNSYGVCGL 202



 Score = 36.3 bits (85), Expect = 0.007
 Identities = 13/30 (43%), Positives = 17/30 (56%), Gaps = 4/30 (13%)

Query: 84  LPANFDSRTKWPNCPTIREIRDQGSCGSCW 113
           +P   D R K     T   +++QGSCGSCW
Sbjct: 1   IPEYVDWRQK--GAVT--PVKNQGSCGSCW 26



 Score = 29.0 bits (66), Expect = 1.5
 Identities = 10/26 (38%), Positives = 14/26 (53%)

Query: 11 FGCNGGFPGMAWRYWVKSGIVSGGAY 36
          +GCNGG+P  A +   + GI     Y
Sbjct: 61 YGCNGGYPWSALQLVAQYGIHYRNTY 86


>2oul_A Falcipain 2; cysteine protease, inhibitor, macromolecular
           interaction, HY hydrolase inhibitor complex; 2.20A
           {Plasmodium falciparum} SCOP: d.3.1.1 PDB: 2ghu_A 1yvb_A
           3bpf_A* 3pnr_A
          Length = 241

 Score = 55.3 bits (134), Expect = 4e-09
 Identities = 24/84 (28%), Positives = 34/84 (40%), Gaps = 20/84 (23%)

Query: 233 AFTVFDDLILYKSG-------KALGGHAIRILGWGEDE--------KSKEKYWLIANSWN 277
           +  V DD   YK G         L  HA+ ++G+G  E          K  Y++I NSW 
Sbjct: 149 SVAVSDDFAFYKEGIFDGECGDQLN-HAVMLVGFGMKEIVNPLTKKGEKHYYYIIKNSWG 207

Query: 278 TDWGDNGLFKILRGKDE----CGI 297
             WG+ G   I   +      CG+
Sbjct: 208 QQWGERGFINIETDESGLMRKCGL 231



 Score = 39.5 bits (93), Expect = 7e-04
 Identities = 12/37 (32%), Positives = 18/37 (48%), Gaps = 4/37 (10%)

Query: 77  YSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCW 113
           Y   +    A +D R    +  T   ++DQ +CGSCW
Sbjct: 11  YRGEENFDHAAYDWRLH--SGVT--PVKDQKNCGSCW 43


>2fo5_A Cysteine proteinase EP-B 2; EP-B2, EPB2, EPB, cysteine
           endoprotease, endopeptidase, LEUP hydrolase; HET: AR7;
           2.20A {Hordeum vulgare}
          Length = 262

 Score = 55.0 bits (133), Expect = 5e-09
 Identities = 19/67 (28%), Positives = 28/67 (41%), Gaps = 13/67 (19%)

Query: 242 LYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDE 294
            Y  G         L  H + ++G+G  E  K  YW + NSW   WG+ G  ++ +    
Sbjct: 151 FYSEGVFTGECGTELD-HGVAVVGYGVAEDGK-AYWTVKNSWGPSWGEQGYIRVEKDSGA 208

Query: 295 ----CGI 297
               CGI
Sbjct: 209 SGGLCGI 215



 Score = 38.4 bits (90), Expect = 0.001
 Identities = 15/32 (46%), Positives = 18/32 (56%), Gaps = 4/32 (12%)

Query: 82  EDLPANFDSRTKWPNCPTIREIRDQGSCGSCW 113
            DLP + D R K     T   ++DQG CGSCW
Sbjct: 2   SDLPPSVDWRQK--GAVT--GVKDQGKCGSCW 29


>3ioq_A CMS1MS2; caricaceae, cysteine protease, papain family, hydrolase;
           HET: E64 SO4; 1.87A {Carica candamarcensis}
          Length = 213

 Score = 54.4 bits (132), Expect = 6e-09
 Identities = 22/67 (32%), Positives = 31/67 (46%), Gaps = 18/67 (26%)

Query: 242 LYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDE 294
            Y+ G        ++  HA+  +G+G D      Y LI NSW T WG+ G  +I RG   
Sbjct: 143 NYRGGIFAGPCGTSID-HAVAAVGYGND------YILIKNSWGTGWGEGGYIRIKRGSGN 195

Query: 295 ----CGI 297
               CG+
Sbjct: 196 PQGACGV 202



 Score = 35.9 bits (84), Expect = 0.008
 Identities = 13/30 (43%), Positives = 17/30 (56%), Gaps = 4/30 (13%)

Query: 84  LPANFDSRTKWPNCPTIREIRDQGSCGSCW 113
           +P + D R K     T   +R+QG CGSCW
Sbjct: 1   IPTSIDWRQK--GAVT--PVRNQGGCGSCW 26



 Score = 28.6 bits (65), Expect = 1.9
 Identities = 12/26 (46%), Positives = 14/26 (53%)

Query: 11 FGCNGGFPGMAWRYWVKSGIVSGGAY 36
          +GC GGFP  A +Y   SGI     Y
Sbjct: 61 YGCRGGFPLYALQYVANSGIHLRQYY 86


>1by8_A Protein (procathepsin K); hydrolase(sulfhydryl proteinase), papain;
           2.60A {Homo sapiens} SCOP: d.3.1.1 PDB: 7pck_A
          Length = 314

 Score = 55.3 bits (134), Expect = 6e-09
 Identities = 18/50 (36%), Positives = 29/50 (58%), Gaps = 3/50 (6%)

Query: 252 HAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD-ECGIESS 300
           HA+  +G+G  +    K+W+I NSW  +WG+ G   + R K+  CGI + 
Sbjct: 261 HAVLAVGYGIQKG--NKHWIIKNSWGENWGNKGYILMARNKNNACGIANL 308



 Score = 38.8 bits (91), Expect = 0.002
 Identities = 15/49 (30%), Positives = 23/49 (46%), Gaps = 4/49 (8%)

Query: 65  NLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCW 113
            L  +R  + +   E +   P + D R K     T   +++QG CGSCW
Sbjct: 81  PLSHSRSNDTLYIPEWEGRAPDSVDYRKK--GYVT--PVKNQGQCGSCW 125


>3ovx_A Cathepsin S; hydrolase, covalent inhibitor, aldehyde warhead is
           covalently bound to Cys25, lysosomeal protein; HET: O64;
           1.49A {Homo sapiens} PDB: 2h7j_A* 2f1g_A* 2hh5_B*
           2hhn_A* 2hxz_A* 2op3_A* 2frq_A* 2fra_A* 2fq9_A* 2ft2_A*
           2fud_A* 2g7y_A* 1ms6_A* 2r9m_A* 2r9n_A* 2r9o_A* 3n3g_A*
           3n4c_A* 3mpe_A* 1nqc_A* ...
          Length = 218

 Score = 53.7 bits (130), Expect = 9e-09
 Identities = 16/47 (34%), Positives = 29/47 (61%), Gaps = 3/47 (6%)

Query: 252 HAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD-ECGI 297
           H + ++G+G+     ++YWL+ NSW  ++G+ G  ++ R K   CGI
Sbjct: 165 HGVLVVGYGDLNG--KEYWLVKNSWGHNFGEEGYIRMARNKGNHCGI 209



 Score = 36.7 bits (86), Expect = 0.005
 Identities = 15/30 (50%), Positives = 19/30 (63%), Gaps = 4/30 (13%)

Query: 84  LPANFDSRTKWPNCPTIREIRDQGSCGSCW 113
           LP + D R K   C T  E++ QGSCG+CW
Sbjct: 2   LPDSVDWREK--GCVT--EVKYQGSCGACW 27


>3p5u_A Actinidin; SAD, cysteine proteinases, hydrolase; 1.50A {Actinidia
           arguta} PDB: 3p5v_A 3p5w_A 3p5x_A 1aec_A* 2act_A
          Length = 220

 Score = 53.7 bits (130), Expect = 9e-09
 Identities = 19/49 (38%), Positives = 27/49 (55%), Gaps = 5/49 (10%)

Query: 252 HAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDE---CGI 297
           HA+ I+G+G +      YW++ NSW T WG+ G  +I R       CGI
Sbjct: 162 HAVTIVGYGTE--GGIDYWIVKNSWGTTWGEEGYMRIQRNVGGVGQCGI 208



 Score = 35.6 bits (83), Expect = 0.010
 Identities = 12/30 (40%), Positives = 15/30 (50%), Gaps = 4/30 (13%)

Query: 84  LPANFDSRTKWPNCPTIREIRDQGSCGSCW 113
           LP   D R+         +I+DQG CGS W
Sbjct: 1   LPDYVDWRSS--GAVV--DIKDQGQCGSAW 26


>2c0y_A Procathepsin S; proenzyme, proteinase, hydrolase, thiol protease,
           prosegment binding loop, glycoprotein, lysosome,
           protease, zymogen; 2.1A {Homo sapiens}
          Length = 315

 Score = 54.2 bits (131), Expect = 1e-08
 Identities = 20/64 (31%), Positives = 34/64 (53%), Gaps = 10/64 (15%)

Query: 242 LYKSGKALGG-------HAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD- 293
           LY+SG            H + ++G+G+     ++YWL+ NSW  ++G+ G  ++ R K  
Sbjct: 245 LYRSGVYYEPSCTQNVNHGVLVVGYGDLNG--KEYWLVKNSWGHNFGEEGYIRMARNKGN 302

Query: 294 ECGI 297
            CGI
Sbjct: 303 HCGI 306



 Score = 41.1 bits (97), Expect = 2e-04
 Identities = 16/50 (32%), Positives = 24/50 (48%), Gaps = 4/50 (8%)

Query: 64  YNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCW 113
             +P+     +   S  +  LP + D R K   C T  E++ QGSCG+ W
Sbjct: 79  LRVPSQWQRNITYKSNPNRILPDSVDWREK--GCVT--EVKYQGSCGAAW 124


>3kwz_A Cathepsin K; enzyme inhibitor, covalent reversible inhibitor,
           disease mutation, disulfide bond, glycoprotein,
           hydrolase, lysosome, protease; HET: KWZ; 1.49A {Homo
           sapiens} PDB: 1au0_A* 1au2_A* 1au3_A* 1au4_A* 1ayu_A*
           1ayv_A* 1ayw_A* 1bgo_A* 1atk_A* 1nl6_A* 1nlj_A* 1q6k_A*
           1mem_A* 1yk7_A* 1yk8_A* 1yt7_A* 2ato_A* 2aux_A* 2auz_A*
           2bdl_A* ...
          Length = 215

 Score = 52.9 bits (128), Expect = 2e-08
 Identities = 18/47 (38%), Positives = 28/47 (59%), Gaps = 3/47 (6%)

Query: 252 HAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD-ECGI 297
           HA+  +G+G  +    K+W+I NSW  +WG+ G   + R K+  CGI
Sbjct: 162 HAVLAVGYGIQKG--NKHWIIKNSWGENWGNKGYILMARNKNNACGI 206



 Score = 34.8 bits (81), Expect = 0.021
 Identities = 12/29 (41%), Positives = 16/29 (55%), Gaps = 4/29 (13%)

Query: 85  PANFDSRTKWPNCPTIREIRDQGSCGSCW 113
           P + D R K     T   +++QG CGSCW
Sbjct: 2   PDSVDYRKK--GYVT--PVKNQGQCGSCW 26


>2bdz_A Mexicain; cysteine protease, peptidase_C1, papain-like, HYDR; HET:
           E64; 2.10A {Jacaratia mexicana}
          Length = 214

 Score = 52.8 bits (128), Expect = 2e-08
 Identities = 16/50 (32%), Positives = 25/50 (50%), Gaps = 10/50 (20%)

Query: 252 HAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDE----CGI 297
           HA+  +G+G+       Y L+ NSW  +WG+ G  +I R        CG+
Sbjct: 159 HAVTAVGYGKT------YLLLKNSWGPNWGEKGYIRIKRASGRSKGTCGV 202



 Score = 34.0 bits (79), Expect = 0.034
 Identities = 11/29 (37%), Positives = 15/29 (51%), Gaps = 4/29 (13%)

Query: 85  PANFDSRTKWPNCPTIREIRDQGSCGSCW 113
           P + D R K     T   +++Q  CGSCW
Sbjct: 2   PESIDWREK--GAVT--PVKNQNPCGSCW 26



 Score = 29.4 bits (67), Expect = 1.1
 Identities = 8/26 (30%), Positives = 15/26 (57%)

Query: 11 FGCNGGFPGMAWRYWVKSGIVSGGAY 36
           GC+GG+   + +Y V +G+ +   Y
Sbjct: 61 HGCDGGYQTTSLQYVVDNGVHTEREY 86


>1pci_A Procaricain; zymogen, hydrolase, thiol protease; 3.20A {Carica
           papaya} SCOP: d.3.1.1
          Length = 322

 Score = 53.8 bits (130), Expect = 2e-08
 Identities = 17/50 (34%), Positives = 25/50 (50%), Gaps = 6/50 (12%)

Query: 252 HAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD----ECGI 297
            A+  +G+G+     + Y LI NSW T WG+ G  +I R        CG+
Sbjct: 265 GAVTAVGYGKS--GGKGYILIKNSWGTAWGEKGYIRIKRAPGNSPGVCGL 312



 Score = 38.0 bits (89), Expect = 0.002
 Identities = 17/37 (45%), Positives = 20/37 (54%), Gaps = 4/37 (10%)

Query: 77  YSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCW 113
            +E   +LP N D R K     T   +R QGSCGSCW
Sbjct: 100 INEDIVNLPENVDWRKK--GAVT--PVRHQGSCGSCW 132



 Score = 29.9 bits (68), Expect = 1.0
 Identities = 11/26 (42%), Positives = 13/26 (50%)

Query: 11  FGCNGGFPGMAWRYWVKSGIVSGGAY 36
            GC GG+P  A  Y  K+GI     Y
Sbjct: 167 HGCKGGYPPYALEYVAKNGIHLRSKY 192


>1o0e_A Ervatamin C; plant cysteine protease, two domain, stable at PH
           2-12, HYDR; 1.90A {Tabernaemontana divaricata} SCOP:
           d.3.1.1 PDB: 2pns_A* 2pre_A* 3bcn_A*
          Length = 208

 Score = 52.3 bits (126), Expect = 3e-08
 Identities = 22/99 (22%), Positives = 35/99 (35%), Gaps = 8/99 (8%)

Query: 201 SGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWG 260
            G   VP     A+           +   +   F  +   I          H + I+G+ 
Sbjct: 106 DGYNGVPFCNEXALKQAVAVQPSTVAIDASSAQFQQYSSGIFSGPCGTKLNHGVTIVGYQ 165

Query: 261 EDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDE--CGI 297
            +      YW++ NSW   WG+ G  ++LR      CGI
Sbjct: 166 AN------YWIVRNSWGRYWGEKGYIRMLRVGGCGLCGI 198



 Score = 36.1 bits (84), Expect = 0.006
 Identities = 14/30 (46%), Positives = 17/30 (56%), Gaps = 4/30 (13%)

Query: 84  LPANFDSRTKWPNCPTIREIRDQGSCGSCW 113
           LP   D R K     T   +++QGSCGSCW
Sbjct: 1   LPEQIDWRKK--GAVT--PVKNQGSCGSCW 26


>1cqd_A Protein (protease II); cysteine protease, glycoprotein, proline
           specificity, carboh papain family, hydrolase; HET: NAG
           FUL FUC; 2.10A {Zingiber officinale} SCOP: d.3.1.1
          Length = 221

 Score = 51.8 bits (125), Expect = 4e-08
 Identities = 19/67 (28%), Positives = 36/67 (53%), Gaps = 14/67 (20%)

Query: 242 LYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDE 294
           LY+SG        +   HA+ ++G+G +  + + +W++ NSW  +WG++G  +  R  + 
Sbjct: 145 LYRSGIFTGSCNISAN-HALTVVGYGTE--NDKDFWIVKNSWGKNWGESGYIRAERNIEN 201

Query: 295 ----CGI 297
               CGI
Sbjct: 202 PDGKCGI 208



 Score = 39.4 bits (93), Expect = 6e-04
 Identities = 12/32 (37%), Positives = 17/32 (53%), Gaps = 4/32 (12%)

Query: 82  EDLPANFDSRTKWPNCPTIREIRDQGSCGSCW 113
           +DLP + D R           +++QG CGSCW
Sbjct: 1   DDLPDSIDWREN--GAVV--PVKNQGGCGSCW 28


>1iwd_A Ervatamin B; cysteine protease, alpha-beta protein, catalytic DYAD,
           L-DOM domain., hydrolase; 1.63A {Tabernaemontana
           divaricata} SCOP: d.3.1.1
          Length = 215

 Score = 51.7 bits (125), Expect = 4e-08
 Identities = 21/67 (31%), Positives = 30/67 (44%), Gaps = 14/67 (20%)

Query: 242 LYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD- 293
            Y SG        A   H + I+G+G    S + YW++ NSW  +WG+ G   + R    
Sbjct: 142 HYSSGIFTGPCGTAQN-HGVVIVGYGTQ--SGKNYWIVRNSWGQNWGNQGYIWMERNVAS 198

Query: 294 ---ECGI 297
               CGI
Sbjct: 199 SAGLCGI 205



 Score = 35.6 bits (83), Expect = 0.010
 Identities = 12/30 (40%), Positives = 16/30 (53%), Gaps = 4/30 (13%)

Query: 84  LPANFDSRTKWPNCPTIREIRDQGSCGSCW 113
           LP+  D R+K         I++Q  CGSCW
Sbjct: 1   LPSFVDWRSK--GAVN--SIKNQKQCGSCW 26


>1s4v_A Cysteine endopeptidase; KDEL ER retention signal, endosperm,
           ricinosomes, SEED germi senescence, hydrolase-hydrolase
           inhibitor complex; 2.00A {Ricinus communis} SCOP:
           d.3.1.1
          Length = 229

 Score = 51.8 bits (125), Expect = 5e-08
 Identities = 21/67 (31%), Positives = 30/67 (44%), Gaps = 13/67 (19%)

Query: 242 LYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDE 294
            Y  G         L  H + I+G+G       KYW + NSW  +WG+ G  ++ RG  +
Sbjct: 146 FYSEGVFTGSCGTELD-HGVAIVGYGTTIDGT-KYWTVKNSWGPEWGEKGYIRMERGISD 203

Query: 295 ----CGI 297
               CGI
Sbjct: 204 KEGLCGI 210



 Score = 35.6 bits (83), Expect = 0.010
 Identities = 14/30 (46%), Positives = 18/30 (60%), Gaps = 4/30 (13%)

Query: 84  LPANFDSRTKWPNCPTIREIRDQGSCGSCW 113
           +PA+ D R K     T   ++DQG CGSCW
Sbjct: 2   VPASVDWRKK--GAVT--SVKDQGQCGSCW 27


>1yal_A Chymopapain; hydrolase, thiol protease; 1.70A {Carica papaya} SCOP:
           d.3.1.1 PDB: 1gec_E*
          Length = 218

 Score = 51.0 bits (123), Expect = 7e-08
 Identities = 21/67 (31%), Positives = 31/67 (46%), Gaps = 14/67 (20%)

Query: 242 LYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD- 293
           LYKSG         L  HA+  +G+G      + Y +I NSW  +WG+ G  ++ R    
Sbjct: 143 LYKSGVFDGPCGTKLD-HAVTAVGYGTS--DGKNYIIIKNSWGPNWGEKGYMRLKRQSGN 199

Query: 294 ---ECGI 297
               CG+
Sbjct: 200 SQGTCGV 206



 Score = 34.0 bits (79), Expect = 0.039
 Identities = 7/12 (58%), Positives = 11/12 (91%)

Query: 102 EIRDQGSCGSCW 113
            +++QG+CGSCW
Sbjct: 15  PVKNQGACGSCW 26



 Score = 29.0 bits (66), Expect = 1.7
 Identities = 7/26 (26%), Positives = 14/26 (53%)

Query: 11 FGCNGGFPGMAWRYWVKSGIVSGGAY 36
          +GC GG+   + +Y   +G+ +   Y
Sbjct: 61 YGCKGGYQTTSLQYVANNGVHTSKVY 86


>3pw3_A Aminopeptidase C; bleomycin, cysteine proteinase fold, structural
           genomics, JO center for structural genomics, JCSG; HET:
           MSE; 2.23A {Parabacteroides distasonis}
          Length = 383

 Score = 42.9 bits (100), Expect = 7e-05
 Identities = 15/83 (18%), Positives = 39/83 (46%), Gaps = 1/83 (1%)

Query: 206 VPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKS 265
           + G++      +K   +   ++   +   T  +  + Y + +    H ++I G  +D++ 
Sbjct: 272 LSGSDMAHWLKLKPEEKKLNTKPQPQKWCTQAERQLAYDNYETTDDHGMQIYGIAKDQE- 330

Query: 266 KEKYWLIANSWNTDWGDNGLFKI 288
             +Y+++ NSW T+   NG++  
Sbjct: 331 GNEYYMVKNSWGTNSKYNGIWYA 353



 Score = 27.1 bits (59), Expect = 7.8
 Identities = 4/11 (36%), Positives = 8/11 (72%)

Query: 103 IRDQGSCGSCW 113
           +++Q   G+CW
Sbjct: 25  VKNQNRAGTCW 35


>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis,
           programmed cell death; HET: DTP; 6.90A {Drosophila
           melanogaster} PDB: 3iz8_A*
          Length = 1221

 Score = 36.8 bits (84), Expect = 0.009
 Identities = 34/251 (13%), Positives = 61/251 (24%), Gaps = 89/251 (35%)

Query: 22  WRYWVKSGIVSGGAYGSKQAE--KNSLSNIP----RAHLKSWMGVHP-DYNLPANRLPEL 74
           W  W             K     ++SL+ +     R      + V P   ++P   L  +
Sbjct: 344 WDNWKHVNC-------DKLTTIIESSLNVLEPAEYRKMFDR-LSVFPPSAHIPTILLSLI 395

Query: 75  IGYS--EVDEDLPANFDSRT---KWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVN- 128
                      +       +   K P   T                    I      +  
Sbjct: 396 WFDVIKSDVMVVVNKLHKYSLVEKQPKEST------------------ISI----PSIYL 433

Query: 129 GTRPSCDASKG-HTPKCVRECQENYDVPYKKDLNFGAK----SYSVSSNEKSIMKEIYEH 183
             +   +     H     R   ++Y++P   D +         Y             Y H
Sbjct: 434 ELKVKLENEYALH-----RSIVDHYNIPKTFDSDDLIPPYLDQY------------FYSH 476

Query: 184 --------GPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFT 235
                      E   T+F  +  +   RF            ++  IR +++   A G+  
Sbjct: 477 IGHHLKNIEHPE-RMTLFRMV--FLDFRF------------LEQKIRHDSTAWNASGSIL 521

Query: 236 -VFDDLILYKS 245
                L  YK 
Sbjct: 522 NTLQQLKFYKP 532



 Score = 32.1 bits (72), Expect = 0.23
 Identities = 20/144 (13%), Positives = 42/144 (29%), Gaps = 40/144 (27%)

Query: 157 KKDLNFGAKSYSVSSNEKSIMKEIYE--HGPVEGAFTVFDDLILYKSGRFFVPGNETTAM 214
           + +  F          + S+M  +Y                  LY   + F   N +   
Sbjct: 88  RINYKFLMSPIKTEQRQPSMMTRMYIEQRDR------------LYNDNQVFAKYNVSRLQ 135

Query: 215 SLIKWTIRDNTSQLGAEGAFTVFDDLILY---KSGK-ALGGHAIRILGWGEDEKSKEK-- 268
             +K  +R    +L          ++++     SGK  +              K + K  
Sbjct: 136 PYLK--LRQALLELRPA------KNVLIDGVLGSGKTWVALDVCL------SYKVQCKMD 181

Query: 269 ---YWLIANSWNTDWGDNGLFKIL 289
              +WL   + N+      + ++L
Sbjct: 182 FKIFWLNLKNCNS---PETVLEML 202


>2cb5_A Protein (bleomycin hydrolase); aminopeptidase, cysteine protease,
           SELF- compartmentalizing, cylinase; 1.85A {Homo sapiens}
           SCOP: d.3.1.1 PDB: 1cb5_A
          Length = 453

 Score = 33.2 bits (75), Expect = 0.11
 Identities = 10/37 (27%), Positives = 13/37 (35%), Gaps = 2/37 (5%)

Query: 252 HAIRILGWGED--EKSKEKYWLIANSWNTDWGDNGLF 286
           HA+      E   +      W + NSW  D G  G  
Sbjct: 371 HAMTFTAVSEKDDQDGAFTKWRVENSWGEDHGHKGYL 407


>2e01_A Cysteine proteinase 1; bleomycin hydrolase, thiol protease, C1
           protease, hydrolase; 1.73A {Saccharomyces cerevisiae}
           PDB: 2e02_A 2e03_A 2dzy_A 1a6r_A 2e00_A 2dzz_A 3gcb_A
           1gcb_A
          Length = 457

 Score = 32.0 bits (72), Expect = 0.26
 Identities = 14/36 (38%), Positives = 19/36 (52%), Gaps = 1/36 (2%)

Query: 252 HAIRILGWGEDEKSKE-KYWLIANSWNTDWGDNGLF 286
            A+ I G   DE SK    + + NSW  D G +GL+
Sbjct: 373 AAMLITGCHVDETSKLPLRYRVENSWGKDSGKDGLY 408


>2e1b_A PH0108, 216AA long hypothetical alanyl-tRNA synthetase;
           zinc-binding motif, trans-editing enzyme, structural
           genomics, NPPSFA; 2.70A {Pyrococcus horikoshii} SCOP:
           b.43.3.6 d.67.1.2
          Length = 216

 Score = 28.7 bits (65), Expect = 1.9
 Identities = 7/29 (24%), Positives = 10/29 (34%), Gaps = 9/29 (31%)

Query: 277 NTDWGDNGLFKILRGKDECGIESSITAGV 305
           +    + G  K L+        SSI  G 
Sbjct: 189 DI--KEIGHIKKLK-------RSSIGRGK 208


>2dtg_E Insulin receptor; IR ectodomain, X-RAY crystallography, hormone
           receptor/immune system complex; 3.80A {Homo sapiens}
           SCOP: b.1.2.1 b.1.2.1 b.1.2.1 c.10.2.5 c.10.2.5 g.3.9.1
           PDB: 3loh_E
          Length = 897

 Score = 28.7 bits (63), Expect = 2.9
 Identities = 13/58 (22%), Positives = 16/58 (27%), Gaps = 4/58 (6%)

Query: 95  PNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENY 152
             CP          C +   C+           N  R  C     H  KC+ EC   Y
Sbjct: 239 ETCPPPYYHFQDWRCVNFSFCQ----DLHHKCKNSRRQGCHQYVIHNNKCIPECPSGY 292



 Score = 27.2 bits (59), Expect = 9.3
 Identities = 9/64 (14%), Positives = 12/64 (18%), Gaps = 2/64 (3%)

Query: 87  NFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSC--DASKGHTPKC 144
             D+      CP   + +         G          H        C           C
Sbjct: 149 KDDNEECGDICPGTAKGKTNCPATVINGQFVERCWTHSHCQKVCPTICKSHGCTAEGLCC 208

Query: 145 VREC 148
             EC
Sbjct: 209 HSEC 212


>1v4p_A Alanyl-tRNA synthetase; alanine-tRNA ligase, riken structural
           genomics/proteomics initiative RSGI, structural
           genomics; 1.45A {Pyrococcus horikoshii} SCOP: d.67.1.2
           PDB: 1wxo_A 1v7o_A 1wnu_A 3rhu_A 3rfn_A
          Length = 157

 Score = 27.6 bits (62), Expect = 3.4
 Identities = 6/29 (20%), Positives = 11/29 (37%), Gaps = 8/29 (27%)

Query: 277 NTDWGDNGLFKILRGKDECGIESSITAGV 305
            T  G+ G  KI +      +    + G+
Sbjct: 123 TT--GEIGPIKIRK------VRFRKSKGL 143


>2pff_B Fatty acid synthase subunit beta; fatty acid synthase,
           acyl-carrier-protein, beta-ketoacyl RED beta-ketoacyl
           synthase, dehydratase; 4.00A {Saccharomyces cerevisiae}
          Length = 2006

 Score = 28.5 bits (63), Expect = 4.0
 Identities = 35/207 (16%), Positives = 60/207 (28%), Gaps = 74/207 (35%)

Query: 139 GHTPKCVRECQENYDV--PYKKDL-NFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDD 195
           G+T     E ++ Y        DL  F A++ S         ++++  G          +
Sbjct: 164 GNTDDYFEELRDLYQTYHVLVGDLIKFSAETLSELIRTTLDAEKVFTQG-----L----N 214

Query: 196 LILYKSGRFFVPGNE---TTAMS--LIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALG 250
           ++ +       P  +   +  +S  LI         QL     + V          K LG
Sbjct: 215 ILEWLENPSNTPDKDYLLSIPISCPLIGVI------QL---AHYVVT--------AKLLG 257

Query: 251 ---GHAIRIL----GWGEDEKSKEKYWLI-------ANSWNTDWGDNG------LFKI-L 289
              G     L    G  +         L+        +SW   +  +       LF I +
Sbjct: 258 FTPGELRSYLKGATGHSQG--------LVTAVAIAETDSWE-SFFVSVRKAITVLFFIGV 308

Query: 290 RGKD---ECGIESSITA-------GVP 306
           R  +      +  SI         GVP
Sbjct: 309 RCYEAYPNTSLPPSILEDSLENNEGVP 335


>2ztg_A Alanyl-tRNA synthetase; class-II aminoacyl-tRNA synthetase,
           aminoacyl-tRNA synthetase, ATP-binding, cytoplasm,
           ligase; HET: A5A; 2.20A {Archaeoglobus fulgidus}
          Length = 739

 Score = 28.0 bits (63), Expect = 5.0
 Identities = 10/29 (34%), Positives = 14/29 (48%), Gaps = 9/29 (31%)

Query: 277 NTDWGDNGLFKILRGKDECGIESSITAGV 305
           +T  G+ G+ KIL+         SI  GV
Sbjct: 710 ST--GEIGMLKILK-------VESIQDGV 729


>3fe4_A Carbonic anhydrase 6; secretion, metal binding, structural GEN
           structural genomics consortium, SGC, glycoprotein,
           lyase, M binding, secreted; 1.90A {Homo sapiens}
          Length = 278

 Score = 27.4 bits (61), Expect = 5.9
 Identities = 9/19 (47%), Positives = 11/19 (57%)

Query: 113 WGCRPYEIAPCEHHVNGTR 131
           WG    EI+  EH V+G R
Sbjct: 95  WGGASSEISGSEHTVDGIR 113


>3hh2_C Follistatin; protein-protein complex, TB domain, cystine knot
           motif, TGF- fold, disulfide linked dimer, CLE PAIR of
           basic residues, cytokine; HET: CIT; 2.15A {Homo sapiens}
           PDB: 2b0u_C* 2p6a_D
          Length = 288

 Score = 27.3 bits (59), Expect = 6.0
 Identities = 13/65 (20%), Positives = 23/65 (35%), Gaps = 6/65 (9%)

Query: 95  PNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDV 154
            +    + +   G   +C  C+      CE+   G    C  +K + P+CV  C  +   
Sbjct: 42  NDNTLFKWMIFNGGAPNCIPCK----ETCENVDCGPGKKCRMNKKNKPRCV--CAPDCSN 95

Query: 155 PYKKD 159
              K 
Sbjct: 96  ITWKG 100


>3k1f_M Transcription initiation factor IIB; RNA polymerase II, TFIIB,
           transcription factor, DNA-binding, DNA-directed RNA
           polymerase; 4.30A {Saccharomyces cerevisiae}
          Length = 197

 Score = 26.9 bits (59), Expect = 6.9
 Identities = 9/42 (21%), Positives = 12/42 (28%), Gaps = 6/42 (14%)

Query: 77  YSEVDEDLPANFDSRTKWPNC----PTIREIRDQGS--CGSC 112
             +       N +     P C    P I E   +G   C  C
Sbjct: 7   IDKRAGRRGPNLNIVLTCPECKVYPPKIVERFSEGDVVCALC 48


>2hr7_A Insulin receptor; hormone receptor, leucine rich repeat,
           transferase; HET: NAG BMA MAN FUC P33; 2.32A {Homo
           sapiens}
          Length = 486

 Score = 27.1 bits (59), Expect = 8.2
 Identities = 13/58 (22%), Positives = 16/58 (27%), Gaps = 4/58 (6%)

Query: 95  PNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENY 152
             CP          C +   C+           N  R  C     H  KC+ EC   Y
Sbjct: 239 ETCPPPYYHFQDWRCVNFSFCQD----LHHKCKNSRRQGCHQYVIHNNKCIPECPSGY 292


>2f68_X Collagen adhesin; beta barrel, domain SWAP, cell adhesion; 1.95A
           {Staphylococcus aureus} PDB: 2f6a_A 1amx_A
          Length = 313

 Score = 26.7 bits (58), Expect = 9.1
 Identities = 18/130 (13%), Positives = 38/130 (29%), Gaps = 15/130 (11%)

Query: 157 KKDLNFGAKSYSVSSNEKSIMKEIY-------EHGPVEGAFTVFDDLILYKSGRFFVPGN 209
                F  +  +++    S  K           +  V  +      +  YK+G   +P +
Sbjct: 105 SGFAEFEVQGRNLTQTNTSDDKVATITSGNKSTNVTVHKSEAGTSSVFYYKTGDM-LPED 163

Query: 210 ETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKY 269
            T     ++W +  N  +       T+ D +   + G+ L    + I   G         
Sbjct: 164 TTH----VRWFLNINNEKSYVSKDITIKDQI---QGGQQLDLSTLNINVTGTHSNYYSGQ 216

Query: 270 WLIANSWNTD 279
             I +     
Sbjct: 217 SAITDFEKAF 226


  Database: pdb70
    Posted date:  Sep 4, 2012  3:40 AM
  Number of letters in database: 6,701,793
  Number of sequences in database:  27,921
  
Lambda     K      H
   0.317    0.137    0.447 

Gapped
Lambda     K      H
   0.267   0.0856    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 27921
Number of Hits to DB: 4,958,293
Number of extensions: 291432
Number of successful extensions: 834
Number of sequences better than 10.0: 1
Number of HSP's gapped: 798
Number of HSP's successfully gapped: 163
Length of query: 309
Length of database: 6,701,793
Length adjustment: 93
Effective length of query: 216
Effective length of database: 4,105,140
Effective search space: 886710240
Effective search space used: 886710240
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 57 (25.5 bits)