RPS-BLAST 2.2.26 [Sep-21-2011]

Database: pdb70 
           27,921 sequences; 6,701,793 total letters

Searching..................................................done

Query= psy5095
         (192 letters)



>1m6d_A Cathepsin F, catsf; papain family cysteine protease, hydrolase;
           HET: MYP; 1.70A {Homo sapiens} SCOP: d.3.1.1
          Length = 214

 Score =  160 bits (407), Expect = 3e-50
 Identities = 60/178 (33%), Positives = 96/178 (53%), Gaps = 8/178 (4%)

Query: 2   LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHA-GLEAEADYPFRNQN 60
           +E Q+ +  GTLL LS+ +L++C+  ++ C GG  + A   +K+  GLE E DY ++   
Sbjct: 34  VEGQWFLNQGTLLSLSEQELLDCDKMDKACMGGLPSNAYSAIKNLGGLETEDDYSYQ--- 90

Query: 61  GVTGRCAYDARKVKVRVSDFL-VFNGSDTFRRMLYHYGPLVAGMNGALLQDY-NGKLIRK 118
           G    C + A K KV + D + +          L   GP+   +N   +Q Y +G     
Sbjct: 91  GHMQSCQFSAEKAKVYIQDSVELSQNEQKLAAWLAKRGPISVAINAFGMQFYRHGISRPL 150

Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGIESY 175
             +C    ++HAV++VGYG R  VP W ++NSWG  WG + GY+ + RG+ ACG+ + 
Sbjct: 151 RPLCSPWLIDHAVLLVGYGQRSDVPFWAIKNSWGTDWG-EKGYYYLHRGSGACGVNTM 207


>3i06_A Cruzipain; autocatalytic cleavage, glycoprotein, protease, thiol
           protease, zymogen; HET: QL2; 1.10A {Trypanosoma cruzi}
           PDB: 1ewm_A* 1ewo_A* 1ewl_A* 1f29_A* 1ewp_A* 1f2b_A*
           1f2c_A* 1f2a_A* 1me4_A* 1u9q_X* 2aim_A* 2efm_A* 2oz2_A*
           1me3_A* 3kku_A* 3lxs_A* 1aim_A* 3iut_A* 3hd3_A* 2p86_A*
           ...
          Length = 215

 Score =  151 bits (385), Expect = 8e-47
 Identities = 47/179 (26%), Positives = 85/179 (47%), Gaps = 9/179 (5%)

Query: 2   LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHA---GLEAEADYPFRN 58
           +E Q+ +    L  LS+  L+ C+  + GC GG  N A +++       +  E  YP+ +
Sbjct: 34  VECQWFLAGHPLTNLSEQMLVSCDKTDSGCSGGLMNNAFEWIVQENNGAVYTEDSYPYAS 93

Query: 59  QNGVTGRCAYDARKVKVRVSDFL-VFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIR 117
             G++  C      V   ++  + +          L   GP+   ++ +    Y G ++ 
Sbjct: 94  GEGISPPCTTSGHTVGATITGHVELPQDEAQIAAWLAVNGPVAVAVDASSWMTYTGGVM- 152

Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGIESY 175
               C SE L+H V++VGY     VP WI++NSW  +WG ++GY  + +G+N C ++  
Sbjct: 153 --TSCVSEQLDHGVLLVGYNDSAAVPYWIIKNSWTTQWG-EEGYIRIAKGSNQCLVKEE 208


>1xkg_A DER P I, major mite fecal allergen DER P 1; major allergen,
           cysteine protease, house DUST mite, dermatop
           pteronyssinus; 1.61A {Dermatophagoides pteronyssinus}
           SCOP: d.3.1.1
          Length = 312

 Score =  145 bits (368), Expect = 3e-43
 Identities = 46/181 (25%), Positives = 73/181 (40%), Gaps = 14/181 (7%)

Query: 2   LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQNG 61
            ES Y       L L++ +L++C     GC G    + I+Y++H G+  E+ Y +     
Sbjct: 123 TESAYLAYRDQSLDLAEQELVDCA-SQHGCHGDTIPRGIEYIQHNGVVQESYYRYV---A 178

Query: 62  VTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYH-YGPLVAGMNGALLQD---YNGKL 115
               C     +    +S++  +    ++  R  L   +  +   +    L     Y+G+ 
Sbjct: 179 REQSCRRPNAQR-FGISNYCQIYPPNANKIREALAQTHSAIAVIIGIKDLDAFRHYDGR- 236

Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGIES 174
                    +   HAV IVGY     V  WIVRNSW   WG D+GY       +   IE 
Sbjct: 237 TIIQRDNGYQPNYHAVNIVGYSNAQGVDYWIVRNSWDTNWG-DNGYGYFAANIDLMMIEE 295

Query: 175 Y 175
           Y
Sbjct: 296 Y 296


>3f5v_A DER P 1 allergen; allergy, asthma, DUST mites, glycoprotein,
           hydrola protease, secreted, thiol protease; HET: P6G;
           1.36A {Dermatophagoides pteronyssinus} PDB: 2as8_A
           3rvw_A* 3rvx_A 3rvv_A* 3d6s_A*
          Length = 222

 Score =  143 bits (362), Expect = 3e-43
 Identities = 46/181 (25%), Positives = 73/181 (40%), Gaps = 14/181 (7%)

Query: 2   LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQNG 61
            ES Y       L L++ +L++C     GC G    + I+Y++H G+  E+ Y +     
Sbjct: 43  TESAYLAYRQQSLDLAEQELVDCA-SQHGCHGDTIPRGIEYIQHNGVVQESYYRYV---A 98

Query: 62  VTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYH-YGPLVAGMNGALLQD---YNGKL 115
               C     +    +S++  +    ++  R  L   +  +   +    L     Y+G+ 
Sbjct: 99  REQSCRRPNAQR-FGISNYCQIYPPNANKIREALAQTHSAIAVIIGIKDLDAFRHYDGR- 156

Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGIES 174
                    +   HAV IVGY     V  WIVRNSW   WG D+GY       +   IE 
Sbjct: 157 TIIQRDNGYQPNYHAVNIVGYSNAQGVDYWIVRNSWDTNWG-DNGYGYFAANIDLMMIEE 215

Query: 175 Y 175
           Y
Sbjct: 216 Y 216


>3pdf_A Cathepsin C, dipeptidyl peptidase 1; two domains, cystein protease,
           hydrolase-hydrolase inhibitor; HET: LXV NAG; 1.85A {Homo
           sapiens} PDB: 1jqp_A* 2djf_B* 1k3b_B* 2djg_B* 2djf_A*
           1k3b_A* 2djg_A* 2djf_C* 1k3b_C* 2djg_C*
          Length = 441

 Score =  140 bits (354), Expect = 4e-40
 Identities = 57/192 (29%), Positives = 85/192 (44%), Gaps = 21/192 (10%)

Query: 1   MLESQYAIKHGTLL--PLSKSQLIECNIYNQGCQGGG-FNKAIQYLKHAGLEAEADYPFR 57
           MLE++  I         LS  +++ C+ Y QGC+GG  +  A +Y +  GL  EA +P+ 
Sbjct: 242 MLEARIRILTNNSQTPILSPQEVVSCSQYAQGCEGGFPYLIAGKYAQDFGLVEEACFPYT 301

Query: 58  NQNGVTGRCAYDARKVKVRVSDF------LVFNGSDTFRRMLYHYGPLVAGMN-GALLQD 110
              G    C       +   S++               +  L H+GP+            
Sbjct: 302 ---GTDSPCKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDDFLH 358

Query: 111 YNG----KLIRKNDVCPSENLNHAVVIVGYGMRHQ--VPVWIVRNSWG-RWGPDDGYFTV 163
           Y          ++   P E  NHAV++VGYG      +  WIV+NSWG  WG ++GYF +
Sbjct: 359 YKKGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWG-ENGYFRI 417

Query: 164 ERGTNACGIESY 175
            RGT+ C IES 
Sbjct: 418 RRGTDECAIESI 429


>2b1m_A SPE31; papain-like, sugar binding protein; HET: NAG FUC PG4; 2.00A
           {Pachyrhizus erosus} PDB: 2b1n_A*
          Length = 246

 Score =  135 bits (342), Expect = 6e-40
 Identities = 53/188 (28%), Positives = 94/188 (50%), Gaps = 18/188 (9%)

Query: 2   LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQY-LKHAGLEAEADYPFRNQN 60
           +E+ +AI  G L+ LS+ +LI+C   ++GC  G   ++ ++ +KH G+ +EADYP++   
Sbjct: 35  IEAAHAIATGNLVSLSEQELIDCVDESEGCYNGWHYQSFEWVVKHGGIASEADYPYK--- 91

Query: 61  GVTGRCAYDARKVKVRVSDFLVFN--------GSDTFRRMLYHYGPLVAGMNGALLQDYN 112
              G+C  +  + KV + ++ V           +++  +      P+   ++      Y+
Sbjct: 92  ARDGKCKANEIQDKVTIDNYGVQILSNESTESEAESSLQSFVLEQPISVSIDAKDFHFYS 151

Query: 113 GKLIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNA-- 169
           G +    +      +NH V+IVGYG    V  WI +NSWG  WG  DGY  ++R T    
Sbjct: 152 GGIYDGGNCSSPYGINHFVLIVGYGSEDGVDYWIAKNSWGEDWG-IDGYIRIQRNTGNLL 210

Query: 170 --CGIESY 175
             CG+  +
Sbjct: 211 GVCGMNYF 218


>8pch_A Cathepsin H; hydrolase, protease, cysteine proteinase,
           aminopeptidase; HET: NAG BMA; 2.10A {Sus scrofa} SCOP:
           d.3.1.1 PDB: 1nb3_A* 1nb5_A*
          Length = 220

 Score =  134 bits (340), Expect = 6e-40
 Identities = 61/182 (33%), Positives = 92/182 (50%), Gaps = 12/182 (6%)

Query: 2   LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQYLK-HAGLEAEADYPFRN 58
           LES  AI  G +L L++ QL++C  N  N GCQGG  ++A +Y++ + G+  E  YP++ 
Sbjct: 35  LESAVAIATGKMLSLAEQQLVDCAQNFNNHGCQGGLPSQAFEYIRYNKGIMGEDTYPYK- 93

Query: 59  QNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMN-GALLQDY-NGK 114
             G    C +   K    V D   +  N  +     +  Y P+            Y  G 
Sbjct: 94  --GQDDHCKFQPDKAIAFVKDVANITMNDEEAMVEAVALYNPVSFAFEVTNDFLMYRKGI 151

Query: 115 LIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGIE 173
               +     + +NHAV+ VGYG  + +P WIV+NSWG +WG  +GYF +ERG N CG+ 
Sbjct: 152 YSSTSCHKTPDKVNHAVLAVGYGEENGIPYWIVKNSWGPQWG-MNGYFLIERGKNMCGLA 210

Query: 174 SY 175
           + 
Sbjct: 211 AC 212


>3qj3_A Cathepsin L-like protein; hydrolase, proteinase, larVal midgut;
           1.85A {Tenebrio molitor}
          Length = 331

 Score =  137 bits (347), Expect = 6e-40
 Identities = 61/182 (33%), Positives = 89/182 (48%), Gaps = 13/182 (7%)

Query: 2   LESQYAIKHGTLL--PLSKSQLIECNIYNQGCQGGGFNKAIQYLK-HAGLEAEADYPFRN 58
           +ESQ  I +G      +S+ QL++C     GC GG  N A  Y+  + G+++E  YP+  
Sbjct: 149 IESQMKIANGAGYDSSVSEQQLVDCVPNALGCSGGWMNDAFTYVAQNGGIDSEGAYPYE- 207

Query: 59  QNGVTGRCAYDARKVKVRVSDFLVFNGSD--TFRRMLYHYGPLVAGMN-GALLQDYNGKL 115
                G C YD  +V  R+S ++  +G D      M+   GP+    +       Y+G  
Sbjct: 208 --MADGNCHYDPNQVAARLSGYVYLSGPDENMLADMVATKGPVAVAFDADDPFGSYSGG- 264

Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGT-NACGIE 173
           +  N  C +    HAV+IVGYG  +    W+V+NSWG  WG  DGYF + R   N CGI 
Sbjct: 265 VYYNPTCETNKFTHAVLIVGYGNENGQDYWLVKNSWGDGWG-LDGYFKIARNANNHCGIA 323

Query: 174 SY 175
             
Sbjct: 324 GV 325


>1by8_A Protein (procathepsin K); hydrolase(sulfhydryl proteinase), papain;
           2.60A {Homo sapiens} SCOP: d.3.1.1 PDB: 7pck_A
          Length = 314

 Score =  136 bits (345), Expect = 1e-39
 Identities = 61/182 (33%), Positives = 92/182 (50%), Gaps = 14/182 (7%)

Query: 2   LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLK-HAGLEAEADYPFRNQN 60
           LE Q   K G LL LS   L++C   N GC GG    A QY++ + G+++E  YP+    
Sbjct: 133 LEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYV--- 189

Query: 61  GVTGRCAYDARKVKVRVSDFLVFNGSD--TFRRMLYHYGPLVAGMNGAL--LQDY-NGKL 115
           G    C Y+      +   +      +    +R +   GP+   ++ +L   Q Y  G  
Sbjct: 190 GQEESCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQFYSKG-- 247

Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGT-NACGIE 173
           +  ++ C S+NLNHAV+ VGYG++     WI++NSWG  WG + GY  + R   NACGI 
Sbjct: 248 VYYDESCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWG-NKGYILMARNKNNACGIA 306

Query: 174 SY 175
           + 
Sbjct: 307 NL 308


>3qt4_A Cathepsin-L-like midgut cysteine proteinase; hydrolase, zymogen,
           intramolecular DISS bonds, insect larVal midgut; HET:
           PG4 PG6; 2.11A {Tenebrio molitor}
          Length = 329

 Score =  135 bits (342), Expect = 3e-39
 Identities = 55/181 (30%), Positives = 92/181 (50%), Gaps = 12/181 (6%)

Query: 2   LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQ 59
           +E Q A++ G L  LS+  LI+C  +  N GC GG  + A  Y+   G+ +E+ YP+   
Sbjct: 148 VEGQLALQRGRLTSLSEQNLIDCSSSYGNAGCDGGWMDSAFSYIHDYGIMSESAYPYE-- 205

Query: 60  NGVTGRCAYDARKVKVRVSDFL-VFNGS-DTFRRMLYHYGPLVAGMN-GALLQDYNGKLI 116
                 C +D+ +    +S +  + +G  ++    +   GP+   ++    LQ Y+G  +
Sbjct: 206 -AQGDYCRFDSSQSVTTLSGYYDLPSGDENSLADAVGQAGPVAVAIDATDELQFYSGG-L 263

Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGT-NACGIES 174
             +  C   +LNH V++VGYG  +    WI++NSWG  WG + GY+   R   N CGI +
Sbjct: 264 FYDQTCNQSDLNHGVLVVGYGSDNGQDYWILKNSWGSGWG-ESGYWRQVRNYGNNCGIAT 322

Query: 175 Y 175
            
Sbjct: 323 A 323


>2o6x_A Procathepsin L1, secreted cathepsin L 1; hydrolase, thiol protease,
           cysteine protease, zymogen, hydro; 1.40A {Fasciola
           hepatica}
          Length = 310

 Score =  134 bits (340), Expect = 5e-39
 Identities = 63/181 (34%), Positives = 90/181 (49%), Gaps = 12/181 (6%)

Query: 2   LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQ 59
           +E QY     T +  S+ QL++C     N GC GG    A QYLK  GLE E+ YP+   
Sbjct: 125 MEGQYMKNERTSISFSEQQLVDCSRPWGNNGCGGGLMENAYQYLKQFGLETESSYPYT-- 182

Query: 60  NGVTGRCAYDARKVKVRVSDFL-VFNGS-DTFRRMLYHYGPLVAGMN-GALLQDYNGKLI 116
             V G+C Y+ +    +V+ F  V +GS    + ++   GP    ++  +    Y    I
Sbjct: 183 -AVEGQCRYNKQLGVAKVTGFYTVHSGSEVELKNLVGAEGPAAVAVDVESDFMMYRSG-I 240

Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGT-NACGIES 174
            ++  C    +NHAV+ VGYG +     WIV+NSWG  WG + GY  + R   N CGI S
Sbjct: 241 YQSQTCSPLRVNHAVLAVGYGTQGGTDYWIVKNSWGLSWG-ERGYIRMVRNRGNMCGIAS 299

Query: 175 Y 175
            
Sbjct: 300 L 300


>1cs8_A Human procathepsin L; prosegment, propeptide, inhibition,
           hydrolase; HET: OCS; 1.80A {Homo sapiens} SCOP: d.3.1.1
           PDB: 1cjl_A 3hwn_A*
          Length = 316

 Score =  134 bits (339), Expect = 7e-39
 Identities = 54/186 (29%), Positives = 85/186 (45%), Gaps = 17/186 (9%)

Query: 2   LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQYLK-HAGLEAEADYPFRN 58
           LE Q   K G L+ LS+  L++C     N+GC GG  + A QY++ + GL++E  YP+  
Sbjct: 130 LEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYE- 188

Query: 59  QNGVTGRCAYDARKVKVRVSDFL-VFNGSDTFRRMLYHYGPLVAGMNGAL--LQDYNGKL 115
                  C Y+ +      + F+ +        + +   GP+   ++        Y    
Sbjct: 189 --ATEESCKYNPKYSVANDAGFVDIPKQEKALMKAVATVGPISVAIDAGHESFLFYKEG- 245

Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVPV----WIVRNSWG-RWGPDDGYFTVERGT-NA 169
           I     C SE+++H V++VGYG           W+V+NSWG  WG   GY  + +   N 
Sbjct: 246 IYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWG-MGGYVKMAKDRRNH 304

Query: 170 CGIESY 175
           CGI S 
Sbjct: 305 CGIASA 310


>3kwz_A Cathepsin K; enzyme inhibitor, covalent reversible inhibitor,
           disease mutation, disulfide bond, glycoprotein,
           hydrolase, lysosome, protease; HET: KWZ; 1.49A {Homo
           sapiens} PDB: 1au0_A* 1au2_A* 1au3_A* 1au4_A* 1ayu_A*
           1ayv_A* 1ayw_A* 1bgo_A* 1atk_A* 1nl6_A* 1nlj_A* 1q6k_A*
           1mem_A* 1yk7_A* 1yk8_A* 1yt7_A* 2ato_A* 2aux_A* 2auz_A*
           2bdl_A* ...
          Length = 215

 Score =  131 bits (331), Expect = 1e-38
 Identities = 62/182 (34%), Positives = 94/182 (51%), Gaps = 14/182 (7%)

Query: 2   LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLK-HAGLEAEADYPFRNQN 60
           LE Q   K G LL LS   L++C   N GC GG    A QY++ + G+++E  YP+    
Sbjct: 34  LEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYV--- 90

Query: 61  GVTGRCAYDARKVKVRVSDFL-VFNGS-DTFRRMLYHYGPLVAGMNGAL--LQDY-NGKL 115
           G    C Y+      +   +  +  G+    +R +   GP+   ++ +L   Q Y  G  
Sbjct: 91  GQEESCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQFYSKG-- 148

Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGT-NACGIE 173
           +  ++ C S+NLNHAV+ VGYG++     WI++NSWG  WG + GY  + R   NACGI 
Sbjct: 149 VYYDESCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWG-NKGYILMARNKNNACGIA 207

Query: 174 SY 175
           + 
Sbjct: 208 NL 209


>3bwk_A Cysteine protease falcipain-3; malaria, hydrolase; HET: C1P; 2.42A
           {Plasmodium falciparum} PDB: 3bpm_A*
          Length = 243

 Score =  129 bits (326), Expect = 1e-37
 Identities = 53/191 (27%), Positives = 82/191 (42%), Gaps = 24/191 (12%)

Query: 2   LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQY-LKHAGLEAEADYPFRNQN 60
           +ESQYAI+   L   S+ +L++C++ N GC GG    A    +   GL ++ DYP+ +  
Sbjct: 53  VESQYAIRKKALFLFSEQELVDCSVKNNGCYGGYITNAFDDMIDLGGLCSQDDYPYVSNL 112

Query: 61  GVTGRCAYDARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMN-GALLQDYNGKLIRKN 119
                C       +  +  + V    D F+  L + GP+   +        Y G      
Sbjct: 113 P--ETCNLKRCNERYTIKSY-VSIPDDKFKEALRYLGPISISIAASDDFAFYRGGFYDGE 169

Query: 120 DVCPSENLNHAVVIVGYGM----------RHQVPVWIVRNSWG-RWGPDDGYFTVERGTN 168
             C     NHAV++VGYGM            +   +I++NSWG  WG + GY  +E   N
Sbjct: 170 --C-GAAPNHAVILVGYGMKDIYNEDTGRMEKFYYYIIKNSWGSDWG-EGGYINLETDEN 225

Query: 169 A----CGIESY 175
                C I + 
Sbjct: 226 GYKKTCSIGTE 236


>2xu3_A Cathepsin L1; hydrolase, drug design, thiol protease; HET: XU3 BTB;
           0.90A {Homo sapiens} PDB: 2xu4_A* 2xu5_A* 2yj2_A*
           2yj8_A* 2yj9_A* 2yjb_A* 2yjc_A* 3bc3_A* 3h89_A* 3h8b_A*
           3h8c_A* 3of9_A* 3of8_A* 3hha_A* 2xu1_A* 3iv2_A* 3k24_A*
           2nqd_B* 3kse_A* 2vhs_A ...
          Length = 220

 Score =  128 bits (324), Expect = 1e-37
 Identities = 55/187 (29%), Positives = 86/187 (45%), Gaps = 19/187 (10%)

Query: 2   LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQYLK-HAGLEAEADYPFRN 58
           LE Q   K G L+ LS+  L++C     N+GC GG  + A QY++ + GL++E  YP+  
Sbjct: 34  LEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYE- 92

Query: 59  QNGVTGRCAYDARKVKVRVSDFL-VFNGSDTFRRMLYHYGPLVAGMNGAL--LQDY-NGK 114
                  C Y+ +      + F+ +        + +   GP+   ++        Y  G 
Sbjct: 93  --ATEESCKYNPKYSVANDTGFVDIPKQEKALMKAVATVGPISVAIDAGHESFLFYKEG- 149

Query: 115 LIRKNDVCPSENLNHAVVIVGYGMRHQ----VPVWIVRNSWG-RWGPDDGYFTVERGT-N 168
            I     C SE+++H V++VGYG           W+V+NSWG  WG   GY  + +   N
Sbjct: 150 -IYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWG-MGGYVKMAKDRRN 207

Query: 169 ACGIESY 175
            CGI S 
Sbjct: 208 HCGIASA 214


>2oul_A Falcipain 2; cysteine protease, inhibitor, macromolecular
           interaction, HY hydrolase inhibitor complex; 2.20A
           {Plasmodium falciparum} SCOP: d.3.1.1 PDB: 2ghu_A 1yvb_A
           3bpf_A* 3pnr_A
          Length = 241

 Score =  127 bits (322), Expect = 4e-37
 Identities = 51/188 (27%), Positives = 90/188 (47%), Gaps = 24/188 (12%)

Query: 2   LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQY-LKHAGLEAEADYPFRNQN 60
           +ESQYAI+   L+ LS+ +L++C+  N GC GG  N A +  ++  G+  + DYP+ +  
Sbjct: 51  VESQYAIRKNKLITLSEQELVDCSFKNYGCNGGLINNAFEDMIELGGICPDGDYPYVSDA 110

Query: 61  GVTGRCAYDARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMN-GALLQDYNGKLIRKN 119
                C  D    K  + ++ +    +  +  L   GP+   +        Y   +   +
Sbjct: 111 P--NLCNIDRCTEKYGIKNY-LSVPDNKLKEALRFLGPISISVAVSDDFAFYKEGIF--D 165

Query: 120 DVCPSENLNHAVVIVGYGMRHQVPV----------WIVRNSWG-RWGPDDGYFTVERGTN 168
             C  + LNHAV++VG+GM+  V            +I++NSWG +WG + G+  +E   +
Sbjct: 166 GEC-GDQLNHAVMLVGFGMKEIVNPLTKKGEKHYYYIIKNSWGQQWG-ERGFINIETDES 223

Query: 169 A----CGI 172
                CG+
Sbjct: 224 GLMRKCGL 231


>2c0y_A Procathepsin S; proenzyme, proteinase, hydrolase, thiol protease,
           prosegment binding loop, glycoprotein, lysosome,
           protease, zymogen; 2.1A {Homo sapiens}
          Length = 315

 Score =  129 bits (327), Expect = 4e-37
 Identities = 56/184 (30%), Positives = 96/184 (52%), Gaps = 16/184 (8%)

Query: 2   LESQYAIKHGTLLPLSKSQLIEC---NIYNQGCQGGGFNKAIQYLK-HAGLEAEADYPFR 57
           LE+Q  +K G L+ LS   L++C      N+GC GG    A QY+  + G++++A YP++
Sbjct: 132 LEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYK 191

Query: 58  NQNGVTGRCAYDARKVKVRVSDFL-VFNGS-DTFRRMLYHYGPLVAGMNGAL--LQDYNG 113
               +  +C YD++      S +  +  G  D  +  + + GP+  G++        Y  
Sbjct: 192 ---AMDQKCQYDSKYRAATCSKYTELPYGREDVLKEAVANKGPVSVGVDARHPSFFLYRS 248

Query: 114 KLIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGT-NACG 171
             +     C ++N+NH V++VGYG  +    W+V+NSWG  +G ++GY  + R   N CG
Sbjct: 249 G-VYYEPSC-TQNVNHGVLVVGYGDLNGKEYWLVKNSWGHNFG-EEGYIRMARNKGNHCG 305

Query: 172 IESY 175
           I S+
Sbjct: 306 IASF 309


>3ovx_A Cathepsin S; hydrolase, covalent inhibitor, aldehyde warhead is
           covalently bound to Cys25, lysosomeal protein; HET: O64;
           1.49A {Homo sapiens} PDB: 2h7j_A* 2f1g_A* 2hh5_B*
           2hhn_A* 2hxz_A* 2op3_A* 2frq_A* 2fra_A* 2fq9_A* 2ft2_A*
           2fud_A* 2g7y_A* 1ms6_A* 2r9m_A* 2r9n_A* 2r9o_A* 3n3g_A*
           3n4c_A* 3mpe_A* 1nqc_A* ...
          Length = 218

 Score =  124 bits (314), Expect = 4e-36
 Identities = 57/185 (30%), Positives = 98/185 (52%), Gaps = 18/185 (9%)

Query: 2   LESQYAIKHGTLLPLSKSQLIEC---NIYNQGCQGGGFNKAIQY-LKHAGLEAEADYPFR 57
           LE+Q  +K G L+ LS   L++C      N+GC GG    A QY + + G++++A YP++
Sbjct: 35  LEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYK 94

Query: 58  NQNGVTGRCAYDARKVKVRVSDFL-VFNGS-DTFRRMLYHYGPLVAGMNGAL--LQDY-N 112
               +  +C YD++      S +  +  G  D  +  + + GP+  G++        Y +
Sbjct: 95  ---AMDQKCQYDSKYRAATCSKYTELPYGREDVLKEAVANKGPVSVGVDARHPSFFLYRS 151

Query: 113 GKLIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGT-NAC 170
           G  +     C ++N+NH V++VGYG  +    W+V+NSWG  +G ++GY  + R   N C
Sbjct: 152 G--VYYEPSC-TQNVNHGVLVVGYGDLNGKEYWLVKNSWGHNFG-EEGYIRMARNKGNHC 207

Query: 171 GIESY 175
           GI S+
Sbjct: 208 GIASF 212


>1o0e_A Ervatamin C; plant cysteine protease, two domain, stable at PH
           2-12, HYDR; 1.90A {Tabernaemontana divaricata} SCOP:
           d.3.1.1 PDB: 2pns_A* 2pre_A* 3bcn_A*
          Length = 208

 Score =  123 bits (310), Expect = 1e-35
 Identities = 56/181 (30%), Positives = 86/181 (47%), Gaps = 20/181 (11%)

Query: 2   LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQY-LKHAGLEAEADYPFRNQN 60
           +ES   I+ G L+ LS+ +L++C+  N GC GG F  A QY + + G++ +A+YP++   
Sbjct: 34  VESINQIRTGNLISLSEQELVDCDKKNHGCLGGAFVFAYQYIINNGGIDTQANYPYK--- 90

Query: 61  GVTGRCAYDARKVKVRVSDFL-VFNGSDTFRRMLYHYGPLVAGMN--GALLQDYNGKLIR 117
            V G C   ++ V   +  +  V   ++   +      P    ++   A  Q Y+  +  
Sbjct: 91  AVQGPCQAASKVVS--IDGYNGVPFCNEXALKQAVAVQPSTVAIDASSAQFQQYSSGIF- 147

Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVER--GTNACGIES 174
                    LNH V IVGY   +    WIVRNSWG  WG + GY  + R  G   CGI  
Sbjct: 148 --SGPCGTKLNHGVTIVGYQANY----WIVRNSWGRYWG-EKGYIRMLRVGGCGLCGIAR 200

Query: 175 Y 175
            
Sbjct: 201 L 201


>1ppo_A Protease omega; hydrolase(thiol protease); 1.80A {Carica papaya}
           SCOP: d.3.1.1 PDB: 1meg_A*
          Length = 216

 Score =  123 bits (311), Expect = 1e-35
 Identities = 44/202 (21%), Positives = 74/202 (36%), Gaps = 54/202 (26%)

Query: 2   LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQNG 61
           +E    I+ G L+ LS+ +L++C   + GC+GG    A++Y+   G+   + YP++    
Sbjct: 34  VEGINKIRTGKLVELSEQELVDCERRSHGCKGGYPPYALEYVAKNGIHLRSKYPYK---A 90

Query: 62  VTGRCAYDARKVKV-RVSDF-------------LVFNG---------SDTFRRMLYHYGP 98
             G C        + + S                +               F   LY  G 
Sbjct: 91  KQGTCRAKQVGGPIVKTSGVGRVQPNNEGNLLNAIAKQPVSVVVESKGRPF--QLYKGG- 147

Query: 99  LVAGMNGALLQDYNGKLIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPD 157
            +          + G        C    ++HAV  VGYG        +++NSWG  WG +
Sbjct: 148 -I----------FEGP-------C-GTKVDHAVTAVGYGKSGGKGYILIKNSWGTAWG-E 187

Query: 158 DGYFTVERGTNA----CGIESY 175
            GY  ++R        CG+   
Sbjct: 188 KGYIRIKRAPGNSPGVCGLYKS 209


>3ioq_A CMS1MS2; caricaceae, cysteine protease, papain family, hydrolase;
           HET: E64 SO4; 1.87A {Carica candamarcensis}
          Length = 213

 Score =  122 bits (310), Expect = 2e-35
 Identities = 52/185 (28%), Positives = 85/185 (45%), Gaps = 24/185 (12%)

Query: 2   LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQNG 61
           +E    I  G LL LS+ +L++C   + GC+GG    A+QY+ ++G+     YP+    G
Sbjct: 34  VEGINKIVTGQLLSLSEQELLDCERRSYGCRGGFPLYALQYVANSGIHLRQYYPYE---G 90

Query: 62  VTGRCAYDARK-VKVRVSDFLVFNGSDTFRRMLYH---YGPLVAGMN--GALLQDYNGKL 115
           V  +C     K  KV+         ++   + L       P+   +   G   Q+Y G +
Sbjct: 91  VQRQCRASQAKGPKVKTDGVGRVPRNNE--QALIQRIAIQPVSIVVEAKGRAFQNYRGGI 148

Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNA----C 170
                 C   +++HAV  VGYG        +++NSWG  WG + GY  ++RG+      C
Sbjct: 149 F--AGPC-GTSIDHAVAAVGYG----NDYILIKNSWGTGWG-EGGYIRIKRGSGNPQGAC 200

Query: 171 GIESY 175
           G+ S 
Sbjct: 201 GVLSD 205


>1cqd_A Protein (protease II); cysteine protease, glycoprotein, proline
           specificity, carboh papain family, hydrolase; HET: NAG
           FUL FUC; 2.10A {Zingiber officinale} SCOP: d.3.1.1
          Length = 221

 Score =  122 bits (309), Expect = 2e-35
 Identities = 57/202 (28%), Positives = 81/202 (40%), Gaps = 54/202 (26%)

Query: 2   LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQY-LKHAGLEAEADYPFRNQN 60
           +E    I  G L+ LS+ QL++C   N GC+GG  N A Q+ + + G+ +E  YP+R   
Sbjct: 36  VEGINQIVTGDLISLSEQQLVDCTTANHGCRGGWMNPAFQFIVNNGGINSEETYPYR--- 92

Query: 61  GVTGRCAYDARKVKVRVSDF-------------LVFNG---------SDTFRRMLYHYGP 98
           G  G C        V +  +              V N             F   LY  G 
Sbjct: 93  GQDGICNSTVNAPVVSIDSYENVPSHNEQSLQKAVANQPVSVTMDAAGRDF--QLYRSG- 149

Query: 99  LVAGMNGALLQDYNGKLIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPD 157
            +          + G        C + + NHA+ +VGYG  +    WIV+NSWG  WG +
Sbjct: 150 -I----------FTGS-------C-NISANHALTVVGYGTENDKDFWIVKNSWGKNWG-E 189

Query: 158 DGYFTVERGTNA----CGIESY 175
            GY   ER        CGI  +
Sbjct: 190 SGYIRAERNIENPDGKCGITRF 211


>3hhi_A Cathepsin B-like cysteine protease; occluding loop, hydrolase, THIO
           protease; HET: 074; 1.60A {Trypanosoma brucei} PDB:
           3mor_A*
          Length = 325

 Score =  124 bits (314), Expect = 4e-35
 Identities = 50/206 (24%), Positives = 78/206 (37%), Gaps = 38/206 (18%)

Query: 1   MLESQYAIKHGT-LLPLSKSQLIEC-NIYNQGCQGGGFNKAIQYLKHAGLEAEADYPFR- 57
            +  ++    G   + +S   L+ C +    GC GG  ++A  Y    GL ++   P+  
Sbjct: 108 AMSDRFCTMGGVQDVHISAGDLLACCSDCGDGCNGGDPDRAWAYFSSTGLVSDYCQPYPF 167

Query: 58  -----------------NQNGVTGRCAYDARKVKVRVSDFLVF-----NGSDTFRRMLYH 95
                              N  T +C Y      + V ++  +      G D + R L+ 
Sbjct: 168 PHCSHHSKSKNGYPPCSQFNFDTPKCDYTCDDPTIPVVNYRSWTSYALQGEDDYMRELFF 227

Query: 96  YGPLVAGMN-GALLQDYNGKLIRK----NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNS 150
            GP     +       Y           + V       HAV +VG+G  + VP W + NS
Sbjct: 228 RGPFEVAFDVYEDFIAY------NSGVYHHVSGQYLGGHAVRLVGWGTSNGVPYWKIANS 281

Query: 151 WGR-WGPDDGYFTVERGTNACGIESY 175
           W   WG  DGYF + RG++ CGIE  
Sbjct: 282 WNTEWG-MDGYFLIRRGSSECGIEDG 306


>2bdz_A Mexicain; cysteine protease, peptidase_C1, papain-like, HYDR; HET:
           E64; 2.10A {Jacaratia mexicana}
          Length = 214

 Score =  120 bits (304), Expect = 1e-34
 Identities = 50/202 (24%), Positives = 80/202 (39%), Gaps = 58/202 (28%)

Query: 2   LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQNG 61
           +E    I  G L+ LS+ +L++C   + GC GG    ++QY+   G+  E +YP+     
Sbjct: 34  IEGINKIITGQLISLSEQELLDCERRSHGCDGGYQTTSLQYVVDNGVHTEREYPYE---K 90

Query: 62  VTGRCAYDARK-VKVRVSDF-------------LVFNG---------SDTFRRMLYHYGP 98
             GRC    +K  KV ++ +              + N             F    Y  G 
Sbjct: 91  KQGRCRAKDKKGPKVYITGYKYVPANDEISLIQAIANQPVSVVTDSRGRGF--QFYKGG- 147

Query: 99  LVAGMNGALLQDYNGKLIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPD 157
            +          Y G        C   N +HAV  VGYG  +     +++NSWG  WG +
Sbjct: 148 -I----------YEGP-------C-GTNTDHAVTAVGYGKTY----LLLKNSWGPNWG-E 183

Query: 158 DGYFTVERGT----NACGIESY 175
            GY  ++R +      CG+ + 
Sbjct: 184 KGYIRIKRASGRSKGTCGVYTS 205


>1pci_A Procaricain; zymogen, hydrolase, thiol protease; 3.20A {Carica
           papaya} SCOP: d.3.1.1
          Length = 322

 Score =  123 bits (311), Expect = 1e-34
 Identities = 43/202 (21%), Positives = 73/202 (36%), Gaps = 54/202 (26%)

Query: 2   LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQNG 61
           +E    I+ G L+ LS+ +L++C   + GC+GG    A++Y+   G+   + YP++    
Sbjct: 140 VEGINKIRTGKLVELSEQELVDCERRSHGCKGGYPPYALEYVAKNGIHLRSKYPYK---A 196

Query: 62  VTGRCAYDARKVKV-RVSDFL-------------VFNG---------SDTFRRMLYHYGP 98
             G C        + + S                +               F   LY  G 
Sbjct: 197 KQGTCRAKQVGGPIVKTSGVGRVQPNNEGNLLNAIAKQPVSVVVESKGRPF--QLYKGG- 253

Query: 99  LVAGMNGALLQDYNGKLIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPD 157
            +          + G        C    ++ AV  VGYG        +++NSWG  WG +
Sbjct: 254 -I----------FEGP-------C-GTKVDGAVTAVGYGKSGGKGYILIKNSWGTAWG-E 293

Query: 158 DGYFTVERGTNA----CGIESY 175
            GY  ++R        CG+   
Sbjct: 294 KGYIRIKRAPGNSPGVCGLYKS 315


>1yal_A Chymopapain; hydrolase, thiol protease; 1.70A {Carica papaya} SCOP:
           d.3.1.1 PDB: 1gec_E*
          Length = 218

 Score =  120 bits (304), Expect = 1e-34
 Identities = 51/202 (25%), Positives = 83/202 (41%), Gaps = 54/202 (26%)

Query: 2   LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQNG 61
           +E    I  G LL LS+ +L++C+ ++ GC+GG    ++QY+ + G+     YP++    
Sbjct: 34  VEGINKIVTGNLLELSEQELVDCDKHSYGCKGGYQTTSLQYVANNGVHTSKVYPYQ---A 90

Query: 62  VTGRC-AYDARKVKVRVSDF-------------LVFNG---------SDTFRRMLYHYGP 98
              +C A D    KV+++ +              + N             F   LY  G 
Sbjct: 91  KQYKCRATDKPGPKVKITGYKRVPSNCETSFLGALANQPLSVLVEAGGKPF--QLYKSG- 147

Query: 99  LVAGMNGALLQDYNGKLIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPD 157
            V          ++G        C    L+HAV  VGYG        I++NSWG  WG +
Sbjct: 148 -V----------FDGP-------C-GTKLDHAVTAVGYGTSDGKNYIIIKNSWGPNWG-E 187

Query: 158 DGYFTVERGTNA----CGIESY 175
            GY  ++R +      CG+   
Sbjct: 188 KGYMRLKRQSGNSQGTCGVYKS 209


>2cio_A Papain; hydrolase/inhibitor, complex hydrolase/inhibitor, ICP,
           cysteine protease, allergen, protease, thiol protease;
           1.5A {Carica papaya} PDB: 1khq_A 1khp_A 1ppn_A 3e1z_B
           3ima_A 3lfy_A 9pap_A 1bqi_A* 1bp4_A* 1pad_A 1pe6_A*
           1pip_A* 1pop_A* 1ppd_A 1ppp_A* 1stf_E* 2pad_A 4pad_A*
           5pad_A* 6pad_A* ...
          Length = 212

 Score =  120 bits (303), Expect = 1e-34
 Identities = 46/202 (22%), Positives = 74/202 (36%), Gaps = 58/202 (28%)

Query: 2   LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQNG 61
           +E    I+ G L   S+ +L++C+  + GC GG    A+Q +   G+     YP+    G
Sbjct: 34  IEGIIKIRTGNLNQYSEQELLDCDRRSYGCNGGYPWSALQLVAQYGIHYRNTYPYE---G 90

Query: 62  VTGRCAY-DARKVKVRVSDF-------------LVFNG---------SDTFRRMLYHYGP 98
           V   C   +      +                  + N             F   LY  G 
Sbjct: 91  VQRYCRSREKGPYAAKTDGVRQVQPYNEGALLYSIANQPVSVVLEAAGKDF--QLYRGG- 147

Query: 99  LVAGMNGALLQDYNGKLIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPD 157
            +          + G        C    ++HAV  VGYG        +++NSWG  WG +
Sbjct: 148 -I----------FVGP-------C-GNKVDHAVAAVGYG----PNYILIKNSWGTGWG-E 183

Query: 158 DGYFTVERGTNA----CGIESY 175
           +GY  ++RGT      CG+ + 
Sbjct: 184 NGYIRIKRGTGNSYGVCGLYTS 205


>2wbf_X Serine-repeat antigen protein; SERA, malaria, vacuole, protease,
           cathepsin, hydrolase, glycoprotein, thiol protease; HET:
           DMS; 1.60A {Plasmodium falciparum} PDB: 3ch3_X 3ch2_X
          Length = 265

 Score =  121 bits (306), Expect = 2e-34
 Identities = 47/212 (22%), Positives = 79/212 (37%), Gaps = 42/212 (19%)

Query: 2   LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKA-IQYLKHAG-LEAEADYP-- 55
           LE+   +K      +S   +  C    +   C  G      +Q ++  G L AE++YP  
Sbjct: 43  LETIRCMKGYEPTKISALYVANCYKGEHKDRCDEGSSPMEFLQIIEDYGFLPAESNYPYN 102

Query: 56  -------------FRNQNGVTGRCAYDARKVKVRVSDFLVFNGSDTFR-----------R 91
                                G+  ++  +             S+ F             
Sbjct: 103 YVKVGEQCPKVEDHWMNLWDNGKILHNKNEPNSLDGKGYTAYESERFHDNMDAFVKIIKT 162

Query: 92  MLYHYGPLVAGMN--GALLQDYNGKLIRKNDVCPSENLNHAVVIVGYG-----MRHQVPV 144
            + + G ++A +     +  +++GK +   ++C  +  +HAV IVGYG        +   
Sbjct: 163 EVMNKGSVIAYIKAENVMGYEFSGKKV--KNLCGDDTADHAVNIVGYGNYVNSEGEKKSY 220

Query: 145 WIVRNSWG-RWGPDDGYFTVER-GTNACGIES 174
           WIVRNSWG  WG D+GYF V+  G   C    
Sbjct: 221 WIVRNSWGPYWG-DEGYFKVDMYGPTHCHFNF 251


>1iwd_A Ervatamin B; cysteine protease, alpha-beta protein, catalytic DYAD,
           L-DOM domain., hydrolase; 1.63A {Tabernaemontana
           divaricata} SCOP: d.3.1.1
          Length = 215

 Score =  117 bits (296), Expect = 2e-33
 Identities = 58/202 (28%), Positives = 84/202 (41%), Gaps = 55/202 (27%)

Query: 2   LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLK-HAGLEAEADYPFRNQN 60
           +ES   I+ G L+ LS+ +L++C+  + GC GG  N A QY+  + G++ + +YP+    
Sbjct: 34  VESINKIRTGQLISLSEQELVDCDTASHGCNGGWMNNAFQYIITNGGIDTQQNYPYS--- 90

Query: 61  GVTGRCAYDARKVKVRVSDF-------------LVFNG---------SDTFRRMLYHYGP 98
            V G C     +V V ++ F              V +             F    Y  G 
Sbjct: 91  AVQGSCKPYRLRV-VSINGFQRVTRNNESALQSAVASQPVSVTVEAAGAPF--QHYSSG- 146

Query: 99  LVAGMNGALLQDYNGKLIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPD 157
            +          + G        C     NH VVIVGYG +     WIVRNSWG  WG +
Sbjct: 147 -I----------FTGP-------C-GTAQNHGVVIVGYGTQSGKNYWIVRNSWGQNWG-N 186

Query: 158 DGYFTVERGTNA----CGIESY 175
            GY  +ER   +    CGI   
Sbjct: 187 QGYIWMERNVASSAGLCGIAQL 208


>3f75_A Toxopain-2, cathepsin L protease; medical structural genomics of
           pathogenic protozoa, MSGPP, C protease, parasite,
           protozoa, hydrolase; 1.99A {Toxoplasma gondii}
          Length = 224

 Score =  118 bits (297), Expect = 2e-33
 Identities = 52/205 (25%), Positives = 78/205 (38%), Gaps = 57/205 (27%)

Query: 2   LESQYAIKHGTLLPLSKSQLIECNIY--NQGCQGGGFNKAIQY-LKHAGLEAEADYPFRN 58
           LE  +  K G L+ LS+ +L++C+    NQ C GG  N A QY L   G+ +E  YP+  
Sbjct: 40  LEGAHCAKTGKLVSLSEQELMDCSRAEGNQSCSGGEMNDAFQYVLDSGGICSEDAYPYL- 98

Query: 59  QNGVTGRCAYDARKVKVRVSDF-------------LVFNG---------SDTFRRMLYHY 96
                  C   + +  V++  F              +               F    YH 
Sbjct: 99  --ARDEECRAQSCEKVVKILGFKDVPRRSEAAMKAALAKSPVSIAIEADQMPF--QFYHE 154

Query: 97  GPLVAGMNGALLQDYNGKLIRKNDVCPSENLNHAVVIVGYGM--RHQVPVWIVRNSWG-R 153
           G  V          ++         C   +L+H V++VGYG     +   WI++NSWG  
Sbjct: 155 G--V----------FDAS-------C-GTDLDHGVLLVGYGTDKESKKDFWIMKNSWGTG 194

Query: 154 WGPDDGYFTVERGT---NACGIESY 175
           WG  DGY  +         CG+   
Sbjct: 195 WG-RDGYMYMAMHKGEEGQCGLLLD 218


>1deu_A Procathepsin X; cysteine protease, proregion, prosegment, HY; 1.70A
           {Homo sapiens} SCOP: d.3.1.1 PDB: 1ef7_A
          Length = 277

 Score =  118 bits (299), Expect = 3e-33
 Identities = 44/197 (22%), Positives = 72/197 (36%), Gaps = 23/197 (11%)

Query: 1   MLESQYAIKHGTLLP---LSKSQLIECNIYNQGCQGGGFNKAIQYLKHAGLEAEADYPFR 57
            +  +  IK     P   LS   +I+C      C+GG       Y    G+  E    ++
Sbjct: 74  AMADRINIKRKGAWPSTLLSVQNVIDCG-NAGSCEGGNDLSVWDYAHQHGIPDETCNNYQ 132

Query: 58  NQNG------------VTGRCAYDARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMN- 104
            ++                 C         RV D+   +G +     +Y  GP+  G+  
Sbjct: 133 AKDQECDKFNQCGTCNEFKECHAIRNYTLWRVGDYGSLSGREKMMAEIYANGPISCGIMA 192

Query: 105 GALLQDYNGKLIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTV 163
              L +Y G +    +   +  +NH V + G+G+      WIVRNSWG  WG + G+  +
Sbjct: 193 TERLANYTGGIYA--EYQDTTYINHVVSVAGWGISDGTEYWIVRNSWGEPWG-ERGWLRI 249

Query: 164 ERGTNACGIESYG--GI 178
              T   G  +     I
Sbjct: 250 VTSTYKDGKGARYNLAI 266


>3p5u_A Actinidin; SAD, cysteine proteinases, hydrolase; 1.50A {Actinidia
           arguta} PDB: 3p5v_A 3p5w_A 3p5x_A 1aec_A* 2act_A
          Length = 220

 Score =  114 bits (287), Expect = 5e-32
 Identities = 51/204 (25%), Positives = 79/204 (38%), Gaps = 56/204 (27%)

Query: 2   LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQYLK-HAGLEAEADYPFRN 58
           +E    I  G L+ LS+ +L++C      +GC GG      Q++  + G+  EA+YP+  
Sbjct: 34  VEGINKIATGDLISLSEQELVDCGRTQNTRGCDGGFMTDGFQFIINNGGINTEANYPYT- 92

Query: 59  QNGVTGRCAYDARKVKV-RVSDF-------------LVFNG---------SDTFRRMLYH 95
                G+C  D ++ K   +  +              V               F    Y 
Sbjct: 93  --AEEGQCNLDLQQEKYVSIDTYENVPYNNEWALQTAVAYQPVSVALEAAGYNF--QHYS 148

Query: 96  YGPLVAGMNGALLQDYNGKLIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RW 154
            G  +          + G        C    ++HAV IVGYG    +  WIV+NSWG  W
Sbjct: 149 SG--I----------FTGP-------C-GTAVDHAVTIVGYGTEGGIDYWIVKNSWGTTW 188

Query: 155 GPDDGYFTVERG---TNACGIESY 175
           G ++GY  ++R       CGI   
Sbjct: 189 G-EEGYMRIQRNVGGVGQCGIAKK 211


>1s4v_A Cysteine endopeptidase; KDEL ER retention signal, endosperm,
           ricinosomes, SEED germi senescence, hydrolase-hydrolase
           inhibitor complex; 2.00A {Ricinus communis} SCOP:
           d.3.1.1
          Length = 229

 Score =  114 bits (287), Expect = 6e-32
 Identities = 55/205 (26%), Positives = 75/205 (36%), Gaps = 57/205 (27%)

Query: 2   LESQYAIKHGTLLPLSKSQLIECNI-YNQGCQGGGFNKAIQYLK-HAGLEAEADYPFRNQ 59
           +E    IK   L+ LS+ +L++C+   NQGC GG  + A +++K   G+  EA+YP+   
Sbjct: 35  VEGINQIKTNKLVSLSEQELVDCDTDQNQGCNGGLMDYAFEFIKQRGGITTEANYPYE-- 92

Query: 60  NGVTGRCAYDARKVKV-RVSDF-------------LVFNG---------SDTFRRMLYHY 96
               G C           +                 V N             F    Y  
Sbjct: 93  -AYDGTCDVSKENAPAVSIDGHENVPENDENALLKAVANQPVSVAIDAGGSDF--QFYSE 149

Query: 97  GPLVAGMNGALLQDYNGKLIRKNDVCPSENLNHAVVIVGYGM-RHQVPVWIVRNSWG-RW 154
           G  V          + G        C    L+H V IVGYG        W V+NSWG  W
Sbjct: 150 G--V----------FTGS-------C-GTELDHGVAIVGYGTTIDGTKYWTVKNSWGPEW 189

Query: 155 GPDDGYFTVERGT----NACGIESY 175
           G + GY  +ERG       CGI   
Sbjct: 190 G-EKGYIRMERGISDKEGLCGIAME 213


>3u8e_A Papain-like cysteine protease; papain-like cysteine peptidase,
           peptidase_C1A, hydrolase, in form; 1.31A {Crocus
           sativus}
          Length = 222

 Score =  113 bits (285), Expect = 8e-32
 Identities = 51/185 (27%), Positives = 87/185 (47%), Gaps = 16/185 (8%)

Query: 2   LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL-KHAGLEAEADYPFRNQN 60
           +E   AI  G L+ +S+ Q+++C+       GG  + A +++  + G+ ++A+YP+    
Sbjct: 34  IEGIDAITTGRLISVSEQQIVDCDTXXXXXXGGDADDAFRWVITNGGIASDANYPYT--- 90

Query: 61  GVTGRCAYDARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGAL--LQDYNGKLIRK 118
           GV G C  + + +  R+  +     S +         P+   +  +    Q Y G  I  
Sbjct: 91  GVDGTCDLN-KPIAARIDGYTNVPNSSSALLDAVAKQPVSVNIYTSSTSFQLYTGPGIFA 149

Query: 119 NDVCPSE--NLNHAVVIVGYGMRHQ-VPVWIVRNSWG-RWGPDDGYFTVERGTNA----C 170
              C  +   ++H V+IVGYG        WIV+NSWG  WG  DGY  + R TN     C
Sbjct: 150 GSSCSDDPATVDHTVLIVGYGSNGTNADYWIVKNSWGTEWG-IDGYILIRRNTNRPDGVC 208

Query: 171 GIESY 175
            I+++
Sbjct: 209 AIDAW 213


>2fo5_A Cysteine proteinase EP-B 2; EP-B2, EPB2, EPB, cysteine
           endoprotease, endopeptidase, LEUP hydrolase; HET: AR7;
           2.20A {Hordeum vulgare}
          Length = 262

 Score =  111 bits (281), Expect = 7e-31
 Identities = 62/208 (29%), Positives = 85/208 (40%), Gaps = 60/208 (28%)

Query: 2   LESQYAIKHGTLLPLSKSQLIECNIY-NQGCQGGGFNKAIQYLK-HAGLEAEADYPFRNQ 59
           +E   AI+ G+L+ LS+ +LI+C+   N GCQGG  + A +Y+K + GL  EA YP+R  
Sbjct: 37  VEGINAIRTGSLVSLSEQELIDCDTADNDGCQGGLMDNAFEYIKNNGGLITEAAYPYR-- 94

Query: 60  NGVTGRCAYD----ARKVKVRVSDF-------------LVFNG---------SDTFRRML 93
               G C          V V +                 V N             F  M 
Sbjct: 95  -AARGTCNVARAAQNSPVVVHIDGHQDVPANSEEDLARAVANQPVSVAVEASGKAF--MF 151

Query: 94  YHYGPLVAGMNGALLQDYNGKLIRKNDVCPSENLNHAVVIVGYGMRHQ-VPVWIVRNSWG 152
           Y  G  V          + G+       C    L+H V +VGYG+       W V+NSWG
Sbjct: 152 YSEG--V----------FTGE-------C-GTELDHGVAVVGYGVAEDGKAYWTVKNSWG 191

Query: 153 -RWGPDDGYFTVERGTNA----CGIESY 175
             WG + GY  VE+ + A    CGI   
Sbjct: 192 PSWG-EQGYIRVEKDSGASGGLCGIAME 218


>3cbj_A Cathepsin B; cathepsin B, occluding loop, chagas disease, glyco
           hydrolase, lysosome, protease, thiol protease, zymogen,
           CYT vesicle; 1.80A {Homo sapiens} PDB: 3cbk_A 1gmy_A*
           3ai8_B* 3k9m_A 1the_A* 1cpj_A* 1cte_A 2dcc_A* 2dc6_A*
           1ito_A* 2dc8_A* 2dc9_A* 2dca_A* 2dcb_A* 2dc7_A* 2dcd_A*
           1qdq_A* 1csb_B* 1huc_B 2ipp_B ...
          Length = 266

 Score =  111 bits (280), Expect = 1e-30
 Identities = 48/217 (22%), Positives = 75/217 (34%), Gaps = 50/217 (23%)

Query: 1   MLESQYAIKHGTL--LPLSKSQLIEC--NIYNQGCQGGGFNKAIQYLKHAGLEAEADYPF 56
            +  +  I       + +S   L+ C  ++   GC GG   +A  +    GL +   Y  
Sbjct: 43  AISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYES 102

Query: 57  R-------------NQNGVTGRC-------------------AYDARKVKVRVSDFLVFN 84
                         + NG    C                    Y   K     + + V N
Sbjct: 103 HVGCRPYSIPPCEAHVNGARPPCTGEGDTPKCSKICEPGYSPTYKQDKHYG-YNSYSVSN 161

Query: 85  GSDTFRRMLYHYGPLVAGMN-GALLQDYNGKLIRK----NDVCPSENLNHAVVIVGYGMR 139
                   +Y  GP+    +  +    Y      K      V       HA+ I+G+G+ 
Sbjct: 162 SEKDIMAEIYKNGPVEGAFSVYSDFLLY------KSGVYQHVTGEMMGGHAIRILGWGVE 215

Query: 140 HQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIESY 175
           +  P W+V NSW   WG D+G+F + RG + CGIES 
Sbjct: 216 NGTPYWLVANSWNTDWG-DNGFFKILRGQDHCGIESE 251


>3qsd_A Cathepsin B-like peptidase (C01 family); cysteine peptidase,
           digestive tract, hydrolase-hydrolase INH complex; HET:
           074; 1.30A {Schistosoma mansoni} PDB: 3s3q_A* 3s3r_A*
          Length = 254

 Score =  111 bits (279), Expect = 2e-30
 Identities = 49/217 (22%), Positives = 79/217 (36%), Gaps = 50/217 (23%)

Query: 1   MLESQYAIKHG--TLLPLSKSQLIEC-NIYNQGCQGGGFNKAIQYLKHAGLEAEADYPFR 57
            +  +  I+ G    + LS   L+ C      GC+GG    A  Y    G+   +     
Sbjct: 39  AMSDRSCIQSGGKQNVELSAVDLLSCCESCGLGCEGGILGPAWDYWVKEGIVTGSSKENH 98

Query: 58  -----------------------NQNGVTGRC----------AYDARKVKVRVSDFLVFN 84
                                  ++   T RC           Y   K +   S + V N
Sbjct: 99  AGCEPYPFPKCEHHTKGKYPPCGSKIYKTPRCKQTCQKKYKTPYTQDKHRG-KSSYNVKN 157

Query: 85  GSDTFRRMLYHYGPLVAGMN-GALLQDYNGKLIRK----NDVCPSENLNHAVVIVGYGMR 139
                ++ +  YGP+ AG        +Y      K      +       HA+ I+G+G+ 
Sbjct: 158 DEKAIQKEIMKYGPVEAGFTVYEDFLNY------KSGIYKHITGETLGGHAIRIIGWGVE 211

Query: 140 HQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIESY 175
           ++ P W++ NSW   WG ++GYF + RG + C IES 
Sbjct: 212 NKAPYWLIANSWNEDWG-ENGYFRIVRGRDECSIESE 247


>3pbh_A Procathepsin B; thiol protease, cysteine protease, proenzyme,
           papain; 2.50A {Homo sapiens} SCOP: d.3.1.1 PDB: 2pbh_A
           1pbh_A 1mir_A
          Length = 317

 Score =  112 bits (281), Expect = 3e-30
 Identities = 48/217 (22%), Positives = 75/217 (34%), Gaps = 50/217 (23%)

Query: 1   MLESQYAIKHGTLL--PLSKSQLIEC--NIYNQGCQGGGFNKAIQYLKHAGLEAEADYPF 56
            +  +  I     +   +S   L+ C  ++   GC GG   +A  +    GL +   Y  
Sbjct: 100 AISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYES 159

Query: 57  R-------------NQNGVTGRC-------------------AYDARKVKVRVSDFLVFN 84
                         + NG    C                    Y   K     + + V N
Sbjct: 160 HVGCRPYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYG-YNSYSVSN 218

Query: 85  GSDTFRRMLYHYGPLVAGMN-GALLQDYNGKLIRK----NDVCPSENLNHAVVIVGYGMR 139
                   +Y  GP+    +  +    Y      K      V       HA+ I+G+G+ 
Sbjct: 219 SEKDIMAEIYKNGPVEGAFSVYSDFLLY------KSGVYQHVTGEMMGGHAIRILGWGVE 272

Query: 140 HQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIESY 175
           +  P W+V NSW   WG D+G+F + RG + CGIES 
Sbjct: 273 NGTPYWLVANSWNTDWG-DNGFFKILRGQDHCGIESE 308


>3ois_A Cysteine protease; alpha and beta, hydrolase; HET: UDP; 1.65A
           {Xylella fastidiosa}
          Length = 291

 Score = 98.8 bits (246), Expect = 2e-25
 Identities = 36/200 (18%), Positives = 66/200 (33%), Gaps = 30/200 (15%)

Query: 1   MLESQYAIKHGTLLPLSKSQLIECNI----YNQGCQGGGF-NKAIQYLKHAGLEAEADYP 55
            ++ +      +   +     I  N      +     G      I+ L   G+  E ++P
Sbjct: 86  AIQFERIHDKQSPEFIPSRLFIYYNERKIEGHVNYDSGAMIRDGIKVLHKLGVCPEKEWP 145

Query: 56  FRNQNG----------------VTGRCAYDARKVKVRVSDFLVFNGSDTFRRMLYHYGPL 99
           + +                    + +C  DA+  K+      V    D  +  L    P 
Sbjct: 146 YGDTPADPRTEEFPPGAPASKKPSDQCYKDAQNYKI-TEYSRVAQDIDHLKACLAVGSPF 204

Query: 100 VAGMN-GALLQDYNGKLIRKNDVCPSEN--LNHAVVIVGYGMRHQVPVWIVRNSWG-RWG 155
           V G +        N   +R      ++     HAV+ VGY    ++  + +RNSWG   G
Sbjct: 205 VFGFSVYNSWVGNNSLPVRIPLPTKNDTLEGGHAVLCVGYDD--EIRHFRIRNSWGNNVG 262

Query: 156 PDDGYFTVERG-TNACGIES 174
            +DGYF +     +   +  
Sbjct: 263 -EDGYFWMPYEYISNTQLAD 281


>3pw3_A Aminopeptidase C; bleomycin, cysteine proteinase fold, structural
           genomics, JO center for structural genomics, JCSG; HET:
           MSE; 2.23A {Parabacteroides distasonis}
          Length = 383

 Score = 37.5 bits (86), Expect = 0.001
 Identities = 9/42 (21%), Positives = 17/42 (40%), Gaps = 1/42 (2%)

Query: 122 CPSENLNHAVVIVGYGM-RHQVPVWIVRNSWGRWGPDDGYFT 162
                 +H + I G    +     ++V+NSWG     +G + 
Sbjct: 311 NYETTDDHGMQIYGIAKDQEGNEYYMVKNSWGTNSKYNGIWY 352



 Score = 27.5 bits (60), Expect = 3.3
 Identities = 13/73 (17%), Positives = 22/73 (30%), Gaps = 12/73 (16%)

Query: 1   MLESQYAIKHGTLLPLSKSQLIECNIYNQG------------CQGGGFNKAIQYLKHAGL 48
            LES+          LS+   +     ++              QGG F  A+  ++  GL
Sbjct: 42  FLESELLRMGKGEYDLSEMFTVYNTYLDRADAAVRTHGDVSFSQGGSFYDALYGMETFGL 101

Query: 49  EAEADYPFRNQNG 61
             E +        
Sbjct: 102 VPEEEMRPGMMYA 114


>2e01_A Cysteine proteinase 1; bleomycin hydrolase, thiol protease, C1
           protease, hydrolase; 1.73A {Saccharomyces cerevisiae}
           PDB: 2e02_A 2e03_A 2dzy_A 1a6r_A 2e00_A 2dzz_A 3gcb_A
           1gcb_A
          Length = 457

 Score = 37.0 bits (85), Expect = 0.002
 Identities = 21/90 (23%), Positives = 37/90 (41%), Gaps = 4/90 (4%)

Query: 78  SDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRKNDVCPSENL-NHAVVIVGY 136
           ++  VF GS T + M    G +   +       YN    + + +   E+L   A++I G 
Sbjct: 321 NNKAVFFGSHTPKFMDKKTGVMDIELWNYPAIGYNLPQQKASRIRYHESLMTAAMLITGC 380

Query: 137 GM---RHQVPVWIVRNSWGRWGPDDGYFTV 163
            +         + V NSWG+    DG + +
Sbjct: 381 HVDETSKLPLRYRVENSWGKDSGKDGLYVM 410


>2cb5_A Protein (bleomycin hydrolase); aminopeptidase, cysteine protease,
           SELF- compartmentalizing, cylinase; 1.85A {Homo sapiens}
           SCOP: d.3.1.1 PDB: 1cb5_A
          Length = 453

 Score = 35.5 bits (81), Expect = 0.008
 Identities = 10/40 (25%), Positives = 13/40 (32%), Gaps = 4/40 (10%)

Query: 128 NHAVVIVGYGMRHQVP----VWIVRNSWGRWGPDDGYFTV 163
            HA+       +         W V NSWG      GY  +
Sbjct: 370 THAMTFTAVSEKDDQDGAFTKWRVENSWGEDHGHKGYLCM 409


>2pff_B Fatty acid synthase subunit beta; fatty acid synthase,
           acyl-carrier-protein, beta-ketoacyl RED beta-ketoacyl
           synthase, dehydratase; 4.00A {Saccharomyces cerevisiae}
          Length = 2006

 Score = 31.9 bits (72), Expect = 0.11
 Identities = 21/119 (17%), Positives = 36/119 (30%), Gaps = 37/119 (31%)

Query: 38  KAIQYLKHAGLEAEADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFNGSDTFRRMLYHYG 97
           KAI  L   G+     YP           +     ++  +      N       ML    
Sbjct: 298 KAITVLFFIGVRCYEAYP---------NTSLPPSILEDSLE-----NNEGVPSPML---- 339

Query: 98  PLVAGMNGALLQDYNGKLIRK-NDVCPSE--------NLNHAVVIVG-----YGMRHQV 142
             ++ +    +QDY    + K N   P+         N    +V+ G     YG+   +
Sbjct: 340 -SISNLTQEQVQDY----VNKTNSHLPAGKQVEISLVNGAKNLVVSGPPQSLYGLNLTL 393



 Score = 31.2 bits (70), Expect = 0.26
 Identities = 18/98 (18%), Positives = 31/98 (31%), Gaps = 20/98 (20%)

Query: 12  TLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAGLEAEADY--------PFRNQNGVT 63
           TL  L ++ L    ++ QG         +++L++     + DY        P     GV 
Sbjct: 194 TLSELIRTTLDAEKVFTQGLN------ILEWLENPSNTPDKDYLLSIPISCPL---IGVI 244

Query: 64  GRCAYDARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVA 101
               Y    V  ++  F         +    H   LV 
Sbjct: 245 QLAHY---VVTAKLLGFTPGELRSYLKGATGHSQGLVT 279


>1nz9_A Transcription antitermination protein NUSG; transcription
          elongation, riken structural genomics/proteomics
          initiative, RSGI; NMR {Thermus thermophilus} SCOP:
          b.34.5.4
          Length = 58

 Score = 27.5 bits (62), Expect = 0.48
 Identities = 9/28 (32%), Positives = 12/28 (42%)

Query: 53 DYPFRNQNGVTGRCAYDARKVKVRVSDF 80
            PF +  G       +  KVKV V+ F
Sbjct: 15 SGPFADFTGTVTEINPERGKVKVMVTIF 42


>2z0t_A Putative uncharacterized protein PH0355; alpha/beta protein, RNA
           binding protein, structural genomics, NPPSFA; 1.80A
           {Pyrococcus horikoshii} PDB: 1s04_A
          Length = 109

 Score = 27.8 bits (62), Expect = 1.00
 Identities = 10/61 (16%), Positives = 23/61 (37%), Gaps = 13/61 (21%)

Query: 62  VTGRCAYDARKVKVRVSDFLVFNGS------------DTFRRMLYHYGPLVAGMNGALLQ 109
           + GR   + R+ +++  D ++F G              +F+ ML   G          ++
Sbjct: 22  IEGRLYDEKRR-QIKPGDIIIFEGGKLKVKVKGIRVYSSFKEMLEKEGIENVLPGVKSIE 80

Query: 110 D 110
           +
Sbjct: 81  E 81


>3r4c_A Hydrolase, haloacid dehalogenase-like hydrolase; haloalkanoate
           dehalogenase enzyme superfamily, phosphohydrol
           hydrolase; 1.82A {Bacteroides thetaiotaomicron}
          Length = 268

 Score = 27.5 bits (62), Expect = 2.8
 Identities = 22/142 (15%), Positives = 41/142 (28%), Gaps = 53/142 (37%)

Query: 11  GTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQNGV-----TGR 65
           GTLL     ++ + +I            A++ +                +G+     TGR
Sbjct: 21  GTLLSFETHKVSQSSI-----------DALKKVH--------------DSGIKIVIATGR 55

Query: 66  CAYDARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRKNDVCPSE 125
            A D  ++                      Y  ++A +NGA     +G +IRK  +    
Sbjct: 56  AASDLHEID------------------AVPYDGVIA-LNGAECVLRDGSVIRKVAI---- 92

Query: 126 NLNHAVVIVGYGMRHQVPVWIV 147
                   +         V + 
Sbjct: 93  PAQDFRKSMELAREFDFAVALE 114


>3mpo_A Predicted hydrolase of the HAD superfamily; SGX, PSI, structural
           genomics, protein structure initiative; 2.90A
           {Lactobacillus brevis}
          Length = 279

 Score = 27.1 bits (61), Expect = 3.8
 Identities = 8/76 (10%), Positives = 24/76 (31%), Gaps = 12/76 (15%)

Query: 93  LYHYGPLVAGMNGALLQDYNGKLIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG 152
           +          NG++ Q  +GK++  + +         + +  +  + +    I      
Sbjct: 61  IDGDDQYAITFNGSVAQTISGKVLTNHSL----TYEDYIDLEAWARKVRAHFQIET---- 112

Query: 153 RWGPDDGYFTVERGTN 168
                D  +T  +  +
Sbjct: 113 ----PDYIYTANKDIS 124


>2b30_A Pvivax hypothetical protein; SGPP, structural genomics, PSI,
           protein structure initiative; 2.70A {Plasmodium vivax}
           SCOP: c.108.1.10
          Length = 301

 Score = 26.8 bits (60), Expect = 5.1
 Identities = 8/71 (11%), Positives = 22/71 (30%), Gaps = 10/71 (14%)

Query: 95  HYGPLVAGMNGALLQDYNGKLIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRW 154
            YG     +NG ++ D  G  +    +      +    ++ Y +   +    + +     
Sbjct: 89  FYGMPGVYINGTIVYDQIGYTLLDETI----ETDVYAELISYLVEKNLVNQTIFHR---- 140

Query: 155 GPDDGYFTVER 165
              +  +  E 
Sbjct: 141 --GESNYVTED 149


>2jvv_A Transcription antitermination protein NUSG; transcription factor,
           transcription regulation, transcription termination; NMR
           {Escherichia coli} PDB: 2k06_A 2kvq_G
          Length = 181

 Score = 26.3 bits (59), Expect = 5.1
 Identities = 11/26 (42%), Positives = 15/26 (57%)

Query: 55  PFRNQNGVTGRCAYDARKVKVRVSDF 80
           PF + NGV     Y+  ++KV VS F
Sbjct: 140 PFADFNGVVEEVDYEKSRLKVSVSIF 165


>3n6r_B Propionyl-COA carboxylase, beta subunit; protein complex,
           biotin-dependent carboxylase, ligase; HET: BTI; 3.20A
           {Roseobacter denitrificans}
          Length = 531

 Score = 26.4 bits (59), Expect = 6.2
 Identities = 11/25 (44%), Positives = 12/25 (48%)

Query: 62  VTGRCAYDARKVKVRVSDFLVFNGS 86
           VTG    + R V V   DF V  GS
Sbjct: 96  VTGWGTINGRVVYVFSQDFTVLGGS 120


>1nrw_A Hypothetical protein, haloacid dehalogenase-like hydrolase;
           structural genomics, PSI, protein structure initiative;
           1.70A {Bacillus subtilis} SCOP: c.108.1.10
          Length = 288

 Score = 26.5 bits (59), Expect = 6.6
 Identities = 6/16 (37%), Positives = 9/16 (56%)

Query: 104 NGALLQDYNGKLIRKN 119
           NGA++ D  G+L    
Sbjct: 68  NGAVIHDPEGRLYHHE 83


>1vrg_A Propionyl-COA carboxylase, beta subunit; TM0716, structural joint
           center for structural genomics, JCSG, protein structu
           initiative, PSI; HET: MSE; 2.30A {Thermotoga maritima}
           SCOP: c.14.1.4 c.14.1.4
          Length = 527

 Score = 26.4 bits (59), Expect = 7.8
 Identities = 11/25 (44%), Positives = 13/25 (52%)

Query: 62  VTGRCAYDARKVKVRVSDFLVFNGS 86
           +TG    + RKV V   DF V  GS
Sbjct: 89  ITGVGEINGRKVAVFSQDFTVMGGS 113


>1m1h_A Transcription antitermination protein NUSG; transcription
           termination, RNP motif, immunoglobulin fold, nucleic
           acid interaction; 1.95A {Aquifex aeolicus} SCOP:
           b.114.1.1 d.58.42.1 PDB: 1m1g_A 1npp_A 1npr_A
          Length = 248

 Score = 25.8 bits (57), Expect = 8.1
 Identities = 9/26 (34%), Positives = 12/26 (46%)

Query: 55  PFRNQNGVTGRCAYDARKVKVRVSDF 80
           PF N  G       + RK+ V +S F
Sbjct: 207 PFMNFTGTVEEVHPEKRKLTVMISIF 232


>1x0u_A Hypothetical methylmalonyl-COA decarboxylase ALPH; lyase; 2.20A
           {Sulfolobus tokodaii}
          Length = 522

 Score = 26.0 bits (58), Expect = 9.7
 Identities = 11/25 (44%), Positives = 11/25 (44%)

Query: 62  VTGRCAYDARKVKVRVSDFLVFNGS 86
           VTG    D R V     DF V  GS
Sbjct: 82  VTGWGKVDGRTVFAYAQDFTVLGGS 106


>2xhc_A Transcription antitermination protein NUSG; 2.45A {Thermotoga
           maritima}
          Length = 352

 Score = 25.8 bits (57), Expect = 9.9
 Identities = 8/26 (30%), Positives = 14/26 (53%)

Query: 55  PFRNQNGVTGRCAYDARKVKVRVSDF 80
           PF +  GV      + +++KV V+ F
Sbjct: 311 PFEDFAGVIKEIDPERQELKVNVTIF 336


  Database: pdb70
    Posted date:  Sep 4, 2012  3:40 AM
  Number of letters in database: 6,701,793
  Number of sequences in database:  27,921
  
Lambda     K      H
   0.322    0.141    0.450 

Gapped
Lambda     K      H
   0.267   0.0856    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 27921
Number of Hits to DB: 3,116,359
Number of extensions: 184801
Number of successful extensions: 496
Number of sequences better than 10.0: 1
Number of HSP's gapped: 398
Number of HSP's successfully gapped: 75
Length of query: 192
Length of database: 6,701,793
Length adjustment: 88
Effective length of query: 104
Effective length of database: 4,244,745
Effective search space: 441453480
Effective search space used: 441453480
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 54 (24.3 bits)