RPS-BLAST 2.2.26 [Sep-21-2011]

Database: pdb70 
           27,921 sequences; 6,701,793 total letters

Searching..................................................done

Query= psy12185
         (317 letters)



>2o6x_A Procathepsin L1, secreted cathepsin L 1; hydrolase, thiol protease,
           cysteine protease, zymogen, hydro; 1.40A {Fasciola
           hepatica}
          Length = 310

 Score =  207 bits (530), Expect = 1e-65
 Identities = 74/285 (25%), Positives = 129/285 (45%), Gaps = 37/285 (12%)

Query: 35  ELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELN-KNRQSPESARYGITEFSDLSEE 93
           +L+  +++ Y K Y+ ++   R   +EK++  I+E N ++     +   G+ +F+D++ E
Sbjct: 3   DLWHQWKRMYNKEYNGADDQHRRNIWEKNVKHIQEHNLRHDLGLVTYTLGLNQFTDMTFE 62

Query: 94  EFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKV 153
           EFK ++L        ++SH   ++ ++                 +P K DWRE+G + +V
Sbjct: 63  EFKAKYLTEMSRASDILSHGVPYEANN---------------RAVPDKIDWRESGYVTEV 107

Query: 154 RNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDW 212
           ++Q  CG+ WAFST  T E  +     T    S Q+++DC+   GN GC GG        
Sbjct: 108 KDQGNCGSGWAFSTTGTMEGQYMKNERTSISFSEQQLVDCSRPWGNNGCGGG-------L 160

Query: 213 MD-----VNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPS--ESSILTD 265
           M+     + +  LE ES YP    +  C+         K+  +     + S  E  +   
Sbjct: 161 MENAYQYLKQFGLETESSYPYTAVEGQCRYNK-QLGVAKVTGF---YTVHSGSEVELKNL 216

Query: 266 IATHGPVIAAVNAL-TWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
           +   GP   AV+    +  Y  G+   +   S   +NHAV  VGY
Sbjct: 217 VGAEGPAAVAVDVESDFMMYRSGIY-QSQTCSPLRVNHAVLAVGY 260


>1cs8_A Human procathepsin L; prosegment, propeptide, inhibition,
           hydrolase; HET: OCS; 1.80A {Homo sapiens} SCOP: d.3.1.1
           PDB: 1cjl_A 3hwn_A*
          Length = 316

 Score =  200 bits (510), Expect = 2e-62
 Identities = 73/281 (25%), Positives = 121/281 (43%), Gaps = 30/281 (10%)

Query: 35  ELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELN-KNRQSPESARYGITEFSDLSEE 93
             ++ ++  + + Y  +E   R   +EK++ +IE  N + R+   S    +  F D++ E
Sbjct: 10  AQWTKWKAMHNRLYGMNEEGWRRAVWEKNMKMIELHNQEYREGKHSFTMAMNAFGDMTSE 69

Query: 94  EFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKV 153
           EF+         K       +                        P   DWRE G +  V
Sbjct: 70  EFRQVMNGFQNRKPRKGKVFQEPLF-----------------YEAPRSVDWREKGYVTPV 112

Query: 154 RNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDW 212
           +NQ  CG+CWAFS     E     K G L  LS Q ++DC+G  GN GC+GG       +
Sbjct: 113 KNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQY 172

Query: 213 MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIP-SESSILTDIATHGP 271
           +  N   L+ E  YP    + +CK      +      +     IP  E +++  +AT GP
Sbjct: 173 VQDNG-GLDSEESYPYEATEESCKYNP-KYSVANDAGF---VDIPKQEKALMKAVATVGP 227

Query: 272 VIAAVNA--LTWQYYLGGVIQ-YNCDGSLANINHAVQIVGY 309
           +  A++A   ++ +Y  G+    +C  S  +++H V +VGY
Sbjct: 228 ISVAIDAGHESFLFYKEGIYFEPDC--SSEDMDHGVLVVGY 266


>1by8_A Protein (procathepsin K); hydrolase(sulfhydryl proteinase), papain;
           2.60A {Homo sapiens} SCOP: d.3.1.1 PDB: 7pck_A
          Length = 314

 Score =  198 bits (506), Expect = 5e-62
 Identities = 74/284 (26%), Positives = 121/284 (42%), Gaps = 33/284 (11%)

Query: 35  ELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELN-KNRQSPESARYGITEFSDLSE 92
             +  +++ ++K Y +K +   R   +EK+L  I   N +      +    +    D++ 
Sbjct: 9   THWELWKKTHRKQYNNKVDEISRRLIWEKNLKYISIHNLEASLGVHTYELAMNHLGDMTS 68

Query: 93  EEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGK 152
           EE   +     V      S+   +                      P   D+R+ G +  
Sbjct: 69  EEVVQKMTGLKVPLSHSRSNDTLYIPEWE--------------GRAPDSVDYRKKGYVTP 114

Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW 212
           V+NQ  CG+CWAFS+V   E     K G L  LS Q ++DC    N GC GG       +
Sbjct: 115 VKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVS-ENDGCGGGYMTNAFQY 173

Query: 213 MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIP--SESSILTDIATHG 270
           +  N+  ++ E  YP + ++ +C    T     K + Y     IP  +E ++   +A  G
Sbjct: 174 VQKNR-GIDSEDAYPYVGQEESCMYNPTGK-AAKCRGY---REIPEGNEKALKRAVARVG 228

Query: 271 PVIAAVNA--LTWQYYLGGVIQYN---CDGSLANINHAVQIVGY 309
           PV  A++A   ++Q+Y  GV  Y    C  +  N+NHAV  VGY
Sbjct: 229 PVSVAIDASLTSFQFYSKGV--YYDESC--NSDNLNHAVLAVGY 268


>3qj3_A Cathepsin L-like protein; hydrolase, proteinase, larVal midgut;
           1.85A {Tenebrio molitor}
          Length = 331

 Score =  199 bits (507), Expect = 7e-62
 Identities = 77/283 (27%), Positives = 122/283 (43%), Gaps = 25/283 (8%)

Query: 35  ELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELN-KNRQSPESARYGITEFSDLSE 92
           E + +F+  Y +SY +  E   R + F+K L+  EE N K RQ   S   G+  F+D++ 
Sbjct: 20  EKWENFKTTYARSYVNAKEETFRKQIFQKKLETFEEHNEKYRQGLVSYTLGVNLFTDMTP 79

Query: 93  EEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGK 152
           EE K       +      +    +       +   +   +  P       DWR+ G++  
Sbjct: 80  EEMKAYTHGLIMP-----ADLHKNGIPIKTREDLGLNASVRYPASF----DWRDQGMVSP 130

Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSL--LSVQEVIDCAGNGNMGCSGGDFCALL 210
           V+NQ +CG+ WAFS+    ES   + NG      +S Q+++DC     +GCSGG      
Sbjct: 131 VKNQGSCGSSWAFSSTGAIESQMKIANGAGYDSSVSEQQLVDCVP-NALGCSGGWMNDAF 189

Query: 211 DWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPS--ESSILTDIAT 268
            ++  N   ++ E  YP  + D  C     +    ++  Y     +    E+ +   +AT
Sbjct: 190 TYVAQNG-GIDSEGAYPYEMADGNCHYDP-NQVAARLSGY---VYLSGPDENMLADMVAT 244

Query: 269 HGPVIAAVNAL-TWQYYLGGVIQ-YNCDGSLANINHAVQIVGY 309
            GPV  A +A   +  Y GGV     C        HAV IVGY
Sbjct: 245 KGPVAVAFDADDPFGSYSGGVYYNPTC--ETNKFTHAVLIVGY 285


>1pci_A Procaricain; zymogen, hydrolase, thiol protease; 3.20A {Carica
           papaya} SCOP: d.3.1.1
          Length = 322

 Score =  197 bits (502), Expect = 2e-61
 Identities = 83/298 (27%), Positives = 136/298 (45%), Gaps = 33/298 (11%)

Query: 18  FLAIPVKVSKPNLEQKL-ELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQ 75
           F  +          ++L +LF+S+   + K Y +  E   RF+ F+ +L+ I+E NK   
Sbjct: 2   FSIVGYSQDDLTSTERLIQLFNSWMLNHNKFYENVDEKLYRFEIFKDNLNYIDETNKKN- 60

Query: 76  SPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIP 135
              S   G+ EF+DLS +EF  +++   ++  +  S+ +   +                 
Sbjct: 61  --NSYWLGLNEFADLSNDEFNEKYVGSLIDATIEQSYDEEFINEDI-------------- 104

Query: 136 TGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAG 195
             +P   DWR+ G +  VR+Q +CG+CWAFS V T E ++ ++ G L  LS QE++DC  
Sbjct: 105 VNLPENVDWRKKGAVTPVRHQGSCGSCWAFSAVATVEGINKIRTGKLVELSEQELVDCER 164

Query: 196 NGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTL 255
             + GC GG     L+++  N   +   S+YP   K   C+ K      VK         
Sbjct: 165 -RSHGCKGGYPPYALEYVAKNG--IHLRSKYPYKAKQGTCRAKQVGGPIVKTSGV---GR 218

Query: 256 IPS--ESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
           +    E ++L  IA   PV   V +    +Q Y GG+ +  C   +   + AV  VGY
Sbjct: 219 VQPNNEGNLLNAIA-KQPVSVVVESKGRPFQLYKGGIFEGPCGTKV---DGAVTAVGY 272


>1xkg_A DER P I, major mite fecal allergen DER P 1; major allergen,
           cysteine protease, house DUST mite, dermatop
           pteronyssinus; 1.61A {Dermatophagoides pteronyssinus}
           SCOP: d.3.1.1
          Length = 312

 Score =  194 bits (495), Expect = 2e-60
 Identities = 68/293 (23%), Positives = 114/293 (38%), Gaps = 46/293 (15%)

Query: 27  KPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGIT 85
           +P+     + F  +++ + KSY +  + +   KNF +S+  ++               I 
Sbjct: 1   RPSSI---KTFEEYKKAFNKSYATFEDEEAARKNFLESVKYVQSNGG----------AIN 47

Query: 86  EFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWR 145
             SDLS +EFK R L  +     L +    +       +  + +     P  I    D R
Sbjct: 48  HLSDLSLDEFKNRFLMSAEAFEHLKTQFDLNA------ETNACSINGNAPAEI----DLR 97

Query: 146 EAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGD 205
           +   +  +R Q  CG+ WAFS V   ES +         L+ QE++DCA     GC G  
Sbjct: 98  QMRTVTPIRMQGGCGSAWAFSGVAATESAYLAYRDQSLDLAEQELVDCA--SQHGCHGD- 154

Query: 206 FCALLDWMD-----VNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSES 260
                  +      +    +  ES Y  + ++ +C+R         I +Y      P+ +
Sbjct: 155 ------TIPRGIEYIQHNGVVQESYYRYVAREQSCRRPNA--QRFGISNYC-QIYPPNAN 205

Query: 261 SILTDIA-THGPV---IAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
            I   +A TH  +   I   +   +++Y G  I    D       HAV IVGY
Sbjct: 206 KIREALAQTHSAIAVIIGIKDLDAFRHYDGRTI-IQRDNGYQPNYHAVNIVGY 257


>2c0y_A Procathepsin S; proenzyme, proteinase, hydrolase, thiol protease,
           prosegment binding loop, glycoprotein, lysosome,
           protease, zymogen; 2.1A {Homo sapiens}
          Length = 315

 Score =  194 bits (495), Expect = 3e-60
 Identities = 77/284 (27%), Positives = 122/284 (42%), Gaps = 33/284 (11%)

Query: 35  ELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELN-KNRQSPESARYGITEFSDLSE 92
             +  +++ Y K Y  K+E  +R   +EK+L  +   N ++     S   G+    D++ 
Sbjct: 10  HHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLKFVMLHNLEHSMGMHSYDLGMNHLGDMTS 69

Query: 93  EEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGK 152
           EE  +      V                    +R+IT        +P   DWRE G + +
Sbjct: 70  EEVMSLMSSLRVPSQ----------------WQRNITYKSNPNRILPDSVDWREKGCVTE 113

Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN--GNMGCSGGDFCALL 210
           V+ Q +CGA WAFS V   E+   LK G L  LS Q ++DC+    GN GC+GG      
Sbjct: 114 VKYQGSCGAAWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAF 173

Query: 211 DWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPS--ESSILTDIAT 268
            ++  NK  ++ ++ YP    D  C+  +          Y   T +P   E  +   +A 
Sbjct: 174 QYIIDNK-GIDSDASYPYKAMDQKCQYDS-KYRAATCSKY---TELPYGREDVLKEAVAN 228

Query: 269 HGPVIAAVNA--LTWQYYLGGVIQ-YNCDGSLANINHAVQIVGY 309
            GPV   V+A   ++  Y  GV    +C  ++   NH V +VGY
Sbjct: 229 KGPVSVGVDARHPSFFLYRSGVYYEPSCTQNV---NHGVLVVGY 269


>3qt4_A Cathepsin-L-like midgut cysteine proteinase; hydrolase, zymogen,
           intramolecular DISS bonds, insect larVal midgut; HET:
           PG4 PG6; 2.11A {Tenebrio molitor}
          Length = 329

 Score =  190 bits (486), Expect = 7e-59
 Identities = 80/289 (27%), Positives = 127/289 (43%), Gaps = 44/289 (15%)

Query: 35  ELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELN-KNRQSPESARYGITEFSDLSE 92
           E +S F+  +KKSY S  E   R   F+ ++  I E N K  +   +    + +F D+S+
Sbjct: 25  EQWSQFKLTHKKSYSSPIEEIRRQLIFKDNVAKIAEHNAKFEKGEVTYSKAMNQFGDMSK 84

Query: 93  EEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGK 152
           EEF     R    K     + +                       +    DWR   +   
Sbjct: 85  EEFLAYVNRGKAQKPKHPENLRMPYVSSK--------------KPLAASVDWRSNAVSE- 129

Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLD 211
           V++Q  CG+ W+FST    E   AL+ G L+ LS Q +IDC+ + GN GC GG       
Sbjct: 130 VKDQGQCGSSWSFSTTGAVEGQLALQRGRLTSLSEQNLIDCSSSYGNAGCDGG------- 182

Query: 212 WMD-----VNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIP--SESSILT 264
           WMD     ++   +  ES YP   +   C+  + S +   +  Y     +P   E+S+  
Sbjct: 183 WMDSAFSYIHDYGIMSESAYPYEAQGDYCRFDS-SQSVTTLSGY---YDLPSGDENSLAD 238

Query: 265 DIATHGPVIAAVNAL-TWQYYLGGVIQYN---CDGSLANINHAVQIVGY 309
            +   GPV  A++A    Q+Y GG+  +    C  + +++NH V +VGY
Sbjct: 239 AVGQAGPVAVAIDATDELQFYSGGL--FYDQTC--NQSDLNHGVLVVGY 283


>3pdf_A Cathepsin C, dipeptidyl peptidase 1; two domains, cystein protease,
           hydrolase-hydrolase inhibitor; HET: LXV NAG; 1.85A {Homo
           sapiens} PDB: 1jqp_A* 2djf_B* 1k3b_B* 2djg_B* 2djf_A*
           1k3b_A* 2djg_A* 2djf_C* 1k3b_C* 2djg_C*
          Length = 441

 Score =  159 bits (404), Expect = 1e-45
 Identities = 57/297 (19%), Positives = 103/297 (34%), Gaps = 43/297 (14%)

Query: 32  QKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLS 91
           +K+   S            S+     + ++   + ++ +N  ++S  +  Y   E+  L+
Sbjct: 116 KKVGTASENVYVNTAHLKNSQEKYSNRLYKYDHNFVKAINAIQKSWTATTY--MEYETLT 173

Query: 92  EEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWRE---AG 148
             +   R   HS             +     +              +P   DWR      
Sbjct: 174 LGDMIRRSGGHSRKIPRPKPAPLTAEIQQKILF-------------LPTSWDWRNVHGIN 220

Query: 149 IIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSL--LSVQEVIDCAGNGNMGCSGGDF 206
            +  VRNQ +CG+C++F+++   E+   +         LS QEV+ C+     GC GG  
Sbjct: 221 FVSPVRNQASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVVSCSQ-YAQGCEGG-- 277

Query: 207 CALLDWMD------VNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDT---LIP 257
                +             L  E+ +P    D+ CK K           Y          
Sbjct: 278 -----FPYLIAGKYAQDFGLVEEACFPYTGTDSPCKMKEDCF-RYYSSEYHYVGGFYGGC 331

Query: 258 SESSILTDIATHGPVIAAVNALT-WQYYLGGVIQYNCDGSLAN----INHAVQIVGY 309
           +E+ +  ++  HGP+  A      + +Y  G+  +       N     NHAV +VGY
Sbjct: 332 NEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGY 388


>1m6d_A Cathepsin F, catsf; papain family cysteine protease, hydrolase;
           HET: MYP; 1.70A {Homo sapiens} SCOP: d.3.1.1
          Length = 214

 Score =  151 bits (383), Expect = 7e-45
 Identities = 56/179 (31%), Positives = 79/179 (44%), Gaps = 20/179 (11%)

Query: 139 PVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGN 198
           P + DWR  G + KV++Q  CG+CWAFS     E    L  GTL  LS QE++DC    +
Sbjct: 2   PPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDK-MD 60

Query: 199 MGCSGGDFCALLDWMD------VNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTC 252
             C GG                 N   LE E +Y       +C+  A     V I+    
Sbjct: 61  KACMGG-------LPSNAYSAIKNLGGLETEDDYSYQGHMQSCQFSAEKA-KVYIQDS-- 110

Query: 253 DTLIP-SESSILTDIATHGPVIAAVNALTWQYYLGGVIQ-YNCDGSLANINHAVQIVGY 309
              +  +E  +   +A  GP+  A+NA   Q+Y  G+ +      S   I+HAV +VGY
Sbjct: 111 -VELSQNEQKLAAWLAKRGPISVAINAFGMQFYRHGISRPLRPLCSPWLIDHAVLLVGY 168


>1s4v_A Cysteine endopeptidase; KDEL ER retention signal, endosperm,
           ricinosomes, SEED germi senescence, hydrolase-hydrolase
           inhibitor complex; 2.00A {Ricinus communis} SCOP:
           d.3.1.1
          Length = 229

 Score =  149 bits (378), Expect = 5e-44
 Identities = 58/182 (31%), Positives = 89/182 (48%), Gaps = 24/182 (13%)

Query: 138 IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNG 197
           +P   DWR+ G +  V++Q  CG+CWAFST+   E ++ +K   L  LS QE++DC  + 
Sbjct: 2   VPASVDWRKKGAVTSVKDQGQCGSCWAFSTIVAVEGINQIKTNKLVSLSEQELVDCDTDQ 61

Query: 198 NMGCSGGDFCALLDWMD------VNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYT 251
           N GC+GG        MD        +  +  E+ YP    D  C     +   V I  + 
Sbjct: 62  NQGCNGG-------LMDYAFEFIKQRGGITTEANYPYEAYDGTCDVSKENAPAVSIDGH- 113

Query: 252 CDTLIPS--ESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIV 307
               +P   E+++L  +A + PV  A++A    +Q+Y  GV   +C   L   +H V IV
Sbjct: 114 --ENVPENDENALLKAVA-NQPVSVAIDAGGSDFQFYSEGVFTGSCGTEL---DHGVAIV 167

Query: 308 GY 309
           GY
Sbjct: 168 GY 169


>2xu3_A Cathepsin L1; hydrolase, drug design, thiol protease; HET: XU3 BTB;
           0.90A {Homo sapiens} PDB: 2xu4_A* 2xu5_A* 2yj2_A*
           2yj8_A* 2yj9_A* 2yjb_A* 2yjc_A* 3bc3_A* 3h89_A* 3h8b_A*
           3h8c_A* 3of9_A* 3of8_A* 3hha_A* 2xu1_A* 3iv2_A* 3k24_A*
           2nqd_B* 3kse_A* 2vhs_A ...
          Length = 220

 Score =  145 bits (369), Expect = 1e-42
 Identities = 58/182 (31%), Positives = 86/182 (47%), Gaps = 24/182 (13%)

Query: 139 PVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-G 197
           P   DWRE G +  V+NQ  CG+CWAFS     E     K G L  LS Q ++DC+G  G
Sbjct: 2   PRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQG 61

Query: 198 NMGCSGGDFCALLDWMD------VNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYT 251
           N GC+GG        MD       +   L+ E  YP    + +CK             + 
Sbjct: 62  NEGCNGG-------LMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYS-VANDTGFV 113

Query: 252 CDTLIPS-ESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQ-YNCDGSLANINHAVQIV 307
               IP  E +++  +AT GP+  A++A   ++ +Y  G+    +C  S  +++H V +V
Sbjct: 114 D---IPKQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDC--SSEDMDHGVLVV 168

Query: 308 GY 309
           GY
Sbjct: 169 GY 170


>3ioq_A CMS1MS2; caricaceae, cysteine protease, papain family, hydrolase;
           HET: E64 SO4; 1.87A {Carica candamarcensis}
          Length = 213

 Score =  144 bits (367), Expect = 2e-42
 Identities = 56/181 (30%), Positives = 77/181 (42%), Gaps = 24/181 (13%)

Query: 138 IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNG 197
           IP   DWR+ G +  VRNQ  CG+CW FS+V   E ++ +  G L  LS QE++DC    
Sbjct: 1   IPTSIDWRQKGAVTPVRNQGGCGSCWTFSSVAAVEGINKIVTGQLLSLSEQELLDCER-R 59

Query: 198 NMGCSGGDFCALLDWMD-----VNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTC 252
           + GC GG       +       V    +     YP       C+        VK      
Sbjct: 60  SYGCRGG-------FPLYALQYVANSGIHLRQYYPYEGVQRQCRASQAKGPKVKTDGV-- 110

Query: 253 DTLIPS--ESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVG 308
              +P   E +++  IA   PV   V A    +Q Y GG+    C  S+   +HAV  VG
Sbjct: 111 -GRVPRNNEQALIQRIA-IQPVSIVVEAKGRAFQNYRGGIFAGPCGTSI---DHAVAAVG 165

Query: 309 Y 309
           Y
Sbjct: 166 Y 166


>2cio_A Papain; hydrolase/inhibitor, complex hydrolase/inhibitor, ICP,
           cysteine protease, allergen, protease, thiol protease;
           1.5A {Carica papaya} PDB: 1khq_A 1khp_A 1ppn_A 3e1z_B
           3ima_A 3lfy_A 9pap_A 1bqi_A* 1bp4_A* 1pad_A 1pe6_A*
           1pip_A* 1pop_A* 1ppd_A 1ppp_A* 1stf_E* 2pad_A 4pad_A*
           5pad_A* 6pad_A* ...
          Length = 212

 Score =  144 bits (367), Expect = 2e-42
 Identities = 54/181 (29%), Positives = 82/181 (45%), Gaps = 24/181 (13%)

Query: 138 IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNG 197
           IP   DWR+ G +  V+NQ +CG+CWAFS V T E +  ++ G L+  S QE++DC    
Sbjct: 1   IPEYVDWRQKGAVTPVKNQGSCGSCWAFSAVVTIEGIIKIRTGNLNQYSEQELLDCDR-R 59

Query: 198 NMGCSGGDFCALLDWMD-----VNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTC 252
           + GC+GG       +       V +  +   + YP       C+ +   P   K      
Sbjct: 60  SYGCNGG-------YPWSALQLVAQYGIHYRNTYPYEGVQRYCRSREKGPYAAKTDGVRQ 112

Query: 253 DTLIPS--ESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVG 308
              +    E ++L  IA + PV   + A    +Q Y GG+    C   +   +HAV  VG
Sbjct: 113 ---VQPYNEGALLYSIA-NQPVSVVLEAAGKDFQLYRGGIFVGPCGNKV---DHAVAAVG 165

Query: 309 Y 309
           Y
Sbjct: 166 Y 166


>2bdz_A Mexicain; cysteine protease, peptidase_C1, papain-like, HYDR; HET:
           E64; 2.10A {Jacaratia mexicana}
          Length = 214

 Score =  144 bits (367), Expect = 2e-42
 Identities = 61/180 (33%), Positives = 86/180 (47%), Gaps = 24/180 (13%)

Query: 139 PVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGN 198
           P   DWRE G +  V+NQ  CG+CWAFSTV T E ++ +  G L  LS QE++DC    +
Sbjct: 2   PESIDWREKGAVTPVKNQNPCGSCWAFSTVATIEGINKIITGQLISLSEQELLDCER-RS 60

Query: 199 MGCSGGDFCALLDWMD-----VNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCD 253
            GC GG       +       V    +  E EYP   K   C+ K      V I  Y   
Sbjct: 61  HGCDGG-------YQTTSLQYVVDNGVHTEREYPYEKKQGRCRAKDKKGPKVYITGY--- 110

Query: 254 TLIPS--ESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
             +P+  E S++  IA + PV    ++    +Q+Y GG+ +  C  +    +HAV  VGY
Sbjct: 111 KYVPANDEISLIQAIA-NQPVSVVTDSRGRGFQFYKGGIYEGPCGTNT---DHAVTAVGY 166


>8pch_A Cathepsin H; hydrolase, protease, cysteine proteinase,
           aminopeptidase; HET: NAG BMA; 2.10A {Sus scrofa} SCOP:
           d.3.1.1 PDB: 1nb3_A* 1nb5_A*
          Length = 220

 Score =  144 bits (367), Expect = 2e-42
 Identities = 53/183 (28%), Positives = 78/183 (42%), Gaps = 23/183 (12%)

Query: 139 PVKKDWREAG-IIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN- 196
           P   DWR+ G  +  V+NQ +CG+CW FST    ES  A+  G +  L+ Q+++DCA N 
Sbjct: 2   PPSMDWRKKGNFVSPVKNQGSCGSCWTFSTTGALESAVAIATGKMLSLAEQQLVDCAQNF 61

Query: 197 GNMGCSGGDFCALLDWMD------VNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSY 250
            N GC GG                     +  E  YP   +D  CK +        +K  
Sbjct: 62  NNHGCQGG-------LPSQAFEYIRYNKGIMGEDTYPYKGQDDHCKFQPDKA-IAFVKDV 113

Query: 251 TCDTLIPS--ESSILTDIATHGPVIAAVNAL-TWQYYLGGVIQ-YNCDGSLANINHAVQI 306
                I    E +++  +A + PV  A      +  Y  G+    +C  +   +NHAV  
Sbjct: 114 ---ANITMNDEEAMVEAVALYNPVSFAFEVTNDFLMYRKGIYSSTSCHKTPDKVNHAVLA 170

Query: 307 VGY 309
           VGY
Sbjct: 171 VGY 173


>3i06_A Cruzipain; autocatalytic cleavage, glycoprotein, protease, thiol
           protease, zymogen; HET: QL2; 1.10A {Trypanosoma cruzi}
           PDB: 1ewm_A* 1ewo_A* 1ewl_A* 1f29_A* 1ewp_A* 1f2b_A*
           1f2c_A* 1f2a_A* 1me4_A* 1u9q_X* 2aim_A* 2efm_A* 2oz2_A*
           1me3_A* 3kku_A* 3lxs_A* 1aim_A* 3iut_A* 3hd3_A* 2p86_A*
           ...
          Length = 215

 Score =  144 bits (366), Expect = 2e-42
 Identities = 53/181 (29%), Positives = 79/181 (43%), Gaps = 23/181 (12%)

Query: 139 PVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGN 198
           P   DWR  G +  V++Q  CG+CWAFS +   E    L    L+ LS Q ++ C    +
Sbjct: 2   PAAVDWRARGAVTAVKDQGQCGSCWAFSAIGNVECQWFLAGHPLTNLSEQMLVSCD-KTD 60

Query: 199 MGCSGGDFCALLDWMD-----VNKVV---LEPESEYPLLLKDAACK--RKATSPNGVKIK 248
            GCSGG        M+     + +     +  E  YP    +        +    G  I 
Sbjct: 61  SGCSGG-------LMNNAFEWIVQENNGAVYTEDSYPYASGEGISPPCTTSGHTVGATIT 113

Query: 249 SYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVG 308
            +    L   E+ I   +A +GPV  AV+A +W  Y GGV+  +C      ++H V +VG
Sbjct: 114 GHV--ELPQDEAQIAAWLAVNGPVAVAVDASSWMTYTGGVMT-SCVSE--QLDHGVLLVG 168

Query: 309 Y 309
           Y
Sbjct: 169 Y 169


>1ppo_A Protease omega; hydrolase(thiol protease); 1.80A {Carica papaya}
           SCOP: d.3.1.1 PDB: 1meg_A*
          Length = 216

 Score =  144 bits (366), Expect = 3e-42
 Identities = 58/181 (32%), Positives = 85/181 (46%), Gaps = 24/181 (13%)

Query: 138 IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNG 197
           +P   DWR+ G +  VR+Q +CG+CWAFS V T E ++ ++ G L  LS QE++DC    
Sbjct: 1   LPENVDWRKKGAVTPVRHQGSCGSCWAFSAVATVEGINKIRTGKLVELSEQELVDCER-R 59

Query: 198 NMGCSGGDFCALLDWMD-----VNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTC 252
           + GC GG       +       V K  +   S+YP   K   C+ K      VK      
Sbjct: 60  SHGCKGG-------YPPYALEYVAKNGIHLRSKYPYKAKQGTCRAKQVGGPIVKTSGVGR 112

Query: 253 DTLIP--SESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVG 308
              +   +E ++L  IA   PV   V +    +Q Y GG+ +  C   +   +HAV  VG
Sbjct: 113 ---VQPNNEGNLLNAIA-KQPVSVVVESKGRPFQLYKGGIFEGPCGTKV---DHAVTAVG 165

Query: 309 Y 309
           Y
Sbjct: 166 Y 166


>3p5u_A Actinidin; SAD, cysteine proteinases, hydrolase; 1.50A {Actinidia
           arguta} PDB: 3p5v_A 3p5w_A 3p5x_A 1aec_A* 2act_A
          Length = 220

 Score =  144 bits (366), Expect = 3e-42
 Identities = 52/182 (28%), Positives = 82/182 (45%), Gaps = 23/182 (12%)

Query: 138 IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN- 196
           +P   DWR +G +  +++Q  CG+ WAFST+   E ++ +  G L  LS QE++DC    
Sbjct: 1   LPDYVDWRSSGAVVDIKDQGQCGSAWAFSTIAAVEGINKIATGDLISLSEQELVDCGRTQ 60

Query: 197 GNMGCSGGDFCALLDWMD------VNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSY 250
              GC GG       +M       +N   +  E+ YP   ++  C         V I +Y
Sbjct: 61  NTRGCDGG-------FMTDGFQFIINNGGINTEANYPYTAEEGQCNLDLQQEKYVSIDTY 113

Query: 251 TCDTLIPSES-SILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIV 307
                +P  +   L     + PV  A+ A    +Q+Y  G+    C  ++   +HAV IV
Sbjct: 114 ---ENVPYNNEWALQTAVAYQPVSVALEAAGYNFQHYSSGIFTGPCGTAV---DHAVTIV 167

Query: 308 GY 309
           GY
Sbjct: 168 GY 169


>1yal_A Chymopapain; hydrolase, thiol protease; 1.70A {Carica papaya} SCOP:
           d.3.1.1 PDB: 1gec_E*
          Length = 218

 Score =  144 bits (365), Expect = 3e-42
 Identities = 60/180 (33%), Positives = 80/180 (44%), Gaps = 24/180 (13%)

Query: 139 PVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGN 198
           P   DWR  G +  V+NQ  CG+CWAFST+ T E ++ +  G L  LS QE++DC    +
Sbjct: 2   PQSIDWRAKGAVTPVKNQGACGSCWAFSTIATVEGINKIVTGNLLELSEQELVDCDK-HS 60

Query: 199 MGCSGGDFCALLDWMD-----VNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCD 253
            GC GG       +       V    +     YP   K   C+        VKI  Y   
Sbjct: 61  YGCKGG-------YQTTSLQYVANNGVHTSKVYPYQAKQYKCRATDKPGPKVKITGY--- 110

Query: 254 TLIPS--ESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
             +PS  E+S L  +A + P+   V A    +Q Y  GV    C   L   +HAV  VGY
Sbjct: 111 KRVPSNCETSFLGALA-NQPLSVLVEAGGKPFQLYKSGVFDGPCGTKL---DHAVTAVGY 166


>3ovx_A Cathepsin S; hydrolase, covalent inhibitor, aldehyde warhead is
           covalently bound to Cys25, lysosomeal protein; HET: O64;
           1.49A {Homo sapiens} PDB: 2h7j_A* 2f1g_A* 2hh5_B*
           2hhn_A* 2hxz_A* 2op3_A* 2frq_A* 2fra_A* 2fq9_A* 2ft2_A*
           2fud_A* 2g7y_A* 1ms6_A* 2r9m_A* 2r9n_A* 2r9o_A* 3n3g_A*
           3n4c_A* 3mpe_A* 1nqc_A* ...
          Length = 218

 Score =  144 bits (365), Expect = 4e-42
 Identities = 59/184 (32%), Positives = 86/184 (46%), Gaps = 25/184 (13%)

Query: 138 IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN- 196
           +P   DWRE G + +V+ Q +CGACWAFS V   E+   LK G L  LS Q ++DC+   
Sbjct: 2   LPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEK 61

Query: 197 -GNMGCSGGDFCALLDWMD------VNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKS 249
            GN GC+GG       +M       ++   ++ ++ YP    D  C+  +          
Sbjct: 62  YGNKGCNGG-------FMTTAFQYIIDNKGIDSDASYPYKAMDQKCQYDSKYR-AATCSK 113

Query: 250 YTCDTLIPS--ESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQ 305
           Y   T +P   E  +   +A  GPV   V+A   ++  Y  GV  Y       N+NH V 
Sbjct: 114 Y---TELPYGREDVLKEAVANKGPVSVGVDARHPSFFLYRSGV--YYEPSCTQNVNHGVL 168

Query: 306 IVGY 309
           +VGY
Sbjct: 169 VVGY 172


>2fo5_A Cysteine proteinase EP-B 2; EP-B2, EPB2, EPB, cysteine
           endoprotease, endopeptidase, LEUP hydrolase; HET: AR7;
           2.20A {Hordeum vulgare}
          Length = 262

 Score =  145 bits (368), Expect = 4e-42
 Identities = 62/186 (33%), Positives = 86/186 (46%), Gaps = 25/186 (13%)

Query: 136 TGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAG 195
           + +P   DWR+ G +  V++Q  CG+CWAFSTV + E ++A++ G+L  LS QE+IDC  
Sbjct: 2   SDLPPSVDWRQKGAVTGVKDQGKCGSCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDT 61

Query: 196 NGNMGCSGGDFCALLDWMD------VNKVVLEPESEYPLLLKDAAC---KRKATSPNGVK 246
             N GC GG        MD       N   L  E+ YP       C   +    SP  V 
Sbjct: 62  ADNDGCQGG-------LMDNAFEYIKNNGGLITEAAYPYRAARGTCNVARAAQNSPVVVH 114

Query: 247 IKSYTCDTLIPSES-SILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHA 303
           I  +     +P+ S   L     + PV  AV A    + +Y  GV    C   L   +H 
Sbjct: 115 IDGH---QDVPANSEEDLARAVANQPVSVAVEASGKAFMFYSEGVFTGECGTEL---DHG 168

Query: 304 VQIVGY 309
           V +VGY
Sbjct: 169 VAVVGY 174


>3f75_A Toxopain-2, cathepsin L protease; medical structural genomics of
           pathogenic protozoa, MSGPP, C protease, parasite,
           protozoa, hydrolase; 1.99A {Toxoplasma gondii}
          Length = 224

 Score =  144 bits (365), Expect = 5e-42
 Identities = 60/185 (32%), Positives = 89/185 (48%), Gaps = 24/185 (12%)

Query: 135 PTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA 194
           P+ +P   DWR  G +  V++Q+ CG+CWAFST    E  H  K G L  LS QE++DC+
Sbjct: 4   PSELPAGVDWRSRGCVTPVKDQRDCGSCWAFSTTGALEGAHCAKTGKLVSLSEQELMDCS 63

Query: 195 GN-GNMGCSGGDFCALLDWMD------VNKVVLEPESEYPLLLKDAACKRKATSPNGVKI 247
              GN  CSGG        M+      ++   +  E  YP L +D  C+ ++     VKI
Sbjct: 64  RAEGNQSCSGG-------EMNDAFQYVLDSGGICSEDAYPYLARDEECRAQSCEK-VVKI 115

Query: 248 KSYTCDTLIPSES-SILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAV 304
             +     +P  S + +       PV  A+ A  + +Q+Y  GV   +C   L   +H V
Sbjct: 116 LGF---KDVPRRSEAAMKAALAKSPVSIAIEADQMPFQFYHEGVFDASCGTDL---DHGV 169

Query: 305 QIVGY 309
            +VGY
Sbjct: 170 LLVGY 174


>3kwz_A Cathepsin K; enzyme inhibitor, covalent reversible inhibitor,
           disease mutation, disulfide bond, glycoprotein,
           hydrolase, lysosome, protease; HET: KWZ; 1.49A {Homo
           sapiens} PDB: 1au0_A* 1au2_A* 1au3_A* 1au4_A* 1ayu_A*
           1ayv_A* 1ayw_A* 1bgo_A* 1atk_A* 1nl6_A* 1nlj_A* 1q6k_A*
           1mem_A* 1yk7_A* 1yk8_A* 1yt7_A* 2ato_A* 2aux_A* 2auz_A*
           2bdl_A* ...
          Length = 215

 Score =  143 bits (363), Expect = 6e-42
 Identities = 59/181 (32%), Positives = 86/181 (47%), Gaps = 23/181 (12%)

Query: 139 PVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGN 198
           P   D+R+ G +  V+NQ  CG+CWAFS+V   E     K G L  LS Q ++DC    N
Sbjct: 2   PDSVDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVS-EN 60

Query: 199 MGCSGGDFCALLDWMD------VNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTC 252
            GC GG       +M            ++ E  YP + ++ +C    T     K + Y  
Sbjct: 61  DGCGGG-------YMTNAFQYVQKNRGIDSEDAYPYVGQEESCMYNPTGK-AAKCRGYRE 112

Query: 253 DTLIPS--ESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVG 308
              IP   E ++   +A  GPV  A++A   ++Q+Y  GV  Y+   +  N+NHAV  VG
Sbjct: 113 ---IPEGNEKALKRAVARVGPVSVAIDASLTSFQFYSKGVY-YDESCNSDNLNHAVLAVG 168

Query: 309 Y 309
           Y
Sbjct: 169 Y 169


>2b1m_A SPE31; papain-like, sugar binding protein; HET: NAG FUC PG4; 2.00A
           {Pachyrhizus erosus} PDB: 2b1n_A*
          Length = 246

 Score =  144 bits (365), Expect = 9e-42
 Identities = 61/184 (33%), Positives = 83/184 (45%), Gaps = 22/184 (11%)

Query: 138 IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNG 197
            P   DW + G+I KV+ Q  CG+ WAFS     E+ HA+  G L  LS QE+IDC    
Sbjct: 2   APESWDWSKKGVITKVKFQGQCGSGWAFSATGAIEAAHAIATGNLVSLSEQELIDCVD-E 60

Query: 198 NMGCSGGDFCALLDWMD------VNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYT 251
           + GC  G       W        V    +  E++YP   +D  CK      + V I +Y 
Sbjct: 61  SEGCYNG-------WHYQSFEWVVKHGGIASEADYPYKARDGKCKANE-IQDKVTIDNYG 112

Query: 252 CDTL----IPSES-SILTDIATHGPVIAAVNALTWQYYLGGVIQ-YNCDGSLANINHAVQ 305
              L      SE+ S L       P+  +++A  + +Y GG+    NC  S   INH V 
Sbjct: 113 VQILSNESTESEAESSLQSFVLEQPISVSIDAKDFHFYSGGIYDGGNC-SSPYGINHFVL 171

Query: 306 IVGY 309
           IVGY
Sbjct: 172 IVGY 175


>1cqd_A Protein (protease II); cysteine protease, glycoprotein, proline
           specificity, carboh papain family, hydrolase; HET: NAG
           FUL FUC; 2.10A {Zingiber officinale} SCOP: d.3.1.1
          Length = 221

 Score =  142 bits (360), Expect = 2e-41
 Identities = 64/182 (35%), Positives = 90/182 (49%), Gaps = 26/182 (14%)

Query: 138 IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNG 197
           +P   DWRE G +  V+NQ  CG+CWAFSTV   E ++ +  G L  LS Q+++DC    
Sbjct: 3   LPDSIDWRENGAVVPVKNQGGCGSCWAFSTVAAVEGINQIVTGDLISLSEQQLVDCTT-A 61

Query: 198 NMGCSGGDFCALLDWMD------VNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYT 251
           N GC GG       WM+      VN   +  E  YP   +D  C     +P  V I SY 
Sbjct: 62  NHGCRGG-------WMNPAFQFIVNNGGINSEETYPYRGQDGICNSTVNAP-VVSIDSY- 112

Query: 252 CDTLIPS--ESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIV 307
               +PS  E S+   +A + PV   ++A    +Q Y  G+   +C+ S    NHA+ +V
Sbjct: 113 --ENVPSHNEQSLQKAVA-NQPVSVTMDAAGRDFQLYRSGIFTGSCNISA---NHALTVV 166

Query: 308 GY 309
           GY
Sbjct: 167 GY 168


>2wbf_X Serine-repeat antigen protein; SERA, malaria, vacuole, protease,
           cathepsin, hydrolase, glycoprotein, thiol protease; HET:
           DMS; 1.60A {Plasmodium falciparum} PDB: 3ch3_X 3ch2_X
          Length = 265

 Score =  143 bits (362), Expect = 4e-41
 Identities = 44/209 (21%), Positives = 66/209 (31%), Gaps = 44/209 (21%)

Query: 142 KDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMG 200
           KD        +V +Q  C   W F++    E++  +K    + +S   V +C        
Sbjct: 14  KDENNCISNLQVEDQGNCDTSWIFASKYHLETIRCMKGYEPTKISALYVANCYKGEHKDR 73

Query: 201 CSGGDFCALLDWMD-------VNKVVLEPESEYP------------------LLLKDAAC 235
           C  G                  +   L  ES YP                   L  +   
Sbjct: 74  CDEG-------SSPMEFLQIIEDYGFLPAESNYPYNYVKVGEQCPKVEDHWMNLWDNGKI 126

Query: 236 KRKATSPNGVKIKSYT-------CDTLIPSESSILTDIATHGPVIAAVNALTW--QYYLG 286
                 PN +  K YT        D +      I T++   G VIA + A       + G
Sbjct: 127 LHNKNEPNSLDGKGYTAYESERFHDNMDAFVKIIKTEVMNKGSVIAYIKAENVMGYEFSG 186

Query: 287 GVIQYNCDGSLANINHAVQIVGYDNYSRT 315
             ++  C       +HAV IVGY NY  +
Sbjct: 187 KKVKNLCGDD--TADHAVNIVGYGNYVNS 213


>1iwd_A Ervatamin B; cysteine protease, alpha-beta protein, catalytic DYAD,
           L-DOM domain., hydrolase; 1.63A {Tabernaemontana
           divaricata} SCOP: d.3.1.1
          Length = 215

 Score =  139 bits (354), Expect = 1e-40
 Identities = 55/182 (30%), Positives = 85/182 (46%), Gaps = 27/182 (14%)

Query: 138 IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNG 197
           +P   DWR  G +  ++NQ+ CG+CWAFS V   ES++ ++ G L  LS QE++DC    
Sbjct: 1   LPSFVDWRSKGAVNSIKNQKQCGSCWAFSAVAAVESINKIRTGQLISLSEQELVDCDT-A 59

Query: 198 NMGCSGGDFCALLDWMD------VNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYT 251
           + GC+GG       WM+      +    ++ +  YP      +CK        V I  + 
Sbjct: 60  SHGCNGG-------WMNNAFQYIITNGGIDTQQNYPYSAVQGSCKPYRL--RVVSINGF- 109

Query: 252 CDTLIPS--ESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIV 307
               +    ES++ + +A   PV   V A    +Q+Y  G+    C  +    NH V IV
Sbjct: 110 --QRVTRNNESALQSAVA-SQPVSVTVEAAGAPFQHYSSGIFTGPCGTAQ---NHGVVIV 163

Query: 308 GY 309
           GY
Sbjct: 164 GY 165


>3bwk_A Cysteine protease falcipain-3; malaria, hydrolase; HET: C1P; 2.42A
           {Plasmodium falciparum} PDB: 3bpm_A*
          Length = 243

 Score =  140 bits (356), Expect = 2e-40
 Identities = 54/182 (29%), Positives = 84/182 (46%), Gaps = 24/182 (13%)

Query: 136 TGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAG 195
               +  DWR  G +  V++Q  CG+CWAFS+V + ES +A++   L L S QE++DC+ 
Sbjct: 18  KLDRIAYDWRLHGGVTPVKDQALCGSCWAFSSVGSVESQYAIRKKALFLFSEQELVDCSV 77

Query: 196 NGNMGCSGGDFCALLDWMD------VNKVVLEPESEYP-LLLKDAACKRKATSPNGVKIK 248
             N GC GG       ++       ++   L  + +YP +      C  K  +     IK
Sbjct: 78  -KNNGCYGG-------YITNAFDDMIDLGGLCSQDDYPYVSNLPETCNLKRCNE-RYTIK 128

Query: 249 SYTCDTLIPSESSILTDIATHGPVIAAVNA-LTWQYYLGGVIQYNCDGSLANINHAVQIV 307
           SY     IP +      +   GP+  ++ A   + +Y GG     C  +    NHAV +V
Sbjct: 129 SY---VSIP-DDKFKEALRYLGPISISIAASDDFAFYRGGFYDGECGAAP---NHAVILV 181

Query: 308 GY 309
           GY
Sbjct: 182 GY 183


>3f5v_A DER P 1 allergen; allergy, asthma, DUST mites, glycoprotein,
           hydrola protease, secreted, thiol protease; HET: P6G;
           1.36A {Dermatophagoides pteronyssinus} PDB: 2as8_A
           3rvw_A* 3rvx_A 3rvv_A* 3d6s_A*
          Length = 222

 Score =  139 bits (352), Expect = 3e-40
 Identities = 49/190 (25%), Positives = 76/190 (40%), Gaps = 22/190 (11%)

Query: 129 TTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQ 188
           T   +I    P + D R+   +  +R Q  CG+ WAFS V   ES +         L+ Q
Sbjct: 1   TNACSINGNAPAEIDLRQMRTVTPIRMQGGCGSAWAFSGVAATESAYLAYRQQSLDLAEQ 60

Query: 189 EVIDCAGNGNMGCSGGDFCALLDWMD-----VNKVVLEPESEYPLLLKDAACKRKATSPN 243
           E++DCA     GC G         +      +    +  ES Y  + ++ +C+R      
Sbjct: 61  ELVDCA--SQHGCHGD-------TIPRGIEYIQHNGVVQESYYRYVAREQSCRRPNA--Q 109

Query: 244 GVKIKSYTCDTLIPSESSILTDIA-THGPV---IAAVNALTWQYYLGGVIQYNCDGSLAN 299
              I +Y      P+ + I   +A TH  +   I   +   +++Y G  I    D     
Sbjct: 110 RFGISNYC-QIYPPNANKIREALAQTHSAIAVIIGIKDLDAFRHYDGRTI-IQRDNGYQP 167

Query: 300 INHAVQIVGY 309
             HAV IVGY
Sbjct: 168 NYHAVNIVGY 177


>2oul_A Falcipain 2; cysteine protease, inhibitor, macromolecular
           interaction, HY hydrolase inhibitor complex; 2.20A
           {Plasmodium falciparum} SCOP: d.3.1.1 PDB: 2ghu_A 1yvb_A
           3bpf_A* 3pnr_A
          Length = 241

 Score =  139 bits (352), Expect = 6e-40
 Identities = 47/179 (26%), Positives = 82/179 (45%), Gaps = 22/179 (12%)

Query: 138 IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNG 197
                DWR    +  V++Q+ CG+CWAFS++ + ES +A++   L  LS QE++DC+   
Sbjct: 18  DHAAYDWRLHSGVTPVKDQKNCGSCWAFSSIGSVESQYAIRKNKLITLSEQELVDCS-FK 76

Query: 198 NMGCSGGDFCALLDWMD------VNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYT 251
           N GC+GG        ++      +    + P+ +YP +                 IK+Y 
Sbjct: 77  NYGCNGG-------LINNAFEDMIELGGICPDGDYPYVSDAPNLCNIDRCTEKYGIKNY- 128

Query: 252 CDTLIPSESSILTDIATHGPVIAAVNA-LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
               +P ++ +   +   GP+  +V     + +Y  G+    C   L   NHAV +VG+
Sbjct: 129 --LSVP-DNKLKEALRFLGPISISVAVSDDFAFYKEGIFDGECGDQL---NHAVMLVGF 181


>3hhi_A Cathepsin B-like cysteine protease; occluding loop, hydrolase, THIO
           protease; HET: 074; 1.60A {Trypanosoma brucei} PDB:
           3mor_A*
          Length = 325

 Score =  139 bits (352), Expect = 4e-39
 Identities = 42/270 (15%), Positives = 84/270 (31%), Gaps = 40/270 (14%)

Query: 65  DIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVK 124
             ++ +N+  +    A+Y      +++  E K  +     N +  +   +          
Sbjct: 13  AFVDRVNRLNRGIWKAKYD-GVMQNITLREAKRLNGVIKKNNNASILPKRRFTEEEARAP 71

Query: 125 KRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGT-LS 183
                    +P+     + W     I ++ +Q  CG+CWA +             G    
Sbjct: 72  ---------LPSSFDSAEAWPNCPTIPQIADQSACGSCWAVAAASAMSDRFCTMGGVQDV 122

Query: 184 LLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYP---------------- 227
            +S  +++ C  +   GC+GGD      +     +V   +   P                
Sbjct: 123 HISAGDLLACCSDCGDGCNGGDPDRAWAYFSSTGLV--SDYCQPYPFPHCSHHSKSKNGY 180

Query: 228 -----LLLKDAACKRKATSPNGVKIKSYTCDTLIP--SESSILTDIATHGPVIAAVNALT 280
                       C      P  + + +Y   T      E   + ++   GP   A +   
Sbjct: 181 PPCSQFNFDTPKCDYTCDDPT-IPVVNYRSWTSYALQGEDDYMRELFFRGPFEVAFDVYE 239

Query: 281 -WQYYLGGVIQYNCDGSLANINHAVQIVGY 309
            +  Y  GV  +     L    HAV++VG+
Sbjct: 240 DFIAYNSGVYHHVSGQYL--GGHAVRLVGW 267


>1o0e_A Ervatamin C; plant cysteine protease, two domain, stable at PH
           2-12, HYDR; 1.90A {Tabernaemontana divaricata} SCOP:
           d.3.1.1 PDB: 2pns_A* 2pre_A* 3bcn_A*
          Length = 208

 Score =  130 bits (330), Expect = 4e-37
 Identities = 56/181 (30%), Positives = 83/181 (45%), Gaps = 26/181 (14%)

Query: 138 IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNG 197
           +P + DWR+ G +  V+NQ +CG+CWAFSTV T ES++ ++ G L  LS QE++DC    
Sbjct: 1   LPEQIDWRKKGAVTPVKNQGSCGSCWAFSTVSTVESINQIRTGNLISLSEQELVDCDK-K 59

Query: 198 NMGCSGGDFCALLDWMD------VNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYT 251
           N GC GG                +N   ++ ++ YP       C+    +   V I  Y 
Sbjct: 60  NHGCLGG-------AFVFAYQYIINNGGIDTQANYPYKAVQGPCQ---AASKVVSIDGY- 108

Query: 252 CDTLIPSES-SILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVG 308
               +P  +   L       P   A++A    +Q Y  G+    C      +NH V IVG
Sbjct: 109 --NGVPFCNEXALKQAVAVQPSTVAIDASSAQFQQYSSGIFSGPCG---TKLNHGVTIVG 163

Query: 309 Y 309
           Y
Sbjct: 164 Y 164


>3pbh_A Procathepsin B; thiol protease, cysteine protease, proenzyme,
           papain; 2.50A {Homo sapiens} SCOP: d.3.1.1 PDB: 2pbh_A
           1pbh_A 1mir_A
          Length = 317

 Score =  124 bits (313), Expect = 1e-33
 Identities = 53/290 (18%), Positives = 100/290 (34%), Gaps = 62/290 (21%)

Query: 56  RFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKH 115
            F       +++  +NK   + ++       F ++     K R     +           
Sbjct: 6   SFHPLSD--ELVNYVNKRNTTWQAGHN----FYNVDMSYLK-RLCGTFLGGPKPPQRVMF 58

Query: 116 HDHHHNHVKKRSITTGITIPTGIPVKKDWREA----GIIGKVRNQQTCGACWAFSTVETA 171
            +                    +P   D RE       I ++R+Q +CG+CWAF  VE  
Sbjct: 59  TE-----------------DLKLPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAI 101

Query: 172 ESMHALK-NGTLS-LLSVQEVIDCAGN-GNMGCSGGDFCALLDWMDVNKVVLE------- 221
                +  N  +S  +S ++++ C G+    GC+GG      ++     +V         
Sbjct: 102 SDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHV 161

Query: 222 -------PESEYPLLLKDAACKRKATSPN--------------GVKIKSYTCDTLIPSES 260
                  P  E+ +      C  +  +P                 K   Y   ++  SE 
Sbjct: 162 GCRPYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEK 221

Query: 261 SILTDIATHGPVIAAVNALT-WQYYLGGVIQYNCDGSLANINHAVQIVGY 309
            I+ +I  +GPV  A +  + +  Y  GV Q+     +    HA++I+G+
Sbjct: 222 DIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMM--GGHAIRILGW 269


>3u8e_A Papain-like cysteine protease; papain-like cysteine peptidase,
           peptidase_C1A, hydrolase, in form; 1.31A {Crocus
           sativus}
          Length = 222

 Score =  118 bits (298), Expect = 2e-32
 Identities = 53/175 (30%), Positives = 76/175 (43%), Gaps = 11/175 (6%)

Query: 139 PVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGN 198
           P   DWR+ G +  V++Q  CG CWAF      E + A+  G L  +S Q+++DC     
Sbjct: 2   PASIDWRKKGAVTSVKDQGACGMCWAFGATGAIEGIDAITTGRLISVSEQQIVDCD-TXX 60

Query: 199 MGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPS 258
               GGD      W+  N   +  ++ YP    D  C      P   +I  Y   T +P+
Sbjct: 61  XXXXGGDADDAFRWVITNG-GIASDANYPYTGVDGTCDLN--KPIAARIDGY---TNVPN 114

Query: 259 ESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQY--NCDGSLANINHAVQIVGY 309
            SS L D     PV   +     ++Q Y G  I    +C    A ++H V IVGY
Sbjct: 115 SSSALLDAVAKQPVSVNIYTSSTSFQLYTGPGIFAGSSCSDDPATVDHTVLIVGY 169


>3qsd_A Cathepsin B-like peptidase (C01 family); cysteine peptidase,
           digestive tract, hydrolase-hydrolase INH complex; HET:
           074; 1.30A {Schistosoma mansoni} PDB: 3s3q_A* 3s3r_A*
          Length = 254

 Score =  117 bits (296), Expect = 1e-31
 Identities = 45/208 (21%), Positives = 74/208 (35%), Gaps = 38/208 (18%)

Query: 138 IPVKKDWREA----GIIGKVRNQQTCGACWAFSTVETAESMHALKNG--TLSLLSVQEVI 191
           IP   D R+       I  +R+Q  CG+CWAF  VE       +++G      LS  +++
Sbjct: 3   IPSSFDSRKKWPRCKSIATIRDQSRCGSCWAFGAVEAMSDRSCIQSGGKQNVELSAVDLL 62

Query: 192 DCAGNGNMGCSGGDFCALLDWMDVNKVVLE-------PESEYPLLLKDAACKRKATSPNG 244
            C  +  +GC GG      D+     +V             YP    +   K K      
Sbjct: 63  SCCESCGLGCEGGILGPAWDYWVKEGIVTGSSKENHAGCEPYPFPKCEHHTKGKYPPCGS 122

Query: 245 VKIKSYTCDT----------------------LIPSESSILTDIATHGPVIAAVNALT-W 281
              K+  C                        +   E +I  +I  +GPV A       +
Sbjct: 123 KIYKTPRCKQTCQKKYKTPYTQDKHRGKSSYNVKNDEKAIQKEIMKYGPVEAGFTVYEDF 182

Query: 282 QYYLGGVIQYNCDGSLANINHAVQIVGY 309
             Y  G+ ++    +L    HA++I+G+
Sbjct: 183 LNYKSGIYKHITGETL--GGHAIRIIGW 208


>3ois_A Cysteine protease; alpha and beta, hydrolase; HET: UDP; 1.65A
           {Xylella fastidiosa}
          Length = 291

 Score =  118 bits (298), Expect = 1e-31
 Identities = 32/202 (15%), Positives = 55/202 (27%), Gaps = 30/202 (14%)

Query: 133 TIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVID 192
           ++   +P K D        +V +Q   G+C A +     +        +   +  +  I 
Sbjct: 52  SVIAALPPKVDLTPPF---QVYDQGRIGSCTANALAAAIQFERIHDKQSPEFIPSRLFIY 108

Query: 193 CA----GNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSP------ 242
                        SG      +  +    V   PE E+P     A  + +   P      
Sbjct: 109 YNERKIEGHVNYDSGAMIRDGIKVLHKLGVC--PEKEWPYGDTPADPRTEEFPPGAPASK 166

Query: 243 ----------NGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALT-WQYYLGGV--I 289
                        KI  Y+   +      +   +A   P +   +    W         I
Sbjct: 167 KPSDQCYKDAQNYKITEYS--RVAQDIDHLKACLAVGSPFVFGFSVYNSWVGNNSLPVRI 224

Query: 290 QYNCDGSLANINHAVQIVGYDN 311
                       HAV  VGYD+
Sbjct: 225 PLPTKNDTLEGGHAVLCVGYDD 246


>3cbj_A Cathepsin B; cathepsin B, occluding loop, chagas disease, glyco
           hydrolase, lysosome, protease, thiol protease, zymogen,
           CYT vesicle; 1.80A {Homo sapiens} PDB: 3cbk_A 1gmy_A*
           3ai8_B* 3k9m_A 1the_A* 1cpj_A* 1cte_A 2dcc_A* 2dc6_A*
           1ito_A* 2dc8_A* 2dc9_A* 2dca_A* 2dcb_A* 2dc7_A* 2dcd_A*
           1qdq_A* 1csb_B* 1huc_B 2ipp_B ...
          Length = 266

 Score =  117 bits (294), Expect = 4e-31
 Identities = 44/208 (21%), Positives = 77/208 (37%), Gaps = 38/208 (18%)

Query: 138 IPVKKDWREA----GIIGKVRNQQTCGACWAFSTVETAESMHALK-NGTLS-LLSVQEVI 191
           +P   D RE       I ++R+Q +CG+ WAF  VE       +  N  +S  +S ++++
Sbjct: 7   LPASFDAREQWPQCPTIKEIRDQGSCGSAWAFGAVEAISDRICIHTNAHVSVEVSAEDLL 66

Query: 192 DCAGN-GNMGCSGGDFCALLDWMDVNKVVLE-----PESEYPLLLKDAACKRKATSP--- 242
            C G+    GC+GG      ++     +V            P  +           P   
Sbjct: 67  TCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEAHVNGARPPCT 126

Query: 243 --------------------NGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALT-W 281
                                  K   Y   ++  SE  I+ +I  +GPV  A +  + +
Sbjct: 127 GEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDF 186

Query: 282 QYYLGGVIQYNCDGSLANINHAVQIVGY 309
             Y  GV Q+     +    HA++I+G+
Sbjct: 187 LLYKSGVYQHVTGEMM--GGHAIRILGW 212


>1deu_A Procathepsin X; cysteine protease, proregion, prosegment, HY; 1.70A
           {Homo sapiens} SCOP: d.3.1.1 PDB: 1ef7_A
          Length = 277

 Score =  115 bits (289), Expect = 2e-30
 Identities = 46/201 (22%), Positives = 69/201 (34%), Gaps = 37/201 (18%)

Query: 135 PTGIPVKKDWREAG---IIGKVRNQ---QTCGACWAFSTVETAESMHALKNG---TLSLL 185
           P  +P   DWR           RNQ   Q CG+CWA ++         +K       +LL
Sbjct: 33  PADLPKSWDWRNVDGVNYASITRNQHIPQYCGSCWAHASTSAMADRINIKRKGAWPSTLL 92

Query: 186 SVQEVIDCAGNGNMGCSGGDFCALLDWMD-----VNKVVLEPESEYPLLLKDAACKRKAT 240
           SVQ VIDC   G   C GG                ++  +  E+      KD  C +   
Sbjct: 93  SVQNVIDCGNAG--SCEGG-------NDLSVWDYAHQHGIPDETCNNYQAKDQECDKFNQ 143

Query: 241 SPNGVKIKSYTCDT-----------LIPSESSILTDIATHGPVIAAVNA-LTWQYYLGGV 288
                + K                  +     ++ +I  +GP+   + A      Y GG+
Sbjct: 144 CGTCNEFKECHAIRNYTLWRVGDYGSLSGREKMMAEIYANGPISCGIMATERLANYTGGI 203

Query: 289 IQYNCDGSLANINHAVQIVGY 309
                D +   INH V + G+
Sbjct: 204 YAEYQDTTY--INHVVSVAGW 222


>3f75_P Toxopain-2, cathepsin L propeptide; medical structural genomics of
           pathogenic protozoa, MSGPP, C protease, parasite,
           protozoa, hydrolase; 1.99A {Toxoplasma gondii}
          Length = 106

 Score = 67.5 bits (165), Expect = 2e-14
 Identities = 23/74 (31%), Positives = 38/74 (51%), Gaps = 4/74 (5%)

Query: 35  ELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEE 93
           + FSSFQ  Y KSY ++ E   R+  F+ +L  I   N+   S       +  F DLS +
Sbjct: 23  DAFSSFQAMYAKSYATEEEKQRRYAIFKNNLVYIHTHNQQGYS---YSLKMNHFGDLSRD 79

Query: 94  EFKTRHLRHSVNKH 107
           EF+ ++L    +++
Sbjct: 80  EFRRKYLGFKKSRN 93


>2l95_A Crammer, LP06209P; cysteine proteinase inhibitor, intrinsic
           disorder P like protein, hydrolase; NMR {Drosophila
           melanogaster}
          Length = 80

 Score = 65.8 bits (161), Expect = 4e-14
 Identities = 19/67 (28%), Positives = 34/67 (50%), Gaps = 1/67 (1%)

Query: 35  ELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELN-KNRQSPESARYGITEFSDLSEE 93
           E +  ++ ++ K+Y   E  +R + + +S   IEE N K  +   + + GI   +DL+ E
Sbjct: 8   EEWVEYKSKFDKNYEAEEDLMRRRIYAESKARIEEHNRKFEKGEVTWKMGINHLADLTPE 67

Query: 94  EFKTRHL 100
           EF  R  
Sbjct: 68  EFAQRSG 74


>2pff_B Fatty acid synthase subunit beta; fatty acid synthase,
           acyl-carrier-protein, beta-ketoacyl RED beta-ketoacyl
           synthase, dehydratase; 4.00A {Saccharomyces cerevisiae}
          Length = 2006

 Score = 52.7 bits (126), Expect = 8e-08
 Identities = 51/347 (14%), Positives = 95/347 (27%), Gaps = 119/347 (34%)

Query: 24  KVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPE----- 78
           +     +    +L     +       ++  D   K F + L+I+E L     +P+     
Sbjct: 178 QTYHVLVG---DLIKFSAETLS-ELIRTTLDAE-KVFTQGLNILEWLENPSNTPDKDYLL 232

Query: 79  SARY-----GITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKK-RSITT-- 130
           S        G+ + +                  H +++               +  T   
Sbjct: 233 SIPISCPLIGVIQLA------------------HYVVTAKLLGFTPGELRSYLKGATGHS 274

Query: 131 -GITIPTGIPVKKDWREAG-----------IIGKVRNQQTCGACWAFSTVETAESMH--A 176
            G+     I     W                IG VR  +      A+       S+   +
Sbjct: 275 QGLVTAVAIAETDSWESFFVSVRKAITVLFFIG-VRCYE------AYPNTSLPPSILEDS 327

Query: 177 LKNGT-----------LSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVN---KVVL-- 220
           L+N             L+   VQ+ ++   N ++        +L     VN    +V+  
Sbjct: 328 LENNEGVPSPMLSISNLTQEQVQDYVN-KTNSHLPAGKQVEISL-----VNGAKNLVVSG 381

Query: 221 EPESEYPL---LLKDAA-----------CKRKA---------TSPNGVKIKSYTCDTLIP 257
            P+S Y L   L K  A            +RK           SP       +    L+P
Sbjct: 382 PPQSLYGLNLTLRKAKAPSGLDQSRIPFSERKLKFSNRFLPVASP-------FHSHLLVP 434

Query: 258 SESSILTDIATHG----------PVIAAVNALTWQYYLGGVIQYNCD 294
           +   I  D+  +           PV    +    +   G + +   D
Sbjct: 435 ASDLINKDLVKNNVSFNAKDIQIPVYDTFDGSDLRVLSGSISERIVD 481



 Score = 38.5 bits (89), Expect = 0.003
 Identities = 53/345 (15%), Positives = 93/345 (26%), Gaps = 125/345 (36%)

Query: 22  PVKVSKPNLEQKL----ELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSP 77
           P+ +S  +LE  L      F    Q  ++             F K L    E       P
Sbjct: 8   PLTLSHGSLEHVLLVPTASFFIASQLQEQ-------------FNKILPEPTEGFAADDEP 54

Query: 78  ES-----ARY-----------GITEFSDLSE---EEFKTRHLR----HSVNKHVLMSHHK 114
            +      ++            + +F  +      EF+  +L     H++   +L  +  
Sbjct: 55  TTPAELVGKFLGYVSSLVEPSKVGQFDQVLNLCLTEFENCYLEGNDIHALAAKLLQENDT 114

Query: 115 HHDHHHNHVK---KRSITTGITIPTGIP------VKKDWREAGII----GKVRNQQTCGA 161
                   +K      I                 V +    A ++    G    Q     
Sbjct: 115 TLVKTKELIKNYITARIMAKRPFDKKSNSALFRAVGEG--NAQLVAIFGG----QGNTDD 168

Query: 162 CWA-----FST----VETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW 212
            +      + T    V       A    TLS L  +  +D       G +      +L+W
Sbjct: 169 YFEELRDLYQTYHVLVGDLIKFSA---ETLSELI-RTTLDAEKVFTQGLN------ILEW 218

Query: 213 MDVNKVVLEPESEY--------PLL-LKDAACKRKATSPNGVKIKSY--TCDTL--IPSE 259
           ++       P+ +Y        PL+ +   A               Y  T   L   P E
Sbjct: 219 LE--NPSNTPDKDYLLSIPISCPLIGVIQLAH--------------YVVTAKLLGFTPGE 262

Query: 260 -SSILTDIATHGP-VIAAV----------------NALTWQYYLG 286
             S L     H   ++ AV                 A+T  +++G
Sbjct: 263 LRSYLKGATGHSQGLVTAVAIAETDSWESFFVSVRKAITVLFFIG 307



 Score = 38.5 bits (89), Expect = 0.003
 Identities = 51/310 (16%), Positives = 98/310 (31%), Gaps = 105/310 (33%)

Query: 34   LELFSSFQ----------QRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYG 83
            ++L+ + +            +K +Y  S  DI   N   +L I     K ++  E   Y 
Sbjct: 1633 MDLYKTSKAAQDVWNRADNHFKDTYGFSILDI-VINNPVNLTIHFGGEKGKRIRE--NYS 1689

Query: 84   ITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKD 143
               F  + + + KT  +   +N+H                     +T  T          
Sbjct: 1690 AMIFETIVDGKLKTEKIFKEINEH---------------------STSYT---------- 1718

Query: 144  WR-EAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCS 202
            +R E G++   +  Q      A + +E A +   LK+  L    +      AG+     S
Sbjct: 1719 FRSEKGLLSATQFTQP-----ALTLMEKA-AFEDLKSKGL----IPADATFAGH-----S 1763

Query: 203  GGDFCAL------LDWMDVNKVV---------LEPESEYPLLLKDAACKRKATSPNGVKI 247
             G++ AL      +    + +VV           P  E             A +P  V  
Sbjct: 1764 LGEYAALASLADVMSIESLVEVVFYRGMTMQVAVPRDELGRSNYGMI----AINPGRV-A 1818

Query: 248  KSYTCDTLIPSESSILTDIATH-GPVIAAVNALTWQYYLGGVIQYNCD-------GSLAN 299
             S++ + L      ++  +    G ++  VN             YN +       G L  
Sbjct: 1819 ASFSQEAL----QYVVERVGKRTGWLVEIVN-------------YNVENQQYVAAGDLRA 1861

Query: 300  INHAVQIVGY 309
            ++    ++ +
Sbjct: 1862 LDTVTNVLNF 1871


>3pw3_A Aminopeptidase C; bleomycin, cysteine proteinase fold, structural
           genomics, JO center for structural genomics, JCSG; HET:
           MSE; 2.23A {Parabacteroides distasonis}
          Length = 383

 Score = 42.1 bits (98), Expect = 1e-04
 Identities = 21/89 (23%), Positives = 27/89 (30%), Gaps = 13/89 (14%)

Query: 150 IGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC-----------AGNGN 198
           I  V+NQ   G CW +S+    ES           LS    +                  
Sbjct: 22  ITSVKNQNRAGTCWCYSSYSFLESELLRMGKGEYDLSEMFTVYNTYLDRADAAVRTHGDV 81

Query: 199 MGCSGGDFCALLDWMDVNKVVLEPESEYP 227
               GG F   L  M+   +V  PE E  
Sbjct: 82  SFSQGGSFYDALYGMETFGLV--PEEEMR 108


>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis,
           programmed cell death; HET: DTP; 6.90A {Drosophila
           melanogaster} PDB: 3iz8_A*
          Length = 1221

 Score = 39.1 bits (90), Expect = 0.002
 Identities = 53/346 (15%), Positives = 96/346 (27%), Gaps = 106/346 (30%)

Query: 26  SKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGIT 85
           SK  +   L LF +   + ++   K   ++   N++  +  I+   +   S  +  Y I 
Sbjct: 57  SKDAVSGTLRLFWTLLSKQEEMVQKFVEEVLRINYKFLMSPIKTEQRQP-SMMTRMY-IE 114

Query: 86  EFSDL--SEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKD 143
           +   L    + F   ++          S  + +      + +      + I  G+     
Sbjct: 115 QRDRLYNDNQVFAKYNV----------SRLQPYLKLRQALLELRPAKNVLI-DGV----- 158

Query: 144 WREAGIIGK------------VRNQQTCGACWA-FSTVETAESMHALKNGTLSLLSVQEV 190
               G  GK            V+ +      W       + E++  L+   L  L  Q  
Sbjct: 159 ---LG-SGKTWVALDVCLSYKVQCKMDFKIFWLNLKNCNSPETV--LEM--LQKLLYQ-- 208

Query: 191 IDCAGNGNMGCSGGDFCA----LLDWMDVNKVVLEPESEYP--LL----------LK--D 232
           ID         S  D  +     +  +      L     Y   LL              +
Sbjct: 209 IDP-----NWTSRSDHSSNIKLRIHSIQAELRRLLKSKPYENCLLVLLNVQNAKAWNAFN 263

Query: 233 AACK-----RKATSPNGVKIKSYT-------CDTLIPSES-SILTDIA------------ 267
            +CK     R     + +   + T         TL P E  S+L                
Sbjct: 264 LSCKILLTTRFKQVTDFLSAATTTHISLDHHSMTLTPDEVKSLLLKYLDCRPQDLPREVL 323

Query: 268 THGP----VIAAV---NALTWQYYLGGVIQYNCDGSLANINHAVQI 306
           T  P    +IA        TW  +       NCD     +   ++ 
Sbjct: 324 TTNPRRLSIIAESIRDGLATWDNWK----HVNCD----KLTTIIES 361



 Score = 29.8 bits (66), Expect = 1.3
 Identities = 16/125 (12%), Positives = 37/125 (29%), Gaps = 32/125 (25%)

Query: 21  IPVKV--------SKPNLEQKLELF--SSFQQRYKKSYSKSEHDIRFKNFEKSLD----- 65
           IP  +         K ++   +      S  ++  K  + S   I  +   K  +     
Sbjct: 387 IPTILLSLIWFDVIKSDVMVVVNKLHKYSLVEKQPKESTISIPSIYLELKVKLENEYALH 446

Query: 66  --IIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHV 123
             I++  N  +       +   +      +++   H+ H         H K+ +H     
Sbjct: 447 RSIVDHYNIPK------TFDSDDLIPPYLDQYFYSHIGH---------HLKNIEHPERMT 491

Query: 124 KKRSI 128
             R +
Sbjct: 492 LFRMV 496


>3t8b_A 1,4-dihydroxy-2-naphthoyl-COA synthase; crotonase superfamily,
           lyase; 1.65A {Mycobacterium tuberculosis} PDB: 3t8a_A
           1rjm_A* 1rjn_A* 1q52_A 1q51_A
          Length = 334

 Score = 31.7 bits (72), Expect = 0.25
 Identities = 11/36 (30%), Positives = 16/36 (44%), Gaps = 6/36 (16%)

Query: 271 PVIAAVNALTWQYYLGG--VIQYNCDGSLANINHAV 304
            VI  VN     +  GG   +   CD +LA+  +A 
Sbjct: 169 VVICLVNG----WAAGGGHSLHVVCDLTLASREYAR 200


>2bec_A Calcineurin B homologous protein 2; calcineurin-homologous protein,
           calcium-binding protein, NHE1 regulating protein; 2.70A
           {Homo sapiens}
          Length = 202

 Score = 30.9 bits (70), Expect = 0.40
 Identities = 10/31 (32%), Positives = 15/31 (48%)

Query: 90  LSEEEFKTRHLRHSVNKHVLMSHHKHHDHHH 120
           +S  EF     +  V + + +   KHH HHH
Sbjct: 172 VSFVEFTKSLEKMDVEQKMSIRILKHHHHHH 202


>2cb5_A Protein (bleomycin hydrolase); aminopeptidase, cysteine protease,
           SELF- compartmentalizing, cylinase; 1.85A {Homo sapiens}
           SCOP: d.3.1.1 PDB: 1cb5_A
          Length = 453

 Score = 30.5 bits (68), Expect = 0.76
 Identities = 7/31 (22%), Positives = 11/31 (35%)

Query: 150 IGKVRNQQTCGACWAFSTVETAESMHALKNG 180
              + NQ++ G  W FS +         K  
Sbjct: 60  GKPITNQKSSGRSWIFSCLNVMRLPFMKKLN 90


>1q0p_A Complement factor B; VON willebrand factor, MAC-1, I domain, A
           domain, hydrolase; 1.80A {Homo sapiens} SCOP: c.62.1.1
          Length = 223

 Score = 29.9 bits (67), Expect = 0.96
 Identities = 11/56 (19%), Positives = 17/56 (30%), Gaps = 1/56 (1%)

Query: 55  IRFKNFEKSLDIIEEL-NKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVL 109
           I   NF  +   +  L  K        RYG+  ++   +   K      S    V 
Sbjct: 28  IGASNFTGAKKSLVNLIEKVASYGVKPRYGLVTYATYPKIWVKVSEADSSNADWVT 83


>3cio_A ETK, tyrosine-protein kinase ETK; WZC, escherichia coli tyrosine
           kinase domain, signaling protein, transferase, inner
           membrane, membrane; 2.50A {Escherichia coli}
          Length = 299

 Score = 29.6 bits (67), Expect = 1.3
 Identities = 15/39 (38%), Positives = 20/39 (51%), Gaps = 8/39 (20%)

Query: 110 MSHHKHHDHHHNH---VKKRSITTGITIP-----TGIPV 140
           M HH HH HHH+    ++ R I +G+  P      GI V
Sbjct: 1   MGHHHHHHHHHHSSGHIEGRHIGSGVEAPEQLEEHGISV 39


>1pq4_A Periplasmic binding protein component of AN ABC T uptake
           transporter; ZNUA, loop, metal-binding, metal binding
           protein; 1.90A {Synechocystis SP} SCOP: c.92.2.2 PDB:
           2ov3_A 2ov1_A
          Length = 291

 Score = 29.3 bits (66), Expect = 1.6
 Identities = 15/68 (22%), Positives = 25/68 (36%), Gaps = 5/68 (7%)

Query: 59  NFEKSL-DIIEELNKNRQSPESARYGIT--EFSDLSEEEF-KTRHLRHSVNKHVLMSHHK 114
            FE+   + ++  N N +  +SA+ GIT  E          +  H  HS + H   S  +
Sbjct: 61  GFEQPWLEKLKAANANMKLIDSAQ-GITPLEMEKHDHSHGEEEGHDDHSHDGHDHGSESE 119

Query: 115 HHDHHHNH 122
                   
Sbjct: 120 KEKAKGAL 127


>2prs_A High-affinity zinc uptake system protein ZNUA; protein consists of
           two (beta/ALFA)4 domains, metal transport; 1.70A
           {Escherichia coli} PDB: 2osv_A 2ps0_A 2ps3_A 2ps9_A
           2ogw_A 2xy4_A* 2xqv_A* 2xh8_A
          Length = 284

 Score = 28.9 bits (65), Expect = 2.0
 Identities = 9/66 (13%), Positives = 17/66 (25%), Gaps = 12/66 (18%)

Query: 58  KNFEKSLD-IIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHH 116
              E  +   + +L   +Q   +    +      S       H            H +  
Sbjct: 57  PEMEAFMQKPVSKLPGAKQVTIAQLEDVKPLLMKSIHGDDDDH-----------DHAEKS 105

Query: 117 DHHHNH 122
           D  H+H
Sbjct: 106 DEDHHH 111


>1s7o_A Hypothetical UPF0122 protein
           SPY1201/SPYM3_0842/SPS1042/SPYM18_1152; putative DNA
           binding protein, structural genomics; 2.31A
           {Streptococcus pyogenes serotype M3} SCOP: a.4.13.3
          Length = 113

 Score = 27.5 bits (61), Expect = 2.3
 Identities = 7/45 (15%), Positives = 18/45 (40%)

Query: 29  NLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKN 73
             E KL ++S +  R +       H    +  ++ + I+  ++  
Sbjct: 68  TYEMKLHMYSDYVVRSEIFDDMIAHYPHDEYLQEKISILTSIDNR 112


>2e01_A Cysteine proteinase 1; bleomycin hydrolase, thiol protease, C1
           protease, hydrolase; 1.73A {Saccharomyces cerevisiae}
           PDB: 2e02_A 2e03_A 2dzy_A 1a6r_A 2e00_A 2dzz_A 3gcb_A
           1gcb_A
          Length = 457

 Score = 28.9 bits (64), Expect = 2.3
 Identities = 7/33 (21%), Positives = 11/33 (33%)

Query: 151 GKVRNQQTCGACWAFSTVETAESMHALKNGTLS 183
             V NQ++ G CW F+           +     
Sbjct: 66  TPVTNQKSSGRCWLFAATNQLRLNVLSELNLKE 98


>3zqk_A VON willebrand factor; blood clotting, adamts-13, force sensor, VON
           willebrand DISE domain, haemostasis; HET: NAG; 1.70A
           {Homo sapiens} PDB: 3ppv_A 3ppx_A 3ppw_A 3ppy_A 3gxb_A*
          Length = 199

 Score = 28.4 bits (64), Expect = 2.6
 Identities = 9/59 (15%), Positives = 25/59 (42%), Gaps = 4/59 (6%)

Query: 55  IRFKNFEKSLDIIEELNKN-RQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSH 112
           I   +F +S + +EE+ +      +S    + ++S +   E+       + +K  ++  
Sbjct: 34  IGEADFNRSKEFMEEVIQRMDVGQDSIHVTVLQYSYMVTVEY---PFSEAQSKGDILQR 89


>1mjn_A Integrin alpha-L; rossmann fold, immune system; 1.30A {Homo
          sapiens} SCOP: c.62.1.1 PDB: 3hi6_A 1mq8_B* 3eoa_I
          3eob_I 1rd4_A* 1lfa_A 1zon_A 1zoo_A 1zop_A 1dgq_A
          1xdd_A* 1xdg_A* 1xuo_A* 3e2m_A* 3bqn_B* 1cqp_A* 3bqm_B*
          2ica_A* 2o7n_A* 3m6f_A* ...
          Length = 179

 Score = 28.1 bits (63), Expect = 2.8
 Identities = 10/41 (24%), Positives = 22/41 (53%), Gaps = 1/41 (2%)

Query: 55 IRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
          ++   F+K LD ++++   + S  S ++   +FS   + EF
Sbjct: 15 LQPDEFQKILDFMKDV-MKKCSNTSYQFAAVQFSTSYKTEF 54


>3nb2_A Secreted effector protein; pentapeptide, HECT domain, HECT E
           ubiquitin ligase, ligase; HET: MES; 2.10A {Escherichia
           coli} PDB: 3naw_A* 3sqv_A
          Length = 613

 Score = 28.7 bits (64), Expect = 2.8
 Identities = 10/82 (12%), Positives = 21/82 (25%)

Query: 35  ELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
            LFS     +   Y K+            L    EL +      ++     +     ++ 
Sbjct: 417 NLFSESFPIFSIPYHKAFSQNFVSGILDILISDNELKERFIEALNSNKSDYKMIADDQQR 476

Query: 95  FKTRHLRHSVNKHVLMSHHKHH 116
                    ++   L + H   
Sbjct: 477 KLACVWNPFLDGWELNAQHVDM 498


>1wuf_A Hypothetical protein LIN2664; structural genomics, unknown
           function, nysgxrc target T2186, superfamily, protein
           structure initiative, PSI; 2.90A {Listeria innocua}
           SCOP: c.1.11.2 d.54.1.1
          Length = 393

 Score = 28.4 bits (64), Expect = 2.9
 Identities = 27/133 (20%), Positives = 45/133 (33%), Gaps = 34/133 (25%)

Query: 112 HHKHHDHHHNHVKKRS----ITTGITIPTGIPVKKDWREAGIIGKVRNQQTC-------- 159
           HH HH HHH+ +  R           I   +P+   ++ +   G+++++           
Sbjct: 2   HHHHHHHHHHGLVPRGSHMYFQKARLIHAELPLLAPFKTS--YGELKSKDFYIIELINEE 59

Query: 160 -----GACWAFS----TVETAESM-HALKNGTLSLL---------SVQEVIDCAGNGNMG 200
                G   AF     T ET  S    +K   L LL          +QE+        M 
Sbjct: 60  GIHGYGELEAFPLPDYTEETLSSAILIIKEQLLPLLAQRKIRKPEEIQELFSWIQGNEMA 119

Query: 201 CSGGDFCALLDWM 213
            +  +  A+ D  
Sbjct: 120 KAAVE-LAVWDAF 131


>1svj_A Potassium-transporting ATPase B chain; alpha-beta sandwich,
           hydrolase; NMR {Escherichia coli} SCOP: d.220.1.1 PDB:
           1u7q_A 2a00_A* 2a29_A*
          Length = 156

 Score = 27.9 bits (62), Expect = 3.0
 Identities = 9/26 (34%), Positives = 12/26 (46%), Gaps = 8/26 (30%)

Query: 110 MSHHKHHDHHHNHVKKRSITTG-ITI 134
           M HH HH HHH+       ++G    
Sbjct: 1   MGHHHHHHHHHH-------SSGHGGR 19


>3oka_C N-terminal His-affinity TAG; GT-B fold, alpha-mannosyltransferase,
           GDP-MAN binding, trans; HET: GDD; 2.20A {Escherichia
           coli}
          Length = 26

 Score = 25.5 bits (55), Expect = 3.1
 Identities = 10/20 (50%), Positives = 13/20 (65%), Gaps = 3/20 (15%)

Query: 110 MSHHKHHDHHHN---HVKKR 126
           M HH HH HHH+   H++ R
Sbjct: 1   MGHHHHHHHHHHSSGHIEGR 20


>3t89_A 1,4-dihydroxy-2-naphthoyl-COA synthase; crotonase superfamily,
           lyase; 1.95A {Escherichia coli} PDB: 3t88_A 3h02_A
           2iex_A
          Length = 289

 Score = 28.3 bits (64), Expect = 3.2
 Identities = 11/30 (36%), Positives = 16/30 (53%), Gaps = 6/30 (20%)

Query: 271 PVIAAVNALTWQYYLGG--VIQYNCDGSLA 298
           PV+A V      Y +GG  V+   CD ++A
Sbjct: 125 PVVAMVAG----YSIGGGHVLHMMCDLTIA 150


>2uzf_A Naphthoate synthase; lyase, menaquinone biosynthesis; HET: CAA;
           2.9A {Staphylococcus aureus}
          Length = 273

 Score = 28.3 bits (64), Expect = 3.3
 Identities = 12/30 (40%), Positives = 16/30 (53%), Gaps = 6/30 (20%)

Query: 271 PVIAAVNALTWQYYLGG--VIQYNCDGSLA 298
           PVIA V      Y +GG  V+   CD ++A
Sbjct: 109 PVIAMVKG----YAVGGGNVLNVVCDLTIA 134


>1zx5_A Mannosephosphate isomerase, putative; STRU genomics, PSI, protein
           structure initiative, midwest center structural
           genomics, MCSG; HET: LFR; 2.30A {Archaeoglobus fulgidus}
           SCOP: b.82.1.3
          Length = 300

 Score = 28.3 bits (63), Expect = 3.6
 Identities = 12/59 (20%), Positives = 21/59 (35%), Gaps = 6/59 (10%)

Query: 162 CWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCS---GGDFCALLDWMDVNK 217
            W FS   +  S   +K   LS+    E+     +  +G +      F  L+  +D   
Sbjct: 39  SWEFSAHTSRPSTVLVKGQQLSM---IELFSKHRDELLGRAAEKFSKFPILVRLIDAAS 94


>1wue_A Mandelate racemase/muconate lactonizing enzyme FA protein;
           structural genomics, unknown function, nysgxrc target
           T2185; 2.10A {Enterococcus faecalis} SCOP: c.1.11.2
           d.54.1.1
          Length = 386

 Score = 28.0 bits (63), Expect = 3.8
 Identities = 26/132 (19%), Positives = 44/132 (33%), Gaps = 33/132 (25%)

Query: 112 HHKHHDHHHNHVKKRS---ITTGITIPTGIPVKKDWREAGIIGKVRNQQTC--------- 159
           HH HH HHH  V + S   I +  T    +P+K  +  +   G++  +            
Sbjct: 3   HHHHHHHHHGLVPRGSHMNIQSIETYQVRLPLKTPFVTS--YGRLEEKAFDLFVITDEQG 60

Query: 160 ----GACWAFS----TVETAESM-HALKNGTLSLL---------SVQEVIDCAGNGNMGC 201
               G   AF       ET  +    ++   + LL          V  + +      MG 
Sbjct: 61  NQGFGELVAFEQPDYVQETLVTERFIIQQHLIPLLLTEAIEQPQEVSTIFEEVKGHWMGK 120

Query: 202 SGGDFCALLDWM 213
           +  +  A+ D  
Sbjct: 121 AALE-TAIWDLY 131


>3ebl_A Gibberellin receptor GID1; alpha/beta hydrolase, lipase,
           gibberellin signaling pathway, hydrolase, nucleus,
           hydrolase receptor; HET: GA4; 1.90A {Oryza sativa subsp}
           PDB: 3ed1_A*
          Length = 365

 Score = 28.1 bits (63), Expect = 4.0
 Identities = 8/19 (42%), Positives = 12/19 (63%)

Query: 104 VNKHVLMSHHKHHDHHHNH 122
           +N ++    H HH HHH+H
Sbjct: 347 LNANLYYGSHHHHHHHHHH 365


>2xgg_A Microneme protein 2; A/I domain, cell adhesion, hydrolase; 2.05A
          {Toxoplasma gondii}
          Length = 178

 Score = 27.7 bits (62), Expect = 4.3
 Identities = 6/42 (14%), Positives = 12/42 (28%), Gaps = 1/42 (2%)

Query: 55 IRFKNFEKSLDIIEELNKNRQ-SPESARYGITEFSDLSEEEF 95
          I  +NF      +          PE     +  +S     ++
Sbjct: 30 IGIQNFRLVKQFLHTFLMVLPIGPEEVNNAVVTYSTDVHLQW 71


>4eml_A Naphthoate synthase; 1,4-dihydroxy-2-naphthoyl-coenzyme A, lyase;
           2.04A {Synechocystis SP}
          Length = 275

 Score = 27.9 bits (63), Expect = 4.4
 Identities = 11/30 (36%), Positives = 15/30 (50%), Gaps = 6/30 (20%)

Query: 271 PVIAAVNALTWQYYLGG--VIQYNCDGSLA 298
            VIA V      Y +GG  V+   CD ++A
Sbjct: 111 VVIALVAG----YAIGGGHVLHLVCDLTIA 136


>3i71_A Ethanolamine utilization protein EUTK; helix-turn-helix, unknown
           function; HET: FLC; 2.10A {Escherichia coli}
          Length = 68

 Score = 25.9 bits (56), Expect = 4.4
 Identities = 17/66 (25%), Positives = 31/66 (46%), Gaps = 6/66 (9%)

Query: 61  EKSLDIIEELNKNRQSPE----SARYG--ITEFSDLSEEEFKTRHLRHSVNKHVLMSHHK 114
           E + +++  L   RQ       +A +G  + +  +  E+ F    LR   +++ L  H +
Sbjct: 3   ESADELLALLTSVRQGMTAGEVAAHFGWPLEKARNALEQLFSAGTLRKRSSRYRLKPHLE 62

Query: 115 HHDHHH 120
           HH HHH
Sbjct: 63  HHHHHH 68


>3kd6_A Carbohydrate kinase, PFKB family; nucleoside kinase, AMP, PSI-II,
           NYSGXRC, struc genomics, protein structure initiative;
           HET: AMP; 1.88A {Chlorobaculum tepidum}
          Length = 313

 Score = 27.9 bits (62), Expect = 4.7
 Identities = 11/42 (26%), Positives = 17/42 (40%), Gaps = 7/42 (16%)

Query: 81  RYGITEFSDLSEEEFKTRHLRHSVNKHVLMSH--HKHHDHHH 120
           ++G   ++DL   E   R         + +S     HH HHH
Sbjct: 277 QFGPYRYNDLDLLEVDDR-----YQSFLELSRIEEGHHHHHH 313


>3pqa_A Lactaldehyde dehydrogenase; structural genomics, protein structure
           initiative, nysgrc, P biology, oxidoreductase; 1.50A
           {Methanocaldococcus jannaschii} PDB: 3rhd_A*
          Length = 486

 Score = 27.9 bits (63), Expect = 4.9
 Identities = 13/48 (27%), Positives = 18/48 (37%), Gaps = 6/48 (12%)

Query: 78  ESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKK 125
           E  +Y + E S       KT  +  +       SHH HH   H   +K
Sbjct: 445 EGVKYAMEEMS-----NIKTIIISKA-ENLYFQSHHHHHHWSHPQFEK 486


>3erv_A Putative C39-like peptidase; structural genomics, unknown function,
           PSI-2, protein structure initiative; 2.10A {Bacillus
           anthracis}
          Length = 236

 Score = 27.4 bits (60), Expect = 6.0
 Identities = 12/62 (19%), Positives = 20/62 (32%), Gaps = 3/62 (4%)

Query: 253 DTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANI---NHAVQIVGY 309
           D    S   +   +    PV+   NA            +  +    +I    H V ++GY
Sbjct: 126 DLTGKSIEELYKSVKAGQPVVIITNATFAPLDEDEFTTWETNNGDVSITYNEHCVVLIGY 185

Query: 310 DN 311
           D 
Sbjct: 186 DQ 187


>1ijb_A VON willebrand factor; dinucleotide-binding fold, blood clotting;
          1.80A {Homo sapiens} SCOP: c.62.1.1 PDB: 1ijk_A 1auq_A
          1u0n_A 3hxo_A 1uex_C 3hxq_A 1sq0_A 1m10_A 1fns_A 1oak_A
          1u0o_C
          Length = 202

 Score = 27.3 bits (61), Expect = 6.0
 Identities = 9/45 (20%), Positives = 15/45 (33%), Gaps = 7/45 (15%)

Query: 55 IRFKNFEKSLD----IIEELNKNRQSPESARYGITEFSDLSEEEF 95
          +    FE        ++E L     S +  R  + E+ D S    
Sbjct: 26 LSEAEFEVLKAFVVDMMERLRV---SQKWVRVAVVEYHDGSHAYI 67


>1xsv_A Hypothetical UPF0122 protein SAV1236; helix-turn-helix, putative
           DNA-binding protein, signal recognition particle,
           unknown function; 1.70A {Staphylococcus aureus subsp}
           SCOP: a.4.13.3
          Length = 113

 Score = 26.3 bits (58), Expect = 6.3
 Identities = 11/42 (26%), Positives = 23/42 (54%)

Query: 29  NLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEEL 70
           + E+KLEL+  F+QR +      +H    +  ++ +  +E+L
Sbjct: 71  DYEKKLELYQKFEQRREIYDEMKQHLSNPEQIQRYIQQLEDL 112


>2k8i_A SLYD, peptidyl-prolyl CIS-trans isomerase; ppiase, chaperone,
           rotamase; NMR {Escherichia coli}
          Length = 171

 Score = 26.9 bits (60), Expect = 6.7
 Identities = 15/40 (37%), Positives = 21/40 (52%), Gaps = 6/40 (15%)

Query: 84  ITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKH-HDHHHNH 122
           +    + +EEE    H+ H  + H    HH H HDHHH+H
Sbjct: 136 VVAIREATEEELAHGHV-HGAHDH----HHDHDHDHHHHH 170


>3odm_A Pepcase, PEPC, phosphoenolpyruvate carboxylase; beta-barrel, lyase;
           2.95A {Clostridium perfringens}
          Length = 560

 Score = 27.4 bits (60), Expect = 7.1
 Identities = 9/25 (36%), Positives = 11/25 (44%)

Query: 112 HHKHHDHHHNHVKKRSITTGITIPT 136
           HH HH HHH+          + IP 
Sbjct: 4   HHHHHHHHHSSGHIDDDDKHMKIPC 28


>1uw4_B UPF2, regulator of nonsense transcripts 2; nonsense mediated mRNA
           decay protein, RNA-binding protein, N domain, MIF4G
           domain; 1.95A {Homo sapiens} SCOP: a.118.1.14
          Length = 248

 Score = 27.2 bits (60), Expect = 7.3
 Identities = 21/100 (21%), Positives = 37/100 (37%), Gaps = 9/100 (9%)

Query: 1   MFDVKNVLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNF 60
             D    LF + L+            + + ++KL+ F  + QRY   + K   ++  K+ 
Sbjct: 142 SLDPPEHLFRIRLVCTILDTCGQYFDRGSSKRKLDCFLVYFQRY--VWWKKSLEVWTKDH 199

Query: 61  EKSLDI-------IEELNKNRQSPESARYGITEFSDLSEE 93
              +DI       +E L    +   S    I +  DL  E
Sbjct: 200 PFPIDIDYMISDTLELLRPKIKLCNSLEESIRQVQDLERE 239


>1atz_A VON willebrand factor; collagen-binding, hemostasis, dinucleotide
           binding fold; 1.80A {Homo sapiens} SCOP: c.62.1.1 PDB:
           4dmu_B 2adf_A 1fe8_A 1ao3_A
          Length = 189

 Score = 26.9 bits (60), Expect = 7.5
 Identities = 5/59 (8%), Positives = 16/59 (27%), Gaps = 4/59 (6%)

Query: 55  IRFKNFEKSLDIIEELNKNRQ-SPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSH 112
                F++     +         P   +  + ++  ++  +           K  L+S 
Sbjct: 18  FPASYFDEMKSFAKAFISKANIGPRLTQVSVLQYGSITTIDV---PWNVVPEKAHLLSL 73


>1pt6_A Integrin alpha-1; cell adhesion; 1.87A {Homo sapiens} SCOP:
           c.62.1.1 PDB: 4a0q_A 1qcy_A 1qc5_A 1qc5_B 1ck4_A 1mhp_A
          Length = 213

 Score = 27.0 bits (60), Expect = 7.8
 Identities = 8/55 (14%), Positives = 21/55 (38%), Gaps = 4/55 (7%)

Query: 59  NFEKSLDIIEELNKNRQ-SPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSH 112
            ++     + +L K     P+  + GI ++ +    EF   +L    +   ++  
Sbjct: 22  PWDSVTAFLNDLLKRMDIGPKQTQVGIVQYGENVTHEF---NLNKYSSTEEVLVA 73


>2d7u_A Adenylosuccinate synthetase; structural genomics, conserved
           hypothetical protein, NPPSFA; 2.50A {Pyrococcus
           horikoshii}
          Length = 339

 Score = 27.2 bits (61), Expect = 7.9
 Identities = 10/48 (20%), Positives = 18/48 (37%), Gaps = 1/48 (2%)

Query: 84  ITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTG 131
             E   L +   K R +       ++   HK  D  + ++  +  TTG
Sbjct: 82  FHELEQLKDFNVKDR-VGIDYRCAIIEEKHKQLDRTNGYLHGKIGTTG 128


>2hza_A Nickel-responsive regulator; nickel-binding, ribbon-helix-helix,
           transcription factor, ME binding protein; HET: 3CM;
           2.10A {Escherichia coli} SCOP: a.43.1.3 d.58.18.4 PDB:
           1q5v_A* 2hzv_A 3od2_A*
          Length = 133

 Score = 26.4 bits (58), Expect = 8.2
 Identities = 4/25 (16%), Positives = 10/25 (40%)

Query: 96  KTRHLRHSVNKHVLMSHHKHHDHHH 120
           +    +H  +   + + H H +H  
Sbjct: 70  RIVSTQHHHHDLSVATLHVHINHDD 94


>1mc0_A 3',5'-cyclic nucleotide phosphodiesterase 2A; GAF domain, 3',5'
           guanosine monophosphate, hydrolase; HET: PCG; 2.86A {Mus
           musculus} SCOP: d.110.2.1 d.110.2.1
          Length = 368

 Score = 27.3 bits (60), Expect = 8.3
 Identities = 7/34 (20%), Positives = 17/34 (50%), Gaps = 6/34 (17%)

Query: 87  FSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHH 120
           +  ++E ++++           +M + +HH HHH
Sbjct: 341 YKKVNEAQYRSHLANE------MMMYLEHHHHHH 368


>1n3y_A Integrin alpha-X; alpha/beta rossmann fold, cell adhesion; 1.65A
           {Homo sapiens} SCOP: c.62.1.1
          Length = 198

 Score = 26.6 bits (59), Expect = 8.9
 Identities = 8/58 (13%), Positives = 21/58 (36%), Gaps = 4/58 (6%)

Query: 55  IRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSH 112
           I  +NF   ++ +  +   +    S ++ + +FS+  +  F              +S 
Sbjct: 22  ISSRNFATMMNFVRAVIS-QFQRPSTQFSLMQFSNKFQTHF---TFEEFRRSSNPLSL 75


  Database: pdb70
    Posted date:  Sep 4, 2012  3:40 AM
  Number of letters in database: 6,701,793
  Number of sequences in database:  27,921
  
Lambda     K      H
   0.319    0.133    0.406 

Gapped
Lambda     K      H
   0.267   0.0856    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 27921
Number of Hits to DB: 4,964,434
Number of extensions: 303477
Number of successful extensions: 1769
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1512
Number of HSP's successfully gapped: 107
Length of query: 317
Length of database: 6,701,793
Length adjustment: 94
Effective length of query: 223
Effective length of database: 4,077,219
Effective search space: 909219837
Effective search space used: 909219837
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 57 (25.5 bits)