BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= psy18108
         (102 letters)

Database: swissprot 
           539,616 sequences; 191,569,459 total letters

Searching..................................................done



>sp|Q9VN93|CPR1_DROME Putative cysteine proteinase CG12163 OS=Drosophila melanogaster
           GN=CG12163 PE=2 SV=2
          Length = 614

 Score = 56.6 bits (135), Expect = 5e-08,   Method: Composition-based stats.
 Identities = 30/69 (43%), Positives = 38/69 (55%), Gaps = 2/69 (2%)

Query: 29  EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTR 88
           +HL  F KF   F + Y +  E   R  +F  NLK IE+LN  E G+A YGI   +D+T 
Sbjct: 305 DHL--FYKFQVRFGRRYVSTAERQMRLRIFRQNLKTIEELNANEMGSAKYGITEFADMTS 362

Query: 89  EEMKSRLGL 97
            E K R GL
Sbjct: 363 SEYKERTGL 371


>sp|O23791|BROM1_ANACO Fruit bromelain OS=Ananas comosus PE=1 SV=1
          Length = 351

 Score = 52.8 bits (125), Expect = 6e-07,   Method: Composition-based stats.
 Identities = 21/64 (32%), Positives = 39/64 (60%)

Query: 27 NPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
          N   +K+FE+++ ++ + Y   +E  +RF +F++N+K IE  N     + T GIN  +D+
Sbjct: 30 NDPMMKRFEEWMAEYGRVYKDDDEKMRRFQIFKNNVKHIETFNSRNENSYTLGINQFTDM 89

Query: 87 TREE 90
          T+ E
Sbjct: 90 TKSE 93


>sp|P25804|CYSP_PEA Cysteine proteinase 15A OS=Pisum sativum PE=2 SV=1
          Length = 363

 Score = 52.8 bits (125), Expect = 6e-07,   Method: Composition-based stats.
 Identities = 36/83 (43%), Positives = 44/83 (53%), Gaps = 4/83 (4%)

Query: 16  QMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGT 75
           Q+  N E    N EH   F  F   FSKSY TKEE   RF VF+ NL +   L++    T
Sbjct: 32  QVVDNEEDHLLNAEH--HFTSFKSKFSKSYATKEEHDYRFGVFKSNL-IKAKLHQNRDPT 88

Query: 76  ATYGINHLSDLTREEMKSR-LGL 97
           A +GI   SDLT  E + + LGL
Sbjct: 89  AEHGITKFSDLTASEFRRQFLGL 111


>sp|Q91GE3|CATV_NPVEP Viral cathepsin OS=Epiphyas postvittana nucleopolyhedrovirus
           GN=VCATH PE=3 SV=1
          Length = 323

 Score = 52.0 bits (123), Expect = 1e-06,   Method: Composition-based stats.
 Identities = 26/70 (37%), Positives = 44/70 (62%), Gaps = 3/70 (4%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           FE+F+R ++K Y ++ E  +R+ +F+ NL  I  + K  + TA Y IN  SDL+++E  +
Sbjct: 28  FEEFVRQYNKQYDSEYEKLRRYKIFQHNLNDI--ITKNRNDTAVYKINKFSDLSKDETIA 85

Query: 94  RL-GLNLSKH 102
           +  GL+L  H
Sbjct: 86  KYTGLSLPLH 95


>sp|Q9YMP9|CATV_NPVLD Viral cathepsin OS=Lymantria dispar multicapsid nuclear
           polyhedrosis virus GN=VCATH PE=3 SV=1
          Length = 356

 Score = 52.0 bits (123), Expect = 1e-06,   Method: Composition-based stats.
 Identities = 26/69 (37%), Positives = 46/69 (66%), Gaps = 3/69 (4%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLI--EDLNKGEHGTATYGINHLSDLTREEM 91
           FE F+ +++K+Y +  E  KR+++F+DNL  I  ++ N  +  TATY IN  SDL++ E+
Sbjct: 56  FESFVENYNKNYTSDWEKNKRYSIFKDNLHEINAKNGNATDGPTATYKINKFSDLSKSEL 115

Query: 92  KSRL-GLNL 99
            ++  GL++
Sbjct: 116 IAKFTGLSI 124


>sp|O10364|CATV_NPVOP Viral cathepsin OS=Orgyia pseudotsugata multicapsid polyhedrosis
          virus GN=VCATH PE=3 SV=1
          Length = 324

 Score = 51.2 bits (121), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 28/67 (41%), Positives = 43/67 (64%), Gaps = 2/67 (2%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          FE F+  F+K+Y ++ E   RF +F+ NL+ I + N+ +  TA Y IN  SDL++EE  S
Sbjct: 28 FEDFLHKFNKNYSSESEKLHRFKIFQHNLEEIINKNQND-STAQYEINKFSDLSKEEAIS 86

Query: 94 RL-GLNL 99
          +  GL+L
Sbjct: 87 KYTGLSL 93


>sp|Q9UBX1|CATF_HUMAN Cathepsin F OS=Homo sapiens GN=CTSF PE=1 SV=1
          Length = 484

 Score = 51.2 bits (121), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 21/60 (35%), Positives = 37/60 (61%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           F+ F+  ++++Y +KEE   R +VF +N+   + +   + GTA YG+   SDLT EE ++
Sbjct: 187 FKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 246


>sp|P41721|CATV_NPVBM Viral cathepsin OS=Bombyx mori nuclear polyhedrosis virus
          GN=VCATH PE=1 SV=1
          Length = 323

 Score = 51.2 bits (121), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 26/67 (38%), Positives = 45/67 (67%), Gaps = 3/67 (4%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          FE+F+  F+K+Y ++ E  +RF +F+ NL   E +NK ++ +A Y IN  SDL+++E  +
Sbjct: 28 FEEFVHRFNKNYSSEVEKLRRFKIFQHNLN--EIINKNQNDSAKYEINKFSDLSKDETIA 85

Query: 94 RL-GLNL 99
          +  GL+L
Sbjct: 86 KYTGLSL 92


>sp|Q91CL9|CATV_NPVAP Viral cathepsin OS=Antheraea pernyi nuclear polyhedrosis virus
          GN=VCATH PE=3 SV=1
          Length = 324

 Score = 50.4 bits (119), Expect = 3e-06,   Method: Composition-based stats.
 Identities = 28/68 (41%), Positives = 46/68 (67%), Gaps = 4/68 (5%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGT-ATYGINHLSDLTREEMK 92
          FE+F+  F+K+Y ++ E  +RF +F+ NL+  E +NK ++ T A Y IN  SDL+++E  
Sbjct: 28 FEEFLHKFNKNYSSESEKLRRFKIFQHNLE--EIINKNQNDTSAQYEINKFSDLSKDETI 85

Query: 93 SRL-GLNL 99
          S+  GL+L
Sbjct: 86 SKYTGLSL 93


>sp|P25783|CATV_NPVAC Viral cathepsin OS=Autographa californica nuclear polyhedrosis
          virus GN=VCATH PE=1 SV=1
          Length = 323

 Score = 50.4 bits (119), Expect = 3e-06,   Method: Composition-based stats.
 Identities = 26/67 (38%), Positives = 44/67 (65%), Gaps = 3/67 (4%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          FE+F+  F+K Y ++ E  +RF +F+ NL   E +NK ++ +A Y IN  SDL+++E  +
Sbjct: 28 FEEFVHRFNKDYGSEVEKLRRFKIFQHNLN--EIINKNQNDSAKYEINKFSDLSKDETIA 85

Query: 94 RL-GLNL 99
          +  GL+L
Sbjct: 86 KYTGLSL 92


>sp|Q86GF7|CRUST_PANBO Crustapain OS=Pandalus borealis GN=Cys PE=1 SV=1
          Length = 323

 Score = 49.7 bits (117), Expect = 5e-06,   Method: Composition-based stats.
 Identities = 28/74 (37%), Positives = 42/74 (56%), Gaps = 4/74 (5%)

Query: 33  QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLN-KGEHGTATY--GINHLSDLTRE 89
           ++E F   F K Y   EE + R +VF D LK I++ N + + G  TY   IN+ SDLT E
Sbjct: 19  EWENFKTKFGKKYANSEEESHRMSVFMDKLKFIQEHNERYDKGEVTYWLKINNFSDLTHE 78

Query: 90  E-MKSRLGLNLSKH 102
           E + ++ G+   +H
Sbjct: 79  EVLATKTGMTRRRH 92


>sp|Q9TST1|CATW_FELCA Cathepsin W OS=Felis catus GN=CTSW PE=2 SV=2
          Length = 374

 Score = 49.7 bits (117), Expect = 5e-06,   Method: Composition-based stats.
 Identities = 27/70 (38%), Positives = 37/70 (52%), Gaps = 1/70 (1%)

Query: 28  PEHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
           P  LKQ F  F   +++SY   EE A+R  +F  NL   + L + + GTA +G+   SDL
Sbjct: 35  PLELKQAFTLFQIQYNRSYSNPEEYARRLDIFAHNLAQAQQLEEEDLGTAEFGVTPFSDL 94

Query: 87  TREEMKSRLG 96
           T EE     G
Sbjct: 95  TEEEFGRLYG 104


>sp|P41715|CATV_NPVCF Viral cathepsin OS=Choristoneura fumiferana nuclear polyhedrosis
          virus GN=Vcath PE=3 SV=1
          Length = 324

 Score = 49.3 bits (116), Expect = 7e-06,   Method: Composition-based stats.
 Identities = 27/67 (40%), Positives = 42/67 (62%), Gaps = 2/67 (2%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          FE F+  F+KSY ++ E  +RF +F  NL+ I + N  +  TA Y IN  +DL+++E  S
Sbjct: 28 FEDFLHKFNKSYSSESEKLRRFQIFRHNLEEIINKNHND-STAQYEINKFADLSKDETIS 86

Query: 94 RL-GLNL 99
          +  GL+L
Sbjct: 87 KYTGLSL 93


>sp|P25780|PEPT1_EURMA Peptidase 1 OS=Euroglyphus maynei GN=EURM1 PE=1 SV=2
          Length = 321

 Score = 49.3 bits (116), Expect = 7e-06,   Method: Composition-based stats.
 Identities = 31/73 (42%), Positives = 46/73 (63%), Gaps = 12/73 (16%)

Query: 28 PEHLKQFEKFIRDFSKSY--PTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSD 85
          P  +K FE+F + F+K+Y  P KEEVA++   F ++LK +E  NKG        INHLSD
Sbjct: 20 PASIKTFEEFKKAFNKTYATPEKEEVARK--NFLESLKYVES-NKG-------AINHLSD 69

Query: 86 LTREEMKSRLGLN 98
          L+ +E K++  +N
Sbjct: 70 LSLDEFKNQFLMN 82


>sp|P80884|ANAN_ANACO Ananain OS=Ananas comosus GN=AN1 PE=1 SV=2
          Length = 345

 Score = 49.3 bits (116), Expect = 7e-06,   Method: Composition-based stats.
 Identities = 23/70 (32%), Positives = 41/70 (58%), Gaps = 1/70 (1%)

Query: 31  LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
           +KQFE+++ ++ + Y   +E   RF +F++N+  IE  N     + T GIN  +D+T  E
Sbjct: 34  MKQFEEWMAEYGRVYKDNDEKMLRFQIFKNNVNHIETFNNRNGNSYTLGINQFTDMTNNE 93

Query: 91  MKSRL-GLNL 99
             ++  GL+L
Sbjct: 94  FVAQYTGLSL 103


>sp|Q9LM66|XCP2_ARATH Xylem cysteine proteinase 2 OS=Arabidopsis thaliana GN=XCP2 PE=1
           SV=2
          Length = 356

 Score = 48.9 bits (115), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 31/69 (44%), Positives = 44/69 (63%), Gaps = 4/69 (5%)

Query: 31  LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATY-GINHLSDLTRE 89
           ++ FE +I +F K+Y T EE   RF VF+DNLK I++ NK   G + + G+N  +DL+ E
Sbjct: 48  IELFENWISNFEKAYETVEEKFLRFEVFKDNLKHIDETNK--KGKSYWLGLNEFADLSHE 105

Query: 90  EMKS-RLGL 97
           E K   LGL
Sbjct: 106 EFKKMYLGL 114


>sp|Q6VTL7|CATV_NPVCD Viral cathepsin OS=Choristoneura fumiferana defective
          polyhedrosis virus GN=Vcath PE=3 SV=1
          Length = 324

 Score = 48.9 bits (115), Expect = 9e-06,   Method: Composition-based stats.
 Identities = 29/68 (42%), Positives = 44/68 (64%), Gaps = 4/68 (5%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGT-ATYGINHLSDLTREEMK 92
          FE F+ +F+K+Y +K E   RF +F+ NL+  E +NK  + T A Y IN  SDL+++E  
Sbjct: 28 FEDFLHNFNKNYSSKSEKLHRFKIFQHNLE--EIINKNLNDTSAQYEINKFSDLSKDETI 85

Query: 93 SRL-GLNL 99
          S+  GL+L
Sbjct: 86 SKYTGLSL 93


>sp|Q9R013|CATF_MOUSE Cathepsin F OS=Mus musculus GN=Ctsf PE=2 SV=1
          Length = 462

 Score = 48.5 bits (114), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 21/60 (35%), Positives = 34/60 (56%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           F+ F+  ++++Y ++EE   R  VF  N+   + +   + GTA YGI   SDLT EE  +
Sbjct: 165 FKDFMTTYNRTYESREEAQWRLTVFARNMIRAQKIQALDRGTAQYGITKFSDLTEEEFHT 224


>sp|P56202|CATW_HUMAN Cathepsin W OS=Homo sapiens GN=CTSW PE=1 SV=2
          Length = 376

 Score = 47.8 bits (112), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 27/70 (38%), Positives = 38/70 (54%), Gaps = 1/70 (1%)

Query: 28  PEHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
           P  LK+ F+ F   F++SY + EE A R  +F  NL   + L + + GTA +G+   SDL
Sbjct: 35  PLELKEAFKLFQIQFNRSYLSPEEHAHRLDIFAHNLAQAQRLQEEDLGTAEFGVTPFSDL 94

Query: 87  TREEMKSRLG 96
           T EE     G
Sbjct: 95  TEEEFGQLYG 104


>sp|Q9WGE0|CATV_NPVHC Viral cathepsin OS=Hyphantria cunea nuclear polyhedrosis virus
          GN=VCATH PE=3 SV=1
          Length = 324

 Score = 47.8 bits (112), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 24/61 (39%), Positives = 38/61 (62%), Gaps = 1/61 (1%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          FE F+  F+K Y ++ E  +RF +F+ NL+ I   N+ +  TA Y IN  SDL+++E  S
Sbjct: 28 FEDFLHKFNKHYSSESEKLRRFQIFQHNLEEIIIKNQND-TTAQYEINKFSDLSKDETIS 86

Query: 94 R 94
          +
Sbjct: 87 K 87


>sp|O65493|XCP1_ARATH Xylem cysteine proteinase 1 OS=Arabidopsis thaliana GN=XCP1 PE=1
           SV=1
          Length = 355

 Score = 47.4 bits (111), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 32/78 (41%), Positives = 42/78 (53%), Gaps = 9/78 (11%)

Query: 28  PEHLKQ-------FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGI 80
           PEHL         FE ++ + SK+Y + EE   RF VF +NL  I+  N  E  +   G+
Sbjct: 38  PEHLTNTDKLLELFESWMSEHSKAYKSVEEKVHRFEVFRENLMHIDQRNN-EINSYWLGL 96

Query: 81  NHLSDLTREEMKSR-LGL 97
           N  +DLT EE K R LGL
Sbjct: 97  NEFADLTHEEFKGRYLGL 114


>sp|P25779|CYSP_TRYCR Cruzipain OS=Trypanosoma cruzi PE=1 SV=1
          Length = 467

 Score = 47.0 bits (110), Expect = 3e-05,   Method: Composition-based stats.
 Identities = 24/62 (38%), Positives = 36/62 (58%), Gaps = 1/62 (1%)

Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
          QF +F +   + Y +  E A R +VF +NL  +  L+   +  AT+G+   SDLTREE +
Sbjct: 37 QFAEFKQKHGRVYESAAEEAFRLSVFRENL-FLARLHAAANPHATFGVTPFSDLTREEFR 95

Query: 93 SR 94
          SR
Sbjct: 96 SR 97


>sp|Q8B9D5|CATV_NPVR1 Viral cathepsin OS=Rachiplusia ou multiple nucleopolyhedrovirus
          (strain R1) GN=VCATH PE=3 SV=1
          Length = 323

 Score = 46.6 bits (109), Expect = 4e-05,   Method: Composition-based stats.
 Identities = 25/67 (37%), Positives = 43/67 (64%), Gaps = 3/67 (4%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          FE+F+  F+K Y ++ E  +RF +F+ NL   E + K ++ +A Y IN  SDL+++E  +
Sbjct: 28 FEEFVHRFNKDYGSEVEKLRRFKIFQHNLN--EIIIKNQNDSAKYEINKFSDLSKDETIA 85

Query: 94 RL-GLNL 99
          +  GL+L
Sbjct: 86 KYTGLSL 92


>sp|P00785|ACTN_ACTCH Actinidain OS=Actinidia chinensis PE=1 SV=4
          Length = 380

 Score = 46.6 bits (109), Expect = 4e-05,   Method: Composition-based stats.
 Identities = 22/75 (29%), Positives = 41/75 (54%)

Query: 19  SNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATY 78
           + N  +  N E    +E ++  + KSY +  E  +RF +F++ L+ I++ N   + +   
Sbjct: 27  AKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWERRFEIFKETLRFIDEHNADTNRSYKV 86

Query: 79  GINHLSDLTREEMKS 93
           G+N  +DLT EE +S
Sbjct: 87  GLNQFADLTDEEFRS 101


>sp|Q9LXW3|CPR2_ARATH Probable cysteine proteinase At3g43960 OS=Arabidopsis thaliana
           GN=At3g43960 PE=2 SV=1
          Length = 376

 Score = 46.6 bits (109), Expect = 4e-05,   Method: Composition-based stats.
 Identities = 27/74 (36%), Positives = 43/74 (58%), Gaps = 1/74 (1%)

Query: 29  EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTR 88
           E L  +E+++ +  K+Y    E  +RF +F+DNLK IE+ N   + +   G+N  SDLT 
Sbjct: 36  EVLTMYEQWLVENGKNYNGLGEKERRFKIFKDNLKRIEEHNSDPNRSYERGLNKFSDLTA 95

Query: 89  EEMK-SRLGLNLSK 101
           +E + S LG  + K
Sbjct: 96  DEFQASYLGGKMEK 109


>sp|A5HII1|ACTN_ACTDE Actinidain OS=Actinidia deliciosa PE=1 SV=1
          Length = 380

 Score = 46.6 bits (109), Expect = 4e-05,   Method: Composition-based stats.
 Identities = 22/75 (29%), Positives = 41/75 (54%)

Query: 19  SNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATY 78
           + N  +  N E    +E ++  + KSY +  E  +RF +F++ L+ I++ N   + +   
Sbjct: 27  AKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWERRFEIFKETLRFIDEHNADTNRSYKV 86

Query: 79  GINHLSDLTREEMKS 93
           G+N  +DLT EE +S
Sbjct: 87  GLNQFADLTDEEFRS 101


>sp|Q8H166|ALEU_ARATH Thiol protease aleurain OS=Arabidopsis thaliana GN=ALEU PE=1 SV=2
          Length = 358

 Score = 46.6 bits (109), Expect = 5e-05,   Method: Composition-based stats.
 Identities = 24/68 (35%), Positives = 40/68 (58%), Gaps = 2/68 (2%)

Query: 30  HLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTRE 89
           H+  F +F   + K Y   EE+  RF++F++NL LI   NK +  +   G+N  +DLT +
Sbjct: 55  HVLSFARFTHRYGKKYQNVEEMKLRFSIFKENLDLIRSTNK-KGLSYKLGVNQFADLTWQ 113

Query: 90  EM-KSRLG 96
           E  +++LG
Sbjct: 114 EFQRTKLG 121


>sp|Q8QLK1|CATV_NPVMC Viral cathepsin OS=Mamestra configurata nucleopolyhedrovirus
          GN=VCATH PE=3 SV=1
          Length = 337

 Score = 46.6 bits (109), Expect = 5e-05,   Method: Composition-based stats.
 Identities = 21/61 (34%), Positives = 37/61 (60%), Gaps = 1/61 (1%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          FEKFI  ++K Y +++E   R+ +F  N++ I   N   + +A Y IN  +D+T+ E+ +
Sbjct: 40 FEKFISQYNKQYSSEDEKKYRYNIFRHNIESINAKNS-RNDSAVYKINRFADMTKNEVVN 98

Query: 94 R 94
          R
Sbjct: 99 R 99


>sp|P16311|PEPT1_DERFA Peptidase 1 OS=Dermatophagoides farinae GN=DERF1 PE=1 SV=2
          Length = 321

 Score = 45.8 bits (107), Expect = 7e-05,   Method: Composition-based stats.
 Identities = 30/69 (43%), Positives = 44/69 (63%), Gaps = 12/69 (17%)

Query: 28 PEHLKQFEKFIRDFSKSYPT--KEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSD 85
          P  +K FE+F + F+K+Y T  +EEVA++   F ++LK +E  NKG        INHLSD
Sbjct: 20 PASIKTFEEFKKAFNKNYATVEEEEVARK--NFLESLKYVE-ANKG-------AINHLSD 69

Query: 86 LTREEMKSR 94
          L+ +E K+R
Sbjct: 70 LSLDEFKNR 78


>sp|Q10717|CYSP2_MAIZE Cysteine proteinase 2 OS=Zea mays GN=CCP2 PE=2 SV=1
          Length = 360

 Score = 45.4 bits (106), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 23/65 (35%), Positives = 40/65 (61%), Gaps = 2/65 (3%)

Query: 33  QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
           +F +F   + KSY +  EV KRF +F ++L+L+   N+ +  +   GIN  +D++ EE +
Sbjct: 58  RFARFAVRYGKSYESAAEVHKRFRIFSESLQLVRSTNR-KGLSYRLGINRFADMSWEEFR 116

Query: 93  -SRLG 96
            +RLG
Sbjct: 117 ATRLG 121


>sp|P13277|CYSP1_HOMAM Digestive cysteine proteinase 1 OS=Homarus americanus GN=LCP1
          PE=1 SV=2
          Length = 322

 Score = 45.4 bits (106), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 29/74 (39%), Positives = 37/74 (50%), Gaps = 7/74 (9%)

Query: 23 LKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKG-EHGTATY--G 79
          L   NP     +E+F   F + Y   EE   R  VF DNL+ IE+ NK  E G  TY   
Sbjct: 13 LAAANP----SWEEFKGKFGRKYVDLEEERYRLNVFLDNLQYIEEFNKKYERGEVTYNLA 68

Query: 80 INHLSDLTREEMKS 93
          IN  SD+T E+  +
Sbjct: 69 INQFSDMTNEKFNA 82


>sp|P43296|RD19A_ARATH Cysteine proteinase RD19a OS=Arabidopsis thaliana GN=RD19A PE=2
           SV=1
          Length = 368

 Score = 45.1 bits (105), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 24/62 (38%), Positives = 34/62 (54%), Gaps = 1/62 (1%)

Query: 33  QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
            F  F R F K Y + EE   RF+VF+ NL+      K +  +AT+G+   SDLTR E +
Sbjct: 50  HFSLFKRKFGKVYASNEEHDYRFSVFKANLRRARRHQKLD-PSATHGVTQFSDLTRSEFR 108

Query: 93  SR 94
            +
Sbjct: 109 KK 110


>sp|P14658|CYSP_TRYBB Cysteine proteinase OS=Trypanosoma brucei brucei PE=1 SV=1
          Length = 450

 Score = 44.7 bits (104), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 21/62 (33%), Positives = 36/62 (58%), Gaps = 1/62 (1%)

Query: 33  QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
           +F  F + + K Y   +E A RF  FE+N++  + +    +  AT+G+   SD+TREE +
Sbjct: 40  RFAAFKKKYGKVYKDAKEEAFRFRAFEENMEQAK-IQAAANPYATFGVTPFSDMTREEFR 98

Query: 93  SR 94
           +R
Sbjct: 99  AR 100


>sp|Q80LP4|CATV_NPVAH Viral cathepsin OS=Adoxophyes honmai nucleopolyhedrovirus
          GN=VCATH PE=3 SV=1
          Length = 337

 Score = 44.3 bits (103), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 23/61 (37%), Positives = 38/61 (62%), Gaps = 1/61 (1%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          FE FI +++K YP  +    RF +F+ NL+ I + NK  + +A Y IN  SDL++ E+ +
Sbjct: 32 FETFIINYNKQYPDTKTKNYRFKIFKQNLEDINEKNK-LNDSAIYNINKFSDLSKNELLT 90

Query: 94 R 94
          +
Sbjct: 91 K 91


>sp|Q40143|CYSP3_SOLLC Cysteine proteinase 3 OS=Solanum lycopersicum GN=CYP-3 PE=2 SV=1
          Length = 356

 Score = 43.9 bits (102), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 25/64 (39%), Positives = 37/64 (57%), Gaps = 2/64 (3%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM-K 92
           F +F     K Y + EE+ +RF +F DNLK+I   N+ +  +   GIN  +DLT +E  K
Sbjct: 57  FARFAIRHRKRYDSVEEIKQRFEIFLDNLKMIRSHNR-KGLSYKLGINEFTDLTWDEFRK 115

Query: 93  SRLG 96
            +LG
Sbjct: 116 HKLG 119


>sp|P00784|PAPA1_CARPA Papain OS=Carica papaya PE=1 SV=1
          Length = 345

 Score = 43.9 bits (102), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 24/76 (31%), Positives = 44/76 (57%), Gaps = 2/76 (2%)

Query: 19  SNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATY 78
           S N+L T     ++ FE ++   +K Y   +E   RF +F+DNLK I++ NK ++ +   
Sbjct: 34  SQNDL-TSTERLIQLFESWMLKHNKIYKNIDEKIYRFEIFKDNLKYIDETNK-KNNSYWL 91

Query: 79  GINHLSDLTREEMKSR 94
           G+N  +D++ +E K +
Sbjct: 92  GLNVFADMSNDEFKEK 107


>sp|P56203|CATW_MOUSE Cathepsin W OS=Mus musculus GN=Ctsw PE=2 SV=2
          Length = 371

 Score = 43.9 bits (102), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 31/90 (34%), Positives = 45/90 (50%), Gaps = 4/90 (4%)

Query: 11  LALFGQMKSNNELKTE---NPEHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIE 66
           L L GQ  S++ L  +    P  LK+ F+ F   F++SY    E  +R ++F  NL   +
Sbjct: 13  LLLAGQGLSDSLLTKDAGPRPLELKEVFKLFQIRFNRSYWNPAEYTRRLSIFAHNLAQAQ 72

Query: 67  DLNKGEHGTATYGINHLSDLTREEMKSRLG 96
            L + + GTA +G    SDLT EE     G
Sbjct: 73  RLQQEDLGTAEFGETPFSDLTEEEFGQLYG 102


>sp|Q8RWQ9|ALEUL_ARATH Thiol protease aleurain-like OS=Arabidopsis thaliana GN=At3g45310
           PE=2 SV=1
          Length = 358

 Score = 43.9 bits (102), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 22/63 (34%), Positives = 36/63 (57%), Gaps = 1/63 (1%)

Query: 30  HLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTRE 89
           H+  F +F   + K Y + EE+  RF+VF++NL LI   NK +  +    +N  +DLT +
Sbjct: 55  HVLSFSRFTHRYGKKYQSVEEMKLRFSVFKENLDLIRSTNK-KGLSYKLSLNQFADLTWQ 113

Query: 90  EMK 92
           E +
Sbjct: 114 EFQ 116


>sp|Q9YWK4|CATV_NPVBS Viral cathepsin OS=Buzura suppressaria nuclear polyhedrosis virus
          GN=VCATH PE=3 SV=1
          Length = 331

 Score = 43.9 bits (102), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 23/67 (34%), Positives = 41/67 (61%), Gaps = 2/67 (2%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          FE F+ +++K Y    E  +RF++F+  L+ I   N+  + +A Y IN  +DL++ E+ S
Sbjct: 31 FETFLANYNKMYNDTSEKERRFSIFQQTLEEINYKNR-LNDSAVYQINKFADLSKNEIIS 89

Query: 94 RL-GLNL 99
          +  GLN+
Sbjct: 90 KYTGLNM 96


>sp|P05994|PAPA4_CARPA Papaya proteinase 4 OS=Carica papaya PE=1 SV=3
          Length = 348

 Score = 43.5 bits (101), Expect = 4e-04,   Method: Composition-based stats.
 Identities = 29/95 (30%), Positives = 48/95 (50%), Gaps = 13/95 (13%)

Query: 11  LALFGQMK-----------SNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFE 59
           + LFG M            S ++L T     ++ F  ++   +K+Y   +E   RF +F+
Sbjct: 15  ICLFGHMSLSYCDFSIVGYSQDDL-TSTERLIQLFNSWMLKHNKNYKNVDEKLYRFEIFK 73

Query: 60  DNLKLIEDLNKGEHGTATYGINHLSDLTREEMKSR 94
           DNLK I++ NK  +G    G+N  SDL+ +E K +
Sbjct: 74  DNLKYIDERNKMINGY-WLGLNEFSDLSNDEFKEK 107


>sp|Q9J8B9|CATV_NPVSE Viral cathepsin OS=Spodoptera exigua nuclear polyhedrosis virus
           (strain US) GN=VCATH PE=3 SV=1
          Length = 337

 Score = 43.1 bits (100), Expect = 4e-04,   Method: Composition-based stats.
 Identities = 24/74 (32%), Positives = 39/74 (52%), Gaps = 9/74 (12%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           FEKFI  ++K Y +++E   R+ +F  N++ I   N   + +A Y IN  +D+ + E+  
Sbjct: 40  FEKFITQYNKQYKSEDEKKYRYNIFRHNIESINQKNS-RNDSAVYKINRFADMPKNEIVI 98

Query: 94  R--------LGLNL 99
           R        LGLN 
Sbjct: 99  RHTGLASGELGLNF 112


>sp|Q8V5U0|CATV_NPVHZ Viral cathepsin OS=Heliothis zea nuclear polyhedrosis virus
           GN=VCATH PE=3 SV=1
          Length = 367

 Score = 43.1 bits (100), Expect = 5e-04,   Method: Composition-based stats.
 Identities = 28/83 (33%), Positives = 43/83 (51%), Gaps = 14/83 (16%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGE-----------HGTATYGINH 82
           F+ F++ ++KSY   +E   R+ VF+DNL  I   N+               +A +G+N 
Sbjct: 57  FKHFLQQYNKSYDDPKEYQYRYNVFKDNLNKINSQNRENLLNNKNNNDSLSTSAQFGVNK 116

Query: 83  LSDLTREE-MKSRLG--LNLSKH 102
            SD T +E + S  G  LNLS+H
Sbjct: 117 FSDKTPDEVLHSNTGFFLNLSQH 139


>sp|P04988|CYSP1_DICDI Cysteine proteinase 1 OS=Dictyostelium discoideum GN=cprA PE=1
          SV=2
          Length = 343

 Score = 42.7 bits (99), Expect = 6e-04,   Method: Composition-based stats.
 Identities = 24/68 (35%), Positives = 37/68 (54%), Gaps = 4/68 (5%)

Query: 29 EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNK---GEHGTATYGINHLSD 85
          E   QF +F   F+K Y + EE  +RF +F+ NL  IE+LN           +G+N  +D
Sbjct: 24 EEQSQFLEFQDKFNKKY-SHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFAD 82

Query: 86 LTREEMKS 93
          L+ +E K+
Sbjct: 83 LSSDEFKN 90


>sp|P05167|ALEU_HORVU Thiol protease aleurain OS=Hordeum vulgare PE=2 SV=1
          Length = 362

 Score = 42.7 bits (99), Expect = 7e-04,   Method: Composition-based stats.
 Identities = 23/68 (33%), Positives = 39/68 (57%), Gaps = 2/68 (2%)

Query: 30  HLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTRE 89
           H  +F +F   + KSY +  EV +RF +F ++L+ +   N+ +      GIN  SD++ E
Sbjct: 57  HALRFARFAVRYGKSYESAAEVRRRFRIFSESLEEVRSTNR-KGLPYRLGINRFSDMSWE 115

Query: 90  EMK-SRLG 96
           E + +RLG
Sbjct: 116 EFQATRLG 123


>sp|Q9LT77|CPR1_ARATH Probable cysteine proteinase At3g19400 OS=Arabidopsis thaliana
           GN=At3g19400 PE=2 SV=1
          Length = 362

 Score = 42.4 bits (98), Expect = 9e-04,   Method: Composition-based stats.
 Identities = 21/75 (28%), Positives = 40/75 (53%)

Query: 19  SNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATY 78
           +  E++    E    +E+++ +  K+Y    E  +RF +F+DNLK +++ N     T   
Sbjct: 29  TETEIERNETEVRLMYEQWLVENRKNYNGLGEKERRFKIFKDNLKFVDEHNSVPDRTFEV 88

Query: 79  GINHLSDLTREEMKS 93
           G+   +DLT EE ++
Sbjct: 89  GLTRFADLTNEEFRA 103


>sp|P36400|LMCPB_LEIME Cysteine proteinase B OS=Leishmania mexicana GN=LMCPB PE=2 SV=2
          Length = 443

 Score = 42.0 bits (97), Expect = 0.001,   Method: Composition-based stats.
 Identities = 21/61 (34%), Positives = 35/61 (57%), Gaps = 1/61 (1%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          FE+F R + ++Y T  E  +R A FE NL+L+ + ++  +  A +GI    DL+  E  +
Sbjct: 38 FEEFKRTYGRAYETLAEEQQRLANFERNLELMRE-HQARNPHAQFGITKFFDLSEAEFAA 96

Query: 94 R 94
          R
Sbjct: 97 R 97


>sp|Q05094|CYSP2_LEIPI Cysteine proteinase 2 OS=Leishmania pifanoi GN=CYS2 PE=1 SV=1
          Length = 444

 Score = 42.0 bits (97), Expect = 0.001,   Method: Composition-based stats.
 Identities = 21/61 (34%), Positives = 35/61 (57%), Gaps = 1/61 (1%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          FE+F R + ++Y T  E  +R A FE NL+L+ + ++  +  A +GI    DL+  E  +
Sbjct: 38 FEEFKRTYGRAYETLAEEQQRLANFERNLELMRE-HQARNPHAQFGITKFFDLSEAEFAA 96

Query: 94 R 94
          R
Sbjct: 97 R 97


>sp|Q10716|CYSP1_MAIZE Cysteine proteinase 1 OS=Zea mays GN=CCP1 PE=2 SV=1
          Length = 371

 Score = 41.6 bits (96), Expect = 0.002,   Method: Composition-based stats.
 Identities = 28/83 (33%), Positives = 45/83 (54%), Gaps = 5/83 (6%)

Query: 20  NNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYG 79
           +N+L+     H   F  F++ F KSY   +E A R +VF+DNL+     ++    +A +G
Sbjct: 37  DNDLELNAESH---FLSFVQRFGKSYKDADEHAYRLSVFKDNLRRARR-HQLLDPSAEHG 92

Query: 80  INHLSDLTREEM-KSRLGLNLSK 101
           +   SDLT  E  ++ LGL  S+
Sbjct: 93  VTKFSDLTPAEFRRTYLGLRKSR 115


>sp|P08176|PEPT1_DERPT Peptidase 1 OS=Dermatophagoides pteronyssinus GN=DERP1 PE=1 SV=2
          Length = 320

 Score = 40.4 bits (93), Expect = 0.003,   Method: Composition-based stats.
 Identities = 24/68 (35%), Positives = 38/68 (55%), Gaps = 8/68 (11%)

Query: 28 PEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLT 87
          P  +K FE++ + F+KSY T E+       F +++K ++  N G        INHLSDL+
Sbjct: 20 PSSIKTFEEYKKAFNKSYATFEDEEAARKNFLESVKYVQS-NGG-------AINHLSDLS 71

Query: 88 REEMKSRL 95
           +E K+R 
Sbjct: 72 LDEFKNRF 79


>sp|P25782|CYSP2_HOMAM Digestive cysteine proteinase 2 OS=Homarus americanus GN=LCP2 PE=2
           SV=1
          Length = 323

 Score = 40.4 bits (93), Expect = 0.003,   Method: Composition-based stats.
 Identities = 23/71 (32%), Positives = 37/71 (52%), Gaps = 3/71 (4%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKG-EHGTATY--GINHLSDLTREE 90
           +E F   + + Y   EE + R  +FE N K IE+ NK  E+G  T+   +N   D+T EE
Sbjct: 20  WEHFKGKYGRQYVDAEEDSYRRVIFEQNQKYIEEFNKKYENGEVTFNLAMNKFGDMTLEE 79

Query: 91  MKSRLGLNLSK 101
             + +  N+ +
Sbjct: 80  FNAVMKGNIPR 90


  Database: swissprot
    Posted date:  Mar 23, 2013  2:32 AM
  Number of letters in database: 191,569,459
  Number of sequences in database:  539,616
  
Lambda     K      H
   0.311    0.128    0.347 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 35,710,904
Number of Sequences: 539616
Number of extensions: 1306554
Number of successful extensions: 4164
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 60
Number of HSP's successfully gapped in prelim test: 60
Number of HSP's that attempted gapping in prelim test: 4070
Number of HSP's gapped (non-prelim): 124
length of query: 102
length of database: 191,569,459
effective HSP length: 71
effective length of query: 31
effective length of database: 153,256,723
effective search space: 4750958413
effective search space used: 4750958413
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 55 (25.8 bits)