BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= psy18108
(102 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|Q9VN93|CPR1_DROME Putative cysteine proteinase CG12163 OS=Drosophila melanogaster
GN=CG12163 PE=2 SV=2
Length = 614
Score = 56.6 bits (135), Expect = 5e-08, Method: Composition-based stats.
Identities = 30/69 (43%), Positives = 38/69 (55%), Gaps = 2/69 (2%)
Query: 29 EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTR 88
+HL F KF F + Y + E R +F NLK IE+LN E G+A YGI +D+T
Sbjct: 305 DHL--FYKFQVRFGRRYVSTAERQMRLRIFRQNLKTIEELNANEMGSAKYGITEFADMTS 362
Query: 89 EEMKSRLGL 97
E K R GL
Sbjct: 363 SEYKERTGL 371
>sp|O23791|BROM1_ANACO Fruit bromelain OS=Ananas comosus PE=1 SV=1
Length = 351
Score = 52.8 bits (125), Expect = 6e-07, Method: Composition-based stats.
Identities = 21/64 (32%), Positives = 39/64 (60%)
Query: 27 NPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
N +K+FE+++ ++ + Y +E +RF +F++N+K IE N + T GIN +D+
Sbjct: 30 NDPMMKRFEEWMAEYGRVYKDDDEKMRRFQIFKNNVKHIETFNSRNENSYTLGINQFTDM 89
Query: 87 TREE 90
T+ E
Sbjct: 90 TKSE 93
>sp|P25804|CYSP_PEA Cysteine proteinase 15A OS=Pisum sativum PE=2 SV=1
Length = 363
Score = 52.8 bits (125), Expect = 6e-07, Method: Composition-based stats.
Identities = 36/83 (43%), Positives = 44/83 (53%), Gaps = 4/83 (4%)
Query: 16 QMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGT 75
Q+ N E N EH F F FSKSY TKEE RF VF+ NL + L++ T
Sbjct: 32 QVVDNEEDHLLNAEH--HFTSFKSKFSKSYATKEEHDYRFGVFKSNL-IKAKLHQNRDPT 88
Query: 76 ATYGINHLSDLTREEMKSR-LGL 97
A +GI SDLT E + + LGL
Sbjct: 89 AEHGITKFSDLTASEFRRQFLGL 111
>sp|Q91GE3|CATV_NPVEP Viral cathepsin OS=Epiphyas postvittana nucleopolyhedrovirus
GN=VCATH PE=3 SV=1
Length = 323
Score = 52.0 bits (123), Expect = 1e-06, Method: Composition-based stats.
Identities = 26/70 (37%), Positives = 44/70 (62%), Gaps = 3/70 (4%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
FE+F+R ++K Y ++ E +R+ +F+ NL I + K + TA Y IN SDL+++E +
Sbjct: 28 FEEFVRQYNKQYDSEYEKLRRYKIFQHNLNDI--ITKNRNDTAVYKINKFSDLSKDETIA 85
Query: 94 RL-GLNLSKH 102
+ GL+L H
Sbjct: 86 KYTGLSLPLH 95
>sp|Q9YMP9|CATV_NPVLD Viral cathepsin OS=Lymantria dispar multicapsid nuclear
polyhedrosis virus GN=VCATH PE=3 SV=1
Length = 356
Score = 52.0 bits (123), Expect = 1e-06, Method: Composition-based stats.
Identities = 26/69 (37%), Positives = 46/69 (66%), Gaps = 3/69 (4%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLI--EDLNKGEHGTATYGINHLSDLTREEM 91
FE F+ +++K+Y + E KR+++F+DNL I ++ N + TATY IN SDL++ E+
Sbjct: 56 FESFVENYNKNYTSDWEKNKRYSIFKDNLHEINAKNGNATDGPTATYKINKFSDLSKSEL 115
Query: 92 KSRL-GLNL 99
++ GL++
Sbjct: 116 IAKFTGLSI 124
>sp|O10364|CATV_NPVOP Viral cathepsin OS=Orgyia pseudotsugata multicapsid polyhedrosis
virus GN=VCATH PE=3 SV=1
Length = 324
Score = 51.2 bits (121), Expect = 2e-06, Method: Composition-based stats.
Identities = 28/67 (41%), Positives = 43/67 (64%), Gaps = 2/67 (2%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
FE F+ F+K+Y ++ E RF +F+ NL+ I + N+ + TA Y IN SDL++EE S
Sbjct: 28 FEDFLHKFNKNYSSESEKLHRFKIFQHNLEEIINKNQND-STAQYEINKFSDLSKEEAIS 86
Query: 94 RL-GLNL 99
+ GL+L
Sbjct: 87 KYTGLSL 93
>sp|Q9UBX1|CATF_HUMAN Cathepsin F OS=Homo sapiens GN=CTSF PE=1 SV=1
Length = 484
Score = 51.2 bits (121), Expect = 2e-06, Method: Composition-based stats.
Identities = 21/60 (35%), Positives = 37/60 (61%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
F+ F+ ++++Y +KEE R +VF +N+ + + + GTA YG+ SDLT EE ++
Sbjct: 187 FKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 246
>sp|P41721|CATV_NPVBM Viral cathepsin OS=Bombyx mori nuclear polyhedrosis virus
GN=VCATH PE=1 SV=1
Length = 323
Score = 51.2 bits (121), Expect = 2e-06, Method: Composition-based stats.
Identities = 26/67 (38%), Positives = 45/67 (67%), Gaps = 3/67 (4%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
FE+F+ F+K+Y ++ E +RF +F+ NL E +NK ++ +A Y IN SDL+++E +
Sbjct: 28 FEEFVHRFNKNYSSEVEKLRRFKIFQHNLN--EIINKNQNDSAKYEINKFSDLSKDETIA 85
Query: 94 RL-GLNL 99
+ GL+L
Sbjct: 86 KYTGLSL 92
>sp|Q91CL9|CATV_NPVAP Viral cathepsin OS=Antheraea pernyi nuclear polyhedrosis virus
GN=VCATH PE=3 SV=1
Length = 324
Score = 50.4 bits (119), Expect = 3e-06, Method: Composition-based stats.
Identities = 28/68 (41%), Positives = 46/68 (67%), Gaps = 4/68 (5%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGT-ATYGINHLSDLTREEMK 92
FE+F+ F+K+Y ++ E +RF +F+ NL+ E +NK ++ T A Y IN SDL+++E
Sbjct: 28 FEEFLHKFNKNYSSESEKLRRFKIFQHNLE--EIINKNQNDTSAQYEINKFSDLSKDETI 85
Query: 93 SRL-GLNL 99
S+ GL+L
Sbjct: 86 SKYTGLSL 93
>sp|P25783|CATV_NPVAC Viral cathepsin OS=Autographa californica nuclear polyhedrosis
virus GN=VCATH PE=1 SV=1
Length = 323
Score = 50.4 bits (119), Expect = 3e-06, Method: Composition-based stats.
Identities = 26/67 (38%), Positives = 44/67 (65%), Gaps = 3/67 (4%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
FE+F+ F+K Y ++ E +RF +F+ NL E +NK ++ +A Y IN SDL+++E +
Sbjct: 28 FEEFVHRFNKDYGSEVEKLRRFKIFQHNLN--EIINKNQNDSAKYEINKFSDLSKDETIA 85
Query: 94 RL-GLNL 99
+ GL+L
Sbjct: 86 KYTGLSL 92
>sp|Q86GF7|CRUST_PANBO Crustapain OS=Pandalus borealis GN=Cys PE=1 SV=1
Length = 323
Score = 49.7 bits (117), Expect = 5e-06, Method: Composition-based stats.
Identities = 28/74 (37%), Positives = 42/74 (56%), Gaps = 4/74 (5%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLN-KGEHGTATY--GINHLSDLTRE 89
++E F F K Y EE + R +VF D LK I++ N + + G TY IN+ SDLT E
Sbjct: 19 EWENFKTKFGKKYANSEEESHRMSVFMDKLKFIQEHNERYDKGEVTYWLKINNFSDLTHE 78
Query: 90 E-MKSRLGLNLSKH 102
E + ++ G+ +H
Sbjct: 79 EVLATKTGMTRRRH 92
>sp|Q9TST1|CATW_FELCA Cathepsin W OS=Felis catus GN=CTSW PE=2 SV=2
Length = 374
Score = 49.7 bits (117), Expect = 5e-06, Method: Composition-based stats.
Identities = 27/70 (38%), Positives = 37/70 (52%), Gaps = 1/70 (1%)
Query: 28 PEHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
P LKQ F F +++SY EE A+R +F NL + L + + GTA +G+ SDL
Sbjct: 35 PLELKQAFTLFQIQYNRSYSNPEEYARRLDIFAHNLAQAQQLEEEDLGTAEFGVTPFSDL 94
Query: 87 TREEMKSRLG 96
T EE G
Sbjct: 95 TEEEFGRLYG 104
>sp|P41715|CATV_NPVCF Viral cathepsin OS=Choristoneura fumiferana nuclear polyhedrosis
virus GN=Vcath PE=3 SV=1
Length = 324
Score = 49.3 bits (116), Expect = 7e-06, Method: Composition-based stats.
Identities = 27/67 (40%), Positives = 42/67 (62%), Gaps = 2/67 (2%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
FE F+ F+KSY ++ E +RF +F NL+ I + N + TA Y IN +DL+++E S
Sbjct: 28 FEDFLHKFNKSYSSESEKLRRFQIFRHNLEEIINKNHND-STAQYEINKFADLSKDETIS 86
Query: 94 RL-GLNL 99
+ GL+L
Sbjct: 87 KYTGLSL 93
>sp|P25780|PEPT1_EURMA Peptidase 1 OS=Euroglyphus maynei GN=EURM1 PE=1 SV=2
Length = 321
Score = 49.3 bits (116), Expect = 7e-06, Method: Composition-based stats.
Identities = 31/73 (42%), Positives = 46/73 (63%), Gaps = 12/73 (16%)
Query: 28 PEHLKQFEKFIRDFSKSY--PTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSD 85
P +K FE+F + F+K+Y P KEEVA++ F ++LK +E NKG INHLSD
Sbjct: 20 PASIKTFEEFKKAFNKTYATPEKEEVARK--NFLESLKYVES-NKG-------AINHLSD 69
Query: 86 LTREEMKSRLGLN 98
L+ +E K++ +N
Sbjct: 70 LSLDEFKNQFLMN 82
>sp|P80884|ANAN_ANACO Ananain OS=Ananas comosus GN=AN1 PE=1 SV=2
Length = 345
Score = 49.3 bits (116), Expect = 7e-06, Method: Composition-based stats.
Identities = 23/70 (32%), Positives = 41/70 (58%), Gaps = 1/70 (1%)
Query: 31 LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
+KQFE+++ ++ + Y +E RF +F++N+ IE N + T GIN +D+T E
Sbjct: 34 MKQFEEWMAEYGRVYKDNDEKMLRFQIFKNNVNHIETFNNRNGNSYTLGINQFTDMTNNE 93
Query: 91 MKSRL-GLNL 99
++ GL+L
Sbjct: 94 FVAQYTGLSL 103
>sp|Q9LM66|XCP2_ARATH Xylem cysteine proteinase 2 OS=Arabidopsis thaliana GN=XCP2 PE=1
SV=2
Length = 356
Score = 48.9 bits (115), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 31/69 (44%), Positives = 44/69 (63%), Gaps = 4/69 (5%)
Query: 31 LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATY-GINHLSDLTRE 89
++ FE +I +F K+Y T EE RF VF+DNLK I++ NK G + + G+N +DL+ E
Sbjct: 48 IELFENWISNFEKAYETVEEKFLRFEVFKDNLKHIDETNK--KGKSYWLGLNEFADLSHE 105
Query: 90 EMKS-RLGL 97
E K LGL
Sbjct: 106 EFKKMYLGL 114
>sp|Q6VTL7|CATV_NPVCD Viral cathepsin OS=Choristoneura fumiferana defective
polyhedrosis virus GN=Vcath PE=3 SV=1
Length = 324
Score = 48.9 bits (115), Expect = 9e-06, Method: Composition-based stats.
Identities = 29/68 (42%), Positives = 44/68 (64%), Gaps = 4/68 (5%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGT-ATYGINHLSDLTREEMK 92
FE F+ +F+K+Y +K E RF +F+ NL+ E +NK + T A Y IN SDL+++E
Sbjct: 28 FEDFLHNFNKNYSSKSEKLHRFKIFQHNLE--EIINKNLNDTSAQYEINKFSDLSKDETI 85
Query: 93 SRL-GLNL 99
S+ GL+L
Sbjct: 86 SKYTGLSL 93
>sp|Q9R013|CATF_MOUSE Cathepsin F OS=Mus musculus GN=Ctsf PE=2 SV=1
Length = 462
Score = 48.5 bits (114), Expect = 1e-05, Method: Composition-based stats.
Identities = 21/60 (35%), Positives = 34/60 (56%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
F+ F+ ++++Y ++EE R VF N+ + + + GTA YGI SDLT EE +
Sbjct: 165 FKDFMTTYNRTYESREEAQWRLTVFARNMIRAQKIQALDRGTAQYGITKFSDLTEEEFHT 224
>sp|P56202|CATW_HUMAN Cathepsin W OS=Homo sapiens GN=CTSW PE=1 SV=2
Length = 376
Score = 47.8 bits (112), Expect = 2e-05, Method: Composition-based stats.
Identities = 27/70 (38%), Positives = 38/70 (54%), Gaps = 1/70 (1%)
Query: 28 PEHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
P LK+ F+ F F++SY + EE A R +F NL + L + + GTA +G+ SDL
Sbjct: 35 PLELKEAFKLFQIQFNRSYLSPEEHAHRLDIFAHNLAQAQRLQEEDLGTAEFGVTPFSDL 94
Query: 87 TREEMKSRLG 96
T EE G
Sbjct: 95 TEEEFGQLYG 104
>sp|Q9WGE0|CATV_NPVHC Viral cathepsin OS=Hyphantria cunea nuclear polyhedrosis virus
GN=VCATH PE=3 SV=1
Length = 324
Score = 47.8 bits (112), Expect = 2e-05, Method: Composition-based stats.
Identities = 24/61 (39%), Positives = 38/61 (62%), Gaps = 1/61 (1%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
FE F+ F+K Y ++ E +RF +F+ NL+ I N+ + TA Y IN SDL+++E S
Sbjct: 28 FEDFLHKFNKHYSSESEKLRRFQIFQHNLEEIIIKNQND-TTAQYEINKFSDLSKDETIS 86
Query: 94 R 94
+
Sbjct: 87 K 87
>sp|O65493|XCP1_ARATH Xylem cysteine proteinase 1 OS=Arabidopsis thaliana GN=XCP1 PE=1
SV=1
Length = 355
Score = 47.4 bits (111), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 32/78 (41%), Positives = 42/78 (53%), Gaps = 9/78 (11%)
Query: 28 PEHLKQ-------FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGI 80
PEHL FE ++ + SK+Y + EE RF VF +NL I+ N E + G+
Sbjct: 38 PEHLTNTDKLLELFESWMSEHSKAYKSVEEKVHRFEVFRENLMHIDQRNN-EINSYWLGL 96
Query: 81 NHLSDLTREEMKSR-LGL 97
N +DLT EE K R LGL
Sbjct: 97 NEFADLTHEEFKGRYLGL 114
>sp|P25779|CYSP_TRYCR Cruzipain OS=Trypanosoma cruzi PE=1 SV=1
Length = 467
Score = 47.0 bits (110), Expect = 3e-05, Method: Composition-based stats.
Identities = 24/62 (38%), Positives = 36/62 (58%), Gaps = 1/62 (1%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
QF +F + + Y + E A R +VF +NL + L+ + AT+G+ SDLTREE +
Sbjct: 37 QFAEFKQKHGRVYESAAEEAFRLSVFRENL-FLARLHAAANPHATFGVTPFSDLTREEFR 95
Query: 93 SR 94
SR
Sbjct: 96 SR 97
>sp|Q8B9D5|CATV_NPVR1 Viral cathepsin OS=Rachiplusia ou multiple nucleopolyhedrovirus
(strain R1) GN=VCATH PE=3 SV=1
Length = 323
Score = 46.6 bits (109), Expect = 4e-05, Method: Composition-based stats.
Identities = 25/67 (37%), Positives = 43/67 (64%), Gaps = 3/67 (4%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
FE+F+ F+K Y ++ E +RF +F+ NL E + K ++ +A Y IN SDL+++E +
Sbjct: 28 FEEFVHRFNKDYGSEVEKLRRFKIFQHNLN--EIIIKNQNDSAKYEINKFSDLSKDETIA 85
Query: 94 RL-GLNL 99
+ GL+L
Sbjct: 86 KYTGLSL 92
>sp|P00785|ACTN_ACTCH Actinidain OS=Actinidia chinensis PE=1 SV=4
Length = 380
Score = 46.6 bits (109), Expect = 4e-05, Method: Composition-based stats.
Identities = 22/75 (29%), Positives = 41/75 (54%)
Query: 19 SNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATY 78
+ N + N E +E ++ + KSY + E +RF +F++ L+ I++ N + +
Sbjct: 27 AKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWERRFEIFKETLRFIDEHNADTNRSYKV 86
Query: 79 GINHLSDLTREEMKS 93
G+N +DLT EE +S
Sbjct: 87 GLNQFADLTDEEFRS 101
>sp|Q9LXW3|CPR2_ARATH Probable cysteine proteinase At3g43960 OS=Arabidopsis thaliana
GN=At3g43960 PE=2 SV=1
Length = 376
Score = 46.6 bits (109), Expect = 4e-05, Method: Composition-based stats.
Identities = 27/74 (36%), Positives = 43/74 (58%), Gaps = 1/74 (1%)
Query: 29 EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTR 88
E L +E+++ + K+Y E +RF +F+DNLK IE+ N + + G+N SDLT
Sbjct: 36 EVLTMYEQWLVENGKNYNGLGEKERRFKIFKDNLKRIEEHNSDPNRSYERGLNKFSDLTA 95
Query: 89 EEMK-SRLGLNLSK 101
+E + S LG + K
Sbjct: 96 DEFQASYLGGKMEK 109
>sp|A5HII1|ACTN_ACTDE Actinidain OS=Actinidia deliciosa PE=1 SV=1
Length = 380
Score = 46.6 bits (109), Expect = 4e-05, Method: Composition-based stats.
Identities = 22/75 (29%), Positives = 41/75 (54%)
Query: 19 SNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATY 78
+ N + N E +E ++ + KSY + E +RF +F++ L+ I++ N + +
Sbjct: 27 AKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWERRFEIFKETLRFIDEHNADTNRSYKV 86
Query: 79 GINHLSDLTREEMKS 93
G+N +DLT EE +S
Sbjct: 87 GLNQFADLTDEEFRS 101
>sp|Q8H166|ALEU_ARATH Thiol protease aleurain OS=Arabidopsis thaliana GN=ALEU PE=1 SV=2
Length = 358
Score = 46.6 bits (109), Expect = 5e-05, Method: Composition-based stats.
Identities = 24/68 (35%), Positives = 40/68 (58%), Gaps = 2/68 (2%)
Query: 30 HLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTRE 89
H+ F +F + K Y EE+ RF++F++NL LI NK + + G+N +DLT +
Sbjct: 55 HVLSFARFTHRYGKKYQNVEEMKLRFSIFKENLDLIRSTNK-KGLSYKLGVNQFADLTWQ 113
Query: 90 EM-KSRLG 96
E +++LG
Sbjct: 114 EFQRTKLG 121
>sp|Q8QLK1|CATV_NPVMC Viral cathepsin OS=Mamestra configurata nucleopolyhedrovirus
GN=VCATH PE=3 SV=1
Length = 337
Score = 46.6 bits (109), Expect = 5e-05, Method: Composition-based stats.
Identities = 21/61 (34%), Positives = 37/61 (60%), Gaps = 1/61 (1%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
FEKFI ++K Y +++E R+ +F N++ I N + +A Y IN +D+T+ E+ +
Sbjct: 40 FEKFISQYNKQYSSEDEKKYRYNIFRHNIESINAKNS-RNDSAVYKINRFADMTKNEVVN 98
Query: 94 R 94
R
Sbjct: 99 R 99
>sp|P16311|PEPT1_DERFA Peptidase 1 OS=Dermatophagoides farinae GN=DERF1 PE=1 SV=2
Length = 321
Score = 45.8 bits (107), Expect = 7e-05, Method: Composition-based stats.
Identities = 30/69 (43%), Positives = 44/69 (63%), Gaps = 12/69 (17%)
Query: 28 PEHLKQFEKFIRDFSKSYPT--KEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSD 85
P +K FE+F + F+K+Y T +EEVA++ F ++LK +E NKG INHLSD
Sbjct: 20 PASIKTFEEFKKAFNKNYATVEEEEVARK--NFLESLKYVE-ANKG-------AINHLSD 69
Query: 86 LTREEMKSR 94
L+ +E K+R
Sbjct: 70 LSLDEFKNR 78
>sp|Q10717|CYSP2_MAIZE Cysteine proteinase 2 OS=Zea mays GN=CCP2 PE=2 SV=1
Length = 360
Score = 45.4 bits (106), Expect = 1e-04, Method: Composition-based stats.
Identities = 23/65 (35%), Positives = 40/65 (61%), Gaps = 2/65 (3%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
+F +F + KSY + EV KRF +F ++L+L+ N+ + + GIN +D++ EE +
Sbjct: 58 RFARFAVRYGKSYESAAEVHKRFRIFSESLQLVRSTNR-KGLSYRLGINRFADMSWEEFR 116
Query: 93 -SRLG 96
+RLG
Sbjct: 117 ATRLG 121
>sp|P13277|CYSP1_HOMAM Digestive cysteine proteinase 1 OS=Homarus americanus GN=LCP1
PE=1 SV=2
Length = 322
Score = 45.4 bits (106), Expect = 1e-04, Method: Composition-based stats.
Identities = 29/74 (39%), Positives = 37/74 (50%), Gaps = 7/74 (9%)
Query: 23 LKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKG-EHGTATY--G 79
L NP +E+F F + Y EE R VF DNL+ IE+ NK E G TY
Sbjct: 13 LAAANP----SWEEFKGKFGRKYVDLEEERYRLNVFLDNLQYIEEFNKKYERGEVTYNLA 68
Query: 80 INHLSDLTREEMKS 93
IN SD+T E+ +
Sbjct: 69 INQFSDMTNEKFNA 82
>sp|P43296|RD19A_ARATH Cysteine proteinase RD19a OS=Arabidopsis thaliana GN=RD19A PE=2
SV=1
Length = 368
Score = 45.1 bits (105), Expect = 1e-04, Method: Composition-based stats.
Identities = 24/62 (38%), Positives = 34/62 (54%), Gaps = 1/62 (1%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
F F R F K Y + EE RF+VF+ NL+ K + +AT+G+ SDLTR E +
Sbjct: 50 HFSLFKRKFGKVYASNEEHDYRFSVFKANLRRARRHQKLD-PSATHGVTQFSDLTRSEFR 108
Query: 93 SR 94
+
Sbjct: 109 KK 110
>sp|P14658|CYSP_TRYBB Cysteine proteinase OS=Trypanosoma brucei brucei PE=1 SV=1
Length = 450
Score = 44.7 bits (104), Expect = 2e-04, Method: Composition-based stats.
Identities = 21/62 (33%), Positives = 36/62 (58%), Gaps = 1/62 (1%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
+F F + + K Y +E A RF FE+N++ + + + AT+G+ SD+TREE +
Sbjct: 40 RFAAFKKKYGKVYKDAKEEAFRFRAFEENMEQAK-IQAAANPYATFGVTPFSDMTREEFR 98
Query: 93 SR 94
+R
Sbjct: 99 AR 100
>sp|Q80LP4|CATV_NPVAH Viral cathepsin OS=Adoxophyes honmai nucleopolyhedrovirus
GN=VCATH PE=3 SV=1
Length = 337
Score = 44.3 bits (103), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 23/61 (37%), Positives = 38/61 (62%), Gaps = 1/61 (1%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
FE FI +++K YP + RF +F+ NL+ I + NK + +A Y IN SDL++ E+ +
Sbjct: 32 FETFIINYNKQYPDTKTKNYRFKIFKQNLEDINEKNK-LNDSAIYNINKFSDLSKNELLT 90
Query: 94 R 94
+
Sbjct: 91 K 91
>sp|Q40143|CYSP3_SOLLC Cysteine proteinase 3 OS=Solanum lycopersicum GN=CYP-3 PE=2 SV=1
Length = 356
Score = 43.9 bits (102), Expect = 3e-04, Method: Composition-based stats.
Identities = 25/64 (39%), Positives = 37/64 (57%), Gaps = 2/64 (3%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM-K 92
F +F K Y + EE+ +RF +F DNLK+I N+ + + GIN +DLT +E K
Sbjct: 57 FARFAIRHRKRYDSVEEIKQRFEIFLDNLKMIRSHNR-KGLSYKLGINEFTDLTWDEFRK 115
Query: 93 SRLG 96
+LG
Sbjct: 116 HKLG 119
>sp|P00784|PAPA1_CARPA Papain OS=Carica papaya PE=1 SV=1
Length = 345
Score = 43.9 bits (102), Expect = 3e-04, Method: Composition-based stats.
Identities = 24/76 (31%), Positives = 44/76 (57%), Gaps = 2/76 (2%)
Query: 19 SNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATY 78
S N+L T ++ FE ++ +K Y +E RF +F+DNLK I++ NK ++ +
Sbjct: 34 SQNDL-TSTERLIQLFESWMLKHNKIYKNIDEKIYRFEIFKDNLKYIDETNK-KNNSYWL 91
Query: 79 GINHLSDLTREEMKSR 94
G+N +D++ +E K +
Sbjct: 92 GLNVFADMSNDEFKEK 107
>sp|P56203|CATW_MOUSE Cathepsin W OS=Mus musculus GN=Ctsw PE=2 SV=2
Length = 371
Score = 43.9 bits (102), Expect = 3e-04, Method: Composition-based stats.
Identities = 31/90 (34%), Positives = 45/90 (50%), Gaps = 4/90 (4%)
Query: 11 LALFGQMKSNNELKTE---NPEHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIE 66
L L GQ S++ L + P LK+ F+ F F++SY E +R ++F NL +
Sbjct: 13 LLLAGQGLSDSLLTKDAGPRPLELKEVFKLFQIRFNRSYWNPAEYTRRLSIFAHNLAQAQ 72
Query: 67 DLNKGEHGTATYGINHLSDLTREEMKSRLG 96
L + + GTA +G SDLT EE G
Sbjct: 73 RLQQEDLGTAEFGETPFSDLTEEEFGQLYG 102
>sp|Q8RWQ9|ALEUL_ARATH Thiol protease aleurain-like OS=Arabidopsis thaliana GN=At3g45310
PE=2 SV=1
Length = 358
Score = 43.9 bits (102), Expect = 3e-04, Method: Composition-based stats.
Identities = 22/63 (34%), Positives = 36/63 (57%), Gaps = 1/63 (1%)
Query: 30 HLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTRE 89
H+ F +F + K Y + EE+ RF+VF++NL LI NK + + +N +DLT +
Sbjct: 55 HVLSFSRFTHRYGKKYQSVEEMKLRFSVFKENLDLIRSTNK-KGLSYKLSLNQFADLTWQ 113
Query: 90 EMK 92
E +
Sbjct: 114 EFQ 116
>sp|Q9YWK4|CATV_NPVBS Viral cathepsin OS=Buzura suppressaria nuclear polyhedrosis virus
GN=VCATH PE=3 SV=1
Length = 331
Score = 43.9 bits (102), Expect = 3e-04, Method: Composition-based stats.
Identities = 23/67 (34%), Positives = 41/67 (61%), Gaps = 2/67 (2%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
FE F+ +++K Y E +RF++F+ L+ I N+ + +A Y IN +DL++ E+ S
Sbjct: 31 FETFLANYNKMYNDTSEKERRFSIFQQTLEEINYKNR-LNDSAVYQINKFADLSKNEIIS 89
Query: 94 RL-GLNL 99
+ GLN+
Sbjct: 90 KYTGLNM 96
>sp|P05994|PAPA4_CARPA Papaya proteinase 4 OS=Carica papaya PE=1 SV=3
Length = 348
Score = 43.5 bits (101), Expect = 4e-04, Method: Composition-based stats.
Identities = 29/95 (30%), Positives = 48/95 (50%), Gaps = 13/95 (13%)
Query: 11 LALFGQMK-----------SNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFE 59
+ LFG M S ++L T ++ F ++ +K+Y +E RF +F+
Sbjct: 15 ICLFGHMSLSYCDFSIVGYSQDDL-TSTERLIQLFNSWMLKHNKNYKNVDEKLYRFEIFK 73
Query: 60 DNLKLIEDLNKGEHGTATYGINHLSDLTREEMKSR 94
DNLK I++ NK +G G+N SDL+ +E K +
Sbjct: 74 DNLKYIDERNKMINGY-WLGLNEFSDLSNDEFKEK 107
>sp|Q9J8B9|CATV_NPVSE Viral cathepsin OS=Spodoptera exigua nuclear polyhedrosis virus
(strain US) GN=VCATH PE=3 SV=1
Length = 337
Score = 43.1 bits (100), Expect = 4e-04, Method: Composition-based stats.
Identities = 24/74 (32%), Positives = 39/74 (52%), Gaps = 9/74 (12%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
FEKFI ++K Y +++E R+ +F N++ I N + +A Y IN +D+ + E+
Sbjct: 40 FEKFITQYNKQYKSEDEKKYRYNIFRHNIESINQKNS-RNDSAVYKINRFADMPKNEIVI 98
Query: 94 R--------LGLNL 99
R LGLN
Sbjct: 99 RHTGLASGELGLNF 112
>sp|Q8V5U0|CATV_NPVHZ Viral cathepsin OS=Heliothis zea nuclear polyhedrosis virus
GN=VCATH PE=3 SV=1
Length = 367
Score = 43.1 bits (100), Expect = 5e-04, Method: Composition-based stats.
Identities = 28/83 (33%), Positives = 43/83 (51%), Gaps = 14/83 (16%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGE-----------HGTATYGINH 82
F+ F++ ++KSY +E R+ VF+DNL I N+ +A +G+N
Sbjct: 57 FKHFLQQYNKSYDDPKEYQYRYNVFKDNLNKINSQNRENLLNNKNNNDSLSTSAQFGVNK 116
Query: 83 LSDLTREE-MKSRLG--LNLSKH 102
SD T +E + S G LNLS+H
Sbjct: 117 FSDKTPDEVLHSNTGFFLNLSQH 139
>sp|P04988|CYSP1_DICDI Cysteine proteinase 1 OS=Dictyostelium discoideum GN=cprA PE=1
SV=2
Length = 343
Score = 42.7 bits (99), Expect = 6e-04, Method: Composition-based stats.
Identities = 24/68 (35%), Positives = 37/68 (54%), Gaps = 4/68 (5%)
Query: 29 EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNK---GEHGTATYGINHLSD 85
E QF +F F+K Y + EE +RF +F+ NL IE+LN +G+N +D
Sbjct: 24 EEQSQFLEFQDKFNKKY-SHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFAD 82
Query: 86 LTREEMKS 93
L+ +E K+
Sbjct: 83 LSSDEFKN 90
>sp|P05167|ALEU_HORVU Thiol protease aleurain OS=Hordeum vulgare PE=2 SV=1
Length = 362
Score = 42.7 bits (99), Expect = 7e-04, Method: Composition-based stats.
Identities = 23/68 (33%), Positives = 39/68 (57%), Gaps = 2/68 (2%)
Query: 30 HLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTRE 89
H +F +F + KSY + EV +RF +F ++L+ + N+ + GIN SD++ E
Sbjct: 57 HALRFARFAVRYGKSYESAAEVRRRFRIFSESLEEVRSTNR-KGLPYRLGINRFSDMSWE 115
Query: 90 EMK-SRLG 96
E + +RLG
Sbjct: 116 EFQATRLG 123
>sp|Q9LT77|CPR1_ARATH Probable cysteine proteinase At3g19400 OS=Arabidopsis thaliana
GN=At3g19400 PE=2 SV=1
Length = 362
Score = 42.4 bits (98), Expect = 9e-04, Method: Composition-based stats.
Identities = 21/75 (28%), Positives = 40/75 (53%)
Query: 19 SNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATY 78
+ E++ E +E+++ + K+Y E +RF +F+DNLK +++ N T
Sbjct: 29 TETEIERNETEVRLMYEQWLVENRKNYNGLGEKERRFKIFKDNLKFVDEHNSVPDRTFEV 88
Query: 79 GINHLSDLTREEMKS 93
G+ +DLT EE ++
Sbjct: 89 GLTRFADLTNEEFRA 103
>sp|P36400|LMCPB_LEIME Cysteine proteinase B OS=Leishmania mexicana GN=LMCPB PE=2 SV=2
Length = 443
Score = 42.0 bits (97), Expect = 0.001, Method: Composition-based stats.
Identities = 21/61 (34%), Positives = 35/61 (57%), Gaps = 1/61 (1%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
FE+F R + ++Y T E +R A FE NL+L+ + ++ + A +GI DL+ E +
Sbjct: 38 FEEFKRTYGRAYETLAEEQQRLANFERNLELMRE-HQARNPHAQFGITKFFDLSEAEFAA 96
Query: 94 R 94
R
Sbjct: 97 R 97
>sp|Q05094|CYSP2_LEIPI Cysteine proteinase 2 OS=Leishmania pifanoi GN=CYS2 PE=1 SV=1
Length = 444
Score = 42.0 bits (97), Expect = 0.001, Method: Composition-based stats.
Identities = 21/61 (34%), Positives = 35/61 (57%), Gaps = 1/61 (1%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
FE+F R + ++Y T E +R A FE NL+L+ + ++ + A +GI DL+ E +
Sbjct: 38 FEEFKRTYGRAYETLAEEQQRLANFERNLELMRE-HQARNPHAQFGITKFFDLSEAEFAA 96
Query: 94 R 94
R
Sbjct: 97 R 97
>sp|Q10716|CYSP1_MAIZE Cysteine proteinase 1 OS=Zea mays GN=CCP1 PE=2 SV=1
Length = 371
Score = 41.6 bits (96), Expect = 0.002, Method: Composition-based stats.
Identities = 28/83 (33%), Positives = 45/83 (54%), Gaps = 5/83 (6%)
Query: 20 NNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYG 79
+N+L+ H F F++ F KSY +E A R +VF+DNL+ ++ +A +G
Sbjct: 37 DNDLELNAESH---FLSFVQRFGKSYKDADEHAYRLSVFKDNLRRARR-HQLLDPSAEHG 92
Query: 80 INHLSDLTREEM-KSRLGLNLSK 101
+ SDLT E ++ LGL S+
Sbjct: 93 VTKFSDLTPAEFRRTYLGLRKSR 115
>sp|P08176|PEPT1_DERPT Peptidase 1 OS=Dermatophagoides pteronyssinus GN=DERP1 PE=1 SV=2
Length = 320
Score = 40.4 bits (93), Expect = 0.003, Method: Composition-based stats.
Identities = 24/68 (35%), Positives = 38/68 (55%), Gaps = 8/68 (11%)
Query: 28 PEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLT 87
P +K FE++ + F+KSY T E+ F +++K ++ N G INHLSDL+
Sbjct: 20 PSSIKTFEEYKKAFNKSYATFEDEEAARKNFLESVKYVQS-NGG-------AINHLSDLS 71
Query: 88 REEMKSRL 95
+E K+R
Sbjct: 72 LDEFKNRF 79
>sp|P25782|CYSP2_HOMAM Digestive cysteine proteinase 2 OS=Homarus americanus GN=LCP2 PE=2
SV=1
Length = 323
Score = 40.4 bits (93), Expect = 0.003, Method: Composition-based stats.
Identities = 23/71 (32%), Positives = 37/71 (52%), Gaps = 3/71 (4%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKG-EHGTATY--GINHLSDLTREE 90
+E F + + Y EE + R +FE N K IE+ NK E+G T+ +N D+T EE
Sbjct: 20 WEHFKGKYGRQYVDAEEDSYRRVIFEQNQKYIEEFNKKYENGEVTFNLAMNKFGDMTLEE 79
Query: 91 MKSRLGLNLSK 101
+ + N+ +
Sbjct: 80 FNAVMKGNIPR 90
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.311 0.128 0.347
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 35,710,904
Number of Sequences: 539616
Number of extensions: 1306554
Number of successful extensions: 4164
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 60
Number of HSP's successfully gapped in prelim test: 60
Number of HSP's that attempted gapping in prelim test: 4070
Number of HSP's gapped (non-prelim): 124
length of query: 102
length of database: 191,569,459
effective HSP length: 71
effective length of query: 31
effective length of database: 153,256,723
effective search space: 4750958413
effective search space used: 4750958413
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 55 (25.8 bits)