BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= psy18108
(102 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|242014216|ref|XP_002427787.1| Cathepsin F precursor, putative [Pediculus humanus corporis]
gi|212512256|gb|EEB15049.1| Cathepsin F precursor, putative [Pediculus humanus corporis]
Length = 434
Score = 71.2 bits (173), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 38/80 (47%), Positives = 50/80 (62%)
Query: 18 KSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTAT 77
K +NE+ +N L+ F+ F+ F+K Y +KEE KRF +F N+K I LNK E GTA
Sbjct: 118 KIDNEIINKNEYLLQSFKDFVLKFNKVYFSKEEFKKRFRIFRANMKKINFLNKAEKGTAQ 177
Query: 78 YGINHLSDLTREEMKSRLGL 97
YGI SDL+ E K+ LGL
Sbjct: 178 YGITEFSDLSVTEFKNYLGL 197
>gi|156546466|ref|XP_001607324.1| PREDICTED: hypothetical protein LOC100123649 [Nasonia vitripennis]
Length = 1036
Score = 64.7 bits (156), Expect = 6e-09, Method: Composition-based stats.
Identities = 38/95 (40%), Positives = 53/95 (55%), Gaps = 12/95 (12%)
Query: 15 GQMKSNNELKTEN--------PEHLKQ---FEKFIRDFSKSYPTKEEVAKRFAVFEDNLK 63
GQ +S L+ +N LK+ F +F+ + K Y KEE RF +F+DNL
Sbjct: 701 GQNRSKRSLRGQNYSQKMLQQSRQLKEEILFHEFMGKYKKMYHNKEEKEMRFQIFKDNLN 760
Query: 64 LIEDLNKGEHGTATYGINHLSDLTREEMKSR-LGL 97
LIE+L + E GT YG+ +DLT+ E K+R LGL
Sbjct: 761 LIEELQRNEMGTGRYGVTQFTDLTKAEFKARHLGL 795
>gi|348528696|ref|XP_003451852.1| PREDICTED: cathepsin F-like [Oreochromis niloticus]
Length = 475
Score = 64.7 bits (156), Expect = 7e-09, Method: Composition-based stats.
Identities = 30/75 (40%), Positives = 47/75 (62%)
Query: 19 SNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATY 78
S ++ E+ E L QF++F+ ++K Y ++EEV +R +F +NLK E L + G+A Y
Sbjct: 162 STSQPLEESVELLGQFKEFMTKYNKVYSSQEEVDRRLRIFHENLKTAEKLQALDQGSAEY 221
Query: 79 GINHLSDLTREEMKS 93
G+ SDLT EE +S
Sbjct: 222 GVTKFSDLTEEEFRS 236
>gi|17543258|ref|NP_502836.1| Protein Y40H7A.10 [Caenorhabditis elegans]
gi|3880920|emb|CAA22062.1| Protein Y40H7A.10 [Caenorhabditis elegans]
Length = 343
Score = 64.3 bits (155), Expect = 9e-09, Method: Composition-based stats.
Identities = 29/80 (36%), Positives = 45/80 (56%)
Query: 16 QMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGT 75
Q+ + + T + ++ F+ F+ + + YP + E+ KRF +F NL L+E NK + G
Sbjct: 33 QILQRHHIPTPDVKYTNAFQNFLVKYLREYPNEYEIVKRFTIFSRNLDLVERYNKEDAGK 92
Query: 76 ATYGINHLSDLTREEMKSRL 95
TY +N SDLT EE K L
Sbjct: 93 VTYELNDFSDLTEEEWKKYL 112
>gi|383863617|ref|XP_003707276.1| PREDICTED: uncharacterized protein LOC100880620 [Megachile
rotundata]
Length = 884
Score = 63.9 bits (154), Expect = 1e-08, Method: Composition-based stats.
Identities = 32/65 (49%), Positives = 45/65 (69%), Gaps = 1/65 (1%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
FE F++ ++K+Y + +E A R+ VF NLK+IE L K E GTA YG+ +DLT EE K+
Sbjct: 579 FEDFVKTYNKTYLSAKEKADRYKVFRKNLKMIEKLRKFEQGTAVYGVTMFADLTPEEFKT 638
Query: 94 R-LGL 97
+ LGL
Sbjct: 639 KYLGL 643
>gi|118350314|ref|XP_001008438.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89290205|gb|EAR88193.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 389
Score = 63.5 bits (153), Expect = 2e-08, Method: Composition-based stats.
Identities = 36/87 (41%), Positives = 49/87 (56%), Gaps = 2/87 (2%)
Query: 10 TLALFGQMKSNNELKTENPEHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDL 68
++A+ Q N K N +KQ F KF + K Y EE +RF +F NL +I +L
Sbjct: 15 SMAIINQNFHYNSTKQLNLTQVKQLFSKFKAEHKKFYNFLEE-QRRFEIFRQNLDIISEL 73
Query: 69 NKGEHGTATYGINHLSDLTREEMKSRL 95
N+ E GTA YGI SD+T EE KS++
Sbjct: 74 NQVEEGTAEYGITQFSDMTTEEFKSQI 100
>gi|83944664|gb|ABC48936.1| cathepsin F like protease [Glossina morsitans morsitans]
Length = 471
Score = 63.5 bits (153), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 33/69 (47%), Positives = 43/69 (62%), Gaps = 2/69 (2%)
Query: 29 EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTR 88
EHL F KF F ++Y T E RF +F+ NL+LIE+LN+ E G+A YGI +D+T
Sbjct: 163 EHL--FAKFQIKFKRNYHTTMEKQMRFRIFKQNLQLIEELNRNEQGSAKYGITEFADMTS 220
Query: 89 EEMKSRLGL 97
E K R GL
Sbjct: 221 PEYKQRTGL 229
>gi|350421176|ref|XP_003492760.1| PREDICTED: hypothetical protein LOC100745708 [Bombus impatiens]
Length = 884
Score = 63.5 bits (153), Expect = 2e-08, Method: Composition-based stats.
Identities = 32/65 (49%), Positives = 44/65 (67%), Gaps = 1/65 (1%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
FE FI+ F K+Y + +E RF +F+ NLK+IE+L E GTA YG+ +DLT +E K+
Sbjct: 579 FEAFIKKFGKTYNSADEKLDRFKIFKQNLKIIEELQTFERGTAEYGVTMFADLTPKEFKA 638
Query: 94 R-LGL 97
R LGL
Sbjct: 639 RYLGL 643
>gi|289740839|gb|ADD19167.1| cysteine proteinase cathepsin F [Glossina morsitans morsitans]
Length = 471
Score = 63.5 bits (153), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 33/69 (47%), Positives = 43/69 (62%), Gaps = 2/69 (2%)
Query: 29 EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTR 88
EHL F KF F ++Y T E RF +F+ NL+LIE+LN+ E G+A YGI +D+T
Sbjct: 163 EHL--FAKFQIKFKRNYHTTMEKQMRFRIFKQNLQLIEELNRNEQGSAKYGITEFADMTS 220
Query: 89 EEMKSRLGL 97
E K R GL
Sbjct: 221 PEYKQRTGL 229
>gi|432880227|ref|XP_004073613.1| PREDICTED: cathepsin F-like [Oryzias latipes]
Length = 473
Score = 63.2 bits (152), Expect = 2e-08, Method: Composition-based stats.
Identities = 28/75 (37%), Positives = 46/75 (61%)
Query: 19 SNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATY 78
+N++ E+ + L QF+ F+ + K Y ++EE +R +F++NLK E L + G+A Y
Sbjct: 160 TNSQPVEESVQLLGQFKDFMVKYKKDYSSQEEAERRLQIFQENLKTAEKLQALDQGSAEY 219
Query: 79 GINHLSDLTREEMKS 93
G+ SDLT EE +S
Sbjct: 220 GVTKFSDLTEEEFRS 234
>gi|224555777|gb|ACN56478.1| cathepsin F [Paralichthys olivaceus]
Length = 475
Score = 63.2 bits (152), Expect = 2e-08, Method: Composition-based stats.
Identities = 31/86 (36%), Positives = 51/86 (59%), Gaps = 8/86 (9%)
Query: 16 QMKSNNELK--------TENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIED 67
Q+K NE++ E+ E L QF++F+ ++K Y +++E +R ++F +NLK E
Sbjct: 151 QVKETNEVEDLSINPPLEESVELLGQFKEFMVKYNKVYSSQDEADRRLSIFHENLKTAEK 210
Query: 68 LNKGEHGTATYGINHLSDLTREEMKS 93
L + G+A YG+ SDLT EE +S
Sbjct: 211 LQSLDQGSAEYGVTKFSDLTEEEFRS 236
>gi|186688051|gb|ACC86111.1| cathepsin F [Paralichthys olivaceus]
Length = 475
Score = 63.2 bits (152), Expect = 2e-08, Method: Composition-based stats.
Identities = 31/86 (36%), Positives = 51/86 (59%), Gaps = 8/86 (9%)
Query: 16 QMKSNNELK--------TENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIED 67
Q+K NE++ E+ E L QF++F+ ++K Y +++E +R ++F +NLK E
Sbjct: 151 QVKETNEVEDLSINPPLEESVELLGQFKEFMVKYNKVYSSQDEADRRLSIFHENLKTAEK 210
Query: 68 LNKGEHGTATYGINHLSDLTREEMKS 93
L + G+A YG+ SDLT EE +S
Sbjct: 211 LQSLDQGSAEYGVTKFSDLTEEEFRS 236
>gi|339246873|ref|XP_003375070.1| viral cathepsin [Trichinella spiralis]
gi|316971622|gb|EFV55373.1| viral cathepsin [Trichinella spiralis]
Length = 496
Score = 62.0 bits (149), Expect = 4e-08, Method: Composition-based stats.
Identities = 26/60 (43%), Positives = 42/60 (70%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
QF++F++ F K Y +++E+ KR+ +F+ N+K +E L K E GTA YG+ +DLT EE +
Sbjct: 195 QFKEFLKTFKKWYLSEKELLKRYDIFKVNMKTVEMLQKNEQGTAVYGVTFFADLTPEEFR 254
>gi|403183546|gb|EJY58173.1| AAEL017153-PA [Aedes aegypti]
Length = 1165
Score = 62.0 bits (149), Expect = 4e-08, Method: Composition-based stats.
Identities = 33/72 (45%), Positives = 43/72 (59%), Gaps = 2/72 (2%)
Query: 26 ENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSD 85
++ HL FEKF S+ Y + E RF +F++NL IE LNK E GTA YGI H +D
Sbjct: 852 DHARHL--FEKFKLKHSREYQSTLEHEMRFRIFKNNLFKIEQLNKYEQGTAKYGITHFAD 909
Query: 86 LTREEMKSRLGL 97
+T E + R GL
Sbjct: 910 MTSAEYRQRTGL 921
>gi|223648298|gb|ACN10907.1| Cathepsin F precursor [Salmo salar]
Length = 474
Score = 61.6 bits (148), Expect = 5e-08, Method: Composition-based stats.
Identities = 28/72 (38%), Positives = 45/72 (62%)
Query: 22 ELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGIN 81
E ++ E L QF++F+ ++++Y ++EE +R VF +NLK E L + GTA YG+
Sbjct: 164 EESVDSVELLGQFKEFMVRYNRTYSSQEEADRRLRVFHENLKTAEKLQSLDQGTAEYGVT 223
Query: 82 HLSDLTREEMKS 93
SDLT EE ++
Sbjct: 224 KFSDLTEEEFRT 235
>gi|67773378|gb|AAY81946.1| cysteine protease 8 [Paragonimus westermani]
Length = 325
Score = 61.6 bits (148), Expect = 6e-08, Method: Composition-based stats.
Identities = 31/77 (40%), Positives = 49/77 (63%), Gaps = 3/77 (3%)
Query: 28 PEHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
P++ ++ +E+F RD+ K+Y E+ KRFA+F+DNL + E GTA YG+ SDL
Sbjct: 25 PDNARELYEQFKRDYGKAY-ANEDDQKRFAIFKDNLVRAQQYQMQEQGTAKYGVTQFSDL 83
Query: 87 TREEMKSR-LGLNLSKH 102
T EE +++ LGL + +
Sbjct: 84 TPEEFEAKYLGLRIDEQ 100
>gi|347968729|ref|XP_003436276.1| AGAP002879-PC [Anopheles gambiae str. PEST]
gi|333467870|gb|EGK96737.1| AGAP002879-PC [Anopheles gambiae str. PEST]
Length = 953
Score = 60.5 bits (145), Expect = 1e-07, Method: Composition-based stats.
Identities = 35/99 (35%), Positives = 53/99 (53%), Gaps = 1/99 (1%)
Query: 5 ASAEATLALFGQMKSNNELKTENPEHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLK 63
A T A + +S LK ++ H+++ F+KF + Y + E RF +F +NL
Sbjct: 613 APTPVTTAPAVKRRSVRSLKIDDDAHVRRMFDKFRHHHRRQYASSMEHEMRFNIFRNNLF 672
Query: 64 LIEDLNKGEHGTATYGINHLSDLTREEMKSRLGLNLSKH 102
IE LNK E GTA YG+ +D+T E ++ GL + KH
Sbjct: 673 KIEQLNKFERGTAKYGVTKFADMTVAEYRAHTGLVVPKH 711
>gi|308454071|ref|XP_003089699.1| hypothetical protein CRE_27946 [Caenorhabditis remanei]
gi|308269278|gb|EFP13231.1| hypothetical protein CRE_27946 [Caenorhabditis remanei]
Length = 316
Score = 59.7 bits (143), Expect = 2e-07, Method: Composition-based stats.
Identities = 26/75 (34%), Positives = 44/75 (58%)
Query: 21 NELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGI 80
+ + T + ++ F+ F+ + + Y T++E+ KRF +F N+ L+E NK + G TY +
Sbjct: 10 HHIPTPDAKYTNAFQDFLVKYLRKYKTEDELVKRFTIFSRNMDLVERFNKEDLGKVTYEL 69
Query: 81 NHLSDLTREEMKSRL 95
N SDL+ EE K L
Sbjct: 70 NDFSDLSDEEWKKFL 84
>gi|268534724|ref|XP_002632495.1| Hypothetical protein CBG13738 [Caenorhabditis briggsae]
Length = 341
Score = 59.7 bits (143), Expect = 2e-07, Method: Composition-based stats.
Identities = 26/75 (34%), Positives = 45/75 (60%)
Query: 21 NELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGI 80
+++ T + ++ + F+ F+ + + Y ++EE+ KRF +F N+ L+E NK G TY +
Sbjct: 37 HQIPTPDVKYTEAFQNFLVKYLREYKSEEEIVKRFTIFSRNMDLVERYNKEGAGKVTYEL 96
Query: 81 NHLSDLTREEMKSRL 95
N SDL+ EE K L
Sbjct: 97 NDFSDLSDEEWKQFL 111
>gi|380025691|ref|XP_003696602.1| PREDICTED: putative cysteine proteinase CG12163-like [Apis florea]
Length = 881
Score = 59.3 bits (142), Expect = 3e-07, Method: Composition-based stats.
Identities = 28/69 (40%), Positives = 44/69 (63%)
Query: 26 ENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSD 85
+N ++ FE FI F+K++ + E RF +F+ NLK+I++L E GTA YG+ +D
Sbjct: 568 QNIKYETLFEDFIIKFNKTFSSTNEKQNRFQIFKQNLKIIKELQTFEQGTAEYGVTMFAD 627
Query: 86 LTREEMKSR 94
LT +E K+R
Sbjct: 628 LTPKEFKTR 636
>gi|341879557|gb|EGT35492.1| hypothetical protein CAEBREN_11857 [Caenorhabditis brenneri]
Length = 340
Score = 59.3 bits (142), Expect = 3e-07, Method: Composition-based stats.
Identities = 27/75 (36%), Positives = 44/75 (58%)
Query: 21 NELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGI 80
+++ T + ++ F+ F+ + + Y ++EE+ KRF +F N L+E NK + G TY +
Sbjct: 36 HQIPTPDAKYTNAFQDFLVKYMREYKSEEEMVKRFTIFSRNADLVERYNKEDAGKVTYEL 95
Query: 81 NHLSDLTREEMKSRL 95
N SDLT EE K L
Sbjct: 96 NDFSDLTDEEWKQFL 110
>gi|67773374|gb|AAY81944.1| cysteine protease 6 [Paragonimus westermani]
Length = 325
Score = 59.3 bits (142), Expect = 3e-07, Method: Composition-based stats.
Identities = 27/62 (43%), Positives = 39/62 (62%), Gaps = 1/62 (1%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
+E+F RD+ KSY ++ KRFA+F+DNL ++ E GTA YG+ SDLT EE +
Sbjct: 32 YEQFKRDYGKSYANDDD-EKRFAIFKDNLVRAQNYQLQEQGTARYGVTQFSDLTPEEFAA 90
Query: 94 RL 95
+
Sbjct: 91 KF 92
>gi|347968731|ref|XP_003436277.1| AGAP002879-PB [Anopheles gambiae str. PEST]
gi|333467869|gb|EGK96736.1| AGAP002879-PB [Anopheles gambiae str. PEST]
Length = 1834
Score = 59.3 bits (142), Expect = 3e-07, Method: Composition-based stats.
Identities = 35/99 (35%), Positives = 53/99 (53%), Gaps = 1/99 (1%)
Query: 5 ASAEATLALFGQMKSNNELKTENPEHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLK 63
A T A + +S LK ++ H+++ F+KF + Y + E RF +F +NL
Sbjct: 1494 APTPVTTAPAVKRRSVRSLKIDDDAHVRRMFDKFRHHHRRQYASSMEHEMRFNIFRNNLF 1553
Query: 64 LIEDLNKGEHGTATYGINHLSDLTREEMKSRLGLNLSKH 102
IE LNK E GTA YG+ +D+T E ++ GL + KH
Sbjct: 1554 KIEQLNKFERGTAKYGVTKFADMTVAEYRAHTGLVVPKH 1592
>gi|347968733|ref|XP_312034.5| AGAP002879-PA [Anopheles gambiae str. PEST]
gi|333467868|gb|EAA08025.5| AGAP002879-PA [Anopheles gambiae str. PEST]
Length = 1810
Score = 59.3 bits (142), Expect = 3e-07, Method: Composition-based stats.
Identities = 35/99 (35%), Positives = 53/99 (53%), Gaps = 1/99 (1%)
Query: 5 ASAEATLALFGQMKSNNELKTENPEHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLK 63
A T A + +S LK ++ H+++ F+KF + Y + E RF +F +NL
Sbjct: 1470 APTPVTTAPAVKRRSVRSLKIDDDAHVRRMFDKFRHHHRRQYASSMEHEMRFNIFRNNLF 1529
Query: 64 LIEDLNKGEHGTATYGINHLSDLTREEMKSRLGLNLSKH 102
IE LNK E GTA YG+ +D+T E ++ GL + KH
Sbjct: 1530 KIEQLNKFERGTAKYGVTKFADMTVAEYRAHTGLVVPKH 1568
>gi|2731635|gb|AAB93494.1| pre-procathepsin L [Paragonimus westermani]
Length = 325
Score = 59.3 bits (142), Expect = 3e-07, Method: Composition-based stats.
Identities = 28/67 (41%), Positives = 42/67 (62%), Gaps = 2/67 (2%)
Query: 28 PEHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
P++ ++ +E+F RD+ K+Y E+ KRFA+F+DNL + E GTA YG+ SDL
Sbjct: 25 PDNARELYEQFKRDYGKAY-ANEDDQKRFAIFKDNLVRAQQYQTQEQGTAKYGVTQFSDL 83
Query: 87 TREEMKS 93
T EE +
Sbjct: 84 TNEEFAA 90
>gi|163914827|ref|NP_001106423.1| cathepsin F precursor [Xenopus (Silurana) tropicalis]
gi|157423494|gb|AAI53364.1| LOC100127591 protein [Xenopus (Silurana) tropicalis]
Length = 463
Score = 58.9 bits (141), Expect = 3e-07, Method: Composition-based stats.
Identities = 28/78 (35%), Positives = 46/78 (58%)
Query: 16 QMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGT 75
Q ++EL+ E + L F+ F+ ++K Y +EE A+R +F NLK + + + + GT
Sbjct: 148 QNVPSSELEDEMLKTLTLFKDFVTTYNKKYSDQEEAARRLQIFSQNLKKAQMIQEMDQGT 207
Query: 76 ATYGINHLSDLTREEMKS 93
A YG+ SDLT +E +S
Sbjct: 208 AEYGVTKYSDLTEDEFRS 225
>gi|94420703|gb|ABF18679.1| cysteine protease [Medicago sativa]
Length = 350
Score = 58.5 bits (140), Expect = 5e-07, Method: Composition-based stats.
Identities = 30/68 (44%), Positives = 41/68 (60%), Gaps = 2/68 (2%)
Query: 30 HLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTRE 89
H F +F + K Y T +E+ +RF +F +NL+LIE NK G T G+NH +D T E
Sbjct: 47 HAVSFARFANRYGKRYDTVDEMKRRFKIFSENLQLIESTNKKRLGY-TLGVNHFADWTWE 105
Query: 90 EMKS-RLG 96
E +S RLG
Sbjct: 106 EFRSHRLG 113
>gi|195453400|ref|XP_002073772.1| GK14287 [Drosophila willistoni]
gi|194169857|gb|EDW84758.1| GK14287 [Drosophila willistoni]
Length = 610
Score = 58.5 bits (140), Expect = 5e-07, Method: Composition-based stats.
Identities = 32/80 (40%), Positives = 41/80 (51%), Gaps = 2/80 (2%)
Query: 18 KSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTAT 77
K +N + EHL F KF F + Y E R +F NL++IE LN E G+A
Sbjct: 289 KKHNHHSLDKVEHL--FHKFQIKFERRYVNSVERQMRLRIFRQNLRIIEQLNANEMGSAK 346
Query: 78 YGINHLSDLTREEMKSRLGL 97
YGI +D+T E K R GL
Sbjct: 347 YGITEFADMTSTEYKERTGL 366
>gi|328788558|ref|XP_392381.3| PREDICTED: putative cysteine proteinase CG12163-like [Apis
mellifera]
Length = 881
Score = 58.5 bits (140), Expect = 5e-07, Method: Composition-based stats.
Identities = 27/61 (44%), Positives = 39/61 (63%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
FE FI F+K++ + E RF +F+ NLK+I +L E GTA YG+ +DLT +E K+
Sbjct: 576 FEDFIIKFNKTFSSTNEKQNRFQIFKQNLKIINELQTFEQGTAEYGVTMFADLTPKEFKT 635
Query: 94 R 94
R
Sbjct: 636 R 636
>gi|194746631|ref|XP_001955780.1| GF16067 [Drosophila ananassae]
gi|190628817|gb|EDV44341.1| GF16067 [Drosophila ananassae]
Length = 620
Score = 58.2 bits (139), Expect = 6e-07, Method: Composition-based stats.
Identities = 31/69 (44%), Positives = 38/69 (55%), Gaps = 2/69 (2%)
Query: 29 EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTR 88
EHL F KF F + Y + E R +F NLK IE+LN E G+A YGI +D+T
Sbjct: 311 EHL--FHKFQVRFGRRYVSTAERQMRLRIFRQNLKTIEELNANEMGSAKYGITEFADMTS 368
Query: 89 EEMKSRLGL 97
E K R GL
Sbjct: 369 TEYKERTGL 377
>gi|170032975|ref|XP_001844355.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167873312|gb|EDS36695.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 1454
Score = 58.2 bits (139), Expect = 6e-07, Method: Composition-based stats.
Identities = 29/64 (45%), Positives = 41/64 (64%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
F+KF +++Y + E RF +F++NL IE LNK E GTA YGI H +D+T E ++
Sbjct: 1146 FDKFKTRHNRTYQSSLEHEMRFRIFKNNLFKIEQLNKYEQGTAKYGITHFADMTSAEYRA 1205
Query: 94 RLGL 97
R GL
Sbjct: 1206 RTGL 1209
>gi|308447426|ref|XP_003087427.1| hypothetical protein CRE_22755 [Caenorhabditis remanei]
gi|308256596|gb|EFP00549.1| hypothetical protein CRE_22755 [Caenorhabditis remanei]
Length = 324
Score = 58.2 bits (139), Expect = 6e-07, Method: Composition-based stats.
Identities = 25/75 (33%), Positives = 45/75 (60%)
Query: 21 NELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGI 80
+ + T + ++ F+ F+ + + Y T++E+ KRF +F N+ L+E NK + G TY +
Sbjct: 18 HHIPTPDAKYTNAFQDFLVKYLREYKTEDELVKRFTIFSRNMDLVETYNKEDLGKVTYEL 77
Query: 81 NHLSDLTREEMKSRL 95
N SDL+ +E K+ L
Sbjct: 78 NDFSDLSDKEWKTFL 92
>gi|67773376|gb|AAY81945.1| cysteine protease 7 [Paragonimus westermani]
Length = 325
Score = 58.2 bits (139), Expect = 6e-07, Method: Composition-based stats.
Identities = 28/67 (41%), Positives = 42/67 (62%), Gaps = 2/67 (2%)
Query: 28 PEHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
P++ ++ +E+F RD+ K+Y E+ KRFA+F+DNL + E GTA YG+ SDL
Sbjct: 25 PDNARELYEQFKRDYGKAY-ANEDDQKRFAIFKDNLVRAQQYQMQEQGTAKYGVTQFSDL 83
Query: 87 TREEMKS 93
T EE +
Sbjct: 84 TPEEFAA 90
>gi|182892046|gb|AAI65744.1| Ctsf protein [Danio rerio]
Length = 473
Score = 57.8 bits (138), Expect = 7e-07, Method: Composition-based stats.
Identities = 25/67 (37%), Positives = 41/67 (61%)
Query: 26 ENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSD 85
E+ E L F+ F+ ++++Y ++EE KR +F+ N+K + L E G+A YGI SD
Sbjct: 167 ESVELLTMFKNFMITYNRTYSSQEEAEKRLRIFQQNMKTAQTLQSLEQGSAEYGITKFSD 226
Query: 86 LTREEMK 92
LT +E +
Sbjct: 227 LTEDEFR 233
>gi|117606135|ref|NP_001071036.1| cathepsin F precursor [Danio rerio]
gi|115313533|gb|AAI24244.1| Cathepsin F [Danio rerio]
Length = 473
Score = 57.8 bits (138), Expect = 7e-07, Method: Composition-based stats.
Identities = 25/67 (37%), Positives = 41/67 (61%)
Query: 26 ENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSD 85
E+ E L F+ F+ ++++Y ++EE KR +F+ N+K + L E G+A YGI SD
Sbjct: 167 ESVELLTMFKNFMITYNRTYSSQEEAEKRLRIFQQNMKTAQTLQSLEQGSAEYGITKFSD 226
Query: 86 LTREEMK 92
LT +E +
Sbjct: 227 LTEDEFR 233
>gi|291385469|ref|XP_002709277.1| PREDICTED: cathepsin F [Oryctolagus cuniculus]
Length = 460
Score = 57.8 bits (138), Expect = 8e-07, Method: Composition-based stats.
Identities = 24/60 (40%), Positives = 38/60 (63%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
F+KF+R ++++Y +KEE R +VF N+ + + + GTA YGI SDLT EE ++
Sbjct: 163 FKKFVRTYNRTYESKEEAQWRLSVFASNMVRAQKIQSLDRGTAQYGITKFSDLTEEEFRT 222
>gi|213513816|ref|NP_001133678.1| Cathepsin F precursor [Salmo salar]
gi|209154908|gb|ACI33686.1| Cathepsin F precursor [Salmo salar]
Length = 475
Score = 57.8 bits (138), Expect = 8e-07, Method: Composition-based stats.
Identities = 25/65 (38%), Positives = 42/65 (64%)
Query: 29 EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTR 88
E L QF++F+ ++++Y ++E+ +R +F +NLK E L + GTA YG+ SDLT
Sbjct: 172 ELLGQFKEFMVRYNRTYSSQEDTDRRLRIFHENLKTAEKLQSLDLGTAEYGVTKFSDLTE 231
Query: 89 EEMKS 93
EE ++
Sbjct: 232 EEFRT 236
>gi|194898683|ref|XP_001978897.1| GG11133 [Drosophila erecta]
gi|190650600|gb|EDV47855.1| GG11133 [Drosophila erecta]
Length = 615
Score = 57.8 bits (138), Expect = 8e-07, Method: Composition-based stats.
Identities = 31/69 (44%), Positives = 38/69 (55%), Gaps = 2/69 (2%)
Query: 29 EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTR 88
+HL F KF F + Y + E R +F NLK IE+LN E G+A YGI +DLT
Sbjct: 306 DHL--FHKFQVRFGRRYVSTAERQMRLRIFRQNLKTIEELNTNEMGSAKYGITEFADLTS 363
Query: 89 EEMKSRLGL 97
E K R GL
Sbjct: 364 SEYKERTGL 372
>gi|67773370|gb|AAY81942.1| cysteine protease 3 [Paragonimus westermani]
Length = 321
Score = 57.8 bits (138), Expect = 9e-07, Method: Composition-based stats.
Identities = 27/61 (44%), Positives = 38/61 (62%), Gaps = 1/61 (1%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
+E+F RD+ K Y E+ KRFA+F+DNL + L + GTA YG+ SDLT EE +
Sbjct: 27 YEQFKRDYGKVY-ANEDDQKRFAIFKDNLMRAQKLQLKDQGTARYGVTQFSDLTPEEFAA 85
Query: 94 R 94
+
Sbjct: 86 K 86
>gi|56718881|gb|AAW28151.1| westerpain-1 [Paragonimus westermani]
Length = 322
Score = 57.4 bits (137), Expect = 9e-07, Method: Composition-based stats.
Identities = 27/61 (44%), Positives = 38/61 (62%), Gaps = 1/61 (1%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
+E+F RD+ K Y E+ KRFA+F+DNL + L + GTA YG+ SDLT EE +
Sbjct: 27 YEQFKRDYGKVY-ANEDDQKRFAIFKDNLMRAQKLQLKDQGTARYGVTQFSDLTPEEFAA 85
Query: 94 R 94
+
Sbjct: 86 K 86
>gi|13507095|gb|AAK28439.1| cysteine protease 3 precursor [Clonorchis sinensis]
Length = 320
Score = 57.0 bits (136), Expect = 1e-06, Method: Composition-based stats.
Identities = 34/84 (40%), Positives = 46/84 (54%), Gaps = 4/84 (4%)
Query: 11 LALFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNK 70
L FG + SN ++EN L +E+F + KSY ++ RF VF+DNL I+
Sbjct: 11 LGFFGVLGSNIP-ESENARQL--YEEFKLKYKKSYSNDDD-EYRFRVFKDNLLRIKQFQN 66
Query: 71 GEHGTATYGINHLSDLTREEMKSR 94
E GTA YG+ SDLT +E K R
Sbjct: 67 MERGTAKYGVTQFSDLTAQEFKVR 90
>gi|116242322|gb|ABJ89818.1| cysteine proteinase 3 [Clonorchis sinensis]
Length = 327
Score = 57.0 bits (136), Expect = 1e-06, Method: Composition-based stats.
Identities = 34/84 (40%), Positives = 46/84 (54%), Gaps = 4/84 (4%)
Query: 11 LALFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNK 70
L FG + SN ++EN L +E+F + KSY ++ RF VF+DNL I+
Sbjct: 11 LGFFGVLGSNIP-ESENARQL--YEEFKLKYKKSYSNDDD-EYRFRVFKDNLLRIKQFQN 66
Query: 71 GEHGTATYGINHLSDLTREEMKSR 94
E GTA YG+ SDLT +E K R
Sbjct: 67 MERGTAKYGVTQFSDLTAQEFKVR 90
>gi|67773372|gb|AAY81943.1| cysteine protease 5 [Paragonimus westermani]
Length = 325
Score = 57.0 bits (136), Expect = 2e-06, Method: Composition-based stats.
Identities = 27/68 (39%), Positives = 43/68 (63%), Gaps = 2/68 (2%)
Query: 28 PEHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
P++ ++ +E+F RD+ K Y ++ KRFA+F+DNL + L + GTA YG+ SDL
Sbjct: 25 PDNARELYEQFKRDYGKVYANDDD-QKRFAIFKDNLVRAQKLQLKDRGTARYGVTQFSDL 83
Query: 87 TREEMKSR 94
T EE ++
Sbjct: 84 TPEEFAAK 91
>gi|322801532|gb|EFZ22193.1| hypothetical protein SINV_14496 [Solenopsis invicta]
Length = 781
Score = 57.0 bits (136), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 25/65 (38%), Positives = 42/65 (64%), Gaps = 1/65 (1%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
F+ F+ ++++Y + +E R +F +NL +IE L K E T YG+N +D++REE ++
Sbjct: 524 FDDFVATYNRTYSSPDERNLRLQIFRENLGIIELLQKTEQATGRYGVNMFADMSREEFRT 583
Query: 94 R-LGL 97
R LGL
Sbjct: 584 RYLGL 588
>gi|67773382|gb|AAY81948.1| cysteine protease 11 [Paragonimus westermani]
Length = 322
Score = 57.0 bits (136), Expect = 2e-06, Method: Composition-based stats.
Identities = 27/61 (44%), Positives = 38/61 (62%), Gaps = 1/61 (1%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
+E+F RD+ K Y E+ KRFA+F+DNL + L + GTA YG+ SDLT EE +
Sbjct: 27 YEQFKRDYGKVY-ANEDDQKRFAIFKDNLVRAQKLQLRDQGTARYGVTQFSDLTPEEFAA 85
Query: 94 R 94
+
Sbjct: 86 K 86
>gi|30575716|gb|AAP33050.1| cysteine proteinase 3 [Clonorchis sinensis]
gi|358339353|dbj|GAA47433.1| cathepsin F [Clonorchis sinensis]
Length = 327
Score = 57.0 bits (136), Expect = 2e-06, Method: Composition-based stats.
Identities = 34/84 (40%), Positives = 46/84 (54%), Gaps = 4/84 (4%)
Query: 11 LALFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNK 70
L FG + SN ++EN L +E+F + KSY ++ RF VF+DNL I+
Sbjct: 11 LGFFGVLGSNIP-ESENARQL--YEEFKLKYKKSYSNDDD-EYRFRVFKDNLLRIKQFQN 66
Query: 71 GEHGTATYGINHLSDLTREEMKSR 94
E GTA YG+ SDLT +E K R
Sbjct: 67 MERGTAKYGVTQFSDLTAQEFKVR 90
>gi|351693703|gb|AEQ59229.1| cysteine protease precursor [Clonorchis sinensis]
Length = 327
Score = 56.6 bits (135), Expect = 2e-06, Method: Composition-based stats.
Identities = 34/84 (40%), Positives = 46/84 (54%), Gaps = 4/84 (4%)
Query: 11 LALFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNK 70
L FG + SN ++EN L +E+F + KSY ++ RF VF+DNL I+
Sbjct: 11 LGFFGVLGSNIP-ESENARQL--YEEFKLKYKKSYSNDDD-EYRFRVFKDNLLRIKQFQN 66
Query: 71 GEHGTATYGINHLSDLTREEMKSR 94
E GTA YG+ SDLT +E K R
Sbjct: 67 MERGTAKYGVTQFSDLTAQEFKVR 90
>gi|440792185|gb|ELR13413.1| cathepsin L, putative [Acanthamoeba castellanii str. Neff]
Length = 331
Score = 56.6 bits (135), Expect = 2e-06, Method: Composition-based stats.
Identities = 24/64 (37%), Positives = 38/64 (59%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
+QF F++ + KSY + EE +RFA+F NL LN G +GI +D+++EE
Sbjct: 32 EQFNAFVQRYGKSYASAEEAEQRFAIFTQNLAETAALNIKYEGKTQFGITKFADMSQEEF 91
Query: 92 KSRL 95
+SR+
Sbjct: 92 QSRV 95
>gi|341878608|gb|EGT34543.1| hypothetical protein CAEBREN_26318 [Caenorhabditis brenneri]
Length = 478
Score = 56.6 bits (135), Expect = 2e-06, Method: Composition-based stats.
Identities = 29/62 (46%), Positives = 34/62 (54%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
F FI K Y K EV KRF VF+ N K+I +L K E GTA YG SD+T E K
Sbjct: 176 FLDFIDRHEKRYENKREVLKRFRVFKRNAKVIRELQKNEQGTAVYGFTKFSDMTTMEFKE 235
Query: 94 RL 95
+
Sbjct: 236 TM 237
>gi|427778331|gb|JAA54617.1| Putative cysteine proteinase cathepsin f [Rhipicephalus pulchellus]
Length = 361
Score = 56.6 bits (135), Expect = 2e-06, Method: Composition-based stats.
Identities = 31/72 (43%), Positives = 44/72 (61%), Gaps = 3/72 (4%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
F F R ++K+Y KEE RF +F++NLK I N+ E GTA YG+ SDL+ E +
Sbjct: 34 FSVFARTYNKTYKDKEEHEARFMIFKNNLKRIALFNRLEEGTAHYGLTEFSDLSPSEFER 93
Query: 94 R-LGL--NLSKH 102
LGL +L++H
Sbjct: 94 HYLGLKKDLAEH 105
>gi|410913409|ref|XP_003970181.1| PREDICTED: cathepsin F-like [Takifugu rubripes]
Length = 476
Score = 56.6 bits (135), Expect = 2e-06, Method: Composition-based stats.
Identities = 25/67 (37%), Positives = 42/67 (62%)
Query: 26 ENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSD 85
E+ E L F++F+ ++K Y ++EE +R +F++NLK E + + G+A YG+ SD
Sbjct: 170 ESVELLGLFKEFMTKYNKVYSSQEEADRRLQIFKENLKTAEKIQSLDEGSAEYGVTKFSD 229
Query: 86 LTREEMK 92
LT EE +
Sbjct: 230 LTEEEFR 236
>gi|390178852|ref|XP_003736743.1| GA27408, isoform B [Drosophila pseudoobscura pseudoobscura]
gi|388859612|gb|EIM52816.1| GA27408, isoform B [Drosophila pseudoobscura pseudoobscura]
Length = 477
Score = 56.6 bits (135), Expect = 2e-06, Method: Composition-based stats.
Identities = 30/69 (43%), Positives = 37/69 (53%), Gaps = 2/69 (2%)
Query: 29 EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTR 88
+HL F KF F + Y E R +F NLK IE+LN E G+A YGI +D+T
Sbjct: 168 DHL--FHKFQIRFGRRYDNTAERQMRLRIFRQNLKTIEELNTNEMGSAKYGITEFADMTS 225
Query: 89 EEMKSRLGL 97
E K R GL
Sbjct: 226 TEYKERTGL 234
>gi|341878637|gb|EGT34572.1| hypothetical protein CAEBREN_13324 [Caenorhabditis brenneri]
Length = 478
Score = 56.6 bits (135), Expect = 2e-06, Method: Composition-based stats.
Identities = 29/62 (46%), Positives = 34/62 (54%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
F FI K Y K EV KRF VF+ N K+I +L K E GTA YG SD+T E K
Sbjct: 176 FLDFIDRHEKRYENKREVLKRFRVFKRNAKVIRELQKNEQGTAVYGFTKFSDMTTMEFKE 235
Query: 94 RL 95
+
Sbjct: 236 TM 237
>gi|308465858|ref|XP_003095186.1| hypothetical protein CRE_22071 [Caenorhabditis remanei]
gi|308246042|gb|EFO89994.1| hypothetical protein CRE_22071 [Caenorhabditis remanei]
Length = 326
Score = 56.6 bits (135), Expect = 2e-06, Method: Composition-based stats.
Identities = 25/75 (33%), Positives = 43/75 (57%)
Query: 21 NELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGI 80
+ + T + ++ F+ F+ + + Y T++E+ RF +F N+ L+E NK + G TY +
Sbjct: 20 HHIPTPDAKYTNAFQDFLVKYLREYKTEDELVMRFTIFSRNMDLVERYNKEDLGKVTYEL 79
Query: 81 NHLSDLTREEMKSRL 95
N SDL+ EE K L
Sbjct: 80 NDFSDLSDEEWKKFL 94
>gi|71993922|ref|NP_505215.2| Protein TAG-196 [Caenorhabditis elegans]
gi|351050011|emb|CCD64084.1| Protein TAG-196 [Caenorhabditis elegans]
Length = 477
Score = 56.6 bits (135), Expect = 2e-06, Method: Composition-based stats.
Identities = 28/59 (47%), Positives = 33/59 (55%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
F F+ K Y K EV KRF VF+ N K+I +L K E GTA YG SD+T E K
Sbjct: 174 FLDFVDRHEKKYTNKREVLKRFRVFKKNAKVIRELQKNEQGTAVYGFTKFSDMTTMEFK 232
>gi|24644155|ref|NP_730901.1| CG12163, isoform A [Drosophila melanogaster]
gi|32699625|sp|Q9VN93.2|CPR1_DROME RecName: Full=Putative cysteine proteinase CG12163; Flags:
Precursor
gi|23170427|gb|AAF52055.2| CG12163, isoform A [Drosophila melanogaster]
gi|27819876|gb|AAO24986.1| LP08529p [Drosophila melanogaster]
Length = 614
Score = 56.6 bits (135), Expect = 2e-06, Method: Composition-based stats.
Identities = 30/69 (43%), Positives = 38/69 (55%), Gaps = 2/69 (2%)
Query: 29 EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTR 88
+HL F KF F + Y + E R +F NLK IE+LN E G+A YGI +D+T
Sbjct: 305 DHL--FYKFQVRFGRRYVSTAERQMRLRIFRQNLKTIEELNANEMGSAKYGITEFADMTS 362
Query: 89 EEMKSRLGL 97
E K R GL
Sbjct: 363 SEYKERTGL 371
>gi|67773380|gb|AAY81947.1| cysteine protease 9 [Paragonimus westermani]
Length = 322
Score = 56.2 bits (134), Expect = 2e-06, Method: Composition-based stats.
Identities = 27/61 (44%), Positives = 38/61 (62%), Gaps = 1/61 (1%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
+E+F RD+ K Y E+ KRFA+F+DNL + L + GTA YG+ SDLT EE +
Sbjct: 27 YEQFKRDYGKVY-ANEDDQKRFAIFKDNLVRAQKLQLKDQGTARYGVTQFSDLTPEEFAA 85
Query: 94 R 94
+
Sbjct: 86 K 86
>gi|444510192|gb|ELV09527.1| Cathepsin F [Tupaia chinensis]
Length = 597
Score = 56.2 bits (134), Expect = 2e-06, Method: Composition-based stats.
Identities = 23/60 (38%), Positives = 37/60 (61%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
F+ F+ ++++Y TKEE R +VF N+ + + +HGTA YG+ SDLT EE ++
Sbjct: 300 FKNFVTTYNRTYQTKEEAQWRLSVFASNMVRAQKIQALDHGTAQYGVTKFSDLTEEEFRT 359
>gi|195497262|ref|XP_002096026.1| GE25302 [Drosophila yakuba]
gi|194182127|gb|EDW95738.1| GE25302 [Drosophila yakuba]
Length = 615
Score = 56.2 bits (134), Expect = 2e-06, Method: Composition-based stats.
Identities = 31/80 (38%), Positives = 41/80 (51%), Gaps = 2/80 (2%)
Query: 18 KSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTAT 77
K ++ + +HL F KF F + Y + E R +F NLK IE LN E G+A
Sbjct: 295 KKHSHRALDKADHL--FHKFQVRFGRRYVSTAERQMRLRIFRQNLKTIEQLNVNEMGSAK 352
Query: 78 YGINHLSDLTREEMKSRLGL 97
YGI +D+T E K R GL
Sbjct: 353 YGITEFADMTSSEYKERTGL 372
>gi|146386354|gb|ABQ23965.1| cathepsin F [Oryctolagus cuniculus]
Length = 248
Score = 56.2 bits (134), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 24/60 (40%), Positives = 38/60 (63%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
F+KF+R ++++Y +KEE R +VF N+ + + + GTA YGI SDLT EE ++
Sbjct: 91 FKKFVRTYNRTYESKEEAQWRLSVFASNMVRAQKIQSLDRGTAQYGITKFSDLTEEEFRT 150
>gi|195343593|ref|XP_002038380.1| GM10654 [Drosophila sechellia]
gi|194133401|gb|EDW54917.1| GM10654 [Drosophila sechellia]
Length = 615
Score = 56.2 bits (134), Expect = 2e-06, Method: Composition-based stats.
Identities = 30/69 (43%), Positives = 38/69 (55%), Gaps = 2/69 (2%)
Query: 29 EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTR 88
+HL F KF F + Y + E R +F NLK IE+LN E G+A YGI +D+T
Sbjct: 306 DHL--FYKFQVRFGRRYVSTAERQMRLRIFRQNLKTIEELNANEMGSAKYGITEFADMTS 363
Query: 89 EEMKSRLGL 97
E K R GL
Sbjct: 364 SEYKERTGL 372
>gi|198453932|ref|XP_002137768.1| GA27408, isoform A [Drosophila pseudoobscura pseudoobscura]
gi|198132577|gb|EDY68326.1| GA27408, isoform A [Drosophila pseudoobscura pseudoobscura]
Length = 629
Score = 56.2 bits (134), Expect = 3e-06, Method: Composition-based stats.
Identities = 30/69 (43%), Positives = 37/69 (53%), Gaps = 2/69 (2%)
Query: 29 EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTR 88
+HL F KF F + Y E R +F NLK IE+LN E G+A YGI +D+T
Sbjct: 320 DHL--FHKFQIRFGRRYDNTAERQMRLRIFRQNLKTIEELNTNEMGSAKYGITEFADMTS 377
Query: 89 EEMKSRLGL 97
E K R GL
Sbjct: 378 TEYKERTGL 386
>gi|195152617|ref|XP_002017233.1| GL22196 [Drosophila persimilis]
gi|194112290|gb|EDW34333.1| GL22196 [Drosophila persimilis]
Length = 627
Score = 55.8 bits (133), Expect = 3e-06, Method: Composition-based stats.
Identities = 30/69 (43%), Positives = 37/69 (53%), Gaps = 2/69 (2%)
Query: 29 EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTR 88
+HL F KF F + Y E R +F NLK IE+LN E G+A YGI +D+T
Sbjct: 318 DHL--FHKFQIRFGRRYDNTAERQMRLRIFRQNLKTIEELNTNEMGSAKYGITEFADMTS 375
Query: 89 EEMKSRLGL 97
E K R GL
Sbjct: 376 TEYKERTGL 384
>gi|330842703|ref|XP_003293312.1| hypothetical protein DICPUDRAFT_41833 [Dictyostelium purpureum]
gi|325076376|gb|EGC30167.1| hypothetical protein DICPUDRAFT_41833 [Dictyostelium purpureum]
Length = 352
Score = 55.8 bits (133), Expect = 3e-06, Method: Composition-based stats.
Identities = 29/74 (39%), Positives = 41/74 (55%)
Query: 20 NNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYG 79
NN +T + F + + K Y T EE KRF+ F+ NLK IE+LN G A++G
Sbjct: 25 NNAYRTIDGPSKDLFHHWTKQNGKIYETSEEFEKRFSNFKTNLKKIENLNNLHKGKASFG 84
Query: 80 INHLSDLTREEMKS 93
+N SDL+ EE +
Sbjct: 85 MNKYSDLSEEEFSN 98
>gi|24644153|ref|NP_649521.1| CG12163, isoform B [Drosophila melanogaster]
gi|23170426|gb|AAN13266.1| CG12163, isoform B [Drosophila melanogaster]
gi|378548248|gb|AFC17498.1| FI18603p1 [Drosophila melanogaster]
Length = 475
Score = 55.8 bits (133), Expect = 3e-06, Method: Composition-based stats.
Identities = 30/69 (43%), Positives = 38/69 (55%), Gaps = 2/69 (2%)
Query: 29 EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTR 88
+HL F KF F + Y + E R +F NLK IE+LN E G+A YGI +D+T
Sbjct: 166 DHL--FYKFQVRFGRRYVSTAERQMRLRIFRQNLKTIEELNANEMGSAKYGITEFADMTS 223
Query: 89 EEMKSRLGL 97
E K R GL
Sbjct: 224 SEYKERTGL 232
>gi|344295816|ref|XP_003419606.1| PREDICTED: cathepsin F [Loxodonta africana]
Length = 473
Score = 55.8 bits (133), Expect = 3e-06, Method: Composition-based stats.
Identities = 24/60 (40%), Positives = 37/60 (61%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
F+ F+ ++++Y TKEE R +VF +N+ + L + GTA YGI SDLT EE ++
Sbjct: 176 FKNFVTTYNRTYETKEETKWRMSVFANNMIRAQKLQALDQGTAQYGITKFSDLTEEEFRT 235
>gi|427777627|gb|JAA54265.1| Putative cathepsin f-like cysteine protease [Rhipicephalus
pulchellus]
Length = 475
Score = 55.8 bits (133), Expect = 3e-06, Method: Composition-based stats.
Identities = 31/72 (43%), Positives = 44/72 (61%), Gaps = 3/72 (4%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
F F R ++K+Y KEE RF +F++NLK I N+ E GTA YG+ SDL+ E +
Sbjct: 166 FSVFARTYNKTYKDKEEHEARFMIFKNNLKRIALFNRLEEGTAHYGLTEFSDLSPSEFER 225
Query: 94 R-LGL--NLSKH 102
LGL +L++H
Sbjct: 226 HYLGLKKDLAEH 237
>gi|307200028|gb|EFN80374.1| Putative cysteine proteinase CG12163 [Harpegnathos saltator]
Length = 1032
Score = 55.8 bits (133), Expect = 3e-06, Method: Composition-based stats.
Identities = 26/65 (40%), Positives = 41/65 (63%), Gaps = 1/65 (1%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
FE F+ ++++Y T+EE R ++F +NL +I L K E GT YG+N +D++ EE +
Sbjct: 727 FENFVNTYNRTYATEEERNLRLSIFRENLGIIRLLRKNEQGTGQYGVNQFADVSTEEFHA 786
Query: 94 -RLGL 97
LGL
Sbjct: 787 FYLGL 791
>gi|158148921|dbj|BAF81994.1| cysteine proteinase [Platycodon grandiflorus]
Length = 359
Score = 55.8 bits (133), Expect = 3e-06, Method: Composition-based stats.
Identities = 30/68 (44%), Positives = 42/68 (61%), Gaps = 2/68 (2%)
Query: 30 HLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTRE 89
H F +F + KSY T EE+ +RF++F D+LK+I NK + + T G+N +DLT E
Sbjct: 56 HSLAFARFAHRYGKSYETAEEMKRRFSIFVDSLKMIRSHNK-KGLSYTLGVNEFADLTWE 114
Query: 90 EM-KSRLG 96
E K RLG
Sbjct: 115 EFRKHRLG 122
>gi|38048171|gb|AAR09988.1| similar to Drosophila melanogaster CG12163, partial [Drosophila
yakuba]
Length = 213
Score = 55.8 bits (133), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 30/72 (41%), Positives = 38/72 (52%), Gaps = 2/72 (2%)
Query: 26 ENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSD 85
+ +HL F KF F + Y + E R +F NLK IE LN E G+A YGI +D
Sbjct: 31 DKADHL--FHKFQVRFGRRYVSTAERQMRLRIFRQNLKTIEQLNVNEMGSAKYGITEFAD 88
Query: 86 LTREEMKSRLGL 97
+T E K R GL
Sbjct: 89 MTSSEYKERTGL 100
>gi|348564702|ref|XP_003468143.1| PREDICTED: cathepsin F-like [Cavia porcellus]
Length = 462
Score = 55.5 bits (132), Expect = 4e-06, Method: Composition-based stats.
Identities = 23/60 (38%), Positives = 38/60 (63%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
F+KF+ ++++Y +KEE R +VF N+ L + + + GTA YG+ SDLT EE ++
Sbjct: 165 FKKFVATYNRTYESKEETQWRLSVFTRNMILAQKIQALDRGTAQYGVTKFSDLTEEEFRT 224
>gi|390994427|gb|AFM37363.1| cathepsin F1 [Dictyocaulus viviparus]
Length = 459
Score = 55.5 bits (132), Expect = 4e-06, Method: Composition-based stats.
Identities = 29/60 (48%), Positives = 34/60 (56%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
QF F+ K Y +K + KRF VF+ NLK I + E GTA YGI SDLT EE K
Sbjct: 156 QFVDFMGRHEKVYNSKHDTLKRFRVFKRNLKAIRSWQEKEEGTAVYGITQFSDLTPEEFK 215
>gi|407859260|gb|EKG06954.1| cysteine protease, putative [Trypanosoma cruzi]
Length = 422
Score = 55.5 bits (132), Expect = 4e-06, Method: Composition-based stats.
Identities = 29/73 (39%), Positives = 41/73 (56%), Gaps = 2/73 (2%)
Query: 25 TENPEHLKQ--FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINH 82
T N ++L Q FEK+I DF K Y EE KR A+F++NL + N + GIN
Sbjct: 16 TSNEDYLAQYTFEKYIADFGKRYADPEEHRKRAAIFKENLAEVRAFNGVLGRSYRLGINK 75
Query: 83 LSDLTREEMKSRL 95
SD+T+EE ++
Sbjct: 76 FSDMTKEEFNAKF 88
>gi|357619727|gb|EHJ72186.1| cathepsin [Danaus plexippus]
Length = 336
Score = 55.5 bits (132), Expect = 4e-06, Method: Composition-based stats.
Identities = 28/59 (47%), Positives = 41/59 (69%), Gaps = 2/59 (3%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
FE FIR+++K Y +KE+ +RF +F +NLK I DLN + A +GIN +DL++EE K
Sbjct: 41 FENFIREYNKKYDSKEK-EERFKIFVNNLKRINDLNH-KSTNAVHGINKFTDLSKEEFK 97
>gi|305434754|gb|ADM53739.1| cathepsin L2 precursor [Lepeophtheirus salmonis]
Length = 382
Score = 55.5 bits (132), Expect = 4e-06, Method: Composition-based stats.
Identities = 29/82 (35%), Positives = 44/82 (53%)
Query: 13 LFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGE 72
LFG + +++FE F++++SKSY + + + VF DNL+ IE+ N
Sbjct: 15 LFGLAALAAGTSSPTQREIQEFESFVKEYSKSYHNRALRSLKLKVFVDNLREIEEHNANP 74
Query: 73 HGTATYGINHLSDLTREEMKSR 94
T GIN SDLT EE +S+
Sbjct: 75 KRTWDMGINEFSDLTDEEFESK 96
>gi|47212989|emb|CAF92720.1| unnamed protein product [Tetraodon nigroviridis]
Length = 142
Score = 55.5 bits (132), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 27/64 (42%), Positives = 40/64 (62%)
Query: 29 EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTR 88
E L QF++F+ +SK Y ++EE R +F++NLK E + + G+A YGI SDLT
Sbjct: 2 ELLGQFKEFMMKYSKVYNSQEEADHRLKIFKENLKTAEKIQSLDEGSAEYGITKFSDLTE 61
Query: 89 EEMK 92
EE +
Sbjct: 62 EEFR 65
>gi|155966155|gb|ABU41032.1| cysteine proteinase [Lepeophtheirus salmonis]
Length = 372
Score = 55.5 bits (132), Expect = 4e-06, Method: Composition-based stats.
Identities = 29/82 (35%), Positives = 44/82 (53%)
Query: 13 LFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGE 72
LFG + +++FE F++++SKSY + + + VF DNL+ IE+ N
Sbjct: 6 LFGLAALAAGTSSPTQREIQEFESFVKEYSKSYHNRALRSLKLKVFVDNLREIEEHNANP 65
Query: 73 HGTATYGINHLSDLTREEMKSR 94
T GIN SDLT EE +S+
Sbjct: 66 KRTWDMGINEFSDLTDEEFESK 87
>gi|74273320|gb|ABA01328.1| secreted cathepsin F [Teladorsagia circumcincta]
Length = 364
Score = 55.5 bits (132), Expect = 4e-06, Method: Composition-based stats.
Identities = 25/60 (41%), Positives = 34/60 (56%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
F FI K Y + E KRF +F+ NL++I + + GTA YGIN +DL+ EE K
Sbjct: 63 HFTSFIERHDKVYRNESEALKRFGIFKRNLEIIRSAQENDKGTAIYGINQFADLSPEEFK 122
>gi|195997891|ref|XP_002108814.1| hypothetical protein TRIADDRAFT_20325 [Trichoplax adhaerens]
gi|190589590|gb|EDV29612.1| hypothetical protein TRIADDRAFT_20325 [Trichoplax adhaerens]
Length = 333
Score = 55.5 bits (132), Expect = 5e-06, Method: Composition-based stats.
Identities = 28/66 (42%), Positives = 38/66 (57%), Gaps = 3/66 (4%)
Query: 31 LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
L +F+ FI D++++Y TKEE RF F+ N + I N ATYG+N +D T EE
Sbjct: 33 LARFKSFITDYNRNYTTKEEHEFRFQTFKKNFRRIASTNA---NGATYGVNKFADWTDEE 89
Query: 91 MKSRLG 96
K LG
Sbjct: 90 FKELLG 95
>gi|71662527|ref|XP_818269.1| cysteine protease [Trypanosoma cruzi strain CL Brener]
gi|70883510|gb|EAN96418.1| cysteine protease, putative [Trypanosoma cruzi]
Length = 434
Score = 55.5 bits (132), Expect = 5e-06, Method: Composition-based stats.
Identities = 28/73 (38%), Positives = 41/73 (56%), Gaps = 2/73 (2%)
Query: 25 TENPEHLKQ--FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINH 82
T + ++L Q FEK+I DF K Y EE KR A+F++NL + N + GIN
Sbjct: 28 TSDEDYLAQYTFEKYIADFGKRYADPEEHRKRAAIFKENLAKVRAFNGALGRSYRLGINK 87
Query: 83 LSDLTREEMKSRL 95
SD+T+EE ++
Sbjct: 88 FSDMTKEEFNAKF 100
>gi|351710879|gb|EHB13798.1| Cathepsin F [Heterocephalus glaber]
Length = 482
Score = 55.1 bits (131), Expect = 5e-06, Method: Composition-based stats.
Identities = 22/60 (36%), Positives = 38/60 (63%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
F+ F+ ++++Y +K+E R +VF N+ L + + +HGTA YG+ SDLT EE ++
Sbjct: 185 FKNFVATYNRTYESKKEAQWRLSVFTRNMVLAQRIQALDHGTAQYGVTKFSDLTEEEFRT 244
>gi|118429521|gb|ABK91808.1| cysteine proteinase prozyme precursor [Clonorchis sinensis]
Length = 316
Score = 55.1 bits (131), Expect = 5e-06, Method: Composition-based stats.
Identities = 33/83 (39%), Positives = 45/83 (54%), Gaps = 4/83 (4%)
Query: 12 ALFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKG 71
FG + SN ++EN L +E+F + KSY ++ RF VF+DNL I+
Sbjct: 1 GFFGVLGSNIP-ESENARQL--YEEFKLKYKKSYSNDDD-EYRFRVFKDNLLRIKQFQNM 56
Query: 72 EHGTATYGINHLSDLTREEMKSR 94
E GTA YG+ SDLT +E K R
Sbjct: 57 ERGTAKYGVTQFSDLTAQEFKVR 79
>gi|292397748|ref|YP_003517814.1| cathepsin [Lymantria xylina MNPV]
gi|291065465|gb|ADD73783.1| cathepsin [Lymantria xylina MNPV]
Length = 335
Score = 55.1 bits (131), Expect = 5e-06, Method: Composition-based stats.
Identities = 27/71 (38%), Positives = 48/71 (67%), Gaps = 3/71 (4%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLI--EDLNKGEHGTATYGINHLSDLTREEM 91
FE F+ +++K+Y + E KR+++F+DNL I ++ N + TATYGIN SDL++ E+
Sbjct: 35 FESFVENYNKNYTSDWEKNKRYSIFKDNLHEINAKNGNATDGPTATYGINKFSDLSKSEL 94
Query: 92 KSRL-GLNLSK 101
++ GL++ +
Sbjct: 95 IAKFTGLSIPQ 105
>gi|356564154|ref|XP_003550321.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
Length = 476
Score = 55.1 bits (131), Expect = 5e-06, Method: Composition-based stats.
Identities = 27/72 (37%), Positives = 42/72 (58%), Gaps = 1/72 (1%)
Query: 23 LKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINH 82
L+TE E + +E+++ K Y E KRF +F+DNL+ I+D N E T G+N
Sbjct: 49 LRTEE-ELMSMYEQWLVKHGKVYNALGEKEKRFQIFKDNLRFIDDHNSAEDRTYKLGLNR 107
Query: 83 LSDLTREEMKSR 94
+DLT EE +++
Sbjct: 108 FADLTNEEYRAK 119
>gi|407424636|gb|EKF39072.1| cysteine protease, putative [Trypanosoma cruzi marinkellei]
Length = 438
Score = 55.1 bits (131), Expect = 5e-06, Method: Composition-based stats.
Identities = 30/73 (41%), Positives = 40/73 (54%), Gaps = 2/73 (2%)
Query: 25 TENPEHLKQ--FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINH 82
T N ++L Q FEK+I DF K Y EE KR A+F +NL I N + GIN
Sbjct: 32 TYNEDYLAQYTFEKYISDFGKRYADPEEHRKRNAIFNENLAKIRAFNGVLGRSYRLGINK 91
Query: 83 LSDLTREEMKSRL 95
SD+T+EE ++
Sbjct: 92 FSDMTKEEFNAKF 104
>gi|146215994|gb|ABQ10199.1| cysteine protease Cp1 [Actinidia deliciosa]
Length = 358
Score = 55.1 bits (131), Expect = 6e-06, Method: Composition-based stats.
Identities = 30/71 (42%), Positives = 40/71 (56%), Gaps = 2/71 (2%)
Query: 27 NPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
+ H F +F + K Y T EE RFA+F +NLKLI NK + + T G+NH +D
Sbjct: 52 DSRHALSFARFAHRYGKRYETAEETKLRFAIFSENLKLIRSHNK-KGLSYTLGVNHFADW 110
Query: 87 TREEMKS-RLG 96
T EE + RLG
Sbjct: 111 TWEEFRRHRLG 121
>gi|242074968|ref|XP_002447420.1| hypothetical protein SORBIDRAFT_06g000780 [Sorghum bicolor]
gi|241938603|gb|EES11748.1| hypothetical protein SORBIDRAFT_06g000780 [Sorghum bicolor]
Length = 381
Score = 55.1 bits (131), Expect = 6e-06, Method: Composition-based stats.
Identities = 24/64 (37%), Positives = 39/64 (60%)
Query: 31 LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
+++F ++ +SYPT EE +RF ++ DN+K IE +N+ T T G N +DLT +E
Sbjct: 56 MERFHAWMAAHGRSYPTAEEKLRRFQIYRDNVKFIEAINRDTTKTFTCGENQFTDLTHQE 115
Query: 91 MKSR 94
+R
Sbjct: 116 FLAR 119
>gi|345783063|ref|XP_533219.3| PREDICTED: LOW QUALITY PROTEIN: cathepsin F [Canis lupus
familiaris]
Length = 490
Score = 55.1 bits (131), Expect = 6e-06, Method: Composition-based stats.
Identities = 23/60 (38%), Positives = 38/60 (63%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
F++F+ ++++Y TKEE R +VF +N+ + + + GTA YGI SDLT EE ++
Sbjct: 192 FKEFVTTYNRTYETKEEAEWRMSVFSNNMVRAQKIQALDRGTAQYGITKFSDLTEEEFRT 251
>gi|308454069|ref|XP_003089698.1| hypothetical protein CRE_27947 [Caenorhabditis remanei]
gi|308269277|gb|EFP13230.1| hypothetical protein CRE_27947 [Caenorhabditis remanei]
Length = 243
Score = 55.1 bits (131), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 26/73 (35%), Positives = 43/73 (58%)
Query: 23 LKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINH 82
+ T + ++ F+ F+ + + Y T++E+ KRF +F N+ L+E NK + G TY +N
Sbjct: 39 IPTPDAKYTNAFQDFLVKYLREYKTEDELVKRFTIFSRNMDLVERYNKEDLGKVTYELND 98
Query: 83 LSDLTREEMKSRL 95
SDL+ EE K L
Sbjct: 99 FSDLSDEEWKKFL 111
>gi|403293523|ref|XP_003937763.1| PREDICTED: cathepsin W [Saimiri boliviensis boliviensis]
Length = 373
Score = 55.1 bits (131), Expect = 6e-06, Method: Composition-based stats.
Identities = 29/70 (41%), Positives = 40/70 (57%), Gaps = 1/70 (1%)
Query: 28 PEHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
P LK+ F+ F R F++SY T EE A+R +F NL + L + + GTA +G+ SDL
Sbjct: 35 PLELKEAFKFFQRQFNRSYLTPEEHARRLDIFAHNLAQAQQLQEEDFGTAEFGVTPFSDL 94
Query: 87 TREEMKSRLG 96
T EE G
Sbjct: 95 TEEEFGQLYG 104
>gi|268554660|ref|XP_002635317.1| C. briggsae CBR-TAG-196 protein [Caenorhabditis briggsae]
Length = 477
Score = 54.7 bits (130), Expect = 6e-06, Method: Composition-based stats.
Identities = 27/62 (43%), Positives = 33/62 (53%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
F FI K Y K EV KRF F+ N K+I +L K E G+A YG SD+T E K
Sbjct: 174 FLDFIDRHEKRYSNKREVLKRFRTFKKNAKVIRELQKNEQGSAVYGFTKFSDMTTMEFKQ 233
Query: 94 RL 95
+
Sbjct: 234 TM 235
>gi|401758208|gb|AFQ01139.1| cathepsin F-like protease, partial [Chilo suppressalis]
Length = 537
Score = 54.7 bits (130), Expect = 7e-06, Method: Composition-based stats.
Identities = 29/67 (43%), Positives = 39/67 (58%), Gaps = 2/67 (2%)
Query: 34 FEKFIRDFSKSYPTKE-EVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
F FI + Y E+ KRF +F++N+K I +LN E GT Y + +DLT EE K
Sbjct: 231 FFNFITTYKPEYINDHVEMTKRFEIFKENVKKIHELNTHERGTGVYAVTRFTDLTYEEFK 290
Query: 93 SR-LGLN 98
S+ LGLN
Sbjct: 291 SKYLGLN 297
>gi|308506829|ref|XP_003115597.1| CRE-TAG-196 protein [Caenorhabditis remanei]
gi|308256132|gb|EFP00085.1| CRE-TAG-196 protein [Caenorhabditis remanei]
Length = 475
Score = 54.7 bits (130), Expect = 7e-06, Method: Composition-based stats.
Identities = 28/62 (45%), Positives = 32/62 (51%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
F FI K Y K EV KRF F+ N K I +L K E GTA YG SD+T E K
Sbjct: 172 FLDFIDRHEKRYSNKREVLKRFRTFKKNAKAIRELQKNEQGTAVYGFTKFSDMTTMEFKQ 231
Query: 94 RL 95
+
Sbjct: 232 TM 233
>gi|152926446|gb|ABS32280.1| cathepsin L protease inhibitor 2 [Diaprepes abbreviatus]
Length = 91
Score = 54.3 bits (129), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 32/76 (42%), Positives = 50/76 (65%), Gaps = 6/76 (7%)
Query: 25 TENPEHL---KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLN-KGEHGTATY-- 78
T+ P +L +++EKF F+++Y + +E AKRF +F+ NL+ I + N K E G T+
Sbjct: 5 TKAPSYLSDQEEWEKFKTGFNRNYDSSDEEAKRFNIFQQNLQSIREHNEKFERGETTFTQ 64
Query: 79 GINHLSDLTREEMKSR 94
GIN +DLT+EE K+R
Sbjct: 65 GINQFTDLTKEEFKAR 80
>gi|56718883|gb|AAW28152.1| westerpain-10 [Paragonimus westermani]
Length = 327
Score = 54.3 bits (129), Expect = 9e-06, Method: Composition-based stats.
Identities = 26/61 (42%), Positives = 37/61 (60%), Gaps = 1/61 (1%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
+E+F R + K Y E+ KRFA+F+DNL + L + GTA YG+ SDLT EE +
Sbjct: 32 YEQFKRGYGKVY-ANEDDQKRFAIFKDNLVRAQKLQLKDQGTARYGVTQFSDLTPEEFAA 90
Query: 94 R 94
+
Sbjct: 91 K 91
>gi|114679921|ref|YP_758371.1| cathepsin [Leucania separata nuclear polyhedrosis virus]
gi|39598652|gb|AAR28838.1| cathepsin [Leucania separata nuclear polyhedrosis virus]
Length = 359
Score = 54.3 bits (129), Expect = 9e-06, Method: Composition-based stats.
Identities = 29/75 (38%), Positives = 47/75 (62%), Gaps = 4/75 (5%)
Query: 28 PEHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
P+ ++ FE+F+RD++++Y E +R+ F NLK I LN + A+Y IN SDL
Sbjct: 47 PDRMRDYFERFVRDYNRTYIDSVEREQRYETFVQNLKNINRLN--QKSQASYDINKFSDL 104
Query: 87 TREEMKSRL-GLNLS 100
T++E+ +R GL+ S
Sbjct: 105 TKDEVVARFTGLDPS 119
>gi|144905112|dbj|BAF56429.1| cysteine proteinase [Lotus japonicus]
Length = 341
Score = 54.3 bits (129), Expect = 1e-05, Method: Composition-based stats.
Identities = 30/82 (36%), Positives = 45/82 (54%), Gaps = 1/82 (1%)
Query: 20 NNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYG 79
+ EL +++ E+++ + K Y E KRF +F+DN++ IE N + G
Sbjct: 27 SRELHETETSLIERHEQWMAKYDKVYKDAAEKEKRFLIFKDNVEFIESFNAAGNKPYKLG 86
Query: 80 INHLSDLTREEMK-SRLGLNLS 100
+NHL+DLT EE K SR GL S
Sbjct: 87 VNHLADLTIEEFKASRNGLKRS 108
>gi|6649575|gb|AAF21461.1|U69120_1 cysteine proteinase PWCP1 [Paragonimus westermani]
Length = 427
Score = 53.9 bits (128), Expect = 1e-05, Method: Composition-based stats.
Identities = 28/63 (44%), Positives = 40/63 (63%), Gaps = 2/63 (3%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
FE+F R F KSY + + AKR+A+F+ NL ++ + + E GTA YGI SDL+ EE +
Sbjct: 127 FEEFQRKFRKSYSS--DTAKRYALFKYNLLKMQLIQRLEKGTANYGITKFSDLSAEEFRH 184
Query: 94 RLG 96
L
Sbjct: 185 SLA 187
>gi|294661899|gb|ADF28790.1| RE01479p [Drosophila melanogaster]
Length = 334
Score = 53.9 bits (128), Expect = 1e-05, Method: Composition-based stats.
Identities = 28/64 (43%), Positives = 35/64 (54%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
F KF F + Y + E R +F NLK IE+LN E G+A YGI +D+T E K
Sbjct: 169 FYKFQVRFGRRYVSTAERQMRLRIFRQNLKTIEELNANEMGSAKYGITEFADMTSSEYKE 228
Query: 94 RLGL 97
R GL
Sbjct: 229 RTGL 232
>gi|30575714|gb|AAP33049.1| cysteine proteinase 1 [Clonorchis sinensis]
Length = 326
Score = 53.9 bits (128), Expect = 1e-05, Method: Composition-based stats.
Identities = 27/82 (32%), Positives = 46/82 (56%), Gaps = 3/82 (3%)
Query: 13 LFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGE 72
++ + +++ +N L +E+F + K+Y ++ RF +F+DNL + L + E
Sbjct: 13 IWSALARTTQVEPDNARAL--YEEFTLKYKKTYSNDDD-ELRFEIFKDNLLRAKRLQEME 69
Query: 73 HGTATYGINHLSDLTREEMKSR 94
GTA YG+ SDLT EE K+R
Sbjct: 70 QGTAQYGVTQFSDLTSEEFKTR 91
>gi|357624871|gb|EHJ75484.1| putative 26,29kDa proteinase [Danaus plexippus]
Length = 553
Score = 53.9 bits (128), Expect = 1e-05, Method: Composition-based stats.
Identities = 30/94 (31%), Positives = 50/94 (53%), Gaps = 6/94 (6%)
Query: 14 FGQMKSNNELKT----ENPEHLK-QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDL 68
F M + N +K + EH+ +F++F+ +K Y ++ E KR +F NL+LI
Sbjct: 223 FRHMATFNPMKEFVHPASDEHVHHEFDRFVNKHNKQYASEVEKTKRINIFRQNLRLIHSH 282
Query: 69 NKGEHGTATYGINHLSDLTREEMKSRLGLNLSKH 102
N+ G + +NHL+D T EE+ +R G + H
Sbjct: 283 NRAHRGF-SLAVNHLADHTDEELAARRGRRYTGH 315
>gi|332373716|gb|AEE61999.1| unknown [Dendroctonus ponderosae]
Length = 346
Score = 53.5 bits (127), Expect = 1e-05, Method: Composition-based stats.
Identities = 26/60 (43%), Positives = 37/60 (61%), Gaps = 1/60 (1%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGE-HGTATYGINHLSDLTREE 90
+QF +++ DF+KSYP + E RFA F+ +L IE LN + +A YG+ SD T EE
Sbjct: 39 EQFHEYLSDFNKSYPQEAEFQFRFAAFKKSLANIEQLNANKTKSSAQYGLTKFSDFTAEE 98
>gi|146215980|gb|ABQ10192.1| actinidin Act2a [Actinidia deliciosa]
Length = 378
Score = 53.5 bits (127), Expect = 1e-05, Method: Composition-based stats.
Identities = 28/88 (31%), Positives = 51/88 (57%), Gaps = 1/88 (1%)
Query: 11 LALFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNK 70
L L + N ++ N + + +E ++ + KSY + +E RF +F++NL++I+D N
Sbjct: 19 LILSSAIDIENSVQRTNDQVMAMYESWLVEHGKSYNSLDEKEMRFEIFKENLRIIDDHNA 78
Query: 71 GEHGTATYGINHLSDLTREEMKSR-LGL 97
+ + + G+N +DLT EE +S LGL
Sbjct: 79 DANRSYSLGLNRFADLTDEEYRSTYLGL 106
>gi|301784869|ref|XP_002927853.1| PREDICTED: cathepsin F-like [Ailuropoda melanoleuca]
Length = 394
Score = 53.5 bits (127), Expect = 1e-05, Method: Composition-based stats.
Identities = 22/60 (36%), Positives = 38/60 (63%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
F++F+ ++++Y +KEE R +VF +N+ + + + GTA YGI SDLT EE ++
Sbjct: 97 FKEFVTTYNRTYESKEEAEWRMSVFSNNVMRAQKIQALDRGTAQYGITKFSDLTEEEFRT 156
>gi|355681647|gb|AER96812.1| cathepsin F [Mustela putorius furo]
Length = 408
Score = 53.5 bits (127), Expect = 1e-05, Method: Composition-based stats.
Identities = 21/60 (35%), Positives = 38/60 (63%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
F++F+ ++++Y +KEE R +VF +N+ + + + GTA YG+ SDLT EE ++
Sbjct: 112 FKEFVTTYNRTYESKEETQWRMSVFSNNMMRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 171
>gi|244790093|ref|NP_001156453.1| cathepsin F isoform 1 precursor [Acyrthosiphon pisum]
Length = 586
Score = 53.5 bits (127), Expect = 2e-05, Method: Composition-based stats.
Identities = 28/68 (41%), Positives = 41/68 (60%), Gaps = 1/68 (1%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
FE FI +K Y + EE ++RF +F N+K ++ L E G+A YG +DLT+ E K
Sbjct: 280 FENFIMTHNKIYTSLEEKSRRFRIFAANMKKVKLLQNHEQGSAIYGATQFADLTKNEFKK 339
Query: 94 R-LGLNLS 100
+ LGL+ S
Sbjct: 340 KYLGLDSS 347
>gi|126338866|ref|XP_001379280.1| PREDICTED: cathepsin F-like [Monodelphis domestica]
Length = 567
Score = 53.5 bits (127), Expect = 2e-05, Method: Composition-based stats.
Identities = 23/64 (35%), Positives = 36/64 (56%)
Query: 29 EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTR 88
E + F+ F+ ++KSY E +R +F NL+L L + + G+A YG+ SDLT
Sbjct: 265 ELISLFKDFLTTYNKSYANATETQRRLGIFARNLELAHKLQELDQGSAQYGVTKFSDLTE 324
Query: 89 EEMK 92
EE +
Sbjct: 325 EEFR 328
>gi|457756|emb|CAA82995.1| cysteine proteinase [Vicia sativa]
Length = 358
Score = 53.5 bits (127), Expect = 2e-05, Method: Composition-based stats.
Identities = 38/87 (43%), Positives = 46/87 (52%), Gaps = 4/87 (4%)
Query: 13 LFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGE 72
L Q+ N E N EH F F FSKSY TKEE RF VF+ NL + L++
Sbjct: 24 LIRQVVDNEEDHLLNAEH--HFTSFKSKFSKSYATKEEHDYRFGVFKANL-IKAKLHQKL 80
Query: 73 HGTATYGINHLSDLTREEMKSR-LGLN 98
TA +GI SDLT E + + LGLN
Sbjct: 81 DPTAEHGITKFSDLTASEFRRQFLGLN 107
>gi|388521567|gb|AFK48845.1| unknown [Medicago truncatula]
Length = 343
Score = 53.5 bits (127), Expect = 2e-05, Method: Composition-based stats.
Identities = 27/62 (43%), Positives = 39/62 (62%), Gaps = 2/62 (3%)
Query: 36 KFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS-R 94
+F + K Y T +E+ +RF +F +NL+LI+ NK G T G+NH +D T EE +S R
Sbjct: 46 RFANRYGKRYDTVDEMKRRFKIFSENLQLIKSTNKKRLGY-TLGVNHFADWTWEEFRSHR 104
Query: 95 LG 96
LG
Sbjct: 105 LG 106
>gi|410974700|ref|XP_003993781.1| PREDICTED: cathepsin F [Felis catus]
Length = 459
Score = 53.5 bits (127), Expect = 2e-05, Method: Composition-based stats.
Identities = 22/60 (36%), Positives = 38/60 (63%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
F++F+ ++++Y T+EE R +VF +N+ + + + GTA YGI SDLT EE ++
Sbjct: 162 FKEFVTTYNRTYGTQEEAQWRLSVFSNNMVRAQKIQALDRGTAQYGITKFSDLTEEEFRA 221
>gi|356565778|ref|XP_003551114.1| PREDICTED: thiol protease aleurain-like [Glycine max]
Length = 353
Score = 53.1 bits (126), Expect = 2e-05, Method: Composition-based stats.
Identities = 28/68 (41%), Positives = 38/68 (55%), Gaps = 2/68 (2%)
Query: 30 HLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTRE 89
H F +F R K Y + +E+ RF +F DNLKLI N+ T T G+NH +D T E
Sbjct: 50 HALSFARFARRHGKRYRSVDEIRNRFRIFSDNLKLIRSTNR-RSLTYTLGVNHFADWTWE 108
Query: 90 EM-KSRLG 96
E + +LG
Sbjct: 109 EFTRHKLG 116
>gi|323454466|gb|EGB10336.1| hypothetical protein AURANDRAFT_22962 [Aureococcus anophagefferens]
Length = 416
Score = 53.1 bits (126), Expect = 2e-05, Method: Composition-based stats.
Identities = 29/83 (34%), Positives = 44/83 (53%), Gaps = 1/83 (1%)
Query: 12 ALFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKG 71
A F ++KS + + F++FI ++SKSY T E RF +F NL I+ LN
Sbjct: 67 ATFAKLKSVTYGTLDTRDQKSLFDQFIDEYSKSYDTTHEYNDRFTIFSKNLNYIDALNT- 125
Query: 72 EHGTATYGINHLSDLTREEMKSR 94
++ A +G+N +D T EE R
Sbjct: 126 QNPHALFGLNVFADQTEEERSKR 148
>gi|168059933|ref|XP_001781954.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666600|gb|EDQ53250.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 369
Score = 53.1 bits (126), Expect = 2e-05, Method: Composition-based stats.
Identities = 31/67 (46%), Positives = 41/67 (61%), Gaps = 2/67 (2%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
KQFE FI++F K Y T EE RF VF+ NL L ++ TA++G+ SDLT EE
Sbjct: 54 KQFESFIKEFGKVYHTVEEYEHRFKVFKSNL-LRALKHQALDPTASHGVTMFSDLTEEEF 112
Query: 92 KSR-LGL 97
++ LGL
Sbjct: 113 ATQYLGL 119
>gi|244790097|ref|NP_001156454.1| cathepsin F isoform 2 precursor [Acyrthosiphon pisum]
Length = 586
Score = 53.1 bits (126), Expect = 2e-05, Method: Composition-based stats.
Identities = 28/68 (41%), Positives = 41/68 (60%), Gaps = 1/68 (1%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
FE FI +K Y + EE ++RF +F N+K ++ L E G+A YG +DLT+ E K
Sbjct: 280 FENFIMTHNKIYTSLEEKSRRFRIFAANMKKVKLLQNHEQGSAIYGATQFADLTKNEFKK 339
Query: 94 R-LGLNLS 100
+ LGL+ S
Sbjct: 340 KYLGLDSS 347
>gi|146215986|gb|ABQ10195.1| actinidin Act2d [Actinidia eriantha]
Length = 381
Score = 53.1 bits (126), Expect = 2e-05, Method: Composition-based stats.
Identities = 25/83 (30%), Positives = 48/83 (57%)
Query: 11 LALFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNK 70
L L + N ++ N + + +E ++ + KSY + +E RF +F++NL++I+D N
Sbjct: 21 LILSSALDIKNSVQRTNDQVMAMYESWLVEQGKSYNSLDEKEMRFEIFKENLRIIDDHNA 80
Query: 71 GEHGTATYGINHLSDLTREEMKS 93
+ + + G+N +DLT EE +S
Sbjct: 81 DANRSYSLGLNRFADLTDEEYRS 103
>gi|55979119|gb|AAV69023.1| cysteine protease [Opisthorchis viverrini]
gi|224923980|gb|ACN68966.1| cathepsin F-like cysteine protease [Opisthorchis viverrini]
Length = 326
Score = 53.1 bits (126), Expect = 2e-05, Method: Composition-based stats.
Identities = 25/61 (40%), Positives = 37/61 (60%), Gaps = 1/61 (1%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
+E+F + K+Y ++ RF +F+DNL+ + L E GTA YG+ SDLT EE K+
Sbjct: 32 YEEFKLKYKKTYSNDDD-ELRFRIFKDNLERAKRLQAMEQGTAEYGVTQFSDLTSEEFKT 90
Query: 94 R 94
R
Sbjct: 91 R 91
>gi|7239343|gb|AAF43193.1|AF228731_1 cathepsin L [Stylonychia lemnae]
Length = 340
Score = 53.1 bits (126), Expect = 2e-05, Method: Composition-based stats.
Identities = 30/84 (35%), Positives = 45/84 (53%), Gaps = 2/84 (2%)
Query: 14 FGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEH 73
+ S+ L T + +H+ F F+ FSK+Y +KEE R ++ N+ I + N
Sbjct: 23 LSETSSSQSLYTADQDHI-DFVHFMSRFSKAYKSKEEFEMRLQQYKSNIAFINNHNSQND 81
Query: 74 GTA-TYGINHLSDLTREEMKSRLG 96
GT+ T G NHL+D T +E K LG
Sbjct: 82 GTSFTLGPNHLADYTHDEYKKMLG 105
>gi|307175778|gb|EFN65613.1| Putative cysteine proteinase CG12163 [Camponotus floridanus]
Length = 887
Score = 53.1 bits (126), Expect = 2e-05, Method: Composition-based stats.
Identities = 27/65 (41%), Positives = 41/65 (63%), Gaps = 1/65 (1%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
F F+ ++++Y T EE R +F +NL +I+ L K E GTA Y +N +D++ EE +S
Sbjct: 582 FNNFVVTYNRTYSTPEERNLRLRIFRENLGIIQLLRKTERGTAHYDVNMFADMSPEEFRS 641
Query: 94 R-LGL 97
R LGL
Sbjct: 642 RYLGL 646
>gi|18141281|gb|AAL60578.1|AF454956_1 senescence-associated cysteine protease [Brassica oleracea]
Length = 445
Score = 53.1 bits (126), Expect = 2e-05, Method: Composition-based stats.
Identities = 24/67 (35%), Positives = 40/67 (59%)
Query: 27 NPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
NPE +K FE+++ + K+Y E KRF +F DNLK +++ N + + G+ +DL
Sbjct: 30 NPEEVKMFERWLVENHKNYNGLGEKDKRFEIFMDNLKFVQEHNSVPNQSYELGLTRFADL 89
Query: 87 TREEMKS 93
T EE ++
Sbjct: 90 TNEEFRA 96
>gi|335281454|ref|XP_003122543.2| PREDICTED: cathepsin F [Sus scrofa]
gi|350579927|ref|XP_003480717.1| PREDICTED: cathepsin F-like [Sus scrofa]
Length = 490
Score = 53.1 bits (126), Expect = 2e-05, Method: Composition-based stats.
Identities = 22/60 (36%), Positives = 38/60 (63%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
F++F+ ++++Y TKEE R +VF +N+ + + + GTA YG+ SDLT EE ++
Sbjct: 193 FKEFVTTYNRTYDTKEEARWRMSVFANNMVRAQKIQALDTGTARYGVTKFSDLTEEEFRT 252
>gi|6649577|gb|AAF21462.1|U69121_1 cysteine proteinase PWCP2 [Paragonimus westermani]
Length = 260
Score = 53.1 bits (126), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 26/61 (42%), Positives = 37/61 (60%), Gaps = 1/61 (1%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
+E+F R + K Y E+ KRFA+F+DNL + L + GTA YG+ SDLT EE +
Sbjct: 6 YEQFKRXYGKVY-ANEDDQKRFAIFKDNLMRAQKLQLKDQGTARYGVTQFSDLTPEEFAA 64
Query: 94 R 94
+
Sbjct: 65 K 65
>gi|7219908|gb|AAF40479.1| cystein protease [Clonorchis sinensis]
Length = 326
Score = 53.1 bits (126), Expect = 2e-05, Method: Composition-based stats.
Identities = 27/82 (32%), Positives = 46/82 (56%), Gaps = 3/82 (3%)
Query: 13 LFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGE 72
++ + +++ +N L +E+F + K+Y ++ RF +F+DNL + L + E
Sbjct: 13 IWSALARTTQVEPDNARAL--YEEFKLKYKKTYSNDDD-ELRFEIFKDNLLRAKRLQEME 69
Query: 73 HGTATYGINHLSDLTREEMKSR 94
GTA YG+ SDLT EE K+R
Sbjct: 70 QGTAQYGVTQFSDLTSEEFKTR 91
>gi|4760897|gb|AAD29130.1| cysteine proteinase 1 precursor [Clonorchis sinensis]
Length = 328
Score = 53.1 bits (126), Expect = 2e-05, Method: Composition-based stats.
Identities = 27/82 (32%), Positives = 46/82 (56%), Gaps = 3/82 (3%)
Query: 13 LFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGE 72
++ + +++ +N L +E+F + K+Y ++ RF +F+DNL + L + E
Sbjct: 13 IWSALARTTQVEPDNARAL--YEEFKLKYKKTYSNDDD-ELRFEIFKDNLLRAKRLQEME 69
Query: 73 HGTATYGINHLSDLTREEMKSR 94
GTA YG+ SDLT EE K+R
Sbjct: 70 QGTAQYGVTQFSDLTSEEFKTR 91
>gi|413942348|gb|AFW74997.1| Xylem cysteine proteinase 2 [Zea mays]
Length = 391
Score = 53.1 bits (126), Expect = 2e-05, Method: Composition-based stats.
Identities = 30/79 (37%), Positives = 47/79 (59%), Gaps = 8/79 (10%)
Query: 27 NPEHLKQ-------FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYG 79
+PE L Q FE+++ + K+Y + EE +RF VF+DNL I++ N+ E + G
Sbjct: 72 SPEDLTQHDRLVRLFEEWVAKYRKAYGSFEEKLRRFEVFKDNLHHIDEANRKEVTSYWLG 131
Query: 80 INHLSDLTREEMKSR-LGL 97
+N +DLT +E K+ LGL
Sbjct: 132 LNAFADLTHDEFKATYLGL 150
>gi|326435242|gb|EGD80812.1| hypothetical protein PTSG_11722 [Salpingoeca sp. ATCC 50818]
Length = 372
Score = 53.1 bits (126), Expect = 2e-05, Method: Composition-based stats.
Identities = 26/63 (41%), Positives = 36/63 (57%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
FE F +F K+Y + EE R ++FE L ++ N+ E T G+NH+SD T EE K
Sbjct: 34 FEDFKLEFGKTYASHEEHEYRRSIFEQTLATVKAHNRDESKTWKQGVNHMSDWTDEEFKR 93
Query: 94 RLG 96
LG
Sbjct: 94 LLG 96
>gi|118429515|gb|ABK91805.1| cysteine proteinase 7 precursor [Clonorchis sinensis]
Length = 326
Score = 53.1 bits (126), Expect = 2e-05, Method: Composition-based stats.
Identities = 27/82 (32%), Positives = 46/82 (56%), Gaps = 3/82 (3%)
Query: 13 LFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGE 72
++ + +++ +N L +E+F + K+Y ++ RF +F+DNL + L + E
Sbjct: 13 IWSALARTTQVEPDNARAL--YEEFKLKYKKTYSNDDD-ELRFEIFKDNLLRAKRLQEME 69
Query: 73 HGTATYGINHLSDLTREEMKSR 94
GTA YG+ SDLT EE K+R
Sbjct: 70 QGTAQYGVTQFSDLTSEEFKTR 91
>gi|118429527|gb|ABK91811.1| cathepsin F precursor [Clonorchis sinensis]
Length = 326
Score = 53.1 bits (126), Expect = 2e-05, Method: Composition-based stats.
Identities = 27/82 (32%), Positives = 46/82 (56%), Gaps = 3/82 (3%)
Query: 13 LFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGE 72
++ + +++ +N L +E+F + K+Y ++ RF +F+DNL + L + E
Sbjct: 13 IWSALARTTQVEPDNARAL--YEEFKLKYKKTYSNDDD-ELRFEIFKDNLLRAKRLQEME 69
Query: 73 HGTATYGINHLSDLTREEMKSR 94
GTA YG+ SDLT EE K+R
Sbjct: 70 QGTAQYGVTQFSDLTSEEFKTR 91
>gi|116242314|gb|ABJ89814.1| cysteine protease preprotein [Clonorchis sinensis]
Length = 326
Score = 53.1 bits (126), Expect = 2e-05, Method: Composition-based stats.
Identities = 27/82 (32%), Positives = 46/82 (56%), Gaps = 3/82 (3%)
Query: 13 LFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGE 72
++ + +++ +N L +E+F + K+Y ++ RF +F+DNL + L + E
Sbjct: 13 IWSALARTTQVEPDNARAL--YEEFKLKYKKTYSNDDD-ELRFEIFKDNLLRAKRLQEME 69
Query: 73 HGTATYGINHLSDLTREEMKSR 94
GTA YG+ SDLT EE K+R
Sbjct: 70 QGTAQYGVTQFSDLTSEEFKTR 91
>gi|85068702|gb|ABC69431.1| cysteine protease [Clonorchis sinensis]
Length = 326
Score = 53.1 bits (126), Expect = 2e-05, Method: Composition-based stats.
Identities = 27/82 (32%), Positives = 46/82 (56%), Gaps = 3/82 (3%)
Query: 13 LFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGE 72
++ + +++ +N L +E+F + K+Y ++ RF +F+DNL + L + E
Sbjct: 13 IWSALARTTQVEPDNARAL--YEEFKLKYKKTYSNDDD-ELRFEIFKDNLLRAKRLQEME 69
Query: 73 HGTATYGINHLSDLTREEMKSR 94
GTA YG+ SDLT EE K+R
Sbjct: 70 QGTAQYGVTQFSDLTSEEFKTR 91
>gi|85068708|gb|ABC69434.1| cysteine protease [Clonorchis sinensis]
gi|85068710|gb|ABC69435.1| cysteine protease [Clonorchis sinensis]
Length = 328
Score = 53.1 bits (126), Expect = 2e-05, Method: Composition-based stats.
Identities = 27/82 (32%), Positives = 46/82 (56%), Gaps = 3/82 (3%)
Query: 13 LFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGE 72
++ + +++ +N L +E+F + K+Y ++ RF +F+DNL + L + E
Sbjct: 13 IWSALARTTQVEPDNARAL--YEEFKLKYKKTYSNDDD-ELRFEIFKDNLLRAKRLQEME 69
Query: 73 HGTATYGINHLSDLTREEMKSR 94
GTA YG+ SDLT EE K+R
Sbjct: 70 QGTAQYGVTQFSDLTSEEFKTR 91
>gi|85068698|gb|ABC69429.1| cysteine protease [Clonorchis sinensis]
Length = 326
Score = 53.1 bits (126), Expect = 2e-05, Method: Composition-based stats.
Identities = 27/82 (32%), Positives = 46/82 (56%), Gaps = 3/82 (3%)
Query: 13 LFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGE 72
++ + +++ +N L +E+F + K+Y ++ RF +F+DNL + L + E
Sbjct: 13 IWSALARTTQVEPDNARAL--YEEFKLKYKKTYSNDDD-ELRFEIFKDNLLRAKRLQEME 69
Query: 73 HGTATYGINHLSDLTREEMKSR 94
GTA YG+ SDLT EE K+R
Sbjct: 70 QGTAQYGVTQFSDLTSEEFKTR 91
>gi|85068700|gb|ABC69430.1| cysteine protease [Clonorchis sinensis]
Length = 326
Score = 53.1 bits (126), Expect = 2e-05, Method: Composition-based stats.
Identities = 27/82 (32%), Positives = 46/82 (56%), Gaps = 3/82 (3%)
Query: 13 LFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGE 72
++ + +++ +N L +E+F + K+Y ++ RF +F+DNL + L + E
Sbjct: 13 IWSALARTTQVEPDNARAL--YEEFKLKYKKTYSNDDD-ELRFEIFKDNLLRAKRLQEME 69
Query: 73 HGTATYGINHLSDLTREEMKSR 94
GTA YG+ SDLT EE K+R
Sbjct: 70 QGTAQYGVTQFSDLTSEEFKTR 91
>gi|85068706|gb|ABC69433.1| cysteine protease [Clonorchis sinensis]
Length = 326
Score = 53.1 bits (126), Expect = 2e-05, Method: Composition-based stats.
Identities = 27/82 (32%), Positives = 46/82 (56%), Gaps = 3/82 (3%)
Query: 13 LFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGE 72
++ + +++ +N L +E+F + K+Y ++ RF +F+DNL + L + E
Sbjct: 13 IWSALARTTQVEPDNARAL--YEEFKLKYKKTYSNDDD-ELRFEIFKDNLLRAKRLQEME 69
Query: 73 HGTATYGINHLSDLTREEMKSR 94
GTA YG+ SDLT EE K+R
Sbjct: 70 QGTAQYGVTQFSDLTSEEFKTR 91
>gi|85068712|gb|ABC69436.1| cysteine protease [Clonorchis sinensis]
Length = 328
Score = 53.1 bits (126), Expect = 2e-05, Method: Composition-based stats.
Identities = 27/82 (32%), Positives = 46/82 (56%), Gaps = 3/82 (3%)
Query: 13 LFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGE 72
++ + +++ +N L +E+F + K+Y ++ RF +F+DNL + L + E
Sbjct: 13 IWSALARTTQVEPDNARAL--YEEFKLKYKKTYSNDDD-ELRFEIFKDNLLRAKRLQEME 69
Query: 73 HGTATYGINHLSDLTREEMKSR 94
GTA YG+ SDLT EE K+R
Sbjct: 70 QGTAQYGVTQFSDLTSEEFKTR 91
>gi|356515040|ref|XP_003526209.1| PREDICTED: thiol protease SEN102-like [Glycine max]
Length = 342
Score = 52.8 bits (125), Expect = 2e-05, Method: Composition-based stats.
Identities = 27/67 (40%), Positives = 40/67 (59%), Gaps = 1/67 (1%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
++ E ++ ++ K Y E KRF +F+DN++ IE N + G+NHL+DLT EE
Sbjct: 36 ERHENWMAEYGKMYKDAAEKEKRFQIFKDNVEFIESFNAAGNKPYKLGVNHLADLTLEEF 95
Query: 92 K-SRLGL 97
K SR GL
Sbjct: 96 KDSRNGL 102
>gi|75277440|sp|O23791.1|BROM1_ANACO RecName: Full=Fruit bromelain; AltName: Allergen=Ana c 2; Flags:
Precursor
gi|2342496|dbj|BAA21849.1| bromelain [Ananas comosus]
Length = 351
Score = 52.8 bits (125), Expect = 2e-05, Method: Composition-based stats.
Identities = 21/64 (32%), Positives = 39/64 (60%)
Query: 27 NPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
N +K+FE+++ ++ + Y +E +RF +F++N+K IE N + T GIN +D+
Sbjct: 30 NDPMMKRFEEWMAEYGRVYKDDDEKMRRFQIFKNNVKHIETFNSRNENSYTLGINQFTDM 89
Query: 87 TREE 90
T+ E
Sbjct: 90 TKSE 93
>gi|146215982|gb|ABQ10193.1| actinidin Act2b [Actinidia eriantha]
Length = 378
Score = 52.8 bits (125), Expect = 2e-05, Method: Composition-based stats.
Identities = 26/80 (32%), Positives = 49/80 (61%), Gaps = 1/80 (1%)
Query: 21 NELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGI 80
N ++ N + + +E ++ + KSY + +E RF +F++NL++I+D N + + + G+
Sbjct: 29 NSVQRTNDQVMAMYESWLVEQGKSYNSLDEKEMRFEIFKENLRIIDDHNADANRSYSLGL 88
Query: 81 NHLSDLTREEMKSR-LGLNL 99
N +DLT EE +S LGL +
Sbjct: 89 NRFADLTDEEYRSTYLGLKM 108
>gi|226503129|ref|NP_001149806.1| LOC100283433 precursor [Zea mays]
gi|195634783|gb|ACG36860.1| xylem cysteine proteinase 2 precursor [Zea mays]
gi|219884977|gb|ACL52863.1| unknown [Zea mays]
Length = 377
Score = 52.8 bits (125), Expect = 2e-05, Method: Composition-based stats.
Identities = 30/79 (37%), Positives = 47/79 (59%), Gaps = 8/79 (10%)
Query: 27 NPEHLKQ-------FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYG 79
+PE L Q FE+++ + K+Y + EE +RF VF+DNL I++ N+ E + G
Sbjct: 58 SPEDLTQHDRLVRLFEEWVAKYRKAYGSFEEKLRRFEVFKDNLHHIDEANRKEVTSYWLG 117
Query: 80 INHLSDLTREEMKSR-LGL 97
+N +DLT +E K+ LGL
Sbjct: 118 LNAFADLTHDEFKATYLGL 136
>gi|291230041|ref|XP_002734978.1| PREDICTED: cysteine proteinase inhibitor-like [Saccoglossus
kowalevskii]
Length = 352
Score = 52.8 bits (125), Expect = 3e-05, Method: Composition-based stats.
Identities = 22/59 (37%), Positives = 34/59 (57%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
F+ F++ + K Y T+EE R+ +F+DNL E L + E T YG+ DL+ EE +
Sbjct: 54 FQDFMKTYDKKYDTEEEHQLRYQIFQDNLLKAERLQQTEQATGQYGVTKFMDLSEEEFR 112
>gi|118150|sp|P25804.1|CYSP_PEA RecName: Full=Cysteine proteinase 15A; AltName:
Full=Turgor-responsive protein 15A; Flags: Precursor
gi|20679|emb|CAA38242.1| unnamed protein product [Pisum sativum]
Length = 363
Score = 52.8 bits (125), Expect = 3e-05, Method: Composition-based stats.
Identities = 36/83 (43%), Positives = 44/83 (53%), Gaps = 4/83 (4%)
Query: 16 QMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGT 75
Q+ N E N EH F F FSKSY TKEE RF VF+ NL + L++ T
Sbjct: 32 QVVDNEEDHLLNAEH--HFTSFKSKFSKSYATKEEHDYRFGVFKSNL-IKAKLHQNRDPT 88
Query: 76 ATYGINHLSDLTREEMKSR-LGL 97
A +GI SDLT E + + LGL
Sbjct: 89 AEHGITKFSDLTASEFRRQFLGL 111
>gi|356515050|ref|XP_003526214.1| PREDICTED: vignain-like [Glycine max]
Length = 344
Score = 52.8 bits (125), Expect = 3e-05, Method: Composition-based stats.
Identities = 27/67 (40%), Positives = 40/67 (59%), Gaps = 1/67 (1%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
++ E ++ ++ K Y E KRF +F+DN++ IE N + G+NHL+DLT EE
Sbjct: 36 ERHENWMAEYGKIYKDAAEKEKRFQIFKDNVEFIESFNAAGNKPYKLGVNHLADLTLEEF 95
Query: 92 K-SRLGL 97
K SR GL
Sbjct: 96 KDSRNGL 102
>gi|356545118|ref|XP_003540992.1| PREDICTED: thiol protease SEN102-like [Glycine max]
Length = 337
Score = 52.8 bits (125), Expect = 3e-05, Method: Composition-based stats.
Identities = 37/105 (35%), Positives = 54/105 (51%), Gaps = 11/105 (10%)
Query: 1 MAEDASAEATLALF-------GQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAK 53
MA + + T+ALF QM S +T E + E+++ ++ K Y E K
Sbjct: 1 MAFTSQKQYTIALFLLLALGIPQMMSRKLHETSMRE---RHEQWMAEYGKVYKDAAEKEK 57
Query: 54 RFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK-SRLGL 97
RF +F+ N++ IE N + G+NHL+DLT EE K SR GL
Sbjct: 58 RFLIFKHNVEFIESFNAAANKPYKLGVNHLADLTVEEFKASRNGL 102
>gi|356515046|ref|XP_003526212.1| PREDICTED: thiol protease SEN102-like [Glycine max]
Length = 342
Score = 52.8 bits (125), Expect = 3e-05, Method: Composition-based stats.
Identities = 27/67 (40%), Positives = 40/67 (59%), Gaps = 1/67 (1%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
++ E ++ ++ K Y E KRF +F+DN++ IE N + G+NHL+DLT EE
Sbjct: 36 ERHENWMAEYGKMYKDAAEKEKRFQIFKDNVEFIESFNAAGNKPYKLGVNHLADLTLEEF 95
Query: 92 K-SRLGL 97
K SR GL
Sbjct: 96 KDSRNGL 102
>gi|326430491|gb|EGD76061.1| cathepsin [Salpingoeca sp. ATCC 50818]
Length = 381
Score = 52.8 bits (125), Expect = 3e-05, Method: Composition-based stats.
Identities = 28/63 (44%), Positives = 36/63 (57%), Gaps = 5/63 (7%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLN----KGEHGTATYGINHLSDLTRE 89
F+ F F K Y + EE A+RFA+F DNL I N +G H T T G+N +DLT E
Sbjct: 20 FDDFKTTFEKQYESPEEEARRFAIFADNLAFIARHNAEAARGLH-THTVGVNQFADLTNE 78
Query: 90 EMK 92
E +
Sbjct: 79 EYR 81
>gi|124484387|dbj|BAF46304.1| cysteine proteinase precursor [Ipomoea nil]
Length = 474
Score = 52.8 bits (125), Expect = 3e-05, Method: Composition-based stats.
Identities = 27/79 (34%), Positives = 45/79 (56%), Gaps = 7/79 (8%)
Query: 15 GQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHG 74
G ++S +E+K + FE ++ KSY +E KRF +F DNLK I++ N E+
Sbjct: 38 GLVRSEDEVK-------EMFESWLVKHGKSYNAVDEKDKRFKIFRDNLKYIDEKNSLENR 90
Query: 75 TATYGINHLSDLTREEMKS 93
+ G+N +D+T EE ++
Sbjct: 91 SYKLGLNRFADITNEEYRT 109
>gi|356553978|ref|XP_003545327.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
Length = 496
Score = 52.4 bits (124), Expect = 3e-05, Method: Composition-based stats.
Identities = 24/68 (35%), Positives = 39/68 (57%)
Query: 27 NPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
+ E + +E+++ K Y E KRF +F+DNL+ I+D N E T G+N +DL
Sbjct: 72 DEELMSMYEQWLVKHGKVYNALGEKEKRFQIFKDNLRFIDDHNSQEDRTYKLGLNRFADL 131
Query: 87 TREEMKSR 94
T EE +++
Sbjct: 132 TNEEYRAK 139
>gi|168018894|ref|XP_001761980.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162686697|gb|EDQ73084.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 369
Score = 52.4 bits (124), Expect = 3e-05, Method: Composition-based stats.
Identities = 30/67 (44%), Positives = 41/67 (61%), Gaps = 2/67 (2%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
K+FE F++DF K Y + EE RF VF+ NL L ++ TA++G+ SDLT EE
Sbjct: 54 KRFESFMKDFGKVYHSVEEYEHRFGVFKSNL-LKALKHQALDPTASHGVTMFSDLTEEEF 112
Query: 92 KSR-LGL 97
S+ LGL
Sbjct: 113 TSKYLGL 119
>gi|395851695|ref|XP_003798388.1| PREDICTED: LOW QUALITY PROTEIN: cathepsin F [Otolemur garnettii]
Length = 491
Score = 52.4 bits (124), Expect = 3e-05, Method: Composition-based stats.
Identities = 22/65 (33%), Positives = 39/65 (60%)
Query: 29 EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTR 88
+ L F+ F+ ++++Y +KEE R ++F +N+ + + + GTA YGI SDLT
Sbjct: 189 QMLSVFKNFLTTYNRTYESKEETQWRLSIFINNMVRAQKIQALDQGTARYGITKFSDLTE 248
Query: 89 EEMKS 93
EE ++
Sbjct: 249 EEFRT 253
>gi|338712411|ref|XP_001491536.3| PREDICTED: cathepsin F [Equus caballus]
Length = 459
Score = 52.4 bits (124), Expect = 3e-05, Method: Composition-based stats.
Identities = 21/60 (35%), Positives = 36/60 (60%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
F+ F+ ++++Y TKEE R ++F N+ + + + GTA YG+ SDLT EE ++
Sbjct: 162 FKHFVTTYNRTYETKEEAQWRMSIFASNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 221
>gi|426252044|ref|XP_004019728.1| PREDICTED: cathepsin W [Ovis aries]
Length = 375
Score = 52.4 bits (124), Expect = 3e-05, Method: Composition-based stats.
Identities = 25/64 (39%), Positives = 37/64 (57%), Gaps = 1/64 (1%)
Query: 28 PEHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
P+ LK+ F F +++SYP E A+R +F NL + L + + GTA +G+ SDL
Sbjct: 35 PQELKEVFRLFQMQYNRSYPNPAEHARRLDIFAQNLAKAQRLQEEDLGTAEFGVTQFSDL 94
Query: 87 TREE 90
T EE
Sbjct: 95 TEEE 98
>gi|195054270|ref|XP_001994049.1| GH22731 [Drosophila grimshawi]
gi|193895919|gb|EDV94785.1| GH22731 [Drosophila grimshawi]
Length = 617
Score = 52.4 bits (124), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 28/69 (40%), Positives = 36/69 (52%), Gaps = 2/69 (2%)
Query: 29 EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTR 88
EHL F KF + + Y E R +F NL+ IE+LN E G+A YGI +D+T
Sbjct: 308 EHL--FHKFQLKYKRQYANTAEHQMRLRIFRQNLRTIEELNANERGSAKYGITQFADMTS 365
Query: 89 EEMKSRLGL 97
E K GL
Sbjct: 366 TEYKLHAGL 374
>gi|431910221|gb|ELK13294.1| Cathepsin F [Pteropus alecto]
Length = 458
Score = 52.4 bits (124), Expect = 4e-05, Method: Composition-based stats.
Identities = 22/60 (36%), Positives = 38/60 (63%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
F++F+ ++++Y TKEE R +VF +N+ + + + GTA YG+ SDLT EE ++
Sbjct: 161 FKEFVITYNRTYETKEEAQWRMSVFINNMMRAQKIQALDRGTARYGVTKFSDLTEEEFRT 220
>gi|334347644|ref|XP_001379528.2| PREDICTED: cathepsin W-like [Monodelphis domestica]
Length = 619
Score = 52.4 bits (124), Expect = 4e-05, Method: Composition-based stats.
Identities = 25/73 (34%), Positives = 38/73 (52%)
Query: 18 KSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTAT 77
+S +L + + QF+ F ++KSY E +RF +F DNL + L + G A
Sbjct: 250 QSFEDLPPATQDLMDQFKAFQIQYNKSYADPAEQERRFEIFADNLAWAQQLTEKHGGMAQ 309
Query: 78 YGINHLSDLTREE 90
+G+ SDLT EE
Sbjct: 310 FGVTQFSDLTEEE 322
>gi|438000427|ref|YP_007250532.1| v-cath protein [Thysanoplusia orichalcea NPV]
gi|429842964|gb|AGA16276.1| v-cath protein [Thysanoplusia orichalcea NPV]
Length = 323
Score = 52.0 bits (123), Expect = 4e-05, Method: Composition-based stats.
Identities = 26/67 (38%), Positives = 45/67 (67%), Gaps = 3/67 (4%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
FE+F+ F+K+Y ++ E +RF +F+ NL E +NK ++ +A Y IN SDL+++E +
Sbjct: 28 FEEFVHRFNKNYSSETEKLRRFKIFQHNLN--EIINKNQNDSAKYEINKFSDLSKDETIA 85
Query: 94 RL-GLNL 99
+ GL+L
Sbjct: 86 KYTGLSL 92
>gi|118397739|ref|XP_001031201.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89285525|gb|EAR83538.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 352
Score = 52.0 bits (123), Expect = 4e-05, Method: Composition-based stats.
Identities = 29/76 (38%), Positives = 45/76 (59%), Gaps = 1/76 (1%)
Query: 15 GQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHG 74
GQ + ++ +H ++FE+F + FSK+Y ++E RFA F +NL I+ LN E
Sbjct: 20 GQSNFDKNTFSQKHQHHQKFEQFKKSFSKAYESEEVQQFRFATFVENLNEIDRLN-AEVT 78
Query: 75 TATYGINHLSDLTREE 90
TA + I+ SD T+EE
Sbjct: 79 TAQFDISFFSDYTKEE 94
>gi|194352752|emb|CAQ00104.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
Length = 351
Score = 52.0 bits (123), Expect = 4e-05, Method: Composition-based stats.
Identities = 29/65 (44%), Positives = 43/65 (66%), Gaps = 2/65 (3%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
FEK++ K+Y + EE RF VF+DNLKLI+++N+ E + G+N +DLT +E K+
Sbjct: 44 FEKWLAKHQKAYASFEEKLHRFEVFKDNLKLIDEINR-EVTSYWLGLNEFADLTHDEFKT 102
Query: 94 R-LGL 97
LGL
Sbjct: 103 TYLGL 107
>gi|2463588|dbj|BAA22546.1| FB1035 precursor [Ananas comosus]
Length = 324
Score = 52.0 bits (123), Expect = 4e-05, Method: Composition-based stats.
Identities = 21/64 (32%), Positives = 39/64 (60%)
Query: 27 NPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
N +K+FE+++ ++ + Y +E +RF +F++N+K IE N + T GIN +D+
Sbjct: 3 NDPMMKRFEEWMAEYGRIYKDNDEKMRRFQIFKNNVKHIETFNSRNGNSYTLGINQFTDM 62
Query: 87 TREE 90
T+ E
Sbjct: 63 TKSE 66
>gi|395544492|ref|XP_003774144.1| PREDICTED: cathepsin F [Sarcophilus harrisii]
Length = 451
Score = 52.0 bits (123), Expect = 5e-05, Method: Composition-based stats.
Identities = 21/60 (35%), Positives = 35/60 (58%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
F+ F+ ++KSY E +R +F NL+L + + + G+A YG+ SDLT EE ++
Sbjct: 154 FKDFLTTYNKSYANATETQRRLGIFARNLELARKVQELDRGSAEYGVTKFSDLTEEEFRT 213
>gi|15320768|ref|NP_203280.1| V-CATH [Epiphyas postvittana NPV]
gi|37077652|sp|Q91GE3.1|CATV_NPVEP RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|15213236|gb|AAK85675.1| V-CATH [Epiphyas postvittana NPV]
Length = 323
Score = 52.0 bits (123), Expect = 5e-05, Method: Composition-based stats.
Identities = 26/70 (37%), Positives = 44/70 (62%), Gaps = 3/70 (4%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
FE+F+R ++K Y ++ E +R+ +F+ NL I + K + TA Y IN SDL+++E +
Sbjct: 28 FEEFVRQYNKQYDSEYEKLRRYKIFQHNLNDI--ITKNRNDTAVYKINKFSDLSKDETIA 85
Query: 94 RL-GLNLSKH 102
+ GL+L H
Sbjct: 86 KYTGLSLPLH 95
>gi|9631045|ref|NP_047715.1| cathepsin-like proteinase [Lymantria dispar MNPV]
gi|13124028|sp|Q9YMP9.1|CATV_NPVLD RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|3822313|gb|AAC70264.1| cathepsin-like proteinase [Lymantria dispar MNPV]
Length = 356
Score = 52.0 bits (123), Expect = 5e-05, Method: Composition-based stats.
Identities = 26/69 (37%), Positives = 46/69 (66%), Gaps = 3/69 (4%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLI--EDLNKGEHGTATYGINHLSDLTREEM 91
FE F+ +++K+Y + E KR+++F+DNL I ++ N + TATY IN SDL++ E+
Sbjct: 56 FESFVENYNKNYTSDWEKNKRYSIFKDNLHEINAKNGNATDGPTATYKINKFSDLSKSEL 115
Query: 92 KSRL-GLNL 99
++ GL++
Sbjct: 116 IAKFTGLSI 124
>gi|18408828|ref|NP_566920.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|12324451|gb|AAG52191.1|AC012329_18 putative cysteine proteinase; 15366-14136 [Arabidopsis thaliana]
gi|6723404|emb|CAB66413.1| cysteine protease-like protein [Arabidopsis thaliana]
gi|332645009|gb|AEE78530.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 341
Score = 52.0 bits (123), Expect = 5e-05, Method: Composition-based stats.
Identities = 23/64 (35%), Positives = 38/64 (59%)
Query: 31 LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
+++ E+++ F++ Y E RF +F +NLK +E +N + T T +N SDLT EE
Sbjct: 32 VEKHEQWMSRFNRVYSDDSEKTSRFEIFTNNLKFVESINMNTNKTYTLDVNEFSDLTDEE 91
Query: 91 MKSR 94
K+R
Sbjct: 92 FKAR 95
>gi|388512155|gb|AFK44139.1| unknown [Medicago truncatula]
Length = 340
Score = 52.0 bits (123), Expect = 5e-05, Method: Composition-based stats.
Identities = 23/68 (33%), Positives = 41/68 (60%)
Query: 26 ENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSD 85
E+P ++ E+++ ++ K Y E KRF +F+DN++ IE N ++ +NHL+D
Sbjct: 32 ESPSLQERHEQWMSEYGKLYKDAIEKEKRFMIFKDNVEFIESFNAADNKPYKLSVNHLAD 91
Query: 86 LTREEMKS 93
LT +E K+
Sbjct: 92 LTLDEFKA 99
>gi|357474725|ref|XP_003607647.1| Cysteine proteinase [Medicago truncatula]
gi|355508702|gb|AES89844.1| Cysteine proteinase [Medicago truncatula]
Length = 340
Score = 52.0 bits (123), Expect = 5e-05, Method: Composition-based stats.
Identities = 23/68 (33%), Positives = 41/68 (60%)
Query: 26 ENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSD 85
E+P ++ E+++ ++ K Y E KRF +F+DN++ IE N ++ +NHL+D
Sbjct: 32 ESPSLQERHEQWMSEYGKLYKDAIEKEKRFMIFKDNVEFIESFNAADNKPYKLSVNHLAD 91
Query: 86 LTREEMKS 93
LT +E K+
Sbjct: 92 LTLDEFKA 99
>gi|440804656|gb|ELR25533.1| cysteine proteinase precursor, putative [Acanthamoeba castellanii
str. Neff]
Length = 330
Score = 52.0 bits (123), Expect = 5e-05, Method: Composition-based stats.
Identities = 26/62 (41%), Positives = 37/62 (59%), Gaps = 2/62 (3%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
+QF +F + KSY + EE +R +F DNL I+ LN G A YG+N +DLT +E
Sbjct: 30 QQFRQFAAQYGKSYAS-EEFGERLRIFRDNLDRIDALNSANTG-ARYGVNKFADLTPKEF 87
Query: 92 KS 93
K+
Sbjct: 88 KA 89
>gi|357437719|ref|XP_003589135.1| Cysteine proteinase [Medicago truncatula]
gi|355478183|gb|AES59386.1| Cysteine proteinase [Medicago truncatula]
Length = 457
Score = 52.0 bits (123), Expect = 5e-05, Method: Composition-based stats.
Identities = 28/72 (38%), Positives = 41/72 (56%), Gaps = 1/72 (1%)
Query: 24 KTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHL 83
K N E L +E+++ KSY E KRF +F+DNLK I++ N G + T G+
Sbjct: 45 KRTNKEVLTMYEEWLVKHGKSYNGLGEKDKRFEIFKDNLKFIDEHN-GLNSTYRLGLTRF 103
Query: 84 SDLTREEMKSRL 95
+DLT EE +S+
Sbjct: 104 ADLTNEEYRSKF 115
>gi|426252096|ref|XP_004019754.1| PREDICTED: cathepsin F isoform 2 [Ovis aries]
Length = 477
Score = 51.6 bits (122), Expect = 5e-05, Method: Composition-based stats.
Identities = 20/60 (33%), Positives = 38/60 (63%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
F+ F+ ++++Y ++EE + R +VF +N+ + + + GTA YG+ SDLT EE ++
Sbjct: 180 FKDFVTTYNRTYDSQEEASWRMSVFANNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 239
>gi|357437715|ref|XP_003589133.1| Cysteine proteinase [Medicago truncatula]
gi|87240770|gb|ABD32628.1| Granulin; Peptidase C1A, papain [Medicago truncatula]
gi|355478181|gb|AES59384.1| Cysteine proteinase [Medicago truncatula]
Length = 474
Score = 51.6 bits (122), Expect = 5e-05, Method: Composition-based stats.
Identities = 28/72 (38%), Positives = 41/72 (56%), Gaps = 1/72 (1%)
Query: 24 KTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHL 83
K N E L +E+++ KSY E KRF +F+DNLK I++ N G + T G+
Sbjct: 45 KRTNKEVLTMYEEWLVKHGKSYNGLGEKDKRFEIFKDNLKFIDEHN-GLNSTYRLGLTRF 103
Query: 84 SDLTREEMKSRL 95
+DLT EE +S+
Sbjct: 104 ADLTNEEYRSKF 115
>gi|417401303|gb|JAA47542.1| Putative cathepsin f [Desmodus rotundus]
Length = 459
Score = 51.6 bits (122), Expect = 5e-05, Method: Composition-based stats.
Identities = 21/60 (35%), Positives = 38/60 (63%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
F+ FI ++++Y T+EE R ++F +N+ +++ + GTA YG+ SDLT EE ++
Sbjct: 162 FKHFIATYNRTYETEEEAQWRMSIFINNMVRAQEIQALDRGTAQYGVTKFSDLTEEEFRT 221
>gi|195395906|ref|XP_002056575.1| GJ11017 [Drosophila virilis]
gi|194143284|gb|EDW59687.1| GJ11017 [Drosophila virilis]
Length = 599
Score = 51.6 bits (122), Expect = 5e-05, Method: Composition-based stats.
Identities = 26/69 (37%), Positives = 36/69 (52%), Gaps = 2/69 (2%)
Query: 29 EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTR 88
+HL F KF + + Y E R +F +LK I++LN E G+A YGI +D+T
Sbjct: 290 DHL--FHKFQVKYKRRYANSAEHQMRLRIFRQSLKTIQELNANEQGSAKYGITEFADMTS 347
Query: 89 EEMKSRLGL 97
E R GL
Sbjct: 348 TEYAQRAGL 356
>gi|255538788|ref|XP_002510459.1| cysteine protease, putative [Ricinus communis]
gi|223551160|gb|EEF52646.1| cysteine protease, putative [Ricinus communis]
Length = 422
Score = 51.6 bits (122), Expect = 6e-05, Method: Composition-based stats.
Identities = 25/68 (36%), Positives = 41/68 (60%), Gaps = 1/68 (1%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
K FE + ++ K+Y +KE+ RF +FE+N + ++ N + + T +N +DLT E
Sbjct: 30 KLFESWTKEHGKTYTSKEDKLYRFKIFEENYEFVKKHNSQGNSSYTLSLNAFADLTHHEF 89
Query: 92 K-SRLGLN 98
K SRLGL+
Sbjct: 90 KASRLGLS 97
>gi|426252094|ref|XP_004019753.1| PREDICTED: cathepsin F isoform 1 [Ovis aries]
Length = 460
Score = 51.6 bits (122), Expect = 6e-05, Method: Composition-based stats.
Identities = 20/60 (33%), Positives = 38/60 (63%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
F+ F+ ++++Y ++EE + R +VF +N+ + + + GTA YG+ SDLT EE ++
Sbjct: 163 FKDFVTTYNRTYDSQEEASWRMSVFANNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 222
>gi|401397136|ref|XP_003879989.1| cathepsin L, related [Neospora caninum Liverpool]
gi|325114397|emb|CBZ49954.1| cathepsin L, related [Neospora caninum Liverpool]
Length = 415
Score = 51.6 bits (122), Expect = 6e-05, Method: Composition-based stats.
Identities = 29/76 (38%), Positives = 46/76 (60%), Gaps = 3/76 (3%)
Query: 29 EHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLT 87
EH + F F + KSY T+EE KR+A+F++NL I N+ + + + +NH DL+
Sbjct: 113 EHFQNAFGSFRATYGKSYATEEETQKRYAIFKNNLAYIHTHNQQGY-SYSLKMNHFGDLS 171
Query: 88 REEMKSR-LGLNLSKH 102
REE + + LG N S++
Sbjct: 172 REEFRRKYLGYNKSRN 187
>gi|115495381|ref|NP_001068884.1| cathepsin F precursor [Bos taurus]
gi|111304901|gb|AAI20004.1| Cathepsin F [Bos taurus]
gi|296471599|tpg|DAA13714.1| TPA: cathepsin F [Bos taurus]
Length = 460
Score = 51.6 bits (122), Expect = 6e-05, Method: Composition-based stats.
Identities = 20/60 (33%), Positives = 38/60 (63%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
F+ F+ ++++Y ++EE + R +VF +N+ + + + GTA YG+ SDLT EE ++
Sbjct: 163 FKDFVTTYNRTYDSQEEASWRMSVFANNMVRAQKIQALDRGTARYGVTKFSDLTEEEFRT 222
>gi|356530431|ref|XP_003533785.1| PREDICTED: cysteine proteinase [Glycine max]
Length = 354
Score = 51.6 bits (122), Expect = 6e-05, Method: Composition-based stats.
Identities = 26/65 (40%), Positives = 39/65 (60%), Gaps = 2/65 (3%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
+F +F+ F KSY ++EE+ +R+ +F NL+ I NK + T +NH +D T EE K
Sbjct: 54 KFARFVSRFGKSYQSEEEMKERYEIFSQNLRFIRSHNK-KRLPYTLSVNHFADWTWEEFK 112
Query: 93 S-RLG 96
RLG
Sbjct: 113 RHRLG 117
>gi|356515036|ref|XP_003526207.1| PREDICTED: thiol protease SEN102-like [Glycine max]
Length = 336
Score = 51.6 bits (122), Expect = 6e-05, Method: Composition-based stats.
Identities = 23/62 (37%), Positives = 38/62 (61%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
++ E+++ ++ K Y E KRF +F+DN++ IE N + G+NHL+DLT EE
Sbjct: 36 ERHEQWMTEYGKVYKDAAEKDKRFQIFKDNVEFIESFNADGNKPYKLGVNHLADLTVEEF 95
Query: 92 KS 93
K+
Sbjct: 96 KA 97
>gi|85068704|gb|ABC69432.1| cysteine protease [Clonorchis sinensis]
Length = 326
Score = 51.6 bits (122), Expect = 6e-05, Method: Composition-based stats.
Identities = 26/82 (31%), Positives = 46/82 (56%), Gaps = 3/82 (3%)
Query: 13 LFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGE 72
++ + +++ +N L +E+F + K+Y ++ RF +F+DNL + L + E
Sbjct: 13 IWSALARTTQVEPDNARAL--YEEFKLKYKKTYSNDDD-ELRFEIFKDNLLRAKRLQEME 69
Query: 73 HGTATYGINHLSDLTREEMKSR 94
GTA YG+ SDLT EE ++R
Sbjct: 70 QGTAQYGVTQFSDLTSEEFETR 91
>gi|357458909|ref|XP_003599735.1| Cysteine proteinase [Medicago truncatula]
gi|357474677|ref|XP_003607623.1| Cysteine proteinase [Medicago truncatula]
gi|355488783|gb|AES69986.1| Cysteine proteinase [Medicago truncatula]
gi|355508678|gb|AES89820.1| Cysteine proteinase [Medicago truncatula]
Length = 342
Score = 51.6 bits (122), Expect = 6e-05, Method: Composition-based stats.
Identities = 27/70 (38%), Positives = 38/70 (54%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
+ EK++ F KSY E KRF +F++N++ IE N + INH +DLT EE K
Sbjct: 36 KHEKWMTQFGKSYKDAAEKEKRFQIFKNNVEFIELFNAVGNKPFNLSINHFADLTNEEFK 95
Query: 93 SRLGLNLSKH 102
+ L N H
Sbjct: 96 ASLNGNKKLH 105
>gi|326502440|dbj|BAJ95283.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 349
Score = 51.6 bits (122), Expect = 6e-05, Method: Composition-based stats.
Identities = 27/80 (33%), Positives = 43/80 (53%), Gaps = 1/80 (1%)
Query: 19 SNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATY 78
+ +E T + + + EK++ + ++Y +EE A+R VF N KLI+ N E T
Sbjct: 29 AGDEAITVDSAMVSRHEKWMAEHGRTYANEEEKARRLEVFRANAKLIDSFNSAEDSTHRL 88
Query: 79 GINHLSDLTREEMK-SRLGL 97
N +DLT EE + +R GL
Sbjct: 89 ATNRFADLTDEEFRAARTGL 108
>gi|2224810|emb|CAB09698.1| cysteine proteinase [Hordeum vulgare subsp. vulgare]
Length = 349
Score = 51.6 bits (122), Expect = 7e-05, Method: Composition-based stats.
Identities = 27/80 (33%), Positives = 43/80 (53%), Gaps = 1/80 (1%)
Query: 19 SNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATY 78
+ +E T + + + EK++ + ++Y +EE A+R VF N KLI+ N E T
Sbjct: 29 AGDEAITVDAAMVSRHEKWMAEHGRTYANEEEKARRLEVFRANAKLIDSFNSAEDSTHRL 88
Query: 79 GINHLSDLTREEMK-SRLGL 97
N +DLT EE + +R GL
Sbjct: 89 ATNRFADLTDEEFRAARTGL 108
>gi|27413319|gb|AAO11786.1| pre-pro cysteine proteinase [Vicia faba]
Length = 363
Score = 51.2 bits (121), Expect = 7e-05, Method: Composition-based stats.
Identities = 36/83 (43%), Positives = 44/83 (53%), Gaps = 4/83 (4%)
Query: 16 QMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGT 75
Q+ N E N EH F F FSKSY TKEE RF VF+ NL + L++ T
Sbjct: 32 QVVDNEEDHLLNAEH--HFTSFKSKFSKSYSTKEEHDYRFGVFKSNL-IKAKLHQKLDPT 88
Query: 76 ATYGINHLSDLTREEMKSR-LGL 97
A +GI SDLT E + + LGL
Sbjct: 89 AEHGITKFSDLTASEFRRQFLGL 111
>gi|312378084|gb|EFR24752.1| hypothetical protein AND_10451 [Anopheles darlingi]
Length = 1785
Score = 51.2 bits (121), Expect = 7e-05, Method: Composition-based stats.
Identities = 28/86 (32%), Positives = 47/86 (54%), Gaps = 1/86 (1%)
Query: 18 KSNNELKTENPEHLK-QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTA 76
+S LK ++ +++ QFEKF + Y + E R+ +F +NL I+ LN+ E GT
Sbjct: 1461 RSVRSLKIDDEAYVRRQFEKFKLHHQRQYASSFEHEMRYNIFRNNLYKIDQLNRHERGTG 1520
Query: 77 TYGINHLSDLTREEMKSRLGLNLSKH 102
YG+ +D+T E ++ GL + K
Sbjct: 1521 KYGVTKFADMTTAEYRAHTGLIVPKQ 1546
>gi|1401242|gb|AAB67878.1| pre-pro-cysteine proteinase [Vicia faba]
Length = 363
Score = 51.2 bits (121), Expect = 7e-05, Method: Composition-based stats.
Identities = 36/83 (43%), Positives = 44/83 (53%), Gaps = 4/83 (4%)
Query: 16 QMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGT 75
Q+ N E N EH F F FSKSY TKEE RF VF+ NL + L++ T
Sbjct: 32 QVVDNEEDHLLNAEH--HFTSFKSKFSKSYSTKEEHDYRFGVFKSNL-IKAKLHQKLDPT 88
Query: 76 ATYGINHLSDLTREEMKSR-LGL 97
A +GI SDLT E + + LGL
Sbjct: 89 AEHGITKFSDLTASEFRRQFLGL 111
>gi|332026794|gb|EGI66903.1| Putative cysteine proteinase [Acromyrmex echinatior]
Length = 774
Score = 51.2 bits (121), Expect = 7e-05, Method: Composition-based stats.
Identities = 25/74 (33%), Positives = 47/74 (63%), Gaps = 2/74 (2%)
Query: 25 TENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLS 84
+E+ + + F F+ ++++Y + E RF +F +NL IE+L + E GT YG+N +
Sbjct: 461 SEDMKAERLFNNFMTTYNRTYSSLER-NLRFKIFRENLNFIEELRETEQGTGIYGVNMFA 519
Query: 85 DLTREEMKSR-LGL 97
D++++E ++R LGL
Sbjct: 520 DMSQKEFRTRYLGL 533
>gi|197258084|gb|ACH56226.1| cathepsin L-like cysteine proteinase [Radopholus similis]
Length = 417
Score = 51.2 bits (121), Expect = 7e-05, Method: Composition-based stats.
Identities = 28/73 (38%), Positives = 42/73 (57%), Gaps = 3/73 (4%)
Query: 26 ENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTA---TYGINH 82
E PE +++F++ R FS+ + ++ E +RF +FE NL I LN T TYG+N
Sbjct: 89 ELPEVVREFDQIQRTFSREWNSERERWERFKLFERNLAEIARLNAEAKRTGRNMTYGVNG 148
Query: 83 LSDLTREEMKSRL 95
++D T EEM L
Sbjct: 149 MADWTEEEMGRML 161
>gi|17978639|gb|AAL48318.1| berghepain-2 [Plasmodium berghei]
Length = 468
Score = 51.2 bits (121), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 41/68 (60%), Gaps = 1/68 (1%)
Query: 27 NPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
N E + F F+++++K Y + EE+ +RF +F +NLK IE NK H T GIN SD+
Sbjct: 145 NLESVNIFYNFMKEYNKQYNSAEEIQERFYIFSENLKKIEKHNKENH-LYTKGINAFSDM 203
Query: 87 TREEMKSR 94
EE K +
Sbjct: 204 RHEEFKMK 211
>gi|441611591|ref|XP_003273955.2| PREDICTED: LOW QUALITY PROTEIN: cathepsin F [Nomascus leucogenys]
Length = 548
Score = 51.2 bits (121), Expect = 8e-05, Method: Composition-based stats.
Identities = 21/60 (35%), Positives = 37/60 (61%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
F+ F+ ++++Y +KEE R +VF +N+ + + + GTA YG+ SDLT EE ++
Sbjct: 257 FKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 316
>gi|426369382|ref|XP_004051670.1| PREDICTED: cathepsin F [Gorilla gorilla gorilla]
Length = 517
Score = 51.2 bits (121), Expect = 8e-05, Method: Composition-based stats.
Identities = 21/60 (35%), Positives = 37/60 (61%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
F+ F+ ++++Y +KEE R +VF +N+ + + + GTA YG+ SDLT EE ++
Sbjct: 220 FKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 279
>gi|54696066|gb|AAV38405.1| cathepsin F [synthetic construct]
Length = 485
Score = 51.2 bits (121), Expect = 8e-05, Method: Composition-based stats.
Identities = 21/60 (35%), Positives = 37/60 (61%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
F+ F+ ++++Y +KEE R +VF +N+ + + + GTA YG+ SDLT EE ++
Sbjct: 187 FKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 246
>gi|6467382|gb|AAF13146.1|AF136279_1 cathepsin F precursor [Homo sapiens]
Length = 484
Score = 51.2 bits (121), Expect = 8e-05, Method: Composition-based stats.
Identities = 21/60 (35%), Positives = 37/60 (61%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
F+ F+ ++++Y +KEE R +VF +N+ + + + GTA YG+ SDLT EE ++
Sbjct: 187 FKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 246
>gi|9630063|ref|NP_046281.1| cathepsin [Orgyia pseudotsugata MNPV]
gi|2499880|sp|O10364.1|CATV_NPVOP RecName: Full=Viral cathepsin; Short=V-cath; AltName:
Full=Cysteine proteinase; Short=CP; Flags: Precursor
gi|7435821|pir||T10394 cathepsin - Orgyia pseudotsugata nuclear polyhedrosis virus
gi|1911371|gb|AAC59124.1| cathepsin [Orgyia pseudotsugata MNPV]
Length = 324
Score = 51.2 bits (121), Expect = 8e-05, Method: Composition-based stats.
Identities = 28/67 (41%), Positives = 43/67 (64%), Gaps = 2/67 (2%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
FE F+ F+K+Y ++ E RF +F+ NL+ I + N+ + TA Y IN SDL++EE S
Sbjct: 28 FEDFLHKFNKNYSSESEKLHRFKIFQHNLEEIINKNQND-STAQYEINKFSDLSKEEAIS 86
Query: 94 RL-GLNL 99
+ GL+L
Sbjct: 87 KYTGLSL 93
>gi|6042196|ref|NP_003784.2| cathepsin F precursor [Homo sapiens]
gi|12643325|sp|Q9UBX1.1|CATF_HUMAN RecName: Full=Cathepsin F; Short=CATSF; Flags: Precursor
gi|4731642|gb|AAD26616.2|AF088886_1 cathepsin F precursor [Homo sapiens]
gi|5305722|gb|AAD41790.1|AF132894_1 cathepsin F [Homo sapiens]
gi|4826528|emb|CAB42883.1| cysteine proteinase [Homo sapiens]
gi|15079738|gb|AAH11682.1| Cathepsin F [Homo sapiens]
gi|22209085|gb|AAH36451.1| Cathepsin F [Homo sapiens]
gi|61363874|gb|AAX42458.1| cathepsin F [synthetic construct]
gi|123993139|gb|ABM84171.1| cathepsin F [synthetic construct]
gi|189053904|dbj|BAG36411.1| unnamed protein product [Homo sapiens]
Length = 484
Score = 51.2 bits (121), Expect = 8e-05, Method: Composition-based stats.
Identities = 21/60 (35%), Positives = 37/60 (61%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
F+ F+ ++++Y +KEE R +VF +N+ + + + GTA YG+ SDLT EE ++
Sbjct: 187 FKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 246
>gi|410045434|ref|XP_003313198.2| PREDICTED: LOW QUALITY PROTEIN: cathepsin F [Pan troglodytes]
Length = 548
Score = 51.2 bits (121), Expect = 8e-05, Method: Composition-based stats.
Identities = 21/60 (35%), Positives = 37/60 (61%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
F+ F+ ++++Y +KEE R +VF +N+ + + + GTA YG+ SDLT EE ++
Sbjct: 251 FKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 310
>gi|402892718|ref|XP_003909556.1| PREDICTED: cathepsin F [Papio anubis]
Length = 460
Score = 51.2 bits (121), Expect = 8e-05, Method: Composition-based stats.
Identities = 21/60 (35%), Positives = 37/60 (61%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
F+ F+ ++++Y +KEE R +VF +N+ + + + GTA YG+ SDLT EE ++
Sbjct: 163 FKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 222
>gi|397517049|ref|XP_003828732.1| PREDICTED: cathepsin F [Pan paniscus]
Length = 379
Score = 51.2 bits (121), Expect = 8e-05, Method: Composition-based stats.
Identities = 21/60 (35%), Positives = 37/60 (61%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
F+ F+ ++++Y +KEE R +VF +N+ + + + GTA YG+ SDLT EE ++
Sbjct: 82 FKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 141
>gi|355566270|gb|EHH22649.1| Cathepsin F [Macaca mulatta]
Length = 484
Score = 51.2 bits (121), Expect = 8e-05, Method: Composition-based stats.
Identities = 21/60 (35%), Positives = 37/60 (61%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
F+ F+ ++++Y +KEE R +VF +N+ + + + GTA YG+ SDLT EE ++
Sbjct: 187 FKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 246
>gi|440907378|gb|ELR57532.1| Cathepsin W [Bos grunniens mutus]
Length = 382
Score = 51.2 bits (121), Expect = 8e-05, Method: Composition-based stats.
Identities = 25/64 (39%), Positives = 36/64 (56%), Gaps = 1/64 (1%)
Query: 28 PEHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
P LK+ F F +++SYP E A+R +F NL + L + + GTA +G+ SDL
Sbjct: 35 PLELKEVFRLFQMQYNRSYPNPAEYARRLDIFAQNLAKAQRLQEEDLGTAEFGVTQFSDL 94
Query: 87 TREE 90
T EE
Sbjct: 95 TEEE 98
>gi|146215984|gb|ABQ10194.1| actinidin Act2c [Actinidia arguta]
Length = 378
Score = 51.2 bits (121), Expect = 8e-05, Method: Composition-based stats.
Identities = 24/73 (32%), Positives = 43/73 (58%)
Query: 21 NELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGI 80
N + N + +E ++ + KSY + +E RF +F+DNL++I+D N + + + G+
Sbjct: 29 NSAQRTNDQVRDMYESWLVEQGKSYNSLDEKEMRFEIFKDNLRIIDDHNADANRSFSLGL 88
Query: 81 NHLSDLTREEMKS 93
N +DLT EE +S
Sbjct: 89 NRFADLTDEEYRS 101
>gi|68076993|ref|XP_680416.1| falcipain 2 precursor [Plasmodium berghei strain ANKA]
gi|56501341|emb|CAI05700.1| falcipain 2 precursor, putative [Plasmodium berghei]
Length = 470
Score = 51.2 bits (121), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 41/68 (60%), Gaps = 1/68 (1%)
Query: 27 NPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
N E + F F+++++K Y + EE+ +RF +F +NLK IE NK H T GIN SD+
Sbjct: 147 NLESVNIFYNFMKEYNKQYNSAEEIQERFYIFSENLKKIEKHNKENH-LYTKGINAFSDM 205
Query: 87 TREEMKSR 94
EE K +
Sbjct: 206 RHEEFKMK 213
>gi|228861649|ref|YP_002854669.1| cathepsin [Euproctis pseudoconspersa nucleopolyhedrovirus]
gi|226425097|gb|ACO53509.1| cathepsin [Euproctis pseudoconspersa nucleopolyhedrovirus]
Length = 334
Score = 51.2 bits (121), Expect = 8e-05, Method: Composition-based stats.
Identities = 28/67 (41%), Positives = 43/67 (64%), Gaps = 2/67 (2%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
FE F+ +++K+Y E KR+ +F+DNL+ I + NK + TA Y IN SDL+ E+ S
Sbjct: 37 FELFVANYNKNYTDPLEKTKRYHIFKDNLEEINNKNK-SNDTAVYRINKFSDLSTNELIS 95
Query: 94 RL-GLNL 99
+ GLN+
Sbjct: 96 KYTGLNV 102
>gi|115461667|ref|NP_001054433.1| Os05g0108600 [Oryza sativa Japonica Group]
gi|14719319|gb|AAK73137.1|AC079022_10 putative cysteine proteinase [Oryza sativa]
gi|33151125|gb|AAP97431.1| cysteine protease CP1 [Oryza sativa]
gi|52353572|gb|AAU44138.1| cysteine proteinase CP1 [Oryza sativa Japonica Group]
gi|113577984|dbj|BAF16347.1| Os05g0108600 [Oryza sativa Japonica Group]
gi|125550541|gb|EAY96250.1| hypothetical protein OsI_18148 [Oryza sativa Indica Group]
Length = 358
Score = 51.2 bits (121), Expect = 8e-05, Method: Composition-based stats.
Identities = 28/68 (41%), Positives = 45/68 (66%), Gaps = 2/68 (2%)
Query: 31 LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
++ FEK++ + K+Y + EE +RF VF+DNL I+D+NK + + G+N +DLT +E
Sbjct: 48 IELFEKWVAKYRKAYASFEEKVRRFEVFKDNLNHIDDINK-KVTSYWLGLNEFADLTHDE 106
Query: 91 MKSR-LGL 97
K+ LGL
Sbjct: 107 FKATYLGL 114
>gi|89274062|dbj|BAE80740.1| cysteine proteinase [Platycodon grandiflorus]
Length = 462
Score = 51.2 bits (121), Expect = 8e-05, Method: Composition-based stats.
Identities = 26/75 (34%), Positives = 44/75 (58%), Gaps = 1/75 (1%)
Query: 19 SNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATY 78
S + +T++ E + +E ++ KSY E KRF +F+DNL+ I++ N E+ +
Sbjct: 36 SKSSWRTDD-EVMAMYESWLVKHGKSYNALGEKEKRFQIFKDNLRFIDEHNAEENLSYKV 94
Query: 79 GINHLSDLTREEMKS 93
G+N +DLT EE +S
Sbjct: 95 GLNRFADLTNEEYRS 109
>gi|27819101|gb|AAO23117.1| cysteine proteinase [Bombyx mori NPV]
Length = 323
Score = 51.2 bits (121), Expect = 8e-05, Method: Composition-based stats.
Identities = 26/67 (38%), Positives = 45/67 (67%), Gaps = 3/67 (4%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
FE+F+ F+K+Y ++ E +RF +F+ NL E +NK ++ +A Y IN SDL+++E +
Sbjct: 28 FEEFVHRFNKNYSSEVEKLRRFKIFQHNLN--EIINKNQNDSAKYEINKFSDLSKDETIA 85
Query: 94 RL-GLNL 99
+ GL+L
Sbjct: 86 KYTGLSL 92
>gi|9630927|ref|NP_047524.1| Cystein Protease [Bombyx mori NPV]
gi|1168798|sp|P41721.1|CATV_NPVBM RecName: Full=Viral cathepsin; Short=V-cath; AltName:
Full=Cysteine proteinase; Short=CP; Flags: Precursor
gi|540066|gb|AAB49542.1| cysteine protease [Bombyx mori NPV]
gi|3745946|gb|AAC63793.1| Cystein Protease [Bombyx mori NPV]
Length = 323
Score = 51.2 bits (121), Expect = 8e-05, Method: Composition-based stats.
Identities = 26/67 (38%), Positives = 45/67 (67%), Gaps = 3/67 (4%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
FE+F+ F+K+Y ++ E +RF +F+ NL E +NK ++ +A Y IN SDL+++E +
Sbjct: 28 FEEFVHRFNKNYSSEVEKLRRFKIFQHNLN--EIINKNQNDSAKYEINKFSDLSKDETIA 85
Query: 94 RL-GLNL 99
+ GL+L
Sbjct: 86 KYTGLSL 92
>gi|309752918|gb|ADO85436.1| cathepsin [Pieris rapae granulovirus]
Length = 339
Score = 51.2 bits (121), Expect = 8e-05, Method: Composition-based stats.
Identities = 32/89 (35%), Positives = 50/89 (56%), Gaps = 6/89 (6%)
Query: 6 SAEATLALFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLI 65
S TL G+ + N EN +++ FE FI+ ++KSY T +E A ++ F++NLK+I
Sbjct: 13 SVMLTLCHLGETVTYN---LENSDNI--FEDFIKKYNKSYATDQERAIKYENFKNNLKMI 67
Query: 66 EDLNKGEHGTATYGINHLSDLTREEMKSR 94
D N G A + IN SDL + ++ R
Sbjct: 68 NDKNNGSK-DAVFDINAFSDLNKNDLLRR 95
>gi|393717160|gb|AFN21082.1| V-Cath [Bombyx mori NPV]
gi|393717442|gb|AFN21362.1| V-Cath [Bombyx mori NPV]
Length = 323
Score = 50.8 bits (120), Expect = 9e-05, Method: Composition-based stats.
Identities = 26/67 (38%), Positives = 45/67 (67%), Gaps = 3/67 (4%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
FE+F+ F+K+Y ++ E +RF +F+ NL E +NK ++ +A Y IN SDL+++E +
Sbjct: 28 FEEFVHRFNKNYSSEVEKLRRFKIFQHNLN--EIINKNQNDSAKYEINKFSDLSKDETIA 85
Query: 94 RL-GLNL 99
+ GL+L
Sbjct: 86 KYTGLSL 92
>gi|393717301|gb|AFN21222.1| V-Cath [Bombyx mori NPV]
Length = 323
Score = 50.8 bits (120), Expect = 9e-05, Method: Composition-based stats.
Identities = 26/67 (38%), Positives = 45/67 (67%), Gaps = 3/67 (4%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
FE+F+ F+K+Y ++ E +RF +F+ NL E +NK ++ +A Y IN SDL+++E +
Sbjct: 28 FEEFVHRFNKNYSSEVEKLRRFKIFQHNLN--EIINKNQNDSAKYEINKFSDLSKDETIA 85
Query: 94 RL-GLNL 99
+ GL+L
Sbjct: 86 KYTGLSL 92
>gi|328711164|ref|XP_003244460.1| PREDICTED: cathepsin O-like [Acyrthosiphon pisum]
Length = 339
Score = 50.8 bits (120), Expect = 9e-05, Method: Composition-based stats.
Identities = 27/66 (40%), Positives = 42/66 (63%), Gaps = 1/66 (1%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
+F KFI+ ++KSY + E KRF F+ +LK I+ L++ +G YGI SDL+ EE
Sbjct: 35 KFNKFIKMYNKSYMNETEHNKRFEHFKKSLKTIQLLSQKCNGCTNYGITEFSDLSTEEF- 93
Query: 93 SRLGLN 98
+++ LN
Sbjct: 94 TKIYLN 99
>gi|47779249|gb|AAT38521.1| cysteine protease [Bombyx mori NPV]
Length = 323
Score = 50.8 bits (120), Expect = 9e-05, Method: Composition-based stats.
Identities = 26/67 (38%), Positives = 45/67 (67%), Gaps = 3/67 (4%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
FE+F+ F+K+Y ++ E +RF +F+ NL E +NK ++ +A Y IN SDL+++E +
Sbjct: 28 FEEFVHRFNKNYSSEVEKLRRFKIFQHNLN--EIINKNQNDSAKYEINKFSDLSKDETIA 85
Query: 94 RL-GLNL 99
+ GL+L
Sbjct: 86 KYTGLSL 92
>gi|237643659|ref|YP_002884349.1| V-CATH [Bombyx mandarina nucleopolyhedrovirus]
gi|229358205|gb|ACQ57300.1| V-CATH [Bombyx mandarina nucleopolyhedrovirus]
Length = 323
Score = 50.8 bits (120), Expect = 9e-05, Method: Composition-based stats.
Identities = 26/67 (38%), Positives = 45/67 (67%), Gaps = 3/67 (4%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
FE+F+ F+K+Y ++ E +RF +F+ NL E +NK ++ +A Y IN SDL+++E +
Sbjct: 28 FEEFVHRFNKNYSSEVEKLRRFKIFQHNLN--EIINKNQNDSAKYEINKFSDLSKDETIA 85
Query: 94 RL-GLNL 99
+ GL+L
Sbjct: 86 KYTGLSL 92
>gi|288804650|ref|YP_003429335.1| cathepsin [Pieris rapae granulovirus]
gi|270161225|gb|ACZ63497.1| cathepsin [Pieris rapae granulovirus]
Length = 339
Score = 50.8 bits (120), Expect = 9e-05, Method: Composition-based stats.
Identities = 32/89 (35%), Positives = 50/89 (56%), Gaps = 6/89 (6%)
Query: 6 SAEATLALFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLI 65
S TL G+ + N EN +++ FE FI+ ++KSY T +E A ++ F++NLK+I
Sbjct: 13 SVMLTLCHLGETVTYN---LENSDNI--FEDFIKKYNKSYATDQERAIKYENFKNNLKMI 67
Query: 66 EDLNKGEHGTATYGINHLSDLTREEMKSR 94
D N G A + IN SDL + ++ R
Sbjct: 68 NDKNNGSK-YAVFDINAFSDLNKNDLLRR 95
>gi|393660044|gb|AFN09033.1| V-Cath [Bombyx mori NPV]
Length = 323
Score = 50.8 bits (120), Expect = 9e-05, Method: Composition-based stats.
Identities = 26/67 (38%), Positives = 45/67 (67%), Gaps = 3/67 (4%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
FE+F+ F+K+Y ++ E +RF +F+ NL E +NK ++ +A Y IN SDL+++E +
Sbjct: 28 FEEFVHRFNKNYSSEVEKLRRFKIFQHNLN--EIINKNQNDSAKYEINKFSDLSKDETIA 85
Query: 94 RL-GLNL 99
+ GL+L
Sbjct: 86 KYTGLSL 92
>gi|55735421|gb|AAV59468.1| cathepsin [Bombyx mori NPV]
Length = 323
Score = 50.8 bits (120), Expect = 9e-05, Method: Composition-based stats.
Identities = 26/67 (38%), Positives = 45/67 (67%), Gaps = 3/67 (4%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
FE+F+ F+K+Y ++ E +RF +F+ NL E +NK ++ +A Y IN SDL+++E +
Sbjct: 28 FEEFVHRFNKNYSSEVEKLRRFKIFQHNLN--EIINKNQNDSAKYEINKFSDLSKDETIA 85
Query: 94 RL-GLNL 99
+ GL+L
Sbjct: 86 KYTGLSL 92
>gi|355751926|gb|EHH56046.1| Cathepsin F, partial [Macaca fascicularis]
Length = 381
Score = 50.8 bits (120), Expect = 9e-05, Method: Composition-based stats.
Identities = 21/60 (35%), Positives = 37/60 (61%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
F+ F+ ++++Y +KEE R +VF +N+ + + + GTA YG+ SDLT EE ++
Sbjct: 84 FKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 143
>gi|119594953|gb|EAW74547.1| cathepsin F, isoform CRA_a [Homo sapiens]
gi|119594954|gb|EAW74548.1| cathepsin F, isoform CRA_a [Homo sapiens]
Length = 392
Score = 50.8 bits (120), Expect = 9e-05, Method: Composition-based stats.
Identities = 21/60 (35%), Positives = 37/60 (61%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
F+ F+ ++++Y +KEE R +VF +N+ + + + GTA YG+ SDLT EE ++
Sbjct: 95 FKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 154
>gi|432091081|gb|ELK24293.1| Cathepsin F, partial [Myotis davidii]
Length = 410
Score = 50.8 bits (120), Expect = 9e-05, Method: Composition-based stats.
Identities = 22/60 (36%), Positives = 37/60 (61%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
F+ FI ++++Y T+EE R +VF +N+ + + + GTA YG+ SDLT EE ++
Sbjct: 113 FKYFITTYNRTYETEEEAQWRMSVFINNMIRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 172
>gi|49456321|emb|CAG46481.1| CTSF [Homo sapiens]
Length = 338
Score = 50.8 bits (120), Expect = 1e-04, Method: Composition-based stats.
Identities = 21/60 (35%), Positives = 37/60 (61%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
F+ F+ ++++Y +KEE R +VF +N+ + + + GTA YG+ SDLT EE ++
Sbjct: 41 FKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 100
>gi|395742406|ref|XP_003777749.1| PREDICTED: LOW QUALITY PROTEIN: cathepsin F [Pongo abelii]
Length = 490
Score = 50.8 bits (120), Expect = 1e-04, Method: Composition-based stats.
Identities = 20/60 (33%), Positives = 37/60 (61%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
F+ F+ ++++Y +KEE R ++F +N+ + + + GTA YG+ SDLT EE ++
Sbjct: 193 FKNFVITYNRTYESKEEARWRLSIFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 252
>gi|242040563|ref|XP_002467676.1| hypothetical protein SORBIDRAFT_01g032090 [Sorghum bicolor]
gi|241921530|gb|EER94674.1| hypothetical protein SORBIDRAFT_01g032090 [Sorghum bicolor]
Length = 358
Score = 50.8 bits (120), Expect = 1e-04, Method: Composition-based stats.
Identities = 24/64 (37%), Positives = 38/64 (59%)
Query: 27 NPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
N + +F ++ +++SYPT EE +RF V+ N++ IE N+ + T T G N +DL
Sbjct: 50 NKLMMDRFLRWQATYNRSYPTAEERQRRFQVYRRNMEHIEATNRAGNLTYTLGENQFADL 109
Query: 87 TREE 90
T EE
Sbjct: 110 TEEE 113
>gi|145531433|ref|XP_001451483.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124419138|emb|CAK84086.1| unnamed protein product [Paramecium tetraurelia]
Length = 314
Score = 50.8 bits (120), Expect = 1e-04, Method: Composition-based stats.
Identities = 29/71 (40%), Positives = 39/71 (54%), Gaps = 4/71 (5%)
Query: 28 PEHLK---QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLS 84
PE L QF+K+ F K Y T E A RF V++D +K I+ LN E+ T +G +
Sbjct: 23 PESLDLRVQFDKYTNQFGKFY-TPAERAYRFQVYQDAMKQIQILNSEENSTTVFGETQFT 81
Query: 85 DLTREEMKSRL 95
DLT EE + L
Sbjct: 82 DLTNEEFAALL 92
>gi|297819568|ref|XP_002877667.1| hypothetical protein ARALYDRAFT_348033 [Arabidopsis lyrata subsp.
lyrata]
gi|297323505|gb|EFH53926.1| hypothetical protein ARALYDRAFT_348033 [Arabidopsis lyrata subsp.
lyrata]
Length = 341
Score = 50.8 bits (120), Expect = 1e-04, Method: Composition-based stats.
Identities = 23/64 (35%), Positives = 36/64 (56%)
Query: 31 LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
+++ E+++ F + Y E RF +F+ NLK +E N + T T +N SDLT EE
Sbjct: 32 IEKHEQWMSRFHRVYSDDSEKTSRFEIFKKNLKFVESFNMNTNKTYTLDVNEFSDLTDEE 91
Query: 91 MKSR 94
K+R
Sbjct: 92 FKAR 95
>gi|146215992|gb|ABQ10198.1| actinidin Act4b [Actinidia eriantha]
Length = 379
Score = 50.8 bits (120), Expect = 1e-04, Method: Composition-based stats.
Identities = 24/67 (35%), Positives = 39/67 (58%)
Query: 27 NPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
N E + FE ++ ++ KSY E +RF +F+DNL+ +++ N + + G+N SDL
Sbjct: 41 NDEVMAMFESWLVEYGKSYNALGEKERRFEIFKDNLRFVDEHNADVNRSYKVGLNQFSDL 100
Query: 87 TREEMKS 93
T EE S
Sbjct: 101 TLEEYSS 107
>gi|3916212|gb|AAC78838.1| cathepsin F [Homo sapiens]
Length = 338
Score = 50.4 bits (119), Expect = 1e-04, Method: Composition-based stats.
Identities = 21/60 (35%), Positives = 37/60 (61%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
F+ F+ ++++Y +KEE R +VF +N+ + + + GTA YG+ SDLT EE ++
Sbjct: 41 FKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 100
>gi|222425026|dbj|BAH20463.1| cysteine protease [Spinacia oleracea]
Length = 473
Score = 50.4 bits (119), Expect = 1e-04, Method: Composition-based stats.
Identities = 24/65 (36%), Positives = 38/65 (58%)
Query: 29 EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTR 88
E ++ +E ++ K+Y E KRFA+F+DNL+ I+ N + T G+N +DLT
Sbjct: 48 EVMRIYESWLVQHRKNYNALGEKEKRFAIFKDNLEFIDQHNSDDSQTFKVGLNKFADLTN 107
Query: 89 EEMKS 93
EE +S
Sbjct: 108 EEFRS 112
>gi|341888719|gb|EGT44654.1| hypothetical protein CAEBREN_19265 [Caenorhabditis brenneri]
Length = 396
Score = 50.4 bits (119), Expect = 1e-04, Method: Composition-based stats.
Identities = 26/64 (40%), Positives = 40/64 (62%), Gaps = 1/64 (1%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
+QF+ F + F + + + EE RF VF+ NL+ IE+LN ++ + YGIN SD T E+
Sbjct: 86 QQFKDFNKKFGREHKSLEEYKMRFEVFQKNLRDIEELNL-KNPSVQYGINRFSDKTESEL 144
Query: 92 KSRL 95
K+ L
Sbjct: 145 KNLL 148
>gi|260234113|dbj|BAI44279.1| cysteine proteinase inhibitor precursor [Manduca sexta]
gi|261336196|dbj|BAH59606.2| cysteine proteinase inhibitor precursor [Manduca sexta]
Length = 2676
Score = 50.4 bits (119), Expect = 1e-04, Method: Composition-based stats.
Identities = 26/74 (35%), Positives = 46/74 (62%), Gaps = 4/74 (5%)
Query: 29 EHLKQFEKFIRDFSKSY-PTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLT 87
EHL F +F+ + Y + ++ +RF +F++N++ + +LN E GTATYG+ +DLT
Sbjct: 2368 EHL--FYEFLSTYKPEYIDDRHQMRQRFEIFKENVRKMHELNTHERGTATYGVTRFADLT 2425
Query: 88 REEMKSR-LGLNLS 100
EE ++ +G+ S
Sbjct: 2426 YEEFSTKHMGMKAS 2439
>gi|96979798|ref|YP_611001.1| cathepsin [Antheraea pernyi nucleopolyhedrovirus]
gi|37077647|sp|Q91CL9.1|CATV_NPVAP RecName: Full=Viral cathepsin; Short=V-cath; AltName:
Full=Cysteine proteinase; Short=CP; Flags: Precursor
gi|16041073|dbj|BAB69773.1| cathepsin [Antheraea pernyi nucleopolyhedrovirus]
gi|94983331|gb|ABF50271.1| cathepsin [Antheraea pernyi nucleopolyhedrovirus]
gi|146229694|gb|ABQ12259.1| cathepsin [Antheraea pernyi nucleopolyhedrovirus]
Length = 324
Score = 50.4 bits (119), Expect = 1e-04, Method: Composition-based stats.
Identities = 28/68 (41%), Positives = 46/68 (67%), Gaps = 4/68 (5%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGT-ATYGINHLSDLTREEMK 92
FE+F+ F+K+Y ++ E +RF +F+ NL+ E +NK ++ T A Y IN SDL+++E
Sbjct: 28 FEEFLHKFNKNYSSESEKLRRFKIFQHNLE--EIINKNQNDTSAQYEINKFSDLSKDETI 85
Query: 93 SRL-GLNL 99
S+ GL+L
Sbjct: 86 SKYTGLSL 93
>gi|118350036|ref|XP_001008299.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89290066|gb|EAR88054.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 332
Score = 50.4 bits (119), Expect = 1e-04, Method: Composition-based stats.
Identities = 27/63 (42%), Positives = 37/63 (58%), Gaps = 7/63 (11%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
+++++F++ S +Y T EE RFAVF DNLK IE G + YGI DLT EE
Sbjct: 41 QKWQEFLKKHSITYKTIEEKLHRFAVFRDNLKKIE-------GHSNYGITKFMDLTSEEF 93
Query: 92 KSR 94
+ R
Sbjct: 94 QQR 96
>gi|413953667|gb|AFW86316.1| hypothetical protein ZEAMMB73_635707 [Zea mays]
Length = 340
Score = 50.4 bits (119), Expect = 1e-04, Method: Composition-based stats.
Identities = 22/69 (31%), Positives = 40/69 (57%)
Query: 25 TENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLS 84
+++ + + E+++ +S+ Y E A+RF VF+ N+K IE N G + G+N +
Sbjct: 28 SDDSAMVARHEQWMAQYSRVYKDASEKARRFEVFKANVKFIESFNAGGNNKFWLGVNQFA 87
Query: 85 DLTREEMKS 93
DLT +E +S
Sbjct: 88 DLTNDEFRS 96
>gi|297845064|ref|XP_002890413.1| hypothetical protein ARALYDRAFT_472321 [Arabidopsis lyrata subsp.
lyrata]
gi|297336255|gb|EFH66672.1| hypothetical protein ARALYDRAFT_472321 [Arabidopsis lyrata subsp.
lyrata]
Length = 357
Score = 50.4 bits (119), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 38/101 (37%), Positives = 55/101 (54%), Gaps = 9/101 (8%)
Query: 5 ASAEATLALFGQMKSNNELKTENPEHLKQ-------FEKFIRDFSKSYPTKEEVAKRFAV 57
A + ATL+L + + +PE L+ FE +I +F K+Y T EE RF V
Sbjct: 15 ALSAATLSLSVAASHDYSIVGYSPEDLESHDKLIELFENWISNFEKAYETVEEKLLRFEV 74
Query: 58 FEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS-RLGL 97
F+DNLK I++ NK + + G+N +DL+ EE K LGL
Sbjct: 75 FKDNLKHIDETNK-KVKSYWLGLNEFADLSHEEFKKMYLGL 114
>gi|357619725|gb|EHJ72184.1| hypothetical protein KGM_03271 [Danaus plexippus]
Length = 338
Score = 50.4 bits (119), Expect = 1e-04, Method: Composition-based stats.
Identities = 26/57 (45%), Positives = 38/57 (66%), Gaps = 2/57 (3%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
FE+FI+D++K Y E+ +RF +F +NLK I +N+ A YGIN SDL++EE
Sbjct: 41 FEQFIKDYNKEYDESEK-EERFKIFVNNLKDINAMNE-RSSNAVYGINKFSDLSKEE 95
>gi|224085750|ref|XP_002307688.1| predicted protein [Populus trichocarpa]
gi|222857137|gb|EEE94684.1| predicted protein [Populus trichocarpa]
Length = 436
Score = 50.4 bits (119), Expect = 1e-04, Method: Composition-based stats.
Identities = 25/66 (37%), Positives = 38/66 (57%), Gaps = 1/66 (1%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK- 92
FE + ++ KSY ++EE + R VFEDN + N + + + +N +DLT E K
Sbjct: 29 FETWCKEHGKSYTSQEERSHRLKVFEDNYDFVTKHNSKGNSSYSLALNAFADLTHHEFKT 88
Query: 93 SRLGLN 98
SRLGL+
Sbjct: 89 SRLGLS 94
>gi|11359985|pir||T46294 hypothetical protein DKFZp434F0610.1 - human (fragment)
gi|6808322|emb|CAB70900.1| hypothetical protein [Homo sapiens]
Length = 308
Score = 50.4 bits (119), Expect = 1e-04, Method: Composition-based stats.
Identities = 21/60 (35%), Positives = 37/60 (61%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
F+ F+ ++++Y +KEE R +VF +N+ + + + GTA YG+ SDLT EE ++
Sbjct: 27 FKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 86
>gi|351724281|ref|NP_001237820.1| cysteine protease-like precursor [Glycine max]
gi|149393486|gb|ABR26679.1| putative cysteine protease [Glycine max]
Length = 355
Score = 50.4 bits (119), Expect = 1e-04, Method: Composition-based stats.
Identities = 26/65 (40%), Positives = 38/65 (58%), Gaps = 2/65 (3%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
+F +F+ F KSY ++EE+ +R+ +F NL+ I NK T +NH +D T EE K
Sbjct: 54 KFARFMSRFGKSYRSEEEMRERYEIFSQNLRFIRSHNKNRL-PYTLSVNHFADWTWEEFK 112
Query: 93 S-RLG 96
RLG
Sbjct: 113 RHRLG 117
>gi|390470786|ref|XP_003734355.1| PREDICTED: LOW QUALITY PROTEIN: cathepsin W [Callithrix jacchus]
Length = 373
Score = 50.4 bits (119), Expect = 1e-04, Method: Composition-based stats.
Identities = 28/70 (40%), Positives = 39/70 (55%), Gaps = 1/70 (1%)
Query: 28 PEHLKQFEKFIR-DFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
P LK+ KF + F++SY T EE A+R +F NL + L + + GTA +G+ SDL
Sbjct: 35 PLELKEAFKFFQIQFNRSYLTPEEHARRLDIFAHNLVQAQRLQEEDLGTAEFGVTPFSDL 94
Query: 87 TREEMKSRLG 96
T EE G
Sbjct: 95 TEEEFGQLYG 104
>gi|339244639|ref|XP_003378245.1| cathepsin F [Trichinella spiralis]
gi|316972864|gb|EFV56510.1| cathepsin F [Trichinella spiralis]
Length = 366
Score = 50.4 bits (119), Expect = 1e-04, Method: Composition-based stats.
Identities = 24/78 (30%), Positives = 46/78 (58%), Gaps = 1/78 (1%)
Query: 22 ELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGIN 81
E+ + + F++F+ +F+K Y T++ A+++ +F+ N+ + + L + E GTA YG
Sbjct: 54 EMNAKEARSWENFKQFMVEFNKWYETEKLTAEKYNIFKSNMVIAKRLQEEEQGTAIYGPT 113
Query: 82 HLSDLTREEM-KSRLGLN 98
+D+T EE K+ L N
Sbjct: 114 IFADMTPEEFRKTHLNFN 131
>gi|9627870|ref|NP_054157.1| viral cathepsin-like protein [Autographa californica
nucleopolyhedrovirus]
gi|114680178|ref|YP_758591.1| viral cathepsin [Plutella xylostella multiple
nucleopolyhedrovirus]
gi|115751|sp|P25783.1|CATV_NPVAC RecName: Full=Viral cathepsin; Short=V-cath; AltName:
Full=Cysteine proteinase; Short=CP; Flags: Precursor
gi|332491|gb|AAA46752.1| viral cathepsin [Autographa californica nucleopolyhedrovirus]
gi|559196|gb|AAA66757.1| viral cathepsin-like protein [Autographa californica
nucleopolyhedrovirus]
gi|113015253|gb|ABE68510.1| viral cathepsin [Plutella xylostella multiple
nucleopolyhedrovirus]
Length = 323
Score = 50.4 bits (119), Expect = 1e-04, Method: Composition-based stats.
Identities = 26/67 (38%), Positives = 44/67 (65%), Gaps = 3/67 (4%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
FE+F+ F+K Y ++ E +RF +F+ NL E +NK ++ +A Y IN SDL+++E +
Sbjct: 28 FEEFVHRFNKDYGSEVEKLRRFKIFQHNLN--EIINKNQNDSAKYEINKFSDLSKDETIA 85
Query: 94 RL-GLNL 99
+ GL+L
Sbjct: 86 KYTGLSL 92
>gi|403293601|ref|XP_003937801.1| PREDICTED: cathepsin F [Saimiri boliviensis boliviensis]
Length = 379
Score = 50.1 bits (118), Expect = 2e-04, Method: Composition-based stats.
Identities = 20/60 (33%), Positives = 35/60 (58%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
F F+ ++++Y +KEE R ++F N+ + + + GTA YG+ SDLT EE ++
Sbjct: 82 FRNFVITYNRTYESKEEAQWRLSIFAHNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 141
>gi|357154164|ref|XP_003576692.1| PREDICTED: vignain-like [Brachypodium distachyon]
Length = 427
Score = 50.1 bits (118), Expect = 2e-04, Method: Composition-based stats.
Identities = 23/63 (36%), Positives = 39/63 (61%), Gaps = 1/63 (1%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
+FE+++ ++Y E +RF V+++NL LIE+ N G HG T N +DLT EE +
Sbjct: 118 RFEQWMGKHGRAYANGGEKQRRFEVYKENLALIEEFNSGGHGY-TLTDNKFADLTNEEFR 176
Query: 93 SRL 95
+++
Sbjct: 177 AKM 179
>gi|3941390|gb|AAC82352.1| group 1 allergen Eur m 1 0102 [Euroglyphus maynei]
Length = 327
Score = 50.1 bits (118), Expect = 2e-04, Method: Composition-based stats.
Identities = 32/73 (43%), Positives = 46/73 (63%), Gaps = 12/73 (16%)
Query: 28 PEHLKQFEKFIRDFSKSY--PTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSD 85
P +K FE+F + F+KSY P KEEVA++ F ++LK +E NKG INHLSD
Sbjct: 26 PASIKTFEEFKKAFNKSYATPEKEEVARK--NFLESLKYVES-NKG-------AINHLSD 75
Query: 86 LTREEMKSRLGLN 98
L+ +E K++ +N
Sbjct: 76 LSLDEFKNQFLMN 88
>gi|158284547|ref|XP_307325.4| Anopheles gambiae str. PEST AGAP012577-PA [Anopheles gambiae str.
PEST]
gi|157021017|gb|EAA03137.4| AGAP012577-PA [Anopheles gambiae str. PEST]
Length = 547
Score = 50.1 bits (118), Expect = 2e-04, Method: Composition-based stats.
Identities = 32/90 (35%), Positives = 45/90 (50%), Gaps = 4/90 (4%)
Query: 12 ALFGQMKSNNELKTENPEHL-KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNK 70
A F M+ ++E EHL +F +F KSY + E +R VF NL+ I N+
Sbjct: 223 ATFNPMQEFVHPRSE--EHLHNEFGRFKNKHGKSYASPLEHERRLNVFRQNLRFIHSHNR 280
Query: 71 GEHGTATYGINHLSDLTREEMKSRLGLNLS 100
G T +NHL+D T EE+K+ G S
Sbjct: 281 ANRGF-TVAVNHLADRTEEELKALRGFRSS 309
>gi|341886805|gb|EGT42740.1| hypothetical protein CAEBREN_23878 [Caenorhabditis brenneri]
Length = 396
Score = 50.1 bits (118), Expect = 2e-04, Method: Composition-based stats.
Identities = 25/64 (39%), Positives = 40/64 (62%), Gaps = 1/64 (1%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
+QF+ F + F + + + EE RF VF+ NL+ E+LN+ ++ + YGIN SD T E+
Sbjct: 86 QQFKDFNKKFGREHKSLEEYKMRFEVFQKNLREFEELNQ-KNPSVQYGINKFSDKTESEL 144
Query: 92 KSRL 95
K+ L
Sbjct: 145 KNLL 148
>gi|334904467|gb|AEH26024.1| cysteine peptidase [Ananas comosus]
Length = 352
Score = 50.1 bits (118), Expect = 2e-04, Method: Composition-based stats.
Identities = 20/71 (28%), Positives = 42/71 (59%)
Query: 31 LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
+K+FE+++ ++ + Y +E +RF +F++N+ IE N + T GIN +D+T+ E
Sbjct: 34 MKRFEEWMAEYGRVYKDNDEKMRRFQIFKNNVNHIETFNSHNGNSYTLGINQFTDMTKSE 93
Query: 91 MKSRLGLNLSK 101
++ +S+
Sbjct: 94 FVAQYTGGISR 104
>gi|326431661|gb|EGD77231.1| cysteine protease [Salpingoeca sp. ATCC 50818]
Length = 347
Score = 50.1 bits (118), Expect = 2e-04, Method: Composition-based stats.
Identities = 26/62 (41%), Positives = 39/62 (62%), Gaps = 3/62 (4%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLN-KGEHGTATY--GINHLSDLTREE 90
FE+F ++K Y + EE A+R A+F+++L IE N + G TY G+N +DLTREE
Sbjct: 31 FEEFKDKYNKVYESAEEEARRAAIFQESLDFIEKHNAEAAAGMHTYLVGVNEFADLTREE 90
Query: 91 MK 92
+
Sbjct: 91 FR 92
>gi|296218871|ref|XP_002755611.1| PREDICTED: cathepsin F [Callithrix jacchus]
Length = 489
Score = 50.1 bits (118), Expect = 2e-04, Method: Composition-based stats.
Identities = 21/60 (35%), Positives = 35/60 (58%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
F F+ ++++Y +KEE R +VF N+ + + + GTA YG+ SDLT EE ++
Sbjct: 193 FRNFVITYNRTYESKEEAQWRLSVFVHNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 252
>gi|393906608|gb|EFO21301.2| ctsf protein [Loa loa]
Length = 472
Score = 50.1 bits (118), Expect = 2e-04, Method: Composition-based stats.
Identities = 22/62 (35%), Positives = 32/62 (51%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
F FI+ F + Y + E RF + NL +E L E GTA YG+ SD++ EE +
Sbjct: 170 FLDFIKKFKREYSSVAEQLDRFKKYMQNLHFVEKLQHEEKGTAIYGVTQFSDMSPEEFQK 229
Query: 94 RL 95
+
Sbjct: 230 TM 231
>gi|312080834|ref|XP_003142769.1| ctsf protein [Loa loa]
Length = 437
Score = 50.1 bits (118), Expect = 2e-04, Method: Composition-based stats.
Identities = 22/62 (35%), Positives = 32/62 (51%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
F FI+ F + Y + E RF + NL +E L E GTA YG+ SD++ EE +
Sbjct: 135 FLDFIKKFKREYSSVAEQLDRFKKYMQNLHFVEKLQHEEKGTAIYGVTQFSDMSPEEFQK 194
Query: 94 RL 95
+
Sbjct: 195 TM 196
>gi|303283194|ref|XP_003060888.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226457239|gb|EEH54538.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 422
Score = 50.1 bits (118), Expect = 2e-04, Method: Composition-based stats.
Identities = 27/72 (37%), Positives = 43/72 (59%), Gaps = 3/72 (4%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEH-GTATY--GINHLSDLTRE 89
+F++++ K+Y +E AKR A+F DN + + N+ G ++ +NHL+DLTRE
Sbjct: 69 RFDRWLATHGKAYACPKERAKRLAIFADNAEFVRVHNEAHAAGKKSHWLRLNHLADLTRE 128
Query: 90 EMKSRLGLNLSK 101
E K LG + SK
Sbjct: 129 EFKHMLGYDASK 140
>gi|195494228|ref|XP_002094747.1| GE21992 [Drosophila yakuba]
gi|194180848|gb|EDW94459.1| GE21992 [Drosophila yakuba]
Length = 549
Score = 50.1 bits (118), Expect = 2e-04, Method: Composition-based stats.
Identities = 27/69 (39%), Positives = 39/69 (56%), Gaps = 2/69 (2%)
Query: 29 EHL-KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLT 87
EH+ K F F R +YP++ E R +F NL+ I N+ + T T +NHL+D T
Sbjct: 239 EHVDKAFHHFKRKHGVAYPSETEHEHRKNIFRQNLRYIHSKNRAKL-TYTLAVNHLADKT 297
Query: 88 REEMKSRLG 96
EE+K+R G
Sbjct: 298 EEELKARRG 306
>gi|297598407|ref|NP_001045533.2| Os01g0971400 [Oryza sativa Japonica Group]
gi|15289977|dbj|BAB63672.1| putative cysteine protease CP1 [Oryza sativa Japonica Group]
gi|125529282|gb|EAY77396.1| hypothetical protein OsI_05384 [Oryza sativa Indica Group]
gi|125573472|gb|EAZ14987.1| hypothetical protein OsJ_04922 [Oryza sativa Japonica Group]
gi|215740756|dbj|BAG97412.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215741010|dbj|BAG97505.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215765325|dbj|BAG87022.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767338|dbj|BAG99566.1| unnamed protein product [Oryza sativa Japonica Group]
gi|255674119|dbj|BAF07447.2| Os01g0971400 [Oryza sativa Japonica Group]
Length = 365
Score = 50.1 bits (118), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 30/71 (42%), Positives = 45/71 (63%), Gaps = 2/71 (2%)
Query: 31 LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
++ FEKF+ + K+Y + EE +RF VF+DNL I++ NK G G+N +DLT +E
Sbjct: 49 MELFEKFMAKYRKAYSSLEEKLRRFEVFKDNLNHIDEENKKITGY-WLGLNEFADLTHDE 107
Query: 91 MKSR-LGLNLS 100
K+ LGL L+
Sbjct: 108 FKAAYLGLTLT 118
>gi|397133545|gb|AFO10079.1| V-CATH [Bombyx mandarina nucleopolyhedrovirus S2]
Length = 323
Score = 50.1 bits (118), Expect = 2e-04, Method: Composition-based stats.
Identities = 26/67 (38%), Positives = 44/67 (65%), Gaps = 3/67 (4%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
FE+F+ F+K Y ++ E +RF +F+ NL E +NK ++ +A Y IN SDL+++E +
Sbjct: 28 FEEFVHRFNKDYGSEVEKLRRFKIFQHNLN--EIINKDQNDSAKYEINKFSDLSKDETIA 85
Query: 94 RL-GLNL 99
+ GL+L
Sbjct: 86 KYTGLSL 92
>gi|357130141|ref|XP_003566711.1| PREDICTED: xylem cysteine proteinase 1-like [Brachypodium
distachyon]
Length = 457
Score = 50.1 bits (118), Expect = 2e-04, Method: Composition-based stats.
Identities = 30/74 (40%), Positives = 45/74 (60%), Gaps = 2/74 (2%)
Query: 25 TENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLS 84
+ N ++ FEK++ K+Y + EE RF VF+DNLK I+ +N+ E + G+N +
Sbjct: 141 SSNDRIIELFEKWLAKHQKAYASFEEKLHRFEVFKDNLKHIDKVNR-EVTSYWLGLNEFA 199
Query: 85 DLTREEMKSR-LGL 97
DLT EE K+ LGL
Sbjct: 200 DLTHEEFKATYLGL 213
>gi|324514421|gb|ADY45863.1| Viral cathepsin [Ascaris suum]
Length = 399
Score = 49.7 bits (117), Expect = 2e-04, Method: Composition-based stats.
Identities = 27/93 (29%), Positives = 46/93 (49%), Gaps = 4/93 (4%)
Query: 1 MAEDASAEATLALFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFED 60
++ D SA + + M EL + P ++ F KF++++ + Y + +E RF F
Sbjct: 69 LSSDPSAGSLETILADM---GELSNDYPIYIDSFVKFMQEYDRQYSSNDETRLRFRNFVR 125
Query: 61 NLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
N+K I+ KG +GI +D + EMKS
Sbjct: 126 NMKFIKKAQKGRD-NVVFGITRFTDWSEAEMKS 157
>gi|8347420|dbj|BAA96501.1| cysteine protease [Nicotiana tabacum]
Length = 360
Score = 49.7 bits (117), Expect = 2e-04, Method: Composition-based stats.
Identities = 27/68 (39%), Positives = 39/68 (57%), Gaps = 2/68 (2%)
Query: 30 HLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTRE 89
H F +F + K Y + EE+ +RF VF DNLK+I NK + + G+N +DLT +
Sbjct: 57 HALSFARFAHRYGKRYESVEEIKQRFEVFLDNLKMIRSHNK-KGLSYKLGVNEFTDLTWD 115
Query: 90 EM-KSRLG 96
E + RLG
Sbjct: 116 EFRRDRLG 123
>gi|74927078|sp|Q86GF7.1|CRUST_PANBO RecName: Full=Crustapain; AltName: Full=NsCys; Flags: Precursor
gi|28971811|dbj|BAC65417.1| crustapain [Pandalus borealis]
Length = 323
Score = 49.7 bits (117), Expect = 2e-04, Method: Composition-based stats.
Identities = 28/74 (37%), Positives = 42/74 (56%), Gaps = 4/74 (5%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLN-KGEHGTATY--GINHLSDLTRE 89
++E F F K Y EE + R +VF D LK I++ N + + G TY IN+ SDLT E
Sbjct: 19 EWENFKTKFGKKYANSEEESHRMSVFMDKLKFIQEHNERYDKGEVTYWLKINNFSDLTHE 78
Query: 90 E-MKSRLGLNLSKH 102
E + ++ G+ +H
Sbjct: 79 EVLATKTGMTRRRH 92
>gi|171460937|ref|NP_001116343.1| cathepsin W precursor [Felis catus]
gi|6165261|emb|CAB59816.1| cysteine protease [Felis catus]
Length = 344
Score = 49.7 bits (117), Expect = 2e-04, Method: Composition-based stats.
Identities = 27/70 (38%), Positives = 37/70 (52%), Gaps = 1/70 (1%)
Query: 28 PEHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
P LKQ F F +++SY EE A+R +F NL + L + + GTA +G+ SDL
Sbjct: 35 PLELKQAFTLFQIQYNRSYSNPEEYARRLDIFAHNLAQAQQLEEEDLGTAEFGVTPFSDL 94
Query: 87 TREEMKSRLG 96
T EE G
Sbjct: 95 TEEEFGRLYG 104
>gi|22653681|sp|Q9TST1.2|CATW_FELCA RecName: Full=Cathepsin W; Flags: Precursor
Length = 374
Score = 49.7 bits (117), Expect = 2e-04, Method: Composition-based stats.
Identities = 27/70 (38%), Positives = 37/70 (52%), Gaps = 1/70 (1%)
Query: 28 PEHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
P LKQ F F +++SY EE A+R +F NL + L + + GTA +G+ SDL
Sbjct: 35 PLELKQAFTLFQIQYNRSYSNPEEYARRLDIFAHNLAQAQQLEEEDLGTAEFGVTPFSDL 94
Query: 87 TREEMKSRLG 96
T EE G
Sbjct: 95 TEEEFGRLYG 104
>gi|3273233|dbj|BAA31161.1| tetrain [Tetrahymena pyriformis]
Length = 330
Score = 49.7 bits (117), Expect = 2e-04, Method: Composition-based stats.
Identities = 29/75 (38%), Positives = 40/75 (53%), Gaps = 5/75 (6%)
Query: 25 TENPE---HLKQ--FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYG 79
T NP HL+ F+KF R+F +Y + E + R +VF +NLK IE N T
Sbjct: 22 TRNPNADGHLEHYAFQKFKRNFGVTYKNQGEESYRLSVFLENLKSIEANNANPLSTHVEE 81
Query: 80 INHLSDLTREEMKSR 94
+N +DLT EE +R
Sbjct: 82 VNSFTDLTEEEFAAR 96
>gi|161408101|dbj|BAF94154.1| cathepsin F-like cysteine protease [Plautia stali]
Length = 803
Score = 49.7 bits (117), Expect = 2e-04, Method: Composition-based stats.
Identities = 26/56 (46%), Positives = 35/56 (62%), Gaps = 1/56 (1%)
Query: 43 KSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKSR-LGL 97
+SY T EE+ KRF +F N+K + L K E GTA YG+ SD++ +E K LGL
Sbjct: 509 RSYKTTEELKKRFRIFRANMKKADYLQKTEQGTAKYGVTIFSDISSKEFKKHYLGL 564
>gi|413953051|gb|AFW85700.1| hypothetical protein ZEAMMB73_033873 [Zea mays]
Length = 359
Score = 49.7 bits (117), Expect = 2e-04, Method: Composition-based stats.
Identities = 23/73 (31%), Positives = 44/73 (60%), Gaps = 1/73 (1%)
Query: 31 LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTA-TYGINHLSDLTRE 89
L++F+ + +++++Y T EE +RF V+ +NL+ I+ +N+ G++ G N +DLT E
Sbjct: 37 LERFKAWQAEYNRTYATPEEFQQRFMVYSENLRFIKTMNQLSTGSSYELGENQFTDLTEE 96
Query: 90 EMKSRLGLNLSKH 102
E K + L +
Sbjct: 97 EFKDTYLMKLDEQ 109
>gi|440798492|gb|ELR19560.1| papain family cysteine protease containing protein [Acanthamoeba
castellanii str. Neff]
Length = 385
Score = 49.7 bits (117), Expect = 2e-04, Method: Composition-based stats.
Identities = 21/65 (32%), Positives = 39/65 (60%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
F++++ +F+K+Y + +EV R A+FE L I+ N+ + G+N L+D + E++
Sbjct: 35 FDRYVVEFNKAYASDDEVVSRRAIFESRLAAIKAHNRDASKSWKQGVNQLTDRSEAEIRQ 94
Query: 94 RLGLN 98
LG N
Sbjct: 95 LLGYN 99
>gi|3916214|gb|AAC78839.1| cathepsin F [Homo sapiens]
Length = 302
Score = 49.7 bits (117), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 21/60 (35%), Positives = 37/60 (61%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
F+ F+ ++++Y +KEE R +VF +N+ + + + GTA YG+ SDLT EE ++
Sbjct: 5 FKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 64
>gi|413953050|gb|AFW85699.1| hypothetical protein ZEAMMB73_033873 [Zea mays]
Length = 361
Score = 49.7 bits (117), Expect = 2e-04, Method: Composition-based stats.
Identities = 23/73 (31%), Positives = 44/73 (60%), Gaps = 1/73 (1%)
Query: 31 LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTA-TYGINHLSDLTRE 89
L++F+ + +++++Y T EE +RF V+ +NL+ I+ +N+ G++ G N +DLT E
Sbjct: 37 LERFKAWQAEYNRTYATPEEFQQRFMVYSENLRFIKTMNQLSTGSSYELGENQFTDLTEE 96
Query: 90 EMKSRLGLNLSKH 102
E K + L +
Sbjct: 97 EFKDTYLMKLDEQ 109
>gi|348565006|ref|XP_003468295.1| PREDICTED: cathepsin W-like [Cavia porcellus]
Length = 375
Score = 49.7 bits (117), Expect = 2e-04, Method: Composition-based stats.
Identities = 27/70 (38%), Positives = 38/70 (54%), Gaps = 1/70 (1%)
Query: 28 PEHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
P LK+ F+ F F++SY + E A+R +F NL + L + E GTA +G+ SDL
Sbjct: 35 PLELKEVFKLFQIQFNRSYSNQAEYARRLDIFVHNLATAQRLQEEELGTAEFGVTPFSDL 94
Query: 87 TREEMKSRLG 96
T EE G
Sbjct: 95 TEEEFGQLYG 104
>gi|413944253|gb|AFW76902.1| hypothetical protein ZEAMMB73_056195 [Zea mays]
Length = 340
Score = 49.7 bits (117), Expect = 2e-04, Method: Composition-based stats.
Identities = 23/68 (33%), Positives = 39/68 (57%)
Query: 26 ENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSD 85
E+ + + E+++ +S+ Y E A+RF VF+ N+K IE N G + GIN +D
Sbjct: 29 EDSAMVARHEQWMAQYSRVYKDAAEKARRFEVFKANVKFIESFNTGGNRKFWLGINQFAD 88
Query: 86 LTREEMKS 93
LT +E ++
Sbjct: 89 LTNDEFRT 96
>gi|341888721|gb|EGT44656.1| hypothetical protein CAEBREN_22029 [Caenorhabditis brenneri]
Length = 396
Score = 49.7 bits (117), Expect = 2e-04, Method: Composition-based stats.
Identities = 26/64 (40%), Positives = 39/64 (60%), Gaps = 1/64 (1%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
+QF+ F F + + T EE RF +F+ NL+ IE+LN ++ + YGIN SD T E+
Sbjct: 86 QQFKDFNAKFQREHKTLEEYKMRFEIFQKNLRDIEELNL-KNPSVQYGINKFSDKTESEL 144
Query: 92 KSRL 95
K+ L
Sbjct: 145 KNLL 148
>gi|205364757|gb|ACI04578.1| cysteine protease-like protein [Robinia pseudoacacia]
Length = 335
Score = 49.7 bits (117), Expect = 2e-04, Method: Composition-based stats.
Identities = 33/86 (38%), Positives = 46/86 (53%), Gaps = 4/86 (4%)
Query: 13 LFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGE 72
L Q+ +NE N EH F F FSK+Y TKEE RF VF+ N++ + L+
Sbjct: 3 LIRQVVDDNEDHVLNAEH--HFSTFKSKFSKTYATKEEHDYRFGVFKSNVRRAK-LHAKL 59
Query: 73 HGTATYGINHLSDLTREEMKSR-LGL 97
+A +G+ SDLT E + + LGL
Sbjct: 60 DPSAVHGVTKFSDLTPSEFRRQFLGL 85
>gi|341893155|gb|EGT49090.1| hypothetical protein CAEBREN_13400 [Caenorhabditis brenneri]
Length = 372
Score = 49.7 bits (117), Expect = 2e-04, Method: Composition-based stats.
Identities = 33/98 (33%), Positives = 47/98 (47%), Gaps = 3/98 (3%)
Query: 1 MAEDASAEATLALFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFED 60
M ++S L F + + L + E LK FEKF K Y T EE KR F
Sbjct: 1 MVPNSSCSLILFAFFSIFAEYSLAQHSQEVLKNFEKFQTHHKKHYRTAEEKKKRLGHFAK 60
Query: 61 NLKLIEDLN---KGEHGTATYGINHLSDLTREEMKSRL 95
N + I++LN K T+G+N +D+ +EE +RL
Sbjct: 61 NHQRIKELNEEAKKAGRNVTFGLNKFADMPKEERHARL 98
>gi|195111686|ref|XP_002000409.1| GI10216 [Drosophila mojavensis]
gi|193917003|gb|EDW15870.1| GI10216 [Drosophila mojavensis]
Length = 605
Score = 49.7 bits (117), Expect = 2e-04, Method: Composition-based stats.
Identities = 25/69 (36%), Positives = 35/69 (50%), Gaps = 2/69 (2%)
Query: 29 EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTR 88
+HL F F + + Y E R +F NL+ I++LN E G+A YGI +D+T
Sbjct: 296 DHL--FHVFQIKYKRRYANSMEHQMRLRIFRQNLRTIQELNDNEQGSAKYGITEFADMTS 353
Query: 89 EEMKSRLGL 97
E R GL
Sbjct: 354 SEYTQRAGL 362
>gi|165969032|ref|YP_001650932.1| peptidase [Orgyia leucostigma NPV]
gi|164663528|gb|ABY65748.1| peptidase [Orgyia leucostigma NPV]
Length = 328
Score = 49.7 bits (117), Expect = 2e-04, Method: Composition-based stats.
Identities = 27/66 (40%), Positives = 42/66 (63%), Gaps = 2/66 (3%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
FE F+ ++ K+Y E +KR+ +F+DNL+ I N+ + TA Y IN SDL++ E+ S
Sbjct: 29 FESFVANYQKNYNDDLEKSKRYTIFKDNLEEINVKNR-LNDTAVYRINKFSDLSKTEIIS 87
Query: 94 RL-GLN 98
+ GLN
Sbjct: 88 KYTGLN 93
>gi|224131910|ref|XP_002328138.1| predicted protein [Populus trichocarpa]
gi|222837653|gb|EEE76018.1| predicted protein [Populus trichocarpa]
Length = 349
Score = 49.7 bits (117), Expect = 2e-04, Method: Composition-based stats.
Identities = 33/79 (41%), Positives = 45/79 (56%), Gaps = 9/79 (11%)
Query: 27 NPEHLKQ-------FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYG 79
+PEHL FE +I K+Y + EE RF VF++NLK I+ NK E + G
Sbjct: 33 SPEHLTSVDKLVELFESWISGHGKAYNSLEEKLHRFEVFKENLKHIDQRNK-EVTSYWLG 91
Query: 80 INHLSDLTREEMKSR-LGL 97
+N +DL+ EE KS+ LGL
Sbjct: 92 LNEFADLSHEEFKSKFLGL 110
>gi|157093563|gb|ABV22436.1| cysteine proteinase [Oxyrrhis marina]
Length = 329
Score = 49.7 bits (117), Expect = 2e-04, Method: Composition-based stats.
Identities = 28/66 (42%), Positives = 40/66 (60%), Gaps = 2/66 (3%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM- 91
Q+E+F F +SY +EE A+R VF N++LI + N H T T G+N +DLT EE
Sbjct: 18 QWEEFKAKFGESYNGEEEEAERKGVFAQNVQLINEENSKGH-TYTLGVNQFADLTVEEFS 76
Query: 92 KSRLGL 97
K+ +G
Sbjct: 77 KTYMGF 82
>gi|340508003|gb|EGR33817.1| papain family cysteine protease, putative [Ichthyophthirius
multifiliis]
Length = 334
Score = 49.7 bits (117), Expect = 2e-04, Method: Composition-based stats.
Identities = 29/76 (38%), Positives = 48/76 (63%), Gaps = 2/76 (2%)
Query: 25 TENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLS 84
++N ++ +FE F ++K Y ++++ R VF +NLK IE NK + T G+N +S
Sbjct: 40 SQNVNYVSEFENFNFKYNKQYQSQQQYQYRLQVFTENLKYIEQQNKKSQ-SFTLGVNSIS 98
Query: 85 DLTREE-MKSRLGLNL 99
LTREE +++ LGLN+
Sbjct: 99 HLTREEFIQTYLGLNI 114
>gi|73983670|ref|XP_540846.2| PREDICTED: cathepsin W [Canis lupus familiaris]
Length = 374
Score = 49.7 bits (117), Expect = 2e-04, Method: Composition-based stats.
Identities = 32/93 (34%), Positives = 45/93 (48%), Gaps = 1/93 (1%)
Query: 5 ASAEATLALFGQMKSNNELKTENPEHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLK 63
A + A+LA + N+ P LKQ F F +++SY EE A+R +F NL
Sbjct: 12 ALSVASLAHGIKRSLKNQDPGPQPLELKQVFALFQIQYNRSYSNPEEYARRLDIFAHNLA 71
Query: 64 LIEDLNKGEHGTATYGINHLSDLTREEMKSRLG 96
+ L + GTA +G+ SDLT EE G
Sbjct: 72 QAQQLEDEDLGTAEFGVTPFSDLTEEEFGQFYG 104
>gi|312106123|ref|XP_003150646.1| hypothetical protein LOAG_15105 [Loa loa]
gi|307754189|gb|EFO13423.1| hypothetical protein LOAG_15105 [Loa loa]
Length = 139
Score = 49.7 bits (117), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 23/59 (38%), Positives = 38/59 (64%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
F FI+ ++ Y +K+E+ KRF +++ NL+L + + K E TA YG SD+T+EE +
Sbjct: 30 FANFIQQHNRKYRSKKELLKRFRIYKRNLRLAKLIQKNEQDTAIYGETPFSDMTQEEFR 88
>gi|198427474|ref|XP_002119872.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
Length = 596
Score = 49.7 bits (117), Expect = 3e-04, Method: Composition-based stats.
Identities = 21/69 (30%), Positives = 40/69 (57%), Gaps = 1/69 (1%)
Query: 34 FEKFIRDFSKSYPTK-EEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
F+ F+ + ++Y + +E +RF +F+ N ++++ LN+ E GTA YGI D++ EE
Sbjct: 169 FDMFLEKYPRTYSSSSDEYNERFEIFKTNYQVVQHLNEIERGTAVYGITKFMDMSEEEYH 228
Query: 93 SRLGLNLSK 101
L ++
Sbjct: 229 RTLAPGFTR 237
>gi|395545396|ref|XP_003774588.1| PREDICTED: cathepsin W [Sarcophilus harrisii]
Length = 358
Score = 49.3 bits (116), Expect = 3e-04, Method: Composition-based stats.
Identities = 22/61 (36%), Positives = 33/61 (54%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
++F+ F ++KSYP E R +F DNL + L + G A +G+ SDLT EE
Sbjct: 42 ERFKAFQIQYNKSYPDAAEQECRLKIFADNLARAQQLTEEHQGLAQFGVTRFSDLTEEEF 101
Query: 92 K 92
+
Sbjct: 102 R 102
>gi|194352750|emb|CAQ00103.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
gi|326514262|dbj|BAJ92281.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326519402|dbj|BAJ96700.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326524351|dbj|BAK00559.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326531998|dbj|BAK01375.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 356
Score = 49.3 bits (116), Expect = 3e-04, Method: Composition-based stats.
Identities = 29/75 (38%), Positives = 46/75 (61%), Gaps = 2/75 (2%)
Query: 25 TENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLS 84
+ N ++ FEK++ K+Y + EE RF VF+DNLK I+ +N+ E + G+N +
Sbjct: 40 SSNERLVELFEKWLAKHQKAYASFEEKLHRFEVFKDNLKHIDKINR-EVTSYWLGLNEFA 98
Query: 85 DLTREEMKSR-LGLN 98
DLT +E K+ LGL+
Sbjct: 99 DLTHDEFKAAYLGLD 113
>gi|146215990|gb|ABQ10197.1| actinidin Act4a [Actinidia eriantha]
Length = 385
Score = 49.3 bits (116), Expect = 3e-04, Method: Composition-based stats.
Identities = 23/67 (34%), Positives = 38/67 (56%)
Query: 27 NPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
N E + FE ++ ++ KSY E +RF +F+DNL+ +++ N + + G+N SDL
Sbjct: 41 NDEVIAMFESWLVEYGKSYNALGEKERRFEIFKDNLRFVDEHNADVNRSYKVGLNQFSDL 100
Query: 87 TREEMKS 93
T E S
Sbjct: 101 TDAEYSS 107
>gi|242072398|ref|XP_002446135.1| hypothetical protein SORBIDRAFT_06g002170 [Sorghum bicolor]
gi|241937318|gb|EES10463.1| hypothetical protein SORBIDRAFT_06g002170 [Sorghum bicolor]
Length = 338
Score = 49.3 bits (116), Expect = 3e-04, Method: Composition-based stats.
Identities = 22/67 (32%), Positives = 38/67 (56%)
Query: 31 LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
+++ E ++ ++ + Y E A+RF VF+DN+ +E N ++ GIN +DLT EE
Sbjct: 33 VERHENWMVEYGRVYKDAAEKARRFEVFKDNVAFVESFNTNKNNKFWLGINQFADLTIEE 92
Query: 91 MKSRLGL 97
K+ G
Sbjct: 93 FKANKGF 99
>gi|255539310|ref|XP_002510720.1| cysteine protease, putative [Ricinus communis]
gi|223551421|gb|EEF52907.1| cysteine protease, putative [Ricinus communis]
Length = 349
Score = 49.3 bits (116), Expect = 3e-04, Method: Composition-based stats.
Identities = 32/80 (40%), Positives = 47/80 (58%), Gaps = 4/80 (5%)
Query: 25 TENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLS 84
T N + + FE +I F + Y + EE +RF +F+DNL I+D NK G+N +
Sbjct: 38 TSNDKLIDLFESWISRFGRVYESAEEKLERFEIFKDNLFHIDDTNKKVR-NYWLGLNEFA 96
Query: 85 DLTREEMKSR-LGL--NLSK 101
DL+ EE K++ LGL +LSK
Sbjct: 97 DLSHEEFKNKYLGLKPDLSK 116
>gi|226509942|ref|NP_001146834.1| cysteine protease precursor [Zea mays]
gi|159506725|gb|ABW97700.1| cysteine protease [Zea mays]
gi|414867308|tpg|DAA45865.1| TPA: cysteine protease [Zea mays]
Length = 352
Score = 49.3 bits (116), Expect = 3e-04, Method: Composition-based stats.
Identities = 23/60 (38%), Positives = 36/60 (60%)
Query: 31 LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
+ +F + +++SYPT EE +RF V+ N++ IE N+ + T T G N +DLT EE
Sbjct: 46 MDRFLSWQATYNRSYPTAEERQRRFQVYRRNIEHIEATNRAGNLTYTLGENQFADLTEEE 105
>gi|30387350|ref|NP_848429.1| cathepsin [Choristoneura fumiferana MNPV]
gi|1168799|sp|P41715.1|CATV_NPVCF RecName: Full=Viral cathepsin; Short=V-cath; AltName:
Full=Cysteine proteinase; Short=CP; Flags: Precursor
gi|332509|gb|AAA96732.1| cathepsin [Choristoneura fumiferana MNPV]
gi|30270084|gb|AAP29900.1| cathepsin [Choristoneura fumiferana MNPV]
Length = 324
Score = 49.3 bits (116), Expect = 3e-04, Method: Composition-based stats.
Identities = 27/67 (40%), Positives = 42/67 (62%), Gaps = 2/67 (2%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
FE F+ F+KSY ++ E +RF +F NL+ I + N + TA Y IN +DL+++E S
Sbjct: 28 FEDFLHKFNKSYSSESEKLRRFQIFRHNLEEIINKNHND-STAQYEINKFADLSKDETIS 86
Query: 94 RL-GLNL 99
+ GL+L
Sbjct: 87 KYTGLSL 93
>gi|413944254|gb|AFW76903.1| hypothetical protein ZEAMMB73_202256 [Zea mays]
Length = 151
Score = 49.3 bits (116), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 23/68 (33%), Positives = 39/68 (57%)
Query: 26 ENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSD 85
E+ + + E+++ +S+ Y E A+RF VF+ N+K IE N G + GIN +D
Sbjct: 29 EDSAMVARHEQWMAQYSRVYKDAAEKARRFEVFKANVKFIESFNTGGNRKFWLGINQFAD 88
Query: 86 LTREEMKS 93
LT +E ++
Sbjct: 89 LTNDEFRT 96
>gi|255563136|ref|XP_002522572.1| cysteine protease, putative [Ricinus communis]
gi|223538263|gb|EEF39872.1| cysteine protease, putative [Ricinus communis]
Length = 340
Score = 49.3 bits (116), Expect = 3e-04, Method: Composition-based stats.
Identities = 22/59 (37%), Positives = 36/59 (61%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
++ E+++ ++Y EE +RF +F+ NLK IE+ N + T G+NH +DLT EE
Sbjct: 36 EKHEQWMARHGRTYQDDEEKERRFHIFKKNLKHIENFNNAFNRTYKLGLNHFADLTDEE 94
>gi|5231178|gb|AAD41105.1|AF157961_1 cysteine proteinase [Hypera postica]
Length = 324
Score = 49.3 bits (116), Expect = 3e-04, Method: Composition-based stats.
Identities = 27/72 (37%), Positives = 44/72 (61%), Gaps = 3/72 (4%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNK-GEHGTATY--GINHLSDLTRE 89
+F+ F + K+Y + E +KRF +F DN++ IE N E G +Y GIN +D+++E
Sbjct: 25 KFQAFKLEHGKTYLNQAEESKRFNIFTDNVRAIEAHNALYEQGKVSYKKGINKFTDMSQE 84
Query: 90 EMKSRLGLNLSK 101
E K+ L L+ S+
Sbjct: 85 EFKTMLTLSASR 96
>gi|70935030|ref|XP_738656.1| hypothetical protein [Plasmodium chabaudi chabaudi]
gi|56515053|emb|CAH79945.1| hypothetical protein PC000617.03.0 [Plasmodium chabaudi chabaudi]
Length = 221
Score = 49.3 bits (116), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 27/68 (39%), Positives = 39/68 (57%)
Query: 27 NPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
N E + F F++ F+K Y + EE+ +RF +F +NLK +E NK + GIN SD+
Sbjct: 149 NLESVNIFYNFMKKFNKQYNSAEEMQERFYIFTENLKKVEKHNKEKKYMYKKGINPFSDM 208
Query: 87 TREEMKSR 94
EE K R
Sbjct: 209 RPEEFKMR 216
>gi|71482942|gb|AAZ32410.1| cysteine proteinase aleuran type [Nicotiana benthamiana]
Length = 360
Score = 49.3 bits (116), Expect = 3e-04, Method: Composition-based stats.
Identities = 27/68 (39%), Positives = 39/68 (57%), Gaps = 2/68 (2%)
Query: 30 HLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTRE 89
H F +F + K Y T EE+ +RF VF DNLK+I NK + + G+N +D+T +
Sbjct: 57 HALLFARFAHRYGKRYETVEEIKQRFEVFLDNLKMIRSHNK-KGLSYKLGVNEFTDITWD 115
Query: 90 EM-KSRLG 96
E + RLG
Sbjct: 116 EFRRDRLG 123
>gi|14424447|sp|P25780.2|PEPT1_EURMA RecName: Full=Peptidase 1; AltName: Full=Allergen Eur m I;
AltName: Full=Mite group 1 allergen Eur m 1; AltName:
Allergen=Eur m 1; Flags: Precursor
gi|3941388|gb|AAC82351.1| group 1 allergen Eur m 1 0101 precursor [Euroglyphus maynei]
Length = 321
Score = 49.3 bits (116), Expect = 3e-04, Method: Composition-based stats.
Identities = 31/73 (42%), Positives = 46/73 (63%), Gaps = 12/73 (16%)
Query: 28 PEHLKQFEKFIRDFSKSY--PTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSD 85
P +K FE+F + F+K+Y P KEEVA++ F ++LK +E NKG INHLSD
Sbjct: 20 PASIKTFEEFKKAFNKTYATPEKEEVARK--NFLESLKYVES-NKG-------AINHLSD 69
Query: 86 LTREEMKSRLGLN 98
L+ +E K++ +N
Sbjct: 70 LSLDEFKNQFLMN 82
>gi|323452406|gb|EGB08280.1| hypothetical protein AURANDRAFT_26549 [Aureococcus
anophagefferens]
Length = 339
Score = 49.3 bits (116), Expect = 3e-04, Method: Composition-based stats.
Identities = 24/61 (39%), Positives = 36/61 (59%), Gaps = 1/61 (1%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
F KF DFS +Y + +E ++RF F+ NL +I+ LNK H A +GI +D + +E
Sbjct: 21 FSKFQEDFSTTYSSPDETSERFTYFKKNLGMIDKLNK-VHPHALFGITKFADKSEDERAV 79
Query: 94 R 94
R
Sbjct: 80 R 80
>gi|13432122|sp|P80884.2|ANAN_ANACO RecName: Full=Ananain; Flags: Precursor
gi|2623956|emb|CAA05487.1| Ananain precursor [Ananas comosus]
Length = 345
Score = 49.3 bits (116), Expect = 3e-04, Method: Composition-based stats.
Identities = 23/70 (32%), Positives = 41/70 (58%), Gaps = 1/70 (1%)
Query: 31 LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
+KQFE+++ ++ + Y +E RF +F++N+ IE N + T GIN +D+T E
Sbjct: 34 MKQFEEWMAEYGRVYKDNDEKMLRFQIFKNNVNHIETFNNRNGNSYTLGINQFTDMTNNE 93
Query: 91 MKSRL-GLNL 99
++ GL+L
Sbjct: 94 FVAQYTGLSL 103
>gi|389608785|dbj|BAM18004.1| unknown unsecreted protein [Papilio xuthus]
Length = 88
Score = 49.3 bits (116), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 24/64 (37%), Positives = 38/64 (59%), Gaps = 1/64 (1%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
FEKF++DF K+Y + + + F +L I ++N + G+ATYG+N +D T EE K
Sbjct: 22 FEKFVKDFDKNYKDDADREEHYQAFIKSLHRINEMN-SKDGSATYGVNKFADYTEEETKQ 80
Query: 94 RLGL 97
G+
Sbjct: 81 MRGM 84
>gi|301762528|ref|XP_002916735.1| PREDICTED: cathepsin W-like [Ailuropoda melanoleuca]
Length = 374
Score = 49.3 bits (116), Expect = 3e-04, Method: Composition-based stats.
Identities = 27/70 (38%), Positives = 36/70 (51%), Gaps = 1/70 (1%)
Query: 28 PEHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
P LKQ F F +++SY EE A+R +F NL + L + GTA +G+ SDL
Sbjct: 35 PLELKQVFTLFQIQYNRSYSNPEEYARRLDIFARNLAQAQQLEAEDLGTAEFGVTPFSDL 94
Query: 87 TREEMKSRLG 96
T EE G
Sbjct: 95 TEEEFGQLYG 104
>gi|281350618|gb|EFB26202.1| hypothetical protein PANDA_004780 [Ailuropoda melanoleuca]
Length = 373
Score = 49.3 bits (116), Expect = 3e-04, Method: Composition-based stats.
Identities = 27/70 (38%), Positives = 36/70 (51%), Gaps = 1/70 (1%)
Query: 28 PEHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
P LKQ F F +++SY EE A+R +F NL + L + GTA +G+ SDL
Sbjct: 35 PLELKQVFTLFQIQYNRSYSNPEEYARRLDIFARNLAQAQQLEAEDLGTAEFGVTPFSDL 94
Query: 87 TREEMKSRLG 96
T EE G
Sbjct: 95 TEEEFGQLYG 104
>gi|357438145|ref|XP_003589348.1| Cysteine proteinase [Medicago truncatula]
gi|355478396|gb|AES59599.1| Cysteine proteinase [Medicago truncatula]
Length = 364
Score = 49.3 bits (116), Expect = 3e-04, Method: Composition-based stats.
Identities = 35/87 (40%), Positives = 45/87 (51%), Gaps = 4/87 (4%)
Query: 13 LFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGE 72
L Q+ E N EH F F FSK+Y TKEE RF VF+ NL + L++
Sbjct: 32 LIRQVVDTAEDHILNAEH--HFTSFKSKFSKNYATKEEHDYRFGVFKSNL-IKAKLHQKL 88
Query: 73 HGTATYGINHLSDLTREEMKSR-LGLN 98
+A +GI SDLT E + + LGLN
Sbjct: 89 DPSAQHGITKFSDLTASEFRRQFLGLN 115
>gi|158524604|gb|ABW71226.1| cysteine protease [Nicotiana tabacum]
Length = 360
Score = 49.3 bits (116), Expect = 3e-04, Method: Composition-based stats.
Identities = 27/68 (39%), Positives = 39/68 (57%), Gaps = 2/68 (2%)
Query: 30 HLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTRE 89
H F +F + K Y + EE+ +RF VF DNLK+I NK + + G+N +DLT +
Sbjct: 57 HALSFVRFAHRYGKRYESVEEIKQRFEVFLDNLKMIRSHNK-KGLSYKLGVNEFTDLTWD 115
Query: 90 EM-KSRLG 96
E + RLG
Sbjct: 116 EFRRDRLG 123
>gi|449500145|ref|XP_004161017.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
Length = 349
Score = 49.3 bits (116), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 35/99 (35%), Positives = 54/99 (54%), Gaps = 9/99 (9%)
Query: 9 ATLALFGQMKSNNELKTENPEHLKQ-------FEKFIRDFSKSYPTKEEVAKRFAVFEDN 61
ATL + + + + +PEHL FE ++ SK+Y + EE RF +F DN
Sbjct: 15 ATLFITYAIAHDFSIVGYSPEHLASMDKTIELFESWMSKHSKTYRSIEEKLHRFEIFLDN 74
Query: 62 LKLIEDLNKGEHGTATYGINHLSDLTREEMKSR-LGLNL 99
LK I++ NK + + G+N +DL+ EE KS+ LGL +
Sbjct: 75 LKHIDETNK-KVSSYWLGLNEFADLSHEEFKSKYLGLRV 112
>gi|194870649|ref|XP_001972693.1| GG15663 [Drosophila erecta]
gi|190654476|gb|EDV51719.1| GG15663 [Drosophila erecta]
Length = 549
Score = 49.3 bits (116), Expect = 3e-04, Method: Composition-based stats.
Identities = 27/69 (39%), Positives = 38/69 (55%), Gaps = 2/69 (2%)
Query: 29 EHL-KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLT 87
EH+ K F F R +YP+ E R +F NL+ I N+ + T T +NHL+D T
Sbjct: 239 EHVDKAFHHFKRKHGVAYPSDTEHEHRKNIFRQNLRYIHSKNRAKL-TYTLAVNHLADKT 297
Query: 88 REEMKSRLG 96
EE+K+R G
Sbjct: 298 EEELKARRG 306
>gi|46948154|gb|AAT07059.1| cathepsin F-like cysteine proteinase, partial [Brugia malayi]
Length = 461
Score = 49.3 bits (116), Expect = 3e-04, Method: Composition-based stats.
Identities = 29/82 (35%), Positives = 39/82 (47%), Gaps = 4/82 (4%)
Query: 11 LALFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNK 70
LA+ Q N E KT F FI+ F + Y + EE RF ++ N+ + L
Sbjct: 140 LAMNSQEWQNEEKKT----LWSDFMTFIKKFKREYSSIEEQLDRFRIYLQNMNFAKKLQF 195
Query: 71 GEHGTATYGINHLSDLTREEMK 92
E GTA YG SD+T EE +
Sbjct: 196 EEKGTAIYGATKFSDMTAEEFQ 217
>gi|312377879|gb|EFR24605.1| hypothetical protein AND_10691 [Anopheles darlingi]
Length = 375
Score = 49.3 bits (116), Expect = 3e-04, Method: Composition-based stats.
Identities = 32/90 (35%), Positives = 44/90 (48%), Gaps = 4/90 (4%)
Query: 12 ALFGQMKSNNELKTENPEHL-KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNK 70
A F M+ ++E EHL +F +F K+Y + E R VF NL+ I N+
Sbjct: 52 ATFNPMQEFVHPRSE--EHLHDEFSRFKGKHQKTYASDREHEHRLNVFRQNLRFIHSHNR 109
Query: 71 GEHGTATYGINHLSDLTREEMKSRLGLNLS 100
G T +NHL+D T +EMKS G S
Sbjct: 110 ANRGF-TVAVNHLADRTEDEMKSLRGFRSS 138
>gi|255568299|ref|XP_002525124.1| cysteine protease, putative [Ricinus communis]
gi|223535583|gb|EEF37251.1| cysteine protease, putative [Ricinus communis]
Length = 342
Score = 49.3 bits (116), Expect = 3e-04, Method: Composition-based stats.
Identities = 26/83 (31%), Positives = 44/83 (53%), Gaps = 2/83 (2%)
Query: 11 LALFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNK 70
LA++ S EL +++ EK++ K Y EE +RF +F++N++ IE N
Sbjct: 18 LAMWADQASTRELHEST--MVERHEKWMAKHGKVYKDDEEKLRRFQIFKNNVEFIESSNA 75
Query: 71 GEHGTATYGINHLSDLTREEMKS 93
+ + GIN +DLT EE ++
Sbjct: 76 AGNNSYMLGINRFADLTNEEFRA 98
>gi|196014793|ref|XP_002117255.1| hypothetical protein TRIADDRAFT_61245 [Trichoplax adhaerens]
gi|190580220|gb|EDV20305.1| hypothetical protein TRIADDRAFT_61245 [Trichoplax adhaerens]
Length = 353
Score = 49.3 bits (116), Expect = 3e-04, Method: Composition-based stats.
Identities = 24/74 (32%), Positives = 41/74 (55%)
Query: 20 NNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYG 79
+ + T + K + +FI++++KSY +E+ R+ VF N+ K ++ T YG
Sbjct: 41 SQDTATHHDPMFKNYLQFIKEYNKSYNNIQELNYRYQVFTKNMARAMLFQKHDNATGRYG 100
Query: 80 INHLSDLTREEMKS 93
LSDLT +E+KS
Sbjct: 101 FTKLSDLTDQEVKS 114
>gi|2463584|dbj|BAA22544.1| FBSB precursor [Ananas comosus]
Length = 356
Score = 49.3 bits (116), Expect = 3e-04, Method: Composition-based stats.
Identities = 20/71 (28%), Positives = 41/71 (57%)
Query: 31 LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
+K+FE+++ ++ + Y +E +RF +F++N+ IE N + T GIN +D+T E
Sbjct: 34 MKRFEEWMVEYGRVYKDNDEKMRRFQIFKNNVNHIETFNSRNENSYTLGINQFTDMTNNE 93
Query: 91 MKSRLGLNLSK 101
++ +S+
Sbjct: 94 FIAQYTGGISR 104
>gi|312282841|dbj|BAJ34286.1| unnamed protein product [Thellungiella halophila]
Length = 358
Score = 48.9 bits (115), Expect = 3e-04, Method: Composition-based stats.
Identities = 24/68 (35%), Positives = 40/68 (58%), Gaps = 2/68 (2%)
Query: 30 HLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTRE 89
H+ F +F + K Y EE+ RF++F++NL LI NK + + G+N +DLT +
Sbjct: 55 HVLSFARFTHRYGKKYQNAEEIKLRFSIFKENLDLIRSTNK-KRLSYKLGVNQFADLTWQ 113
Query: 90 EM-KSRLG 96
E +++LG
Sbjct: 114 EFQRNKLG 121
>gi|414587996|tpg|DAA38567.1| TPA: hypothetical protein ZEAMMB73_390779 [Zea mays]
Length = 343
Score = 48.9 bits (115), Expect = 3e-04, Method: Composition-based stats.
Identities = 22/66 (33%), Positives = 36/66 (54%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
++ E+++ + + Y E A+RF VF+DNL +E N + G+N +DLT EE
Sbjct: 39 ERHERWMAVYGRVYKDAAEKARRFEVFKDNLAFVESFNADKKNKFWLGVNQFADLTTEEF 98
Query: 92 KSRLGL 97
K+ G
Sbjct: 99 KANKGF 104
>gi|8886940|gb|AAF80626.1|AC069251_19 F2D10.37 [Arabidopsis thaliana]
Length = 315
Score = 48.9 bits (115), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 31/69 (44%), Positives = 44/69 (63%), Gaps = 4/69 (5%)
Query: 31 LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATY-GINHLSDLTRE 89
++ FE +I +F K+Y T EE RF VF+DNLK I++ NK G + + G+N +DL+ E
Sbjct: 48 IELFENWISNFEKAYETVEEKFLRFEVFKDNLKHIDETNK--KGKSYWLGLNEFADLSHE 105
Query: 90 EMKS-RLGL 97
E K LGL
Sbjct: 106 EFKKMYLGL 114
>gi|332249835|ref|XP_003274061.1| PREDICTED: cathepsin W [Nomascus leucogenys]
Length = 403
Score = 48.9 bits (115), Expect = 3e-04, Method: Composition-based stats.
Identities = 27/70 (38%), Positives = 39/70 (55%), Gaps = 1/70 (1%)
Query: 28 PEHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
P LK+ F+ F F++SY + EE A+R +F NL + L + + GTA +G+ SDL
Sbjct: 62 PLELKEAFKLFQIQFNRSYLSPEEHARRLDIFAHNLAQAQRLQEEDLGTAEFGVTPFSDL 121
Query: 87 TREEMKSRLG 96
T EE G
Sbjct: 122 TEEEFGQLYG 131
>gi|225428879|ref|XP_002285299.1| PREDICTED: cysteine proteinase RD21a-like [Vitis vinifera]
Length = 469
Score = 48.9 bits (115), Expect = 3e-04, Method: Composition-based stats.
Identities = 26/71 (36%), Positives = 40/71 (56%), Gaps = 1/71 (1%)
Query: 24 KTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHL 83
K + E + +E ++ KSY E +RF +F+DNL+ IE+ N + T G+N
Sbjct: 44 KRTDAEVMAVYEAWLVKHGKSYNALGERERRFEIFKDNLRFIEEHN-AVNRTYKVGLNRF 102
Query: 84 SDLTREEMKSR 94
+DLT EE +SR
Sbjct: 103 ADLTNEEYRSR 113
>gi|351701945|gb|EHB04864.1| Cathepsin W [Heterocephalus glaber]
Length = 373
Score = 48.9 bits (115), Expect = 3e-04, Method: Composition-based stats.
Identities = 27/70 (38%), Positives = 38/70 (54%), Gaps = 1/70 (1%)
Query: 28 PEHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
P LK+ F+ F F+KSY E A+R +F NL + + L + + GTA +G+ SDL
Sbjct: 35 PLELKEVFKLFQIQFNKSYSNPAEHARRLDIFVHNLAMAQRLQEEDLGTAEFGVTPFSDL 94
Query: 87 TREEMKSRLG 96
T EE G
Sbjct: 95 TEEEFGQLYG 104
>gi|31096290|gb|AAP43630.1| chabaupain-2 [Plasmodium chabaudi chabaudi]
Length = 471
Score = 48.9 bits (115), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 27/68 (39%), Positives = 39/68 (57%)
Query: 27 NPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
N E + F F++ F+K Y + EE+ +RF +F +NLK +E NK + GIN SD+
Sbjct: 147 NLESVNIFYNFMKKFNKQYNSAEEMQERFYIFTENLKKVEKHNKEKKYMYKKGINPFSDM 206
Query: 87 TREEMKSR 94
EE K R
Sbjct: 207 RPEEFKMR 214
>gi|18394919|ref|NP_564126.1| Xylem cysteine proteinase 2 [Arabidopsis thaliana]
gi|71153409|sp|Q9LM66.2|XCP2_ARATH RecName: Full=Xylem cysteine proteinase 2; Short=AtXCP2; Flags:
Precursor
gi|4836904|gb|AAD30607.1|AC007369_17 Putative cysteine proteinase [Arabidopsis thaliana]
gi|6708183|gb|AAF25832.1|AF191028_1 papain-type cysteine endopeptidase XCP2 [Arabidopsis thaliana]
gi|28466959|gb|AAO44088.1| At1g20850 [Arabidopsis thaliana]
gi|110743795|dbj|BAE99733.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|332191910|gb|AEE30031.1| Xylem cysteine proteinase 2 [Arabidopsis thaliana]
Length = 356
Score = 48.9 bits (115), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 31/69 (44%), Positives = 44/69 (63%), Gaps = 4/69 (5%)
Query: 31 LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATY-GINHLSDLTRE 89
++ FE +I +F K+Y T EE RF VF+DNLK I++ NK G + + G+N +DL+ E
Sbjct: 48 IELFENWISNFEKAYETVEEKFLRFEVFKDNLKHIDETNK--KGKSYWLGLNEFADLSHE 105
Query: 90 EMKS-RLGL 97
E K LGL
Sbjct: 106 EFKKMYLGL 114
>gi|229594208|ref|XP_001031647.3| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|225567000|gb|EAR83984.3| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 331
Score = 48.9 bits (115), Expect = 4e-04, Method: Composition-based stats.
Identities = 35/89 (39%), Positives = 45/89 (50%), Gaps = 7/89 (7%)
Query: 11 LALFGQMKSNNELKTENPE---HLK--QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLI 65
LAL G + L T NP HL F KF R F+ Y + E + R +VF +NLK+I
Sbjct: 10 LALIG--AATVYLITRNPNGDGHLDMYSFLKFKRSFNVQYHNESEESYRLSVFLENLKMI 67
Query: 66 EDLNKGEHGTATYGINHLSDLTREEMKSR 94
E N T +N +DLT EE +SR
Sbjct: 68 EKHNADSTRTYDQEVNQFADLTIEEFESR 96
>gi|297688135|ref|XP_002821545.1| PREDICTED: cathepsin W [Pongo abelii]
Length = 376
Score = 48.9 bits (115), Expect = 4e-04, Method: Composition-based stats.
Identities = 27/70 (38%), Positives = 39/70 (55%), Gaps = 1/70 (1%)
Query: 28 PEHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
P LK+ F+ F F++SY + EE A R +F +NL + L + + GTA +G+ SDL
Sbjct: 35 PLELKEAFKLFQIQFNRSYLSPEEHAHRLDIFANNLAQAQRLQEEDLGTAEFGVTPFSDL 94
Query: 87 TREEMKSRLG 96
T EE G
Sbjct: 95 TEEEFGQLYG 104
>gi|37651368|ref|NP_932731.1| cathepsin [Choristoneura fumiferana DEF MNPV]
gi|82024252|sp|Q6VTL7.1|CATV_NPVCD RecName: Full=Viral cathepsin; Short=V-cath; AltName:
Full=Cysteine proteinase; Short=CP; Flags: Precursor
gi|37499277|gb|AAQ91676.1| cathepsin [Choristoneura fumiferana DEF MNPV]
Length = 324
Score = 48.9 bits (115), Expect = 4e-04, Method: Composition-based stats.
Identities = 29/68 (42%), Positives = 44/68 (64%), Gaps = 4/68 (5%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGT-ATYGINHLSDLTREEMK 92
FE F+ +F+K+Y +K E RF +F+ NL+ E +NK + T A Y IN SDL+++E
Sbjct: 28 FEDFLHNFNKNYSSKSEKLHRFKIFQHNLE--EIINKNLNDTSAQYEINKFSDLSKDETI 85
Query: 93 SRL-GLNL 99
S+ GL+L
Sbjct: 86 SKYTGLSL 93
>gi|8050826|gb|AAF71757.1| cysteine protease falcipain-3 [Plasmodium falciparum]
Length = 488
Score = 48.9 bits (115), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 29/77 (37%), Positives = 44/77 (57%), Gaps = 1/77 (1%)
Query: 26 ENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSD 85
+N E + F F+++ +K Y T EE+ KRF +F +N + IE NK + G+N D
Sbjct: 159 DNLETVNLFYIFLKENNKKYETSEEMQKRFIIFSENYRKIELHNKKTNSLYKRGMNKFGD 218
Query: 86 LTREEMKSRLGLNLSKH 102
L+ EE +S+ LNL H
Sbjct: 219 LSPEEFRSKY-LNLKTH 234
>gi|414588007|tpg|DAA38578.1| TPA: hypothetical protein ZEAMMB73_159244 [Zea mays]
Length = 307
Score = 48.9 bits (115), Expect = 4e-04, Method: Composition-based stats.
Identities = 21/66 (31%), Positives = 36/66 (54%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
++ E+++ ++ + Y E A+RF VF+DN +E N + G+N +DLT EE
Sbjct: 3 ERHERWMAEYDRVYKDAAEKARRFEVFKDNFAFVESFNADKKNKFWLGVNQFADLTTEEF 62
Query: 92 KSRLGL 97
K+ G
Sbjct: 63 KANKGF 68
>gi|118481169|gb|ABK92536.1| unknown [Populus trichocarpa]
Length = 368
Score = 48.9 bits (115), Expect = 4e-04, Method: Composition-based stats.
Identities = 33/84 (39%), Positives = 47/84 (55%), Gaps = 5/84 (5%)
Query: 15 GQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHG 74
GQ +S++ L T H F F R F KSY ++EE RF+VF+ NL+ K +
Sbjct: 37 GQDESSSNLLTAEQHH---FSLFKRKFKKSYLSQEEHDYRFSVFKSNLRRAARHQKLD-P 92
Query: 75 TATYGINHLSDLTREEMKSR-LGL 97
TA++G+ SDLT E + + LGL
Sbjct: 93 TASHGVTQFSDLTSAEFRKQVLGL 116
>gi|402892809|ref|XP_003909601.1| PREDICTED: cathepsin W [Papio anubis]
Length = 375
Score = 48.9 bits (115), Expect = 4e-04, Method: Composition-based stats.
Identities = 27/70 (38%), Positives = 39/70 (55%), Gaps = 1/70 (1%)
Query: 28 PEHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
P LK+ F+ F F++SY + EE A+R +F NL + L + + GTA +G+ SDL
Sbjct: 35 PLELKEAFKLFQIQFNRSYLSPEEHARRLDIFAHNLAQAQRLQEEDLGTAEFGVTLFSDL 94
Query: 87 TREEMKSRLG 96
T EE G
Sbjct: 95 TEEEFGQLYG 104
>gi|357619726|gb|EHJ72185.1| cathepsin [Danaus plexippus]
Length = 1118
Score = 48.9 bits (115), Expect = 4e-04, Method: Composition-based stats.
Identities = 26/57 (45%), Positives = 38/57 (66%), Gaps = 2/57 (3%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
FE+FI+D++K Y E+ +RF +F +NLK I +N+ A YGIN SDL++EE
Sbjct: 302 FEQFIKDYNKEYDESEK-EERFKIFVNNLKDINAMNE-RSSNAVYGINKFSDLSKEE 356
Score = 48.9 bits (115), Expect = 4e-04, Method: Composition-based stats.
Identities = 26/57 (45%), Positives = 38/57 (66%), Gaps = 2/57 (3%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
FE+FI+D++K Y E+ +RF +F +NLK I +N+ A YGIN SDL++EE
Sbjct: 519 FEQFIKDYNKEYDESEK-EERFKIFVNNLKDINAMNE-RSSNAVYGINKFSDLSKEE 573
Score = 47.4 bits (111), Expect = 0.001, Method: Composition-based stats.
Identities = 25/57 (43%), Positives = 38/57 (66%), Gaps = 2/57 (3%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
FE+FI+D++K Y E+ +RF +F +NLK I +N+ A YGIN SDL+++E
Sbjct: 819 FEQFIKDYNKEYDESEK-EERFKIFVNNLKDINAMNE-RSSNAVYGINKFSDLSKDE 873
>gi|124803852|ref|XP_001347833.1| falcipain-3 [Plasmodium falciparum 3D7]
gi|9255922|gb|AAF86352.1|AF282974_1 cysteine protease falcipain-3 [Plasmodium falciparum]
gi|23496085|gb|AAN35746.1|AE014838_24 falcipain-3 [Plasmodium falciparum 3D7]
Length = 492
Score = 48.9 bits (115), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 29/77 (37%), Positives = 44/77 (57%), Gaps = 1/77 (1%)
Query: 26 ENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSD 85
+N E + F F+++ +K Y T EE+ KRF +F +N + IE NK + G+N D
Sbjct: 163 DNLETVNLFYIFLKENNKKYETSEEMQKRFIIFSENYRKIELHNKKTNSLYKRGMNKFGD 222
Query: 86 LTREEMKSRLGLNLSKH 102
L+ EE +S+ LNL H
Sbjct: 223 LSPEEFRSKY-LNLKTH 238
>gi|224105327|ref|XP_002313770.1| predicted protein [Populus trichocarpa]
gi|222850178|gb|EEE87725.1| predicted protein [Populus trichocarpa]
Length = 368
Score = 48.9 bits (115), Expect = 4e-04, Method: Composition-based stats.
Identities = 33/84 (39%), Positives = 47/84 (55%), Gaps = 5/84 (5%)
Query: 15 GQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHG 74
GQ +S++ L T H F F R F KSY ++EE RF+VF+ NL+ K +
Sbjct: 37 GQDESSSNLLTAEQHH---FSLFKRKFKKSYLSQEEHDYRFSVFKSNLRRAARHQKLD-P 92
Query: 75 TATYGINHLSDLTREEMKSR-LGL 97
TA++G+ SDLT E + + LGL
Sbjct: 93 TASHGVTQFSDLTSAEFRKQVLGL 116
>gi|2351557|gb|AAB68595.1| cathepsin [Choristoneura fumiferana MNPV]
Length = 324
Score = 48.9 bits (115), Expect = 4e-04, Method: Composition-based stats.
Identities = 29/68 (42%), Positives = 44/68 (64%), Gaps = 4/68 (5%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGT-ATYGINHLSDLTREEMK 92
FE F+ +F+K+Y +K E RF +F+ NL+ E +NK + T A Y IN SDL+++E
Sbjct: 28 FEDFLHNFNKNYSSKSEKLHRFKIFQHNLE--EIINKNLNDTSAQYEINKFSDLSKDETI 85
Query: 93 SRL-GLNL 99
S+ GL+L
Sbjct: 86 SKYTGLSL 93
>gi|413953668|gb|AFW86317.1| hypothetical protein ZEAMMB73_339067 [Zea mays]
Length = 433
Score = 48.9 bits (115), Expect = 4e-04, Method: Composition-based stats.
Identities = 21/59 (35%), Positives = 35/59 (59%)
Query: 35 EKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
E+++ +S+ Y E A+RF VF+ N++ IE N G + G+N +DLT +E +S
Sbjct: 131 EQWMAQYSRVYKDASEKARRFEVFKANVQFIESFNAGGNNKFWLGVNQFADLTNDEFRS 189
>gi|358347416|ref|XP_003637753.1| Cysteine proteinase [Medicago truncatula]
gi|355503688|gb|AES84891.1| Cysteine proteinase [Medicago truncatula]
Length = 323
Score = 48.9 bits (115), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 23/63 (36%), Positives = 38/63 (60%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
K E++++DF ++Y E KRF +F NL+ IE+ N+ + T G+N DLT++E
Sbjct: 32 KTHEQWMKDFGRTYADDVEKEKRFKIFAKNLEYIENFNRAGNETYELGLNQFLDLTKKEF 91
Query: 92 KSR 94
S+
Sbjct: 92 TSK 94
>gi|388491952|gb|AFK34042.1| unknown [Lotus japonicus]
Length = 352
Score = 48.9 bits (115), Expect = 4e-04, Method: Composition-based stats.
Identities = 22/65 (33%), Positives = 39/65 (60%), Gaps = 1/65 (1%)
Query: 30 HLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTRE 89
H F +F + K Y + EE+ RF +F +NL+LI+ NK + + G+NH +DL+ +
Sbjct: 49 HAASFARFASKYGKRYDSVEEIQHRFRIFSENLELIKSTNK-KRLSYKLGLNHFADLSWD 107
Query: 90 EMKSR 94
E +++
Sbjct: 108 EFRTQ 112
>gi|3377952|emb|CAA08906.1| cysteine proteinase [Cicer arietinum]
Length = 362
Score = 48.9 bits (115), Expect = 4e-04, Method: Composition-based stats.
Identities = 34/86 (39%), Positives = 47/86 (54%), Gaps = 4/86 (4%)
Query: 13 LFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGE 72
L Q+ + + + N EH F F FSKSY TKEE RF VF+ NLK + L++
Sbjct: 28 LIRQVTDHEDDQLLNAEH--HFTTFKSKFSKSYATKEEHDYRFGVFKSNLKKAK-LHQKL 84
Query: 73 HGTATYGINHLSDLTREEMKSR-LGL 97
+A +G+ SDLT E + + LGL
Sbjct: 85 DPSAEHGVTKFSDLTASEFRRQFLGL 110
>gi|449454309|ref|XP_004144898.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
gi|449471311|ref|XP_004153272.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
Length = 349
Score = 48.9 bits (115), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 32/81 (39%), Positives = 47/81 (58%), Gaps = 9/81 (11%)
Query: 27 NPEHLKQ-------FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYG 79
+PEHL FE ++ SK+Y + EE RF +F DNLK I++ NK + + G
Sbjct: 33 SPEHLASMDKTIELFESWMSKHSKAYRSIEEKLHRFEIFLDNLKHIDETNK-KVSSYWLG 91
Query: 80 INHLSDLTREEMKSR-LGLNL 99
+N +DL+ EE KS+ LGL +
Sbjct: 92 LNEFADLSHEEFKSKYLGLRV 112
>gi|358339354|dbj|GAA47434.1| cathepsin F [Clonorchis sinensis]
Length = 603
Score = 48.9 bits (115), Expect = 4e-04, Method: Composition-based stats.
Identities = 25/69 (36%), Positives = 40/69 (57%), Gaps = 2/69 (2%)
Query: 25 TENPEHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHL 83
T PE+ +Q +E+F + + K+Y ++ RF+VF++NL L E GTA YG+
Sbjct: 297 TPEPENARQLYEEFKQKYKKTYVNDDD-EYRFSVFKENLLRAHQLQTMEQGTAEYGVTQF 355
Query: 84 SDLTREEMK 92
DLT +E +
Sbjct: 356 FDLTSQEFQ 364
>gi|242072390|ref|XP_002446131.1| hypothetical protein SORBIDRAFT_06g002140 [Sorghum bicolor]
gi|241937314|gb|EES10459.1| hypothetical protein SORBIDRAFT_06g002140 [Sorghum bicolor]
Length = 328
Score = 48.9 bits (115), Expect = 4e-04, Method: Composition-based stats.
Identities = 21/67 (31%), Positives = 38/67 (56%)
Query: 31 LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
+++ E ++ ++ + Y E A+RF VF+DN+ +E N ++ G+N +DLT EE
Sbjct: 33 VERHENWMVEYGRVYKDAAEKARRFQVFKDNVAFVESFNTNKNNKFWLGVNQFADLTTEE 92
Query: 91 MKSRLGL 97
K+ G
Sbjct: 93 FKANKGF 99
>gi|118397782|ref|XP_001031222.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89285547|gb|EAR83559.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 331
Score = 48.9 bits (115), Expect = 4e-04, Method: Composition-based stats.
Identities = 21/67 (31%), Positives = 37/67 (55%)
Query: 29 EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTR 88
E L+ + KF R++ + Y + E R A+F +N + I+D N T G+N SD+T+
Sbjct: 27 EALQAYNKFTRNYPRIYLNEAESDYRLAIFLENYQKIQDHNNNPENTYQIGVNRFSDMTQ 86
Query: 89 EEMKSRL 95
+E ++
Sbjct: 87 QEFSQKI 93
>gi|146216004|gb|ABQ10204.1| cysteine protease Cp6 [Actinidia deliciosa]
Length = 461
Score = 48.5 bits (114), Expect = 4e-04, Method: Composition-based stats.
Identities = 27/83 (32%), Positives = 49/83 (59%), Gaps = 5/83 (6%)
Query: 11 LALFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNK 70
+++ G++ S+ +T++ E + +E ++ KSY E KRF +F+DNL+ I++ N
Sbjct: 27 MSIIGELSSS---RTDD-EVMAMYESWLVKHGKSYNAIGEKEKRFQIFKDNLRFIDEHN- 81
Query: 71 GEHGTATYGINHLSDLTREEMKS 93
E T G+N +DLT +E +S
Sbjct: 82 AESRTYKVGLNRFADLTNDEYRS 104
>gi|116794072|gb|ABK26996.1| unknown [Picea sitchensis]
Length = 367
Score = 48.5 bits (114), Expect = 4e-04, Method: Composition-based stats.
Identities = 26/80 (32%), Positives = 42/80 (52%), Gaps = 7/80 (8%)
Query: 15 GQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHG 74
G + S N L ++ F++++ K Y + EE A+R +F NL+ I NK +
Sbjct: 31 GDINSGNGL-------VRLFDRWLGRHGKLYGSHEEKARRLQIFRTNLQYIHAHNKNSNS 83
Query: 75 TATYGINHLSDLTREEMKSR 94
+ G+N +DLT EE K+R
Sbjct: 84 SFRLGLNKFADLTNEEFKTR 103
>gi|118387039|ref|XP_001026636.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89308403|gb|EAS06391.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 336
Score = 48.5 bits (114), Expect = 5e-04, Method: Composition-based stats.
Identities = 29/64 (45%), Positives = 39/64 (60%), Gaps = 3/64 (4%)
Query: 29 EHLKQ--FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
+HL+Q F F + F+K Y ++E RF VF +NLK IE LNK E TA + + SD
Sbjct: 33 KHLQQQSFLDFKKSFAKKYNSQEHELFRFNVFLENLKEIERLNK-EITTAKFDVTQFSDY 91
Query: 87 TREE 90
T+EE
Sbjct: 92 TKEE 95
>gi|3377950|emb|CAA08861.1| cysteine proteinase precursor, AN11 [Ananas comosus]
Length = 357
Score = 48.5 bits (114), Expect = 5e-04, Method: Composition-based stats.
Identities = 19/60 (31%), Positives = 36/60 (60%)
Query: 31 LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
+K+FE+++ ++ + Y +E +RF +F++N+ IE N + T GIN +D+T E
Sbjct: 34 MKRFEEWMAEYGRVYKDNDEKMRRFQIFKNNVNHIETFNSRNGNSYTLGINQFTDMTNNE 93
>gi|3377948|emb|CAA08860.1| cysteine proteinase precursor, AN8 [Ananas comosus]
Length = 356
Score = 48.5 bits (114), Expect = 5e-04, Method: Composition-based stats.
Identities = 20/71 (28%), Positives = 41/71 (57%)
Query: 31 LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
+K+FE+++ ++ + Y +E +RF +F++N+ IE N + T GIN +D+T E
Sbjct: 34 MKRFEEWMVEYGRVYKDNDEKMRRFQIFKNNVNHIETFNSRNKDSYTLGINQFTDMTNNE 93
Query: 91 MKSRLGLNLSK 101
++ +S+
Sbjct: 94 FVAQYTGGISR 104
>gi|405977658|gb|EKC42097.1| Cathepsin F [Crassostrea gigas]
Length = 715
Score = 48.5 bits (114), Expect = 5e-04, Method: Composition-based stats.
Identities = 21/63 (33%), Positives = 36/63 (57%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
F++F F + Y +K+E RF +F +N++ + L E GTA YG+ +D++ E K
Sbjct: 418 FQQFQAAFKRLYMSKQEEKTRFKIFCENMRKAKKLQDVEKGTAVYGVTKFADMSESEFKQ 477
Query: 94 RLG 96
+G
Sbjct: 478 YVG 480
>gi|11066228|gb|AAG28508.1|AF197480_1 cathepsin F [Mus musculus]
Length = 462
Score = 48.5 bits (114), Expect = 5e-04, Method: Composition-based stats.
Identities = 21/60 (35%), Positives = 34/60 (56%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
F+ F+ ++++Y ++EE R VF N+ + + + GTA YGI SDLT EE +
Sbjct: 165 FKDFMTTYNRTYESREEAQWRLTVFARNMIRAQKIQALDRGTAQYGITKFSDLTEEEFHT 224
>gi|77628008|ref|NP_001029282.1| cathepsin F precursor [Rattus norvegicus]
gi|71681040|gb|AAH99780.1| Cathepsin F [Rattus norvegicus]
gi|149062007|gb|EDM12430.1| cathepsin F, isoform CRA_a [Rattus norvegicus]
gi|159895422|gb|ABX09995.1| cathepsin F [Rattus norvegicus]
Length = 462
Score = 48.5 bits (114), Expect = 5e-04, Method: Composition-based stats.
Identities = 21/60 (35%), Positives = 34/60 (56%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
F+ F+ ++++Y ++EE R VF N+ + + + GTA YGI SDLT EE +
Sbjct: 165 FKDFMTTYNRTYESREEAQWRLTVFARNMIRAQKIQALDRGTAQYGITKFSDLTEEEFHT 224
>gi|9845246|ref|NP_063914.1| cathepsin F precursor [Mus musculus]
gi|12643321|sp|Q9R013.1|CATF_MOUSE RecName: Full=Cathepsin F; Flags: Precursor
gi|6467384|gb|AAF13147.1|AF136280_1 cathepsin F precursor [Mus musculus]
gi|7141165|gb|AAF37228.1|AF217224_1 cathepsin F [Mus musculus]
gi|26344728|dbj|BAC36013.1| unnamed protein product [Mus musculus]
gi|37589148|gb|AAH58758.1| Cathepsin F [Mus musculus]
gi|148701127|gb|EDL33074.1| cathepsin F, isoform CRA_b [Mus musculus]
Length = 462
Score = 48.5 bits (114), Expect = 5e-04, Method: Composition-based stats.
Identities = 21/60 (35%), Positives = 34/60 (56%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
F+ F+ ++++Y ++EE R VF N+ + + + GTA YGI SDLT EE +
Sbjct: 165 FKDFMTTYNRTYESREEAQWRLTVFARNMIRAQKIQALDRGTAQYGITKFSDLTEEEFHT 224
>gi|302774134|ref|XP_002970484.1| hypothetical protein SELMODRAFT_93661 [Selaginella moellendorffii]
gi|300162000|gb|EFJ28614.1| hypothetical protein SELMODRAFT_93661 [Selaginella moellendorffii]
Length = 343
Score = 48.5 bits (114), Expect = 5e-04, Method: Composition-based stats.
Identities = 26/65 (40%), Positives = 35/65 (53%), Gaps = 1/65 (1%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
F+ F++ F K Y T EE R VF+ NL + L K + TA +GI +DLT EE+
Sbjct: 45 HFKHFMQKFGKVYGTTEEYVHRLKVFQANLAHVMSLKK-QDPTAIHGITSFADLTPEELS 103
Query: 93 SRLGL 97
LG
Sbjct: 104 RFLGF 108
>gi|148701126|gb|EDL33073.1| cathepsin F, isoform CRA_a [Mus musculus]
Length = 417
Score = 48.5 bits (114), Expect = 5e-04, Method: Composition-based stats.
Identities = 21/60 (35%), Positives = 34/60 (56%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
F+ F+ ++++Y ++EE R VF N+ + + + GTA YGI SDLT EE +
Sbjct: 120 FKDFMTTYNRTYESREEAQWRLTVFARNMIRAQKIQALDRGTAQYGITKFSDLTEEEFHT 179
>gi|4826565|emb|CAB42884.1| cathepsin F [Mus musculus]
Length = 462
Score = 48.5 bits (114), Expect = 5e-04, Method: Composition-based stats.
Identities = 21/60 (35%), Positives = 34/60 (56%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
F+ F+ ++++Y ++EE R VF N+ + + + GTA YGI SDLT EE +
Sbjct: 165 FKDFMTTYNRTYESREEAQWRLTVFARNMIRAQKIQALDRGTAQYGITKFSDLTEEEFHT 224
>gi|1134882|emb|CAA92583.1| cysteine protease [Pisum sativum]
Length = 350
Score = 48.5 bits (114), Expect = 5e-04, Method: Composition-based stats.
Identities = 26/68 (38%), Positives = 38/68 (55%), Gaps = 2/68 (2%)
Query: 30 HLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTRE 89
H F +F + K Y + +E+ RF +F +NL+LI NK + G+NH +D T E
Sbjct: 47 HAVSFARFANRYGKRYDSVDEMKLRFKIFSENLELIRSSNK-RRLSYKLGVNHFADWTWE 105
Query: 90 EMKS-RLG 96
E +S RLG
Sbjct: 106 EFRSHRLG 113
>gi|388513209|gb|AFK44666.1| unknown [Lotus japonicus]
gi|388514955|gb|AFK45539.1| unknown [Lotus japonicus]
Length = 352
Score = 48.5 bits (114), Expect = 5e-04, Method: Composition-based stats.
Identities = 22/65 (33%), Positives = 39/65 (60%), Gaps = 1/65 (1%)
Query: 30 HLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTRE 89
H F +F + K Y + EE+ RF +F +NL+LI+ NK + + G+NH +DL+ +
Sbjct: 49 HAVSFARFASKYGKRYDSVEEIQHRFRIFSENLELIKSTNK-KRLSYKLGLNHFADLSWD 107
Query: 90 EMKSR 94
E +++
Sbjct: 108 EFRTQ 112
>gi|356543114|ref|XP_003540008.1| PREDICTED: LOW QUALITY PROTEIN: KDEL-tailed cysteine endopeptidase
CEP1-like [Glycine max]
Length = 343
Score = 48.5 bits (114), Expect = 5e-04, Method: Composition-based stats.
Identities = 28/82 (34%), Positives = 43/82 (52%), Gaps = 4/82 (4%)
Query: 16 QMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGT 75
Q+KS K + ++ E+++ + K Y E+ KRF +FE+N++ IE N +
Sbjct: 23 QVKSR---KLHDASMYERHEQWMEKYGKVYKDSAEMQKRFLIFENNVEFIESFNAAGNKP 79
Query: 76 ATYGINHLSDLTREE-MKSRLG 96
INHL+D T EE M S G
Sbjct: 80 YKLSINHLADQTNEEFMASHKG 101
>gi|225458143|ref|XP_002280937.1| PREDICTED: cysteine proteinase RD21a [Vitis vinifera]
gi|302142569|emb|CBI19772.3| unnamed protein product [Vitis vinifera]
Length = 436
Score = 48.5 bits (114), Expect = 5e-04, Method: Composition-based stats.
Identities = 24/66 (36%), Positives = 36/66 (54%), Gaps = 1/66 (1%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK- 92
FE + + K+Y ++EE A R VFE+N + N + + T +N +DLT E K
Sbjct: 29 FEAWCEQYGKTYSSEEEKASRLKVFEENHAFVTQHNSMANASYTLALNAFADLTHHEFKA 88
Query: 93 SRLGLN 98
SRLG +
Sbjct: 89 SRLGFS 94
>gi|217323618|gb|ACK38176.1| midgut cysteine peptidase, partial [Sphenophorus levis]
Length = 324
Score = 48.5 bits (114), Expect = 5e-04, Method: Composition-based stats.
Identities = 27/68 (39%), Positives = 38/68 (55%), Gaps = 5/68 (7%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLN----KGEHGTATYGINHLSDLTR 88
F+ F K+Y + E KRFA+F +NL+ IE N +G H + T GIN +D+TR
Sbjct: 25 HFQSFKLKHGKTYKNQAEETKRFAIFRENLRKIEAHNAEYKQGIH-SYTQGINKFADMTR 83
Query: 89 EEMKSRLG 96
E K+ L
Sbjct: 84 AEFKAMLA 91
>gi|195128649|ref|XP_002008774.1| GI11630 [Drosophila mojavensis]
gi|193920383|gb|EDW19250.1| GI11630 [Drosophila mojavensis]
Length = 547
Score = 48.5 bits (114), Expect = 5e-04, Method: Composition-based stats.
Identities = 30/88 (34%), Positives = 44/88 (50%), Gaps = 2/88 (2%)
Query: 14 FGQMKSNNELKTENPEHL-KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGE 72
F +E + + EH+ K F F R + Y T++E R +F NL+ I N+ +
Sbjct: 222 FATFNPMHEFISGSDEHVEKAFHHFKRKHAIDYSTEKEHEHRKNIFRQNLRYIHSKNRAK 281
Query: 73 HGTATYGINHLSDLTREEMKSRLGLNLS 100
T +NHL+D T EEMK+R G S
Sbjct: 282 L-TYKLAVNHLADKTDEEMKARRGYKSS 308
>gi|113819972|gb|AAH04054.2| Ctsf protein [Mus musculus]
Length = 332
Score = 48.5 bits (114), Expect = 5e-04, Method: Composition-based stats.
Identities = 21/60 (35%), Positives = 34/60 (56%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
F+ F+ ++++Y ++EE R VF N+ + + + GTA YGI SDLT EE +
Sbjct: 35 FKDFMTTYNRTYESREEAQWRLTVFARNMIRAQKIQALDRGTAQYGITKFSDLTEEEFHT 94
>gi|224136808|ref|XP_002326950.1| predicted protein [Populus trichocarpa]
gi|222835265|gb|EEE73700.1| predicted protein [Populus trichocarpa]
Length = 456
Score = 48.5 bits (114), Expect = 5e-04, Method: Composition-based stats.
Identities = 25/65 (38%), Positives = 38/65 (58%), Gaps = 1/65 (1%)
Query: 29 EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTR 88
E + +E+++ K+Y E KRF +F+DNL I+ N E+ T T G+N +DLT
Sbjct: 37 EVMAMYEEWLVKHGKNYNALGEKEKRFEIFKDNLMFIDQHNS-ENRTYTVGLNRFADLTN 95
Query: 89 EEMKS 93
EE +S
Sbjct: 96 EEFRS 100
>gi|118486542|gb|ABK95110.1| unknown [Populus trichocarpa]
Length = 465
Score = 48.5 bits (114), Expect = 5e-04, Method: Composition-based stats.
Identities = 25/65 (38%), Positives = 38/65 (58%), Gaps = 1/65 (1%)
Query: 29 EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTR 88
E + +E+++ K+Y E KRF +F+DNL I+ N E+ T T G+N +DLT
Sbjct: 46 EVMAMYEEWLVKHGKNYNALGEKEKRFEIFKDNLMFIDQHNS-ENRTYTVGLNRFADLTN 104
Query: 89 EEMKS 93
EE +S
Sbjct: 105 EEFRS 109
>gi|241111179|ref|XP_002399230.1| cysteine protease and A protease inhibitor, putative [Ixodes
scapularis]
gi|215492918|gb|EEC02559.1| cysteine protease and A protease inhibitor, putative [Ixodes
scapularis]
Length = 363
Score = 48.5 bits (114), Expect = 5e-04, Method: Composition-based stats.
Identities = 27/84 (32%), Positives = 48/84 (57%), Gaps = 3/84 (3%)
Query: 16 QMKSNNELKTENPEHLKQFEKFIRDFSKSYPT-KEEVAKRFAVFEDNLKLIEDLNK-GEH 73
+ ++++ +T +P FE++++ ++K+Y + E +KR F D L IED N+ G H
Sbjct: 29 RTETDDTNRTADPSVEAAFEQYVKRYNKTYASGSAEYSKRLNAFRDALIRIEDRNRHGNH 88
Query: 74 GT-ATYGINHLSDLTREEMKSRLG 96
A YG+ SDLT +E ++ L
Sbjct: 89 SNGALYGLTPYSDLTPDEFRALLA 112
>gi|2342494|dbj|BAA21848.1| bromelain [Ananas comosus]
gi|2463582|dbj|BAA22543.1| FB31 precursor [Ananas comosus]
Length = 352
Score = 48.5 bits (114), Expect = 5e-04, Method: Composition-based stats.
Identities = 20/71 (28%), Positives = 41/71 (57%)
Query: 31 LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
+K+FE+++ ++ + Y +E +RF +F++N+ IE N + T GIN +D+T E
Sbjct: 34 MKRFEEWMAEYGRVYKDNDEKMRRFQIFKNNVNHIETFNNRNGNSYTLGINKFTDMTNNE 93
Query: 91 MKSRLGLNLSK 101
++ +S+
Sbjct: 94 FVAQYTGGISR 104
>gi|2414683|emb|CAB16316.1| cysteine proteinase precursor [Vicia sativa]
Length = 379
Score = 48.5 bits (114), Expect = 5e-04, Method: Composition-based stats.
Identities = 28/81 (34%), Positives = 43/81 (53%), Gaps = 14/81 (17%)
Query: 16 QMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHG- 74
++K N+ L TE K+F+ F++D+SK Y T EE R +F N+ + EH
Sbjct: 42 ELKDNDLLTTE-----KKFKLFMKDYSKKYSTTEEYLLRLGIFAKNM-----VKAAEHQA 91
Query: 75 ---TATYGINHLSDLTREEMK 92
TA +G+ SDL+ EE +
Sbjct: 92 LDPTAIHGVTQFSDLSEEEFE 112
>gi|182375363|gb|ACB87490.1| mucunain [Mucuna pruriens]
Length = 422
Score = 48.5 bits (114), Expect = 5e-04, Method: Composition-based stats.
Identities = 23/61 (37%), Positives = 38/61 (62%), Gaps = 1/61 (1%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
+E+++ K+Y E KRF +F+DNL+ I+D N ++ T G+N +DLT EE ++
Sbjct: 4 YEQWLVKHGKAYNALGEKDKRFDIFKDNLRFIDDHN-ADNRTYKLGLNRFADLTNEEYRA 62
Query: 94 R 94
R
Sbjct: 63 R 63
>gi|60396844|gb|AAX19661.1| cysteine proteinase [Populus tomentosa]
Length = 374
Score = 48.5 bits (114), Expect = 6e-04, Method: Composition-based stats.
Identities = 33/84 (39%), Positives = 47/84 (55%), Gaps = 5/84 (5%)
Query: 15 GQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHG 74
GQ +S+ L T HL F+ R F KSY ++EE RF+VF+ NL+ K +
Sbjct: 43 GQDESSPNLLTAEQHHLSLFK---RKFKKSYLSQEEHDYRFSVFKSNLRRAARHQKLD-P 98
Query: 75 TATYGINHLSDLTREEMKSR-LGL 97
TA++G+ SDLT E + + LGL
Sbjct: 99 TASHGVTQFSDLTSAEFRKQVLGL 122
>gi|28192373|gb|AAK07730.1| CPR1-like cysteine proteinase [Nicotiana tabacum]
Length = 374
Score = 48.5 bits (114), Expect = 6e-04, Method: Composition-based stats.
Identities = 22/71 (30%), Positives = 42/71 (59%)
Query: 23 LKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINH 82
L+++ + ++E ++ + ++Y E KRF +F+DNL+ IE+ N + T G+N
Sbjct: 39 LQSDEDQVKNRYEMWLAEHGRAYNALGEKEKRFEIFKDNLRFIEEHNNSGNRTYKVGLNQ 98
Query: 83 LSDLTREEMKS 93
+DLT EE ++
Sbjct: 99 FADLTNEEYRT 109
>gi|945081|gb|AAC49361.1| P21 [Petunia x hybrida]
Length = 358
Score = 48.1 bits (113), Expect = 6e-04, Method: Composition-based stats.
Identities = 28/69 (40%), Positives = 40/69 (57%), Gaps = 4/69 (5%)
Query: 30 HLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLN-KGEHGTATYGINHLSDLTR 88
H F +F R + K Y + EE+ +RF +F DNL++I N KG + G+N SDLT
Sbjct: 55 HALSFARFARRYGKRYDSVEEIKQRFDIFLDNLEMINSHNDKGL--SYKLGVNEFSDLTW 112
Query: 89 EEM-KSRLG 96
+E + RLG
Sbjct: 113 DEFRRDRLG 121
>gi|110349473|gb|ABG73217.1| cathepsin L 1 precursor [Diaprepes abbreviatus]
Length = 322
Score = 48.1 bits (113), Expect = 6e-04, Method: Composition-based stats.
Identities = 28/72 (38%), Positives = 42/72 (58%), Gaps = 3/72 (4%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNK-GEHGTATY--GINHLSDLTRE 89
+F+ F K+Y + E RF +F+DNL+ IE N E G +Y GIN +D+T+E
Sbjct: 24 KFQAFKLKHGKTYKNQVEETARFNIFKDNLRAIEQHNVLYEQGLVSYKKGINRFTDMTQE 83
Query: 90 EMKSRLGLNLSK 101
E ++ L L+ SK
Sbjct: 84 EFRAFLTLSSSK 95
>gi|324522685|gb|ADY48108.1| Cathepsin L, partial [Ascaris suum]
Length = 308
Score = 48.1 bits (113), Expect = 6e-04, Method: Composition-based stats.
Identities = 22/63 (34%), Positives = 36/63 (57%)
Query: 30 HLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTRE 89
H + FI ++++Y K+E+ KRF +++ NL+ + E GTA YG SDLT+
Sbjct: 3 HGISVDGFIGRYNRTYSNKKEMLKRFRIYKRNLRAAKIWQANEQGTAIYGETQFSDLTQA 62
Query: 90 EMK 92
E +
Sbjct: 63 EFR 65
>gi|449455625|ref|XP_004145553.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
Length = 351
Score = 48.1 bits (113), Expect = 6e-04, Method: Composition-based stats.
Identities = 29/70 (41%), Positives = 44/70 (62%), Gaps = 2/70 (2%)
Query: 31 LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
++ FE++I + K Y T EE RF VF+DNLK I++ NK + + G+N +DLT +E
Sbjct: 45 IELFEEWISNHGKIYETIEEKWHRFEVFKDNLKHIDETNK-KVTSYWLGVNEFADLTHQE 103
Query: 91 MKSR-LGLNL 99
K+ LGL +
Sbjct: 104 FKNMYLGLKV 113
>gi|401758206|gb|AFQ01138.1| cathepsin L3-like protease, partial [Chilo suppressalis]
Length = 330
Score = 48.1 bits (113), Expect = 6e-04, Method: Composition-based stats.
Identities = 25/68 (36%), Positives = 37/68 (54%), Gaps = 1/68 (1%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
+F++F + SK Y E+AKR +F NL+ I N+ G T +NHL+D T +EM
Sbjct: 244 EFDRFAKKHSKQYQNDVELAKRLNIFRQNLRYIHSNNRARRGF-TLSVNHLADRTDDEMA 302
Query: 93 SRLGLNLS 100
+ G S
Sbjct: 303 ALRGRRYS 310
>gi|30141019|dbj|BAC75923.1| cysteine protease-1 [Helianthus annuus]
Length = 461
Score = 48.1 bits (113), Expect = 6e-04, Method: Composition-based stats.
Identities = 22/59 (37%), Positives = 36/59 (61%), Gaps = 1/59 (1%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
+E ++ K+Y E +RF +F+DNL+ I++ N G+H T G+N +DLT EE +
Sbjct: 52 YESWLVKHGKTYNALGEKDRRFQIFKDNLRFIDEHNSGDH-TYKLGLNKFADLTNEEYR 109
>gi|13491750|gb|AAK27968.1|AF242372_1 cysteine protease [Ipomoea batatas]
Length = 339
Score = 48.1 bits (113), Expect = 6e-04, Method: Composition-based stats.
Identities = 28/68 (41%), Positives = 41/68 (60%), Gaps = 5/68 (7%)
Query: 35 EKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATY--GINHLSDLTREEMK 92
E+++ + + Y T+ E KRF +F++N++ IE NK GT Y GIN +DLT +E K
Sbjct: 38 EQWMAQYGRVYKTEAEKTKRFNIFKENVEYIESFNKA--GTKPYKLGINAFADLTNQEFK 95
Query: 93 -SRLGLNL 99
SR G L
Sbjct: 96 ASRNGYKL 103
>gi|2463586|dbj|BAA22545.1| FB22 precursor [Ananas comosus]
Length = 340
Score = 48.1 bits (113), Expect = 6e-04, Method: Composition-based stats.
Identities = 19/64 (29%), Positives = 38/64 (59%)
Query: 31 LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
+K+FE+++ ++ + Y +E +RF +F++N+ IE N + T GIN +D+T E
Sbjct: 34 MKRFEEWMAEYGRVYKDNDEKMRRFQIFKNNVNHIETFNNRNGNSYTLGINKFTDMTNNE 93
Query: 91 MKSR 94
++
Sbjct: 94 FVTQ 97
>gi|294462776|gb|ADE76932.1| unknown [Picea sitchensis]
Length = 403
Score = 48.1 bits (113), Expect = 6e-04, Method: Composition-based stats.
Identities = 31/74 (41%), Positives = 38/74 (51%), Gaps = 7/74 (9%)
Query: 27 NPEHLKQ------FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGI 80
N EHL F+KFI + K Y T EE +R +FE NL L N+ TA +GI
Sbjct: 77 NREHLLNLRSKTLFDKFIVEHGKVYSTIEEYVRRLRIFEKNL-LKAAENQALDPTAVHGI 135
Query: 81 NHLSDLTREEMKSR 94
SDLT E +SR
Sbjct: 136 TPFSDLTEYEFESR 149
>gi|302793594|ref|XP_002978562.1| hypothetical protein SELMODRAFT_109056 [Selaginella moellendorffii]
gi|300153911|gb|EFJ20548.1| hypothetical protein SELMODRAFT_109056 [Selaginella moellendorffii]
Length = 343
Score = 48.1 bits (113), Expect = 6e-04, Method: Composition-based stats.
Identities = 26/65 (40%), Positives = 35/65 (53%), Gaps = 1/65 (1%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
F+ F++ F K Y T EE R VF+ NL + L K + TA +GI +DLT EE+
Sbjct: 45 HFKHFMQKFGKVYGTTEEYVHRLKVFQANLVHVMSLKK-QDPTAIHGITSFADLTPEELS 103
Query: 93 SRLGL 97
LG
Sbjct: 104 RFLGF 108
>gi|440292376|gb|ELP85581.1| cathepsin L, putative [Entamoeba invadens IP1]
Length = 421
Score = 48.1 bits (113), Expect = 6e-04, Method: Composition-based stats.
Identities = 24/70 (34%), Positives = 41/70 (58%), Gaps = 1/70 (1%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
QF++F+++ + Y T E+ +R +FE +L+ IE+ NK H T GI SD T EE +
Sbjct: 16 QFKEFLKENNIVYTTPSELLRRRLIFEQSLREIEEFNKSPH-TFQIGITQFSDQTNEEFQ 74
Query: 93 SRLGLNLSKH 102
++ L + +
Sbjct: 75 NQFSLTMDRQ 84
>gi|341893196|gb|EGT49131.1| hypothetical protein CAEBREN_18227 [Caenorhabditis brenneri]
Length = 381
Score = 48.1 bits (113), Expect = 6e-04, Method: Composition-based stats.
Identities = 31/70 (44%), Positives = 39/70 (55%), Gaps = 3/70 (4%)
Query: 29 EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKG--EHGTAT-YGINHLSD 85
E K FE+F F K Y T EE R VF N + LNK ++G T +GIN SD
Sbjct: 40 EAFKAFEEFKIRFHKKYKTPEEEKMRGEVFLKNHNTVGILNKKAEQNGQGTKFGINKFSD 99
Query: 86 LTREEMKSRL 95
LT++E +SRL
Sbjct: 100 LTKKEFQSRL 109
>gi|281203744|gb|EFA77940.1| hypothetical protein PPL_08585 [Polysphondylium pallidum PN500]
Length = 505
Score = 48.1 bits (113), Expect = 7e-04, Method: Composition-based stats.
Identities = 26/82 (31%), Positives = 43/82 (52%), Gaps = 2/82 (2%)
Query: 11 LALFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNK 70
L +FG + +N L ++ +FE +I F K Y E KRF++F+ N+ + N
Sbjct: 158 LLIFGLIAISNALLFSEEQYKNEFENWIDRFEKKYDV-SEFKKRFSIFKSNMDFVHSWNS 216
Query: 71 GEHGTATYGINHLSDLTREEMK 92
++ G+NHL+DLT E +
Sbjct: 217 -KNSQTVLGLNHLADLTNLEYR 237
>gi|357216861|gb|AET71138.1| cysteine peptidase isoform b [Sphenophorus levis]
Length = 324
Score = 48.1 bits (113), Expect = 7e-04, Method: Composition-based stats.
Identities = 27/68 (39%), Positives = 38/68 (55%), Gaps = 5/68 (7%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLN----KGEHGTATYGINHLSDLTR 88
F+ F K+Y + E KRFA+F +NL+ IE N +G H + T GIN +D+TR
Sbjct: 25 HFQSFKLKHGKTYKNQAEETKRFAIFRENLRKIEAHNAEYKQGIH-SYTQGINKFADMTR 83
Query: 89 EEMKSRLG 96
E K+ L
Sbjct: 84 AEFKAMLA 91
>gi|158519867|ref|NP_001103540.1| cathepsin W precursor [Bos taurus]
gi|158455042|gb|AAI13313.1| CTSW protein [Bos taurus]
gi|296471607|tpg|DAA13722.1| TPA: cathepsin W [Bos taurus]
Length = 272
Score = 48.1 bits (113), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 26/74 (35%), Positives = 39/74 (52%), Gaps = 1/74 (1%)
Query: 28 PEHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
P LK+ F F +++SYP E A+R +F NL + L + + GTA +G+ SDL
Sbjct: 35 PLELKEVFRLFQMQYNRSYPNPAEYARRLDIFAQNLAKAQRLQEEDLGTAEFGVTQFSDL 94
Query: 87 TREEMKSRLGLNLS 100
T EE G ++
Sbjct: 95 TEEEFVQLYGSQVA 108
>gi|6650705|gb|AAF21977.1|AF115280_1 thiolproteinase SmTP1 [Sarcocystis muris]
Length = 394
Score = 48.1 bits (113), Expect = 7e-04, Method: Composition-based stats.
Identities = 25/62 (40%), Positives = 37/62 (59%), Gaps = 1/62 (1%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
QF +F RD +K Y T+EE KR+A+F++NL I + N + + +N DLT EE +
Sbjct: 88 QFYQFQRDHNKFYATEEERLKRYAIFKNNLTYIHNHNMQGY-SYVLKMNKFGDLTLEEFR 146
Query: 93 SR 94
R
Sbjct: 147 QR 148
>gi|449522968|ref|XP_004168497.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
Length = 348
Score = 48.1 bits (113), Expect = 7e-04, Method: Composition-based stats.
Identities = 29/70 (41%), Positives = 44/70 (62%), Gaps = 2/70 (2%)
Query: 31 LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
++ FE++I + K Y T EE RF VF+DNLK I++ NK + + G+N +DLT +E
Sbjct: 42 IELFEEWISNHGKIYETIEEKWHRFEVFKDNLKHIDETNK-KVTSYWLGVNEFADLTHQE 100
Query: 91 MKSR-LGLNL 99
K+ LGL +
Sbjct: 101 FKNMYLGLKV 110
>gi|355751954|gb|EHH56074.1| Cathepsin W [Macaca fascicularis]
Length = 375
Score = 48.1 bits (113), Expect = 7e-04, Method: Composition-based stats.
Identities = 27/73 (36%), Positives = 39/73 (53%), Gaps = 1/73 (1%)
Query: 25 TENPEHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHL 83
+ P LK+ F+ F F++SY + EE A R +F NL + L + + GTA +G+
Sbjct: 32 SPQPLELKEAFKLFQIQFNRSYLSPEEHAHRLDIFAHNLAQAQRLQEEDLGTAEFGVTPF 91
Query: 84 SDLTREEMKSRLG 96
SDLT EE G
Sbjct: 92 SDLTEEEFGQLYG 104
>gi|109105377|ref|XP_001112560.1| PREDICTED: cathepsin W-like isoform 2 [Macaca mulatta]
gi|355566302|gb|EHH22681.1| Cathepsin W [Macaca mulatta]
Length = 375
Score = 48.1 bits (113), Expect = 7e-04, Method: Composition-based stats.
Identities = 27/73 (36%), Positives = 39/73 (53%), Gaps = 1/73 (1%)
Query: 25 TENPEHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHL 83
+ P LK+ F+ F F++SY + EE A R +F NL + L + + GTA +G+
Sbjct: 32 SPQPLELKEAFKLFQIQFNRSYLSPEEHAHRLDIFAHNLAQAQRLQEEDLGTAEFGVTPF 91
Query: 84 SDLTREEMKSRLG 96
SDLT EE G
Sbjct: 92 SDLTEEEFGQLYG 104
>gi|20129967|ref|NP_610907.1| CG6357 [Drosophila melanogaster]
gi|7303269|gb|AAF58330.1| CG6357 [Drosophila melanogaster]
Length = 439
Score = 48.1 bits (113), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 28/70 (40%), Positives = 42/70 (60%), Gaps = 3/70 (4%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKG-EHGTATY--GINHLSDLTREE 90
++KF+ DF Y ++E KR +F DN K I++ N+ E G ++ GIN SDLT EE
Sbjct: 347 WKKFLIDFGAKYQDEKETEKRRTIFCDNWKAIQEHNEQFELGVESFKKGINQWSDLTVEE 406
Query: 91 MKSRLGLNLS 100
K++ NL+
Sbjct: 407 WKTKQRPNLA 416
Score = 44.3 bits (103), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 31/79 (39%), Positives = 42/79 (53%), Gaps = 3/79 (3%)
Query: 19 SNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLN-KGEHGTAT 77
S +E+ +N +EKF+ DF SY E KR VF DN K I N + + G +
Sbjct: 237 STSEIDNDNIICQPAWEKFLIDFKPSYQDDTETEKRRNVFCDNFKSIHKHNVQFDLGNIS 296
Query: 78 Y--GINHLSDLTREEMKSR 94
+ GIN SDLT EE K++
Sbjct: 297 FKKGINQWSDLTVEEWKNK 315
Score = 38.1 bits (87), Expect = 0.71, Method: Compositional matrix adjust.
Identities = 23/64 (35%), Positives = 35/64 (54%), Gaps = 3/64 (4%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLN-KGEHGTATY--GINHLSDLTREE 90
+++F+ DF Y E KR +F +N + + D N K + G ++ GIN SDLT EE
Sbjct: 72 WQRFLVDFDVHYDNDYERQKRRDIFCENWQKVRDHNLKYDLGVVSFKKGINQWSDLTFEE 131
Query: 91 MKSR 94
K +
Sbjct: 132 WKEK 135
>gi|225428328|ref|XP_002279940.1| PREDICTED: cysteine proteinase-like [Vitis vinifera]
Length = 707
Score = 48.1 bits (113), Expect = 7e-04, Method: Composition-based stats.
Identities = 27/68 (39%), Positives = 40/68 (58%), Gaps = 2/68 (2%)
Query: 31 LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
+ +FE ++ K Y + EE RF VF +NL I++ NK E + G+N +DL+ EE
Sbjct: 401 IARFESWVSKHGKVYKSMEEKLHRFEVFRENLNHIDERNK-EVSSYWLGLNEFADLSHEE 459
Query: 91 MKSR-LGL 97
KS+ LGL
Sbjct: 460 FKSKYLGL 467
>gi|168047065|ref|XP_001775992.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162672650|gb|EDQ59184.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 336
Score = 47.8 bits (112), Expect = 7e-04, Method: Composition-based stats.
Identities = 25/64 (39%), Positives = 38/64 (59%), Gaps = 2/64 (3%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
F F + K Y T EE+ RF F +++KL+E NKG+H + + +N +D+T EE +
Sbjct: 28 HFAGFAAKYKKEYKTVEELKHRFVTFLESVKLVETHNKGQH-SYSLAVNEFADMTFEEFR 86
Query: 93 -SRL 95
SRL
Sbjct: 87 DSRL 90
>gi|426369199|ref|XP_004051582.1| PREDICTED: cathepsin W [Gorilla gorilla gorilla]
Length = 376
Score = 47.8 bits (112), Expect = 8e-04, Method: Composition-based stats.
Identities = 27/70 (38%), Positives = 38/70 (54%), Gaps = 1/70 (1%)
Query: 28 PEHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
P LK+ F+ F F++SY + EE A R +F NL + L + + GTA +G+ SDL
Sbjct: 35 PLELKEAFKLFQIQFNRSYLSPEEHAHRLDIFAHNLAQAQRLQEEDLGTAEFGVTPFSDL 94
Query: 87 TREEMKSRLG 96
T EE G
Sbjct: 95 TEEEFGQLYG 104
>gi|23110964|ref|NP_001326.2| cathepsin W preproprotein [Homo sapiens]
gi|29476894|gb|AAH48255.1| Cathepsin W [Homo sapiens]
gi|119594870|gb|EAW74464.1| cathepsin W (lymphopain), isoform CRA_b [Homo sapiens]
Length = 376
Score = 47.8 bits (112), Expect = 8e-04, Method: Composition-based stats.
Identities = 27/70 (38%), Positives = 38/70 (54%), Gaps = 1/70 (1%)
Query: 28 PEHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
P LK+ F+ F F++SY + EE A R +F NL + L + + GTA +G+ SDL
Sbjct: 35 PLELKEAFKLFQIQFNRSYLSPEEHAHRLDIFAHNLAQAQRLQEEDLGTAEFGVTPFSDL 94
Query: 87 TREEMKSRLG 96
T EE G
Sbjct: 95 TEEEFGQLYG 104
>gi|2582045|gb|AAB82449.1| lymphopain [Homo sapiens]
gi|2582181|gb|AAB82457.1| lymphopain [Homo sapiens]
gi|3033547|gb|AAC32181.1| cathepsin W [Homo sapiens]
Length = 376
Score = 47.8 bits (112), Expect = 8e-04, Method: Composition-based stats.
Identities = 27/70 (38%), Positives = 38/70 (54%), Gaps = 1/70 (1%)
Query: 28 PEHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
P LK+ F+ F F++SY + EE A R +F NL + L + + GTA +G+ SDL
Sbjct: 35 PLELKEAFKLFQIQFNRSYLSPEEHAHRLDIFAHNLAQAQRLQEEDLGTAEFGVTPFSDL 94
Query: 87 TREEMKSRLG 96
T EE G
Sbjct: 95 TEEEFGQLYG 104
>gi|397516975|ref|XP_003828695.1| PREDICTED: cathepsin W [Pan paniscus]
Length = 376
Score = 47.8 bits (112), Expect = 8e-04, Method: Composition-based stats.
Identities = 27/70 (38%), Positives = 38/70 (54%), Gaps = 1/70 (1%)
Query: 28 PEHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
P LK+ F+ F F++SY + EE A R +F NL + L + + GTA +G+ SDL
Sbjct: 35 PLELKEAFKLFQIQFNRSYLSPEEHAHRLDIFAHNLAQAQRLQEEDLGTAEFGVTPFSDL 94
Query: 87 TREEMKSRLG 96
T EE G
Sbjct: 95 TEEEFGQLYG 104
>gi|259016196|sp|P56202.2|CATW_HUMAN RecName: Full=Cathepsin W; AltName: Full=Lymphopain; Flags:
Precursor
Length = 376
Score = 47.8 bits (112), Expect = 8e-04, Method: Composition-based stats.
Identities = 27/70 (38%), Positives = 38/70 (54%), Gaps = 1/70 (1%)
Query: 28 PEHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
P LK+ F+ F F++SY + EE A R +F NL + L + + GTA +G+ SDL
Sbjct: 35 PLELKEAFKLFQIQFNRSYLSPEEHAHRLDIFAHNLAQAQRLQEEDLGTAEFGVTPFSDL 94
Query: 87 TREEMKSRLG 96
T EE G
Sbjct: 95 TEEEFGQLYG 104
>gi|114638622|ref|XP_001170363.1| PREDICTED: cathepsin W [Pan troglodytes]
Length = 376
Score = 47.8 bits (112), Expect = 8e-04, Method: Composition-based stats.
Identities = 27/70 (38%), Positives = 38/70 (54%), Gaps = 1/70 (1%)
Query: 28 PEHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
P LK+ F+ F F++SY + EE A R +F NL + L + + GTA +G+ SDL
Sbjct: 35 PLELKEAFKLFQIQFNRSYLSPEEHAHRLDIFAHNLAQAQRLQEEDLGTAEFGVTPFSDL 94
Query: 87 TREEMKSRLG 96
T EE G
Sbjct: 95 TEEEFGQLYG 104
>gi|294874412|ref|XP_002766943.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
gi|239868318|gb|EEQ99660.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
Length = 366
Score = 47.8 bits (112), Expect = 8e-04, Method: Composition-based stats.
Identities = 25/57 (43%), Positives = 34/57 (59%), Gaps = 1/57 (1%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
F F F K Y +KEE KR A+F+ NL IE +N ++ + T G+N +DLT EE
Sbjct: 28 FTDFQHKFGKKYESKEEEMKRNAIFQANLHHIEQVN-AQNLSYTLGVNEYADLTHEE 83
>gi|118394988|ref|XP_001029851.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89284124|gb|EAR82188.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 330
Score = 47.8 bits (112), Expect = 8e-04, Method: Composition-based stats.
Identities = 22/57 (38%), Positives = 36/57 (63%), Gaps = 2/57 (3%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
F+KF + ++K Y ++E R ++F++NL+ IE NK + A +GI +DLT EE
Sbjct: 30 FKKFTQTYNKKYSSEEHYNARLSIFKENLRRIELFNKNDE--AQHGITQFADLTHEE 84
>gi|285002340|ref|YP_003422404.1| cathepsin [Pseudaletia unipuncta granulovirus]
gi|197343600|gb|ACH69415.1| cathepsin [Pseudaletia unipuncta granulovirus]
Length = 338
Score = 47.8 bits (112), Expect = 8e-04, Method: Composition-based stats.
Identities = 32/95 (33%), Positives = 49/95 (51%), Gaps = 5/95 (5%)
Query: 9 ATLALFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDL 68
AT + M +N + N E L F++F+ + K Y E RF VF+ NL +I +
Sbjct: 15 ATTPIVSSM-NNLQYDLSNSEVL--FDEFVTKYGKVYANDAERKSRFDVFKANLAIINER 71
Query: 69 NKGEHGTATYGINHLSDLTREE-MKSRLGLNLSKH 102
N E +AT+GIN SDL+ E ++ + G + H
Sbjct: 72 NAQEE-SATFGINFYSDLSSNELLRKQTGFKTALH 105
>gi|13124026|sp|Q9WGE0.1|CATV_NPVHC RecName: Full=Viral cathepsin; Short=V-cath; AltName:
Full=Cysteine proteinase; Short=CP; Flags: Precursor
gi|4884631|gb|AAD31760.1|AF120926_1 cysteine proteinase [Hyphantria cunea nucleopolyhedrovirus]
Length = 324
Score = 47.8 bits (112), Expect = 8e-04, Method: Composition-based stats.
Identities = 24/61 (39%), Positives = 38/61 (62%), Gaps = 1/61 (1%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
FE F+ F+K Y ++ E +RF +F+ NL+ I N+ + TA Y IN SDL+++E S
Sbjct: 28 FEDFLHKFNKHYSSESEKLRRFQIFQHNLEEIIIKNQND-TTAQYEINKFSDLSKDETIS 86
Query: 94 R 94
+
Sbjct: 87 K 87
>gi|86355549|ref|YP_473217.1| Cathepsin [Hyphantria cunea nucleopolyhedrovirus]
gi|86198154|dbj|BAE72318.1| Cathepsin [Hyphantria cunea nucleopolyhedrovirus]
Length = 324
Score = 47.8 bits (112), Expect = 8e-04, Method: Composition-based stats.
Identities = 24/61 (39%), Positives = 38/61 (62%), Gaps = 1/61 (1%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
FE F+ F+K Y ++ E +RF +F+ NL+ I N+ + TA Y IN SDL+++E S
Sbjct: 28 FEDFLHKFNKHYSSESEKLRRFQIFQHNLEEIIIKNQND-TTAQYEINKFSDLSKDETIS 86
Query: 94 R 94
+
Sbjct: 87 K 87
>gi|356543122|ref|XP_003540012.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 342
Score = 47.8 bits (112), Expect = 8e-04, Method: Composition-based stats.
Identities = 28/82 (34%), Positives = 42/82 (51%), Gaps = 4/82 (4%)
Query: 16 QMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGT 75
Q+KS K + ++ E+++ + K Y E KRF +FE+N++ IE N +
Sbjct: 23 QVKSR---KLHDASMYERHEQWMEKYGKVYKDSAEXEKRFLIFENNVEFIESFNAAGNKP 79
Query: 76 ATYGINHLSDLTREE-MKSRLG 96
INHL+D T EE M S G
Sbjct: 80 YKLSINHLADQTNEEFMASHKG 101
>gi|50355617|dbj|BAD29957.1| cysteine protease [Daucus carota]
Length = 437
Score = 47.8 bits (112), Expect = 8e-04, Method: Composition-based stats.
Identities = 23/89 (25%), Positives = 50/89 (56%), Gaps = 1/89 (1%)
Query: 6 SAEATLALFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLI 65
+++ ++ + Q +N+ ++T++ E + + ++ KSY E RF +F+DNL+ I
Sbjct: 22 ASDMSIINYDQTHTNSLIRTDD-EVMTMYNSWLVKHGKSYNALGEKETRFQIFKDNLRYI 80
Query: 66 EDLNKGEHGTATYGINHLSDLTREEMKSR 94
++ N + G+N +DLT EE +++
Sbjct: 81 DNHNADPDRSYELGLNRFADLTNEEYRAK 109
>gi|118387041|ref|XP_001026637.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89308404|gb|EAS06392.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 335
Score = 47.8 bits (112), Expect = 8e-04, Method: Composition-based stats.
Identities = 25/62 (40%), Positives = 37/62 (59%), Gaps = 1/62 (1%)
Query: 29 EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTR 88
+ + F F ++F+K Y ++E RF VF +NLK IE LNK E +A + + SD T+
Sbjct: 34 QQQQSFLDFKKNFAKKYHSQEHEQYRFNVFLENLKEIERLNK-EITSAKFAVTQFSDYTK 92
Query: 89 EE 90
EE
Sbjct: 93 EE 94
>gi|40806498|gb|AAR92154.1| putative cysteine protease 1 [Iris x hollandica]
Length = 340
Score = 47.8 bits (112), Expect = 8e-04, Method: Composition-based stats.
Identities = 27/76 (35%), Positives = 39/76 (51%), Gaps = 1/76 (1%)
Query: 26 ENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSD 85
EN L++ E+++ + Y E A RF +F N++ IE N H G+N +D
Sbjct: 33 ENKSMLERHEQWMAQHGRVYKNAAEKAHRFEIFRANVERIESFNAENH-KFKLGVNQFAD 91
Query: 86 LTREEMKSRLGLNLSK 101
LT EE K+R L SK
Sbjct: 92 LTNEEFKTRNTLKPSK 107
>gi|155970232|gb|ABU41785.1| cysteine protease [Rosa x borboniana]
Length = 357
Score = 47.8 bits (112), Expect = 9e-04, Method: Composition-based stats.
Identities = 24/68 (35%), Positives = 39/68 (57%), Gaps = 2/68 (2%)
Query: 30 HLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTRE 89
H++ F +F + K Y + EE+ +RF +F +N KLI N+ + + G+N +D T E
Sbjct: 54 HVRSFARFAYRYEKRYESVEEMGRRFEIFAENKKLIRSTNR-KGLSYKLGVNRFADWTWE 112
Query: 90 EM-KSRLG 96
E + RLG
Sbjct: 113 EFQRHRLG 120
>gi|357631369|gb|EHJ78914.1| cysteine protease [Danaus plexippus]
Length = 329
Score = 47.8 bits (112), Expect = 9e-04, Method: Composition-based stats.
Identities = 27/70 (38%), Positives = 46/70 (65%), Gaps = 4/70 (5%)
Query: 24 KTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHL 83
K+ + E L F +++ ++K Y ++E +RF +F++NL+ I +LN+ + T YGINHL
Sbjct: 30 KSTDAEDL--FIEYVHKYNKRY-NEDEYDRRFQIFKENLENINELNRKSNLT-VYGINHL 85
Query: 84 SDLTREEMKS 93
+DL EE+ S
Sbjct: 86 TDLKYEEVAS 95
>gi|242086591|ref|XP_002439128.1| hypothetical protein SORBIDRAFT_09g000960 [Sorghum bicolor]
gi|241944413|gb|EES17558.1| hypothetical protein SORBIDRAFT_09g000960 [Sorghum bicolor]
Length = 371
Score = 47.8 bits (112), Expect = 9e-04, Method: Composition-based stats.
Identities = 28/68 (41%), Positives = 43/68 (63%), Gaps = 2/68 (2%)
Query: 31 LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
+K FE+++ + K+Y + EE RF VF+DNL I++ NK + T G+N +DLT +E
Sbjct: 63 IKLFEEWVAKYRKAYASFEEKLHRFEVFKDNLHHIDEANK-KVTTYWLGLNAFADLTHDE 121
Query: 91 MKSR-LGL 97
K+ LGL
Sbjct: 122 FKATYLGL 129
>gi|167534377|ref|XP_001748864.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163772544|gb|EDQ86194.1| predicted protein [Monosiga brevicollis MX1]
Length = 340
Score = 47.8 bits (112), Expect = 9e-04, Method: Composition-based stats.
Identities = 26/67 (38%), Positives = 34/67 (50%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
FE + ++ K+Y T E R VFE NL I N T G+NH+SD T EE +
Sbjct: 32 FEHYKAEYKKAYATTTEHEYRRQVFEQNLAKIRAHNADTTKTWKEGVNHMSDWTSEEFRR 91
Query: 94 RLGLNLS 100
LG + S
Sbjct: 92 LLGYDQS 98
>gi|326526731|dbj|BAK00754.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 341
Score = 47.8 bits (112), Expect = 0.001, Method: Composition-based stats.
Identities = 22/64 (34%), Positives = 36/64 (56%), Gaps = 2/64 (3%)
Query: 30 HLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTRE 89
+ +F++F FSK+Y + EE R+A F DNL+ + LN + G +G+ D+T
Sbjct: 28 NFAKFQEFTARFSKNYKSVEEYTTRYATFLDNLERVAKLN--QDGRGVFGVTKFMDMTPA 85
Query: 90 EMKS 93
E K+
Sbjct: 86 EFKA 89
>gi|156089449|ref|XP_001612131.1| papain family cysteine protease containing protein [Babesia bovis]
gi|154799385|gb|EDO08563.1| papain family cysteine protease containing protein [Babesia bovis]
Length = 435
Score = 47.8 bits (112), Expect = 0.001, Method: Composition-based stats.
Identities = 26/69 (37%), Positives = 34/69 (49%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
QF F RDF + + E +RFA F N+ I + N H T T IN +D+T E+
Sbjct: 119 QFNDFNRDFKRHDNSISEKIERFATFYRNVTRIREFNMNVHKTYTMKINQFADMTPEQFM 178
Query: 93 SRLGLNLSK 101
S G SK
Sbjct: 179 SLQGTRASK 187
>gi|294874400|ref|XP_002766937.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
gi|239868312|gb|EEQ99654.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
Length = 347
Score = 47.8 bits (112), Expect = 0.001, Method: Composition-based stats.
Identities = 25/57 (43%), Positives = 34/57 (59%), Gaps = 1/57 (1%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
F F F K Y +KEE KR A+F+ NL IE +N ++ + T G+N +DLT EE
Sbjct: 28 FTDFQHKFGKKYESKEEEMKRNAIFQANLHHIEQVN-AQNLSYTLGVNEYADLTHEE 83
>gi|219884655|gb|ACL52702.1| unknown [Zea mays]
gi|413916718|gb|AFW56650.1| thiol protease SEN102 [Zea mays]
Length = 349
Score = 47.8 bits (112), Expect = 0.001, Method: Composition-based stats.
Identities = 22/62 (35%), Positives = 38/62 (61%), Gaps = 1/62 (1%)
Query: 31 LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
L +F+ + +++++Y T EE +RF V+ +N+K IE +N+ + G N +DLT EE
Sbjct: 34 LDRFQAWQAEYNRTYATPEEFQQRFMVYSENVKFIETMNQ-PGSSYELGENQFADLTEEE 92
Query: 91 MK 92
K
Sbjct: 93 FK 94
>gi|356559055|ref|XP_003547817.1| PREDICTED: cysteine proteinase RD21a [Glycine max]
Length = 366
Score = 47.8 bits (112), Expect = 0.001, Method: Composition-based stats.
Identities = 24/68 (35%), Positives = 39/68 (57%), Gaps = 1/68 (1%)
Query: 25 TENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLS 84
T+N E + +E+++ K Y E KRF VF+DNL I++ N ++ T G+N +
Sbjct: 32 TDN-EVMTMYEEWLVKHQKVYNGLREKDKRFQVFKDNLGFIQEHNNNQNNTYKLGLNQFA 90
Query: 85 DLTREEMK 92
D+T EE +
Sbjct: 91 DMTNEEYR 98
>gi|242092700|ref|XP_002436840.1| hypothetical protein SORBIDRAFT_10g009830 [Sorghum bicolor]
gi|241915063|gb|EER88207.1| hypothetical protein SORBIDRAFT_10g009830 [Sorghum bicolor]
Length = 328
Score = 47.8 bits (112), Expect = 0.001, Method: Composition-based stats.
Identities = 21/59 (35%), Positives = 35/59 (59%)
Query: 35 EKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
E+++ +S+ Y E A+RF VF+ N+K IE N G + G+N +DLT +E ++
Sbjct: 38 EQWMVQYSRVYKDTTEKARRFEVFKANVKFIESFNAGGNRKFWLGVNQFADLTNDEFRA 96
>gi|226531284|ref|NP_001147086.1| thiol protease SEN102 precursor [Zea mays]
gi|195607128|gb|ACG25394.1| thiol protease SEN102 precursor [Zea mays]
Length = 356
Score = 47.8 bits (112), Expect = 0.001, Method: Composition-based stats.
Identities = 21/73 (28%), Positives = 44/73 (60%), Gaps = 1/73 (1%)
Query: 31 LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTA-TYGINHLSDLTRE 89
L++F+ + +++++Y T EE +RF ++ +N++ I+ +N+ G++ G N +DLT E
Sbjct: 35 LERFKAWQAEYNRTYATPEEFQQRFMIYSENVRFIKTMNQLSTGSSYELGENQFTDLTEE 94
Query: 90 EMKSRLGLNLSKH 102
E K + L +
Sbjct: 95 EFKDTYLMKLDEQ 107
>gi|37732137|gb|AAR02406.1| cysteine proteinase [Anthonomus grandis]
Length = 322
Score = 47.8 bits (112), Expect = 0.001, Method: Composition-based stats.
Identities = 29/74 (39%), Positives = 41/74 (55%), Gaps = 3/74 (4%)
Query: 29 EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNK-GEHGTATY--GINHLSD 85
+H FE F + KSY + E +RF +F N+ IE N E G +Y IN +D
Sbjct: 21 KHQALFETFKVENGKSYRNQVEEVQRFNIFRANVLEIEQHNALYEQGLVSYKKAINQFTD 80
Query: 86 LTREEMKSRLGLNL 99
LT+EE K+ LGL++
Sbjct: 81 LTQEEFKAYLGLHV 94
>gi|9502421|gb|AAF88120.1|AC021043_13 Putative cysteine proteinase [Arabidopsis thaliana]
Length = 331
Score = 47.4 bits (111), Expect = 0.001, Method: Composition-based stats.
Identities = 26/62 (41%), Positives = 34/62 (54%), Gaps = 3/62 (4%)
Query: 29 EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTR 88
EH +Q+ + FS+ Y + E RF VF+ NLK IE NK T G+N +D TR
Sbjct: 21 EHHQQW---MTRFSRVYSDELEKQMRFDVFKKNLKFIEKFNKKGDRTYKLGVNEFADWTR 77
Query: 89 EE 90
EE
Sbjct: 78 EE 79
>gi|30690594|ref|NP_564321.2| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|28393492|gb|AAO42167.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|332192920|gb|AEE31041.1| cysteine proteinase-like protein [Arabidopsis thaliana]
Length = 355
Score = 47.4 bits (111), Expect = 0.001, Method: Composition-based stats.
Identities = 26/62 (41%), Positives = 34/62 (54%), Gaps = 3/62 (4%)
Query: 29 EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTR 88
EH +Q+ + FS+ Y + E RF VF+ NLK IE NK T G+N +D TR
Sbjct: 45 EHHQQW---MTRFSRVYSDELEKQMRFDVFKKNLKFIEKFNKKGDRTYKLGVNEFADWTR 101
Query: 89 EE 90
EE
Sbjct: 102 EE 103
>gi|443691408|gb|ELT93269.1| hypothetical protein CAPTEDRAFT_181131 [Capitella teleta]
Length = 541
Score = 47.4 bits (111), Expect = 0.001, Method: Composition-based stats.
Identities = 30/86 (34%), Positives = 45/86 (52%), Gaps = 2/86 (2%)
Query: 16 QMKSNNELKTENPEHLK-QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHG 74
Q+ +E +N EH+ F+ + +D+SK Y E A R VF+ NL+ IE N+
Sbjct: 222 QVNPMHEYIHDNDEHIHGMFDGYKKDYSKDYKDDFEHASRLHVFKHNLRYIESQNR-RGL 280
Query: 75 TATYGINHLSDLTREEMKSRLGLNLS 100
T T +NHL+D E+ S G + S
Sbjct: 281 TYTLAMNHLADRKDRELVSLRGFHRS 306
>gi|42573181|ref|NP_974687.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
gi|332661102|gb|AEE86502.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
Length = 288
Score = 47.4 bits (111), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 32/78 (41%), Positives = 42/78 (53%), Gaps = 9/78 (11%)
Query: 28 PEHLKQ-------FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGI 80
PEHL FE ++ + SK+Y + EE RF VF +NL I+ N E + G+
Sbjct: 38 PEHLTNTDKLLELFESWMSEHSKAYKSVEEKVHRFEVFRENLMHIDQRNN-EINSYWLGL 96
Query: 81 NHLSDLTREEMKSR-LGL 97
N +DLT EE K R LGL
Sbjct: 97 NEFADLTHEEFKGRYLGL 114
>gi|395852405|ref|XP_003798729.1| PREDICTED: cathepsin W [Otolemur garnettii]
Length = 367
Score = 47.4 bits (111), Expect = 0.001, Method: Composition-based stats.
Identities = 26/70 (37%), Positives = 38/70 (54%), Gaps = 1/70 (1%)
Query: 28 PEHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
P LK+ F+ F F++SY E ++R +F NL + L + + GTA +G+ LSDL
Sbjct: 35 PLELKEVFKLFQVQFNRSYSNPAEHSRRLDIFAHNLAKAQQLQEEDLGTAEFGMTSLSDL 94
Query: 87 TREEMKSRLG 96
T EE G
Sbjct: 95 TEEEFGKIFG 104
>gi|407394331|gb|EKF26898.1| cysteine proteinase, putative [Trypanosoma cruzi marinkellei]
Length = 392
Score = 47.4 bits (111), Expect = 0.001, Method: Composition-based stats.
Identities = 24/63 (38%), Positives = 36/63 (57%), Gaps = 1/63 (1%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
F++F++++ K Y KE V +R A+FE L + N+ + GINH+SD T EE S
Sbjct: 55 FDRFLQEYGKKYDAKEYVRRR-AIFEQTLARVRTHNEAGNHLYVMGINHMSDWTPEEFTS 113
Query: 94 RLG 96
G
Sbjct: 114 LNG 116
>gi|2511695|emb|CAB17077.1| cysteine proteinase precursor [Phaseolus vulgaris]
Length = 377
Score = 47.4 bits (111), Expect = 0.001, Method: Composition-based stats.
Identities = 25/77 (32%), Positives = 42/77 (54%), Gaps = 6/77 (7%)
Query: 16 QMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGT 75
+++ N L+TE K+F F+ ++ K Y T+EE +R +F N+ L N+ T
Sbjct: 40 KLQDNQLLRTE-----KKFNVFMENYGKKYSTREEYLQRLEIFAGNM-LRAPENQALDPT 93
Query: 76 ATYGINHLSDLTREEMK 92
A +G+ SDLT +E +
Sbjct: 94 AIHGVTQFSDLTEDEFQ 110
>gi|449452572|ref|XP_004144033.1| PREDICTED: thiol protease aleurain-like [Cucumis sativus]
gi|449500499|ref|XP_004161114.1| PREDICTED: thiol protease aleurain-like [Cucumis sativus]
Length = 356
Score = 47.4 bits (111), Expect = 0.001, Method: Composition-based stats.
Identities = 26/68 (38%), Positives = 39/68 (57%), Gaps = 2/68 (2%)
Query: 30 HLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTRE 89
H +F +F + K Y T EE+ RF +F ++L+LI+ NK + + G+N +D T E
Sbjct: 53 HALRFARFAHRYGKKYETAEEMKLRFGIFLESLELIKSTNK-QGLSYKLGVNQFADWTWE 111
Query: 90 EM-KSRLG 96
E K RLG
Sbjct: 112 EFRKHRLG 119
>gi|225458701|ref|XP_002284973.1| PREDICTED: cysteine proteinase RD21a-like [Vitis vinifera]
Length = 467
Score = 47.4 bits (111), Expect = 0.001, Method: Composition-based stats.
Identities = 23/60 (38%), Positives = 36/60 (60%), Gaps = 1/60 (1%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
+E ++ KSY E +RF +F+DNL+ I++ N E+ T G+N +DLT EE +S
Sbjct: 51 YEAWLAKHGKSYNALGEKERRFQIFKDNLRFIDEHN-AENRTYKVGLNRFADLTNEEYRS 109
>gi|2511689|emb|CAB17074.1| cysteine proteinase precursor [Phaseolus vulgaris]
Length = 364
Score = 47.4 bits (111), Expect = 0.001, Method: Composition-based stats.
Identities = 26/69 (37%), Positives = 42/69 (60%), Gaps = 2/69 (2%)
Query: 25 TENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLS 84
+EN E + +E+++ K Y +E KRF VF+DNL I+D N ++ T T G+N +
Sbjct: 28 SEN-EVMDMYEEWLVKHRKVYNGLDEKEKRFQVFKDNLGFIQDHN-AQNNTYTLGLNKFA 85
Query: 85 DLTREEMKS 93
D+T EE ++
Sbjct: 86 DITNEEYRA 94
>gi|297793593|ref|XP_002864681.1| hypothetical protein ARALYDRAFT_496172 [Arabidopsis lyrata subsp.
lyrata]
gi|297310516|gb|EFH40940.1| hypothetical protein ARALYDRAFT_496172 [Arabidopsis lyrata subsp.
lyrata]
Length = 361
Score = 47.4 bits (111), Expect = 0.001, Method: Composition-based stats.
Identities = 24/68 (35%), Positives = 40/68 (58%), Gaps = 2/68 (2%)
Query: 30 HLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTRE 89
H+ F +F + K Y EE+ RF++F++NL LI NK + + G+N +DLT +
Sbjct: 55 HVLTFARFTHRYGKKYQNVEEMKLRFSIFKENLDLIRSTNK-KGLSYKLGVNQFADLTWQ 113
Query: 90 EM-KSRLG 96
E +++LG
Sbjct: 114 EFQRTKLG 121
>gi|147790682|emb|CAN61026.1| hypothetical protein VITISV_001146 [Vitis vinifera]
Length = 469
Score = 47.4 bits (111), Expect = 0.001, Method: Composition-based stats.
Identities = 23/60 (38%), Positives = 36/60 (60%), Gaps = 1/60 (1%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
+E ++ KSY E +RF +F+DNL+ I++ N E+ T G+N +DLT EE +S
Sbjct: 53 YEAWLAKHGKSYNALGEKERRFQIFKDNLRFIDEHN-AENRTYKVGLNRFADLTNEEYRS 111
>gi|413953046|gb|AFW85695.1| thiol protease SEN102 [Zea mays]
Length = 382
Score = 47.4 bits (111), Expect = 0.001, Method: Composition-based stats.
Identities = 21/73 (28%), Positives = 44/73 (60%), Gaps = 1/73 (1%)
Query: 31 LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTA-TYGINHLSDLTRE 89
L++F+ + +++++Y T EE +RF ++ +N++ I+ +N+ G++ G N +DLT E
Sbjct: 61 LERFKAWQAEYNRTYATPEEFQQRFMIYSENVRFIKTMNQLSTGSSYELGENQFTDLTEE 120
Query: 90 EMKSRLGLNLSKH 102
E K + L +
Sbjct: 121 EFKDTYLMKLDEQ 133
>gi|345314917|ref|XP_003429566.1| PREDICTED: cathepsin F-like, partial [Ornithorhynchus anatinus]
Length = 219
Score = 47.4 bits (111), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 21/65 (32%), Positives = 37/65 (56%)
Query: 29 EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTR 88
E + F++F+ +S+SY E +R +F NL+ + + + G+A YG+ SDLT
Sbjct: 54 EVISLFKEFLTTYSRSYANATETQRRLGIFAHNLERARRIQELDQGSARYGVTKFSDLTE 113
Query: 89 EEMKS 93
EE ++
Sbjct: 114 EEFRT 118
>gi|242092702|ref|XP_002436841.1| hypothetical protein SORBIDRAFT_10g009840 [Sorghum bicolor]
gi|241915064|gb|EER88208.1| hypothetical protein SORBIDRAFT_10g009840 [Sorghum bicolor]
Length = 328
Score = 47.4 bits (111), Expect = 0.001, Method: Composition-based stats.
Identities = 21/59 (35%), Positives = 35/59 (59%)
Query: 35 EKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
E+++ +S+ Y E A+RF VF+ N+K IE N G + G+N +DLT +E ++
Sbjct: 38 EQWMVQYSRVYKDTTEKARRFEVFKANVKFIESFNAGGNRKFWLGVNQFADLTNDEFRA 96
>gi|357483847|ref|XP_003612210.1| Cysteine proteinase [Medicago truncatula]
gi|355513545|gb|AES95168.1| Cysteine proteinase [Medicago truncatula]
Length = 344
Score = 47.4 bits (111), Expect = 0.001, Method: Composition-based stats.
Identities = 22/60 (36%), Positives = 36/60 (60%), Gaps = 1/60 (1%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTA-TYGINHLSDLTREE 90
++ E+++ + K Y +E KRF +F +N+K IE N G++ + GIN +DLT EE
Sbjct: 37 ERHERWMNHYGKVYKDHQEREKRFKIFTENMKYIEAFNNGDNNESYKLGINQFADLTNEE 96
>gi|226503205|ref|NP_001150062.1| thiol protease SEN102 precursor [Zea mays]
gi|195636390|gb|ACG37663.1| thiol protease SEN102 precursor [Zea mays]
Length = 349
Score = 47.4 bits (111), Expect = 0.001, Method: Composition-based stats.
Identities = 22/62 (35%), Positives = 38/62 (61%), Gaps = 1/62 (1%)
Query: 31 LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
L +F+ + +++++Y T EE +RF V+ +N+K IE +N+ + G N +DLT EE
Sbjct: 34 LDRFQAWQAEYNRTYATPEEFQQRFMVYSENVKFIETMNQ-PGSSYELGENRFADLTEEE 92
Query: 91 MK 92
K
Sbjct: 93 FK 94
>gi|354496134|ref|XP_003510182.1| PREDICTED: cathepsin F [Cricetulus griseus]
gi|344250261|gb|EGW06365.1| Cathepsin F [Cricetulus griseus]
Length = 462
Score = 47.4 bits (111), Expect = 0.001, Method: Composition-based stats.
Identities = 21/57 (36%), Positives = 33/57 (57%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
F+ F+ ++++Y ++EE R VF N+ + + + GTA YGI SDLT EE
Sbjct: 165 FKDFMITYNRTYESREETQWRLTVFTRNMVKAQKIEALDRGTAQYGITKFSDLTEEE 221
>gi|294883334|ref|XP_002770714.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
gi|239873999|gb|EER02719.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
Length = 330
Score = 47.4 bits (111), Expect = 0.001, Method: Composition-based stats.
Identities = 27/64 (42%), Positives = 39/64 (60%), Gaps = 2/64 (3%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
F F F K+Y +KEE KR A+F+ NL LIE +N ++ + G+N +DLT EE +
Sbjct: 28 FMGFQHKFGKNYESKEEEVKRNAIFQANLHLIEQVN-AKNLSYKLGVNEYADLTHEEFAA 86
Query: 94 -RLG 96
+LG
Sbjct: 87 LKLG 90
>gi|189236657|ref|XP_970512.2| PREDICTED: similar to cathepsin o [Tribolium castaneum]
Length = 329
Score = 47.4 bits (111), Expect = 0.001, Method: Composition-based stats.
Identities = 27/82 (32%), Positives = 48/82 (58%), Gaps = 3/82 (3%)
Query: 23 LKTENPEHLK-QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLN-KGEHGTATYGI 80
++ + P+ + QF+++++ F+K+Y R F+ +L+ IE LN K +G+A YG+
Sbjct: 23 IRIKGPDQAESQFQEYLKRFNKTYDDPSVYQNRLHAFKQSLQTIETLNSKKRNGSALYGL 82
Query: 81 NHLSDLTREE-MKSRLGLNLSK 101
SDL EE ++ L NLS+
Sbjct: 83 TKFSDLLPEEFFQTYLQSNLSQ 104
>gi|270006364|gb|EFA02812.1| cathepsin O precursor [Tribolium castaneum]
Length = 326
Score = 47.4 bits (111), Expect = 0.001, Method: Composition-based stats.
Identities = 27/82 (32%), Positives = 48/82 (58%), Gaps = 3/82 (3%)
Query: 23 LKTENPEHLK-QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLN-KGEHGTATYGI 80
++ + P+ + QF+++++ F+K+Y R F+ +L+ IE LN K +G+A YG+
Sbjct: 23 IRIKGPDQAESQFQEYLKRFNKTYDDPSVYQNRLHAFKQSLQTIETLNSKKRNGSALYGL 82
Query: 81 NHLSDLTREE-MKSRLGLNLSK 101
SDL EE ++ L NLS+
Sbjct: 83 TKFSDLLPEEFFQTYLQSNLSQ 104
>gi|113931178|ref|NP_001039033.1| cathepsin W [Xenopus (Silurana) tropicalis]
gi|89269052|emb|CAJ83515.1| cathepsin W [Xenopus (Silurana) tropicalis]
Length = 303
Score = 47.4 bits (111), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 23/51 (45%), Positives = 31/51 (60%)
Query: 41 FSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
+++SY T+EE R +F +NLK L + E GTA YG+ SDLT EE
Sbjct: 4 YNRSYKTREEFKYRLRIFSENLKEASRLQREELGTAQYGVTKFSDLTDEEF 54
>gi|225444726|ref|XP_002278624.1| PREDICTED: thiol protease aleurain-like isoform 1 [Vitis vinifera]
gi|147826441|emb|CAN62278.1| hypothetical protein VITISV_031382 [Vitis vinifera]
gi|297738562|emb|CBI27807.3| unnamed protein product [Vitis vinifera]
Length = 362
Score = 47.4 bits (111), Expect = 0.001, Method: Composition-based stats.
Identities = 26/71 (36%), Positives = 37/71 (52%), Gaps = 2/71 (2%)
Query: 27 NPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
+ H F F + KSY T +E+ RF +F +NLKLI N+ + T +N +D
Sbjct: 56 DTRHAHSFASFAHRYGKSYKTVDEIKLRFEIFSENLKLIRSTNR-KGLPYTLAVNQFADW 114
Query: 87 TREEMKS-RLG 96
T EE + RLG
Sbjct: 115 TWEEFRRHRLG 125
>gi|18418684|ref|NP_567983.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
gi|71153408|sp|O65493.1|XCP1_ARATH RecName: Full=Xylem cysteine proteinase 1; Short=AtXCP1; Flags:
Precursor
gi|6708181|gb|AAF25831.1|AF191027_1 papain-type cysteine endopeptidase XCP1 [Arabidopsis thaliana]
gi|3080415|emb|CAA18734.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|7270487|emb|CAB80252.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|26449881|dbj|BAC42063.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|28827736|gb|AAO50712.1| unknown protein [Arabidopsis thaliana]
gi|332661101|gb|AEE86501.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
Length = 355
Score = 47.4 bits (111), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 32/78 (41%), Positives = 42/78 (53%), Gaps = 9/78 (11%)
Query: 28 PEHLKQ-------FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGI 80
PEHL FE ++ + SK+Y + EE RF VF +NL I+ N E + G+
Sbjct: 38 PEHLTNTDKLLELFESWMSEHSKAYKSVEEKVHRFEVFRENLMHIDQRNN-EINSYWLGL 96
Query: 81 NHLSDLTREEMKSR-LGL 97
N +DLT EE K R LGL
Sbjct: 97 NEFADLTHEEFKGRYLGL 114
>gi|2414570|emb|CAB16317.1| cysteine proteinase precursor [Nicotiana tabacum]
Length = 374
Score = 47.4 bits (111), Expect = 0.001, Method: Composition-based stats.
Identities = 22/71 (30%), Positives = 41/71 (57%)
Query: 23 LKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINH 82
L+++ + ++E ++ + ++Y E KRF +F+DNL+ IE N + T G+N
Sbjct: 39 LQSDEDQVKNRYEMWLAEHGRAYNALGEKEKRFEIFKDNLRFIEGHNNSGNRTYKVGLNQ 98
Query: 83 LSDLTREEMKS 93
+DLT EE ++
Sbjct: 99 FADLTNEEYRT 109
>gi|359484377|ref|XP_003633102.1| PREDICTED: thiol protease aleurain-like isoform 2 [Vitis vinifera]
Length = 318
Score = 47.4 bits (111), Expect = 0.001, Method: Composition-based stats.
Identities = 26/71 (36%), Positives = 37/71 (52%), Gaps = 2/71 (2%)
Query: 27 NPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
+ H F F + KSY T +E+ RF +F +NLKLI N+ + T +N +D
Sbjct: 56 DTRHAHSFASFAHRYGKSYKTVDEIKLRFEIFSENLKLIRSTNR-KGLPYTLAVNQFADW 114
Query: 87 TREEMKS-RLG 96
T EE + RLG
Sbjct: 115 TWEEFRRHRLG 125
>gi|357437717|ref|XP_003589134.1| Cysteine proteinase [Medicago truncatula]
gi|355478182|gb|AES59385.1| Cysteine proteinase [Medicago truncatula]
Length = 299
Score = 47.0 bits (110), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 28/72 (38%), Positives = 41/72 (56%), Gaps = 1/72 (1%)
Query: 24 KTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHL 83
K N E L +E+++ KSY E KRF +F+DNLK I++ N G + T G+
Sbjct: 45 KRTNKEVLTMYEEWLVKHGKSYNGLGEKDKRFEIFKDNLKFIDEHN-GLNSTYRLGLTRF 103
Query: 84 SDLTREEMKSRL 95
+DLT EE +S+
Sbjct: 104 ADLTNEEYRSKF 115
>gi|82705269|ref|XP_726900.1| berghepain-2 [Plasmodium yoelii yoelii 17XNL]
gi|23482498|gb|EAA18465.1| berghepain-2 [Plasmodium yoelii yoelii]
Length = 472
Score = 47.0 bits (110), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 27/68 (39%), Positives = 39/68 (57%), Gaps = 1/68 (1%)
Query: 27 NPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
N E + F F++ ++K Y + EE+ +RF +F + LK IE NK H T GIN SD+
Sbjct: 149 NLESVNLFYSFMKKYNKEYSSAEEMQERFYIFSEKLKKIEKHNKENH-LYTKGINAFSDM 207
Query: 87 TREEMKSR 94
EE K +
Sbjct: 208 RHEEFKMK 215
>gi|344295866|ref|XP_003419631.1| PREDICTED: cathepsin W-like [Loxodonta africana]
Length = 376
Score = 47.0 bits (110), Expect = 0.001, Method: Composition-based stats.
Identities = 25/70 (35%), Positives = 37/70 (52%), Gaps = 1/70 (1%)
Query: 28 PEHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
P LK+ F F +++SY E A+R +F NL + L + + GTA +G+ SDL
Sbjct: 35 PLELKEVFALFQLQYNRSYSNPAEHARRLDIFARNLAQAQQLQEEDLGTAKFGVTPFSDL 94
Query: 87 TREEMKSRLG 96
T EE + G
Sbjct: 95 TEEEFRQVYG 104
>gi|294874404|ref|XP_002766939.1| thiolproteinase SmTP1, putative [Perkinsus marinus ATCC 50983]
gi|239868314|gb|EEQ99656.1| thiolproteinase SmTP1, putative [Perkinsus marinus ATCC 50983]
Length = 339
Score = 47.0 bits (110), Expect = 0.001, Method: Composition-based stats.
Identities = 25/57 (43%), Positives = 34/57 (59%), Gaps = 1/57 (1%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
F F F K Y +KEE KR A+F+ NL IE +N ++ + T G+N +DLT EE
Sbjct: 28 FTDFQHKFGKKYESKEEEMKRNAIFQANLHHIEQVN-AQNLSYTLGVNEYADLTHEE 83
>gi|255557851|ref|XP_002519955.1| cysteine protease, putative [Ricinus communis]
gi|223541001|gb|EEF42559.1| cysteine protease, putative [Ricinus communis]
Length = 321
Score = 47.0 bits (110), Expect = 0.001, Method: Composition-based stats.
Identities = 19/60 (31%), Positives = 38/60 (63%)
Query: 31 LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
+++ E+++ ++Y EE +RF +F+ NL+ I++ NK + T G+N+ +DL+ EE
Sbjct: 36 VEKHEQWMARHGRTYQDSEEKERRFQIFKSNLEYIDNFNKASNQTYQLGLNNFADLSHEE 95
>gi|356543116|ref|XP_003540009.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 337
Score = 47.0 bits (110), Expect = 0.001, Method: Composition-based stats.
Identities = 21/59 (35%), Positives = 34/59 (57%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
++ E++++ + K Y E KR +F+DN++ IE N + GINHL+D T EE
Sbjct: 36 ERHEQWMKKYGKVYKDAAEKQKRLLIFKDNVEFIESFNAAGNKPYKLGINHLADQTNEE 94
>gi|334265690|ref|YP_004376219.1| cathepsin [Clostera anachoreta granulovirus]
gi|315451014|gb|ADU24593.1| cathepsin [Clostera anachoreta granulovirus]
gi|327553705|gb|AEB00299.1| cathepsin [Clostera anachoreta granulovirus]
Length = 332
Score = 47.0 bits (110), Expect = 0.001, Method: Composition-based stats.
Identities = 26/69 (37%), Positives = 43/69 (62%), Gaps = 3/69 (4%)
Query: 26 ENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSD 85
+N E L FE+F+ +F+K+Y +++E R+ +F+ NL LI + N E AT+ IN SD
Sbjct: 23 DNSETL--FEEFVTNFNKTYSSQDEKLIRYEIFKKNLALINNKNM-ESKHATFDINIYSD 79
Query: 86 LTREEMKSR 94
L + ++ R
Sbjct: 80 LHKNDLLHR 88
>gi|281209544|gb|EFA83712.1| cysteine proteinase 1 [Polysphondylium pallidum PN500]
Length = 465
Score = 47.0 bits (110), Expect = 0.001, Method: Composition-based stats.
Identities = 27/87 (31%), Positives = 47/87 (54%), Gaps = 8/87 (9%)
Query: 10 TLALFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLN 69
T+ L M + +L E QF +F ++K Y T E A+RFA F+ NLK+I++ N
Sbjct: 8 TVLLLVSMAAAKKLSLEE----TQFRQFQIKYNKQY-TSSEYAERFATFKSNLKVIDEKN 62
Query: 70 K---GEHGTATYGINHLSDLTREEMKS 93
+ + +G+N +DL++ E ++
Sbjct: 63 RDAASRKSSVRFGVNEFADLSQSEFRA 89
>gi|375073980|gb|AFA34857.1| cathepsin L-like protein [Trypanosoma cruzi marinkellei]
Length = 467
Score = 47.0 bits (110), Expect = 0.001, Method: Composition-based stats.
Identities = 24/62 (38%), Positives = 36/62 (58%), Gaps = 1/62 (1%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
QF +F + + Y + E A R +VF +NL + L+ + AT+G+ SDLTREE +
Sbjct: 37 QFAEFKQKHGRVYKSAAEEAFRLSVFRENL-FLARLHAAANPHATFGVTPFSDLTREEFR 95
Query: 93 SR 94
SR
Sbjct: 96 SR 97
>gi|374713651|gb|AEZ65083.1| cysteine protease [Carica papaya]
Length = 467
Score = 47.0 bits (110), Expect = 0.001, Method: Composition-based stats.
Identities = 24/89 (26%), Positives = 48/89 (53%), Gaps = 1/89 (1%)
Query: 5 ASAEATLALFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKL 64
++ + ++ + Q ++ + E + +E ++ K+Y E KRF +F+DNL+
Sbjct: 20 SAVDMSIVSYDQRHADKSSWRTDDEVMAMYEAWLVKHGKAYNALGEKEKRFGIFKDNLRF 79
Query: 65 IEDLNKGEHGTATYGINHLSDLTREEMKS 93
I++ N ++ T G+N +DLT EE +S
Sbjct: 80 IDEHNS-QNLTYRLGLNRFADLTNEEYRS 107
>gi|194749983|ref|XP_001957411.1| GF24054 [Drosophila ananassae]
gi|190624693|gb|EDV40217.1| GF24054 [Drosophila ananassae]
Length = 549
Score = 47.0 bits (110), Expect = 0.001, Method: Composition-based stats.
Identities = 28/84 (33%), Positives = 40/84 (47%), Gaps = 2/84 (2%)
Query: 14 FGQMKSNNELKTENPEHL-KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGE 72
F E + EH+ K F F R SY +E R +F NL+ I N+ +
Sbjct: 224 FATFNPMQEFVSGVDEHVEKAFHHFKRKHGVSYNNDKEHEHRLNIFRQNLRYIHSKNRAK 283
Query: 73 HGTATYGINHLSDLTREEMKSRLG 96
T T +NHL+D T +E+K+R G
Sbjct: 284 L-TYTLAVNHLADKTEDELKARRG 306
>gi|146215988|gb|ABQ10196.1| actinidin Act3a [Actinidia eriantha]
Length = 380
Score = 47.0 bits (110), Expect = 0.001, Method: Composition-based stats.
Identities = 22/67 (32%), Positives = 39/67 (58%)
Query: 27 NPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
N E + +E ++ + KSY + E R +F++NL+ I++ N + + T G+N +DL
Sbjct: 35 NDEVMALYESWLVKYGKSYNSLGEREMRIEIFKENLRFIDEHNADPNRSYTVGLNQFADL 94
Query: 87 TREEMKS 93
T EE +S
Sbjct: 95 TDEEYRS 101
>gi|297826875|ref|XP_002881320.1| hypothetical protein ARALYDRAFT_321132 [Arabidopsis lyrata subsp.
lyrata]
gi|297327159|gb|EFH57579.1| hypothetical protein ARALYDRAFT_321132 [Arabidopsis lyrata subsp.
lyrata]
Length = 341
Score = 47.0 bits (110), Expect = 0.001, Method: Composition-based stats.
Identities = 23/63 (36%), Positives = 36/63 (57%)
Query: 28 PEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLT 87
P L++ E+++ FS+ Y + E R VF+ NLK IE+ NK + + G+N +D T
Sbjct: 33 PSSLEKHEQWMARFSRVYRDELEKQMRRDVFKKNLKFIENFNKKGNKSYKLGVNEFADWT 92
Query: 88 REE 90
EE
Sbjct: 93 NEE 95
>gi|11464864|gb|AAG35357.1|AF314929_1 cruzipain [Trypanosoma cruzi]
Length = 467
Score = 47.0 bits (110), Expect = 0.001, Method: Composition-based stats.
Identities = 24/62 (38%), Positives = 36/62 (58%), Gaps = 1/62 (1%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
QF +F + + Y + E A R +VF +NL + L+ + AT+G+ SDLTREE +
Sbjct: 37 QFAEFKQKHGRVYESAAEEAFRLSVFRENL-FLARLHAAANPHATFGVTPFSDLTREEFR 95
Query: 93 SR 94
SR
Sbjct: 96 SR 97
>gi|8468605|gb|AAF75546.1| cruzipain [Trypanosoma cruzi]
Length = 467
Score = 47.0 bits (110), Expect = 0.001, Method: Composition-based stats.
Identities = 24/62 (38%), Positives = 36/62 (58%), Gaps = 1/62 (1%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
QF +F + + Y + E A R +VF +NL + L+ + AT+G+ SDLTREE +
Sbjct: 37 QFAEFKQKHGRVYESAAEEAFRLSVFRENL-FLARLHAAANPHATFGVTPFSDLTREEFR 95
Query: 93 SR 94
SR
Sbjct: 96 SR 97
>gi|11464866|gb|AAG35358.1|AF314930_1 cruzipain [Trypanosoma cruzi]
Length = 467
Score = 47.0 bits (110), Expect = 0.001, Method: Composition-based stats.
Identities = 24/62 (38%), Positives = 36/62 (58%), Gaps = 1/62 (1%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
QF +F + + Y + E A R +VF +NL + L+ + AT+G+ SDLTREE +
Sbjct: 37 QFAEFKQKHGRVYESAAEEAFRLSVFRENL-FLARLHAAANPHATFGVTPFSDLTREEFR 95
Query: 93 SR 94
SR
Sbjct: 96 SR 97
>gi|118157|sp|P25779.1|CYSP_TRYCR RecName: Full=Cruzipain; AltName: Full=Cruzaine; AltName:
Full=Major cysteine proteinase; Flags: Precursor
gi|162048|gb|AAA30181.1| cruzain [Trypanosoma cruzi]
gi|29409382|gb|AAM33131.1| cysteine proteinase precursor [Trypanosoma cruzi]
Length = 467
Score = 47.0 bits (110), Expect = 0.001, Method: Composition-based stats.
Identities = 24/62 (38%), Positives = 36/62 (58%), Gaps = 1/62 (1%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
QF +F + + Y + E A R +VF +NL + L+ + AT+G+ SDLTREE +
Sbjct: 37 QFAEFKQKHGRVYESAAEEAFRLSVFRENL-FLARLHAAANPHATFGVTPFSDLTREEFR 95
Query: 93 SR 94
SR
Sbjct: 96 SR 97
>gi|375073978|gb|AFA34856.1| cathepsin L-like protein [Trypanosoma cruzi]
Length = 467
Score = 47.0 bits (110), Expect = 0.001, Method: Composition-based stats.
Identities = 24/62 (38%), Positives = 36/62 (58%), Gaps = 1/62 (1%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
QF +F + + Y + E A R +VF +NL + L+ + AT+G+ SDLTREE +
Sbjct: 37 QFAEFKQKHGRVYESAAEEAFRLSVFRENL-FLARLHAAANPHATFGVTPFSDLTREEFR 95
Query: 93 SR 94
SR
Sbjct: 96 SR 97
>gi|297802418|ref|XP_002869093.1| hypothetical protein ARALYDRAFT_491113 [Arabidopsis lyrata subsp.
lyrata]
gi|297314929|gb|EFH45352.1| hypothetical protein ARALYDRAFT_491113 [Arabidopsis lyrata subsp.
lyrata]
Length = 355
Score = 47.0 bits (110), Expect = 0.001, Method: Composition-based stats.
Identities = 31/77 (40%), Positives = 42/77 (54%), Gaps = 2/77 (2%)
Query: 22 ELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGIN 81
E T + L+ FE ++ + SK Y + EE RF VF +NL I+ N E + G+N
Sbjct: 39 EQLTSTEKLLELFESWMSEHSKVYKSVEEKVHRFEVFRENLMHIDQRNN-EINSYWLGLN 97
Query: 82 HLSDLTREEMKSR-LGL 97
+DLT EE K R LGL
Sbjct: 98 EFADLTHEEFKGRYLGL 114
>gi|294885989|ref|XP_002771502.1| thiolproteinase SmTP1, putative [Perkinsus marinus ATCC 50983]
gi|239875206|gb|EER03318.1| thiolproteinase SmTP1, putative [Perkinsus marinus ATCC 50983]
Length = 337
Score = 47.0 bits (110), Expect = 0.001, Method: Composition-based stats.
Identities = 25/60 (41%), Positives = 35/60 (58%), Gaps = 1/60 (1%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
F F + KSY KEE KR A+F DNL IE++N ++ + G+N +DLT EE +
Sbjct: 27 FIGFQKKHGKSYDNKEEEMKRAAIFHDNLNYIEEVN-AQNLSYKLGVNEYTDLTLEEFAA 85
>gi|19747207|gb|AAL96762.1|AC104496_8 Tcc1l8.8 [Trypanosoma cruzi]
Length = 500
Score = 47.0 bits (110), Expect = 0.001, Method: Composition-based stats.
Identities = 24/62 (38%), Positives = 36/62 (58%), Gaps = 1/62 (1%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
QF +F + + Y + E A R +VF +NL + L+ + AT+G+ SDLTREE +
Sbjct: 70 QFAEFKQKHGRVYESAAEEAFRLSVFRENL-FLARLHAAANPHATFGVTPFSDLTREEFR 128
Query: 93 SR 94
SR
Sbjct: 129 SR 130
>gi|71666430|ref|XP_820174.1| cysteine peptidase [Trypanosoma cruzi strain CL Brener]
gi|70885508|gb|EAN98323.1| cysteine peptidase, putative [Trypanosoma cruzi]
Length = 467
Score = 47.0 bits (110), Expect = 0.001, Method: Composition-based stats.
Identities = 24/62 (38%), Positives = 36/62 (58%), Gaps = 1/62 (1%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
QF +F + + Y + E A R +VF +NL + L+ + AT+G+ SDLTREE +
Sbjct: 37 QFAEFKQKHGRVYESAAEEAFRLSVFRENL-FLARLHAAANPHATFGVTPFSDLTREEFR 95
Query: 93 SR 94
SR
Sbjct: 96 SR 97
>gi|71663165|ref|XP_818579.1| cruzipain precursor [Trypanosoma cruzi strain CL Brener]
gi|70883838|gb|EAN96728.1| cruzipain precursor, putative [Trypanosoma cruzi]
Length = 467
Score = 47.0 bits (110), Expect = 0.001, Method: Composition-based stats.
Identities = 24/62 (38%), Positives = 36/62 (58%), Gaps = 1/62 (1%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
QF +F + + Y + E A R +VF +NL + L+ + AT+G+ SDLTREE +
Sbjct: 37 QFAEFKQKHGRVYESAAEEAFRLSVFRENL-FLARLHAAANPHATFGVTPFSDLTREEFR 95
Query: 93 SR 94
SR
Sbjct: 96 SR 97
>gi|71663163|ref|XP_818578.1| cysteine peptidase [Trypanosoma cruzi strain CL Brener]
gi|70883837|gb|EAN96727.1| cysteine peptidase, putative [Trypanosoma cruzi]
Length = 467
Score = 47.0 bits (110), Expect = 0.001, Method: Composition-based stats.
Identities = 24/62 (38%), Positives = 36/62 (58%), Gaps = 1/62 (1%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
QF +F + + Y + E A R +VF +NL + L+ + AT+G+ SDLTREE +
Sbjct: 37 QFAEFKQKHGRVYESAAEEAFRLSVFRENL-FLARLHAAANPHATFGVTPFSDLTREEFR 95
Query: 93 SR 94
SR
Sbjct: 96 SR 97
>gi|71406896|ref|XP_805951.1| cysteine peptidase [Trypanosoma cruzi strain CL Brener]
gi|70869552|gb|EAN84100.1| cysteine peptidase, putative [Trypanosoma cruzi]
Length = 426
Score = 47.0 bits (110), Expect = 0.001, Method: Composition-based stats.
Identities = 24/62 (38%), Positives = 36/62 (58%), Gaps = 1/62 (1%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
QF +F + + Y + E A R +VF +NL + L+ + AT+G+ SDLTREE +
Sbjct: 37 QFAEFKQKHGRVYESAAEEAFRLSVFRENL-FLARLHAAANPHATFGVTPFSDLTREEFR 95
Query: 93 SR 94
SR
Sbjct: 96 SR 97
>gi|71660475|ref|XP_821954.1| cysteine peptidase [Trypanosoma cruzi strain CL Brener]
gi|3063559|gb|AAC14094.1| TcC31.13 [Trypanosoma cruzi]
gi|70887345|gb|EAO00103.1| cysteine peptidase, putative [Trypanosoma cruzi]
Length = 322
Score = 47.0 bits (110), Expect = 0.001, Method: Composition-based stats.
Identities = 24/62 (38%), Positives = 36/62 (58%), Gaps = 1/62 (1%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
QF +F + + Y + E A R +VF +NL + L+ + AT+G+ SDLTREE +
Sbjct: 70 QFAEFKQKHGRVYESAAEEAFRLSVFRENL-FLARLHAAANPHATFGVTPFSDLTREEFR 128
Query: 93 SR 94
SR
Sbjct: 129 SR 130
>gi|357458911|ref|XP_003599736.1| Cysteine proteinase [Medicago truncatula]
gi|357474719|ref|XP_003607644.1| Cysteine proteinase [Medicago truncatula]
gi|355488784|gb|AES69987.1| Cysteine proteinase [Medicago truncatula]
gi|355508699|gb|AES89841.1| Cysteine proteinase [Medicago truncatula]
Length = 340
Score = 47.0 bits (110), Expect = 0.001, Method: Composition-based stats.
Identities = 21/62 (33%), Positives = 37/62 (59%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
++ E+++ + K Y E KRF +F+DN++ IE N ++ +NHL+DLT +E
Sbjct: 38 ERHEQWMTEHGKVYEDAIEKEKRFMIFKDNVEFIESFNAADNQPYKLSVNHLADLTLDEF 97
Query: 92 KS 93
K+
Sbjct: 98 KA 99
>gi|317106666|dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas]
Length = 441
Score = 47.0 bits (110), Expect = 0.001, Method: Composition-based stats.
Identities = 24/66 (36%), Positives = 37/66 (56%), Gaps = 1/66 (1%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK- 92
FE + + K+Y ++EE R VF+DN + + N + + T +N +DLT E K
Sbjct: 30 FETWCQQHGKTYASQEEKLFRLKVFQDNYDFVTEHNSQGNSSYTLSLNAFADLTHHEFKA 89
Query: 93 SRLGLN 98
SRLGL+
Sbjct: 90 SRLGLS 95
>gi|77379397|gb|ABA71355.1| cysteine protease [Brassica napus]
Length = 359
Score = 47.0 bits (110), Expect = 0.001, Method: Composition-based stats.
Identities = 23/68 (33%), Positives = 40/68 (58%), Gaps = 2/68 (2%)
Query: 30 HLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTRE 89
H+ F +F + K Y EE+ RF++F++NL LI NK + + G+N +D+T +
Sbjct: 56 HVLSFARFTHRYGKRYENAEEMKLRFSIFKENLDLIRSTNK-KGLSYKLGVNQFTDMTWQ 114
Query: 90 EM-KSRLG 96
E +++LG
Sbjct: 115 EFQRTKLG 122
>gi|145508365|ref|XP_001440132.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124407338|emb|CAK72735.1| unnamed protein product [Paramecium tetraurelia]
Length = 321
Score = 47.0 bits (110), Expect = 0.002, Method: Composition-based stats.
Identities = 24/85 (28%), Positives = 47/85 (55%), Gaps = 6/85 (7%)
Query: 6 SAEATLALFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLI 65
SA + F + +++ K +KQ++++ + ++K YPT+ E RF++++ N+ I
Sbjct: 15 SAGVYFSKFYEQNDHDQFKI-----IKQYQEWQQKYNKRYPTQNEQIYRFSIYQQNIMKI 69
Query: 66 EDLNKGEHGTATYGINHLSDLTREE 90
ED N ++ + IN DLT +E
Sbjct: 70 EDFNS-QNNSYKQKINKFGDLTDQE 93
>gi|45738078|gb|AAS75836.1| fastuosain precursor [Bromelia fastuosa]
Length = 324
Score = 47.0 bits (110), Expect = 0.002, Method: Composition-based stats.
Identities = 18/64 (28%), Positives = 37/64 (57%)
Query: 31 LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
+++FE+++ ++ + Y E +RF +F++N+ IE N + T G+N +D+T E
Sbjct: 7 MERFEEWMAEYGRVYNDNAEKMRRFQIFKNNVNHIETFNNRSGNSYTLGVNQFTDMTNNE 66
Query: 91 MKSR 94
+R
Sbjct: 67 FLAR 70
>gi|145334857|ref|NP_001078774.1| thiol protease aleurain [Arabidopsis thaliana]
gi|332009932|gb|AED97315.1| thiol protease aleurain [Arabidopsis thaliana]
Length = 361
Score = 47.0 bits (110), Expect = 0.002, Method: Composition-based stats.
Identities = 24/68 (35%), Positives = 40/68 (58%), Gaps = 2/68 (2%)
Query: 30 HLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTRE 89
H+ F +F + K Y EE+ RF++F++NL LI NK + + G+N +DLT +
Sbjct: 55 HVLSFARFTHRYGKKYQNVEEMKLRFSIFKENLDLIRSTNK-KGLSYKLGVNQFADLTWQ 113
Query: 90 EM-KSRLG 96
E +++LG
Sbjct: 114 EFQRTKLG 121
>gi|530734|emb|CAA56914.1| cathepsin l [Nephrops norvegicus]
gi|1582620|prf||2119193A cathepsin L-related Cys protease
Length = 324
Score = 47.0 bits (110), Expect = 0.002, Method: Composition-based stats.
Identities = 31/76 (40%), Positives = 38/76 (50%), Gaps = 7/76 (9%)
Query: 23 LKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKG-EHGTATY--G 79
L NP +E+F F + Y EE R VF DNL+ IE+ NK E G TY
Sbjct: 13 LAAANP----SWEEFKGKFGRKYVDLEEERYRLNVFLDNLQYIEEFNKKYESGEVTYNLA 68
Query: 80 INHLSDLTREEMKSRL 95
IN SDLT +E S +
Sbjct: 69 INQFSDLTNDEFNSMM 84
>gi|23397070|gb|AAN31820.1| putative cysteine proteinase AALP [Arabidopsis thaliana]
Length = 358
Score = 47.0 bits (110), Expect = 0.002, Method: Composition-based stats.
Identities = 24/68 (35%), Positives = 40/68 (58%), Gaps = 2/68 (2%)
Query: 30 HLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTRE 89
H+ F +F + K Y EE+ RF++F++NL LI NK + + G+N +DLT +
Sbjct: 55 HVLSFARFTHRYGKKYQNVEEMKLRFSIFKENLDLIRSTNK-KGLSYKLGVNQFADLTWQ 113
Query: 90 EM-KSRLG 96
E +++LG
Sbjct: 114 EFQRTKLG 121
>gi|1136308|gb|AAB41119.1| cruzipain [Trypanosoma cruzi]
Length = 467
Score = 46.6 bits (109), Expect = 0.002, Method: Composition-based stats.
Identities = 24/62 (38%), Positives = 36/62 (58%), Gaps = 1/62 (1%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
QF +F + + Y + E A R +VF +NL + L+ + AT+G+ SDLTREE +
Sbjct: 37 QFAEFKQKHGRVYGSAAEEAFRLSVFRENL-FLARLHAAANPHATFGVTAFSDLTREEFR 95
Query: 93 SR 94
SR
Sbjct: 96 SR 97
>gi|334311632|ref|XP_001373241.2| PREDICTED: cathepsin L1-like [Monodelphis domestica]
Length = 328
Score = 46.6 bits (109), Expect = 0.002, Method: Composition-based stats.
Identities = 29/81 (35%), Positives = 47/81 (58%), Gaps = 4/81 (4%)
Query: 23 LKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNK-GEHGTATY--G 79
L +N + ++E + + K+Y KEE +R V+E NLKLI D N+ + G +Y G
Sbjct: 18 LSPKNEKLDAEWEAWKTTYGKNYSEKEESFRR-QVWEKNLKLINDHNRLFKEGKKSYFMG 76
Query: 80 INHLSDLTREEMKSRLGLNLS 100
+N D+T +E +SRL L ++
Sbjct: 77 MNQFGDMTDKEFESRLNLRIA 97
>gi|146216000|gb|ABQ10202.1| cysteine protease Cp4 [Actinidia deliciosa]
Length = 463
Score = 46.6 bits (109), Expect = 0.002, Method: Composition-based stats.
Identities = 25/74 (33%), Positives = 43/74 (58%), Gaps = 2/74 (2%)
Query: 27 NPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
+ E + +EK++ K+Y E +RF +F+DNL+ +++ N G+ G+N +DL
Sbjct: 40 DAEAMAIYEKWLTTHGKAYNAIGEKERRFEIFKDNLRFVDEHN-AVAGSYRVGLNRFADL 98
Query: 87 TREEMKSR-LGLNL 99
T EE +S LG N+
Sbjct: 99 TNEEYRSMFLGGNM 112
>gi|312451836|gb|ADQ85985.1| actinidin [Actinidia chinensis]
Length = 380
Score = 46.6 bits (109), Expect = 0.002, Method: Composition-based stats.
Identities = 22/75 (29%), Positives = 41/75 (54%)
Query: 19 SNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATY 78
+ N + N E +E ++ + KSY + E +RF +F++ L+ I++ N + +
Sbjct: 27 TKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWERRFEIFKETLRFIDEHNADTNRSYKV 86
Query: 79 GINHLSDLTREEMKS 93
G+N +DLT EE +S
Sbjct: 87 GLNQFADLTDEEFRS 101
>gi|23577865|ref|NP_703114.1| viral cathepsin [Rachiplusia ou MNPV]
gi|37077115|sp|Q8B9D5.1|CATV_NPVR1 RecName: Full=Viral cathepsin; Short=V-cath; AltName:
Full=Cysteine proteinase; Short=CP; Flags: Precursor
gi|23476510|gb|AAN28057.1| viral cathepsin [Rachiplusia ou MNPV]
Length = 323
Score = 46.6 bits (109), Expect = 0.002, Method: Composition-based stats.
Identities = 25/67 (37%), Positives = 43/67 (64%), Gaps = 3/67 (4%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
FE+F+ F+K Y ++ E +RF +F+ NL E + K ++ +A Y IN SDL+++E +
Sbjct: 28 FEEFVHRFNKDYGSEVEKLRRFKIFQHNLN--EIIIKNQNDSAKYEINKFSDLSKDETIA 85
Query: 94 RL-GLNL 99
+ GL+L
Sbjct: 86 KYTGLSL 92
>gi|375073982|gb|AFA34858.1| cathepsin L-like protein [Trypanosoma dionisii]
Length = 467
Score = 46.6 bits (109), Expect = 0.002, Method: Composition-based stats.
Identities = 25/62 (40%), Positives = 35/62 (56%), Gaps = 1/62 (1%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
QF F + + + Y + E A R +VF NL L L+ + AT+G+ SDLTREE +
Sbjct: 37 QFADFKQRYGRVYKSAAEEAFRLSVFRKNL-LDAKLHAAANPHATFGVTPFSDLTREEFR 95
Query: 93 SR 94
SR
Sbjct: 96 SR 97
>gi|242044818|ref|XP_002460280.1| hypothetical protein SORBIDRAFT_02g025920 [Sorghum bicolor]
gi|241923657|gb|EER96801.1| hypothetical protein SORBIDRAFT_02g025920 [Sorghum bicolor]
Length = 363
Score = 46.6 bits (109), Expect = 0.002, Method: Composition-based stats.
Identities = 24/65 (36%), Positives = 40/65 (61%), Gaps = 2/65 (3%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
+F +F + KSY + EV KRF +F ++L+L+ N+ + + GIN SD++ EE +
Sbjct: 61 RFARFAVRYGKSYESAAEVQKRFRIFSESLQLVRSTNR-KGLSYRLGINRFSDMSWEEFR 119
Query: 93 -SRLG 96
+RLG
Sbjct: 120 ATRLG 124
>gi|410493601|ref|YP_006908539.1| V-CATH [Epinotia aporema granulovirus]
gi|354805035|gb|AER41457.1| V-CATH [Epinotia aporema granulovirus]
Length = 329
Score = 46.6 bits (109), Expect = 0.002, Method: Composition-based stats.
Identities = 26/71 (36%), Positives = 41/71 (57%), Gaps = 3/71 (4%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
F+ F+ ++K Y T EE A ++ +F +NL +I + N + A Y IN LSDL + E+
Sbjct: 28 FDDFVIKYNKVYATDEERAAKYEIFRNNLVVINEKNS-KTTNALYDINRLSDLNKNELLR 86
Query: 94 RLG--LNLSKH 102
G +NL K+
Sbjct: 87 STGFSVNLKKN 97
>gi|118485796|gb|ABK94746.1| unknown [Populus trichocarpa]
Length = 367
Score = 46.6 bits (109), Expect = 0.002, Method: Composition-based stats.
Identities = 33/86 (38%), Positives = 46/86 (53%), Gaps = 4/86 (4%)
Query: 13 LFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGE 72
L Q+ S+ E N EH F F F K+Y T+EE RF VF+ NL+ + ++
Sbjct: 32 LIRQVVSDGEDDLLNAEH--HFTSFKSKFGKTYATQEEHDYRFGVFKANLRRAKK-HQMI 88
Query: 73 HGTATYGINHLSDLTREEMKSR-LGL 97
TA +GI SDLT +E + + LGL
Sbjct: 89 DPTAAHGITKFSDLTPKEFRRQFLGL 114
>gi|242072572|ref|XP_002446222.1| hypothetical protein SORBIDRAFT_06g005410 [Sorghum bicolor]
gi|241937405|gb|EES10550.1| hypothetical protein SORBIDRAFT_06g005410 [Sorghum bicolor]
Length = 340
Score = 46.6 bits (109), Expect = 0.002, Method: Composition-based stats.
Identities = 26/84 (30%), Positives = 47/84 (55%), Gaps = 2/84 (2%)
Query: 11 LALF-GQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLN 69
LALF G + +L ++ + + E+++ +++ Y E A+RF VF+ N+K IE N
Sbjct: 14 LALFCGAALAARDL-NDDSAMVARHEQWMAQYNRVYKDATEKAQRFEVFKANVKFIESFN 72
Query: 70 KGEHGTATYGINHLSDLTREEMKS 93
G + G+N +DLT +E ++
Sbjct: 73 AGGNRKFWLGVNQFADLTNDEFRA 96
>gi|190358935|sp|P00785.4|ACTN_ACTCH RecName: Full=Actinidain; Short=Actinidin; AltName: Allergen=Act c
1; Flags: Precursor
gi|12744965|gb|AAK06862.1|AF343446_1 actinidin protease [Actinidia chinensis]
Length = 380
Score = 46.6 bits (109), Expect = 0.002, Method: Composition-based stats.
Identities = 22/75 (29%), Positives = 41/75 (54%)
Query: 19 SNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATY 78
+ N + N E +E ++ + KSY + E +RF +F++ L+ I++ N + +
Sbjct: 27 AKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWERRFEIFKETLRFIDEHNADTNRSYKV 86
Query: 79 GINHLSDLTREEMKS 93
G+N +DLT EE +S
Sbjct: 87 GLNQFADLTDEEFRS 101
>gi|312451845|gb|ADQ85986.1| actinidin [Actinidia chinensis]
Length = 380
Score = 46.6 bits (109), Expect = 0.002, Method: Composition-based stats.
Identities = 22/75 (29%), Positives = 41/75 (54%)
Query: 19 SNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATY 78
+ N + N E +E ++ + KSY + E +RF +F++ L+ I++ N + +
Sbjct: 27 AKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWERRFEIFKETLRFIDEHNADTNRSYKV 86
Query: 79 GINHLSDLTREEMKS 93
G+N +DLT EE +S
Sbjct: 87 GLNQFADLTDEEFRS 101
>gi|224135841|ref|XP_002327317.1| predicted protein [Populus trichocarpa]
gi|222835687|gb|EEE74122.1| predicted protein [Populus trichocarpa]
Length = 342
Score = 46.6 bits (109), Expect = 0.002, Method: Composition-based stats.
Identities = 21/67 (31%), Positives = 35/67 (52%)
Query: 26 ENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSD 85
+ P + E+++ F K Y E +RF +F+DN++ IE N + +N +D
Sbjct: 30 QEPSMSARHEQWMETFGKVYADAAEKERRFEIFKDNVEYIESFNTAGNKPYKLSVNKFAD 89
Query: 86 LTREEMK 92
LT EE+K
Sbjct: 90 LTNEELK 96
>gi|18407678|ref|NP_566867.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|30315950|sp|Q9LXW3.1|CPR2_ARATH RecName: Full=Probable cysteine proteinase At3g43960; Flags:
Precursor
gi|7594557|emb|CAB88124.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|26452289|dbj|BAC43231.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|332644328|gb|AEE77849.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 376
Score = 46.6 bits (109), Expect = 0.002, Method: Composition-based stats.
Identities = 27/74 (36%), Positives = 43/74 (58%), Gaps = 1/74 (1%)
Query: 29 EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTR 88
E L +E+++ + K+Y E +RF +F+DNLK IE+ N + + G+N SDLT
Sbjct: 36 EVLTMYEQWLVENGKNYNGLGEKERRFKIFKDNLKRIEEHNSDPNRSYERGLNKFSDLTA 95
Query: 89 EEMK-SRLGLNLSK 101
+E + S LG + K
Sbjct: 96 DEFQASYLGGKMEK 109
>gi|193806686|sp|A5HII1.1|ACTN_ACTDE RecName: Full=Actinidain; Short=Actinidin; AltName: Full=Allergen
Act d 1; AltName: Allergen=Act d 1; Flags: Precursor
gi|146215974|gb|ABQ10189.1| actinidin Act1a [Actinidia deliciosa]
Length = 380
Score = 46.6 bits (109), Expect = 0.002, Method: Composition-based stats.
Identities = 22/75 (29%), Positives = 41/75 (54%)
Query: 19 SNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATY 78
+ N + N E +E ++ + KSY + E +RF +F++ L+ I++ N + +
Sbjct: 27 AKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWERRFEIFKETLRFIDEHNADTNRSYKV 86
Query: 79 GINHLSDLTREEMKS 93
G+N +DLT EE +S
Sbjct: 87 GLNQFADLTDEEFRS 101
>gi|2144501|pir||TAGB actinidain (EC 3.4.22.14) precursor - kiwi fruit
gi|166317|gb|AAA32629.1| actinidin [Actinidia deliciosa]
Length = 380
Score = 46.6 bits (109), Expect = 0.002, Method: Composition-based stats.
Identities = 22/75 (29%), Positives = 41/75 (54%)
Query: 19 SNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATY 78
+ N + N E +E ++ + KSY + E +RF +F++ L+ I++ N + +
Sbjct: 27 AKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWERRFEIFKETLRFIDEHNADTNRSYKV 86
Query: 79 GINHLSDLTREEMKS 93
G+N +DLT EE +S
Sbjct: 87 GLNQFADLTDEEFRS 101
>gi|297816790|ref|XP_002876278.1| hypothetical protein ARALYDRAFT_485911 [Arabidopsis lyrata subsp.
lyrata]
gi|297322116|gb|EFH52537.1| hypothetical protein ARALYDRAFT_485911 [Arabidopsis lyrata subsp.
lyrata]
Length = 368
Score = 46.6 bits (109), Expect = 0.002, Method: Composition-based stats.
Identities = 24/64 (37%), Positives = 33/64 (51%), Gaps = 9/64 (14%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHG----TATYGINHLSDLTR 88
+F F+ D+ K+Y T+EE R +F N+ L EH TA +G+ SDLT
Sbjct: 50 KFRVFMSDYGKNYSTREEYIHRLGIFAKNV-----LKAAEHQMMDPTAVHGVTQFSDLTE 104
Query: 89 EEMK 92
EE K
Sbjct: 105 EEFK 108
>gi|224066056|ref|XP_002302004.1| predicted protein [Populus trichocarpa]
gi|222843730|gb|EEE81277.1| predicted protein [Populus trichocarpa]
Length = 367
Score = 46.6 bits (109), Expect = 0.002, Method: Composition-based stats.
Identities = 33/86 (38%), Positives = 46/86 (53%), Gaps = 4/86 (4%)
Query: 13 LFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGE 72
L Q+ S+ E N EH F F F K+Y T+EE RF VF+ NL+ + ++
Sbjct: 32 LIRQVVSDGEDDLLNAEH--HFTSFKSKFGKTYATQEEHDYRFGVFKANLRRAKK-HQMI 88
Query: 73 HGTATYGINHLSDLTREEMKSR-LGL 97
TA +GI SDLT +E + + LGL
Sbjct: 89 DPTAAHGITKFSDLTPKEFRRQFLGL 114
>gi|109390302|gb|ABG33750.1| cysteine protease [Hevea brasiliensis]
Length = 457
Score = 46.6 bits (109), Expect = 0.002, Method: Composition-based stats.
Identities = 24/65 (36%), Positives = 39/65 (60%), Gaps = 1/65 (1%)
Query: 29 EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTR 88
E + +E ++ K+Y + E +RF VF+DNL+ I++ N E+ T G+N +DLT
Sbjct: 37 EVMAIYEDWLVKHGKAYNSLGEKERRFEVFKDNLRFIDEHNS-ENRTYRVGLNRFADLTN 95
Query: 89 EEMKS 93
EE +S
Sbjct: 96 EEYRS 100
>gi|50355619|dbj|BAD29958.1| cysteine protease [Daucus carota]
Length = 496
Score = 46.6 bits (109), Expect = 0.002, Method: Composition-based stats.
Identities = 26/72 (36%), Positives = 40/72 (55%), Gaps = 1/72 (1%)
Query: 23 LKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINH 82
KT++ E FE ++ KSY E KRF +F++NL+ I++ N E G+N
Sbjct: 35 FKTDD-EATTLFESWLVTHGKSYNALGEEEKRFQIFKNNLRYIDEQNLVEDRGFKLGLNK 93
Query: 83 LSDLTREEMKSR 94
+DLT EE +S+
Sbjct: 94 FADLTNEEYRSK 105
>gi|242092704|ref|XP_002436842.1| hypothetical protein SORBIDRAFT_10g009850 [Sorghum bicolor]
gi|241915065|gb|EER88209.1| hypothetical protein SORBIDRAFT_10g009850 [Sorghum bicolor]
Length = 296
Score = 46.6 bits (109), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/59 (35%), Positives = 35/59 (59%)
Query: 35 EKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
E+++ +S+ Y E A+RF VF+ N+K IE N G + G+N +DLT +E ++
Sbjct: 6 EQWMVQYSRVYKDATEKAQRFEVFKSNVKFIESFNAGGNRKFWLGVNQFADLTNDEFRA 64
>gi|18424347|ref|NP_568921.1| thiol protease aleurain [Arabidopsis thaliana]
gi|71152227|sp|Q8H166.2|ALEU_ARATH RecName: Full=Thiol protease aleurain; Short=AtALEU; AltName:
Full=Senescence-associated gene product 2; Flags:
Precursor
gi|7230640|gb|AAF43041.1|AF233883_1 AALP protein [Arabidopsis thaliana]
gi|13430722|gb|AAK25983.1|AF360273_1 putative cysteine proteinase AALP [Arabidopsis thaliana]
gi|9757740|dbj|BAB08221.1| AALP protein [Arabidopsis thaliana]
gi|21617934|gb|AAM66984.1| cysteine proteinase AALP [Arabidopsis thaliana]
gi|23397068|gb|AAN31819.1| putative cysteine proteinase AALP [Arabidopsis thaliana]
gi|23397074|gb|AAN31822.1| putative cysteine proteinase AALP [Arabidopsis thaliana]
gi|24417304|gb|AAN60262.1| unknown [Arabidopsis thaliana]
gi|222423506|dbj|BAH19723.1| AT5G60360 [Arabidopsis thaliana]
gi|222424411|dbj|BAH20161.1| AT5G60360 [Arabidopsis thaliana]
gi|332009930|gb|AED97313.1| thiol protease aleurain [Arabidopsis thaliana]
Length = 358
Score = 46.6 bits (109), Expect = 0.002, Method: Composition-based stats.
Identities = 24/68 (35%), Positives = 40/68 (58%), Gaps = 2/68 (2%)
Query: 30 HLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTRE 89
H+ F +F + K Y EE+ RF++F++NL LI NK + + G+N +DLT +
Sbjct: 55 HVLSFARFTHRYGKKYQNVEEMKLRFSIFKENLDLIRSTNK-KGLSYKLGVNQFADLTWQ 113
Query: 90 EM-KSRLG 96
E +++LG
Sbjct: 114 EFQRTKLG 121
>gi|375073976|gb|AFA34855.1| cathepsin L-like protein [Trypanosoma cruzi]
Length = 467
Score = 46.6 bits (109), Expect = 0.002, Method: Composition-based stats.
Identities = 24/62 (38%), Positives = 36/62 (58%), Gaps = 1/62 (1%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
QF +F + + Y + E A R +VF +NL + L+ + AT+G+ SDLTREE +
Sbjct: 37 QFAEFKQKHGRVYGSAAEEAFRLSVFRENL-FLARLHAAANPHATFGVTPFSDLTREEFR 95
Query: 93 SR 94
SR
Sbjct: 96 SR 97
>gi|312985015|gb|ACX54787.2| cysteine protease [Arachis diogoi]
Length = 360
Score = 46.6 bits (109), Expect = 0.002, Method: Composition-based stats.
Identities = 31/72 (43%), Positives = 39/72 (54%), Gaps = 4/72 (5%)
Query: 27 NPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
N EH F F F KSY T+EE RF VF NL+ + L+ +A +G+ SDL
Sbjct: 39 NAEH--HFTTFKTKFGKSYATQEEHDYRFGVFRANLRRAK-LHAKLDPSAEHGVTKFSDL 95
Query: 87 TREEMKSR-LGL 97
T EE K + LGL
Sbjct: 96 TPEEFKRQYLGL 107
>gi|363814535|ref|NP_001242660.1| uncharacterized protein LOC100807362 precursor [Glycine max]
gi|255636658|gb|ACU18666.1| unknown [Glycine max]
Length = 367
Score = 46.6 bits (109), Expect = 0.002, Method: Composition-based stats.
Identities = 25/68 (36%), Positives = 39/68 (57%), Gaps = 1/68 (1%)
Query: 27 NPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
+ E + +E+++ K Y EE KRF +F+DNL IE+ N + T G+N SDL
Sbjct: 45 DEEVMSIYEEWLVKHGKVYNAVEEKEKRFQIFKDNLNFIEEHN-AVNRTYKVGLNRFSDL 103
Query: 87 TREEMKSR 94
+ EE +S+
Sbjct: 104 SNEEYRSK 111
>gi|407867877|gb|EKG08706.1| cysteine proteinase, putative [Trypanosoma cruzi]
Length = 392
Score = 46.6 bits (109), Expect = 0.002, Method: Composition-based stats.
Identities = 23/63 (36%), Positives = 37/63 (58%), Gaps = 1/63 (1%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
F++F++++ K Y +E V +R A+FE L + N+ + GINH+SD T EE+ S
Sbjct: 55 FDRFLQEYGKKYDAREYVRRR-ALFEQTLARVRTHNEAGNHLYVMGINHMSDWTPEELAS 113
Query: 94 RLG 96
G
Sbjct: 114 LNG 116
>gi|71415597|ref|XP_809860.1| cysteine proteinase [Trypanosoma cruzi strain CL Brener]
gi|70874305|gb|EAN88009.1| cysteine proteinase, putative [Trypanosoma cruzi]
Length = 392
Score = 46.6 bits (109), Expect = 0.002, Method: Composition-based stats.
Identities = 23/63 (36%), Positives = 37/63 (58%), Gaps = 1/63 (1%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
F++F++++ K Y +E V +R A+FE L + N+ + GINH+SD T EE+ S
Sbjct: 55 FDRFLQEYGKKYDAREYVRRR-ALFEQTLARVRTHNEAGNHLYVMGINHMSDWTPEELAS 113
Query: 94 RLG 96
G
Sbjct: 114 LNG 116
>gi|71421935|ref|XP_811957.1| cysteine proteinase [Trypanosoma cruzi strain CL Brener]
gi|70876682|gb|EAN90106.1| cysteine proteinase, putative [Trypanosoma cruzi]
Length = 392
Score = 46.6 bits (109), Expect = 0.002, Method: Composition-based stats.
Identities = 23/63 (36%), Positives = 37/63 (58%), Gaps = 1/63 (1%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
F++F++++ K Y +E V +R A+FE L + N+ + GINH+SD T EE+ S
Sbjct: 55 FDRFLQEYGKKYDAREYVRRR-ALFEQTLARVRTHNEAGNHLYVMGINHMSDWTPEELAS 113
Query: 94 RLG 96
G
Sbjct: 114 LNG 116
>gi|340380717|ref|XP_003388868.1| PREDICTED: pro-cathepsin H-like [Amphimedon queenslandica]
Length = 337
Score = 46.6 bits (109), Expect = 0.002, Method: Composition-based stats.
Identities = 26/62 (41%), Positives = 35/62 (56%), Gaps = 2/62 (3%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTAT-YGINHLSDLTREE 90
+ F +++ + K+Y T EE +R V+ N IE LNK EHG T Y +N SDLT E
Sbjct: 33 ESFNMWMKKYEKTYSTMEEYNERLRVYTSNYYYIEQLNK-EHGPHTEYELNQFSDLTFAE 91
Query: 91 MK 92
K
Sbjct: 92 FK 93
>gi|15984|emb|CAA34486.1| unnamed protein product [Actinidia deliciosa]
Length = 380
Score = 46.6 bits (109), Expect = 0.002, Method: Composition-based stats.
Identities = 22/75 (29%), Positives = 41/75 (54%)
Query: 19 SNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATY 78
+ N + N E +E ++ + KSY + E +RF +F++ L+ I++ N + +
Sbjct: 27 AKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWERRFEIFKETLRFIDEHNADTNRSYKV 86
Query: 79 GINHLSDLTREEMKS 93
G+N +DLT EE +S
Sbjct: 87 GLNQFADLTDEEFRS 101
>gi|323457344|gb|EGB13210.1| hypothetical protein AURANDRAFT_18666 [Aureococcus
anophagefferens]
Length = 346
Score = 46.6 bits (109), Expect = 0.002, Method: Composition-based stats.
Identities = 24/63 (38%), Positives = 35/63 (55%), Gaps = 2/63 (3%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLN--KGEHGTATYGINHLSDLTREEM 91
FE F D+ KSY + E A+RF +F NL+ E LN + + A +G+ DLT E
Sbjct: 20 FELFKSDYVKSYNSTEAEAERFTIFSANLRKTEALNAQRVDEDDAEFGVTQFMDLTEAEF 79
Query: 92 KSR 94
K++
Sbjct: 80 KAQ 82
>gi|440792913|gb|ELR14120.1| papain family cysteine protease subfamily protein [Acanthamoeba
castellanii str. Neff]
Length = 321
Score = 46.6 bits (109), Expect = 0.002, Method: Composition-based stats.
Identities = 23/64 (35%), Positives = 30/64 (46%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
+QF F+ F+K Y E +R A F NL IE N +GI SD+T E
Sbjct: 31 EQFYAFVGRFNKKYANDNEYQQRLAAFTHNLAQIEAFNAKYGEKTQFGITQFSDMTPTEF 90
Query: 92 KSRL 95
K R+
Sbjct: 91 KERV 94
>gi|20069912|ref|NP_613116.1| cathepsin [Mamestra configurata NPV-A]
gi|37077373|sp|Q8QLK1.1|CATV_NPVMC RecName: Full=Viral cathepsin; Short=V-cath; AltName:
Full=Cysteine proteinase; Short=CP; Flags: Precursor
gi|20043306|gb|AAM09141.1| cathepsin [Mamestra configurata NPV-A]
gi|33331744|gb|AAQ11052.1| putative cysteine proteinase [Mamestra configurata NPV-A]
Length = 337
Score = 46.6 bits (109), Expect = 0.002, Method: Composition-based stats.
Identities = 21/61 (34%), Positives = 37/61 (60%), Gaps = 1/61 (1%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
FEKFI ++K Y +++E R+ +F N++ I N + +A Y IN +D+T+ E+ +
Sbjct: 40 FEKFISQYNKQYSSEDEKKYRYNIFRHNIESINAKNS-RNDSAVYKINRFADMTKNEVVN 98
Query: 94 R 94
R
Sbjct: 99 R 99
>gi|79331505|ref|NP_001032106.1| thiol protease aleurain [Arabidopsis thaliana]
gi|332009931|gb|AED97314.1| thiol protease aleurain [Arabidopsis thaliana]
Length = 357
Score = 46.6 bits (109), Expect = 0.002, Method: Composition-based stats.
Identities = 24/68 (35%), Positives = 40/68 (58%), Gaps = 2/68 (2%)
Query: 30 HLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTRE 89
H+ F +F + K Y EE+ RF++F++NL LI NK + + G+N +DLT +
Sbjct: 55 HVLSFARFTHRYGKKYQNVEEMKLRFSIFKENLDLIRSTNK-KGLSYKLGVNQFADLTWQ 113
Query: 90 EM-KSRLG 96
E +++LG
Sbjct: 114 EFQRTKLG 121
>gi|443694581|gb|ELT95681.1| hypothetical protein CAPTEDRAFT_173171 [Capitella teleta]
Length = 342
Score = 46.6 bits (109), Expect = 0.002, Method: Composition-based stats.
Identities = 33/96 (34%), Positives = 53/96 (55%), Gaps = 8/96 (8%)
Query: 6 SAEATLALFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLI 65
S EA+ F ++ N++ + L + ++ + KSY KE+V +R +++E NL+ I
Sbjct: 14 SVEASSLKFQPLRHQNDVMSSELNEL--WTEYKETYGKSYDMKEDVVRR-SLWEGNLRHI 70
Query: 66 EDLNK----GEHGTATYGINHLSDLTREEMKSRLGL 97
N G+H + + GIN LSDLT E + RLGL
Sbjct: 71 SMHNVKHDLGKH-SFSMGINELSDLTPSEYRQRLGL 105
>gi|224103643|ref|XP_002313136.1| predicted protein [Populus trichocarpa]
gi|222849544|gb|EEE87091.1| predicted protein [Populus trichocarpa]
Length = 477
Score = 46.6 bits (109), Expect = 0.002, Method: Composition-based stats.
Identities = 23/79 (29%), Positives = 43/79 (54%), Gaps = 5/79 (6%)
Query: 15 GQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHG 74
GQ+ E +T L+ +E ++ + K+Y E +RF +F+DNLK ++ N +
Sbjct: 35 GQVPERTEAET-----LRLYEMWLVKYGKAYNALGEKERRFEIFKDNLKFVDQHNSVGNP 89
Query: 75 TATYGINHLSDLTREEMKS 93
+ G+N +DL+ EE ++
Sbjct: 90 SYKLGLNKFADLSNEEYRA 108
>gi|218137972|gb|ACK57563.1| cysteine protease-like protein [Arachis hypogaea]
Length = 364
Score = 46.6 bits (109), Expect = 0.002, Method: Composition-based stats.
Identities = 32/75 (42%), Positives = 42/75 (56%), Gaps = 10/75 (13%)
Query: 27 NPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNL---KLIEDLNKGEHGTATYGINHL 83
N EH F F FSK+Y TKEE RF VF+ NL K ++L+ +A +G+
Sbjct: 44 NAEH--HFSAFKTKFSKTYATKEEHDYRFGVFKSNLLRAKSHQELDP----SAIHGVTKF 97
Query: 84 SDLTREEMKSR-LGL 97
SDLT E +S+ LGL
Sbjct: 98 SDLTPSEFRSQFLGL 112
>gi|18141289|gb|AAL60582.1|AF454960_1 senescence-associated cysteine protease [Brassica oleracea]
Length = 359
Score = 46.6 bits (109), Expect = 0.002, Method: Composition-based stats.
Identities = 23/68 (33%), Positives = 40/68 (58%), Gaps = 2/68 (2%)
Query: 30 HLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTRE 89
H+ F +F + K Y EE+ RF++F++NL LI NK + + G+N +D+T +
Sbjct: 56 HVISFARFAHRYGKRYENAEEMKLRFSIFKENLDLIRSTNK-KGLSYKLGVNQFADMTWQ 114
Query: 90 EM-KSRLG 96
E +++LG
Sbjct: 115 EFQRTKLG 122
>gi|356543038|ref|XP_003539970.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 46.2 bits (108), Expect = 0.002, Method: Composition-based stats.
Identities = 21/59 (35%), Positives = 34/59 (57%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
++ E+++ ++K Y EE KRF +F++N+ IE N + GIN +DLT EE
Sbjct: 37 ERHEEWMARYAKVYKDPEEREKRFKIFKENVNYIEAFNNAANKPYKLGINQFADLTNEE 95
>gi|28192371|gb|AAK07729.1| NTCP23-like cysteine proteinase [Nicotiana tabacum]
Length = 360
Score = 46.2 bits (108), Expect = 0.002, Method: Composition-based stats.
Identities = 26/68 (38%), Positives = 38/68 (55%), Gaps = 2/68 (2%)
Query: 30 HLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTRE 89
H +F + K Y + EE+ +RF VF DNLK+I NK + + G+N +DLT +
Sbjct: 57 HALSSARFAHRYGKRYESVEEIKQRFEVFLDNLKMIRSHNK-KGLSYKLGVNEFTDLTWD 115
Query: 90 EM-KSRLG 96
E + RLG
Sbjct: 116 EFRRDRLG 123
>gi|449449489|ref|XP_004142497.1| PREDICTED: cysteine proteinase 15A-like [Cucumis sativus]
Length = 406
Score = 46.2 bits (108), Expect = 0.002, Method: Composition-based stats.
Identities = 23/65 (35%), Positives = 35/65 (53%), Gaps = 9/65 (13%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHG----TATYGINHLSDLT 87
++F F+ + KSYPT++E RF +F NL + EH TA +G+ SDL+
Sbjct: 87 RKFVMFMEKYGKSYPTRKEYLHRFGIFVKNL-----IRAAEHQALDPTAVHGVTQFSDLS 141
Query: 88 REEMK 92
EE +
Sbjct: 142 EEEFE 146
>gi|17978641|gb|AAL48319.1| vinckepain-2 [Plasmodium vinckei]
Length = 470
Score = 46.2 bits (108), Expect = 0.002, Method: Composition-based stats.
Identities = 26/69 (37%), Positives = 36/69 (52%)
Query: 27 NPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
N E + F F++ ++K Y + EE+ +RF +F + LK IE NK IN SDL
Sbjct: 146 NLEAVNIFYNFMKKYNKQYNSAEEMQERFYIFSEKLKKIEKHNKENKYMYKKAINSFSDL 205
Query: 87 TREEMKSRL 95
EE K R
Sbjct: 206 HPEEFKMRF 214
>gi|356563584|ref|XP_003550041.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
Length = 366
Score = 46.2 bits (108), Expect = 0.002, Method: Composition-based stats.
Identities = 24/68 (35%), Positives = 39/68 (57%), Gaps = 1/68 (1%)
Query: 25 TENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLS 84
T+N E + +E+++ K Y E KRF VF+DNL I++ N ++ T G+N +
Sbjct: 32 TDN-EVMTMYEEWLVKHQKVYNGLGEKDKRFQVFKDNLGFIQEHNNNQNNTYKLGLNKFA 90
Query: 85 DLTREEMK 92
D+T EE +
Sbjct: 91 DMTNEEYR 98
>gi|225707828|gb|ACO09760.1| Cathepsin S precursor [Osmerus mordax]
Length = 282
Score = 46.2 bits (108), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/63 (41%), Positives = 39/63 (61%), Gaps = 5/63 (7%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLI----EDLNKGEHGTATYGINHLSDLTR 88
Q+EK+ + KSY K E R V+E NL+L+ E+ + G+H A G+NHL+D+T
Sbjct: 29 QWEKWKDKYQKSYGNKVEDLHRRIVWEKNLRLVHKHNEETSTGQHSFAM-GVNHLTDMTA 87
Query: 89 EEM 91
EE+
Sbjct: 88 EEV 90
>gi|116786550|gb|ABK24153.1| unknown [Picea sitchensis]
Length = 394
Score = 46.2 bits (108), Expect = 0.002, Method: Composition-based stats.
Identities = 27/66 (40%), Positives = 37/66 (56%), Gaps = 2/66 (3%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
F F++ F+K Y EE A+RF++F+ NL K + A +GIN SDLT EE
Sbjct: 74 HFAHFVKKFNKEYSGAEEHARRFSIFKKNLHKALRHQKLDR-DAIHGINKFSDLTEEEFH 132
Query: 93 SR-LGL 97
+ LGL
Sbjct: 133 EQYLGL 138
>gi|326430490|gb|EGD76060.1| cysteine proteinase [Salpingoeca sp. ATCC 50818]
Length = 448
Score = 46.2 bits (108), Expect = 0.002, Method: Composition-based stats.
Identities = 25/63 (39%), Positives = 35/63 (55%), Gaps = 5/63 (7%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLN----KGEHGTATYGINHLSDLTRE 89
F+ F F+K Y + EE A+RF+VF N+ I N +G H T T +N +DLT E
Sbjct: 30 FDAFKTKFNKVYESAEEEARRFSVFSQNIDFINRHNAEAARGVH-THTVDVNQFADLTNE 88
Query: 90 EMK 92
E +
Sbjct: 89 EYR 91
>gi|156106765|gb|ABU49605.1| Der f 1 allergen [Dermatophagoides farinae]
Length = 321
Score = 46.2 bits (108), Expect = 0.002, Method: Composition-based stats.
Identities = 30/69 (43%), Positives = 44/69 (63%), Gaps = 12/69 (17%)
Query: 28 PEHLKQFEKFIRDFSKSYPT--KEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSD 85
P +K FE+F + F+K+Y T +EEVA++ F ++LK +E NKG INHLSD
Sbjct: 20 PASIKTFEEFKKAFNKNYATVEEEEVARK--NFLESLKYVE-ANKG-------AINHLSD 69
Query: 86 LTREEMKSR 94
L+ +E K+R
Sbjct: 70 LSLDEFKNR 78
>gi|449487301|ref|XP_004157559.1| PREDICTED: cysteine proteinase 15A-like [Cucumis sativus]
Length = 406
Score = 46.2 bits (108), Expect = 0.002, Method: Composition-based stats.
Identities = 23/65 (35%), Positives = 35/65 (53%), Gaps = 9/65 (13%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHG----TATYGINHLSDLT 87
++F F+ + KSYPT++E RF +F NL + EH TA +G+ SDL+
Sbjct: 87 RKFVMFMEKYGKSYPTRKEYLHRFGIFVKNL-----IRAAEHQALDPTAVHGVTQFSDLS 141
Query: 88 REEMK 92
EE +
Sbjct: 142 EEEFE 146
>gi|121531592|gb|ABM55481.1| digestive cysteine protease intestain [Leptinotarsa decemlineata]
Length = 318
Score = 46.2 bits (108), Expect = 0.002, Method: Composition-based stats.
Identities = 26/68 (38%), Positives = 40/68 (58%), Gaps = 3/68 (4%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLN-KGEHGTATY--GINHLSDLTRE 89
Q+ F + K+Y + E RF +F++NL+ IE+ N K + G TY G+N +D+T E
Sbjct: 19 QWVAFKQTHGKTYKSLLEERTRFGIFQNNLRTIEEHNAKYDKGEETYYMGVNQFADMTAE 78
Query: 90 EMKSRLGL 97
E + LGL
Sbjct: 79 EFRHMLGL 86
>gi|2511693|emb|CAB17076.1| cysteine proteinase precursor [Phaseolus vulgaris]
Length = 455
Score = 46.2 bits (108), Expect = 0.002, Method: Composition-based stats.
Identities = 23/61 (37%), Positives = 36/61 (59%), Gaps = 1/61 (1%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
+E+++ K Y E KRF +F+DNL+ I+ N E+ T G+N +DLT EE ++
Sbjct: 40 YEEWLVKHGKLYNALGEKDKRFQIFKDNLRFIDQQN-AENRTYKLGLNRFADLTNEEYRA 98
Query: 94 R 94
R
Sbjct: 99 R 99
>gi|119633262|gb|ABL84750.1| Der f 1 allergen [Dermatophagoides farinae]
Length = 321
Score = 46.2 bits (108), Expect = 0.002, Method: Composition-based stats.
Identities = 30/69 (43%), Positives = 44/69 (63%), Gaps = 12/69 (17%)
Query: 28 PEHLKQFEKFIRDFSKSYPT--KEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSD 85
P +K FE+F + F+K+Y T +EEVA++ F ++LK +E NKG INHLSD
Sbjct: 20 PASIKTFEEFKKAFNKNYATVEEEEVARK--NFLESLKYVE-ANKG-------AINHLSD 69
Query: 86 LTREEMKSR 94
L+ +E K+R
Sbjct: 70 LSLDEFKNR 78
>gi|215401412|ref|YP_002332715.1| cathepsin [Spodoptera litura nucleopolyhedrovirus II]
gi|209483953|gb|ACI47386.1| cathepsin [Spodoptera litura nucleopolyhedrovirus II]
Length = 337
Score = 46.2 bits (108), Expect = 0.003, Method: Composition-based stats.
Identities = 22/61 (36%), Positives = 36/61 (59%), Gaps = 1/61 (1%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
FEKFI ++K Y T++E R+ +F N++ I N + +A Y IN +D+T+ E+
Sbjct: 40 FEKFIAQYNKKYKTEDEKKYRYNIFRHNMESINHKNS-RNDSAIYKINRFADMTKNEVVI 98
Query: 94 R 94
R
Sbjct: 99 R 99
>gi|121531590|gb|ABM55480.1| digestive cysteine protease intestain [Leptinotarsa decemlineata]
Length = 321
Score = 46.2 bits (108), Expect = 0.003, Method: Composition-based stats.
Identities = 26/68 (38%), Positives = 40/68 (58%), Gaps = 3/68 (4%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLN-KGEHGTATY--GINHLSDLTRE 89
Q+ F + K+Y + E RF +F++NL+ IE+ N K + G TY G+N +D+T E
Sbjct: 22 QWVAFKQTHGKTYKSLLEERTRFGIFQNNLRTIEEHNAKYDKGEETYYMGVNQFADMTAE 81
Query: 90 EMKSRLGL 97
E + LGL
Sbjct: 82 EFRHMLGL 89
>gi|297851334|ref|XP_002893548.1| peptidase C1A papain family protein [Arabidopsis lyrata subsp.
lyrata]
gi|297339390|gb|EFH69807.1| peptidase C1A papain family protein [Arabidopsis lyrata subsp.
lyrata]
Length = 346
Score = 46.2 bits (108), Expect = 0.003, Method: Composition-based stats.
Identities = 25/62 (40%), Positives = 34/62 (54%), Gaps = 3/62 (4%)
Query: 29 EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTR 88
EH +Q+ + FS+ Y + E RF VF+ NLK IE NK T G+N +D T+
Sbjct: 36 EHHQQW---MTRFSRVYSDELEKQMRFDVFKKNLKFIEKFNKKGDRTYKLGVNEFADWTK 92
Query: 89 EE 90
EE
Sbjct: 93 EE 94
>gi|170064305|ref|XP_001867470.1| cathepsin l [Culex quinquefasciatus]
gi|167881732|gb|EDS45115.1| cathepsin l [Culex quinquefasciatus]
Length = 547
Score = 46.2 bits (108), Expect = 0.003, Method: Composition-based stats.
Identities = 30/90 (33%), Positives = 47/90 (52%), Gaps = 4/90 (4%)
Query: 12 ALFGQMKSNNELKTENPEHLK-QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNK 70
A F MK ++E EHL+ +F +F K+Y ++E +R +F NL+ I N+
Sbjct: 223 ATFNPMKEFIHPRSE--EHLQDEFTRFKYKHGKTYNGEKEHDRRQDIFRQNLRFIHSHNR 280
Query: 71 GEHGTATYGINHLSDLTREEMKSRLGLNLS 100
G T +NHL+D T EE+++ G S
Sbjct: 281 ANKGY-TVAVNHLADRTDEEIQALRGFKSS 309
>gi|116309178|emb|CAH66275.1| OSIGBa0147O06.5 [Oryza sativa Indica Group]
Length = 339
Score = 46.2 bits (108), Expect = 0.003, Method: Composition-based stats.
Identities = 21/59 (35%), Positives = 33/59 (55%), Gaps = 1/59 (1%)
Query: 35 EKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
E+++ + + Y E A+RF VF+ N+ IE N G H G+N +DLT +E +S
Sbjct: 38 ERWMAQYGRMYKDDAEKARRFEVFKANVAFIESFNAGNH-KFWLGVNQFADLTNDEFRS 95
>gi|432882407|ref|XP_004074015.1| PREDICTED: cathepsin K-like [Oryzias latipes]
Length = 330
Score = 46.2 bits (108), Expect = 0.003, Method: Composition-based stats.
Identities = 27/79 (34%), Positives = 46/79 (58%), Gaps = 4/79 (5%)
Query: 28 PEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKG---EHGTATYGINHLS 84
P+ + +E++ + KSY + E+ R AV+E NL ++ N+ E + T G+NHLS
Sbjct: 20 PDVNRLWEEWKQKHDKSYSNQTEMNFRRAVWEKNLHVVMKHNQQATEEKHSFTVGLNHLS 79
Query: 85 DLTREEMKSRL-GLNLSKH 102
D+T EE+ +L G + +H
Sbjct: 80 DMTAEEINEKLNGFKMEEH 98
>gi|440804881|gb|ELR25744.1| papain family cysteine protease subfamily protein [Acanthamoeba
castellanii str. Neff]
Length = 383
Score = 46.2 bits (108), Expect = 0.003, Method: Composition-based stats.
Identities = 24/63 (38%), Positives = 33/63 (52%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
FE ++++F+K Y + EE R AVFE L I N T G+NHL+D E +
Sbjct: 41 FEAYVKEFNKVYASLEEREARRAVFEARLAKIRAHNADPTKTWKEGVNHLTDRHEHEFRR 100
Query: 94 RLG 96
LG
Sbjct: 101 LLG 103
>gi|21593501|gb|AAM65468.1| cysteine proteinase [Arabidopsis thaliana]
Length = 376
Score = 46.2 bits (108), Expect = 0.003, Method: Composition-based stats.
Identities = 26/72 (36%), Positives = 42/72 (58%), Gaps = 1/72 (1%)
Query: 31 LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
L +E+++ + K+Y E +RF +F+DNLK IE+ N + + G+N SDLT +E
Sbjct: 38 LTMYEQWLVENGKNYNGLGEKERRFKIFKDNLKRIEEHNSDPNRSYERGLNKFSDLTADE 97
Query: 91 MK-SRLGLNLSK 101
+ S LG + K
Sbjct: 98 FQASYLGGKMEK 109
>gi|403223173|dbj|BAM41304.1| cysteine protease precursor TacP [Theileria orientalis strain
Shintoku]
Length = 463
Score = 46.2 bits (108), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 27/63 (42%), Positives = 37/63 (58%), Gaps = 2/63 (3%)
Query: 29 EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTR 88
E L+ FEKF D++K + T +E +RF VF +N +E L H T T +N SDLT
Sbjct: 140 EALRSFEKFKADYNKVHATDDERRERFLVFRNN--YLETLTHKGHETFTKSVNFFSDLTE 197
Query: 89 EEM 91
EE+
Sbjct: 198 EEL 200
>gi|225446589|ref|XP_002280263.1| PREDICTED: vignain [Vitis vinifera]
Length = 339
Score = 46.2 bits (108), Expect = 0.003, Method: Composition-based stats.
Identities = 22/62 (35%), Positives = 33/62 (53%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
++ E ++ + + Y E KRF +F+DN+ IE NK T IN +DLT EE
Sbjct: 37 ERHEDWMARYGRMYKDANEKEKRFKIFKDNVARIESFNKAMDKTYKLSINEFADLTNEEF 96
Query: 92 KS 93
+S
Sbjct: 97 RS 98
>gi|119633264|gb|ABL84751.1| Der f 1 allergen [Dermatophagoides farinae]
Length = 321
Score = 46.2 bits (108), Expect = 0.003, Method: Composition-based stats.
Identities = 30/69 (43%), Positives = 44/69 (63%), Gaps = 12/69 (17%)
Query: 28 PEHLKQFEKFIRDFSKSYPT--KEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSD 85
P +K FE+F + F+K+Y T +EEVA++ F ++LK +E NKG INHLSD
Sbjct: 20 PASIKTFEEFKKAFNKNYATVEEEEVARK--NFLESLKYVE-ANKG-------AINHLSD 69
Query: 86 LTREEMKSR 94
L+ +E K+R
Sbjct: 70 LSLDEFKNR 78
>gi|27530349|dbj|BAC53948.1| Der f 1 allergen preproenzyme [Dermatophagoides farinae]
Length = 321
Score = 46.2 bits (108), Expect = 0.003, Method: Composition-based stats.
Identities = 30/69 (43%), Positives = 44/69 (63%), Gaps = 12/69 (17%)
Query: 28 PEHLKQFEKFIRDFSKSYPT--KEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSD 85
P +K FE+F + F+K+Y T +EEVA++ F ++LK +E NKG INHLSD
Sbjct: 20 PASIKTFEEFKKAFNKNYATVEEEEVARK--NFLESLKYVE-ANKG-------AINHLSD 69
Query: 86 LTREEMKSR 94
L+ +E K+R
Sbjct: 70 LSLDEFKNR 78
>gi|357630543|gb|EHJ78591.1| hypothetical protein KGM_15350 [Danaus plexippus]
Length = 87
Score = 46.2 bits (108), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 26/66 (39%), Positives = 37/66 (56%), Gaps = 1/66 (1%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
FEKF +D++++Y + + + F F LK I N E AT+ IN +D T EE K+
Sbjct: 18 FEKFTKDYNRNYKDEADRQEHFQAFIKTLKSINKAN-AESSHATFDINKFADYTPEERKN 76
Query: 94 RLGLNL 99
GLNL
Sbjct: 77 MFGLNL 82
>gi|297602242|ref|NP_001052232.2| Os04g0203500 [Oryza sativa Japonica Group]
gi|255675217|dbj|BAF14146.2| Os04g0203500 [Oryza sativa Japonica Group]
Length = 336
Score = 46.2 bits (108), Expect = 0.003, Method: Composition-based stats.
Identities = 21/59 (35%), Positives = 33/59 (55%), Gaps = 1/59 (1%)
Query: 35 EKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
E+++ + + Y E A+RF VF+ N+ IE N G H G+N +DLT +E +S
Sbjct: 38 ERWMAQYGRMYKDDAEKARRFEVFKANVAFIESFNAGNH-KFWLGVNQFADLTNDEFRS 95
>gi|146215976|gb|ABQ10190.1| actinidin Act1b [Actinidia arguta]
Length = 380
Score = 46.2 bits (108), Expect = 0.003, Method: Composition-based stats.
Identities = 24/83 (28%), Positives = 42/83 (50%)
Query: 11 LALFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNK 70
L L + N K N E +E ++ + KSY + E +RF +F++ L+ I++ N
Sbjct: 19 LVLSLAFNAKNLTKRTNDELKAMYESWLTKYGKSYNSLGEWERRFEIFKETLRFIDEHNA 78
Query: 71 GEHGTATYGINHLSDLTREEMKS 93
+ + G+N +D T EE +S
Sbjct: 79 DTNRSYRVGLNQFADQTNEEFQS 101
>gi|22549430|ref|NP_689203.1| cath gene product [Mamestra configurata NPV-B]
gi|215401259|ref|YP_002332563.1| cathepsin [Helicoverpa armigera multiple nucleopolyhedrovirus]
gi|22476609|gb|AAM95015.1| putative cysteine proteinase [Mamestra configurata NPV-B]
gi|198448759|gb|ACH88549.1| cathepsin [Helicoverpa armigera multiple nucleopolyhedrovirus]
gi|390165231|gb|AFL64878.1| cathepsin [Mamestra brassicae MNPV]
gi|401665635|gb|AFP95747.1| putative cysteine proteinase [Mamestra brassicae MNPV]
Length = 341
Score = 45.8 bits (107), Expect = 0.003, Method: Composition-based stats.
Identities = 21/61 (34%), Positives = 37/61 (60%), Gaps = 1/61 (1%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
FEKFI ++K Y +++E R+ +F N++ I N + +A Y IN +D+T+ E+ +
Sbjct: 44 FEKFITQYNKQYSSEDEKKYRYNIFRHNIESINAKNS-RNDSAVYKINRFADMTKNEVVN 102
Query: 94 R 94
R
Sbjct: 103 R 103
>gi|38345188|emb|CAE03344.2| OSJNBb0005B05.11 [Oryza sativa Japonica Group]
gi|125589403|gb|EAZ29753.1| hypothetical protein OsJ_13812 [Oryza sativa Japonica Group]
Length = 323
Score = 45.8 bits (107), Expect = 0.003, Method: Composition-based stats.
Identities = 21/59 (35%), Positives = 33/59 (55%), Gaps = 1/59 (1%)
Query: 35 EKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
E+++ + + Y E A+RF VF+ N+ IE N G H G+N +DLT +E +S
Sbjct: 38 ERWMAQYGRMYKDDAEKARRFEVFKANVAFIESFNAGNH-KFWLGVNQFADLTNDEFRS 95
>gi|50355615|dbj|BAD29956.1| cysteine protease [Daucus carota]
Length = 423
Score = 45.8 bits (107), Expect = 0.003, Method: Composition-based stats.
Identities = 21/51 (41%), Positives = 32/51 (62%)
Query: 43 KSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
K+Y KRF +F+DNL+ I++ NKG + + G+N +DL+ EE KS
Sbjct: 16 KNYNALGAKEKRFEIFKDNLRFIDEHNKGVNQSFKLGLNKFADLSNEEYKS 66
>gi|255078398|ref|XP_002502779.1| cysteine endopeptidase [Micromonas sp. RCC299]
gi|226518045|gb|ACO64037.1| cysteine endopeptidase [Micromonas sp. RCC299]
Length = 414
Score = 45.8 bits (107), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 26/71 (36%), Positives = 41/71 (57%), Gaps = 5/71 (7%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNK----GEHGTATYGINHLSDLTRE 89
F ++ + K+Y ++EE R +F DN + ++ N GEH T G+NHL+DLT++
Sbjct: 68 FHEWTQKHGKTYDSEEEKELRLKIFADNHEFVQKHNAEYENGEH-THFVGLNHLADLTKD 126
Query: 90 EMKSRLGLNLS 100
E K LG N +
Sbjct: 127 EFKKMLGYNAA 137
>gi|119964630|ref|YP_950826.1| cathepsin [Maruca vitrata MNPV]
gi|119514473|gb|ABL76048.1| cathepsin [Maruca vitrata MNPV]
Length = 324
Score = 45.8 bits (107), Expect = 0.003, Method: Composition-based stats.
Identities = 25/67 (37%), Positives = 43/67 (64%), Gaps = 2/67 (2%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
FE+F+ F+K+Y ++ E +RF +F+ NL I + N+ + A Y IN SDL+++E +
Sbjct: 28 FEEFVLQFNKNYGSEIEKLRRFKIFQHNLNEIINKNQND-SAAKYEINKFSDLSKDETIA 86
Query: 94 RL-GLNL 99
+ GL+L
Sbjct: 87 KYTGLSL 93
>gi|449464688|ref|XP_004150061.1| PREDICTED: cysteine proteinase RD19a-like [Cucumis sativus]
gi|449519862|ref|XP_004166953.1| PREDICTED: cysteine proteinase RD19a-like [Cucumis sativus]
Length = 377
Score = 45.8 bits (107), Expect = 0.003, Method: Composition-based stats.
Identities = 28/66 (42%), Positives = 38/66 (57%), Gaps = 2/66 (3%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM- 91
F F + F KSY +KEE RF VF+ NLK + ++ +AT+G+ SDLT E
Sbjct: 59 HFSVFKQKFGKSYASKEEHDHRFRVFKANLKRAQR-HQALDPSATHGVTQFSDLTPSEFR 117
Query: 92 KSRLGL 97
+S LGL
Sbjct: 118 RSFLGL 123
>gi|119633260|gb|ABL84749.1| Der f 1 allergen [Dermatophagoides farinae]
Length = 321
Score = 45.8 bits (107), Expect = 0.003, Method: Composition-based stats.
Identities = 30/69 (43%), Positives = 44/69 (63%), Gaps = 12/69 (17%)
Query: 28 PEHLKQFEKFIRDFSKSYPT--KEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSD 85
P +K FE+F + F+K+Y T +EEVA++ F ++LK +E NKG INHLSD
Sbjct: 20 PASIKTFEEFKKAFNKNYATVEEEEVARK--NFLESLKYVE-ANKG-------AINHLSD 69
Query: 86 LTREEMKSR 94
L+ +E K+R
Sbjct: 70 LSLDEFKNR 78
>gi|1581746|prf||2117247B Cys protease:ISOTYPE=2
Length = 467
Score = 45.8 bits (107), Expect = 0.003, Method: Composition-based stats.
Identities = 25/62 (40%), Positives = 35/62 (56%), Gaps = 1/62 (1%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
QF F + K Y + E A R VF++NL L L+ + A++G+ SDLTREE +
Sbjct: 37 QFAAFKQRHGKVYGSAAEEAFRLGVFKENL-LFARLHAAANPHASFGVTPFSDLTREEFR 95
Query: 93 SR 94
SR
Sbjct: 96 SR 97
>gi|403180727|gb|AEW46900.2| cathepsin-like protease, partial [Chilo suppressalis]
Length = 100
Score = 45.8 bits (107), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 26/64 (40%), Positives = 38/64 (59%), Gaps = 1/64 (1%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
FEKFI+D+++SY + + + F NL+ I +LNK ++ ATYGIN +D T E K
Sbjct: 33 FEKFIKDYNRSYRDEYDKKVHYEAFVINLQEINELNK-KNPRATYGINKFADYTDAEKKR 91
Query: 94 RLGL 97
G
Sbjct: 92 MFGF 95
>gi|302754322|ref|XP_002960585.1| hypothetical protein SELMODRAFT_266583 [Selaginella
moellendorffii]
gi|300171524|gb|EFJ38124.1| hypothetical protein SELMODRAFT_266583 [Selaginella
moellendorffii]
Length = 330
Score = 45.8 bits (107), Expect = 0.003, Method: Composition-based stats.
Identities = 28/68 (41%), Positives = 36/68 (52%), Gaps = 2/68 (2%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
F+ FI F K+Y T E A R VFE NL ++ +A +GI SDLT EE K
Sbjct: 20 HFKSFIARFGKAYATAEAYAHRLKVFEANLVRAVS-HQALDPSAVHGITQFSDLTEEEFK 78
Query: 93 SR-LGLNL 99
+ LGL +
Sbjct: 79 QQFLGLRV 86
>gi|20428641|ref|NP_620470.1| 26-29kD-proteinase [Drosophila melanogaster]
gi|6448467|dbj|BAA86910.1| homologue of Sarcophaga 26,29kDa proteinase [Drosophila
melanogaster]
gi|7294432|gb|AAF49777.1| 26-29kD-proteinase [Drosophila melanogaster]
gi|21483518|gb|AAM52734.1| RE18380p [Drosophila melanogaster]
Length = 549
Score = 45.8 bits (107), Expect = 0.003, Method: Composition-based stats.
Identities = 31/90 (34%), Positives = 44/90 (48%), Gaps = 5/90 (5%)
Query: 12 ALFGQMKSNNELKTENPEHL-KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNK 70
A F M+ E + EH+ K F F R +Y + E R +F NL+ I N+
Sbjct: 225 ATFNPMQ---EFISGTDEHVDKAFHHFKRKHGVAYHSDTEHEHRKNIFRQNLRYIHSKNR 281
Query: 71 GEHGTATYGINHLSDLTREEMKSRLGLNLS 100
+ T T +NHL+D T EE+K+R G S
Sbjct: 282 AKL-TYTLAVNHLADKTEEELKARRGYKSS 310
>gi|356577763|ref|XP_003556992.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 45.8 bits (107), Expect = 0.003, Method: Composition-based stats.
Identities = 20/59 (33%), Positives = 35/59 (59%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
++ E+++ ++K Y +E +RF +F++N+ IE N + T GIN +DLT EE
Sbjct: 37 ERHEEWMGRYAKVYKDPQERERRFKIFKENVNYIEAFNNAANKPYTLGINQFADLTNEE 95
>gi|1617037|emb|CAA26255.1| cysteine proteinase I precursor [Dictyostelium discoideum]
Length = 343
Score = 45.8 bits (107), Expect = 0.003, Method: Composition-based stats.
Identities = 25/69 (36%), Positives = 38/69 (55%), Gaps = 4/69 (5%)
Query: 28 PEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNK---GEHGTATYGINHLS 84
PE QF +F F+K Y + EE +RF +F+ NL IE+LN +G+N +
Sbjct: 23 PEEQSQFLEFQDKFNKKY-SHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFA 81
Query: 85 DLTREEMKS 93
DL+ +E K+
Sbjct: 82 DLSSDEFKN 90
>gi|356517348|ref|XP_003527349.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 45.8 bits (107), Expect = 0.003, Method: Composition-based stats.
Identities = 20/59 (33%), Positives = 35/59 (59%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
++ E+++ ++K Y +E +RF +F++N+ IE N + T GIN +DLT EE
Sbjct: 37 ERHEEWMGRYAKVYKDPQERERRFKIFKENVNYIEAFNNAANKPYTLGINQFADLTNEE 95
>gi|282158089|ref|NP_001164088.1| cathepsin L precursor [Tribolium castaneum]
Length = 552
Score = 45.8 bits (107), Expect = 0.003, Method: Composition-based stats.
Identities = 26/79 (32%), Positives = 40/79 (50%), Gaps = 2/79 (2%)
Query: 23 LKTENPEHLK-QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGIN 81
++ E H+ +F KF R K+Y K E R +F N++ I +N+ G T +N
Sbjct: 237 IRPEKSGHVDFEFGKFTRKHGKNYQNKTETLMRKDIFRQNVRFIHSMNRQNRGF-TLTVN 295
Query: 82 HLSDLTREEMKSRLGLNLS 100
HL+D T E+K+ G S
Sbjct: 296 HLADKTPTELKALRGRTYS 314
>gi|5777889|emb|CAB53515.1| cysteine protease [Solanum tuberosum]
Length = 466
Score = 45.8 bits (107), Expect = 0.003, Method: Composition-based stats.
Identities = 22/60 (36%), Positives = 35/60 (58%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
+E ++ + KSY E KRF +F+DNLK I++ N + + G+ +DLT EE +S
Sbjct: 49 YESWLIEHGKSYNALGEKDKRFQIFKDNLKYIDEQNSVPNQSYKLGLTKFADLTNEEYRS 108
>gi|730035|sp|P16311.2|PEPT1_DERFA RecName: Full=Peptidase 1; AltName: Full=Allergen Der f I;
AltName: Full=Major mite fecal allergen Der f 1;
AltName: Allergen=Der f 1; Flags: Precursor
Length = 321
Score = 45.8 bits (107), Expect = 0.003, Method: Composition-based stats.
Identities = 30/69 (43%), Positives = 44/69 (63%), Gaps = 12/69 (17%)
Query: 28 PEHLKQFEKFIRDFSKSYPT--KEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSD 85
P +K FE+F + F+K+Y T +EEVA++ F ++LK +E NKG INHLSD
Sbjct: 20 PASIKTFEEFKKAFNKNYATVEEEEVARK--NFLESLKYVE-ANKG-------AINHLSD 69
Query: 86 LTREEMKSR 94
L+ +E K+R
Sbjct: 70 LSLDEFKNR 78
>gi|62526575|gb|AAX84673.1| cysteine protease CP1 [Manihot esculenta]
Length = 467
Score = 45.8 bits (107), Expect = 0.003, Method: Composition-based stats.
Identities = 26/89 (29%), Positives = 48/89 (53%), Gaps = 1/89 (1%)
Query: 5 ASAEATLALFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKL 64
++++ ++ + Q + + E + +E+++ K Y E KRF VF+DNL+
Sbjct: 23 SASDMSIISYDQTHATKSSWRTDDEVMAIYEEWLVKQGKVYNALGEREKRFQVFKDNLRF 82
Query: 65 IEDLNKGEHGTATYGINHLSDLTREEMKS 93
I++ N E+ T G+N +DLT EE +S
Sbjct: 83 IDEHNS-ENRTYKLGLNGFADLTNEEYRS 110
>gi|326531188|dbj|BAK04945.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 360
Score = 45.8 bits (107), Expect = 0.003, Method: Composition-based stats.
Identities = 22/54 (40%), Positives = 32/54 (59%)
Query: 42 SKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKSRL 95
++SY + EE +RF V+ DN++ IE N+ T G N +DLTREE +R
Sbjct: 50 NQSYRSAEERLRRFQVYRDNVEYIETTNRRGDLTYQLGENQFADLTREEFIARF 103
>gi|195590156|ref|XP_002084812.1| GD14469 [Drosophila simulans]
gi|194196821|gb|EDX10397.1| GD14469 [Drosophila simulans]
Length = 549
Score = 45.8 bits (107), Expect = 0.003, Method: Composition-based stats.
Identities = 31/90 (34%), Positives = 44/90 (48%), Gaps = 5/90 (5%)
Query: 12 ALFGQMKSNNELKTENPEHL-KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNK 70
A F M+ E + EH+ K F F R +Y + E R +F NL+ I N+
Sbjct: 225 ATFNPMQ---EFISGTDEHVDKAFHHFKRKHGVAYHSDTEHEHRKNIFRQNLRYIHSKNR 281
Query: 71 GEHGTATYGINHLSDLTREEMKSRLGLNLS 100
+ T T +NHL+D T EE+K+R G S
Sbjct: 282 AKL-TYTLAVNHLADKTEEELKARRGYKSS 310
>gi|355344587|gb|AER60490.1| cysteine proteases [Gossypium hirsutum]
Length = 371
Score = 45.8 bits (107), Expect = 0.003, Method: Composition-based stats.
Identities = 25/91 (27%), Positives = 51/91 (56%), Gaps = 1/91 (1%)
Query: 5 ASAEATLALFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKL 64
A+ +TL L S++ ++++ E + ++ ++ K+Y E KRF +F+DNL+
Sbjct: 17 ATYISTLTLNQNHPSSSSWRSDD-EVMGLYKSWVIQHGKAYNGIGEEEKRFEIFKDNLRF 75
Query: 65 IEDLNKGEHGTATYGINHLSDLTREEMKSRL 95
I++ N + T G+N +DLT +E +++
Sbjct: 76 IDEHNSNNNTTYKLGLNKFADLTNQEYRAKF 106
>gi|297818854|ref|XP_002877310.1| hypothetical protein ARALYDRAFT_484828 [Arabidopsis lyrata subsp.
lyrata]
gi|297323148|gb|EFH53569.1| hypothetical protein ARALYDRAFT_484828 [Arabidopsis lyrata subsp.
lyrata]
Length = 376
Score = 45.8 bits (107), Expect = 0.003, Method: Composition-based stats.
Identities = 25/69 (36%), Positives = 41/69 (59%), Gaps = 1/69 (1%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK- 92
+E+++ + K+Y E +RF +F+DNLK IE+ N + + G+N SDLT +E +
Sbjct: 41 YERWLVEHGKNYNGLGEKERRFKIFKDNLKHIEEHNSDPNRSYDRGLNQFSDLTVDEFQA 100
Query: 93 SRLGLNLSK 101
S LG + K
Sbjct: 101 SYLGGKIEK 109
>gi|1323748|gb|AAC49287.1| thiol protease [Triticum aestivum]
Length = 374
Score = 45.8 bits (107), Expect = 0.003, Method: Composition-based stats.
Identities = 21/60 (35%), Positives = 34/60 (56%)
Query: 31 LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
+++F ++ KSY EE +RF +F N++ IE N+ + T G+N +DLT EE
Sbjct: 47 MERFHGWMAKHGKSYAGVEEKLRRFDIFRRNVEFIEAANRDGRLSYTLGVNQFADLTHEE 106
>gi|359491865|ref|XP_002273243.2| PREDICTED: xylem cysteine proteinase 1-like [Vitis vinifera]
Length = 351
Score = 45.8 bits (107), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 28/67 (41%), Positives = 41/67 (61%), Gaps = 2/67 (2%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
FE ++ KSY + EE RF VF+DNLK I++ NK + + G+N +DL+ EE K
Sbjct: 48 FESWMSKHGKSYRSFEEKLHRFEVFQDNLKHIDETNK-KVSSYWLGLNEFADLSHEEFKR 106
Query: 94 R-LGLNL 99
+ LGL +
Sbjct: 107 KYLGLKI 113
>gi|195327474|ref|XP_002030443.1| GM25442 [Drosophila sechellia]
gi|194119386|gb|EDW41429.1| GM25442 [Drosophila sechellia]
Length = 549
Score = 45.8 bits (107), Expect = 0.003, Method: Composition-based stats.
Identities = 31/90 (34%), Positives = 44/90 (48%), Gaps = 5/90 (5%)
Query: 12 ALFGQMKSNNELKTENPEHL-KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNK 70
A F M+ E + EH+ K F F R +Y + E R +F NL+ I N+
Sbjct: 225 ATFNPMQ---EFISGTDEHVDKAFHHFKRKHGVAYHSDTEHEHRKNIFRQNLRYIHSKNR 281
Query: 71 GEHGTATYGINHLSDLTREEMKSRLGLNLS 100
+ T T +NHL+D T EE+K+R G S
Sbjct: 282 AKL-TYTLAVNHLADKTEEELKARRGYKSS 310
>gi|440291172|gb|ELP84441.1| cysteine protease, putative, partial [Entamoeba invadens IP1]
Length = 472
Score = 45.8 bits (107), Expect = 0.003, Method: Composition-based stats.
Identities = 24/60 (40%), Positives = 36/60 (60%), Gaps = 2/60 (3%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLN--KGEHGTATYGINHLSDLTREEM 91
F++F +SK Y T + + A+F D+LK I +LN + A +GIN+ SDLT +EM
Sbjct: 25 FKEFELKYSKKYETPAQRLSKLALFRDSLKKIRELNSQRTRKSDAIFGINYYSDLTPKEM 84
>gi|324983200|gb|ADY68475.1| stem bromelain [Ananas comosus]
Length = 291
Score = 45.8 bits (107), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 20/71 (28%), Positives = 41/71 (57%)
Query: 31 LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
+K+FE+++ ++ + Y +E +RF +F++N+ IE N + T GIN +D+T E
Sbjct: 34 MKRFEEWMAEYGRVYKDNDEKMRRFQIFKNNVNHIETFNNRNGNSYTLGINKFTDMTNNE 93
Query: 91 MKSRLGLNLSK 101
++ +S+
Sbjct: 94 FVAQYTGGISR 104
>gi|24285904|gb|AAL14199.1| cysteine proteinase precursor [Ipomoea batatas]
gi|56961686|gb|AAK15148.2| cysteine proteinase-like protein [Ipomoea batatas]
Length = 341
Score = 45.8 bits (107), Expect = 0.003, Method: Composition-based stats.
Identities = 27/68 (39%), Positives = 40/68 (58%), Gaps = 5/68 (7%)
Query: 35 EKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATY--GINHLSDLTREEMK 92
E+++ + + Y + E KRF +F++N++ IE NK GT Y GIN +DLT +E K
Sbjct: 40 EQWMAQYGRVYENEVEKTKRFNIFKENVEYIESFNKA--GTKPYKLGINAFADLTNQEFK 97
Query: 93 -SRLGLNL 99
SR G L
Sbjct: 98 ASRNGYKL 105
>gi|356517358|ref|XP_003527354.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
gi|356577767|ref|XP_003556994.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 343
Score = 45.8 bits (107), Expect = 0.003, Method: Composition-based stats.
Identities = 20/59 (33%), Positives = 35/59 (59%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
++ E+++ ++K Y +E +RF +F++N+ IE N + T GIN +DLT EE
Sbjct: 37 ERHEEWMGRYAKVYKDPQERERRFKIFKENVNYIEAFNNAANKPYTLGINQFADLTNEE 95
>gi|237844793|ref|XP_002371694.1| cathepsin L-like thiolproteinase, putative [Toxoplasma gondii ME49]
gi|50313163|gb|AAT74529.1| toxopain-2 [Toxoplasma gondii]
gi|89242977|gb|ABD64744.1| cathepsin L [Toxoplasma gondii]
gi|95007485|emb|CAJ20707.1| toxopain-2 [Toxoplasma gondii RH]
gi|211969358|gb|EEB04554.1| cathepsin L-like thiolproteinase, putative [Toxoplasma gondii ME49]
gi|221480879|gb|EEE19300.1| cysteine protease, putative [Toxoplasma gondii GT1]
gi|221501596|gb|EEE27366.1| cysteine protease, putative [Toxoplasma gondii VEG]
Length = 422
Score = 45.8 bits (107), Expect = 0.003, Method: Composition-based stats.
Identities = 24/70 (34%), Positives = 43/70 (61%), Gaps = 2/70 (2%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
F F ++KSY T+EE +R+A+F++NL I N+ + + + +NH DL+R+E +
Sbjct: 117 FSSFQAMYAKSYATEEEKQRRYAIFKNNLVYIHTHNQQGY-SYSLKMNHFGDLSRDEFRR 175
Query: 94 R-LGLNLSKH 102
+ LG S++
Sbjct: 176 KYLGFKKSRN 185
>gi|388519111|gb|AFK47617.1| unknown [Medicago truncatula]
Length = 241
Score = 45.8 bits (107), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 37/90 (41%), Positives = 46/90 (51%), Gaps = 10/90 (11%)
Query: 13 LFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNL---KLIEDLN 69
L Q+ E N EH F F FSK+Y TKEE RF VF+ NL KL + L+
Sbjct: 32 LIRQVVDTAEDHILNAEH--HFTSFKSKFSKNYATKEEHDYRFGVFKSNLIKAKLHQKLD 89
Query: 70 KGEHGTATYGINHLSDLTREEMKSR-LGLN 98
+A +GI SDLT E + + LGLN
Sbjct: 90 P----SAQHGITKFSDLTASEFRRQFLGLN 115
>gi|298713906|emb|CBJ33775.1| Cathepsin-like proteinase [Ectocarpus siliculosus]
Length = 462
Score = 45.8 bits (107), Expect = 0.003, Method: Composition-based stats.
Identities = 29/72 (40%), Positives = 39/72 (54%), Gaps = 3/72 (4%)
Query: 21 NELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGI 80
+EL + E L F++F F KSY +E A RF VF+ NLK I++ N G Y +
Sbjct: 115 SELSDQELESL--FQEFGIKFEKSYENDDEKAMRFEVFKRNLKRIDERNSKSLGV-KYDV 171
Query: 81 NHLSDLTREEMK 92
+DLT EE K
Sbjct: 172 TMWTDLTHEEFK 183
>gi|1256830|gb|AAB68374.1| cysteine endopeptidase 1 [Phaseolus vulgaris]
gi|2959418|emb|CAA12118.1| cysteine protease [Phaseolus vulgaris]
Length = 364
Score = 45.8 bits (107), Expect = 0.004, Method: Composition-based stats.
Identities = 25/69 (36%), Positives = 42/69 (60%), Gaps = 2/69 (2%)
Query: 25 TENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLS 84
+EN E + +E+++ K Y +E KRF VF+DNL I+D N ++ T T G+N +
Sbjct: 28 SEN-EVMDMYEEWLVKHRKVYNGLDEKEKRFQVFKDNLGFIQDHN-AQNNTYTLGLNKFA 85
Query: 85 DLTREEMKS 93
D+T +E ++
Sbjct: 86 DITNKEYRA 94
>gi|413917779|gb|AFW57711.1| hypothetical protein ZEAMMB73_361217 [Zea mays]
Length = 390
Score = 45.8 bits (107), Expect = 0.004, Method: Composition-based stats.
Identities = 21/58 (36%), Positives = 33/58 (56%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
+F ++ +SYPT EE +RF ++ N++LIE N+ T T G N +DL+ E
Sbjct: 58 RFHAWMAAHGRSYPTAEEKLRRFHIYRANVELIEATNRDTSKTFTCGENQFTDLSHHE 115
>gi|356517350|ref|XP_003527350.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
gi|356577765|ref|XP_003556993.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 343
Score = 45.8 bits (107), Expect = 0.004, Method: Composition-based stats.
Identities = 22/59 (37%), Positives = 33/59 (55%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
++ E+++ + K Y EE KRF VF++N+ IE N + GIN +DLT EE
Sbjct: 37 ERHEQWMARYGKVYKDPEEKEKRFRVFKENVNYIEAFNNAANKPYKLGINQFADLTSEE 95
>gi|302764466|ref|XP_002965654.1| hypothetical protein SELMODRAFT_230713 [Selaginella
moellendorffii]
gi|300166468|gb|EFJ33074.1| hypothetical protein SELMODRAFT_230713 [Selaginella
moellendorffii]
Length = 345
Score = 45.8 bits (107), Expect = 0.004, Method: Composition-based stats.
Identities = 18/59 (30%), Positives = 35/59 (59%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
++K+I++ K+Y + E KRF +F++N+ I N + + + G+N +DLT E +
Sbjct: 38 YQKWIQEHGKAYNSAHEYKKRFQIFKENVNYINSHNARRNNSHSLGLNKFADLTNSEFR 96
>gi|403376395|gb|EJY88173.1| Cysteine protease-5 [Oxytricha trifallax]
Length = 401
Score = 45.8 bits (107), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 23/65 (35%), Positives = 36/65 (55%)
Query: 27 NPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
N E + F +F+ ++ K+Y TK + RF +F N ++I+ N+ E GIN SD+
Sbjct: 65 NHETQQAFIQFVAEYGKTYATKNHLNSRFDIFAKNFEMIKSHNENEEKHYEMGINKFSDM 124
Query: 87 TREEM 91
T EE
Sbjct: 125 THEEF 129
>gi|307169691|gb|EFN62267.1| Cathepsin O [Camponotus floridanus]
Length = 358
Score = 45.8 bits (107), Expect = 0.004, Method: Composition-based stats.
Identities = 25/68 (36%), Positives = 38/68 (55%), Gaps = 3/68 (4%)
Query: 26 ENPEHLKQFEKFIRDFSKSYPTKE-EVAKRFAVFEDNLKLIEDLN--KGEHGTATYGINH 82
+N E K FE +I ++KSY E KRF F+ +L+ IE +N + +A YG+
Sbjct: 28 KNVEDAKLFENYIVQYNKSYRNDSTEYKKRFECFQKSLRHIEKMNSFQSSQESAYYGLTK 87
Query: 83 LSDLTREE 90
SDL+ +E
Sbjct: 88 FSDLSEDE 95
>gi|164472556|gb|ABY58967.1| cathepsin L [Toxoplasma gondii]
Length = 421
Score = 45.8 bits (107), Expect = 0.004, Method: Composition-based stats.
Identities = 24/70 (34%), Positives = 43/70 (61%), Gaps = 2/70 (2%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
F F ++KSY T+EE +R+A+F++NL I N+ + + + +NH DL+R+E +
Sbjct: 116 FSSFQAMYAKSYATEEEKQRRYAIFKNNLVYIHTHNQQGY-SYSLKMNHFGDLSRDEFRR 174
Query: 94 R-LGLNLSKH 102
+ LG S++
Sbjct: 175 KYLGFKKSRN 184
>gi|356543076|ref|XP_003539989.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 45.8 bits (107), Expect = 0.004, Method: Composition-based stats.
Identities = 21/59 (35%), Positives = 33/59 (55%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
++ E+++ ++K Y EE KRF +F++N+ IE N GIN +DLT EE
Sbjct: 37 ERHEEWMARYAKVYKDPEEREKRFKIFKENVNYIEAFNNAADKPYKLGINQFADLTNEE 95
>gi|42564149|gb|AAS20588.1| digestive cysteine proteinase intestain [Leptinotarsa
decemlineata]
Length = 322
Score = 45.4 bits (106), Expect = 0.004, Method: Composition-based stats.
Identities = 25/68 (36%), Positives = 39/68 (57%), Gaps = 3/68 (4%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLN-KGEHGTATY--GINHLSDLTRE 89
Q+ F + K+Y + E RF +F++NL+ IE N K E G TY + +D+TR+
Sbjct: 22 QWVAFKQTHGKTYKSLLEERTRFGIFQNNLRTIEKHNAKYEEGKVTYYMAVTQFADMTRD 81
Query: 90 EMKSRLGL 97
E + +LGL
Sbjct: 82 EFRKKLGL 89
>gi|357471211|ref|XP_003605890.1| Cysteine proteinase [Medicago truncatula]
gi|355506945|gb|AES88087.1| Cysteine proteinase [Medicago truncatula]
Length = 343
Score = 45.4 bits (106), Expect = 0.004, Method: Composition-based stats.
Identities = 22/71 (30%), Positives = 39/71 (54%), Gaps = 1/71 (1%)
Query: 24 KTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTA-TYGINH 82
+T + ++ +++ + K Y +E KRF +F +N+ IE NKG++ T G+N
Sbjct: 28 RTLQDDMYERHRQWMSQYGKVYKDSQEREKRFKIFTENVNYIEAFNKGDNNKLYTLGVNQ 87
Query: 83 LSDLTREEMKS 93
+DLT +E S
Sbjct: 88 FADLTNDEFTS 98
>gi|270011045|gb|EFA07493.1| cathepsin L precursor [Tribolium castaneum]
Length = 429
Score = 45.4 bits (106), Expect = 0.004, Method: Composition-based stats.
Identities = 26/79 (32%), Positives = 40/79 (50%), Gaps = 2/79 (2%)
Query: 23 LKTENPEHLK-QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGIN 81
++ E H+ +F KF R K+Y K E R +F N++ I +N+ G T +N
Sbjct: 235 IRPEKSGHVDFEFGKFTRKHGKNYQNKTETLMRKDIFRQNVRFIHSMNRQNRGF-TLTVN 293
Query: 82 HLSDLTREEMKSRLGLNLS 100
HL+D T E+K+ G S
Sbjct: 294 HLADKTPTELKALRGRTYS 312
>gi|149725427|ref|XP_001494683.1| PREDICTED: cathepsin W-like [Equus caballus]
Length = 373
Score = 45.4 bits (106), Expect = 0.004, Method: Composition-based stats.
Identities = 25/70 (35%), Positives = 37/70 (52%), Gaps = 1/70 (1%)
Query: 28 PEHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
P LK+ F F +++SY + E A R +F NL + L + + GTA +G++ SDL
Sbjct: 35 PLELKEVFTLFQIQYNRSYSSPAEYAHRLDIFARNLAQAQRLQEDDLGTAEFGVSPFSDL 94
Query: 87 TREEMKSRLG 96
T EE G
Sbjct: 95 TEEEFGQLYG 104
>gi|42564153|gb|AAS20589.1| digestive cysteine proteinase intestain [Leptinotarsa
decemlineata]
Length = 322
Score = 45.4 bits (106), Expect = 0.004, Method: Composition-based stats.
Identities = 25/68 (36%), Positives = 39/68 (57%), Gaps = 3/68 (4%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLN-KGEHGTATY--GINHLSDLTRE 89
Q+ F + K+Y + E RF +F++NL+ IE N K E G TY + +D+TR+
Sbjct: 22 QWVAFKQTHGKTYKSLLEERTRFGIFQNNLRTIEKHNAKYEEGKVTYYMAVTQFADMTRD 81
Query: 90 EMKSRLGL 97
E + +LGL
Sbjct: 82 EFRKKLGL 89
>gi|33590494|gb|AAQ22984.1| cathepsin L-like cysteine proteinase precursor [Acanthoscelides
obtectus]
Length = 321
Score = 45.4 bits (106), Expect = 0.004, Method: Composition-based stats.
Identities = 25/69 (36%), Positives = 41/69 (59%), Gaps = 3/69 (4%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEH-GTATY--GINHLSDLTR 88
+++++F ++Y T E +RF +F+ NL+ IE+ N+ H G T+ GIN D+T+
Sbjct: 21 EKWQQFKIQHGRTYRTLLEEKRRFEIFKFNLRTIEEHNERYHNGEETFEMGINQFGDMTQ 80
Query: 89 EEMKSRLGL 97
EE K L L
Sbjct: 81 EEFKRMLAL 89
>gi|356553413|ref|XP_003545051.1| PREDICTED: cysteine proteinase 15A-like [Glycine max]
Length = 367
Score = 45.4 bits (106), Expect = 0.004, Method: Composition-based stats.
Identities = 28/81 (34%), Positives = 38/81 (46%), Gaps = 3/81 (3%)
Query: 15 GQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHG 74
G+ E N EH F F F K Y TKEE +RF VF+ NL+ L+
Sbjct: 36 GEAAEKEEDHLLNAEH--HFASFKAKFGKKYATKEEHDRRFGVFKSNLRRAR-LHAKLDP 92
Query: 75 TATYGINHLSDLTREEMKSRL 95
+A +G+ SDLT E + +
Sbjct: 93 SAVHGVTKFSDLTPAEFRRQF 113
>gi|170579222|ref|XP_001894733.1| cathepsin F-like cysteine proteinase [Brugia malayi]
gi|158598547|gb|EDP36418.1| cathepsin F-like cysteine proteinase, putative [Brugia malayi]
Length = 284
Score = 45.4 bits (106), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 29/82 (35%), Positives = 39/82 (47%), Gaps = 4/82 (4%)
Query: 11 LALFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNK 70
LA+ Q N E KT F FI+ F + Y + EE RF ++ N+ + L
Sbjct: 156 LAMNSQEWQNEEKKT----LWSDFMTFIKKFKREYSSIEEQLDRFRIYLQNMNFAKKLQF 211
Query: 71 GEHGTATYGINHLSDLTREEMK 92
E GTA YG SD+T EE +
Sbjct: 212 EEKGTAIYGATKFSDMTAEEFQ 233
>gi|297819566|ref|XP_002877666.1| hypothetical protein ARALYDRAFT_906213 [Arabidopsis lyrata subsp.
lyrata]
gi|297323504|gb|EFH53925.1| hypothetical protein ARALYDRAFT_906213 [Arabidopsis lyrata subsp.
lyrata]
Length = 304
Score = 45.4 bits (106), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 23/68 (33%), Positives = 39/68 (57%), Gaps = 1/68 (1%)
Query: 31 LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
+++ E+++ F++ Y E RF +F+ NLK +E N + T +N SDLT EE
Sbjct: 15 IEKHEQWMSRFNRVYSDDSEKTSRFEIFKKNLKFVESFNMNTNNTYKLDVNKFSDLTDEE 74
Query: 91 MKSR-LGL 97
++R +GL
Sbjct: 75 FQARYMGL 82
>gi|294885991|ref|XP_002771503.1| thiolproteinase SmTP1, putative [Perkinsus marinus ATCC 50983]
gi|239875207|gb|EER03319.1| thiolproteinase SmTP1, putative [Perkinsus marinus ATCC 50983]
Length = 337
Score = 45.4 bits (106), Expect = 0.004, Method: Composition-based stats.
Identities = 24/60 (40%), Positives = 35/60 (58%), Gaps = 1/60 (1%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
F F + KSY K+E KR A+F DNL IE++N ++ + G+N +DLT EE +
Sbjct: 27 FIGFQKKHGKSYDNKDEEMKRAAIFHDNLNYIEEVN-AQNLSYKLGVNEYTDLTLEEFAA 85
>gi|432091112|gb|ELK24324.1| Cathepsin W [Myotis davidii]
Length = 370
Score = 45.4 bits (106), Expect = 0.004, Method: Composition-based stats.
Identities = 26/81 (32%), Positives = 39/81 (48%), Gaps = 5/81 (6%)
Query: 21 NELKTENP-----EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGT 75
+ L+ +NP E + F F +++SY E A R +F NL + L + + GT
Sbjct: 24 DSLRVQNPGAGPLELKEVFTLFQIQYNRSYSNPAEYAHRLDIFARNLAHAQRLQEEDLGT 83
Query: 76 ATYGINHLSDLTREEMKSRLG 96
A +G+ SDLT EE G
Sbjct: 84 AEFGVTAFSDLTEEEFDQLYG 104
>gi|1136312|gb|AAB41118.1| cruzipain [Trypanosoma cruzi]
Length = 383
Score = 45.4 bits (106), Expect = 0.004, Method: Composition-based stats.
Identities = 24/62 (38%), Positives = 35/62 (56%), Gaps = 1/62 (1%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
QF +F + + Y + E A R +VF NL + L+ + AT+G+ SDLTREE +
Sbjct: 37 QFAEFKQKHGRVYGSAAEEAFRLSVFRANL-FLARLHAAANPHATFGVTAFSDLTREEFR 95
Query: 93 SR 94
SR
Sbjct: 96 SR 97
>gi|356564325|ref|XP_003550405.1| PREDICTED: cysteine proteinase 15A [Glycine max]
Length = 370
Score = 45.4 bits (106), Expect = 0.004, Method: Composition-based stats.
Identities = 29/72 (40%), Positives = 39/72 (54%), Gaps = 4/72 (5%)
Query: 27 NPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
N EH F F F+K+Y TKEE RF VF+ NL+ L+ +A +G+ SDL
Sbjct: 51 NAEH--HFASFKAKFAKTYATKEEHDHRFGVFKSNLRRAR-LHAKLDPSAVHGVTKFSDL 107
Query: 87 TREEMKSR-LGL 97
T E + + LGL
Sbjct: 108 TPAEFRRQFLGL 119
>gi|260819200|ref|XP_002604925.1| hypothetical protein BRAFLDRAFT_77225 [Branchiostoma floridae]
gi|229290254|gb|EEN60935.1| hypothetical protein BRAFLDRAFT_77225 [Branchiostoma floridae]
Length = 520
Score = 45.4 bits (106), Expect = 0.004, Method: Composition-based stats.
Identities = 20/56 (35%), Positives = 33/56 (58%), Gaps = 1/56 (1%)
Query: 40 DFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKSRL 95
+ ++ Y T +E RFA F+DNL IE LN E+ + N +D++ EE +S++
Sbjct: 181 EHNRRYKTADEEKARFATFQDNLLKIEKLN-AEYSGTEFATNQFADMSEEEFRSKI 235
>gi|118489556|gb|ABK96580.1| unknown [Populus trichocarpa x Populus deltoides]
Length = 367
Score = 45.4 bits (106), Expect = 0.004, Method: Composition-based stats.
Identities = 31/83 (37%), Positives = 45/83 (54%), Gaps = 4/83 (4%)
Query: 16 QMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGT 75
Q+ S+ E N EH F F F K+Y T+EE RF VF+ NL+ + ++ T
Sbjct: 35 QVVSDGEDDLLNAEH--HFTSFKSKFGKTYATQEEHDYRFGVFKANLRRAKK-HQMIDPT 91
Query: 76 ATYGINHLSDLTREEMKSR-LGL 97
A +G+ SDLT +E + + LGL
Sbjct: 92 AAHGVTKFSDLTPKEFRRQFLGL 114
>gi|148927396|gb|ABR19829.1| cysteine proteinase [Elaeis guineensis]
Length = 358
Score = 45.4 bits (106), Expect = 0.004, Method: Composition-based stats.
Identities = 28/74 (37%), Positives = 41/74 (55%), Gaps = 5/74 (6%)
Query: 24 KTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHL 83
+T N H F +F + K Y + EE+ RFA+F +NL+LI N+ GIN
Sbjct: 51 QTRNALH---FARFAHRYGKRYQSVEEMKLRFAIFMENLELIRSTNR-RGLPYKLGINRY 106
Query: 84 SDLTREEMK-SRLG 96
+D++ EE + SRLG
Sbjct: 107 ADMSWEEFRASRLG 120
>gi|302771610|ref|XP_002969223.1| hypothetical protein SELMODRAFT_91274 [Selaginella moellendorffii]
gi|300162699|gb|EFJ29311.1| hypothetical protein SELMODRAFT_91274 [Selaginella moellendorffii]
Length = 367
Score = 45.4 bits (106), Expect = 0.004, Method: Composition-based stats.
Identities = 28/68 (41%), Positives = 36/68 (52%), Gaps = 2/68 (2%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
F+ FI F K+Y T E A R VFE NL ++ +A +GI SDLT EE K
Sbjct: 57 HFKSFIARFGKAYATAEAYAHRLKVFEANLVRAVS-HQALDPSAVHGITQFSDLTEEEFK 115
Query: 93 SR-LGLNL 99
+ LGL +
Sbjct: 116 QQFLGLRV 123
>gi|71666438|ref|XP_820178.1| cruzipain precursor [Trypanosoma cruzi strain CL Brener]
gi|70885512|gb|EAN98327.1| cruzipain precursor, putative, partial [Trypanosoma cruzi]
Length = 174
Score = 45.4 bits (106), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 24/63 (38%), Positives = 36/63 (57%), Gaps = 1/63 (1%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
QF +F + + Y + E A R +VF +NL + L+ + AT+G+ SDLTREE
Sbjct: 36 SQFAEFKQKHGRVYESAAEEAFRLSVFRENL-FLARLHAAANPHATFGVTPFSDLTREEF 94
Query: 92 KSR 94
+SR
Sbjct: 95 RSR 97
>gi|255580659|ref|XP_002531152.1| cysteine protease, putative [Ricinus communis]
gi|223529265|gb|EEF31237.1| cysteine protease, putative [Ricinus communis]
Length = 340
Score = 45.4 bits (106), Expect = 0.005, Method: Composition-based stats.
Identities = 20/62 (32%), Positives = 36/62 (58%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
++ E+++ F + Y +E R+ +F++N++ IE NK + GIN +DLT EE
Sbjct: 37 EKHEEWMTRFKRVYSDAKEKEIRYKIFKENVQRIESFNKASEKSYKLGINQFADLTNEEF 96
Query: 92 KS 93
K+
Sbjct: 97 KT 98
>gi|195624522|gb|ACG34091.1| thiol protease aleurain precursor [Zea mays]
Length = 360
Score = 45.4 bits (106), Expect = 0.005, Method: Composition-based stats.
Identities = 23/65 (35%), Positives = 40/65 (61%), Gaps = 2/65 (3%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
+F +F + KSY + EV KRF +F ++L+L+ N+ + + GIN +D++ EE +
Sbjct: 58 RFARFAVRYGKSYESAAEVHKRFRIFSESLQLVRSTNR-KGLSYRLGINRFADMSWEEFR 116
Query: 93 -SRLG 96
+RLG
Sbjct: 117 ATRLG 121
>gi|194689248|gb|ACF78708.1| unknown [Zea mays]
gi|414885653|tpg|DAA61667.1| TPA: cysteine protease2 [Zea mays]
Length = 360
Score = 45.4 bits (106), Expect = 0.005, Method: Composition-based stats.
Identities = 23/65 (35%), Positives = 40/65 (61%), Gaps = 2/65 (3%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
+F +F + KSY + EV KRF +F ++L+L+ N+ + + GIN +D++ EE +
Sbjct: 58 RFARFAVRYGKSYESAAEVHKRFRIFSESLQLVRSTNR-KGLSYRLGINRFADMSWEEFR 116
Query: 93 -SRLG 96
+RLG
Sbjct: 117 ATRLG 121
>gi|1706261|sp|Q10717.1|CYSP2_MAIZE RecName: Full=Cysteine proteinase 2; Flags: Precursor
gi|644490|dbj|BAA08245.1| cysteine proteinase [Zea mays]
Length = 360
Score = 45.4 bits (106), Expect = 0.005, Method: Composition-based stats.
Identities = 23/65 (35%), Positives = 40/65 (61%), Gaps = 2/65 (3%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
+F +F + KSY + EV KRF +F ++L+L+ N+ + + GIN +D++ EE +
Sbjct: 58 RFARFAVRYGKSYESAAEVHKRFRIFSESLQLVRSTNR-KGLSYRLGINRFADMSWEEFR 116
Query: 93 -SRLG 96
+RLG
Sbjct: 117 ATRLG 121
>gi|351726954|ref|NP_001236888.1| cysteine proteinase precursor [Glycine max]
gi|479060|emb|CAA83673.1| cysteine proteinase [Glycine max]
gi|300507422|gb|ADK24076.1| cysteine proteinase [Glycine max]
gi|300507425|gb|ADK24077.1| cysteine proteinase [Glycine max]
gi|1096153|prf||2111244A Cys protease
Length = 380
Score = 45.4 bits (106), Expect = 0.005, Method: Composition-based stats.
Identities = 25/81 (30%), Positives = 43/81 (53%), Gaps = 14/81 (17%)
Query: 16 QMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHG- 74
++ N L+TE K+F+ F+ ++ +SY T+EE +R +F N+ + EH
Sbjct: 41 KLGDNELLRTE-----KKFKVFMENYGRSYSTEEEYLRRLGIFAQNM-----VRAAEHQA 90
Query: 75 ---TATYGINHLSDLTREEMK 92
TA +G+ SDLT +E +
Sbjct: 91 LDPTAVHGVTQFSDLTEDEFE 111
>gi|7242888|dbj|BAA92495.1| cysteine protease [Vigna mungo]
Length = 364
Score = 45.4 bits (106), Expect = 0.005, Method: Composition-based stats.
Identities = 29/72 (40%), Positives = 38/72 (52%), Gaps = 4/72 (5%)
Query: 27 NPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
N EH F F F K+Y TKEE RF VF+ NL+ L+ +A +G+ SDL
Sbjct: 45 NAEH--HFSNFKAKFGKTYATKEEHDHRFGVFKSNLRRAR-LHAQLDPSAVHGVTKFSDL 101
Query: 87 TREEMKSR-LGL 97
T E + + LGL
Sbjct: 102 TAAEFQRQFLGL 113
>gi|26245865|gb|AAN77408.1| digestive cysteine protease intestain [Leptinotarsa decemlineata]
Length = 173
Score = 45.4 bits (106), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 25/68 (36%), Positives = 39/68 (57%), Gaps = 3/68 (4%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLN-KGEHGTATY--GINHLSDLTRE 89
Q+ F + K+Y + E RF +F++NL+ IE N K E G TY + +D+TR+
Sbjct: 22 QWVAFKQTHGKTYKSLLEERTRFGIFQNNLRTIEKHNAKYEEGKVTYYMAVTQFADMTRD 81
Query: 90 EMKSRLGL 97
E + +LGL
Sbjct: 82 EFRKKLGL 89
>gi|3980198|emb|CAA46863.1| thiolprotease [Pisum sativum]
Length = 464
Score = 45.4 bits (106), Expect = 0.005, Method: Composition-based stats.
Identities = 23/69 (33%), Positives = 40/69 (57%), Gaps = 1/69 (1%)
Query: 27 NPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
N + L +E+++ K+Y E KRF +F+DNL I++ N ++ + G+N +DL
Sbjct: 40 NDQVLTMYEEWLVKHGKNYNALGEKEKRFEIFKDNLGFIDEHNS-KNLSFRLGLNRFADL 98
Query: 87 TREEMKSRL 95
T EE ++R
Sbjct: 99 TNEEYRTRF 107
>gi|118119|sp|P13277.2|CYSP1_HOMAM RecName: Full=Digestive cysteine proteinase 1; Flags: Precursor
gi|11051|emb|CAA45127.1| cysteine proteinase preproenzyme [Homarus americanus]
gi|228243|prf||1801240A Cys protease 1
Length = 322
Score = 45.4 bits (106), Expect = 0.005, Method: Composition-based stats.
Identities = 29/74 (39%), Positives = 37/74 (50%), Gaps = 7/74 (9%)
Query: 23 LKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKG-EHGTATY--G 79
L NP +E+F F + Y EE R VF DNL+ IE+ NK E G TY
Sbjct: 13 LAAANP----SWEEFKGKFGRKYVDLEEERYRLNVFLDNLQYIEEFNKKYERGEVTYNLA 68
Query: 80 INHLSDLTREEMKS 93
IN SD+T E+ +
Sbjct: 69 INQFSDMTNEKFNA 82
>gi|162460343|ref|NP_001105479.1| cysteine protease2 precursor [Zea mays]
gi|1491774|emb|CAA68192.1| cysteine protease [Zea mays]
Length = 360
Score = 45.1 bits (105), Expect = 0.005, Method: Composition-based stats.
Identities = 23/65 (35%), Positives = 40/65 (61%), Gaps = 2/65 (3%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
+F +F + KSY + EV KRF +F ++L+L+ N+ + + GIN +D++ EE +
Sbjct: 58 RFARFAVRYGKSYESAAEVHKRFRIFSESLQLVRSTNR-KGLSYRLGINRFADMSWEEFR 116
Query: 93 -SRLG 96
+RLG
Sbjct: 117 ATRLG 121
>gi|310656787|gb|ADP02216.1| Peptidase_C1 domain-containing protein [Triticum aestivum]
Length = 195
Score = 45.1 bits (105), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 22/63 (34%), Positives = 37/63 (58%), Gaps = 1/63 (1%)
Query: 31 LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
+++ E+++ F++ Y E A+RF VF+ N+ IE N G H G+N +DLT +E
Sbjct: 2 VERHEQWMAKFNRVYKDGTEKAQRFEVFKANVAFIESFNAGNH-KFWLGVNQFTDLTNDE 60
Query: 91 MKS 93
K+
Sbjct: 61 FKA 63
>gi|167833701|gb|ACA02577.1| cathepsin [Spodoptera frugiperda MNPV]
Length = 340
Score = 45.1 bits (105), Expect = 0.005, Method: Composition-based stats.
Identities = 21/61 (34%), Positives = 36/61 (59%), Gaps = 1/61 (1%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
FEKFI ++K Y +++E R+ +F N++ I N + +A Y IN +D+T+ E+
Sbjct: 43 FEKFIAQYNKQYKSEDEKKYRYNIFRHNIESINQKNS-RNDSAVYKINRFADMTKNEIVI 101
Query: 94 R 94
R
Sbjct: 102 R 102
>gi|125860143|ref|YP_001036312.1| viral cathepsin [Spodoptera frugiperda MNPV]
gi|120969288|gb|ABM45731.1| viral cathepsin [Spodoptera frugiperda MNPV]
gi|319997353|gb|ADV91251.1| V-CATH [Spodoptera frugiperda MNPV]
gi|384087478|gb|AFH58958.1| v-cath [Spodoptera frugiperda MNPV]
Length = 339
Score = 45.1 bits (105), Expect = 0.005, Method: Composition-based stats.
Identities = 21/61 (34%), Positives = 36/61 (59%), Gaps = 1/61 (1%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
FEKFI ++K Y +++E R+ +F N++ I N + +A Y IN +D+T+ E+
Sbjct: 42 FEKFIAQYNKQYKSEDEKKYRYNIFRHNIESINQKNS-RNDSAVYKINRFADMTKNEIVI 100
Query: 94 R 94
R
Sbjct: 101 R 101
>gi|4678299|emb|CAB41090.1| cysteine proteinase precursor-like protein [Arabidopsis thaliana]
Length = 363
Score = 45.1 bits (105), Expect = 0.005, Method: Composition-based stats.
Identities = 23/64 (35%), Positives = 33/64 (51%), Gaps = 9/64 (14%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHG----TATYGINHLSDLTR 88
+F F+ D+ K+Y T+EE R +F N+ L EH +A +G+ SDLT
Sbjct: 50 KFRLFMSDYGKNYSTREEYIHRLGIFAKNV-----LKAAEHQMMDPSAVHGVTQFSDLTE 104
Query: 89 EEMK 92
EE K
Sbjct: 105 EEFK 108
>gi|240255643|ref|NP_567010.5| Papain family cysteine protease [Arabidopsis thaliana]
gi|17979125|gb|AAL49820.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|332645795|gb|AEE79316.1| Papain family cysteine protease [Arabidopsis thaliana]
Length = 367
Score = 45.1 bits (105), Expect = 0.005, Method: Composition-based stats.
Identities = 23/64 (35%), Positives = 33/64 (51%), Gaps = 9/64 (14%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHG----TATYGINHLSDLTR 88
+F F+ D+ K+Y T+EE R +F N+ L EH +A +G+ SDLT
Sbjct: 50 KFRLFMSDYGKNYSTREEYIHRLGIFAKNV-----LKAAEHQMMDPSAVHGVTQFSDLTE 104
Query: 89 EEMK 92
EE K
Sbjct: 105 EEFK 108
>gi|356533293|ref|XP_003535200.1| PREDICTED: LOW QUALITY PROTEIN: cysteine proteinase RD21a-like
[Glycine max]
Length = 466
Score = 45.1 bits (105), Expect = 0.005, Method: Composition-based stats.
Identities = 20/60 (33%), Positives = 34/60 (56%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
+E ++ K+Y E +RF +F+DNL+ IE+ N + G+N +DLT EE ++
Sbjct: 48 YEAWLVKHGKAYNALGEKERRFKIFKDNLRFIEEHNGAGDKSYKLGLNKFADLTNEEYRA 107
>gi|33622213|ref|NP_891858.1| cathepsin [Cryptophlebia leucotreta granulovirus]
gi|33569322|gb|AAQ21608.1| cathepsin [Cryptophlebia leucotreta granulovirus]
Length = 332
Score = 45.1 bits (105), Expect = 0.005, Method: Composition-based stats.
Identities = 21/60 (35%), Positives = 39/60 (65%), Gaps = 1/60 (1%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
K F+ F++ ++K+Y T+EE +F F++NL++I + N+G A + IN SDL + ++
Sbjct: 28 KLFDSFVKQYNKTYLTEEERMIKFDNFKNNLRIINEKNRGSK-HAVFDINKYSDLNKNDL 86
>gi|577617|gb|AAC37213.1| cysteine proteinase [Trypanosoma cruzi]
Length = 467
Score = 45.1 bits (105), Expect = 0.005, Method: Composition-based stats.
Identities = 24/62 (38%), Positives = 35/62 (56%), Gaps = 1/62 (1%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
QF +F + + Y + E A R +VF NL + L+ + AT+G+ SDLTREE +
Sbjct: 37 QFAEFKQKHGRVYGSAAEEAFRLSVFRANL-FLARLHAAANPHATFGVTPFSDLTREEFR 95
Query: 93 SR 94
SR
Sbjct: 96 SR 97
>gi|407838603|gb|EKG00105.1| cysteine peptidase, putative,cysteine peptidase, clan CA, family
C1, cathepsin L-like, putative, partial [Trypanosoma
cruzi]
Length = 326
Score = 45.1 bits (105), Expect = 0.005, Method: Composition-based stats.
Identities = 24/62 (38%), Positives = 35/62 (56%), Gaps = 1/62 (1%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
QF +F + + Y + E A R +VF NL + L+ + AT+G+ SDLTREE +
Sbjct: 70 QFAEFKQKHGRVYGSAAEEAFRLSVFRANL-FLARLHAAANPHATFGVTPFSDLTREEFR 128
Query: 93 SR 94
SR
Sbjct: 129 SR 130
>gi|310656790|gb|ADP02219.1| Peptidase_C1 domain-containing protein [Triticum aestivum]
Length = 419
Score = 45.1 bits (105), Expect = 0.005, Method: Composition-based stats.
Identities = 20/63 (31%), Positives = 36/63 (57%), Gaps = 1/63 (1%)
Query: 31 LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
+++ E+++ F++ Y E A+RF F+ N+ IE N G H G+N +DLT +E
Sbjct: 34 VEKHEQWMAKFNRVYKDSTEKAQRFKAFKANVAFIESFNTGNH-KFWLGVNQFTDLTNDE 92
Query: 91 MKS 93
++
Sbjct: 93 FRA 95
>gi|223996996|ref|XP_002288171.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220975279|gb|EED93607.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 413
Score = 45.1 bits (105), Expect = 0.005, Method: Composition-based stats.
Identities = 25/77 (32%), Positives = 41/77 (53%), Gaps = 10/77 (12%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTAT--------YGINHLSD 85
FE+++ F KSY +E +R +F +NL++I + NKG ++ G+N +D
Sbjct: 35 FEQYLAHFDKSYSNPDESIRRSRIFNNNLQIILNHNKGRDMDSSGRVQEGFVMGVNQFTD 94
Query: 86 LTREEMKSRLGLNLSKH 102
+ R E+ LG N S H
Sbjct: 95 VERSELP--LGYNKSLH 109
>gi|71400414|ref|XP_803044.1| cysteine peptidase [Trypanosoma cruzi strain CL Brener]
gi|70865609|gb|EAN81598.1| cysteine peptidase, putative [Trypanosoma cruzi]
Length = 467
Score = 45.1 bits (105), Expect = 0.005, Method: Composition-based stats.
Identities = 24/62 (38%), Positives = 35/62 (56%), Gaps = 1/62 (1%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
QF +F + + Y + E A R +VF NL + L+ + AT+G+ SDLTREE +
Sbjct: 37 QFAEFKQKHGRVYGSAAEEAFRLSVFRANL-FLARLHAAANPHATFGVTPFSDLTREEFR 95
Query: 93 SR 94
SR
Sbjct: 96 SR 97
>gi|302763831|ref|XP_002965337.1| hypothetical protein SELMODRAFT_230602 [Selaginella moellendorffii]
gi|300167570|gb|EFJ34175.1| hypothetical protein SELMODRAFT_230602 [Selaginella moellendorffii]
Length = 343
Score = 45.1 bits (105), Expect = 0.005, Method: Composition-based stats.
Identities = 23/60 (38%), Positives = 31/60 (51%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
FE + KSY + E A+R +F D L IE N + T T G+N SDLT E ++
Sbjct: 41 FEDWAAKHGKSYSSDLEKARRLMIFSDTLAYIEKHNAQPNTTFTLGLNKFSDLTNAEFRA 100
>gi|146335582|gb|ABQ23400.1| cathepsin L isotype 3 [Trypanoplasma borreli]
Length = 442
Score = 45.1 bits (105), Expect = 0.005, Method: Composition-based stats.
Identities = 24/72 (33%), Positives = 43/72 (59%), Gaps = 7/72 (9%)
Query: 35 EKFIRDF----SKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
E RDF +++Y + +E KRF +F N+K +LN+ ++ AT+G N +D++ EE
Sbjct: 22 EVLFRDFKTTHARNYASADEERKRFEIFAANMKKAAELNR-KNPMATFGPNEFADMSSEE 80
Query: 91 MKSRLGLNLSKH 102
++R N ++H
Sbjct: 81 FQTR--HNAARH 90
>gi|350538043|ref|NP_001234324.1| cysteine protease TDI-65 precursor [Solanum lycopersicum]
gi|5726641|gb|AAD48496.1|AF172856_1 cysteine protease TDI-65 [Solanum lycopersicum]
gi|2828252|emb|CAA05894.1| CYP1 [Solanum lycopersicum]
Length = 466
Score = 45.1 bits (105), Expect = 0.005, Method: Composition-based stats.
Identities = 21/60 (35%), Positives = 35/60 (58%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
+E ++ + KSY E KRF +F+DNL+ I++ N + + G+ +DLT EE +S
Sbjct: 49 YESWLIEHGKSYNALGEKDKRFQIFKDNLRYIDEQNSVPNQSYKLGLTKFADLTNEEYRS 108
>gi|332376957|gb|AEE63618.1| unknown [Dendroctonus ponderosae]
Length = 318
Score = 45.1 bits (105), Expect = 0.005, Method: Composition-based stats.
Identities = 27/68 (39%), Positives = 39/68 (57%), Gaps = 3/68 (4%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNK-GEHGTATY--GINHLSDLTREE 90
F+ F SKSY + E AKR A+F +NL+ IE+ N G +Y +N +DLT +E
Sbjct: 25 FQSFKLKHSKSYSNQVEEAKRLAIFTENLRDIEEHNALYAAGLVSYNKSVNQFTDLTIDE 84
Query: 91 MKSRLGLN 98
K+ L L+
Sbjct: 85 FKAYLTLH 92
>gi|359806985|ref|NP_001241331.1| uncharacterized protein LOC100811719 precursor [Glycine max]
gi|255645733|gb|ACU23360.1| unknown [Glycine max]
Length = 362
Score = 45.1 bits (105), Expect = 0.005, Method: Composition-based stats.
Identities = 29/91 (31%), Positives = 49/91 (53%), Gaps = 5/91 (5%)
Query: 10 TLALFGQMKSNNELKTENPEHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDL 68
T +L M SN + + E + Q F+ + ++ + Y +EE AKRF +F+ NL+ I ++
Sbjct: 20 TCSLSLAMSSNQLEQFASEEEVFQLFQAWQKEHKREYGNQEEKAKRFQIFQSNLRYINEM 79
Query: 69 NKGEHGTAT---YGINHLSDLTREE-MKSRL 95
N T G+N +D++ EE MK+ L
Sbjct: 80 NAKRKSPTTQHRLGLNKFADMSPEEFMKTYL 110
>gi|118485910|gb|ABK94801.1| unknown [Populus trichocarpa]
Length = 367
Score = 45.1 bits (105), Expect = 0.005, Method: Composition-based stats.
Identities = 31/83 (37%), Positives = 45/83 (54%), Gaps = 4/83 (4%)
Query: 16 QMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGT 75
Q+ S+ E N EH F F F K+Y T+EE RF VF+ NL+ + ++ T
Sbjct: 35 QVVSDGEDDLLNAEH--HFTSFKSKFGKTYATQEEHDYRFGVFKANLRRAKK-HQMIDPT 91
Query: 76 ATYGINHLSDLTREEMKSR-LGL 97
A +G+ SDLT +E + + LGL
Sbjct: 92 AAHGVTKFSDLTPKEFRRQFLGL 114
>gi|356515048|ref|XP_003526213.1| PREDICTED: vignain-like [Glycine max]
Length = 350
Score = 45.1 bits (105), Expect = 0.005, Method: Composition-based stats.
Identities = 20/59 (33%), Positives = 33/59 (55%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
++ E++++ + K Y E KR +F+DN++ IE N + INHL+D T EE
Sbjct: 36 ERHEQWMKKYGKVYKDAAEKQKRLLIFKDNVEFIESFNAAGNKPYKLSINHLADQTNEE 94
>gi|535454|gb|AAA50755.1| cysteine proteinase [Alnus glutinosa]
Length = 340
Score = 45.1 bits (105), Expect = 0.005, Method: Composition-based stats.
Identities = 24/82 (29%), Positives = 43/82 (52%), Gaps = 2/82 (2%)
Query: 12 ALFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKG 71
AL Q+ + L ++ ++ E+++ + + Y E KR+ +FE+N+ LIE NK
Sbjct: 18 ALASQLAAARSL--QDASMRERHEEWMASYGRVYKDINEKQKRYKIFEENVALIESSNKD 75
Query: 72 EHGTATYGINHLSDLTREEMKS 93
+ +N +DLT EE K+
Sbjct: 76 ANKPYKLSVNQFADLTNEEFKA 97
>gi|405953314|gb|EKC21001.1| Cathepsin F [Crassostrea gigas]
Length = 397
Score = 45.1 bits (105), Expect = 0.006, Method: Composition-based stats.
Identities = 25/67 (37%), Positives = 40/67 (59%), Gaps = 1/67 (1%)
Query: 31 LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
L F+K+ + +K Y +F VF +NLK+I +LN G T+G+N L+DL+++E
Sbjct: 51 LPLFQKWKSEHNKIYRNHMIERSKFKVFLENLKVINELNGQFQGKTTFGLNQLADLSQKE 110
Query: 91 MKSRLGL 97
SR+ L
Sbjct: 111 F-SRIVL 116
>gi|357130486|ref|XP_003566879.1| PREDICTED: fruit bromelain-like [Brachypodium distachyon]
Length = 354
Score = 45.1 bits (105), Expect = 0.006, Method: Composition-based stats.
Identities = 22/63 (34%), Positives = 38/63 (60%), Gaps = 1/63 (1%)
Query: 35 EKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE-MKS 93
E+++ F ++Y +E A+R VF N + ++ +N+ + T T G+NH SDLT E ++
Sbjct: 39 ERWMARFGRAYKDADEKARRQEVFGANARHVDAVNRSGNRTYTLGLNHFSDLTDHEFLQQ 98
Query: 94 RLG 96
LG
Sbjct: 99 HLG 101
>gi|449448298|ref|XP_004141903.1| PREDICTED: germination-specific cysteine protease 1-like [Cucumis
sativus]
gi|449531757|ref|XP_004172852.1| PREDICTED: germination-specific cysteine protease 1-like [Cucumis
sativus]
Length = 365
Score = 45.1 bits (105), Expect = 0.006, Method: Composition-based stats.
Identities = 29/94 (30%), Positives = 52/94 (55%), Gaps = 2/94 (2%)
Query: 1 MAEDASAEATLALFGQMKSNNELKTENPEHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFE 59
MA ++ A L+ F S + L + +++ ++ ++ K+Y +E KRF +F+
Sbjct: 1 MATATTSLALLSFFFLSISASALSRRSDGEVREIYDLWLAKHGKAYNGIDEREKRFQIFK 60
Query: 60 DNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
+NLK I+D N E+ T G+N +DLT EE ++
Sbjct: 61 ENLKFIDDHNS-ENRTYKVGLNMFADLTNEEYRA 93
>gi|449524070|ref|XP_004169046.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like, partial
[Cucumis sativus]
Length = 314
Score = 45.1 bits (105), Expect = 0.006, Method: Composition-based stats.
Identities = 18/61 (29%), Positives = 39/61 (63%), Gaps = 1/61 (1%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
+++K++ + + Y ++EE +RF +++ N++ I++ N H + T N+ +DLT EE K
Sbjct: 18 RYQKWMDKYGRQYKSREEWERRFTIYQANVQYIDNFNSMNH-SHTLAENNFADLTNEEFK 76
Query: 93 S 93
+
Sbjct: 77 A 77
>gi|449460678|ref|XP_004148072.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Cucumis
sativus]
Length = 317
Score = 45.1 bits (105), Expect = 0.006, Method: Composition-based stats.
Identities = 18/61 (29%), Positives = 39/61 (63%), Gaps = 1/61 (1%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
+++K++ + + Y ++EE +RF +++ N++ I++ N H + T N+ +DLT EE K
Sbjct: 18 RYQKWMDKYGRQYKSREEWERRFTIYQANVQYIDNFNSMNH-SHTLAENNFADLTNEEFK 76
Query: 93 S 93
+
Sbjct: 77 A 77
>gi|224076970|ref|XP_002305073.1| predicted protein [Populus trichocarpa]
gi|222848037|gb|EEE85584.1| predicted protein [Populus trichocarpa]
Length = 340
Score = 45.1 bits (105), Expect = 0.006, Method: Composition-based stats.
Identities = 21/68 (30%), Positives = 37/68 (54%)
Query: 26 ENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSD 85
E LK+ E+++ + Y +E KR+ +F++N++ IE N G G+N +D
Sbjct: 32 EQEYMLKRHEEWMAQHGRVYGDMKEKEKRYLIFKENIERIEAFNNGSDRGYKLGVNKFAD 91
Query: 86 LTREEMKS 93
LT EE ++
Sbjct: 92 LTNEEFRA 99
>gi|449438381|ref|XP_004136967.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
Length = 479
Score = 45.1 bits (105), Expect = 0.006, Method: Composition-based stats.
Identities = 24/69 (34%), Positives = 40/69 (57%), Gaps = 2/69 (2%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
+E ++ K+Y E +RF +F+DNL+ I++ N+ E T G+ +DLT EE ++
Sbjct: 62 YESWLVHHGKAYNAIGEKERRFEIFKDNLRFIDEHNR-ESRTYKVGLTRFADLTNEEYRA 120
Query: 94 R-LGLNLSK 101
R LG S+
Sbjct: 121 RFLGGRFSR 129
>gi|302763127|ref|XP_002964985.1| hypothetical protein SELMODRAFT_406652 [Selaginella moellendorffii]
gi|300167218|gb|EFJ33823.1| hypothetical protein SELMODRAFT_406652 [Selaginella moellendorffii]
Length = 320
Score = 45.1 bits (105), Expect = 0.006, Method: Composition-based stats.
Identities = 23/60 (38%), Positives = 31/60 (51%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
FE + KSY + E A+R +F D L IE N + T T G+N SDLT E ++
Sbjct: 41 FEDWAAKHGKSYSSDWEKARRMTIFSDTLAYIEKHNALPNTTFTLGLNKFSDLTNAEFRA 100
>gi|302790828|ref|XP_002977181.1| hypothetical protein SELMODRAFT_106402 [Selaginella
moellendorffii]
gi|300155157|gb|EFJ21790.1| hypothetical protein SELMODRAFT_106402 [Selaginella
moellendorffii]
Length = 337
Score = 45.1 bits (105), Expect = 0.006, Method: Composition-based stats.
Identities = 23/60 (38%), Positives = 31/60 (51%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
FE + KSY + E A+R +F D L IE N + T T G+N SDLT E ++
Sbjct: 37 FEDWAAKHGKSYSSDWEKARRLMIFSDTLAYIEKHNAQPNTTFTLGLNKFSDLTNAEFRA 96
>gi|18420375|ref|NP_568052.1| cysteine proteinase RD19a [Arabidopsis thaliana]
gi|1172872|sp|P43296.1|RD19A_ARATH RecName: Full=Cysteine proteinase RD19a; Short=RD19; Flags:
Precursor
gi|435618|dbj|BAA02373.1| thiol protease [Arabidopsis thaliana]
gi|4539328|emb|CAB38829.1| drought-inducible cysteine proteinase RD19A precursor [Arabidopsis
thaliana]
gi|7270892|emb|CAB80572.1| drought-inducible cysteine proteinase RD19A precursor [Arabidopsis
thaliana]
gi|19310552|gb|AAL85009.1| putative cysteine proteinase RD19A [Arabidopsis thaliana]
gi|22136868|gb|AAM91778.1| putative cysteine proteinase RD19A [Arabidopsis thaliana]
gi|110740898|dbj|BAE98545.1| drought-inducible cysteine proteinase RD19A precursor [Arabidopsis
thaliana]
gi|332661616|gb|AEE87016.1| cysteine proteinase RD19a [Arabidopsis thaliana]
Length = 368
Score = 45.1 bits (105), Expect = 0.006, Method: Composition-based stats.
Identities = 24/62 (38%), Positives = 34/62 (54%), Gaps = 1/62 (1%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
F F R F K Y + EE RF+VF+ NL+ K + +AT+G+ SDLTR E +
Sbjct: 50 HFSLFKRKFGKVYASNEEHDYRFSVFKANLRRARRHQKLD-PSATHGVTQFSDLTRSEFR 108
Query: 93 SR 94
+
Sbjct: 109 KK 110
>gi|255563134|ref|XP_002522571.1| cysteine protease, putative [Ricinus communis]
gi|223538262|gb|EEF39871.1| cysteine protease, putative [Ricinus communis]
Length = 343
Score = 45.1 bits (105), Expect = 0.006, Method: Composition-based stats.
Identities = 20/59 (33%), Positives = 35/59 (59%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
++ E+++ ++Y E +RF +F++NL IE+ NK + T G+N SDL+ EE
Sbjct: 38 EKHEQWMARHGRTYHDNAEKERRFQIFKNNLDYIENFNKAFNKTYKLGLNKFSDLSEEE 96
>gi|8468607|gb|AAF75547.1| cruzipain [Trypanosoma cruzi]
Length = 467
Score = 44.7 bits (104), Expect = 0.006, Method: Composition-based stats.
Identities = 24/62 (38%), Positives = 35/62 (56%), Gaps = 1/62 (1%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
QF +F + + Y + E A R +VF +NL + L+ + AT+G+ SDLTREE
Sbjct: 37 QFAEFKQKHGRVYESAAEEAFRLSVFRENL-FLARLHAAANPHATFGVTPFSDLTREEFW 95
Query: 93 SR 94
SR
Sbjct: 96 SR 97
>gi|42407296|dbj|BAD10859.1| cysteine protease [Aster tripolium]
Length = 363
Score = 44.7 bits (104), Expect = 0.006, Method: Composition-based stats.
Identities = 32/98 (32%), Positives = 53/98 (54%), Gaps = 9/98 (9%)
Query: 6 SAEATLALFGQMKSNNELKTE-----NPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFED 60
+A+++ L Q+ N+E + E +PEH F+ F F ++Y T+EE R VF+
Sbjct: 19 TADSSDPLIRQVVQNDETEIESDPLLDPEH--HFKLFKNKFGRTYDTEEEHEYRLTVFKS 76
Query: 61 NLKLIEDLNKGEHGTATYGINHLSDLTREEMKSR-LGL 97
NL+ + ++ TA +G+ SDLT E + + LGL
Sbjct: 77 NLRRAKR-HQVLDPTAKHGVTKFSDLTPSEFRKKYLGL 113
>gi|113120265|gb|ABI30272.1| VXH-A, partial [Vasconcellea x heilbornii]
Length = 318
Score = 44.7 bits (104), Expect = 0.006, Method: Composition-based stats.
Identities = 22/70 (31%), Positives = 40/70 (57%), Gaps = 1/70 (1%)
Query: 25 TENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLS 84
T + + F+ ++ ++ K Y +E RF +F+DNLK I++ NK ++ T G+ +
Sbjct: 39 TSTEKLINLFDSWMVEYDKVYKDIDEKIYRFEIFKDNLKYIDETNK-KNNTYWLGLTSFT 97
Query: 85 DLTREEMKSR 94
DLT +E K +
Sbjct: 98 DLTNDEFKEK 107
>gi|113120271|gb|ABI30275.1| VS-A [Vasconcellea stipulata]
Length = 318
Score = 44.7 bits (104), Expect = 0.006, Method: Composition-based stats.
Identities = 22/70 (31%), Positives = 40/70 (57%), Gaps = 1/70 (1%)
Query: 25 TENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLS 84
T + + F+ ++ ++ K Y +E RF +F+DNLK I++ NK ++ T G+ +
Sbjct: 39 TSTEKLINLFDSWMVEYDKVYKDIDEKIYRFEIFKDNLKYIDETNK-KNNTYWLGLTSFT 97
Query: 85 DLTREEMKSR 94
DLT +E K +
Sbjct: 98 DLTNDEFKEK 107
>gi|21593213|gb|AAM65162.1| cysteine proteinase RD19A [Arabidopsis thaliana]
Length = 368
Score = 44.7 bits (104), Expect = 0.006, Method: Composition-based stats.
Identities = 24/62 (38%), Positives = 34/62 (54%), Gaps = 1/62 (1%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
F F R F K Y + EE RF+VF+ NL+ K + +AT+G+ SDLTR E +
Sbjct: 50 HFSLFKRKFGKVYASNEEHDYRFSVFKANLRRARRHQKLD-PSATHGVTQFSDLTRSEFR 108
Query: 93 SR 94
+
Sbjct: 109 KK 110
>gi|356576257|ref|XP_003556249.1| PREDICTED: LOW QUALITY PROTEIN: cysteine proteinase 15A-like
[Glycine max]
Length = 374
Score = 44.7 bits (104), Expect = 0.006, Method: Composition-based stats.
Identities = 26/81 (32%), Positives = 42/81 (51%), Gaps = 14/81 (17%)
Query: 16 QMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHG- 74
++ N L+TE K+F+ F+ ++ +SY T+EE +R +F N+ L EH
Sbjct: 41 KVGDNELLRTE-----KKFKVFMENYGRSYSTREEYLRRLGIFSQNM-----LRAAEHQA 90
Query: 75 ---TATYGINHLSDLTREEMK 92
TA +G+ SDLT E +
Sbjct: 91 LDPTAVHGVTQFSDLTEVEFE 111
>gi|2160175|gb|AAB60738.1| Strong similarity to Dianthus cysteine proteinase (gb|U17135)
[Arabidopsis thaliana]
Length = 416
Score = 44.7 bits (104), Expect = 0.007, Method: Composition-based stats.
Identities = 23/68 (33%), Positives = 39/68 (57%), Gaps = 1/68 (1%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK- 92
F+ + + K+Y ++EE +R +F+DN + N + T + +N +DLT E K
Sbjct: 30 FDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHHEFKA 89
Query: 93 SRLGLNLS 100
SRLGL++S
Sbjct: 90 SRLGLSVS 97
>gi|18391078|ref|NP_563855.1| xylem bark cysteine peptidase 3 [Arabidopsis thaliana]
gi|110741821|dbj|BAE98853.1| papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana]
gi|111074448|gb|ABH04597.1| At1g09850 [Arabidopsis thaliana]
gi|332190386|gb|AEE28507.1| xylem bark cysteine peptidase 3 [Arabidopsis thaliana]
Length = 437
Score = 44.7 bits (104), Expect = 0.007, Method: Composition-based stats.
Identities = 23/68 (33%), Positives = 39/68 (57%), Gaps = 1/68 (1%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK- 92
F+ + + K+Y ++EE +R +F+DN + N + T + +N +DLT E K
Sbjct: 32 FDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHHEFKA 91
Query: 93 SRLGLNLS 100
SRLGL++S
Sbjct: 92 SRLGLSVS 99
>gi|14600257|gb|AAK71314.1|AF388175_1 papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana]
Length = 437
Score = 44.7 bits (104), Expect = 0.007, Method: Composition-based stats.
Identities = 23/68 (33%), Positives = 39/68 (57%), Gaps = 1/68 (1%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK- 92
F+ + + K+Y ++EE +R +F+DN + N + T + +N +DLT E K
Sbjct: 32 FDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHHEFKA 91
Query: 93 SRLGLNLS 100
SRLGL++S
Sbjct: 92 SRLGLSVS 99
>gi|115533516|ref|NP_001041281.1| Protein R07E3.1, isoform b [Caenorhabditis elegans]
gi|85539716|emb|CAJ58500.1| Protein R07E3.1, isoform b [Caenorhabditis elegans]
Length = 348
Score = 44.7 bits (104), Expect = 0.007, Method: Composition-based stats.
Identities = 24/65 (36%), Positives = 36/65 (55%), Gaps = 1/65 (1%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLN-KGEHGTATYGINHLSDLTREE 90
K++ + F KSY T +E KR + + + I + N + EHG+A YG N +SD T EE
Sbjct: 34 KEYIAYTEKFDKSYATSQESLKRLNAYYNTDENIANWNIQNEHGSAEYGHNDMSDWTDEE 93
Query: 91 MKSRL 95
+ L
Sbjct: 94 FEKTL 98
>gi|118156|sp|P14658.1|CYSP_TRYBB RecName: Full=Cysteine proteinase; Flags: Precursor
gi|10393|emb|CAA34485.1| unnamed protein product [Trypanosoma brucei]
Length = 450
Score = 44.7 bits (104), Expect = 0.007, Method: Composition-based stats.
Identities = 21/62 (33%), Positives = 36/62 (58%), Gaps = 1/62 (1%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
+F F + + K Y +E A RF FE+N++ + + + AT+G+ SD+TREE +
Sbjct: 40 RFAAFKKKYGKVYKDAKEEAFRFRAFEENMEQAK-IQAAANPYATFGVTPFSDMTREEFR 98
Query: 93 SR 94
+R
Sbjct: 99 AR 100
>gi|345488505|ref|XP_001599980.2| PREDICTED: crustapain-like [Nasonia vitripennis]
Length = 111
Score = 44.7 bits (104), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 26/70 (37%), Positives = 42/70 (60%), Gaps = 3/70 (4%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLN-KGEHGTATY--GINHLSDLTRE 89
++E++ F+K Y EE +R+ ++ D K +E+ N K +G ++ GINH +D T E
Sbjct: 22 EWEQYKIKFNKKYANPEEEQRRYKIYLDTKKKVEEHNVKYNNGEVSFSLGINHFADRTPE 81
Query: 90 EMKSRLGLNL 99
E+KS GL L
Sbjct: 82 ELKSMHGLRL 91
>gi|261328617|emb|CBH11595.1| cysteine peptidase precursor [Trypanosoma brucei gambiense DAL972]
gi|261328620|emb|CBH11598.1| cysteine peptidase precursor [Trypanosoma brucei gambiense DAL972]
Length = 450
Score = 44.7 bits (104), Expect = 0.007, Method: Composition-based stats.
Identities = 21/62 (33%), Positives = 36/62 (58%), Gaps = 1/62 (1%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
+F F + + K Y +E A RF FE+N++ + + + AT+G+ SD+TREE +
Sbjct: 40 RFAAFKKKYGKVYKDAKEEAFRFRAFEENMEQAK-IQAAANPYATFGVTPFSDMTREEFR 98
Query: 93 SR 94
+R
Sbjct: 99 AR 100
>gi|261328615|emb|CBH11593.1| cysteine peptidase precursor [Trypanosoma brucei gambiense DAL972]
Length = 451
Score = 44.7 bits (104), Expect = 0.007, Method: Composition-based stats.
Identities = 21/62 (33%), Positives = 36/62 (58%), Gaps = 1/62 (1%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
+F F + + K Y +E A RF FE+N++ + + + AT+G+ SD+TREE +
Sbjct: 40 RFAAFKKKYGKVYKDAKEEAFRFRAFEENMEQAK-IQAAANPYATFGVTPFSDMTREEFR 98
Query: 93 SR 94
+R
Sbjct: 99 AR 100
>gi|60679562|gb|AAX34043.1| Sui m 1 allergen [Suidasia medanensis]
Length = 336
Score = 44.7 bits (104), Expect = 0.007, Method: Composition-based stats.
Identities = 27/60 (45%), Positives = 38/60 (63%), Gaps = 3/60 (5%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
FE+F + K Y T EE +R A+FE+NL+ I++ N G+HG A +N +DLT EE S
Sbjct: 28 FEQFKELYGKQY-TAEEEPQRRAIFEENLRWIQE-NHGKHG-AGLEVNEHADLTAEEFSS 84
>gi|72389847|ref|XP_845218.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|72389849|ref|XP_845219.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|72389851|ref|XP_845220.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|72389857|ref|XP_845223.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|62359926|gb|AAX80351.1| cysteine peptidase precursor [Trypanosoma brucei]
gi|62359927|gb|AAX80352.1| cysteine peptidase precursor [Trypanosoma brucei]
gi|62359928|gb|AAX80353.1| cysteine peptidase precursor [Trypanosoma brucei]
gi|62359931|gb|AAX80356.1| cysteine peptidase precursor [Trypanosoma brucei]
gi|70801753|gb|AAZ11659.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|70801754|gb|AAZ11660.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|70801755|gb|AAZ11661.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|70801758|gb|AAZ11664.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
Length = 450
Score = 44.7 bits (104), Expect = 0.007, Method: Composition-based stats.
Identities = 21/62 (33%), Positives = 36/62 (58%), Gaps = 1/62 (1%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
+F F + + K Y +E A RF FE+N++ + + + AT+G+ SD+TREE +
Sbjct: 40 RFAAFKKKYGKVYKDAKEEAFRFRAFEENMEQAK-IQAAANPYATFGVTPFSDMTREEFR 98
Query: 93 SR 94
+R
Sbjct: 99 AR 100
>gi|72389853|ref|XP_845221.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|62359929|gb|AAX80354.1| cysteine peptidase precursor [Trypanosoma brucei]
gi|70801756|gb|AAZ11662.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
Length = 449
Score = 44.7 bits (104), Expect = 0.007, Method: Composition-based stats.
Identities = 21/62 (33%), Positives = 36/62 (58%), Gaps = 1/62 (1%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
+F F + + K Y +E A RF FE+N++ + + + AT+G+ SD+TREE +
Sbjct: 40 RFAAFKKKYGKVYKDAKEEAFRFRAFEENMEQAK-IQAAANPYATFGVTPFSDMTREEFR 98
Query: 93 SR 94
+R
Sbjct: 99 AR 100
>gi|72389861|ref|XP_845225.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|72389863|ref|XP_845226.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|62359933|gb|AAX80358.1| cysteine peptidase precursor [Trypanosoma brucei]
gi|62359934|gb|AAX80359.1| cysteine peptidase precursor [Trypanosoma brucei]
gi|70801760|gb|AAZ11666.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|70801761|gb|AAZ11667.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
Length = 449
Score = 44.7 bits (104), Expect = 0.007, Method: Composition-based stats.
Identities = 21/62 (33%), Positives = 36/62 (58%), Gaps = 1/62 (1%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
+F F + + K Y +E A RF FE+N++ + + + AT+G+ SD+TREE +
Sbjct: 40 RFAAFKKKYGKVYKDAKEEAFRFRAFEENMEQAK-IQAAANPYATFGVTPFSDMTREEFR 98
Query: 93 SR 94
+R
Sbjct: 99 AR 100
>gi|72389855|ref|XP_845222.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|72389865|ref|XP_845227.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|72389867|ref|XP_845228.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|62359930|gb|AAX80355.1| cysteine peptidase precursor [Trypanosoma brucei]
gi|62359935|gb|AAX80360.1| cysteine peptidase precursor [Trypanosoma brucei]
gi|62359936|gb|AAX80361.1| cysteine peptidase precursor [Trypanosoma brucei]
gi|70801757|gb|AAZ11663.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|70801762|gb|AAZ11668.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|70801763|gb|AAZ11669.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
Length = 449
Score = 44.7 bits (104), Expect = 0.007, Method: Composition-based stats.
Identities = 21/62 (33%), Positives = 36/62 (58%), Gaps = 1/62 (1%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
+F F + + K Y +E A RF FE+N++ + + + AT+G+ SD+TREE +
Sbjct: 40 RFAAFKKKYGKVYKDAKEEAFRFRAFEENMEQAK-IQAAANPYATFGVTPFSDMTREEFR 98
Query: 93 SR 94
+R
Sbjct: 99 AR 100
>gi|72389859|ref|XP_845224.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|62359932|gb|AAX80357.1| cysteine peptidase precursor [Trypanosoma brucei]
gi|70801759|gb|AAZ11665.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
Length = 450
Score = 44.7 bits (104), Expect = 0.007, Method: Composition-based stats.
Identities = 21/62 (33%), Positives = 36/62 (58%), Gaps = 1/62 (1%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
+F F + + K Y +E A RF FE+N++ + + + AT+G+ SD+TREE +
Sbjct: 40 RFAAFKKKYGKVYKDAKEEAFRFRAFEENMEQAK-IQAAANPYATFGVTPFSDMTREEFR 98
Query: 93 SR 94
+R
Sbjct: 99 AR 100
>gi|10391|emb|CAA38238.1| unnamed protein product [Trypanosoma brucei]
Length = 450
Score = 44.7 bits (104), Expect = 0.007, Method: Composition-based stats.
Identities = 21/62 (33%), Positives = 36/62 (58%), Gaps = 1/62 (1%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
+F F + + K Y +E A RF FE+N++ + + + AT+G+ SD+TREE +
Sbjct: 40 RFAAFKKKYGKVYKDAKEEAFRFRAFEENMEQAK-IQAAANPYATFGVTPFSDMTREEFR 98
Query: 93 SR 94
+R
Sbjct: 99 AR 100
>gi|15485586|emb|CAC67416.1| cysteine protease [Trypanosoma brucei rhodesiense]
Length = 450
Score = 44.7 bits (104), Expect = 0.007, Method: Composition-based stats.
Identities = 21/62 (33%), Positives = 36/62 (58%), Gaps = 1/62 (1%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
+F F + + K Y +E A RF FE+N++ + + + AT+G+ SD+TREE +
Sbjct: 40 RFAAFKKKYGKVYKDAKEEAFRFRAFEENMEQAK-IQAAANPYATFGVTPFSDMTREEFR 98
Query: 93 SR 94
+R
Sbjct: 99 AR 100
>gi|313224805|emb|CBY20597.1| unnamed protein product [Oikopleura dioica]
Length = 343
Score = 44.7 bits (104), Expect = 0.007, Method: Composition-based stats.
Identities = 25/64 (39%), Positives = 35/64 (54%)
Query: 31 LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
L+ F ++ +FSK Y T EE R F N ++I N+ E T T G+N +DLT E
Sbjct: 39 LRAFRQYEVEFSKMYETAEERRIRAQTFSKNFEMITSHNQREDVTWTMGLNFDADLTFSE 98
Query: 91 MKSR 94
+SR
Sbjct: 99 FQSR 102
>gi|222629922|gb|EEE62054.1| hypothetical protein OsJ_16838 [Oryza sativa Japonica Group]
Length = 336
Score = 44.7 bits (104), Expect = 0.007, Method: Composition-based stats.
Identities = 26/61 (42%), Positives = 39/61 (63%), Gaps = 2/61 (3%)
Query: 38 IRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKSR-LG 96
I + K+Y + EE +RF VF+DNL I+D+NK + + G+N +DLT +E K+ LG
Sbjct: 33 IVGYRKAYASFEEKVRRFEVFKDNLNHIDDINK-KVTSYWLGLNEFADLTHDEFKATYLG 91
Query: 97 L 97
L
Sbjct: 92 L 92
>gi|156553312|ref|XP_001599758.1| PREDICTED: cathepsin O-like [Nasonia vitripennis]
Length = 345
Score = 44.7 bits (104), Expect = 0.007, Method: Composition-based stats.
Identities = 25/69 (36%), Positives = 39/69 (56%), Gaps = 6/69 (8%)
Query: 34 FEKFIRDFSKSYPTK-EEVAKRFAVFEDNLKLIEDLN--KGEHGTATYGINHLSDLTREE 90
FE +++D+ K Y +E +RF F+ +L+ IE LN + +A YG+ SD+T +E
Sbjct: 26 FEAYVQDYKKPYKNDPDEYERRFGRFQQSLRKIESLNRLRSSADSARYGLTDYSDMTEQE 85
Query: 91 MKSRLGLNL 99
L LNL
Sbjct: 86 F---LALNL 91
>gi|30141025|dbj|BAC75926.1| cysteine protease-4 [Helianthus annuus]
Length = 352
Score = 44.7 bits (104), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 28/65 (43%), Positives = 39/65 (60%), Gaps = 2/65 (3%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
FE ++ SK Y + +E RF +F DNLK I+D NK + G+N +DLT EE K+
Sbjct: 49 FESWLAKHSKIYESLDEKLHRFEIFMDNLKHIDDTNK-KVSNYWLGLNEFADLTHEEFKN 107
Query: 94 R-LGL 97
+ LGL
Sbjct: 108 KFLGL 112
>gi|255580657|ref|XP_002531151.1| cysteine protease, putative [Ricinus communis]
gi|223529264|gb|EEF31236.1| cysteine protease, putative [Ricinus communis]
Length = 340
Score = 44.7 bits (104), Expect = 0.007, Method: Composition-based stats.
Identities = 25/86 (29%), Positives = 45/86 (52%), Gaps = 1/86 (1%)
Query: 9 ATLALFGQMKSNNELKT-ENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIED 67
A + L G + S +T ++ ++ E+++ F + Y E R+ +F++N++ IE
Sbjct: 13 ALIFLLGALVSQAMARTLQDASMHEKHEEWMSRFGRVYNDGNEKEIRYKIFKENVQRIES 72
Query: 68 LNKGEHGTATYGINHLSDLTREEMKS 93
NK + GIN +DLT EE K+
Sbjct: 73 FNKASGKSYKLGINQFADLTNEEFKT 98
>gi|224076972|ref|XP_002305074.1| predicted protein [Populus trichocarpa]
gi|224106329|ref|XP_002333698.1| predicted protein [Populus trichocarpa]
gi|222837984|gb|EEE76349.1| predicted protein [Populus trichocarpa]
gi|222848038|gb|EEE85585.1| predicted protein [Populus trichocarpa]
Length = 307
Score = 44.7 bits (104), Expect = 0.007, Method: Composition-based stats.
Identities = 20/63 (31%), Positives = 36/63 (57%)
Query: 31 LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
LK+ E+++ + Y +E KR+ +F++N++ IE N G G+N +DLT EE
Sbjct: 2 LKRHEEWMAQHGRVYGDMKEKEKRYLIFKENIERIEAFNNGSDRGYKLGVNKFADLTNEE 61
Query: 91 MKS 93
++
Sbjct: 62 FRA 64
>gi|125552771|gb|EAY98480.1| hypothetical protein OsI_20393 [Oryza sativa Indica Group]
Length = 296
Score = 44.7 bits (104), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 23/72 (31%), Positives = 39/72 (54%)
Query: 21 NELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGI 80
E+K + +F ++++ +SY T+EE A+R+ VF++ + +N E T YG
Sbjct: 178 QEVKVDEATMKARFHDLMKEYGRSYSTEEEKARRYEVFKEATLWADKVNALEPRTIPYGP 237
Query: 81 NHLSDLTREEMK 92
N +D T EE K
Sbjct: 238 NGYADFTDEEFK 249
>gi|229893789|gb|ACQ90252.1| cathepsin L [Pinctada fucata]
Length = 362
Score = 44.7 bits (104), Expect = 0.007, Method: Composition-based stats.
Identities = 34/98 (34%), Positives = 51/98 (52%), Gaps = 7/98 (7%)
Query: 3 EDASAEATLALFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNL 62
+ A E L Q+K++ L P H + +++F F K Y T EE KRF +F D L
Sbjct: 27 QPAKVEHASNLKLQVKASTRL---GPYH-ETWKEFKTLFGKVYDTVEEEIKRFDIFRDTL 82
Query: 63 KLIEDLNKGEH-GTATY--GINHLSDLTREEMKSRLGL 97
+ IE+ N+ H G +Y G+N SD++ +E GL
Sbjct: 83 ERIEEHNRKYHMGQKSYYMGVNQFSDMSHDEYLRHNGL 120
>gi|66730453|ref|NP_001019413.1| cathepsin W precursor [Rattus norvegicus]
gi|62531092|gb|AAH93401.1| Cathepsin W [Rattus norvegicus]
gi|149062072|gb|EDM12495.1| cathepsin W [Rattus norvegicus]
Length = 371
Score = 44.7 bits (104), Expect = 0.007, Method: Composition-based stats.
Identities = 25/70 (35%), Positives = 35/70 (50%), Gaps = 1/70 (1%)
Query: 28 PEHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
P LK+ F+ F F++SY E +R +F NL + L + + GTA +G SDL
Sbjct: 33 PLELKEVFKLFQIQFNRSYSNPAEYTRRLGIFAHNLAQAQRLQEEDLGTAEFGQTPFSDL 92
Query: 87 TREEMKSRLG 96
T EE G
Sbjct: 93 TEEEFGQLYG 102
>gi|444724527|gb|ELW65130.1| Cathepsin W [Tupaia chinensis]
Length = 491
Score = 44.7 bits (104), Expect = 0.007, Method: Composition-based stats.
Identities = 24/64 (37%), Positives = 36/64 (56%), Gaps = 1/64 (1%)
Query: 28 PEHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
P LK+ F F +++SY + E A+R +F NL + L + + GTA +G+ SDL
Sbjct: 161 PLELKEVFALFQIQYNRSYSSPAEHARRLDIFAHNLAQAQRLQEEDLGTAEFGVTPFSDL 220
Query: 87 TREE 90
T EE
Sbjct: 221 TDEE 224
>gi|356569685|ref|XP_003553027.1| PREDICTED: LOW QUALITY PROTEIN: cysteine proteinase 3-like [Glycine
max]
Length = 428
Score = 44.7 bits (104), Expect = 0.007, Method: Composition-based stats.
Identities = 24/61 (39%), Positives = 31/61 (50%), Gaps = 1/61 (1%)
Query: 30 HLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTRE 89
H F +F K Y + E+ F +F DNLKLI N+ T T G+NH +D T E
Sbjct: 50 HALSFARFACRHDKRYHSVGEIRNDFQIFSDNLKLIRSTNR-RSLTYTLGVNHFADWTWE 108
Query: 90 E 90
E
Sbjct: 109 E 109
>gi|156397875|ref|XP_001637915.1| predicted protein [Nematostella vectensis]
gi|156225031|gb|EDO45852.1| predicted protein [Nematostella vectensis]
Length = 331
Score = 44.7 bits (104), Expect = 0.007, Method: Composition-based stats.
Identities = 25/61 (40%), Positives = 37/61 (60%), Gaps = 2/61 (3%)
Query: 43 KSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM-KSRLGLNLSK 101
K YP K E R ++++NLK I N+G+H + +NHL D+T E+ ++ LGL L K
Sbjct: 38 KEYPNKNEETMRNFIWQNNLKKIVTHNEGKH-SFKLAMNHLGDMTSLEISQTLLGLKLKK 96
Query: 102 H 102
H
Sbjct: 97 H 97
>gi|357613024|gb|EHJ68277.1| BCP inhibitor [Danaus plexippus]
Length = 90
Score = 44.7 bits (104), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 26/68 (38%), Positives = 35/68 (51%), Gaps = 1/68 (1%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
FEKFI+DF K+Y E+ + F +LK I LN E TY IN +D T + +
Sbjct: 22 FEKFIKDFDKTYKDAEDREIHYQAFVQSLKDINRLN-SEQPDTTYDINQFADYTEADQQG 80
Query: 94 RLGLNLSK 101
GL L +
Sbjct: 81 MRGLILPE 88
>gi|302143412|emb|CBI21973.3| unnamed protein product [Vitis vinifera]
Length = 320
Score = 44.7 bits (104), Expect = 0.008, Method: Composition-based stats.
Identities = 19/62 (30%), Positives = 35/62 (56%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
++ E ++ + + Y +E +KR+ +F+DN+ IE NK + IN +DLT EE
Sbjct: 37 ERHEDWMVQYGREYKDADEKSKRYKIFKDNVARIESFNKAMDKSYKLSINEFADLTNEEF 96
Query: 92 KS 93
++
Sbjct: 97 RA 98
>gi|50355623|dbj|BAD29960.1| cysteine protease [Daucus carota]
Length = 460
Score = 44.7 bits (104), Expect = 0.008, Method: Composition-based stats.
Identities = 20/64 (31%), Positives = 35/64 (54%)
Query: 31 LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
+ +E ++ KSY E +RF +F+DN I++ N + + G+N +DLT EE
Sbjct: 41 MAAYESWLVKHGKSYNALGEKEQRFQIFKDNFLYIDEQNAAKDRSFKLGLNRFADLTNEE 100
Query: 91 MKSR 94
+S+
Sbjct: 101 YRSK 104
>gi|402584107|gb|EJW78049.1| hypothetical protein WUBG_11042, partial [Wuchereria bancrofti]
Length = 213
Score = 44.7 bits (104), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 20/52 (38%), Positives = 34/52 (65%)
Query: 41 FSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
+++ Y +K+E KRF +++ NL+L + + E GTA YG SD+T+EE +
Sbjct: 1 YNRKYRSKKEFLKRFRIYKRNLRLAKLIQNKEEGTAIYGETPYSDMTQEEFR 52
>gi|449469929|ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
gi|449529596|ref|XP_004171784.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
Length = 431
Score = 44.7 bits (104), Expect = 0.008, Method: Composition-based stats.
Identities = 24/66 (36%), Positives = 35/66 (53%), Gaps = 1/66 (1%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK- 92
FE + + KSY + EE R VF DN + + N ++ + T +N +DLT E K
Sbjct: 29 FEIWCTEHGKSYSSAEEKLYRLGVFADNYEFVTHHNNLDNSSYTLSLNSYADLTHHEFKV 88
Query: 93 SRLGLN 98
SRLG +
Sbjct: 89 SRLGFS 94
>gi|391343119|ref|XP_003745860.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
Length = 385
Score = 44.7 bits (104), Expect = 0.008, Method: Composition-based stats.
Identities = 23/70 (32%), Positives = 36/70 (51%)
Query: 25 TENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLS 84
+ NP + +E F + +K Y + E R +FE+N + IED N + G+NH
Sbjct: 72 SPNPNLNQHWENFKAEHNKKYESFPEELMRRLIFEENHQFIEDHNSKKEFDFYLGMNHFG 131
Query: 85 DLTREEMKSR 94
DLT +E + R
Sbjct: 132 DLTNKEYRER 141
>gi|1019667|gb|AAA79287.1| rangelipain, partial [Trypanosoma rangeli]
Length = 263
Score = 44.7 bits (104), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 25/63 (39%), Positives = 35/63 (55%), Gaps = 1/63 (1%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
QF F + K Y + E A R VF++NL L L+ + A++G+ SDLTREE
Sbjct: 36 SQFAAFKQRHGKVYGSAAEEAFRLGVFKENL-LFARLHAAANPHASFGVTPFSDLTREEF 94
Query: 92 KSR 94
+SR
Sbjct: 95 RSR 97
>gi|357631370|gb|EHJ78915.1| cathepsin [Danaus plexippus]
Length = 327
Score = 44.7 bits (104), Expect = 0.008, Method: Composition-based stats.
Identities = 24/67 (35%), Positives = 42/67 (62%), Gaps = 3/67 (4%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM-K 92
F+K++ ++ K Y +EE + +F+DNL+ I +LNK + T Y IN +DL EE+
Sbjct: 37 FQKYVIEYDKHY-NEEEYWAHYEIFKDNLEKINELNKNSNST-VYDINQFTDLKFEEVAN 94
Query: 93 SRLGLNL 99
+ +G++L
Sbjct: 95 TYMGMSL 101
>gi|356542631|ref|XP_003539770.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 44.7 bits (104), Expect = 0.008, Method: Composition-based stats.
Identities = 27/86 (31%), Positives = 42/86 (48%), Gaps = 1/86 (1%)
Query: 9 ATLALFGQMK-SNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIED 67
A L LFG S N E+ ++ E+++ K Y E R+ +F+ N+K IE
Sbjct: 13 ALLLLFGFWAFSANTRTLEDASMHERHEQWMAQHGKVYKDHHEKELRYKIFQQNVKGIEG 72
Query: 68 LNKGEHGTATYGINHLSDLTREEMKS 93
N + + G+N +DLT EE K+
Sbjct: 73 FNNAGNKSHKLGVNQFADLTEEEFKA 98
>gi|343417244|emb|CCD20093.1| cysteine peptidase precursor [Trypanosoma vivax Y486]
Length = 454
Score = 44.7 bits (104), Expect = 0.008, Method: Composition-based stats.
Identities = 23/61 (37%), Positives = 34/61 (55%), Gaps = 1/61 (1%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
F F + + +SY T E A R VFEDN++ + + AT+G+ SDLT EE ++
Sbjct: 34 FAAFKQKYGRSYGTAAEEAFRLRVFEDNMRR-SRMYAAANPHATFGVTPFSDLTPEEFRT 92
Query: 94 R 94
R
Sbjct: 93 R 93
>gi|343412462|emb|CCD21670.1| cysteine peptidase (CP), putative [Trypanosoma vivax Y486]
Length = 367
Score = 44.7 bits (104), Expect = 0.008, Method: Composition-based stats.
Identities = 23/61 (37%), Positives = 34/61 (55%), Gaps = 1/61 (1%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
F F + + +SY T E A R VFEDN++ + + AT+G+ SDLT EE ++
Sbjct: 34 FAAFKQKYGRSYGTAAEEAFRLRVFEDNMRR-SRMYAAANPHATFGVTPFSDLTPEEFRT 92
Query: 94 R 94
R
Sbjct: 93 R 93
>gi|340053971|emb|CCC48265.1| cysteine peptidase precursor, fragment, partial [Trypanosoma
vivax Y486]
Length = 389
Score = 44.7 bits (104), Expect = 0.008, Method: Composition-based stats.
Identities = 23/61 (37%), Positives = 34/61 (55%), Gaps = 1/61 (1%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
F F + + +SY T E A R VFEDN++ + + AT+G+ SDLT EE ++
Sbjct: 34 FAAFKQKYGRSYGTAAEEAFRLRVFEDNMRR-SRMYAAANPHATFGVTPFSDLTPEEFRT 92
Query: 94 R 94
R
Sbjct: 93 R 93
>gi|340053968|emb|CCC48262.1| cysteine peptidase, Clan CA, family C1,Cathepsin L-like,
fragment, partial [Trypanosoma vivax Y486]
Length = 323
Score = 44.7 bits (104), Expect = 0.008, Method: Composition-based stats.
Identities = 23/61 (37%), Positives = 34/61 (55%), Gaps = 1/61 (1%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
F F + + +SY T E A R VFEDN++ + + AT+G+ SDLT EE ++
Sbjct: 34 FAAFKQKYGRSYGTAAEEAFRLRVFEDNMRR-SRMYAAANPHATFGVTPFSDLTPEEFRT 92
Query: 94 R 94
R
Sbjct: 93 R 93
>gi|340053966|emb|CCC48259.1| cysteine peptidase precursor, fragment, partial [Trypanosoma
vivax Y486]
Length = 447
Score = 44.7 bits (104), Expect = 0.008, Method: Composition-based stats.
Identities = 23/61 (37%), Positives = 34/61 (55%), Gaps = 1/61 (1%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
F F + + +SY T E A R VFEDN++ + + AT+G+ SDLT EE ++
Sbjct: 26 FAAFKQKYGRSYGTAAEEAFRLRVFEDNMRR-SRMYAAANPHATFGVTPFSDLTPEEFRT 84
Query: 94 R 94
R
Sbjct: 85 R 85
>gi|340053965|emb|CCC48258.1| cysteine peptidase precursor [Trypanosoma vivax Y486]
Length = 441
Score = 44.7 bits (104), Expect = 0.008, Method: Composition-based stats.
Identities = 23/61 (37%), Positives = 34/61 (55%), Gaps = 1/61 (1%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
F F + + +SY T E A R VFEDN++ + + AT+G+ SDLT EE ++
Sbjct: 34 FAAFKQKYGRSYGTAAEEAFRLRVFEDNMRR-SRMYAAANPHATFGVTPFSDLTPEEFRT 92
Query: 94 R 94
R
Sbjct: 93 R 93
>gi|340053963|emb|CCC48256.1| cysteine peptidase precursor [Trypanosoma vivax Y486]
Length = 452
Score = 44.7 bits (104), Expect = 0.008, Method: Composition-based stats.
Identities = 23/61 (37%), Positives = 34/61 (55%), Gaps = 1/61 (1%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
F F + + +SY T E A R VFEDN++ + + AT+G+ SDLT EE ++
Sbjct: 34 FAAFKQKYGRSYGTAAEEAFRLRVFEDNMRR-SRMYAAANPHATFGVTPFSDLTPEEFRT 92
Query: 94 R 94
R
Sbjct: 93 R 93
>gi|302143411|emb|CBI21972.3| unnamed protein product [Vitis vinifera]
Length = 320
Score = 44.7 bits (104), Expect = 0.008, Method: Composition-based stats.
Identities = 19/62 (30%), Positives = 35/62 (56%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
++ E ++ + + Y +E +KR+ +F+DN+ IE NK + IN +DLT EE
Sbjct: 37 ERHEDWMVQYGREYKDADEKSKRYKIFKDNVARIESFNKAMDKSYKLSINEFADLTNEEF 96
Query: 92 KS 93
++
Sbjct: 97 RA 98
>gi|115533514|ref|NP_001041280.1| Protein R07E3.1, isoform a [Caenorhabditis elegans]
gi|3878958|emb|CAA89070.1| Protein R07E3.1, isoform a [Caenorhabditis elegans]
Length = 402
Score = 44.7 bits (104), Expect = 0.008, Method: Composition-based stats.
Identities = 24/65 (36%), Positives = 36/65 (55%), Gaps = 1/65 (1%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLN-KGEHGTATYGINHLSDLTREE 90
K++ + F KSY T +E KR + + + I + N + EHG+A YG N +SD T EE
Sbjct: 88 KEYIAYTEKFDKSYATSQESLKRLNAYYNTDENIANWNIQNEHGSAEYGHNDMSDWTDEE 147
Query: 91 MKSRL 95
+ L
Sbjct: 148 FEKTL 152
>gi|209170907|ref|YP_002268053.1| agip23 [Agrotis ipsilon multiple nucleopolyhedrovirus]
gi|208436498|gb|ACI28725.1| viral cathepsin [Agrotis ipsilon multiple nucleopolyhedrovirus]
Length = 364
Score = 44.7 bits (104), Expect = 0.008, Method: Composition-based stats.
Identities = 21/61 (34%), Positives = 35/61 (57%), Gaps = 1/61 (1%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
FEKFI ++K Y ++E R+ +F N++ I N + +A Y IN +D+T+ E+
Sbjct: 67 FEKFISQYNKHYKNEDEKKYRYNIFRHNIESINHKNS-RNDSAVYKINRFADMTKNEVVI 125
Query: 94 R 94
R
Sbjct: 126 R 126
>gi|225446585|ref|XP_002280215.1| PREDICTED: vignain [Vitis vinifera]
Length = 341
Score = 44.7 bits (104), Expect = 0.008, Method: Composition-based stats.
Identities = 19/62 (30%), Positives = 35/62 (56%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
++ E ++ + + Y +E +KR+ +F+DN+ IE NK + IN +DLT EE
Sbjct: 37 ERHEDWMAQYGRVYKDADEKSKRYKIFKDNVARIESFNKAMDKSYKLSINEFADLTNEEF 96
Query: 92 KS 93
++
Sbjct: 97 RA 98
>gi|313221004|emb|CBY31836.1| unnamed protein product [Oikopleura dioica]
Length = 323
Score = 44.3 bits (103), Expect = 0.008, Method: Composition-based stats.
Identities = 26/59 (44%), Positives = 36/59 (61%), Gaps = 1/59 (1%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
FE + + KSY T EE R VFE+N+ IE +NK E+ + T G+N SDLT +E +
Sbjct: 18 FEDWTAEHWKSYETAEEEKFRKGVFEENVAKIEQINK-ENRSWTAGLNKFSDLTWDEFQ 75
>gi|226533314|ref|NP_001150119.1| xylem cysteine proteinase 2 [Zea mays]
gi|195636886|gb|ACG37911.1| xylem cysteine proteinase 2 precursor [Zea mays]
gi|223946183|gb|ACN27175.1| unknown [Zea mays]
gi|413951209|gb|AFW83858.1| Xylem cysteine proteinase 2 [Zea mays]
Length = 385
Score = 44.3 bits (103), Expect = 0.008, Method: Composition-based stats.
Identities = 25/68 (36%), Positives = 42/68 (61%), Gaps = 2/68 (2%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
FE+++ ++Y + EE +RF VF+DNL I++ N+ + + G+N +DLT +E K+
Sbjct: 59 FERWLSRHRRAYASLEEKLRRFQVFKDNLHHIDETNR-KVSSYWLGLNEFADLTHDEFKA 117
Query: 94 R-LGLNLS 100
LGL S
Sbjct: 118 TYLGLRSS 125
>gi|194352746|emb|CAQ00101.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
Length = 381
Score = 44.3 bits (103), Expect = 0.008, Method: Composition-based stats.
Identities = 30/84 (35%), Positives = 43/84 (51%), Gaps = 5/84 (5%)
Query: 19 SNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATY 78
+ NEL+ H F F+R F KSY +E R +VF NL+ + + +A +
Sbjct: 46 AENELELNAEAH---FASFVRRFGKSYRDADEHEHRLSVFRANLRRARRHQRLD-PSAVH 101
Query: 79 GINHLSDLTREEMKSR-LGLNLSK 101
GI SDLT +E + R LGL S+
Sbjct: 102 GITKFSDLTPDEFRERFLGLRKSR 125
>gi|413947586|gb|AFW80235.1| hypothetical protein ZEAMMB73_542371 [Zea mays]
Length = 264
Score = 44.3 bits (103), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 19/63 (30%), Positives = 33/63 (52%)
Query: 35 EKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKSR 94
E+++ ++ + Y + A+RF VF+DN +E N + G+N +DLT E K+
Sbjct: 42 ERWMAEYGRVYKDAADKARRFEVFKDNFAFVESFNADKKNKFWLGVNQFADLTTEAFKAN 101
Query: 95 LGL 97
G
Sbjct: 102 KGF 104
>gi|297843784|ref|XP_002889773.1| hypothetical protein ARALYDRAFT_471096 [Arabidopsis lyrata subsp.
lyrata]
gi|297335615|gb|EFH66032.1| hypothetical protein ARALYDRAFT_471096 [Arabidopsis lyrata subsp.
lyrata]
Length = 439
Score = 44.3 bits (103), Expect = 0.008, Method: Composition-based stats.
Identities = 23/68 (33%), Positives = 39/68 (57%), Gaps = 1/68 (1%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK- 92
F+ + + K+Y ++EE +R +F+DN + N + T + +N +DLT E K
Sbjct: 32 FDDWCQRHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHHEFKA 91
Query: 93 SRLGLNLS 100
SRLGL++S
Sbjct: 92 SRLGLSVS 99
>gi|311247276|ref|XP_003122571.1| PREDICTED: cathepsin W-like [Sus scrofa]
Length = 367
Score = 44.3 bits (103), Expect = 0.009, Method: Composition-based stats.
Identities = 24/64 (37%), Positives = 35/64 (54%), Gaps = 1/64 (1%)
Query: 28 PEHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
P LK+ F F +++SY E A+R +F NL + L + + GTA +G+ SDL
Sbjct: 35 PMGLKEVFTLFQIQYNRSYSNPAEHARRLDIFAQNLAKAQRLQEEDLGTAEFGVTPFSDL 94
Query: 87 TREE 90
T EE
Sbjct: 95 TEEE 98
>gi|332376813|gb|AEE63546.1| unknown [Dendroctonus ponderosae]
Length = 312
Score = 44.3 bits (103), Expect = 0.009, Method: Composition-based stats.
Identities = 27/67 (40%), Positives = 42/67 (62%), Gaps = 3/67 (4%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNK-GEHGTATY--GINHLSDLTR 88
+++ KF +K Y T E RFA+F++N+++IE+ N+ E G ATY +N +DL+R
Sbjct: 23 EKWLKFKNQHNKVYETVYEEKLRFAIFQENVQIIEEQNRLYEAGEATYRMAVNKFADLSR 82
Query: 89 EEMKSRL 95
EE S L
Sbjct: 83 EEYLSIL 89
>gi|195379510|ref|XP_002048521.1| GJ11312 [Drosophila virilis]
gi|194155679|gb|EDW70863.1| GJ11312 [Drosophila virilis]
Length = 549
Score = 44.3 bits (103), Expect = 0.009, Method: Composition-based stats.
Identities = 28/88 (31%), Positives = 41/88 (46%), Gaps = 2/88 (2%)
Query: 14 FGQMKSNNELKTENPEHL-KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGE 72
F E + + EH+ K F F R Y ++E R +F NL+ I N+ +
Sbjct: 224 FATFNPMQEFISGSDEHVDKAFHHFKRKHGVDYRNEKEHEHRKNIFRQNLRYIHSKNRAK 283
Query: 73 HGTATYGINHLSDLTREEMKSRLGLNLS 100
T +NHL+D T EE+K+R G S
Sbjct: 284 L-TYKLAVNHLADKTEEELKARRGYKSS 310
>gi|125547236|gb|EAY93058.1| hypothetical protein OsI_14861 [Oryza sativa Indica Group]
Length = 339
Score = 44.3 bits (103), Expect = 0.009, Method: Composition-based stats.
Identities = 20/58 (34%), Positives = 32/58 (55%), Gaps = 1/58 (1%)
Query: 35 EKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
E+++ + + Y E A+RF VF+ N+ IE N G H G+N +DLT +E +
Sbjct: 38 ERWMAQYGRVYRDDAEKARRFEVFKANVAFIESFNAGNH-NFWLGVNQFADLTNDEFR 94
>gi|38345008|emb|CAD40026.2| OSJNBa0052O21.11 [Oryza sativa Japonica Group]
gi|125589414|gb|EAZ29764.1| hypothetical protein OsJ_13822 [Oryza sativa Japonica Group]
Length = 339
Score = 44.3 bits (103), Expect = 0.009, Method: Composition-based stats.
Identities = 20/58 (34%), Positives = 32/58 (55%), Gaps = 1/58 (1%)
Query: 35 EKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
E+++ + + Y E A+RF VF+ N+ IE N G H G+N +DLT +E +
Sbjct: 38 ERWMAQYGRVYRDDAEKARRFEVFKANVAFIESFNAGNH-NFWLGVNQFADLTNDEFR 94
>gi|242072394|ref|XP_002446133.1| hypothetical protein SORBIDRAFT_06g002160 [Sorghum bicolor]
gi|241937316|gb|EES10461.1| hypothetical protein SORBIDRAFT_06g002160 [Sorghum bicolor]
Length = 338
Score = 44.3 bits (103), Expect = 0.009, Method: Composition-based stats.
Identities = 19/67 (28%), Positives = 35/67 (52%)
Query: 31 LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
+++ E ++ ++ + Y E A+RF F+ N+ +E N + G+N +DLT EE
Sbjct: 33 VERHENWMVEYGRVYKDAAEKARRFEAFKHNVAFVESFNTNKKNKFWLGVNQFADLTTEE 92
Query: 91 MKSRLGL 97
K+ G
Sbjct: 93 FKANKGF 99
>gi|390339264|ref|XP_791714.3| PREDICTED: putative cysteine proteinase CG12163-like
[Strongylocentrotus purpuratus]
Length = 453
Score = 44.3 bits (103), Expect = 0.009, Method: Composition-based stats.
Identities = 26/93 (27%), Positives = 47/93 (50%), Gaps = 6/93 (6%)
Query: 6 SAEATLALFGQ---MKSNNELKTENPEHLKQFEKFIRDFSKSYPTKE---EVAKRFAVFE 59
++E++L+L Q + + + E+ F+KF+ F + Y + E R++VF
Sbjct: 125 NSESSLSLKAQDFSITKDCQASDIKDEYRDLFDKFLMTFKREYRQNDGTNEYEYRYSVFV 184
Query: 60 DNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
N+ +E N+ E GTA YG +D+T E +
Sbjct: 185 QNMLTVEMFNQFEQGTAKYGPTKFADMTEAEFR 217
>gi|302816909|ref|XP_002990132.1| hypothetical protein SELMODRAFT_428615 [Selaginella
moellendorffii]
gi|300142145|gb|EFJ08849.1| hypothetical protein SELMODRAFT_428615 [Selaginella
moellendorffii]
Length = 358
Score = 44.3 bits (103), Expect = 0.009, Method: Composition-based stats.
Identities = 20/60 (33%), Positives = 36/60 (60%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
+EK++ D + Y E +RF +F DN + IE+ N+ + T G+N+ +D+T +E K+
Sbjct: 34 YEKWMVDHGRVYNGIGEKERRFQIFRDNAEYIEEHNRQVNQTYWLGLNNFADMTHDEFKA 93
>gi|146215978|gb|ABQ10191.1| actinidin Act1c [Actinidia eriantha]
Length = 368
Score = 44.3 bits (103), Expect = 0.009, Method: Composition-based stats.
Identities = 22/72 (30%), Positives = 38/72 (52%)
Query: 22 ELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGIN 81
+ K N E +E ++ KSY + E +RF +F++ L+ I++ N + G+N
Sbjct: 26 DAKRTNDEVKAMYESWLIKHGKSYNSLGERERRFEIFKETLRFIDEHNADTSRSYKVGLN 85
Query: 82 HLSDLTREEMKS 93
+DLT EE +S
Sbjct: 86 QFADLTNEEFRS 97
>gi|356515086|ref|XP_003526232.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 44.3 bits (103), Expect = 0.009, Method: Composition-based stats.
Identities = 20/59 (33%), Positives = 34/59 (57%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
++ E+++ + K Y +E KRF VF++N+ IE N + + GIN +DLT +E
Sbjct: 37 ERHEQWMTRYGKVYKDPQEREKRFRVFKENVNYIEAFNNAANKSYKLGINQFADLTNKE 95
>gi|302816222|ref|XP_002989790.1| hypothetical protein SELMODRAFT_184826 [Selaginella
moellendorffii]
gi|300142356|gb|EFJ09057.1| hypothetical protein SELMODRAFT_184826 [Selaginella
moellendorffii]
Length = 358
Score = 44.3 bits (103), Expect = 0.009, Method: Composition-based stats.
Identities = 20/60 (33%), Positives = 36/60 (60%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
+EK++ D + Y E +RF +F DN + IE+ N+ + T G+N+ +D+T +E K+
Sbjct: 34 YEKWMVDHGRVYNGIGEKERRFQIFRDNAEYIEEHNRQVNQTYWLGLNNFADMTHDEFKA 93
>gi|326523323|dbj|BAJ88702.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 161
Score = 44.3 bits (103), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 21/64 (32%), Positives = 38/64 (59%)
Query: 29 EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTR 88
E + F +++ ++ K Y + E +R+A+F+D L+ ++ LN YGIN LSD+T
Sbjct: 64 ETRRVFAEWMVEYGKKYSSAGEEDRRYALFKDELRRVDLLNAAFGPNPIYGINFLSDITD 123
Query: 89 EEMK 92
+E +
Sbjct: 124 KEWR 127
>gi|255555337|ref|XP_002518705.1| cysteine protease, putative [Ricinus communis]
gi|223542086|gb|EEF43630.1| cysteine protease, putative [Ricinus communis]
Length = 471
Score = 44.3 bits (103), Expect = 0.009, Method: Composition-based stats.
Identities = 21/62 (33%), Positives = 37/62 (59%), Gaps = 1/62 (1%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
+ +E ++ + K+Y E KRF +F+DNL+ I++ N + + G+N +DLT EE
Sbjct: 49 RMYEMWLVEHGKAYNALGEKEKRFEIFKDNLRFIDEHNSVDR-SYKVGLNRFADLTNEEY 107
Query: 92 KS 93
K+
Sbjct: 108 KA 109
>gi|125979159|ref|XP_001353612.1| GA21427 [Drosophila pseudoobscura pseudoobscura]
gi|54642377|gb|EAL31126.1| GA21427 [Drosophila pseudoobscura pseudoobscura]
Length = 549
Score = 44.3 bits (103), Expect = 0.009, Method: Composition-based stats.
Identities = 29/90 (32%), Positives = 44/90 (48%), Gaps = 5/90 (5%)
Query: 12 ALFGQMKSNNELKTENPEHL-KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNK 70
A F M+ E + EH+ + F F +Y ++E R +F NL+ I N+
Sbjct: 225 ATFNPMQ---EFISHTDEHVDRAFHHFKHKHGMAYRNEQEHEHRKNIFRQNLRYIHSKNR 281
Query: 71 GEHGTATYGINHLSDLTREEMKSRLGLNLS 100
+ T T +NHL+D T EE+K+R G S
Sbjct: 282 AKL-TYTLAVNHLADKTEEELKARRGYKSS 310
>gi|356543124|ref|XP_003540013.1| PREDICTED: vignain-like [Glycine max]
gi|356543126|ref|XP_003540014.1| PREDICTED: vignain-like [Glycine max]
Length = 337
Score = 44.3 bits (103), Expect = 0.009, Method: Composition-based stats.
Identities = 20/59 (33%), Positives = 33/59 (55%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
++ E++++ + K Y E KR +F+DN++ IE N + INHL+D T EE
Sbjct: 36 ERHEQWMKKYGKVYKDAAEKQKRLLIFKDNVEFIESFNAAGNRPYKLSINHLADQTNEE 94
>gi|147839728|emb|CAN70559.1| hypothetical protein VITISV_032465 [Vitis vinifera]
Length = 341
Score = 44.3 bits (103), Expect = 0.009, Method: Composition-based stats.
Identities = 19/62 (30%), Positives = 35/62 (56%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
++ E ++ + + Y +E +KR+ +F+DN+ IE NK + IN +DLT EE
Sbjct: 37 ERHEDWMVQYGREYKDADEKSKRYKIFKDNVARIESFNKAMDKSYKLSINEFADLTNEEF 96
Query: 92 KS 93
++
Sbjct: 97 RA 98
>gi|215261456|pdb|3F75|P Chain P, Activated Toxoplasma Gondii Cathepsin L (Tgcpl) In Complex
With Its Propeptide
Length = 106
Score = 44.3 bits (103), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 24/69 (34%), Positives = 42/69 (60%), Gaps = 2/69 (2%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
F F ++KSY T+EE +R+A+F++NL I N+ + + + +NH DL+R+E +
Sbjct: 25 FSSFQAMYAKSYATEEEKQRRYAIFKNNLVYIHTHNQQGY-SYSLKMNHFGDLSRDEFRR 83
Query: 94 R-LGLNLSK 101
+ LG S+
Sbjct: 84 KYLGFKKSR 92
>gi|225446583|ref|XP_002280204.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1 [Vitis
vinifera]
Length = 341
Score = 44.3 bits (103), Expect = 0.009, Method: Composition-based stats.
Identities = 19/62 (30%), Positives = 35/62 (56%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
++ E ++ + + Y +E +KR+ +F+DN+ IE NK + IN +DLT EE
Sbjct: 37 ERHEDWMVQYGREYKDADEKSKRYKIFKDNVARIESFNKAMDKSYKLSINEFADLTNEEF 96
Query: 92 KS 93
++
Sbjct: 97 RA 98
>gi|225446581|ref|XP_002280246.1| PREDICTED: vignain [Vitis vinifera]
Length = 341
Score = 44.3 bits (103), Expect = 0.009, Method: Composition-based stats.
Identities = 19/62 (30%), Positives = 35/62 (56%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
++ E ++ + + Y +E +KR+ +F+DN+ IE NK + IN +DLT EE
Sbjct: 37 ERHEDWMVQYGREYKDADEKSKRYKIFKDNVARIESFNKAMDKSYKLSINEFADLTNEEF 96
Query: 92 KS 93
++
Sbjct: 97 RA 98
>gi|297804580|ref|XP_002870174.1| hypothetical protein ARALYDRAFT_915142 [Arabidopsis lyrata subsp.
lyrata]
gi|297316010|gb|EFH46433.1| hypothetical protein ARALYDRAFT_915142 [Arabidopsis lyrata subsp.
lyrata]
Length = 373
Score = 44.3 bits (103), Expect = 0.009, Method: Composition-based stats.
Identities = 29/77 (37%), Positives = 41/77 (53%), Gaps = 6/77 (7%)
Query: 26 ENPEHL----KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGIN 81
EN EHL F F + K+Y T+EE RF VF+ NL+ N+ +A +G+
Sbjct: 43 ENDEHLLNAEHHFSLFKSKYEKTYATQEEHDHRFRVFKANLRRARR-NQLLDPSAVHGVT 101
Query: 82 HLSDLTREEMKSR-LGL 97
SDLT +E + + LGL
Sbjct: 102 QFSDLTPKEFRRKFLGL 118
>gi|91992512|gb|ABE72972.1| cathepsin L [Aedes aegypti]
Length = 548
Score = 44.3 bits (103), Expect = 0.009, Method: Composition-based stats.
Identities = 31/90 (34%), Positives = 45/90 (50%), Gaps = 4/90 (4%)
Query: 12 ALFGQMKSNNELKTENPEHL-KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNK 70
A F M+ ++E EHL +F +F KSY ++E R +F NL+ I N+
Sbjct: 223 ATFNPMQEFIHPRSE--EHLDNEFTRFRYKHGKSYHNEKEHDLRRDIFRQNLRFIHSHNR 280
Query: 71 GEHGTATYGINHLSDLTREEMKSRLGLNLS 100
G T +NHL+D T EE+K+ G S
Sbjct: 281 AGKGF-TVAVNHLADRTDEELKALRGFKSS 309
>gi|33242865|gb|AAQ01137.1| cathepsin [Branchiostoma lanceolatum]
Length = 328
Score = 44.3 bits (103), Expect = 0.009, Method: Composition-based stats.
Identities = 26/72 (36%), Positives = 43/72 (59%), Gaps = 6/72 (8%)
Query: 23 LKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLI----EDLNKGEHGTATY 78
+ T +P Q+E F + +++ Y +EE A+R +FEDNLK I E+ ++G H T
Sbjct: 12 MATASPLMNPQWEVFKKAYNRVYAAEEEFARRL-IFEDNLKTIQMHNEEADRGLH-TFRL 69
Query: 79 GINHLSDLTREE 90
G+N +D+T +E
Sbjct: 70 GVNQYADMTHKE 81
>gi|1581747|prf||2117247C Cys protease:ISOTYPE=3
Length = 469
Score = 44.3 bits (103), Expect = 0.010, Method: Composition-based stats.
Identities = 24/62 (38%), Positives = 34/62 (54%), Gaps = 1/62 (1%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
QF F + K Y + E R VF++NL L L+ + A++G+ SDLTREE +
Sbjct: 37 QFAAFKQRHGKVYGSAAEETFRLGVFKENL-LFARLHAAANPHASFGVTPFSDLTREEFR 95
Query: 93 SR 94
SR
Sbjct: 96 SR 97
>gi|223998002|ref|XP_002288674.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220975782|gb|EED94110.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 415
Score = 44.3 bits (103), Expect = 0.010, Method: Composition-based stats.
Identities = 23/77 (29%), Positives = 41/77 (53%), Gaps = 10/77 (12%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTAT--------YGINHLSD 85
FE+++ +F KSY +E +R +F +NL++I + NKG ++ G+N +D
Sbjct: 40 FEQYLANFDKSYSNPDEFTRRSRIFNNNLQIILNHNKGRDMDSSGRVKEGFVMGVNQFTD 99
Query: 86 LTREEMKSRLGLNLSKH 102
+ R E+ +G N H
Sbjct: 100 VERSELP--MGYNKGLH 114
>gi|29567137|ref|NP_818699.1| cathepsin [Adoxophyes honmai NPV]
gi|37076951|sp|Q80LP4.1|CATV_NPVAH RecName: Full=Viral cathepsin; Short=V-cath; AltName:
Full=Cysteine proteinase; Short=CP; Flags: Precursor
gi|29467913|dbj|BAC67303.1| cathepsin [Adoxophyes honmai NPV]
Length = 337
Score = 44.3 bits (103), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 23/61 (37%), Positives = 38/61 (62%), Gaps = 1/61 (1%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
FE FI +++K YP + RF +F+ NL+ I + NK + +A Y IN SDL++ E+ +
Sbjct: 32 FETFIINYNKQYPDTKTKNYRFKIFKQNLEDINEKNK-LNDSAIYNINKFSDLSKNELLT 90
Query: 94 R 94
+
Sbjct: 91 K 91
>gi|57282619|emb|CAE54307.1| cysteine proteinase [Gossypium hirsutum]
Length = 372
Score = 44.3 bits (103), Expect = 0.010, Method: Composition-based stats.
Identities = 20/67 (29%), Positives = 38/67 (56%)
Query: 29 EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTR 88
E + ++ ++ K+Y E KRF +F+DNL+ I++ N + T G+N +DLT
Sbjct: 41 EVMGLYKSWVIQHGKAYNGIGEEEKRFEIFKDNLRFIDEHNSNNNTTYKLGLNKFADLTN 100
Query: 89 EEMKSRL 95
+E +++
Sbjct: 101 QEYRAKF 107
>gi|255568297|ref|XP_002525123.1| cysteine protease, putative [Ricinus communis]
gi|223535582|gb|EEF37250.1| cysteine protease, putative [Ricinus communis]
Length = 349
Score = 44.3 bits (103), Expect = 0.010, Method: Composition-based stats.
Identities = 25/83 (30%), Positives = 41/83 (49%), Gaps = 2/83 (2%)
Query: 11 LALFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNK 70
LA+ ++ EL E + EK++ K Y +E +RF +F+ N+ IE N
Sbjct: 18 LAMCADQAASREL--HELEMTGRHEKWMAKHGKVYKDDKEKLRRFQIFKSNVVFIESFNT 75
Query: 71 GEHGTATYGINHLSDLTREEMKS 93
+ + GIN +DLT EE ++
Sbjct: 76 AGNKSYMLGINKFADLTNEEFRA 98
>gi|125547256|gb|EAY93078.1| hypothetical protein OsI_14879 [Oryza sativa Indica Group]
Length = 339
Score = 44.3 bits (103), Expect = 0.010, Method: Composition-based stats.
Identities = 19/70 (27%), Positives = 38/70 (54%), Gaps = 1/70 (1%)
Query: 24 KTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHL 83
++++ + + E+++ + + Y E A+RF +F+ N+ IE N G H G+N
Sbjct: 27 QSDHAAMVARHERWMEQYGRVYKDATEKARRFEIFKANVAFIESFNAGNH-KFWLGVNQF 85
Query: 84 SDLTREEMKS 93
+DLT E ++
Sbjct: 86 ADLTNYEFRA 95
>gi|38346003|emb|CAD40112.2| OSJNBa0035O13.5 [Oryza sativa Japonica Group]
gi|125589427|gb|EAZ29777.1| hypothetical protein OsJ_13835 [Oryza sativa Japonica Group]
Length = 339
Score = 44.3 bits (103), Expect = 0.010, Method: Composition-based stats.
Identities = 19/70 (27%), Positives = 38/70 (54%), Gaps = 1/70 (1%)
Query: 24 KTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHL 83
++++ + + E+++ + + Y E A+RF +F+ N+ IE N G H G+N
Sbjct: 27 QSDHAAMVARHERWMEQYGRVYKDATEKARRFEIFKANVAFIESFNAGNH-KFWLGVNQF 85
Query: 84 SDLTREEMKS 93
+DLT E ++
Sbjct: 86 ADLTNYEFRA 95
>gi|324518532|gb|ADY47133.1| Cysteine proteinase [Ascaris suum]
Length = 334
Score = 44.3 bits (103), Expect = 0.011, Method: Composition-based stats.
Identities = 21/74 (28%), Positives = 41/74 (55%), Gaps = 2/74 (2%)
Query: 18 KSNNELKTENPEHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTA 76
+ E + E ++ + +F+ D+ ++ T++E RFA+F+ N+ LI++LN + +
Sbjct: 17 RGEAEYSKNDTEQMRTLYNQFLHDYRRTNITEDEYKFRFAIFQKNMLLIDELNS-RNDSI 75
Query: 77 TYGINHLSDLTREE 90
YGI +D T E
Sbjct: 76 VYGITQFADWTDSE 89
>gi|242072392|ref|XP_002446132.1| hypothetical protein SORBIDRAFT_06g002150 [Sorghum bicolor]
gi|241937315|gb|EES10460.1| hypothetical protein SORBIDRAFT_06g002150 [Sorghum bicolor]
Length = 337
Score = 44.3 bits (103), Expect = 0.011, Method: Composition-based stats.
Identities = 19/67 (28%), Positives = 35/67 (52%)
Query: 31 LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
+++ E ++ ++ + Y E A+RF F+ N+ +E N + G+N +DLT EE
Sbjct: 33 VERHENWMVEYGRVYKDAAEKARRFEAFKHNVAFVESFNTNKKNKFWLGVNQFADLTTEE 92
Query: 91 MKSRLGL 97
K+ G
Sbjct: 93 FKANKGF 99
>gi|33667928|gb|AAQ24541.1| Blo t 1 allergen [Blomia tropicalis]
Length = 333
Score = 43.9 bits (102), Expect = 0.011, Method: Composition-based stats.
Identities = 29/75 (38%), Positives = 41/75 (54%), Gaps = 5/75 (6%)
Query: 29 EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTR 88
E +K FE+F + F K Y EE A+R F++ LK +E+ N G G Y IN SD++
Sbjct: 23 EEIKTFEQFKKVFGKVYRNAEEEARREHHFKEQLKWVEEHN-GIDGV-EYAINEYSDMSE 80
Query: 89 EEMKSRL---GLNLS 100
+E L GLN +
Sbjct: 81 QEFSFHLSGGGLNFT 95
>gi|356509992|ref|XP_003523725.1| PREDICTED: oryzain alpha chain-like [Glycine max]
Length = 439
Score = 43.9 bits (102), Expect = 0.011, Method: Composition-based stats.
Identities = 27/73 (36%), Positives = 40/73 (54%), Gaps = 6/73 (8%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLI-----EDLNKGEHGTATYGINHLSDLTR 88
FEK+ ++ SK+Y ++EE R VFEDN + N + + T +N +DLT
Sbjct: 33 FEKWCKEHSKTYSSEEEKLYRLKVFEDNYAFVAQHNQNANNNNNNSSYTLSLNAFADLTH 92
Query: 89 EEMK-SRLGLNLS 100
E K +RLGL L+
Sbjct: 93 HEFKTTRLGLPLT 105
>gi|357446975|ref|XP_003593763.1| Cysteine proteinase [Medicago truncatula]
gi|355482811|gb|AES64014.1| Cysteine proteinase [Medicago truncatula]
Length = 350
Score = 43.9 bits (102), Expect = 0.011, Method: Composition-based stats.
Identities = 21/67 (31%), Positives = 39/67 (58%), Gaps = 1/67 (1%)
Query: 35 EKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE-MKS 93
++++ + ++Y E+ KR +F++NL+ IE+ N + + G+N SDLT EE + S
Sbjct: 34 QQWMMKYERTYTNSSEMEKRKKIFKENLEYIENFNNVGNKSYKLGLNRYSDLTSEEFIAS 93
Query: 94 RLGLNLS 100
G +S
Sbjct: 94 HTGFKVS 100
>gi|195428245|ref|XP_002062184.1| GK16790 [Drosophila willistoni]
gi|194158269|gb|EDW73170.1| GK16790 [Drosophila willistoni]
Length = 549
Score = 43.9 bits (102), Expect = 0.011, Method: Composition-based stats.
Identities = 24/67 (35%), Positives = 35/67 (52%), Gaps = 1/67 (1%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
F F R +Y + +E R +F NL+ I N+ + T T +NHL+D T EE+K+
Sbjct: 245 FHHFKRKHGVAYRSDKEHEHRKNIFRQNLRYIHSKNRAKL-TYTLAVNHLADKTEEELKA 303
Query: 94 RLGLNLS 100
R G S
Sbjct: 304 RRGYKSS 310
>gi|363807062|ref|NP_001242584.1| uncharacterized protein LOC100804015 precursor [Glycine max]
gi|255640677|gb|ACU20623.1| unknown [Glycine max]
Length = 366
Score = 43.9 bits (102), Expect = 0.011, Method: Composition-based stats.
Identities = 22/77 (28%), Positives = 42/77 (54%)
Query: 17 MKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTA 76
+K++ + + E + +E+++ K Y + KRF VF+DNL I++ N + T
Sbjct: 21 IKTSTIINYTDNEVMAMYEEWLVRHQKGYNELGKKDKRFQVFKDNLGFIQEHNNNLNNTY 80
Query: 77 TYGINHLSDLTREEMKS 93
G+N +D+T EE ++
Sbjct: 81 KLGLNKFADMTNEEYRA 97
>gi|20151497|gb|AAM11108.1| GM07827p [Drosophila melanogaster]
Length = 219
Score = 43.9 bits (102), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 31/79 (39%), Positives = 42/79 (53%), Gaps = 3/79 (3%)
Query: 19 SNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLN-KGEHGTAT 77
S +E+ +N +EKF+ DF SY E KR VF DN K I N + + G +
Sbjct: 58 STSEIDNDNIICQPAWEKFLIDFKPSYQDDTETEKRRNVFCDNFKSIHKHNVQFDLGNIS 117
Query: 78 Y--GINHLSDLTREEMKSR 94
+ GIN SDLT EE K++
Sbjct: 118 FKKGINQWSDLTVEEWKNK 136
>gi|2499879|sp|Q40143.1|CYSP3_SOLLC RecName: Full=Cysteine proteinase 3; Flags: Precursor
gi|1235545|emb|CAA88629.1| pre-pro-cysteine proteinase [Solanum lycopersicum]
Length = 356
Score = 43.9 bits (102), Expect = 0.011, Method: Composition-based stats.
Identities = 25/64 (39%), Positives = 37/64 (57%), Gaps = 2/64 (3%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM-K 92
F +F K Y + EE+ +RF +F DNLK+I N+ + + GIN +DLT +E K
Sbjct: 57 FARFAIRHRKRYDSVEEIKQRFEIFLDNLKMIRSHNR-KGLSYKLGINEFTDLTWDEFRK 115
Query: 93 SRLG 96
+LG
Sbjct: 116 HKLG 119
>gi|194883258|ref|XP_001975720.1| GG20406 [Drosophila erecta]
gi|190658907|gb|EDV56120.1| GG20406 [Drosophila erecta]
Length = 345
Score = 43.9 bits (102), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 28/70 (40%), Positives = 37/70 (52%), Gaps = 3/70 (4%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLN-KGEHGTATY--GINHLSDLTREE 90
+ KF+ DF Y E KR +F DN I+ N + + G ++ GIN SDLT EE
Sbjct: 271 WNKFLIDFGPKYSDDTETKKRRNIFCDNWNSIQKHNVQYDLGNISFKKGINQWSDLTVEE 330
Query: 91 MKSRLGLNLS 100
KS+ NLS
Sbjct: 331 WKSKQQPNLS 340
Score = 40.0 bits (92), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 24/64 (37%), Positives = 35/64 (54%), Gaps = 3/64 (4%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLN-KGEHGTATY--GINHLSDLTREE 90
+EKF+ DF + Y E +R +F DN I+ N + + G ++ GIN SDLT EE
Sbjct: 179 WEKFMIDFKRKYEDDNETKQRRNIFCDNWNSIQKHNVQYDLGNISFRKGINQWSDLTVEE 238
Query: 91 MKSR 94
K +
Sbjct: 239 WKKK 242
>gi|129614|sp|P00784.1|PAPA1_CARPA RecName: Full=Papain; AltName: Full=Papaya proteinase I; Short=PPI;
AltName: Allergen=Car p 1; Flags: Precursor
gi|167391|gb|AAB02650.1| papain precursor [Carica papaya]
gi|387885|gb|AAA72774.1| papain [synthetic construct]
gi|225437|prf||1303270A papain
Length = 345
Score = 43.9 bits (102), Expect = 0.012, Method: Composition-based stats.
Identities = 24/76 (31%), Positives = 44/76 (57%), Gaps = 2/76 (2%)
Query: 19 SNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATY 78
S N+L T ++ FE ++ +K Y +E RF +F+DNLK I++ NK ++ +
Sbjct: 34 SQNDL-TSTERLIQLFESWMLKHNKIYKNIDEKIYRFEIFKDNLKYIDETNK-KNNSYWL 91
Query: 79 GINHLSDLTREEMKSR 94
G+N +D++ +E K +
Sbjct: 92 GLNVFADMSNDEFKEK 107
>gi|113195461|ref|YP_717598.1| V-CATH [Clanis bilineata nucleopolyhedrosis virus]
gi|66968272|gb|AAY59557.1| V-CATH [Clanis bilineata nucleopolyhedrosis virus]
Length = 325
Score = 43.9 bits (102), Expect = 0.012, Method: Composition-based stats.
Identities = 24/67 (35%), Positives = 41/67 (61%), Gaps = 2/67 (2%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
FE F+ +++K Y +E A R+ +F+ NL+ I N+ E A + IN SD+++ E+ S
Sbjct: 27 FESFVANYNKMYNDTQEKAYRYKIFKHNLEEINIKNQVED-HAVFSINKFSDMSKSEIIS 85
Query: 94 RL-GLNL 99
+ GL+L
Sbjct: 86 KYTGLSL 92
>gi|46309423|ref|YP_006313.1| ORF31 [Agrotis segetum granulovirus]
gi|46200640|gb|AAS82707.1| ORF31 [Agrotis segetum granulovirus]
Length = 327
Score = 43.9 bits (102), Expect = 0.012, Method: Composition-based stats.
Identities = 24/74 (32%), Positives = 45/74 (60%), Gaps = 4/74 (5%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
K FE F++ ++KSY ++EE +F F++N++ I + N + +A Y IN SD+ + E+
Sbjct: 23 KLFEDFVQKYNKSYSSEEERQIKFDNFKNNIRSINEKNSLSN-SAVYDINFYSDMNKNEL 81
Query: 92 ---KSRLGLNLSKH 102
++ +NL K+
Sbjct: 82 LRKQTGFKINLKKN 95
>gi|357465603|ref|XP_003603086.1| Cysteine proteinase [Medicago truncatula]
gi|355492134|gb|AES73337.1| Cysteine proteinase [Medicago truncatula]
Length = 474
Score = 43.9 bits (102), Expect = 0.012, Method: Composition-based stats.
Identities = 21/45 (46%), Positives = 30/45 (66%), Gaps = 1/45 (2%)
Query: 50 EVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKSR 94
E KRF +F+DNLK I++ N E+ T G+N +DL+ EE +SR
Sbjct: 71 EKDKRFEIFKDNLKFIDEHN-AENRTYKVGLNRFADLSNEEYRSR 114
>gi|255550445|ref|XP_002516273.1| cysteine protease, putative [Ricinus communis]
gi|223544759|gb|EEF46275.1| cysteine protease, putative [Ricinus communis]
Length = 358
Score = 43.9 bits (102), Expect = 0.012, Method: Composition-based stats.
Identities = 23/64 (35%), Positives = 37/64 (57%), Gaps = 2/64 (3%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM-K 92
F +F+ K Y +++E+ RFA+F +NL I N+ + + T +N +DLT +E K
Sbjct: 59 FSRFVYRHGKRYQSEDEMKMRFAIFSENLDFIRSTNR-KGLSYTLAVNDFADLTWQEFQK 117
Query: 93 SRLG 96
RLG
Sbjct: 118 HRLG 121
>gi|116779325|gb|ABK21238.1| unknown [Picea sitchensis]
gi|148905850|gb|ABR16087.1| unknown [Picea sitchensis]
gi|148908434|gb|ABR17330.1| unknown [Picea sitchensis]
gi|148908881|gb|ABR17545.1| unknown [Picea sitchensis]
gi|224286109|gb|ACN40765.1| unknown [Picea sitchensis]
Length = 366
Score = 43.9 bits (102), Expect = 0.012, Method: Composition-based stats.
Identities = 28/70 (40%), Positives = 36/70 (51%), Gaps = 10/70 (14%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGT----ATYGINHLSDLTR 88
F FIR + K Y EE RF VF+ NL L EH A++G+ SDLT+
Sbjct: 56 HFRHFIRRYGKKYSGPEEHEHRFGVFKSNL-----LRALEHQKLDPRASHGVTKFSDLTQ 110
Query: 89 EEMKSR-LGL 97
EE + + LGL
Sbjct: 111 EEFRHQYLGL 120
>gi|374713649|gb|AEZ65082.1| cysteine protease [Carica papaya]
Length = 471
Score = 43.9 bits (102), Expect = 0.012, Method: Composition-based stats.
Identities = 22/72 (30%), Positives = 39/72 (54%), Gaps = 1/72 (1%)
Query: 31 LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
+K +E ++ K+Y E +RF +F+DNL+ +++ N T G+ +DLT EE
Sbjct: 49 MKMYEHWLVKHGKNYNAIGEKERRFEIFKDNLRFVDEQNSVPGRTYKLGLTKFADLTNEE 108
Query: 91 MKSR-LGLNLSK 101
++ LG + K
Sbjct: 109 YRAMYLGAKMEK 120
>gi|357130490|ref|XP_003566881.1| PREDICTED: actinidain-like [Brachypodium distachyon]
Length = 350
Score = 43.9 bits (102), Expect = 0.012, Method: Composition-based stats.
Identities = 23/63 (36%), Positives = 36/63 (57%), Gaps = 1/63 (1%)
Query: 35 EKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM-KS 93
E+++ F + Y E A+R AVF N + ++ +N+ + T T G+N SDLT E K+
Sbjct: 41 EQWMAKFGRVYTDANEKARRQAVFGANARYVDAVNRAGNRTYTLGLNEFSDLTDNEFAKT 100
Query: 94 RLG 96
LG
Sbjct: 101 HLG 103
>gi|294883332|ref|XP_002770713.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
gi|239873998|gb|EER02718.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
Length = 332
Score = 43.9 bits (102), Expect = 0.012, Method: Composition-based stats.
Identities = 24/60 (40%), Positives = 35/60 (58%), Gaps = 1/60 (1%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
F F F K+Y +KEE KR A+F+ NL+ IE +N + + G+N +DLT EE +
Sbjct: 28 FMGFKHKFGKNYESKEEEVKRNAIFQANLQHIEQVNAKDL-SYKLGVNEHADLTHEEFAA 86
>gi|116787909|gb|ABK24688.1| unknown [Picea sitchensis]
gi|224284108|gb|ACN39791.1| unknown [Picea sitchensis]
gi|224285024|gb|ACN40241.1| unknown [Picea sitchensis]
Length = 366
Score = 43.9 bits (102), Expect = 0.012, Method: Composition-based stats.
Identities = 28/70 (40%), Positives = 36/70 (51%), Gaps = 10/70 (14%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGT----ATYGINHLSDLTR 88
F FIR + K Y EE RF VF+ NL L EH A++G+ SDLT+
Sbjct: 56 HFRHFIRRYGKKYSGPEEHEHRFGVFKSNL-----LRALEHQKLDPRASHGVTKFSDLTQ 110
Query: 89 EEMKSR-LGL 97
EE + + LGL
Sbjct: 111 EEFRHQYLGL 120
>gi|2582055|gb|AAB82455.1| lymphopain [Mus musculus]
Length = 371
Score = 43.9 bits (102), Expect = 0.012, Method: Composition-based stats.
Identities = 31/90 (34%), Positives = 45/90 (50%), Gaps = 4/90 (4%)
Query: 11 LALFGQMKSNNELKTE---NPEHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIE 66
L L GQ S++ L + P LK+ F+ F F++SY E +R ++F NL +
Sbjct: 13 LLLAGQGLSDSLLTKDAGPRPLELKEVFKLFQIRFNRSYWNPAEYTRRLSIFAHNLAQAQ 72
Query: 67 DLNKGEHGTATYGINHLSDLTREEMKSRLG 96
L + + GTA +G SDLT EE G
Sbjct: 73 RLQQEDLGTAEFGETPFSDLTEEEFGQLYG 102
>gi|356543118|ref|XP_003540010.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 339
Score = 43.9 bits (102), Expect = 0.012, Method: Composition-based stats.
Identities = 20/59 (33%), Positives = 32/59 (54%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
++ E++ + + K Y E KR +F+DN++ IE N + INHL+D T EE
Sbjct: 38 ERHEQWTKKYGKVYKDAAEKQKRLLIFKDNVEFIESFNAAGNKPYKLSINHLTDQTNEE 96
>gi|353441042|gb|AEQ94105.1| putative drought-inducible cysteine proteinase [Elaeis guineensis]
Length = 187
Score = 43.9 bits (102), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 25/60 (41%), Positives = 34/60 (56%), Gaps = 1/60 (1%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
F F+R F KSY ++E A RF+VF+ NL+ K + TA +GI SDLT E +
Sbjct: 54 HFSSFLRRFGKSYADEKEHAYRFSVFKANLRRARRHQKMD-PTAVHGITKFSDLTPAEFR 112
>gi|313213098|emb|CBY36961.1| unnamed protein product [Oikopleura dioica]
Length = 326
Score = 43.9 bits (102), Expect = 0.012, Method: Composition-based stats.
Identities = 25/59 (42%), Positives = 36/59 (61%), Gaps = 1/59 (1%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
FE + + KSY T E+ R VFE+N+ IE +NK E+ + T G+N SDLT +E +
Sbjct: 18 FEDWTSEHWKSYETAEDEKFRKGVFEENIAKIEQINK-ENRSWTAGLNKFSDLTWDEFQ 75
>gi|116787404|gb|ABK24495.1| unknown [Picea sitchensis]
gi|224286306|gb|ACN40861.1| unknown [Picea sitchensis]
Length = 452
Score = 43.9 bits (102), Expect = 0.012, Method: Composition-based stats.
Identities = 23/75 (30%), Positives = 45/75 (60%), Gaps = 2/75 (2%)
Query: 19 SNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATY 78
S+ +L+ E+ ++ +E ++ + ++Y +E KRF+VF+DN I + N+G +
Sbjct: 28 SSKDLR-EDDAIMELYELWLAEHKRAYNGLDEKQKRFSVFKDNFLYIHEHNQGNR-SYKL 85
Query: 79 GINHLSDLTREEMKS 93
G+N +DL+ EE K+
Sbjct: 86 GLNQFADLSHEEFKA 100
>gi|157113282|ref|XP_001657758.1| cathepsin l [Aedes aegypti]
gi|108877803|gb|EAT42028.1| AAEL006389-PA, partial [Aedes aegypti]
Length = 538
Score = 43.9 bits (102), Expect = 0.012, Method: Composition-based stats.
Identities = 31/90 (34%), Positives = 45/90 (50%), Gaps = 4/90 (4%)
Query: 12 ALFGQMKSNNELKTENPEHL-KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNK 70
A F M+ ++E EHL +F +F KSY ++E R +F NL+ I N+
Sbjct: 213 ATFNPMQEFIHPRSE--EHLDNEFTRFRYKHGKSYHNEKEHDLRRDIFRQNLRFIHSHNR 270
Query: 71 GEHGTATYGINHLSDLTREEMKSRLGLNLS 100
G T +NHL+D T EE+K+ G S
Sbjct: 271 AGKGF-TVAVNHLADRTDEELKALRGFKSS 299
>gi|31981819|ref|NP_034115.2| cathepsin W preproprotein [Mus musculus]
gi|341940311|sp|P56203.2|CATW_MOUSE RecName: Full=Cathepsin W; AltName: Full=Lymphopain; Flags:
Precursor
gi|26353368|dbj|BAC40314.1| unnamed protein product [Mus musculus]
gi|44890089|gb|AAS48498.1| cathepsin W precursor [Mus musculus]
gi|148701190|gb|EDL33137.1| cathepsin W, isoform CRA_b [Mus musculus]
gi|162317774|gb|AAI56226.1| Cathepsin W [synthetic construct]
gi|162318342|gb|AAI56999.1| Cathepsin W [synthetic construct]
Length = 371
Score = 43.9 bits (102), Expect = 0.013, Method: Composition-based stats.
Identities = 31/90 (34%), Positives = 45/90 (50%), Gaps = 4/90 (4%)
Query: 11 LALFGQMKSNNELKTE---NPEHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIE 66
L L GQ S++ L + P LK+ F+ F F++SY E +R ++F NL +
Sbjct: 13 LLLAGQGLSDSLLTKDAGPRPLELKEVFKLFQIRFNRSYWNPAEYTRRLSIFAHNLAQAQ 72
Query: 67 DLNKGEHGTATYGINHLSDLTREEMKSRLG 96
L + + GTA +G SDLT EE G
Sbjct: 73 RLQQEDLGTAEFGETPFSDLTEEEFGQLYG 102
>gi|255626679|gb|ACU13684.1| unknown [Glycine max]
Length = 229
Score = 43.9 bits (102), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 22/64 (34%), Positives = 36/64 (56%)
Query: 29 EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTR 88
E + +E+++ K Y E KRF VF+DNL I++ N ++ T G+N +D+T
Sbjct: 35 EVMTMYEEWLVKHQKVYNGLREKDKRFQVFKDNLGFIQEHNNNQNNTYKLGLNQFADMTN 94
Query: 89 EEMK 92
EE +
Sbjct: 95 EEYR 98
>gi|189239337|ref|XP_973607.2| PREDICTED: similar to cathepsin F-like cysteine protease [Tribolium
castaneum]
Length = 1726
Score = 43.9 bits (102), Expect = 0.013, Method: Composition-based stats.
Identities = 23/44 (52%), Positives = 27/44 (61%)
Query: 54 RFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKSRLGL 97
RF VF NL I LN E GTATYGI +D+T++E LGL
Sbjct: 1442 RFNVFVQNLMQIRVLNTFEQGTATYGITRFADMTQKEFSRSLGL 1485
>gi|67605684|ref|XP_666697.1| cryptopain precursor [Cryptosporidium hominis TU502]
gi|54657738|gb|EAL36466.1| cryptopain precursor [Cryptosporidium hominis]
Length = 401
Score = 43.9 bits (102), Expect = 0.013, Method: Composition-based stats.
Identities = 20/67 (29%), Positives = 39/67 (58%), Gaps = 1/67 (1%)
Query: 29 EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTR 88
E+ K FE+F + ++K+Y + EE +RF +++ N+ I+ N + + +N DL++
Sbjct: 81 EYRKSFEEFKKKYNKTYSSMEEENQRFEIYKQNMNFIKTTNS-QGFSYVLEMNEFGDLSK 139
Query: 89 EEMKSRL 95
EE +R
Sbjct: 140 EEFMARF 146
>gi|1594287|gb|AAC48340.1| cathepsin L-like cysteine proteinase [Toxocara canis]
Length = 360
Score = 43.9 bits (102), Expect = 0.013, Method: Composition-based stats.
Identities = 23/66 (34%), Positives = 36/66 (54%), Gaps = 1/66 (1%)
Query: 31 LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTAT-YGINHLSDLTRE 89
L +FE+FIR + K Y + EE A+RF ++ +N+ + LN+ T YG N +D
Sbjct: 47 LDRFEEFIRKYDKVYDSNEEFAERFRIYVNNMLEAQKLNQRNRDYGTIYGENEFADWNVN 106
Query: 90 EMKSRL 95
E + L
Sbjct: 107 EFREIL 112
>gi|270011071|gb|EFA07519.1| cystatin [Tribolium castaneum]
Length = 1761
Score = 43.9 bits (102), Expect = 0.013, Method: Composition-based stats.
Identities = 23/44 (52%), Positives = 27/44 (61%)
Query: 54 RFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKSRLGL 97
RF VF NL I LN E GTATYGI +D+T++E LGL
Sbjct: 1477 RFNVFVQNLMQIRVLNTFEQGTATYGITRFADMTQKEFSRSLGL 1520
>gi|6967097|emb|CAB72480.1| cysteine protease-like protein [Arabidopsis thaliana]
Length = 377
Score = 43.9 bits (102), Expect = 0.013, Method: Composition-based stats.
Identities = 22/63 (34%), Positives = 36/63 (57%), Gaps = 1/63 (1%)
Query: 30 HLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTRE 89
H+ F +F + K Y + EE+ RF+VF++NL LI NK + + +N +DLT +
Sbjct: 55 HVLSFSRFTHRYGKKYQSVEEMKLRFSVFKENLDLIRSTNK-KGLSYKLSLNQFADLTWQ 113
Query: 90 EMK 92
E +
Sbjct: 114 EFQ 116
>gi|224083362|ref|XP_002306996.1| predicted protein [Populus trichocarpa]
gi|222856445|gb|EEE93992.1| predicted protein [Populus trichocarpa]
Length = 336
Score = 43.9 bits (102), Expect = 0.013, Method: Composition-based stats.
Identities = 28/69 (40%), Positives = 40/69 (57%), Gaps = 6/69 (8%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATY--GINHLSDLTREEM 91
FE +I K Y + EE RF +F+DNL I++ NK Y G+N +DL+ EE
Sbjct: 33 FESWISKHQKIYESIEEKWHRFEIFKDNLFHIDETNK---KVVNYWLGLNEFADLSHEEF 89
Query: 92 KSR-LGLNL 99
K++ LGLN+
Sbjct: 90 KNKYLGLNV 98
>gi|2511691|emb|CAB17075.1| cysteine proteinase precursor [Phaseolus vulgaris]
Length = 365
Score = 43.9 bits (102), Expect = 0.013, Method: Composition-based stats.
Identities = 33/95 (34%), Positives = 46/95 (48%), Gaps = 6/95 (6%)
Query: 6 SAEATLALFGQMKSNNELKTE--NPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLK 63
S +A L Q+ E++ N EH F F F K+Y TKEE RF VF+ N++
Sbjct: 23 STDADDILIRQVVPEGEVEDHLLNAEH--HFSTFKSKFGKTYATKEEHDHRFGVFKSNMR 80
Query: 64 LIEDLNKGEHGTATYGINHLSDLTREEMKSR-LGL 97
L+ +A +G+ SDLT E + LGL
Sbjct: 81 RAR-LHAQLDPSAVHGVTKFSDLTPAEFHRKFLGL 114
>gi|32396020|gb|AAP41847.1| senescence-associated cysteine protease [Anthurium andraeanum]
Length = 460
Score = 43.9 bits (102), Expect = 0.013, Method: Composition-based stats.
Identities = 22/61 (36%), Positives = 36/61 (59%), Gaps = 1/61 (1%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHG-TATYGINHLSDLTREEMK 92
+E ++ K+Y E +RF +F DNL+ I+D N+ E+ + T G+ +DLT EE +
Sbjct: 38 YEGWLVGNGKAYNLLGEKERRFEIFWDNLRYIDDHNRAENNHSYTLGLTRFADLTNEEYR 97
Query: 93 S 93
S
Sbjct: 98 S 98
>gi|77554625|gb|ABA97421.1| Vignain precursor, putative [Oryza sativa Japonica Group]
gi|222630746|gb|EEE62878.1| hypothetical protein OsJ_17681 [Oryza sativa Japonica Group]
Length = 350
Score = 43.9 bits (102), Expect = 0.013, Method: Composition-based stats.
Identities = 20/61 (32%), Positives = 31/61 (50%)
Query: 35 EKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKSR 94
E+++ + Y E A+R VF+ N+ IE N G G+N +DLT EE K+
Sbjct: 45 ERWMAQHGRVYKDAAEKARRLEVFKANVAFIESFNAGGKNRYWLGVNQFADLTSEEFKAT 104
Query: 95 L 95
+
Sbjct: 105 M 105
>gi|400180419|gb|AFP73348.1| cysteine protease [Solanum lycopersicoides]
Length = 343
Score = 43.9 bits (102), Expect = 0.013, Method: Composition-based stats.
Identities = 21/72 (29%), Positives = 42/72 (58%), Gaps = 1/72 (1%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
++ E ++ + Y + E +RF +F++N+K IE +NK + + GIN +D+T EE
Sbjct: 37 ERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGINEFADITSEEF 96
Query: 92 KSRL-GLNLSKH 102
++ G+N+ +
Sbjct: 97 LTKFTGINIPSY 108
>gi|125551397|gb|EAY97106.1| hypothetical protein OsI_19029 [Oryza sativa Indica Group]
Length = 350
Score = 43.9 bits (102), Expect = 0.013, Method: Composition-based stats.
Identities = 20/61 (32%), Positives = 31/61 (50%)
Query: 35 EKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKSR 94
E+++ + Y E A+R VF+ N+ IE N G G+N +DLT EE K+
Sbjct: 45 ERWMAQHGRVYKDAAEKARRLEVFKANVAFIESFNAGGKNRYWLGVNQFADLTSEEFKAT 104
Query: 95 L 95
+
Sbjct: 105 M 105
>gi|18407961|ref|NP_566880.1| thiol protease aleurain-like protein [Arabidopsis thaliana]
gi|73622182|sp|Q8RWQ9.1|ALEUL_ARATH RecName: Full=Thiol protease aleurain-like; Flags: Precursor
gi|20147207|gb|AAM10319.1| AT3g45310/F18N11_70 [Arabidopsis thaliana]
gi|332644500|gb|AEE78021.1| thiol protease aleurain-like protein [Arabidopsis thaliana]
Length = 358
Score = 43.9 bits (102), Expect = 0.014, Method: Composition-based stats.
Identities = 22/63 (34%), Positives = 36/63 (57%), Gaps = 1/63 (1%)
Query: 30 HLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTRE 89
H+ F +F + K Y + EE+ RF+VF++NL LI NK + + +N +DLT +
Sbjct: 55 HVLSFSRFTHRYGKKYQSVEEMKLRFSVFKENLDLIRSTNK-KGLSYKLSLNQFADLTWQ 113
Query: 90 EMK 92
E +
Sbjct: 114 EFQ 116
>gi|255567869|ref|XP_002524912.1| cysteine protease, putative [Ricinus communis]
gi|223535747|gb|EEF37409.1| cysteine protease, putative [Ricinus communis]
Length = 366
Score = 43.9 bits (102), Expect = 0.014, Method: Composition-based stats.
Identities = 20/67 (29%), Positives = 38/67 (56%)
Query: 29 EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTR 88
E + + ++ SK+Y E KRF +F++NL+ I++ N ++ T G+ +DLT
Sbjct: 43 EVISMYNWWLAKHSKTYNKLGEREKRFEIFKNNLRFIDEHNNSKNRTYKVGLTRFADLTN 102
Query: 89 EEMKSRL 95
EE +++
Sbjct: 103 EEYRAKF 109
>gi|10441624|gb|AAG17127.1|AF190653_1 cathepsin L-like cysteine proteinase CAL1 [Diabrotica virgifera
virgifera]
Length = 322
Score = 43.9 bits (102), Expect = 0.014, Method: Composition-based stats.
Identities = 26/69 (37%), Positives = 37/69 (53%), Gaps = 3/69 (4%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLN-KGEHGTATY--GINHLSDLTR 88
+ +E F K Y E RF+VF+ NLK I + N K E G Y +N +D+T
Sbjct: 19 QHWESFKVQHGKVYKNPIEERVRFSVFQANLKTINEHNAKYEQGLVGYTMAVNQFADMTP 78
Query: 89 EEMKSRLGL 97
EE K++LG+
Sbjct: 79 EEFKAKLGM 87
>gi|13124011|sp|Q9YWK4.1|CATV_NPVBS RecName: Full=Viral cathepsin; Short=V-cath; AltName:
Full=Cysteine proteinase; Short=CP; Flags: Precursor
gi|3882976|gb|AAC77812.1| cathepsin [Buzura suppressaria NPV]
Length = 331
Score = 43.9 bits (102), Expect = 0.014, Method: Composition-based stats.
Identities = 23/67 (34%), Positives = 41/67 (61%), Gaps = 2/67 (2%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
FE F+ +++K Y E +RF++F+ L+ I N+ + +A Y IN +DL++ E+ S
Sbjct: 31 FETFLANYNKMYNDTSEKERRFSIFQQTLEEINYKNR-LNDSAVYQINKFADLSKNEIIS 89
Query: 94 RL-GLNL 99
+ GLN+
Sbjct: 90 KYTGLNM 96
>gi|225718616|gb|ACO15154.1| Cathepsin K precursor [Caligus clemensi]
Length = 377
Score = 43.9 bits (102), Expect = 0.014, Method: Composition-based stats.
Identities = 20/69 (28%), Positives = 35/69 (50%)
Query: 27 NPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
+P +++FE+F + F K Y + +KR +F NL++I N + +N +DL
Sbjct: 24 SPYEIQRFEEFQKTFGKVYDDRMTYSKRLRIFIHNLRVINAHNANPGRSYDLAVNKFTDL 83
Query: 87 TREEMKSRL 95
T +E R
Sbjct: 84 TEKEFTQRF 92
>gi|403334193|gb|EJY66252.1| Cysteine protease [Oxytricha trifallax]
Length = 397
Score = 43.5 bits (101), Expect = 0.014, Method: Composition-based stats.
Identities = 22/70 (31%), Positives = 35/70 (50%), Gaps = 3/70 (4%)
Query: 27 NPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTAT---YGINHL 83
+PE + F F+ +S+ TKEE R + F DN + I+ N+G G+N
Sbjct: 70 DPETQQAFSDFVAKHQRSFLTKEEYKARLSNFRDNYQTIKSHNEGRRKNGVSFKMGVNQF 129
Query: 84 SDLTREEMKS 93
SD ++ E+ S
Sbjct: 130 SDWSKAELNS 139
>gi|113208365|dbj|BAF03553.1| cysteine proteinase CP2 [Phaseolus vulgaris]
Length = 365
Score = 43.5 bits (101), Expect = 0.014, Method: Composition-based stats.
Identities = 28/72 (38%), Positives = 37/72 (51%), Gaps = 4/72 (5%)
Query: 27 NPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
N EH F F F K+Y TKEE RF VF+ N++ L+ +A +G+ SDL
Sbjct: 46 NAEH--HFSTFKAKFGKTYATKEEHDHRFGVFKSNMRRAR-LHAQLDPSAVHGVTKFSDL 102
Query: 87 TREEMKSR-LGL 97
T E + LGL
Sbjct: 103 TPAEFHRKFLGL 114
>gi|1581745|prf||2117247A Cys protease:ISOTYPE=1
Length = 467
Score = 43.5 bits (101), Expect = 0.014, Method: Composition-based stats.
Identities = 24/62 (38%), Positives = 34/62 (54%), Gaps = 1/62 (1%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
QF F + K Y + E A R VF++NL L L+ + A++ + SDLTREE +
Sbjct: 37 QFAAFKQRHGKVYGSAAEEAFRLGVFKENL-LFARLHAAANPHASFAVTPFSDLTREEFR 95
Query: 93 SR 94
SR
Sbjct: 96 SR 97
>gi|79314271|ref|NP_001030812.1| thiol protease aleurain-like protein [Arabidopsis thaliana]
gi|332644501|gb|AEE78022.1| thiol protease aleurain-like protein [Arabidopsis thaliana]
Length = 357
Score = 43.5 bits (101), Expect = 0.015, Method: Composition-based stats.
Identities = 22/63 (34%), Positives = 36/63 (57%), Gaps = 1/63 (1%)
Query: 30 HLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTRE 89
H+ F +F + K Y + EE+ RF+VF++NL LI NK + + +N +DLT +
Sbjct: 55 HVLSFSRFTHRYGKKYQSVEEMKLRFSVFKENLDLIRSTNK-KGLSYKLSLNQFADLTWQ 113
Query: 90 EMK 92
E +
Sbjct: 114 EFQ 116
>gi|194352766|emb|CAQ00111.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
Length = 384
Score = 43.5 bits (101), Expect = 0.015, Method: Composition-based stats.
Identities = 21/68 (30%), Positives = 35/68 (51%)
Query: 31 LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
L +F ++ +SYPT EE +RF V+ N++ IE N+ + + G +DLT +E
Sbjct: 49 LGRFHGWMAAHGRSYPTVEEKLRRFEVYRSNMEFIEAANRDSRMSYSLGETPFTDLTHDE 108
Query: 91 MKSRLGLN 98
+ N
Sbjct: 109 FMAMYSSN 116
>gi|13897890|gb|AAK48495.1|AF259983_1 putative cysteine protease [Ipomoea batatas]
Length = 462
Score = 43.5 bits (101), Expect = 0.015, Method: Composition-based stats.
Identities = 24/72 (33%), Positives = 39/72 (54%), Gaps = 1/72 (1%)
Query: 23 LKTENPEHLKQFEKFIRDFSKSYP-TKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGIN 81
L + E + +E ++ + KSY E KRF +F+DNL+ I++ N + G+N
Sbjct: 38 LSRSDEEVMALYESWLVEHGKSYNGLGGEKDKRFEIFKDNLRYIDEQNSRGDRSYKLGLN 97
Query: 82 HLSDLTREEMKS 93
+DLT EE +S
Sbjct: 98 RFADLTNEEYRS 109
>gi|218202087|gb|EEC84514.1| hypothetical protein OsI_31214 [Oryza sativa Indica Group]
Length = 348
Score = 43.5 bits (101), Expect = 0.015, Method: Composition-based stats.
Identities = 20/58 (34%), Positives = 31/58 (53%), Gaps = 1/58 (1%)
Query: 35 EKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
E+++ + + Y E A+RF VF+ N IE N G H G+N +DLT +E +
Sbjct: 38 ERWMAQYGRMYKDDAEKARRFEVFKANAAFIESFNAGNH-KFWLGVNQFADLTNDEFR 94
>gi|356515038|ref|XP_003526208.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 339
Score = 43.5 bits (101), Expect = 0.015, Method: Composition-based stats.
Identities = 23/70 (32%), Positives = 37/70 (52%), Gaps = 1/70 (1%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
++ EK++ + K Y E KRF +F++N++ IE N IN +DL EE
Sbjct: 35 ERHEKWMAQYGKLYTDAAEKEKRFQIFKNNVQFIESFNAAGDKPFNLSINQFADLHNEEF 94
Query: 92 KSRLGLNLSK 101
K+ L +N+ K
Sbjct: 95 KASL-INVQK 103
>gi|340053967|emb|CCC48260.1| cysteine peptidase precursor, fragment, partial [Trypanosoma
vivax Y486]
Length = 182
Score = 43.5 bits (101), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 23/61 (37%), Positives = 34/61 (55%), Gaps = 1/61 (1%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
F F + + +SY T E A R VFEDN++ + + AT+G+ SDLT EE ++
Sbjct: 34 FAAFKQKYGRSYGTAAEEAFRLRVFEDNMRR-SRMYAAANPHATFGVTPFSDLTPEEFRT 92
Query: 94 R 94
R
Sbjct: 93 R 93
>gi|50355613|dbj|BAD29955.1| cysteine protease [Daucus carota]
Length = 365
Score = 43.5 bits (101), Expect = 0.015, Method: Composition-based stats.
Identities = 18/59 (30%), Positives = 33/59 (55%)
Query: 35 EKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
++++ + + Y T E +R +F++NLK I+ NK + G+N +DLT EE +
Sbjct: 40 DQWMARYGRVYKTANEKNRRSTIFQENLKYIQTFNKANNKPYKLGVNEFADLTNEEFTT 98
>gi|91992516|gb|ABE72974.1| cathepsin L [Ochlerotatus atropalpus]
Length = 313
Score = 43.5 bits (101), Expect = 0.015, Method: Composition-based stats.
Identities = 26/73 (35%), Positives = 38/73 (52%), Gaps = 2/73 (2%)
Query: 29 EHL-KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLT 87
EHL +F +F K+Y +E +R +F NL+ I N+ G T +NHL+D T
Sbjct: 3 EHLDNEFSRFKNKHGKNYHNDKEHDRRRDIFRQNLRFIHSHNRAGKGF-TVAVNHLADRT 61
Query: 88 REEMKSRLGLNLS 100
EE+K+ G S
Sbjct: 62 DEELKALRGFKSS 74
>gi|357439381|ref|XP_003589967.1| Cysteine proteinase [Medicago truncatula]
gi|357439401|ref|XP_003589977.1| Cysteine proteinase [Medicago truncatula]
gi|357439405|ref|XP_003589979.1| Cysteine proteinase [Medicago truncatula]
gi|355479015|gb|AES60218.1| Cysteine proteinase [Medicago truncatula]
gi|355479025|gb|AES60228.1| Cysteine proteinase [Medicago truncatula]
gi|355479027|gb|AES60230.1| Cysteine proteinase [Medicago truncatula]
Length = 127
Score = 43.5 bits (101), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 23/71 (32%), Positives = 38/71 (53%), Gaps = 3/71 (4%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
K F+++I ++ ++Y E+ KR +F++ LK ++ NK T G+N SD T EE
Sbjct: 35 KAFQQWIHEYGRTYSNTTEMNKRRVIFKEELKYVKKFNKAGDEGYTIGLNQYSDWTDEEY 94
Query: 92 KSRLGLNLSKH 102
G L K+
Sbjct: 95 ---FGSQLPKY 102
>gi|168017893|ref|XP_001761481.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162687165|gb|EDQ73549.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 471
Score = 43.5 bits (101), Expect = 0.016, Method: Composition-based stats.
Identities = 22/63 (34%), Positives = 36/63 (57%), Gaps = 1/63 (1%)
Query: 31 LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
L F +++ S+ Y + E +RF +F+DNL I + NK E + G+N SDLT +E
Sbjct: 49 LDVFHQWLERHSRVYHSLSEKQRRFQIFKDNLHYIHNHNKQEK-SYWLGLNKFSDLTHDE 107
Query: 91 MKS 93
++
Sbjct: 108 FRA 110
>gi|144905108|dbj|BAF56428.1| cysteine proteinase [Lotus japonicus]
Length = 342
Score = 43.5 bits (101), Expect = 0.016, Method: Composition-based stats.
Identities = 28/90 (31%), Positives = 45/90 (50%), Gaps = 1/90 (1%)
Query: 6 SAEATLALFGQMKSNNELKTENPEHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKL 64
S+ A L +FG + +T LK+ E+++ + K Y E R +F++N++
Sbjct: 10 SSLALLLVFGFLAFEANARTLEDVSLKERHEQWMTQYGKVYTDSYEKELRSNIFKENVQR 69
Query: 65 IEDLNKGEHGTATYGINHLSDLTREEMKSR 94
IE N + GIN +DLT EE K+R
Sbjct: 70 IEAFNNAGNKPYKLGINQFADLTNEEFKAR 99
>gi|224035611|gb|ACN36881.1| unknown [Zea mays]
Length = 327
Score = 43.5 bits (101), Expect = 0.016, Method: Composition-based stats.
Identities = 25/68 (36%), Positives = 42/68 (61%), Gaps = 2/68 (2%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
FE+++ ++Y + EE +RF VF+DNL I++ N+ + + G+N +DLT +E K+
Sbjct: 59 FERWLSRHRRAYASLEEKLRRFQVFKDNLHHIDETNR-KVSSYWLGLNEFADLTHDEFKA 117
Query: 94 R-LGLNLS 100
LGL S
Sbjct: 118 TYLGLRSS 125
>gi|4733887|gb|AAD02173.3| cysteine proteinase [Acanthamoeba culbertsoni]
Length = 482
Score = 43.5 bits (101), Expect = 0.016, Method: Composition-based stats.
Identities = 21/58 (36%), Positives = 35/58 (60%), Gaps = 2/58 (3%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
QF ++R ++SY + +E +R+ + +N+ IE+ N+G H T T +N DLT EE
Sbjct: 63 QFNSWMRRHARSY-SNDEFLERYNTWRENMDFIEEFNRGNH-TFTVAMNEHGDLTPEE 118
>gi|356515052|ref|XP_003526215.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 339
Score = 43.5 bits (101), Expect = 0.016, Method: Composition-based stats.
Identities = 23/70 (32%), Positives = 37/70 (52%), Gaps = 1/70 (1%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
++ EK++ + K Y E KRF +F++N++ IE N IN +DL EE
Sbjct: 35 ERHEKWMAQYGKLYTDAAEKEKRFQIFKNNVQFIESFNAAGDKPFNLSINQFADLHNEEF 94
Query: 92 KSRLGLNLSK 101
K+ L +N+ K
Sbjct: 95 KASL-INVQK 103
>gi|297744465|emb|CBI37727.3| unnamed protein product [Vitis vinifera]
Length = 331
Score = 43.5 bits (101), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 24/64 (37%), Positives = 37/64 (57%), Gaps = 1/64 (1%)
Query: 31 LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
+ +FE ++ K Y + EE RF VF +NL I++ NK E + G+N +DL+ EE
Sbjct: 46 IARFESWVSKHGKVYKSMEEKLHRFEVFRENLNHIDERNK-EVSSYWLGLNEFADLSHEE 104
Query: 91 MKSR 94
KS+
Sbjct: 105 FKSK 108
>gi|1709576|sp|P05994.3|PAPA4_CARPA RecName: Full=Papaya proteinase 4; AltName: Full=Glycyl
endopeptidase; AltName: Full=Papaya peptidase B;
AltName: Full=Papaya proteinase IV; Short=PPIV; Flags:
Precursor
gi|953176|emb|CAA54974.1| proteinase IV [Carica papaya]
Length = 348
Score = 43.5 bits (101), Expect = 0.016, Method: Composition-based stats.
Identities = 29/95 (30%), Positives = 48/95 (50%), Gaps = 13/95 (13%)
Query: 11 LALFGQMK-----------SNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFE 59
+ LFG M S ++L T ++ F ++ +K+Y +E RF +F+
Sbjct: 15 ICLFGHMSLSYCDFSIVGYSQDDL-TSTERLIQLFNSWMLKHNKNYKNVDEKLYRFEIFK 73
Query: 60 DNLKLIEDLNKGEHGTATYGINHLSDLTREEMKSR 94
DNLK I++ NK +G G+N SDL+ +E K +
Sbjct: 74 DNLKYIDERNKMINGY-WLGLNEFSDLSNDEFKEK 107
>gi|440797510|gb|ELR18596.1| Cathepsin L precursor (Cysteine proteinase 1), putative
[Acanthamoeba castellanii str. Neff]
Length = 340
Score = 43.5 bits (101), Expect = 0.016, Method: Composition-based stats.
Identities = 28/84 (33%), Positives = 46/84 (54%), Gaps = 3/84 (3%)
Query: 9 ATLALFGQMKSNNELKTENPEHLKQ--FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIE 66
A LALF + + L + + ++ F +++R +KSY T +E + R+AV+ DN + IE
Sbjct: 5 AILALFAVVFVVSALASGHSTSAEEQIFAQWMRAHAKSYAT-QEFSHRWAVWRDNHRFIE 63
Query: 67 DLNKGEHGTATYGINHLSDLTREE 90
N+ + T T +N DLT E
Sbjct: 64 AHNRQPNKTFTLAMNQFGDLTDHE 87
>gi|328876826|gb|EGG25189.1| hypothetical protein DFA_03437 [Dictyostelium fasciculatum]
Length = 341
Score = 43.5 bits (101), Expect = 0.016, Method: Composition-based stats.
Identities = 24/73 (32%), Positives = 42/73 (57%), Gaps = 1/73 (1%)
Query: 20 NNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYG 79
N L T + ++ +F+ ++ + +K Y +EE R + F N+ IE +N+ TAT+G
Sbjct: 19 NVRLSTAD-DYTTRFKTWMVEHNKMYHEEEEFYLRLSNFIRNIHSIEKMNRQYGRTATFG 77
Query: 80 INHLSDLTREEMK 92
+N SDL+ +E K
Sbjct: 78 LNKFSDLSLDEFK 90
>gi|326494040|dbj|BAJ85482.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 355
Score = 43.5 bits (101), Expect = 0.016, Method: Composition-based stats.
Identities = 22/63 (34%), Positives = 36/63 (57%), Gaps = 1/63 (1%)
Query: 35 EKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM-KS 93
E+++ + + Y E +R VF N + I+ +N+ + T T G+NH SDLT EE ++
Sbjct: 42 ERWMAKYGRVYADAAEKLRRQEVFAANARHIDAVNRAGNRTYTLGLNHFSDLTNEEFAQT 101
Query: 94 RLG 96
LG
Sbjct: 102 HLG 104
>gi|260830531|ref|XP_002610214.1| hypothetical protein BRAFLDRAFT_216923 [Branchiostoma floridae]
gi|229295578|gb|EEN66224.1| hypothetical protein BRAFLDRAFT_216923 [Branchiostoma floridae]
Length = 274
Score = 43.5 bits (101), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 20/39 (51%), Positives = 24/39 (61%)
Query: 54 RFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
R+ VF+DNLK E L E GTA YG+ DLT EE +
Sbjct: 1 RYFVFQDNLKKAETLQDSERGTAKYGVTKFMDLTEEEFR 39
>gi|302790836|ref|XP_002977185.1| hypothetical protein SELMODRAFT_106228 [Selaginella
moellendorffii]
gi|300155161|gb|EFJ21794.1| hypothetical protein SELMODRAFT_106228 [Selaginella
moellendorffii]
Length = 299
Score = 43.5 bits (101), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 23/60 (38%), Positives = 31/60 (51%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
FE + KSY + E A+R +F D L IE N + T T G+N SDLT E ++
Sbjct: 2 FEDWAAKHGKSYSSDSEKARRLMIFSDTLAYIEKHNAQPNTTFTLGLNKFSDLTNAEFRA 61
>gi|118359377|ref|XP_001012928.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89294695|gb|EAR92683.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 377
Score = 43.5 bits (101), Expect = 0.017, Method: Composition-based stats.
Identities = 20/62 (32%), Positives = 37/62 (59%), Gaps = 1/62 (1%)
Query: 34 FEKFIRDFSKSYP-TKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
FE+++++F+K+Y E+ R ++FE NL I D N + + G+N +D T+ E+K
Sbjct: 29 FEQYVKEFNKNYGFNSEDYQLRKSIFERNLAEIIDFNNDPNHSYKKGVNQFTDQTQNELK 88
Query: 93 SR 94
+
Sbjct: 89 EK 90
>gi|18141287|gb|AAL60581.1|AF454959_1 senescence-associated cysteine protease [Brassica oleracea]
Length = 368
Score = 43.5 bits (101), Expect = 0.017, Method: Composition-based stats.
Identities = 23/62 (37%), Positives = 34/62 (54%), Gaps = 1/62 (1%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
F F + F K Y ++EE RF+VF+ NL+ K + +A +G+ SDLTR E K
Sbjct: 50 HFSLFKKKFGKVYASREEHDYRFSVFKSNLRRARRHQKLD-PSARHGVTQFSDLTRSEFK 108
Query: 93 SR 94
+
Sbjct: 109 RK 110
>gi|353441136|gb|AEQ94152.1| drought-inducible cysteine proteinase [Elaeis guineensis]
Length = 252
Score = 43.5 bits (101), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 25/59 (42%), Positives = 34/59 (57%), Gaps = 1/59 (1%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
F F+R F KSY ++E A RF+VF+ NL+ K + TA +GI SDLT E +
Sbjct: 55 FSSFLRRFGKSYADEKEHAYRFSVFKANLRRARRHQKMDP-TAVHGITKFSDLTPAEFR 112
>gi|294885122|ref|XP_002771197.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
gi|239874644|gb|EER03013.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
Length = 111
Score = 43.5 bits (101), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 26/65 (40%), Positives = 39/65 (60%), Gaps = 2/65 (3%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE-MK 92
F F F K Y +KEE KR A+F+ NL IE +N ++ + T G+N +DLT EE +
Sbjct: 28 FTDFQHKFGKKYESKEEEMKRNAIFQANLHHIEQVN-AQNLSYTLGVNEYADLTHEEFVA 86
Query: 93 SRLGL 97
++G+
Sbjct: 87 QKVGI 91
>gi|359359066|gb|AEV40973.1| putative oryzain beta chain precursor [Oryza punctata]
Length = 461
Score = 43.5 bits (101), Expect = 0.017, Method: Composition-based stats.
Identities = 24/73 (32%), Positives = 38/73 (52%), Gaps = 2/73 (2%)
Query: 23 LKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLN--KGEHGTATYGI 80
L+ E ++ ++ + +SY E +RF VF DNLK ++ N EHG G+
Sbjct: 38 LERTEAEARAAYDLWLAENGRSYNALGERERRFRVFWDNLKFVDAHNARADEHGGFRLGM 97
Query: 81 NHLSDLTREEMKS 93
N +DLT +E +S
Sbjct: 98 NRFADLTNDEFRS 110
>gi|194719810|emb|CAR31335.1| pro-asclepain f [Gomphocarpus fruticosus subsp. fruticosus]
Length = 340
Score = 43.5 bits (101), Expect = 0.017, Method: Composition-based stats.
Identities = 24/68 (35%), Positives = 37/68 (54%), Gaps = 3/68 (4%)
Query: 29 EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIED---LNKGEHGTATYGINHLSD 85
E + +E+++ K Y + E KRF +F+DNL+ I+ NK H T G+N +D
Sbjct: 29 EVIALYEEWLVKHQKLYSSLGEKIKRFEIFKDNLRYIDQQNHYNKVNHMNFTLGLNQFAD 88
Query: 86 LTREEMKS 93
LT +E S
Sbjct: 89 LTLDEFSS 96
>gi|400180443|gb|AFP73358.1| cysteine protease, partial [Solanum habrochaites]
Length = 345
Score = 43.5 bits (101), Expect = 0.017, Method: Composition-based stats.
Identities = 21/69 (30%), Positives = 41/69 (59%), Gaps = 1/69 (1%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
++ E ++ + Y + E +RF +F++N+K IE +NK + + G+N +D+T EE
Sbjct: 37 ERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSEEF 96
Query: 92 KSRL-GLNL 99
++ GLN+
Sbjct: 97 LAKFTGLNI 105
>gi|356509908|ref|XP_003523684.1| PREDICTED: cysteine proteinase RD19a-like [Glycine max]
Length = 366
Score = 43.5 bits (101), Expect = 0.017, Method: Composition-based stats.
Identities = 27/72 (37%), Positives = 40/72 (55%), Gaps = 4/72 (5%)
Query: 27 NPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
N EH F F F+K+Y T+EE RF +F++NL + K + +A +G+ SDL
Sbjct: 46 NAEH--HFSAFKTKFAKTYATQEEHDHRFRIFKNNLLRAKSHQKLD-PSAVHGVTRFSDL 102
Query: 87 TREEMKSR-LGL 97
T E + + LGL
Sbjct: 103 TPSEFRGQFLGL 114
>gi|6448469|dbj|BAA86911.1| homologue of Sarcophaga 26,29kDa proteinase [Periplaneta americana]
Length = 552
Score = 43.5 bits (101), Expect = 0.017, Method: Composition-based stats.
Identities = 25/69 (36%), Positives = 37/69 (53%), Gaps = 2/69 (2%)
Query: 29 EHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLT 87
EH++ F+ F + SK Y + E KR +F NL+ I N+ G T +NHL+D T
Sbjct: 242 EHVETAFDHFRKRHSKDYASNLEHTKRKEIFRQNLRFIHSKNRARLGF-TLDVNHLADRT 300
Query: 88 REEMKSRLG 96
E+K+ G
Sbjct: 301 ELELKALRG 309
>gi|389611850|dbj|BAM19484.1| cathepsin L [Papilio xuthus]
Length = 342
Score = 43.5 bits (101), Expect = 0.017, Method: Composition-based stats.
Identities = 22/68 (32%), Positives = 36/68 (52%), Gaps = 1/68 (1%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
+F++F +K Y ++ E AKR +F NL+ I N+ G T +NHL+D +E+
Sbjct: 35 EFDRFKAKHNKKYASEIEHAKRLNIFRQNLRYIHSNNRARRGF-TLAVNHLADWAEDELA 93
Query: 93 SRLGLNLS 100
+ G S
Sbjct: 94 ALRGRRYS 101
>gi|310656789|gb|ADP02218.1| Peptidase_C1 domain-containing protein [Triticum aestivum]
Length = 341
Score = 43.5 bits (101), Expect = 0.017, Method: Composition-based stats.
Identities = 24/73 (32%), Positives = 41/73 (56%), Gaps = 4/73 (5%)
Query: 31 LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
+++ E+++ F++ Y E A+RF VF+ N+ IE N E+ G+N +DLT +E
Sbjct: 34 VERHEQWMAKFNRVYKDGTEKAQRFEVFKANVAFIESFN-AENRKFWLGVNQFTDLTNDE 92
Query: 91 M---KSRLGLNLS 100
K+ GL +S
Sbjct: 93 FRATKTNKGLKMS 105
>gi|302143415|emb|CBI21976.3| unnamed protein product [Vitis vinifera]
Length = 322
Score = 43.5 bits (101), Expect = 0.017, Method: Composition-based stats.
Identities = 19/59 (32%), Positives = 33/59 (55%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
++ E ++ + + Y +E +KR+ +F+DN+ IE NK + IN +DLT EE
Sbjct: 37 ERHEDWMAQYGRVYKDADEKSKRYKIFKDNVARIESFNKAMDKSYKLSINEFADLTNEE 95
>gi|76097507|gb|ABA39436.1| Der f 1 allergen precursor [Dermatophagoides farinae]
Length = 276
Score = 43.5 bits (101), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 30/69 (43%), Positives = 44/69 (63%), Gaps = 12/69 (17%)
Query: 28 PEHLKQFEKFIRDFSKSYPT--KEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSD 85
P +K FE+F + F+K+Y T +EEVA++ F ++LK +E NKG INHLSD
Sbjct: 2 PASIKTFEEFKKAFNKNYATVEEEEVARKN--FLESLKYVE-ANKG-------AINHLSD 51
Query: 86 LTREEMKSR 94
L+ +E K+R
Sbjct: 52 LSLDEFKNR 60
>gi|354494740|ref|XP_003509493.1| PREDICTED: cathepsin W-like [Cricetulus griseus]
gi|344243260|gb|EGV99363.1| Cathepsin W [Cricetulus griseus]
Length = 376
Score = 43.5 bits (101), Expect = 0.018, Method: Composition-based stats.
Identities = 23/68 (33%), Positives = 35/68 (51%)
Query: 29 EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTR 88
E ++ F+ F +++SY E A+R +F NL + L + + GTA +G SDLT
Sbjct: 35 ELIEVFKLFQIKYNRSYANPAEYARRLNIFAHNLAQAQRLQEEDLGTAEFGETPFSDLTE 94
Query: 89 EEMKSRLG 96
EE G
Sbjct: 95 EEFGQLYG 102
>gi|147788834|emb|CAN64655.1| hypothetical protein VITISV_005140 [Vitis vinifera]
Length = 341
Score = 43.5 bits (101), Expect = 0.018, Method: Composition-based stats.
Identities = 19/62 (30%), Positives = 35/62 (56%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
++ E ++ + + Y E +KR+ +F+DN+ IE NK + + IN +DLT EE
Sbjct: 37 ERHEDWMAQYGRVYKDAGEKSKRYKIFKDNVARIESFNKAMNKSYKLSINEFADLTNEEF 96
Query: 92 KS 93
++
Sbjct: 97 RA 98
>gi|115448287|ref|NP_001047923.1| Os02g0715000 [Oryza sativa Japonica Group]
gi|42408029|dbj|BAD09165.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|113537454|dbj|BAF09837.1| Os02g0715000 [Oryza sativa Japonica Group]
gi|215737450|dbj|BAG96580.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215765786|dbj|BAG87483.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222623551|gb|EEE57683.1| hypothetical protein OsJ_08138 [Oryza sativa Japonica Group]
Length = 366
Score = 43.5 bits (101), Expect = 0.018, Method: Composition-based stats.
Identities = 23/57 (40%), Positives = 36/57 (63%), Gaps = 2/57 (3%)
Query: 42 SKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK-SRLGL 97
SK Y + +E KR+ +F+ NL+ I + N+ +G+ G+NH +D+ EE K S LGL
Sbjct: 63 SKIYASPKEKVKRYEIFKRNLRHIVETNR-RNGSYWLGLNHFADIAHEEFKASYLGL 118
>gi|403376023|gb|EJY87990.1| Cathepsin L [Oxytricha trifallax]
Length = 343
Score = 43.5 bits (101), Expect = 0.018, Method: Composition-based stats.
Identities = 29/91 (31%), Positives = 41/91 (45%), Gaps = 3/91 (3%)
Query: 9 ATLALFGQMK---SNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLI 65
AT+ LF + S N E + F ++ + KSY TKEE R+ ++ N+ +
Sbjct: 15 ATVGLFAISEAPASTNLFAIEVTQDNVAFANYLAKYGKSYGTKEEFQFRYEQYQKNMAKV 74
Query: 66 EDLNKGEHGTATYGINHLSDLTREEMKSRLG 96
N T GIN +D T EE K LG
Sbjct: 75 AQYNGQNGNTFRLGINKFTDYTPEEYKVLLG 105
>gi|357162946|ref|XP_003579573.1| PREDICTED: cysteine proteinase 1-like [Brachypodium distachyon]
Length = 376
Score = 43.1 bits (100), Expect = 0.018, Method: Composition-based stats.
Identities = 28/78 (35%), Positives = 42/78 (53%), Gaps = 5/78 (6%)
Query: 21 NELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGI 80
NEL+ H F F++ F+KSY +E A R +VF NL+ + + +A +G+
Sbjct: 42 NELELNAEAH---FASFVQRFNKSYRDADEHAHRLSVFTANLRRARRHQRLD-PSAVHGV 97
Query: 81 NHLSDLTREEMKSR-LGL 97
SDLT +E + R LGL
Sbjct: 98 TKFSDLTPDEFRDRFLGL 115
>gi|9634237|ref|NP_037776.1| ORF16 cathepsin [Spodoptera exigua MNPV]
gi|37077857|sp|Q9J8B9.1|CATV_NPVSE RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|6960476|gb|AAF33546.1|AF169823_16 ORF16 cathepsin [Spodoptera exigua MNPV]
Length = 337
Score = 43.1 bits (100), Expect = 0.018, Method: Composition-based stats.
Identities = 24/74 (32%), Positives = 39/74 (52%), Gaps = 9/74 (12%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
FEKFI ++K Y +++E R+ +F N++ I N + +A Y IN +D+ + E+
Sbjct: 40 FEKFITQYNKQYKSEDEKKYRYNIFRHNIESINQKNS-RNDSAVYKINRFADMPKNEIVI 98
Query: 94 R--------LGLNL 99
R LGLN
Sbjct: 99 RHTGLASGELGLNF 112
>gi|313220237|emb|CBY31096.1| unnamed protein product [Oikopleura dioica]
Length = 371
Score = 43.1 bits (100), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 39/67 (58%), Gaps = 2/67 (2%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
KQFE F+ + K Y +++E RF F +NLK I+ N E G+A YG+ +DL+ E
Sbjct: 48 KQFENFLLEHPKMY-SEQESHSRFQTFWENLKRIKFHNHIEQGSAKYGVTEFTDLSDFEF 106
Query: 92 KSR-LGL 97
+ LGL
Sbjct: 107 RRHYLGL 113
>gi|302763109|ref|XP_002964976.1| hypothetical protein SELMODRAFT_83176 [Selaginella
moellendorffii]
gi|302763113|ref|XP_002964978.1| hypothetical protein SELMODRAFT_83554 [Selaginella
moellendorffii]
gi|300167209|gb|EFJ33814.1| hypothetical protein SELMODRAFT_83176 [Selaginella
moellendorffii]
gi|300167211|gb|EFJ33816.1| hypothetical protein SELMODRAFT_83554 [Selaginella
moellendorffii]
Length = 300
Score = 43.1 bits (100), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 24/60 (40%), Positives = 31/60 (51%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
FE + KSY + E A+R VF D L IE N + T T G+N SDLT E ++
Sbjct: 2 FEDWAAKHDKSYSSDWEKARRLMVFSDTLAYIEKHNAQPNTTFTLGLNKFSDLTNAEFRA 61
>gi|357467173|ref|XP_003603871.1| Cysteine proteinase [Medicago truncatula]
gi|355492919|gb|AES74122.1| Cysteine proteinase [Medicago truncatula]
gi|388499154|gb|AFK37643.1| unknown [Medicago truncatula]
Length = 350
Score = 43.1 bits (100), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 29/72 (40%), Positives = 43/72 (59%), Gaps = 6/72 (8%)
Query: 31 LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATY--GINHLSDLTR 88
++ FE ++ K Y T EE RF VF+DNLK I+D NK + Y G+N +DL+
Sbjct: 44 IELFESWMSRHGKIYETIEEKLLRFEVFKDNLKHIDDRNK---VVSNYWLGLNEFADLSH 100
Query: 89 EEMKSR-LGLNL 99
+E K++ LGL +
Sbjct: 101 QEFKNKYLGLKV 112
>gi|125540888|gb|EAY87283.1| hypothetical protein OsI_08685 [Oryza sativa Indica Group]
Length = 357
Score = 43.1 bits (100), Expect = 0.018, Method: Composition-based stats.
Identities = 23/57 (40%), Positives = 36/57 (63%), Gaps = 2/57 (3%)
Query: 42 SKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK-SRLGL 97
SK Y + +E KR+ +F+ NL+ I + N+ +G+ G+NH +D+ EE K S LGL
Sbjct: 54 SKIYASPKEKVKRYEIFKRNLRHIVETNR-RNGSYWLGLNHFADIAHEEFKASYLGL 109
>gi|37958161|gb|AAP35075.1| Der f 1 allergen [Dermatophagoides farinae]
Length = 263
Score = 43.1 bits (100), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 30/69 (43%), Positives = 44/69 (63%), Gaps = 12/69 (17%)
Query: 28 PEHLKQFEKFIRDFSKSYPT--KEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSD 85
P +K FE+F + F+K+Y T +EEVA++ F ++LK +E NKG INHLSD
Sbjct: 20 PASIKTFEEFKKAFNKNYATVEEEEVARKN--FLESLKYVE-ANKG-------AINHLSD 69
Query: 86 LTREEMKSR 94
L+ +E K+R
Sbjct: 70 LSLDEFKNR 78
>gi|440800456|gb|ELR21495.1| cathepsin Llike proteinase [Acanthamoeba castellanii str. Neff]
Length = 557
Score = 43.1 bits (100), Expect = 0.019, Method: Composition-based stats.
Identities = 24/81 (29%), Positives = 37/81 (45%), Gaps = 1/81 (1%)
Query: 11 LALFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNK 70
L++ +M + + N E QF F K Y E + R +F NL+ IE NK
Sbjct: 211 LSVLRKMFQASPVPDHNDEVAAQFAAHAHKFGKVYADHSEYSMRLNIFRKNLEYIEQYNK 270
Query: 71 GEHGTATYGINHLSDLTREEM 91
+ G +NH D+T +E+
Sbjct: 271 KDTGM-KLAMNHFGDMTYDEI 290
>gi|30141027|dbj|BAC75927.1| cysteine protease-5 [Helianthus annuus]
Length = 365
Score = 43.1 bits (100), Expect = 0.019, Method: Composition-based stats.
Identities = 21/60 (35%), Positives = 33/60 (55%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
+E ++ K+Y E RF +F DNLK I++ N + + G+N +DLT EE +S
Sbjct: 36 YELWLARHGKTYNALGEKESRFRIFADNLKFIDEHNLSGNRSYKVGLNQFADLTNEEYRS 95
>gi|111073719|dbj|BAF02548.1| triticain gamma [Triticum aestivum]
Length = 365
Score = 43.1 bits (100), Expect = 0.019, Method: Composition-based stats.
Identities = 23/68 (33%), Positives = 40/68 (58%), Gaps = 2/68 (2%)
Query: 30 HLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTRE 89
H +F +F + KSY + EV +RF +F ++L+ + N+ + + GIN SD++ E
Sbjct: 60 HALRFARFAVRYGKSYESAAEVRRRFRIFSESLEEVRSTNR-KGLSYRLGINRFSDMSWE 118
Query: 90 EMK-SRLG 96
E + +RLG
Sbjct: 119 EFQATRLG 126
>gi|224113123|ref|XP_002316398.1| predicted protein [Populus trichocarpa]
gi|222865438|gb|EEF02569.1| predicted protein [Populus trichocarpa]
Length = 327
Score = 43.1 bits (100), Expect = 0.019, Method: Composition-based stats.
Identities = 22/62 (35%), Positives = 37/62 (59%), Gaps = 3/62 (4%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNL-KLIEDLNKGEHGTATYGINHLSDLTREE 90
++F+ FI++ +K Y T+EE RF +F NL + +E ++ TA +G+ DLT EE
Sbjct: 12 EKFKMFIKEHNKEYATREEYVHRFGIFGKNLIRAVE--HQALDPTAIHGVTPFMDLTEEE 69
Query: 91 MK 92
+
Sbjct: 70 FE 71
>gi|407844577|gb|EKG02025.1| cysteine peptidase, putative,cysteine peptidase, clan CA, family
C1, cathepsin L-like, putative, partial [Trypanosoma
cruzi]
Length = 308
Score = 43.1 bits (100), Expect = 0.019, Method: Composition-based stats.
Identities = 23/62 (37%), Positives = 34/62 (54%), Gaps = 1/62 (1%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
QF +F + + Y + E A R +VF NL + L+ + A +G+ SDLTREE +
Sbjct: 65 QFAEFKQKHGRVYGSAAEEAFRLSVFRANL-FLARLHAAANPHANFGVTPFSDLTREEFR 123
Query: 93 SR 94
SR
Sbjct: 124 SR 125
>gi|400180441|gb|AFP73357.1| cysteine protease [Solanum habrochaites]
Length = 344
Score = 43.1 bits (100), Expect = 0.019, Method: Composition-based stats.
Identities = 21/69 (30%), Positives = 41/69 (59%), Gaps = 1/69 (1%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
++ E ++ + Y + E +RF +F++N+K IE +NK + + G+N +D+T EE
Sbjct: 37 ERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSEEF 96
Query: 92 KSRL-GLNL 99
++ GLN+
Sbjct: 97 LAKFTGLNI 105
>gi|326515410|dbj|BAK03618.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 202
Score = 43.1 bits (100), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 24/61 (39%), Positives = 36/61 (59%), Gaps = 3/61 (4%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLN-KGEHG--TATYGINHLSDLTREE 90
F + F K+Y + E +R+AVF++ L+L++ N GE G A GIN L+D+T EE
Sbjct: 50 FGAWKAKFGKTYSSVGEEERRYAVFKETLRLVDQHNAAGEAGVPVARMGINGLADMTTEE 109
Query: 91 M 91
Sbjct: 110 W 110
>gi|357630541|gb|EHJ78589.1| hypothetical protein KGM_15348 [Danaus plexippus]
Length = 98
Score = 43.1 bits (100), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 27/66 (40%), Positives = 36/66 (54%), Gaps = 5/66 (7%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGT--ATYGINHLSDLTRE 89
K FEKF+ D+ K Y + + A + F +L I NKG + TY INHL+D T E
Sbjct: 30 KLFEKFMADYDKHYKDQIDTANHYNAFLASLVTI---NKGNRDSPLTTYDINHLADYTPE 86
Query: 90 EMKSRL 95
E+ S L
Sbjct: 87 EIDSTL 92
>gi|348513249|ref|XP_003444155.1| PREDICTED: cathepsin K-like [Oreochromis niloticus]
Length = 330
Score = 43.1 bits (100), Expect = 0.019, Method: Composition-based stats.
Identities = 27/80 (33%), Positives = 47/80 (58%), Gaps = 6/80 (7%)
Query: 27 NPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLI----EDLNKGEHGTATYGINH 82
+P K +E++ K Y + E+ R AV+E N+ L+ ++ + G+H + T G+NH
Sbjct: 19 SPAVNKLWEEWKTKHGKVYDNQTEIDFRRAVWEKNVHLVLRHNQEASAGKH-SFTLGLNH 77
Query: 83 LSDLTREEMKSRL-GLNLSK 101
L+D+T EE+ +L GL L +
Sbjct: 78 LADMTAEEINEKLNGLKLEE 97
>gi|357473651|ref|XP_003607110.1| Cysteine proteinase [Medicago truncatula]
gi|355508165|gb|AES89307.1| Cysteine proteinase [Medicago truncatula]
Length = 331
Score = 43.1 bits (100), Expect = 0.020, Method: Composition-based stats.
Identities = 27/78 (34%), Positives = 40/78 (51%), Gaps = 13/78 (16%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHG----TATYGINHLSDLTR 88
QF +F + F K Y +K+E RF VF+ NL HG +AT+G+ SDLT
Sbjct: 47 QFNEFKQRFGKVYSSKDEHDYRFNVFKSNLH-----RAKRHGIMDPSATHGVTRFSDLTP 101
Query: 89 EEMKSRL----GLNLSKH 102
E ++ + G+ L +H
Sbjct: 102 REFRNSILGLKGVGLPRH 119
>gi|345316917|ref|XP_001511419.2| PREDICTED: cathepsin W-like, partial [Ornithorhynchus anatinus]
Length = 252
Score = 43.1 bits (100), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 24/73 (32%), Positives = 39/73 (53%)
Query: 15 GQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHG 74
GQ + L E + +F++F ++KSY + E A+RF +F NL L + + G
Sbjct: 28 GQNQHPQPLPDTTLELMDKFKEFQIRYNKSYEDQAEHARRFEIFVQNLARARKLQEEDQG 87
Query: 75 TATYGINHLSDLT 87
TA +G+ SDL+
Sbjct: 88 TAEFGVTPFSDLS 100
>gi|358248896|ref|NP_001239703.1| uncharacterized protein LOC100799247 precursor [Glycine max]
gi|255636729|gb|ACU18700.1| unknown [Glycine max]
Length = 341
Score = 43.1 bits (100), Expect = 0.020, Method: Composition-based stats.
Identities = 22/64 (34%), Positives = 33/64 (51%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
++ EK++ + K Y E KRF VF++N++ IE N IN +DL EE
Sbjct: 33 ERHEKWMAQYGKVYKDAAEKEKRFQVFKNNVQFIESFNAAGDKPFNLSINQFADLHDEEF 92
Query: 92 KSRL 95
K+ L
Sbjct: 93 KALL 96
>gi|38423491|emb|CAD80247.1| salarin [Salvelinus alpinus]
Length = 342
Score = 43.1 bits (100), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 25/67 (37%), Positives = 40/67 (59%), Gaps = 3/67 (4%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNK-GEHG--TATYGINHLSDLTR 88
K+FE + + K+YP+ EE AKR ++ K++ + NK E+G + T +NH +DLT
Sbjct: 272 KEFETWKVKYGKTYPSTEEEAKRKEIWLATRKMVTEHNKRAENGQESFTMAVNHFADLTT 331
Query: 89 EEMKSRL 95
EE+ L
Sbjct: 332 EEVPKGL 338
Score = 38.9 bits (89), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 24/67 (35%), Positives = 38/67 (56%), Gaps = 3/67 (4%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLN-KGEHGTATY--GINHLSDLTR 88
K+FE + + KSYP+ EE AKR ++ K + + N + +G +Y +NH +DLT
Sbjct: 32 KEFETWKVKYGKSYPSTEEEAKRKEMWLATRKRVMEHNTRAGNGLESYTMAVNHFADLTT 91
Query: 89 EEMKSRL 95
EE+ L
Sbjct: 92 EEVPKGL 98
>gi|387765908|gb|AFJ95133.1| cathepsin-L [Toxocara canis]
Length = 360
Score = 43.1 bits (100), Expect = 0.020, Method: Composition-based stats.
Identities = 23/66 (34%), Positives = 35/66 (53%), Gaps = 1/66 (1%)
Query: 31 LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTAT-YGINHLSDLTRE 89
L +FE FIR + K Y + EE A+RF ++ +N+ + LN+ T YG N +D
Sbjct: 47 LDRFEDFIRKYDKVYDSNEEFAERFRIYVNNMLEAQKLNQRNRDYGTIYGENEFADWNVN 106
Query: 90 EMKSRL 95
E + L
Sbjct: 107 EFREIL 112
>gi|258618831|gb|ACV84238.1| cysteine proteinase L [Anisakis simplex]
Length = 411
Score = 43.1 bits (100), Expect = 0.020, Method: Composition-based stats.
Identities = 21/68 (30%), Positives = 37/68 (54%), Gaps = 1/68 (1%)
Query: 26 ENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSD 85
E+ ++ QF F+ + + Y E +RF F +N+K I+ + +G+ +GI +D
Sbjct: 96 EDFAYIDQFIDFMNVYGRKYHGYHETRERFQNFVNNMKYIKKIQQGKQ-NVQFGITRFAD 154
Query: 86 LTREEMKS 93
+ EEMKS
Sbjct: 155 WSEEEMKS 162
>gi|198435380|ref|XP_002128293.1| PREDICTED: similar to cathepsin H [Ciona intestinalis]
Length = 438
Score = 43.1 bits (100), Expect = 0.020, Method: Composition-based stats.
Identities = 25/74 (33%), Positives = 37/74 (50%), Gaps = 3/74 (4%)
Query: 20 NNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYG 79
NN+++ E K ++ + K Y +EE KRF +F +LK I++ N T G
Sbjct: 124 NNDVEVEERNLFKGWQI---EHGKQYINQEEAEKRFQIFSKSLKTIKEFNNRVDRTWEMG 180
Query: 80 INHLSDLTREEMKS 93
+N SD T EE S
Sbjct: 181 LNEFSDRTFEEFAS 194
>gi|313229615|emb|CBY18430.1| unnamed protein product [Oikopleura dioica]
Length = 326
Score = 43.1 bits (100), Expect = 0.020, Method: Composition-based stats.
Identities = 25/59 (42%), Positives = 36/59 (61%), Gaps = 1/59 (1%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
FE + + KSY T E+ R VFE+N+ IE +NK E+ + T G+N SDLT +E +
Sbjct: 18 FEDWTAEHWKSYETAEDEKFRKGVFEENVAKIEKINK-ENRSWTAGLNKFSDLTWDEFQ 75
>gi|12597541|ref|NP_075125.1| cathepsin [Helicoverpa armigera nucleopolyhedrovirus G4]
gi|15426394|ref|NP_203611.1| cathepsin [Helicoverpa armigera NPV]
gi|12483807|gb|AAG53799.1|AF271059_56 cathepsin [Helicoverpa armigera nucleopolyhedrovirus G4]
gi|15384470|gb|AAK96381.1|AF303045_123 cathepsin [Helicoverpa armigera NPV]
gi|18027090|gb|AAL55725.1|AF268612_1 cathepsin [Helicoverpa armigera NPV]
Length = 365
Score = 43.1 bits (100), Expect = 0.021, Method: Composition-based stats.
Identities = 28/83 (33%), Positives = 43/83 (51%), Gaps = 14/83 (16%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGE-----------HGTATYGINH 82
F+ F++ ++KSY +E R+ VF+DNL I N+ +A +G+N
Sbjct: 55 FKHFLQQYNKSYDDPKEYQYRYNVFKDNLNKINSQNRENLLNNKNNNDSLSTSAQFGVNK 114
Query: 83 LSDLTREE-MKSRLG--LNLSKH 102
SD T +E + S G LNLS+H
Sbjct: 115 FSDKTPDEVLHSNTGFFLNLSQH 137
>gi|313235882|emb|CBY11269.1| unnamed protein product [Oikopleura dioica]
Length = 371
Score = 43.1 bits (100), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 39/67 (58%), Gaps = 2/67 (2%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
KQFE F+ + K Y +++E RF F +NLK I+ N E G+A YG+ +DL+ E
Sbjct: 48 KQFENFLLEHPKMY-SEQESHSRFQTFWENLKRIKFHNHIEQGSAKYGVTEFADLSDFEF 106
Query: 92 KSR-LGL 97
+ LGL
Sbjct: 107 RRHYLGL 113
>gi|449461649|ref|XP_004148554.1| PREDICTED: LOW QUALITY PROTEIN: cysteine proteinase RD19a-like
[Cucumis sativus]
Length = 381
Score = 43.1 bits (100), Expect = 0.021, Method: Composition-based stats.
Identities = 28/70 (40%), Positives = 40/70 (57%), Gaps = 4/70 (5%)
Query: 29 EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTR 88
EH F F R F KSY T+EE +RF +F+ N++ E ++ +A +G+ SDLT
Sbjct: 56 EH--HFSLFKRRFGKSYATEEEHDRRFKIFKANMRRAER-HQSFDPSAIHGVTQFSDLTP 112
Query: 89 EEM-KSRLGL 97
E K+ LGL
Sbjct: 113 FEFRKAFLGL 122
>gi|404312774|pdb|3TNX|A Chain A, Structure Of The Precursor Of A Thermostable Variant Of
Papain At 2.6 Angstroem Resolution
gi|404312775|pdb|3TNX|C Chain C, Structure Of The Precursor Of A Thermostable Variant Of
Papain At 2.6 Angstroem Resolution
gi|428698029|pdb|3USV|A Chain A, Structure Of The Precursor Of A Thermostable Variant Of
Papain At 3.8 A Resolution From A Crystal Soaked At Ph 4
gi|428698030|pdb|3USV|C Chain C, Structure Of The Precursor Of A Thermostable Variant Of
Papain At 3.8 A Resolution From A Crystal Soaked At Ph 4
Length = 363
Score = 43.1 bits (100), Expect = 0.021, Method: Composition-based stats.
Identities = 24/76 (31%), Positives = 44/76 (57%), Gaps = 2/76 (2%)
Query: 19 SNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATY 78
S N+L T ++ FE ++ +K Y +E RF +F+DNLK I++ NK ++ +
Sbjct: 52 SQNDL-TSTERLIQLFESWMLKHNKIYKNIDEKIYRFEIFKDNLKYIDETNK-KNNSYWL 109
Query: 79 GINHLSDLTREEMKSR 94
G+N +D++ +E K +
Sbjct: 110 GLNVFADMSNDEFKEK 125
>gi|344310882|gb|AEN03980.1| cathepsin-like cysteine proteinase [Helicoverpa armigera NPV strain
Australia]
Length = 367
Score = 43.1 bits (100), Expect = 0.021, Method: Composition-based stats.
Identities = 28/83 (33%), Positives = 43/83 (51%), Gaps = 14/83 (16%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGE-----------HGTATYGINH 82
F+ F++ ++KSY +E R+ VF+DNL I N+ +A +G+N
Sbjct: 57 FKHFLQQYNKSYDDPKEYQYRYNVFKDNLNKINSQNRENLLNNKNNNDSLSTSAQFGVNK 116
Query: 83 LSDLTREE-MKSRLG--LNLSKH 102
SD T +E + S G LNLS+H
Sbjct: 117 FSDKTPDEVLHSNTGFFLNLSQH 139
>gi|18138384|ref|NP_542680.1| cathepsin [Helicoverpa zea SNPV]
gi|209401110|ref|YP_002273979.1| viral cathepsin-like protein [Helicoverpa armigera NPV NNg1]
gi|37077430|sp|Q8V5U0.1|CATV_NPVHZ RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|18028766|gb|AAL56202.1|AF334030_127 ORF57 [Helicoverpa zea SNPV]
gi|209364362|dbj|BAG74621.1| viral cathepsin-like protein [Helicoverpa armigera NPV NNg1]
Length = 367
Score = 43.1 bits (100), Expect = 0.021, Method: Composition-based stats.
Identities = 28/83 (33%), Positives = 43/83 (51%), Gaps = 14/83 (16%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGE-----------HGTATYGINH 82
F+ F++ ++KSY +E R+ VF+DNL I N+ +A +G+N
Sbjct: 57 FKHFLQQYNKSYDDPKEYQYRYNVFKDNLNKINSQNRENLLNNKNNNDSLSTSAQFGVNK 116
Query: 83 LSDLTREE-MKSRLG--LNLSKH 102
SD T +E + S G LNLS+H
Sbjct: 117 FSDKTPDEVLHSNTGFFLNLSQH 139
>gi|359485281|ref|XP_002280230.2| PREDICTED: LOW QUALITY PROTEIN: KDEL-tailed cysteine
endopeptidase CEP1 [Vitis vinifera]
Length = 341
Score = 43.1 bits (100), Expect = 0.021, Method: Composition-based stats.
Identities = 19/59 (32%), Positives = 33/59 (55%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
++ E ++ + + Y +E +KR+ +F+DN+ IE NK + IN +DLT EE
Sbjct: 37 ERHEDWMAQYGRVYKDADEKSKRYKIFKDNVARIESFNKAMDKSYKLSINEFADLTNEE 95
>gi|37780039|gb|AAP32192.1| cysteine protease 14 [Trifolium repens]
Length = 351
Score = 43.1 bits (100), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 29/69 (42%), Positives = 41/69 (59%), Gaps = 6/69 (8%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATY--GINHLSDLTREEM 91
FE ++ K Y T EE RF VF+DNLK I+D NK + Y G+N +DL+ +E
Sbjct: 47 FESWMSRHGKIYETIEEKLLRFEVFKDNLKHIDDRNK---IVSNYWLGLNEFADLSHQEF 103
Query: 92 KSR-LGLNL 99
K++ LGL +
Sbjct: 104 KNKYLGLKV 112
>gi|2695929|emb|CAA10983.1| putative thiol protease [Hordeum vulgare subsp. vulgare]
Length = 111
Score = 43.1 bits (100), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 21/60 (35%), Positives = 33/60 (55%)
Query: 31 LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
L +F ++ +SYPT EE +RF V+ N++ IE N+ + + G +DLT EE
Sbjct: 46 LGRFHGWMAAHGRSYPTVEEKLRRFEVYRSNMEFIEAANRDSRMSYSLGETPFTDLTHEE 105
>gi|449516391|ref|XP_004165230.1| PREDICTED: cysteine proteinase RD19a-like [Cucumis sativus]
Length = 387
Score = 43.1 bits (100), Expect = 0.021, Method: Composition-based stats.
Identities = 28/70 (40%), Positives = 40/70 (57%), Gaps = 4/70 (5%)
Query: 29 EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTR 88
EH F F R F KSY T+EE +RF +F+ N++ E ++ +A +G+ SDLT
Sbjct: 56 EH--HFSLFKRRFGKSYATEEEHDRRFKIFKANMRRAER-HQSFDPSAIHGVTQFSDLTP 112
Query: 89 EEM-KSRLGL 97
E K+ LGL
Sbjct: 113 FEFRKAFLGL 122
>gi|297819034|ref|XP_002877400.1| hypothetical protein ARALYDRAFT_323209 [Arabidopsis lyrata subsp.
lyrata]
gi|297323238|gb|EFH53659.1| hypothetical protein ARALYDRAFT_323209 [Arabidopsis lyrata subsp.
lyrata]
Length = 317
Score = 43.1 bits (100), Expect = 0.021, Method: Composition-based stats.
Identities = 22/63 (34%), Positives = 36/63 (57%), Gaps = 1/63 (1%)
Query: 30 HLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTRE 89
H+ F +F + K Y + EE+ RF+VF++NL LI NK + + +N +DLT +
Sbjct: 55 HVLSFSRFAHRYGKKYQSVEEMKLRFSVFKENLDLIRSTNK-KGLSYKLSLNQFADLTWQ 113
Query: 90 EMK 92
E +
Sbjct: 114 EFQ 116
>gi|297613009|ref|NP_001066557.2| Os12g0273800 [Oryza sativa Japonica Group]
gi|255670224|dbj|BAF29576.2| Os12g0273800 [Oryza sativa Japonica Group]
Length = 210
Score = 43.1 bits (100), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 20/61 (32%), Positives = 31/61 (50%)
Query: 35 EKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKSR 94
E+++ + Y E A+R VF+ N+ IE N G G+N +DLT EE K+
Sbjct: 45 ERWMAQHGRVYKDAAEKARRLEVFKANVAFIESFNAGGKNRYWLGVNQFADLTSEEFKAT 104
Query: 95 L 95
+
Sbjct: 105 M 105
>gi|218187750|gb|EEC70177.1| hypothetical protein OsI_00904 [Oryza sativa Indica Group]
gi|222617983|gb|EEE54115.1| hypothetical protein OsJ_00884 [Oryza sativa Japonica Group]
Length = 327
Score = 43.1 bits (100), Expect = 0.021, Method: Composition-based stats.
Identities = 20/62 (32%), Positives = 30/62 (48%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
+ FE+++ F K YP E RF VF DN++ I + +N +DLT +E
Sbjct: 17 QMFEEWMAKFGKKYPCHGEKEYRFGVFRDNVRFIRSYRPPAGYNSALRVNQFADLTNDEF 76
Query: 92 KS 93
S
Sbjct: 77 VS 78
>gi|7523482|dbj|BAA94210.1| putative cysteine protease [Oryza sativa Japonica Group]
gi|10800060|dbj|BAB16480.1| putative cysteine protease [Oryza sativa Japonica Group]
Length = 349
Score = 43.1 bits (100), Expect = 0.021, Method: Composition-based stats.
Identities = 20/62 (32%), Positives = 30/62 (48%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
+ FE+++ F K YP E RF VF DN++ I + +N +DLT +E
Sbjct: 39 QMFEEWMAKFGKKYPCHGEKEYRFGVFRDNVRFIRSYRPPAGYNSALRVNQFADLTNDEF 98
Query: 92 KS 93
S
Sbjct: 99 VS 100
>gi|307110445|gb|EFN58681.1| hypothetical protein CHLNCDRAFT_56822 [Chlorella variabilis]
Length = 466
Score = 43.1 bits (100), Expect = 0.022, Method: Composition-based stats.
Identities = 23/78 (29%), Positives = 45/78 (57%), Gaps = 4/78 (5%)
Query: 26 ENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSD 85
E+P + F+ +++ ++Y + EE +RF V+ DNL+ + + N G H + + +D
Sbjct: 34 ESPR--EAFDFWVQTLKRAYASAEEYERRFDVWLDNLRFVHEYNAG-HTSHWLSMGVYAD 90
Query: 86 LTREEMKSR-LGLNLSKH 102
L+++E +S+ LG N H
Sbjct: 91 LSQDEYRSKALGYNADLH 108
>gi|297845822|ref|XP_002890792.1| hypothetical protein ARALYDRAFT_473117 [Arabidopsis lyrata subsp.
lyrata]
gi|297336634|gb|EFH67051.1| hypothetical protein ARALYDRAFT_473117 [Arabidopsis lyrata subsp.
lyrata]
Length = 322
Score = 43.1 bits (100), Expect = 0.022, Method: Composition-based stats.
Identities = 22/67 (32%), Positives = 38/67 (56%), Gaps = 1/67 (1%)
Query: 35 EKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE-MKS 93
++++ FS+ Y + E R VF+ NLK IE+ N + + T G+N +D T EE + +
Sbjct: 39 QQWMTQFSRVYQDESEKEMRLQVFKKNLKFIENFNNMGNQSYTVGVNEFTDWTIEEFLAT 98
Query: 94 RLGLNLS 100
GL ++
Sbjct: 99 HTGLRVN 105
>gi|42564157|gb|AAS20590.1| digestive cysteine proteinase intestain [Leptinotarsa
decemlineata]
Length = 322
Score = 43.1 bits (100), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 24/68 (35%), Positives = 39/68 (57%), Gaps = 3/68 (4%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLN-KGEHGTATY--GINHLSDLTRE 89
Q+ F + K+Y + E RF +F++NL+ IE N + E G TY + +D+TR+
Sbjct: 22 QWVAFKQTHGKTYKSLLEERTRFGIFQNNLRTIEKHNAEYEEGKVTYYMAVTQFADMTRD 81
Query: 90 EMKSRLGL 97
E + +LGL
Sbjct: 82 EFRKKLGL 89
>gi|340053969|emb|CCC48263.1| cysteine peptidase precursor, fragment, partial [Trypanosoma
vivax Y486]
Length = 259
Score = 43.1 bits (100), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 23/61 (37%), Positives = 34/61 (55%), Gaps = 1/61 (1%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
F F + + +SY T E A R VFEDN++ + + AT+G+ SDLT EE ++
Sbjct: 34 FAAFKQKYGRSYGTAAEEAFRLRVFEDNMRR-SRMYAAANPHATFGVTPFSDLTPEEFRT 92
Query: 94 R 94
R
Sbjct: 93 R 93
>gi|357158628|ref|XP_003578189.1| PREDICTED: thiol protease aleurain-like [Brachypodium distachyon]
Length = 363
Score = 43.1 bits (100), Expect = 0.023, Method: Composition-based stats.
Identities = 24/68 (35%), Positives = 40/68 (58%), Gaps = 2/68 (2%)
Query: 30 HLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTRE 89
H +F +F + KSY + EV +RF +F ++L+ + N+ + + GIN SD++ E
Sbjct: 58 HALRFARFAVRYGKSYESAAEVQRRFRIFSESLEEVRSTNQ-KGLSYRLGINRYSDMSWE 116
Query: 90 EMK-SRLG 96
E + SRLG
Sbjct: 117 EFQASRLG 124
>gi|403348594|gb|EJY73736.1| Cysteine protease [Oxytricha trifallax]
Length = 362
Score = 43.1 bits (100), Expect = 0.023, Method: Composition-based stats.
Identities = 26/92 (28%), Positives = 44/92 (47%), Gaps = 8/92 (8%)
Query: 9 ATLALFGQMKSN--------NELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFED 60
A LAL G + N N L +PE F F+ +S+ T+EE R A+F D
Sbjct: 12 AALALIGVLNLNESSLENNSNLLLKVSPEVQSAFNNFVSRQQRSFLTQEEFKARLAIFRD 71
Query: 61 NLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
N + ++ N + + IN +D +++E++
Sbjct: 72 NYERVQLHNSQKDVSFKLAINKFADWSKQELQ 103
>gi|300175245|emb|CBK20556.2| unnamed protein product [Blastocystis hominis]
Length = 325
Score = 43.1 bits (100), Expect = 0.023, Method: Composition-based stats.
Identities = 22/66 (33%), Positives = 35/66 (53%), Gaps = 1/66 (1%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
QF F + F K+Y +EE R +VF +NLK+++ N + + GI DL+ +E +
Sbjct: 23 QFAAFEKKFGKTYVGEEERRFRMSVFSNNLKIVDYYNS-KQSSFVLGITPFIDLSNDEFR 81
Query: 93 SRLGLN 98
R N
Sbjct: 82 ERFASN 87
>gi|312281839|dbj|BAJ33785.1| unnamed protein product [Thellungiella halophila]
Length = 373
Score = 43.1 bits (100), Expect = 0.024, Method: Composition-based stats.
Identities = 29/94 (30%), Positives = 46/94 (48%), Gaps = 3/94 (3%)
Query: 1 MAEDASAEATLALFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFED 60
+AE +S++ + Q+ E K + E F F R F K Y + EE R +VF+
Sbjct: 25 VAETSSSDGDDLVIRQVVDGAEPKVLSSE--DHFSLFKRKFGKVYASSEEHDYRLSVFKA 82
Query: 61 NLKLIEDLNKGEHGTATYGINHLSDLTREEMKSR 94
NL+ K + +A +G+ SDLTR E + +
Sbjct: 83 NLRRARRHQKLD-PSARHGVTQFSDLTRSEFRKK 115
>gi|118397743|ref|XP_001031203.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89285527|gb|EAR83540.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 358
Score = 42.7 bits (99), Expect = 0.024, Method: Composition-based stats.
Identities = 25/54 (46%), Positives = 32/54 (59%), Gaps = 1/54 (1%)
Query: 37 FIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
F +SK Y +KE RFA F +NLK I+ LN E TA + I+ SD T+EE
Sbjct: 42 FKNTYSKVYESKEVEQFRFATFVENLKEIDRLN-AEVTTAQFDISFFSDFTKEE 94
>gi|74229834|gb|AAU14993.2| cysteine proteinase [Cryptobia salmositica]
Length = 443
Score = 42.7 bits (99), Expect = 0.024, Method: Composition-based stats.
Identities = 21/61 (34%), Positives = 36/61 (59%), Gaps = 1/61 (1%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
F F +++Y + +E KRF +F N+K LN+ ++ AT+G N +D+T EE ++
Sbjct: 25 FGNFKAAHARNYASPDEERKRFEIFAGNMKKAAVLNR-KNPMATFGPNEFADMTSEEFQT 83
Query: 94 R 94
R
Sbjct: 84 R 84
>gi|332374780|gb|AEE62531.1| unknown [Dendroctonus ponderosae]
Length = 544
Score = 42.7 bits (99), Expect = 0.024, Method: Composition-based stats.
Identities = 25/80 (31%), Positives = 40/80 (50%), Gaps = 2/80 (2%)
Query: 23 LKTENPEHLK-QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGIN 81
+K E HL+ +F KF + + Y + E R +F N++ I N+ G + +N
Sbjct: 228 IKPETTGHLEFEFNKFTKKHRRIYTNQNERLLRMEIFRQNVRFIHSHNRKNVGF-SLSVN 286
Query: 82 HLSDLTREEMKSRLGLNLSK 101
HL+D T E+K+ G SK
Sbjct: 287 HLADKTETELKALRGKTYSK 306
>gi|343412631|emb|CCD21595.1| hypothetical protein, conserved in T. vivax [Trypanosoma vivax
Y486]
Length = 257
Score = 42.7 bits (99), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 23/61 (37%), Positives = 34/61 (55%), Gaps = 1/61 (1%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
F F + + +SY T E A R VFEDN++ + + AT+G+ SDLT EE ++
Sbjct: 14 FAAFKQKYGRSYGTAAEEAFRLRVFEDNMRR-SRMYAAANPHATFGVTPFSDLTPEEFRT 72
Query: 94 R 94
R
Sbjct: 73 R 73
>gi|294883322|ref|XP_002770704.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
gi|239873993|gb|EER02713.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
Length = 333
Score = 42.7 bits (99), Expect = 0.024, Method: Composition-based stats.
Identities = 26/64 (40%), Positives = 37/64 (57%), Gaps = 2/64 (3%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
F F F K+Y +KEE KR A+F+ NL IE +N + + G+N +DLT EE +
Sbjct: 28 FMGFQHKFGKNYESKEEEVKRNAIFQANLHHIEQVNAKDL-SYKLGVNEHADLTHEEFAA 86
Query: 94 -RLG 96
+LG
Sbjct: 87 LKLG 90
>gi|145547990|ref|XP_001459676.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124427502|emb|CAK92279.1| unnamed protein product [Paramecium tetraurelia]
Length = 329
Score = 42.7 bits (99), Expect = 0.024, Method: Composition-based stats.
Identities = 21/60 (35%), Positives = 36/60 (60%)
Query: 31 LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
L QF+++ +F+K+Y +K E RF ++ NL++I+ N + + T G N DLT +E
Sbjct: 22 LNQFQEWKTEFNKNYQSKYEEIYRFQIYIANLEIIQTHNSNNNYSYTLGENQFMDLTNDE 81
>gi|20147096|gb|AAM09951.1| 49 kDa cysteine proteinase Cysp1 [Cryptobia salmositica]
Length = 428
Score = 42.7 bits (99), Expect = 0.024, Method: Composition-based stats.
Identities = 21/61 (34%), Positives = 36/61 (59%), Gaps = 1/61 (1%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
F F +++Y + +E KRF +F N+K LN+ ++ AT+G N +D+T EE ++
Sbjct: 10 FGNFKAAHARNYASPDEERKRFEIFAGNMKKAAVLNR-KNPMATFGPNEFADMTSEEFQT 68
Query: 94 R 94
R
Sbjct: 69 R 69
>gi|18402225|ref|NP_566633.1| Granulin repeat cysteine protease family protein [Arabidopsis
thaliana]
gi|11994461|dbj|BAB02463.1| cysteine proteinase [Arabidopsis thaliana]
gi|17065298|gb|AAL32803.1| cysteine proteinase [Arabidopsis thaliana]
gi|20260004|gb|AAM13349.1| cysteine proteinase [Arabidopsis thaliana]
gi|332642713|gb|AEE76234.1| Granulin repeat cysteine protease family protein [Arabidopsis
thaliana]
Length = 452
Score = 42.7 bits (99), Expect = 0.024, Method: Composition-based stats.
Identities = 20/72 (27%), Positives = 39/72 (54%)
Query: 22 ELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGIN 81
E E + +E+++ + K+Y E +RF +F+DNLK +E+ + + T G+
Sbjct: 31 ETTRNEAEARRMYERWLVENRKNYNGLGEKERRFEIFKDNLKFVEEHSSIPNRTYEVGLT 90
Query: 82 HLSDLTREEMKS 93
+DLT +E ++
Sbjct: 91 RFADLTNDEFRA 102
>gi|294869083|ref|XP_002765753.1| Cysteine proteinase 3 precursor, putative [Perkinsus marinus ATCC
50983]
gi|239865917|gb|EEQ98470.1| Cysteine proteinase 3 precursor, putative [Perkinsus marinus ATCC
50983]
Length = 174
Score = 42.7 bits (99), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 25/60 (41%), Positives = 35/60 (58%), Gaps = 1/60 (1%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
F F + KSY KEE KR A+F DNL IE++N ++ + G+N +DLT EE +
Sbjct: 27 FIGFQKKHGKSYDNKEEEMKRAAIFHDNLNYIEEVNA-QNLSYKLGVNEYTDLTLEEFAA 85
>gi|168063167|ref|XP_001783545.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664932|gb|EDQ51634.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 461
Score = 42.7 bits (99), Expect = 0.024, Method: Composition-based stats.
Identities = 27/90 (30%), Positives = 45/90 (50%), Gaps = 9/90 (10%)
Query: 3 EDASAEATLALFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNL 62
ED ++E+ L + ++ N L L+QF + K+Y E+ RFAV++DNL
Sbjct: 30 EDGTSESFLHMTTDLEHENLL-------LEQFAAWAHKHGKAYHDAEQCLHRFAVWKDNL 82
Query: 63 KLIEDLNKGEHGTATYGINHLSDLTREEMK 92
I + + T + G+ +DLT EE +
Sbjct: 83 AYIR--HSETNRTYSLGLTKFADLTNEEFR 110
>gi|66803148|ref|XP_635417.1| cysteine proteinase 1 [Dictyostelium discoideum AX4]
gi|166201987|sp|P04988.2|CYSP1_DICDI RecName: Full=Cysteine proteinase 1; Flags: Precursor
gi|60463731|gb|EAL61909.1| cysteine proteinase 1 [Dictyostelium discoideum AX4]
Length = 343
Score = 42.7 bits (99), Expect = 0.024, Method: Composition-based stats.
Identities = 24/68 (35%), Positives = 37/68 (54%), Gaps = 4/68 (5%)
Query: 29 EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNK---GEHGTATYGINHLSD 85
E QF +F F+K Y + EE +RF +F+ NL IE+LN +G+N +D
Sbjct: 24 EEQSQFLEFQDKFNKKY-SHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFAD 82
Query: 86 LTREEMKS 93
L+ +E K+
Sbjct: 83 LSSDEFKN 90
>gi|389615359|dbj|BAM20657.1| cathepsin L, partial [Papilio polytes]
Length = 377
Score = 42.7 bits (99), Expect = 0.024, Method: Composition-based stats.
Identities = 22/68 (32%), Positives = 36/68 (52%), Gaps = 1/68 (1%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
+F++F +K Y ++ E AKR +F NL+ I N+ G T +NHL+D +E+
Sbjct: 246 EFDRFKMKHNKKYASEIEHAKRLNIFRQNLRYIHSNNRARRGY-TLAVNHLADWAEDELA 304
Query: 93 SRLGLNLS 100
+ G S
Sbjct: 305 ALRGRRYS 312
>gi|297826061|ref|XP_002880913.1| hypothetical protein ARALYDRAFT_481640 [Arabidopsis lyrata subsp.
lyrata]
gi|297326752|gb|EFH57172.1| hypothetical protein ARALYDRAFT_481640 [Arabidopsis lyrata subsp.
lyrata]
Length = 347
Score = 42.7 bits (99), Expect = 0.024, Method: Composition-based stats.
Identities = 18/63 (28%), Positives = 37/63 (58%)
Query: 31 LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
+++ E+++ F++ Y + E RF +F+ NL+ ++ N ++ T +N SDLT EE
Sbjct: 32 IEKHEQWMARFNRVYSDESEKRNRFNIFKKNLEFVQSFNMNKNITYKLDVNEFSDLTDEE 91
Query: 91 MKS 93
++
Sbjct: 92 FRA 94
>gi|113120269|gb|ABI30274.1| VS-B, partial [Vasconcellea stipulata]
Length = 341
Score = 42.7 bits (99), Expect = 0.024, Method: Composition-based stats.
Identities = 22/64 (34%), Positives = 37/64 (57%), Gaps = 1/64 (1%)
Query: 31 LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
++ FE ++ K Y T +E RF F+DNL I++ NK ++ + G+N +DLT +E
Sbjct: 45 IRLFESWMLKHDKVYKTIDEKIYRFETFKDNLMYIDETNK-KNNSYWLGLNEFADLTHDE 103
Query: 91 MKSR 94
K +
Sbjct: 104 FKEK 107
>gi|18390634|ref|NP_563764.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|8844131|gb|AAF80223.1|AC025290_12 Contains similarity to a cysteine endopeptidase 1 from Phaseolus
vulgaris gb|U52970 and is a member of the papain
cysteine protease family PF|00112 [Arabidopsis thaliana]
gi|332189848|gb|AEE27969.1| cysteine proteinase-like protein [Arabidopsis thaliana]
Length = 343
Score = 42.7 bits (99), Expect = 0.025, Method: Composition-based stats.
Identities = 27/72 (37%), Positives = 42/72 (58%), Gaps = 3/72 (4%)
Query: 31 LKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTRE 89
LKQ FEK+++ SK Y ++E RF +++ N++LI+ +N H N +D+T
Sbjct: 39 LKQRFEKWLKTHSKLYGGRDEWMLRFGIYQSNVQLIDYINS-LHLPFKLTDNRFADMTNS 97
Query: 90 EMKSR-LGLNLS 100
E K+ LGLN S
Sbjct: 98 EFKAHFLGLNTS 109
>gi|323447420|gb|EGB03341.1| hypothetical protein AURANDRAFT_15921 [Aureococcus
anophagefferens]
Length = 124
Score = 42.7 bits (99), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 21/57 (36%), Positives = 35/57 (61%), Gaps = 1/57 (1%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
F+ + DFS++Y T +E A+R+A F+ NL ++ LN G H A +G+ +D + E
Sbjct: 8 FDAWAADFSRAYATADERAERYAHFKKNLAEVDRLN-GAHPYALFGLTRFADRSDAE 63
>gi|297843430|ref|XP_002889596.1| hypothetical protein ARALYDRAFT_887827 [Arabidopsis lyrata subsp.
lyrata]
gi|297335438|gb|EFH65855.1| hypothetical protein ARALYDRAFT_887827 [Arabidopsis lyrata subsp.
lyrata]
Length = 343
Score = 42.7 bits (99), Expect = 0.025, Method: Composition-based stats.
Identities = 27/72 (37%), Positives = 42/72 (58%), Gaps = 3/72 (4%)
Query: 31 LKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTRE 89
LKQ FEK+++ SK Y ++E RF +++ N++LI+ +N H N +D+T
Sbjct: 39 LKQRFEKWLKTHSKLYGGRDEWMLRFGIYQSNVQLIDYINS-LHLPFKLTDNRFADMTNS 97
Query: 90 EMKSR-LGLNLS 100
E K+ LGLN S
Sbjct: 98 EFKAHFLGLNTS 109
>gi|294901125|ref|XP_002777247.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
gi|239884778|gb|EER09063.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
Length = 214
Score = 42.7 bits (99), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 25/60 (41%), Positives = 35/60 (58%), Gaps = 1/60 (1%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
F F + KSY KEE KR A+F DNL IE++N ++ + G+N +DLT EE +
Sbjct: 27 FIGFQKKHGKSYDNKEEEMKRAAIFHDNLNYIEEVNA-QNLSYKLGVNEYTDLTLEEFAA 85
>gi|144905116|dbj|BAF56430.1| cysteine proteinase [Lotus japonicus]
Length = 341
Score = 42.7 bits (99), Expect = 0.025, Method: Composition-based stats.
Identities = 25/85 (29%), Positives = 44/85 (51%), Gaps = 1/85 (1%)
Query: 11 LALFGQMKSNNELKT-ENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLN 69
L +FG + +T E+ ++ E+++ + K Y E R +F++N++ IE N
Sbjct: 15 LLVFGFLSFEANARTLEDASMHERHEQWMAQYGKVYKDSYEKELRSKIFKENVQRIEAFN 74
Query: 70 KGEHGTATYGINHLSDLTREEMKSR 94
+ + GIN +DLT EE K+R
Sbjct: 75 NAGNKSYKLGINQFADLTNEEFKAR 99
>gi|389583697|dbj|GAB66431.1| vivapain [Plasmodium cynomolgi strain B]
Length = 487
Score = 42.7 bits (99), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 23/69 (33%), Positives = 37/69 (53%)
Query: 27 NPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
N E + F F+++F K Y T +E+ +R+ F +NL I+ N E+ G+N DL
Sbjct: 160 NLESVNSFYLFVKEFGKKYKTADEMQQRYQSFVENLAKIKAHNSKENVLYRKGMNQFGDL 219
Query: 87 TREEMKSRL 95
+ EE K +
Sbjct: 220 SFEEFKKKF 228
>gi|359483753|ref|XP_002266308.2| PREDICTED: oryzain alpha chain-like [Vitis vinifera]
Length = 501
Score = 42.7 bits (99), Expect = 0.025, Method: Composition-based stats.
Identities = 20/52 (38%), Positives = 31/52 (59%), Gaps = 1/52 (1%)
Query: 43 KSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKSR 94
+ Y EE AKRF +F++NLK + + N H T G+N +D++ EE K +
Sbjct: 55 RVYKHAEETAKRFEIFKENLKYVIERNSKGH-RHTLGMNKFADMSNEEFKEK 105
>gi|356517184|ref|XP_003527269.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
Length = 350
Score = 42.7 bits (99), Expect = 0.025, Method: Composition-based stats.
Identities = 30/84 (35%), Positives = 51/84 (60%), Gaps = 7/84 (8%)
Query: 19 SNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATY 78
S+ +LK+ + + ++ FE +I K Y + EE RF +F+DNLK I++ NK + Y
Sbjct: 34 SSEDLKSMD-KLIELFESWISRHGKIYQSIEEKLHRFEIFKDNLKHIDERNK---VVSNY 89
Query: 79 --GINHLSDLTREEMKSR-LGLNL 99
G+N +DL+ +E K++ LGL +
Sbjct: 90 WLGLNEFADLSHQEFKNKYLGLKV 113
>gi|224056176|ref|XP_002298740.1| predicted protein [Populus trichocarpa]
gi|222845998|gb|EEE83545.1| predicted protein [Populus trichocarpa]
Length = 455
Score = 42.7 bits (99), Expect = 0.025, Method: Composition-based stats.
Identities = 22/79 (27%), Positives = 42/79 (53%), Gaps = 5/79 (6%)
Query: 15 GQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHG 74
GQ+ E +T + +E ++ ++Y E +RF +F+DNLK I++ N +
Sbjct: 11 GQVPERTEAETR-----RIYEMWLVKHGRAYNALGEKERRFEIFKDNLKFIDEHNSVGNP 65
Query: 75 TATYGINHLSDLTREEMKS 93
+ G+N +DL+ +E +S
Sbjct: 66 SYKLGLNKFADLSNDEYRS 84
>gi|50355621|dbj|BAD29959.1| cysteine protease [Daucus carota]
Length = 361
Score = 42.7 bits (99), Expect = 0.025, Method: Composition-based stats.
Identities = 21/71 (29%), Positives = 39/71 (54%), Gaps = 1/71 (1%)
Query: 31 LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
++ E+++ + + Y + E + RF +F DN+K IE+ NK + +N +D T EE
Sbjct: 54 FERHEQWMIQYGRVYKDEAEKSVRFQIFMDNVKFIEEFNKDGRQSYKLAVNEFADQTNEE 113
Query: 91 MK-SRLGLNLS 100
+ SR G ++
Sbjct: 114 FQASRNGYKMA 124
>gi|297830592|ref|XP_002883178.1| hypothetical protein ARALYDRAFT_479457 [Arabidopsis lyrata subsp.
lyrata]
gi|297329018|gb|EFH59437.1| hypothetical protein ARALYDRAFT_479457 [Arabidopsis lyrata subsp.
lyrata]
Length = 452
Score = 42.7 bits (99), Expect = 0.025, Method: Composition-based stats.
Identities = 21/65 (32%), Positives = 36/65 (55%)
Query: 29 EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTR 88
E + +E+++ + K+Y E RF +F DNLK IE+ N + T G+ +DLT
Sbjct: 38 EARRMYEQWLVENRKNYNGLGEKETRFEIFTDNLKYIEEHNSVPNQTFEVGLTRFADLTN 97
Query: 89 EEMKS 93
+E ++
Sbjct: 98 DEFRA 102
>gi|47606562|gb|AAT36265.1| vivapain-3 [Plasmodium vivax]
gi|47606564|gb|AAT36266.1| vivapain-3 [Plasmodium vivax]
gi|47606566|gb|AAT36267.1| vivapain-3 [Plasmodium vivax]
gi|47606568|gb|AAT36268.1| vivapain-3 [Plasmodium vivax]
gi|47606570|gb|AAT36269.1| vivapain-3 [Plasmodium vivax]
gi|47606572|gb|AAT36270.1| vivapain-3 [Plasmodium vivax]
gi|47606574|gb|AAT36271.1| vivapain-3 [Plasmodium vivax]
gi|47606588|gb|AAT36278.1| vivapain-3 [Plasmodium vivax]
gi|47606590|gb|AAT36279.1| vivapain-3 [Plasmodium vivax]
gi|47606592|gb|AAT36280.1| vivapain-3 [Plasmodium vivax]
gi|47606594|gb|AAT36281.1| vivapain-3 [Plasmodium vivax]
gi|47606596|gb|AAT36282.1| vivapain-3 [Plasmodium vivax]
gi|47606598|gb|AAT36283.1| vivapain-3 [Plasmodium vivax]
gi|47606600|gb|AAT36284.1| vivapain-3 [Plasmodium vivax]
Length = 495
Score = 42.7 bits (99), Expect = 0.025, Method: Composition-based stats.
Identities = 22/68 (32%), Positives = 37/68 (54%)
Query: 27 NPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
N E + F F+++ K Y T +E+ +R+ F +NL I+ N E+ G+N DL
Sbjct: 166 NLETVNSFYLFMKEHGKEYSTADEMQQRYLSFAENLAKIKAHNSRENVLYRKGMNRFGDL 225
Query: 87 TREEMKSR 94
+ EE+K +
Sbjct: 226 SFEEIKKK 233
>gi|432910512|ref|XP_004078392.1| PREDICTED: cathepsin K-like [Oryzias latipes]
Length = 331
Score = 42.7 bits (99), Expect = 0.026, Method: Composition-based stats.
Identities = 27/72 (37%), Positives = 42/72 (58%), Gaps = 6/72 (8%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNK----GEHGTATYGINHLSDLTR 88
+E++ +K Y T EE R A++E NL++IE N+ G H T T G+N D+T+
Sbjct: 27 HWEEWKMTHTKEYITVEEEGIRRAIWEKNLRMIEAHNQEAALGMH-TYTLGMNQFGDMTQ 85
Query: 89 EEMKSRL-GLNL 99
EE+ R+ GL +
Sbjct: 86 EEVVERMTGLQM 97
>gi|387178006|gb|AFJ68066.1| Der f 1 variant, partial [Dermatophagoides farinae]
Length = 305
Score = 42.7 bits (99), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 30/69 (43%), Positives = 44/69 (63%), Gaps = 12/69 (17%)
Query: 28 PEHLKQFEKFIRDFSKSYPT--KEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSD 85
P +K FE+F + F+K+Y T +EEVA++ F ++LK +E NKG INHLSD
Sbjct: 4 PASIKIFEEFKKAFNKNYATVEEEEVARK--NFLESLKYVE-ANKG-------AINHLSD 53
Query: 86 LTREEMKSR 94
L+ +E K+R
Sbjct: 54 LSLDEFKNR 62
>gi|18401420|ref|NP_565649.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|4314384|gb|AAD15594.1| cysteine proteinase [Arabidopsis thaliana]
gi|17381154|gb|AAL36389.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|20465849|gb|AAM20029.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|330252901|gb|AEC07995.1| cysteine proteinase-like protein [Arabidopsis thaliana]
Length = 348
Score = 42.7 bits (99), Expect = 0.026, Method: Composition-based stats.
Identities = 19/63 (30%), Positives = 36/63 (57%)
Query: 31 LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
+++ E+++ F++ Y + E RF +F+ NL+ +++ N T IN SDLT EE
Sbjct: 32 IEKHEQWMARFNRVYSDETEKRNRFNIFKKNLEFVQNFNMNNKITYKVDINEFSDLTDEE 91
Query: 91 MKS 93
++
Sbjct: 92 FRA 94
>gi|356515056|ref|XP_003526217.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 342
Score = 42.7 bits (99), Expect = 0.026, Method: Composition-based stats.
Identities = 23/70 (32%), Positives = 36/70 (51%), Gaps = 1/70 (1%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
++ EK++ + + Y E KRF VF++N+ IE N IN +DL EE
Sbjct: 35 ERHEKWMAQYGRVYKDAAEKEKRFQVFKNNVHFIESFNAAGDKPFNLSINQFADLNDEEF 94
Query: 92 KSRLGLNLSK 101
K+ L +N+ K
Sbjct: 95 KALL-INVQK 103
>gi|47606576|gb|AAT36272.1| vivapain-3 [Plasmodium vivax]
gi|47606584|gb|AAT36276.1| vivapain-3 [Plasmodium vivax]
gi|47606586|gb|AAT36277.1| vivapain-3 [Plasmodium vivax]
Length = 495
Score = 42.7 bits (99), Expect = 0.026, Method: Composition-based stats.
Identities = 22/68 (32%), Positives = 37/68 (54%)
Query: 27 NPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
N E + F F+++ K Y T +E+ +R+ F +NL I+ N E+ G+N DL
Sbjct: 166 NLETVNSFYLFMKEHGKEYSTADEMQQRYLSFAENLAKIKAHNSRENVLYRKGMNRFGDL 225
Query: 87 TREEMKSR 94
+ EE+K +
Sbjct: 226 SFEEIKKK 233
>gi|156098482|ref|XP_001615273.1| vivapain-3 [Plasmodium vivax Sal-1]
gi|32395685|gb|AAP04594.1| vivapain-3 [Plasmodium vivax]
gi|47606602|gb|AAT36285.1| vivapain-3 [Plasmodium vivax]
gi|47606604|gb|AAT36286.1| vivapain-3 [Plasmodium vivax]
gi|148804147|gb|EDL45546.1| vivapain-3 [Plasmodium vivax]
Length = 495
Score = 42.7 bits (99), Expect = 0.026, Method: Composition-based stats.
Identities = 22/68 (32%), Positives = 37/68 (54%)
Query: 27 NPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
N E + F F+++ K Y T +E+ +R+ F +NL I+ N E+ G+N DL
Sbjct: 166 NLETVNSFYLFMKEHGKEYSTADEMQQRYLSFAENLAKIKAHNSRENVLYRKGMNRFGDL 225
Query: 87 TREEMKSR 94
+ EE+K +
Sbjct: 226 SFEEIKKK 233
>gi|225458119|ref|XP_002279862.1| PREDICTED: cysteine proteinase RD19a [Vitis vinifera]
gi|302142581|emb|CBI19784.3| unnamed protein product [Vitis vinifera]
Length = 368
Score = 42.7 bits (99), Expect = 0.026, Method: Composition-based stats.
Identities = 27/68 (39%), Positives = 38/68 (55%), Gaps = 2/68 (2%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
+ FEKF F K+Y T EE RF VF+ NL+ + ++ +A +G+ SDLT E
Sbjct: 50 RHFEKFKARFQKTYATPEEHDYRFNVFKANLRRAKR-HQLLDPSAVHGVTQFSDLTPAEF 108
Query: 92 -KSRLGLN 98
+ LGLN
Sbjct: 109 RRDYLGLN 116
>gi|326516056|dbj|BAJ88051.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 362
Score = 42.7 bits (99), Expect = 0.027, Method: Composition-based stats.
Identities = 23/68 (33%), Positives = 39/68 (57%), Gaps = 2/68 (2%)
Query: 30 HLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTRE 89
H +F +F + KSY + EV +RF +F ++L+ + N+ + GIN SD++ E
Sbjct: 57 HALRFARFAVGYGKSYESAAEVRRRFRIFSESLEEVRSTNR-KGLPYRLGINRFSDMSWE 115
Query: 90 EMK-SRLG 96
E + +RLG
Sbjct: 116 EFQATRLG 123
>gi|47606578|gb|AAT36273.1| vivapain-3 [Plasmodium vivax]
gi|47606580|gb|AAT36274.1| vivapain-3 [Plasmodium vivax]
gi|47606582|gb|AAT36275.1| vivapain-3 [Plasmodium vivax]
Length = 495
Score = 42.7 bits (99), Expect = 0.027, Method: Composition-based stats.
Identities = 22/68 (32%), Positives = 37/68 (54%)
Query: 27 NPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
N E + F F+++ K Y T +E+ +R+ F +NL I+ N E+ G+N DL
Sbjct: 166 NLETVNSFYLFMKEHGKEYSTADEMQQRYLSFAENLAKIKAHNSRENVLYRKGMNRFGDL 225
Query: 87 TREEMKSR 94
+ EE+K +
Sbjct: 226 SFEEIKKK 233
>gi|18403438|ref|NP_565780.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|2342728|gb|AAB67626.1| cysteine proteinase [Arabidopsis thaliana]
gi|330253821|gb|AEC08915.1| cysteine proteinase-like protein [Arabidopsis thaliana]
Length = 345
Score = 42.7 bits (99), Expect = 0.027, Method: Composition-based stats.
Identities = 21/60 (35%), Positives = 34/60 (56%)
Query: 31 LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
+ + E+++ FS+ Y + E R VF+ NLK IE+ NK + + G+N +D T EE
Sbjct: 36 VDKHEQWMARFSREYRDELEKNMRRDVFKKNLKFIENFNKKGNKSYKLGVNEFADWTNEE 95
>gi|351629613|gb|AEQ54770.1| cysteine proteinase CP1 [Coffea canephora]
Length = 397
Score = 42.7 bits (99), Expect = 0.027, Method: Composition-based stats.
Identities = 25/83 (30%), Positives = 40/83 (48%), Gaps = 11/83 (13%)
Query: 15 GQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHG 74
G+ +N+ L E F+ F+ ++ K+Y T EE R +F NL + EH
Sbjct: 57 GRSSANHRLLGTTTE--VHFKSFVEEYEKTYSTHEEYVHRLGIFAKNL-----IKAAEHQ 109
Query: 75 ----TATYGINHLSDLTREEMKS 93
+A +G+ SDLT EE ++
Sbjct: 110 AMDPSAIHGVTQFSDLTEEEFEA 132
>gi|110737404|dbj|BAF00646.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 345
Score = 42.7 bits (99), Expect = 0.028, Method: Composition-based stats.
Identities = 21/60 (35%), Positives = 34/60 (56%)
Query: 31 LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
+ + E+++ FS+ Y + E R VF+ NLK IE+ NK + + G+N +D T EE
Sbjct: 36 VDKHEQWMARFSREYRDELEKNMRRDVFKKNLKFIENFNKKGNKSYKLGVNEFADWTNEE 95
>gi|356545112|ref|XP_003540989.1| PREDICTED: LOW QUALITY PROTEIN: KDEL-tailed cysteine endopeptidase
CEP1-like [Glycine max]
Length = 400
Score = 42.7 bits (99), Expect = 0.028, Method: Composition-based stats.
Identities = 25/80 (31%), Positives = 39/80 (48%), Gaps = 3/80 (3%)
Query: 16 QMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGT 75
Q +S + L+ E + EK++ + K Y E+ KRF +F++N++ IE N
Sbjct: 100 QCRSKSRLEACTSE---RHEKWMAQYGKVYEDAAEMEKRFQIFKNNVQFIESFNVAGDKP 156
Query: 76 ATYGINHLSDLTREEMKSRL 95
IN DL EE K+ L
Sbjct: 157 FNIRINQFPDLHDEEFKALL 176
>gi|195484884|ref|XP_002090861.1| GE12567 [Drosophila yakuba]
gi|194176962|gb|EDW90573.1| GE12567 [Drosophila yakuba]
Length = 299
Score = 42.7 bits (99), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 25/64 (39%), Positives = 37/64 (57%), Gaps = 3/64 (4%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLN-KGEHGTATY--GINHLSDLTREE 90
+EKF+ DF Y E KR +F DN K I++ N + + G ++ G+N SDLT EE
Sbjct: 227 WEKFLVDFKVKYQDDTETEKRRNIFCDNWKAIQEHNVQFDLGVESFKKGVNQWSDLTVEE 286
Query: 91 MKSR 94
K++
Sbjct: 287 WKNK 290
Score = 40.0 bits (92), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 24/64 (37%), Positives = 35/64 (54%), Gaps = 3/64 (4%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLN-KGEHGTATY--GINHLSDLTREE 90
++KF+ DF Y E+ +R VF N + + D N K + G ++ GIN SDLT EE
Sbjct: 71 WKKFLVDFDVHYDNYSELQRRRKVFCGNWQKVSDHNLKYDSGVVSFRKGINQFSDLTFEE 130
Query: 91 MKSR 94
K +
Sbjct: 131 WKEK 134
>gi|297851332|ref|XP_002893547.1| peptidase C1A papain family protein [Arabidopsis lyrata subsp.
lyrata]
gi|297339389|gb|EFH69806.1| peptidase C1A papain family protein [Arabidopsis lyrata subsp.
lyrata]
Length = 345
Score = 42.7 bits (99), Expect = 0.028, Method: Composition-based stats.
Identities = 23/82 (28%), Positives = 42/82 (51%), Gaps = 1/82 (1%)
Query: 18 KSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTAT 77
++ + + P +K++ +FS+ Y + E R VF +NLK IE+ N +
Sbjct: 22 EATSRVALHEPTIFYYHQKWMINFSRVYDDEFEKQMRLEVFTENLKFIENFNNMGSQSYK 81
Query: 78 YGINHLSDLTREE-MKSRLGLN 98
G+N +D T+EE + + GL+
Sbjct: 82 LGVNKFTDWTKEEFLATHTGLS 103
>gi|357143305|ref|XP_003572875.1| PREDICTED: xylem cysteine proteinase 1-like [Brachypodium
distachyon]
Length = 473
Score = 42.7 bits (99), Expect = 0.029, Method: Composition-based stats.
Identities = 25/57 (43%), Positives = 35/57 (61%), Gaps = 2/57 (3%)
Query: 42 SKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKSR-LGL 97
SK Y + EE KR+ VF+ NLK I + N+ +G+ G+N +D+ EE KS LGL
Sbjct: 56 SKIYVSPEEKVKRYEVFKQNLKHIVETNR-RNGSYWLGLNQFADVAHEEFKSTYLGL 111
>gi|356517308|ref|XP_003527330.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 342
Score = 42.7 bits (99), Expect = 0.029, Method: Composition-based stats.
Identities = 23/70 (32%), Positives = 36/70 (51%), Gaps = 1/70 (1%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
++ EK++ + + Y E KRF VF++N+ IE N IN +DL EE
Sbjct: 35 ERHEKWMAQYGRVYKDAAEKEKRFQVFKNNVHFIESFNAAGDKPFNLSINQFADLNDEEF 94
Query: 92 KSRLGLNLSK 101
K+ L +N+ K
Sbjct: 95 KALL-INVQK 103
>gi|340375899|ref|XP_003386471.1| PREDICTED: probable cysteine proteinase A494-like [Amphimedon
queenslandica]
Length = 373
Score = 42.7 bits (99), Expect = 0.030, Method: Composition-based stats.
Identities = 27/87 (31%), Positives = 45/87 (51%), Gaps = 1/87 (1%)
Query: 7 AEATLALFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIE 66
AE+ LF + + +L + E F + + SKSY T E +R +V++ N L++
Sbjct: 6 AESISFLFIFLLCSFQLAVSSNEPQLSFTDWCKLHSKSYRTITEAKERESVYKSNADLVQ 65
Query: 67 DLN-KGEHGTATYGINHLSDLTREEMK 92
LN + T+ +NH +DL+ EE K
Sbjct: 66 QLNNEYRERNVTFSLNHFADLSIEEFK 92
>gi|356508487|ref|XP_003522988.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
Length = 349
Score = 42.7 bits (99), Expect = 0.030, Method: Composition-based stats.
Identities = 33/98 (33%), Positives = 57/98 (58%), Gaps = 11/98 (11%)
Query: 9 ATLALFGQMK----SNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKL 64
A+LA+ G S+ +LK+ + + ++ FE ++ K Y + EE RF +F+DNLK
Sbjct: 19 ASLAVAGDFSIVGYSSEDLKSMD-KLIELFESWMSRHGKIYQSIEEKLHRFDIFKDNLKH 77
Query: 65 IEDLNKGEHGTATY--GINHLSDLTREEMKSR-LGLNL 99
I++ NK + Y G+N +DL+ +E K++ LGL +
Sbjct: 78 IDERNK---VVSNYWLGLNEFADLSHQEFKNKYLGLKV 112
>gi|54300680|gb|AAV32963.1| cysteine proteinase inhibitor [Oncorhynchus mykiss]
Length = 131
Score = 42.7 bits (99), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 22/63 (34%), Positives = 39/63 (61%), Gaps = 3/63 (4%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNK-GEHGTATY--GINHLSDLTR 88
K+F+ + + K+YP+ EE AKR ++ K++ + NK E+G +Y +NH +DLT
Sbjct: 61 KEFQTWKVKYGKTYPSPEEEAKRKEIWLATRKMVTEHNKRAENGLESYTLAVNHFADLTT 120
Query: 89 EEM 91
+E+
Sbjct: 121 QEV 123
>gi|113603|sp|P05167.1|ALEU_HORVU RecName: Full=Thiol protease aleurain; Flags: Precursor
gi|19021|emb|CAA28804.1| aleurain [Hordeum vulgare]
Length = 362
Score = 42.7 bits (99), Expect = 0.030, Method: Composition-based stats.
Identities = 23/68 (33%), Positives = 39/68 (57%), Gaps = 2/68 (2%)
Query: 30 HLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTRE 89
H +F +F + KSY + EV +RF +F ++L+ + N+ + GIN SD++ E
Sbjct: 57 HALRFARFAVRYGKSYESAAEVRRRFRIFSESLEEVRSTNR-KGLPYRLGINRFSDMSWE 115
Query: 90 EMK-SRLG 96
E + +RLG
Sbjct: 116 EFQATRLG 123
>gi|151547430|gb|ABS12459.1| cysteine protease Cp [Citrus sinensis]
Length = 361
Score = 42.7 bits (99), Expect = 0.031, Method: Composition-based stats.
Identities = 27/69 (39%), Positives = 36/69 (52%), Gaps = 4/69 (5%)
Query: 30 HLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLN-KGEHGTATYGINHLSDLTR 88
H F +F R + K Y + EE+ RFA F NL LI N KG + G+N +D +
Sbjct: 58 HALSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGL--SYRLGLNKFADWSW 115
Query: 89 EEM-KSRLG 96
EE + RLG
Sbjct: 116 EEFQRHRLG 124
>gi|357160572|ref|XP_003578808.1| PREDICTED: vignain-like [Brachypodium distachyon]
Length = 339
Score = 42.7 bits (99), Expect = 0.031, Method: Composition-based stats.
Identities = 25/83 (30%), Positives = 42/83 (50%), Gaps = 2/83 (2%)
Query: 11 LALFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNK 70
L L G + + EL ++ + + E ++ + + Y E A++F VF+ N + I N
Sbjct: 15 LCLCGSVLAAREL-NDDLSMVARHENWMLQYGRVYKDAAEKAQKFEVFKANAEFINSFNA 73
Query: 71 GEHGTATYGINHLSDLTREEMKS 93
G H GIN +D+T EE K+
Sbjct: 74 GNH-KFWLGINQFADITNEEFKA 95
>gi|1019670|gb|AAA79289.1| rangelipain, partial [Trypanosoma rangeli]
Length = 265
Score = 42.4 bits (98), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 24/63 (38%), Positives = 34/63 (53%), Gaps = 1/63 (1%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
QF F + K Y + E R VF++NL L L+ + A++G+ SDLTREE
Sbjct: 36 SQFAAFKQRHGKVYGSAAEETFRLGVFKENL-LFARLHAAANPHASFGVTPFSDLTREEF 94
Query: 92 KSR 94
+SR
Sbjct: 95 RSR 97
>gi|302143414|emb|CBI21975.3| unnamed protein product [Vitis vinifera]
Length = 286
Score = 42.4 bits (98), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 19/59 (32%), Positives = 33/59 (55%)
Query: 35 EKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
E ++ + + Y +E +KR+ +F+DN+ IE NK + IN +DLT EE ++
Sbjct: 40 EDWMAQYGRVYKDADEKSKRYKIFKDNVARIESFNKAMDKSYKLSINEFADLTNEEFRA 98
>gi|261289781|ref|XP_002611752.1| hypothetical protein BRAFLDRAFT_284342 [Branchiostoma floridae]
gi|229297124|gb|EEN67762.1| hypothetical protein BRAFLDRAFT_284342 [Branchiostoma floridae]
Length = 327
Score = 42.4 bits (98), Expect = 0.032, Method: Composition-based stats.
Identities = 25/72 (34%), Positives = 43/72 (59%), Gaps = 6/72 (8%)
Query: 23 LKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLI----EDLNKGEHGTATY 78
+ T +P ++E F + +++ Y +EE A+R +FEDNLK I E+ ++G H T
Sbjct: 12 MATASPLMNPEWEVFKKAYNRVYAAEEEYARRL-IFEDNLKTIQMHNEEADRGLH-TFRL 69
Query: 79 GINHLSDLTREE 90
G+N +D+T +E
Sbjct: 70 GVNQYADMTHKE 81
>gi|413917937|gb|AFW57869.1| hypothetical protein ZEAMMB73_830006 [Zea mays]
Length = 443
Score = 42.4 bits (98), Expect = 0.032, Method: Composition-based stats.
Identities = 24/82 (29%), Positives = 44/82 (53%), Gaps = 2/82 (2%)
Query: 12 ALFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKG 71
AL G + + +L ++ + + E+++ + + Y E A+RF VF+ N+ LIE +N G
Sbjct: 20 ALSGSLAAR-DLADQDQAMVARHEEWMAKYDRVYSDAAEKARRFEVFKANMALIESVNAG 78
Query: 72 EHGTATYGINHLSDLTREEMKS 93
H N +DLT +E ++
Sbjct: 79 NHKFWLEA-NRFADLTDDEFRA 99
>gi|403340410|gb|EJY69490.1| Cysteine protease [Oxytricha trifallax]
Length = 355
Score = 42.4 bits (98), Expect = 0.032, Method: Composition-based stats.
Identities = 22/72 (30%), Positives = 36/72 (50%), Gaps = 1/72 (1%)
Query: 26 ENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSD 85
++ + + +F FI K+Y TKEE R +F+ N I+ N E+ +N +D
Sbjct: 40 QDQQVMLKFNDFISKHQKNYLTKEEYKARLGLFKQNFDYIQKSN-AENKDYVLDLNAFAD 98
Query: 86 LTREEMKSRLGL 97
++ EE RLG
Sbjct: 99 MSDEEYNKRLGF 110
>gi|90592736|ref|YP_529689.1| VCATH [Agrotis segetum nucleopolyhedrovirus]
gi|71559186|gb|AAZ38185.1| VCATH [Agrotis segetum nucleopolyhedrovirus]
Length = 343
Score = 42.4 bits (98), Expect = 0.032, Method: Composition-based stats.
Identities = 21/61 (34%), Positives = 34/61 (55%), Gaps = 1/61 (1%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
FE+FI ++K Y + E RF +F N++ I N + +A Y IN +D+T+ E+
Sbjct: 45 FEQFISQYNKQYKNEAEKRHRFNIFMHNIEEINQKNS-RNDSAVYKINRFADMTKNEVVI 103
Query: 94 R 94
R
Sbjct: 104 R 104
>gi|356517310|ref|XP_003527331.1| PREDICTED: vignain-like [Glycine max]
Length = 342
Score = 42.4 bits (98), Expect = 0.032, Method: Composition-based stats.
Identities = 23/70 (32%), Positives = 36/70 (51%), Gaps = 1/70 (1%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
++ EK++ + + Y E KRF VF++N+ IE N IN +DL EE
Sbjct: 35 ERHEKWMAQYGRVYKDAAEKEKRFQVFKNNVHFIESFNAAGDKPFNLSINQFADLNDEEF 94
Query: 92 KSRLGLNLSK 101
K+ L +N+ K
Sbjct: 95 KALL-INVQK 103
>gi|328722454|ref|XP_001951172.2| PREDICTED: counting factor associated protein D-like [Acyrthosiphon
pisum]
Length = 558
Score = 42.4 bits (98), Expect = 0.032, Method: Composition-based stats.
Identities = 24/64 (37%), Positives = 35/64 (54%), Gaps = 2/64 (3%)
Query: 34 FEKFIRDFSKSYPTKEEVA-KRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
FE+F R +K+YP + R +F NL+ I N+ G T +NHL+D + E+K
Sbjct: 252 FEEFKRKHNKNYPNDTIIHFDRKNIFRQNLRYIRSKNRANVGY-TLAVNHLADYSSTELK 310
Query: 93 SRLG 96
S LG
Sbjct: 311 SMLG 314
>gi|225438807|ref|XP_002283263.1| PREDICTED: germination-specific cysteine protease 1-like isoform 1
[Vitis vinifera]
Length = 374
Score = 42.4 bits (98), Expect = 0.032, Method: Composition-based stats.
Identities = 22/65 (33%), Positives = 38/65 (58%), Gaps = 1/65 (1%)
Query: 29 EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTR 88
E + ++ ++ K+Y E KRF +F+DNLK I++ N ++ T G+N +DLT
Sbjct: 41 EVMGMYQWWMAKHGKAYNGLGEKEKRFEIFKDNLKFIDEHN-AQNRTYKVGLNRFADLTN 99
Query: 89 EEMKS 93
EE ++
Sbjct: 100 EEYRA 104
>gi|388497270|gb|AFK36701.1| unknown [Lotus japonicus]
Length = 343
Score = 42.4 bits (98), Expect = 0.032, Method: Composition-based stats.
Identities = 25/69 (36%), Positives = 38/69 (55%), Gaps = 4/69 (5%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATY--GINHLSDLTRE 89
K ++++ + +SY E+ KRF +F +NL+ IE N G +Y +N SDLT E
Sbjct: 36 KTHQQWMLQYGRSYTNDAEMEKRFKIFMENLEYIEKFNNAP-GNKSYKLDLNQFSDLTNE 94
Query: 90 E-MKSRLGL 97
E + S GL
Sbjct: 95 EFIASHTGL 103
>gi|356517426|ref|XP_003527388.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 343
Score = 42.4 bits (98), Expect = 0.033, Method: Composition-based stats.
Identities = 18/55 (32%), Positives = 32/55 (58%)
Query: 36 KFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
+++ ++K Y +E KRF +F++N+ IE N ++ + IN +DLT EE
Sbjct: 41 QWMARYAKVYKDPQEREKRFRIFKENVNYIETFNSADNKSYKLDINQFADLTNEE 95
>gi|403342666|gb|EJY70658.1| Cysteine protease [Oxytricha trifallax]
Length = 367
Score = 42.4 bits (98), Expect = 0.034, Method: Composition-based stats.
Identities = 25/90 (27%), Positives = 43/90 (47%), Gaps = 1/90 (1%)
Query: 10 TLALFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLN 69
T ++ G +N +LK + P F F+ +S+ T+EE R A+F D + ++ N
Sbjct: 34 TESVSGNAATNLKLKVD-PSIQTAFNNFVSRHQRSFLTQEEYKARLAIFRDTFEAVQLHN 92
Query: 70 KGEHGTATYGINHLSDLTREEMKSRLGLNL 99
E + IN SD++++E L L
Sbjct: 93 SLESKSYKLAINKFSDMSKDEFSKFSSLQL 122
>gi|238006338|gb|ACR34204.1| unknown [Zea mays]
Length = 465
Score = 42.4 bits (98), Expect = 0.034, Method: Composition-based stats.
Identities = 23/73 (31%), Positives = 40/73 (54%), Gaps = 3/73 (4%)
Query: 23 LKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNK--GEHGTATYGI 80
L+ PE +E ++ + ++Y E +RF VF DNL+ ++ N+ EHG G+
Sbjct: 41 LERTEPEARTLYELWLAEHGRAYNALGERDRRFRVFWDNLRFVDAHNERAAEHGF-RLGM 99
Query: 81 NHLSDLTREEMKS 93
N +DLT +E ++
Sbjct: 100 NQFADLTNDEFRA 112
>gi|226501480|ref|NP_001150266.1| cysteine protease 1 precursor [Zea mays]
gi|195637948|gb|ACG38442.1| cysteine protease 1 precursor [Zea mays]
Length = 462
Score = 42.4 bits (98), Expect = 0.034, Method: Composition-based stats.
Identities = 23/73 (31%), Positives = 40/73 (54%), Gaps = 3/73 (4%)
Query: 23 LKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNK--GEHGTATYGI 80
L+ PE +E ++ + ++Y E +RF VF DNL+ ++ N+ EHG G+
Sbjct: 38 LERTEPEARTLYELWLAEHGRAYNALGERDRRFRVFWDNLRFVDAHNERAAEHGF-RLGM 96
Query: 81 NHLSDLTREEMKS 93
N +DLT +E ++
Sbjct: 97 NQFADLTNDEFRA 109
>gi|261289779|ref|XP_002611751.1| hypothetical protein BRAFLDRAFT_284345 [Branchiostoma floridae]
gi|229297123|gb|EEN67761.1| hypothetical protein BRAFLDRAFT_284345 [Branchiostoma floridae]
Length = 330
Score = 42.4 bits (98), Expect = 0.034, Method: Composition-based stats.
Identities = 25/61 (40%), Positives = 35/61 (57%), Gaps = 4/61 (6%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNK-GEHGTATY--GINHLSDLTRE 89
Q+E F K Y KEE A+R +F+DNLK IE N+ + G +Y G+N +D+T
Sbjct: 23 QWEAFKIKHDKVYSEKEEYARRL-IFQDNLKTIESHNQEADTGKHSYWLGVNQFADMTHA 81
Query: 90 E 90
E
Sbjct: 82 E 82
>gi|224077886|ref|XP_002305451.1| predicted protein [Populus trichocarpa]
gi|222848415|gb|EEE85962.1| predicted protein [Populus trichocarpa]
Length = 368
Score = 42.4 bits (98), Expect = 0.035, Method: Composition-based stats.
Identities = 31/87 (35%), Positives = 47/87 (54%), Gaps = 11/87 (12%)
Query: 15 GQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLI---EDLNKG 71
GQ S++ L + H F F F KSY ++EE RF+VF+ NL+ ++L+
Sbjct: 37 GQDASSSNLLSAEQHH---FSLFKSKFKKSYGSQEEHDYRFSVFKANLRRAARHQELDP- 92
Query: 72 EHGTATYGINHLSDLTREEMKSR-LGL 97
TA++G+ SDLT E + + LGL
Sbjct: 93 ---TASHGVTQFSDLTPAEFRKQVLGL 116
>gi|30685308|ref|NP_566634.2| putative cysteine proteinase [Arabidopsis thaliana]
gi|30315949|sp|Q9LT77.1|CPR1_ARATH RecName: Full=Probable cysteine proteinase At3g19400; Flags:
Precursor
gi|11994462|dbj|BAB02464.1| cysteine proteinase [Arabidopsis thaliana]
gi|332642715|gb|AEE76236.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 362
Score = 42.4 bits (98), Expect = 0.036, Method: Composition-based stats.
Identities = 21/75 (28%), Positives = 40/75 (53%)
Query: 19 SNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATY 78
+ E++ E +E+++ + K+Y E +RF +F+DNLK +++ N T
Sbjct: 29 TETEIERNETEVRLMYEQWLVENRKNYNGLGEKERRFKIFKDNLKFVDEHNSVPDRTFEV 88
Query: 79 GINHLSDLTREEMKS 93
G+ +DLT EE ++
Sbjct: 89 GLTRFADLTNEEFRA 103
>gi|26452046|dbj|BAC43113.1| putative cysteine proteinase RD21A precursor [Arabidopsis thaliana]
Length = 362
Score = 42.4 bits (98), Expect = 0.036, Method: Composition-based stats.
Identities = 21/75 (28%), Positives = 40/75 (53%)
Query: 19 SNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATY 78
+ E++ E +E+++ + K+Y E +RF +F+DNLK +++ N T
Sbjct: 29 TETEIERNETEVRLMYEQWLVENRKNYNGLGEKERRFKIFKDNLKFVDEHNSVPDRTFEV 88
Query: 79 GINHLSDLTREEMKS 93
G+ +DLT EE ++
Sbjct: 89 GLTRFADLTNEEFRA 103
>gi|58201366|gb|AAW66804.1| cysteine protease [Pinus taeda]
gi|58201368|gb|AAW66805.1| cysteine protease [Pinus taeda]
gi|58201392|gb|AAW66817.1| cysteine protease [Pinus taeda]
gi|58201394|gb|AAW66818.1| cysteine protease [Pinus taeda]
gi|58201398|gb|AAW66820.1| cysteine protease [Pinus taeda]
Length = 193
Score = 42.4 bits (98), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 25/75 (33%), Positives = 44/75 (58%), Gaps = 2/75 (2%)
Query: 19 SNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATY 78
SN +L+ E+ ++ +E ++ + K+Y +E KRF VF+DN I + N+G +
Sbjct: 28 SNKDLR-EDDAIMELYELWVAEHKKAYNGLDEKQKRFTVFKDNFLYIHEHNQGNR-SYKL 85
Query: 79 GINHLSDLTREEMKS 93
G+N +DL+ EE K+
Sbjct: 86 GLNQFADLSHEEFKA 100
>gi|357122137|ref|XP_003562772.1| PREDICTED: fruit bromelain-like [Brachypodium distachyon]
Length = 358
Score = 42.4 bits (98), Expect = 0.036, Method: Composition-based stats.
Identities = 19/63 (30%), Positives = 35/63 (55%)
Query: 31 LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
+ +F F ++++Y + EE +RF V+ N+ IE +N+ T G N +DLT +E
Sbjct: 37 MDRFRAFQATYNRTYASPEERLRRFEVYRRNVDYIEAMNRRGDLTYELGENQFADLTVQE 96
Query: 91 MKS 93
++
Sbjct: 97 FRA 99
>gi|118364806|ref|XP_001015624.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89297391|gb|EAR95379.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 375
Score = 42.4 bits (98), Expect = 0.036, Method: Composition-based stats.
Identities = 26/68 (38%), Positives = 36/68 (52%), Gaps = 2/68 (2%)
Query: 34 FEKFIRDFSKSYPTKE-EVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
FE++I DF K Y E +R FE NL I N +H + G+N +DLT +E +
Sbjct: 29 FEQYIVDFEKEYEVDSVEYNQRKQTFEKNLVEIIAFNNKDH-SYKKGVNRNTDLTTKEFQ 87
Query: 93 SRLGLNLS 100
+LGL S
Sbjct: 88 VQLGLKKS 95
>gi|356570072|ref|XP_003553215.1| PREDICTED: LOW QUALITY PROTEIN: cysteine proteinase 3-like,
partial [Glycine max]
Length = 301
Score = 42.4 bits (98), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 25/68 (36%), Positives = 34/68 (50%), Gaps = 2/68 (2%)
Query: 30 HLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTRE 89
H F F K Y + +E+ F +F DNLKLI N+ T G+NH +D T E
Sbjct: 29 HALSFACFACHHDKRYHSIDEIRNGFQIFSDNLKLIRSTNR-RSLTYMLGVNHFADWTWE 87
Query: 90 EM-KSRLG 96
E + +LG
Sbjct: 88 EFTRHKLG 95
>gi|48374352|gb|AAT09103.1| digestive cysteine proteinase [Bigelowiella natans]
Length = 360
Score = 42.4 bits (98), Expect = 0.037, Method: Composition-based stats.
Identities = 20/65 (30%), Positives = 38/65 (58%)
Query: 31 LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
+ +FE + ++F KSY + K F +N ++I+ LN+ E G+A YG SD++ E+
Sbjct: 21 IPKFEAWKKEFGKSYEEAGKEDKARLNFVENERIIQGLNENELGSAVYGHTRFSDMSPEQ 80
Query: 91 MKSRL 95
++ +
Sbjct: 81 FRAMM 85
>gi|390344145|ref|XP_798313.2| PREDICTED: cathepsin O-like [Strongylocentrotus purpuratus]
Length = 361
Score = 42.4 bits (98), Expect = 0.037, Method: Composition-based stats.
Identities = 25/62 (40%), Positives = 36/62 (58%), Gaps = 3/62 (4%)
Query: 34 FEKFIRDFSKSYPT-KEEVAKRFAVFEDNLKLIEDLNK--GEHGTATYGINHLSDLTREE 90
F+ FI+ F+K+Y +E KR+ +F+++L E LN ATYGI SDLT EE
Sbjct: 54 FQIFIQKFNKTYTRGSQEYFKRYRIFKESLLKHEMLNAIATHRDHATYGITKFSDLTSEE 113
Query: 91 MK 92
+
Sbjct: 114 FQ 115
>gi|326428462|gb|EGD74032.1| hypothetical protein PTSG_05727 [Salpingoeca sp. ATCC 50818]
Length = 398
Score = 42.4 bits (98), Expect = 0.037, Method: Composition-based stats.
Identities = 25/65 (38%), Positives = 32/65 (49%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
FE F ++ K Y + EE R VFE L ++ N T GINH+SD T E K
Sbjct: 57 FEHFKAEYGKRYLSSEEHDFRRQVFERTLASVKAHNSDPTKTWKQGINHMSDWTDGEFKR 116
Query: 94 RLGLN 98
LG +
Sbjct: 117 LLGYD 121
>gi|224082940|ref|XP_002306900.1| predicted protein [Populus trichocarpa]
gi|118481986|gb|ABK92924.1| unknown [Populus trichocarpa]
gi|222856349|gb|EEE93896.1| predicted protein [Populus trichocarpa]
Length = 367
Score = 42.4 bits (98), Expect = 0.037, Method: Composition-based stats.
Identities = 32/90 (35%), Positives = 44/90 (48%), Gaps = 11/90 (12%)
Query: 5 ASAEATLALFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKL 64
+S++ L Q+ S E N EH F F F K+Y T+EE RF+VF+ NL
Sbjct: 24 SSSDLDDPLIRQVVSEGEDHLLNAEH--HFTTFKSKFGKNYATQEEHDYRFSVFKANL-- 79
Query: 65 IEDLNKGEHG----TATYGINHLSDLTREE 90
L +H TA +G+ SDLT +E
Sbjct: 80 ---LRAKKHQIMDPTAAHGVTKFSDLTPKE 106
>gi|356545063|ref|XP_003540965.1| PREDICTED: thiol protease SEN102-like [Glycine max]
Length = 361
Score = 42.4 bits (98), Expect = 0.037, Method: Composition-based stats.
Identities = 19/59 (32%), Positives = 32/59 (54%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
++ E+++ + K Y +E KRF +F++N+ IE N + IN +DLT EE
Sbjct: 55 ERHEQWMTRYGKVYKDPQEREKRFRIFKENVNYIEAFNNAANKRYKLAINQFADLTNEE 113
>gi|302790570|ref|XP_002977052.1| hypothetical protein SELMODRAFT_268054 [Selaginella
moellendorffii]
gi|300155028|gb|EFJ21661.1| hypothetical protein SELMODRAFT_268054 [Selaginella
moellendorffii]
Length = 300
Score = 42.4 bits (98), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 23/60 (38%), Positives = 31/60 (51%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
FE + KSY + E A+R +F D L IE N + T T G+N SDLT E ++
Sbjct: 2 FEGWAAKHGKSYSSDWEKARRLMIFSDTLAYIEKHNALPNTTFTLGLNKFSDLTNAEFRA 61
>gi|302763837|ref|XP_002965340.1| hypothetical protein SELMODRAFT_143126 [Selaginella
moellendorffii]
gi|302790566|ref|XP_002977050.1| hypothetical protein SELMODRAFT_232903 [Selaginella
moellendorffii]
gi|300155026|gb|EFJ21659.1| hypothetical protein SELMODRAFT_232903 [Selaginella
moellendorffii]
gi|300167573|gb|EFJ34178.1| hypothetical protein SELMODRAFT_143126 [Selaginella
moellendorffii]
Length = 300
Score = 42.4 bits (98), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 23/60 (38%), Positives = 31/60 (51%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
FE + KSY + E A+R +F D L IE N + T T G+N SDLT E ++
Sbjct: 2 FEGWAAKHGKSYSSDWEKARRLMIFSDTLAYIEKHNALPNTTFTLGLNKFSDLTNAEFRA 61
>gi|225427714|ref|XP_002264345.1| PREDICTED: cysteine proteinase RD19a [Vitis vinifera]
Length = 377
Score = 42.4 bits (98), Expect = 0.038, Method: Composition-based stats.
Identities = 29/85 (34%), Positives = 44/85 (51%), Gaps = 3/85 (3%)
Query: 14 FGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEH 73
G ++ + E +H F F R F KSY ++EE RF VF+ NL+ + +
Sbjct: 43 LGDVEGSEEENLLTADH-HHFSIFKRRFGKSYASQEEHDYRFKVFKANLRRARRHQQLD- 100
Query: 74 GTATYGINHLSDLTREEMK-SRLGL 97
+AT+G+ SDLT E + + LGL
Sbjct: 101 PSATHGVTQFSDLTPAEFRGTYLGL 125
>gi|440293210|gb|ELP86353.1| cysteine protease, putative [Entamoeba invadens IP1]
Length = 453
Score = 42.4 bits (98), Expect = 0.038, Method: Composition-based stats.
Identities = 22/59 (37%), Positives = 32/59 (54%), Gaps = 2/59 (3%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLN--KGEHGTATYGINHLSDLTREE 90
F + F KSY T+ E +R A+F D + I + N + A G+N+LSDLT +E
Sbjct: 36 FASYKMLFQKSYNTQSEELRRLAIFADKSRFIAEFNTQRKSSNDALLGLNNLSDLTTDE 94
>gi|359359118|gb|AEV41024.1| putative oryzain beta chain precursor [Oryza minuta]
Length = 493
Score = 42.4 bits (98), Expect = 0.038, Method: Composition-based stats.
Identities = 23/73 (31%), Positives = 38/73 (52%), Gaps = 2/73 (2%)
Query: 23 LKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLN--KGEHGTATYGI 80
L+ E ++ ++ + +SY E +RF VF DNLK ++ N EHG G+
Sbjct: 38 LERTEAEARAAYDLWLAENGRSYNALGERERRFRVFWDNLKFVDAHNARADEHGGFRLGM 97
Query: 81 NHLSDLTREEMKS 93
N +DLT +E ++
Sbjct: 98 NRFADLTNDEFRA 110
>gi|400180437|gb|AFP73356.1| cysteine protease [Solanum pennellii]
Length = 337
Score = 42.4 bits (98), Expect = 0.039, Method: Composition-based stats.
Identities = 20/69 (28%), Positives = 41/69 (59%), Gaps = 1/69 (1%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
++ E ++ + Y + E +RF +F++N+K IE +NK + + G+N +D+T +E
Sbjct: 37 ERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEF 96
Query: 92 KSRL-GLNL 99
++ GLN+
Sbjct: 97 LAKFTGLNI 105
>gi|20334375|gb|AAM19208.1|AF493233_1 cysteine protease [Solanum pennellii]
Length = 337
Score = 42.4 bits (98), Expect = 0.039, Method: Composition-based stats.
Identities = 20/69 (28%), Positives = 41/69 (59%), Gaps = 1/69 (1%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
++ E ++ + Y + E +RF +F++N+K IE +NK + + G+N +D+T +E
Sbjct: 37 ERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEF 96
Query: 92 KSRL-GLNL 99
++ GLN+
Sbjct: 97 LAKFTGLNI 105
>gi|400180345|gb|AFP73311.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 42.4 bits (98), Expect = 0.039, Method: Composition-based stats.
Identities = 20/69 (28%), Positives = 41/69 (59%), Gaps = 1/69 (1%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
++ E ++ + Y + E +RF +F++N+K IE +NK + + G+N +D+T +E
Sbjct: 37 ERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEF 96
Query: 92 KSRL-GLNL 99
++ GLN+
Sbjct: 97 LAKFTGLNI 105
>gi|400180347|gb|AFP73312.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 42.4 bits (98), Expect = 0.040, Method: Composition-based stats.
Identities = 20/69 (28%), Positives = 41/69 (59%), Gaps = 1/69 (1%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
++ E ++ + Y + E +RF +F++N+K IE +NK + + G+N +D+T +E
Sbjct: 37 ERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEF 96
Query: 92 KSRL-GLNL 99
++ GLN+
Sbjct: 97 LAKFTGLNI 105
>gi|113120273|gb|ABI30276.1| VXH-C [Vasconcellea x heilbornii]
Length = 282
Score = 42.4 bits (98), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 22/64 (34%), Positives = 38/64 (59%), Gaps = 1/64 (1%)
Query: 31 LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
++ FE ++ K Y + EE RF +F+DNL I++ NK ++ + G+N +DLT +E
Sbjct: 45 IRLFESWMLKHDKVYKSMEEKINRFEIFKDNLMYIDETNK-KNNSYWLGLNEFADLTHDE 103
Query: 91 MKSR 94
K +
Sbjct: 104 FKKK 107
>gi|400180383|gb|AFP73330.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 42.4 bits (98), Expect = 0.040, Method: Composition-based stats.
Identities = 20/69 (28%), Positives = 41/69 (59%), Gaps = 1/69 (1%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
++ E ++ + Y + E +RF +F++N+K IE +NK + + G+N +D+T +E
Sbjct: 37 ERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEF 96
Query: 92 KSRL-GLNL 99
++ GLN+
Sbjct: 97 LAKFTGLNI 105
>gi|400180353|gb|AFP73315.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 42.4 bits (98), Expect = 0.040, Method: Composition-based stats.
Identities = 20/69 (28%), Positives = 41/69 (59%), Gaps = 1/69 (1%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
++ E ++ + Y + E +RF +F++N+K IE +NK + + G+N +D+T +E
Sbjct: 37 ERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEF 96
Query: 92 KSRL-GLNL 99
++ GLN+
Sbjct: 97 LAKFTGLNI 105
>gi|401430350|ref|XP_003886559.1| unnamed protein product, partial [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|356491516|emb|CBZ40966.1| unnamed protein product, partial [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 503
Score = 42.0 bits (97), Expect = 0.041, Method: Composition-based stats.
Identities = 21/61 (34%), Positives = 35/61 (57%), Gaps = 1/61 (1%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
FE+F R + ++Y T E +R A FE NL+L+ + ++ + A +GI DL+ E +
Sbjct: 98 FEEFKRTYGRAYETLAEEQQRLANFERNLELMRE-HQARNPHAQFGITKFFDLSEAEFAA 156
Query: 94 R 94
R
Sbjct: 157 R 157
>gi|401430288|ref|XP_003886537.1| unnamed protein product [Leishmania mexicana MHOM/GT/2001/U1103]
gi|356491333|emb|CBZ40988.1| unnamed protein product [Leishmania mexicana MHOM/GT/2001/U1103]
Length = 533
Score = 42.0 bits (97), Expect = 0.041, Method: Composition-based stats.
Identities = 21/61 (34%), Positives = 35/61 (57%), Gaps = 1/61 (1%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
FE+F R + ++Y T E +R A FE NL+L+ + ++ + A +GI DL+ E +
Sbjct: 128 FEEFKRTYGRAYETLAEEQQRLANFERNLELMRE-HQARNPHAQFGITKFFDLSEAEFAA 186
Query: 94 R 94
R
Sbjct: 187 R 187
>gi|401416324|ref|XP_003872657.1| putative cathepsin L-like protease [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|322488881|emb|CBZ24131.1| putative cathepsin L-like protease [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 443
Score = 42.0 bits (97), Expect = 0.041, Method: Composition-based stats.
Identities = 21/61 (34%), Positives = 35/61 (57%), Gaps = 1/61 (1%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
FE+F R + ++Y T E +R A FE NL+L+ + ++ + A +GI DL+ E +
Sbjct: 38 FEEFKRTYGRAYETLAEEQQRLANFERNLELMRE-HQARNPHAQFGITKFFDLSEAEFAA 96
Query: 94 R 94
R
Sbjct: 97 R 97
>gi|2780176|emb|CAA71085.1| cystein proteinase [Leishmania mexicana]
Length = 443
Score = 42.0 bits (97), Expect = 0.041, Method: Composition-based stats.
Identities = 21/61 (34%), Positives = 35/61 (57%), Gaps = 1/61 (1%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
FE+F R + ++Y T E +R A FE NL+L+ + ++ + A +GI DL+ E +
Sbjct: 38 FEEFKRTYGRAYETLAEEQQRLANFERNLELMRE-HQARNPHAQFGITKFFDLSEAEFAA 96
Query: 94 R 94
R
Sbjct: 97 R 97
>gi|9542|emb|CAA78443.1| cysteine proteinase [Leishmania mexicana]
Length = 443
Score = 42.0 bits (97), Expect = 0.041, Method: Composition-based stats.
Identities = 21/61 (34%), Positives = 35/61 (57%), Gaps = 1/61 (1%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
FE+F R + ++Y T E +R A FE NL+L+ + ++ + A +GI DL+ E +
Sbjct: 38 FEEFKRTYGRAYETLAEEQQRLANFERNLELMRE-HQARNPHAQFGITKFFDLSEAEFAA 96
Query: 94 R 94
R
Sbjct: 97 R 97
>gi|461905|sp|Q05094.1|CYSP2_LEIPI RecName: Full=Cysteine proteinase 2; AltName: Full=Amastigote
cysteine proteinase A-2; Flags: Precursor
gi|159298|gb|AAA29229.1| cysteine proteinase [Leishmania pifanoi]
Length = 444
Score = 42.0 bits (97), Expect = 0.041, Method: Composition-based stats.
Identities = 21/61 (34%), Positives = 35/61 (57%), Gaps = 1/61 (1%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
FE+F R + ++Y T E +R A FE NL+L+ + ++ + A +GI DL+ E +
Sbjct: 38 FEEFKRTYGRAYETLAEEQQRLANFERNLELMRE-HQARNPHAQFGITKFFDLSEAEFAA 96
Query: 94 R 94
R
Sbjct: 97 R 97
>gi|1730100|sp|P36400.2|LMCPB_LEIME RecName: Full=Cysteine proteinase B; Flags: Precursor
gi|899313|emb|CAA90236.1| LmCPb2.8 [Leishmania mexicana]
Length = 443
Score = 42.0 bits (97), Expect = 0.041, Method: Composition-based stats.
Identities = 21/61 (34%), Positives = 35/61 (57%), Gaps = 1/61 (1%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
FE+F R + ++Y T E +R A FE NL+L+ + ++ + A +GI DL+ E +
Sbjct: 38 FEEFKRTYGRAYETLAEEQQRLANFERNLELMRE-HQARNPHAQFGITKFFDLSEAEFAA 96
Query: 94 R 94
R
Sbjct: 97 R 97
>gi|383852175|ref|XP_003701604.1| PREDICTED: cathepsin O-like [Megachile rotundata]
Length = 370
Score = 42.0 bits (97), Expect = 0.041, Method: Composition-based stats.
Identities = 23/71 (32%), Positives = 43/71 (60%), Gaps = 7/71 (9%)
Query: 29 EHLKQFEKFIRDFSKSY---PTKEEVAKRFAVFEDNLKLIEDLN--KGEHGTATYGINHL 83
E +K F+ ++ ++K+Y PT+ E +RF F+ +L+ IE +N + +A YG+
Sbjct: 47 EDIKLFKNYVTRYNKTYRNDPTEYE--ERFQRFQRSLRHIETMNSLRSSPESAFYGLTEF 104
Query: 84 SDLTREEMKSR 94
SD+T +E +S+
Sbjct: 105 SDMTEDEFRSQ 115
>gi|222637029|gb|EEE67161.1| hypothetical protein OsJ_24244 [Oryza sativa Japonica Group]
Length = 309
Score = 42.0 bits (97), Expect = 0.041, Method: Composition-based stats.
Identities = 25/63 (39%), Positives = 34/63 (53%), Gaps = 1/63 (1%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
QF F+R + Y EE A+R VF NL ++ TA +G+ SDLTREE +
Sbjct: 47 QFAAFVRRHGREYSGPEEYARRLRVFAANLAR-AAAHQALDPTARHGVTPFSDLTREEFE 105
Query: 93 SRL 95
+RL
Sbjct: 106 ARL 108
>gi|144905104|dbj|BAF56427.1| cysteine proteinase [Lotus japonicus]
Length = 342
Score = 42.0 bits (97), Expect = 0.041, Method: Composition-based stats.
Identities = 19/59 (32%), Positives = 34/59 (57%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
++ E+++ + K Y +E KRF +F++N+K IE N + G+N +DLT +E
Sbjct: 37 ERHEQWMARYGKVYKDLQEKEKRFNIFQENVKYIEASNNAGNKPYKLGVNQFTDLTNKE 95
>gi|213514356|ref|NP_001134251.1| Cathepsin M precursor [Salmo salar]
gi|38423489|emb|CAD80246.1| cystein proteinase inhibitor protein [Salmo salar]
gi|209731860|gb|ACI66799.1| Cathepsin M precursor [Salmo salar]
Length = 342
Score = 42.0 bits (97), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 25/67 (37%), Positives = 40/67 (59%), Gaps = 3/67 (4%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNK-GEHG--TATYGINHLSDLTR 88
K+FE + + K+YP+ E AKR ++ K++ + NK E+G + T G+NH +DLT
Sbjct: 272 KEFETWKVKYGKTYPSTVEEAKRKEIWLATRKMVMEHNKRAENGLESFTMGVNHFADLTA 331
Query: 89 EEMKSRL 95
EE+ L
Sbjct: 332 EEVPRGL 338
Score = 40.8 bits (94), Expect = 0.093, Method: Compositional matrix adjust.
Identities = 25/67 (37%), Positives = 39/67 (58%), Gaps = 3/67 (4%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVF-EDNLKLIEDLNKGEHGTATY--GINHLSDLTR 88
K+FE + + KSYP+ EE AKR ++ K++E + +G +Y +NHL+DLT
Sbjct: 32 KEFETWKVKYGKSYPSTEEEAKRKEMWLATRKKVMEHNTRAGNGLESYTMAVNHLADLTT 91
Query: 89 EEMKSRL 95
EE+ L
Sbjct: 92 EEVPKGL 98
Score = 35.4 bits (80), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 22/66 (33%), Positives = 37/66 (56%), Gaps = 3/66 (4%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVF-EDNLKLIEDLNKGEHGTATY--GINHLSDLTR 88
K+FE + K+Y + EE AKR ++ +++E + E G+ ++ G+NHLSD T
Sbjct: 195 KEFETWKVQHGKNYGSTEEEAKRKGIWLATRTRVMEHNKRAETGSESFTMGMNHLSDKTT 254
Query: 89 EEMKSR 94
E+ R
Sbjct: 255 AEVTGR 260
>gi|320164780|gb|EFW41679.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
Length = 334
Score = 42.0 bits (97), Expect = 0.042, Method: Composition-based stats.
Identities = 24/60 (40%), Positives = 32/60 (53%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
+FE + R F KSY E R AV+E N L++ N + T G+N +DLT EE K
Sbjct: 29 EFEAWKRTFGKSYSDAVEEINRRAVWEANKMLVDAHNGAGIHSYTLGMNIFADLTHEEFK 88
>gi|359806140|ref|NP_001241450.1| uncharacterized protein LOC100778716 precursor [Glycine max]
gi|255639509|gb|ACU20049.1| unknown [Glycine max]
Length = 366
Score = 42.0 bits (97), Expect = 0.042, Method: Composition-based stats.
Identities = 27/72 (37%), Positives = 39/72 (54%), Gaps = 4/72 (5%)
Query: 27 NPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
N EH F F F K+Y T+EE RF +F++NL + K + +A +G+ SDL
Sbjct: 46 NAEH--HFSAFKTKFGKTYATQEEHDHRFRIFKNNLLRAKSHQKLD-PSAVHGVTRFSDL 102
Query: 87 TREEMKSR-LGL 97
T E + + LGL
Sbjct: 103 TPAEFRRQFLGL 114
>gi|145520919|ref|XP_001446315.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124413792|emb|CAK78918.1| unnamed protein product [Paramecium tetraurelia]
Length = 317
Score = 42.0 bits (97), Expect = 0.042, Method: Composition-based stats.
Identities = 30/90 (33%), Positives = 47/90 (52%), Gaps = 4/90 (4%)
Query: 1 MAEDASAEATLALFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFED 60
M++ A T+AL G + N+ ++ +++ +FE F + F K Y + EE A R AV+
Sbjct: 1 MSKTILALGTIALIGALLMANQ--PQSVDYVSKFEAFKQRFGKRYGSTEE-AYRLAVYTQ 57
Query: 61 NLKLIEDLNKGEHGTATYGINHLSDLTREE 90
NL E N + G +G DLT+EE
Sbjct: 58 NLLFAEAHNL-QKGKRVFGETIFFDLTQEE 86
>gi|400180461|gb|AFP73367.1| cysteine protease [Solanum peruvianum]
gi|400180473|gb|AFP73373.1| cysteine protease [Solanum peruvianum]
gi|400180475|gb|AFP73374.1| cysteine protease [Solanum peruvianum]
gi|400180479|gb|AFP73376.1| cysteine protease [Solanum peruvianum]
gi|400180481|gb|AFP73377.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 42.0 bits (97), Expect = 0.042, Method: Composition-based stats.
Identities = 20/69 (28%), Positives = 41/69 (59%), Gaps = 1/69 (1%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
++ E ++ + Y + E +RF +F++N+K IE +NK + + G+N +D+T +E
Sbjct: 37 ERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEF 96
Query: 92 KSRL-GLNL 99
++ GLN+
Sbjct: 97 LAKFTGLNI 105
>gi|400180426|gb|AFP73351.1| cysteine protease [Solanum corneliomuelleri]
Length = 344
Score = 42.0 bits (97), Expect = 0.042, Method: Composition-based stats.
Identities = 20/69 (28%), Positives = 41/69 (59%), Gaps = 1/69 (1%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
++ E ++ + Y + E +RF +F++N+K IE +NK + + G+N +D+T +E
Sbjct: 37 ERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEF 96
Query: 92 KSRL-GLNL 99
++ GLN+
Sbjct: 97 LAKFTGLNI 105
>gi|218199600|gb|EEC82027.1| hypothetical protein OsI_25996 [Oryza sativa Indica Group]
Length = 709
Score = 42.0 bits (97), Expect = 0.042, Method: Composition-based stats.
Identities = 25/63 (39%), Positives = 34/63 (53%), Gaps = 1/63 (1%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
QF F+R + Y EE A+R VF NL ++ TA +G+ SDLTREE +
Sbjct: 47 QFAAFVRRHGREYSGPEEYARRLRVFAANLAR-AAAHQALDPTARHGVTPFSDLTREEFE 105
Query: 93 SRL 95
+RL
Sbjct: 106 ARL 108
>gi|443732032|gb|ELU16924.1| hypothetical protein CAPTEDRAFT_222012 [Capitella teleta]
Length = 342
Score = 42.0 bits (97), Expect = 0.043, Method: Composition-based stats.
Identities = 27/81 (33%), Positives = 38/81 (46%), Gaps = 3/81 (3%)
Query: 23 LKTENPEHLKQFEKFIRDFSKSYPTKE-EVAKRFAVFEDNLKLIEDLN--KGEHGTATYG 79
L+ N E F KF + K+Y E R +F DN K LN + + +A YG
Sbjct: 21 LRVSNEEIDDLFVKFTEKYHKTYLIGSLEYMHRRGIFRDNFKKHVALNSLRTNNASAWYG 80
Query: 80 INHLSDLTREEMKSRLGLNLS 100
+ SDLT+EE +R N +
Sbjct: 81 VTQFSDLTQEEFTNRFLSNFT 101
>gi|400180457|gb|AFP73365.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 42.0 bits (97), Expect = 0.043, Method: Composition-based stats.
Identities = 20/69 (28%), Positives = 41/69 (59%), Gaps = 1/69 (1%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
++ E ++ + Y + E +RF +F++N+K IE +NK + + G+N +D+T +E
Sbjct: 37 ERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEF 96
Query: 92 KSRL-GLNL 99
++ GLN+
Sbjct: 97 LAKFTGLNI 105
>gi|400180435|gb|AFP73355.1| cysteine protease [Solanum pennellii]
Length = 344
Score = 42.0 bits (97), Expect = 0.043, Method: Composition-based stats.
Identities = 20/69 (28%), Positives = 41/69 (59%), Gaps = 1/69 (1%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
++ E ++ + Y + E +RF +F++N+K IE +NK + + G+N +D+T +E
Sbjct: 37 ERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEF 96
Query: 92 KSRL-GLNL 99
++ GLN+
Sbjct: 97 LAKFTGLNI 105
>gi|400180357|gb|AFP73317.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 42.0 bits (97), Expect = 0.043, Method: Composition-based stats.
Identities = 20/69 (28%), Positives = 41/69 (59%), Gaps = 1/69 (1%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
++ E ++ + Y + E +RF +F++N+K IE +NK + + G+N +D+T +E
Sbjct: 37 ERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEF 96
Query: 92 KSRL-GLNL 99
++ GLN+
Sbjct: 97 LAKFTGLNI 105
>gi|146335578|gb|ABQ23398.1| cathepsin L isotype 1 [Trypanoplasma borreli]
Length = 443
Score = 42.0 bits (97), Expect = 0.043, Method: Composition-based stats.
Identities = 20/61 (32%), Positives = 36/61 (59%), Gaps = 1/61 (1%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
F F +++Y + E KRF +F N+K +LN+ ++ AT+G N +D++ EE ++
Sbjct: 25 FSDFKATHARNYVSPGEERKRFEIFAANMKKAAELNR-KNPMATFGPNEFADMSSEEFQT 83
Query: 94 R 94
R
Sbjct: 84 R 84
>gi|146335580|gb|ABQ23399.1| cathepsin L isotype 2 [Trypanoplasma borreli]
Length = 443
Score = 42.0 bits (97), Expect = 0.043, Method: Composition-based stats.
Identities = 20/61 (32%), Positives = 36/61 (59%), Gaps = 1/61 (1%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
F F +++Y + E KRF +F N+K +LN+ ++ AT+G N +D++ EE ++
Sbjct: 25 FSDFKATHARNYVSPGEERKRFEIFAANMKKAAELNR-KNPMATFGPNEFADMSSEEFQT 83
Query: 94 R 94
R
Sbjct: 84 R 84
>gi|20334377|gb|AAM19209.1|AF493234_1 cysteine protease [Solanum lycopersicum]
gi|400180431|gb|AFP73353.1| cysteine protease [Solanum lycopersicum]
Length = 345
Score = 42.0 bits (97), Expect = 0.043, Method: Composition-based stats.
Identities = 20/69 (28%), Positives = 41/69 (59%), Gaps = 1/69 (1%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
++ E ++ + Y + E +RF +F++N+K IE +NK + + G+N +D+T +E
Sbjct: 37 ERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEF 96
Query: 92 KSRL-GLNL 99
++ GLN+
Sbjct: 97 LAKFTGLNI 105
>gi|224093956|ref|XP_002310053.1| predicted protein [Populus trichocarpa]
gi|224147016|ref|XP_002336386.1| predicted protein [Populus trichocarpa]
gi|222834869|gb|EEE73318.1| predicted protein [Populus trichocarpa]
gi|222852956|gb|EEE90503.1| predicted protein [Populus trichocarpa]
Length = 340
Score = 42.0 bits (97), Expect = 0.043, Method: Composition-based stats.
Identities = 23/87 (26%), Positives = 46/87 (52%), Gaps = 3/87 (3%)
Query: 9 ATLALFGQMKSNNELKT--ENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIE 66
A L + G S + +T + P + ++ E+++ + + Y E A R+++F++N+ I+
Sbjct: 13 ALLFILGAWPSKSTARTLLDAPMY-ERHEQWMTQYGRVYKDDNERATRYSIFKENVARID 71
Query: 67 DLNKGEHGTATYGINHLSDLTREEMKS 93
N + G+N +DLT EE K+
Sbjct: 72 AFNSQTGKSYKLGVNQFADLTNEEFKA 98
>gi|156142226|gb|ABU51882.1| ervatamin-C precursor [Tabernaemontana divaricata]
Length = 365
Score = 42.0 bits (97), Expect = 0.043, Method: Composition-based stats.
Identities = 24/66 (36%), Positives = 37/66 (56%), Gaps = 2/66 (3%)
Query: 29 EHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLT 87
E +K+ +E ++ K Y E KRF +F+DNLK I++ N H T G+ +DLT
Sbjct: 39 EEVKEIYELWLAKHDKVYSGLVEYEKRFEIFKDNLKFIDEHNSENH-TYKMGLTPYTDLT 97
Query: 88 REEMKS 93
EE ++
Sbjct: 98 NEEFQA 103
>gi|400180381|gb|AFP73329.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 42.0 bits (97), Expect = 0.043, Method: Composition-based stats.
Identities = 20/69 (28%), Positives = 41/69 (59%), Gaps = 1/69 (1%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
++ E ++ + Y + E +RF +F++N+K IE +NK + + G+N +D+T +E
Sbjct: 37 ERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEF 96
Query: 92 KSRL-GLNL 99
++ GLN+
Sbjct: 97 LAKFTGLNI 105
>gi|146215998|gb|ABQ10201.1| cysteine protease Cp3 [Actinidia deliciosa]
Length = 365
Score = 42.0 bits (97), Expect = 0.043, Method: Composition-based stats.
Identities = 25/66 (37%), Positives = 37/66 (56%), Gaps = 2/66 (3%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM- 91
F F R F KSY T+E+ RF+VF+ NL+ + + +A +G+ SDLT E
Sbjct: 49 HFRLFKRRFGKSYATQEDHDYRFSVFKTNLRRARHHQRLD-PSAVHGVTQFSDLTPAEFR 107
Query: 92 KSRLGL 97
++ LGL
Sbjct: 108 RNHLGL 113
>gi|4581057|gb|AAD24589.1|AF139913_1 cysteine protease [Trypanosoma congolense]
Length = 440
Score = 42.0 bits (97), Expect = 0.044, Method: Composition-based stats.
Identities = 20/62 (32%), Positives = 36/62 (58%), Gaps = 1/62 (1%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
+QF F + +S+SY E A RF VF+ N++ ++ + AT+G+ SD++ EE
Sbjct: 39 QQFAAFKQKYSRSYKDATEEAFRFRVFKQNMERAKE-EAAANPYATFGVTRFSDMSPEEF 97
Query: 92 KS 93
++
Sbjct: 98 RA 99
>gi|408009|gb|AAA18215.1| cysteine protease precursor [Trypanosoma congolense]
Length = 444
Score = 42.0 bits (97), Expect = 0.044, Method: Composition-based stats.
Identities = 20/62 (32%), Positives = 36/62 (58%), Gaps = 1/62 (1%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
+QF F + +S+SY E A RF VF+ N++ ++ + AT+G+ SD++ EE
Sbjct: 39 QQFAAFKQKYSRSYKDATEEAFRFRVFKQNMERAKE-EAAANPYATFGVTRFSDMSPEEF 97
Query: 92 KS 93
++
Sbjct: 98 RA 99
>gi|400180365|gb|AFP73321.1| cysteine protease [Solanum peruvianum]
gi|400180395|gb|AFP73336.1| cysteine protease [Solanum peruvianum]
gi|400180405|gb|AFP73341.1| cysteine protease [Solanum peruvianum]
gi|400180409|gb|AFP73343.1| cysteine protease [Solanum peruvianum]
gi|400180411|gb|AFP73344.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 42.0 bits (97), Expect = 0.044, Method: Composition-based stats.
Identities = 20/69 (28%), Positives = 41/69 (59%), Gaps = 1/69 (1%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
++ E ++ + Y + E +RF +F++N+K IE +NK + + G+N +D+T +E
Sbjct: 37 ERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEF 96
Query: 92 KSRL-GLNL 99
++ GLN+
Sbjct: 97 LAKFTGLNI 105
>gi|343476707|emb|CCD12272.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 447
Score = 42.0 bits (97), Expect = 0.044, Method: Composition-based stats.
Identities = 20/62 (32%), Positives = 36/62 (58%), Gaps = 1/62 (1%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
+QF F + +S+SY E A RF VF+ N++ ++ + AT+G+ SD++ EE
Sbjct: 39 QQFAAFKQKYSRSYKDATEEAFRFRVFKQNMERAKE-EAAANPYATFGVTRFSDMSPEEF 97
Query: 92 KS 93
++
Sbjct: 98 RA 99
>gi|343473370|emb|CCD14732.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 445
Score = 42.0 bits (97), Expect = 0.044, Method: Composition-based stats.
Identities = 20/62 (32%), Positives = 36/62 (58%), Gaps = 1/62 (1%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
+QF F + +S+SY E A RF VF+ N++ ++ + AT+G+ SD++ EE
Sbjct: 39 QQFAAFKQKYSRSYKDATEEAFRFRVFKQNMERAKE-EAAANPYATFGVTRFSDMSPEEF 97
Query: 92 KS 93
++
Sbjct: 98 RA 99
>gi|343472324|emb|CCD15484.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 445
Score = 42.0 bits (97), Expect = 0.044, Method: Composition-based stats.
Identities = 20/62 (32%), Positives = 36/62 (58%), Gaps = 1/62 (1%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
+QF F + +S+SY E A RF VF+ N++ ++ + AT+G+ SD++ EE
Sbjct: 39 QQFAAFKQKYSRSYKDATEEAFRFRVFKQNMERAKE-EAAANPYATFGVTRFSDMSPEEF 97
Query: 92 KS 93
++
Sbjct: 98 RA 99
>gi|343477225|emb|CCD11889.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 447
Score = 42.0 bits (97), Expect = 0.044, Method: Composition-based stats.
Identities = 20/62 (32%), Positives = 36/62 (58%), Gaps = 1/62 (1%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
+QF F + +S+SY E A RF VF+ N++ ++ + AT+G+ SD++ EE
Sbjct: 39 QQFAAFKQKYSRSYKDATEEAFRFRVFKQNMERAKE-EAAANPYATFGVTRFSDMSPEEF 97
Query: 92 KS 93
++
Sbjct: 98 RA 99
>gi|194352770|emb|CAQ00113.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
Length = 310
Score = 42.0 bits (97), Expect = 0.044, Method: Composition-based stats.
Identities = 21/52 (40%), Positives = 29/52 (55%)
Query: 43 KSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKSR 94
KSYP +E +RF V+ N++ IE N+ T G N +DLT EE +R
Sbjct: 4 KSYPAVDEELRRFEVYRRNVERIEATNRDGGRGYTLGENQFTDLTSEEFLAR 55
>gi|1163075|emb|CAA81061.1| cysteine proteinase [Trypanosoma congolense]
Length = 442
Score = 42.0 bits (97), Expect = 0.044, Method: Composition-based stats.
Identities = 20/62 (32%), Positives = 36/62 (58%), Gaps = 1/62 (1%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
+QF F + +S+SY E A RF VF+ N++ ++ + AT+G+ SD++ EE
Sbjct: 34 QQFAAFKQKYSRSYKDATEEAFRFRVFKQNMERAKE-EAAANPYATFGVTRFSDMSPEEF 92
Query: 92 KS 93
++
Sbjct: 93 RA 94
>gi|400180453|gb|AFP73363.1| cysteine protease [Solanum chilense]
Length = 344
Score = 42.0 bits (97), Expect = 0.044, Method: Composition-based stats.
Identities = 20/69 (28%), Positives = 41/69 (59%), Gaps = 1/69 (1%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
++ E ++ + Y + E +RF +F++N+K IE +NK + + G+N +D+T +E
Sbjct: 37 ERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEF 96
Query: 92 KSRL-GLNL 99
++ GLN+
Sbjct: 97 LAKFTGLNI 105
>gi|400180422|gb|AFP73349.1| cysteine protease [Solanum chmielewskii]
Length = 344
Score = 42.0 bits (97), Expect = 0.044, Method: Composition-based stats.
Identities = 20/69 (28%), Positives = 41/69 (59%), Gaps = 1/69 (1%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
++ E ++ + Y + E +RF +F++N+K IE +NK + + G+N +D+T +E
Sbjct: 37 ERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEF 96
Query: 92 KSRL-GLNL 99
++ GLN+
Sbjct: 97 LAKFTGLNI 105
>gi|400180417|gb|AFP73347.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 42.0 bits (97), Expect = 0.044, Method: Composition-based stats.
Identities = 20/69 (28%), Positives = 41/69 (59%), Gaps = 1/69 (1%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
++ E ++ + Y + E +RF +F++N+K IE +NK + + G+N +D+T +E
Sbjct: 37 ERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEF 96
Query: 92 KSRL-GLNL 99
++ GLN+
Sbjct: 97 LAKFTGLNI 105
>gi|400180391|gb|AFP73334.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 42.0 bits (97), Expect = 0.044, Method: Composition-based stats.
Identities = 20/69 (28%), Positives = 41/69 (59%), Gaps = 1/69 (1%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
++ E ++ + Y + E +RF +F++N+K IE +NK + + G+N +D+T +E
Sbjct: 37 ERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEF 96
Query: 92 KSRL-GLNL 99
++ GLN+
Sbjct: 97 LAKFTGLNI 105
>gi|400180361|gb|AFP73319.1| cysteine protease [Solanum peruvianum]
gi|400180397|gb|AFP73337.1| cysteine protease [Solanum peruvianum]
gi|400180401|gb|AFP73339.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 42.0 bits (97), Expect = 0.044, Method: Composition-based stats.
Identities = 20/69 (28%), Positives = 41/69 (59%), Gaps = 1/69 (1%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
++ E ++ + Y + E +RF +F++N+K IE +NK + + G+N +D+T +E
Sbjct: 37 ERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEF 96
Query: 92 KSRL-GLNL 99
++ GLN+
Sbjct: 97 LAKFTGLNI 105
>gi|161778780|gb|ABX79341.1| cysteine protease [Vitis vinifera]
Length = 377
Score = 42.0 bits (97), Expect = 0.044, Method: Composition-based stats.
Identities = 29/85 (34%), Positives = 43/85 (50%), Gaps = 3/85 (3%)
Query: 14 FGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEH 73
G ++ E +H F F R F KSY ++EE RF VF+ NL+ + +
Sbjct: 43 LGDVEGGEEENLLTADH-HHFSIFKRRFGKSYASQEEHDYRFKVFKANLRRARRHQQLD- 100
Query: 74 GTATYGINHLSDLTREEMK-SRLGL 97
+AT+G+ SDLT E + + LGL
Sbjct: 101 PSATHGVTQFSDLTPAEFRGTYLGL 125
>gi|326515420|dbj|BAK03623.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326522532|dbj|BAK07728.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 205
Score = 42.0 bits (97), Expect = 0.044, Method: Compositional matrix adjust.
Identities = 26/66 (39%), Positives = 35/66 (53%), Gaps = 7/66 (10%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLI---EDLNKGEHGTATYGINHLSDLTRE 89
QF F+R K Y EE A+R VF N+ + L+ G A +G+ SDLTRE
Sbjct: 49 QFAAFVRRHGKEYSGPEEYARRLRVFAANVARAAAHQALDPG----ARHGVTPFSDLTRE 104
Query: 90 EMKSRL 95
E ++RL
Sbjct: 105 EFEARL 110
>gi|195583147|ref|XP_002081385.1| GD10988 [Drosophila simulans]
gi|194193394|gb|EDX06970.1| GD10988 [Drosophila simulans]
Length = 349
Score = 42.0 bits (97), Expect = 0.044, Method: Composition-based stats.
Identities = 28/70 (40%), Positives = 41/70 (58%), Gaps = 3/70 (4%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLN-KGEHGTATY--GINHLSDLTREE 90
++KF+ DF Y + E KR +F DN K I++ N + E G ++ GIN SDLT EE
Sbjct: 257 WKKFLIDFGAKYQDETETEKRRTIFCDNWKAIQEHNVQFELGVQSFKKGINQWSDLTVEE 316
Query: 91 MKSRLGLNLS 100
K++ NL+
Sbjct: 317 WKTKQRPNLA 326
Score = 39.3 bits (90), Expect = 0.28, Method: Composition-based stats.
Identities = 31/76 (40%), Positives = 42/76 (55%), Gaps = 5/76 (6%)
Query: 24 KTENPEHLKQ--FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLN-KGEHGTATY-- 78
K EN + + Q +EKF+ DF +Y E KR VF DN K I N + + G ++
Sbjct: 150 KIENYDIICQAAWEKFLIDFKPTYQDDTETEKRRNVFCDNFKSIHKHNVQYDLGNISFKK 209
Query: 79 GINHLSDLTREEMKSR 94
GIN SDLT EE K++
Sbjct: 210 GINQWSDLTVEEWKNK 225
>gi|66475996|ref|XP_627814.1| cryptopain - cysteine proteinase secreted, possible transmembrane
domain near N-terminus [Cryptosporidium parvum Iowa II]
gi|32399065|emb|CAD98305.1| cryptopain precursor [Cryptosporidium parvum]
gi|46229218|gb|EAK90067.1| cryptopain - cysteine proteinase secreted, possible transmembrane
domain near N-terminus [Cryptosporidium parvum Iowa II]
gi|76160841|gb|ABA40395.1| cryptopain-1 [Cryptosporidium parvum]
Length = 401
Score = 42.0 bits (97), Expect = 0.044, Method: Composition-based stats.
Identities = 20/67 (29%), Positives = 37/67 (55%), Gaps = 1/67 (1%)
Query: 29 EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTR 88
E+ K FE+F + + K Y + EE +RF +++ N+ I+ N + + +N DL++
Sbjct: 81 EYRKSFEEFKKKYHKVYSSMEEENQRFEIYKQNMNFIKTTNS-QGFSYVLEMNEFGDLSK 139
Query: 89 EEMKSRL 95
EE +R
Sbjct: 140 EEFMARF 146
>gi|400180399|gb|AFP73338.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 42.0 bits (97), Expect = 0.045, Method: Composition-based stats.
Identities = 20/69 (28%), Positives = 41/69 (59%), Gaps = 1/69 (1%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
++ E ++ + Y + E +RF +F++N+K IE +NK + + G+N +D+T +E
Sbjct: 37 ERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEF 96
Query: 92 KSRL-GLNL 99
++ GLN+
Sbjct: 97 LAKFTGLNI 105
>gi|400180355|gb|AFP73316.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 42.0 bits (97), Expect = 0.045, Method: Composition-based stats.
Identities = 20/69 (28%), Positives = 41/69 (59%), Gaps = 1/69 (1%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
++ E ++ + Y + E +RF +F++N+K IE +NK + + G+N +D+T +E
Sbjct: 37 ERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEF 96
Query: 92 KSRL-GLNL 99
++ GLN+
Sbjct: 97 LAKFTGLNI 105
>gi|356577811|ref|XP_003557016.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 42.0 bits (97), Expect = 0.045, Method: Composition-based stats.
Identities = 19/59 (32%), Positives = 32/59 (54%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
++ E+++ + K Y +E KRF +F++N+ IE N + IN +DLT EE
Sbjct: 37 ERHEQWMTRYGKVYKDPQEREKRFRIFKENVNYIEAFNNAANKRYKLAINQFADLTNEE 95
>gi|42564163|gb|AAS20593.1| digestive cysteine proteinase intestain [Leptinotarsa
decemlineata]
Length = 324
Score = 42.0 bits (97), Expect = 0.045, Method: Compositional matrix adjust.
Identities = 26/66 (39%), Positives = 37/66 (56%), Gaps = 3/66 (4%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKG-EHGTATY--GINHLSDLTRE 89
+++ F +FSKSY E +RF +F NL IE+ N+ G +TY G+N +DLT E
Sbjct: 22 KWQNFKINFSKSYQNVVEEKRRFNIFLSNLLRIEEHNQNFSRGLSTYEMGVNKFADLTPE 81
Query: 90 EMKSRL 95
E R
Sbjct: 82 EFMERF 87
>gi|20334373|gb|AAM19207.1|AF493232_1 cysteine protease [Solanum pimpinellifolium]
gi|400180424|gb|AFP73350.1| cysteine protease [Solanum pimpinellifolium]
gi|400180433|gb|AFP73354.1| cysteine protease [Solanum lycopersicum]
Length = 344
Score = 42.0 bits (97), Expect = 0.045, Method: Composition-based stats.
Identities = 20/69 (28%), Positives = 41/69 (59%), Gaps = 1/69 (1%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
++ E ++ + Y + E +RF +F++N+K IE +NK + + G+N +D+T +E
Sbjct: 37 ERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEF 96
Query: 92 KSRL-GLNL 99
++ GLN+
Sbjct: 97 LAKFTGLNI 105
>gi|400180451|gb|AFP73362.1| cysteine protease [Solanum chilense]
Length = 344
Score = 42.0 bits (97), Expect = 0.045, Method: Composition-based stats.
Identities = 20/69 (28%), Positives = 41/69 (59%), Gaps = 1/69 (1%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
++ E ++ + Y + E +RF +F++N+K IE +NK + + G+N +D+T +E
Sbjct: 37 ERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEF 96
Query: 92 KSRL-GLNL 99
++ GLN+
Sbjct: 97 LAKFTGLNI 105
>gi|400180445|gb|AFP73359.1| cysteine protease, partial [Solanum chilense]
Length = 345
Score = 42.0 bits (97), Expect = 0.045, Method: Composition-based stats.
Identities = 20/69 (28%), Positives = 41/69 (59%), Gaps = 1/69 (1%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
++ E ++ + Y + E +RF +F++N+K IE +NK + + G+N +D+T +E
Sbjct: 37 ERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEF 96
Query: 92 KSRL-GLNL 99
++ GLN+
Sbjct: 97 LAKFTGLNI 105
>gi|400180428|gb|AFP73352.1| cysteine protease [Solanum corneliomuelleri]
Length = 344
Score = 42.0 bits (97), Expect = 0.045, Method: Composition-based stats.
Identities = 20/69 (28%), Positives = 41/69 (59%), Gaps = 1/69 (1%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
++ E ++ + Y + E +RF +F++N+K IE +NK + + G+N +D+T +E
Sbjct: 37 ERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEF 96
Query: 92 KSRL-GLNL 99
++ GLN+
Sbjct: 97 LAKFTGLNI 105
>gi|400180371|gb|AFP73324.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 42.0 bits (97), Expect = 0.045, Method: Composition-based stats.
Identities = 20/69 (28%), Positives = 41/69 (59%), Gaps = 1/69 (1%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
++ E ++ + Y + E +RF +F++N+K IE +NK + + G+N +D+T +E
Sbjct: 37 ERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEF 96
Query: 92 KSRL-GLNL 99
++ GLN+
Sbjct: 97 LAKFTGLNI 105
>gi|400180363|gb|AFP73320.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 42.0 bits (97), Expect = 0.045, Method: Composition-based stats.
Identities = 20/69 (28%), Positives = 41/69 (59%), Gaps = 1/69 (1%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
++ E ++ + Y + E +RF +F++N+K IE +NK + + G+N +D+T +E
Sbjct: 37 ERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEF 96
Query: 92 KSRL-GLNL 99
++ GLN+
Sbjct: 97 LAKFTGLNI 105
>gi|357130488|ref|XP_003566880.1| PREDICTED: fruit bromelain-like [Brachypodium distachyon]
Length = 356
Score = 42.0 bits (97), Expect = 0.045, Method: Composition-based stats.
Identities = 21/63 (33%), Positives = 38/63 (60%), Gaps = 1/63 (1%)
Query: 35 EKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE-MKS 93
E+++ F + Y +E A+R VF N + ++ +N+ + T T G+N SDLT +E +++
Sbjct: 40 EEWMAKFGRVYTDAQEKARRQEVFGANARYVDAVNRAGNRTYTLGLNKFSDLTDDEFVQT 99
Query: 94 RLG 96
LG
Sbjct: 100 HLG 102
>gi|21430502|gb|AAM50929.1| LP08365p [Drosophila melanogaster]
Length = 432
Score = 42.0 bits (97), Expect = 0.045, Method: Composition-based stats.
Identities = 28/70 (40%), Positives = 42/70 (60%), Gaps = 3/70 (4%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKG-EHGTATY--GINHLSDLTREE 90
++KF+ DF Y ++E KR +F DN K I++ N+ E G ++ GIN SDLT EE
Sbjct: 347 WKKFLIDFGAKYQDEKETEKRRTIFCDNWKAIQEHNEQFELGVESFKKGINQWSDLTVEE 406
Query: 91 MKSRLGLNLS 100
K++ NL+
Sbjct: 407 WKTKQRPNLA 416
Score = 40.8 bits (94), Expect = 0.094, Method: Composition-based stats.
Identities = 31/79 (39%), Positives = 42/79 (53%), Gaps = 3/79 (3%)
Query: 19 SNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLN-KGEHGTAT 77
S +E+ +N +EKF+ DF SY E KR VF DN K I N + + G +
Sbjct: 237 STSEIDNDNIICQPAWEKFLIDFKPSYQDDTETEKRRNVFCDNFKSIHKHNVQFDLGNIS 296
Query: 78 Y--GINHLSDLTREEMKSR 94
+ GIN SDLT EE K++
Sbjct: 297 FKKGINQWSDLTVEEWKNK 315
>gi|400180389|gb|AFP73333.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 42.0 bits (97), Expect = 0.046, Method: Composition-based stats.
Identities = 20/69 (28%), Positives = 41/69 (59%), Gaps = 1/69 (1%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
++ E ++ + Y + E +RF +F++N+K IE +NK + + G+N +D+T +E
Sbjct: 37 ERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEF 96
Query: 92 KSRL-GLNL 99
++ GLN+
Sbjct: 97 LAKFTGLNI 105
>gi|400180351|gb|AFP73314.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 42.0 bits (97), Expect = 0.046, Method: Composition-based stats.
Identities = 20/69 (28%), Positives = 41/69 (59%), Gaps = 1/69 (1%)
Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
++ E ++ + Y + E +RF +F++N+K IE +NK + + G+N +D+T +E
Sbjct: 37 ERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEF 96
Query: 92 KSRL-GLNL 99
++ GLN+
Sbjct: 97 LAKFTGLNI 105
>gi|297745594|emb|CBI40759.3| unnamed protein product [Vitis vinifera]
Length = 300
Score = 42.0 bits (97), Expect = 0.046, Method: Compositional matrix adjust.
Identities = 26/59 (44%), Positives = 37/59 (62%), Gaps = 2/59 (3%)
Query: 42 SKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKSR-LGLNL 99
KSY + EE RF VF+DNLK I++ NK + + G+N +DL+ EE K + LGL +
Sbjct: 5 GKSYRSFEEKLHRFEVFQDNLKHIDETNK-KVSSYWLGLNEFADLSHEEFKRKYLGLKI 62
>gi|195334170|ref|XP_002033757.1| GM21494 [Drosophila sechellia]
gi|194125727|gb|EDW47770.1| GM21494 [Drosophila sechellia]
Length = 427
Score = 42.0 bits (97), Expect = 0.046, Method: Composition-based stats.
Identities = 31/80 (38%), Positives = 45/80 (56%), Gaps = 4/80 (5%)
Query: 24 KTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLN-KGEHGTATY--GI 80
K +NP ++KF+ DF Y + E KR +F DN K I++ N + E G ++ GI
Sbjct: 338 KDDNPCQ-AAWKKFLVDFGVKYQDETETEKRRTIFCDNWKAIQEHNVQFELGVESFKKGI 396
Query: 81 NHLSDLTREEMKSRLGLNLS 100
N SDLT EE K++ NL+
Sbjct: 397 NQWSDLTVEEWKTKQRPNLA 416
Score = 38.9 bits (89), Expect = 0.38, Method: Composition-based stats.
Identities = 27/64 (42%), Positives = 36/64 (56%), Gaps = 3/64 (4%)
Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLN-KGEHGTATY--GINHLSDLTREE 90
+EKF+ DF +Y E KR VF DN K I N + + G ++ GIN SDLT EE
Sbjct: 252 WEKFLIDFKPTYQDHTETEKRRNVFCDNFKSIHKHNVEFDLGNISFKKGINQWSDLTVEE 311
Query: 91 MKSR 94
K++
Sbjct: 312 WKNK 315
>gi|261328619|emb|CBH11597.1| cysteine peptidase precursor, (fragment) [Trypanosoma brucei
gambiense DAL972]
Length = 201
Score = 42.0 bits (97), Expect = 0.046, Method: Compositional matrix adjust.
Identities = 21/62 (33%), Positives = 36/62 (58%), Gaps = 1/62 (1%)
Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
+F F + + K Y +E A RF FE+N++ + + + AT+G+ SD+TREE +
Sbjct: 40 RFAAFKKKYGKVYKDAKEEAFRFRAFEENMEQAK-IQAAANPYATFGVTPFSDMTREEFR 98
Query: 93 SR 94
+R
Sbjct: 99 AR 100
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.311 0.128 0.347
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,444,504,571
Number of Sequences: 23463169
Number of extensions: 51466450
Number of successful extensions: 157379
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 895
Number of HSP's successfully gapped in prelim test: 1195
Number of HSP's that attempted gapping in prelim test: 155848
Number of HSP's gapped (non-prelim): 2141
length of query: 102
length of database: 8,064,228,071
effective HSP length: 71
effective length of query: 31
effective length of database: 6,398,343,072
effective search space: 198348635232
effective search space used: 198348635232
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 69 (31.2 bits)