BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= psy2558
(348 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|161408101|dbj|BAF94154.1| cathepsin F-like cysteine protease [Plautia stali]
Length = 803
Score = 341 bits (874), Expect = 4e-91, Method: Compositional matrix adjust.
Identities = 176/366 (48%), Positives = 222/366 (60%), Gaps = 24/366 (6%)
Query: 3 CFYFFAGVALLSLTVSVSSFMVVGD------EKLHHLHHVKHTALFNYFLEQH------- 49
C + G L LTV S++ D + +H + + H+A EQ
Sbjct: 442 CPFQEEGRMLCQLTVWERSWLKKIDLTSSKCDPIHTVMDISHSAELLGVDEQDKDYIKFK 501
Query: 50 ------NKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKY 103
++Y T E R IF N++K LQ TE G+ YG+ FSD+S+ EF+ Y
Sbjct: 502 FFTKKFQRSYKTTEELKKRFRIFRANMKKADYLQKTEQGTAKYGVTIFSDISSKEFKKHY 561
Query: 104 LGFKLK-PSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGV 162
LG K + P + A IPNITLP +DWR Y+AVT VK+Q MCGS WAFS TGNIEG
Sbjct: 562 LGLKKRTPDIKFKQEMAQIPNITLPEEYDWRNYNAVTPVKNQGMCGSCWAFSVTGNIEGQ 621
Query: 163 YAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKA 222
YA KT LVSLSEQEL+DCD+ DDGCEGG A+ I GGLE E YPY G D
Sbjct: 622 YAIKTGNLVSLSEQELVDCDKYDDGCEGGLFETAYHAIEEL--GGLELESDYPYSGRDNT 679
Query: 223 CRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCD 282
C N +V I V++S DETDMAK+LV NGP+++ INA A+QFY+ GVSHP++F CD
Sbjct: 680 CHFNSSEVRVSITSSVNISNDETDMAKWLVANGPISIGINANAMQFYLGGVSHPLKFLCD 739
Query: 283 GGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDY 342
+ L H VLIVGYG+ RT H+ +PYW+IKNSW WG KGY+ LYRGDGSCG+N +
Sbjct: 740 P--KTLDHGVLIVGYGIHRTWLLHRHLPYWLIKNSWSSYWGAKGYYMLYRGDGSCGVNQW 797
Query: 343 VRSALV 348
SA++
Sbjct: 798 PSSAVL 803
>gi|244790097|ref|NP_001156454.1| cathepsin F isoform 2 precursor [Acyrthosiphon pisum]
Length = 586
Score = 340 bits (873), Expect = 5e-91, Method: Compositional matrix adjust.
Identities = 164/311 (52%), Positives = 220/311 (70%), Gaps = 9/311 (2%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
F F+ HNK Y +L E R IF+ N++K++LLQ+ E GS +YG +F+DL+ EF+
Sbjct: 280 FENFIMTHNKIYTSLEEKSRRFRIFAANMKKVKLLQNHEQGSAIYGATQFADLTKNEFKK 339
Query: 102 KYLGFKLKPSYADRSVP-AMIP-NITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNI 159
KYLG + + +++P A+IP + ++P FDWR ++ VT VK+Q CGS WAFS NI
Sbjct: 340 KYLGLDSSMT-SKKTLPMAVIPQSASIPNEFDWRNHNVVTPVKNQGACGSCWAFSAIANI 398
Query: 160 EGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG- 218
EG YA K+K+L+SLSEQELIDCD D+GC GG ++ AF+ + + GGLE E YPY G
Sbjct: 399 EGQYALKSKELLSLSEQELIDCDNLDNGCGGGLMTQAFEAVENL--GGLETESDYPYEGH 456
Query: 219 -DDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPI 277
D K C+L K +V I+ V+VS DE D+AK+LV++GP++V +NA A+QFY+ GVSHPI
Sbjct: 457 ADRKGCQLKKSDVKVSISKAVNVSTDEEDIAKFLVKHGPLSVGVNANAMQFYMGGVSHPI 516
Query: 278 QFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSC 337
C ++L H V IVGYGV RTK+THK +PYW+IKNSWG GWGEKGY+ LYRGDGSC
Sbjct: 517 HALCSP--KSLDHGVAIVGYGVHRTKYTHKNLPYWLIKNSWGPGWGEKGYYLLYRGDGSC 574
Query: 338 GINDYVRSALV 348
G+N V SA++
Sbjct: 575 GVNQMVSSAII 585
>gi|307200028|gb|EFN80374.1| Putative cysteine proteinase CG12163 [Harpegnathos saltator]
Length = 1032
Score = 334 bits (856), Expect = 4e-89, Method: Compositional matrix adjust.
Identities = 165/327 (50%), Positives = 223/327 (68%), Gaps = 11/327 (3%)
Query: 27 DEKLHHL-HHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGV 85
+EK+ + ++ LF F+ +N+TYAT E RL IF NL I+LL+ E G+G
Sbjct: 711 NEKMLRIAEDMRSERLFENFVNTYNRTYATEEERNLRLSIFRENLGIIRLLRKNEQGTGQ 770
Query: 86 YGLNEFSDLSTAEFQAKYLGFKLKPSY-ADRSVP---AMIPNITLPRAFDWREYDAVTGV 141
YG+N+F+D+ST EF A YLG L+P + ++P A IP+I LP +FDWR+ AVT V
Sbjct: 771 YGVNQFADVSTEEFHAFYLG--LRPDLRTENNIPLRQAEIPDIELPNSFDWRQKGAVTPV 828
Query: 142 KDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIM 201
K+Q MCGS WAFS TGN+EG YA K KL+SLSEQEL+DCD D+GC GG NA+ I
Sbjct: 829 KNQGMCGSCWAFSVTGNVEGQYAIKHNKLLSLSEQELVDCDDLDEGCNGGLPDNAYRAI- 887
Query: 202 SKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAI 261
KL GGLE E YPY +++ C K +V++ V+++ +ET +A++LV NGP+++ I
Sbjct: 888 EKL-GGLELESDYPYEAENERCHFKKNMAKVQVGSAVNITSNETQIAQWLVANGPISIGI 946
Query: 262 NAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEG 321
NA A+QFY+ GVSHP +F C+ +NL H VLIVGYG HK +PYWI+KNSWG+
Sbjct: 947 NANAMQFYMGGVSHPFKFLCNP--KNLDHGVLIVGYGTSNYPLFHKKLPYWIVKNSWGDR 1004
Query: 322 WGEKGYFRLYRGDGSCGINDYVRSALV 348
WGE+GY+R+YRGDG+CG+N SA+V
Sbjct: 1005 WGEQGYYRVYRGDGTCGLNTMASSAVV 1031
>gi|307175778|gb|EFN65613.1| Putative cysteine proteinase CG12163 [Camponotus floridanus]
Length = 887
Score = 334 bits (856), Expect = 5e-89, Method: Compositional matrix adjust.
Identities = 165/323 (51%), Positives = 217/323 (67%), Gaps = 10/323 (3%)
Query: 30 LHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLN 89
L V+ LFN F+ +N+TY+T E RL IF NL IQLL+ TE G+ Y +N
Sbjct: 570 LQIAEDVRSEQLFNNFVVTYNRTYSTPEERNLRLRIFRENLGIIQLLRKTERGTAHYDVN 629
Query: 90 EFSDLSTAEFQAKYLGFKLKPSY-ADRSVP---AMIPNITLPRAFDWREYDAVTGVKDQT 145
F+D+S EF+++YLG L+P ++ +P A IP++ LP FDWRE VT VKDQ
Sbjct: 630 MFADMSPEEFRSRYLG--LRPDLRSENDIPLREAEIPDVELPPKFDWREKSVVTPVKDQG 687
Query: 146 MCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLG 205
MCGS WAFS TGNIEG YA K +L+SLSEQEL+DCD D+GC GG NA+ I KL
Sbjct: 688 MCGSCWAFSVTGNIEGQYAIKHGRLLSLSEQELVDCDDLDEGCNGGLPDNAYRAI-EKL- 745
Query: 206 GGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA 265
GGLE E YPY +++ C K +V++ V+++ +ET MA++LV+NGP+++ INA A
Sbjct: 746 GGLELESDYPYEAENEKCHFKKNLAKVQLASAVNITSNETQMAQWLVQNGPISIGINANA 805
Query: 266 LQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEK 325
+QFYV GVSHP +F C+ +NL H VLIVGYG HK +PYW IKNSWG+ WGE+
Sbjct: 806 MQFYVGGVSHPFKFLCNP--KNLDHGVLIVGYGTSDYPLFHKKLPYWTIKNSWGKRWGEQ 863
Query: 326 GYFRLYRGDGSCGINDYVRSALV 348
GY+R+YRGDG+CG+N SA+V
Sbjct: 864 GYYRVYRGDGTCGLNTLATSAVV 886
>gi|242014216|ref|XP_002427787.1| Cathepsin F precursor, putative [Pediculus humanus corporis]
gi|212512256|gb|EEB15049.1| Cathepsin F precursor, putative [Pediculus humanus corporis]
Length = 434
Score = 330 bits (846), Expect = 6e-88, Method: Compositional matrix adjust.
Identities = 159/308 (51%), Positives = 211/308 (68%), Gaps = 9/308 (2%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
F F+ + NK Y + E+ R IF N++KI L E G+ YG+ EFSDLS EF+
Sbjct: 134 FKDFVLKFNKVYFSKEEFKKRFRIFRANMKKINFLNKAEKGTAQYGITEFSDLSVTEFK- 192
Query: 102 KYLGFKLKPSYADRSVP-AMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIE 160
YLG K KP + +P A IP++ LP FDWR Y+AVT VK+Q CGS WAFS TGNIE
Sbjct: 193 NYLGLKKKP---ESKLPTAEIPDVKLPDNFDWRHYNAVTPVKNQGSCGSCWAFSVTGNIE 249
Query: 161 GVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDD 220
G++A K +L+SLSEQELIDCD+ D+GC GG + ++ IM KL GGLE E YPY ++
Sbjct: 250 GLWAIKKHELLSLSEQELIDCDKIDNGCNGGYMPETYEAIM-KL-GGLETETDYPYEAEN 307
Query: 221 KACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFF 280
+ C LNK +VKING V++++ E D+AK+L +NGP++ +NA A+QFY+ G+SHP +
Sbjct: 308 EKCNLNKTEIKVKINGAVNLTKSELDIAKWLYKNGPVSAGLNANAMQFYLGGISHPPKIL 367
Query: 281 CDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGIN 340
C+ E H +LIVGYG+ ++ + +PYWIIKNSWG+ WGEKGY+RLYRG G CGIN
Sbjct: 368 CNP--EEQDHGILIVGYGIHKSSILKRTIPYWIIKNSWGKHWGEKGYYRLYRGSGVCGIN 425
Query: 341 DYVRSALV 348
V SAL+
Sbjct: 426 QMVSSALI 433
>gi|383863617|ref|XP_003707276.1| PREDICTED: uncharacterized protein LOC100880620 [Megachile
rotundata]
Length = 884
Score = 328 bits (840), Expect = 3e-87, Method: Compositional matrix adjust.
Identities = 162/315 (51%), Positives = 209/315 (66%), Gaps = 8/315 (2%)
Query: 37 KHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLST 96
K LF F++ +NKTY + E R +F NL+ I+ L+ E G+ VYG+ F+DL+
Sbjct: 574 KDELLFEDFVKTYNKTYLSAKEKADRYKVFRKNLKMIEKLRKFEQGTAVYGVTMFADLTP 633
Query: 97 AEFQAKYLGFKLKPSYADRSVP---AMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAF 153
EF+ KYLG K + + +P A+IP+I LP FDWREY+AVT VKDQ CGS WAF
Sbjct: 634 EEFKTKYLGLKTNLN-QENDIPLQEAVIPDIDLPPKFDWREYNAVTPVKDQGQCGSCWAF 692
Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
S GNIEG YA K KKL+SLSEQEL+DCD DDGC GG + NA+ T+ KL GGLE E
Sbjct: 693 SAIGNIEGQYAIKHKKLLSLSEQELVDCDNLDDGCGGGYMINAYKTV-EKL-GGLELETD 750
Query: 214 YPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGV 273
YPY ++ C K +V++ ++++ DE MA++LV+NGP++V INA A+QFY GV
Sbjct: 751 YPYDARNEKCHFLKNKAKVQVASALNITNDEKKMAQWLVKNGPISVGINANAMQFYFGGV 810
Query: 274 SHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG 333
SHP +F CD NL H VLIVGY K +PYWIIKNSWG WGE+GY+R+YRG
Sbjct: 811 SHPFKFLCDPA--NLDHGVLIVGYATSTYPLFKKKLPYWIIKNSWGPKWGEQGYYRVYRG 868
Query: 334 DGSCGINDYVRSALV 348
DG+CG+N SA+V
Sbjct: 869 DGTCGVNAMASSAIV 883
>gi|332026794|gb|EGI66903.1| Putative cysteine proteinase [Acromyrmex echinatior]
Length = 774
Score = 328 bits (840), Expect = 3e-87, Method: Compositional matrix adjust.
Identities = 157/317 (49%), Positives = 219/317 (69%), Gaps = 11/317 (3%)
Query: 36 VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLS 95
+K LFN F+ +N+TY++L E R IF NL I+ L++TE G+G+YG+N F+D+S
Sbjct: 464 MKAERLFNNFMTTYNRTYSSL-ERNLRFKIFRENLNFIEELRETEQGTGIYGVNMFADMS 522
Query: 96 TAEFQAKYLGFKLKPSY-ADRSVP---AMIPNITLPRAFDWREYDAVTGVKDQTMCGSSW 151
EF+ +YLG L+P ++ +P A IP+I LP +FDWR+ VT VK+Q CGS W
Sbjct: 523 QKEFRTRYLG--LRPDLQSENEIPLPKAEIPDIDLPSSFDWRQKGVVTPVKNQGQCGSCW 580
Query: 152 AFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEE 211
AFS TGN+EG YA K +L+SLSEQEL+DCD D+GC GG NA+ I + GGLE E
Sbjct: 581 AFSVTGNVEGQYAIKHGQLLSLSEQELVDCDHLDEGCNGGLPDNAYRAI--EQLGGLELE 638
Query: 212 KTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVT 271
YPY +++ C + +V++ V+++ +ET +A++LV+NGP+A+ INA A+QFY+
Sbjct: 639 SDYPYEAENEKCHFKQNLVKVELASAVNITSNETQIAQWLVQNGPIAIGINANAMQFYMG 698
Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
GVSHP++ C+ NL+H VLIVGYG R HK +PYWIIKNSWG+ WGE+GY+R+Y
Sbjct: 699 GVSHPLKILCNPN--NLNHGVLIVGYGTSRYPLFHKNLPYWIIKNSWGKSWGEQGYYRVY 756
Query: 332 RGDGSCGINDYVRSALV 348
RGDG+CG+N SA+V
Sbjct: 757 RGDGTCGLNTMASSAVV 773
>gi|350421176|ref|XP_003492760.1| PREDICTED: hypothetical protein LOC100745708 [Bombus impatiens]
Length = 884
Score = 326 bits (835), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 156/319 (48%), Positives = 211/319 (66%), Gaps = 10/319 (3%)
Query: 34 HHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSD 93
+K LF F+++ KTY + E R IF NL+ I+ LQ E G+ YG+ F+D
Sbjct: 571 EEIKDETLFEAFIKKFGKTYNSADEKLDRFKIFKQNLKIIEELQTFERGTAEYGVTMFAD 630
Query: 94 LSTAEFQAKYLGFKLKPSYA-DRSVP---AMIPNITLPRAFDWREYDAVTGVKDQTMCGS 149
L+ EF+A+YLG L+P + +P A IP+++LP FDWR++ VT VKDQ CGS
Sbjct: 631 LTPKEFKARYLG--LRPELKHENEIPLPEAEIPDVSLPLKFDWRDHSVVTPVKDQGQCGS 688
Query: 150 SWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLE 209
WAFS TGN+EG YA K +L+SLSEQEL+DCD D+GC GG + NA+ I + GGLE
Sbjct: 689 CWAFSVTGNVEGQYAIKHNQLLSLSEQELVDCDSLDEGCNGGDMENAYKAI--ERLGGLE 746
Query: 210 EEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFY 269
E YPY D+ C + +V++ V+++ DE MA++LV+NGP++V INA A+QFY
Sbjct: 747 LESDYPYDAKDEKCHFLQNKAKVQVVSAVNITSDEKRMAQWLVKNGPISVGINANAMQFY 806
Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
GVSHP+ F C+ +NL H VLIVGYG+ + HK +PYWIIKNSWG WGE+GY+R
Sbjct: 807 FGGVSHPLNFLCNP--KNLDHGVLIVGYGISKYPLFHKELPYWIIKNSWGPRWGERGYYR 864
Query: 330 LYRGDGSCGINDYVRSALV 348
+YRGDG+CG+N SA+V
Sbjct: 865 VYRGDGTCGVNTMATSAVV 883
>gi|260234113|dbj|BAI44279.1| cysteine proteinase inhibitor precursor [Manduca sexta]
gi|261336196|dbj|BAH59606.2| cysteine proteinase inhibitor precursor [Manduca sexta]
Length = 2676
Score = 323 bits (828), Expect = 8e-86, Method: Compositional matrix adjust.
Identities = 158/318 (49%), Positives = 207/318 (65%), Gaps = 7/318 (2%)
Query: 34 HHVKHTALFNYFLEQHNKTYAT-LVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFS 92
HH++ LF FL + Y + R IF N+RK+ L E G+ YG+ F+
Sbjct: 2363 HHLQAEHLFYEFLSTYKPEYIDDRHQMRQRFEIFKENVRKMHELNTHERGTATYGVTRFA 2422
Query: 93 DLSTAEFQAKYLGFK--LKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSS 150
DL+ EF K++G K L+ + A+IPN+T P +FDWR++ AVTGVKDQ CGS
Sbjct: 2423 DLTYEEFSTKHMGMKASLRDPNQVQFRKAVIPNVTAPDSFDWRDHGAVTGVKDQGSCGSC 2482
Query: 151 WAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEE 210
WAFS TGNIEG + KT LVSLSEQEL+DCD+ D GC GG NA+ I + GGLE
Sbjct: 2483 WAFSVTGNIEGQWKMKTGDLVSLSEQELVDCDKLDQGCNGGLPDNAYRAI--EQLGGLES 2540
Query: 211 EKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYV 270
E YPY G D C NK +V+I+G V+++ +ETDMAK+LV++GP+++ INA A+QFY+
Sbjct: 2541 EDDYPYEGSDDKCSFNKTLARVQISGAVNITSNETDMAKWLVKHGPISIGINANAMQFYM 2600
Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
G+SHP + C+ NL H VLIVGYG HK +PYWIIKNSWG WGE+GY+R+
Sbjct: 2601 GGISHPWRMLCNPS--NLDHGVLIVGYGAKDYPLFHKHLPYWIIKNSWGTSWGEQGYYRV 2658
Query: 331 YRGDGSCGINDYVRSALV 348
YRGDG+CG+N SA+V
Sbjct: 2659 YRGDGTCGVNQMASSAVV 2676
>gi|401758208|gb|AFQ01139.1| cathepsin F-like protease, partial [Chilo suppressalis]
Length = 537
Score = 322 bits (826), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 160/319 (50%), Positives = 212/319 (66%), Gaps = 8/319 (2%)
Query: 34 HHVKHTALFNYFLEQHNKTYAT-LVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFS 92
HHV+ LF F+ + Y VE R IF N++KI L E G+GVY + F+
Sbjct: 223 HHVQAEQLFFNFITTYKPEYINDHVEMTKRFEIFKENVKKIHELNTHERGTGVYAVTRFT 282
Query: 93 DLSTAEFQAKYLGFK--LKPSYADRSVPAMIPNI-TLPRAFDWREYDAVTGVKDQTMCGS 149
DL+ EF++KYLG LK A IP + LP +FDWR AVT VKDQ CGS
Sbjct: 283 DLTYEEFKSKYLGLNPNLKKPNQIPMRQAEIPKVHQLPASFDWRPLGAVTEVKDQGACGS 342
Query: 150 SWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLE 209
WAFS TGNIEG + KT KL+SLSEQEL+DCD+ DDGC+GG + NA+ I + GGLE
Sbjct: 343 CWAFSVTGNIEGQWKLKTGKLLSLSEQELVDCDKMDDGCDGGYMDNAYRAI--EQLGGLE 400
Query: 210 EEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFY 269
E+ YPY +D C NK ++V+I+G V++S +ET+MAK+LV NGP+++ INA A+QFY
Sbjct: 401 TEEEYPYEAEDDKCSFNKSLSKVQISGAVNISSNETNMAKWLVHNGPISIGINANAMQFY 460
Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
V GVSHP + C+ +N+ H VLIVGYG+ +K +PYW++KNSWG GWGE+GY+R
Sbjct: 461 VGGVSHPWKALCNP--KNIDHGVLIVGYGIKEYPLFNKQLPYWVVKNSWGPGWGEQGYYR 518
Query: 330 LYRGDGSCGINDYVRSALV 348
++RGDG+CG+N SA+V
Sbjct: 519 VFRGDGTCGVNTMASSAVV 537
>gi|156546466|ref|XP_001607324.1| PREDICTED: hypothetical protein LOC100123649 [Nasonia vitripennis]
Length = 1036
Score = 322 bits (826), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 160/323 (49%), Positives = 212/323 (65%), Gaps = 10/323 (3%)
Query: 30 LHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLN 89
L +K LF+ F+ ++ K Y E R IF NL I+ LQ E G+G YG+
Sbjct: 719 LQQSRQLKEEILFHEFMGKYKKMYHNKEEKEMRFQIFKDNLNLIEELQRNEMGTGRYGVT 778
Query: 90 EFSDLSTAEFQAKYLGFKLKPSY-ADRSVP---AMIPNITLPRAFDWREYDAVTGVKDQT 145
+F+DL+ AEF+A++LG LKP+ ++ +P A IP+I LP +DWR ++ VT VKDQ
Sbjct: 779 QFTDLTKAEFKARHLG--LKPTLKSENDIPMPMATIPDIELPSDYDWRHHNVVTPVKDQG 836
Query: 146 MCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLG 205
CGS WAFS TGNIEG YA K +L+SLSEQEL+DCD+ D GC GG A+ I
Sbjct: 837 SCGSCWAFSVTGNIEGQYAIKHGELLSLSEQELVDCDKLDSGCNGGLPDTAYRAIEEL-- 894
Query: 206 GGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA 265
GGLE E YPY +D+ C NK +V I ++++ +ET MA++LV+NGPM++ INA A
Sbjct: 895 GGLELESDYPYDAEDEKCHFNKNKVKVNIVSGLNITSNETQMAQWLVKNGPMSIGINANA 954
Query: 266 LQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEK 325
+QFY+ GVSHP +F C ++L H VLIVGYGV K +PYWIIKNSWG WGE+
Sbjct: 955 MQFYMGGVSHPFKFLC--SPDSLDHGVLIVGYGVKFYPIFKKTMPYWIIKNSWGPRWGEQ 1012
Query: 326 GYFRLYRGDGSCGINDYVRSALV 348
GY+R+YRGDG+CG+N V SA+V
Sbjct: 1013 GYYRVYRGDGTCGVNKMVTSAVV 1035
>gi|244790093|ref|NP_001156453.1| cathepsin F isoform 1 precursor [Acyrthosiphon pisum]
Length = 586
Score = 318 bits (816), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 154/311 (49%), Positives = 214/311 (68%), Gaps = 9/311 (2%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
F F+ HNK Y +L E R IF+ N++K++LLQ+ E GS +YG +F+DL+ EF+
Sbjct: 280 FENFIMTHNKIYTSLEEKSRRFRIFAANMKKVKLLQNHEQGSAIYGATQFADLTKNEFKK 339
Query: 102 KYLGFKLKPSYADRSVP-AMIP-NITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNI 159
KYLG + + +++P A+IP + ++P FDWR ++ VT VK+Q CGS WAFS NI
Sbjct: 340 KYLGLDSSMT-SKKTLPMAVIPQSASIPNEFDWRNHNVVTPVKNQGACGSCWAFSAIANI 398
Query: 160 EGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG- 218
EG YA K+K+L+SLSEQELIDCD D+GC GG ++ AF+ + + GGLE E YPY G
Sbjct: 399 EGQYALKSKELLSLSEQELIDCDNLDNGCGGGLMTQAFEAVENL--GGLETESDYPYEGH 456
Query: 219 -DDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPI 277
D K C+L K +V I+ V+VS DE D+AK+LV++GP++V +NA A+QFY+ GVSHPI
Sbjct: 457 ADRKGCQLKKSDVKVSISKAVNVSTDEEDIAKFLVKHGPLSVGVNANAMQFYMGGVSHPI 516
Query: 278 QFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSC 337
C ++L H V IVGYGV + + + +P+W IKNSWG+ WG +GY+ LYRGDGSC
Sbjct: 517 HALCSP--KSLDHGVAIVGYGVHKYPYLNATLPFWTIKNSWGDKWGMQGYYLLYRGDGSC 574
Query: 338 GINDYVRSALV 348
G+N V SA++
Sbjct: 575 GVNQMVSSAII 585
>gi|380025691|ref|XP_003696602.1| PREDICTED: putative cysteine proteinase CG12163-like [Apis florea]
Length = 881
Score = 312 bits (799), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 160/321 (49%), Positives = 218/321 (67%), Gaps = 6/321 (1%)
Query: 30 LHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLN 89
L ++K+ LF F+ + NKT+++ E +R IF NL+ I+ LQ E G+ YG+
Sbjct: 564 LKLAQNIKYETLFEDFIIKFNKTFSSTNEKQNRFQIFKQNLKIIKELQTFEQGTAEYGVT 623
Query: 90 EFSDLSTAEFQAKYLGFK--LKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMC 147
F+DL+ EF+ +YLGF+ LK + +I LP FDWR+Y+AVT VKDQ +C
Sbjct: 624 MFADLTPKEFKTRYLGFRPELKQENEIPLAKIEVSDIFLPPKFDWRDYNAVTPVKDQGLC 683
Query: 148 GSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGG 207
GS WAFS TGN+EG YA K KKL+SLSEQEL+DCD D+GC GG + NA+ I KL GG
Sbjct: 684 GSCWAFSVTGNVEGQYAIKYKKLLSLSEQELLDCDTLDEGCNGGYMENAYKAI-EKL-GG 741
Query: 208 LEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQ 267
LE E YPY G ++ C KK +V++ G V+++ +ET MA++L++NGP+++ INA A+Q
Sbjct: 742 LELESDYPYDGRNEKCHFFKKNAKVQVVGAVNITSNETKMAQWLIKNGPISIGINANAMQ 801
Query: 268 FYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGY 327
FY+ GVSHP F C+ ++L H VLIVGYG+ + HK +PYWIIKNSWG WGE GY
Sbjct: 802 FYIGGVSHPFHFLCNP--KDLDHGVLIVGYGISKYPLFHKELPYWIIKNSWGSRWGENGY 859
Query: 328 FRLYRGDGSCGINDYVRSALV 348
+R+YRGDG+CG+N SA+V
Sbjct: 860 YRVYRGDGTCGVNAMASSAIV 880
>gi|189239337|ref|XP_973607.2| PREDICTED: similar to cathepsin F-like cysteine protease [Tribolium
castaneum]
Length = 1726
Score = 309 bits (791), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 153/313 (48%), Positives = 212/313 (67%), Gaps = 8/313 (2%)
Query: 38 HTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTA 97
H +LF FL+++NK Y Y R ++F NL +I++L E G+ YG+ F+D++
Sbjct: 1419 HLSLFTDFLKKYNKKYHKKE-YKYRFNVFVQNLMQIRVLNTFEQGTATYGITRFADMTQK 1477
Query: 98 EFQAKYLGFK--LKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFST 155
EF ++ LG + L+ A IPNI LP+ FDWR+ + VT VK+Q CGS WAFS
Sbjct: 1478 EF-SRSLGLRTDLRNENETPFAQAKIPNIELPKEFDWRKKNVVTEVKNQEQCGSCWAFSV 1536
Query: 156 TGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYP 215
TGN+EG YA + KL+ SEQEL+DCD +D GC GG + A+ +I K+ GGLE E+ YP
Sbjct: 1537 TGNVEGQYALRHGKLLEFSEQELVDCDTDDQGCNGGLMDTAYRSI-EKI-GGLETEQDYP 1594
Query: 216 YRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSH 275
Y +D+ C N+ +V++ G +++S +ETDMAK+LV NGP+++AINA A+QFY+ GVSH
Sbjct: 1595 YDAEDEKCHFNRTLARVQVTGALNISHNETDMAKWLVANGPISIAINANAMQFYMGGVSH 1654
Query: 276 PIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
P +F C +NL H VLIVGYGV K++PYWI+KNSWG GWGE+GY+R+YRGDG
Sbjct: 1655 PFKFLC--SPKNLDHGVLIVGYGVHNYPLFKKSLPYWIVKNSWGTGWGEQGYYRVYRGDG 1712
Query: 336 SCGINDYVRSALV 348
+CG+N SA+V
Sbjct: 1713 TCGLNQTPSSAIV 1725
>gi|270011071|gb|EFA07519.1| cystatin [Tribolium castaneum]
Length = 1761
Score = 308 bits (790), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 151/313 (48%), Positives = 210/313 (67%), Gaps = 8/313 (2%)
Query: 38 HTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTA 97
H +LF FL+++ EY R ++F NL +I++L E G+ YG+ F+D++
Sbjct: 1454 HLSLFTDFLKKY-NKKYHKKEYKYRFNVFVQNLMQIRVLNTFEQGTATYGITRFADMTQK 1512
Query: 98 EFQAKYLGFK--LKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFST 155
EF ++ LG + L+ A IPNI LP+ FDWR+ + VT VK+Q CGS WAFS
Sbjct: 1513 EF-SRSLGLRTDLRNENETPFAQAKIPNIELPKEFDWRKKNVVTEVKNQEQCGSCWAFSV 1571
Query: 156 TGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYP 215
TGN+EG YA + KL+ SEQEL+DCD +D GC GG + A+ +I K+ GGLE E+ YP
Sbjct: 1572 TGNVEGQYALRHGKLLEFSEQELVDCDTDDQGCNGGLMDTAYRSI-EKI-GGLETEQDYP 1629
Query: 216 YRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSH 275
Y +D+ C N+ +V++ G +++S +ETDMAK+LV NGP+++AINA A+QFY+ GVSH
Sbjct: 1630 YDAEDEKCHFNRTLARVQVTGALNISHNETDMAKWLVANGPISIAINANAMQFYMGGVSH 1689
Query: 276 PIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
P +F C +NL H VLIVGYGV K++PYWI+KNSWG GWGE+GY+R+YRGDG
Sbjct: 1690 PFKFLC--SPKNLDHGVLIVGYGVHNYPLFKKSLPYWIVKNSWGTGWGEQGYYRVYRGDG 1747
Query: 336 SCGINDYVRSALV 348
+CG+N SA+V
Sbjct: 1748 TCGLNQTPSSAIV 1760
>gi|405977658|gb|EKC42097.1| Cathepsin F [Crassostrea gigas]
Length = 715
Score = 308 bits (790), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 158/310 (50%), Positives = 213/310 (68%), Gaps = 14/310 (4%)
Query: 41 LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
+F F + Y + E +R IF N+RK + LQD E G+ VYG+ +F+D+S +EF+
Sbjct: 417 VFQQFQAAFKRLYMSKQEEKTRFKIFCENMRKAKKLQDVEKGTAVYGVTKFADMSESEFK 476
Query: 101 AKYLGFKLKPSYADRSVP-AMIPNI-TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
+Y+G K+ A++ + A IP + +LP +FDWRE+ AVT VK+Q CGS WAFSTTGN
Sbjct: 477 -QYVG-KVWDQNANKGMKKAKIPEMNSLPNSFDWREHGAVTEVKNQGSCGSCWAFSTTGN 534
Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG 218
IEG +A KKLVSLSEQEL+DCD+ D+GC GG S A+ I+ GGLE E Y YRG
Sbjct: 535 IEGQWAISKKKLVSLSEQELVDCDKVDEGCNGGLPSQAYKEIIRL--GGLETETDYKYRG 592
Query: 219 DDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQ 278
++ C ++K +VKING VS+S +ET+MA +LV+NGP+++ INA+A+QFY+ G+SHP +
Sbjct: 593 HNEKCSMDKSKIRVKINGSVSISSNETEMAAWLVKNGPISIGINAFAMQFYMGGISHPWK 652
Query: 279 FFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCG 338
FC+ + L H VLIVGYGV +K PYWIIKNSWG WGEKGY+ +YRG G CG
Sbjct: 653 IFCN--PKELDHGVLIVGYGVKGSK------PYWIIKNSWGPDWGEKGYYLVYRGAGVCG 704
Query: 339 INDYVRSALV 348
+N SA+V
Sbjct: 705 LNTMCTSAVV 714
>gi|328788558|ref|XP_392381.3| PREDICTED: putative cysteine proteinase CG12163-like [Apis
mellifera]
Length = 881
Score = 308 bits (789), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 159/321 (49%), Positives = 214/321 (66%), Gaps = 6/321 (1%)
Query: 30 LHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLN 89
L +K LF F+ + NKT+++ E +R IF NL+ I LQ E G+ YG+
Sbjct: 564 LKLAQDIKDEMLFEDFIIKFNKTFSSTNEKQNRFQIFKQNLKIINELQTFEQGTAEYGVT 623
Query: 90 EFSDLSTAEFQAKYLGFK--LKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMC 147
F+DL+ EF+ +YLGF+ LK + +I LP FDWR+Y+ VT VKDQ +C
Sbjct: 624 MFADLTPKEFKTRYLGFRPELKQENEIPLAKIEVSDIFLPLKFDWRDYNVVTPVKDQGLC 683
Query: 148 GSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGG 207
GS WAFS TGN+EG YA K KKL+SLSEQEL+DCD D+GC GG + NA+ I KL GG
Sbjct: 684 GSCWAFSVTGNVEGQYAIKYKKLLSLSEQELLDCDTLDEGCNGGYMENAYKAI-EKL-GG 741
Query: 208 LEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQ 267
LE E YPY G ++ C KK +V++ G V+++ +ET MA++L++NGP+++ INA A+Q
Sbjct: 742 LELESDYPYDGRNEKCHFFKKNAKVQVVGAVNITSNETKMAQWLIKNGPISIGINANAMQ 801
Query: 268 FYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGY 327
FY+ GVSHP F C+ ++L H VLIVGYG+ + HK +PYWIIKNSWG WGE GY
Sbjct: 802 FYIGGVSHPFHFLCNP--KDLDHGVLIVGYGISKYPLFHKKLPYWIIKNSWGSRWGENGY 859
Query: 328 FRLYRGDGSCGINDYVRSALV 348
+R+YRGDG+CG+N SA+V
Sbjct: 860 YRVYRGDGTCGVNAMASSAIV 880
>gi|427777627|gb|JAA54265.1| Putative cathepsin f-like cysteine protease [Rhipicephalus
pulchellus]
Length = 475
Score = 303 bits (777), Expect = 6e-80, Method: Compositional matrix adjust.
Identities = 158/349 (45%), Positives = 216/349 (61%), Gaps = 11/349 (3%)
Query: 7 FAGVALLSLTVSVSSFMVVGDEKL----HHLHHVKHTALFNYFLEQHNKTYATLVEYYSR 62
F A +S+ +S + D HL + +LF+ F +NKTY E+ +R
Sbjct: 127 FTCEAAMSIVTRISGVLDPKDLTFAYLSKHLKLSQERSLFSVFARTYNKTYKDKEEHEAR 186
Query: 63 LHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFK--LKPSYADRSVPAM 120
IF NL++I L E G+ YGL EFSDLS +EF+ YLG K L A+ +
Sbjct: 187 FMIFKNNLKRIALFNRLEEGTAHYGLTEFSDLSPSEFERHYLGLKKDLAEHKAEVKPIKV 246
Query: 121 IP-NITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELI 179
P N LP FDWR AVT VK+Q MCGS WAFS TGN+EG + KL+SLSEQEL+
Sbjct: 247 GPVNEPLPDLFDWRTKGAVTEVKNQGMCGSCWAFSVTGNVEGQWFLSRSKLLSLSEQELV 306
Query: 180 DCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVS 239
DCD D GC+GG + A ++ GGLE E YPY+G D C NK ++ ++ +V
Sbjct: 307 DCDHGDHGCKGGYMGQAMKAVIEM--GGLETESEYPYKGVDGTCEFNKTESKARVQSFVG 364
Query: 240 VSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGV 299
+ ++ET++A +L+++GP+++ INA A+QFY G+SHP +F C +L H VL+VG+GV
Sbjct: 365 LPQNETELAYWLMKHGPVSIGINANAMQFYFGGISHPWKFLCS--PTDLDHGVLLVGFGV 422
Query: 300 DRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
D+ F K VPYWI+KNSWG+ WGEKGY+R+YRGDG+CG+N SA+V
Sbjct: 423 DKRSFRRKPVPYWIVKNSWGKYWGEKGYYRVYRGDGTCGVNQMALSAVV 471
>gi|289740839|gb|ADD19167.1| cysteine proteinase cathepsin F [Glossina morsitans morsitans]
Length = 471
Score = 299 bits (765), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 145/318 (45%), Positives = 201/318 (63%), Gaps = 6/318 (1%)
Query: 31 HHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNE 90
H+L+ V+H LF F + + Y T +E R IF NL+ I+ L E GS YG+ E
Sbjct: 157 HNLNKVEH--LFAKFQIKFKRNYHTTMEKQMRFRIFKQNLQLIEELNRNEQGSAKYGITE 214
Query: 91 FSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSS 150
F+D+++ E++ + ++ P A + A IPNI LP+ FDWRE A++ VK+Q CGS
Sbjct: 215 FADMTSPEYKQRTGLWQRDPQKAASNPKAEIPNIDLPKEFDWREKGAISAVKNQGNCGSC 274
Query: 151 WAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEE 210
WAFS TGNIEG++A +T L SEQEL+DCD D C GG NA++ I K+ GGLE
Sbjct: 275 WAFSVTGNIEGLHAVRTGVLEQYSEQELLDCDTSDSACNGGLPDNAYEAI-EKI-GGLEL 332
Query: 211 EKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYV 270
E YPY C N VK+ G+V + ++ET +A++L+ NGP+++ INA A+QFY
Sbjct: 333 ESDYPYHARKDQCHFNSTKIHVKVKGHVDLPKNETAIAQWLIANGPISIGINANAMQFYR 392
Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
GVSHP C +NL H VLIVGYGV K +PYWI+KNSWG+ WGE+GY+R+
Sbjct: 393 GGVSHPPHILC--SRKNLDHGVLIVGYGVSDYPMFKKTLPYWIVKNSWGKKWGEQGYYRV 450
Query: 331 YRGDGSCGINDYVRSALV 348
YRGD +CG+++ SA++
Sbjct: 451 YRGDNTCGVSEMSSSAVL 468
>gi|291230041|ref|XP_002734978.1| PREDICTED: cysteine proteinase inhibitor-like [Saccoglossus
kowalevskii]
Length = 352
Score = 298 bits (763), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 159/314 (50%), Positives = 197/314 (62%), Gaps = 13/314 (4%)
Query: 37 KHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLST 96
K LF F++ ++K Y T E+ R IF NL K + LQ TE +G YG+ +F DLS
Sbjct: 49 KTQDLFQDFMKTYDKKYDTEEEHQLRYQIFQDNLLKAERLQQTEQATGQYGVTKFMDLSE 108
Query: 97 AEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYD--AVTGVKDQTMCGSSWAFS 154
EF+ YL + S A IP T P AFDWR+ D AVT VK+Q CGS WAFS
Sbjct: 109 EEFRKYYLTPVWRGSDPHMK-KAEIPKGTPPAAFDWRDADKNAVTKVKNQGTCGSCWAFS 167
Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
TTGNIEG + K LVSLSEQEL+DCD+ D GC GG SNA+ IM GG+ E Y
Sbjct: 168 TTGNIEGQWKIKKGTLVSLSEQELVDCDKLDQGCNGGLPSNAYQEIMRF--GGIMSEDDY 225
Query: 215 PYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVS 274
PY G D+ C+LN +V ING +++S+DE DMA +L NGP+++ INA A+QFY GVS
Sbjct: 226 PYTGRDQDCKLNATLNKVYINGSMNISKDEGDMASWLAANGPISIGINANAMQFYFGGVS 285
Query: 275 HPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGD 334
HP + FC+ ENL H VLIVGYG T PYWIIKNSWG WG +GY+ +YRG
Sbjct: 286 HPWKIFCN--PENLDHGVLIVGYG------TKDGTPYWIIKNSWGRSWGVEGYYLVYRGG 337
Query: 335 GSCGINDYVRSALV 348
G CG+N+ SA+V
Sbjct: 338 GVCGLNEMCTSAIV 351
>gi|38683931|gb|AAR27011.1| cysteine protease [Periserrula leucophryna]
Length = 283
Score = 297 bits (761), Expect = 5e-78, Method: Compositional matrix adjust.
Identities = 147/287 (51%), Positives = 190/287 (66%), Gaps = 6/287 (2%)
Query: 62 RLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMI 121
R IF N++KI L D E G YG+ +FSDL+ EF+ YL K S+ V A I
Sbjct: 2 RFKIFRENMKKINTLNDNELGDAEYGVTQFSDLAEEEFRRYYLTPKWDLSHRPDLVRAKI 61
Query: 122 PNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDC 181
P++ P +FDWR+++AVT VK+Q MCGS WAFSTT NIEG +A KLVSLSEQEL+DC
Sbjct: 62 PDVDPPASFDWRDHNAVTPVKNQGMCGSCWAFSTTENIEGQWAIHRNKLVSLSEQELVDC 121
Query: 182 DQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVS 241
D+ DDGCEGG NA++ I+ GGLE EK YPY +D+ C+ V IN V++S
Sbjct: 122 DKLDDGCEGGLPVNAYEEIIRL--GGLESEKKYPYDAEDEKCKFTVGDVAVYINSSVNIS 179
Query: 242 RDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDR 301
+E DMA +L +NGP+++ INA+A+QFY+ GVSHP F C + L H VLIVGYG +
Sbjct: 180 SNEADMAAWLYKNGPISIGINAFAMQFYMGGVSHPFSFLC--SPDELDHGVLIVGYGTKK 237
Query: 302 TKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
F+ PYWI+KNSWG WG +GY+ +YRGDG CG+N SA+V
Sbjct: 238 GWFSDS--PYWIVKNSWGASWGVQGYYLVYRGDGVCGLNKMPTSAIV 282
>gi|83944664|gb|ABC48936.1| cathepsin F like protease [Glossina morsitans morsitans]
Length = 471
Score = 296 bits (757), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 144/318 (45%), Positives = 200/318 (62%), Gaps = 6/318 (1%)
Query: 31 HHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNE 90
H+L+ V+H LF F + + Y T +E R IF NL+ I+ L E GS YG+ E
Sbjct: 157 HNLNKVEH--LFAKFQIKFKRNYHTTMEKQMRFRIFKQNLQLIEELNRNEQGSAKYGITE 214
Query: 91 FSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSS 150
F+D+++ E++ + ++ P A + A IPNI LP+ FDWRE A++ VK+Q CGS
Sbjct: 215 FADMTSPEYKQRTGLWQRDPQKAASNPKAEIPNIDLPKEFDWREKGAISAVKNQGNCGSC 274
Query: 151 WAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEE 210
WAFS TGNIEG++A +T L SEQEL+DCD D C GG NA++ I K+ GGLE
Sbjct: 275 WAFSVTGNIEGLHAVRTGVLEQYSEQELLDCDTSDSACNGGLPDNAYEAI-EKI-GGLEL 332
Query: 211 EKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYV 270
E YPY C N VK+ G+V + ++ET +A++L+ NGP+++ INA A+QFY
Sbjct: 333 ESDYPYHARKDQCHFNSTKIHVKVKGHVDLPKNETAIAQWLIANGPISIGINANAMQFYR 392
Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
GVSHP C +NL H VLIVGY V K +PYWI+KNSWG+ WGE+GY+R+
Sbjct: 393 GGVSHPPHILC--SRKNLDHGVLIVGYRVSDYPMFKKTLPYWIVKNSWGKKWGEQGYYRV 450
Query: 331 YRGDGSCGINDYVRSALV 348
YRGD +CG+++ SA++
Sbjct: 451 YRGDNTCGVSEMSSSAVL 468
>gi|195111686|ref|XP_002000409.1| GI10216 [Drosophila mojavensis]
gi|193917003|gb|EDW15870.1| GI10216 [Drosophila mojavensis]
Length = 605
Score = 295 bits (756), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 148/319 (46%), Positives = 203/319 (63%), Gaps = 9/319 (2%)
Query: 33 LHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFS 92
L+ V H LF+ F ++ + YA +E+ RL IF NLR IQ L D E GS YG+ EF+
Sbjct: 292 LNKVDH--LFHVFQIKYKRRYANSMEHQMRLRIFRQNLRTIQELNDNEQGSAKYGITEFA 349
Query: 93 DLSTAEFQAKYLGFKLKPSYADRSVPAMIPNIT--LPRAFDWREYDAVTGVKDQTMCGSS 150
D++++E+ + ++ + PA++P LP+ FDWRE +AVT VK+Q CGS
Sbjct: 350 DMTSSEYTQRAGLWQRSANKPTGGKPAVVPAYKGELPKEFDWREKNAVTQVKNQGSCGSC 409
Query: 151 WAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEE 210
WAFS TGNIEG+YA KT +L SEQEL+DCD D C GG + NA+ I K GGLE
Sbjct: 410 WAFSVTGNIEGLYAIKTGELREFSEQELLDCDSTDSACNGGLMDNAYKAI--KDIGGLEY 467
Query: 211 EKTYPYRGDDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINAYALQFY 269
E YPY K C NK + V++ +V + + +ET M ++L+ NGP+++ +NA A+QFY
Sbjct: 468 ESEYPYLAKKKQCHFNKTLSHVQVADFVDLPKGNETAMQEWLLANGPISIGLNANAMQFY 527
Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
GVSHP C +NL H VLIVGYGV HK +PYWI+KNSWG WGE+GY+R
Sbjct: 528 RGGVSHPWGPLC--SKKNLDHGVLIVGYGVSDYPNFHKTLPYWIVKNSWGPRWGEQGYYR 585
Query: 330 LYRGDGSCGINDYVRSALV 348
+YRGD +CG+++ SA++
Sbjct: 586 IYRGDNTCGVSEMATSAVL 604
>gi|312378084|gb|EFR24752.1| hypothetical protein AND_10451 [Anopheles darlingi]
Length = 1785
Score = 294 bits (752), Expect = 5e-77, Method: Compositional matrix adjust.
Identities = 147/312 (47%), Positives = 205/312 (65%), Gaps = 10/312 (3%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
F F H + YA+ E+ R +IF NL KI L E G+G YG+ +F+D++TAE++A
Sbjct: 1478 FEKFKLHHQRQYASSFEHEMRYNIFRNNLYKIDQLNRHERGTGKYGVTKFADMTTAEYRA 1537
Query: 102 KYLGFKLKPSYAD--RSVPAMI--PNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
+ G + +++ R+ A + +LP +FDWR++ AVTGVK+Q CGS WAFS G
Sbjct: 1538 -HTGLIVPKQHSNHIRNPIATVSTERTSLPTSFDWRDHGAVTGVKNQGNCGSCWAFSAIG 1596
Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
NIEG++ KTKKL + SEQELIDCD D+GC GG + +AF I KL GGLE E YPY+
Sbjct: 1597 NIEGLHQIKTKKLEAYSEQELIDCDTVDNGCNGGYMDDAFKAI-EKL-GGLELEDEYPYQ 1654
Query: 218 GD-DKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHP 276
K C NK + V++ G V + ++ET +A+YL+ENGP+A+ +NA A+QFY G+SHP
Sbjct: 1655 AKAQKTCHFNKTLSHVRVKGAVDMPKNETFIAQYLIENGPIAIGLNANAMQFYRGGISHP 1714
Query: 277 IQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGS 336
C ++ + H VLIVGYGV +K +PYW IKNSWG WGE+GY+R+YRGD S
Sbjct: 1715 WHLLC--SHKQIDHGVLIVGYGVKEYPLFNKTLPYWTIKNSWGPKWGEQGYYRIYRGDNS 1772
Query: 337 CGINDYVRSALV 348
CG+++ SA++
Sbjct: 1773 CGVSEMASSAIL 1784
>gi|427778331|gb|JAA54617.1| Putative cysteine proteinase cathepsin f [Rhipicephalus pulchellus]
Length = 361
Score = 291 bits (746), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 153/338 (45%), Positives = 207/338 (61%), Gaps = 25/338 (7%)
Query: 32 HLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEF 91
HL + +LF+ F +NKTY E+ +R IF NL++I L E G+ YGL EF
Sbjct: 24 HLKLSQERSLFSVFARTYNKTYKDKEEHEARFMIFKNNLKRIALFNRLEEGTAHYGLTEF 83
Query: 92 SDLSTAEFQAKYLGFK--LKPSYADRSVPAMIP-NITLPRAFDWREYDAVTGVKDQTMCG 148
SDLS +EF+ YLG K L A+ + P N LP FDWR AVT VK+Q MCG
Sbjct: 84 SDLSPSEFERHYLGLKKDLAEHKAEVKPIKVGPVNEPLPDLFDWRTKGAVTEVKNQGMCG 143
Query: 149 SSWAFS------------------TTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEG 190
S WAFS TGN+EG + KL+SLSEQEL+DCD D GC+G
Sbjct: 144 SCWAFSXXTEVKNQGMCGSCWAFSVTGNVEGQWFLSRSKLLSLSEQELVDCDHGDHGCKG 203
Query: 191 GSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKY 250
G + A ++ GGLE E YPY+G D C NK ++ ++ +V + ++ET++A +
Sbjct: 204 GYMGQAMKAVIEM--GGLETESEYPYKGVDGTCEFNKTESKARVQSFVGLPQNETELAYW 261
Query: 251 LVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVP 310
L+++GP+++ INA A+QFY G+SHP +F C +L H VL+VG+GVD+ F K VP
Sbjct: 262 LMKHGPVSIGINANAMQFYFGGISHPWKFLCS--PTDLDHGVLLVGFGVDKRSFRRKPVP 319
Query: 311 YWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
YWI+KNSWG+ WGEKGY+R+YRGDG+CG+N SA+V
Sbjct: 320 YWIVKNSWGKYWGEKGYYRVYRGDGTCGVNQMALSAVV 357
>gi|195395906|ref|XP_002056575.1| GJ11017 [Drosophila virilis]
gi|194143284|gb|EDW59687.1| GJ11017 [Drosophila virilis]
Length = 599
Score = 291 bits (746), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 146/319 (45%), Positives = 200/319 (62%), Gaps = 9/319 (2%)
Query: 33 LHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFS 92
L+ V H LF+ F ++ + YA E+ RL IF +L+ IQ L E GS YG+ EF+
Sbjct: 286 LNKVDH--LFHKFQVKYKRRYANSAEHQMRLRIFRQSLKTIQELNANEQGSAKYGITEFA 343
Query: 93 DLSTAEFQAKYLGFKLKPSYADRSVPAMIPNIT--LPRAFDWREYDAVTGVKDQTMCGSS 150
D+++ E+ + ++ A++P LP+ FDWR+ +AVT VK+Q CGS
Sbjct: 344 DMTSTEYAQRAGLWQRSEGKPTGGAAAVVPAYAGELPKEFDWRQKNAVTHVKNQGQCGSC 403
Query: 151 WAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEE 210
WAFS TGNIEG YA KT L SEQEL+DCD +D C GG + NA+ I K GGLE
Sbjct: 404 WAFSVTGNIEGAYAIKTGDLQEFSEQELLDCDSKDSACNGGLMDNAYKAI--KDIGGLEY 461
Query: 211 EKTYPYRGDDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINAYALQFY 269
E YPY G K C N+ + V+++G+V + + +ET M ++L+ NGP+++ INA A+QFY
Sbjct: 462 ESEYPYEGKKKQCHFNRTLSHVQVSGFVDLPKGNETAMQEWLLTNGPISIGINANAMQFY 521
Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
GVSHP C +NL H VLIVGYGV HK +PYWI+KNSWG WGE+GY+R
Sbjct: 522 RGGVSHPWSPLC--SKKNLDHGVLIVGYGVSDYPNFHKTLPYWIVKNSWGPRWGEQGYYR 579
Query: 330 LYRGDGSCGINDYVRSALV 348
+YRGD +CG+++ SAL+
Sbjct: 580 VYRGDNTCGVSEMATSALL 598
>gi|390994427|gb|AFM37363.1| cathepsin F1 [Dictyocaulus viviparus]
Length = 459
Score = 291 bits (744), Expect = 5e-76, Method: Compositional matrix adjust.
Identities = 149/309 (48%), Positives = 194/309 (62%), Gaps = 15/309 (4%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
F+ +H K Y + + R +F NL+ I+ Q+ E G+ VYG+ +FSDL+ EF+ YL
Sbjct: 160 FMGRHEKVYNSKHDTLKRFRVFKRNLKAIRSWQEKEEGTAVYGITQFSDLTPEEFKKIYL 219
Query: 105 GFKL-KPSYADRSVPAMIP----NITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNI 159
+ +P +R V N TLP +FDWR++ AVT VK+Q CGS WAFSTTGNI
Sbjct: 220 PYIWDEPIVPNRMVDLTAEGVHLNETLPESFDWRDHGAVTDVKNQGFCGSCWAFSTTGNI 279
Query: 160 EGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGD 219
EG + KKLVSLSEQEL+DCD+ DDGCEGG S A+ IM GGLE E YPY G
Sbjct: 280 EGQWFLAKKKLVSLSEQELVDCDKVDDGCEGGLPSQAYKEIMRM--GGLETESAYPYDGR 337
Query: 220 DKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQF 279
+ C +N+ V IN V + DE M +LV+ GP+++ INA LQFY G+SHP +F
Sbjct: 338 GEECHINRTEFAVYINDSVELPHDEESMKAWLVKKGPISIGINANPLQFYRHGISHPWKF 397
Query: 280 FCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGI 339
FC+ L+H VL+VGYG ++ K PYWIIKNSWG WGE GY+RLYRG CG+
Sbjct: 398 FCEP--YMLNHGVLLVGYGSEKNK------PYWIIKNSWGPKWGENGYYRLYRGKNVCGV 449
Query: 340 NDYVRSALV 348
++ SA+V
Sbjct: 450 HEMPTSAVV 458
>gi|194746631|ref|XP_001955780.1| GF16067 [Drosophila ananassae]
gi|190628817|gb|EDV44341.1| GF16067 [Drosophila ananassae]
Length = 620
Score = 290 bits (743), Expect = 5e-76, Method: Compositional matrix adjust.
Identities = 145/319 (45%), Positives = 201/319 (63%), Gaps = 9/319 (2%)
Query: 33 LHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFS 92
L V+H LF+ F + + Y + E RL IF NL+ I+ L E GS YG+ EF+
Sbjct: 307 LDKVEH--LFHKFQVRFGRRYVSTAERQMRLRIFRQNLKTIEELNANEMGSAKYGITEFA 364
Query: 93 DLSTAEFQAKYLGFKLKPSYADRSVPAMIPNIT--LPRAFDWREYDAVTGVKDQTMCGSS 150
D+++ E++ + ++ + A PA++P + LP+ FDWR +AVTGVK+Q CGS
Sbjct: 365 DMTSTEYKERTGLWQRDEAKATGGSPAVVPAYSGELPKEFDWRSKNAVTGVKNQGQCGSC 424
Query: 151 WAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEE 210
WAFS TGNIEG+YA K +L SEQEL+DCD D C GG + NA+ I K GGLE
Sbjct: 425 WAFSVTGNIEGLYALKYGELKEFSEQELLDCDTTDSACNGGLMDNAYKAI--KDIGGLEY 482
Query: 211 EKTYPYRGDDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINAYALQFY 269
E YPY K C NK + V++ +V + + +ET M ++LV NGP+++ INA A+QFY
Sbjct: 483 EAEYPYEAKKKQCHFNKTMSHVQVKDFVDLPKGNETAMQEWLVSNGPISIGINANAMQFY 542
Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
GVSHP + C +NL H VL+VGYGV HK +PYWI+KNSWG WGE+GY+R
Sbjct: 543 RGGVSHPWKALC--SKKNLDHGVLVVGYGVSDYPNYHKTLPYWIVKNSWGPRWGEQGYYR 600
Query: 330 LYRGDGSCGINDYVRSALV 348
+YRGD +CG+++ SA++
Sbjct: 601 VYRGDNTCGVSEMATSAVL 619
>gi|195453400|ref|XP_002073772.1| GK14287 [Drosophila willistoni]
gi|194169857|gb|EDW84758.1| GK14287 [Drosophila willistoni]
Length = 610
Score = 290 bits (742), Expect = 8e-76, Method: Compositional matrix adjust.
Identities = 144/322 (44%), Positives = 200/322 (62%), Gaps = 10/322 (3%)
Query: 31 HHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNE 90
H L V+H LF+ F + + Y VE RL IF NLR I+ L E GS YG+ E
Sbjct: 294 HSLDKVEH--LFHKFQIKFERRYVNSVERQMRLRIFRQNLRIIEQLNANEMGSAKYGITE 351
Query: 91 FSDLSTAEFQAK---YLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMC 147
F+D+++ E++ + + + +P+ ++V P LP+ FDWR+ AV+ VK+Q C
Sbjct: 352 FADMTSTEYKERTGLWQRTEGQPTGGQKAVVPSYPGGELPKEFDWRQKGAVSSVKNQGSC 411
Query: 148 GSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGG 207
GS WAFST GNIEG+ A KT +L SEQEL+DCD +D C GG NA+ I GG
Sbjct: 412 GSCWAFSTIGNIEGLNAVKTGQLKEFSEQELLDCDTKDSACNGGLPDNAYKAIQEI--GG 469
Query: 208 LEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINAYAL 266
LE E YPY+ + C NK V++ G+V + + +ET M ++L+ NGP+++ INA A+
Sbjct: 470 LEYESEYPYKARKEQCHFNKTLAHVQVTGFVDLPKNNETAMQEWLIANGPISIGINANAM 529
Query: 267 QFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKG 326
QFY GVSHP + C+ NL H VLIVGYGV HK +PYWI+KNSWG WGE+G
Sbjct: 530 QFYRGGVSHPWKILCE--KSNLDHGVLIVGYGVSDYPNFHKTLPYWIVKNSWGPRWGEQG 587
Query: 327 YFRLYRGDGSCGINDYVRSALV 348
Y+R+YRGD +CG+++ SA++
Sbjct: 588 YYRVYRGDNTCGVSEMASSAIL 609
>gi|347968731|ref|XP_003436277.1| AGAP002879-PB [Anopheles gambiae str. PEST]
gi|333467869|gb|EGK96736.1| AGAP002879-PB [Anopheles gambiae str. PEST]
Length = 1834
Score = 287 bits (735), Expect = 4e-75, Method: Compositional matrix adjust.
Identities = 144/328 (43%), Positives = 210/328 (64%), Gaps = 15/328 (4%)
Query: 29 KLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGL 88
K+ HV+ +F+ F H + YA+ +E+ R +IF NL KI+ L E G+ YG+
Sbjct: 1513 KIDDDAHVRR--MFDKFRHHHRRQYASSMEHEMRFNIFRNNLFKIEQLNKFERGTAKYGV 1570
Query: 89 NEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNIT-------LPRAFDWREYDAVTGV 141
+F+D++ AE++A + G + V + + LPR+FDWR++ AVT V
Sbjct: 1571 TKFADMTVAEYRA-HTGLVVPKHDRANHVGNRVASEEDVAGVGDLPRSFDWRDHGAVTEV 1629
Query: 142 KDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIM 201
K+Q CGS WAFS GN+EG++ KTKKL S SEQELIDCD+ D+GC GG + +AF I
Sbjct: 1630 KNQGSCGSCWAFSAVGNVEGLHQIKTKKLESYSEQELIDCDKVDNGCGGGYMDDAFKAI- 1688
Query: 202 SKLGGGLEEEKTYPYRGD-DKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVA 260
+ GGLE E YPY K+C N+ + V++ G V + ++ET +AKYL++NGP+A+
Sbjct: 1689 -EQLGGLELENDYPYEAKAQKSCHFNRSLSHVQVKGAVDMPKNETYIAKYLIKNGPIAIG 1747
Query: 261 INAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGE 320
+NA A+QFY G+SHP C+ ++++ H VLIVGYG+ +K +PYWIIKNSWG
Sbjct: 1748 LNANAMQFYRGGISHPWHPLCN--HKSIDHGVLIVGYGIKEYPMFNKTLPYWIIKNSWGP 1805
Query: 321 GWGEKGYFRLYRGDGSCGINDYVRSALV 348
WGE+GY+R+YRGD SCG+++ SA++
Sbjct: 1806 RWGEQGYYRIYRGDNSCGVSEMASSAIL 1833
>gi|24644155|ref|NP_730901.1| CG12163, isoform A [Drosophila melanogaster]
gi|32699625|sp|Q9VN93.2|CPR1_DROME RecName: Full=Putative cysteine proteinase CG12163; Flags:
Precursor
gi|23170427|gb|AAF52055.2| CG12163, isoform A [Drosophila melanogaster]
gi|27819876|gb|AAO24986.1| LP08529p [Drosophila melanogaster]
Length = 614
Score = 287 bits (735), Expect = 4e-75, Method: Compositional matrix adjust.
Identities = 143/321 (44%), Positives = 200/321 (62%), Gaps = 9/321 (2%)
Query: 31 HHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNE 90
H V H LF F + + Y + E RL IF NL+ I+ L E GS YG+ E
Sbjct: 299 HRFDKVDH--LFYKFQVRFGRRYVSTAERQMRLRIFRQNLKTIEELNANEMGSAKYGITE 356
Query: 91 FSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNI--TLPRAFDWREYDAVTGVKDQTMCG 148
F+D++++E++ + ++ + A A++P LP+ FDWR+ DAVT VK+Q CG
Sbjct: 357 FADMTSSEYKERTGLWQRDEAKATGGSAAVVPAYHGELPKEFDWRQKDAVTQVKNQGSCG 416
Query: 149 SSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGL 208
S WAFS TGNIEG+YA KT +L SEQEL+DCD D C GG + NA+ I K GGL
Sbjct: 417 SCWAFSVTGNIEGLYAVKTGELKEFSEQELLDCDTTDSACNGGLMDNAYKAI--KDIGGL 474
Query: 209 EEEKTYPYRGDDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINAYALQ 267
E E YPY+ C N+ + V++ G+V + + +ET M ++L+ NGP+++ INA A+Q
Sbjct: 475 EYEAEYPYKAKKNQCHFNRTLSHVQVAGFVDLPKGNETAMQEWLLANGPISIGINANAMQ 534
Query: 268 FYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGY 327
FY GVSHP + C +NL H VL+VGYGV HK +PYWI+KNSWG WGE+GY
Sbjct: 535 FYRGGVSHPWKALC--SKKNLDHGVLVVGYGVSDYPNFHKTLPYWIVKNSWGPRWGEQGY 592
Query: 328 FRLYRGDGSCGINDYVRSALV 348
+R+YRGD +CG+++ SA++
Sbjct: 593 YRVYRGDNTCGVSEMATSAVL 613
>gi|347968733|ref|XP_312034.5| AGAP002879-PA [Anopheles gambiae str. PEST]
gi|333467868|gb|EAA08025.5| AGAP002879-PA [Anopheles gambiae str. PEST]
Length = 1810
Score = 287 bits (735), Expect = 4e-75, Method: Compositional matrix adjust.
Identities = 144/328 (43%), Positives = 210/328 (64%), Gaps = 15/328 (4%)
Query: 29 KLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGL 88
K+ HV+ +F+ F H + YA+ +E+ R +IF NL KI+ L E G+ YG+
Sbjct: 1489 KIDDDAHVRR--MFDKFRHHHRRQYASSMEHEMRFNIFRNNLFKIEQLNKFERGTAKYGV 1546
Query: 89 NEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNIT-------LPRAFDWREYDAVTGV 141
+F+D++ AE++A + G + V + + LPR+FDWR++ AVT V
Sbjct: 1547 TKFADMTVAEYRA-HTGLVVPKHDRANHVGNRVASEEDVAGVGDLPRSFDWRDHGAVTEV 1605
Query: 142 KDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIM 201
K+Q CGS WAFS GN+EG++ KTKKL S SEQELIDCD+ D+GC GG + +AF I
Sbjct: 1606 KNQGSCGSCWAFSAVGNVEGLHQIKTKKLESYSEQELIDCDKVDNGCGGGYMDDAFKAI- 1664
Query: 202 SKLGGGLEEEKTYPYRGD-DKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVA 260
+ GGLE E YPY K+C N+ + V++ G V + ++ET +AKYL++NGP+A+
Sbjct: 1665 -EQLGGLELENDYPYEAKAQKSCHFNRSLSHVQVKGAVDMPKNETYIAKYLIKNGPIAIG 1723
Query: 261 INAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGE 320
+NA A+QFY G+SHP C+ ++++ H VLIVGYG+ +K +PYWIIKNSWG
Sbjct: 1724 LNANAMQFYRGGISHPWHPLCN--HKSIDHGVLIVGYGIKEYPMFNKTLPYWIIKNSWGP 1781
Query: 321 GWGEKGYFRLYRGDGSCGINDYVRSALV 348
WGE+GY+R+YRGD SCG+++ SA++
Sbjct: 1782 RWGEQGYYRIYRGDNSCGVSEMASSAIL 1809
>gi|24644153|ref|NP_649521.1| CG12163, isoform B [Drosophila melanogaster]
gi|23170426|gb|AAN13266.1| CG12163, isoform B [Drosophila melanogaster]
gi|378548248|gb|AFC17498.1| FI18603p1 [Drosophila melanogaster]
Length = 475
Score = 287 bits (735), Expect = 5e-75, Method: Compositional matrix adjust.
Identities = 143/321 (44%), Positives = 201/321 (62%), Gaps = 9/321 (2%)
Query: 31 HHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNE 90
H V H LF F + + Y + E RL IF NL+ I+ L E GS YG+ E
Sbjct: 160 HRFDKVDH--LFYKFQVRFGRRYVSTAERQMRLRIFRQNLKTIEELNANEMGSAKYGITE 217
Query: 91 FSDLSTAEFQAKYLGFKLKPSYADRSVPAMIP--NITLPRAFDWREYDAVTGVKDQTMCG 148
F+D++++E++ + ++ + A A++P + LP+ FDWR+ DAVT VK+Q CG
Sbjct: 218 FADMTSSEYKERTGLWQRDEAKATGGSAAVVPAYHGELPKEFDWRQKDAVTQVKNQGSCG 277
Query: 149 SSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGL 208
S WAFS TGNIEG+YA KT +L SEQEL+DCD D C GG + NA+ I K GGL
Sbjct: 278 SCWAFSVTGNIEGLYAVKTGELKEFSEQELLDCDTTDSACNGGLMDNAYKAI--KDIGGL 335
Query: 209 EEEKTYPYRGDDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINAYALQ 267
E E YPY+ C N+ + V++ G+V + + +ET M ++L+ NGP+++ INA A+Q
Sbjct: 336 EYEAEYPYKAKKNQCHFNRTLSHVQVAGFVDLPKGNETAMQEWLLANGPISIGINANAMQ 395
Query: 268 FYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGY 327
FY GVSHP + C +NL H VL+VGYGV HK +PYWI+KNSWG WGE+GY
Sbjct: 396 FYRGGVSHPWKALC--SKKNLDHGVLVVGYGVSDYPNFHKTLPYWIVKNSWGPRWGEQGY 453
Query: 328 FRLYRGDGSCGINDYVRSALV 348
+R+YRGD +CG+++ SA++
Sbjct: 454 YRVYRGDNTCGVSEMATSAVL 474
>gi|347968729|ref|XP_003436276.1| AGAP002879-PC [Anopheles gambiae str. PEST]
gi|333467870|gb|EGK96737.1| AGAP002879-PC [Anopheles gambiae str. PEST]
Length = 953
Score = 287 bits (735), Expect = 5e-75, Method: Compositional matrix adjust.
Identities = 143/322 (44%), Positives = 208/322 (64%), Gaps = 15/322 (4%)
Query: 35 HVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDL 94
HV+ +F+ F H + YA+ +E+ R +IF NL KI+ L E G+ YG+ +F+D+
Sbjct: 638 HVRR--MFDKFRHHHRRQYASSMEHEMRFNIFRNNLFKIEQLNKFERGTAKYGVTKFADM 695
Query: 95 STAEFQAKYLGFKLKPSYADRSVPAMIPNIT-------LPRAFDWREYDAVTGVKDQTMC 147
+ AE++A + G + V + + LPR+FDWR++ AVT VK+Q C
Sbjct: 696 TVAEYRA-HTGLVVPKHDRANHVGNRVASEEDVAGVGDLPRSFDWRDHGAVTEVKNQGSC 754
Query: 148 GSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGG 207
GS WAFS GN+EG++ KTKKL S SEQELIDCD+ D+GC GG + +AF I + GG
Sbjct: 755 GSCWAFSAVGNVEGLHQIKTKKLESYSEQELIDCDKVDNGCGGGYMDDAFKAI--EQLGG 812
Query: 208 LEEEKTYPYRGD-DKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYAL 266
LE E YPY K+C N+ + V++ G V + ++ET +AKYL++NGP+A+ +NA A+
Sbjct: 813 LELENDYPYEAKAQKSCHFNRSLSHVQVKGAVDMPKNETYIAKYLIKNGPIAIGLNANAM 872
Query: 267 QFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKG 326
QFY G+SHP C+ ++++ H VLIVGYG+ +K +PYWIIKNSWG WGE+G
Sbjct: 873 QFYRGGISHPWHPLCN--HKSIDHGVLIVGYGIKEYPMFNKTLPYWIIKNSWGPRWGEQG 930
Query: 327 YFRLYRGDGSCGINDYVRSALV 348
Y+R+YRGD SCG+++ SA++
Sbjct: 931 YYRIYRGDNSCGVSEMASSAIL 952
>gi|195054270|ref|XP_001994049.1| GH22731 [Drosophila grimshawi]
gi|193895919|gb|EDV94785.1| GH22731 [Drosophila grimshawi]
Length = 617
Score = 286 bits (731), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 141/321 (43%), Positives = 200/321 (62%), Gaps = 9/321 (2%)
Query: 31 HHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNE 90
H L+ ++H LF+ F ++ + YA E+ RL IF NLR I+ L E GS YG+ +
Sbjct: 302 HTLNKIEH--LFHKFQLKYKRQYANTAEHQMRLRIFRQNLRTIEELNANERGSAKYGITQ 359
Query: 91 FSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNIT--LPRAFDWREYDAVTGVKDQTMCG 148
F+D+++ E++ ++ A++P +P+ FDWR+ AVT VK+Q CG
Sbjct: 360 FADMTSTEYKLHAGLWQRSEDKPTGGAAAVVPPYAGEMPKEFDWRQKKAVTHVKNQGQCG 419
Query: 149 SSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGL 208
S WAFS TGNIEG+YA KT +L SEQEL+DCD D C GG + NA+ I K GGL
Sbjct: 420 SCWAFSVTGNIEGLYAIKTGELEEFSEQELLDCDSTDSACNGGLMDNAYKAI--KDIGGL 477
Query: 209 EEEKTYPYRGDDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINAYALQ 267
E E YPY C N+ + V+++G+V + + +ET M ++L+ NGP+++ +NA A+Q
Sbjct: 478 EYESEYPYAAKKMQCHFNRTMSHVQLSGFVDLPKGNETAMQEWLLSNGPISIGLNANAMQ 537
Query: 268 FYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGY 327
FY GVSHP C +NL H VLIVGYGV HK +PYWI+KNSWG WGE+GY
Sbjct: 538 FYRGGVSHPWAPLC--SKKNLDHGVLIVGYGVSDYPNFHKTLPYWIVKNSWGPRWGEQGY 595
Query: 328 FRLYRGDGSCGINDYVRSALV 348
+R+YRGD +CG+++ SA++
Sbjct: 596 YRIYRGDNTCGVSEMATSAVL 616
>gi|195343593|ref|XP_002038380.1| GM10654 [Drosophila sechellia]
gi|194133401|gb|EDW54917.1| GM10654 [Drosophila sechellia]
Length = 615
Score = 286 bits (731), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 140/311 (45%), Positives = 197/311 (63%), Gaps = 7/311 (2%)
Query: 41 LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
LF F + + Y + E RL IF NL+ I+ L E GS YG+ EF+D++++E++
Sbjct: 308 LFYKFQVRFGRRYVSTAERQMRLRIFRQNLKTIEELNANEMGSAKYGITEFADMTSSEYK 367
Query: 101 AKYLGFKLKPSYADRSVPAMIPNI--TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
+ ++ + A A++P LP+ FDWR+ DAVT VK+Q CGS WAFS TGN
Sbjct: 368 ERTGLWQRDEAKATGGSAAVVPAYHGELPKEFDWRQKDAVTQVKNQGSCGSCWAFSVTGN 427
Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG 218
IEG+YA KT +L SEQEL+DCD D C GG + NA+ I K GGLE E YPY+
Sbjct: 428 IEGLYAVKTGELKEFSEQELLDCDTTDSACNGGLMDNAYKAI--KDIGGLEYEAEYPYKA 485
Query: 219 DDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINAYALQFYVTGVSHPI 277
C N+ + V++ G+V + + +ET M ++L+ NGP+++ INA A+QFY GVSHP
Sbjct: 486 KKNQCHFNRTLSHVQVAGFVDLPKGNETAMQEWLLTNGPISIGINANAMQFYRGGVSHPW 545
Query: 278 QFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSC 337
+ C +NL H VL+VGYGV HK +PYWI+KNSWG WGE+GY+R+YRGD +C
Sbjct: 546 KALC--SKKNLDHGVLVVGYGVSDYPNFHKTLPYWIVKNSWGPRWGEQGYYRVYRGDNTC 603
Query: 338 GINDYVRSALV 348
G+++ SA++
Sbjct: 604 GVSEMATSAVL 614
>gi|170032975|ref|XP_001844355.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167873312|gb|EDS36695.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 1454
Score = 284 bits (727), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 151/344 (43%), Positives = 213/344 (61%), Gaps = 19/344 (5%)
Query: 16 TVSVSSFMVVGDEKLHHL----HHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLR 71
TV+ S + + HH H +H LF+ F +HN+TY + +E+ R IF NL
Sbjct: 1118 TVAKRSLRPHPNLEAHHYSKSEDHSRH--LFDKFKTRHNRTYQSSLEHEMRFRIFKNNLF 1175
Query: 72 KIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYAD-----RSVPAMI-PNIT 125
KI+ L E G+ YG+ F+D+++AE++A+ G + P D R+ A I ++
Sbjct: 1176 KIEQLNKYEQGTAKYGITHFADMTSAEYRAR-TGLVV-PREGDEVNHIRNPMAEIDEHME 1233
Query: 126 LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQED 185
LP AFDWRE AV+ VK+Q CGS WAFS GNIEG++ KTKKL SEQEL+DCD D
Sbjct: 1234 LPDAFDWRELGAVSEVKNQGNCGSCWAFSVVGNIEGLHQVKTKKLEEYSEQELLDCDTVD 1293
Query: 186 DGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG-DDKACRLNKKATQVKINGYVSVSRDE 244
C GG + +A+ I K+ GGLE E YPY K C NK V++ G V + ++E
Sbjct: 1294 SACNGGFMDDAYKAI-EKI-GGLELESEYPYLAKKQKTCHFNKTMAHVRVKGAVDLPKNE 1351
Query: 245 TDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKF 304
T +A++LV NGP+++ +NA A+QFY G+SHP + C +NL H VLIVGYGV
Sbjct: 1352 TAIAQFLVANGPVSIGLNANAMQFYRGGISHPWKPLC--SKKNLDHGVLIVGYGVKEYPM 1409
Query: 305 THKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
+K +PYWI+KNSWG WGE+GY+R++RGD +CG+++ SA++
Sbjct: 1410 FNKTLPYWIVKNSWGPKWGEQGYYRVFRGDNTCGVSEMATSAVL 1453
>gi|194898683|ref|XP_001978897.1| GG11133 [Drosophila erecta]
gi|190650600|gb|EDV47855.1| GG11133 [Drosophila erecta]
Length = 615
Score = 284 bits (727), Expect = 4e-74, Method: Compositional matrix adjust.
Identities = 142/319 (44%), Positives = 200/319 (62%), Gaps = 9/319 (2%)
Query: 33 LHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFS 92
L V H LF+ F + + Y + E RL IF NL+ I+ L E GS YG+ EF+
Sbjct: 302 LDKVDH--LFHKFQVRFGRRYVSTAERQMRLRIFRQNLKTIEELNTNEMGSAKYGITEFA 359
Query: 93 DLSTAEFQAKYLGFKLKPSYADRSVPAMIPNI--TLPRAFDWREYDAVTGVKDQTMCGSS 150
DL+++E++ + ++ + A A++P LP+ FDWR+ +AVT VK+Q CGS
Sbjct: 360 DLTSSEYKERTGLWQRDEAKATGGSAAVVPAYHGELPKEFDWRQKNAVTPVKNQGSCGSC 419
Query: 151 WAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEE 210
WAFS TGNIEG+YA KT +L SEQEL+DCD D C GG + NA+ I K GGLE
Sbjct: 420 WAFSVTGNIEGLYAVKTGELKEFSEQELLDCDTTDSACNGGLMDNAYKAI--KDIGGLEY 477
Query: 211 EKTYPYRGDDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINAYALQFY 269
E YPY+ C N+ + V++ G+V + + +ET M ++L+ GP+++ INA A+QFY
Sbjct: 478 EAEYPYKAKKNQCHFNRTLSHVQVAGFVDLPKGNETAMQEWLLTKGPISIGINANAMQFY 537
Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
GVSHP + C +NL H VL+VGYGV HK +PYWI+KNSWG WGE+GY+R
Sbjct: 538 RGGVSHPWKALC--SKKNLDHGVLVVGYGVSDYPNFHKTLPYWIVKNSWGPRWGEQGYYR 595
Query: 330 LYRGDGSCGINDYVRSALV 348
+YRGD +CG+++ SA++
Sbjct: 596 VYRGDNTCGVSEMATSAVL 614
>gi|341878608|gb|EGT34543.1| hypothetical protein CAEBREN_26318 [Caenorhabditis brenneri]
Length = 478
Score = 284 bits (726), Expect = 5e-74, Method: Compositional matrix adjust.
Identities = 148/309 (47%), Positives = 188/309 (60%), Gaps = 15/309 (4%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
F+++H K Y E R +F N + I+ LQ E G+ VYG +FSD++T EF+ L
Sbjct: 179 FIDRHEKRYENKREVLKRFRVFKRNAKVIRELQKNEQGTAVYGFTKFSDMTTMEFKETML 238
Query: 105 GFKL-KPSYADRS----VPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNI 159
++ +P D++ I LP +FDWRE+ AVT VK+Q CGS WAFSTTGNI
Sbjct: 239 PYQWEQPVPMDQANFEKEGVTISEEDLPDSFDWREHGAVTQVKNQGSCGSCWAFSTTGNI 298
Query: 160 EGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGD 219
EG + KKLVSLSEQEL+DCD D GC GG SNA+ I+ GGLE E YPY G
Sbjct: 299 EGAWFLAKKKLVSLSEQELVDCDSVDQGCNGGLPSNAYKEIIRM--GGLEPEDAYPYDGR 356
Query: 220 DKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQF 279
+ C L +K V ING V + DE +M K+LV GP+++ +NA LQFY GV HP +
Sbjct: 357 GETCHLVRKDIAVYINGSVELPHDEVEMQKWLVTKGPISIGLNANTLQFYRHGVVHPFKI 416
Query: 280 FCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGI 339
FC+ L+H VLIVGYG D K PYWI+KNSWG WGE GYF+LYRG CG+
Sbjct: 417 FCEPF--MLNHGVLIVGYGKDGRK------PYWIVKNSWGPTWGEAGYFKLYRGKNVCGV 468
Query: 340 NDYVRSALV 348
+ S+LV
Sbjct: 469 QEMATSSLV 477
>gi|341878637|gb|EGT34572.1| hypothetical protein CAEBREN_13324 [Caenorhabditis brenneri]
Length = 478
Score = 284 bits (726), Expect = 5e-74, Method: Compositional matrix adjust.
Identities = 148/309 (47%), Positives = 188/309 (60%), Gaps = 15/309 (4%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
F+++H K Y E R +F N + I+ LQ E G+ VYG +FSD++T EF+ L
Sbjct: 179 FIDRHEKRYENKREVLKRFRVFKRNAKVIRELQKNEQGTAVYGFTKFSDMTTMEFKETML 238
Query: 105 GFKL-KPSYADRS----VPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNI 159
++ +P D++ I LP +FDWRE+ AVT VK+Q CGS WAFSTTGNI
Sbjct: 239 PYQWEQPVPMDQANFEKEGVTISEEDLPDSFDWREHGAVTQVKNQGSCGSCWAFSTTGNI 298
Query: 160 EGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGD 219
EG + KKLVSLSEQEL+DCD D GC GG SNA+ I+ GGLE E YPY G
Sbjct: 299 EGAWFLAKKKLVSLSEQELVDCDSVDQGCNGGLPSNAYKEIIRM--GGLEPEDAYPYDGR 356
Query: 220 DKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQF 279
+ C L +K V ING V + DE +M K+LV GP+++ +NA LQFY GV HP +
Sbjct: 357 GETCHLVRKDIAVYINGSVELPHDEVEMQKWLVTKGPISIGLNANTLQFYRHGVVHPFKI 416
Query: 280 FCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGI 339
FC+ L+H VLIVGYG D K PYWI+KNSWG WGE GYF+LYRG CG+
Sbjct: 417 FCEPF--MLNHGVLIVGYGKDGRK------PYWIVKNSWGPTWGEAGYFKLYRGKNVCGV 468
Query: 340 NDYVRSALV 348
+ S+LV
Sbjct: 469 QEMATSSLV 477
>gi|339246873|ref|XP_003375070.1| viral cathepsin [Trichinella spiralis]
gi|316971622|gb|EFV55373.1| viral cathepsin [Trichinella spiralis]
Length = 496
Score = 284 bits (726), Expect = 5e-74, Method: Compositional matrix adjust.
Identities = 145/310 (46%), Positives = 192/310 (61%), Gaps = 13/310 (4%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
F FL+ K Y + E R IF N++ +++LQ E G+ VYG+ F+DL+ EF+
Sbjct: 196 FKEFLKTFKKWYLSEKELLKRYDIFKVNMKTVEMLQKNEQGTAVYGVTFFADLTPEEFRK 255
Query: 102 KYLGFKLKPSYADRSVP---AMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
F L P + +P A IP + +DWRE++AVT VK+Q MCGS WAF+T N
Sbjct: 256 ----FYLSPQWKRDQLPQRKASIPKGKIEDRWDWREHNAVTEVKNQGMCGSCWAFATIAN 311
Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG 218
+EGV+A K +LVSLSEQEL+DCD D GC GG SNA+ I+ GGL E Y Y G
Sbjct: 312 VEGVWAVKKGELVSLSEQELVDCDTLDQGCSGGYPSNAYKEIIRL--GGLTTETNYSYDG 369
Query: 219 DDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQ 278
+ CR + +V IN VS+ DET++A Y+ ENGP+AV INA+A+ FY G++HP +
Sbjct: 370 NQGTCRFKTQNAKVYINDSVSLPEDETEIAAYIRENGPVAVGINAFAMMFYRHGIAHPWR 429
Query: 279 FFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCG 338
F C + L H V IVGY V++ + K PYWIIKNSWG WGE GY+ LYRG G CG
Sbjct: 430 FLCSP--DALDHGVAIVGYDVEKQ--SKKPKPYWIIKNSWGTHWGEGGYYMLYRGAGVCG 485
Query: 339 INDYVRSALV 348
+N V SA++
Sbjct: 486 VNKMVTSAII 495
>gi|390339264|ref|XP_791714.3| PREDICTED: putative cysteine proteinase CG12163-like
[Strongylocentrotus purpuratus]
Length = 453
Score = 284 bits (726), Expect = 5e-74, Method: Compositional matrix adjust.
Identities = 143/330 (43%), Positives = 209/330 (63%), Gaps = 17/330 (5%)
Query: 14 SLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYAT---LVEYYSRLHIFSGNL 70
SL++ F + D + + ++ LF+ FL + Y EY R +F N+
Sbjct: 129 SLSLKAQDFSITKDCQASDIKD-EYRDLFDKFLMTFKREYRQNDGTNEYEYRYSVFVQNM 187
Query: 71 RKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAF 130
+++ E G+ YG +F+D++ AEF+ G LK + + A IP +P +
Sbjct: 188 LTVEMFNQFEQGTAKYGPTKFADMTEAEFRKLQSG-PLKKTGIKKQ--AAIPQGPVPEEY 244
Query: 131 DWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEG 190
DWR + AVT VK+Q MCGS WAFS GN+EG + K +L+SLSEQEL+DCD+ D GCEG
Sbjct: 245 DWRTHGAVTPVKNQGMCGSCWAFSAIGNMEGQWQIKKGELISLSEQELVDCDKVDGGCEG 304
Query: 191 GSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKY 250
G +S+A++ I+ KLGG + EEK YPYRG+++ C+ N +VKINGYV++S++ET+MA +
Sbjct: 305 GEMSDAYEAII-KLGGAMSEEK-YPYRGENEKCKFNMTDVRVKINGYVNISKNETEMAGW 362
Query: 251 LVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVP 310
L +GP+++ INA +QFY G++HP + FC ++L H VLIVGY V P
Sbjct: 363 LAAHGPISIGINALMMQFYFGGIAHPWKIFCS--PDSLDHGVLIVGYSV------KDGEP 414
Query: 311 YWIIKNSWGEGWGEKGYFRLYRGDGSCGIN 340
YWI+KNSWG+ WGE+GY+ +YRGDG+CG+N
Sbjct: 415 YWIVKNSWGKDWGEEGYYLVYRGDGTCGLN 444
>gi|156389068|ref|XP_001634814.1| predicted protein [Nematostella vectensis]
gi|156221901|gb|EDO42751.1| predicted protein [Nematostella vectensis]
Length = 276
Score = 284 bits (726), Expect = 5e-74, Method: Compositional matrix adjust.
Identities = 147/287 (51%), Positives = 186/287 (64%), Gaps = 13/287 (4%)
Query: 63 LHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF-QAKYLGFKLKPSYADRSVPAMI 121
+ IF N+RK +Q + G+ YG FSDLS EF + K + KP Y + A I
Sbjct: 1 MKIFESNMRKAAKMQKMDSGTAQYGPTIFSDLSEEEFRKQKMMPGWGKPLYEMKD--AEI 58
Query: 122 PNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDC 181
P +P + DWR+ VT VK+Q CGS WAFSTTGNIEG YA KT KLVSLSEQEL+DC
Sbjct: 59 PLGDIPESVDWRDKGVVTPVKNQGSCGSCWAFSTTGNIEGQYAIKTGKLVSLSEQELVDC 118
Query: 182 DQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVS 241
D D GCEGG SNA+ I KL GGLE E YPY+G D C+ NK +V IN V +S
Sbjct: 119 DTIDKGCEGGLPSNAYKQI-EKL-GGLESESDYPYKGADSKCKFNKAEVKVTINSSVVIS 176
Query: 242 RDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDR 301
+DE ++A +L +NGP+++ INA A+QFY+ G++HP + FC+ +L+H VLIVGYGV
Sbjct: 177 KDEKEIAAWLAKNGPISIGINANAMQFYMGGIAHPWKIFCN--PSSLNHGVLIVGYGV-- 232
Query: 302 TKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
PYWIIKNSWG WGEKGY+ +YRG G CG+N SA++
Sbjct: 233 ----KNGTPYWIIKNSWGPSWGEKGYYLIYRGGGCCGLNTMCTSAVI 275
>gi|195497262|ref|XP_002096026.1| GE25302 [Drosophila yakuba]
gi|194182127|gb|EDW95738.1| GE25302 [Drosophila yakuba]
Length = 615
Score = 284 bits (726), Expect = 6e-74, Method: Compositional matrix adjust.
Identities = 138/311 (44%), Positives = 197/311 (63%), Gaps = 7/311 (2%)
Query: 41 LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
LF+ F + + Y + E RL IF NL+ I+ L E GS YG+ EF+D++++E++
Sbjct: 308 LFHKFQVRFGRRYVSTAERQMRLRIFRQNLKTIEQLNVNEMGSAKYGITEFADMTSSEYK 367
Query: 101 AKYLGFKLKPSYADRSVPAMIPNI--TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
+ ++ + A A++P LP+ FDWR+ +AVT VK+Q CGS WAFS TGN
Sbjct: 368 ERTGLWQRNEAKATGGSVAVVPAYHGELPKEFDWRQKNAVTQVKNQGSCGSCWAFSVTGN 427
Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG 218
IEG++A KT L SEQEL+DCD D C GG + NA+ I K GGLE E YPY+
Sbjct: 428 IEGLHAVKTGDLKEFSEQELLDCDTTDSACNGGLMDNAYKAI--KDIGGLEYEAEYPYKA 485
Query: 219 DDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINAYALQFYVTGVSHPI 277
C N+ + V++ G+V + + +ET M ++L+ NGP+++ INA A+QFY GVSHP
Sbjct: 486 KKNQCHFNRTLSHVQVAGFVDLPKGNETAMQEWLLTNGPISIGINANAMQFYRGGVSHPW 545
Query: 278 QFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSC 337
+ C +NL H VL+VGYGV HK +PYWI+KNSWG WGE+GY+R+YRGD +C
Sbjct: 546 KALC--SKKNLDHGVLVVGYGVSEYPNFHKTLPYWIVKNSWGPRWGEQGYYRVYRGDNTC 603
Query: 338 GINDYVRSALV 348
G+++ SA++
Sbjct: 604 GVSEMATSAVL 614
>gi|6649575|gb|AAF21461.1|U69120_1 cysteine proteinase PWCP1 [Paragonimus westermani]
Length = 427
Score = 283 bits (725), Expect = 7e-74, Method: Compositional matrix adjust.
Identities = 153/341 (44%), Positives = 207/341 (60%), Gaps = 17/341 (4%)
Query: 7 FAGVALLSLTVSVSSFMVVG---DEKLHHLHHVKHTA-LFNYFLEQHNKTYATLVEYYSR 62
F +A+ L +S S +V + +L ++T+ LF F + K+Y++ + R
Sbjct: 88 FQRLAIEQLRISRRSIELVSLPSNIELLGFRLPQNTSRLFEEFQRKFRKSYSS--DTAKR 145
Query: 63 LHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIP 122
+F NL K+QL+Q E G+ YG+ +FSDLS EF+ K + S + A+ P
Sbjct: 146 YALFKYNLLKMQLIQRLEKGTANYGITKFSDLSAEEFRHSLANMKRRKSKGSQMETAIFP 205
Query: 123 NI--TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELID 180
+LP +FDWR AVT VKDQ MCGS WAF+TTGNIEG + KT KL+SLSEQ+L+D
Sbjct: 206 TTIQSLPPSFDWRANGAVTEVKDQGMCGSCWAFATTGNIEGQWFRKTNKLISLSEQQLLD 265
Query: 181 CDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG-DDKACRLNKKATQVKINGYVS 239
CD +D+ C GG A+D I+ GGL EK YPY +++C L + ING +
Sbjct: 266 CDTKDEACNGGLPEWAYDEIVKM--GGLMSEKDYPYEAMKEQSCHLRRPNISAYINGSAT 323
Query: 240 VSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGV 299
+ DE +A +LV+NGP++V +NA LQFY+ G+SHP C L H+VL+VGYGV
Sbjct: 324 LPSDEAKLAAWLVQNGPISVGVNANFLQFYLGGISHPPHMLCS--EAGLDHAVLLVGYGV 381
Query: 300 DRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGIN 340
T PYWI+KNSWG GWGEKGYFR+YRGDG+CGIN
Sbjct: 382 S----TFLRRPYWIVKNSWGGGWGEKGYFRMYRGDGTCGIN 418
>gi|163914827|ref|NP_001106423.1| cathepsin F precursor [Xenopus (Silurana) tropicalis]
gi|157423494|gb|AAI53364.1| LOC100127591 protein [Xenopus (Silurana) tropicalis]
Length = 463
Score = 283 bits (724), Expect = 8e-74, Method: Compositional matrix adjust.
Identities = 142/315 (45%), Positives = 198/315 (62%), Gaps = 14/315 (4%)
Query: 36 VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLS 95
+K LF F+ +NK Y+ E RL IFS NL+K Q++Q+ + G+ YG+ ++SDL+
Sbjct: 160 LKTLTLFKDFVTTYNKKYSDQEEAARRLQIFSQNLKKAQMIQEMDQGTAEYGVTKYSDLT 219
Query: 96 TAEFQAKYLGFKL--KPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAF 153
EF++ YL L KP Y + A++PN++ P +DWR++ AVT VK+Q MCGS WAF
Sbjct: 220 EDEFRSLYLNPLLSSKPLYQMKK--AIVPNMSAPDQWDWRDHGAVTEVKNQGMCGSCWAF 277
Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
S GNIEG + K LVSLSEQEL+DCD D C GG SNA++ I KL GG+E E+
Sbjct: 278 SVIGNIEGQWFLKKGSLVSLSEQELVDCDGVDHACAGGLPSNAYEAI-EKL-GGIETEQE 335
Query: 214 YPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGV 273
Y Y G C + IN V + +DE ++A +L +NGP+++A+NA+A+QFY G+
Sbjct: 336 YSYEGHKNTCSFSTSKVSAYINSSVEIPKDENEIAAWLAQNGPISIALNAFAMQFYRKGI 395
Query: 274 SHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG 333
SHP + C+ + H+VL+VGYG P+W IKNSWG WGE+GY+ LYRG
Sbjct: 396 SHPFRILCNPW--MIDHAVLLVGYG------ERNGTPFWAIKNSWGTDWGEQGYYYLYRG 447
Query: 334 DGSCGINDYVRSALV 348
G+CG+N SA+V
Sbjct: 448 TGACGMNTMCSSAVV 462
>gi|268554660|ref|XP_002635317.1| C. briggsae CBR-TAG-196 protein [Caenorhabditis briggsae]
Length = 477
Score = 283 bits (724), Expect = 9e-74, Method: Compositional matrix adjust.
Identities = 150/310 (48%), Positives = 188/310 (60%), Gaps = 16/310 (5%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
F+++H K Y+ E R F N + I+ LQ E GS VYG +FSD++T EF+ L
Sbjct: 177 FIDRHEKRYSNKREVLKRFRTFKKNAKVIRELQKNEQGSAVYGFTKFSDMTTMEFKQTML 236
Query: 105 GFKL-KPSY----ADRSVPAM-IPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
++ +P Y AD + I LP +FDWR++ AVT VK+Q CGS WAFSTTGN
Sbjct: 237 PYQWEQPVYPMAEADFEKEGVTISEDDLPDSFDWRDHGAVTQVKNQGNCGSCWAFSTTGN 296
Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG 218
+EG + KKLVSLSEQEL+DCD D GC GG SNA+ IM GGLE E YPY G
Sbjct: 297 VEGAWYLAKKKLVSLSEQELVDCDSVDQGCNGGLPSNAYKEIMRM--GGLEPEDAYPYDG 354
Query: 219 DDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQ 278
+ C + +K V ING V + DE + K+LV GP+++ +NA LQFY GV HP +
Sbjct: 355 KGETCHIVRKDIAVYINGSVELPHDEVKIQKWLVTKGPISIGLNANTLQFYRHGVVHPFK 414
Query: 279 FFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCG 338
FC+ L+H VLIVGYG D K PYWI+KNSWG WGE GYFRLYRG CG
Sbjct: 415 IFCEPF--MLNHGVLIVGYGKDGRK------PYWIVKNSWGPTWGESGYFRLYRGKNVCG 466
Query: 339 INDYVRSALV 348
+ + SALV
Sbjct: 467 VQEMATSALV 476
>gi|71993922|ref|NP_505215.2| Protein TAG-196 [Caenorhabditis elegans]
gi|351050011|emb|CCD64084.1| Protein TAG-196 [Caenorhabditis elegans]
Length = 477
Score = 283 bits (723), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 147/310 (47%), Positives = 186/310 (60%), Gaps = 16/310 (5%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
F+++H K Y E R +F N + I+ LQ E G+ VYG +FSD++T EF+ L
Sbjct: 177 FVDRHEKKYTNKREVLKRFRVFKKNAKVIRELQKNEQGTAVYGFTKFSDMTTMEFKKIML 236
Query: 105 GFKL-KPSYADRSVPAMIPNIT-----LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
++ +P Y ++T LP +FDWRE AVT VK+Q CGS WAFSTTGN
Sbjct: 237 PYQWEQPVYPMEQANFEKHDVTINEEDLPESFDWREKGAVTQVKNQGNCGSCWAFSTTGN 296
Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG 218
+EG + KLVSLSEQEL+DCD D GC GG SNA+ I+ GGLE E YPY G
Sbjct: 297 VEGAWFIAKNKLVSLSEQELVDCDSMDQGCNGGLPSNAYKEIIRM--GGLEPEDAYPYDG 354
Query: 219 DDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQ 278
+ C L +K V ING V + DE +M K+LV GP+++ +NA LQFY GV HP +
Sbjct: 355 RGETCHLVRKDIAVYINGSVELPHDEVEMQKWLVTKGPISIGLNANTLQFYRHGVVHPFK 414
Query: 279 FFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCG 338
FC+ L+H VLIVGYG D K PYWI+KNSWG WGE GYF+LYRG CG
Sbjct: 415 IFCEPF--MLNHGVLIVGYGKDGRK------PYWIVKNSWGPNWGEAGYFKLYRGKNVCG 466
Query: 339 INDYVRSALV 348
+ + SALV
Sbjct: 467 VQEMATSALV 476
>gi|213513816|ref|NP_001133678.1| Cathepsin F precursor [Salmo salar]
gi|209154908|gb|ACI33686.1| Cathepsin F precursor [Salmo salar]
Length = 475
Score = 283 bits (723), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 146/308 (47%), Positives = 198/308 (64%), Gaps = 11/308 (3%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
F F+ ++N+TY++ + RL IF NL+ + LQ + G+ YG+ +FSDL+ EF+
Sbjct: 177 FKEFMVRYNRTYSSQEDTDRRLRIFHENLKTAEKLQSLDLGTAEYGVTKFSDLTEEEFRT 236
Query: 102 KYLGFKLKPSYADRSV-PAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIE 160
YL L RS+ PA +P+ P ++DWRE+ AV+ VK+Q MCGS WAFS TGNIE
Sbjct: 237 LYLNPLLSQQKLQRSMKPAAMPHGPAPPSWDWREHGAVSPVKNQGMCGSCWAFSVTGNIE 296
Query: 161 GVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDD 220
G + KT KLVSLSEQEL+DCD D C GG SNA++ I KL GG+E E Y Y G
Sbjct: 297 GQWFVKTGKLVSLSEQELVDCDTADQACGGGLPSNAYEAI-EKL-GGVETETDYSYTGKK 354
Query: 221 KACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFF 280
++C IN V +S+DE ++A +L ENGP++VA+NA+A+QFY GVSHP++ F
Sbjct: 355 QSCDFTTDKVTAYINSSVELSKDENEIAAWLAENGPVSVALNAFAMQFYRKGVSHPLKIF 414
Query: 281 CDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGIN 340
C+ + H+VL+VGYG + K P+W IKNSWGE +GE+GY+ LYRG CGIN
Sbjct: 415 CNPW--MIDHAVLLVGYGERQGK------PFWAIKNSWGEDYGEQGYYYLYRGSRLCGIN 466
Query: 341 DYVRSALV 348
SA+V
Sbjct: 467 TMCSSAIV 474
>gi|223648298|gb|ACN10907.1| Cathepsin F precursor [Salmo salar]
Length = 474
Score = 283 bits (723), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 147/308 (47%), Positives = 197/308 (63%), Gaps = 11/308 (3%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
F F+ ++N+TY++ E RL +F NL+ + LQ + G+ YG+ +FSDL+ EF+
Sbjct: 176 FKEFMVRYNRTYSSQEEADRRLRVFHENLKTAEKLQSLDQGTAEYGVTKFSDLTEEEFRT 235
Query: 102 KYLGFKLKPSYADRSV-PAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIE 160
YL L +S+ PA +P P ++DWRE+ AV+ VK+Q MCGS WAFS TGNIE
Sbjct: 236 LYLNPLLSQQNLQQSMKPAAMPRGPAPPSWDWREHGAVSPVKNQGMCGSCWAFSVTGNIE 295
Query: 161 GVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDD 220
G + AKT KLVSLSEQEL+DCD D C GG SNA++ I KL GGLE E Y Y G
Sbjct: 296 GQWFAKTGKLVSLSEQELVDCDTVDQACGGGLPSNAYEAI-EKL-GGLETETDYSYTGKK 353
Query: 221 KACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFF 280
++C IN V +S DE ++A +L ENGP++VA+NA+A+QFY GVSHP++ F
Sbjct: 354 QSCDFTTDKVIAYINSSVELSTDENEIAAWLAENGPVSVALNAFAMQFYRKGVSHPLKIF 413
Query: 281 CDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGIN 340
C+ + H+VL+VGYG + K P+W IKNSWGE +GE+GY+ LYRG CGIN
Sbjct: 414 CNPW--MIDHAVLLVGYGERQGK------PFWAIKNSWGEDYGEQGYYYLYRGSRLCGIN 465
Query: 341 DYVRSALV 348
SA+V
Sbjct: 466 KMCSSAIV 473
>gi|198453932|ref|XP_002137768.1| GA27408, isoform A [Drosophila pseudoobscura pseudoobscura]
gi|198132577|gb|EDY68326.1| GA27408, isoform A [Drosophila pseudoobscura pseudoobscura]
Length = 629
Score = 282 bits (722), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 140/319 (43%), Positives = 198/319 (62%), Gaps = 9/319 (2%)
Query: 33 LHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFS 92
L V H LF+ F + + Y E RL IF NL+ I+ L E GS YG+ EF+
Sbjct: 316 LDKVDH--LFHKFQIRFGRRYDNTAERQMRLRIFRQNLKTIEELNTNEMGSAKYGITEFA 373
Query: 93 DLSTAEFQAKYLGFKLKPSYADRSVPAMIPNIT--LPRAFDWREYDAVTGVKDQTMCGSS 150
D+++ E++ + ++ PA++P P+ FDWR+ +AVT VK+Q CGS
Sbjct: 374 DMTSTEYKERTGLWQRDEQKPTGGAPAVVPAYEGEFPKEFDWRQKNAVTPVKNQGSCGSC 433
Query: 151 WAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEE 210
WAFS TGNIEG+YA KT +L SEQEL+DCD D C GG + NA+ I K GGLE
Sbjct: 434 WAFSVTGNIEGLYAVKTGELKEFSEQELLDCDTTDSACNGGLMDNAYKAI--KDIGGLEY 491
Query: 211 EKTYPYRGDDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINAYALQFY 269
E YPY + C N+ + V+++G+V + + +ET M ++L+ +GP+++ +NA A+QFY
Sbjct: 492 EAEYPYEAKKQQCHFNRTLSHVQVSGFVDLPKGNETAMQEWLLTHGPISIGLNANAMQFY 551
Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
GVSHP + C +NL H VLIVGYGV HK +PYWI+KNSWG WGE+GY+R
Sbjct: 552 RGGVSHPWKALC--SKKNLDHGVLIVGYGVSDYPNFHKTLPYWIVKNSWGPRWGEQGYYR 609
Query: 330 LYRGDGSCGINDYVRSALV 348
+YRGD +CG+++ SA++
Sbjct: 610 VYRGDNTCGVSEMATSAVL 628
>gi|324522685|gb|ADY48108.1| Cathepsin L, partial [Ascaris suum]
Length = 308
Score = 282 bits (722), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 146/312 (46%), Positives = 190/312 (60%), Gaps = 22/312 (7%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
F+ ++N+TY+ E R I+ NLR ++ Q E G+ +YG +FSDL+ AEF+ L
Sbjct: 10 FIGRYNRTYSNKKEMLKRFRIYKRNLRAAKIWQANEQGTAIYGETQFSDLTQAEFRKIML 69
Query: 105 GFKLKPSYADRSVPAMIPNIT--------LPRAFDWREYDAVTGVKDQTMCGSSWAFSTT 156
+K + VP + N +P +FDWRE +AVT VK+Q CGS WAFS T
Sbjct: 70 PYK----WETPKVPNKMANFKEFGIAQNDIPESFDWREKNAVTEVKNQGSCGSCWAFSVT 125
Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
GNIEG +A KT KLVSLSEQEL+DCD D GC GG SNA+ I+ GGLE E YPY
Sbjct: 126 GNIEGAWAIKTSKLVSLSEQELVDCDIIDQGCNGGLPSNAYREIIRM--GGLEAESDYPY 183
Query: 217 RGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHP 276
G + C L KK V IN + + DE MA +LV GP+++ +NA LQFY G++HP
Sbjct: 184 DGRGEKCHLMKKDIAVYINDSLQLPHDEEKMAAWLVAKGPISIGLNANPLQFYRHGIAHP 243
Query: 277 IQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGS 336
+ FC ++L H VLIVGYG + K PYWIIKNSWG WGE+GYFRL+RG
Sbjct: 244 WRVFCS--PKHLDHGVLIVGYGSETDK------PYWIIKNSWGTKWGEEGYFRLFRGKNV 295
Query: 337 CGINDYVRSALV 348
CGI + +A++
Sbjct: 296 CGIQEMATTAII 307
>gi|195152617|ref|XP_002017233.1| GL22196 [Drosophila persimilis]
gi|194112290|gb|EDW34333.1| GL22196 [Drosophila persimilis]
Length = 627
Score = 282 bits (721), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 140/319 (43%), Positives = 198/319 (62%), Gaps = 9/319 (2%)
Query: 33 LHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFS 92
L V H LF+ F + + Y E RL IF NL+ I+ L E GS YG+ EF+
Sbjct: 314 LDKVDH--LFHKFQIRFGRRYDNTAERQMRLRIFRQNLKTIEELNTNEMGSAKYGITEFA 371
Query: 93 DLSTAEFQAKYLGFKLKPSYADRSVPAMIPNIT--LPRAFDWREYDAVTGVKDQTMCGSS 150
D+++ E++ + ++ PA++P P+ FDWR+ +AVT VK+Q CGS
Sbjct: 372 DMTSTEYKERTGLWQRDEQKPTGGAPAVVPAYEGEFPKEFDWRQKNAVTPVKNQGSCGSC 431
Query: 151 WAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEE 210
WAFS TGNIEG+YA KT +L SEQEL+DCD D C GG + NA+ I K GGLE
Sbjct: 432 WAFSVTGNIEGLYAVKTGELKEFSEQELLDCDTTDSACNGGLMDNAYKAI--KDIGGLEY 489
Query: 211 EKTYPYRGDDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINAYALQFY 269
E YPY + C N+ + V+++G+V + + +ET M ++L+ +GP+++ +NA A+QFY
Sbjct: 490 EAEYPYEAKKQQCHFNRTLSHVQVSGFVDLPKGNETAMQEWLLTHGPISIGLNANAMQFY 549
Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
GVSHP + C +NL H VLIVGYGV HK +PYWI+KNSWG WGE+GY+R
Sbjct: 550 RGGVSHPWKALC--SKKNLDHGVLIVGYGVSDYPNFHKTLPYWIVKNSWGPRWGEQGYYR 607
Query: 330 LYRGDGSCGINDYVRSALV 348
+YRGD +CG+++ SA++
Sbjct: 608 VYRGDNTCGVSEMATSAVL 626
>gi|390178852|ref|XP_003736743.1| GA27408, isoform B [Drosophila pseudoobscura pseudoobscura]
gi|388859612|gb|EIM52816.1| GA27408, isoform B [Drosophila pseudoobscura pseudoobscura]
Length = 477
Score = 281 bits (720), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 140/319 (43%), Positives = 198/319 (62%), Gaps = 9/319 (2%)
Query: 33 LHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFS 92
L V H LF+ F + + Y E RL IF NL+ I+ L E GS YG+ EF+
Sbjct: 164 LDKVDH--LFHKFQIRFGRRYDNTAERQMRLRIFRQNLKTIEELNTNEMGSAKYGITEFA 221
Query: 93 DLSTAEFQAKYLGFKLKPSYADRSVPAMIPNIT--LPRAFDWREYDAVTGVKDQTMCGSS 150
D+++ E++ + ++ PA++P P+ FDWR+ +AVT VK+Q CGS
Sbjct: 222 DMTSTEYKERTGLWQRDEQKPTGGAPAVVPAYEGEFPKEFDWRQKNAVTPVKNQGSCGSC 281
Query: 151 WAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEE 210
WAFS TGNIEG+YA KT +L SEQEL+DCD D C GG + NA+ I K GGLE
Sbjct: 282 WAFSVTGNIEGLYAVKTGELKEFSEQELLDCDTTDSACNGGLMDNAYKAI--KDIGGLEY 339
Query: 211 EKTYPYRGDDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINAYALQFY 269
E YPY + C N+ + V+++G+V + + +ET M ++L+ +GP+++ +NA A+QFY
Sbjct: 340 EAEYPYEAKKQQCHFNRTLSHVQVSGFVDLPKGNETAMQEWLLTHGPISIGLNANAMQFY 399
Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
GVSHP + C +NL H VLIVGYGV HK +PYWI+KNSWG WGE+GY+R
Sbjct: 400 RGGVSHPWKALCS--KKNLDHGVLIVGYGVSDYPNFHKTLPYWIVKNSWGPRWGEQGYYR 457
Query: 330 LYRGDGSCGINDYVRSALV 348
+YRGD +CG+++ SA++
Sbjct: 458 VYRGDNTCGVSEMATSAVL 476
>gi|403183546|gb|EJY58173.1| AAEL017153-PA [Aedes aegypti]
Length = 1165
Score = 281 bits (718), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 141/320 (44%), Positives = 200/320 (62%), Gaps = 14/320 (4%)
Query: 35 HVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDL 94
H +H LF F +H++ Y + +E+ R IF NL KI+ L E G+ YG+ F+D+
Sbjct: 853 HARH--LFEKFKLKHSREYQSTLEHEMRFRIFKNNLFKIEQLNKYEQGTAKYGITHFADM 910
Query: 95 STAEFQAKYLGFKLKPSYADRS-----VPAMIPNITLPRAFDWREYDAVTGVKDQTMCGS 149
++AE++ + G + P DR+ + N+ LP +FDWRE AV+ VK+Q CGS
Sbjct: 911 TSAEYRQR-TGLVI-PRDEDRNHVGNPKAEIDENMELPESFDWRELGAVSPVKNQGNCGS 968
Query: 150 SWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLE 209
WAFS GNIEG++ KTK L SEQEL+DCD D C+GG + +A+ I K+ GGLE
Sbjct: 969 CWAFSVVGNIEGLHQIKTKVLEEYSEQELLDCDAVDSACQGGYMDDAYKAI-EKI-GGLE 1026
Query: 210 EEKTYPYRG-DDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQF 268
E YPY K C N V++ G V + ++ET MA+YLV NGP+++ +NA A+QF
Sbjct: 1027 LESEYPYLAKKQKTCHFNSTEVHVRVKGAVDLPKNETAMAQYLVANGPISIGLNANAMQF 1086
Query: 269 YVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYF 328
Y G+SHP + C +NL H VLIVGYGV +K +PYWI+KNSWG WGE+GY+
Sbjct: 1087 YRGGISHPWKPLC--SKKNLDHGVLIVGYGVKEYPMFNKTMPYWIVKNSWGPKWGEQGYY 1144
Query: 329 RLYRGDGSCGINDYVRSALV 348
R++RGD +CG+++ SA++
Sbjct: 1145 RIFRGDNTCGVSEMASSAVL 1164
>gi|321460289|gb|EFX71333.1| hypothetical protein DAPPUDRAFT_189155 [Daphnia pulex]
Length = 266
Score = 280 bits (716), Expect = 7e-73, Method: Compositional matrix adjust.
Identities = 137/269 (50%), Positives = 179/269 (66%), Gaps = 7/269 (2%)
Query: 82 GSGVYGLNEFSDLSTAEFQAKYLGFK--LKPSYADRSVPAMIPNITLPRAFDWREYDAVT 139
G+ VYG FSD S AE++A GF L+ S A R A IP I LP FDWR + VT
Sbjct: 2 GTAVYGDTPFSDWSAAEYKAHLAGFNPSLRQSNA-RLRQAAIPEIDLPDEFDWRNHSVVT 60
Query: 140 GVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDT 199
VKDQ CGS WAFS TGN+EG+YA + L+SLSEQEL+DCD+ D GC GG NA+
Sbjct: 61 PVKDQGSCGSCWAFSVTGNVEGIYAVRNGDLLSLSEQELVDCDKLDSGCNGGLPENAYKA 120
Query: 200 IMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAV 259
I GGLE E YPY G + C+ N T+V++ G V +S +ET+MA++L++NGP+++
Sbjct: 121 IHDI--GGLETESDYPYNGHENKCKFNSNITRVQVTGGVEISTNETEMAQWLIQNGPISI 178
Query: 260 AINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWG 319
INA A+Q+Y GVSHP + C G + H VLIVGYGV + +K +PYWI+KNSWG
Sbjct: 179 GINANAMQYYRGGVSHPWKVLCRPG--GIDHGVLIVGYGVSQYPKFNKTLPYWIVKNSWG 236
Query: 320 EGWGEKGYFRLYRGDGSCGINDYVRSALV 348
WGE+GY+R++RGDG+CG+N SA +
Sbjct: 237 TRWGEQGYYRVFRGDGTCGLNQMCTSATL 265
>gi|308506829|ref|XP_003115597.1| CRE-TAG-196 protein [Caenorhabditis remanei]
gi|308256132|gb|EFP00085.1| CRE-TAG-196 protein [Caenorhabditis remanei]
Length = 475
Score = 280 bits (716), Expect = 8e-73, Method: Compositional matrix adjust.
Identities = 146/310 (47%), Positives = 185/310 (59%), Gaps = 16/310 (5%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
F+++H K Y+ E R F N + I+ LQ E G+ VYG +FSD++T EF+ L
Sbjct: 175 FIDRHEKRYSNKREVLKRFRTFKKNAKAIRELQKNEQGTAVYGFTKFSDMTTMEFKQTML 234
Query: 105 GFKL-KPSYADRSVPAMIPNIT-----LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
++ +P Y IT LP +FDWR+ AVT VK+Q CGS WAFSTTGN
Sbjct: 235 PYQWEQPVYPMDQADFEKEGITISEEDLPESFDWRDKGAVTQVKNQGNCGSCWAFSTTGN 294
Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG 218
+EG + KLVSLSEQEL+DCD D GC GG SNA+ I+ GGLE E YPY G
Sbjct: 295 VEGAWFLAKNKLVSLSEQELVDCDGVDQGCNGGLPSNAYKEIIRM--GGLEPEDAYPYDG 352
Query: 219 DDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQ 278
+ C L +K V ING + + DE +M K+LV GP+++ +NA LQFY GV HP +
Sbjct: 353 KGETCHLVRKDIAVYINGSIELPHDEVEMQKWLVTKGPISIGLNANTLQFYRHGVVHPFK 412
Query: 279 FFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCG 338
FC+ L+H VLIVGYG D K PYWI+KNSWG WGE GYF+LYRG CG
Sbjct: 413 IFCEPF--MLNHGVLIVGYGKDGRK------PYWIVKNSWGPTWGESGYFKLYRGKNVCG 464
Query: 339 INDYVRSALV 348
+ + SALV
Sbjct: 465 VQEMATSALV 474
>gi|224555777|gb|ACN56478.1| cathepsin F [Paralichthys olivaceus]
Length = 475
Score = 278 bits (711), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 145/308 (47%), Positives = 194/308 (62%), Gaps = 11/308 (3%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
F F+ ++NK Y++ E RL IF NL+ + LQ + GS YG+ +FSDL+ EF++
Sbjct: 177 FKEFMVKYNKVYSSQDEADRRLSIFHENLKTAEKLQSLDQGSAEYGVTKFSDLTEEEFRS 236
Query: 102 KYLGFKLKPSYADRSV-PAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIE 160
YL L R + PA P ++DWR++ AV+ VK+Q MCGS WAFS TGNIE
Sbjct: 237 TYLNPLLSQWTLHRPMKPASPAKGPAPASWDWRDHGAVSSVKNQGMCGSCWAFSVTGNIE 296
Query: 161 GVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDD 220
G + K LVSLSEQEL+DCD D C GG SNA++ I KL GGLE E Y Y G
Sbjct: 297 GQWFLKNGTLVSLSEQELVDCDGLDQACNGGLPSNAYEAI-EKL-GGLETETDYSYIGKK 354
Query: 221 KACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFF 280
++C K IN V +S+DE ++A +L ENGP++VA+NA+A+QFY GVSHP++ F
Sbjct: 355 QSCDFATKKVAAYINSSVELSKDEKEIAAWLAENGPVSVALNAFAMQFYRKGVSHPLKIF 414
Query: 281 CDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGIN 340
C+ + H+VL+VGYG K +P+W IKNSWGE +GE+GY+ LYRG +CGIN
Sbjct: 415 CNPW--MIDHAVLMVGYG------ERKGIPFWAIKNSWGEDYGEQGYYNLYRGSNACGIN 466
Query: 341 DYVRSALV 348
SA+V
Sbjct: 467 KMCSSAVV 474
>gi|395544492|ref|XP_003774144.1| PREDICTED: cathepsin F [Sarcophilus harrisii]
Length = 451
Score = 277 bits (709), Expect = 5e-72, Method: Compositional matrix adjust.
Identities = 150/338 (44%), Positives = 204/338 (60%), Gaps = 28/338 (8%)
Query: 26 GDEKLH------------HLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKI 73
GD++LH V+ +LF FL +NK+YA E RL IF+ NL
Sbjct: 126 GDQRLHWTSGRQAPAPAAQEDSVQLISLFKDFLTTYNKSYANATETQRRLGIFARNLELA 185
Query: 74 QLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLK--PSYADRSVPAMIPNITLPRAFD 131
+ +Q+ + GS YG+ +FSDL+ EF+ YL L P A R PA P ++D
Sbjct: 186 RKVQELDRGSAEYGVTKFSDLTEEEFRTSYLNPLLSSLPGRALRPGPAT--RGPAPASWD 243
Query: 132 WREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGG 191
WR++ AVTGVK+Q CGS WAFS TGN+EG + + L++LSEQEL+DCD D C GG
Sbjct: 244 WRDHGAVTGVKNQGACGSCWAFSVTGNVEGQWFLRRGALLALSEQELVDCDTLDQACGGG 303
Query: 192 SISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYL 251
SNA+ T + KL GGLE EK Y Y G + C + +V IN V +SRDE ++A +L
Sbjct: 304 LPSNAY-TAIEKL-GGLETEKDYSYEGRKERCSFSPDKARVYINSSVDLSRDEEELATWL 361
Query: 252 VENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKA-VP 310
ENGP+++A+NA+A+QFY GVSHP + C + H+VL+VGYG H++ +P
Sbjct: 362 AENGPVSIALNAFAMQFYRRGVSHPFRPLCS--PWFIDHAVLLVGYG-------HRSGIP 412
Query: 311 YWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
+W IKNSWG WGE+GY+ LYRG +CG+N SA+V
Sbjct: 413 FWAIKNSWGPDWGEEGYYYLYRGARACGVNAMASSAIV 450
>gi|196014793|ref|XP_002117255.1| hypothetical protein TRIADDRAFT_61245 [Trichoplax adhaerens]
gi|190580220|gb|EDV20305.1| hypothetical protein TRIADDRAFT_61245 [Trichoplax adhaerens]
Length = 353
Score = 275 bits (704), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 145/315 (46%), Positives = 200/315 (63%), Gaps = 14/315 (4%)
Query: 38 HTALF-NY--FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDL 94
H +F NY F++++NK+Y + E R +F+ N+ + L Q ++ +G YG + SDL
Sbjct: 48 HDPMFKNYLQFIKEYNKSYNNIQELNYRYQVFTKNMARAMLFQKHDNATGRYGFTKLSDL 107
Query: 95 STAEFQAKYLGFKLKPSYADRSVPAMIPNI-TLPRAFDWREYDAVTGVKDQTMCGSSWAF 153
+ E ++ Y K P + A IP + +LP++FDWR AVT VKDQ CG+ WAF
Sbjct: 108 TDQEVKSFY-AMKKWPQQLYPTKKANIPQLNSLPQSFDWRSKGAVTAVKDQKRCGACWAF 166
Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
+TTGNIEG + KL SLSEQEL+DCD+ D+GC+GG NA+ +IM++L GGLE EK
Sbjct: 167 ATTGNIEGQWYLNKGKLYSLSEQELVDCDKIDEGCKGGLPLNAYHSIMNRL-GGLETEKD 225
Query: 214 YPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGV 273
YPY + C+LNK V IN V VS +ETD+A +LV +GP+A+ IN+ + Y G+
Sbjct: 226 YPYVAKNGKCKLNKSEEVVYINSSVKVSTNETDLAAWLVAHGPVAIGINSVNMLHYKGGI 285
Query: 274 SHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG 333
+HP C+ + L H VLIVGYG + K+ PYWIIKNSWG WGEKGY+R+ RG
Sbjct: 286 AHPTNKDCNP--KLLDHGVLIVGYGEE------KSTPYWIIKNSWGTDWGEKGYYRVVRG 337
Query: 334 DGSCGINDYVRSALV 348
G+CG+N SA+V
Sbjct: 338 IGACGLNKSATSAIV 352
>gi|186688051|gb|ACC86111.1| cathepsin F [Paralichthys olivaceus]
Length = 475
Score = 275 bits (704), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 144/308 (46%), Positives = 194/308 (62%), Gaps = 11/308 (3%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
F F+ ++NK Y++ E RL IF NL+ + LQ + GS YG+ +FSDL+ EF++
Sbjct: 177 FKEFMVKYNKVYSSQDEADRRLSIFHENLKTAEKLQSLDQGSAEYGVTKFSDLTEEEFRS 236
Query: 102 KYLGFKLKPSYADRSV-PAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIE 160
YL L R + PA P ++DWR++ AV+ VK+Q MCGS WAFS TGNIE
Sbjct: 237 TYLNPLLSQWTLHRPMKPASPAKGPAPASWDWRDHGAVSSVKNQGMCGSCWAFSVTGNIE 296
Query: 161 GVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDD 220
G + K LVSLSEQEL+DCD D C GG SNA++ I KL GGLE E Y Y G
Sbjct: 297 GQWFLKNGTLVSLSEQELVDCDGLDQACNGGLPSNAYEAI-EKL-GGLETETDYSYIGKK 354
Query: 221 KACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFF 280
++C K IN V +S+DE ++A +L ENGP++VA+NA+A+QFY GVSHP++ F
Sbjct: 355 QSCDFATKKVAAYINSSVELSKDEKEIAAWLAENGPVSVALNAFAMQFYRKGVSHPLKIF 414
Query: 281 CDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGIN 340
C+ + H+VL+VGYG K +P+W IKNSWGE +GE+GY+ L+RG +CGIN
Sbjct: 415 CNPW--MIDHAVLMVGYG------ERKGIPFWAIKNSWGEDYGEQGYYYLHRGSNACGIN 466
Query: 341 DYVRSALV 348
SA+V
Sbjct: 467 KMCSSAVV 474
>gi|126338866|ref|XP_001379280.1| PREDICTED: cathepsin F-like [Monodelphis domestica]
Length = 567
Score = 271 bits (693), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 146/330 (44%), Positives = 199/330 (60%), Gaps = 19/330 (5%)
Query: 20 SSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT 79
SS +GD V+ +LF FL +NK+YA E RL IF+ NL LQ+
Sbjct: 255 SSLPRMGDS-------VELISLFKDFLTTYNKSYANATETQRRLGIFARNLELAHKLQEL 307
Query: 80 EHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSV-PAMIPNITLPRAFDWREYDAV 138
+ GS YG+ +FSDL+ EF+ YL L S R++ PA P ++DWR++ A+
Sbjct: 308 DQGSAQYGVTKFSDLTEEEFRMFYLNPLLS-SLPGRALRPAPRARGPAPASWDWRDHGAL 366
Query: 139 TGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFD 198
T K+Q MCGS WAFS TGN+EG + + L++LSEQEL+DCD D C GG SNA+
Sbjct: 367 TAAKNQGMCGSCWAFSVTGNVEGQWFLRRGALLTLSEQELVDCDTLDQACGGGLPSNAYT 426
Query: 199 TIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMA 258
I + GGLE EK Y Y G + C + + IN V +SRDE ++A +L ENGP++
Sbjct: 427 AIETL--GGLETEKDYSYEGRKERCSFSPDKARAYINSSVDLSRDEQEIAAWLAENGPVS 484
Query: 259 VAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSW 318
+A+NA+A+QFY GVSHP + C + H+VL+VGYG DR+ +P+W IKNSW
Sbjct: 485 IALNAFAMQFYRRGVSHPFRPLCS--PWFIDHAVLLVGYG-DRS-----GIPFWAIKNSW 536
Query: 319 GEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
G WGE+GY+ LYRG +CG+N SA+V
Sbjct: 537 GPDWGEEGYYYLYRGARACGMNTMASSAIV 566
>gi|443696723|gb|ELT97360.1| hypothetical protein CAPTEDRAFT_147978 [Capitella teleta]
Length = 274
Score = 271 bits (693), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 136/279 (48%), Positives = 175/279 (62%), Gaps = 6/279 (2%)
Query: 70 LRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRA 129
+ K + +Q+ E G YG + F+DL+ EF+ YL ++ PA IP T P A
Sbjct: 1 MIKARRIQEKEQGDATYGASPFADLTAEEFRKNYLSPVWNVTHDPFLKPASIPIETPPDA 60
Query: 130 FDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCE 189
FDWR++DAVT VK+Q CGS WAFS TGN+EG +A + KKL+SLSEQEL+DCD+ D GC
Sbjct: 61 FDWRDHDAVTPVKNQGSCGSCWAFSVTGNVEGQWAIQKKKLLSLSEQELVDCDKVDLGCN 120
Query: 190 GGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAK 249
GG A+ IM GGLE EK YPY G C K +V I G V++S +E DM
Sbjct: 121 GGLPLQAYKEIMRI--GGLETEKDYPYEGKGDKCVFEKAEVEVNITGAVNISSNEDDMKA 178
Query: 250 YLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAV 309
+L +NGP+++ +NA A+QFY+ GVSHP F C +L H VLI GYG+ + +
Sbjct: 179 WLWKNGPISIGLNANAMQFYMGGVSHPFSFLCS--PSSLDHGVLITGYGIKQGWMSDS-- 234
Query: 310 PYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
P+W IKNSWGE WGEKGY+ LYRG G CG+N SA V
Sbjct: 235 PFWAIKNSWGESWGEKGYYLLYRGAGVCGVNQMPTSATV 273
>gi|21218381|gb|AAM44058.1|AF510740_1 cathepsin L1 [Schistosoma japonicum]
Length = 317
Score = 271 bits (692), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 140/287 (48%), Positives = 180/287 (62%), Gaps = 9/287 (3%)
Query: 62 RLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMI 121
R IF NL K QL Q E GS VYG+ +SDL+T EF +L + S ++P
Sbjct: 39 RFSIFKSNLLKAQLYQVLERGSAVYGVTPYSDLTTDEFSRTHLTAPWRASSKRNTIPPRR 98
Query: 122 PNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDC 181
+P FDWRE AVT VK+Q MCGS WAFSTTGNIE + KT KL+SLSEQ+L+DC
Sbjct: 99 EVGDIPNNFDWREKGAVTEVKNQGMCGSCWAFSTTGNIESQWFRKTGKLLSLSEQQLVDC 158
Query: 182 DQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVS 241
D DDGC GG SNA+++I+ GGL E YPY ++ C L IN V+++
Sbjct: 159 DSLDDGCNGGLPSNAYESIIRM--GGLMLEDNYPYDAKNEKCHLKVGNVAAYINSSVNLT 216
Query: 242 RDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDR 301
+DE+++A +L + ++V +NA LQFY G+SHP FC L H+VL+VGYGV
Sbjct: 217 QDESELAIWLYHHSAISVGMNALLLQFYRHGISHPWWIFCS--KYLLDHAVLLVGYGV-- 272
Query: 302 TKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
+ K P+WI+KNSWG WGEKGYFR+YRGDG+CGIN SAL+
Sbjct: 273 ---SEKNEPFWIVKNSWGVEWGEKGYFRMYRGDGTCGINTGATSALI 316
>gi|256077197|ref|XP_002574894.1| cathepsin F (C01 family) [Schistosoma mansoni]
gi|353230780|emb|CCD77197.1| cathepsin F (C01 family) [Schistosoma mansoni]
Length = 419
Score = 269 bits (688), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 139/289 (48%), Positives = 184/289 (63%), Gaps = 11/289 (3%)
Query: 62 RLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFK-LKPSYADRSVPAM 120
R +IF N+ K QL Q E GS +YG+ +SDL+T EF +L + PS + ++
Sbjct: 139 RFNIFKSNILKAQLYQVFERGSAIYGVTPYSDLTTDEFARTHLTASWVVPSSRSNTPTSL 198
Query: 121 IPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELI 179
+ +P+ FDWRE AVT VK+Q MCGS WAFSTTGN+E + KT KL+SLSEQ+L+
Sbjct: 199 GKEVNNIPKNFDWREKGAVTEVKNQGMCGSCWAFSTTGNVESQWFRKTGKLLSLSEQQLV 258
Query: 180 DCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVS 239
DCD DDGC GG SNA+++I+ GGL E YPY ++ C L V IN V+
Sbjct: 259 DCDGLDDGCNGGLPSNAYESIIKM--GGLMLEDNYPYDAKNEKCHLKTDGVAVYINSSVN 316
Query: 240 VSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGV 299
+++DET++A +L N ++V +NA LQFY G+SHP FC L H+VL+VGYGV
Sbjct: 317 LTQDETELAAWLYHNSTISVGMNALLLQFYQHGISHPWWIFCS--KYLLDHAVLLVGYGV 374
Query: 300 DRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
+ K P+WI+KNSWG WGE GYFR+YRGDG+CGIN SAL+
Sbjct: 375 -----SEKNEPFWIVKNSWGVEWGENGYFRMYRGDGTCGINTVATSALI 418
>gi|256077193|ref|XP_002574892.1| cathepsin F (C01 family) [Schistosoma mansoni]
gi|353230781|emb|CCD77198.1| cathepsin F (C01 family) [Schistosoma mansoni]
Length = 457
Score = 269 bits (688), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 139/289 (48%), Positives = 184/289 (63%), Gaps = 11/289 (3%)
Query: 62 RLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFK-LKPSYADRSVPAM 120
R +IF N+ K QL Q E GS +YG+ +SDL+T EF +L + PS + ++
Sbjct: 177 RFNIFKSNILKAQLYQVFERGSAIYGVTPYSDLTTDEFARTHLTASWVVPSSRSNTPTSL 236
Query: 121 IPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELI 179
+ +P+ FDWRE AVT VK+Q MCGS WAFSTTGN+E + KT KL+SLSEQ+L+
Sbjct: 237 GKEVNNIPKNFDWREKGAVTEVKNQGMCGSCWAFSTTGNVESQWFRKTGKLLSLSEQQLV 296
Query: 180 DCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVS 239
DCD DDGC GG SNA+++I+ GGL E YPY ++ C L V IN V+
Sbjct: 297 DCDGLDDGCNGGLPSNAYESIIKM--GGLMLEDNYPYDAKNEKCHLKTDGVAVYINSSVN 354
Query: 240 VSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGV 299
+++DET++A +L N ++V +NA LQFY G+SHP FC L H+VL+VGYGV
Sbjct: 355 LTQDETELAAWLYHNSTISVGMNALLLQFYQHGISHPWWIFC--SKYLLDHAVLLVGYGV 412
Query: 300 DRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
+ K P+WI+KNSWG WGE GYFR+YRGDG+CGIN SAL+
Sbjct: 413 -----SEKNEPFWIVKNSWGVEWGENGYFRMYRGDGTCGINTVATSALI 456
>gi|256077195|ref|XP_002574893.1| cathepsin F (C01 family) [Schistosoma mansoni]
gi|353230782|emb|CCD77199.1| cathepsin F (C01 family) [Schistosoma mansoni]
Length = 456
Score = 269 bits (688), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 139/289 (48%), Positives = 184/289 (63%), Gaps = 11/289 (3%)
Query: 62 RLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFK-LKPSYADRSVPAM 120
R +IF N+ K QL Q E GS +YG+ +SDL+T EF +L + PS + ++
Sbjct: 176 RFNIFKSNILKAQLYQVFERGSAIYGVTPYSDLTTDEFARTHLTASWVVPSSRSNTPTSL 235
Query: 121 IPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELI 179
+ +P+ FDWRE AVT VK+Q MCGS WAFSTTGN+E + KT KL+SLSEQ+L+
Sbjct: 236 GKEVNNIPKNFDWREKGAVTEVKNQGMCGSCWAFSTTGNVESQWFRKTGKLLSLSEQQLV 295
Query: 180 DCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVS 239
DCD DDGC GG SNA+++I+ GGL E YPY ++ C L V IN V+
Sbjct: 296 DCDGLDDGCNGGLPSNAYESIIKM--GGLMLEDNYPYDAKNEKCHLKTDGVAVYINSSVN 353
Query: 240 VSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGV 299
+++DET++A +L N ++V +NA LQFY G+SHP FC L H+VL+VGYGV
Sbjct: 354 LTQDETELAAWLYHNSTISVGMNALLLQFYQHGISHPWWIFCS--KYLLDHAVLLVGYGV 411
Query: 300 DRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
+ K P+WI+KNSWG WGE GYFR+YRGDG+CGIN SAL+
Sbjct: 412 -----SEKNEPFWIVKNSWGVEWGENGYFRMYRGDGTCGINTVATSALI 455
>gi|56755191|gb|AAW25775.1| SJCHGC00511 protein [Schistosoma japonicum]
Length = 454
Score = 268 bits (686), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 139/287 (48%), Positives = 179/287 (62%), Gaps = 9/287 (3%)
Query: 62 RLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMI 121
R IF NL K QL Q E GS VYG+ +SDL+T EF +L + S ++
Sbjct: 176 RFSIFKSNLLKAQLYQVLERGSAVYGVTPYSDLTTDEFSRTHLTAPWRASSKRNTISPRR 235
Query: 122 PNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDC 181
+P FDWRE AVT VK+Q MCGS WAFSTTGNIE + KT KL+SLSEQ+L+DC
Sbjct: 236 EVGDIPNNFDWREKGAVTEVKNQGMCGSCWAFSTTGNIESQWFRKTGKLLSLSEQQLVDC 295
Query: 182 DQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVS 241
D DDGC GG SNA+++I+ GGL E YPY ++ C L IN V+++
Sbjct: 296 DSLDDGCNGGLPSNAYESIIRM--GGLMLEDNYPYDAKNEKCHLKVANVAAYINSSVNLT 353
Query: 242 RDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDR 301
+DE+++A +L + ++V +NA LQFY G+SHP FC L H+VL+VGYGV
Sbjct: 354 QDESELAIWLYHHSAISVGMNALLLQFYRHGISHPWWIFC--SKYLLDHAVLLVGYGV-- 409
Query: 302 TKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
+ K P+WI+KNSWG WGEKGYFR+YRGDG+CGIN SAL+
Sbjct: 410 ---SEKNEPFWIVKNSWGVEWGEKGYFRMYRGDGTCGINTDATSALI 453
>gi|348528696|ref|XP_003451852.1| PREDICTED: cathepsin F-like [Oreochromis niloticus]
Length = 475
Score = 268 bits (684), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 139/308 (45%), Positives = 190/308 (61%), Gaps = 11/308 (3%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
F F+ ++NK Y++ E RL IF NL+ + LQ + GS YG+ +FSDL+ EF++
Sbjct: 177 FKEFMTKYNKVYSSQEEVDRRLRIFHENLKTAEKLQALDQGSAEYGVTKFSDLTEEEFRS 236
Query: 102 KYLGFKLKPSYADRSV-PAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIE 160
YL L + + PA P ++DWR++ AV+ VK+Q MCGS WAFS GNIE
Sbjct: 237 TYLNPLLSQWTLHQPMKPATPAKGPSPDSWDWRDHGAVSPVKNQGMCGSCWAFSVIGNIE 296
Query: 161 GVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDD 220
G + K L+SLSEQEL+DCD D C GG SNA++ I KL GGLE E Y Y G
Sbjct: 297 GQWFLKNGTLLSLSEQELVDCDGLDQACRGGLPSNAYEAI-EKL-GGLETESDYSYTGHK 354
Query: 221 KACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFF 280
+ C IN V + +DE ++A +L ENGP++VA+NA+A+QFY G+SHP++ F
Sbjct: 355 QRCDFTTGKVAAYINSSVELPKDEKEIAAWLAENGPVSVALNAFAMQFYRKGISHPLKIF 414
Query: 281 CDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGIN 340
C+ + H+VL+VGYG K +P+W IKNSWGE +GE+GY+ LYRG +CGIN
Sbjct: 415 CNPW--MIDHAVLLVGYG------ERKGIPFWAIKNSWGEDYGEQGYYYLYRGSNACGIN 466
Query: 341 DYVRSALV 348
SA+V
Sbjct: 467 KMCSSAVV 474
>gi|226468424|emb|CAX69889.1| Temporarily Assigned Gene name [Schistosoma japonicum]
Length = 454
Score = 267 bits (682), Expect = 7e-69, Method: Compositional matrix adjust.
Identities = 138/287 (48%), Positives = 179/287 (62%), Gaps = 9/287 (3%)
Query: 62 RLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMI 121
R IF NL K QL Q E GS VYG+ +SDL+T EF +L + S ++
Sbjct: 176 RFSIFKSNLLKAQLYQVLERGSAVYGVTPYSDLTTDEFSRTHLTAPWRASSKRNTISPRR 235
Query: 122 PNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDC 181
+P FDWR+ AVT VK+Q MCGS WAFSTTGNIE + KT KL+SLSEQ+L+DC
Sbjct: 236 EVGDIPNNFDWRKKGAVTEVKNQGMCGSCWAFSTTGNIESQWFRKTGKLLSLSEQQLVDC 295
Query: 182 DQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVS 241
D DDGC GG SNA+++I+ GGL E YPY ++ C L IN V+++
Sbjct: 296 DNLDDGCNGGLPSNAYESIIRM--GGLMLEDNYPYDAKNEKCHLKVANVAAYINSSVNLT 353
Query: 242 RDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDR 301
+DE+++A +L + ++V +NA LQFY G+SHP FC L H+VL+VGYGV
Sbjct: 354 QDESELAIWLYHHSAISVGMNALLLQFYRHGISHPWWIFC--SKYLLDHAVLLVGYGV-- 409
Query: 302 TKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
+ K P+WI+KNSWG WGEKGYFR+YRGDG+CGIN SAL+
Sbjct: 410 ---SEKNEPFWIVKNSWGVEWGEKGYFRMYRGDGTCGINTDATSALI 453
>gi|313235882|emb|CBY11269.1| unnamed protein product [Oikopleura dioica]
Length = 371
Score = 267 bits (682), Expect = 7e-69, Method: Compositional matrix adjust.
Identities = 145/318 (45%), Positives = 197/318 (61%), Gaps = 16/318 (5%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
F FL +H K Y+ E +SR F NL++I+ E GS YG+ EF+DLS EF+
Sbjct: 50 FENFLLEHPKMYSEQ-ESHSRFQTFWENLKRIKFHNHIEQGSAKYGVTEFADLSDFEFRR 108
Query: 102 KYLGFKLKPSYADR---------SVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
YLG K + +R S + T+ FDW E AVT VK+Q MCGS WA
Sbjct: 109 HYLGLKPELKIPNRKKYERKSRNSSKKLKFAKTVDETFDWVEKGAVTEVKNQGMCGSCWA 168
Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEK 212
FSTTGNIEG + T LVSLSEQEL+DCDQ+D GC GG + AF+ ++ GGLE E+
Sbjct: 169 FSTTGNIEGAWFKATGDLVSLSEQELVDCDQKDSGCNGGLMDQAFEEVIRI--GGLETEQ 226
Query: 213 TYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTG 272
YPY G + C K ++V+I+ ++ + DE ++A+ L E+GP+++AINA+ +QFY G
Sbjct: 227 QYPYDGVQETCNFEKSLSKVQIDDFMDIGEDEEEIAEALEEHGPLSIAINAFGMQFYRGG 286
Query: 273 VSHPIQFFCDGGNENLSHSVLIVGYGVD-RTKFTHK-AVPYWIIKNSWGEGWGEKGYFRL 330
+SHP+ F C + L H VL+VGYGV+ T + H+ PYW IKNSWG WGE GY+R+
Sbjct: 287 ISHPLSFLC--SQDGLDHGVLMVGYGVEHHTTWRHRHPRPYWKIKNSWGPRWGEDGYYRV 344
Query: 331 YRGDGSCGINDYVRSALV 348
RG G CG+N V +++V
Sbjct: 345 ARGKGVCGVNKMVSTSIV 362
>gi|3023456|sp|Q26534.1|CATL_SCHMA RecName: Full=Cathepsin L; AltName: Full=SMCL1; Flags: Precursor
gi|555663|gb|AAC46485.1| preprocathepsin L [Schistosoma mansoni]
gi|1094710|prf||2106314A cathepsin L
Length = 319
Score = 266 bits (680), Expect = 9e-69, Method: Compositional matrix adjust.
Identities = 138/289 (47%), Positives = 183/289 (63%), Gaps = 11/289 (3%)
Query: 62 RLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFK-LKPSYADRSVPAM 120
R +IF N+ K QL Q GS +YG+ +SDL+T EF +L + PS + ++
Sbjct: 39 RFNIFKSNILKAQLYQVFVRGSAIYGVTPYSDLTTDEFARTHLTASWVVPSSRSNTPTSL 98
Query: 121 IPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELI 179
+ +P+ FDWRE AVT VK+Q MCGS WAFSTTGN+E + KT KL+SLSEQ+L+
Sbjct: 99 GKEVNNIPKNFDWREKGAVTEVKNQGMCGSCWAFSTTGNVESQWFRKTGKLLSLSEQQLV 158
Query: 180 DCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVS 239
DCD DDGC GG SNA+++I+ GGL E YPY ++ C L V IN V+
Sbjct: 159 DCDGLDDGCNGGLPSNAYESIIKM--GGLMLEDNYPYDAKNEKCHLKTDGVAVYINSSVN 216
Query: 240 VSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGV 299
+++DET++A +L N ++V +NA LQFY G+SHP FC L H+VL+VGYGV
Sbjct: 217 LTQDETELAAWLYHNSTISVGMNALLLQFYQHGISHPWWIFCS--KYLLDHAVLLVGYGV 274
Query: 300 DRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
+ K P+WI+KNSWG WGE GYFR+YRGDGSCGIN SA++
Sbjct: 275 -----SEKNEPFWIVKNSWGVEWGENGYFRMYRGDGSCGINTVATSAMI 318
>gi|182892046|gb|AAI65744.1| Ctsf protein [Danio rerio]
Length = 473
Score = 266 bits (679), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 141/314 (44%), Positives = 195/314 (62%), Gaps = 11/314 (3%)
Query: 36 VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLS 95
V+ +F F+ +N+TY++ E RL IF N++ Q LQ E GS YG+ +FSDL+
Sbjct: 169 VELLTMFKNFMITYNRTYSSQEEAEKRLRIFQQNMKTAQTLQSLEQGSAEYGITKFSDLT 228
Query: 96 TAEFQAKYLGFKLKP-SYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
EF+ YL L S PA+ + P +DWR++ AV+ VK+Q MCGS WAFS
Sbjct: 229 EDEFRMMYLNPMLSQWSLKKEMKPAIPASAPAPDTWDWRDHGAVSPVKNQGMCGSCWAFS 288
Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
TGNIEG + KT +L+SLSEQEL+DCD+ D C GG SNA++ I + GGLE E Y
Sbjct: 289 VTGNIEGQWFKKTGQLLSLSEQELVDCDKLDQACGGGLPSNAYEAIENL--GGLETETDY 346
Query: 215 PYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVS 274
Y G ++C + IN V + +DE ++A +L ENGP++ A+NA+A+QFY GVS
Sbjct: 347 SYTGHKQSCDFSTGKVAAYINSSVELPKDEKEIAAFLAENGPVSAALNAFAMQFYRKGVS 406
Query: 275 HPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGD 334
HP++ FC+ + H+VL+VG+G VP+W IKNSWGE +GE+GY+ LYRG
Sbjct: 407 HPLKIFCNPW--MIDHAVLLVGFG------QRNGVPFWAIKNSWGEDYGEQGYYYLYRGS 458
Query: 335 GSCGINDYVRSALV 348
G CGI+ SA+V
Sbjct: 459 GLCGIHKMCSSAIV 472
>gi|117606135|ref|NP_001071036.1| cathepsin F precursor [Danio rerio]
gi|115313533|gb|AAI24244.1| Cathepsin F [Danio rerio]
Length = 473
Score = 266 bits (679), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 141/314 (44%), Positives = 195/314 (62%), Gaps = 11/314 (3%)
Query: 36 VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLS 95
V+ +F F+ +N+TY++ E RL IF N++ Q LQ E GS YG+ +FSDL+
Sbjct: 169 VELLTMFKNFMITYNRTYSSQEEAEKRLRIFQQNMKTAQTLQSLEQGSAEYGITKFSDLT 228
Query: 96 TAEFQAKYLGFKLKP-SYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
EF+ YL L S PA+ + P +DWR++ AV+ VK+Q MCGS WAFS
Sbjct: 229 EDEFRMMYLNPMLSQWSLKKEMKPAIPASAPAPDTWDWRDHGAVSPVKNQGMCGSCWAFS 288
Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
TGNIEG + KT +L+SLSEQEL+DCD+ D C GG SNA++ I + GGLE E Y
Sbjct: 289 VTGNIEGQWFKKTGQLLSLSEQELVDCDKLDQACGGGLPSNAYEAIENL--GGLETETDY 346
Query: 215 PYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVS 274
Y G ++C + IN V + +DE ++A +L ENGP++ A+NA+A+QFY GVS
Sbjct: 347 SYTGHKQSCDFSTGKVAAYINSSVELPKDEKEIAAFLAENGPVSAALNAFAMQFYRKGVS 406
Query: 275 HPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGD 334
HP++ FC+ + H+VL+VG+G VP+W IKNSWGE +GE+GY+ LYRG
Sbjct: 407 HPLKIFCNPW--MIDHAVLLVGFG------QRNGVPFWAIKNSWGEDYGEQGYYYLYRGS 458
Query: 335 GSCGINDYVRSALV 348
G CGI+ SA+V
Sbjct: 459 GLCGIHKMCSSAIV 472
>gi|223049408|gb|ACM80348.1| cysteine proteinase [Solanum lycopersicum]
Length = 368
Score = 265 bits (678), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 148/335 (44%), Positives = 202/335 (60%), Gaps = 26/335 (7%)
Query: 24 VVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGS 83
VVGDE HH+ + +H F F ++ KTYA+ E++ R +F NLR+ Q + S
Sbjct: 39 VVGDED-HHMLNAEHH--FTLFKKRFGKTYASDEEHHYRFSVFKANLRRAMRHQKLD-PS 94
Query: 84 GVYGLNEFSDLSTAEFQAKYLGFKLK---PSYADRSVPAMIPNITLPRAFDWREYDAVTG 140
V+G+ +FSD++ EF K+LG + PS A+++ ++P LP FDWRE+ AVT
Sbjct: 95 AVHGVTQFSDMTPDEFSQKFLGVNRRLRFPSDANKA--PILPTEDLPSDFDWREHGAVTP 152
Query: 141 VKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGG 191
VK+Q CGS W+FSTTG +EG T KLVSLSEQ+L+DCD E D GC GG
Sbjct: 153 VKNQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEKDSCDSGCSGG 212
Query: 192 SISNAFDTIMSKLGGGLEEEKTYPYRGDDKA-CRLNKKATQVKINGYVSVSRDETDMAKY 250
+++AF+ + GGL E+ YPY G DKA C+ + K+ + VS DE +A
Sbjct: 213 LMNSAFEYTLK--AGGLMREEDYPYTGTDKATCKFDNTKVAAKVANFSVVSLDEEQIAAN 270
Query: 251 LVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVP 310
LV+NGP+AVAINA +Q YV GVS P + C ++ L H VL+VGYG + K P
Sbjct: 271 LVKNGPLAVAINAVFMQTYVGGVSCP--YIC---SKQLDHGVLLVGYGTGFSPIRMKEKP 325
Query: 311 YWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
YWIIKNSWGE WGE GY+++ RG CG++ V +
Sbjct: 326 YWIIKNSWGEKWGESGYYKIRRGRNVCGVDSMVST 360
>gi|313220237|emb|CBY31096.1| unnamed protein product [Oikopleura dioica]
Length = 371
Score = 265 bits (676), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 145/318 (45%), Positives = 196/318 (61%), Gaps = 16/318 (5%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
F FL +H K Y+ E +SR F NL++I+ E GS YG+ EF+DLS EF+
Sbjct: 50 FENFLLEHPKMYSEQ-ESHSRFQTFWENLKRIKFHNHIEQGSAKYGVTEFTDLSDFEFRR 108
Query: 102 KYLGFKLKPSYADR---------SVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
YLG K + +R S + T FDW E AVT VK+Q MCGS WA
Sbjct: 109 HYLGLKPELKNLNRKKYERKSRNSSKKLKFAKTADETFDWVEKGAVTEVKNQGMCGSCWA 168
Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEK 212
FSTTGNIEG + T L+SLSEQEL+DCDQ+D GC GG + AF+ ++ GGLE E+
Sbjct: 169 FSTTGNIEGAWFKATGDLISLSEQELVDCDQKDSGCNGGLMDQAFEEVIRI--GGLETEQ 226
Query: 213 TYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTG 272
YPY G + C K ++V+I+ ++ + DE ++A+ L E+GP+++AINA+ +QFY G
Sbjct: 227 QYPYDGVQETCNFEKSLSKVQIDDFMDIGEDEEEIAEALEEHGPLSIAINAFGMQFYRGG 286
Query: 273 VSHPIQFFCDGGNENLSHSVLIVGYGVD-RTKFTHK-AVPYWIIKNSWGEGWGEKGYFRL 330
VSHP+ F C + L H VL+VGYGV+ T + H+ PYW IKNSWG WGE GY+R+
Sbjct: 287 VSHPLSFLC--SPDGLDHGVLMVGYGVEHHTTWRHRHPRPYWKIKNSWGPRWGEDGYYRV 344
Query: 331 YRGDGSCGINDYVRSALV 348
RG G CG+N V +++V
Sbjct: 345 ARGKGVCGVNKMVSTSIV 362
>gi|67773374|gb|AAY81944.1| cysteine protease 6 [Paragonimus westermani]
Length = 325
Score = 265 bits (676), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 145/349 (41%), Positives = 195/349 (55%), Gaps = 31/349 (8%)
Query: 1 MSCFYFFAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYY 60
+SC F G TV V L+ F + K+YA +
Sbjct: 6 VSCLTFLVGCVFAVSTVQVPD---------------SARELYEQFKRDYGKSYAN-DDDE 49
Query: 61 SRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAM 120
R IF NL + Q Q E G+ YG+ +FSDL+ EF AK+L + + D+
Sbjct: 50 KRFAIFKDNLVRAQNYQLQEQGTARYGVTQFSDLTPEEFAAKFLSSR----FDDQVERVQ 105
Query: 121 IPNI-TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELI 179
+ ++ P + DWRE AV V+DQ CGS WAFS GN+EG + KT +LVSLS+Q+L+
Sbjct: 106 LNDLKAAPESVDWRELGAVAPVEDQGSCGSCWAFSVAGNVEGQWFLKTGQLVSLSKQQLV 165
Query: 180 DCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVS 239
DCD +D GC+GG + I+ GGLE ++ YPY G ++ C+L++ KIN +
Sbjct: 166 DCDVQDSGCDGGYPPTTYGEIIRM--GGLEAQRDYPYVGREQPCKLDESKLLAKINSSIV 223
Query: 240 VSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGV 299
+ +E A Y+ E+GPM+ INA LQFY +G+SHP + C + L+H VL VGYG
Sbjct: 224 LEANEKKQAAYIAEHGPMSSGINAVTLQFYQSGISHPSKSQCQ--PDWLNHGVLSVGYG- 280
Query: 300 DRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
T VPYWIIKNSWG GWGEKGYFRLYRGDG+CGI V SA++
Sbjct: 281 -----TEDGVPYWIIKNSWGTGWGEKGYFRLYRGDGTCGIEKVVSSAII 324
>gi|431910221|gb|ELK13294.1| Cathepsin F [Pteropus alecto]
Length = 458
Score = 264 bits (674), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 135/309 (43%), Positives = 189/309 (61%), Gaps = 10/309 (3%)
Query: 40 ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
++F F+ +N+TY T E R+ +F N+ + Q +Q + G+ YG+ +FSDL+ EF
Sbjct: 159 SIFKEFVITYNRTYETKEEAQWRMSVFINNMMRAQKIQALDRGTARYGVTKFSDLTEEEF 218
Query: 100 QAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNI 159
+ YL LK + R AM + P +DWR AVT VKDQ MCGS WAFS TGN+
Sbjct: 219 RTIYLNPLLKELRSKRMPLAMSVSGPAPPEWDWRNKGAVTKVKDQGMCGSCWAFSVTGNV 278
Query: 160 EGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGD 219
EG + K L+SLSEQEL+DCD+ D C GG SNA+ I K GGLE E Y Y G
Sbjct: 279 EGQWFLKRGDLLSLSEQELVDCDKLDKACLGGLPSNAYSAI--KTLGGLETEDDYGYNGH 336
Query: 220 DKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQF 279
+ C + + +V IN V +S++E +A +L +NGP+++AINA+ +QFY G+SHP++
Sbjct: 337 LQTCNFSAEKAKVYINDSVELSQNEQKLAAWLAKNGPISIAINAFGMQFYRHGISHPLRP 396
Query: 280 FCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGI 339
C + H+VL+VGYG +P+W IKNSWG WGE+GY+ L+RG G+CG+
Sbjct: 397 LCSPW--LIDHAVLLVGYG------NRSDIPFWAIKNSWGTDWGEEGYYYLHRGSGACGV 448
Query: 340 NDYVRSALV 348
N SA+V
Sbjct: 449 NIMASSAVV 457
>gi|339244639|ref|XP_003378245.1| cathepsin F [Trichinella spiralis]
gi|316972864|gb|EFV56510.1| cathepsin F [Trichinella spiralis]
Length = 366
Score = 264 bits (674), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 133/307 (43%), Positives = 185/307 (60%), Gaps = 7/307 (2%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
F F+ + NK Y T + +IF N+ + LQ+ E G+ +YG F+D++ EF+
Sbjct: 66 FKQFMVEFNKWYETEKLTAEKYNIFKSNMVIAKRLQEEEQGTAIYGPTIFADMTPEEFRK 125
Query: 102 KYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEG 161
+L F + + A IP + DWR+++AVT VKDQ CGS WAF T NIEG
Sbjct: 126 THLNFNPNNVKKPKRM-ANIPKSNISERMDWRKFNAVTSVKDQGNCGSCWAFCTVANIEG 184
Query: 162 VYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDK 221
+A KT +L+SLSEQ+L+DCD+ DDGCEGG NA+ I+ GGLE+E+ Y Y
Sbjct: 185 AWAVKTAQLISLSEQQLVDCDRLDDGCEGGLPVNAYLEIIRL--GGLEKEEDYKYTARSG 242
Query: 222 ACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFC 281
C+ N + V IN V + DE +A+Y+ ENGP+AV +NA A+ FY +G++HP + C
Sbjct: 243 KCKFNHTKSAVYINDTVVLPEDEDAIARYVSENGPVAVGLNADAMMFYRSGIAHPSRLMC 302
Query: 282 DGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGIND 341
+ ++H V IVGY V + F + PYWIIKNSWG WGEKGY+ LYRG G CGI+
Sbjct: 303 SP--DGINHGVTIVGYDVKESLFW--STPYWIIKNSWGPNWGEKGYYYLYRGKGVCGIDQ 358
Query: 342 YVRSALV 348
S ++
Sbjct: 359 MASSVVI 365
>gi|417401303|gb|JAA47542.1| Putative cathepsin f [Desmodus rotundus]
Length = 459
Score = 263 bits (672), Expect = 8e-68, Method: Compositional matrix adjust.
Identities = 138/332 (41%), Positives = 192/332 (57%), Gaps = 10/332 (3%)
Query: 17 VSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLL 76
S S F ++ + L +K +LF +F+ +N+TY T E R+ IF N+ + Q +
Sbjct: 137 TSSSFFPLLNKDPLPQNFSMKMVSLFKHFIATYNRTYETEEEAQWRMSIFINNMVRAQEI 196
Query: 77 QDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYD 136
Q + G+ YG+ +FSDL+ EF+ YL LK + A + P +DWR
Sbjct: 197 QALDRGTAQYGVTKFSDLTEEEFRTFYLNPLLKEGLGKKMRLAKPVDDPAPPEWDWRNKG 256
Query: 137 AVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNA 196
AVT VK+Q MCGS WAFS TGN+EG + K L+SLSEQEL+DCD D C GG SNA
Sbjct: 257 AVTKVKNQGMCGSCWAFSVTGNVEGQWFLKQGDLLSLSEQELVDCDTLDKACMGGLPSNA 316
Query: 197 FDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGP 256
+ I K GGLE E Y Y G + C + +V IN V +S+DE +A +L + GP
Sbjct: 317 YSAI--KTLGGLETEDDYSYHGHLQTCSFTAEKVKVYINDSVELSKDEQKLAAWLAKKGP 374
Query: 257 MAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKN 316
+++AINA+ +QFY G+S P++ C + H+VL+VGYG VP+W IKN
Sbjct: 375 ISIAINAFGMQFYRRGISRPLRLLCSPW--FIDHAVLLVGYG------NRSDVPFWAIKN 426
Query: 317 SWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
SWG WGE+GY+ L+RG +CG+N SA+V
Sbjct: 427 SWGTDWGEEGYYYLHRGSRACGVNVMASSAVV 458
>gi|432091081|gb|ELK24293.1| Cathepsin F, partial [Myotis davidii]
Length = 410
Score = 263 bits (671), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 136/330 (41%), Positives = 195/330 (59%), Gaps = 11/330 (3%)
Query: 20 SSFMVVGD-EKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQD 78
S + + D + L +++ +LF YF+ +N+TY T E R+ +F N+ + Q +Q
Sbjct: 90 SPLLPLSDRDPLPQDFYLRMASLFKYFITTYNRTYETEEEAQWRMSVFINNMIRAQKIQA 149
Query: 79 TEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAV 138
+ G+ YG+ +FSDL+ EF+ YL LK + P +DWR+ AV
Sbjct: 150 LDRGTAQYGVTKFSDLTEEEFRTMYLNPLLKEELGKKMRLVKFVGDPAPPEWDWRKKGAV 209
Query: 139 TGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFD 198
T VK+Q MCGS WAFS TGN+EG + K L+SLSEQEL+DCD+ D C GG SNA+
Sbjct: 210 TKVKNQGMCGSCWAFSVTGNVEGQWFLKRGDLLSLSEQELVDCDKVDKACMGGLPSNAYS 269
Query: 199 TIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMA 258
I K GGLE E Y Y G + C + + +V IN V +S +E ++A +L +NGP++
Sbjct: 270 AI--KTLGGLETEDDYSYSGHLQTCSFSAQKAKVYINDSVELSHNEQELAAWLAKNGPIS 327
Query: 259 VAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSW 318
+AINA+ +QFY G+S P++ C + H+VL+VGYG VP+W IKNSW
Sbjct: 328 IAINAFGMQFYRHGISRPLRPLCS--RWFIDHAVLLVGYG------NRSDVPFWAIKNSW 379
Query: 319 GEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
G WGE+GY+ L+RG G+CG+N SA+V
Sbjct: 380 GTDWGEEGYYYLHRGSGACGVNVMASSAVV 409
>gi|2414683|emb|CAB16316.1| cysteine proteinase precursor [Vicia sativa]
Length = 379
Score = 260 bits (664), Expect = 8e-67, Method: Compositional matrix adjust.
Identities = 139/318 (43%), Positives = 190/318 (59%), Gaps = 18/318 (5%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
F F++ ++K Y+T EY RL IF+ N+ K Q + + ++G+ +FSDLS EF+
Sbjct: 55 FKLFMKDYSKKYSTTEEYLLRLGIFAKNMVKAAEHQALDP-TAIHGVTQFSDLSEEEFER 113
Query: 102 KYLGFK--LKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNI 159
Y GFK S A V + P FDWRE AVTG+K Q CGS WAF+TTG+I
Sbjct: 114 FYTGFKGGFPSSNAAGGVAPPLDVKGFPENFDWREKGAVTGIKTQGKCGSCWAFTTTGSI 173
Query: 160 EGVYAAKTKKLVSLSEQELIDCDQE--------DDGCEGGSISNAFDTIMSKLGGGLEEE 211
EG T KLVSLSEQ+L+DCD + D+GC GG ++ A+D +M GGLEEE
Sbjct: 174 EGANFLATGKLVSLSEQQLVDCDNKCDITKTSCDNGCNGGLMTTAYDYLME--AGGLEEE 231
Query: 212 KTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVT 271
+YPY G C+ + V+++ + ++ DE +A YLV +GP+A+A+NA +Q YV
Sbjct: 232 TSYPYTGAQGECKFDPNKVAVRVSNFTNIPADENQIAAYLVNHGPLAIAVNAVFMQTYVG 291
Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTH-KAVPYWIIKNSWGEGWGEKGYFRL 330
GVS P+ C L+H VL+VGY + + PYW IKNSWGE WGEKGY++L
Sbjct: 292 GVSCPL--ICS--KRRLNHGVLLVGYNAEGFSILRLRKKPYWTIKNSWGEQWGEKGYYKL 347
Query: 331 YRGDGSCGINDYVRSALV 348
RG G CG+N V +A+V
Sbjct: 348 CRGHGMCGMNTMVSAAMV 365
>gi|356553413|ref|XP_003545051.1| PREDICTED: cysteine proteinase 15A-like [Glycine max]
Length = 367
Score = 260 bits (664), Expect = 8e-67, Method: Compositional matrix adjust.
Identities = 142/324 (43%), Positives = 193/324 (59%), Gaps = 20/324 (6%)
Query: 32 HLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEF 91
HL + +H F F + K YAT E+ R +F NLR+ +L + S V+G+ +F
Sbjct: 45 HLLNAEHH--FASFKAKFGKKYATKEEHDRRFGVFKSNLRRARLHAKLDP-SAVHGVTKF 101
Query: 92 SDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSW 151
SDL+ AEF+ ++LGFK A+ ++P LP+ FDWR+ AVT VKDQ CGS W
Sbjct: 102 SDLTPAEFRRQFLGFKPLRLPANAQKAPILPTKDLPKDFDWRDKGAVTNVKDQGACGSCW 161
Query: 152 AFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIMS 202
+FSTTG +EG + T +LVSLSEQ+L+DCD D GC GG ++NAF+ I+
Sbjct: 162 SFSTTGALEGAHYLATGELVSLSEQQLVDCDHVCDPEEYGACDSGCNGGLMNNAFEYILQ 221
Query: 203 KLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAIN 262
GG+++EK YPY G D C+ +K ++ Y VS DE +A LV+NGP+AV IN
Sbjct: 222 S--GGVQKEKDYPYTGRDGTCKFDKTKVAATVSNYSVVSLDEDQIAANLVKNGPLAVGIN 279
Query: 263 AYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDR-TKFTHKAVPYWIIKNSWGEG 321
A +Q Y+ GVS P + C ++L H VLIVGYG K PYWIIKNSWGE
Sbjct: 280 AVFMQTYIGGVSCP--YIC---GKHLDHGVLIVGYGEGAYAPIRFKNKPYWIIKNSWGES 334
Query: 322 WGEKGYFRLYRGDGSCGINDYVRS 345
WGE GY+++ RG CG++ V +
Sbjct: 335 WGENGYYKICRGRNVCGVDSMVST 358
>gi|67773372|gb|AAY81943.1| cysteine protease 5 [Paragonimus westermani]
Length = 325
Score = 259 bits (663), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 149/348 (42%), Positives = 193/348 (55%), Gaps = 29/348 (8%)
Query: 1 MSCFYFFAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYY 60
+SC F G A TV V L+ F + K YA +
Sbjct: 6 VSCLAFLVGCAFAVSTVPVPD---------------NARELYEQFKRDYGKVYAN-DDDQ 49
Query: 61 SRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAM 120
R IF NL + Q LQ + G+ YG+ +FSDL+ EF AKYL + +R P
Sbjct: 50 KRFAIFKDNLVRAQKLQLKDRGTARYGVTQFSDLTPEEFAAKYLSRPMNDQ-VERVRPTG 108
Query: 121 IPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELID 180
+ P DWRE+ AV V++Q CGS WAFS GN+EG + KT +LVSLS+Q+L+D
Sbjct: 109 LK--AAPERMDWREWGAVGPVENQGSCGSCWAFSVAGNVEGQWFLKTGQLVSLSKQQLVD 166
Query: 181 CDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSV 240
CD D GC GG +NA+ IM GGLE + YPY G + C LNK+ KI+ + +
Sbjct: 167 CDVMDYGCGGGWPTNAYMEIMRM--GGLELQSDYPYVGVQQQCYLNKEKLLAKIDDLIVL 224
Query: 241 SRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVD 300
E + A YL E+GP++ A+NA LQFY +G+SHP C +L+H+VL VGY
Sbjct: 225 GAYEEEHAAYLAEHGPLSSALNAGYLQFYQSGISHPSYEECSPA--SLNHAVLTVGYD-- 280
Query: 301 RTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
T VPYWIIKNSWG GWGE GYFRLYRGDG+CGIN + SA++
Sbjct: 281 ----TENGVPYWIIKNSWGTGWGENGYFRLYRGDGTCGINRMITSAII 324
>gi|7242888|dbj|BAA92495.1| cysteine protease [Vigna mungo]
Length = 364
Score = 259 bits (663), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 142/331 (42%), Positives = 193/331 (58%), Gaps = 24/331 (7%)
Query: 25 VGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSG 84
V D L+ HH F+ F + KTYAT E+ R +F NLR+ +L + S
Sbjct: 39 VEDHLLNAEHH------FSNFKAKFGKTYATKEEHDHRFGVFKSNLRRARLHAQLDP-SA 91
Query: 85 VYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQ 144
V+G+ +FSDL+ AEFQ ++LG K A+ ++P LP+ FDWR+ AVT VKDQ
Sbjct: 92 VHGVTKFSDLTAAEFQRQFLGLKPLGLPANAQKAPILPTNNLPKDFDWRDKGAVTNVKDQ 151
Query: 145 TMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISN 195
CGS W+FSTTG +EG + T +LVSLSEQ+L+DCD D GC GG ++N
Sbjct: 152 GACGSCWSFSTTGALEGAHFLATGELVSLSEQQLVDCDHVCDPEEYGACDSGCNGGLMNN 211
Query: 196 AFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENG 255
AF+ I+ GG++ E+ YPY G D +C+ +K + Y +S DE +A LV+NG
Sbjct: 212 AFEYILG--AGGVQREEDYPYAGRDSSCKFDKSKIAASVANYSVISLDEDQIAANLVKNG 269
Query: 256 PMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDR-TKFTHKAVPYWII 314
P+AV INA +Q Y+ GVS P + C + L H V IVGYG K PYWII
Sbjct: 270 PLAVGINAVYMQTYIGGVSCP--YIC---AKRLDHGVQIVGYGESGYAPIRFKEKPYWII 324
Query: 315 KNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
KNSWGE WGE GY+++ RG +CG++ V +
Sbjct: 325 KNSWGESWGENGYYKICRGQNACGVDSMVST 355
>gi|113208365|dbj|BAF03553.1| cysteine proteinase CP2 [Phaseolus vulgaris]
Length = 365
Score = 259 bits (662), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 145/333 (43%), Positives = 200/333 (60%), Gaps = 28/333 (8%)
Query: 25 VGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSG 84
V D L+ HH F+ F + KTYAT E+ R +F N+R+ +L + S
Sbjct: 40 VEDHLLNAEHH------FSTFKAKFGKTYATKEEHDHRFGVFKSNMRRARLHAQLD-PSA 92
Query: 85 VYGLNEFSDLSTAEFQAKYLGFK-LK-PSYADRSVPAMIPNITLPRAFDWREYDAVTGVK 142
V+G+ +FSDL+ AEF K+LG K L+ P++A ++ ++P LP+ FDWR+ AVT VK
Sbjct: 93 VHGVTKFSDLTPAEFHRKFLGLKPLRLPAHAQKA--PILPTNNLPKDFDWRDKGAVTNVK 150
Query: 143 DQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSI 193
DQ CGS W+FSTTG +EG + T +LVSLSEQ+L+DCD D GC GG +
Sbjct: 151 DQGSCGSCWSFSTTGALEGAHFLATGELVSLSEQQLVDCDHVCDPEEYGSCDSGCNGGLM 210
Query: 194 SNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVE 253
+NAF+ ++ GG++ EK YPY G D C+ +K ++ Y +S DE +A LV+
Sbjct: 211 NNAFEYLIGS--GGVQREKDYPYTGRDGTCKFDKSKIAASVSNYSVISLDEEQIAANLVK 268
Query: 254 NGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDR-TKFTHKAVPYW 312
NGP+AVAINA +Q YV GVS P + C ++L H VL+VGYG K PYW
Sbjct: 269 NGPLAVAINAVYMQTYVGGVSCP--YIC---GKHLDHGVLLVGYGEGAYAPIRFKEKPYW 323
Query: 313 IIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
IIKNSWGE WGE GY+++ RG CG++ V +
Sbjct: 324 IIKNSWGENWGENGYYKICRGRNVCGVDSMVST 356
>gi|356564325|ref|XP_003550405.1| PREDICTED: cysteine proteinase 15A [Glycine max]
Length = 370
Score = 259 bits (662), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 146/331 (44%), Positives = 200/331 (60%), Gaps = 28/331 (8%)
Query: 27 DEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY 86
D L+ HH F F + KTYAT E+ R +F NLR+ +L + S V+
Sbjct: 47 DNLLNAEHH------FASFKAKFAKTYATKEEHDHRFGVFKSNLRRARLHAKLD-PSAVH 99
Query: 87 GLNEFSDLSTAEFQAKYLGFK-LK-PSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQ 144
G+ +FSDL+ AEF+ ++LG K L+ P++A ++ ++P LP+ FDWR+ AVT VKDQ
Sbjct: 100 GVTKFSDLTPAEFRRQFLGLKPLRFPAHAQKA--PILPTKDLPKDFDWRDKGAVTNVKDQ 157
Query: 145 TMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISN 195
CGS W+FSTTG +EG + T +LVSLSEQ+L+DCD D GC GG ++N
Sbjct: 158 GACGSCWSFSTTGALEGAHYLATGELVSLSEQQLVDCDHVCDPEEYGACDSGCNGGLMNN 217
Query: 196 AFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENG 255
AF+ I+ GG+++EK YPY G D C+ +K ++ Y VS DE +A LV+NG
Sbjct: 218 AFEYILQS--GGVQKEKDYPYTGRDGTCKFDKTKVAATVSNYSVVSLDEEQIAANLVKNG 275
Query: 256 PMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDR-TKFTHKAVPYWII 314
P+AVAINA +Q YV GVS P + C ++L H VL+VGYG K PYWII
Sbjct: 276 PLAVAINAVFMQTYVGGVSCP--YIC---GKHLDHGVLLVGYGEGAYAPIRFKNKPYWII 330
Query: 315 KNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
KNSWGE WGE GY+++ RG CG++ V +
Sbjct: 331 KNSWGESWGENGYYKICRGRNVCGVDSMVST 361
>gi|359806140|ref|NP_001241450.1| uncharacterized protein LOC100778716 precursor [Glycine max]
gi|255639509|gb|ACU20049.1| unknown [Glycine max]
Length = 366
Score = 258 bits (659), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 143/334 (42%), Positives = 194/334 (58%), Gaps = 21/334 (6%)
Query: 23 MVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHG 82
VV D + HHL + +H F+ F + KTYAT E+ R IF NL + + Q +
Sbjct: 34 QVVPDAEDHHLLNAEHH--FSAFKTKFGKTYATQEEHDHRFRIFKNNLLRAKSHQKLD-P 90
Query: 83 SGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVK 142
S V+G+ FSDL+ AEF+ ++LG K +D ++P LP FDWRE+ AVTGVK
Sbjct: 91 SAVHGVTRFSDLTPAEFRRQFLGLKPLRLPSDAQKAPILPTNDLPTDFDWREHGAVTGVK 150
Query: 143 DQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSI 193
+Q CGS W+FS G +EG + T +LVSLSEQ+L+DCD E D GC GG +
Sbjct: 151 NQGSCGSCWSFSAVGALEGAHFLSTGELVSLSEQQLVDCDHECDPEERGACDSGCNGGLM 210
Query: 194 SNAFDTIMSKLGGGLEEEKTYPYRGDDKA-CRLNKKATQVKINGYVSVSRDETDMAKYLV 252
+ AF+ + GGL EK YPY G D+ C+ +K + + VS DE +A LV
Sbjct: 211 TTAFEYTLQ--AGGLMREKDYPYTGRDRGPCKFDKSKVAASVANFSVVSLDEEQIAANLV 268
Query: 253 ENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDR-TKFTHKAVPY 311
+NGP+AV INA +Q Y+ GVS P + C ++L H VL+VGYG K PY
Sbjct: 269 QNGPLAVGINAVFMQTYIGGVSCP--YIC---GKHLDHGVLLVGYGSGAYAPIRFKEKPY 323
Query: 312 WIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
WIIKNSWGE WGE+GY+++ RG CG++ V +
Sbjct: 324 WIIKNSWGESWGEEGYYKICRGRNVCGVDSMVST 357
>gi|356576257|ref|XP_003556249.1| PREDICTED: LOW QUALITY PROTEIN: cysteine proteinase 15A-like
[Glycine max]
Length = 374
Score = 258 bits (658), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 143/344 (41%), Positives = 201/344 (58%), Gaps = 32/344 (9%)
Query: 19 VSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQD 78
++ + VGD +L ++ F F+E + ++Y+T EY RL IFS N+ L+
Sbjct: 36 IARKLKVGDNEL-----LRTEKKFKVFMENYGRSYSTREEYLRRLGIFSQNM-----LRA 85
Query: 79 TEH----GSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWRE 134
EH + V+G+ +FSDL+ EF+ Y G + + P + LP FDWRE
Sbjct: 86 AEHQALDPTAVHGVTQFSDLTEVEFEKLYTGXPSTNTAGGVAPPLEVEG--LPENFDWRE 143
Query: 135 YDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------D 185
AVT VK Q CGS WAFSTTG+IEG T KLVSLSEQ+L+DCD + D
Sbjct: 144 KGAVTEVKIQGRCGSCWAFSTTGSIEGANFLATGKLVSLSEQQLLDCDNKCEITEKTSCD 203
Query: 186 DGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDET 245
+GC GG ++NA++ ++ GGLEEE +YPY G+ C+ + + V+I + ++ DE
Sbjct: 204 NGCNGGLMTNAYNYLLES--GGLEEESSYPYTGERGECKFDPEKITVRITNFTNIPVDEN 261
Query: 246 DMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFT 305
+A YLV+NGP+A+ +NA +Q Y+ GVS P+ C + L+H VL+VGYG
Sbjct: 262 QIAAYLVKNGPLAMGVNAIFMQTYIGGVSCPL--ICS--KKRLNHGVLLVGYGAKGFSIL 317
Query: 306 HKA-VPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
PYWIIKNSWG+ WGE GY++L RG G CGIN V +A+V
Sbjct: 318 RLGNKPYWIIKNSWGKKWGEDGYYKLCRGHGMCGINTMVSAAMV 361
>gi|351726954|ref|NP_001236888.1| cysteine proteinase precursor [Glycine max]
gi|479060|emb|CAA83673.1| cysteine proteinase [Glycine max]
gi|300507422|gb|ADK24076.1| cysteine proteinase [Glycine max]
gi|300507425|gb|ADK24077.1| cysteine proteinase [Glycine max]
gi|1096153|prf||2111244A Cys protease
Length = 380
Score = 258 bits (658), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 141/343 (41%), Positives = 202/343 (58%), Gaps = 25/343 (7%)
Query: 19 VSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQD 78
++ + +GD +L ++ F F+E + ++Y+T EY RL IF+ N+ + Q
Sbjct: 36 IARKLKLGDNEL-----LRTEKKFKVFMENYGRSYSTEEEYLRRLGIFAQNMVRAAEHQA 90
Query: 79 TEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNIT---LPRAFDWREY 135
+ + V+G+ +FSDL+ EF+ Y G ++ + + P + LP FDWRE
Sbjct: 91 LDP-TAVHGVTQFSDLTEDEFEKLYTGVNGGFPSSNNAAGGIAPPLEVDGLPENFDWREK 149
Query: 136 DAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DD 186
AVT VK Q CGS WAFSTTG+IEG T KLVSLSEQ+L+DCD + D+
Sbjct: 150 GAVTEVKLQGRCGSCWAFSTTGSIEGANFLATGKLVSLSEQQLLDCDNKCDITEKTSCDN 209
Query: 187 GCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETD 246
GC GG ++NA++ ++ GGLEEE +YPY G+ C+ + + VKI + ++ DE
Sbjct: 210 GCNGGLMTNAYNYLLES--GGLEEESSYPYTGERGECKFDPEKIAVKITNFTNIPADENQ 267
Query: 247 MAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTH 306
+A YLV+NGP+A+ +NA +Q Y+ GVS P+ C + L+H VL+VGYG
Sbjct: 268 IAAYLVKNGPLAMGVNAIFMQTYIGGVSCPL--ICS--KKRLNHGVLLVGYGAKGFSILR 323
Query: 307 KA-VPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
PYWIIKNSWGE WGE GY++L RG G CGIN V +A+V
Sbjct: 324 LGNKPYWIIKNSWGEKWGEDGYYKLCRGHGMCGINTMVSAAMV 366
>gi|393906608|gb|EFO21301.2| ctsf protein [Loa loa]
Length = 472
Score = 258 bits (658), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 142/325 (43%), Positives = 189/325 (58%), Gaps = 28/325 (8%)
Query: 37 KHTALFNYFLE---QHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSD 93
K L+N FL+ + + Y+++ E R + NL ++ LQ E G+ +YG+ +FSD
Sbjct: 162 KTEMLWNSFLDFIKKFKREYSSVAEQLDRFKKYMQNLHFVEKLQHEEKGTAIYGVTQFSD 221
Query: 94 LSTAEFQAKYLGFKLKPS-YADRSVPAMIP------NIT---LPRAFDWREYDAVTGVKD 143
+S EFQ L PS + DR V + N+T LP FDWR VT VK+
Sbjct: 222 MSPEEFQKTML-----PSLWWDRVVSNGVEYDLKKFNLTFNNLPEQFDWRTKGVVTPVKN 276
Query: 144 QTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSK 203
Q CGS WAFS TGNIEG++A KT KL+SLSEQELIDCD+ D GC GG NAF I
Sbjct: 277 QGSCGSCWAFSVTGNIEGLWAIKTGKLISLSEQELIDCDRIDKGCNGGLPINAFREIQRM 336
Query: 204 LGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA 263
GGLE E YPY+ + C L + A V I+ V + R+ET M ++V+ GP++V I+A
Sbjct: 337 --GGLEPEDQYPYKARNGTCHLIRSAIAVTIDDAVEIPRNETVMKAWIVQRGPLSVGIDA 394
Query: 264 YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWG 323
L +Y +G+ HP + C + H VLI GYGV+ +PYW IKNSWG+ WG
Sbjct: 395 KLLAYYKSGILHPSRSRCPPS--GIDHGVLITGYGVEN------GLPYWTIKNSWGDQWG 446
Query: 324 EKGYFRLYRGDGSCGINDYVRSALV 348
E GYFRL G CG++D V SA++
Sbjct: 447 EDGYFRLMLGKDVCGVSDLVSSAII 471
>gi|312080834|ref|XP_003142769.1| ctsf protein [Loa loa]
Length = 437
Score = 257 bits (656), Expect = 6e-66, Method: Compositional matrix adjust.
Identities = 142/325 (43%), Positives = 189/325 (58%), Gaps = 28/325 (8%)
Query: 37 KHTALFNYFLE---QHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSD 93
K L+N FL+ + + Y+++ E R + NL ++ LQ E G+ +YG+ +FSD
Sbjct: 127 KTEMLWNSFLDFIKKFKREYSSVAEQLDRFKKYMQNLHFVEKLQHEEKGTAIYGVTQFSD 186
Query: 94 LSTAEFQAKYLGFKLKPS-YADRSVPAMIP------NIT---LPRAFDWREYDAVTGVKD 143
+S EFQ L PS + DR V + N+T LP FDWR VT VK+
Sbjct: 187 MSPEEFQKTML-----PSLWWDRVVSNGVEYDLKKFNLTFNNLPEQFDWRTKGVVTPVKN 241
Query: 144 QTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSK 203
Q CGS WAFS TGNIEG++A KT KL+SLSEQELIDCD+ D GC GG NAF I
Sbjct: 242 QGSCGSCWAFSVTGNIEGLWAIKTGKLISLSEQELIDCDRIDKGCNGGLPINAFREIQRM 301
Query: 204 LGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA 263
GGLE E YPY+ + C L + A V I+ V + R+ET M ++V+ GP++V I+A
Sbjct: 302 --GGLEPEDQYPYKARNGTCHLIRSAIAVTIDDAVEIPRNETVMKAWIVQRGPLSVGIDA 359
Query: 264 YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWG 323
L +Y +G+ HP + C + H VLI GYGV+ +PYW IKNSWG+ WG
Sbjct: 360 KLLAYYKSGILHPSRSRCPPS--GIDHGVLITGYGVEN------GLPYWTIKNSWGDQWG 411
Query: 324 EKGYFRLYRGDGSCGINDYVRSALV 348
E GYFRL G CG++D V SA++
Sbjct: 412 EDGYFRLMLGKDVCGVSDLVSSAII 436
>gi|260830531|ref|XP_002610214.1| hypothetical protein BRAFLDRAFT_216923 [Branchiostoma floridae]
gi|229295578|gb|EEN66224.1| hypothetical protein BRAFLDRAFT_216923 [Branchiostoma floridae]
Length = 274
Score = 257 bits (656), Expect = 6e-66, Method: Compositional matrix adjust.
Identities = 135/289 (46%), Positives = 173/289 (59%), Gaps = 18/289 (6%)
Query: 62 RLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMI 121
R +F NL+K + LQD+E G+ YG+ +F DL+ EF+ YL K + A PA I
Sbjct: 1 RYFVFQDNLKKAETLQDSERGTAKYGVTKFMDLTEEEFRRYYLTPVWK-APAKPLPPATI 59
Query: 122 PNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDC 181
P P AFDWR++ AVT VKDQ CGS WAFSTTGNIEG +A K L LSEQ
Sbjct: 60 PKKDAPTAFDWRDHGAVTEVKDQGQCGSCWAFSTTGNIEGQWAIKKGNLPDLSEQHT--- 116
Query: 182 DQEDDGCEGGSISNAFDTIMSKLGG--GLEEEKTYPYRGDDKACRLNKKATQVKINGYVS 239
E I+ + G GLE EK YPY D+ C ++ QV IN V+
Sbjct: 117 ----SKIESCHINPIVKRTKRSIDGKSGLESEKAYPYEAKDEQCHMDYSKVQVYINSSVN 172
Query: 240 VSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGV 299
+S+DE DMA +L ENGP+++ INA+ +QFY+ G+SHP + FC+ E L H VLIVGYG
Sbjct: 173 ISKDENDMASWLAENGPISIGINAFPMQFYMGGISHPWRIFCN--PEELDHGVLIVGYG- 229
Query: 300 DRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
T PYWIIKNSWG+ WGE+GY+ +YRG G CG+N S++V
Sbjct: 230 -----TKDETPYWIIKNSWGKNWGEEGYYLVYRGGGVCGLNTMCTSSVV 273
>gi|335281454|ref|XP_003122543.2| PREDICTED: cathepsin F [Sus scrofa]
gi|350579927|ref|XP_003480717.1| PREDICTED: cathepsin F-like [Sus scrofa]
Length = 490
Score = 256 bits (655), Expect = 9e-66, Method: Compositional matrix adjust.
Identities = 144/332 (43%), Positives = 202/332 (60%), Gaps = 11/332 (3%)
Query: 18 SVSSFM-VVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLL 76
+ SSF+ ++ + L VK ++F F+ +N+TY T E R+ +F+ N+ + Q +
Sbjct: 168 TFSSFLPLLNKDPLPQDFSVKMASIFKEFVTTYNRTYDTKEEARWRMSVFANNMVRAQKI 227
Query: 77 QDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYD 136
Q + G+ YG+ +FSDL+ EF+ YL L+ + A + P +DWR+
Sbjct: 228 QALDTGTARYGVTKFSDLTEEEFRTIYLNPLLQEEPGRKMRLAKSVSSLPPPEWDWRKKG 287
Query: 137 AVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNA 196
AVT VKDQ MCGS WAFS TGN+EG + K L+SLSEQEL+DCD+ D GC GG SNA
Sbjct: 288 AVTKVKDQGMCGSCWAFSVTGNVEGQWFLKQGTLLSLSEQELLDCDKVDKGCMGGLPSNA 347
Query: 197 FDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGP 256
+ I K GGLE E+ Y YRG + C N + +V IN V +S++E +A +L E GP
Sbjct: 348 YSAI--KTLGGLETEEDYSYRGHLQTCSFNAEKAKVYINDSVELSQNEQKLAAWLAEKGP 405
Query: 257 MAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKN 316
++VAINA+ +QFY G+SHP++ C + H+VL+VGYG A P+W IKN
Sbjct: 406 ISVAINAFGMQFYRHGISHPLRPLCSPW--LIDHAVLLVGYG------NRSATPFWAIKN 457
Query: 317 SWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
SWG WGE+GY+ LYRG G+CG+N SA+V
Sbjct: 458 SWGTDWGEEGYYYLYRGSGACGVNIMASSAVV 489
>gi|2511691|emb|CAB17075.1| cysteine proteinase precursor [Phaseolus vulgaris]
Length = 365
Score = 256 bits (655), Expect = 9e-66, Method: Compositional matrix adjust.
Identities = 144/333 (43%), Positives = 199/333 (59%), Gaps = 28/333 (8%)
Query: 25 VGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSG 84
V D L+ HH F+ F + KTYAT E+ R +F N+R+ +L + S
Sbjct: 40 VEDHLLNAEHH------FSTFKSKFGKTYATKEEHDHRFGVFKSNMRRARLHAQLDP-SA 92
Query: 85 VYGLNEFSDLSTAEFQAKYLGFK-LK-PSYADRSVPAMIPNITLPRAFDWREYDAVTGVK 142
V+G+ +FSDL+ AEF K+LG K L+ P++A ++ ++P LP+ FDWR+ AVT VK
Sbjct: 93 VHGVTKFSDLTPAEFHRKFLGLKPLRLPAHAQKA--PILPTNNLPKDFDWRDKGAVTNVK 150
Query: 143 DQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSI 193
DQ CGS W+FSTTG +EG + T +LVSLSEQ+L+DCD D GC GG +
Sbjct: 151 DQGSCGSCWSFSTTGALEGAHFLATGELVSLSEQQLVDCDHVCDPEEYGSCDSGCNGGLM 210
Query: 194 SNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVE 253
+NAF+ ++ GG++ EK YPY G D C+ +K ++ Y +S DE +A LV+
Sbjct: 211 NNAFEYLIGS--GGVQREKDYPYTGRDGTCKFDKSKIAASVSNYSVISLDEEQIAANLVK 268
Query: 254 NGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDR-TKFTHKAVPYW 312
NGP+AVAINA +Q YV GVS P + C ++L H VL+VGYG K PYW
Sbjct: 269 NGPLAVAINAVYMQTYVGGVSCP--YIC---GKHLDHGVLLVGYGEGAYAPIRFKEKPYW 323
Query: 313 IIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
IIKNSWGE WG GY+++ RG CG++ V +
Sbjct: 324 IIKNSWGENWGGNGYYKICRGRNVCGVDSMVST 356
>gi|74273320|gb|ABA01328.1| secreted cathepsin F [Teladorsagia circumcincta]
Length = 364
Score = 256 bits (653), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 135/312 (43%), Positives = 181/312 (58%), Gaps = 17/312 (5%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
F F+E+H+K Y E R IF NL I+ Q+ + G+ +YG+N+F+DLS EF+
Sbjct: 64 FTSFIERHDKVYRNESEALKRFGIFKRNLEIIRSAQENDKGTAIYGINQFADLSPEEFKK 123
Query: 102 KYLGFKLK-PSYADRSV----PAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTT 156
+L K P + +R V + P LP +FDWRE+ AVT VK + C + WAFS T
Sbjct: 124 THLPHTWKQPDHPNRIVDLAAEGVDPKEPLPESFDWREHGAVTKVKTEGHCAACWAFSVT 183
Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
GNIEG + KKLVSLS Q+L+DCD D+GC GG +A+ I+ GGLE E YPY
Sbjct: 184 GNIEGQWFLAKKKLVSLSAQQLLDCDVVDEGCNGGFPLDAYKEIVRM--GGLEPEDKYPY 241
Query: 217 RGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHP 276
+ CRL V ING V + DE M +LV+ GP+++ I +QFY GVS P
Sbjct: 242 EAKAEQCRLVPSDIAVYINGSVELPHDEEKMRAWLVKKGPISIGITVDDIQFYKGGVSRP 301
Query: 277 IQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGS 336
++ H L+VGYGV+ K +PYWIIKNSWG WGE GY+R+ RG+ +
Sbjct: 302 TTCRLS----SMIHGALLVGYGVE------KNIPYWIIKNSWGPNWGEDGYYRMVRGENA 351
Query: 337 CGINDYVRSALV 348
C IN + SA+V
Sbjct: 352 CRINRFPTSAVV 363
>gi|115495381|ref|NP_001068884.1| cathepsin F precursor [Bos taurus]
gi|111304901|gb|AAI20004.1| Cathepsin F [Bos taurus]
gi|296471599|tpg|DAA13714.1| TPA: cathepsin F [Bos taurus]
Length = 460
Score = 255 bits (652), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 139/332 (41%), Positives = 201/332 (60%), Gaps = 11/332 (3%)
Query: 18 SVSSFM-VVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLL 76
+ SSF+ ++ + L VK ++F F+ +N+TY + E R+ +F+ N+ + Q +
Sbjct: 138 TFSSFLPLLNKDPLPQDFSVKMASIFKDFVTTYNRTYDSQEEASWRMSVFANNMVRAQKI 197
Query: 77 QDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYD 136
Q + G+ YG+ +FSDL+ EF+ YL LK + PA P +DWR
Sbjct: 198 QALDRGTARYGVTKFSDLTEEEFRTIYLNPLLKDAPGRNMRPAQPVTDVPPPQWDWRNKG 257
Query: 137 AVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNA 196
AVT VKDQ MCGS WAFS TGN+EG + K L+SLSEQEL+DCD+ D C GG SNA
Sbjct: 258 AVTNVKDQGMCGSCWAFSVTGNVEGQWFLKRGTLLSLSEQELLDCDKTDKACLGGLPSNA 317
Query: 197 FDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGP 256
+ I + GGLE E Y YRG + C + + +V IN V +S++E +A +L +NGP
Sbjct: 318 YSAI--RTLGGLETEDDYSYRGRLQTCSFSAEKAKVYINDSVELSKNEQKLAAWLAKNGP 375
Query: 257 MAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKN 316
+++AINA+ +QFY G+SHP++ C + H+VL+VGYG A+P+W IKN
Sbjct: 376 VSIAINAFGMQFYRHGISHPLRPLCSPW--LIDHAVLLVGYG------NRSAIPFWAIKN 427
Query: 317 SWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
SWG WGE+GY+ L+RG G+CG+N SA++
Sbjct: 428 SWGTDWGEEGYYYLHRGSGACGVNIMASSAVI 459
>gi|7381219|gb|AAF61440.1|AF138264_1 papain-like cysteine proteinase isoform I [Ipomoea batatas]
Length = 368
Score = 255 bits (651), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 143/337 (42%), Positives = 198/337 (58%), Gaps = 29/337 (8%)
Query: 24 VVGD---EKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTE 80
VVGD + L+ HH F F + K YA+ E+ RL +F N+R+ + Q+ +
Sbjct: 36 VVGDGDGDLLNADHH------FTVFKRRFGKAYASDEEHDYRLSVFKANMRRAKRHQELD 89
Query: 81 HGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSY-ADRSVPAMIPNITLPRAFDWREYDAVT 139
+ V+G+ +FSDL+ EF+ K+LG + + AD ++P LP FDWR++ AVT
Sbjct: 90 PAA-VHGVTQFSDLTPTEFRRKFLGLNRRLKFPADAKTAPILPTDELPSDFDWRDHGAVT 148
Query: 140 GVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEG 190
VK+Q CGS W+FSTTG +EG T KLVSLSEQ+L+DCD E D GC G
Sbjct: 149 PVKNQGTCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEAGSCDSGCNG 208
Query: 191 GSISNAFDTIMSKLGGGLEEEKTYPYRGDD-KACRLNKKATQVKINGYVSVSRDETDMAK 249
G +++AF+ + GGL E+ YPY G+D + CR +K K+ + VS DE +A
Sbjct: 209 GLMNSAFEYTLK--AGGLMREEDYPYTGNDLQVCRFDKTKIAAKVANFSVVSLDEDQIAA 266
Query: 250 YLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYG-VDRTKFTHKA 308
LV+NGP+AVAINA +Q Y+ GVS P + C ++ L H VL+VGYG K
Sbjct: 267 NLVKNGPLAVAINAVFMQTYIGGVSCP--YIC---SKRLDHGVLLVGYGSAGYAPIRMKE 321
Query: 309 VPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
PYWIIKNSWGE WGE GY+++ RG CG++ V +
Sbjct: 322 KPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVST 358
>gi|7211741|gb|AAF40414.1|AF216783_1 papain-like cysteine proteinase isoform I [Ipomoea batatas]
Length = 368
Score = 255 bits (651), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 143/337 (42%), Positives = 198/337 (58%), Gaps = 29/337 (8%)
Query: 24 VVGD---EKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTE 80
VVGD + L+ HH F F + K YA+ E+ RL +F N+R+ + Q+ +
Sbjct: 36 VVGDGDGDLLNADHH------FTVFKRRFGKAYASDEEHDYRLSVFKANMRRAKRHQELD 89
Query: 81 HGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSY-ADRSVPAMIPNITLPRAFDWREYDAVT 139
+ V+G+ +FSDL+ EF+ K+LG + + AD ++P LP FDWR++ AVT
Sbjct: 90 PAA-VHGVTQFSDLTPTEFRRKFLGLNRRLKFPADAKTAPILPTDELPSDFDWRDHGAVT 148
Query: 140 GVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEG 190
VK+Q CGS W+FSTTG +EG T KLVSLSEQ+L+DCD E D GC G
Sbjct: 149 PVKNQGTCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEAGSCDSGCNG 208
Query: 191 GSISNAFDTIMSKLGGGLEEEKTYPYRGDD-KACRLNKKATQVKINGYVSVSRDETDMAK 249
G +++AF+ + GGL E+ YPY G+D + CR +K K+ + VS DE +A
Sbjct: 209 GLMNSAFEYTLK--AGGLMREEDYPYTGNDLQVCRFDKTKIAAKVANFSVVSLDEDQIAA 266
Query: 250 YLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYG-VDRTKFTHKA 308
LV+NGP+AVAINA +Q Y+ GVS P + C ++ L H VL+VGYG K
Sbjct: 267 NLVKNGPLAVAINAVFMQTYIGGVSCP--YIC---SKRLDHGVLLVGYGSAGYAPIRMKE 321
Query: 309 VPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
PYWIIKNSWGE WGE GY+++ RG CG++ V +
Sbjct: 322 KPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVST 358
>gi|432880227|ref|XP_004073613.1| PREDICTED: cathepsin F-like [Oryzias latipes]
Length = 473
Score = 255 bits (651), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 138/314 (43%), Positives = 189/314 (60%), Gaps = 11/314 (3%)
Query: 36 VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLS 95
V+ F F+ ++ K Y++ E RL IF NL+ + LQ + GS YG+ +FSDL+
Sbjct: 169 VQLLGQFKDFMVKYKKDYSSQEEAERRLQIFQENLKTAEKLQALDQGSAEYGVTKFSDLT 228
Query: 96 TAEFQAKYLGFKLKPSYADRSVPAMIPNITLPR-AFDWREYDAVTGVKDQTMCGSSWAFS 154
EF++ YL L R + P T ++DWR++ AV+ VK+Q MCGS WAFS
Sbjct: 229 EEEFRSTYLNPLLSQWTLHRGMKPAPPAKTPAPDSWDWRDHGAVSPVKNQGMCGSCWAFS 288
Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
TGNIEG + K L+SLSEQEL+DCD D C GG SNA++ I KL GGLE E Y
Sbjct: 289 VTGNIEGQWFLKNGTLLSLSEQELVDCDGLDQACRGGLPSNAYEAI-EKL-GGLESETDY 346
Query: 215 PYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVS 274
Y G + C + IN V + +DE ++A +L ENGP++VA+NA+A+QFY GVS
Sbjct: 347 SYTGHKQKCDFTNRKVAAYINSSVELPKDEREIAAWLAENGPISVALNAFAMQFYKKGVS 406
Query: 275 HPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGD 334
HP + FC+ + H+VL+VGYG +P+W IKNSWGE +GE+GY+ L RG
Sbjct: 407 HPWKIFCNPW--MIDHAVLLVGYG------ERNGIPFWAIKNSWGEDYGEQGYYYLQRGS 458
Query: 335 GSCGINDYVRSALV 348
+CGIN SA++
Sbjct: 459 NACGINRMGSSAVI 472
>gi|312985015|gb|ACX54787.2| cysteine protease [Arachis diogoi]
Length = 360
Score = 254 bits (650), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 142/330 (43%), Positives = 197/330 (59%), Gaps = 24/330 (7%)
Query: 31 HHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNE 90
HH+ + +H F F + K+YAT E+ R +F NLR+ +L + S +G+ +
Sbjct: 35 HHMLNAEHH--FTTFKTKFGKSYATQEEHDYRFGVFRANLRRAKLHAKLDP-SAEHGVTK 91
Query: 91 FSDLSTAEFQAKYLGFK-LK-PSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCG 148
FSDL+ EF+ +YLG K L+ PS A+++ ++P LP FDWR+ AVT VK+Q CG
Sbjct: 92 FSDLTPEEFKRQYLGLKPLRLPSTANKA--PILPTSDLPENFDWRDKGAVTPVKNQGSCG 149
Query: 149 SSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDT 199
S WAFSTTG +EG + T +LVSLSEQ+L+DCD D GC GG ++NAFD
Sbjct: 150 SCWAFSTTGALEGAHYLSTGELVSLSEQQLVDCDHVCDPEEYGACDAGCNGGLMNNAFDY 209
Query: 200 IMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAV 259
I+ GG++ EK YPY G D+ C+ +K + + VS DE +A LV++GP+AV
Sbjct: 210 ILQ--AGGVQTEKDYPYSGRDETCKFDKSKVAATVANFSVVSLDEDQIAANLVKHGPLAV 267
Query: 260 AINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYG-VDRTKFTHKAVPYWIIKNSW 318
INA +Q Y+ GVS P + C +NL H VL+VGYG K P+WIIKNSW
Sbjct: 268 GINAIFMQTYIGGVSCP--YIC---GKNLDHGVLLVGYGAAGYAPIRFKDKPFWIIKNSW 322
Query: 319 GEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
GE WGE GY+++ RG CG++ V S +
Sbjct: 323 GESWGEDGYYKICRGKNVCGVDSMVSSVVA 352
>gi|7381221|gb|AAF61441.1|AF138265_1 papain-like cysteine proteinase isoform II [Ipomoea batatas]
Length = 366
Score = 254 bits (650), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 143/336 (42%), Positives = 198/336 (58%), Gaps = 28/336 (8%)
Query: 24 VVGD--EKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEH 81
VVGD + L+ HH F F + K YA+ E+ RL +F N+R+ + Q+ +
Sbjct: 35 VVGDGGDLLNADHH------FTVFKRRFGKVYASDEEHDYRLSVFKANMRRAKQHQELDP 88
Query: 82 GSGVYGLNEFSDLSTAEFQAKYLGFKLKPSY-ADRSVPAMIPNITLPRAFDWREYDAVTG 140
+ V+G+ +FSDL+ EF+ K+LG + + AD ++P LP FDWR++ AVT
Sbjct: 89 AA-VHGVTQFSDLTPTEFRRKFLGLNRRLKFPADAKTAPILPTDELPSDFDWRDHGAVTP 147
Query: 141 VKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGG 191
VK+Q CGS W+FSTTG +EG T KLVSLSEQ+L+DCD E D GC GG
Sbjct: 148 VKNQGTCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEAGSCDSGCNGG 207
Query: 192 SISNAFDTIMSKLGGGLEEEKTYPYRGDD-KACRLNKKATQVKINGYVSVSRDETDMAKY 250
+++AF+ + GGL E+ YPY G+D + CR +K K+ + VS DE +A
Sbjct: 208 LMNSAFEYTLK--AGGLMREEDYPYTGNDLQVCRFDKTKIAAKVANFSVVSLDEDQIAAN 265
Query: 251 LVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYG-VDRTKFTHKAV 309
LV+NGP+AVAINA +Q Y+ GVS P + C ++ L H VL+VGYG K
Sbjct: 266 LVKNGPLAVAINAVFVQTYIGGVSCP--YIC---SKRLDHGVLLVGYGSAGYAPIRMKEK 320
Query: 310 PYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
PYWIIKNSWGE WGE GY+++ RG CG++ V +
Sbjct: 321 PYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVST 356
>gi|116786550|gb|ABK24153.1| unknown [Picea sitchensis]
Length = 394
Score = 254 bits (650), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 140/318 (44%), Positives = 186/318 (58%), Gaps = 19/318 (5%)
Query: 40 ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
A F +F+++ NK Y+ E+ R IF NL K Q + ++G+N+FSDL+ EF
Sbjct: 73 AHFAHFVKKFNKEYSGAEEHARRFSIFKKNLHKALRHQKLDR-DAIHGINKFSDLTEEEF 131
Query: 100 QAKYLGFKLKP-SYADRSVPA-MIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
+YLG P S + R+ PA ++P LP FDWRE AVT VK+Q CGS W FSTTG
Sbjct: 132 HEQYLGLTTPPRSLSQRTQPAPILPTDDLPPDFDWRELGAVTPVKNQGACGSCWTFSTTG 191
Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSKLGGGL 208
+EG KT KL+SLSEQ+L+DCD E D GC GG ++ A+ + GGL
Sbjct: 192 AMEGANFMKTGKLISLSEQQLVDCDHECDSSEPDVCDSGCNGGLMTTAYQYALK--AGGL 249
Query: 209 EEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQF 268
+ E+ YPY G D +C+ + + + +VS DE +A LV+NGP+AV INA +Q
Sbjct: 250 QREEDYPYTGIDGSCKFDNTKVAAMVANFSTVSIDEDQIAANLVKNGPLAVGINAAFMQT 309
Query: 269 YVTGVSHPIQFFCDGGNENLSHSVLIVGYG-VDRTKFTHKAVPYWIIKNSWGEGWGEKGY 327
YV GVS P + C+ +NL H VL+VGYG K P+WIIKNSWG WGE GY
Sbjct: 310 YVGGVSCP--YVCN--KQNLDHGVLLVGYGAAGYAPGRLKNKPFWIIKNSWGPDWGEDGY 365
Query: 328 FRLYRGDGSCGINDYVRS 345
++L RG CGIN V +
Sbjct: 366 YKLCRGHNVCGINTMVST 383
>gi|302754322|ref|XP_002960585.1| hypothetical protein SELMODRAFT_266583 [Selaginella moellendorffii]
gi|300171524|gb|EFJ38124.1| hypothetical protein SELMODRAFT_266583 [Selaginella moellendorffii]
Length = 330
Score = 254 bits (650), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 136/317 (42%), Positives = 187/317 (58%), Gaps = 22/317 (6%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
F F+ + K YAT Y RL +F NL + Q + S V+G+ +FSDL+ EF+
Sbjct: 21 FKSFIARFGKAYATAEAYAHRLKVFEANLVRAVSHQALDP-SAVHGITQFSDLTEEEFKQ 79
Query: 102 KYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEG 161
++LG ++ + + ++P LP FDWRE+ AVT VK+Q CGS WAFSTTG IEG
Sbjct: 80 QFLGLRVPSRLREANKAPVLPTNDLPEDFDWREHGAVTEVKNQGACGSCWAFSTTGAIEG 139
Query: 162 VYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSKLGGGLEEEK 212
+ +T KL+SLSEQ+L+DCD D GC GG ++NA+D +M GGLE E
Sbjct: 140 AHFLETGKLISLSEQQLVDCDHSCDPTDKVSCDAGCNGGLMTNAYDYVMKS--GGLETET 197
Query: 213 TYPYRGDDKA-CRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVT 271
YPY G+ C+ N + + +VS DE +A LV++GP+A+ INA +Q Y+
Sbjct: 198 DYPYTGNSNGKCQFNANKIVASVANFSTVSLDEDQIAANLVKHGPLAIGINAVFMQTYIG 257
Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVD---RTKFTHKAVPYWIIKNSWGEGWGEKGYF 328
GVS PI C ++ H VL+VGYG +FT K PYWIIKNSWG WGE+GY+
Sbjct: 258 GVSCPI--ICS--KHHIDHGVLLVGYGAKGYAPIRFTEK--PYWIIKNSWGATWGEQGYY 311
Query: 329 RLYRGDGSCGINDYVRS 345
++ RG G CG+N V +
Sbjct: 312 KICRGHGMCGMNTMVST 328
>gi|354496134|ref|XP_003510182.1| PREDICTED: cathepsin F [Cricetulus griseus]
gi|344250261|gb|EGW06365.1| Cathepsin F [Cricetulus griseus]
Length = 462
Score = 254 bits (649), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 143/332 (43%), Positives = 200/332 (60%), Gaps = 11/332 (3%)
Query: 18 SVSSFM-VVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLL 76
+ SSF+ ++ +E L VK T +F F+ +N+TY + E RL +F+ N+ K Q +
Sbjct: 140 TFSSFLPLLNEEPLPQDFSVKMTTVFKDFMITYNRTYESREETQWRLTVFTRNMVKAQKI 199
Query: 77 QDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYD 136
+ + G+ YG+ +FSDL+ EF YL L+ + A N P +DWR+
Sbjct: 200 EALDRGTAQYGITKFSDLTEEEFYTIYLNPLLQKKPGSKMSLAKSINDPAPPEWDWRKKG 259
Query: 137 AVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNA 196
AVT VKDQ MCGS WAFS TGN+EG + L+SLSEQEL+DCD+ D C GG SNA
Sbjct: 260 AVTKVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDKMDKACLGGMPSNA 319
Query: 197 FDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGP 256
+ I S GGLE E Y Y+G +AC + + +V IN V +S++E+ MA +L + GP
Sbjct: 320 YTAIKSL--GGLETEDDYSYKGYVQACNFSAQKAKVYINDSVELSKNESKMAAWLAQKGP 377
Query: 257 MAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKN 316
++VAINA+ +QFY G++HP++ C + H+VL+VGYG PYW IKN
Sbjct: 378 ISVAINAFGMQFYRHGIAHPLRPLCSPW--LIDHAVLLVGYG------NRSNTPYWAIKN 429
Query: 317 SWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
SWG WGE+GY+ LYRG G+CG+N SA+V
Sbjct: 430 SWGSNWGEEGYYYLYRGSGACGVNTMASSAVV 461
>gi|302771610|ref|XP_002969223.1| hypothetical protein SELMODRAFT_91274 [Selaginella moellendorffii]
gi|300162699|gb|EFJ29311.1| hypothetical protein SELMODRAFT_91274 [Selaginella moellendorffii]
Length = 367
Score = 254 bits (649), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 136/317 (42%), Positives = 187/317 (58%), Gaps = 22/317 (6%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
F F+ + K YAT Y RL +F NL + Q + S V+G+ +FSDL+ EF+
Sbjct: 58 FKSFIARFGKAYATAEAYAHRLKVFEANLVRAVSHQALDP-SAVHGITQFSDLTEEEFKQ 116
Query: 102 KYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEG 161
++LG ++ + + ++P LP FDWRE+ AVT VK+Q CGS WAFSTTG IEG
Sbjct: 117 QFLGLRVPSRLREANKAPVLPTNDLPEDFDWREHGAVTEVKNQGACGSCWAFSTTGAIEG 176
Query: 162 VYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSKLGGGLEEEK 212
+ +T KL+SLSEQ+L+DCD D GC GG ++NA+D +M GGLE E
Sbjct: 177 AHFLETGKLISLSEQQLVDCDHSCDPTDKVSCDAGCNGGLMTNAYDYVMKS--GGLETET 234
Query: 213 TYPYRGDDKA-CRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVT 271
YPY G+ C+ N + + +VS DE +A LV++GP+A+ INA +Q Y+
Sbjct: 235 DYPYTGNSNGKCQFNANKIVASVANFSTVSLDEDQIAANLVKHGPLAIGINAVFMQTYIG 294
Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVD---RTKFTHKAVPYWIIKNSWGEGWGEKGYF 328
GVS PI C ++ H VL+VGYG +FT K PYWIIKNSWG WGE+GY+
Sbjct: 295 GVSCPI--ICS--KHHIDHGVLLVGYGAKGYAPIRFTEK--PYWIIKNSWGATWGEQGYY 348
Query: 329 RLYRGDGSCGINDYVRS 345
++ RG G CG+N V +
Sbjct: 349 KICRGHGMCGMNTMVST 365
>gi|225458119|ref|XP_002279862.1| PREDICTED: cysteine proteinase RD19a [Vitis vinifera]
gi|302142581|emb|CBI19784.3| unnamed protein product [Vitis vinifera]
Length = 368
Score = 254 bits (648), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 140/318 (44%), Positives = 186/318 (58%), Gaps = 19/318 (5%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
F F + KTYAT E+ R ++F NLR+ + Q + S V+G+ +FSDL+ AEF+
Sbjct: 52 FEKFKARFQKTYATPEEHDYRFNVFKANLRRAKRHQLLDP-SAVHGVTQFSDLTPAEFRR 110
Query: 102 KYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEG 161
YLG AD ++P LP FDWRE AVT VK+Q CGS W+FST G +EG
Sbjct: 111 DYLGLNPLRFPADAQQAPILPTDNLPTDFDWRENGAVTPVKNQGNCGSCWSFSTIGALEG 170
Query: 162 VYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSKLGGGLEEEK 212
+ T L SLSEQ+L+DCD+E DDGC GG ++NAF+ I+ GG+E EK
Sbjct: 171 AHFLATGNLESLSEQQLVDCDRECDPEEYDACDDGCNGGLMNNAFEYILKT--GGVEREK 228
Query: 213 TYPYRGDDKA-CRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVT 271
YPY G D++ C+ N+ ++ + VS DE +A LV+NGP+AV INA +Q Y
Sbjct: 229 DYPYTGRDRSPCKFNESKIVASVSNFSVVSIDEDQIAANLVKNGPLAVGINAVFMQTYTA 288
Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYG-VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
GVS P F C G L H VL+VGYG + K PYWI+KNSW + WGE GY+R+
Sbjct: 289 GVSCP--FLCSG---ELDHGVLLVGYGSAGYSPIRFKEKPYWILKNSWSKYWGEHGYYRI 343
Query: 331 YRGDGSCGINDYVRSALV 348
RG CG++ V S +
Sbjct: 344 CRGQNMCGVDSMVSSVVA 361
>gi|124484383|dbj|BAF46302.1| cysteine proteinase precursor [Ipomoea nil]
Length = 369
Score = 254 bits (648), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 141/336 (41%), Positives = 193/336 (57%), Gaps = 30/336 (8%)
Query: 27 DEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY 86
D L+ HH F F ++ K+YAT E+ RL +F NLR+ + Q + S V+
Sbjct: 38 DALLNADHH------FTLFKSKYGKSYATQEEHDYRLSVFKANLRRAKRHQLLDP-SAVH 90
Query: 87 GLNEFSDLSTAEFQAKYLGFKLKPS-------YADRSVPAMIPNITLPRAFDWREYDAVT 139
G+ +FSDL+ EF+ +LG + S AD ++P LP FDWR+Y AVT
Sbjct: 91 GVTKFSDLTPKEFRRTFLGIRKSSSGKRKLKLPADAHAAEILPTSDLPSDFDWRDYGAVT 150
Query: 140 GVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEG 190
GVKDQ CGS W+FSTTG +EG T +LVSLSEQ+L+DCD D GC G
Sbjct: 151 GVKDQGSCGSCWSFSTTGALEGANFLATGELVSLSEQQLVDCDHLCDPEEAGACDSGCNG 210
Query: 191 GSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKY 250
G ++ A++ ++ GGLE+EK YPY G D C+ +K + + VS DE +A
Sbjct: 211 GLMTTAYEYVLQS--GGLEKEKDYPYTGKDGTCKFDKSKIAAAVANFSVVSLDEDQIAAN 268
Query: 251 LVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYG-VDRTKFTHKAV 309
LV++GP++V INA +Q Y+ GVS P + C NL H VL+VGYG K
Sbjct: 269 LVKHGPLSVGINAVFMQTYIGGVSCP--YIC--SKRNLDHGVLLVGYGAAGYAPIRFKDK 324
Query: 310 PYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
PYWI+KNSWGE WGE+GY+++ RG+ CGI+ V +
Sbjct: 325 PYWIVKNSWGENWGEEGYYKICRGNNICGIDSMVST 360
>gi|359492709|ref|XP_002280798.2| PREDICTED: cysteine proteinase RD19a-like [Vitis vinifera]
gi|147841854|emb|CAN73591.1| hypothetical protein VITISV_022889 [Vitis vinifera]
gi|302142582|emb|CBI19785.3| unnamed protein product [Vitis vinifera]
Length = 371
Score = 253 bits (647), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 138/315 (43%), Positives = 185/315 (58%), Gaps = 19/315 (6%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
F F + KTYAT E+ R ++F NLR+ + Q + S +G+ +FSDL+ EF+
Sbjct: 56 FAEFKTKFGKTYATAEEHDHRFNVFKANLRRAKRHQLLDP-SAEHGVTQFSDLTPREFRQ 114
Query: 102 KYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEG 161
YLG K AD ++P LP FDWR++ AVT VKDQ CGS W+FST G +EG
Sbjct: 115 NYLGLKRLQLPADAQKAPILPTKDLPTDFDWRDHGAVTAVKDQGYCGSCWSFSTIGALEG 174
Query: 162 VYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSKLGGGLEEEK 212
+ T LVSLS Q+L+DCD E DDGC GG ++NAF+ I+ GG+ +E+
Sbjct: 175 AHFLATGNLVSLSTQQLLDCDTECDPEEYDACDDGCNGGLMNNAFEYILK--AGGVAQEE 232
Query: 213 TYPYRGDDKA-CRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVT 271
YPY G D+ CR NK + + VS DE +A LV+NGP+AV INA +Q Y +
Sbjct: 233 DYPYTGTDRGLCRFNKTKIAASVANFSVVSLDEDQIAANLVKNGPLAVGINAVFMQTYKS 292
Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYG-VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
GVS P + C + L H VL+VGYG + K PYWIIKNSWGE WGE+GY+++
Sbjct: 293 GVSCP--YIC---SSTLDHGVLLVGYGSAGYSPIRFKEKPYWIIKNSWGESWGEQGYYKI 347
Query: 331 YRGDGSCGINDYVRS 345
RG CG++ V +
Sbjct: 348 CRGHNICGVDSMVST 362
>gi|225431287|ref|XP_002275759.1| PREDICTED: cysteine proteinase RD19a isoform 1 [Vitis vinifera]
gi|297735094|emb|CBI17456.3| unnamed protein product [Vitis vinifera]
Length = 367
Score = 253 bits (647), Expect = 7e-65, Method: Compositional matrix adjust.
Identities = 138/331 (41%), Positives = 192/331 (58%), Gaps = 25/331 (7%)
Query: 26 GDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGV 85
GD+ L H F F + KTY+T+ E+ R +F NLR+ + Q + S V
Sbjct: 42 GDDLLSAEHQ------FGLFKAKFGKTYSTVEEHDYRFSVFEANLRRARRHQLLDP-SAV 94
Query: 86 YGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQT 145
+G+ FSDL+ EF+ YLG K AD ++P LP FDWR++ AVT VKDQ
Sbjct: 95 HGVTRFSDLTPDEFRRDYLGLKPLRLPADAQKAPILPTNDLPTDFDWRDHGAVTPVKDQG 154
Query: 146 MCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNA 196
CGS W+FS G +EG + T L+S+SEQ+L+DCD E D GC GG +++A
Sbjct: 155 SCGSCWSFSAIGALEGAHFLTTGNLISMSEQQLVDCDHECDPEEYGACDQGCNGGLMTSA 214
Query: 197 FDTIMSKLGGGLEEEKTYPYRGDDK-ACRLNKKATQVKINGYVSVSRDETDMAKYLVENG 255
F+ I+ GG+E E+TYPY G D+ +C+ NK ++ + VS DE +A +V+NG
Sbjct: 215 FEYILK--AGGVEREETYPYIGSDRGSCKFNKSQIVASVSNFSVVSLDEDQIAANMVKNG 272
Query: 256 PMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYG-VDRTKFTHKAVPYWII 314
P+AV INA +Q Y+ GVS P + C + NL H V++VGYG K PYWII
Sbjct: 273 PLAVGINAVFMQTYMKGVSCP--YIC---SRNLDHGVVLVGYGSAGYAPIRFKEKPYWII 327
Query: 315 KNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
KNSWGE WGE GY+++ RG +CG++ V +
Sbjct: 328 KNSWGESWGEDGYYKICRGHNACGVDSMVST 358
>gi|2511695|emb|CAB17077.1| cysteine proteinase precursor [Phaseolus vulgaris]
Length = 377
Score = 253 bits (647), Expect = 8e-65, Method: Compositional matrix adjust.
Identities = 148/368 (40%), Positives = 210/368 (57%), Gaps = 32/368 (8%)
Query: 1 MSCFYFFAGVALLSLTVSVSSFMVVGD----EKLHHLHHVKHTALFNYFLEQHNKTYATL 56
++CF + + L +LT+S + V D KL ++ FN F+E + K Y+T
Sbjct: 9 LTCFARIS-LVLFALTLSSARQTTVHDIAKKLKLQDNQLLRTEKKFNVFMENYGKKYSTR 67
Query: 57 VEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLG----FKLKPSY 112
EY RL IF+GN+ + Q + + ++G+ +FSDL+ EFQ Y G F
Sbjct: 68 EEYLQRLEIFAGNMLRAPENQALDP-TAIHGVTQFSDLTEDEFQRHYTGVNGGFPWNNGV 126
Query: 113 ADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVS 172
D + P + LP FDWRE AVT VK Q CGS WAFSTTG+IEG T KL++
Sbjct: 127 RDVAPPLKVDG--LPEDFDWREKGAVTEVKMQGKCGSCWAFSTTGSIEGANFIATGKLLN 184
Query: 173 LSEQELIDCDQE---------DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKAC 223
LSEQ+L+DCD + D+GC GG ++NA+ ++ GGLEEE +YPY G C
Sbjct: 185 LSEQQLVDCDSQCDITESTTCDNGCMGGLMTNAYKYLLQS--GGLEEESSYPYTGAKGEC 242
Query: 224 RLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDG 283
+ + V+I + ++ DE +A YLV++GP+AV +NA +Q Y+ GVS P+ C
Sbjct: 243 KFDPGKVAVRITNFTNIPVDENQIAAYLVKHGPLAVGLNAIFMQTYIGGVSCPL--ICS- 299
Query: 284 GNENLSHSVLIVGY---GVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGIN 340
+ L+H VL+VGY G + +K PYWIIKNSWG+ WG GY++L RG G CG+N
Sbjct: 300 -KKWLNHGVLLVGYRAKGFSILRLGNK--PYWIIKNSWGKRWGVDGYYKLCRGHGMCGMN 356
Query: 341 DYVRSALV 348
V +A+V
Sbjct: 357 TMVSTAMV 364
>gi|118150|sp|P25804.1|CYSP_PEA RecName: Full=Cysteine proteinase 15A; AltName:
Full=Turgor-responsive protein 15A; Flags: Precursor
gi|20679|emb|CAA38242.1| unnamed protein product [Pisum sativum]
Length = 363
Score = 253 bits (646), Expect = 9e-65, Method: Compositional matrix adjust.
Identities = 140/336 (41%), Positives = 202/336 (60%), Gaps = 25/336 (7%)
Query: 23 MVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHG 82
VV +E+ H L+ H F F + +K+YAT E+ R +F NL K +L Q+ +
Sbjct: 32 QVVDNEEDHLLNAEHH---FTSFKSKFSKSYATKEEHDYRFGVFKSNLIKAKLHQNRD-P 87
Query: 83 SGVYGLNEFSDLSTAEFQAKYLGFKLK---PSYADRSVPAMIPNITLPRAFDWREYDAVT 139
+ +G+ +FSDL+ +EF+ ++LG K + P++A ++ ++P LP FDWRE AVT
Sbjct: 88 TAEHGITKFSDLTASEFRRQFLGLKKRLRLPAHAQKA--PILPTTNLPEDFDWREKGAVT 145
Query: 140 GVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEG 190
VKDQ CGS WAFSTTG +EG + T KLVSLSEQ+L+DCD D GC G
Sbjct: 146 PVKDQGSCGSCWAFSTTGALEGAHYLATGKLVSLSEQQLVDCDHVCDPEQAGSCDSGCNG 205
Query: 191 GSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKY 250
G ++NAF+ ++ GG+ +EK Y Y G D +C+ +K ++ + V+ DE +A
Sbjct: 206 GLMNNAFEYLLE--SGGVVQEKDYAYTGRDGSCKFDKSKVVASVSNFSVVTLDEDQIAAN 263
Query: 251 LVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDR-TKFTHKAV 309
LV+NGP+AVAINA +Q Y++GVS P + C L H VL+VG+G K
Sbjct: 264 LVKNGPLAVAINAAWMQTYMSGVSCP--YVC--AKSRLDHGVLLVGFGKGAYAPIRLKEK 319
Query: 310 PYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
PYWIIKNSWG+ WGE+GY+++ RG CG++ V +
Sbjct: 320 PYWIIKNSWGQNWGEQGYYKICRGRNVCGVDSMVST 355
>gi|356509908|ref|XP_003523684.1| PREDICTED: cysteine proteinase RD19a-like [Glycine max]
Length = 366
Score = 253 bits (645), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 140/333 (42%), Positives = 193/333 (57%), Gaps = 21/333 (6%)
Query: 24 VVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGS 83
VV D + HHL + +H F+ F + KTYAT E+ R IF NL + + Q + S
Sbjct: 35 VVPDAEDHHLLNAEHH--FSAFKTKFAKTYATQEEHDHRFRIFKNNLLRAKSHQKLD-PS 91
Query: 84 GVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKD 143
V+G+ FSDL+ +EF+ ++LG K +D ++P LP FDWR++ AVTGVK+
Sbjct: 92 AVHGVTRFSDLTPSEFRGQFLGLKPLRLPSDAQKAPILPTSDLPTDFDWRDHGAVTGVKN 151
Query: 144 QTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSIS 194
Q CGS W+FS G +EG + T LVSLSEQ+L+DCD E D GC GG ++
Sbjct: 152 QGSCGSCWSFSAVGALEGAHFLSTGGLVSLSEQQLVDCDHECDPEERGACDSGCNGGLMT 211
Query: 195 NAFDTIMSKLGGGLEEEKTYPYRGDDKA-CRLNKKATQVKINGYVSVSRDETDMAKYLVE 253
AF+ + GGL E+ YPY G D+ C+ +K + + VS DE +A LV+
Sbjct: 212 TAFEYTLK--AGGLMREEDYPYTGRDRGPCKFDKSKIAASVANFSVVSLDEEQIAANLVK 269
Query: 254 NGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDR-TKFTHKAVPYW 312
NGP+AV INA +Q Y+ GVS P + C ++L H VL+VGYG K PYW
Sbjct: 270 NGPLAVGINAVFMQTYIGGVSCP--YIC---GKHLDHGVLLVGYGSGAYAPIRFKEKPYW 324
Query: 313 IIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
IIKNSWGE WGE+GY+++ RG CG++ V +
Sbjct: 325 IIKNSWGESWGEEGYYKICRGRNVCGVDSMVST 357
>gi|67773382|gb|AAY81948.1| cysteine protease 11 [Paragonimus westermani]
Length = 322
Score = 253 bits (645), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 138/308 (44%), Positives = 182/308 (59%), Gaps = 13/308 (4%)
Query: 41 LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
L+ F + K YA + R IF NL + Q LQ + G+ YG+ +FSDL+ EF
Sbjct: 26 LYEQFKRDYGKVYAN-EDDQKRFAIFKDNLVRAQKLQLRDQGTARYGVTQFSDLTPEEFA 84
Query: 101 AKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIE 160
AKYL L +R P + P DWR AVT V++Q CGS WAFST GN+E
Sbjct: 85 AKYLSPPLNSDQVERVQPTGLK--AAPERMDWRAKGAVTPVENQGECGSCWAFSTAGNVE 142
Query: 161 GVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDD 220
G + KT +LVSLS+Q+L+DCD +GC GG S+++ IM GGLE E YPY G +
Sbjct: 143 GQWFIKTGQLVSLSKQQLVDCDMAAEGCNGGWPSSSYLEIMDM--GGLESENDYPYVGVE 200
Query: 221 KACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFF 280
+ C LNK+ KI+ V + E + YL E+GP++ +NA ALQ Y +G+ HP
Sbjct: 201 QTCALNKEKLVAKIDDAVVLGASENEHVDYLAEHGPLSTLLNAVALQHYQSGILHPSHKD 260
Query: 281 CDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGIN 340
C +++L+H+VL VGY DR +PYWIIKNSWG WGEKGYFRL+RGD CGIN
Sbjct: 261 CP--DDDLNHAVLTVGY--DR----EGDMPYWIIKNSWGTDWGEKGYFRLFRGDCVCGIN 312
Query: 341 DYVRSALV 348
SA++
Sbjct: 313 RMATSAVI 320
>gi|168059933|ref|XP_001781954.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666600|gb|EDQ53250.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 369
Score = 252 bits (644), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 133/316 (42%), Positives = 184/316 (58%), Gaps = 21/316 (6%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
F F+++ K Y T+ EY R +F NL + L + +G+ FSDL+ EF
Sbjct: 56 FESFIKEFGKVYHTVEEYEHRFKVFKSNLLRA-LKHQALDPTASHGVTMFSDLTEEEFAT 114
Query: 102 KYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEG 161
+YLG K + + +P LP +FDWRE AV VK+Q CGS WAFSTTG +EG
Sbjct: 115 QYLGLKRPSALSTAPTAEPLPTGDLPPSFDWREKGAVGPVKNQGSCGSCWAFSTTGAVEG 174
Query: 162 VYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSKLGGGLEEEK 212
+ T KL+SLSEQ+L+DCD + D GC GG ++NA+ + + GGLE E
Sbjct: 175 AHFLATGKLLSLSEQQLVDCDHQCDPEEAQACDAGCGGGLMTNAYKYV--EEAGGLELES 232
Query: 213 TYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTG 272
YPY+G D C+ N K++ + ++ DE +A YL+++GP+A+ INA +Q YV G
Sbjct: 233 DYPYKGRDGKCQFNPNKVAAKVSNFTNIPIDEDQVAAYLIKSGPLAIGINAEFMQTYVAG 292
Query: 273 VSHPIQFFCDGGNENLSHSVLIVGY---GVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
VS PI FC+ NL H VL+VGY G + +K PYWIIKNSWG WG+KGY++
Sbjct: 293 VSCPI--FCN--KRNLDHGVLLVGYAEHGFAPARLAYK--PYWIIKNSWGPMWGDKGYYK 346
Query: 330 LYRGDGSCGINDYVRS 345
+ RG G CG+N V +
Sbjct: 347 ICRGHGECGLNTMVSA 362
>gi|7211745|gb|AAF40416.1|AF216785_1 papain-like cysteine proteinase isoform III [Ipomoea batatas]
gi|7381223|gb|AAF61442.1|AF138266_1 papain-like cysteine proteinase isoform III [Ipomoea batatas]
Length = 366
Score = 252 bits (644), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 143/337 (42%), Positives = 196/337 (58%), Gaps = 29/337 (8%)
Query: 24 VVGD---EKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTE 80
VVGD + L+ HH F F + K YA+ E+ RL +F N+R+ + Q +
Sbjct: 34 VVGDGDGDLLNADHH------FAVFKRRFGKAYASDEEHDYRLSVFKANMRRAKRHQQLD 87
Query: 81 HGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSY-ADRSVPAMIPNITLPRAFDWREYDAVT 139
+ V+G+ +FSDL+ EF+ K+LG + + AD ++P LP FDWR+ AVT
Sbjct: 88 PAA-VHGVTQFSDLTPTEFRRKFLGLNRRLKFPADAKTAPILPTDELPSDFDWRDRGAVT 146
Query: 140 GVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEG 190
VK+Q CGS W+FSTTG +EG T KLVSLSEQ+L+DCD E D GC G
Sbjct: 147 PVKNQGTCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEAGSCDSGCNG 206
Query: 191 GSISNAFDTIMSKLGGGLEEEKTYPYRGDD-KACRLNKKATQVKINGYVSVSRDETDMAK 249
G +++AF+ + GGL E+ YPY G+D + CR +K K+ + VS DE +A
Sbjct: 207 GLMNSAFEYTLK--AGGLMREEDYPYTGNDLQVCRFDKTKIAAKVANFSVVSLDEDQIAA 264
Query: 250 YLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYG-VDRTKFTHKA 308
LV+NGP+AVAINA +Q Y+ GVS P + C ++ L H VL+VGYG K
Sbjct: 265 NLVKNGPLAVAINAVFMQTYIGGVSCP--YIC---SKRLDHGVLLVGYGSAGYAPIRMKE 319
Query: 309 VPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
PYWIIKNSWGE WGE GY+++ RG CG++ V +
Sbjct: 320 KPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVST 356
>gi|457756|emb|CAA82995.1| cysteine proteinase [Vicia sativa]
Length = 358
Score = 252 bits (643), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 140/336 (41%), Positives = 200/336 (59%), Gaps = 25/336 (7%)
Query: 23 MVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHG 82
VV +E+ H L+ H F F + +K+YAT E+ R +F NL K +L Q +
Sbjct: 27 QVVDNEEDHLLNAEHH---FTSFKSKFSKSYATKEEHDYRFGVFKANLIKAKLHQKLDP- 82
Query: 83 SGVYGLNEFSDLSTAEFQAKYLGFKLK---PSYADRSVPAMIPNITLPRAFDWREYDAVT 139
+ +G+ +FSDL+ +EF+ ++LG + P++A ++ ++P LP FDWRE AVT
Sbjct: 83 TAEHGITKFSDLTASEFRRQFLGLNKRLRLPAHAQKA--PILPTTNLPEDFDWREKGAVT 140
Query: 140 GVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEG 190
VKDQ CGS WAFSTTG +EG + T KLVSLSEQ+L+DCD D GC G
Sbjct: 141 PVKDQGSCGSCWAFSTTGALEGAHYLATGKLVSLSEQQLVDCDHVCDPEEAGSCDSGCNG 200
Query: 191 GSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKY 250
G ++NAF+ ++ GG+ +EK Y Y G D +C+ +K ++ + VS DE +A
Sbjct: 201 GLMNNAFEYLLQ--SGGVVQEKDYAYTGRDGSCKFDKSKVVASVSNFSVVSLDEEQIAAN 258
Query: 251 LVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDR-TKFTHKAV 309
LV+NGP+AVAINA +Q Y++GVS P + C L H VL+VG+G K
Sbjct: 259 LVKNGPLAVAINAAWMQAYMSGVSCP--YVC--AKARLDHGVLLVGFGKGAYAPIRLKEK 314
Query: 310 PYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
PYWIIKNSWG+ WGE+GY+++ RG CG++ V +
Sbjct: 315 PYWIIKNSWGQNWGEQGYYKICRGRNVCGVDSMVST 350
>gi|34761156|gb|AAQ81938.1| cysteine proteinase precursor [Ipomoea batatas]
Length = 371
Score = 252 bits (643), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 141/338 (41%), Positives = 195/338 (57%), Gaps = 32/338 (9%)
Query: 27 DEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY 86
D L+ HH F F ++ K+YAT E+ RL +F NLR+ + Q + S V+
Sbjct: 38 DALLNADHH------FTLFKSKYGKSYATQEEHDYRLSVFKANLRRAKRHQMLDP-SAVH 90
Query: 87 GLNEFSDLSTAEFQAKYLGFKLKPSY---------ADRSVPAMIPNITLPRAFDWREYDA 137
G+ +FSDL+ EF+ YLG + S AD ++P LP F+WR+Y A
Sbjct: 91 GVTKFSDLTPKEFRRTYLGIRKSSSSKQKLKLKLPADAHAAEILPTSDLPFDFEWRDYGA 150
Query: 138 VTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGC 188
VTGVKDQ +CGS W+FSTTG +EG T +L+SL+EQEL+DCD D GC
Sbjct: 151 VTGVKDQGLCGSCWSFSTTGTLEGTNFLATGELLSLNEQELVDCDHLCDPKKAGACDAGC 210
Query: 189 EGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMA 248
GG ++ A++ ++ GGLE+EK YPY G D C+ +K + + VS DE +A
Sbjct: 211 NGGLMTTAYEYVLQS--GGLEKEKDYPYTGRDGTCKFDKSKIAAAVANFSVVSLDEDQIA 268
Query: 249 KYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYG-VDRTKFTHK 307
LV++GP++V IN+ +Q Y+ GVS P + C +NL H VLIVGYG K
Sbjct: 269 ANLVKHGPLSVGINSIFMQTYIGGVSCP--YIC--SKKNLDHGVLIVGYGAAGYAPIRFK 324
Query: 308 AVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
PYWIIKNSWGE WGE+GY+++ RG+ CG++ V S
Sbjct: 325 DKPYWIIKNSWGENWGEEGYYKICRGNNICGVDSMVSS 362
>gi|301784869|ref|XP_002927853.1| PREDICTED: cathepsin F-like [Ailuropoda melanoleuca]
Length = 394
Score = 251 bits (642), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 138/329 (41%), Positives = 196/329 (59%), Gaps = 10/329 (3%)
Query: 20 SSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT 79
S ++ E L V+ ++F F+ +N+TY + E R+ +FS N+ + Q +Q
Sbjct: 75 SVLPLLNKEPLPQDFSVRMVSIFKEFVTTYNRTYESKEEAEWRMSVFSNNVMRAQKIQAL 134
Query: 80 EHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVT 139
+ G+ YG+ +FSDL+ EF+ YL L+ + + A + P +DWR AVT
Sbjct: 135 DRGTAQYGITKFSDLTEEEFRTIYLNPLLRENRGKKMDLAKSIGDSAPPEWDWRNKGAVT 194
Query: 140 GVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDT 199
VKDQ MCGS WAFS TGN+EG + K L+SLSEQEL+DCD+ D C GG SNA+
Sbjct: 195 QVKDQGMCGSCWAFSVTGNVEGQWFLKRGALLSLSEQELLDCDKVDKACLGGLPSNAYSA 254
Query: 200 IMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAV 259
I K GGLE E Y YRG + C + K +V IN V +S++E + +L +NGP++V
Sbjct: 255 I--KTLGGLETEDDYSYRGHVQTCSFSSKKARVYINDSVELSQNEQKLVAWLAQNGPISV 312
Query: 260 AINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWG 319
AINA+ +QFY G+SHP++ C + H+VL+VGYG +P+W IKNSWG
Sbjct: 313 AINAFGMQFYRRGISHPLRPLCSPW--LIDHAVLLVGYG------NRSGIPFWAIKNSWG 364
Query: 320 EGWGEKGYFRLYRGDGSCGINDYVRSALV 348
WGE+GY+ L+RG G+CG+N SA+V
Sbjct: 365 TDWGEEGYYYLHRGSGACGVNTMASSAVV 393
>gi|1401242|gb|AAB67878.1| pre-pro-cysteine proteinase [Vicia faba]
Length = 363
Score = 251 bits (642), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 143/356 (40%), Positives = 208/356 (58%), Gaps = 27/356 (7%)
Query: 6 FFAGVALLSL-TVSVSSFMV--VGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSR 62
FA VA S + F++ V D + HL + +H F F + +K+Y+T E+ R
Sbjct: 11 LFAAVATSSTDNTNTDDFIIRQVVDNEEDHLLNAEHH--FTSFKSKFSKSYSTKEEHDYR 68
Query: 63 LHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLK---PSYADRSVPA 119
+F NL K +L Q + + +G+ +FSDL+ +EF+ ++LG K + P++A ++
Sbjct: 69 FGVFKSNLIKAKLHQKLDP-TAEHGITKFSDLTASEFRRQFLGLKKRLRLPAHAQKA--P 125
Query: 120 MIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELI 179
++P LP FDWRE AVT VKDQ CGS WAFSTTG +EG + T KLVSLSEQ+L+
Sbjct: 126 ILPTTNLPEDFDWREKGAVTPVKDQGSCGSCWAFSTTGALEGAHYLATGKLVSLSEQQLV 185
Query: 180 DCDQE---------DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKAT 230
DCD D GC GG ++NAF+ ++ GG+ +EK Y Y G D +C+ +K
Sbjct: 186 DCDHVCDPEQAGSCDSGCNGGLMNNAFEYLLQ--SGGVVQEKDYAYTGRDGSCKFDKSKV 243
Query: 231 QVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSH 290
++ + VS DE +A LV+NGP+AV INA +Q Y++GVS P + C L H
Sbjct: 244 VASVSNFSVVSLDEEQIAANLVKNGPLAVGINAAWMQTYMSGVSCP--YVC--AKSRLDH 299
Query: 291 SVLIVGYGVDR-TKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
VL+VG+G K PYWI+KNSWG+ WGE+GY+++ RG CG++ V +
Sbjct: 300 GVLLVGFGKGAYAPIRLKEKPYWIVKNSWGQNWGEQGYYKICRGRNVCGVDSMVST 355
>gi|291385469|ref|XP_002709277.1| PREDICTED: cathepsin F [Oryctolagus cuniculus]
Length = 460
Score = 251 bits (642), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 138/329 (41%), Positives = 195/329 (59%), Gaps = 10/329 (3%)
Query: 20 SSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT 79
S+ + + L VK ++F F+ +N+TY + E RL +F+ N+ + Q +Q
Sbjct: 141 STLPALNRDSLPQDFSVKMASIFKKFVRTYNRTYESKEEAQWRLSVFASNMVRAQKIQSL 200
Query: 80 EHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVT 139
+ G+ YG+ +FSDL+ EF+ YL L+ + A P +DWR AVT
Sbjct: 201 DRGTAQYGITKFSDLTEEEFRTIYLNPLLRSEPGKKMQLAKPVEDPAPPQWDWRSKGAVT 260
Query: 140 GVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDT 199
VKDQ MCGS WAFS TGN+EG + K L+SLSEQEL+DCD+ D C GG SNA+
Sbjct: 261 NVKDQGMCGSCWAFSVTGNVEGQWFLKRGTLLSLSEQELLDCDKLDKACLGGLPSNAYSA 320
Query: 200 IMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAV 259
I K GGLE E+ Y Y+G +AC + + +V IN V +S++E +A +L + GP++V
Sbjct: 321 I--KNLGGLETEEDYTYQGHMQACNFSAQKAKVYINDSVELSQNEQKLAAWLAKRGPISV 378
Query: 260 AINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWG 319
AINA+ +QFY G++HP++ C + H+VL+VGYG A P+W IKNSWG
Sbjct: 379 AINAFGMQFYRRGIAHPLRPLCSPW--LIDHAVLLVGYG------NRSATPFWAIKNSWG 430
Query: 320 EGWGEKGYFRLYRGDGSCGINDYVRSALV 348
WGE+GY+ LYRG G CG+N SA+V
Sbjct: 431 ADWGEEGYYYLYRGSGVCGVNTMASSAVV 459
>gi|294462776|gb|ADE76932.1| unknown [Picea sitchensis]
Length = 403
Score = 251 bits (642), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 144/352 (40%), Positives = 200/352 (56%), Gaps = 35/352 (9%)
Query: 17 VSVSSFMVVGDEKLH--HLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQ 74
VS F+ EK + HL +++ LF+ F+ +H K Y+T+ EY RL IF NL K
Sbjct: 63 VSEGGFIAQVTEKFNREHLLNLRSKTLFDKFIVEHGKVYSTIEEYVRRLRIFEKNLLKAA 122
Query: 75 LLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGF--KLKPSYADRSVPAMIPNITLPRAFDW 132
Q + + V+G+ FSDL+ EF+++Y G + ++ ++P LP FDW
Sbjct: 123 ENQALDP-TAVHGITPFSDLTEYEFESRYTGLLGVRQGLVNEKQTAEILPVDDLPANFDW 181
Query: 133 REYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE-------- 184
RE AVT VK Q CGS WAFSTTG +EG T KL++LSEQ+LIDCD +
Sbjct: 182 REKGAVTEVKTQGNCGSCWAFSTTGVVEGANFLATGKLLNLSEQQLIDCDHKCDPLNTKA 241
Query: 185 -DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRD 243
D+GC GG ++NA++ +M GG+EE K YPY G C+ N VK + +V+ D
Sbjct: 242 CDNGCHGGLMTNAYNYLME--AGGIEEAKNYPYTGVQGDCKFNPDLAAVKAINFTTVNLD 299
Query: 244 ETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTK 303
E +A LV++GP+AV +NA +Q Y+ GVS P+ C ++H VL+VGYG
Sbjct: 300 EKQIAANLVKHGPLAVGLNAAFMQTYIGGVSCPL--ICS--KRFINHGVLLVGYG----- 350
Query: 304 FTHKAV--------PYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSAL 347
HK PYWIIKNSWG+ WGE GY++L RG G CG+N V + +
Sbjct: 351 --HKGFALLRLGYRPYWIIKNSWGKRWGEHGYYKLCRGHGECGMNKMVSAVI 400
>gi|198427474|ref|XP_002119872.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
Length = 596
Score = 251 bits (641), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 132/292 (45%), Positives = 182/292 (62%), Gaps = 22/292 (7%)
Query: 41 LFNYFLEQHNKTYATLV-EYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
LF+ FLE++ +TY++ EY R IF N + +Q L + E G+ VYG+ +F D+S E+
Sbjct: 168 LFDMFLEKYPRTYSSSSDEYNERFEIFKTNYQVVQHLNEIERGTAVYGITKFMDMSEEEY 227
Query: 100 QAKYLGFKLKPSYADRSVP------AMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAF 153
L P + VP A + +P + DWR++ AVT VK+Q CGS WAF
Sbjct: 228 HRT-----LAPGFTRPLVPIQTLNSAELDTTNIPDSMDWRKHGAVTEVKNQGSCGSCWAF 282
Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
STTGN+EG + K KKL+SLSEQEL+DCD D GC GG SNA+ +I KL GGLE EK
Sbjct: 283 STTGNVEGQWFLKHKKLISLSEQELVDCDTLDSGCGGGLPSNAYKSI-EKL-GGLEPEKD 340
Query: 214 YPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGV 273
YPY G+ + C + + +V +N V++ +DE +A +L +NGP+++ INA +QFY G+
Sbjct: 341 YPYVGEGEKCAIKQSDFKVFVNNSVALPKDEVKLAAWLAQNGPISIGINANLMQFYWGGI 400
Query: 274 SHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEK 325
SHP + FC+ ++L H VLIVGYG T P+WIIKNSWG WGE+
Sbjct: 401 SHPWKIFCNP--KSLDHGVLIVGYG------TENGTPFWIIKNSWGPDWGEE 444
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 66/143 (46%), Positives = 80/143 (55%), Gaps = 22/143 (15%)
Query: 108 LKPSYADRSVP------AMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEG 161
L P + VP A + +P + DWR++ AVT VK+Q CGS WAFSTTGN+EG
Sbjct: 451 LAPGFTRPLVPIQTLNSAELDTTNIPDSMDWRKHGAVTEVKNQGSCGSCWAFSTTGNVEG 510
Query: 162 VYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLE------------ 209
+ K KKL+SLSEQEL+DCD D GC GG SNA+ +I KL G
Sbjct: 511 QWFLKHKKLISLSEQELVDCDTLDSGCGGGLPSNAYKSI-EKLENGTPFWIIKNSWGPDW 569
Query: 210 -EEKTYP-YRGDDKACRLNKKAT 230
EE Y YRGD +C LN AT
Sbjct: 570 GEEGYYRIYRGDG-SCGLNNMAT 591
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 26/43 (60%), Positives = 34/43 (79%)
Query: 306 HKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
P+WIIKNSWG WGE+GY+R+YRGDGSCG+N+ S++V
Sbjct: 553 ENGTPFWIIKNSWGPDWGEEGYYRIYRGDGSCGLNNMATSSIV 595
>gi|27413319|gb|AAO11786.1| pre-pro cysteine proteinase [Vicia faba]
Length = 363
Score = 251 bits (641), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 143/356 (40%), Positives = 208/356 (58%), Gaps = 27/356 (7%)
Query: 6 FFAGVALLSLT-VSVSSFMV--VGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSR 62
FA VA S + F++ V D + HL + +H F F + +K+Y+T E+ R
Sbjct: 11 LFAAVATSSTDDTNTDDFIIRQVVDNEEDHLLNAEHH--FTSFKSKFSKSYSTKEEHDYR 68
Query: 63 LHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLK---PSYADRSVPA 119
+F NL K +L Q + + +G+ +FSDL+ +EF+ ++LG K + P++A ++
Sbjct: 69 FGVFKSNLIKAKLHQKLDP-TAEHGITKFSDLTASEFRRQFLGLKKRLRLPAHAQKA--P 125
Query: 120 MIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELI 179
++P LP FDWRE AVT VKDQ CGS WAFSTTG +EG + T KLVSLSEQ+L+
Sbjct: 126 ILPTTNLPEDFDWREKGAVTPVKDQGSCGSCWAFSTTGALEGAHYLATGKLVSLSEQQLV 185
Query: 180 DCDQE---------DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKAT 230
DCD D GC GG ++NAF+ ++ GG+ +EK Y Y G D +C+ +K
Sbjct: 186 DCDHVCDPEQAGSCDSGCNGGLMNNAFEYLLQ--SGGVVQEKDYAYTGRDGSCKFDKSKV 243
Query: 231 QVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSH 290
++ + VS DE +A LV+NGP+AV INA +Q Y++GVS P + C L H
Sbjct: 244 VASVSNFSVVSLDEEQIAANLVKNGPLAVGINAAWMQTYMSGVSCP--YVC--AKSRLDH 299
Query: 291 SVLIVGYGVDR-TKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
VL+VG+G K PYWI+KNSWG+ WGE+GY+++ RG CG++ V +
Sbjct: 300 GVLLVGFGKGAYAPIRLKEKPYWIVKNSWGQNWGEQGYYKICRGRNVCGVDSMVST 355
>gi|56718881|gb|AAW28151.1| westerpain-1 [Paragonimus westermani]
Length = 322
Score = 251 bits (641), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 135/308 (43%), Positives = 181/308 (58%), Gaps = 13/308 (4%)
Query: 41 LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
L+ F + K YA + R IF NL + Q LQ + G+ YG+ +FSDL+ EF
Sbjct: 26 LYEQFKRDYGKVYAN-EDDQKRFAIFKDNLMRAQKLQLKDQGTARYGVTQFSDLTPEEFA 84
Query: 101 AKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIE 160
AKYL + R P + P DWR AVT V++Q CGS WAFST GN+E
Sbjct: 85 AKYLSAPVNNDQVKRVRPTGLK--AAPERIDWRAKGAVTAVENQGSCGSCWAFSTAGNVE 142
Query: 161 GVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDD 220
G + KT +LVSLS+Q+L+DCD+ GC GG ++++ IM GGLE E YPY G +
Sbjct: 143 GQWFIKTGQLVSLSKQQLVDCDRAAQGCNGGWPASSYLEIMYM--GGLESESDYPYVGVE 200
Query: 221 KACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFF 280
+ C LNK+ KI+ + + +E D A YL E+GP++ +NA ALQ+Y +GV P F
Sbjct: 201 QTCALNKEKLVAKIDDSIVLGPEEEDHAAYLAEHGPLSTLLNAVALQYYQSGVLKPT--F 258
Query: 281 CDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGIN 340
+ + L+H+VL VGY +PYWIIKNSWG WGEKGYFRL+RGD +CGIN
Sbjct: 259 EECPDTELNHAVLTVGYD------KEGDMPYWIIKNSWGTDWGEKGYFRLFRGDCTCGIN 312
Query: 341 DYVRSALV 348
SA++
Sbjct: 313 RMATSAII 320
>gi|4826565|emb|CAB42884.1| cathepsin F [Mus musculus]
Length = 462
Score = 251 bits (640), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 142/333 (42%), Positives = 197/333 (59%), Gaps = 11/333 (3%)
Query: 17 VSVSSFMVVGD-EKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQL 75
+ SSF+ + D + L VK LF F+ +N+TY + E RL +F+ N+ + Q
Sbjct: 139 ATFSSFLPLLDKDPLPQDFSVKMAPLFKDFMTTYNRTYESREEAQWRLTVFARNMIRAQK 198
Query: 76 LQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREY 135
+Q + G+ YG+ +FSDL+ EF YL L+ + PA N P +DWR+
Sbjct: 199 IQALDRGTAQYGITKFSDLTEEEFHTIYLNPLLQKESGRKMSPAKSINDLAPPEWDWRKK 258
Query: 136 DAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISN 195
AVT VK+Q MCGS WAFS TGN+EG + L+SLSEQEL+DCD+ D C GG SN
Sbjct: 259 GAVTEVKNQGMCGSCWAFSVTGNVEGQWFLNRGTLLSLSEQELLDCDKVDKACLGGLPSN 318
Query: 196 AFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENG 255
A+ I K GGLE E Y Y+G + C + + +V IN V +SR+E +A +L + G
Sbjct: 319 AYAAI--KNLGGLETEDDYGYQGHVQTCNFSAQMAKVYINDSVELSRNENKIAAWLAQKG 376
Query: 256 PMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIK 315
P++VAINA+ +QFY G++HP + C + H+VL+VGYG +PYW IK
Sbjct: 377 PISVAINAFGMQFYRHGIAHPFRPLCSPW--FIDHAVLLVGYG------NRSNIPYWAIK 428
Query: 316 NSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
NSWG WGE+GY+ LYRG G+CG+N SA+V
Sbjct: 429 NSWGSDWGEEGYYYLYRGSGACGVNTMASSAVV 461
>gi|11066228|gb|AAG28508.1|AF197480_1 cathepsin F [Mus musculus]
Length = 462
Score = 251 bits (640), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 142/333 (42%), Positives = 197/333 (59%), Gaps = 11/333 (3%)
Query: 17 VSVSSFMVVGD-EKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQL 75
+ SSF+ + D + L VK LF F+ +N+TY + E RL +F+ N+ + Q
Sbjct: 139 ATFSSFLPLLDKDPLPQDFSVKMAPLFKDFMTTYNRTYESREEAQWRLTVFARNMIRAQK 198
Query: 76 LQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREY 135
+Q + G+ YG+ +FSDL+ EF YL L+ + PA N P +DWR+
Sbjct: 199 IQALDRGTAQYGITKFSDLTEEEFHTIYLNPLLQKESGRKMSPAKSINDLAPPEWDWRKK 258
Query: 136 DAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISN 195
AVT VK+Q MCGS WAFS TGN+EG + L+SLSEQEL+DCD+ D C GG SN
Sbjct: 259 GAVTEVKNQGMCGSCWAFSVTGNVEGQWFLNRGTLLSLSEQELLDCDKVDKACLGGLPSN 318
Query: 196 AFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENG 255
A+ I K GGLE E Y Y+G + C + + +V IN V +SR+E +A +L + G
Sbjct: 319 AYAAI--KNLGGLETEDDYGYQGHVQTCNFSAQMAKVYINDSVELSRNENKIAAWLAQKG 376
Query: 256 PMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIK 315
P++VAINA+ +QFY G++HP + C + H+VL+VGYG +PYW IK
Sbjct: 377 PISVAINAFGMQFYRHGIAHPFRPLCSPW--FIDHAVLLVGYG------NRSNIPYWAIK 428
Query: 316 NSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
NSWG WGE+GY+ LYRG G+CG+N SA+V
Sbjct: 429 NSWGSDWGEEGYYYLYRGSGACGVNTMASSAVV 461
>gi|444510192|gb|ELV09527.1| Cathepsin F [Tupaia chinensis]
Length = 597
Score = 251 bits (640), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 141/329 (42%), Positives = 196/329 (59%), Gaps = 10/329 (3%)
Query: 20 SSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT 79
S ++ + L VK ++F F+ +N+TY T E RL +F+ N+ + Q +Q
Sbjct: 278 SGLPLLTKDPLSQDFSVKMASIFKNFVTTYNRTYQTKEEAQWRLSVFASNMVRAQKIQAL 337
Query: 80 EHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVT 139
+HG+ YG+ +FSDL+ EF+ YL L+ + A P +DWR+ AVT
Sbjct: 338 DHGTAQYGVTKFSDLTEEEFRTIYLNPLLREVPGKKMHLAKSIGDPAPPEWDWRKNGAVT 397
Query: 140 GVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDT 199
VKDQ MCGS WAFS TGN+EG + L+SLSEQEL+DCD+ D C GG SNA+
Sbjct: 398 KVKDQGMCGSCWAFSVTGNVEGQWFLNRGTLLSLSEQELLDCDKMDKACMGGLPSNAYSA 457
Query: 200 IMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAV 259
I K GGLE E Y Y+G +AC + + +V IN V +S++E +A +L + GP++V
Sbjct: 458 I--KNLGGLETEDDYSYQGHMQACNFSAEKAKVYINDSVELSQNEQKLAAWLAKKGPISV 515
Query: 260 AINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWG 319
AINA+ +QFY G++HP++ C + H+VLIVGYG VP+W IKNSWG
Sbjct: 516 AINAFGMQFYRHGIAHPLRPLCSPW--LIDHAVLIVGYG------NRSEVPFWAIKNSWG 567
Query: 320 EGWGEKGYFRLYRGDGSCGINDYVRSALV 348
WGEKGY+ L+RG GSCG+N SA+V
Sbjct: 568 TDWGEKGYYYLHRGSGSCGVNTMASSAVV 596
>gi|67773378|gb|AAY81946.1| cysteine protease 8 [Paragonimus westermani]
Length = 325
Score = 251 bits (640), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 137/310 (44%), Positives = 180/310 (58%), Gaps = 18/310 (5%)
Query: 41 LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
L+ F + K YA + R IF NL + Q Q E G+ YG+ +FSDL+ EF+
Sbjct: 31 LYEQFKRDYGKAYANEDDQ-KRFAIFKDNLVRAQQYQMQEQGTAKYGVTQFSDLTPEEFE 89
Query: 101 AKYLGFKLKPSYADRSVPAMIPN--ITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
AKYLG ++ D V + N T P + DWRE AV +++Q CGS WAFS GN
Sbjct: 90 AKYLGLRI-----DEQVDRVQLNDLQTAPASVDWREKGAVGPIENQGSCGSCWAFSVVGN 144
Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG 218
IEG + KT LVSLS+Q+L+DCD D+GC GG + I K GGLE + YPY G
Sbjct: 145 IEGQWFLKTGYLVSLSKQQLVDCDTVDNGCYGGYPPYTYKEI--KRMGGLELQSDYPYTG 202
Query: 219 DDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQ 278
CRL++ KI+ + + DE A +L E+GPM+ +NA LQFY +G+ HP +
Sbjct: 203 WGHGCRLDRSKLFAKIDDSIVLEADEEKQAAWLAEHGPMSTCLNAKYLQFYQSGILHPSK 262
Query: 279 FFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCG 338
C E L+H+VL VGY T +PYWIIKNSWG WGE GYFR+YRGDG+CG
Sbjct: 263 AMC--SPEGLNHAVLTVGYD------TKHGIPYWIIKNSWGTSWGEDGYFRIYRGDGTCG 314
Query: 339 INDYVRSALV 348
I+ SA++
Sbjct: 315 IDRLTTSAII 324
>gi|9845246|ref|NP_063914.1| cathepsin F precursor [Mus musculus]
gi|12643321|sp|Q9R013.1|CATF_MOUSE RecName: Full=Cathepsin F; Flags: Precursor
gi|6467384|gb|AAF13147.1|AF136280_1 cathepsin F precursor [Mus musculus]
gi|7141165|gb|AAF37228.1|AF217224_1 cathepsin F [Mus musculus]
gi|26344728|dbj|BAC36013.1| unnamed protein product [Mus musculus]
gi|37589148|gb|AAH58758.1| Cathepsin F [Mus musculus]
gi|148701127|gb|EDL33074.1| cathepsin F, isoform CRA_b [Mus musculus]
Length = 462
Score = 251 bits (640), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 142/333 (42%), Positives = 197/333 (59%), Gaps = 11/333 (3%)
Query: 17 VSVSSFMVVGD-EKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQL 75
+ SSF+ + D + L VK LF F+ +N+TY + E RL +F+ N+ + Q
Sbjct: 139 ATFSSFLPLLDKDPLPQDFSVKMAPLFKDFMTTYNRTYESREEAQWRLTVFARNMIRAQK 198
Query: 76 LQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREY 135
+Q + G+ YG+ +FSDL+ EF YL L+ + PA N P +DWR+
Sbjct: 199 IQALDRGTAQYGITKFSDLTEEEFHTIYLNPLLQKESGRKMSPAKSINDLAPPEWDWRKK 258
Query: 136 DAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISN 195
AVT VK+Q MCGS WAFS TGN+EG + L+SLSEQEL+DCD+ D C GG SN
Sbjct: 259 GAVTEVKNQGMCGSCWAFSVTGNVEGQWFLNRGTLLSLSEQELLDCDKVDKACLGGLPSN 318
Query: 196 AFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENG 255
A+ I K GGLE E Y Y+G + C + + +V IN V +SR+E +A +L + G
Sbjct: 319 AYAAI--KNLGGLETEDDYGYQGHVQTCNFSAQMAKVYINDSVELSRNENKIAAWLAQKG 376
Query: 256 PMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIK 315
P++VAINA+ +QFY G++HP + C + H+VL+VGYG +PYW IK
Sbjct: 377 PISVAINAFGMQFYRHGIAHPFRPLCSPW--FIDHAVLLVGYG------NRSNIPYWAIK 428
Query: 316 NSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
NSWG WGE+GY+ LYRG G+CG+N SA+V
Sbjct: 429 NSWGSDWGEEGYYYLYRGSGACGVNTMASSAVV 461
>gi|351629613|gb|AEQ54770.1| cysteine proteinase CP1 [Coffea canephora]
Length = 397
Score = 251 bits (640), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 148/353 (41%), Positives = 202/353 (57%), Gaps = 45/353 (12%)
Query: 31 HHLHH-----VKHTAL-------FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQD 78
HH HH H L F F+E++ KTY+T EY RL IF+ NL K Q
Sbjct: 51 HHRHHPGRSSANHRLLGTTTEVHFKSFVEEYEKTYSTHEEYVHRLGIFAKNLIKAAEHQA 110
Query: 79 TEHGSGVYGLNEFSDLSTAEFQAKYLGF----------KLKPSYADRSVPAMIPNIT-LP 127
+ S ++G+ +FSDL+ EF+A Y+G +L D S ++ +++ LP
Sbjct: 111 MD-PSAIHGVTQFSDLTEEEFEATYMGLKGGAGVGGTTQLGKDDGDESAAEVMMDVSDLP 169
Query: 128 RAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE--- 184
+FDWRE AVT VK Q CGS WAFSTTG IEG T KL+SLSEQ+L+DCD
Sbjct: 170 ESFDWREKGAVTEVKTQGRCGSCWAFSTTGAIEGANFIATGKLLSLSEQQLVDCDHMCDL 229
Query: 185 ------DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYV 238
DDGC GG ++ AF+ ++ GG+EEE TYPY G C+ N + VK+ +
Sbjct: 230 KEKDDCDDGCSGGLMTTAFNYLIE--AGGIEEEVTYPYTGKRGECKFNPEKVAVKVRNFA 287
Query: 239 SVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGY- 297
+ DE+ +A +V NGP+A+ +NA +Q Y+ GVS P+ CD + ++H VL+VGY
Sbjct: 288 KIPEDESQIAANVVHNGPLAIGLNAVFMQTYIGGVSCPL--ICD--KKRINHGVLLVGYG 343
Query: 298 --GVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
G + +K PYWIIKNSWG+ WGE GY+RL RG CG++ V SA+V
Sbjct: 344 SRGFSILRLGYK--PYWIIKNSWGKRWGEHGYYRLCRGHNMCGMSTMV-SAVV 393
>gi|168018894|ref|XP_001761980.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162686697|gb|EDQ73084.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 369
Score = 250 bits (639), Expect = 6e-64, Method: Compositional matrix adjust.
Identities = 133/316 (42%), Positives = 181/316 (57%), Gaps = 21/316 (6%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
F F++ K Y ++ EY R +F NL K L + +G+ FSDL+ EF +
Sbjct: 56 FESFMKDFGKVYHSVEEYEHRFGVFKSNLLK-ALKHQALDPTASHGVTMFSDLTEEEFTS 114
Query: 102 KYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEG 161
KYLG K + +P LP FDWRE AV VKDQ CGS WAFSTTG +EG
Sbjct: 115 KYLGLKRPSVLSSAPQAPPLPTEDLPPNFDWREKGAVGPVKDQGGCGSCWAFSTTGAVEG 174
Query: 162 VYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSKLGGGLEEEK 212
+ + KLVSLSEQ+L+DCD + D GC GG ++NA+ + + GGLE E
Sbjct: 175 AHFLNSGKLVSLSEQQLVDCDHQCDREEADACDAGCNGGFMTNAYQYV--EAAGGLELES 232
Query: 213 TYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTG 272
YPY G D C+ + VK++ + ++ DE +A YL+++GP+A+ INA +Q Y+ G
Sbjct: 233 DYPYEGRDGKCKFDSNKVAVKVSNFTNIPVDEDQVAAYLIKSGPLAIGINAEFMQTYIAG 292
Query: 273 VSHPIQFFCDGGNENLSHSVLIVGY---GVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
VS PI FC+ NL H VL+VGY G + +K PYWIIKNSWG WG+ GY++
Sbjct: 293 VSCPI--FCN--KRNLDHGVLLVGYAERGFAPARLAYK--PYWIIKNSWGPNWGDNGYYK 346
Query: 330 LYRGDGSCGINDYVRS 345
+ RG G CG+N V +
Sbjct: 347 ICRGHGECGLNTMVSA 362
>gi|338712411|ref|XP_001491536.3| PREDICTED: cathepsin F [Equus caballus]
Length = 459
Score = 250 bits (639), Expect = 6e-64, Method: Compositional matrix adjust.
Identities = 138/313 (44%), Positives = 191/313 (61%), Gaps = 10/313 (3%)
Query: 36 VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLS 95
VK ++F +F+ +N+TY T E R+ IF+ N+ + Q +Q + G+ YG+ +FSDL+
Sbjct: 156 VKMASIFKHFVTTYNRTYETKEEAQWRMSIFASNMVRAQKIQALDRGTAQYGVTKFSDLT 215
Query: 96 TAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFST 155
EF+ YL LK + A + P +DWR AVT VKDQ MCGS WAFS
Sbjct: 216 EEEFRTIYLNPLLKEEPGVKMRRAKSVGDSAPPEWDWRSKGAVTEVKDQGMCGSCWAFSV 275
Query: 156 TGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYP 215
TGN+EG + L+SLSEQEL+DCD+ D C GG SNA+ I K GGLE E Y
Sbjct: 276 TGNVEGQWFLNRGALLSLSEQELLDCDKVDKACMGGLPSNAYSAI--KTLGGLETEDDYS 333
Query: 216 YRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSH 275
Y G +AC + + +V IN V ++++E +A +L + GP++VAINA+ +QFY G+SH
Sbjct: 334 YHGHLQACSFSAEKAKVYINDSVELTKNEQKLAAWLAKKGPISVAINAFGMQFYRHGISH 393
Query: 276 PIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
P++ C + H+VL+VGYG AVP+W IKNSWG WGE+GY+ LYRG G
Sbjct: 394 PLRPLCSPW--LIDHAVLLVGYG------NRSAVPFWAIKNSWGTDWGEEGYYYLYRGSG 445
Query: 336 SCGINDYVRSALV 348
+CG+N SA+V
Sbjct: 446 ACGVNTMASSAVV 458
>gi|113819972|gb|AAH04054.2| Ctsf protein [Mus musculus]
Length = 332
Score = 250 bits (639), Expect = 6e-64, Method: Compositional matrix adjust.
Identities = 142/333 (42%), Positives = 197/333 (59%), Gaps = 11/333 (3%)
Query: 17 VSVSSFMVVGD-EKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQL 75
+ SSF+ + D + L VK LF F+ +N+TY + E RL +F+ N+ + Q
Sbjct: 9 ATFSSFLPLLDKDPLPQDFSVKMAPLFKDFMTTYNRTYESREEAQWRLTVFARNMIRAQK 68
Query: 76 LQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREY 135
+Q + G+ YG+ +FSDL+ EF YL L+ + PA N P +DWR+
Sbjct: 69 IQALDRGTAQYGITKFSDLTEEEFHTIYLNPLLQKESGRKMSPAKSINDLAPPEWDWRKK 128
Query: 136 DAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISN 195
AVT VK+Q MCGS WAFS TGN+EG + L+SLSEQEL+DCD+ D C GG SN
Sbjct: 129 GAVTEVKNQGMCGSCWAFSVTGNVEGQWFLNRGTLLSLSEQELLDCDKVDKACLGGLPSN 188
Query: 196 AFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENG 255
A+ I K GGLE E Y Y+G + C + + +V IN V +SR+E +A +L + G
Sbjct: 189 AYAAI--KNLGGLETEDDYGYQGHVQTCNFSAQMAKVYINDSVELSRNENKIAAWLAQKG 246
Query: 256 PMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIK 315
P++VAINA+ +QFY G++HP + C + H+VL+VGYG +PYW IK
Sbjct: 247 PISVAINAFGMQFYRHGIAHPFRPLCSPW--FIDHAVLLVGYG------NRSNIPYWAIK 298
Query: 316 NSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
NSWG WGE+GY+ LYRG G+CG+N SA+V
Sbjct: 299 NSWGSDWGEEGYYYLYRGSGACGVNTMASSAVV 331
>gi|2731635|gb|AAB93494.1| pre-procathepsin L [Paragonimus westermani]
Length = 325
Score = 250 bits (639), Expect = 6e-64, Method: Compositional matrix adjust.
Identities = 134/310 (43%), Positives = 182/310 (58%), Gaps = 18/310 (5%)
Query: 41 LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
L+ F + K YA + R IF NL + Q Q E G+ YG+ +FSDL+ EF
Sbjct: 31 LYEQFKRDYGKAYANEDDQ-KRFAIFKDNLVRAQQYQTQEQGTAKYGVTQFSDLTNEEFA 89
Query: 101 AKYLGFKLKPSYADRSVPAMIPN--ITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
A YLG ++ D V + N T P + DWRE AV V+ Q CGS WAFS T N
Sbjct: 90 AMYLGSRI-----DERVDRVQLNDLQTAPASVDWREKGAVGPVEHQGSCGSCWAFSVTAN 144
Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG 218
+EG + KT +LVSLS+Q+L+DCD+ D GC GG + I K GGLE + YPY G
Sbjct: 145 VEGQWFLKTGRLVSLSKQQLVDCDRLDHGCSGGYPPYTYKEI--KRMGGLELQSAYPYTG 202
Query: 219 DDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQ 278
++ACRL++ KI+ + + ++E A +L E+GPM+ +NA LQFY G+ HP +
Sbjct: 203 WEQACRLDRSKLFAKIDDSIVLEKNEEKQAAWLAEHGPMSTCLNAGPLQFYRYGILHPSE 262
Query: 279 FFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCG 338
+ C E L+H+VL VGY T + VPYW ++NSWG WGE GYFR+YRGDG+CG
Sbjct: 263 YACS--PEGLNHAVLTVGYD------TERGVPYWTVRNSWGTRWGENGYFRIYRGDGTCG 314
Query: 339 INDYVRSALV 348
I+ SA++
Sbjct: 315 IDRLTTSAII 324
>gi|77628008|ref|NP_001029282.1| cathepsin F precursor [Rattus norvegicus]
gi|71681040|gb|AAH99780.1| Cathepsin F [Rattus norvegicus]
gi|149062007|gb|EDM12430.1| cathepsin F, isoform CRA_a [Rattus norvegicus]
gi|159895422|gb|ABX09995.1| cathepsin F [Rattus norvegicus]
Length = 462
Score = 250 bits (639), Expect = 7e-64, Method: Compositional matrix adjust.
Identities = 140/325 (43%), Positives = 192/325 (59%), Gaps = 10/325 (3%)
Query: 24 VVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGS 83
++ + L VK LF F+ +N+TY + E RL +F+ N+ + Q +Q + G+
Sbjct: 147 MLDKDPLPQDFSVKMATLFKDFMTTYNRTYESREEAQWRLTVFARNMIRAQKIQALDRGT 206
Query: 84 GVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKD 143
YG+ +FSDL+ EF YL L+ + A N P +DWR+ AVT VKD
Sbjct: 207 AQYGITKFSDLTEEEFHTIYLNPLLQKESGGKMSLAKSINDLAPPEWDWRKKGAVTEVKD 266
Query: 144 QTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSK 203
Q MCGS WAFS TGN+EG + L+SLSEQEL+DCD+ D C GG SNA+ I K
Sbjct: 267 QGMCGSCWAFSVTGNVEGQWFLNRGTLLSLSEQELLDCDKMDKACMGGLPSNAYTAI--K 324
Query: 204 LGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA 263
GGLE E Y Y+G +AC + + +V IN V +SRDE +A +L + GP++VAINA
Sbjct: 325 NLGGLETEDDYGYQGHVQACNFSTQMAKVYINDSVELSRDENKIAAWLAQKGPISVAINA 384
Query: 264 YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWG 323
+ +QFY G++HP + C + H+VL+VGYG +PYW IKNSWG WG
Sbjct: 385 FGMQFYRHGIAHPFRPLCSPW--FIDHAVLLVGYG------NRSNIPYWAIKNSWGRDWG 436
Query: 324 EKGYFRLYRGDGSCGINDYVRSALV 348
E+GY+ LYRG G+CG+N SA+V
Sbjct: 437 EEGYYYLYRGSGACGVNTMASSAVV 461
>gi|358339356|dbj|GAA47436.1| cathepsin L [Clonorchis sinensis]
Length = 236
Score = 250 bits (638), Expect = 7e-64, Method: Compositional matrix adjust.
Identities = 123/238 (51%), Positives = 159/238 (66%), Gaps = 8/238 (3%)
Query: 111 SYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKL 170
S ++R P +LP +FDWR++ VT VKDQ MCGS WAF+ TGNIEG + KTKKL
Sbjct: 6 SRSNRPKVTSYPTQSLPGSFDWRQHGVVTEVKDQGMCGSCWAFAVTGNIEGQWYKKTKKL 65
Query: 171 VSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKAT 230
VSLSEQ+L+DCD++D+ C GG A+++I+ GGL EK YPY + C L
Sbjct: 66 VSLSEQQLLDCDKKDEACNGGFPEWAYESIVKM--GGLMSEKDYPYEAHKETCNLKPNNI 123
Query: 231 QVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSH 290
IN V++S+DE ++A +L ENGP++V +NA LQFY GVSHP C + L H
Sbjct: 124 SAYINDSVTLSKDEKELAAWLTENGPISVGMNANFLQFYFGGVSHPPHMLCS--EQGLDH 181
Query: 291 SVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
+VL+VGYGV T F + PYWI+KNSWG WGEKGYFR+YRGDG+CGIN S++V
Sbjct: 182 AVLLVGYGV--TSFWQR--PYWIVKNSWGRSWGEKGYFRIYRGDGTCGINADATSSIV 235
>gi|148701126|gb|EDL33073.1| cathepsin F, isoform CRA_a [Mus musculus]
Length = 417
Score = 250 bits (638), Expect = 7e-64, Method: Compositional matrix adjust.
Identities = 142/333 (42%), Positives = 197/333 (59%), Gaps = 11/333 (3%)
Query: 17 VSVSSFMVVGD-EKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQL 75
+ SSF+ + D + L VK LF F+ +N+TY + E RL +F+ N+ + Q
Sbjct: 94 ATFSSFLPLLDKDPLPQDFSVKMAPLFKDFMTTYNRTYESREEAQWRLTVFARNMIRAQK 153
Query: 76 LQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREY 135
+Q + G+ YG+ +FSDL+ EF YL L+ + PA N P +DWR+
Sbjct: 154 IQALDRGTAQYGITKFSDLTEEEFHTIYLNPLLQKESGRKMSPAKSINDLAPPEWDWRKK 213
Query: 136 DAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISN 195
AVT VK+Q MCGS WAFS TGN+EG + L+SLSEQEL+DCD+ D C GG SN
Sbjct: 214 GAVTEVKNQGMCGSCWAFSVTGNVEGQWFLNRGTLLSLSEQELLDCDKVDKACLGGLPSN 273
Query: 196 AFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENG 255
A+ I K GGLE E Y Y+G + C + + +V IN V +SR+E +A +L + G
Sbjct: 274 AYAAI--KNLGGLETEDDYGYQGHVQTCNFSAQMAKVYINDSVELSRNENKIAAWLAQKG 331
Query: 256 PMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIK 315
P++VAINA+ +QFY G++HP + C + H+VL+VGYG +PYW IK
Sbjct: 332 PISVAINAFGMQFYRHGIAHPFRPLCSPW--FIDHAVLLVGYG------NRSNIPYWAIK 383
Query: 316 NSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
NSWG WGE+GY+ LYRG G+CG+N SA+V
Sbjct: 384 NSWGSDWGEEGYYYLYRGSGACGVNTMASSAVV 416
>gi|345783063|ref|XP_533219.3| PREDICTED: LOW QUALITY PROTEIN: cathepsin F [Canis lupus
familiaris]
Length = 490
Score = 250 bits (638), Expect = 8e-64, Method: Compositional matrix adjust.
Identities = 138/330 (41%), Positives = 199/330 (60%), Gaps = 11/330 (3%)
Query: 20 SSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT 79
S ++ + L VK ++F F+ +N+TY T E R+ +FS N+ + Q +Q
Sbjct: 170 SVLPLLNKDPLPQDFSVKMASVFKEFVTTYNRTYETKEEAEWRMSVFSNNMVRAQKIQAL 229
Query: 80 EHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADR-SVPAMIPNITLPRAFDWREYDAV 138
+ G+ YG+ +FSDL+ EF+ YL L+ + + + I + P +DWR AV
Sbjct: 230 DRGTAQYGITKFSDLTEEEFRTIYLNPLLRENRGKKMRLAKSISDHAPPPEWDWRSKGAV 289
Query: 139 TGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFD 198
T VKDQ MCGS WAFS TGN+EG + K L+SLSEQEL+DCD+ D C GG SNA+
Sbjct: 290 TKVKDQGMCGSCWAFSVTGNVEGQWFLKEGTLLSLSEQELLDCDKVDKACLGGLPSNAYS 349
Query: 199 TIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMA 258
IM+ GGLE E Y Y+G +AC + K +V IN + +S++E +A +L + GP++
Sbjct: 350 AIMTL--GGLETEDDYSYQGHLQACSFSAKKARVYINDSMELSQNEQKLAAWLAKKGPIS 407
Query: 259 VAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSW 318
VAINA+ +QFY G+SHP++ C + H+VL+VGYG +P+W IKNSW
Sbjct: 408 VAINAFGMQFYRHGISHPLRPLCSPW--LIDHAVLLVGYG------NRSGIPFWAIKNSW 459
Query: 319 GEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
G WGE+GY+ L+RG G+CG+N SA+V
Sbjct: 460 GTDWGEEGYYYLHRGSGACGVNTMASSAVV 489
>gi|46948154|gb|AAT07059.1| cathepsin F-like cysteine proteinase, partial [Brugia malayi]
Length = 461
Score = 249 bits (636), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 136/317 (42%), Positives = 182/317 (57%), Gaps = 25/317 (7%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
F F+++ + Y+++ E R I+ N+ + LQ E G+ +YG +FSD++ EFQ
Sbjct: 159 FMTFIKKFKREYSSIEEQLDRFRIYLQNMNFAKKLQFEEKGTAIYGATKFSDMTAEEFQK 218
Query: 102 KYLGFKLKPS-YADRSVPAMIP------NIT---LPRAFDWREYDAVTGVKDQTMCGSSW 151
L PS + DR I N++ LP FDWR VT VKDQ CGS W
Sbjct: 219 IML-----PSIWWDRVESNGITFNLNDFNLSIYNLPSKFDWRTEGVVTPVKDQGSCGSCW 273
Query: 152 AFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEE 211
AFS TGNIE ++A KT KL+SLSEQELIDCD D GC GG NAF I K GGLE E
Sbjct: 274 AFSVTGNIESLWAIKTGKLISLSEQELIDCDVIDKGCNGGLPINAFREI--KRMGGLEPE 331
Query: 212 KTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVT 271
YPY + C L + V I+ V + R+ET M ++ + GP++V I+A L +Y +
Sbjct: 332 DQYPYEAKNGTCHLVRAQIAVSIDDAVEIPRNETVMKAWIAQRGPLSVGIDAELLSYYKS 391
Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
G+ HP + C ++H VLI GYG++ +PYW IKNSWGE WGE GYF+L
Sbjct: 392 GILHPSKSRCPPS--KINHGVLITGYGIENN------LPYWTIKNSWGEQWGENGYFQLM 443
Query: 332 RGDGSCGINDYVRSALV 348
RG CG++D V SA++
Sbjct: 444 RGKNICGVSDLVSSAII 460
>gi|359492179|ref|XP_002280808.2| PREDICTED: cysteine proteinase RD19a-like [Vitis vinifera]
gi|302142580|emb|CBI19783.3| unnamed protein product [Vitis vinifera]
Length = 365
Score = 249 bits (635), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 139/329 (42%), Positives = 188/329 (57%), Gaps = 24/329 (7%)
Query: 27 DEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY 86
D+ L HH F F + KTYAT E+ R IF NLR+ + Q + S V+
Sbjct: 43 DDLLSAEHH------FAAFKARFRKTYATAEEHDYRFSIFKANLRRAKRNQLLDP-SAVH 95
Query: 87 GLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTM 146
G+ FSDL+ AEF+ YLG K D ++P LP FDWR++ AVT VKDQ
Sbjct: 96 GVTRFSDLTPAEFRQNYLGLKPLRFPIDTQQAPILPTNDLPTDFDWRDHGAVTAVKDQGE 155
Query: 147 CGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAF 197
CGS W+FSTTG +EG + T LVSLSEQ+L+DCD E D GC GG ++ AF
Sbjct: 156 CGSCWSFSTTGALEGAHFLATGNLVSLSEQQLVDCDHECDPEEYGACDRGCNGGLMNTAF 215
Query: 198 DTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPM 257
+ I+ GG+ + YPY G D C+ +K ++ + +VS DE +A LV+NGP+
Sbjct: 216 EYILK--AGGVVRGEDYPYTGTDGHCKFDKTKIAASVSNFSTVSIDEDQIAANLVKNGPL 273
Query: 258 AVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYG-VDRTKFTHKAVPYWIIKN 316
AV INA +Q Y GVS P F C + +L+H VL+VGYG + K PYW++KN
Sbjct: 274 AVGINAIFMQSYAGGVSCP--FIC---STSLNHGVLLVGYGSAGYSPIRFKEKPYWLLKN 328
Query: 317 SWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
SWG+ WGE GY+++ RG CG++ V +
Sbjct: 329 SWGQNWGEHGYYKICRGHNICGVDSMVST 357
>gi|56718883|gb|AAW28152.1| westerpain-10 [Paragonimus westermani]
Length = 327
Score = 248 bits (634), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 142/348 (40%), Positives = 194/348 (55%), Gaps = 28/348 (8%)
Query: 1 MSCFYFFAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYY 60
+SCF ++S ++VS+ V + L+ F + K YA +
Sbjct: 6 VSCFAL-----IVSCAIAVSAGRVPDSAR----------ELYEQFKRGYGKVYAN-EDDQ 49
Query: 61 SRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAM 120
R IF NL + Q LQ + G+ YG+ +FSDL+ EF AKYL + R P
Sbjct: 50 KRFAIFKDNLVRAQKLQLKDQGTARYGVTQFSDLTPEEFAAKYLSAPVNDDQVKRMRPTG 109
Query: 121 IPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELID 180
+ P DWR AVT V++Q CGS WAFST GN+EG + KT +LVSLS+Q+L+D
Sbjct: 110 LK--AAPERIDWRAKGAVTAVENQGSCGSCWAFSTAGNVEGQWFIKTGQLVSLSKQQLVD 167
Query: 181 CDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSV 240
CD+ GC GG ++++ IM GGLE E YPY G ++ C LNK+ KI+ + +
Sbjct: 168 CDRAAQGCNGGWPASSYLEIMYM--GGLESESDYPYVGVEQTCALNKEKLVAKIDDSIVL 225
Query: 241 SRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVD 300
+E D A YL E+GP++ +NA ALQ Y +GV P F + + L+H+VL VGY
Sbjct: 226 GPEEEDHAAYLAEHGPLSTLLNAVALQHYQSGVLKPT--FDECPDTELNHAVLTVGYD-- 281
Query: 301 RTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
+PYWIIKNSWG WGEKGYFRL+RGD +CGIN SA++
Sbjct: 282 ----KEGDMPYWIIKNSWGTDWGEKGYFRLFRGDCTCGINRMATSAII 325
>gi|344295816|ref|XP_003419606.1| PREDICTED: cathepsin F [Loxodonta africana]
Length = 473
Score = 248 bits (634), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 136/312 (43%), Positives = 190/312 (60%), Gaps = 10/312 (3%)
Query: 37 KHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLST 96
K ++F F+ +N+TY T E R+ +F+ N+ + Q LQ + G+ YG+ +FSDL+
Sbjct: 171 KMASIFKNFVTTYNRTYETKEETKWRMSVFANNMIRAQKLQALDQGTAQYGITKFSDLTE 230
Query: 97 AEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTT 156
EF+ YL L+ + P +P +DWR AVT VKDQ MCGS WAFS T
Sbjct: 231 EEFRTIYLNPLLREDPGQKMRLGKAPKGPVPPDWDWRTKGAVTKVKDQGMCGSCWAFSVT 290
Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
GN+EG + L+SLSEQEL+DCD+ D C GG SNA+ I K GGLE E+ Y Y
Sbjct: 291 GNVEGQWFLNRGTLLSLSEQELLDCDKVDKACMGGVPSNAYSAI--KTLGGLETEEDYSY 348
Query: 217 RGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHP 276
G +AC + + +V IN V +S++E +A +L +NGP++VAINA+ +QFY G++HP
Sbjct: 349 HGHLQACSFSAEKAKVYINDSVELSQNEYKLAAWLAKNGPISVAINAFGMQFYRHGIAHP 408
Query: 277 IQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGS 336
++ C + H+VLIVGYG VP+W IKNSWG WGE+GY+ L+RG G+
Sbjct: 409 LRPLCS--PWLIDHAVLIVGYG------NRSDVPFWAIKNSWGTDWGEEGYYYLHRGSGA 460
Query: 337 CGINDYVRSALV 348
CG+N SA+V
Sbjct: 461 CGVNTMASSAVV 472
>gi|33945877|emb|CAE45588.1| papain-like cysteine proteinase-like protein 1 [Lotus japonicus]
Length = 359
Score = 248 bits (634), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 142/337 (42%), Positives = 195/337 (57%), Gaps = 32/337 (9%)
Query: 24 VVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNL---RKIQLLQDTE 80
VV DE L HH F F + K YAT E+ R ++F N+ R+ QLL
Sbjct: 33 VVDDEGLGAEHH------FLEFKRRFGKVYATEEEHGYRFNVFKSNMHRARRHQLLDP-- 84
Query: 81 HGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTG 140
S V+G+ +FSDL+ EFQ LG + +D ++P LP+ FDWRE+ AVT
Sbjct: 85 --SAVHGVTQFSDLTPMEFQHSVLGLRGVGLPSDADSAPILPTDNLPKDFDWREHGAVTP 142
Query: 141 VKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE----------DDGCEG 190
VK+Q CGS W+FS TG +EG + T +LVSLSEQ+L+DCD + D GC G
Sbjct: 143 VKNQGSCGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHQQCDPEEAGSCDSGCNG 202
Query: 191 GSISNAFDTIMSKLGGGLEEEKTYPYRGDDKA-CRLNKKATQVKINGYVSVSRDETDMAK 249
G +++AF+ I++ GG+ E+ YPY G + C+ +K + + VSRDE +A
Sbjct: 203 GLMNSAFEYILNN--GGVMREEDYPYSGTNGGTCKFDKAKIAASVANFSVVSRDEDQIAA 260
Query: 250 YLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDR-TKFTHKA 308
LV+NGP+AVAINA +Q YV GVS P + C ++ L+H VL+VGYG + K
Sbjct: 261 NLVKNGPLAVAINAVYMQTYVGGVSCP--YVC---SKKLNHGVLLVGYGSESYAPIRMKQ 315
Query: 309 VPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
PYWIIKNSWGE WGE GY+++ RG CG++ V +
Sbjct: 316 KPYWIIKNSWGENWGENGYYKICRGRNICGVDSMVST 352
>gi|15705865|gb|AAL05851.1|AF411121_1 cysteine proteinase precursor [Sandersonia aurantiaca]
Length = 360
Score = 248 bits (634), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 140/335 (41%), Positives = 192/335 (57%), Gaps = 23/335 (6%)
Query: 23 MVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHG 82
VV D++ L H F+ FL ++ K+YA E+ R +F NLR+ + Q +
Sbjct: 29 QVVSDDQQQLLSAEAH---FSSFLSRYGKSYADEAEHAYRFSVFKSNLRRARRHQRLDP- 84
Query: 83 SGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPA-MIPNITLPRAFDWREYDAVTGV 141
+ V+G+ F+DL+ +EF+ YLG + +P A + A ++P LP FDWR++ AVT V
Sbjct: 85 TAVHGVTRFADLTPSEFRRTYLGLRRRPRTAGSTHDAPILPTNELPADFDWRDHGAVTPV 144
Query: 142 KDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGS 192
K+Q CGS W+FS G +EG T LVSLSEQ+L+DCD E D GC GG
Sbjct: 145 KNQGSCGSCWSFSAAGALEGANYLSTGNLVSLSEQQLVDCDHECDSSEPDSCDQGCNGGL 204
Query: 193 ISNAFDTIMSKLGGGLEEEKTYPYRGDDKA-CRLNKKATQVKINGYVSVSRDETDMAKYL 251
++ AF+ I+ GGLE E YPY G D+ C+ NK + + VS DE +A L
Sbjct: 205 MTTAFEYILKS--GGLEREADYPYTGTDRGTCKFNKAKISAVASNFSVVSIDEDQIAANL 262
Query: 252 VENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYG-VDRTKFTHKAVP 310
V++GP+AV INA +Q YV GVS P + C ++L H VL+VGYG K P
Sbjct: 263 VKHGPLAVGINAVFMQTYVGGVSCP--YIC---GKHLDHGVLLVGYGSAGFAPIRFKEKP 317
Query: 311 YWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
YWIIKNSWGE WGE GY+++ RG CG++ V S
Sbjct: 318 YWIIKNSWGENWGENGYYKICRGRNVCGVDSMVSS 352
>gi|7211743|gb|AAF40415.1|AF216784_1 papain-like cysteine proteinase isoform II [Ipomoea batatas]
Length = 368
Score = 248 bits (633), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 141/337 (41%), Positives = 195/337 (57%), Gaps = 29/337 (8%)
Query: 24 VVGD---EKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTE 80
VVGD + L+ HH F F + K YA+ E+ RL +F N+R+ + Q+ +
Sbjct: 36 VVGDGDGDLLNADHH------FTVFKRRFGKAYASDEEHDYRLSVFKANMRRAKRHQELD 89
Query: 81 HGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSY-ADRSVPAMIPNITLPRAFDWREYDAVT 139
+ V+G+ +FSD + EF+ K+LG + + AD ++P LP FDWR+ AVT
Sbjct: 90 PAA-VHGVTQFSDSTPTEFRRKFLGLNRRLKFPADAKTAPILPTDELPSDFDWRDRGAVT 148
Query: 140 GVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDD---------GCEG 190
VK+Q CG W+FSTTG +EG T KLVSLSEQ+L+DCD E D GC G
Sbjct: 149 PVKNQGTCGLCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEAGSCDFGCNG 208
Query: 191 GSISNAFDTIMSKLGGGLEEEKTYPYRGDD-KACRLNKKATQVKINGYVSVSRDETDMAK 249
G +++AF+ + GGL E+ YPY G+D + CR +K K+ + VS DE +A
Sbjct: 209 GLMNSAFEYTLK--AGGLMREEDYPYTGNDLQVCRFDKTKIAAKVANFSVVSLDEDQIAA 266
Query: 250 YLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYG-VDRTKFTHKA 308
LV+NGP+AVAINA +Q Y+ GVS P + C ++ L H VL+VGYG K
Sbjct: 267 NLVKNGPLAVAINAVFMQTYIGGVSCP--YIC---SKRLDHGVLLVGYGSAGYAPIRMKE 321
Query: 309 VPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
PYWIIKNSWGE WGE GY+++ RG CG++ V +
Sbjct: 322 KPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVST 358
>gi|426252094|ref|XP_004019753.1| PREDICTED: cathepsin F isoform 1 [Ovis aries]
Length = 460
Score = 248 bits (632), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 138/332 (41%), Positives = 198/332 (59%), Gaps = 11/332 (3%)
Query: 18 SVSSFM-VVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLL 76
+ SSF+ ++ + L VK ++F F+ +N+TY + E R+ +F+ N+ + Q +
Sbjct: 138 TFSSFLPLLNKDPLPQDFSVKMASIFKDFVTTYNRTYDSQEEASWRMSVFANNMVRAQKI 197
Query: 77 QDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYD 136
Q + G+ YG+ +FSDL+ EF+ YL LK + A P +DWR
Sbjct: 198 QALDRGTAQYGVTKFSDLTEEEFRTIYLNPLLKDAPGRNMRLAQPVTDVPPPQWDWRNKG 257
Query: 137 AVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNA 196
AVT VKDQ MCGS WAFS TGN+EG + K L+SLSEQEL+DCD+ D C GG SNA
Sbjct: 258 AVTDVKDQGMCGSCWAFSVTGNVEGQWFLKRGTLLSLSEQELLDCDKTDKACLGGLPSNA 317
Query: 197 FDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGP 256
+ I + GGLE E Y YRG + C + + +V IN V +S++E +A +L + GP
Sbjct: 318 YSAI--RTLGGLETEDDYSYRGHLQTCSFSAEKAKVYINDSVELSKNEQKLAAWLAKKGP 375
Query: 257 MAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKN 316
++VAINA+ +QFY G+SHP++ C + H+VL+VGYG A P+W IKN
Sbjct: 376 ISVAINAFGMQFYRHGISHPLRPLCSPW--LIDHAVLLVGYG------NRSATPFWAIKN 427
Query: 317 SWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
SWG WGE+GY+ L+RG G+CG+N SA++
Sbjct: 428 SWGTNWGEEGYYYLHRGSGACGVNIMASSAVI 459
>gi|4757570|gb|AAD29084.1|AF082181_1 cysteine proteinase precursor [Solanum melongena]
Length = 363
Score = 248 bits (632), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 139/337 (41%), Positives = 192/337 (56%), Gaps = 28/337 (8%)
Query: 23 MVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKI---QLLQDT 79
VV + +H+ + +H F+ F ++ K YA+ E+ RL +F NLR+ QLL T
Sbjct: 30 QVVSETDDNHMLNAEHH--FSLFKSKYGKIYASQEEHDHRLKVFKANLRRARRHQLLDPT 87
Query: 80 EHGSGVYGLNEFSDLSTAEFQAKYLGF-KLKPSYADRSVPAMIPNITLPRAFDWREYDAV 138
+G+ +FSDL+ +EF+ YLG K +P + P ++P LP FDWRE AV
Sbjct: 88 AE----HGITQFSDLTPSEFRRTYLGLHKPRPKLNAQKAP-ILPTSDLPEDFDWREKGAV 142
Query: 139 TGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCE 189
TGVK+Q CGS W+FSTTG +EG + T +LVSLSEQ+L+DCD E D GC
Sbjct: 143 TGVKNQGSCGSCWSFSTTGAVEGAHFLATGELVSLSEQQLVDCDHECDAEEKSECDAGCN 202
Query: 190 GGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAK 249
GG ++ AF+ + GGL+ EK YPY G D C +K + + + DE +A
Sbjct: 203 GGLMTTAFEYTLK--AGGLQREKDYPYTGRDGKCHFDKSKIAASVANFSVIGLDEDQIAA 260
Query: 250 YLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYG-VDRTKFTHKA 308
LV++GP+AV INA +Q Y+ GVS P+ F + H VL+VGYG K
Sbjct: 261 NLVKHGPLAVGINAAWMQTYMRGVSCPLICF-----KRQDHGVLLVGYGSAGFAPIRLKE 315
Query: 309 VPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
PYWIIKNSWGE WGE GY+++ RG CG++ V +
Sbjct: 316 KPYWIIKNSWGENWGEHGYYKICRGHNICGVDAMVST 352
>gi|426252096|ref|XP_004019754.1| PREDICTED: cathepsin F isoform 2 [Ovis aries]
Length = 477
Score = 247 bits (631), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 138/332 (41%), Positives = 198/332 (59%), Gaps = 11/332 (3%)
Query: 18 SVSSFM-VVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLL 76
+ SSF+ ++ + L VK ++F F+ +N+TY + E R+ +F+ N+ + Q +
Sbjct: 155 TFSSFLPLLNKDPLPQDFSVKMASIFKDFVTTYNRTYDSQEEASWRMSVFANNMVRAQKI 214
Query: 77 QDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYD 136
Q + G+ YG+ +FSDL+ EF+ YL LK + A P +DWR
Sbjct: 215 QALDRGTAQYGVTKFSDLTEEEFRTIYLNPLLKDAPGRNMRLAQPVTDVPPPQWDWRNKG 274
Query: 137 AVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNA 196
AVT VKDQ MCGS WAFS TGN+EG + K L+SLSEQEL+DCD+ D C GG SNA
Sbjct: 275 AVTDVKDQGMCGSCWAFSVTGNVEGQWFLKRGTLLSLSEQELLDCDKTDKACLGGLPSNA 334
Query: 197 FDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGP 256
+ I + GGLE E Y YRG + C + + +V IN V +S++E +A +L + GP
Sbjct: 335 YSAI--RTLGGLETEDDYSYRGHLQTCSFSAEKAKVYINDSVELSKNEQKLAAWLAKKGP 392
Query: 257 MAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKN 316
++VAINA+ +QFY G+SHP++ C + H+VL+VGYG A P+W IKN
Sbjct: 393 ISVAINAFGMQFYRHGISHPLRPLCSPW--LIDHAVLLVGYG------NRSATPFWAIKN 444
Query: 317 SWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
SWG WGE+GY+ L+RG G+CG+N SA++
Sbjct: 445 SWGTNWGEEGYYYLHRGSGACGVNIMASSAVI 476
>gi|67773370|gb|AAY81942.1| cysteine protease 3 [Paragonimus westermani]
Length = 321
Score = 247 bits (631), Expect = 6e-63, Method: Compositional matrix adjust.
Identities = 133/308 (43%), Positives = 179/308 (58%), Gaps = 13/308 (4%)
Query: 41 LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
L+ F + K YA + R IF NL + Q LQ + G+ YG+ +FSDL+ EF
Sbjct: 26 LYEQFKRDYGKVYAN-EDDQKRFAIFKDNLMRAQKLQLKDQGTARYGVTQFSDLTPEEFA 84
Query: 101 AKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIE 160
AKYL + R P + P DWR AVT V++Q CGS WAFST GN+E
Sbjct: 85 AKYLSAPVNNDQVKRVRPTGLK--AAPERIDWRAKGAVTAVENQGSCGSCWAFSTAGNVE 142
Query: 161 GVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDD 220
G + KT +LVSLS+Q+L+DCD+ DGC GG ++++ IM GGLE + YPY G
Sbjct: 143 GQWFIKTGQLVSLSKQQLVDCDRAADGCNGGWPASSYLEIMHM--GGLESQDDYPYAGVK 200
Query: 221 KACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFF 280
+ C + K+ KI+ +++ E D A YL E+GP++ +NA LQ+Y +G+ HP
Sbjct: 201 EQCFMEKERLLAKIDDSIALGPSEDDNAAYLAEHGPLSTLLNAITLQYYQSGIIHPSYEE 260
Query: 281 CDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGIN 340
C +L+H+VL VGY +PYWIIKNSW WGEKGYFRLYRGDG+CGIN
Sbjct: 261 C--SPVDLNHAVLTVGYD------KEGDMPYWIIKNSWNVEWGEKGYFRLYRGDGTCGIN 312
Query: 341 DYVRSALV 348
SA++
Sbjct: 313 RMPTSAII 320
>gi|410913409|ref|XP_003970181.1| PREDICTED: cathepsin F-like [Takifugu rubripes]
Length = 476
Score = 247 bits (631), Expect = 6e-63, Method: Compositional matrix adjust.
Identities = 138/314 (43%), Positives = 188/314 (59%), Gaps = 11/314 (3%)
Query: 36 VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLS 95
V+ LF F+ ++NK Y++ E RL IF NL+ + +Q + GS YG+ +FSDL+
Sbjct: 172 VELLGLFKEFMTKYNKVYSSQEEADRRLQIFKENLKTAEKIQSLDEGSAEYGVTKFSDLT 231
Query: 96 TAEFQAKYLGFKLKPSYADRSV-PAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
EF+ YL L R + PA P ++DWR++ AV+ VK+Q +CGS WAFS
Sbjct: 232 EEEFRLTYLNPLLSQWTLRRPMKPASPARSPAPASWDWRDHGAVSPVKNQGLCGSCWAFS 291
Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
TGNIEG + K KL+SLSEQEL+DCD D C GG SNA++ I GGLE E Y
Sbjct: 292 VTGNIEGQWFLKHGKLLSLSEQELVDCDGLDHACRGGLPSNAYEAIEGL--GGLEAENDY 349
Query: 215 PYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVS 274
Y G + C + IN V + DE +MA +L ENGP++VA+NA+A+QFY GVS
Sbjct: 350 TYSGHKQKCSFATEKVAAYINSSVELPSDENEMAAWLAENGPVSVALNAFAMQFYKKGVS 409
Query: 275 HPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGD 334
HP C+ + H+VL+VGYG +P+W IKNSWGE +GE+GY+ LY+G
Sbjct: 410 HPWMILCNPW--MIDHAVLLVGYG------ERNGIPFWAIKNSWGEDYGEEGYYYLYKGS 461
Query: 335 GSCGINDYVRSALV 348
+CGIN SA++
Sbjct: 462 NACGINKMGSSAVI 475
>gi|410974700|ref|XP_003993781.1| PREDICTED: cathepsin F [Felis catus]
Length = 459
Score = 247 bits (630), Expect = 6e-63, Method: Compositional matrix adjust.
Identities = 141/329 (42%), Positives = 194/329 (58%), Gaps = 10/329 (3%)
Query: 20 SSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT 79
S ++ + L VK ++F F+ +N+TY T E RL +FS N+ + Q +Q
Sbjct: 140 SVLPLLNKDPLPQDFSVKMASIFKEFVTTYNRTYGTQEEAQWRLSVFSNNMVRAQKIQAL 199
Query: 80 EHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVT 139
+ G+ YG+ +FSDL+ EF+A YL LK + A P +DWR AVT
Sbjct: 200 DRGTAQYGITKFSDLTEEEFRAIYLNPLLKENRNKMMHLAKSIGDHAPPEWDWRTKGAVT 259
Query: 140 GVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDT 199
VK+Q MCGS WAFS TGN+EG + K L+SLSEQEL+DCD+ D C GG SNA+
Sbjct: 260 NVKNQGMCGSCWAFSVTGNVEGQWFLKQGDLLSLSEQELLDCDKVDKACLGGLPSNAYLA 319
Query: 200 IMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAV 259
I K GGLE E Y Y G + C + K +V IN V +S++E +A +L + GP++V
Sbjct: 320 I--KNLGGLETEDDYSYSGHLQTCSFSAKKAKVYINDSVELSQNEQKLAAWLAKKGPISV 377
Query: 260 AINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWG 319
AINA+ +QFY G+SHP++ C + H+VL+VGYG +P+W IKNSWG
Sbjct: 378 AINAFGMQFYRRGISHPLRPLCSPW--LIDHAVLLVGYG------NRSGIPFWAIKNSWG 429
Query: 320 EGWGEKGYFRLYRGDGSCGINDYVRSALV 348
WGE+GY+ LYRG G+CG+N SA+V
Sbjct: 430 TDWGEEGYYYLYRGSGACGVNAMASSAVV 458
>gi|116787909|gb|ABK24688.1| unknown [Picea sitchensis]
gi|224284108|gb|ACN39791.1| unknown [Picea sitchensis]
gi|224285024|gb|ACN40241.1| unknown [Picea sitchensis]
Length = 366
Score = 247 bits (630), Expect = 6e-63, Method: Compositional matrix adjust.
Identities = 135/314 (42%), Positives = 181/314 (57%), Gaps = 18/314 (5%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
F +F+ ++ K Y+ E+ R +F NL + Q + + +G+ +FSDL+ EF+
Sbjct: 57 FRHFIRRYGKKYSGPEEHEHRFGVFKSNLLRALEHQKLDPRAS-HGVTKFSDLTQEEFRH 115
Query: 102 KYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEG 161
+YLG + P P ++P LP FDWRE AVT VK+Q CGS WAFSTTG +EG
Sbjct: 116 QYLGLRAPPLRDAHDAP-ILPTNDLPEDFDWREKGAVTEVKNQGSCGSCWAFSTTGALEG 174
Query: 162 VYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSKLGGGLEEEK 212
KT +LVSLSEQ+L+DCD E D GC GG +++A+ + GGLE+E+
Sbjct: 175 ANFLKTGELVSLSEQQLVDCDHECDPSDARSCDSGCNGGLMTSAYQYALKS--GGLEKEE 232
Query: 213 TYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTG 272
YPY G D C NK ++ + VS DE +A LV+NGP++V INA +Q YV G
Sbjct: 233 DYPYTGKDGTCSFNKNKIVAHVSNFSVVSIDEGQIAANLVKNGPLSVGINAAFMQTYVGG 292
Query: 273 VSHPIQFFCDGGNENLSHSVLIVGYGVDR-TKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
VS P + C NL H VL+VGYG K PYW+IKNSWG WGE GY++L
Sbjct: 293 VSCP--YVC--SKRNLDHGVLLVGYGAAAFAPIRMKDKPYWVIKNSWGPNWGENGYYKLC 348
Query: 332 RGDGSCGINDYVRS 345
RG CGIN+ V +
Sbjct: 349 RGHNVCGINNMVST 362
>gi|116779325|gb|ABK21238.1| unknown [Picea sitchensis]
gi|148905850|gb|ABR16087.1| unknown [Picea sitchensis]
gi|148908434|gb|ABR17330.1| unknown [Picea sitchensis]
gi|148908881|gb|ABR17545.1| unknown [Picea sitchensis]
gi|224286109|gb|ACN40765.1| unknown [Picea sitchensis]
Length = 366
Score = 247 bits (630), Expect = 6e-63, Method: Compositional matrix adjust.
Identities = 135/314 (42%), Positives = 181/314 (57%), Gaps = 18/314 (5%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
F +F+ ++ K Y+ E+ R +F NL + Q + + +G+ +FSDL+ EF+
Sbjct: 57 FRHFIRRYGKKYSGPEEHEHRFGVFKSNLLRALEHQKLDPRAS-HGVTKFSDLTQEEFRH 115
Query: 102 KYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEG 161
+YLG + P P ++P LP FDWRE AVT VK+Q CGS WAFSTTG +EG
Sbjct: 116 QYLGLRAPPLRDAHDAP-ILPTNDLPEDFDWREKGAVTEVKNQGSCGSCWAFSTTGALEG 174
Query: 162 VYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSKLGGGLEEEK 212
KT +LVSLSEQ+L+DCD E D GC GG +++A+ + GGLE+E+
Sbjct: 175 ANFLKTGELVSLSEQQLVDCDHECDPSDARSCDSGCNGGLMTSAYQYALKS--GGLEKEE 232
Query: 213 TYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTG 272
YPY G D C NK ++ + VS DE +A LV+NGP++V INA +Q YV G
Sbjct: 233 DYPYTGKDGTCSFNKNKIVAHVSNFSVVSIDEGQIAANLVKNGPLSVGINAAFMQTYVGG 292
Query: 273 VSHPIQFFCDGGNENLSHSVLIVGYGVDR-TKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
VS P + C NL H VL+VGYG K PYW+IKNSWG WGE GY++L
Sbjct: 293 VSCP--YVC--SKRNLDHGVLLVGYGAAAFAPIRMKDKPYWVIKNSWGPNWGENGYYKLC 348
Query: 332 RGDGSCGINDYVRS 345
RG CGIN+ V +
Sbjct: 349 RGHNVCGINNMVST 362
>gi|171854651|dbj|BAG16515.1| putative cysteine proteinase [Capsicum chinense]
Length = 367
Score = 247 bits (630), Expect = 7e-63, Method: Compositional matrix adjust.
Identities = 142/334 (42%), Positives = 193/334 (57%), Gaps = 29/334 (8%)
Query: 27 DEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKI---QLLQDTEHGS 83
D+ +HL + +H F+ F + K YAT E+ RL +F NLR+ QLL T
Sbjct: 37 DDNNNHLLNAEHH--FSLFKSKFGKIYATQEEHDHRLKVFKANLRRARRHQLLDPTAE-- 92
Query: 84 GVYGLNEFSDLSTAEFQAKYLGF-KLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVK 142
+G+ +FSDL+ +EF+ YLG K KP + P ++P LP FDWRE AVTGVK
Sbjct: 93 --HGITKFSDLTPSEFRRTYLGLHKPKPKLSTTKAP-ILPTSDLPEDFDWREKGAVTGVK 149
Query: 143 DQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSI 193
+Q CGS W+FSTTG +EG + T +LVSLSEQ+L+DCD E D GC GG +
Sbjct: 150 NQGSCGSCWSFSTTGAVEGAHFLATGELVSLSEQQLVDCDHECDAEQKSECDAGCGGGLM 209
Query: 194 SNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVE 253
+ AF+ + GGL+ EK YPY G + C +K + Y V DE +A LV+
Sbjct: 210 TTAFEYTLK--AGGLQREKDYPYTGRNGQCHFDKSKIAASVTNYSVVGLDEDQIAANLVK 267
Query: 254 NGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYG-VDRTKFTHKAVPYW 312
+GP+AV IN+ +Q Y+ GVS P+ F ++ H VL+VGYG KA PYW
Sbjct: 268 HGPLAVGINSAWMQTYIGGVSCPLVCF-----KHQDHGVLLVGYGSAGFAPIRLKAKPYW 322
Query: 313 IIKNSWGEGWGEKGYFRLYRGDGS-CGINDYVRS 345
IIKNSWGE WGE GY+++ RG + CG++ V +
Sbjct: 323 IIKNSWGEHWGEHGYYKICRGQHNICGVDAMVST 356
>gi|67773376|gb|AAY81945.1| cysteine protease 7 [Paragonimus westermani]
Length = 325
Score = 247 bits (630), Expect = 7e-63, Method: Compositional matrix adjust.
Identities = 133/310 (42%), Positives = 179/310 (57%), Gaps = 18/310 (5%)
Query: 41 LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
L+ F + K YA + R IF NL + Q Q E G+ YG+ +FSDL+ EF
Sbjct: 31 LYEQFKRDYGKAYAN-EDDQKRFAIFKDNLVRAQQYQMQEQGTAKYGVTQFSDLTPEEFA 89
Query: 101 AKYLGFKLKPSYADRSVPAMIPN--ITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
A YLG ++ D V + N T P + DWR+ AV V+DQ CGS WAFS T N
Sbjct: 90 AMYLGSRI-----DERVDRVQLNDLQTAPASVDWRKKGAVGPVEDQGSCGSCWAFSVTAN 144
Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG 218
+EG + KT +LVSLS+Q+L+DCD+ D GC GG + I K GGLE + YPY
Sbjct: 145 VEGQWFLKTGRLVSLSKQQLVDCDRLDHGCSGGYPPYTYKEI--KRMGGLELQSAYPYTS 202
Query: 219 DDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQ 278
+ACR+++ KI+ + + DE A +L E+GPM+ +NA LQFY +G+ HP +
Sbjct: 203 WKQACRIDRSKLVAKIDDSIVLETDEEKQAAWLAEHGPMSTCLNAGPLQFYQSGILHPSK 262
Query: 279 FFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCG 338
C E L+H+VL VGY T VPYW ++NSWG WGE GYFR+YRGDG+CG
Sbjct: 263 AMC--SPEGLNHAVLTVGYD------TEHGVPYWTVRNSWGTRWGENGYFRIYRGDGTCG 314
Query: 339 INDYVRSALV 348
I+ SA++
Sbjct: 315 IDRLTTSAII 324
>gi|91992514|gb|ABE72973.1| cathepsin L [Aedes aegypti]
Length = 265
Score = 247 bits (630), Expect = 7e-63, Method: Compositional matrix adjust.
Identities = 123/269 (45%), Positives = 173/269 (64%), Gaps = 12/269 (4%)
Query: 86 YGLNEFSDLSTAEFQAKYLGFKLKPSYADRS-----VPAMIPNITLPRAFDWREYDAVTG 140
YG+ F+D+++AE++ + G + P DR+ + N+ LP +FDWRE AV+
Sbjct: 2 YGITHFADMTSAEYRQR-TGLVI-PRDEDRNHVGNPKAEIDENMELPESFDWRELGAVSP 59
Query: 141 VKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTI 200
VK+Q CGS WAFS GNIEG++ KTK L SEQEL+DCD D C+GG + +A+ I
Sbjct: 60 VKNQGNCGSCWAFSVVGNIEGLHQIKTKVLEEYSEQELLDCDAVDSACQGGYMDDAYKAI 119
Query: 201 MSKLGGGLEEEKTYPYRG-DDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAV 259
K+ GGLE E YPY K C N V++ G V + ++ET MA+YLV NGP+++
Sbjct: 120 -EKI-GGLELESEYPYLAKKQKTCHFNSTEVHVRVKGAVDLPKNETAMAQYLVANGPISI 177
Query: 260 AINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWG 319
+NA A+QFY G+SHP + C +NL H VLIVGYGV +K +PYWI+KNSWG
Sbjct: 178 GLNANAMQFYRGGISHPWKPLCS--KKNLDHGVLIVGYGVKEYPMFNKTMPYWIVKNSWG 235
Query: 320 EGWGEKGYFRLYRGDGSCGINDYVRSALV 348
WGE+GY+R++RGD +CG+++ SA++
Sbjct: 236 PKWGEQGYYRIFRGDNTCGVSEMASSAVL 264
>gi|255538808|ref|XP_002510469.1| cysteine protease, putative [Ricinus communis]
gi|223551170|gb|EEF52656.1| cysteine protease, putative [Ricinus communis]
Length = 366
Score = 247 bits (630), Expect = 8e-63, Method: Compositional matrix adjust.
Identities = 138/332 (41%), Positives = 188/332 (56%), Gaps = 25/332 (7%)
Query: 25 VGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSG 84
V D L HH F F + K YAT E+ R +F NLR+ Q Q + S
Sbjct: 40 VEDYLLSAQHH------FTAFKAKFGKNYATQEEHDYRFKVFKANLRRAQKHQLMDP-SA 92
Query: 85 VYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQ 144
V+G+ +FSDL+ EF+ +YLG K AD ++P +P FDWR++ AVT VK+Q
Sbjct: 93 VHGVTKFSDLTPREFRRQYLGLKKLRLPADAHEAPILPTDGIPEDFDWRDHGAVTNVKNQ 152
Query: 145 TMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISN 195
CGS W+FS G +EG + T +LVSLSEQ+L+DCD E D GC GG ++N
Sbjct: 153 GSCGSCWSFSAAGALEGAHFLATGELVSLSEQQLVDCDHECDPTEYGACDSGCNGGLMTN 212
Query: 196 AFDTIMSKLGGGLEEEKTYPYRGDDK-ACRLNKKATQVKINGYVSVSRDETDMAKYLVEN 254
AF+ I+ GGLE E+ YPY G D+ C+ + +N + VS DE +A LV+N
Sbjct: 213 AFEYILK--AGGLEREEDYPYTGSDRGPCKFERAKIAASVNNFSVVSVDEDQIAANLVQN 270
Query: 255 GPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYG-VDRTKFTHKAVPYWI 313
GP+AV INA +Q Y+ GVS P + C ++ H V++VGYG K P+WI
Sbjct: 271 GPLAVGINAVFMQTYIGGVSCP--YIC---SKRQDHGVVLVGYGSAGYAPVRLKDKPFWI 325
Query: 314 IKNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
IKNSWGE WGE GY+++ RG CG++ V +
Sbjct: 326 IKNSWGENWGENGYYKICRGRNVCGVDAMVST 357
>gi|355681647|gb|AER96812.1| cathepsin F [Mustela putorius furo]
Length = 408
Score = 247 bits (630), Expect = 8e-63, Method: Compositional matrix adjust.
Identities = 138/329 (41%), Positives = 192/329 (58%), Gaps = 10/329 (3%)
Query: 20 SSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT 79
S ++ E L VK ++F F+ +N+TY + E R+ +FS N+ + Q +Q
Sbjct: 90 SVLPLLNKEPLPQDFSVKMASIFKEFVTTYNRTYESKEETQWRMSVFSNNMMRAQKIQAL 149
Query: 80 EHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVT 139
+ G+ YG+ +FSDL+ EF+ YL L+ + P +DWR AVT
Sbjct: 150 DRGTAQYGVTKFSDLTEEEFRTIYLNPLLREYRGKNMRLDKSTGDSAPSEWDWRRKGAVT 209
Query: 140 GVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDT 199
VK+Q MCGS WAFS TGN+EG + K L+SLSEQEL+DCD+ D C GG SNA+
Sbjct: 210 KVKNQGMCGSCWAFSVTGNVEGQWFLKQGALLSLSEQELLDCDKVDKACLGGLPSNAYSA 269
Query: 200 IMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAV 259
I K GGLE E Y YRG + C + K +V IN V +S++E +A +L E GP++V
Sbjct: 270 I--KTLGGLETEDDYSYRGRMQTCGFSPKKARVYINDSVELSQNEETLAAWLAEKGPISV 327
Query: 260 AINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWG 319
AINA+ +QFY G+SHP++ C + H+VL+VGYG P+W IKNSWG
Sbjct: 328 AINAFGMQFYRHGISHPLRPLCS--PWLIDHAVLLVGYG------NRSGTPFWAIKNSWG 379
Query: 320 EGWGEKGYFRLYRGDGSCGINDYVRSALV 348
WGE+GY+ L+RG G+CG+N SA+V
Sbjct: 380 SDWGEEGYYYLHRGSGACGVNTMASSAVV 408
>gi|13491752|gb|AAK27969.1|AF242373_1 cysteine protease [Ipomoea batatas]
Length = 366
Score = 246 bits (629), Expect = 8e-63, Method: Compositional matrix adjust.
Identities = 141/336 (41%), Positives = 196/336 (58%), Gaps = 28/336 (8%)
Query: 24 VVGD--EKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEH 81
VVGD + L+ HH F F + K YA+ E+ RL F N+R+ + Q+ +
Sbjct: 35 VVGDGGDLLNADHH------FTVFKRRFGKVYASDEEHDYRLSEFKANMRRAKQHQELDP 88
Query: 82 GSGVYGLNEFSDLSTAEFQAKYLGFKLKPSY-ADRSVPAMIPNITLPRAFDWREYDAVTG 140
+ V+G+ +FSDL+ EF+ K+LG + + AD ++P LP FDWR++ AVT
Sbjct: 89 AA-VHGVTQFSDLTPTEFRRKFLGLNRRLKFPADAKTAPILPTDELPSDFDWRDHGAVTP 147
Query: 141 VKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGG 191
VK+Q CGS +FSTTG +EG T KLVSLSEQ+L+DCD E D GC GG
Sbjct: 148 VKNQGTCGSCCSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEAGSCDSGCNGG 207
Query: 192 SISNAFDTIMSKLGGGLEEEKTYPYRGDD-KACRLNKKATQVKINGYVSVSRDETDMAKY 250
+++AF+ + GGL E+ +PY G+D + CR +K K+ + VS DE +A
Sbjct: 208 LMNSAFEYTLK--AGGLMREEDHPYTGNDLQVCRFDKTKIAAKVANFSVVSLDEDQIAAN 265
Query: 251 LVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYG-VDRTKFTHKAV 309
LV+NGP+AVAINA +Q Y+ GVS P + C ++ L H VL+VGYG K
Sbjct: 266 LVKNGPLAVAINAVFMQTYIGGVSCP--YIC---SKRLDHGVLLVGYGSAGYAPIRMKEK 320
Query: 310 PYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
PYWIIKNSWGE WGE GY+++ RG CG++ V +
Sbjct: 321 PYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVST 356
>gi|33945878|emb|CAE45589.1| papain-like cysteine proteinase-like protein 2 [Lotus japonicus]
Length = 361
Score = 246 bits (629), Expect = 8e-63, Method: Compositional matrix adjust.
Identities = 142/337 (42%), Positives = 194/337 (57%), Gaps = 32/337 (9%)
Query: 24 VVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNL---RKIQLLQDTE 80
VV DE L HH F F + K YAT E+ R ++F N+ R+ QLL
Sbjct: 33 VVDDEGLGAEHH------FLEFKRRFGKVYATEEEHGYRFNVFKSNMHRARRHQLLDP-- 84
Query: 81 HGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTG 140
S V+G+ FSDL+ EF+ LG + +D ++P LP+ FDWRE+ AVT
Sbjct: 85 --SAVHGVTRFSDLTPMEFRHSVLGLRGVGLPSDADSAPILPTDNLPKDFDWREHGAVTP 142
Query: 141 VKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE----------DDGCEG 190
VK+Q CGS W+FS TG +EG + T KLVSLSEQ+L+DCD E D GC+G
Sbjct: 143 VKNQGSCGSCWSFSATGALEGAHFLSTGKLVSLSEQQLVDCDHEQCDPEEAGSCDSGCKG 202
Query: 191 GSISNAFDTIMSKLGGGLEEEKTYPYRGD-DKACRLNKKATQVKINGYVSVSRDETDMAK 249
G +++AF+ I++ GG+ E+ YPY G C+ ++ + + VSRDE +A
Sbjct: 203 GLMNSAFEYILNN--GGVMREEDYPYSGTAGGTCKFDQTKIAASVANFSVVSRDEDQIAA 260
Query: 250 YLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDR-TKFTHKA 308
LV+NGP+AVAINA +Q YV GVS P + C ++ L+H VL+VGYG + K
Sbjct: 261 NLVKNGPLAVAINAVYMQTYVGGVSCP--YVC---SKKLNHGVLLVGYGSESYAPIRMKQ 315
Query: 309 VPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
PYWIIKNSWGE WGE GY+++ RG CG++ V +
Sbjct: 316 KPYWIIKNSWGENWGENGYYKICRGRNVCGVDSMVST 352
>gi|290980288|ref|XP_002672864.1| predicted protein [Naegleria gruberi]
gi|284086444|gb|EFC40120.1| predicted protein [Naegleria gruberi]
Length = 356
Score = 246 bits (629), Expect = 8e-63, Method: Compositional matrix adjust.
Identities = 146/361 (40%), Positives = 199/361 (55%), Gaps = 32/361 (8%)
Query: 12 LLSLTVSVSSFMVVGDE---KLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSG 68
L+ + + V+SF++ + + L + LF F +H K Y T R IF
Sbjct: 4 LILVVLLVASFILAIEAAKGPFNALPESEMQQLFTQFRRKHVKLYGTKQVQDRRYQIFKQ 63
Query: 69 NLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYAD------RSVPA--- 119
N+ + + + G+ FSDL+ EF++ +L P A R PA
Sbjct: 64 NVERARFENYLTERDNM-GVTRFSDLTPDEFKSMFLMKSYTPKQARELLSGMRQYPANAK 122
Query: 120 --MIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQE 177
M P+ FDWRE++AVT VKDQ CGS W FSTTGN+EG+YAAKT KL+SLSEQ+
Sbjct: 123 LTMKQVSDAPKEFDWREHNAVTPVKDQGNCGSCWTFSTTGNVEGMYAAKTGKLISLSEQQ 182
Query: 178 LIDCDQE----------DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNK 227
L+DCD + GC GG + ++F+ I+ GGL E++YPY D CR N
Sbjct: 183 LVDCDHNCVVWEGEKTCNAGCNGGLMWSSFEHIIKT--GGLVTEESYPYEAVDNRCRFNV 240
Query: 228 KATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNEN 287
VKI+ + VS +E +MA +L NGP+A+AINA LQ+Y G+ +P + CD E
Sbjct: 241 SNAVVKISNWTFVSSNEDEMAAWLANNGPIAIAINADYLQYYRKGILNPSR--CDP--EE 296
Query: 288 LSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSAL 347
L+H VLIVGYG ++ K YWI+KNSW WGEKGY R+ RG G CG+N SAL
Sbjct: 297 LNHGVLIVGYGEEKAA-NGKVEKYWIVKNSWSASWGEKGYVRVLRGKGVCGLNAVPSSAL 355
Query: 348 V 348
+
Sbjct: 356 I 356
>gi|255543801|ref|XP_002512963.1| cysteine protease, putative [Ricinus communis]
gi|223547974|gb|EEF49466.1| cysteine protease, putative [Ricinus communis]
Length = 373
Score = 246 bits (629), Expect = 9e-63, Method: Compositional matrix adjust.
Identities = 134/305 (43%), Positives = 182/305 (59%), Gaps = 19/305 (6%)
Query: 52 TYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPS 111
TYA+ E+ R IF NLR+ + Q + + +G+ +FSDL+ +EF+ ++LG +
Sbjct: 68 TYASQEEHDYRFKIFKSNLRRAERHQKLDP-TATHGVTQFSDLTHSEFRRQFLGLRRLRL 126
Query: 112 YADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLV 171
D + M+P LP FDWRE AVT VK+Q CGS W+FSTTG +EG T KLV
Sbjct: 127 PKDANEAPMLPTNDLPADFDWREKGAVTAVKNQGSCGSCWSFSTTGALEGANYLATGKLV 186
Query: 172 SLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDK- 221
SLSEQ+L+DCD E D GC GG +++AF+ + GGL E+ YPY G D+
Sbjct: 187 SLSEQQLVDCDHECDPAEEGACDSGCNGGLMNSAFEYTLK--AGGLMREEDYPYTGTDRG 244
Query: 222 ACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFC 281
AC+ +K K+ + VS DE +A LV+NGP+AVAINA +Q Y+ GVS P + C
Sbjct: 245 ACQFDKTKIAAKVANFSVVSLDEDQIAANLVKNGPLAVAINAVFMQTYIGGVSCP--YIC 302
Query: 282 DGGNENLSHSVLIVGYG-VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGIN 340
++ L H VL+VGYG K PYWIIKNSWGE WGE GY+++ RG CG++
Sbjct: 303 ---SKRLDHGVLLVGYGSAGYAPIRMKEKPYWIIKNSWGENWGESGYYKICRGRNICGVD 359
Query: 341 DYVRS 345
V +
Sbjct: 360 SMVST 364
>gi|317106675|dbj|BAJ53178.1| JHL18I08.12 [Jatropha curcas]
Length = 368
Score = 246 bits (629), Expect = 9e-63, Method: Compositional matrix adjust.
Identities = 144/364 (39%), Positives = 202/364 (55%), Gaps = 31/364 (8%)
Query: 3 CFYFFAGVALLSLTVSVSS----------FMVVGDEKLHHLHHVKHTALFNYFLEQHNKT 52
C F ALLS T++ ++ VV D HL + +H F F + KT
Sbjct: 5 CLISFLVYALLSFTIASTTSPDELDDPLIRQVVPDGDQDHLLNAEHH--FTTFKAKFGKT 62
Query: 53 YATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSY 112
YAT E+ R +F NLR+ + Q + + V+G+ FSDL+ EF+ +YLG +
Sbjct: 63 YATQEEHDYRFKLFKANLRRARKHQMMD-PTAVHGVTMFSDLTPREFRRQYLGLRRLRLP 121
Query: 113 ADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVS 172
AD ++P LP FDWR++ AVT VK+Q CGS W+FS G +EG + T +LVS
Sbjct: 122 ADAHEAPILPTNDLPTDFDWRDHGAVTNVKNQGSCGSCWSFSAAGALEGAHFLATGELVS 181
Query: 173 LSEQELIDCDQE---------DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDK-A 222
LSEQ+L+DCD E D GC GG ++ AF+ + GGLE E+ YPY G+D+
Sbjct: 182 LSEQQLVDCDHECDPEEYGACDSGCNGGLMTTAFEYTLK--AGGLEREEDYPYTGNDRGP 239
Query: 223 CRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCD 282
C+ ++ ++ + VS DE +A LV++GP+AV INA +Q Y+ GVS P + C
Sbjct: 240 CKFDRNKIVASVSNFSVVSIDEDQIAANLVKHGPLAVGINAVFMQTYMGGVSCP--YIC- 296
Query: 283 GGNENLSHSVLIVGYG-VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGIND 341
++ H VL+VGYG K P+WIIKNSWGE WGE GY+R+ RG CG++
Sbjct: 297 --SKRQDHGVLLVGYGSAGYAPIRLKDKPFWIIKNSWGESWGENGYYRICRGRNICGVDA 354
Query: 342 YVRS 345
V S
Sbjct: 355 MVSS 358
>gi|42407296|dbj|BAD10859.1| cysteine protease [Aster tripolium]
Length = 363
Score = 246 bits (629), Expect = 9e-63, Method: Compositional matrix adjust.
Identities = 143/348 (41%), Positives = 196/348 (56%), Gaps = 25/348 (7%)
Query: 12 LLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLR 71
L+ V + D L HH F F + +TY T E+ RL +F NLR
Sbjct: 26 LIRQVVQNDETEIESDPLLDPEHH------FKLFKNKFGRTYDTEEEHEYRLTVFKSNLR 79
Query: 72 KIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSY-ADRSVPAMIPNITLPRAF 130
+ + Q + + +G+ +FSDL+ +EF+ KYLG K K AD + ++P LP+ F
Sbjct: 80 RAKRHQVLDP-TAKHGVTKFSDLTPSEFRKKYLGLKSKLKLPADANKAPILPTSNLPQDF 138
Query: 131 DWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE------ 184
DWR+ AVT VK+Q CGS W+FSTTG +EG + +T +LVSLSEQ+L+DCD E
Sbjct: 139 DWRDKGAVTPVKNQGSCGSCWSFSTTGALEGSHFLQTGELVSLSEQQLVDCDHECDPAEY 198
Query: 185 ---DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVS 241
D GC GG ++NAF+ I+ GGL++E YPY G D C+ +K + + VS
Sbjct: 199 NSCDSGCNGGLMNNAFEYILK--AGGLQKEADYPYTGRDGTCKFDKSKIAASVANFSVVS 256
Query: 242 RDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYG-VD 300
DE +A LV NGP+A+ INA +Q Y+ VS P + C + H VL+VGYG
Sbjct: 257 TDEDQIAANLVTNGPLAIGINAAWMQTYIGQVSCP--YIC--SKTKMDHGVLLVGYGSAG 312
Query: 301 RTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
K PYWIIKNSWGE WGE GY++L G +CG++ V SA+V
Sbjct: 313 YAPLRFKEKPYWIIKNSWGEDWGEDGYYKLCSGYNACGMDTMV-SAVV 359
>gi|164605518|dbj|BAF98584.1| CM0216.500.nc [Lotus japonicus]
Length = 360
Score = 246 bits (628), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 141/336 (41%), Positives = 194/336 (57%), Gaps = 31/336 (9%)
Query: 24 VVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNL---RKIQLLQDTE 80
VV DE L HH F F + K YAT E+ R ++F N+ R+ QLL
Sbjct: 33 VVDDEGLGAEHH------FLEFKRRFGKVYATEEEHGYRFNVFKSNMHRARRHQLLDP-- 84
Query: 81 HGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTG 140
S V+G+ FSDL+ EF+ LG + +D ++P LP+ FDWRE+ AVT
Sbjct: 85 --SAVHGVTRFSDLTPMEFRHSVLGLRGVGLPSDADSAPILPTDNLPKDFDWREHGAVTP 142
Query: 141 VKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGG 191
VK+Q CGS W+FS TG +EG + T +LVSLSEQ+L+DCD + D GC GG
Sbjct: 143 VKNQGSCGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHQCDPEEAGSCDSGCNGG 202
Query: 192 SISNAFDTIMSKLGGGLEEEKTYPYRGDDKA-CRLNKKATQVKINGYVSVSRDETDMAKY 250
+++AF+ I++ GG+ E+ YPY G + C+ +K + + VSRDE +A
Sbjct: 203 LMNSAFEYILNN--GGVMREEDYPYSGTNGGTCKFDKAKIAASVANFSVVSRDEDQIAAN 260
Query: 251 LVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDR-TKFTHKAV 309
LV+NGP+AVAINA +Q YV GVS P + C ++ L+H VL+VGYG + K
Sbjct: 261 LVKNGPLAVAINAVYMQTYVGGVSCP--YVC---SKKLNHGVLLVGYGSESYAPIRMKQK 315
Query: 310 PYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
PYWIIKNSWGE WGE GY+++ RG CG++ V +
Sbjct: 316 PYWIIKNSWGENWGENGYYKICRGRNICGVDSMVST 351
>gi|3377952|emb|CAA08906.1| cysteine proteinase [Cicer arietinum]
Length = 362
Score = 246 bits (628), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 139/331 (41%), Positives = 197/331 (59%), Gaps = 27/331 (8%)
Query: 27 DEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY 86
D+ L+ HH F F + +K+YAT E+ R +F NL+K +L Q + S +
Sbjct: 38 DQLLNAEHH------FTTFKSKFSKSYATKEEHDYRFGVFKSNLKKAKLHQKLD-PSAEH 90
Query: 87 GLNEFSDLSTAEFQAKYLGFKLK---PSYADRSVPAMIPNITLPRAFDWREYDAVTGVKD 143
G+ +FSDL+ +EF+ ++LG K + P++A ++ ++P LP FDWRE AVT VKD
Sbjct: 91 GVTKFSDLTASEFRRQFLGLKKRLRLPAHAQKA--PILPTNNLPEDFDWREKGAVTPVKD 148
Query: 144 QTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSIS 194
Q CGS WAFSTTG +EG T KLVSLSEQ+L+DCD D GC GG ++
Sbjct: 149 QGSCGSCWAFSTTGALEGANYLATGKLVSLSEQQLVDCDHVCDPDEYNSCDSGCNGGLMN 208
Query: 195 NAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVEN 254
NAF+ ++ GG+ E+ Y Y G D +C+ +K ++ + VS DE +A LV+N
Sbjct: 209 NAFEYLLQ--SGGVVREQDYSYTGRDGSCKFDKSKIAASVSNFSVVSVDEDQIAANLVKN 266
Query: 255 GPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWII 314
GP+AVAINA +Q Y++GVS P + C L H VL+VG+G K PYWII
Sbjct: 267 GPLAVAINAAWMQTYMSGVSCP--YIC--AKSRLDHGVLLVGFGNGFAPIRLKEKPYWII 322
Query: 315 KNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
KNSWG+ WGE+GY+++ RG CG++ V +
Sbjct: 323 KNSWGQNWGEEGYYKICRGRNICGVDSMVST 353
>gi|1619903|gb|AAB16996.1| thiol protease isoform B, partial [Glycine max]
Length = 319
Score = 246 bits (627), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 135/307 (43%), Positives = 183/307 (59%), Gaps = 23/307 (7%)
Query: 51 KTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLK- 109
+ YAT E+ R +F NLR+ + V+G+ +FSDL+ AEF+ ++LG K
Sbjct: 15 RPYATKEEHDHRFGVFKSNLRRASCTPSST--PRVHGVTKFSDLTPAEFRRQFLGLKAVR 72
Query: 110 -PSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTK 168
P++A ++ ++P LP+ FDWR+ AVT VKDQ CGS W+FSTTG +EG Y T
Sbjct: 73 FPAHAQKA--PILPTKDLPKDFDWRDKGAVTNVKDQGGCGSCWSFSTTGALEGAYYLATG 130
Query: 169 KLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGD 219
+LVSLSEQ+L+DCD D GC GG ++NAF+ I+ GG+++EK YPY G
Sbjct: 131 ELVSLSEQQLVDCDHVCDPEEYGACDSGCNGGLMNNAFEYILQS--GGVQKEKDYPYTGR 188
Query: 220 DKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQF 279
D C+ +K ++ Y V DE +A LV+NGP+AVAINA +Q YV GVS P +
Sbjct: 189 DGTCKFDKTKVAATVSNYSVVCLDEEQIAANLVKNGPLAVAINAVFMQTYVGGVSCP--Y 246
Query: 280 FCDGGNENLSHSVLIVGYGVDR-TKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCG 338
C ++L H VL+VGYG K PYWIIKNSWGE WGE GY + RG CG
Sbjct: 247 IC---GKHLDHGVLLVGYGEGAYAPIRFKNKPYWIIKNSWGESWGENGYDEICRGRNVCG 303
Query: 339 INDYVRS 345
++ V +
Sbjct: 304 VDSMVST 310
>gi|357473427|ref|XP_003606998.1| Cysteine proteinase [Medicago truncatula]
gi|355508053|gb|AES89195.1| Cysteine proteinase [Medicago truncatula]
Length = 362
Score = 245 bits (626), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 134/315 (42%), Positives = 186/315 (59%), Gaps = 19/315 (6%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
FN F + K Y++ E+ R IF NL + + Q + S V+G+ FSDL+ EF+
Sbjct: 48 FNLFKHKFGKVYSSKDEHDYRFKIFKSNLNRAKRHQLMDP-SAVHGVTRFSDLTPREFRK 106
Query: 102 KYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEG 161
LG + D + ++P LP+ FDWRE AVT VK+Q CGS W+FSTTG +EG
Sbjct: 107 SVLGLRGVGLPKDANAAPILPTDNLPKDFDWREKGAVTAVKNQGSCGSCWSFSTTGALEG 166
Query: 162 VYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSKLGGGLEEEK 212
+ T KLVSLSEQ+L+DCD E D GC GG +++AF+ I+ GG+ E+
Sbjct: 167 AHFLSTGKLVSLSEQQLVDCDHECDPEQPGSCDAGCNGGLMNSAFEYILKS--GGVMREE 224
Query: 213 TYPYRGDDK-ACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVT 271
YPY G D+ +C+ +KK + + VS DE +A LV+NGP+A+A+NA +Q YV
Sbjct: 225 DYPYSGTDRGSCKFDKKKIAASVANFSVVSLDEDQIAANLVKNGPLAIALNAVYMQTYVG 284
Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDR-TKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
GVS P + C ++ L H VL+VGYG + K PYWIIKNSWGE WGE GY+++
Sbjct: 285 GVSCP--YIC---SKRLDHGVLLVGYGSGAYSPIRLKEKPYWIIKNSWGETWGENGYYKI 339
Query: 331 YRGDGSCGINDYVRS 345
RG CG++ V +
Sbjct: 340 CRGRNICGVDSMVST 354
>gi|146215998|gb|ABQ10201.1| cysteine protease Cp3 [Actinidia deliciosa]
Length = 365
Score = 245 bits (626), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 138/334 (41%), Positives = 191/334 (57%), Gaps = 25/334 (7%)
Query: 23 MVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHG 82
+V GD L HH F F + K+YAT ++ R +F NLR+ + Q +
Sbjct: 37 IVDGDHPLSADHH------FRLFKRRFGKSYATQEDHDYRFSVFKTNLRRARHHQRLDP- 89
Query: 83 SGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVK 142
S V+G+ +FSDL+ AEF+ +LG K AD + ++P LP FDWR++ AV VK
Sbjct: 90 SAVHGVTQFSDLTPAEFRRNHLGLKRLRFPADANKAPILPTEDLPADFDWRDHGAVASVK 149
Query: 143 DQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSI 193
+Q CGS W+FSTTG +EG T KLVSLSEQ+L+DCD E D GC GG +
Sbjct: 150 NQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLM 209
Query: 194 SNAFDTIMSKLGGGLEEEKTYPYRGDDKA-CRLNKKATQVKINGYVSVSRDETDMAKYLV 252
++A + + GGL E+ YPY G D+ C+ ++ + + VS DE +A LV
Sbjct: 210 NSALEYTLK--AGGLMREEDYPYSGTDRGTCKFDETKIAASVANFSVVSLDENQIAANLV 267
Query: 253 ENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYG-VDRTKFTHKAVPY 311
+NGP+AVAINA +Q YV GVS P + C ++ L H VL+VGYG K PY
Sbjct: 268 KNGPLAVAINAVFMQTYVGGVSCP--YIC---SKRLDHGVLLVGYGSAGYAPIRMKEKPY 322
Query: 312 WIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
WIIKNSWGE WGE G++++ +G CG++ V +
Sbjct: 323 WIIKNSWGESWGENGFYKICQGRNVCGVDSMVST 356
>gi|225427714|ref|XP_002264345.1| PREDICTED: cysteine proteinase RD19a [Vitis vinifera]
Length = 377
Score = 245 bits (625), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 133/316 (42%), Positives = 189/316 (59%), Gaps = 21/316 (6%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
F+ F + K+YA+ E+ R +F NLR+ + Q + S +G+ +FSDL+ AEF+
Sbjct: 62 FSIFKRRFGKSYASQEEHDYRFKVFKANLRRARRHQQLDP-SATHGVTQFSDLTPAEFRG 120
Query: 102 KYLGFK-LKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIE 160
YLG + LK + + P ++P LP FDWR++ AVT VK+Q CGS W+FSTTG +E
Sbjct: 121 TYLGLRPLKLPHDAQKAP-ILPTNDLPEDFDWRDHGAVTAVKNQGSCGSCWSFSTTGALE 179
Query: 161 GVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSKLGGGLEEE 211
G T LVSLSEQ+L++CD E D GC GG ++ AF+ + GGL +E
Sbjct: 180 GANFLATGNLVSLSEQQLVECDHECDPEEMGSCDSGCNGGLMNTAFEYTLK--AGGLMKE 237
Query: 212 KTYPYRGDDK-ACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYV 270
+ YPY G D+ +C+ +K ++ + +S DE +A LV+NGP+AVAINA +Q YV
Sbjct: 238 EDYPYTGTDRGSCKFDKTKIAASVSNFSVISLDEDQIAANLVKNGPLAVAINAVFMQTYV 297
Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYG-VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
GVS P + C ++ L H VL+VGYG K PYWIIKNSWGE WGE G+++
Sbjct: 298 GGVSCP--YIC---SKRLDHGVLLVGYGSAGYAPIRMKDKPYWIIKNSWGENWGENGFYK 352
Query: 330 LYRGDGSCGINDYVRS 345
+ RG CG++ V +
Sbjct: 353 ICRGRNVCGVDSMVST 368
>gi|4678299|emb|CAB41090.1| cysteine proteinase precursor-like protein [Arabidopsis thaliana]
Length = 363
Score = 245 bits (625), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 135/315 (42%), Positives = 186/315 (59%), Gaps = 20/315 (6%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
F F+ + K Y+T EY RL IF+ N+ K Q + S V+G+ +FSDL+ EF+
Sbjct: 51 FRLFMSDYGKNYSTREEYIHRLGIFAKNVLKAAEHQMMDP-SAVHGVTQFSDLTEEEFKR 109
Query: 102 KYLGFKLKPSYADRSVPAMIPNIT---LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
Y G +V A P + LP FDWRE VT VK+Q CGS WAFSTTG
Sbjct: 110 MYTGVADVGGSRGGTVGAEAPMVEVDGLPEDFDWREKGGVTEVKNQGACGSCWAFSTTGA 169
Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQE-----DDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
EG + T KL+SLSEQ+L+DCDQ D+GC GG ++NA++ +M GGLEEE++
Sbjct: 170 AEGAHFVSTGKLLSLSEQQLVDCDQADKKACDNGCGGGLMTNAYEYLME--AGGLEEERS 227
Query: 214 YPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGV 273
YPY G C+ + + V++ + ++ DE +A LV +GP+AV +NA +Q Y+ GV
Sbjct: 228 YPYTGKRGHCKFDPEKVAVRVLNFTTIPLDENQIAANLVRHGPLAVGLNAVFMQTYIGGV 287
Query: 274 SHPIQFFCDGGNENLSHSVLIVGY---GVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
S P+ C N++H VL+VGY G + ++K PYWIIKNSWG+ WGE GY++L
Sbjct: 288 SCPL--ICS--KRNVNHGVLLVGYGSKGFSILRLSNK--PYWIIKNSWGKKWGENGYYKL 341
Query: 331 YRGDGSCGINDYVRS 345
RG CGIN V +
Sbjct: 342 CRGHDICGINSMVSA 356
>gi|355566270|gb|EHH22649.1| Cathepsin F [Macaca mulatta]
Length = 484
Score = 244 bits (624), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 137/329 (41%), Positives = 194/329 (58%), Gaps = 10/329 (3%)
Query: 20 SSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT 79
S ++ ++ L VK ++F F+ +N+TY + E RL +F N+ + Q +Q
Sbjct: 165 SVLSLLNEDPLPQDLPVKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQAL 224
Query: 80 EHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVT 139
+ G+ YG+ +FSDL+ EF+ YL L+ ++ A P +DWR AVT
Sbjct: 225 DRGTAQYGVTKFSDLTEEEFRTIYLNPLLREEPGNKMKQAKSVGDLAPPEWDWRSKGAVT 284
Query: 140 GVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDT 199
VKDQ MCGS WAFS TGN+EG + L+SLSEQEL+DCD+ D C GG SNA+
Sbjct: 285 KVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDKMDKACMGGLPSNAYSA 344
Query: 200 IMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAV 259
I K GGLE E Y YRG +AC + + +V IN V +S++E +A +L + GP++V
Sbjct: 345 I--KNLGGLETEDDYSYRGHMQACNFSAEKAKVYINDSVELSQNEQKLAAWLAKKGPISV 402
Query: 260 AINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWG 319
AINA+ +QFY G+S P++ C + H+VL+VGYG +P+W IKNSWG
Sbjct: 403 AINAFGMQFYRHGISRPLRPLCSPW--LIDHAVLLVGYG------NRSDIPFWAIKNSWG 454
Query: 320 EGWGEKGYFRLYRGDGSCGINDYVRSALV 348
WGEKGY+ L+RG G+CG+N SA+V
Sbjct: 455 TDWGEKGYYYLHRGSGACGVNTMASSAVV 483
>gi|402892718|ref|XP_003909556.1| PREDICTED: cathepsin F [Papio anubis]
Length = 460
Score = 244 bits (623), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 137/329 (41%), Positives = 194/329 (58%), Gaps = 10/329 (3%)
Query: 20 SSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT 79
S ++ ++ L VK ++F F+ +N+TY + E RL +F N+ + Q +Q
Sbjct: 141 SVLSLLNEDPLPQDLPVKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQAL 200
Query: 80 EHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVT 139
+ G+ YG+ +FSDL+ EF+ YL L+ ++ A P +DWR AVT
Sbjct: 201 DRGTAQYGVTKFSDLTEEEFRTIYLNPLLREEPGNKMKQAKSVGDLAPPEWDWRSKGAVT 260
Query: 140 GVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDT 199
VKDQ MCGS WAFS TGN+EG + L+SLSEQEL+DCD+ D C GG SNA+
Sbjct: 261 KVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDKMDKACMGGLPSNAYSA 320
Query: 200 IMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAV 259
I K GGLE E Y YRG +AC + + +V IN V +S++E +A +L + GP++V
Sbjct: 321 I--KNLGGLETEDDYSYRGHMQACNFSAEKAKVYINDSVELSQNEQKLAAWLAKRGPISV 378
Query: 260 AINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWG 319
AINA+ +QFY G+S P++ C + H+VL+VGYG +P+W IKNSWG
Sbjct: 379 AINAFGMQFYRHGISRPLRPLCSPW--LIDHAVLLVGYG------NRSDIPFWAIKNSWG 430
Query: 320 EGWGEKGYFRLYRGDGSCGINDYVRSALV 348
WGEKGY+ L+RG G+CG+N SA+V
Sbjct: 431 TDWGEKGYYYLHRGSGACGVNTMASSAVV 459
>gi|67773380|gb|AAY81947.1| cysteine protease 9 [Paragonimus westermani]
Length = 322
Score = 244 bits (623), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 134/308 (43%), Positives = 180/308 (58%), Gaps = 13/308 (4%)
Query: 41 LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
L+ F + K YA + R IF NL + Q LQ + G+ YG+ +FSDL+ EF
Sbjct: 26 LYEQFKRDYGKVYAN-EDDQKRFAIFKDNLVRAQKLQLKDQGTARYGVTQFSDLTPEEFA 84
Query: 101 AKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIE 160
AKYL + +R P + P DWRE AVT V++Q CGS WAFS GN+E
Sbjct: 85 AKYLRAAVNNDQVERVRPTGLK--AAPERMDWREKGAVTAVENQGSCGSCWAFSAAGNVE 142
Query: 161 GVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDD 220
G + KT +LVSLS+Q+L+DCD+ +GC GG +++ I K GGLE E YPY G +
Sbjct: 143 GQWFIKTGQLVSLSKQQLVDCDRVAEGCNGGWPVSSYLEI--KHMGGLESESDYPYVGAE 200
Query: 221 KACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFF 280
+ C LNK+ KI+ + + E + A YL E+GP++ +NA ALQ Y +GV +P
Sbjct: 201 QTCALNKEKLLAKIDDLIVLGAYEEEHAAYLAEHGPLSTLLNAVALQHYQSGVLNPTYEE 260
Query: 281 CDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGIN 340
C + L+H+VL VGY +PYWIIKNSWG WGEKGYFRL+RGD +CGIN
Sbjct: 261 CP--DTELNHAVLTVGYD------KEGDMPYWIIKNSWGTDWGEKGYFRLFRGDYTCGIN 312
Query: 341 DYVRSALV 348
SA++
Sbjct: 313 RMATSAII 320
>gi|224285931|gb|ACN40679.1| unknown [Picea sitchensis]
Length = 366
Score = 244 bits (623), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 134/314 (42%), Positives = 180/314 (57%), Gaps = 18/314 (5%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
F +F+ ++ K Y+ E+ R +F NL + Q + + +G+ +FSDL+ F+
Sbjct: 57 FRHFIRRYGKKYSGPEEHEHRFGVFKSNLLRALEHQKLDPRAS-HGVTKFSDLTQEGFRH 115
Query: 102 KYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEG 161
+YLG + P P ++P LP FDWRE AVT VK+Q CGS WAFSTTG +EG
Sbjct: 116 QYLGLRAPPLRDAHDAP-ILPTNDLPEDFDWREKGAVTEVKNQGSCGSCWAFSTTGALEG 174
Query: 162 VYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSKLGGGLEEEK 212
KT +LVSLSEQ+L+DCD E D GC GG +++A+ + GGLE+E+
Sbjct: 175 ANFLKTGELVSLSEQQLVDCDHECDPSDARSCDSGCNGGLMTSAYQYALKS--GGLEKEE 232
Query: 213 TYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTG 272
YPY G D C NK ++ + VS DE +A LV+NGP++V INA +Q YV G
Sbjct: 233 DYPYTGKDGTCSFNKNKIVAHVSNFSVVSIDEGQIAANLVKNGPLSVGINAAFMQTYVGG 292
Query: 273 VSHPIQFFCDGGNENLSHSVLIVGYGVDR-TKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
VS P + C NL H VL+VGYG K PYW+IKNSWG WGE GY++L
Sbjct: 293 VSCP--YVC--SKRNLDHGVLLVGYGAAAFAPIRMKDKPYWVIKNSWGPNWGENGYYKLC 348
Query: 332 RGDGSCGINDYVRS 345
RG CGIN+ V +
Sbjct: 349 RGHNVCGINNMVST 362
>gi|355751926|gb|EHH56046.1| Cathepsin F, partial [Macaca fascicularis]
Length = 381
Score = 244 bits (622), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 137/329 (41%), Positives = 194/329 (58%), Gaps = 10/329 (3%)
Query: 20 SSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT 79
S ++ ++ L VK ++F F+ +N+TY + E RL +F N+ + Q +Q
Sbjct: 62 SVLSLLNEDPLPQDLPVKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQAL 121
Query: 80 EHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVT 139
+ G+ YG+ +FSDL+ EF+ YL L+ ++ A P +DWR AVT
Sbjct: 122 DRGTAQYGVTKFSDLTEEEFRTIYLNPLLREEPGNKMKQAKSVGDLAPPEWDWRSKGAVT 181
Query: 140 GVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDT 199
VKDQ MCGS WAFS TGN+EG + L+SLSEQEL+DCD+ D C GG SNA+
Sbjct: 182 KVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDKMDKACMGGLPSNAYSA 241
Query: 200 IMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAV 259
I K GGLE E Y YRG +AC + + +V IN V +S++E +A +L + GP++V
Sbjct: 242 I--KNLGGLETEDDYSYRGHMQACNFSAEKAKVYINDSVELSQNEQKLAAWLAKKGPISV 299
Query: 260 AINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWG 319
AINA+ +QFY G+S P++ C + H+VL+VGYG +P+W IKNSWG
Sbjct: 300 AINAFGMQFYRHGISRPLRPLCSPW--LIDHAVLLVGYG------NRSDIPFWAIKNSWG 351
Query: 320 EGWGEKGYFRLYRGDGSCGINDYVRSALV 348
WGEKGY+ L+RG G+CG+N SA+V
Sbjct: 352 TDWGEKGYYYLHRGSGACGVNTMASSAVV 380
>gi|357438145|ref|XP_003589348.1| Cysteine proteinase [Medicago truncatula]
gi|355478396|gb|AES59599.1| Cysteine proteinase [Medicago truncatula]
Length = 364
Score = 244 bits (622), Expect = 6e-62, Method: Compositional matrix adjust.
Identities = 138/334 (41%), Positives = 194/334 (58%), Gaps = 24/334 (7%)
Query: 25 VGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSG 84
V D H+ + +H F F + +K YAT E+ R +F NL K +L Q + S
Sbjct: 36 VVDTAEDHILNAEHH--FTSFKSKFSKNYATKEEHDYRFGVFKSNLIKAKLHQKLDP-SA 92
Query: 85 VYGLNEFSDLSTAEFQAKYLGFKLK---PSYADRSVPAMIPNITLPRAFDWREYDAVTGV 141
+G+ +FSDL+ +EF+ ++LG + P++A ++ ++P LP FDWRE AVT V
Sbjct: 93 QHGITKFSDLTASEFRRQFLGLNKRLRLPAHAQKA--PILPTNNLPEDFDWREKGAVTPV 150
Query: 142 KDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGS 192
KDQ CGS WAFSTTG +EG T KL SLSEQ+L+DCD D GC GG
Sbjct: 151 KDQGSCGSCWAFSTTGALEGANYLATGKLTSLSEQQLVDCDHVCDPEERGSCDSGCNGGL 210
Query: 193 ISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLV 252
++NAF+ I+ GG+ EK Y Y G D +C+ +K ++ + VS DE +A LV
Sbjct: 211 MNNAFEYILQS--GGVVSEKDYAYTGRDGSCKFDKSKVVASVSNFSVVSLDEDQIAANLV 268
Query: 253 ENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGV-DRTKFTHKAVPY 311
+NGP+AVAINA +Q Y++GVS P + C L H VL++G+G K PY
Sbjct: 269 KNGPLAVAINAAWMQTYMSGVSCP--YIC--AKARLDHGVLLLGFGQGGYAPIRLKEKPY 324
Query: 312 WIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
WIIKNSWG+ WGE+GY+++ RG CG++ V +
Sbjct: 325 WIIKNSWGQNWGEEGYYKICRGRNVCGVDSMVST 358
>gi|403293601|ref|XP_003937801.1| PREDICTED: cathepsin F [Saimiri boliviensis boliviensis]
Length = 379
Score = 243 bits (620), Expect = 9e-62, Method: Compositional matrix adjust.
Identities = 137/327 (41%), Positives = 194/327 (59%), Gaps = 10/327 (3%)
Query: 22 FMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEH 81
F ++ ++ L VK ++F F+ +N+TY + E RL IF+ N+ + Q +Q +
Sbjct: 62 FSLLNEDPLPQDLTVKMASIFRNFVITYNRTYESKEEAQWRLSIFAHNMVRAQKIQALDR 121
Query: 82 GSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGV 141
G+ YG+ +FSDL+ EF+ YL L+ + A P +DWR AVT V
Sbjct: 122 GTAQYGVTKFSDLTEEEFRTIYLNPLLREEPGKKMKQAKSVGDLAPPEWDWRSKGAVTKV 181
Query: 142 KDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIM 201
KDQ MCGS WAFS TGN+EG + L+SLSEQEL+DCD+ D C GG S+A+ I
Sbjct: 182 KDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDKIDKACMGGLPSSAYSAI- 240
Query: 202 SKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAI 261
K GGLE E Y YRG +AC + + +V IN V +S++E +A +L + GP++VAI
Sbjct: 241 -KNLGGLETEDDYSYRGHMQACSFSPEKAKVYINDSVELSQNEQKLAAWLAKRGPISVAI 299
Query: 262 NAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEG 321
NA+ +QFY G+S P++ C + H+VL+VGYG +P+W IKNSWG
Sbjct: 300 NAFGMQFYRHGISRPLRPLCSPW--LIDHAVLLVGYG------NRSDIPFWAIKNSWGTD 351
Query: 322 WGEKGYFRLYRGDGSCGINDYVRSALV 348
WGEKGY+ L+RG G+CG+N SA+V
Sbjct: 352 WGEKGYYYLHRGSGACGVNTMASSAVV 378
>gi|357162946|ref|XP_003579573.1| PREDICTED: cysteine proteinase 1-like [Brachypodium distachyon]
Length = 376
Score = 243 bits (620), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 145/367 (39%), Positives = 206/367 (56%), Gaps = 33/367 (8%)
Query: 1 MSCFYFFAGVALLSLTV--SVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVE 58
++ +GVA LS V + +V GDEK + + A F F+++ NK+Y E
Sbjct: 11 VAAVLLLSGVAALSSPVEDPLIEQVVGGDEK--NELELNAEAHFASFVQRFNKSYRDADE 68
Query: 59 YYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGF-KLKPSY----- 112
+ RL +F+ NLR+ + Q + S V+G+ +FSDL+ EF+ ++LG K + S+
Sbjct: 69 HAHRLSVFTANLRRARRHQRLDP-SAVHGVTKFSDLTPDEFRDRFLGLRKYRRSFLKGLS 127
Query: 113 -ADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLV 171
+ PA+ P LP FDWRE+ AV VKDQ CGS W+FST+G +EG + T KL
Sbjct: 128 GSAHDAPAL-PTDGLPTEFDWREHGAVGPVKDQGSCGSCWSFSTSGALEGAHYLATGKLE 186
Query: 172 SLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKA 222
LSEQ+++DCD E D GC GG ++ AF + GGLE EK YPY G A
Sbjct: 187 VLSEQQMVDCDHECDPSEPRACDAGCNGGLMTTAFSYLAK--AGGLETEKDYPYTGRGGA 244
Query: 223 CRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCD 282
C+ +K ++ + +V+ DE +A LV++GP+A+ INA +Q Y+ GVS P F C
Sbjct: 245 CKFDKSKIAAQVKNFSTVAVDEDQIAANLVKHGPLAIGINAVFMQTYIGGVSCP--FIC- 301
Query: 283 GGNENLSHSVLIVGYG-VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG---DGSCG 338
+L H VL+VGYG K PYWIIKNSWGE WGE GY+++ RG CG
Sbjct: 302 --GRHLDHGVLLVGYGSAGYAPLRFKEKPYWIIKNSWGENWGESGYYKICRGAHVKNKCG 359
Query: 339 INDYVRS 345
++ V +
Sbjct: 360 VDSMVST 366
>gi|205364757|gb|ACI04578.1| cysteine protease-like protein [Robinia pseudoacacia]
Length = 335
Score = 243 bits (620), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 139/335 (41%), Positives = 202/335 (60%), Gaps = 26/335 (7%)
Query: 23 MVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHG 82
VV D + H L+ H F+ F + +KTYAT E+ R +F N+R+ +L +
Sbjct: 6 QVVDDNEDHVLNAEHH---FSTFKSKFSKTYATKEEHDYRFGVFKSNVRRAKLHAKLD-P 61
Query: 83 SGVYGLNEFSDLSTAEFQAKYLGFK-LK-PSYADRSVPAMIPNITLPRAFDWREYDAVTG 140
S V+G+ +FSDL+ +EF+ ++LG K L+ P +A ++ ++P LP FDWR+ AVT
Sbjct: 62 SAVHGVTKFSDLTPSEFRRQFLGLKPLRLPEHAQKA--PILPTHDLPEDFDWRDKGAVTH 119
Query: 141 VKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGG 191
VK+Q CGS WAFSTTG +EG + T +LVSLS+Q+L+DCD D GC GG
Sbjct: 120 VKNQGSCGSCWAFSTTGALEGSHFLATGELVSLSDQQLVDCDHVCDPEQYGACDSGCNGG 179
Query: 192 SISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYL 251
++NAF+ I+ GG++ E+ YPY G D+ ++ +A ++ + VS DE ++ L
Sbjct: 180 LMNNAFEYILES--GGVQREEDYPYTGRDRGPAID-EANAASVSNFSVVSLDEDQISANL 236
Query: 252 VENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYG-VDRTKFTHKAVP 310
V+NGP+A+ INA +Q Y+ GVS P + C +NL H VL+VGYG K P
Sbjct: 237 VKNGPLAIGINAVFMQTYIGGVSCP--YIC---GKNLDHGVLLVGYGKAGYAPIRLKEKP 291
Query: 311 YWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
YWIIKNSWGE WGE GY+++ RG CG++ V +
Sbjct: 292 YWIIKNSWGESWGENGYYKICRGRNVCGVDSMVST 326
>gi|240255643|ref|NP_567010.5| Papain family cysteine protease [Arabidopsis thaliana]
gi|17979125|gb|AAL49820.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|332645795|gb|AEE79316.1| Papain family cysteine protease [Arabidopsis thaliana]
Length = 367
Score = 243 bits (619), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 135/319 (42%), Positives = 186/319 (58%), Gaps = 24/319 (7%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
F F+ + K Y+T EY RL IF+ N+ K Q + S V+G+ +FSDL+ EF+
Sbjct: 51 FRLFMSDYGKNYSTREEYIHRLGIFAKNVLKAAEHQMMDP-SAVHGVTQFSDLTEEEFKR 109
Query: 102 KYLGFKLKPSYADRSVPAMIPNIT---LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
Y G +V A P + LP FDWRE VT VK+Q CGS WAFSTTG
Sbjct: 110 MYTGVADVGGSRGGTVGAEAPMVEVDGLPEDFDWREKGGVTEVKNQGACGSCWAFSTTGA 169
Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSKLGGGLE 209
EG + T KL+SLSEQ+L+DCDQ D+GC GG ++NA++ +M GGLE
Sbjct: 170 AEGAHFVSTGKLLSLSEQQLVDCDQACDPKDKKACDNGCGGGLMTNAYEYLME--AGGLE 227
Query: 210 EEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFY 269
EE++YPY G C+ + + V++ + ++ DE +A LV +GP+AV +NA +Q Y
Sbjct: 228 EERSYPYTGKRGHCKFDPEKVAVRVLNFTTIPLDENQIAANLVRHGPLAVGLNAVFMQTY 287
Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGY---GVDRTKFTHKAVPYWIIKNSWGEGWGEKG 326
+ GVS P+ C N++H VL+VGY G + ++K PYWIIKNSWG+ WGE G
Sbjct: 288 IGGVSCPL--ICS--KRNVNHGVLLVGYGSKGFSILRLSNK--PYWIIKNSWGKKWGENG 341
Query: 327 YFRLYRGDGSCGINDYVRS 345
Y++L RG CGIN V +
Sbjct: 342 YYKLCRGHDICGINSMVSA 360
>gi|1353726|gb|AAB01769.1| cysteine proteinase homolog, partial [Naegleria fowleri]
Length = 347
Score = 243 bits (619), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 141/354 (39%), Positives = 200/354 (56%), Gaps = 26/354 (7%)
Query: 12 LLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLR 71
L++ + ++ VV ++ L + LF F ++ K Y T E+ +R IF N+
Sbjct: 3 LIAAVLLIACVGVVLAQEYKPLAESEMKKLFIKFSRKYAKVYGT-EEHNNRYQIFKANVE 61
Query: 72 KIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNI------- 124
K + +G+ +FSDL+ EF+ +L P A + + A +
Sbjct: 62 KSRYYNHVGKREN-FGITKFSDLTPEEFKRMFLMKTYTPEEAKKILAAPQHAVLSEKEVQ 120
Query: 125 TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE 184
T P +FDWR++ AVT VK+Q CGS W FSTTGN+EG +A K KLVSLSEQ+L+DCD
Sbjct: 121 TAPTSFDWRQHGAVTRVKNQGACGSCWTFSTTGNVEGQWAIKKGKLVSLSEQQLVDCDHN 180
Query: 185 ----------DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKI 234
D GC GG + +AF ++ GGL+ E +YPY G D CR NK I
Sbjct: 181 CVTYQNQQACDSGCNGGLMWSAFQYVIKN--GGLDTEDSYPYEGVDDTCRFNKSNVAATI 238
Query: 235 NGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLI 294
+ + S+S DE MA +L NGP+++AINA LQ+Y +G+S P +FC+ ++L H VLI
Sbjct: 239 SSWTSISSDENQMAAWLAANGPISIAINAEWLQYYTSGISDP--WFCN--PQDLDHGVLI 294
Query: 295 VGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
VGYGV ++ + YWI+KNSWG WGE GYFR+ RG G CG+N S++V
Sbjct: 295 VGYGVGKSWLGSEE-NYWIVKNSWGSDWGEDGYFRIIRGKGKCGLNSVPSSSIV 347
>gi|395742406|ref|XP_003777749.1| PREDICTED: LOW QUALITY PROTEIN: cathepsin F [Pongo abelii]
Length = 490
Score = 243 bits (619), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 137/329 (41%), Positives = 195/329 (59%), Gaps = 10/329 (3%)
Query: 20 SSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT 79
S ++ ++ L VK ++F F+ +N+TY + E RL IF N+ + Q +Q
Sbjct: 171 SVISLLNEDPLPQDLPVKMASIFKNFVITYNRTYESKEEARWRLSIFVNNMVRAQKIQAL 230
Query: 80 EHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVT 139
+ G+ YG+ +FSDL+ EF+ YL L+ +++ A P +DWR AVT
Sbjct: 231 DRGTAQYGVTKFSDLTEEEFRTIYLNPLLREEPSNKMKQAKSVGDLAPPEWDWRSKGAVT 290
Query: 140 GVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDT 199
VKDQ MCGS WAFS TGN+EG + L+SLSEQEL+DCD+ D C GG SNA+
Sbjct: 291 KVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDKMDKACMGGLPSNAYSA 350
Query: 200 IMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAV 259
I K GGLE E Y Y+G ++C + + +V IN V +S++E +A +L + GP++V
Sbjct: 351 I--KNLGGLETEDDYSYQGHMQSCNFSAEKAKVYINDSVELSQNEQKLAAWLAKRGPISV 408
Query: 260 AINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWG 319
AINA+ +QFY G+S P++ C + H+VL+VGYG VP+W IKNSWG
Sbjct: 409 AINAFGMQFYRHGISRPLRPLCSPW--LIDHAVLLVGYG------NRSDVPFWAIKNSWG 460
Query: 320 EGWGEKGYFRLYRGDGSCGINDYVRSALV 348
WGEKGY+ L+RG G+CG+N SA+V
Sbjct: 461 TDWGEKGYYYLHRGSGACGVNTMASSAVV 489
>gi|18399697|ref|NP_565512.1| putative cysteine proteinase A494 [Arabidopsis thaliana]
gi|12643282|sp|P43295.2|A494_ARATH RecName: Full=Probable cysteine proteinase A494; Flags: Precursor
gi|4567274|gb|AAD23687.1| cysteine proteinase [Arabidopsis thaliana]
gi|116325924|gb|ABJ98563.1| At2g21430 [Arabidopsis thaliana]
gi|330252083|gb|AEC07177.1| putative cysteine proteinase A494 [Arabidopsis thaliana]
Length = 361
Score = 242 bits (618), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 145/359 (40%), Positives = 209/359 (58%), Gaps = 37/359 (10%)
Query: 10 VALLSLTVSVSSFMVVGDEKLHHLHHVKHTAL--------FNYFLEQHNKTYATLVEYYS 61
V+L+ + VSVS V GDE + V T F F ++ K Y ++ E+Y
Sbjct: 11 VSLIFVFVSVS---VCGDEDVLIRQVVDETEPKVLSSEDHFTLFKKKFGKVYGSIEEHYY 67
Query: 62 RLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLG----FKLKPSYADRSV 117
R +F NL + Q + S +G+ +FSDL+ +EF+ K+LG FKL P A+++
Sbjct: 68 RFSVFKANLLRAMRHQKMDP-SARHGVTQFSDLTRSEFRRKHLGVKGGFKL-PKDANQA- 124
Query: 118 PAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQE 177
++P LP FDWR+ AVT VK+Q CGS W+FSTTG +EG + T KLVSLSEQ+
Sbjct: 125 -PILPTQNLPEEFDWRDRGAVTPVKNQGSCGSCWSFSTTGALEGAHFLATGKLVSLSEQQ 183
Query: 178 LIDCDQE---------DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDD-KACRLNK 227
L+DCD E D GC GG +++AF+ + GGL EK YPY G D +C+L++
Sbjct: 184 LVDCDHECDPEEEGSCDSGCNGGLMNSAFEYTLKT--GGLMREKDYPYTGTDGGSCKLDR 241
Query: 228 KATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNEN 287
++ + VS +E +A L++NGP+AVAINA +Q Y+ GVS P + C +
Sbjct: 242 SKIVASVSNFSVVSINEDQIAANLIKNGPLAVAINAAYMQTYIGGVSCP--YIC---SRR 296
Query: 288 LSHSVLIVGYG-VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
L+H VL+VGYG ++ K PYWIIKNSWGE WGE G++++ +G CG++ V +
Sbjct: 297 LNHGVLLVGYGSAGFSQARLKEKPYWIIKNSWGESWGENGFYKICKGRNICGVDSLVST 355
>gi|224113123|ref|XP_002316398.1| predicted protein [Populus trichocarpa]
gi|222865438|gb|EEF02569.1| predicted protein [Populus trichocarpa]
Length = 327
Score = 242 bits (618), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 136/324 (41%), Positives = 190/324 (58%), Gaps = 32/324 (9%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
F F+++HNK YAT EY R IF NL + Q + + ++G+ F DL+ EF+
Sbjct: 14 FKMFIKEHNKEYATREEYVHRFGIFGKNLIRAVEHQALDP-TAIHGVTPFMDLTEEEFER 72
Query: 102 KYLGFKLKPSYADRSVPAMIPNIT------LPRAFDWREYDAVTGVKDQTMCGSSWAFST 155
Y G +VP +++ LP +FDWRE AVT VK Q CGS WAFST
Sbjct: 73 MYAGV-----LGGGTVPVEKGSVSFMDASGLPDSFDWREKGAVTDVKIQGSCGSCWAFST 127
Query: 156 TGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSKLGG 206
TG++EG T KL++LSEQ+L+DCD+ DDGC GG ++NA+ ++ G
Sbjct: 128 TGSVEGANFIATGKLLNLSEQQLVDCDRVCDKTDKASCDDGCGGGLMTNAYRYLIE--AG 185
Query: 207 GLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYAL 266
GL+EE +YPY G C+ + + VK+ + S++ DE +A LV +GP+A+ +NA +
Sbjct: 186 GLQEESSYPYTGKSGECKFDPEKIAVKVANFTSIAVDENQIAANLVHHGPLAIGLNAIFM 245
Query: 267 QFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGV---DRTKFTHKAVPYWIIKNSWGEGWG 323
Q Y+ GVS P+ C G + L+H VL+VGYG +F +K PYWIIKNSWG WG
Sbjct: 246 QTYIGGVSCPL--IC--GKKWLNHGVLLVGYGARGYSILRFGYK--PYWIIKNSWGNHWG 299
Query: 324 EKGYFRLYRGDGSCGINDYVRSAL 347
EKGY+RL RG G CG+N V + +
Sbjct: 300 EKGYYRLCRGHGMCGMNKMVSAVV 323
>gi|6042196|ref|NP_003784.2| cathepsin F precursor [Homo sapiens]
gi|12643325|sp|Q9UBX1.1|CATF_HUMAN RecName: Full=Cathepsin F; Short=CATSF; Flags: Precursor
gi|4731642|gb|AAD26616.2|AF088886_1 cathepsin F precursor [Homo sapiens]
gi|5305722|gb|AAD41790.1|AF132894_1 cathepsin F [Homo sapiens]
gi|4826528|emb|CAB42883.1| cysteine proteinase [Homo sapiens]
gi|15079738|gb|AAH11682.1| Cathepsin F [Homo sapiens]
gi|22209085|gb|AAH36451.1| Cathepsin F [Homo sapiens]
gi|61363874|gb|AAX42458.1| cathepsin F [synthetic construct]
gi|123993139|gb|ABM84171.1| cathepsin F [synthetic construct]
gi|189053904|dbj|BAG36411.1| unnamed protein product [Homo sapiens]
Length = 484
Score = 242 bits (618), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 136/329 (41%), Positives = 194/329 (58%), Gaps = 10/329 (3%)
Query: 20 SSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT 79
S ++ ++ L VK ++F F+ +N+TY + E RL +F N+ + Q +Q
Sbjct: 165 SVISLLNEDPLSQDLPVKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQAL 224
Query: 80 EHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVT 139
+ G+ YG+ +FSDL+ EF+ YL L+ ++ A P +DWR AVT
Sbjct: 225 DRGTAQYGVTKFSDLTEEEFRTIYLNTLLRKEPGNKMKQAKSVGDLAPPEWDWRSKGAVT 284
Query: 140 GVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDT 199
VKDQ MCGS WAFS TGN+EG + L+SLSEQEL+DCD+ D C GG SNA+
Sbjct: 285 KVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDKMDKACMGGLPSNAYSA 344
Query: 200 IMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAV 259
I K GGLE E Y Y+G ++C + + +V IN V +S++E +A +L + GP++V
Sbjct: 345 I--KNLGGLETEDDYSYQGHMQSCNFSAEKAKVYINDSVELSQNEQKLAAWLAKRGPISV 402
Query: 260 AINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWG 319
AINA+ +QFY G+S P++ C + H+VL+VGYG VP+W IKNSWG
Sbjct: 403 AINAFGMQFYRHGISRPLRPLCSPW--LIDHAVLLVGYG------NRSDVPFWAIKNSWG 454
Query: 320 EGWGEKGYFRLYRGDGSCGINDYVRSALV 348
WGEKGY+ L+RG G+CG+N SA+V
Sbjct: 455 TDWGEKGYYYLHRGSGACGVNTMASSAVV 483
>gi|94556727|gb|ABF46642.1| papain-like cysteine proteinase [Pachysandra terminalis]
Length = 374
Score = 242 bits (618), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 134/315 (42%), Positives = 181/315 (57%), Gaps = 18/315 (5%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
F+ F ++ K Y + E+ R +F NLR+ + Q + S V+G+ +F DL+ AEF+
Sbjct: 58 FSSFKKRFGKAYTSCDEHDRRFGVFKANLRRAKRNQILDP-SAVHGVTQFFDLTPAEFRR 116
Query: 102 KYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEG 161
YLG K AD ++P LP FDWR++ AVT VK+Q CGS W+FS TG +EG
Sbjct: 117 TYLGLKRLRLPADTHEAPILPTNDLPADFDWRDHGAVTPVKNQGSCGSCWSFSATGALEG 176
Query: 162 VYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSKLGGGLEEEK 212
T KLVSLSEQ+L+DCD D GC GG +++AF+ + GGLE E+
Sbjct: 177 ANFLATGKLVSLSEQQLVDCDHVCDSEDPSSCDSGCNGGLMTSAFEYTLK--AGGLEREE 234
Query: 213 TYPYRGDDKA-CRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVT 271
YPY G D + C+ +K V + + VS DE +A LV NGP+A+ INA +Q Y+
Sbjct: 235 DYPYTGTDHSKCKFDKTKIAVSASNFSVVSLDENQIAANLVTNGPLAIGINAMFMQTYIG 294
Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYG-VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
GVS P + C L H VL+VGYG K PYWIIKNSWGE WGEKGY+++
Sbjct: 295 GVSCP--YIC--SKRLLDHGVLLVGYGSAGFAPIRFKEKPYWIIKNSWGESWGEKGYYKI 350
Query: 331 YRGDGSCGINDYVRS 345
RG CG++ V +
Sbjct: 351 CRGRNICGMDSMVSA 365
>gi|297824991|ref|XP_002880378.1| hypothetical protein ARALYDRAFT_481008 [Arabidopsis lyrata subsp.
lyrata]
gi|297326217|gb|EFH56637.1| hypothetical protein ARALYDRAFT_481008 [Arabidopsis lyrata subsp.
lyrata]
Length = 360
Score = 242 bits (617), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 145/359 (40%), Positives = 209/359 (58%), Gaps = 37/359 (10%)
Query: 10 VALLSLTVSVSSFMVVGDEKLHHLHHVKHTAL--------FNYFLEQHNKTYATLVEYYS 61
V+LL + VSVS + GDE L V F F ++ K Y ++ E+Y
Sbjct: 10 VSLLFVFVSVS---ICGDEDLLIRQVVDEAEPKVLSSEDHFTLFKKKFGKDYGSIEEHYY 66
Query: 62 RLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLG----FKLKPSYADRSV 117
R +F NLR+ Q + S +G+ +FSDL+ +EF+ K+LG FKL P A+++
Sbjct: 67 RFSVFKANLRRAMRHQKMDP-SARHGVTQFSDLTGSEFRRKHLGVTGGFKL-PKDANQA- 123
Query: 118 PAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQE 177
++P LP FDWR+ AVT VK+Q CGS W+FSTTG +EG + T KLVSLSEQ+
Sbjct: 124 -PILPTHNLPEEFDWRDRGAVTPVKNQGSCGSCWSFSTTGALEGAHFLATGKLVSLSEQQ 182
Query: 178 LIDCDQE---------DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDD-KACRLNK 227
L+DCD E D GC GG +++AF+ + GGL E+ YPY G D +C+L++
Sbjct: 183 LVDCDHECDPEEAGSCDSGCNGGLMNSAFEYTLKT--GGLMREEDYPYTGTDGGSCKLDR 240
Query: 228 KATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNEN 287
++ + VS +E +A LV+NGP+AVAINA +Q Y+ GVS P + C +
Sbjct: 241 SKIVASVSNFSVVSINEDQIAANLVKNGPLAVAINAAYMQTYIGGVSCP--YIC---SRR 295
Query: 288 LSHSVLIVGYGVDR-TKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
L+H VL++GYG ++ K PYWIIKNSWGE WGE G++++ +G CG++ V +
Sbjct: 296 LNHGVLLMGYGSSGYSQARLKEKPYWIIKNSWGESWGENGFYKICKGRNICGVDSLVST 354
>gi|218137972|gb|ACK57563.1| cysteine protease-like protein [Arachis hypogaea]
Length = 364
Score = 242 bits (617), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 137/334 (41%), Positives = 197/334 (58%), Gaps = 24/334 (7%)
Query: 26 GDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGV 85
GDE HL + +H F+ F + +KTYAT E+ R +F NL + + Q+ + S +
Sbjct: 38 GDE---HLLNAEHH--FSAFKTKFSKTYATKEEHDYRFGVFKSNLLRAKSHQELD-PSAI 91
Query: 86 YGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQT 145
+G+ +FSDL+ +EF++++LG K +D ++P LP+ FDWR++ AVT VK+Q
Sbjct: 92 HGVTKFSDLTPSEFRSQFLGLKPLSLPSDAHNAPILPTDNLPKDFDWRDHGAVTNVKNQG 151
Query: 146 MCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNA 196
GS W+FSTTG +EG + T +LVSLSEQ+L+DCD E D GC GG ++ A
Sbjct: 152 TGGSCWSFSTTGALEGAHFLATGELVSLSEQQLVDCDHECDPDLNDACDSGCNGGLMTTA 211
Query: 197 FDTIMSKLGGGLEEEKTYPYRGDDKA-CRLNKKATQVKINGYVSVSRDETDMAKYLVENG 255
F +K GGL E+ Y Y G D+ C+ +K ++ + VS DE +A LV+NG
Sbjct: 212 FG--YTKKAGGLVREEDYLYTGRDRGPCKFDKSKIAASVSNFSVVSLDEDQIAANLVKNG 269
Query: 256 PMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDR-TKFTHKAVPYWII 314
P++V INA +Q Y+ GVS P F C ++L H VL+VGYG K PYWII
Sbjct: 270 PLSVGINAVYMQTYIGGVSCP--FIC---GKHLDHGVLLVGYGAGGYAPIRFKEKPYWII 324
Query: 315 KNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
KNSWGE WGE GY+++ RG CG++ V + +
Sbjct: 325 KNSWGENWGENGYYKICRGPNMCGVDSMVSTVIA 358
>gi|119594953|gb|EAW74547.1| cathepsin F, isoform CRA_a [Homo sapiens]
gi|119594954|gb|EAW74548.1| cathepsin F, isoform CRA_a [Homo sapiens]
Length = 392
Score = 242 bits (617), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 136/329 (41%), Positives = 194/329 (58%), Gaps = 10/329 (3%)
Query: 20 SSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT 79
S ++ ++ L VK ++F F+ +N+TY + E RL +F N+ + Q +Q
Sbjct: 73 SVISLLNEDPLSQDLPVKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQAL 132
Query: 80 EHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVT 139
+ G+ YG+ +FSDL+ EF+ YL L+ ++ A P +DWR AVT
Sbjct: 133 DRGTAQYGVTKFSDLTEEEFRTIYLNTLLRKEPGNKMKQAKSVGDLAPPEWDWRSKGAVT 192
Query: 140 GVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDT 199
VKDQ MCGS WAFS TGN+EG + L+SLSEQEL+DCD+ D C GG SNA+
Sbjct: 193 KVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDKMDKACMGGLPSNAYSA 252
Query: 200 IMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAV 259
I K GGLE E Y Y+G ++C + + +V IN V +S++E +A +L + GP++V
Sbjct: 253 I--KNLGGLETEDDYSYQGHMQSCNFSAEKAKVYINDSVELSQNEQKLAAWLAKRGPISV 310
Query: 260 AINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWG 319
AINA+ +QFY G+S P++ C + H+VL+VGYG VP+W IKNSWG
Sbjct: 311 AINAFGMQFYRHGISRPLRPLCSPW--LIDHAVLLVGYG------NRSDVPFWAIKNSWG 362
Query: 320 EGWGEKGYFRLYRGDGSCGINDYVRSALV 348
WGEKGY+ L+RG G+CG+N SA+V
Sbjct: 363 TDWGEKGYYYLHRGSGACGVNTMASSAVV 391
>gi|3916212|gb|AAC78838.1| cathepsin F [Homo sapiens]
Length = 338
Score = 242 bits (617), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 136/329 (41%), Positives = 194/329 (58%), Gaps = 10/329 (3%)
Query: 20 SSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT 79
S ++ ++ L VK ++F F+ +N+TY + E RL +F N+ + Q +Q
Sbjct: 19 SVISLLNEDPLSQDLPVKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQAL 78
Query: 80 EHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVT 139
+ G+ YG+ +FSDL+ EF+ YL L+ ++ A P +DWR AVT
Sbjct: 79 DRGTAQYGVTKFSDLTEEEFRTIYLNTLLRKEPGNKMKQAKSVGDLAPPEWDWRSKGAVT 138
Query: 140 GVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDT 199
VKDQ MCGS WAFS TGN+EG + L+SLSEQEL+DCD+ D C GG SNA+
Sbjct: 139 KVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDKMDKACMGGLPSNAYSA 198
Query: 200 IMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAV 259
I K GGLE E Y Y+G ++C + + +V IN V +S++E +A +L + GP++V
Sbjct: 199 I--KNLGGLETEDDYSYQGHMQSCNFSAEKAKVYINDSVELSQNEQKLAAWLAKRGPISV 256
Query: 260 AINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWG 319
AINA+ +QFY G+S P++ C + H+VL+VGYG VP+W IKNSWG
Sbjct: 257 AINAFGMQFYRHGISRPLRPLCSPW--LIDHAVLLVGYG------NRSDVPFWAIKNSWG 308
Query: 320 EGWGEKGYFRLYRGDGSCGINDYVRSALV 348
WGEKGY+ L+RG G+CG+N SA+V
Sbjct: 309 TDWGEKGYYYLHRGSGACGVNTMASSAVV 337
>gi|297816790|ref|XP_002876278.1| hypothetical protein ARALYDRAFT_485911 [Arabidopsis lyrata subsp.
lyrata]
gi|297322116|gb|EFH52537.1| hypothetical protein ARALYDRAFT_485911 [Arabidopsis lyrata subsp.
lyrata]
Length = 368
Score = 241 bits (616), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 144/368 (39%), Positives = 203/368 (55%), Gaps = 39/368 (10%)
Query: 1 MSCFYFFAGV--ALLSLTVSVSSFMVVGDEKL--HHLHHVKHTALFNYFLEQHNKTYATL 56
++C FF V ++ LT+ V DE+ +L + F F+ + K Y+T
Sbjct: 10 ITCIIFFCHVVASVEDLTIR----QVTADERRVRPNLLGTHTESKFRVFMSDYGKNYSTR 65
Query: 57 VEYYSRLHIFSGNLRKI---QLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYA 113
EY RL IF+ N+ K Q++ T V+G+ +FSDL+ EF+ Y G
Sbjct: 66 EEYIHRLGIFAKNVLKAAEHQMMDPT----AVHGVTQFSDLTEEEFKRMYTGVADVGGSR 121
Query: 114 DRSVPAMIPNIT---LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKL 170
+V A P + LP FDWRE VT VK+Q CGS WAFSTTG EG + T KL
Sbjct: 122 GHAVGAEAPMVEVDGLPEDFDWREKGGVTEVKNQGACGSCWAFSTTGAAEGAHFVSTGKL 181
Query: 171 VSLSEQELIDCDQE----------DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDD 220
+SLSEQ+L+DCDQ D+GC GG ++NA++ +M GGLEEE++YPY G
Sbjct: 182 LSLSEQQLVDCDQAVCDPKDKKACDNGCGGGLMTNAYEYLME--AGGLEEERSYPYTGKR 239
Query: 221 KACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFF 280
C+ + + V++ + ++ DE +A LV GP+AV +NA +Q Y+ GVS P+
Sbjct: 240 GHCKFDPEKVAVRVVNFTTIPLDEDQIAANLVRQGPLAVGLNAVFMQTYIGGVSCPL--I 297
Query: 281 CDGGNENLSHSVLIVGY---GVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSC 337
C ++H VL+VGY G + ++K PYWIIKNSWG+ WGE GY++L RG C
Sbjct: 298 CS--KRKVNHGVLLVGYGSKGFSILRLSNK--PYWIIKNSWGKKWGENGYYKLCRGHDIC 353
Query: 338 GINDYVRS 345
GIN V +
Sbjct: 354 GINSMVSA 361
>gi|426369382|ref|XP_004051670.1| PREDICTED: cathepsin F [Gorilla gorilla gorilla]
Length = 517
Score = 241 bits (616), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 136/329 (41%), Positives = 194/329 (58%), Gaps = 10/329 (3%)
Query: 20 SSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT 79
S ++ ++ L VK ++F F+ +N+TY + E RL +F N+ + Q +Q
Sbjct: 198 SVISLLNEDPLPQDLPVKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQAL 257
Query: 80 EHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVT 139
+ G+ YG+ +FSDL+ EF+ YL L+ ++ A P +DWR AVT
Sbjct: 258 DRGTAQYGVTKFSDLTEEEFRTIYLNSLLREEPGNKMKQAKSVGDLAPPEWDWRSKGAVT 317
Query: 140 GVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDT 199
VKDQ MCGS WAFS TGN+EG + L+SLSEQEL+DCD+ D C GG SNA+
Sbjct: 318 KVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDKMDKACMGGLPSNAYSA 377
Query: 200 IMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAV 259
I K GGLE E Y Y+G ++C + + +V IN V +S++E +A +L + GP++V
Sbjct: 378 I--KNLGGLETEDDYSYQGHMQSCNFSAEKAKVYINDSVELSQNEQKLAAWLAKRGPISV 435
Query: 260 AINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWG 319
AINA+ +QFY G+S P++ C + H+VL+VGYG VP+W IKNSWG
Sbjct: 436 AINAFGMQFYRHGISRPLRPLCS--PWLIDHAVLLVGYG------NRSDVPFWAIKNSWG 487
Query: 320 EGWGEKGYFRLYRGDGSCGINDYVRSALV 348
WGEKGY+ L+RG G+CG+N SA+V
Sbjct: 488 TDWGEKGYYYLHRGSGACGVNTMASSAVV 516
>gi|297804580|ref|XP_002870174.1| hypothetical protein ARALYDRAFT_915142 [Arabidopsis lyrata subsp.
lyrata]
gi|297316010|gb|EFH46433.1| hypothetical protein ARALYDRAFT_915142 [Arabidopsis lyrata subsp.
lyrata]
Length = 373
Score = 241 bits (616), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 145/371 (39%), Positives = 211/371 (56%), Gaps = 41/371 (11%)
Query: 4 FYFFAGVALLSLTVSVSSF-------------MVVGDEKLHHLHHVKHTALFNYFLEQHN 50
F+F LL++++ + VV +E HL + +H F+ F ++
Sbjct: 6 FFFLIAATLLAVSLGSAVISGEVNYGFVNPIRQVVPEENDEHLLNAEHH--FSLFKSKYE 63
Query: 51 KTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLK- 109
KTYAT E+ R +F NLR+ + Q + S V+G+ +FSDL+ EF+ K+LG K +
Sbjct: 64 KTYATQEEHDHRFRVFKANLRRARRNQLLD-PSAVHGVTQFSDLTPKEFRRKFLGLKRRG 122
Query: 110 ---PSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAK 166
P+ D ++P LP FDWRE AVT VK+Q MCGS W+FS G +EG +
Sbjct: 123 FRLPT--DTQTAPILPTSDLPTEFDWREQGAVTPVKNQGMCGSCWSFSAIGALEGAHFLA 180
Query: 167 TKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
TK+LVSLSEQ+L+DCD E D GC GG ++NAF+ + GGL +E+ YPY
Sbjct: 181 TKELVSLSEQQLVDCDHECDPAQANSCDSGCSGGLMNNAFEYALK--AGGLMKEEDYPYT 238
Query: 218 G-DDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHP 276
G D+ AC+ +K ++ + VS DE +A LV++GP+A+AINA +Q Y+ GVS P
Sbjct: 239 GRDNTACKFDKSKIAASVSNFSVVSSDEDQIAANLVKHGPLAIAINAMWMQTYIGGVSCP 298
Query: 277 IQFFCDGGNENLSHSVLIVGYGVD-RTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
+ C +++ H VL+VG+G K PYWIIKNSWG WGE GY+++ RG
Sbjct: 299 --YVC---SKSQDHGVLLVGFGSSGYAPIRLKEKPYWIIKNSWGAMWGEHGYYKICRGPH 353
Query: 336 S-CGINDYVRS 345
+ CG++ V +
Sbjct: 354 NMCGMDTMVST 364
>gi|161778780|gb|ABX79341.1| cysteine protease [Vitis vinifera]
Length = 377
Score = 241 bits (616), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 132/316 (41%), Positives = 188/316 (59%), Gaps = 21/316 (6%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
F+ F + K+YA+ E+ R +F NLR+ + Q + S +G+ +FSDL+ AEF+
Sbjct: 62 FSIFKRRFGKSYASQEEHDYRFKVFKANLRRARRHQQLDP-SATHGVTQFSDLTPAEFRG 120
Query: 102 KYLGFK-LKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIE 160
YLG + LK + + P ++P LP FDWR++ AVT VK+Q CGS W+FSTTG +E
Sbjct: 121 TYLGLRPLKLPHDAQKAP-ILPTNDLPEDFDWRDHGAVTAVKNQGSCGSCWSFSTTGALE 179
Query: 161 GVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSKLGGGLEEE 211
G T LVSLSEQ+L++CD E D GC GG ++ AF+ + GGL +E
Sbjct: 180 GANFLATGNLVSLSEQQLVECDHECDPEEMGSCDSGCNGGLMNTAFEYTLK--AGGLMKE 237
Query: 212 KTYPYRGDDK-ACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYV 270
+ YPY G D+ +C+ +K ++ + +S DE +A LV+ GP+AVAINA +Q YV
Sbjct: 238 EDYPYTGTDRGSCKFDKTKIAASVSNFSVISLDEDQIAANLVKIGPLAVAINAVFMQTYV 297
Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYG-VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
GVS P + C ++ L H VL+VGYG K PYWIIKNSWGE WGE G+++
Sbjct: 298 GGVSCP--YIC---SKRLDHGVLLVGYGSAGYAPIRMKDKPYWIIKNSWGENWGENGFYK 352
Query: 330 LYRGDGSCGINDYVRS 345
+ RG CG++ V +
Sbjct: 353 ICRGRNVCGVDSMVST 368
>gi|19849|emb|CAA78361.1| tobacco pre-pro-cysteine proteinase [Nicotiana tabacum]
Length = 363
Score = 241 bits (616), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 139/336 (41%), Positives = 190/336 (56%), Gaps = 26/336 (7%)
Query: 23 MVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHG 82
VV + HL + +H F+ F + K YA+ E+ R +F NLR+ +L Q +
Sbjct: 30 QVVSETDDSHLLNAEHH--FSLFKSKFGKIYASEEEHDHRFKVFKANLRRARLNQLLD-P 86
Query: 83 SGVYGLNEFSDLSTAEFQAKYLGF-KLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGV 141
S +G+ +FSDL+ +EF+ YLG K KP P ++P LP FDWR++ AVTGV
Sbjct: 87 SAEHGITKFSDLTPSEFRRTYLGLHKPKPKLNAEKAP-ILPTSDLPADFDWRDHGAVTGV 145
Query: 142 KDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGS 192
K+Q CGS W+FSTTG +EG + T +LVSLSEQ+L+DCD E D GC GG
Sbjct: 146 KNQGSCGSCWSFSTTGAVEGAHFLATGELVSLSEQQLVDCDHECDPEQQDACDAGCGGGH 205
Query: 193 ISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLV 252
+ AF+ + GGL+ EK YPY G D C +K + + + DE +A LV
Sbjct: 206 YATAFEYTLK--AGGLQLEKDYPYTGKDGKCHFDKSKICAAVTNFSVIGLDEDQIAANLV 263
Query: 253 ENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGY---GVDRTKFTHKAV 309
++GP+AV INA +Q YV GVS P+ F + H VL+VGY G + KA
Sbjct: 264 KHGPLAVGINAAWMQTYVGGVSCPLICF-----KRQDHGVLLVGYGSHGFAPIRLKEKA- 317
Query: 310 PYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
YWIIKNSWGE WGE GY+++ RG CG++ V +
Sbjct: 318 -YWIIKNSWGENWGEHGYYKICRGHNICGVDAMVST 352
>gi|449464688|ref|XP_004150061.1| PREDICTED: cysteine proteinase RD19a-like [Cucumis sativus]
gi|449519862|ref|XP_004166953.1| PREDICTED: cysteine proteinase RD19a-like [Cucumis sativus]
Length = 377
Score = 241 bits (615), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 131/319 (41%), Positives = 191/319 (59%), Gaps = 25/319 (7%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
F+ F ++ K+YA+ E+ R +F NL++ Q Q + S +G+ +FSDL+ +EF+
Sbjct: 60 FSVFKQKFGKSYASKEEHDHRFRVFKANLKRAQRHQALDP-SATHGVTQFSDLTPSEFRR 118
Query: 102 KYLGFKLK----PSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
+LG + + P+ A+++ ++P LP FDWR+ AV+ VK+Q CGS W+FS TG
Sbjct: 119 SFLGLRSRRLGLPADANKA--PILPTDGLPTDFDWRDKGAVSEVKNQGSCGSCWSFSATG 176
Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSKLGGGL 208
+EG T KLVSLSEQ+L+DCD E D GC GG +++AF+ + GGL
Sbjct: 177 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEKGSCDSGCNGGLMNSAFEYTLKS--GGL 234
Query: 209 EEEKTYPYRGDDKA-CRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQ 267
+E+ YPY G D+ C+ +K + + VS DE +A LV+NGP+AVAINA +Q
Sbjct: 235 MKEQDYPYTGTDRGTCKFDKSKIAASVANFSVVSLDEEQIAANLVKNGPLAVAINAVFMQ 294
Query: 268 FYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDR-TKFTHKAVPYWIIKNSWGEGWGEKG 326
Y+ GVS P + C +++L H VL+VGYG D K PYWIIKNSWG WGE G
Sbjct: 295 TYIKGVSCP--YIC---SKHLDHGVLLVGYGSDGYAPIRLKDKPYWIIKNSWGANWGENG 349
Query: 327 YFRLYRGDGSCGINDYVRS 345
Y+++ RG CG++ V +
Sbjct: 350 YYKICRGRNICGVDSMVST 368
>gi|57282617|emb|CAE54306.1| putative papain-like cysteine proteinase [Gossypium hirsutum]
Length = 373
Score = 241 bits (615), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 132/315 (41%), Positives = 183/315 (58%), Gaps = 19/315 (6%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
++ F ++ K+Y + E+ R IF NLR+ Q+ + S +G+ +FSDL+ EF+
Sbjct: 58 YSLFKKRFKKSYGSQKEHDYRFKIFQVNLRRAARHQNLDP-SATHGVTQFSDLTPGEFRK 116
Query: 102 KYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEG 161
YLG + D + ++P LP+ FDWRE AVT VK+Q CGS W+FSTTG +EG
Sbjct: 117 AYLGLRRLRLPKDATEAPILPTDNLPQDFDWREKGAVTPVKNQGSCGSCWSFSTTGALEG 176
Query: 162 VYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSKLGGGLEEEK 212
T KLVSLSEQ+L+DCD E D GC GG +++AF+ + GGL E+
Sbjct: 177 ANFLATGKLVSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNSAFEYTLK--AGGLMREE 234
Query: 213 TYPYRGDDKA-CRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVT 271
YPY G D+ C+ + K+ + VS DE +A L +NGP+AVAINA +Q Y+
Sbjct: 235 DYPYTGTDRGTCKFDNTKVAAKVANFSVVSLDEDQIAANLFKNGPLAVAINAVFMQTYIG 294
Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYG-VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
GVS P + C ++ L H VL+VGYG K PYWIIKNSWGE WGE G++R+
Sbjct: 295 GVSCP--YIC---SKRLDHGVLLVGYGSAGYAPVRMKDKPYWIIKNSWGENWGENGFYRI 349
Query: 331 YRGDGSCGINDYVRS 345
RG CG++ V +
Sbjct: 350 CRGRNICGVDSMVST 364
>gi|290997496|ref|XP_002681317.1| cysteine protease [Naegleria gruberi]
gi|284094941|gb|EFC48573.1| cysteine protease [Naegleria gruberi]
Length = 350
Score = 241 bits (615), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 135/333 (40%), Positives = 184/333 (55%), Gaps = 26/333 (7%)
Query: 33 LHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFS 92
L + LF F ++H K Y ++ R IF N+ K + +G+++F
Sbjct: 27 LSEAEMKKLFVKFSKKHAKLYGA-EDHGKRYQIFKSNVEKARYYNHVGKRE-TFGVSKFM 84
Query: 93 DLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITL-------PRAFDWREYDAVTGVKDQT 145
DL+ EF+ +L P A + + A + P ++DWR+ AVT VK+Q
Sbjct: 85 DLTPEEFKRMFLMKTYTPEEARKILAAPKEAVVTAQQVKDTPTSWDWRQKGAVTPVKNQG 144
Query: 146 MCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE----------DDGCEGGSISN 195
CGS W FSTTGN+EG++ KT KLVSLSEQ+L+DCD D GC GG + +
Sbjct: 145 ACGSCWTFSTTGNVEGIHQIKTGKLVSLSEQQLVDCDHNCVTYQGQQACDAGCNGGLMWS 204
Query: 196 AFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENG 255
AF ++ GGL E +YPY G D CR NK V IN + S+ DE MA +L NG
Sbjct: 205 AFQYVIKT--GGLVTEDSYPYEGVDDTCRFNKSNVAVTINSWTSIPSDEGKMAAWLAANG 262
Query: 256 PMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIK 315
P+++AINA LQ Y +G+S+P +FC+ ++L H VLIVG+G K YWIIK
Sbjct: 263 PISIAINAEWLQTYTSGISNP--WFCN--PQDLDHGVLIVGFGTGSNWLGEKE-DYWIIK 317
Query: 316 NSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
NSWG WGE GYFR+ RG G CG+N S+L+
Sbjct: 318 NSWGADWGESGYFRIVRGKGKCGLNSVPSSSLI 350
>gi|18141287|gb|AAL60581.1|AF454959_1 senescence-associated cysteine protease [Brassica oleracea]
Length = 368
Score = 241 bits (615), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 138/321 (42%), Positives = 192/321 (59%), Gaps = 30/321 (9%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
F+ F ++ K YA+ E+ R +F NLR+ + Q + S +G+ +FSDL+ +EF+
Sbjct: 51 FSLFKKKFGKVYASREEHDYRFSVFKSNLRRARRHQKLDP-SARHGVTQFSDLTRSEFKR 109
Query: 102 KYLG----FKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
K+LG FKL P A+++ ++P LP FDWRE AVT VK+Q CGS W+FS TG
Sbjct: 110 KHLGVKGGFKL-PKDANKA--PILPTENLPEEFDWRERGAVTPVKNQGSCGSCWSFSATG 166
Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSKLGGGL 208
+EG T KLVSLSEQ+L+DCD E D GC GG +++AF+ + GGL
Sbjct: 167 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNSAFEYTLKT--GGL 224
Query: 209 EEEKTYPYRGDDKA-CRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQ 267
E+ YPY G D A C+L+K ++ + +S DE +A LV+NGP+AVAINA +Q
Sbjct: 225 MREEDYPYTGKDGATCKLDKSKIVASVSNFSVISIDEEQIAANLVKNGPLAVAINAAYMQ 284
Query: 268 FYVTGVSHPIQFFCDGGNENLSHSVLIVGYGV---DRTKFTHKAVPYWIIKNSWGEGWGE 324
Y+ GVS P + C L+H VL+VGYG +F K PYWIIKNSWGE WGE
Sbjct: 285 TYIGGVSCP--YIC---MRRLNHGVLLVGYGSAGYAPARFKEK--PYWIIKNSWGETWGE 337
Query: 325 KGYFRLYRGDGSCGINDYVRS 345
G++++ RG CG++ V +
Sbjct: 338 DGFYKICRGRNVCGVDSLVST 358
>gi|356541074|ref|XP_003539008.1| PREDICTED: cysteine proteinase RD19a-like [Glycine max]
Length = 363
Score = 241 bits (614), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 130/314 (41%), Positives = 186/314 (59%), Gaps = 23/314 (7%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
F + K YA+ E+ R +F N+R+ + Q + S +G+ FSDL+ +EF+ K L
Sbjct: 51 FKRRFGKAYASQEEHNYRFEVFKANMRRARRHQSLDP-SAAHGVTRFSDLTASEFRNKVL 109
Query: 105 GFK--LKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGV 162
G + PS A+++ ++P LP FDWR++ AVT VK+Q CGS W+FSTTG +EG
Sbjct: 110 GLRGVRLPSNANKA--PILPTDNLPSDFDWRDHGAVTPVKNQGSCGSCWSFSTTGALEGA 167
Query: 163 YAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
+ T +LVSLSEQ+L+DCD E D GC GG +++AF+ I+ GG+ E+
Sbjct: 168 HFLSTGELVSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNSAFEYILKS--GGVMREED 225
Query: 214 YPYRGDDKA-CRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTG 272
YPY G D+ C+ +K + + +S DE +A LV+NGP+AVAINA +Q Y+ G
Sbjct: 226 YPYSGTDRGNCKFDKAKIAASVANFSVISLDEDQIAANLVKNGPLAVAINAAYMQTYIGG 285
Query: 273 VSHPIQFFCDGGNENLSHSVLIVGYGVDR-TKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
VS P + C + L H VL+VGYG K P+WIIKNSWGE WGE GY+++
Sbjct: 286 VSCP--YIC---SRRLDHGVLLVGYGSGAYAPIRMKEKPFWIIKNSWGENWGENGYYKIC 340
Query: 332 RGDGSCGINDYVRS 345
RG CG++ V +
Sbjct: 341 RGRNICGVDSMVST 354
>gi|297801998|ref|XP_002868883.1| hypothetical protein ARALYDRAFT_490677 [Arabidopsis lyrata subsp.
lyrata]
gi|297314719|gb|EFH45142.1| hypothetical protein ARALYDRAFT_490677 [Arabidopsis lyrata subsp.
lyrata]
Length = 368
Score = 241 bits (614), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 151/371 (40%), Positives = 208/371 (56%), Gaps = 47/371 (12%)
Query: 3 CFYFFAGVALLSLTVSVSSF-----------MVVGDEKLHHLHHVKHTALFNYFLEQHNK 51
CF F L L VSVSS VVG + L H F+ F + K
Sbjct: 7 CFSVFV---LFFLIVSVSSSDVNDGDDLVIRQVVGGAEPQVLTSEDH---FSLFKSKFGK 60
Query: 52 TYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLG----FK 107
YA+ E+ R +F NLR+ + Q + S +G+ +FSDL+ +EF+ K+LG FK
Sbjct: 61 VYASNEEHDYRFSVFKANLRRARRHQKLDP-SARHGVTQFSDLTRSEFRKKHLGVRAGFK 119
Query: 108 LKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKT 167
L P A+++ ++P LP FDWR+ AVT VK+Q CGS W+FS TG +EG T
Sbjct: 120 L-PKDANKA--PILPTENLPEDFDWRDRGAVTPVKNQGSCGSCWSFSATGALEGANFLAT 176
Query: 168 KKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG 218
KLVSLSEQ+L+DCD E D GC GG +++AF+ + GGL +E+ YPY G
Sbjct: 177 GKLVSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNSAFEYTLKT--GGLMKEEDYPYTG 234
Query: 219 DD-KACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPI 277
D K C+L+K ++ + +S DE +A LV+NGP+AVAINA +Q Y+ GVS P
Sbjct: 235 KDGKTCKLDKSKIVASVSNFSVISIDEEQIAANLVKNGPLAVAINAGYMQTYIGGVSCP- 293
Query: 278 QFFCDGGNENLSHSVLIVGYGV---DRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGD 334
+ C L+H VL+VGYG +F K PYWIIKNSWGE WGE G++++ +G
Sbjct: 294 -YIC---TRRLNHGVLLVGYGSAGYAPARFKEK--PYWIIKNSWGETWGENGFYKICKGR 347
Query: 335 GSCGINDYVRS 345
CG++ V +
Sbjct: 348 NICGVDSLVST 358
>gi|296218871|ref|XP_002755611.1| PREDICTED: cathepsin F [Callithrix jacchus]
Length = 489
Score = 241 bits (614), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 136/327 (41%), Positives = 194/327 (59%), Gaps = 11/327 (3%)
Query: 22 FMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEH 81
F ++ ++ L VK ++F F+ +N+TY + E RL +F N+ + Q +Q +
Sbjct: 173 FSLLNEDPLPQDLAVKMASIFRNFVITYNRTYESKEEAQWRLSVFVHNMVRAQKIQALDR 232
Query: 82 GSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGV 141
G+ YG+ +FSDL+ EF+ YL L+ + ++ P +DWR AVT V
Sbjct: 233 GTAQYGVTKFSDLTEEEFRTTYLNPLLREPGKKMKQAKSVGDLAPPE-WDWRSKGAVTKV 291
Query: 142 KDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIM 201
KDQ MCGS WAFS TGN+EG + L+SLSEQEL+DCD+ D C GG S+A+ I
Sbjct: 292 KDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDKIDKACMGGLPSSAYSAI- 350
Query: 202 SKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAI 261
K GGLE E Y YRG +AC + + +V IN V +S++E +A +L + GP++VAI
Sbjct: 351 -KNLGGLETEDDYSYRGHMQACNFSPEKAKVYINDSVELSQNEQKLAAWLAKRGPISVAI 409
Query: 262 NAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEG 321
NA+ +QFY G+S P++ C + H+VL+VGYG VP+W IKNSWG
Sbjct: 410 NAFGMQFYRHGISRPLRPLCSPW--LIDHAVLLVGYG------NRSDVPFWAIKNSWGTD 461
Query: 322 WGEKGYFRLYRGDGSCGINDYVRSALV 348
WGEKGY+ L+RG G+CG+N SA+V
Sbjct: 462 WGEKGYYYLHRGSGACGVNTMASSAVV 488
>gi|5051468|emb|CAB44983.1| putative preprocysteine proteinase [Nicotiana tabacum]
Length = 363
Score = 241 bits (614), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 138/335 (41%), Positives = 190/335 (56%), Gaps = 26/335 (7%)
Query: 24 VVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGS 83
VV + HL + +H F+ F + K YA+ E+ R +F NLR+ + Q + S
Sbjct: 31 VVSETDDSHLLNAEHH--FSLFKSKFGKIYASEEEHDHRFKVFKANLRRARRHQLLD-PS 87
Query: 84 GVYGLNEFSDLSTAEFQAKYLGF-KLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVK 142
+G+ +FSDL+ +EF+ YLG K KP P ++P LP FDWR++ AVTGVK
Sbjct: 88 AEHGITKFSDLTPSEFRRTYLGLHKPKPKLNAEKAP-ILPTSDLPADFDWRDHGAVTGVK 146
Query: 143 DQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSI 193
+Q CGS W+FSTTG +EG + T +LVSLSEQ+L+DCD E D GC GG +
Sbjct: 147 NQGSCGSCWSFSTTGAVEGAHFLATGELVSLSEQQLVDCDHECDPEQQDACDAGCGGGLM 206
Query: 194 SNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVE 253
+ AF+ + GGL+ EK YPY G D C +K + + + DE +A LV+
Sbjct: 207 TTAFEYTLK--AGGLQLEKDYPYTGKDGKCHFDKSKIAAAVTNFSVIGLDEDQIAANLVK 264
Query: 254 NGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGY---GVDRTKFTHKAVP 310
+GP+AV INA +Q YV GVS P+ F + H VL+VGY G + KA
Sbjct: 265 HGPLAVGINAAWMQTYVGGVSCPLICF-----KRQDHGVLLVGYGSHGFAPIRLKEKA-- 317
Query: 311 YWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
YWIIKNSWGE WGE GY+++ RG CG++ V +
Sbjct: 318 YWIIKNSWGENWGEHGYYKICRGHNICGVDAMVST 352
>gi|397517049|ref|XP_003828732.1| PREDICTED: cathepsin F [Pan paniscus]
Length = 379
Score = 241 bits (614), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 136/329 (41%), Positives = 194/329 (58%), Gaps = 10/329 (3%)
Query: 20 SSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT 79
S ++ ++ L VK ++F F+ +N+TY + E RL +F N+ + Q +Q
Sbjct: 60 SVISLLNEDPLPQDLPVKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQAL 119
Query: 80 EHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVT 139
+ G+ YG+ +FSDL+ EF+ YL L+ ++ A P +DWR AVT
Sbjct: 120 DRGTAQYGVTKFSDLTEEEFRTIYLNPLLRKEPGNKMKQAKSVGDLAPPEWDWRSKGAVT 179
Query: 140 GVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDT 199
VKDQ MCGS WAFS TGN+EG + L+SLSEQEL+DCD+ D C GG SNA+
Sbjct: 180 KVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDKMDKACMGGLPSNAYSA 239
Query: 200 IMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAV 259
I K GGLE E Y Y+G ++C + + +V IN V +S++E +A +L + GP++V
Sbjct: 240 I--KNLGGLETEDDYSYQGHMQSCNFSAEKAKVYINDSVVLSQNEQKLAAWLAKRGPISV 297
Query: 260 AINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWG 319
AINA+ +QFY G+S P++ C + H+VL+VGYG VP+W IKNSWG
Sbjct: 298 AINAFGMQFYRHGISRPLRPLCSPW--LIDHAVLLVGYG------NRSDVPFWAIKNSWG 349
Query: 320 EGWGEKGYFRLYRGDGSCGINDYVRSALV 348
WGEKGY+ L+RG G+CG+N SA+V
Sbjct: 350 TDWGEKGYYYLHRGSGACGVNTMASSAVV 378
>gi|21593213|gb|AAM65162.1| cysteine proteinase RD19A [Arabidopsis thaliana]
Length = 368
Score = 241 bits (614), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 141/340 (41%), Positives = 199/340 (58%), Gaps = 33/340 (9%)
Query: 23 MVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHG 82
VVG + L H F+ F + K YA+ E+ R +F NLR+ + Q +
Sbjct: 35 QVVGGAEPQVLTSEDH---FSLFKRKFGKVYASNEEHDYRFSVFKANLRRARRHQKLDP- 90
Query: 83 SGVYGLNEFSDLSTAEFQAKYLG----FKLKPSYADRSVPAMIPNITLPRAFDWREYDAV 138
S +G+ +FSDL+ +EF+ K+LG FKL P A+++ ++P LP FDWR++ AV
Sbjct: 91 SATHGVTQFSDLTRSEFRKKHLGVRSGFKL-PKDANKA--PILPTENLPEDFDWRDHGAV 147
Query: 139 TGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCE 189
T VK+Q CGS W+FS TG +EG T KLVSLSEQ+L+DCD E D GC
Sbjct: 148 TPVKNQGSCGSCWSFSATGALEGANFLATGKLVSLSEQQLVDCDHECDPEEADSCDSGCN 207
Query: 190 GGSISNAFDTIMSKLGGGLEEEKTYPYRGDD-KACRLNKKATQVKINGYVSVSRDETDMA 248
GG +++AF+ + GGL +E+ YPY G D K C+L+K ++ + +S DE +A
Sbjct: 208 GGLMNSAFEHTLKT--GGLMKEEDYPYTGKDGKTCKLDKSKIVASVSNFSVISIDEEQIA 265
Query: 249 KYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGV---DRTKFT 305
LV+NGP+AVAINA +Q Y+ GVS P + C L+H VL+VGYG +F
Sbjct: 266 ANLVKNGPLAVAINAGYMQTYIGGVSCP--YIC---TRRLNHGVLLVGYGAAGYAPARFK 320
Query: 306 HKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
K PYWIIKNSWGE WGE G++++ +G CG++ V +
Sbjct: 321 EK--PYWIIKNSWGETWGENGFYKICKGRNICGVDSMVST 358
>gi|54696066|gb|AAV38405.1| cathepsin F [synthetic construct]
Length = 485
Score = 241 bits (614), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 135/329 (41%), Positives = 194/329 (58%), Gaps = 10/329 (3%)
Query: 20 SSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT 79
S ++ ++ L VK ++F F+ +N+TY + E RL +F N+ + Q +Q
Sbjct: 165 SVISLLNEDPLSQDLPVKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQAL 224
Query: 80 EHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVT 139
+ G+ YG+ +FSDL+ EF+ YL L+ ++ A P +DWR AVT
Sbjct: 225 DRGTAQYGVTKFSDLTEEEFRTIYLNTLLRKEPGNKMKQAKSVGDLAPPEWDWRSKGAVT 284
Query: 140 GVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDT 199
VKDQ MCGS WAFS TGN+EG + L+SLSEQEL+DCD+ D C GG SNA+
Sbjct: 285 KVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDKMDKACMGGLPSNAYSA 344
Query: 200 IMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAV 259
I K GGLE E Y Y+G ++C + + +V IN + +S++E +A +L + GP++V
Sbjct: 345 I--KNLGGLETEDDYSYQGHMQSCNFSAEKAKVYINDSMELSQNEQKLAAWLAKRGPISV 402
Query: 260 AINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWG 319
AINA+ +QFY G+S P++ C + H+VL+VGYG VP+W IKNSWG
Sbjct: 403 AINAFGMQFYRHGISRPLRPLCSPW--LIDHAVLLVGYG------NRSDVPFWAIKNSWG 454
Query: 320 EGWGEKGYFRLYRGDGSCGINDYVRSALV 348
WGEKGY+ L+RG G+CG+N SA+V
Sbjct: 455 TDWGEKGYYYLHRGSGACGVNTMASSAVV 483
>gi|6467382|gb|AAF13146.1|AF136279_1 cathepsin F precursor [Homo sapiens]
Length = 484
Score = 240 bits (613), Expect = 6e-61, Method: Compositional matrix adjust.
Identities = 135/329 (41%), Positives = 194/329 (58%), Gaps = 10/329 (3%)
Query: 20 SSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT 79
S ++ ++ L VK ++F F+ +N+TY + E RL +F N+ + Q +Q
Sbjct: 165 SVISLLNEDPLSQDLPVKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQAL 224
Query: 80 EHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVT 139
+ G+ YG+ +FSDL+ EF+ YL L+ ++ A P +DWR AVT
Sbjct: 225 DRGTAQYGVTKFSDLTEEEFRTIYLNTLLRKEPGNKMKQAKSVGDLAPPEWDWRSKGAVT 284
Query: 140 GVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDT 199
VKDQ MCGS WAFS TGN++G + L+SLSEQEL+DCD+ D C GG SNA+
Sbjct: 285 KVKDQGMCGSCWAFSVTGNVKGQWFLNQGTLLSLSEQELLDCDKMDKACMGGLPSNAYSA 344
Query: 200 IMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAV 259
I K GGLE E Y Y+G ++C + + +V IN V +S++E +A +L + GP++V
Sbjct: 345 I--KNLGGLETEDDYSYQGHMQSCNFSAEKAKVYINDSVELSQNEQKLAAWLAKRGPISV 402
Query: 260 AINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWG 319
AINA+ +QFY G+S P++ C + H+VL+VGYG VP+W IKNSWG
Sbjct: 403 AINAFGMQFYRHGISRPLRPLCSPW--LIDHAVLLVGYG------NRSDVPFWAIKNSWG 454
Query: 320 EGWGEKGYFRLYRGDGSCGINDYVRSALV 348
WGEKGY+ L+RG G+CG+N SA+V
Sbjct: 455 TDWGEKGYYYLHRGSGACGVNTMASSAVV 483
>gi|18420375|ref|NP_568052.1| cysteine proteinase RD19a [Arabidopsis thaliana]
gi|1172872|sp|P43296.1|RD19A_ARATH RecName: Full=Cysteine proteinase RD19a; Short=RD19; Flags:
Precursor
gi|435618|dbj|BAA02373.1| thiol protease [Arabidopsis thaliana]
gi|4539328|emb|CAB38829.1| drought-inducible cysteine proteinase RD19A precursor [Arabidopsis
thaliana]
gi|7270892|emb|CAB80572.1| drought-inducible cysteine proteinase RD19A precursor [Arabidopsis
thaliana]
gi|19310552|gb|AAL85009.1| putative cysteine proteinase RD19A [Arabidopsis thaliana]
gi|22136868|gb|AAM91778.1| putative cysteine proteinase RD19A [Arabidopsis thaliana]
gi|110740898|dbj|BAE98545.1| drought-inducible cysteine proteinase RD19A precursor [Arabidopsis
thaliana]
gi|332661616|gb|AEE87016.1| cysteine proteinase RD19a [Arabidopsis thaliana]
Length = 368
Score = 240 bits (613), Expect = 6e-61, Method: Compositional matrix adjust.
Identities = 141/340 (41%), Positives = 199/340 (58%), Gaps = 33/340 (9%)
Query: 23 MVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHG 82
VVG + L H F+ F + K YA+ E+ R +F NLR+ + Q +
Sbjct: 35 QVVGGAEPQVLTSEDH---FSLFKRKFGKVYASNEEHDYRFSVFKANLRRARRHQKLDP- 90
Query: 83 SGVYGLNEFSDLSTAEFQAKYLG----FKLKPSYADRSVPAMIPNITLPRAFDWREYDAV 138
S +G+ +FSDL+ +EF+ K+LG FKL P A+++ ++P LP FDWR++ AV
Sbjct: 91 SATHGVTQFSDLTRSEFRKKHLGVRSGFKL-PKDANKA--PILPTENLPEDFDWRDHGAV 147
Query: 139 TGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCE 189
T VK+Q CGS W+FS TG +EG T KLVSLSEQ+L+DCD E D GC
Sbjct: 148 TPVKNQGSCGSCWSFSATGALEGANFLATGKLVSLSEQQLVDCDHECDPEEADSCDSGCN 207
Query: 190 GGSISNAFDTIMSKLGGGLEEEKTYPYRGDD-KACRLNKKATQVKINGYVSVSRDETDMA 248
GG +++AF+ + GGL +E+ YPY G D K C+L+K ++ + +S DE +A
Sbjct: 208 GGLMNSAFEYTLKT--GGLMKEEDYPYTGKDGKTCKLDKSKIVASVSNFSVISIDEEQIA 265
Query: 249 KYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGV---DRTKFT 305
LV+NGP+AVAINA +Q Y+ GVS P + C L+H VL+VGYG +F
Sbjct: 266 ANLVKNGPLAVAINAGYMQTYIGGVSCP--YIC---TRRLNHGVLLVGYGAAGYAPARFK 320
Query: 306 HKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
K PYWIIKNSWGE WGE G++++ +G CG++ V +
Sbjct: 321 EK--PYWIIKNSWGETWGENGFYKICKGRNICGVDSMVST 358
>gi|224105327|ref|XP_002313770.1| predicted protein [Populus trichocarpa]
gi|222850178|gb|EEE87725.1| predicted protein [Populus trichocarpa]
Length = 368
Score = 240 bits (613), Expect = 7e-61, Method: Compositional matrix adjust.
Identities = 130/315 (41%), Positives = 184/315 (58%), Gaps = 19/315 (6%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
F+ F + K+Y + E+ R +F NLR+ Q + + +G+ +FSDL++AEF+
Sbjct: 53 FSLFKRKFKKSYLSQEEHDYRFSVFKSNLRRAARHQKLDP-TASHGVTQFSDLTSAEFRK 111
Query: 102 KYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEG 161
+ LG + D + ++P LP FDWRE AV VK+Q CGS W+FSTTG +EG
Sbjct: 112 QVLGLRKLRLPKDANTAPILPTNDLPEDFDWREKGAVGPVKNQGSCGSCWSFSTTGALEG 171
Query: 162 VYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSKLGGGLEEEK 212
+ T +LVSLSEQ+L+DCD E D GC GG +++AF+ + GGL E+
Sbjct: 172 AHFLATGELVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK--AGGLMREE 229
Query: 213 TYPYRGDDK-ACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVT 271
YPY G D+ AC+ +K + + +VS DE +A LV+NGP+AVAINA +Q Y+
Sbjct: 230 DYPYTGMDRGACKFDKNKVAAGVANFSAVSLDEDQIAANLVKNGPLAVAINAVFMQTYIG 289
Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYG-VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
GVS P + C + L H VL+VGYG K PYWIIKNSWGE WGE G++++
Sbjct: 290 GVSCP--YIC---SRRLDHGVLLVGYGSAAYAPVRMKEKPYWIIKNSWGESWGENGFYKI 344
Query: 331 YRGDGSCGINDYVRS 345
RG CG++ V +
Sbjct: 345 CRGRNICGVDSMVST 359
>gi|19851|emb|CAA78365.1| tobacco pre-pro-cysteine proteinase [Nicotiana tabacum]
Length = 365
Score = 240 bits (612), Expect = 8e-61, Method: Compositional matrix adjust.
Identities = 136/327 (41%), Positives = 188/327 (57%), Gaps = 26/327 (7%)
Query: 32 HLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEF 91
HL + +H F+ F + K YA+ E+ R +F NLR+ +L Q + S +G+ +F
Sbjct: 41 HLLNAEHH--FSLFKSKFGKIYASEEEHDHRFKVFKANLRRARLNQLLD-PSAEHGITKF 97
Query: 92 SDLSTAEFQAKYLGF-KLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSS 150
SDL+ +EF+ YLG K KP P ++P LP +DWR++ AVTGVK+Q CGS
Sbjct: 98 SDLTPSEFRRTYLGLHKPKPKVNAEKAP-ILPTSDLPADYDWRDHGAVTGVKNQGSCGSC 156
Query: 151 WAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIM 201
W+FSTTG +EG + T +LVSLSEQ+L+DCD E D GC GG ++ AF+ +
Sbjct: 157 WSFSTTGAVEGAHFLATGELVSLSEQQLVDCDHECDSEQQDSCDAGCGGGLMTTAFEYTL 216
Query: 202 SKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAI 261
GGL+ EK YPY G D C +K + + + DE +A LV++GP+AV I
Sbjct: 217 K--AGGLQLEKDYPYTGKDGKCHFDKSKIAAAVTNFSVIGLDEDQIAANLVKHGPLAVGI 274
Query: 262 NAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGY---GVDRTKFTHKAVPYWIIKNSW 318
NA +Q YV GVS P+ F + H VL+VGY G + KA YWIIKNSW
Sbjct: 275 NAAWMQTYVGGVSCPLICF-----KRQDHGVLLVGYGSHGFAPIRLKEKA--YWIIKNSW 327
Query: 319 GEGWGEKGYFRLYRGDGSCGINDYVRS 345
GE WGE GY+++ RG CG++ V +
Sbjct: 328 GENWGEHGYYKICRGHNICGVDAMVST 354
>gi|351710879|gb|EHB13798.1| Cathepsin F [Heterocephalus glaber]
Length = 482
Score = 240 bits (612), Expect = 8e-61, Method: Compositional matrix adjust.
Identities = 135/321 (42%), Positives = 191/321 (59%), Gaps = 10/321 (3%)
Query: 28 EKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYG 87
+ L +K ++F F+ +N+TY + E RL +F+ N+ Q +Q +HG+ YG
Sbjct: 171 DPLPEEFSMKMISIFKNFVATYNRTYESKKEAQWRLSVFTRNMVLAQRIQALDHGTAQYG 230
Query: 88 LNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMC 147
+ +FSDL+ EF+ YL L+ + A P +DWR+ AVT VK+Q MC
Sbjct: 231 VTKFSDLTEEEFRTIYLNPLLREEPGKKMHLAKAVRDPAPLEWDWRKKGAVTEVKNQGMC 290
Query: 148 GSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGG 207
GS WAFS TGN+EG + L+SLSEQEL+DCD+ D C GG SNA+ I S GG
Sbjct: 291 GSCWAFSVTGNVEGQWFLNRGTLLSLSEQELLDCDKMDKACMGGFPSNAYLAIKSL--GG 348
Query: 208 LEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQ 267
LE E Y Y+G KAC + K +V IN V +S++E +A +L GP++VAINA+ +Q
Sbjct: 349 LETEDDYSYQGHMKACNFSAKKAKVYINDSVELSKNEQKLAAWLAVKGPISVAINAFGMQ 408
Query: 268 FYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGY 327
FY G++HP++ C + H++L+VGYG VP+W IKNSWG WGE+GY
Sbjct: 409 FYRHGIAHPLRPLCS--PWFIDHAMLVVGYG------NRSNVPFWAIKNSWGTDWGEEGY 460
Query: 328 FRLYRGDGSCGINDYVRSALV 348
+ L+RG G+CG+N SA+V
Sbjct: 461 YYLHRGSGACGVNIMASSAVV 481
>gi|4760897|gb|AAD29130.1| cysteine proteinase 1 precursor [Clonorchis sinensis]
Length = 328
Score = 239 bits (611), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 130/310 (41%), Positives = 180/310 (58%), Gaps = 13/310 (4%)
Query: 40 ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
AL+ F ++ KTY+ + R IF NL + + LQ+ E G+ YG+ +FSDL++ EF
Sbjct: 30 ALYEEFKLKYKKTYSNDDDEL-RFEIFKDNLLRAKRLQEMEQGTAQYGVTQFSDLTSEEF 88
Query: 100 QAKYLGFKLKPSYADRSVPAMIPNITLP-RAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
+ +YL + P+ ++T+ FDWRE+ AV V DQ CGS WAFS GN
Sbjct: 89 KTRYLRMRFDGPIVSED-PSPEEDVTMDNEKFDWREHGAVGPVLDQGKCGSCWAFSVIGN 147
Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG 218
+EG + KT L++LSEQ+L+DCD D GC GG + I GGLE YPY G
Sbjct: 148 VEGQWFRKTGDLLALSEQQLVDCDHLDKGCNGGYPPKTYGEIEKM--GGLELASDYPYTG 205
Query: 219 DDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQ 278
D C +N+ +N + E A+ L E GP++ A+NA LQFY+ G+ PI
Sbjct: 206 VDGICYMNQSKFVAYVNESTVLPLSEKIQAQKLKEIGPLSSALNAVLLQFYLGGIIFPIP 265
Query: 279 FFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCG 338
F C+ L+H+VL VGYG T +PYWI+KNSWG G+GEKGYFR++RG G+CG
Sbjct: 266 FLCN--PHGLNHAVLTVGYG------TEFGIPYWIVKNSWGVGFGEKGYFRIFRGAGTCG 317
Query: 339 INDYVRSALV 348
IN V +A++
Sbjct: 318 INLVVSTAII 327
>gi|118489556|gb|ABK96580.1| unknown [Populus trichocarpa x Populus deltoides]
Length = 367
Score = 239 bits (611), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 134/333 (40%), Positives = 195/333 (58%), Gaps = 30/333 (9%)
Query: 27 DEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY 86
D+ L+ HH F F + KTYAT E+ R +F NLR+ + Q + + +
Sbjct: 42 DDLLNAEHH------FTSFKSKFGKTYATQEEHDYRFGVFKANLRRAKKHQMIDP-TAAH 94
Query: 87 GLNEFSDLSTAEFQAKYLGFKLK---PSYADRSVPAMIPNITLPRAFDWREYDAVTGVKD 143
G+ +FSDL+ EF+ ++LG K + P+ A+++ ++P LP +DWR++ AVT VKD
Sbjct: 95 GVTKFSDLTPKEFRRQFLGLKRRLRLPTDANKA--PILPTTDLPTDYDWRDHGAVTEVKD 152
Query: 144 QTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSIS 194
Q CGS W+FS TG +EG + T +L SLSEQ+L+DCD E D GC+GG ++
Sbjct: 153 QGSCGSCWSFSATGALEGAHYLATGELASLSEQQLVDCDHECDPEEYGACDSGCDGGLMN 212
Query: 195 NAFDTIMSKLGGGLEEEKTYPYRGDDKA-CRLNKKATQVKINGYVSVSRDETDMAKYLVE 253
NAF+ + GGLE E+ YPY G D C+ +K ++ + VS DE +A LV+
Sbjct: 213 NAFEYALK--AGGLEREEDYPYTGTDGGTCKFDKSKVVASVSNFSVVSIDEDQIAANLVK 270
Query: 254 NGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYG-VDRTKFTHKAVPYW 312
+GP++VAINA +Q YV GVS P + C ++ H VL+VGYG K P+W
Sbjct: 271 HGPLSVAINAAFMQTYVGGVSCP--YIC---SKRQDHGVLLVGYGSAGYAPIRFKEKPFW 325
Query: 313 IIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
IIKNSWG+ WGE GY+++ RG CG++ V +
Sbjct: 326 IIKNSWGQNWGENGYYKICRGRNICGVDSMVST 358
>gi|348564702|ref|XP_003468143.1| PREDICTED: cathepsin F-like [Cavia porcellus]
Length = 462
Score = 239 bits (611), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 135/313 (43%), Positives = 190/313 (60%), Gaps = 10/313 (3%)
Query: 36 VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLS 95
+K +LF F+ +N+TY + E RL +F+ N+ Q +Q + G+ YG+ +FSDL+
Sbjct: 159 MKIASLFKKFVATYNRTYESKEETQWRLSVFTRNMILAQKIQALDRGTAQYGVTKFSDLT 218
Query: 96 TAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFST 155
EF+ YL L+ + A I + + P +DWR+ AVT VK+Q MCGS WAFS
Sbjct: 219 EEEFRTIYLNPLLREHPSKTMRQAKIVHDSAPPEWDWRKKGAVTEVKNQGMCGSCWAFSV 278
Query: 156 TGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYP 215
TGN+EG + K L+SLSEQEL+DCD+ D C GG NA+ I S GGLE E Y
Sbjct: 279 TGNVEGQWFLKKGTLLSLSEQELLDCDKVDKACMGGLPINAYSAIKSL--GGLETEDDYS 336
Query: 216 YRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSH 275
Y+G +AC + K +V IN V +S++E +A +L GP+++AINA+ +QFY G++H
Sbjct: 337 YQGHMEACNFSAKKAKVYINDSVELSKNEQYLAAWLAVKGPISIAINAFGMQFYRHGIAH 396
Query: 276 PIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
P+Q C + H++LIVGYG VP+W IKNSWG WGE+GY+ L+RG
Sbjct: 397 PLQPLCSPW--FIDHAMLIVGYG------KRSGVPFWAIKNSWGTDWGEEGYYYLHRGSR 448
Query: 336 SCGINDYVRSALV 348
SCG+N SA+V
Sbjct: 449 SCGVNVMASSAVV 461
>gi|395851695|ref|XP_003798388.1| PREDICTED: LOW QUALITY PROTEIN: cathepsin F [Otolemur garnettii]
Length = 491
Score = 239 bits (611), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 134/313 (42%), Positives = 188/313 (60%), Gaps = 10/313 (3%)
Query: 36 VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLS 95
++ ++F FL +N+TY + E RL IF N+ + Q +Q + G+ YG+ +FSDL+
Sbjct: 188 MQMLSVFKNFLTTYNRTYESKEETQWRLSIFINNMVRAQKIQALDQGTARYGITKFSDLT 247
Query: 96 TAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFST 155
EF+ YL L+ + A P +DWR AVT VK+Q MCGS WAFS
Sbjct: 248 EEEFRTIYLNPLLREDPGKKMRVAKPVGDPAPPEWDWRNKGAVTNVKNQGMCGSCWAFSV 307
Query: 156 TGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYP 215
TGN+EG + K L+SLSEQEL+DCD+ D C GG SNA+ I K GGLE E+ Y
Sbjct: 308 TGNVEGQWFLKQGTLLSLSEQELLDCDKMDKACLGGLPSNAYSAI--KNLGGLETEEDYS 365
Query: 216 YRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSH 275
Y+G +AC + + +V IN V +S +E +A +L + GP++VAINA+ +QFY G+S
Sbjct: 366 YQGQMQACNFSAEKAKVYINDSVELSHNEQKLAAWLAKKGPISVAINAFGMQFYRHGISR 425
Query: 276 PIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
P++ C + H+VLIVGYG +P+W IKNSWG WGE+GY+ L+RG G
Sbjct: 426 PLRPLCTPW--LIDHAVLIVGYG------NRSDIPFWAIKNSWGTDWGEQGYYYLHRGSG 477
Query: 336 SCGINDYVRSALV 348
+CG+N SA+V
Sbjct: 478 ACGVNTMASSAVV 490
>gi|18414611|ref|NP_567489.1| Papain family cysteine protease [Arabidopsis thaliana]
gi|2244977|emb|CAB10398.1| cysteine proteinase like protein [Arabidopsis thaliana]
gi|7268368|emb|CAB78661.1| cysteine proteinase like protein [Arabidopsis thaliana]
gi|14517442|gb|AAK62611.1| AT4g16190/dl4135w [Arabidopsis thaliana]
gi|22136546|gb|AAM91059.1| AT4g16190/dl4135w [Arabidopsis thaliana]
gi|22530956|gb|AAM96982.1| cysteine proteinase [Arabidopsis thaliana]
gi|23397184|gb|AAN31875.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|110740834|dbj|BAE98514.1| cysteine proteinase like protein [Arabidopsis thaliana]
gi|332658313|gb|AEE83713.1| Papain family cysteine protease [Arabidopsis thaliana]
Length = 373
Score = 239 bits (611), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 141/338 (41%), Positives = 199/338 (58%), Gaps = 28/338 (8%)
Query: 24 VVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGS 83
VV +E L + +H F F ++ KTYAT VE+ R +F NLR+ + Q + S
Sbjct: 39 VVPEENDEQLLNAEHH--FTLFKSKYEKTYATQVEHDHRFRVFKANLRRARRNQLLD-PS 95
Query: 84 GVYGLNEFSDLSTAEFQAKYLGFKLK----PSYADRSVPAMIPNITLPRAFDWREYDAVT 139
V+G+ +FSDL+ EF+ K+LG K + P+ D ++P LP FDWRE AVT
Sbjct: 96 AVHGVTQFSDLTPKEFRRKFLGLKRRGFRLPT--DTQTAPILPTSDLPTEFDWREQGAVT 153
Query: 140 GVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEG 190
VK+Q MCGS W+FS G +EG + TK+LVSLSEQ+L+DCD E D GC G
Sbjct: 154 PVKNQGMCGSCWSFSAIGALEGAHFLATKELVSLSEQQLVDCDHECDPAQANSCDSGCSG 213
Query: 191 GSISNAFDTIMSKLGGGLEEEKTYPYRG-DDKACRLNKKATQVKINGYVSVSRDETDMAK 249
G ++NAF+ + GGL +E+ YPY G D AC+ +K ++ + VS DE +A
Sbjct: 214 GLMNNAFEYALK--AGGLMKEEDYPYTGRDHTACKFDKSKIVASVSNFSVVSSDEDQIAA 271
Query: 250 YLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVD-RTKFTHKA 308
LV++GP+A+AINA +Q Y+ GVS P + C +++ H VL+VG+G K
Sbjct: 272 NLVQHGPLAIAINAMWMQTYIGGVSCP--YVC---SKSQDHGVLLVGFGSSGYAPIRLKE 326
Query: 309 VPYWIIKNSWGEGWGEKGYFRLYRGDGS-CGINDYVRS 345
PYWIIKNSWG WGE GY+++ RG + CG++ V +
Sbjct: 327 KPYWIIKNSWGAMWGEHGYYKICRGPHNMCGMDTMVST 364
>gi|51969854|dbj|BAD43619.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 361
Score = 239 bits (610), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 144/359 (40%), Positives = 208/359 (57%), Gaps = 37/359 (10%)
Query: 10 VALLSLTVSVSSFMVVGDEKLHHLHHVKHTAL--------FNYFLEQHNKTYATLVEYYS 61
V+L+ + VSVS V GDE + V T F F ++ K Y ++ E+Y
Sbjct: 11 VSLIFVFVSVS---VCGDEDVLIRQVVDETEPKVLSSEDHFTLFKKKFGKVYGSIEEHYY 67
Query: 62 RLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLG----FKLKPSYADRSV 117
R +F NL + Q + S +G+ +FSDL+ +EF+ K+LG FKL P A+++
Sbjct: 68 RFSVFKANLLRAMRHQKMDP-SARHGVTQFSDLTRSEFRRKHLGVKGGFKL-PKDANQA- 124
Query: 118 PAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQE 177
++P LP FDWR+ AVT VK+Q CGS W+FSTTG +EG + T KLVSLSEQ+
Sbjct: 125 -PILPTQNLPEEFDWRDRGAVTPVKNQGSCGSCWSFSTTGALEGAHFLATGKLVSLSEQQ 183
Query: 178 LIDCDQE---------DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDD-KACRLNK 227
L+DCD E D GC G +++AF+ + GGL EK YPY G D +C+L++
Sbjct: 184 LVDCDHECDPEEEGSCDSGCNGRLMNSAFEYTLKT--GGLMREKDYPYTGTDGGSCKLDR 241
Query: 228 KATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNEN 287
++ + VS +E +A L++NGP+AVAINA +Q Y+ GVS P + C +
Sbjct: 242 SKIVASVSNFSVVSINEDQIAANLIKNGPLAVAINAAYMQTYIGGVSCP--YIC---SRR 296
Query: 288 LSHSVLIVGYG-VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
L+H VL+VGYG ++ K PYWIIKNSWGE WGE G++++ +G CG++ V +
Sbjct: 297 LNHGVLLVGYGSAGFSQARLKEKPYWIIKNSWGESWGENGFYKICKGRNICGVDSLVST 355
>gi|85068708|gb|ABC69434.1| cysteine protease [Clonorchis sinensis]
gi|85068710|gb|ABC69435.1| cysteine protease [Clonorchis sinensis]
Length = 328
Score = 239 bits (610), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 131/311 (42%), Positives = 182/311 (58%), Gaps = 15/311 (4%)
Query: 40 ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
AL+ F ++ KTY+ + R IF NL + + LQ+ E G+ YG+ +FSDL++ EF
Sbjct: 30 ALYEEFKLKYKKTYSNDDDEL-RFEIFKDNLLRAKRLQEMEQGTAQYGVTQFSDLTSEEF 88
Query: 100 QAKYLGFKLK-PSYADRSVPAMIPNITLP-RAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
+ +YL + P ++ P ++T+ FDWRE+ AV V DQ CGS WAFS G
Sbjct: 89 KTRYLRMRFDGPIVSEDLTPE--EDVTMDNEKFDWREHGAVGPVLDQGKCGSCWAFSVIG 146
Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
N+EG + KT L++LSEQ+L+DCD D GC GG + I GGLE YPY
Sbjct: 147 NVEGQWFRKTGDLLALSEQQLVDCDHLDKGCNGGYPPKTYGEIEKM--GGLELASDYPYT 204
Query: 218 GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPI 277
G D C +N+ +N + E A+ L E GP++ A+NA LQFY+ G+ PI
Sbjct: 205 GVDGICYMNQSKFVAYVNDSTVLPLSEKIQAQKLKEIGPLSSALNAVLLQFYLGGIIFPI 264
Query: 278 QFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSC 337
F C+ L+H+VL VGYG T +PYWI+KNSWG G+GEKGYFR++RG G+C
Sbjct: 265 PFLCN--PHGLNHAVLTVGYG------TEFGIPYWIVKNSWGVGFGEKGYFRIFRGAGTC 316
Query: 338 GINDYVRSALV 348
GIN V +A++
Sbjct: 317 GINLVVSTAII 327
>gi|118481169|gb|ABK92536.1| unknown [Populus trichocarpa]
Length = 368
Score = 239 bits (610), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 130/315 (41%), Positives = 183/315 (58%), Gaps = 19/315 (6%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
F+ F + K+Y + E+ R +F NLR+ Q + + +G+ +FSDL++AEF+
Sbjct: 53 FSLFKRKFKKSYLSQEEHDYRFSVFKSNLRRAARHQKLDP-TASHGVTQFSDLTSAEFRK 111
Query: 102 KYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEG 161
+ LG + D + ++P LP FDWRE AV VK+Q CGS W+FSTTG +EG
Sbjct: 112 QVLGLRKLRLPKDANTAPILPTNDLPEDFDWREKGAVGPVKNQGSCGSCWSFSTTGALEG 171
Query: 162 VYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSKLGGGLEEEK 212
+ T +LVSLSEQ+L+DCD E D GC GG +++AF+ + GGL E+
Sbjct: 172 AHFLATGELVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK--AGGLMREE 229
Query: 213 TYPYRGDDK-ACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVT 271
YPY G D+ AC+ +K + + VS DE +A LV+NGP+AVAINA +Q Y+
Sbjct: 230 DYPYTGMDRGACKFDKNKVAAGVANFSVVSLDEDQIAANLVKNGPLAVAINAVFMQTYIG 289
Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYG-VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
GVS P + C + L H VL+VGYG K PYWIIKNSWGE WGE G++++
Sbjct: 290 GVSCP--YIC---SRRLDHGVLLVGYGSAAYAPVRMKEKPYWIIKNSWGESWGENGFYKI 344
Query: 331 YRGDGSCGINDYVRS 345
RG CG++ V +
Sbjct: 345 CRGRNICGVDSMVST 359
>gi|312281839|dbj|BAJ33785.1| unnamed protein product [Thellungiella halophila]
Length = 373
Score = 239 bits (609), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 136/321 (42%), Positives = 191/321 (59%), Gaps = 30/321 (9%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
F+ F + K YA+ E+ RL +F NLR+ + Q + S +G+ +FSDL+ +EF+
Sbjct: 56 FSLFKRKFGKVYASSEEHDYRLSVFKANLRRARRHQKLDP-SARHGVTQFSDLTRSEFRK 114
Query: 102 KYLG----FKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
K+LG FKL P A+++ ++P LP FDWR+ AVT VK+Q CGS W+FS TG
Sbjct: 115 KHLGVRGGFKL-PKDANKA--PILPTENLPEDFDWRDRGAVTPVKNQGSCGSCWSFSATG 171
Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSKLGGGL 208
+EG T KLVSLSEQ+L+DCD E D GC GG +++AF+ + GGL
Sbjct: 172 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNSAFEYTLKT--GGL 229
Query: 209 EEEKTYPYRGDD-KACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQ 267
E+ YPY G D C+L+K ++ + +S DE +A LV+NGP+AVAINA +Q
Sbjct: 230 MREEDYPYTGKDGPTCKLDKSKIVASVSNFSVISIDEDQIAANLVKNGPLAVAINAAYMQ 289
Query: 268 FYVTGVSHPIQFFCDGGNENLSHSVLIVGYGV---DRTKFTHKAVPYWIIKNSWGEGWGE 324
Y+ GVS P + C L+H VL+VGYG +F K PYWIIKNSWGE WGE
Sbjct: 290 TYIGGVSCP--YIC---ARRLNHGVLLVGYGSAGYAPARFKEK--PYWIIKNSWGESWGE 342
Query: 325 KGYFRLYRGDGSCGINDYVRS 345
G++++ +G CG++ V +
Sbjct: 343 NGFYKICKGRNICGVDSLVST 363
>gi|49456321|emb|CAG46481.1| CTSF [Homo sapiens]
Length = 338
Score = 239 bits (609), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 135/329 (41%), Positives = 193/329 (58%), Gaps = 10/329 (3%)
Query: 20 SSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT 79
S ++ ++ L VK ++F F+ +N+TY + E RL +F N+ + Q +Q
Sbjct: 19 SVISLLNEDPLSQDLPVKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQAL 78
Query: 80 EHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVT 139
+ G+ YG+ +FSDL+ EF+ YL L+ ++ A P +DWR AVT
Sbjct: 79 DRGTAQYGVTKFSDLTEEEFRTIYLNTLLRKEPGNKMKQAKSVGDLAPPEWDWRSKGAVT 138
Query: 140 GVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDT 199
VKDQ MCGS WAFS TGN+EG + L+SLSEQEL+DCD+ D C GG SNA+
Sbjct: 139 KVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDKMDKACMGGLPSNAYSA 198
Query: 200 IMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAV 259
I K GGLE Y Y+G ++C + + +V IN V +S++E +A +L + GP++V
Sbjct: 199 I--KNLGGLETVDDYSYQGHMQSCNFSAEKAKVYINDSVELSQNEQKLAAWLAKRGPISV 256
Query: 260 AINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWG 319
AINA+ +QFY G+S P++ C + H+VL+VGYG VP+W IKNSWG
Sbjct: 257 AINAFGMQFYRHGISRPLRPLCSPW--LIDHAVLLVGYG------NRSDVPFWAIKNSWG 308
Query: 320 EGWGEKGYFRLYRGDGSCGINDYVRSALV 348
WGEKGY+ L+RG G+CG+N SA+V
Sbjct: 309 TDWGEKGYYYLHRGSGACGVNTMASSAVV 337
>gi|118485910|gb|ABK94801.1| unknown [Populus trichocarpa]
Length = 367
Score = 239 bits (609), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 134/333 (40%), Positives = 195/333 (58%), Gaps = 30/333 (9%)
Query: 27 DEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY 86
D+ L+ HH F F + KTYAT E+ R +F NLR+ + Q + + +
Sbjct: 42 DDLLNAEHH------FTSFKSKFGKTYATQEEHDYRFGVFKANLRRAKKHQMIDP-TAAH 94
Query: 87 GLNEFSDLSTAEFQAKYLGFKLK---PSYADRSVPAMIPNITLPRAFDWREYDAVTGVKD 143
G+ +FSDL+ EF+ ++LG K + P+ A+++ ++P LP +DWR++ AVT VKD
Sbjct: 95 GVTKFSDLTPKEFRRQFLGLKRRLRLPTDANKA--PILPTTDLPTDYDWRDHGAVTEVKD 152
Query: 144 QTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSIS 194
Q CGS W+FS TG +EG + T +L SLSEQ+L+DCD E D GC+GG ++
Sbjct: 153 QGSCGSCWSFSATGALEGAHYLATGELASLSEQQLVDCDHECDPEEYGACDSGCDGGLMN 212
Query: 195 NAFDTIMSKLGGGLEEEKTYPYRGDDKA-CRLNKKATQVKINGYVSVSRDETDMAKYLVE 253
NAF+ + GGLE E+ YPY G D C+ +K ++ + VS DE +A LV+
Sbjct: 213 NAFEYALK--AGGLEREEDYPYTGTDGGTCKFDKSKVVASVSNFSVVSIDEDQIAANLVK 270
Query: 254 NGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYG-VDRTKFTHKAVPYW 312
+GP++VAINA +Q YV GVS P + C ++ H VL+VGYG K P+W
Sbjct: 271 HGPLSVAINAAFMQTYVGGVSCP--YIC---SKRQDHGVLLVGYGSAGYAPIRFKEKPFW 325
Query: 313 IIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
IIKNSWG+ WGE GY+++ RG CG++ V +
Sbjct: 326 IIKNSWGQNWGENGYYKICRGRNICGVDSMVST 358
>gi|356545108|ref|XP_003540987.1| PREDICTED: cysteine proteinase RD19a-like [Glycine max]
Length = 365
Score = 238 bits (608), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 134/331 (40%), Positives = 188/331 (56%), Gaps = 25/331 (7%)
Query: 26 GDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGV 85
GD +L HH F F + K Y + E+ R +F N+R+ + Q + S
Sbjct: 40 GDVRLGAEHH------FLEFKRRFGKAYDSEDEHDYRYKVFKANMRRARRHQSLDP-SAA 92
Query: 86 YGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQT 145
+G+ FSDL+ +EF+ K LG + D + ++P LP FDWR++ AVT VK+Q
Sbjct: 93 HGVTRFSDLTPSEFRNKVLGLRGVRLPLDANKAPILPTDNLPSDFDWRDHGAVTPVKNQG 152
Query: 146 MCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNA 196
CGS W+FSTTG +EG + T +LVSLSEQ+L+DCD E D GC GG +++A
Sbjct: 153 SCGSCWSFSTTGALEGAHFLSTGELVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSA 212
Query: 197 FDTIMSKLGGGLEEEKTYPYRGDDKA-CRLNKKATQVKINGYVSVSRDETDMAKYLVENG 255
F+ I+ GG+ E+ YPY G D C+ +K + + VS DE +A LV+NG
Sbjct: 213 FEYILKS--GGVMREEDYPYSGADSGTCKFDKTKIAASVANFSVVSLDEDQIAANLVKNG 270
Query: 256 PMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDR-TKFTHKAVPYWII 314
P+AVAINA +Q Y+ GVS P + C + L+H VL+VGYG K P+WII
Sbjct: 271 PLAVAINAAYMQTYIGGVSCP--YVC---SRRLNHGVLLVGYGSGAYAPIRMKEKPFWII 325
Query: 315 KNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
KNSWGE WGE GY+++ RG CG++ V +
Sbjct: 326 KNSWGENWGENGYYKICRGRNICGVDSMVST 356
>gi|28192375|gb|AAK07731.1| CPR2-like cysteine proteinase [Nicotiana tabacum]
Length = 363
Score = 238 bits (607), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 137/336 (40%), Positives = 189/336 (56%), Gaps = 26/336 (7%)
Query: 23 MVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHG 82
VV + HL + +H F+ F + K YA+ E+ R +F N R+ + Q +
Sbjct: 30 QVVSETDDSHLLNAEHH--FSLFKSKFGKIYASEEEHDHRFKVFKANRRRARRHQLLD-P 86
Query: 83 SGVYGLNEFSDLSTAEFQAKYLGF-KLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGV 141
S +G+ +FSDL+ +EF+ YLG K KP P ++P LP FDWR++ AVTGV
Sbjct: 87 SAEHGITKFSDLTPSEFRRTYLGLHKPKPKLNAEKAP-ILPTSDLPADFDWRDHGAVTGV 145
Query: 142 KDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGS 192
K+Q CGS W+FSTTG +EG + T +LVSLSEQ+L+DCD E D GC GG
Sbjct: 146 KNQGSCGSCWSFSTTGAVEGAHFLATGELVSLSEQQLVDCDHECDPEQQDACDAGCGGGL 205
Query: 193 ISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLV 252
++ AF+ + GGL+ EK YPY G D C +K + + + DE +A LV
Sbjct: 206 MTTAFEYTLK--AGGLQLEKDYPYTGKDGKCHFDKSKIAAAVTNFSVIGLDEDQIAANLV 263
Query: 253 ENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGY---GVDRTKFTHKAV 309
++GP+AV INA +Q YV GVS P+ F + H VL+VGY G + KA
Sbjct: 264 KHGPLAVGINAAWMQTYVGGVSCPLICF-----KRQDHGVLLVGYGSHGFAPIRLKEKA- 317
Query: 310 PYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
YWIIKNSWGE WGE GY+++ RG CG++ V +
Sbjct: 318 -YWIIKNSWGENWGEHGYYKICRGHNICGVDAMVST 352
>gi|19195|emb|CAA78403.1| pre-pro-cysteine proteinase [Solanum lycopersicum]
Length = 361
Score = 238 bits (607), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 138/353 (39%), Positives = 196/353 (55%), Gaps = 23/353 (6%)
Query: 4 FYFFAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRL 63
F F+ S + +V G++ H L+ H F+ F + K YA+ E+ RL
Sbjct: 10 FALFSSAIAFSDDDPLIRQVVSGNDDNHMLNAEHH---FSLFKAKFGKIYASQEEHDHRL 66
Query: 64 HIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGF-KLKPSYADRSVPAMIP 122
+F NL + + Q + S +G+ +FSDL+ +EF+ YLG K +P+ P ++P
Sbjct: 67 KVFKANLHRAKRHQLLD-PSAEHGITQFSDLTPSEFRRTYLGLNKPRPNLNAEKAP-ILP 124
Query: 123 NITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCD 182
LP FDWRE AVT VK+Q CGS W+FSTTG +EG + T +LVSLSEQ+L+DCD
Sbjct: 125 TKDLPSDFDWREKGAVTDVKNQGSCGSCWSFSTTGAVEGAHFLATGELVSLSEQQLVDCD 184
Query: 183 QE---------DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVK 233
E D GC GG ++ AF+ + GGL+ EK YPY G + C +K
Sbjct: 185 HECDPVEKNDCDAGCNGGLMTTAFEYTLK--AGGLQLEKDYPYTGRNGKCHFDKSRIAAS 242
Query: 234 INGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVL 293
++ + V DE +A L+++GP+AV INA +Q YV GVS P+ F + H VL
Sbjct: 243 VSNFSVVGLDEDQIAANLLKHGPLAVGINAAWMQTYVRGVSCPLICF-----KRQDHGVL 297
Query: 294 IVGYGVDR-TKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
+VGYG + K PYWIIKNSWG+ WGE GY+++ RG CG++ V +
Sbjct: 298 LVGYGSEGFAPIRLKNKPYWIIKNSWGKTWGEHGYYKICRGHHICGVDAMVST 350
>gi|5679322|gb|AAD46920.1|AF167986_1 putative cysteine proteinase GmPM33 [Glycine max]
Length = 363
Score = 238 bits (607), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 138/340 (40%), Positives = 194/340 (57%), Gaps = 36/340 (10%)
Query: 19 VSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQD 78
++ + +GD +L ++ F F+E + ++Y+T EY RL IF+ N+ + Q
Sbjct: 36 IARKLKLGDNEL-----LRTEKKFKVFMENYGRSYSTEEEYLRRLGIFAQNMVRAAEHQA 90
Query: 79 TEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAV 138
+ + V+G+ +FS L + A G P D LP FDWRE AV
Sbjct: 91 LDP-TAVHGVTQFS-LPVSNNAA---GGIAPPLEVD----------GLPENFDWREKGAV 135
Query: 139 TGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCE 189
T VK Q CGS WAFSTTG+IEG T KLVSLS+Q+L+DCD + D+GC
Sbjct: 136 TEVKLQGRCGSCWAFSTTGSIEGANFLATGKLVSLSDQQLLDCDNKCDITEKTSCDNGCN 195
Query: 190 GGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAK 249
GG ++NA++ ++ GGLEEE +YPY G+ C+ + + VKI + ++ DE +A
Sbjct: 196 GGLMTNAYNYLLES--GGLEEESSYPYTGERGECKFDPEKIAVKITNFTNIPADENQIAA 253
Query: 250 YLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKA- 308
YLV+NGP+A+ +NA +Q Y+ GVS P+ C + L+H VL+VGYG
Sbjct: 254 YLVKNGPLAMGVNAIFMQTYIGGVSCPL--ICS--KKRLNHGVLLVGYGAKGFSILRLGN 309
Query: 309 VPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
PYWIIKNSWGE WGE GY++L RG G CGIN V +A+V
Sbjct: 310 KPYWIIKNSWGEKWGEDGYYKLCRGHGMCGINTMVSAAMV 349
>gi|224066056|ref|XP_002302004.1| predicted protein [Populus trichocarpa]
gi|222843730|gb|EEE81277.1| predicted protein [Populus trichocarpa]
Length = 367
Score = 238 bits (606), Expect = 5e-60, Method: Compositional matrix adjust.
Identities = 135/333 (40%), Positives = 196/333 (58%), Gaps = 30/333 (9%)
Query: 27 DEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY 86
D+ L+ HH F F + KTYAT E+ R +F NLR+ + Q + + +
Sbjct: 42 DDLLNAEHH------FTSFKSKFGKTYATQEEHDYRFGVFKANLRRAKKHQMIDP-TAAH 94
Query: 87 GLNEFSDLSTAEFQAKYLGFK--LK-PSYADRSVPAMIPNITLPRAFDWREYDAVTGVKD 143
G+ +FSDL+ EF+ ++LG K L+ P+ A+++ ++P LP +DWR++ AVT VKD
Sbjct: 95 GITKFSDLTPKEFRRQFLGLKRWLRLPTDANKA--PILPTTDLPTDYDWRDHGAVTEVKD 152
Query: 144 QTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSIS 194
Q CGS W+FS TG +EG + T +L SLSEQ+L+DCD E D GC+GG ++
Sbjct: 153 QGSCGSCWSFSATGALEGAHYLATGELASLSEQQLVDCDHECDPEEYGACDSGCDGGLMN 212
Query: 195 NAFDTIMSKLGGGLEEEKTYPYRGDDKA-CRLNKKATQVKINGYVSVSRDETDMAKYLVE 253
NAF+ + GGLE E+ YPY G D C+ +K ++ + VS DE +A LV+
Sbjct: 213 NAFEYALK--AGGLEREEDYPYTGTDGGTCKFDKSKVVASVSNFSVVSIDEDQIAANLVK 270
Query: 254 NGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYG-VDRTKFTHKAVPYW 312
+GP++VAINA +Q YV GVS P + C ++ H VL+VGYG K P+W
Sbjct: 271 HGPLSVAINAAFMQTYVGGVSCP--YIC---SKRQDHGVLLVGYGSAGYAPIRFKEKPFW 325
Query: 313 IIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
IIKNSWG+ WGE GY+++ RG CG++ V +
Sbjct: 326 IIKNSWGQNWGENGYYKICRGRNICGVDSMVST 358
>gi|516865|emb|CAA52403.1| putative thiol protease [Arabidopsis thaliana]
Length = 313
Score = 237 bits (605), Expect = 5e-60, Method: Compositional matrix adjust.
Identities = 132/316 (41%), Positives = 192/316 (60%), Gaps = 26/316 (8%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
F ++ K Y ++ E+Y R +F NL + Q + S +G+ +FSDL+ +EF+ K+L
Sbjct: 3 FKKKFGKVYGSIEEHYYRFSVFKANLLRAMRHQKMDP-SARHGVTQFSDLTRSEFRRKHL 61
Query: 105 G----FKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIE 160
G FKL P A+++ ++P LP FDWR+ AVT VK+Q CGS W+FSTTG +E
Sbjct: 62 GVKGGFKL-PKDANQA--PILPTQNLPEEFDWRDRGAVTPVKNQGSCGSCWSFSTTGALE 118
Query: 161 GVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSKLGGGLEEE 211
G + T KLVSLSEQ+L+DCD E D GC GG +++AF+ + GGL E
Sbjct: 119 GAHFLATGKLVSLSEQQLVDCDHECDPEEEGSCDSGCNGGLMNSAFEYTLKT--GGLMRE 176
Query: 212 KTYPYRGDD-KACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYV 270
K YPY G D +C+L++ ++ + VS +E +A L++NGP+AVAINA +Q Y+
Sbjct: 177 KDYPYTGTDGGSCKLDRSKIVASVSNFSVVSINEDQIAANLIKNGPLAVAINAAYMQTYI 236
Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYG-VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
GVS P + C + L+H VL+VGYG ++ K PYWIIKNSWGE WGE G+++
Sbjct: 237 GGVSCP--YIC---SRRLNHGVLLVGYGSAGFSQARLKEKPYWIIKNSWGESWGENGFYK 291
Query: 330 LYRGDGSCGINDYVRS 345
+ +G CG++ V +
Sbjct: 292 ICKGRNICGVDSLVST 307
>gi|224077886|ref|XP_002305451.1| predicted protein [Populus trichocarpa]
gi|222848415|gb|EEE85962.1| predicted protein [Populus trichocarpa]
Length = 368
Score = 237 bits (605), Expect = 6e-60, Method: Compositional matrix adjust.
Identities = 126/315 (40%), Positives = 184/315 (58%), Gaps = 19/315 (6%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
F+ F + K+Y + E+ R +F NLR+ Q+ + + +G+ +FSDL+ AEF+
Sbjct: 53 FSLFKSKFKKSYGSQEEHDYRFSVFKANLRRAARHQELDP-TASHGVTQFSDLTPAEFRK 111
Query: 102 KYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEG 161
+ LG + D + ++P LP FDWR+ AV +K+Q CGS W+FS TG +EG
Sbjct: 112 QVLGLRRLRLPKDANEAPILPTSDLPEDFDWRDKGAVGPIKNQGSCGSCWSFSATGALEG 171
Query: 162 VYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSKLGGGLEEEK 212
+ T +LVSLSEQ+L+DCD E D GC GG +++AF+ + GGL E+
Sbjct: 172 AHFLATGELVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK--AGGLMREE 229
Query: 213 TYPYRGDDK-ACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVT 271
YPY G D+ AC+ +K ++ + VS DE +A LV+NGP+AVAINA +Q Y+
Sbjct: 230 DYPYTGTDRDACKFDKNKVAARVANFSVVSLDEDQIAANLVKNGPLAVAINAVFMQTYIG 289
Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYG-VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
GVS P + C + L H VL+VGYG + K P+WIIKNSWGE WGE G++++
Sbjct: 290 GVSCP--YIC---SRRLDHGVLLVGYGSAGYSPVRMKEKPFWIIKNSWGEKWGENGFYKI 344
Query: 331 YRGDGSCGINDYVRS 345
RG CG++ V +
Sbjct: 345 CRGRNVCGVDSMVST 359
>gi|3916214|gb|AAC78839.1| cathepsin F [Homo sapiens]
Length = 302
Score = 237 bits (604), Expect = 6e-60, Method: Compositional matrix adjust.
Identities = 132/310 (42%), Positives = 186/310 (60%), Gaps = 10/310 (3%)
Query: 39 TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAE 98
++F F+ +N+TY + E RL +F N+ + Q +Q + G+ YG+ +FSDL+ E
Sbjct: 2 ASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEE 61
Query: 99 FQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
F+ YL L+ ++ A P +DWR AVT VKDQ MCGS WAFS TGN
Sbjct: 62 FRTIYLNTLLRKEPGNKMKQAKSVGDLAPPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGN 121
Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG 218
+EG + L+SLSEQEL+DCD+ D C GG SNA+ I K GGLE E Y Y+G
Sbjct: 122 VEGQWFLNQGTLLSLSEQELLDCDKMDKACMGGLPSNAYSAI--KNLGGLETEDDYSYQG 179
Query: 219 DDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQ 278
++C + + +V IN V +S++E +A +L + GP++VAINA+ +QFY G+S P++
Sbjct: 180 HMQSCNFSAEKAKVYINDSVELSQNEQKLAAWLAKRGPISVAINAFGMQFYRHGISRPLR 239
Query: 279 FFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCG 338
C + H+VL+VGYG VP+W IKNSWG WGEKGY+ L+RG G+CG
Sbjct: 240 PLCSPW--LIDHAVLLVGYG------NRSDVPFWAIKNSWGTDWGEKGYYYLHRGSGACG 291
Query: 339 INDYVRSALV 348
+N SA+V
Sbjct: 292 VNTMASSAVV 301
>gi|118485796|gb|ABK94746.1| unknown [Populus trichocarpa]
Length = 367
Score = 237 bits (604), Expect = 6e-60, Method: Compositional matrix adjust.
Identities = 135/333 (40%), Positives = 195/333 (58%), Gaps = 30/333 (9%)
Query: 27 DEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY 86
D+ L+ HH F F + KTYAT E+ R +F NLR+ + Q + + +
Sbjct: 42 DDLLNAEHH------FTSFKSKFGKTYATQEEHDYRFGVFKANLRRAKKHQMIDP-TAAH 94
Query: 87 GLNEFSDLSTAEFQAKYLGFK--LK-PSYADRSVPAMIPNITLPRAFDWREYDAVTGVKD 143
G+ +FSDL+ EF+ ++LG K L+ P+ A+++ ++P LP +DWR++ AVT VKD
Sbjct: 95 GITKFSDLTPKEFRRQFLGLKRWLRLPTDANKA--PILPTTDLPTDYDWRDHGAVTEVKD 152
Query: 144 QTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSIS 194
Q CGS W+FS TG +EG + T +L SLSEQ+L+DCD E D GC+GG ++
Sbjct: 153 QGSCGSCWSFSATGALEGAHYLATGELASLSEQQLVDCDHECDPEEYGACDSGCDGGLMN 212
Query: 195 NAFDTIMSKLGGGLEEEKTYPYRGDDKA-CRLNKKATQVKINGYVSVSRDETDMAKYLVE 253
NAF+ + GGLE E YPY G D C+ +K ++ + VS DE +A LV+
Sbjct: 213 NAFEYALK--AGGLEREADYPYTGTDGGTCKFDKSKVVASVSNFSVVSIDEDQIAANLVK 270
Query: 254 NGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYG-VDRTKFTHKAVPYW 312
+GP++VAINA +Q YV GVS P + C ++ H VL+VGYG K P+W
Sbjct: 271 HGPLSVAINAAFMQTYVGGVSCP--YIC---SKRQDHGVLLVGYGSAGYAPIRFKEKPFW 325
Query: 313 IIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
IIKNSWG+ WGE GY+++ RG CG++ V +
Sbjct: 326 IIKNSWGQNWGENGYYKICRGRNICGVDSMVST 358
>gi|147809367|emb|CAN64491.1| hypothetical protein VITISV_015725 [Vitis vinifera]
Length = 321
Score = 237 bits (604), Expect = 7e-60, Method: Compositional matrix adjust.
Identities = 135/321 (42%), Positives = 185/321 (57%), Gaps = 24/321 (7%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
F F+E++ K Y++ EY RL IF+ N+ + Q + + ++G+ FSDLS EF+
Sbjct: 7 FRMFMEKYGKEYSSREEYVHRLGIFAKNMVRAAEHQALDPXA-LHGVTPFSDLSEEEFER 65
Query: 102 KYLGFKLKPSYAD--RSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNI 159
+ G +P A + LP +FDWRE AVT VK Q CGS WAFSTTG +
Sbjct: 66 MFTGVVGRPHMKGGVAETAAALEVDGLPESFDWREKGAVTEVKMQGTCGSCWAFSTTGAV 125
Query: 160 EGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSKLGGGLEE 210
EG + TKKL++LSEQ+L+DCD D GCEGG ++NA+ ++ GGLEE
Sbjct: 126 EGAHFISTKKLLTLSEQQLVDCDHMCDIRDKXACDSGCEGGLMTNAYKYLIE--AGGLEE 183
Query: 211 EKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYV 270
E +YPY G C+ V++ + V BE +A LV +GP+AV +NA +Q Y+
Sbjct: 184 ESSYPYTGKHGECKFKPDRVAVRVVNFTEVPIBENQIAANLVCHGPLAVGLNAXFMQTYI 243
Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYGVDR---TKFTHKAVPYWIIKNSWGEGWGEKGY 327
GVS P+ C ++H VL+VGYG +F +K PYWIIKNSWG WGE GY
Sbjct: 244 GGVSCPL--ICP--KRWINHGVLLVGYGAKGYSILRFGYK--PYWIIKNSWGXRWGEHGY 297
Query: 328 FRLYRGDGSCGINDYVRSALV 348
+RL RG G CG+N V SA+V
Sbjct: 298 YRLCRGHGMCGMNTMV-SAVV 317
>gi|71482944|gb|AAZ32411.1| cysteine proteinase glycinain type [Nicotiana benthamiana]
Length = 355
Score = 236 bits (603), Expect = 8e-60, Method: Compositional matrix adjust.
Identities = 133/325 (40%), Positives = 184/325 (56%), Gaps = 22/325 (6%)
Query: 32 HLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEF 91
HL + +H F+ F + K YA+ E+ R +F NLR+ + Q + S +G+ +F
Sbjct: 41 HLLNAEHH--FSLFKSKFGKIYASEEEHDHRFKVFKANLRRARRHQLLD-PSAEHGITKF 97
Query: 92 SDLSTAEFQAKYLGF-KLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSS 150
SDL+ +EF+ YLG K KP P ++P LP +DWR++ AVTGVK+Q CGS
Sbjct: 98 SDLTPSEFRRTYLGLHKPKPKLNAEKAP-ILPTSDLPADYDWRDHGAVTGVKNQGSCGSC 156
Query: 151 WAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIM 201
W+FSTTG +EG + T +LVSLSEQ+L+DCD E D GC GG ++ AF+ +
Sbjct: 157 WSFSTTGAVEGAHFLATGELVSLSEQQLVDCDHECDPEQQDSCDAGCSGGLMTTAFEYTL 216
Query: 202 SKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAI 261
GGL+ EK YPY G C +K + + + DE +A LV++GP+AV I
Sbjct: 217 K--AGGLQREKDYPYTGKXGKCHFDKSKIAAAVTNFSVIGLDEDQIAANLVKHGPLAVGI 274
Query: 262 NAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDR-TKFTHKAVPYWIIKNSWGE 320
NA +Q YV GVS P+ F + H VL+VGYG K YWIIKNSWGE
Sbjct: 275 NAAWMQTYVGGVSCPLICF-----KRQDHGVLLVGYGSHGFAPIRLKEKAYWIIKNSWGE 329
Query: 321 GWGEKGYFRLYRGDGSCGINDYVRS 345
WGE GY+++ RG CG++ V +
Sbjct: 330 NWGEHGYYKICRGHNICGVDAMVST 354
>gi|225448924|ref|XP_002266821.1| PREDICTED: cysteine proteinase 15A-like [Vitis vinifera]
Length = 375
Score = 236 bits (603), Expect = 8e-60, Method: Compositional matrix adjust.
Identities = 132/320 (41%), Positives = 184/320 (57%), Gaps = 23/320 (7%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
F F+E++ K Y++ EY RL IF+ N+ + Q + + ++G+ FSDLS EF+
Sbjct: 61 FRMFMEKYGKEYSSREEYVHRLGIFAKNMVRAAEHQALDP-TALHGVTPFSDLSEEEFER 119
Query: 102 KYLGFKLKPSYAD--RSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNI 159
+ G +P A + LP +FDWRE AVT VK Q CGS WAFSTTG +
Sbjct: 120 MFTGVVGRPHMKGGVAETAAALEVDGLPESFDWREKGAVTEVKMQGTCGSCWAFSTTGAV 179
Query: 160 EGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSKLGGGLEE 210
EG + TKKL++LSEQ+L+DCD D GCEGG ++NA+ ++ GGLEE
Sbjct: 180 EGAHFISTKKLLTLSEQQLVDCDHMCDIRDKTACDSGCEGGLMTNAYKYLIE--AGGLEE 237
Query: 211 EKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYV 270
E +YPY G C+ V++ + V +E +A LV +GP+AV +NA +Q Y+
Sbjct: 238 ESSYPYTGKHGECKFKPDRVAVRVVNFTEVPINENQIAANLVCHGPLAVGLNAIFMQTYI 297
Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYGVDR---TKFTHKAVPYWIIKNSWGEGWGEKGY 327
GVS P+ C ++H VL+VGYG +F +K PYWIIKNSWG+ WGE GY
Sbjct: 298 GGVSCPL--ICP--KRWINHGVLLVGYGAKGYSILRFGYK--PYWIIKNSWGKRWGEHGY 351
Query: 328 FRLYRGDGSCGINDYVRSAL 347
+RL RG G CG+N V + +
Sbjct: 352 YRLCRGHGMCGMNTMVSAVV 371
>gi|358339354|dbj|GAA47434.1| cathepsin F [Clonorchis sinensis]
Length = 603
Score = 236 bits (603), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 128/308 (41%), Positives = 178/308 (57%), Gaps = 11/308 (3%)
Query: 41 LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
L+ F +++ KTY + Y R +F NL + LQ E G+ YG+ +F DL++ EFQ
Sbjct: 306 LYEEFKQKYKKTYVNDDDEY-RFSVFKENLLRAHQLQTMEQGTAEYGVTQFFDLTSQEFQ 364
Query: 101 AKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIE 160
+YLGFK + + + +FDWR++ AV V DQ CGS WAFST GNIE
Sbjct: 365 IQYLGFKYEDMQDTEEMSPSTRVVMDEDSFDWRDHGAVGPVLDQGKCGSCWAFSTIGNIE 424
Query: 161 GVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDD 220
G + KT +L+SLSEQ+LIDCD D+GC GG + ++ GGLE YPY+
Sbjct: 425 GQWFLKTGELLSLSEQQLIDCDNVDEGCNGGYPPKTYGAVIKM--GGLELNSDYPYKALA 482
Query: 221 KACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFF 280
+ C ++++ +V IN V R+E A+ L GP++ A+NA L+FY TG+ H
Sbjct: 483 EKCHMDRQKLKVYINDSVVFPRNEHLQAEALKLMGPLSSALNANPLKFYKTGIMHLPVAS 542
Query: 281 CDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGIN 340
C L+H+VL VGYG T +PYW +KNSWG +GE GYFR+YRG G+CGIN
Sbjct: 543 C--FPRALNHAVLTVGYG------TENGLPYWTVKNSWGTAFGEDGYFRIYRGGGTCGIN 594
Query: 341 DYVRSALV 348
V +A +
Sbjct: 595 RLVSTAAI 602
Score = 157 bits (396), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 88/231 (38%), Positives = 123/231 (53%), Gaps = 10/231 (4%)
Query: 95 STAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
S EF KYLG +L + V FDWR++ AV V +Q CGS WAFS
Sbjct: 8 SGEEFANKYLGVQLDELATEEEVDPEEDVTVADDNFDWRQHGAVGPVWNQGPCGSCWAFS 67
Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
GNIEG + K+ +L+ LS Q+++DCD D GC GG + + GGL+ + Y
Sbjct: 68 AVGNIEGQWFLKSGELLHLSVQQVLDCDHVDHGCNGGYPPQVYRQVNQM--GGLQLDADY 125
Query: 215 PYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVS 274
Y+ C ++ + +N V +S++E A L GP+A +NA LQFY G+
Sbjct: 126 SYKAAVGKCHTDRSKFRAYVNSSVILSQNEQFQANKLKTIGPLASTLNARTLQFYRKGIM 185
Query: 275 HPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEK 325
HP C+ G L+H+VL VGYG T + +PYWI+KNSW G+GE+
Sbjct: 186 HPTPSACNPG--QLNHAVLTVGYG------TEQGMPYWIVKNSWSRGFGEQ 228
>gi|164605519|dbj|BAF98585.1| CM0216.510.nc [Lotus japonicus]
Length = 360
Score = 236 bits (601), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 137/336 (40%), Positives = 191/336 (56%), Gaps = 31/336 (9%)
Query: 24 VVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNL---RKIQLLQDTE 80
VV E L HH F F + K Y + E+ R ++F N+ R+ QLL
Sbjct: 33 VVDGEGLGAEHH------FLEFKRRFGKVYVSEEEHGYRFNVFKSNMHRARRHQLLDP-- 84
Query: 81 HGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTG 140
S V+G+ FSDL+ EF+ LG + +D ++ LP+ FDWRE+ AVT
Sbjct: 85 --SAVHGVTRFSDLTPMEFRHSVLGLRGVGLPSDADSAPILRTDNLPKDFDWREHGAVTP 142
Query: 141 VKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGG 191
VK+Q CG+ W+FS TG +EG + T KLVSLSEQ+L+DCD E D GC+GG
Sbjct: 143 VKNQGSCGACWSFSATGALEGAHFLSTGKLVSLSEQQLVDCDHECDPEEAGSCDSGCKGG 202
Query: 192 SISNAFDTIMSKLGGGLEEEKTYPYRGD-DKACRLNKKATQVKINGYVSVSRDETDMAKY 250
+++AF+ I++ GG+ E+ YPY G C+ ++ + + VSRDE +A
Sbjct: 203 LMNSAFEYILNN--GGVMREEDYPYSGTAGGTCKFDQTKIAASVANFSVVSRDEDQIAAN 260
Query: 251 LVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDR-TKFTHKAV 309
LV+NGP+AVAINA +Q YV GVS P + C ++ L+H VL+VGYG + K
Sbjct: 261 LVKNGPLAVAINAVYMQTYVGGVSCP--YVC---SKKLNHGVLLVGYGSESYAPIRMKQK 315
Query: 310 PYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
PYWIIKNSWGE WGE GY+++ RG CG++ V +
Sbjct: 316 PYWIIKNSWGENWGENGYYKICRGRNVCGVDSMVST 351
>gi|30575716|gb|AAP33050.1| cysteine proteinase 3 [Clonorchis sinensis]
gi|358339353|dbj|GAA47433.1| cathepsin F [Clonorchis sinensis]
Length = 327
Score = 236 bits (601), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 135/336 (40%), Positives = 191/336 (56%), Gaps = 14/336 (4%)
Query: 13 LSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRK 72
L V++ F V+G + + + L+ F ++ K+Y+ + Y R +F NL +
Sbjct: 5 LCFLVALGFFGVLG-SNIPESENARQ--LYEEFKLKYKKSYSNDDDEY-RFRVFKDNLLR 60
Query: 73 IQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDW 132
I+ Q+ E G+ YG+ +FSDL+ EF+ +YL K DR I FDW
Sbjct: 61 IKQFQNMERGTAKYGVTQFSDLTAQEFKVRYLRSKFGGVPVDREPVPFIRMDVDDDNFDW 120
Query: 133 REYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGS 192
R + AV V DQ CGS WAFS GNIEG + KT L+ LSEQ+L+DCD+ D+GC GG+
Sbjct: 121 RNHGAVGPVLDQGDCGSCWAFSAVGNIEGQWFRKTDNLLQLSEQQLLDCDEVDEGCNGGT 180
Query: 193 ISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLV 252
AF I+ GGL+ + YPY G + CR+ +V ING + DE A+ L
Sbjct: 181 PQQAFKQILGM--GGLQLDSDYPYEGREGQCRMVPSKVKVYINGSKILPEDEQIQAQMLK 238
Query: 253 ENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYW 312
E GP++ A+NA LQFY G+ HP+ CD ++L+H+VL VGYG +PYW
Sbjct: 239 ETGPLSSALNALFLQFYTEGILHPLPALCDA--QSLNHAVLTVGYG------KEGRLPYW 290
Query: 313 IIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
+KNSW +GE GYFR+YRGDG+CGIN V ++++
Sbjct: 291 TVKNSWSTMFGENGYFRIYRGDGTCGINTLVSTSII 326
>gi|118429521|gb|ABK91808.1| cysteine proteinase prozyme precursor [Clonorchis sinensis]
Length = 316
Score = 235 bits (599), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 130/308 (42%), Positives = 180/308 (58%), Gaps = 11/308 (3%)
Query: 41 LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
L+ F ++ K+Y+ + Y R +F NL +I+ Q+ E G+ YG+ +FSDL+ EF+
Sbjct: 19 LYEEFKLKYKKSYSNDDDEY-RFRVFKDNLLRIKQFQNMERGTAKYGVTQFSDLTAQEFK 77
Query: 101 AKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIE 160
+YL K DR I FDWR + AV V DQ CGS WAFS GNIE
Sbjct: 78 VRYLRSKFGGVPVDREPVPFIRMDVDDDNFDWRNHGAVGPVLDQGDCGSCWAFSAVGNIE 137
Query: 161 GVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDD 220
G + KT L+ LSEQ+L+DCD+ D+GC GG+ AF I+ GGL+ + YPY G +
Sbjct: 138 GQWFRKTDNLLQLSEQQLLDCDEVDEGCNGGTPQQAFKQILGM--GGLQLDSDYPYEGRE 195
Query: 221 KACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFF 280
CR+ +V ING + DE A+ L E GP++ A+NA LQFY G+ HP+
Sbjct: 196 GQCRMVPSKVKVYINGSKILPEDEQIQAQMLKETGPLSSALNALFLQFYTEGILHPLPAL 255
Query: 281 CDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGIN 340
CD ++L+H+VL VGYG +PYW +KNSW +GE GYFR+YRGDG+CGIN
Sbjct: 256 CDA--QSLNHAVLTVGYG------KEGRLPYWTVKNSWSTMFGENGYFRIYRGDGTCGIN 307
Query: 341 DYVRSALV 348
V ++++
Sbjct: 308 TLVSTSII 315
>gi|25956267|dbj|BAC41322.1| hypothetical protein [Lotus japonicus]
Length = 358
Score = 235 bits (599), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 140/336 (41%), Positives = 193/336 (57%), Gaps = 31/336 (9%)
Query: 24 VVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNL---RKIQLLQDTE 80
VV DE L HH F F + K YAT E+ R ++F N+ R+ QLL
Sbjct: 33 VVDDEGLGAEHH------FLEFKRRFGKVYATEEEHGYRFNVFKSNMHRARRHQLLDP-- 84
Query: 81 HGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTG 140
S V+G+ +FSDL+ EFQ LG + +D ++P LP+ FDWR + AVT
Sbjct: 85 --SAVHGVTQFSDLTPMEFQHSVLGLRGVGLPSDADSAPILPTDNLPKDFDWRGHGAVTP 142
Query: 141 VKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGS-------- 192
VK+Q CGS W+FS TG +EG + T +LVSLSEQ+L+DCD + D E GS
Sbjct: 143 VKNQGSCGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHQCDPEEAGSCGSGCNGG 202
Query: 193 -ISNAFDTIMSKLGGGLEEEKTYPYRGDDKA-CRLNKKATQVKINGYVSVSRDETDMAKY 250
+++AF+ I++ GG+ E+ YPY G + C+ +K + + VSRDE +A
Sbjct: 203 LMNSAFEYILNN--GGVMREEDYPYSGTNGGTCKFDKAKIAASVANFSVVSRDEDQIAAN 260
Query: 251 LVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDR-TKFTHKAV 309
LV+NGP+AVAINA +Q YV GVS P + C ++ L+H VL+VGYG + K
Sbjct: 261 LVKNGPLAVAINAVYMQTYVGGVSCP--YVC---SKKLNHGVLLVGYGSESYAPIRMKQK 315
Query: 310 PYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
PYWIIKNSWGE WGE GY+++ RG CG++ V +
Sbjct: 316 PYWIIKNSWGENWGENGYYKICRGRNICGVDSMVST 351
>gi|358339355|dbj|GAA47435.1| cathepsin F [Clonorchis sinensis]
Length = 1157
Score = 235 bits (599), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 123/301 (40%), Positives = 181/301 (60%), Gaps = 11/301 (3%)
Query: 35 HVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDL 94
+ H L F + + Y T+ + L N+++ + Q E G+ +YG+ +FSDL
Sbjct: 620 EMNHAGLAVGFGFEQDVPYWTIKNSWGMLWGEEDNIKQAEFYQTLERGTALYGVTQFSDL 679
Query: 95 STAEFQAKYLGFKLKPSYA-DRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAF 153
+ EFQ +LG +L Y+ +S ++++P +DWR Y AV V DQ CGS WAF
Sbjct: 680 TGEEFQETFLGLRLDEQYSKSQSYVKKKHSVSIPENYDWRPYGAVGPVLDQGHCGSCWAF 739
Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
S GNIEG + KT +LVSLS+Q+L+DCD+ GC GG +D+I + GGLE E
Sbjct: 740 SVIGNIEGQWFRKTGQLVSLSKQQLVDCDRSSRGCGGGYPPATYDSI--RRIGGLEIELD 797
Query: 214 YPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGV 273
Y Y G D C N + +N V++++DE +A++L +GP+++A+NA LQFYV+G+
Sbjct: 798 YRYTGRDGVCHQNPRKFVAYVNSSVALTKDENTIAEWLSYHGPISMALNARLLQFYVSGI 857
Query: 274 SHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG 333
HP +C +++SH+VL VG+G T VP+WI+KNSWG WGE+GYFR+YRG
Sbjct: 858 MHPPAAYCP--VKDISHAVLSVGFG------TKGNVPFWIVKNSWGTLWGEEGYFRIYRG 909
Query: 334 D 334
D
Sbjct: 910 D 910
Score = 182 bits (462), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 99/229 (43%), Positives = 134/229 (58%), Gaps = 11/229 (4%)
Query: 98 EFQAKYLGFKLKPSYADRSVPAMIPNITLPR-AFDWREYDAVTGVKDQTMCGSSWAFSTT 156
EF+A YL ++S + P+ +FDWR+Y AV V DQ CG+SWAFS
Sbjct: 434 EFKALYLTAMYDHRKLNQSKTTEPETVGEPQDSFDWRDYGAVGPVLDQDRCGASWAFSAI 493
Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
GNIEG Y + +L+SLSEQ+L+DCD+ D GC GG+ AF+ I GGLE E YPY
Sbjct: 494 GNIEGQYFMRVHRLLSLSEQQLVDCDRIDQGCAGGTPYGAFEGIQQL--GGLELEADYPY 551
Query: 217 RGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHP 276
G C+ N V ING V + +DE +A+YL ++GP++V IN LQ+Y +G+ P
Sbjct: 552 LGHQDNCQSNPLRFVVSINGSVQLPKDEDQIAQYLFDHGPLSVGINGALLQYYSSGIMQP 611
Query: 277 IQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEK 325
+ C+ N H+ L VG+G ++ VPYW IKNSWG WGE+
Sbjct: 612 LWDNCNPAEMN--HAGLAVGFGFEQD------VPYWTIKNSWGMLWGEE 652
Score = 146 bits (369), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 79/205 (38%), Positives = 113/205 (55%), Gaps = 31/205 (15%)
Query: 126 LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQED 185
LP FDWREY AV V++Q CGS WA S E++DCD D
Sbjct: 218 LPSYFDWREYGAVGPVRNQGQCGSCWAISA---------------------EVVDCDHAD 256
Query: 186 DGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDET 245
GC GG +A++ + GGLE YPY G + C+ + + ING V++ +D
Sbjct: 257 HGCSGGFPIHAYECVQRL--GGLELAVRYPYVGYQQYCQADPRYFVAYINGSVALPKDSE 314
Query: 246 DMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFT 305
+AK+L GP++V ++A LQ+Y +G+ +P +C+ E L+H+VL VG+G T
Sbjct: 315 QIAKFLATFGPLSVVLDARLLQYYRSGILNPSVAYCN--PEELNHAVLSVGFG------T 366
Query: 306 HKAVPYWIIKNSWGEGWGEKGYFRL 330
+ +PYWIIKNSWGE WGE+ +L
Sbjct: 367 EQGIPYWIIKNSWGEQWGEQHLTKL 391
Score = 134 bits (336), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 87/223 (39%), Positives = 117/223 (52%), Gaps = 16/223 (7%)
Query: 61 SRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLG--FKLKPSYADRSVP 118
+ L +S LR+ QL ++ + G NE F YLG F +PS A V
Sbjct: 940 TSLAEYSRELRERQLYEEFKLNYGKVYENE------GMFYFLYLGARFDREPSRAGSMVV 993
Query: 119 AMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQEL 178
+ I P FDWRE AV ++DQ CGS WAFST GNIEG + KT +L++LSEQ+L
Sbjct: 994 DDLGEI--PERFDWRELGAVGPIQDQGDCGSCWAFSTIGNIEGQWFKKTGQLLTLSEQQL 1051
Query: 179 IDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYV 238
IDCD DDGC GG + + I+ GGLE YPY D C++ + + +N +
Sbjct: 1052 IDCDSVDDGCGGGYPPDTYGDIVKM--GGLELNADYPYIAADGVCKMERSKFRAYVNKSL 1109
Query: 239 SVSRDETDMAKYLVENGPMAVAINAYALQ----FYVTGVSHPI 277
+ E A +L +NGP++ INA LQ FY V+ PI
Sbjct: 1110 VLPTKEDQQAVWLSKNGPLSAGINADYLQVVILFYERSVNGPI 1152
Score = 114 bits (285), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 58/150 (38%), Positives = 90/150 (60%), Gaps = 10/150 (6%)
Query: 176 QELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKIN 235
Q+L+DCD D GCEGG +AF + GGL+ YPY +AC+ N K +
Sbjct: 23 QQLVDCDHVDRGCEGGFPLDAFMAVQRL--GGLQLSIDYPYIASRQACQFNPKQAVAFVT 80
Query: 236 GYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIV 295
G+ ++ R+E +A+YL NGP++V +N+ L+FY +G+ + CD E L+H+ L V
Sbjct: 81 GFAALPRNELLIAEYLHRNGPLSVGLNSRTLKFYNSGILNLAAEQCD--PEALNHAALAV 138
Query: 296 GYGVDRTKFTHKAVPYWIIKNSWGEGWGEK 325
G+G D ++ P+WIIKN++G+ WGE+
Sbjct: 139 GFGTD------ESTPFWIIKNTFGKDWGEQ 162
>gi|116242322|gb|ABJ89818.1| cysteine proteinase 3 [Clonorchis sinensis]
Length = 327
Score = 234 bits (597), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 135/336 (40%), Positives = 190/336 (56%), Gaps = 14/336 (4%)
Query: 13 LSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRK 72
L V++ F V+G + + + L+ F ++ K+Y+ + Y R +F NL +
Sbjct: 5 LCFLVALGFFGVLG-SNIPESENARQ--LYEEFKLKYKKSYSNDDDEY-RFRVFKDNLLR 60
Query: 73 IQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDW 132
I+ Q+ E G+ YG+ +FSDL+ EF+ +YL K DR I FDW
Sbjct: 61 IKQFQNMERGTAKYGVTQFSDLTAQEFKVRYLRSKFGGVPVDREPVPFIRMDVDDDNFDW 120
Query: 133 REYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGS 192
R + AV V DQ CGS WAFS GNIEG + KT L+ LSEQ+L+DCD D+GC GG+
Sbjct: 121 RNHGAVGPVLDQGDCGSCWAFSAVGNIEGQWFRKTDNLLQLSEQQLLDCDGVDEGCNGGT 180
Query: 193 ISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLV 252
AF I+ GGL+ + YPY G + CR+ +V ING + DE A+ L
Sbjct: 181 PQQAFKQILGM--GGLQLDSDYPYEGREGQCRMVPSKVKVYINGSKILPEDEQIQAQMLK 238
Query: 253 ENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYW 312
E GP++ A+NA LQFY G+ HP+ CD ++L+H+VL VGYG +PYW
Sbjct: 239 ETGPLSSALNALFLQFYTEGILHPLPALCDA--QSLNHAVLTVGYG------KEGRLPYW 290
Query: 313 IIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
+KNSW +GE GYFR+YRGDG+CGIN V ++++
Sbjct: 291 TVKNSWSTMFGENGYFRIYRGDGTCGINTLVSTSII 326
>gi|60396844|gb|AAX19661.1| cysteine proteinase [Populus tomentosa]
Length = 374
Score = 234 bits (597), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 128/315 (40%), Positives = 181/315 (57%), Gaps = 19/315 (6%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
+ F + K+Y + E+ R +F NLR+ Q + + +G+ +FSDL++AEF+
Sbjct: 59 LSLFKRKFKKSYLSQEEHDYRFSVFKSNLRRAARHQKLDP-TASHGVTQFSDLTSAEFRK 117
Query: 102 KYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEG 161
+ LG + D + ++P LP FDWRE AV VK+Q CGS W+FSTTG +EG
Sbjct: 118 QVLGLRKLRLPKDANKAPILPTNDLPEDFDWREKGAVGPVKNQGSCGSCWSFSTTGALEG 177
Query: 162 VYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSKLGGGLEEEK 212
+ T +LVSLSEQ+L+DCD E D GC GG +++AF+ + GGL E+
Sbjct: 178 AHFLATGELVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK--AGGLMREE 235
Query: 213 TYPYRGDDK-ACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVT 271
YPY G D+ AC+ +K + + VS DE +A LV+NGP+AVA NA +Q Y+
Sbjct: 236 DYPYTGMDRGACKFDKDKVAAGVANFSVVSLDEDQIAANLVKNGPLAVATNAVFMQTYIG 295
Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYG-VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
GVS P + C + L H VL+VGYG K PYWIIKNSWGE WGE G++++
Sbjct: 296 GVSCP--YIC---SRRLDHGVLLVGYGSAGYAPVRMKEKPYWIIKNSWGESWGENGFYKI 350
Query: 331 YRGDGSCGINDYVRS 345
RG CG++ V +
Sbjct: 351 CRGRNICGVDSMVST 365
>gi|351693703|gb|AEQ59229.1| cysteine protease precursor [Clonorchis sinensis]
Length = 327
Score = 234 bits (597), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 134/336 (39%), Positives = 190/336 (56%), Gaps = 14/336 (4%)
Query: 13 LSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRK 72
L V++ F V+G + + + L+ F ++ K+Y+ + Y R +F NL +
Sbjct: 5 LCFLVALGFFGVLG-SNIPESENARQ--LYEEFKLKYKKSYSNDDDEY-RFRVFKDNLLR 60
Query: 73 IQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDW 132
I+ Q+ E G+ YG+ +FSDL+ EF+ +YL K DR I FDW
Sbjct: 61 IKQFQNMERGTAKYGVTQFSDLTAQEFKVRYLRSKFGGVPVDREPVPFIRMDVDDDNFDW 120
Query: 133 REYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGS 192
R + AV V D+ CGS WAFS GNIEG + KT L+ LSEQ+L+DCD+ D+GC GG+
Sbjct: 121 RNHGAVGPVLDKGDCGSCWAFSAVGNIEGQWFRKTDNLLQLSEQQLLDCDEVDEGCNGGT 180
Query: 193 ISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLV 252
AF I+ GGL+ + YPY G + CR+ +V ING + DE A+ L
Sbjct: 181 PQQAFKQILGM--GGLQLDSDYPYEGREGQCRMVPSKVKVYINGSKILPEDEQIQAQMLK 238
Query: 253 ENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYW 312
E GP + A+NA +LQFY G+ HP+ CD ++L+H+VL VGYG +PYW
Sbjct: 239 ETGPFSSALNALSLQFYTEGILHPLPALCDA--QSLNHAVLTVGYG------KEGRLPYW 290
Query: 313 IIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
+KNSW +GE GYFR+YRGDG CGIN V ++++
Sbjct: 291 TVKNSWSTMFGENGYFRIYRGDGPCGINTLVSTSII 326
>gi|85068712|gb|ABC69436.1| cysteine protease [Clonorchis sinensis]
Length = 328
Score = 234 bits (597), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 128/310 (41%), Positives = 179/310 (57%), Gaps = 13/310 (4%)
Query: 40 ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
AL+ F ++ KTY+ + R IF NL + + LQ+ E G+ YG+ +FSDL++ EF
Sbjct: 30 ALYEEFKLKYKKTYSNDDDEL-RFEIFKDNLLRAKRLQEMEQGTAQYGVTQFSDLTSEEF 88
Query: 100 QAKYLGFKLKPSYADRSVPAMIPNITLP-RAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
+ +YL + P+ ++T+ FDWRE+ AV V DQ CGS WAFS GN
Sbjct: 89 KTRYLRMRFDGPIVSED-PSPEEDVTMDNEKFDWREHGAVGPVLDQGKCGSCWAFSVIGN 147
Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG 218
+EG + KT L++LSEQ+L+DCD + GC GG + I GGLE YPY G
Sbjct: 148 VEGQWFRKTGDLLALSEQQLVDCDHLEKGCNGGYPPKTYGEIEKM--GGLELASDYPYTG 205
Query: 219 DDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQ 278
D C +N+ +N + E A+ L E GP++ A+NA LQFY+ G+ PI
Sbjct: 206 VDGICYMNQSKFVAYVNDSTVLPLSEKIQAQKLKEIGPLSSALNAVLLQFYLGGIIFPIP 265
Query: 279 FFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCG 338
F C+ L+H+VL VGYG T +PYWI+KNS G G+GEKGYFR++RG G+CG
Sbjct: 266 FLCN--PHGLNHAVLTVGYG------TEFGIPYWIVKNSLGVGFGEKGYFRIFRGAGTCG 317
Query: 339 INDYVRSALV 348
IN V +A++
Sbjct: 318 INLVVSTAII 327
>gi|322801532|gb|EFZ22193.1| hypothetical protein SINV_14496 [Solenopsis invicta]
Length = 781
Score = 234 bits (596), Expect = 6e-59, Method: Compositional matrix adjust.
Identities = 135/274 (49%), Positives = 179/274 (65%), Gaps = 10/274 (3%)
Query: 30 LHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLN 89
L V+ LF+ F+ +N+TY++ E RL IF NL I+LLQ TE +G YG+N
Sbjct: 512 LQIAEDVRTERLFDDFVATYNRTYSSPDERNLRLQIFRENLGIIELLQKTEQATGRYGVN 571
Query: 90 EFSDLSTAEFQAKYLGFKLKPSY-ADRSVP---AMIPNITLPRAFDWREYDAVTGVKDQT 145
F+D+S EF+ +YLG L+P ++ +P A PNI LP FDWR+ VT VK+Q
Sbjct: 572 MFADMSREEFRTRYLG--LRPDLQSENEIPLQEAKFPNIELPPTFDWRKKGVVTPVKNQG 629
Query: 146 MCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLG 205
CGS WAFS TGN+EG YA K +L+SLSEQEL+DCD DDGC GG NA+ I KL
Sbjct: 630 GCGSCWAFSVTGNVEGQYAIKHGQLLSLSEQELVDCDDLDDGCGGGLPDNAYRAI-EKL- 687
Query: 206 GGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA 265
GGLE E YPY +++ C K +V++ V+V+ DET MA++LV+NGP+++ INA A
Sbjct: 688 GGLELESDYPYEAENEKCHFKKNLVKVELTSAVNVTSDETQMAQWLVQNGPISIGINANA 747
Query: 266 LQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGV 299
+QFY+ GVSHP +F C+ +NL H VLIVGYG
Sbjct: 748 MQFYMGGVSHPFKFLCNP--KNLDHGVLIVGYGT 779
>gi|55979119|gb|AAV69023.1| cysteine protease [Opisthorchis viverrini]
gi|224923980|gb|ACN68966.1| cathepsin F-like cysteine protease [Opisthorchis viverrini]
Length = 326
Score = 233 bits (595), Expect = 7e-59, Method: Compositional matrix adjust.
Identities = 129/310 (41%), Positives = 181/310 (58%), Gaps = 15/310 (4%)
Query: 40 ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
AL+ F ++ KTY+ + R IF NL + + LQ E G+ YG+ +FSDL++ EF
Sbjct: 30 ALYEEFKLKYKKTYSNDDDEL-RFRIFKDNLERAKRLQAMEQGTAEYGVTQFSDLTSEEF 88
Query: 100 QAKYLGFKLKPSYADRSVPAMIPNITLPRA-FDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
+ +YL + + P ++T+ + FDWR++ AV V DQ CGS WAFS GN
Sbjct: 89 KTRYLRMRFDEPIVNED-PTPQEDVTMDNSNFDWRDHGAVGPVLDQGDCGSCWAFSVIGN 147
Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG 218
+EG + KT L+ LSEQ+LIDCD D GC+GG + I GGLE YPY G
Sbjct: 148 VEGQWFRKTGDLLGLSEQQLIDCDHSDQGCDGGYPPQTYSAIEEM--GGLELRSDYPYTG 205
Query: 219 DDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQ 278
D C +++ +NG + E AK L E GP++ +NA LQ Y G+ P
Sbjct: 206 KDGICYMDQSKFVAYVNGSTRLPWCEKTQAKSLKEIGPLSSGLNAVLLQLYKRGIMRPR- 264
Query: 279 FFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCG 338
+C+ L+H+VL VGYG++ H+ +PYWI+KNSWG+ +GEKGYFR+YRGDG+CG
Sbjct: 265 -WCNPA--ELNHAVLTVGYGME-----HR-MPYWIVKNSWGKRFGEKGYFRIYRGDGTCG 315
Query: 339 INDYVRSALV 348
IN V +A+V
Sbjct: 316 INRAVTTAVV 325
>gi|38344381|emb|CAD40319.2| OSJNBb0054B09.3 [Oryza sativa Japonica Group]
gi|116309071|emb|CAH66180.1| OSIGBa0130O15.4 [Oryza sativa Indica Group]
gi|116309098|emb|CAH66205.1| OSIGBa0148D14.11 [Oryza sativa Indica Group]
Length = 381
Score = 233 bits (595), Expect = 7e-59, Method: Compositional matrix adjust.
Identities = 129/324 (39%), Positives = 184/324 (56%), Gaps = 27/324 (8%)
Query: 40 ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
A F F + +TY E R+ +F+ NLR+ + Q + + +G+ +FSDL+ EF
Sbjct: 56 AHFASFERRFGRTYRDAGERAYRMSVFAANLRRARRHQRLDP-TATHGVTKFSDLTPGEF 114
Query: 100 QAKYLGFKLKPSY-----ADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
+ ++LG + +PS + ++P LP FDWRE+ AV VKDQ CGS W+FS
Sbjct: 115 RDRFLGLR-RPSLEGLVGGEPHEAPILPTDGLPDDFDWREHGAVGPVKDQGSCGSCWSFS 173
Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSKLG 205
T+G +EG + T KL LSEQ+++DCD E D GC GG ++ AF +M
Sbjct: 174 TSGALEGAHFLATGKLEVLSEQQMVDCDHECDASESRACDSGCNGGLMTTAFSYLMKS-- 231
Query: 206 GGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA 265
GGL+ EK YPY G + C+ +K ++ + +S +E +A LV++GP+A+AINA
Sbjct: 232 GGLQSEKDYPYAGRENTCKFDKSKIVAQVKNFSVISVNEDQIAANLVKHGPLAIAINAAY 291
Query: 266 LQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYG-VDRTKFTHKAVPYWIIKNSWGEGWGE 324
+Q Y+ GVS P F C +L H VL+VGYG K PYWIIKNSWGE WGE
Sbjct: 292 MQTYIGGVSCP--FIC---GRHLDHGVLLVGYGSAGYAPIRFKEKPYWIIKNSWGENWGE 346
Query: 325 KGYFRLYRG---DGSCGINDYVRS 345
KGY+++ RG CG++ V S
Sbjct: 347 KGYYKICRGPHDKNKCGVDSMVSS 370
>gi|281209544|gb|EFA83712.1| cysteine proteinase 1 [Polysphondylium pallidum PN500]
Length = 465
Score = 233 bits (593), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 135/309 (43%), Positives = 171/309 (55%), Gaps = 23/309 (7%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQD---TEHGSGVYGLNEFSDLSTAE 98
F F ++NK Y T EY R F NL+ I + S +G+NEF+DLS +E
Sbjct: 28 FRQFQIKYNKQY-TSSEYAERFATFKSNLKVIDEKNRDAASRKSSVRFGVNEFADLSQSE 86
Query: 99 FQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
F+A YL + +V A +P LP AFDWR AVTGVK+Q CGS W+FSTTGN
Sbjct: 87 FRATYLNSVQAVRDPNAAVAADLPVEDLPTAFDWRTKGAVTGVKNQGQCGSCWSFSTTGN 146
Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQE----------DDGCEGGSISNAFDTIMSKLGGGL 208
+EG + L LSEQ L+DCD E D GC GG NA+ I+ GG+
Sbjct: 147 VEGQWFLAGNTLTGLSEQNLVDCDHECMEYLGDNVCDQGCNGGLQPNAYTYIIK--NGGI 204
Query: 209 EEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQF 268
+ E +YPY+G D C KI+ + VS +ET MA YLV NGP+A+A +A QF
Sbjct: 205 DTEASYPYQGVDGTCSFKAANIGAKISNWTYVSSNETQMAAYLVANGPLAIAADAVEWQF 264
Query: 269 YVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYF 328
Y+ GV F GN L H +LIVGY + T F HK YWI+KNSWG WGE+GY
Sbjct: 265 YLGGV-----FDVPCGN-TLDHGILIVGYSAENTIF-HKDKAYWIVKNSWGATWGEQGYI 317
Query: 329 RLYRGDGSC 337
+ RG+G C
Sbjct: 318 YISRGNGEC 326
>gi|118394988|ref|XP_001029851.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89284124|gb|EAR82188.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 330
Score = 233 bits (593), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 140/342 (40%), Positives = 188/342 (54%), Gaps = 32/342 (9%)
Query: 10 VALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGN 69
VALL+ V + F + D+ + A F F + +NK Y++ Y +RL IF N
Sbjct: 7 VALLAACV-FARFSTMQDQDI--------AAAFKKFTQTYNKKYSSEEHYNARLSIFKEN 57
Query: 70 LRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRA 129
LR+I+L + +G+ +F+DL+ EF YLG+K + + V T P A
Sbjct: 58 LRRIELFNKNDEAQ--HGITQFADLTHEEFADMYLGYKPQLRNSQAKVSLSSTPFTAPTA 115
Query: 130 FDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKK-LVSLSEQELIDCD-QEDDG 187
DW AVT VK+Q CGS WAFSTTG+IEG Y + K+ L S SEQ+L+DCD +ED G
Sbjct: 116 IDWTTKGAVTPVKNQGSCGSCWAFSTTGSIEGQYVLQLKQNLTSFSEQQLVDCDTKEDQG 175
Query: 188 CEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYV------SVS 241
C GG + NAF + S LE E YPY D +C+ N+ V + +V +V+
Sbjct: 176 CNGGLMDNAFTYLES---AKLETESAYPYTAVDGSCKYNQSLGVVGVASFVDIEQGKTVA 232
Query: 242 RDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDR 301
E M L GP++VAINA LQFY G+S+P+ C+ L+H VLIVG G +
Sbjct: 233 DTENTMGVALDNIGPLSVAINANNLQFYAGGISNPL--ICNP--NGLNHGVLIVGLGSEN 288
Query: 302 TKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYV 343
K +W +KNSWG WGEKGYFR+ RG G CGIN V
Sbjct: 289 GK------DFWKVKNSWGASWGEKGYFRIVRGKGKCGINRAV 324
>gi|30575714|gb|AAP33049.1| cysteine proteinase 1 [Clonorchis sinensis]
Length = 326
Score = 232 bits (592), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 131/311 (42%), Positives = 179/311 (57%), Gaps = 17/311 (5%)
Query: 40 ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
AL+ F ++ KTY+ + R IF NL + + LQ+ E G+ YG+ +FSDL++ EF
Sbjct: 30 ALYEEFTLKYKKTYSNDDDEL-RFEIFKDNLLRAKRLQEMEQGTAQYGVTQFSDLTSEEF 88
Query: 100 QAKYLGFKLK-PSYADRSVPAMIPNITLP-RAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
+ +YL + P ++ P ++T+ FDWRE+ AV V DQ CGS WAFS G
Sbjct: 89 KTRYLRMRFDGPIVSEDLTPE--EDVTMDNEKFDWREHGAVGPVLDQGKCGSCWAFSVIG 146
Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
N+ G + KT L++LSEQ+L+DCD DDGC+GG + I GGLE YPY
Sbjct: 147 NVVGQWFRKTGHLLALSEQQLVDCDYLDDGCDGGYPPQTYTAIQKM--GGLELASDYPYT 204
Query: 218 GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPI 277
G C ++K +NG + E A+ L GP++ A+NA LQ Y G+ P
Sbjct: 205 GVGGICHMDKSKFVAYVNGSTILPLSEKVQAQKLRAIGPLSSALNADTLQLYKGGIMRPK 264
Query: 278 QFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSC 337
+CD N H+VL VGYGV K PYWI+KNSWGE +GEKGYFR+YRGDG+C
Sbjct: 265 --WCDPAGVN--HAVLTVGYGVQNGK------PYWIVKNSWGEDFGEKGYFRIYRGDGTC 314
Query: 338 GINDYVRSALV 348
GIN V +A++
Sbjct: 315 GINSIVTTAII 325
>gi|449516391|ref|XP_004165230.1| PREDICTED: cysteine proteinase RD19a-like [Cucumis sativus]
Length = 387
Score = 231 bits (589), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 142/375 (37%), Positives = 204/375 (54%), Gaps = 44/375 (11%)
Query: 4 FYFFAGVALLSLTVSVSSFMVV-------GDEKL-----------HHLHHVKHTALFNYF 45
F+ FA + ++ T+ S +V GD + HH +H F+ F
Sbjct: 5 FFLFAVITAVTATLCSSEPLVSQHSVEHDGDPLIRQVVENDGDFNHHALGAEHH--FSLF 62
Query: 46 LEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLG 105
+ K+YAT E+ R IF N+R+ + Q + S ++G+ +FSDL+ EF+ +LG
Sbjct: 63 KRRFGKSYATEEEHDRRFKIFKANMRRAERHQSFD-PSAIHGVTQFSDLTPFEFRKAFLG 121
Query: 106 FK---LKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGV 162
+ L+ + P ++P LP FDWR++ VT VK+Q CGS W+FSTTG +EG
Sbjct: 122 LRGHRLRLPVDTNAAP-ILPTENLPIDFDWRQHGGVTRVKNQGSCGSCWSFSTTGALEGA 180
Query: 163 YAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
T +LVSLSEQ+L+DCD E D GC GG +++AF+ + GGL +E+
Sbjct: 181 NFLATGELVSLSEQQLVDCDHECDPEEEDACDSGCNGGLMNSAFEYTLK--AGGLMKEQD 238
Query: 214 YPYRGDDK-ACRLNKKATQVKINGYVSV-SRDETDMAKYLVENGPMAVAINAYALQFYVT 271
YPY G D+ C +K I + V S DE +A LV+NGP+A+AINA +Q Y+
Sbjct: 239 YPYAGIDRNTCNFDKSKIAASIANFSVVNSIDEDQIAANLVKNGPLAIAINAVFMQTYIG 298
Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYG-VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
GVS P F C ++ L H VL+VGYG + YWIIKNSWGE WGE GY+++
Sbjct: 299 GVSCP--FIC---SKRLDHGVLLVGYGSAGYAPIRMRDKDYWIIKNSWGESWGENGYYKI 353
Query: 331 YRGDGSCGINDYVRS 345
RG CG++ V +
Sbjct: 354 CRGRNICGVDSLVST 368
>gi|1185457|gb|AAA87848.1| cathepsin L, partial [Schistosoma japonicum]
Length = 224
Score = 231 bits (589), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 116/223 (52%), Positives = 149/223 (66%), Gaps = 9/223 (4%)
Query: 126 LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQED 185
+P FDWRE AVT VK+Q MCGS WAFSTTGNIE + KT KL+SLSEQ+L+DCD D
Sbjct: 10 IPNNFDWREKGAVTEVKNQGMCGSCWAFSTTGNIESQWFRKTGKLLSLSEQQLVDCDSLD 69
Query: 186 DGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDET 245
DGC GG SNA+++I+ GGL E YPY ++ C L IN V++++DE+
Sbjct: 70 DGCNGGLPSNAYESIIRM--GGLMLEDNYPYDAKNEKCHLKVGNVAAYINSSVNLTQDES 127
Query: 246 DMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFT 305
++A +L + ++V +NA LQFY G+SHP FC L H+VL+VGYGV +
Sbjct: 128 ELAIWLYHHSAISVGMNALLLQFYRHGISHPWWIFCS--KYLLDHAVLLVGYGV-----S 180
Query: 306 HKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
K P+WI+KNSWG WGEKGYFR+YRGDG+CGIN SAL+
Sbjct: 181 EKNEPFWIVKNSWGVEWGEKGYFRMYRGDGTCGINTGATSALI 223
>gi|224082940|ref|XP_002306900.1| predicted protein [Populus trichocarpa]
gi|118481986|gb|ABK92924.1| unknown [Populus trichocarpa]
gi|222856349|gb|EEE93896.1| predicted protein [Populus trichocarpa]
Length = 367
Score = 231 bits (589), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 136/331 (41%), Positives = 194/331 (58%), Gaps = 32/331 (9%)
Query: 32 HLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNL---RKIQLLQDTEHGSGVYGL 88
HL + +H F F + K YAT E+ R +F NL +K Q++ T +G+
Sbjct: 43 HLLNAEHH--FTTFKSKFGKNYATQEEHDYRFSVFKANLLRAKKHQIMDPT----AAHGV 96
Query: 89 NEFSDLSTAEFQAKYLGFKLK---PSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQT 145
+FSDL+ EF+ + LG K + P+ A+++ ++P LP FDWR++ AVT VKDQ
Sbjct: 97 TKFSDLTPKEFRRQLLGLKRRLRLPTDANKA--PILPTGDLPTDFDWRDHGAVTSVKDQG 154
Query: 146 MCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNA 196
CGS W+FS TG +EG + T +LVSLSEQ+L+DCD E D GC GG ++NA
Sbjct: 155 SCGSCWSFSATGALEGAHYLATGELVSLSEQQLVDCDHECDPEEYGACDSGCSGGLMNNA 214
Query: 197 FDTIMSKLGGGLEEEKTYPYRGDDK-ACRLNKKATQVKINGYVSVSRDETDMAKYLVENG 255
F+ + GGLE EK YPY G+D+ AC+ K ++ + VS DE +A LV++G
Sbjct: 215 FEYALK--AGGLEREKDYPYTGNDRGACKFEKSKVAASVSNFSVVSLDEDQIAANLVKHG 272
Query: 256 PMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYG-VDRTKFTHKAVPYWII 314
P++VAINA +Q Y+ GVS P + C +++ H VL+VGYG K P+WII
Sbjct: 273 PLSVAINAVFMQTYIGGVSCP--YIC---SKHQDHGVLLVGYGAAGYAPIRFKEKPFWII 327
Query: 315 KNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
KNSWGE WGE GY+++ R CG++ V +
Sbjct: 328 KNSWGENWGENGYYKICRARNICGVDSMVST 358
>gi|1617037|emb|CAA26255.1| cysteine proteinase I precursor [Dictyostelium discoideum]
Length = 343
Score = 231 bits (588), Expect = 5e-58, Method: Compositional matrix adjust.
Identities = 137/355 (38%), Positives = 192/355 (54%), Gaps = 36/355 (10%)
Query: 11 ALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNL 70
L TV VSS + +E+ L F ++ NK Y+ EY R IF NL
Sbjct: 8 VLAVFTVFVSSRGIPPEEQSQFLE----------FQDKFNKKYSH-EEYLERFEIFKSNL 56
Query: 71 RKIQ---LLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPN---I 124
KI+ L+ +G+N+F+DLS+ EF+ YL K D V + +
Sbjct: 57 GKIEELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFIN 116
Query: 125 TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE 184
++P AFDWR AVT VK+Q CGS W+FSTTGN+EG + KLVSLSEQ L+DCD E
Sbjct: 117 SIPTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHE 176
Query: 185 ----------DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDD-KACRLNKKATQVK 233
D+GC GG NA++ I+ GG++ E +YPY + C N K
Sbjct: 177 CMEYEGEEACDEGCNGGLQPNAYNYIIKN--GGIQTESSYPYTAETGTQCNFNSANIGAK 234
Query: 234 INGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVL 293
I+ + + ++ET MA Y+V GP+A+A +A QFY+ GV F +L H +L
Sbjct: 235 ISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGV-----FDIPCNPNSLDHGIL 289
Query: 294 IVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
IVGY T F K +PYWI+KNSWG WGE+GY L RG +CG++++V ++++
Sbjct: 290 IVGYSAKNTIF-RKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTSII 343
>gi|449487301|ref|XP_004157559.1| PREDICTED: cysteine proteinase 15A-like [Cucumis sativus]
Length = 406
Score = 230 bits (587), Expect = 6e-58, Method: Compositional matrix adjust.
Identities = 132/325 (40%), Positives = 184/325 (56%), Gaps = 28/325 (8%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
F F+E++ K+Y T EY R IF NL + Q + + V+G+ +FSDLS EF+
Sbjct: 89 FVMFMEKYGKSYPTRKEYLHRFGIFVKNLIRAAEHQALDP-TAVHGVTQFSDLSEEEFER 147
Query: 102 KYLGFKLKPSYADRSVPAMIPNIT--------LPRAFDWREYDAVTGVKDQTMCGSSWAF 153
++ ++ +P M + LP FDWR+ AVT VK Q CGS WAF
Sbjct: 148 MFM--GVRGGAGGEGLPEMNQAVEVTAEEVKGLPERFDWRDKGAVTEVKMQGTCGSCWAF 205
Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQEDD---------GCEGGSISNAFDTIMSKL 204
ST G +EG T L++LSEQ+L+DCD D GC GG ++NA+ ++
Sbjct: 206 STCGAVEGANFIATGNLLNLSEQQLVDCDHTCDPTDKTACNNGCNGGLMTNAYKYLIQS- 264
Query: 205 GGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAY 264
GGLEEE +YPY G C VK++ + ++ DE +A +LV +GP+AV +NA
Sbjct: 265 -GGLEEESSYPYTGRSGQCNFQSDKIAVKVSNFTTIPIDENQIAAHLVRSGPLAVGLNAV 323
Query: 265 ALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDR-TKFTHKAVPYWIIKNSWGEGWG 323
+Q Y+ GVS P+ C G ++H VL+VGYG + + + +PYW+IKNSWGE WG
Sbjct: 324 FMQTYIGGVSCPL--IC--GKRFVNHGVLMVGYGDEGFSILRFRKLPYWVIKNSWGERWG 379
Query: 324 EKGYFRLYRGDGSCGINDYVRSALV 348
E GY+RL RG G CGIN V SA+V
Sbjct: 380 EHGYYRLCRGHGMCGINTMV-SAVV 403
>gi|118429527|gb|ABK91811.1| cathepsin F precursor [Clonorchis sinensis]
Length = 326
Score = 230 bits (587), Expect = 6e-58, Method: Compositional matrix adjust.
Identities = 131/311 (42%), Positives = 178/311 (57%), Gaps = 17/311 (5%)
Query: 40 ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
AL+ F ++ KTY+ + R IF NL + + LQ+ E G+ YG+ +FSDL++ EF
Sbjct: 30 ALYEEFKLKYKKTYSNDDDEL-RFEIFKDNLLRAKRLQEMEQGTAQYGVTQFSDLTSEEF 88
Query: 100 QAKYLGFKLK-PSYADRSVPAMIPNITLP-RAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
+ +YL + P ++ P ++T+ FDWRE+ AV V DQ CGS WAFS G
Sbjct: 89 KTRYLRMRFDGPIVSEDLTPE--EDVTMDNEKFDWREHGAVGPVLDQGKCGSCWAFSVIG 146
Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
N+EG + KT L++LSEQ+L+DCD D GC+GG + I GGLE YPY
Sbjct: 147 NVEGQWFRKTGDLLALSEQQLVDCDYLDGGCDGGYPPQTYTAIQKM--GGLELASDYPYT 204
Query: 218 GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPI 277
G C ++K ING + E A+ L GP++ A+NA LQ Y G+ P
Sbjct: 205 GVGGICYMDKSKFVAYINGSTILPLSEKVQAQKLRAIGPLSSALNADTLQLYKGGIMRP- 263
Query: 278 QFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSC 337
CD N H+VL VGYGV K PYWI+KNSWGE +GE+GYFR+YRGDG+C
Sbjct: 264 -RLCDPAGVN--HAVLTVGYGVQNGK------PYWIVKNSWGEDFGEEGYFRIYRGDGTC 314
Query: 338 GINDYVRSALV 348
GIN V +A++
Sbjct: 315 GINSIVTTAII 325
>gi|449449489|ref|XP_004142497.1| PREDICTED: cysteine proteinase 15A-like [Cucumis sativus]
Length = 406
Score = 230 bits (587), Expect = 6e-58, Method: Compositional matrix adjust.
Identities = 132/325 (40%), Positives = 184/325 (56%), Gaps = 28/325 (8%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
F F+E++ K+Y T EY R IF NL + Q + + V+G+ +FSDLS EF+
Sbjct: 89 FVMFMEKYGKSYPTRKEYLHRFGIFVKNLIRAAEHQALDP-TAVHGVTQFSDLSEEEFER 147
Query: 102 KYLGFKLKPSYADRSVPAMIPNIT--------LPRAFDWREYDAVTGVKDQTMCGSSWAF 153
++ ++ +P M + LP FDWR+ AVT VK Q CGS WAF
Sbjct: 148 MFM--GVRGGAGGEGLPEMNQAVEVTAEEVKGLPERFDWRDKGAVTEVKMQGTCGSCWAF 205
Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQEDD---------GCEGGSISNAFDTIMSKL 204
ST G +EG T L++LSEQ+L+DCD D GC GG ++NA+ ++
Sbjct: 206 STCGAVEGANFIATGNLLNLSEQQLVDCDHTCDPTDKTACNNGCNGGLMTNAYKYLIQS- 264
Query: 205 GGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAY 264
GGLEEE +YPY G C VK++ + ++ DE +A +LV +GP+AV +NA
Sbjct: 265 -GGLEEESSYPYTGRSGQCNFQSDKIAVKVSNFTTIPIDENQIAAHLVRSGPLAVGLNAV 323
Query: 265 ALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDR-TKFTHKAVPYWIIKNSWGEGWG 323
+Q Y+ GVS P+ C G ++H VL+VGYG + + + +PYW+IKNSWGE WG
Sbjct: 324 FMQTYIGGVSCPL--IC--GKRFVNHGVLMVGYGDEGFSILRFRKLPYWVIKNSWGERWG 379
Query: 324 EKGYFRLYRGDGSCGINDYVRSALV 348
E GY+RL RG G CGIN V SA+V
Sbjct: 380 EHGYYRLCRGHGMCGINTMV-SAVV 403
>gi|410045434|ref|XP_003313198.2| PREDICTED: LOW QUALITY PROTEIN: cathepsin F [Pan troglodytes]
Length = 548
Score = 230 bits (587), Expect = 6e-58, Method: Compositional matrix adjust.
Identities = 132/329 (40%), Positives = 191/329 (58%), Gaps = 10/329 (3%)
Query: 20 SSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT 79
S ++ ++ L VK ++F F+ +N+TY + E RL +F N+ + Q +Q
Sbjct: 229 SVISLLNEDPLPQDLPVKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQAL 288
Query: 80 EHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVT 139
+ G+ YG+ +FSDL+ EF+ YL L+ ++ A P +DWR AVT
Sbjct: 289 DRGTAQYGVTKFSDLTEEEFRTIYLNPLLRKEPGNKMKQAKSVGDLAPPEWDWRSKGAVT 348
Query: 140 GVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDT 199
VKDQ MCGS WAFS TGN+EG + L+SLSEQEL+DCD+ D C GG SNA+
Sbjct: 349 KVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDKMDKACMGGLPSNAYSA 408
Query: 200 IMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAV 259
I K GGLE E Y Y+G ++C + + +V IN V +S++E +A +L + GP++V
Sbjct: 409 I--KNLGGLETEDDYSYQGHMQSCNFSAEKAKVYINDSVVLSQNEQKLAAWLAKRGPISV 466
Query: 260 AINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWG 319
AINA+ +QFY G+S P++ C + H+VL+VGYG VP+W IKNSWG
Sbjct: 467 AINAFGMQFYRHGISRPLRPLCS--PWLIDHAVLLVGYG------NRSDVPFWAIKNSWG 518
Query: 320 EGWGEKGYFRLYRGDGSCGINDYVRSALV 348
WGEKGY+ L+ G +CG+N ++V
Sbjct: 519 TDWGEKGYYYLHCGSEACGVNTMASLSVV 547
>gi|116242314|gb|ABJ89814.1| cysteine protease preprotein [Clonorchis sinensis]
Length = 326
Score = 230 bits (587), Expect = 7e-58, Method: Compositional matrix adjust.
Identities = 130/311 (41%), Positives = 179/311 (57%), Gaps = 17/311 (5%)
Query: 40 ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
AL+ F ++ KTY+ + R IF NL + + LQ+ E G+ YG+ +FSDL++ EF
Sbjct: 30 ALYEEFKLKYKKTYSNDDDEL-RFEIFKDNLLRAKRLQEMEQGTAQYGVTQFSDLTSEEF 88
Query: 100 QAKYLGFKLK-PSYADRSVPAMIPNITLP-RAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
+ +YL + P ++ P ++T+ FDWRE+ AV V DQ CGS WAFS G
Sbjct: 89 KTRYLRMRFDGPIVSEDLTPE--EDVTMDNEKFDWREHGAVGPVLDQGKCGSCWAFSVIG 146
Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
N+ G + KT L++LSEQ+L+DCD DDGC+GG + I GGLE YPY
Sbjct: 147 NVVGQWFRKTGHLLALSEQQLVDCDYLDDGCDGGYPPQTYTAIQKM--GGLELASDYPYT 204
Query: 218 GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPI 277
G C ++K +NG + E A+ L GP++ A+NA LQ Y G+ P
Sbjct: 205 GVGGICHMDKSKFVAYVNGSTILPLSEKVQAQKLRAIGPLSSALNADTLQLYKGGIMRPK 264
Query: 278 QFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSC 337
+CD N H+VL VGYGV K PYWI+KNSWGE +GE+GYFR+YRGDG+C
Sbjct: 265 --WCDPAGVN--HAVLTVGYGVQNGK------PYWIVKNSWGEDFGEEGYFRIYRGDGTC 314
Query: 338 GINDYVRSALV 348
GIN V +A++
Sbjct: 315 GINSIVTTAII 325
>gi|118429515|gb|ABK91805.1| cysteine proteinase 7 precursor [Clonorchis sinensis]
Length = 326
Score = 230 bits (586), Expect = 8e-58, Method: Compositional matrix adjust.
Identities = 131/311 (42%), Positives = 179/311 (57%), Gaps = 17/311 (5%)
Query: 40 ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
AL+ F ++ KTY+ + R IF NL + + LQ+ E G+ YG+ +FSDL++ EF
Sbjct: 30 ALYEEFKLKYKKTYSNDDDEL-RFEIFKDNLLRAKRLQEMEQGTAQYGVTQFSDLTSEEF 88
Query: 100 QAKYLGFKLK-PSYADRSVPAMIPNITLP-RAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
+ +YL + P ++ P ++T+ FDWRE+ AV V DQ CGS WAFS G
Sbjct: 89 KTRYLRMRFDGPIVSEDLTPE--EDVTMDNEKFDWREHGAVGPVLDQGKCGSCWAFSVIG 146
Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
N+EG + KT L++LSEQ+L+DCD D GC+GG + I GGLE YPY
Sbjct: 147 NVEGQWFRKTGDLLALSEQQLVDCDYLDGGCDGGYPPQTYTAIQKM--GGLELASDYPYT 204
Query: 218 GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPI 277
G C ++K ING + E A+ L GP++ A+NA LQ Y G+ P
Sbjct: 205 GVGGICYMDKSKFVAYINGSTILPLSEKVQAQKLRAIGPLSSALNADTLQLYKGGIMRPK 264
Query: 278 QFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSC 337
+CD N H+VL VGYGV K PYWI+KNSWGE +GE+GYFR+YRGDG+C
Sbjct: 265 --WCDPAGVN--HAVLTVGYGVQNGK------PYWIVKNSWGEDFGEEGYFRIYRGDGTC 314
Query: 338 GINDYVRSALV 348
GIN V +A++
Sbjct: 315 GINSIVTTAII 325
>gi|66803148|ref|XP_635417.1| cysteine proteinase 1 [Dictyostelium discoideum AX4]
gi|166201987|sp|P04988.2|CYSP1_DICDI RecName: Full=Cysteine proteinase 1; Flags: Precursor
gi|60463731|gb|EAL61909.1| cysteine proteinase 1 [Dictyostelium discoideum AX4]
Length = 343
Score = 230 bits (586), Expect = 9e-58, Method: Compositional matrix adjust.
Identities = 137/355 (38%), Positives = 192/355 (54%), Gaps = 36/355 (10%)
Query: 11 ALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNL 70
L TV VSS + +E+ L F ++ NK Y+ EY R IF NL
Sbjct: 8 VLAVFTVFVSSRGIPLEEQSQFLE----------FQDKFNKKYSH-EEYLERFEIFKSNL 56
Query: 71 RKIQ---LLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPN---I 124
KI+ L+ +G+N+F+DLS+ EF+ YL K D V + +
Sbjct: 57 GKIEELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFIN 116
Query: 125 TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE 184
++P AFDWR AVT VK+Q CGS W+FSTTGN+EG + KLVSLSEQ L+DCD E
Sbjct: 117 SIPTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHE 176
Query: 185 ----------DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDD-KACRLNKKATQVK 233
D+GC GG NA++ I+ GG++ E +YPY + C N K
Sbjct: 177 CMEYEGEQACDEGCNGGLQPNAYNYIIKN--GGIQTESSYPYTAETGTQCNFNSANIGAK 234
Query: 234 INGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVL 293
I+ + + ++ET MA Y+V GP+A+A +A QFY+ GV F +L H +L
Sbjct: 235 ISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGV-----FDIPCNPNSLDHGIL 289
Query: 294 IVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
IVGY T F K +PYWI+KNSWG WGE+GY L RG +CG++++V ++++
Sbjct: 290 IVGYSAKNTIF-RKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTSII 343
>gi|290984408|ref|XP_002674919.1| predicted protein [Naegleria gruberi]
gi|284088512|gb|EFC42175.1| predicted protein [Naegleria gruberi]
Length = 353
Score = 230 bits (586), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 138/356 (38%), Positives = 200/356 (56%), Gaps = 30/356 (8%)
Query: 13 LSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLE---QHNKTYATLVEYYSRLHIFSGN 69
L+ V ++ ++ D++ + + TA+ ++FL+ + + Y EY RL +F N
Sbjct: 7 LTFLVILACGILAFDQETYQ--PLSETAVRDHFLDFTRKFQRFYKGPEEYEYRLKVFREN 64
Query: 70 LRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVP----AMIPNI- 124
+ + + + G+ YG+ +FSDL++ EF+ YL K P + + M+ N
Sbjct: 65 IETSRRM-NIREGNNNYGITKFSDLTSDEFRKFYLMEKKTPKEIQKMMRMDSNKMVSNSY 123
Query: 125 --TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCD 182
P +DWR + A+TGVKDQ CGS WAFS G+IEG YA K K+LVS SEQ+L+DCD
Sbjct: 124 AKPAPDHYDWRNHGAITGVKDQGQCGSCWAFSAIGSIEGSYAIKHKQLVSFSEQQLVDCD 183
Query: 183 QE----------DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQV 232
DDGC GG +A+ +M GG+ EK YPY + C +
Sbjct: 184 NNCVTFENQQSCDDGCNGGLQWSAYQYLMK--AGGVVTEKDYPYYAERYKCEVKPANFVA 241
Query: 233 KINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSV 292
K++ + +S +ET+MA +L ENGP+AVA+NA LQ Y G++ P +CD L H V
Sbjct: 242 KLSNWTMLSTNETEMANWLAENGPIAVALNADFLQNYNNGIADPA--WCDP--TQLDHGV 297
Query: 293 LIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
LIVGYG++ T + K PYWI+KNSWG +GE GYFR+ +G G CGIN +A V
Sbjct: 298 LIVGYGLE-TFWFGKPQPYWIVKNSWGYDFGEDGYFRIVKGVGRCGINTVPSAAFV 352
>gi|242061538|ref|XP_002452058.1| hypothetical protein SORBIDRAFT_04g017830 [Sorghum bicolor]
gi|241931889|gb|EES05034.1| hypothetical protein SORBIDRAFT_04g017830 [Sorghum bicolor]
Length = 371
Score = 229 bits (585), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 133/337 (39%), Positives = 185/337 (54%), Gaps = 30/337 (8%)
Query: 26 GDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGV 85
GD+ L+ H F F+++ K+Y E+ RL IF NLR+ + Q + S
Sbjct: 35 GDDNELELNAESH---FLSFVQRFGKSYKDAEEHAYRLSIFKANLRRARRHQLLDP-SAE 90
Query: 86 YGLNEFSDLSTAEFQAKYLGFK------LKPSYADRSVPAMIPNITLPRAFDWREYDAVT 139
+G+ +FSDL+ AEF+ YLG + L+ + ++P LP FDWR++ AVT
Sbjct: 91 HGVTKFSDLTPAEFRRTYLGLRKSRRALLRELGKSANEAPVLPTDGLPDDFDWRDHGAVT 150
Query: 140 GVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEG 190
VK+Q CGS W+FST+G +EG + T KL LSEQ+++DCD D GC G
Sbjct: 151 PVKNQGSCGSCWSFSTSGALEGAHYLATGKLEVLSEQQMVDCDHVCDTSEPDSCDSGCNG 210
Query: 191 GSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKY 250
G ++NAF + GGLE EK YPY G D C+ +K + + VS DE +A
Sbjct: 211 GLMTNAFSYLQK--AGGLESEKDYPYTGSDDKCKFDKSKIVASVQNFSVVSVDEGQIAAN 268
Query: 251 LVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDR-TKFTHKAV 309
L+++GP+A+ INA +Q Y+ GVS P + C L H VL+VGYG K
Sbjct: 269 LIKHGPLAIGINAAYMQTYIGGVSCP--YIC---GRTLDHGVLLVGYGAAGFAPIRLKDK 323
Query: 310 PYWIIKNSWGEGWGEKGYFRLYRGD---GSCGINDYV 343
PYWIIKNSWGE WGE GY+++ RG CG++ V
Sbjct: 324 PYWIIKNSWGENWGENGYYKICRGSNVRNKCGVDSMV 360
>gi|7219908|gb|AAF40479.1| cystein protease [Clonorchis sinensis]
Length = 326
Score = 229 bits (585), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 130/311 (41%), Positives = 178/311 (57%), Gaps = 17/311 (5%)
Query: 40 ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
AL+ F ++ KTY+ + R IF NL + + LQ+ E G+ YG+ +FSDL++ EF
Sbjct: 30 ALYEEFKLKYKKTYSNDDDEL-RFEIFKDNLLRAKRLQEMEQGTAQYGVTQFSDLTSEEF 88
Query: 100 QAKYLGFKLK-PSYADRSVPAMIPNITLP-RAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
+ +YL + P ++ P ++T+ FDWRE+ AV V DQ CGS WAFS G
Sbjct: 89 KTRYLRMRFDGPIVSEDLTPE--EDVTMDNEKFDWREHGAVGPVLDQGKCGSCWAFSVIG 146
Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
N+ G + KT L++LSEQ+L+DCD DDGC+GG + I GGLE YPY
Sbjct: 147 NVVGQWFRKTGHLLALSEQQLVDCDYLDDGCDGGYPPQTYTAIQKM--GGLELASDYPYT 204
Query: 218 GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPI 277
G C ++K +NG + E A+ L GP++ A+NA LQ Y G+ P
Sbjct: 205 GVGGICHMDKSKFVAYVNGSTILPLSEKVQAQKLRAIGPLSSALNADTLQLYKGGIMRPK 264
Query: 278 QFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSC 337
+CD N H VL VGYGV K PYWI+KNSWGE +GE+GYFR+YRGDG+C
Sbjct: 265 --WCDPAGVN--HGVLTVGYGVQNGK------PYWIVKNSWGEDFGEEGYFRIYRGDGTC 314
Query: 338 GINDYVRSALV 348
GIN V +A++
Sbjct: 315 GINSIVTTAII 325
>gi|330792958|ref|XP_003284553.1| hypothetical protein DICPUDRAFT_96752 [Dictyostelium purpureum]
gi|325085467|gb|EGC38873.1| hypothetical protein DICPUDRAFT_96752 [Dictyostelium purpureum]
Length = 346
Score = 229 bits (583), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 134/334 (40%), Positives = 192/334 (57%), Gaps = 32/334 (9%)
Query: 34 HHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKI-QLLQDTE-HGSGV-YGLNE 90
H ++ T F F +++NK Y++ EY ++ F NL I QL Q + H S +G+NE
Sbjct: 22 HTIEQTQ-FVAFQQKYNKVYSS-NEYSAKFETFKANLGVIAQLNQKAKLHKSDTKFGVNE 79
Query: 91 FSDLSTAEFQAKYLGFKL-KPSYADRSVPAMIPNI--TLPRAFDWREYDAVTGVKDQTMC 147
F+DLS AEF+ YL ++ KP + P + + T+P AFDWR AVTGVK+Q C
Sbjct: 80 FADLSAAEFRKYYLNAQVAKPDASLPMAPLLTEEVLETIPTAFDWRTKGAVTGVKNQGQC 139
Query: 148 GSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE----------DDGCEGGSISNAF 197
GS W+FSTTGNIEG + LV LSEQ L+DCD + D GC+GG NA+
Sbjct: 140 GSCWSFSTTGNIEGQWYLAGNTLVGLSEQNLVDCDHQCMEYDGQKSCDAGCDGGLQPNAY 199
Query: 198 DTIMSKLGGGLEEEKTYPY---RGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVEN 254
++ GGL+ E +YPY GD +C+ KI+ + + ++ET MA YL +
Sbjct: 200 RYVIEN--GGLDSENSYPYLAVTGD--SCKFKSGNVAAKISNFTMIPQNETQMAGYLATH 255
Query: 255 GPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWII 314
GP+A+A +A QFY+ GV C ++L H +LIVG+ ++ F H PYWI+
Sbjct: 256 GPLAIAADAAEWQFYIGGV---FDLPC---GQSLDHGILIVGFSAEKNIFGHLK-PYWIV 308
Query: 315 KNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
KNSWG WGE+GY L +G CG++D+V ++ +
Sbjct: 309 KNSWGASWGEQGYLYLGKGKNLCGVSDFVSTSTI 342
>gi|85068704|gb|ABC69432.1| cysteine protease [Clonorchis sinensis]
Length = 326
Score = 228 bits (582), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 130/311 (41%), Positives = 178/311 (57%), Gaps = 17/311 (5%)
Query: 40 ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
AL+ F ++ KTY+ + R IF NL + + LQ+ E G+ YG+ +FSDL++ EF
Sbjct: 30 ALYEEFKLKYKKTYSNDDDEL-RFEIFKDNLLRAKRLQEMEQGTAQYGVTQFSDLTSEEF 88
Query: 100 QAKYLGFKLK-PSYADRSVPAMIPNITLP-RAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
+ +YL + P ++ P ++T+ FDWRE+ AV V DQ CGS WAFS G
Sbjct: 89 ETRYLRMRFDGPIVSEDLTPE--EDVTMDNEKFDWREHGAVGPVLDQGKCGSCWAFSVIG 146
Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
N+ G + KT L++LSEQ+L+DCD DDGC+GG + I GGLE YPY
Sbjct: 147 NVVGQWFRKTGHLLALSEQQLVDCDYLDDGCDGGYPPQTYTAIQKM--GGLELASDYPYT 204
Query: 218 GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPI 277
G C ++K +NG + E A+ L GP++ A+NA LQ Y G+ P
Sbjct: 205 GVGGICHMDKSKFVAYVNGSTILPLSEKVQAQKLRAIGPLSSALNADTLQLYKGGIMRPK 264
Query: 278 QFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSC 337
+CD N H+VL VGYGV K PYWI+KNSWGE +GE+GYFR+YRGDG+C
Sbjct: 265 --WCDPAGVN--HAVLTVGYGVQNGK------PYWIVKNSWGEDFGEEGYFRIYRGDGTC 314
Query: 338 GINDYVRSALV 348
GIN V +A +
Sbjct: 315 GINSIVTTARI 325
>gi|41019551|tpe|CAD66657.1| TPA: putative cysteine proteinase precursor [Hordeum vulgare subsp.
vulgare]
gi|326489967|dbj|BAJ94057.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326525847|dbj|BAJ93100.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 377
Score = 228 bits (582), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 129/323 (39%), Positives = 181/323 (56%), Gaps = 31/323 (9%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
F F+++ KTY E+ RL +F NLR+ + Q + S +G+ +FSDL+ AEF+
Sbjct: 53 FVGFVQRFGKTYRDAEEHAHRLSVFKANLRRARRHQLLDP-SAEHGVTKFSDLTPAEFRR 111
Query: 102 KYLGFK------LKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFST 155
YLG K L+ ++P LP FDWR++ AV VK+Q CGS W+FS
Sbjct: 112 TYLGLKTTRRSFLREMAGSAHDAPVLPTDGLPEDFDWRDHGAVGPVKNQGSCGSCWSFSA 171
Query: 156 TGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSKLGG 206
+G +EG + K+ LSEQ+L+DCD E D GC GG +++AF ++ G
Sbjct: 172 SGALEGANYLASGKMEVLSEQQLVDCDHECDPSEPDSCDAGCNGGLMTSAFSYLLKS--G 229
Query: 207 GLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYAL 266
GLE EK YPY G D C+ +K + Y V+ DE +A LV+ GP+A+ INA +
Sbjct: 230 GLEREKDYPYTGKDGTCKFDKSKIAASVQNYSVVAVDEEQIAANLVKYGPLAIGINAAYM 289
Query: 267 QFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVD---RTKFTHKAVPYWIIKNSWGEGWG 323
Q Y+ GVS P + C +L H VL+VGYG ++F K PYWIIKNSWGE WG
Sbjct: 290 QTYIGGVSCP--YIC---GRHLDHGVLLVGYGASGFAPSRFKEK--PYWIIKNSWGENWG 342
Query: 324 EKGYFRLYRGD---GSCGINDYV 343
+KGY+++ RG CG++ V
Sbjct: 343 DKGYYKICRGSNVRNKCGVDSMV 365
>gi|194352746|emb|CAQ00101.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
Length = 381
Score = 228 bits (582), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 134/340 (39%), Positives = 186/340 (54%), Gaps = 29/340 (8%)
Query: 23 MVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHG 82
VVG + + L + A F F+ + K+Y E+ RL +F NLR+ + Q +
Sbjct: 40 QVVGGDAENELE-LNAEAHFASFVRRFGKSYRDADEHEHRLSVFRANLRRARRHQRLD-P 97
Query: 83 SGVYGLNEFSDLSTAEFQAKYLGFK------LKPSYADRSVPAMIPNITLPRAFDWREYD 136
S V+G+ +FSDL+ EF+ ++LG + LK +P LP FDWRE+
Sbjct: 98 SAVHGITKFSDLTPDEFRERFLGLRKSRRSFLKGISGSAHDAPALPTDGLPTEFDWREHG 157
Query: 137 AVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDG 187
AV VKDQ CGS W+FST+G +EG T KL LSEQ+L+DCD E D G
Sbjct: 158 AVGPVKDQGSCGSCWSFSTSGALEGANYLATGKLEVLSEQQLVDCDHECDPSEPRACDAG 217
Query: 188 CEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDM 247
C GG ++ AF + GGLE EK YPY G + AC+ +K ++ + +V+ DE +
Sbjct: 218 CNGGLMTTAFSYLAK--AGGLETEKDYPYTGRNSACKFDKSKIAAQVKNFSTVAIDEDQI 275
Query: 248 AKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYG-VDRTKFTH 306
A LV++GP+A+ INA +Q Y+ GVS P + C +L H V +VGYG
Sbjct: 276 AANLVKHGPLAIGINAVFMQTYIGGVSCP--YIC---GRHLDH-VFLVGYGSAGYAPLRF 329
Query: 307 KAVPYWIIKNSWGEGWGEKGYFRLYRG---DGSCGINDYV 343
K PYWIIKNSWGE WGE GY+++ RG CG++ V
Sbjct: 330 KEKPYWIIKNSWGENWGESGYYKICRGPHVKNKCGVDSMV 369
>gi|357148994|ref|XP_003574963.1| PREDICTED: cysteine proteinase 1-like [Brachypodium distachyon]
Length = 377
Score = 228 bits (581), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 127/321 (39%), Positives = 179/321 (55%), Gaps = 27/321 (8%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
F F+++ KTY E+ RL +F NLR+ + Q + S +G+ +FSDL+ AEF+
Sbjct: 53 FTSFVQRFGKTYKDAEEHAHRLSVFKANLRRARRHQLLDP-SAEHGITKFSDLTPAEFRR 111
Query: 102 KYLGFKLKPSYADRSV------PAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFST 155
+LG K R + ++P LP FDWR++ AV VK+Q CGS W+FS
Sbjct: 112 TFLGLKTSRRSFLREIGGSAHDAPVLPTDGLPDDFDWRDHGAVGPVKNQGSCGSCWSFSA 171
Query: 156 TGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSKLGG 206
+G +EG T K+ LSEQ+ +DCD E D GC GG +++AF ++ G
Sbjct: 172 SGALEGANYLATGKMEVLSEQQFVDCDHECDPEEPDSCDAGCNGGLMTSAFSYLLKS--G 229
Query: 207 GLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYAL 266
GLE EK YPY G D C+ +K + + VS DE +A LV++GP+A+ INA +
Sbjct: 230 GLEREKDYPYTGRDGTCKFDKSKIVASVQNFSVVSVDEEQIAANLVKHGPLAIGINAAYM 289
Query: 267 QFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTH-KAVPYWIIKNSWGEGWGEK 325
Q Y+ GVS P + C +L H VL+VGYG + K PYW+IKNSWGE WGEK
Sbjct: 290 QTYIGGVSCP--YIC---GRSLDHGVLLVGYGASGFAPSRLKNKPYWVIKNSWGENWGEK 344
Query: 326 GYFRLYRGD---GSCGINDYV 343
GY+++ RG CG++ V
Sbjct: 345 GYYKICRGSNVRNKCGVDSMV 365
>gi|449469923|ref|XP_004152668.1| PREDICTED: cysteine proteinase RD19a-like [Cucumis sativus]
gi|449520697|ref|XP_004167370.1| PREDICTED: cysteine proteinase RD19a-like [Cucumis sativus]
Length = 371
Score = 228 bits (580), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 128/315 (40%), Positives = 171/315 (54%), Gaps = 18/315 (5%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
F F + KTY T E+ R +F NLRK + Q + V+G+ FSDL+ +EF+
Sbjct: 58 FQDFKLKFGKTYTTDEEHDYRFRVFKANLRKAKRHQKLDP-DAVHGVTRFSDLTESEFRE 116
Query: 102 KYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEG 161
++G AD ++P L FDWR+ AVT VKDQ CGS W+FS G +EG
Sbjct: 117 NFVGLNRLRLPADAHQAPILPTDNLASDFDWRDQGAVTPVKDQGSCGSCWSFSAVGALEG 176
Query: 162 VYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSKLGGGLEEEK 212
T KL+SLSEQ+L+DCD E D GC GG +++AF+ I+ GGLE E+
Sbjct: 177 ANFLSTGKLISLSEQQLVDCDHECDPEEAGACDAGCNGGLMTSAFEYIVK--AGGLEREE 234
Query: 213 TYPYRGDDK-ACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVT 271
YPY G D+ +C+ + +S D +A LV+NGP+A+ INA +Q Y+
Sbjct: 235 DYPYTGTDRGSCKFQNGKIAASAANFSVISNDADQIAANLVKNGPLAIGINAVFMQTYMK 294
Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDR-TKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
G+S P + C NL H VL+VGYG K PYWIIKNSWGE WGE GY+ +
Sbjct: 295 GISCP--YIC--SKRNLDHGVLLVGYGAAGFAPIRLKEKPYWIIKNSWGENWGENGYYFI 350
Query: 331 YRGDGSCGINDYVRS 345
+G CG V S
Sbjct: 351 CKGKNICGSESMVSS 365
>gi|85068702|gb|ABC69431.1| cysteine protease [Clonorchis sinensis]
Length = 326
Score = 227 bits (579), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 130/311 (41%), Positives = 177/311 (56%), Gaps = 17/311 (5%)
Query: 40 ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
AL+ F ++ KTY+ + R IF NL + + LQ+ E G+ YG+ +FSDL++ EF
Sbjct: 30 ALYEEFKLKYKKTYSNDDDEL-RFEIFKDNLLRAKRLQEMEQGTAQYGVTQFSDLTSEEF 88
Query: 100 QAKYLGFKLK-PSYADRSVPAMIPNITLP-RAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
+ +YL + P ++ P ++T+ FDWRE+ AV V DQ CGS WAFS G
Sbjct: 89 KTRYLRMRFDGPIVSEDLTPE--EDVTMDNEKFDWREHGAVGPVLDQGKCGSCWAFSVIG 146
Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
N+ G + KT L++LSEQ+L+DCD D GC+GG + I GGLE YPY
Sbjct: 147 NVVGQWFRKTGHLLALSEQQLVDCDYLDGGCDGGYPPQTYTAIQKM--GGLELASDYPYT 204
Query: 218 GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPI 277
G C ++K ING + E A+ L GP++ A+NA LQ Y G+ P
Sbjct: 205 GVGGICYMDKSKFVAYINGSTILPLSEKVQAQKLRAIGPLSSALNADTLQLYKGGIMRP- 263
Query: 278 QFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSC 337
CD N H+VL VGYGV K PYWI+KNSWGE +GE+GYFR+YRGDG+C
Sbjct: 264 -RLCDPAGVN--HAVLTVGYGVQNGK------PYWIVKNSWGEDFGEEGYFRIYRGDGTC 314
Query: 338 GINDYVRSALV 348
GIN V +A++
Sbjct: 315 GINSIVTTAII 325
>gi|358339045|dbj|GAA32724.2| cathepsin F, partial [Clonorchis sinensis]
Length = 271
Score = 227 bits (578), Expect = 6e-57, Method: Compositional matrix adjust.
Identities = 121/282 (42%), Positives = 166/282 (58%), Gaps = 14/282 (4%)
Query: 69 NLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLK-PSYADRSVPAMIPNITLP 127
L + LQ+ E G+ YG+ +FSDL++ EF+ +YL + P ++ P ++T+
Sbjct: 1 QLAAAKRLQEMEQGTAQYGVTQFSDLTSEEFKTRYLRMRFDGPIVSEDLTPE--EDVTMD 58
Query: 128 -RAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDD 186
FDWRE+ AV V DQ CGS WAFS GN+EG + KT L++LSEQ+L+DCD D
Sbjct: 59 NEKFDWREHGAVGPVLDQGKCGSCWAFSVIGNVEGQWFRKTGDLLALSEQQLVDCDHLDK 118
Query: 187 GCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETD 246
GC GG + I GGLE YPY G D C +N+ +N + E
Sbjct: 119 GCNGGYPPKTYGEIEKM--GGLELASDYPYTGVDGICYMNQSKFVAYVNDSTVLPLSEKI 176
Query: 247 MAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTH 306
A+ L E GP++ A+NA LQFY+ G+ PI F C+ L+H+VL VGYG T
Sbjct: 177 QAQKLKEIGPLSSALNAVLLQFYLGGIIFPIPFLCN--PHGLNHAVLTVGYG------TE 228
Query: 307 KAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
+PYWI+KNSWG G+GEKGYFR++RG G+CGIN V +A++
Sbjct: 229 FGIPYWIVKNSWGVGFGEKGYFRIFRGAGTCGINLVVSTAII 270
>gi|85068698|gb|ABC69429.1| cysteine protease [Clonorchis sinensis]
Length = 326
Score = 226 bits (577), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 129/310 (41%), Positives = 174/310 (56%), Gaps = 15/310 (4%)
Query: 40 ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
AL+ F ++ KTY+ + R IF NL + + LQ+ E G+ YG+ +FSDL++ EF
Sbjct: 30 ALYEEFKLKYKKTYSNDDDEL-RFEIFKDNLLRAKRLQEMEQGTAQYGVTQFSDLTSEEF 88
Query: 100 QAKYLGFKLKPSYADRSVPAMIPNITLP-RAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
+ +YL + P+ ++T+ FDWRE+ AV V DQ CGS WAFS GN
Sbjct: 89 KTRYLRMRFDGPIVSED-PSPEEDVTMDNEKFDWREHGAVGPVLDQGKCGSCWAFSVIGN 147
Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG 218
+ G + KT L++LSEQ+L+DCD D GC+GG + I GGLE YPY G
Sbjct: 148 VVGQWFRKTGHLLALSEQQLVDCDYLDGGCDGGYPPQTYTAIQKM--GGLELASDYPYTG 205
Query: 219 DDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQ 278
C ++K ING + E A+ L GP++ A+NA LQ Y G+ P
Sbjct: 206 VGGICYMDKSKFVAYINGSTILPLSEKVQAQKLRAIGPLSSALNADTLQLYKGGIMRP-- 263
Query: 279 FFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCG 338
CD N H+VL VGYGV K PYWI+KNSWGE +GE+GYFR+YRGDG+CG
Sbjct: 264 RLCDPAGVN--HAVLTVGYGVQNGK------PYWIVKNSWGEDFGEEGYFRIYRGDGTCG 315
Query: 339 INDYVRSALV 348
IN V +A +
Sbjct: 316 INSIVTTARI 325
>gi|255585361|ref|XP_002533377.1| cysteine protease, putative [Ricinus communis]
gi|223526784|gb|EEF29008.1| cysteine protease, putative [Ricinus communis]
Length = 381
Score = 225 bits (574), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 129/321 (40%), Positives = 186/321 (57%), Gaps = 25/321 (7%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKI---QLLQDTEHGSGVYGLNEFSDLSTAE 98
F F+ +++K Y T EY RL +F+ NL + Q+L T V+G+ F DL+ E
Sbjct: 67 FKMFMIKYDKEYDTREEYMHRLGVFAKNLIRAAEHQVLDPT----AVHGITPFMDLTEEE 122
Query: 99 FQAKYLGFKLKPSYADRSVPA--MIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTT 156
F+ Y G + V A + LP +FDWR+ AVT VK Q CGS WAFSTT
Sbjct: 123 FERMYTGVVGGGAVGAEGVTATSFLETAGLPSSFDWRKKGAVTDVKMQGACGSCWAFSTT 182
Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSKLGGG 207
G IEG T KL++LSEQ+L+DCD+ DDGC GG ++NA+ ++ GG
Sbjct: 183 GAIEGANFIATGKLLNLSEQQLVDCDRVCDIKEKTACDDGCGGGLMTNAYRYLIE--AGG 240
Query: 208 LEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQ 267
LE+E +YPY G C+ ++K V++ + S+ DE +A +LV +GP+A+ +NA +Q
Sbjct: 241 LEDEISYPYTGKPGKCKFDEKKIAVRVVNFTSIPIDENQIAAHLVHHGPLAIGLNAVFMQ 300
Query: 268 FYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAV-PYWIIKNSWGEGWGEKG 326
Y+ GVS P+ C G + ++H VL+VGYG PYWIIKNSWG+ WGE+G
Sbjct: 301 TYIGGVSCPL--IC--GKKWINHGVLLVGYGAKGFSILRLGYKPYWIIKNSWGKRWGEEG 356
Query: 327 YFRLYRGDGSCGINDYVRSAL 347
Y+R+ +G G CG++ V + +
Sbjct: 357 YYRICKGYGMCGMDRMVSAVV 377
>gi|56682917|gb|AAW21813.1| cysteine protease [Triticum aestivum]
Length = 377
Score = 224 bits (572), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 127/320 (39%), Positives = 178/320 (55%), Gaps = 31/320 (9%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
F+++ KTY E+ RL +F NLR+ + Q + S +G+ +FSDL+ AEF+ +L
Sbjct: 56 FVQRFGKTYRDAEEHAHRLSVFKANLRRARRHQMLDP-SAEHGVTKFSDLTPAEFRRTFL 114
Query: 105 GFK------LKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
G K L+ ++P LP FDWR++ AV VK+Q C S W+FS +G
Sbjct: 115 GLKTTRRSFLREMAGSAHDAPVLPTDGLPEDFDWRDHGAVGPVKNQGSCWSCWSFSASGA 174
Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSKLGGGLE 209
+EG T K+ LSEQ+L+DCD E D GC GG +++AF ++ GGLE
Sbjct: 175 LEGANYLATGKMEVLSEQQLVDCDHECDPAEPDSCDAGCNGGLMTSAFSYLLKS--GGLE 232
Query: 210 EEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFY 269
EK YPY G D C+ K + + V+ DE +A LVE GP+A+ INA +Q Y
Sbjct: 233 REKDYPYTGKDGTCKFEKSKIAASVQNFSVVAVDEEQIAANLVEYGPLAIGINAAYMQTY 292
Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVD---RTKFTHKAVPYWIIKNSWGEGWGEKG 326
+ GVS P + C +L H VL+VGYG ++F K PYWIIKNSWGE WG+KG
Sbjct: 293 IGGVSCP--YIC---GRHLDHGVLLVGYGASGFAPSRFKEK--PYWIIKNSWGENWGDKG 345
Query: 327 YFRLYRGD---GSCGINDYV 343
Y+++ RG CG++ V
Sbjct: 346 YYKICRGSNVRNKCGVDSMV 365
>gi|327358519|gb|AEA51106.1| cathepsin F, partial [Oryzias melastigma]
Length = 255
Score = 224 bits (572), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 120/260 (46%), Positives = 161/260 (61%), Gaps = 11/260 (4%)
Query: 90 EFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPR-AFDWREYDAVTGVKDQTMCG 148
+FSDL+ EF + YL L R + P + ++DWR++ AV+ VK+Q MCG
Sbjct: 5 KFSDLTEEEFHSAYLNPLLSQWTLHREMKPAPPAKSPAPDSWDWRDHGAVSPVKNQGMCG 64
Query: 149 SSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGL 208
S WAFS TGNIEG + K L+SLSEQEL+DCD D C GG SNA++ I KL GGL
Sbjct: 65 SCWAFSVTGNIEGQWFLKNGTLLSLSEQELVDCDGLDQACRGGLPSNAYEAI-EKL-GGL 122
Query: 209 EEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQF 268
E E Y Y G + C + IN V + +DE ++A +L ENGP++VA+NA+A+QF
Sbjct: 123 ETETDYSYTGKKQRCDFTNRKVAAYINSSVELPKDEKEIAAWLAENGPISVALNAFAMQF 182
Query: 269 YVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYF 328
Y GVSHP + FC+ + H+VL+VGYG +P+W IKNSWGE +GE+GY+
Sbjct: 183 YKKGVSHPWKIFCNPW--MIDHAVLLVGYG------ERNGIPFWAIKNSWGEDYGEQGYY 234
Query: 329 RLYRGDGSCGINDYVRSALV 348
L+RG +CGIN SA+V
Sbjct: 235 YLHRGSNACGINKMGSSAVV 254
>gi|12597541|ref|NP_075125.1| cathepsin [Helicoverpa armigera nucleopolyhedrovirus G4]
gi|15426394|ref|NP_203611.1| cathepsin [Helicoverpa armigera NPV]
gi|12483807|gb|AAG53799.1|AF271059_56 cathepsin [Helicoverpa armigera nucleopolyhedrovirus G4]
gi|15384470|gb|AAK96381.1|AF303045_123 cathepsin [Helicoverpa armigera NPV]
gi|18027090|gb|AAL55725.1|AF268612_1 cathepsin [Helicoverpa armigera NPV]
Length = 365
Score = 224 bits (572), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 122/324 (37%), Positives = 182/324 (56%), Gaps = 31/324 (9%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQ-----------LLQDTEHGSGVYGLNE 90
F +FL+Q+NK+Y EY R ++F NL KI D+ S +G+N+
Sbjct: 55 FKHFLQQYNKSYDDPKEYQYRYNVFKDNLNKINSQNRENLLNNKNNNDSLSTSAQFGVNK 114
Query: 91 FSDLSTAEFQAKYLGFKLKPS----YADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTM 146
FSD + E GF L S + + PNI LP +DWR+ + VT +KDQ +
Sbjct: 115 FSDKTPDEVLHSNTGFFLNLSQHYTLCENRIVKGAPNIRLPDYYDWRDTNKVTPIKDQGV 174
Query: 147 CGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGG 206
CGS WAF GNIE YA + KL+ LSEQ+L+DCD+ D GC GG + AF ++ L G
Sbjct: 175 CGSCWAFVAIGNIESQYAIRHNKLIDLSEQQLLDCDEVDLGCNGGLMHLAFQELL--LMG 232
Query: 207 GLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVS-RDETDMAKYLVENGPMAVAINAYA 265
G+E E YPY+G ++ C L+ + VK+N RDE + + + GP+A+A++A
Sbjct: 233 GVETEADYPYQGSEQMCTLDNRKIAVKLNSCFKYDIRDENKLKELVYTTGPVAIAVDAMD 292
Query: 266 LQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEK 325
+ Y G+ + + +L+H+VL++G+G++ VPYWIIKNSWGE WGE
Sbjct: 293 IINYRRGILNQCHIY------DLNHAVLLIGWGIENN------VPYWIIKNSWGEDWGEN 340
Query: 326 GYFRLYRGDGSCG-INDYVRSALV 348
GY R+ R +CG +N++ S+++
Sbjct: 341 GYLRVRRNVNACGLLNEFGASSVI 364
>gi|115446097|ref|NP_001046828.1| Os02g0469600 [Oryza sativa Japonica Group]
gi|47497527|dbj|BAD19579.1| putative cysteine proteinase 1 precursor [Oryza sativa Japonica
Group]
gi|113536359|dbj|BAF08742.1| Os02g0469600 [Oryza sativa Japonica Group]
gi|215701326|dbj|BAG92750.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215704370|dbj|BAG93804.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215708762|dbj|BAG94031.1| unnamed protein product [Oryza sativa Japonica Group]
gi|218200777|gb|EEC83204.1| hypothetical protein OsI_28465 [Oryza sativa Indica Group]
gi|222622835|gb|EEE56967.1| hypothetical protein OsJ_06681 [Oryza sativa Japonica Group]
Length = 373
Score = 224 bits (571), Expect = 5e-56, Method: Compositional matrix adjust.
Identities = 130/337 (38%), Positives = 184/337 (54%), Gaps = 30/337 (8%)
Query: 26 GDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGV 85
GD+ L+ +H F F+++ K+Y E+ RL +F NLR+ + Q + S
Sbjct: 37 GDDNELELNAERH---FASFVQRFGKSYRDADEHAYRLSVFKANLRRARRHQLLDP-SAE 92
Query: 86 YGLNEFSDLSTAEFQAKYLGFKLKPSYADRSV------PAMIPNITLPRAFDWREYDAVT 139
+G+ +FSDL+ AEF+ YLG + R + ++P LP FDWR++ AV
Sbjct: 93 HGVTKFSDLTPAEFRRAYLGLRTSRRAFLRGLGGSAHEAPVLPTDGLPDDFDWRDHGAVG 152
Query: 140 GVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEG 190
VK+Q CGS W+FS +G +EG T K+ LSEQ+++DCD E D GC G
Sbjct: 153 PVKNQGSCGSCWSFSASGALEGANYLATGKMDVLSEQQMVDCDHECDSSEPDSCDAGCNG 212
Query: 191 GSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKY 250
G ++NAF ++ GGLE EK YPY G D C+ +K + + VS DE +A
Sbjct: 213 GLMTNAFSYLLKS--GGLESEKDYPYTGRDGTCKFDKSKIVTSVQNFSVVSVDEDQIAAN 270
Query: 251 LVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDR-TKFTHKAV 309
LV++GP+A+ INA +Q Y+ GVS P + C +L H VL+VGYG K
Sbjct: 271 LVKHGPLAIGINAAYMQTYIGGVSCP--YIC---GRHLDHGVLLVGYGASGFAPIRLKDK 325
Query: 310 PYWIIKNSWGEGWGEKGYFRLYRGD---GSCGINDYV 343
YWIIKNSWGE WGE GY+++ RG CG++ V
Sbjct: 326 AYWIIKNSWGENWGEHGYYKICRGSNVRNKCGVDSMV 362
>gi|344310882|gb|AEN03980.1| cathepsin-like cysteine proteinase [Helicoverpa armigera NPV strain
Australia]
Length = 367
Score = 224 bits (571), Expect = 5e-56, Method: Compositional matrix adjust.
Identities = 122/324 (37%), Positives = 182/324 (56%), Gaps = 31/324 (9%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQ-----------LLQDTEHGSGVYGLNE 90
F +FL+Q+NK+Y EY R ++F NL KI D+ S +G+N+
Sbjct: 57 FKHFLQQYNKSYDDPKEYQYRYNVFKDNLNKINSQNRENLLNNKNNNDSLSTSAQFGVNK 116
Query: 91 FSDLSTAEFQAKYLGFKLKPS----YADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTM 146
FSD + E GF L S + + PNI LP +DWR+ + VT +KDQ +
Sbjct: 117 FSDKTPDEVLHSNTGFFLNLSQHYTLCENRIVKGAPNIRLPDYYDWRDTNKVTPIKDQGV 176
Query: 147 CGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGG 206
CGS WAF GNIE YA + KL+ LSEQ+L+DCD+ D GC GG + AF ++ L G
Sbjct: 177 CGSCWAFVAIGNIESQYAIRHNKLIDLSEQQLLDCDEVDLGCNGGLMHLAFQELL--LMG 234
Query: 207 GLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVS-RDETDMAKYLVENGPMAVAINAYA 265
G+E E YPY+G ++ C L+ + VK+N RDE + + + GP+A+A++A
Sbjct: 235 GVETEADYPYQGSEQMCTLDNRKIAVKLNSCFKYDIRDENKLKELVYTTGPVAIAVDAMD 294
Query: 266 LQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEK 325
+ Y G+ + + +L+H+VL++G+G++ VPYWIIKNSWGE WGE
Sbjct: 295 IINYRRGILNQCHIY------DLNHAVLLIGWGIENN------VPYWIIKNSWGEDWGEN 342
Query: 326 GYFRLYRGDGSCG-INDYVRSALV 348
GY R+ R +CG +N++ S+++
Sbjct: 343 GYLRVRRNVNACGLLNEFGASSVI 366
>gi|85068706|gb|ABC69433.1| cysteine protease [Clonorchis sinensis]
Length = 326
Score = 224 bits (571), Expect = 5e-56, Method: Compositional matrix adjust.
Identities = 128/311 (41%), Positives = 177/311 (56%), Gaps = 17/311 (5%)
Query: 40 ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
AL+ F ++ KTY+ + R IF NL + + LQ+ E G+ YG+ +FSDL++ EF
Sbjct: 30 ALYEEFKLKYKKTYSNDDDEL-RFEIFKDNLLRAKRLQEMEQGTAQYGVTQFSDLTSEEF 88
Query: 100 QAKYLGFKLK-PSYADRSVPAMIPNITLP-RAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
+ +YL + P ++ P ++T+ FDWRE+ AV V DQ CGS WAFS G
Sbjct: 89 KTRYLRMRFDGPIVSEDLTPE--EDVTMDNEKFDWREHGAVGPVLDQGKCGSCWAFSVIG 146
Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
N+ G + +T L++LS Q+L+DCD DDGC+GG + I GGLE YPY
Sbjct: 147 NVVGQWFRETGHLLALSGQQLVDCDYLDDGCDGGYPPQTYTAIQKM--GGLELASDYPYT 204
Query: 218 GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPI 277
G C ++K +NG + E A+ L GP++ A+NA LQ Y G+ P
Sbjct: 205 GVGGICHMDKSKFVAYVNGSTILPLSEKVQAQKLRAIGPLSSALNADTLQLYKGGIMRPK 264
Query: 278 QFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSC 337
+CD N H+VL VGYGV K PYWI+KNSWGE +GE+GYFR+YRGDG+C
Sbjct: 265 --WCDPAGVN--HAVLTVGYGVQNGK------PYWIVKNSWGEDFGEEGYFRIYRGDGTC 314
Query: 338 GINDYVRSALV 348
GIN V +A +
Sbjct: 315 GINSIVTTARI 325
>gi|85068700|gb|ABC69430.1| cysteine protease [Clonorchis sinensis]
Length = 326
Score = 224 bits (571), Expect = 5e-56, Method: Compositional matrix adjust.
Identities = 131/311 (42%), Positives = 178/311 (57%), Gaps = 17/311 (5%)
Query: 40 ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
AL+ F ++ KTY+ + R IF NL + + LQ+ E G+ YG+ +FSDL++ EF
Sbjct: 30 ALYEEFKLKYKKTYSNDDDEL-RFEIFKDNLLRAKRLQEMEQGTAQYGVTQFSDLTSEEF 88
Query: 100 QAKYLGFKLK-PSYADRSVPAMIPNITLP-RAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
+ +YL + P ++ P ++T+ FDWRE+ AV V DQ CGS WAFS G
Sbjct: 89 KTRYLRMRFDGPIVSEDLTPE--EDVTMDNEKFDWREHGAVGPVLDQGKCGSCWAFSVIG 146
Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
N+ G + KT L++LSEQ L+DCD D GC+GG +T + K+ GGLE YPY
Sbjct: 147 NVVGQWFRKTGHLLALSEQPLVDCDYLDGGCDGGYPPQT-NTAIQKM-GGLELASDYPYT 204
Query: 218 GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPI 277
G C ++K ING + E A+ L GP++ A+NA LQ Y G+ P
Sbjct: 205 GVGGICYMDKSKFVAYINGSTILPLSEKVQAQKLRAIGPLSSALNADTLQLYKGGIMRP- 263
Query: 278 QFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSC 337
CD N H+VL VGYGV K PYWI+KNSWGE +GE+GYFR+YRGDG+C
Sbjct: 264 -RLCDPAGVN--HAVLTVGYGVQNGK------PYWIVKNSWGEDFGEEGYFRIYRGDGTC 314
Query: 338 GINDYVRSALV 348
GIN V +A +
Sbjct: 315 GINSIVTTARI 325
>gi|194705198|gb|ACF86683.1| unknown [Zea mays]
gi|413936851|gb|AFW71402.1| cysteine protease1 [Zea mays]
Length = 371
Score = 223 bits (569), Expect = 7e-56, Method: Compositional matrix adjust.
Identities = 125/321 (38%), Positives = 176/321 (54%), Gaps = 27/321 (8%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
F F+++ K+Y E+ RL +F NLR+ + Q + S +G+ +FSDL+ AEF+
Sbjct: 48 FLSFVQRFGKSYKDADEHAYRLSVFKANLRRARRHQLLDP-SAEHGVTKFSDLTPAEFRR 106
Query: 102 KYLGFKLKPSYADRSV------PAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFST 155
YLG + R + ++P LP FDWR++ AV VK+Q CGS W+FS
Sbjct: 107 TYLGLRKSRRALLRELGESAHEAPVLPTDGLPDDFDWRDHGAVGPVKNQGSCGSCWSFSA 166
Query: 156 TGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSKLGG 206
+G +EG + T KL LSEQ+ +DCD E D GC GG ++ AF + G
Sbjct: 167 SGALEGAHYLATGKLEVLSEQQFVDCDHECDSSEPDSCDSGCNGGLMTTAFSYLQK--AG 224
Query: 207 GLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYAL 266
GLE EK YPY G D C+ +K + + VS DE ++ L+++GP+A+ INA +
Sbjct: 225 GLESEKDYPYTGSDGKCKFDKSKIVASVQNFSVVSVDEAQISANLIKHGPLAIGINAAYM 284
Query: 267 QFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDR-TKFTHKAVPYWIIKNSWGEGWGEK 325
Q Y+ GVS P + C +L H VL+VGYG K PYWIIKNSWGE WGE
Sbjct: 285 QTYIGGVSCP--YIC---GRHLDHGVLLVGYGASGFAPIRLKDKPYWIIKNSWGENWGEN 339
Query: 326 GYFRLYRGD---GSCGINDYV 343
GY+++ RG CG++ V
Sbjct: 340 GYYKICRGSNVRNKCGVDSMV 360
>gi|242045644|ref|XP_002460693.1| hypothetical protein SORBIDRAFT_02g033270 [Sorghum bicolor]
gi|241924070|gb|EER97214.1| hypothetical protein SORBIDRAFT_02g033270 [Sorghum bicolor]
Length = 373
Score = 223 bits (568), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 131/332 (39%), Positives = 185/332 (55%), Gaps = 32/332 (9%)
Query: 40 ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
A F F+ +H + Y+ EY RL +F+ NL + Q + + +G+ FSDL+ EF
Sbjct: 47 AQFAAFVRRHGRRYSGPEEYARRLRVFAANLARAAAHQALDP-TARHGVTPFSDLTREEF 105
Query: 100 QAKYLGFK---------LKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGS 149
+A+ G + L S A + PA ++ LP +FDWR+ AVTGVK Q CGS
Sbjct: 106 EARLTGVRAGAGGDVQRLVMSGAPAAPPASQEEVSRLPASFDWRDKGAVTGVKMQGACGS 165
Query: 150 SWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTI 200
WAFSTTG +EG T KL+ LSEQ+L+DCD ++GC GG ++NA+ +
Sbjct: 166 CWAFSTTGAVEGANFLATGKLLELSEQQLVDCDHTCSAVAQNECNNGCAGGLMTNAYAYL 225
Query: 201 MSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSV-SRDETDMAKYLVENGPMAV 259
M GGL E++ YPY G CR + V++ + +V + DE + LV GP+AV
Sbjct: 226 MKS--GGLMEQRAYPYTGAPGPCRFDPAKAAVRVANFTAVPAGDEAQIRAALVRRGPLAV 283
Query: 260 AINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVD---RTKFTHKAVPYWIIKN 316
+NA +Q YV GVS P+ C ++H VL+VGYG + ++ PYWIIKN
Sbjct: 284 GLNAAFMQTYVGGVSCPL--LCP--RAWVNHGVLLVGYGARGFAALRLGYR--PYWIIKN 337
Query: 317 SWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
SWGE WGE+GY+RL RG CG++ V + V
Sbjct: 338 SWGERWGEQGYYRLCRGSNVCGVDSMVSAVAV 369
>gi|302774134|ref|XP_002970484.1| hypothetical protein SELMODRAFT_93661 [Selaginella moellendorffii]
gi|300162000|gb|EFJ28614.1| hypothetical protein SELMODRAFT_93661 [Selaginella moellendorffii]
Length = 343
Score = 223 bits (568), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 135/347 (38%), Positives = 193/347 (55%), Gaps = 31/347 (8%)
Query: 10 VALLSLTVSVSSFMVVGDEKLHHLH---HVKHT-ALFNYFLEQHNKTYATLVEYYSRLHI 65
V LL L V SS + K+ + VK F +F+++ K Y T EY RL +
Sbjct: 10 VGLLILVVCCSSSNRLDIGKIRQVTDNLEVKDVEGHFKHFMQKFGKVYGTTEEYVHRLKV 69
Query: 66 FSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSV--PAMIPN 123
F NL + L+ + + ++G+ F+DL+ E +++LGF+ +Y++R V ++P
Sbjct: 70 FQANLAHVMSLKKQDP-TAIHGITSFADLTPEEL-SRFLGFR--KAYSNRVVNQAPLLPT 125
Query: 124 ITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQ 183
LP AFDWRE+ AVT VK Q CGS W FSTTG +EG KT KL+SLSE++LIDCD
Sbjct: 126 DNLPEAFDWREHGAVTPVKFQGRCGSCWTFSTTGVVEGANFLKTGKLISLSEEQLIDCDY 185
Query: 184 EDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY-------RGDDKACRLNKKATQVKING 236
+D+GCEGG + +A++ + ++ GLE E+ YPY + CR I
Sbjct: 186 KDNGCEGGDMLSAYEYVKAR---GLEAEEDYPYEELGYRHKPVRGPCRYQPSKVVATIAN 242
Query: 237 YVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVG 296
Y VS DE +A LV+NGP+++A+ L Y GV+ P C G ++H VL+VG
Sbjct: 243 YSRVSEDEDQIAANLVKNGPLSIALRGNVLFTYEGGVACP--RICPG---EINHGVLLVG 297
Query: 297 YGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYV 343
YGV+ + YW KN+W + +GE GYFRL RG G C +N V
Sbjct: 298 YGVE------NGLRYWTFKNTWTDEFGENGYFRLCRGVGVCDMNSEV 338
>gi|162459555|ref|NP_001105685.1| cysteine proteinase 1 precursor [Zea mays]
gi|1706260|sp|Q10716.1|CYSP1_MAIZE RecName: Full=Cysteine proteinase 1; Flags: Precursor
gi|643597|dbj|BAA08244.1| cysteine proteinase [Zea mays]
Length = 371
Score = 223 bits (567), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 125/321 (38%), Positives = 176/321 (54%), Gaps = 27/321 (8%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
F F+++ K+Y E+ RL +F NLR+ + Q + S +G+ +FSDL+ AEF+
Sbjct: 48 FLSFVQRFGKSYKDADEHAYRLSVFKDNLRRARRHQLLDP-SAEHGVTKFSDLTPAEFRR 106
Query: 102 KYLGFKLKPSYADRSV------PAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFST 155
YLG + R + ++P LP FDWR++ AV VK+Q CGS W+FS
Sbjct: 107 TYLGLRKSRRALLRELGESAHEAPVLPTDGLPDDFDWRDHGAVGPVKNQGSCGSCWSFSA 166
Query: 156 TGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSKLGG 206
+G +EG + T KL LSEQ+ +DCD E D GC GG ++ AF + G
Sbjct: 167 SGALEGAHYLATGKLEVLSEQQFVDCDHECDSSEPDSCDSGCNGGLMTTAFSYLQK--AG 224
Query: 207 GLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYAL 266
GLE EK YPY G D C+ +K + + VS DE ++ L+++GP+A+ INA +
Sbjct: 225 GLESEKDYPYTGSDGKCKFDKSKIVASVQNFSVVSVDEAQISANLIKHGPLAIGINAAYM 284
Query: 267 QFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDR-TKFTHKAVPYWIIKNSWGEGWGEK 325
Q Y+ GVS P + C +L H VL+VGYG K PYWIIKNSWGE WGE
Sbjct: 285 QTYIGGVSCP--YIC---GRHLDHGVLLVGYGASGFAPIRLKDKPYWIIKNSWGENWGEN 339
Query: 326 GYFRLYRGD---GSCGINDYV 343
GY+++ RG CG++ V
Sbjct: 340 GYYKICRGSNVRNKCGVDSMV 360
>gi|66803062|ref|XP_635374.1| cysteine protease [Dictyostelium discoideum AX4]
gi|60463697|gb|EAL61879.1| cysteine protease [Dictyostelium discoideum AX4]
Length = 352
Score = 221 bits (564), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 134/335 (40%), Positives = 180/335 (53%), Gaps = 41/335 (12%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLL--QDTEHGSGV-YGLNEFSDLSTAE 98
F F ++NK Y+ EY + F NL I L Q T GS +G+N+F+DLS E
Sbjct: 27 FIAFQNKYNKIYSA-EEYLVKFETFKSNLLNIDALNKQATTIGSDTKFGVNKFADLSKEE 85
Query: 99 FQAKYLGFKLKPSYADRSVPAMIPNIT------LPRAFDWR---------EYDAVTGVKD 143
F+ YL K + +P M+PN++ P AFDWR + VT VK+
Sbjct: 86 FKKYYL--SSKEARLTDDLP-MLPNLSDDIISATPAAFDWRNTGGSTKFPQGTPVTAVKN 142
Query: 144 QTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE----------DDGCEGGSI 193
Q CGS W+FSTTGN+EG + T LV LSEQ L+DCD + GC+GG
Sbjct: 143 QGQCGSCWSFSTTGNVEGQHYLSTGTLVGLSEQNLVDCDHTCMTYENENVCNAGCDGGLQ 202
Query: 194 SNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVE 253
NA++ I+ GG++ E TYPY D C+ N KI+ + V ++ET +A YL
Sbjct: 203 PNAYNYIIKN--GGIQTEATYPYTAVDGECKFNSAQVGAKISSFTMVPQNETQIASYLFN 260
Query: 254 NGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWI 313
NGP+A+A +A QFY+ GV F C + L H +LIVGYG T K PYWI
Sbjct: 261 NGPLAIAADAEEWQFYMGGV---FDFPC---GQTLDHGILIVGYGAQDT-IVGKNTPYWI 313
Query: 314 IKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
IKNSWG WGE GY ++ R CG+ ++V S++V
Sbjct: 314 IKNSWGADWGEAGYLKVERNTDKCGVANFVSSSIV 348
>gi|285002340|ref|YP_003422404.1| cathepsin [Pseudaletia unipuncta granulovirus]
gi|197343600|gb|ACH69415.1| cathepsin [Pseudaletia unipuncta granulovirus]
Length = 338
Score = 221 bits (564), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 135/357 (37%), Positives = 193/357 (54%), Gaps = 35/357 (9%)
Query: 4 FYFFAGVALLSLTVSVSSFMVVGDEKLHHLHH--VKHTALFNYFLEQHNKTYATLVEYYS 61
+++ V LL+ T VSS +++L + LF+ F+ ++ K YA E S
Sbjct: 5 IFWYGFVCLLATTPIVSS--------MNNLQYDLSNSEVLFDEFVTKYGKVYANDAERKS 56
Query: 62 RLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKL--------KPSYA 113
R +F NL I ++ + S +G+N +SDLS+ E K GFK K Y
Sbjct: 57 RFDVFKANLAIINE-RNAQEESATFGINFYSDLSSNELLRKQTGFKTALHNDNEKKSKYC 115
Query: 114 DRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSL 173
R V LP AF+WR+ DAVT VK Q CGS WAFS NIE Y K K+ V L
Sbjct: 116 TRRVITGPSTRLLPEAFNWRDSDAVTSVKQQRDCGSCWAFSAVANIESQYYIKNKQYVDL 175
Query: 174 SEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVK 233
SEQ+++DCD ++GC GG +S A + +M GG++ E+ Y Y G++ C+ N A V+
Sbjct: 176 SEQQIVDCDPINNGCNGGLMSWAMEYVMRS--GGVQLEEDYQYVGNEGVCK-NNSANVVQ 232
Query: 234 INGYVSVS-RDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSV 292
I+G VS R+E + + LV NGP++VAI+ + Y +G++ L+H+V
Sbjct: 233 ISGCVSYDLRNEERLRELLVSNGPISVAIDVMDVTNYQSGIAKHCSVA-----HGLNHAV 287
Query: 293 LIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCG-INDYVRSALV 348
L+VGYGV PYW+ KNSWG WGE GYFR+ R SCG +N Y +A++
Sbjct: 288 LLVGYGV------QNNTPYWVFKNSWGSDWGENGYFRVLRDVNSCGMLNQYAATAIL 338
>gi|449461649|ref|XP_004148554.1| PREDICTED: LOW QUALITY PROTEIN: cysteine proteinase RD19a-like
[Cucumis sativus]
Length = 381
Score = 221 bits (564), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 138/375 (36%), Positives = 200/375 (53%), Gaps = 50/375 (13%)
Query: 4 FYFFAGVALLSLTVSVSSFMVV-------GDEKL-----------HHLHHVKHTALFNYF 45
F+ FA + ++ T+ S +V GD + HH +H F+ F
Sbjct: 5 FFLFAVITAVTATLCSSEPLVSQHSVEHDGDPLIRQVVENDGDFNHHALGAEHH--FSLF 62
Query: 46 LEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLG 105
+ K+YAT E+ R IF N+R+ + Q + S ++G+ +FSDL+ EF+ +LG
Sbjct: 63 KRRFGKSYATEEEHDRRFKIFKANMRRAERHQSFD-PSAIHGVTQFSDLTPFEFRKAFLG 121
Query: 106 FK---LKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGV 162
+ L+ + P ++P LP FDWR++ VT VK+Q CGS W+FSTTG +EG
Sbjct: 122 LRGHRLRLPVDTNAAP-ILPTENLPIDFDWRQHGGVTRVKNQGSCGSCWSFSTTGALEGA 180
Query: 163 YAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
+ LSEQ+L+DCD E D GC GG +++AF+ + GGL +E+
Sbjct: 181 ------NFLXLSEQQLVDCDHECDPEEEDACDSGCNGGLMNSAFEYTLK--AGGLMKEQD 232
Query: 214 YPYRGDDK-ACRLNKKATQVKINGYVSV-SRDETDMAKYLVENGPMAVAINAYALQFYVT 271
YPY G D+ C +K I + V S DE +A LV+NGP+A+AINA +Q Y+
Sbjct: 233 YPYAGIDRNTCNFDKSKIAASIASFSVVNSIDEDQIAANLVKNGPLAIAINAVFMQTYIG 292
Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYG-VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
GVS P F C ++ L H VL+VGYG + YWIIKNSWGE WGE GY+++
Sbjct: 293 GVSCP--FIC---SKRLDHGVLLVGYGSAGYAPIRMRDKDYWIIKNSWGESWGENGYYKI 347
Query: 331 YRGDGSCGINDYVRS 345
RG CG++ V +
Sbjct: 348 CRGRNICGVDSLVST 362
>gi|18138384|ref|NP_542680.1| cathepsin [Helicoverpa zea SNPV]
gi|209401110|ref|YP_002273979.1| viral cathepsin-like protein [Helicoverpa armigera NPV NNg1]
gi|37077430|sp|Q8V5U0.1|CATV_NPVHZ RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|18028766|gb|AAL56202.1|AF334030_127 ORF57 [Helicoverpa zea SNPV]
gi|209364362|dbj|BAG74621.1| viral cathepsin-like protein [Helicoverpa armigera NPV NNg1]
Length = 367
Score = 221 bits (563), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 120/324 (37%), Positives = 182/324 (56%), Gaps = 31/324 (9%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQ-----------LLQDTEHGSGVYGLNE 90
F +FL+Q+NK+Y EY R ++F NL KI D+ S +G+N+
Sbjct: 57 FKHFLQQYNKSYDDPKEYQYRYNVFKDNLNKINSQNRENLLNNKNNNDSLSTSAQFGVNK 116
Query: 91 FSDLSTAEFQAKYLGFKLKPS----YADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTM 146
FSD + E GF L S + + P+I LP +DWR+ + VT +KDQ +
Sbjct: 117 FSDKTPDEVLHSNTGFFLNLSQHYTLCENRIVKGAPDIRLPDYYDWRDTNKVTPIKDQGV 176
Query: 147 CGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGG 206
CGS WAF GNIE YA + KL+ LSEQ+L+DCD+ D GC GG + AF ++ L G
Sbjct: 177 CGSCWAFVAIGNIESQYAIRHNKLIDLSEQQLLDCDEVDLGCNGGLMHLAFQELL--LMG 234
Query: 207 GLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVS-RDETDMAKYLVENGPMAVAINAYA 265
G+E E YPY+G ++ C L+ + VK+N RDE + + + GP+A+A++A
Sbjct: 235 GVETEADYPYQGSEQMCTLDNRKIAVKLNSCFKYDIRDENKLKELVYTTGPVAIAVDAMD 294
Query: 266 LQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEK 325
+ Y G+ + + +L+H+VL++G+G++ VPYWIIKNSWGE WGE
Sbjct: 295 IINYRRGILNQCHIY------DLNHAVLLIGWGIENN------VPYWIIKNSWGEDWGEN 342
Query: 326 GYFRLYRGDGSCG-INDYVRSALV 348
G+ R+ R +CG +N++ S+++
Sbjct: 343 GFLRVRRNVNACGLLNEFGASSVI 366
>gi|4972585|gb|AAD34707.1|AF071801_1 cysteine proteinase [Paragonimus westermani]
Length = 229
Score = 221 bits (563), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 113/222 (50%), Positives = 144/222 (64%), Gaps = 10/222 (4%)
Query: 127 PRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDD 186
P DWRE+ AV V++Q CGS WAFS GN+EG + KT +LVSLS+Q+L+DCD D
Sbjct: 17 PERMDWREWGAVGPVENQGSCGSCWAFSVAGNVEGQWFLKTGQLVSLSKQQLVDCDVMDY 76
Query: 187 GCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETD 246
GC GG +NA+ IM GGLE + YPY G + C LNK+ KI+ + + E +
Sbjct: 77 GCGGGWPTNAYMEIMRM--GGLELQSDYPYVGVQQQCYLNKEKLLAKIDDLIVLGAYEEE 134
Query: 247 MAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTH 306
A YL E+GP++ A+NA LQFY +G+SHP C +L+H+VL VGY T
Sbjct: 135 HAAYLAEHGPLSSALNAGYLQFYQSGISHPSYEECS--PASLNHAVLTVGYD------TE 186
Query: 307 KAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
VPYWIIKNSWG GWGE GYFRLYRGDG+CGIN + SA++
Sbjct: 187 NGVPYWIIKNSWGTGWGENGYFRLYRGDGTCGINRMITSAII 228
>gi|343412462|emb|CCD21670.1| cysteine peptidase (CP), putative [Trypanosoma vivax Y486]
Length = 367
Score = 220 bits (561), Expect = 7e-55, Method: Compositional matrix adjust.
Identities = 123/313 (39%), Positives = 171/313 (54%), Gaps = 18/313 (5%)
Query: 41 LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
LF F +++ ++Y T E RL +F N+R+ ++ + +G+ FSDL+ EF+
Sbjct: 33 LFAAFKQKYGRSYGTAAEEAFRLRVFEDNMRRSRMYA-AANPHATFGVTPFSDLTPEEFR 91
Query: 101 AKYLGFKLKPSYADRSVPAMI--PNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
+Y + A V ++ P P A DWR AVT VKDQ CGS W+FS GN
Sbjct: 92 TRYHNGERHFEAARGRVRTLVQVPPGKAPAAVDWRRKGAVTPVKDQGTCGSCWSFSAIGN 151
Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY-- 216
IEG +AA L SLSEQ L+ CD +D+GC GG + NAF+ I+ + G + EK+YPY
Sbjct: 152 IEGQWAAAGNPLTSLSEQMLVSCDTKDNGCGGGLMDNAFEWIVKENSGKVYTEKSYPYVS 211
Query: 217 -RGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSH 275
G++ C+ I G+V + DE +AKYL +NGP+AVA++A Y GV
Sbjct: 212 GGGEEPPCKPRGHKVGATITGHVDIPHDEDAIAKYLADNGPVAVAVDATTFMSYSGGV-- 269
Query: 276 PIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
+E L+H VL+VGY D +K PYWIIKNSW WGEKGY R+ +G
Sbjct: 270 ----VTSCTSEALNHGVLLVGYN-DSSK-----PPYWIIKNSWSSSWGEKGYIRIEKGTN 319
Query: 336 SCGINDYVRSALV 348
C + SA+V
Sbjct: 320 QCLVAQLASSAVV 332
>gi|414590229|tpg|DAA40800.1| TPA: putative cysteine protease family protein [Zea mays]
Length = 381
Score = 220 bits (561), Expect = 7e-55, Method: Compositional matrix adjust.
Identities = 129/329 (39%), Positives = 180/329 (54%), Gaps = 29/329 (8%)
Query: 40 ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
A F F+ +H + Y+ EY RL +F+ NL + Q + + +G+ FSDL+ EF
Sbjct: 58 AQFAAFVRRHGRRYSGPKEYARRLRVFAANLARAAAHQALDP-TARHGVTPFSDLTREEF 116
Query: 100 QAKYLGFKLKPSYAD--RSVPAMIPN-----ITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
+A+ G + VPA P LP +FDWR+ AVTGVK Q CGS WA
Sbjct: 117 EARLTGLRAGGDVQRLMSGVPAAPPASKEEVARLPASFDWRDKGAVTGVKTQGACGSCWA 176
Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSK 203
FSTTG +EG T +LV LSEQ+L+DCD ++GC GG ++NA+ +M
Sbjct: 177 FSTTGAVEGANFLATGELVDLSEQQLVDCDHTCSAVAQNECNNGCAGGLMTNAYSYLMES 236
Query: 204 LGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSV-SRDETDMAKYLVENGPMAVAIN 262
GGL E+ YPY G CR + V++ + +V + DE + LV GP+AV +N
Sbjct: 237 --GGLMEQSAYPYTGAAGPCRFDPTQVAVRVANFTAVPAGDEAQIRAALVRRGPLAVGLN 294
Query: 263 AYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVD---RTKFTHKAVPYWIIKNSWG 319
A +Q YV GVS P+ C ++H VL+VGYG + ++ PYWIIKNSWG
Sbjct: 295 AAFMQTYVGGVSCPL--ICP--RAWVNHGVLLVGYGARGFAALRLGYR--PYWIIKNSWG 348
Query: 320 EGWGEKGYFRLYRGDGSCGINDYVRSALV 348
+ WGE+GY+RL RG CG++ V + V
Sbjct: 349 KQWGEQGYYRLCRGSNVCGVDSMVSAVAV 377
>gi|340053965|emb|CCC48258.1| cysteine peptidase precursor [Trypanosoma vivax Y486]
Length = 441
Score = 220 bits (561), Expect = 7e-55, Method: Compositional matrix adjust.
Identities = 123/313 (39%), Positives = 171/313 (54%), Gaps = 18/313 (5%)
Query: 41 LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
LF F +++ ++Y T E RL +F N+R+ ++ + +G+ FSDL+ EF+
Sbjct: 33 LFAAFKQKYGRSYGTAAEEAFRLRVFEDNMRRSRMYA-AANPHATFGVTPFSDLTPEEFR 91
Query: 101 AKYLGFKLKPSYADRSVPAMI--PNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
+Y + A V ++ P P A DWR AVT VKDQ CGS W+FS GN
Sbjct: 92 TRYHNGERHFEAARGRVRTLVQVPPGKAPAAVDWRRKGAVTPVKDQGSCGSCWSFSAIGN 151
Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY-- 216
IEG +AA L SLSEQ L+ CD +D+GC GG + NAF+ I+ + G + EK+YPY
Sbjct: 152 IEGQWAAAGNPLTSLSEQMLVSCDTKDNGCGGGLMDNAFEWIVKENSGKVYTEKSYPYVS 211
Query: 217 -RGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSH 275
G++ C+ I G+V + DE +AKYL +NGP+AVA++A Y GV
Sbjct: 212 GGGEEPPCKPRGHKVGATITGHVDIPHDEDAIAKYLADNGPVAVAVDATTFMSYSGGVVT 271
Query: 276 PIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
+E L+H VL+VGY D +K PYWIIKNSW WGEKGY R+ +G
Sbjct: 272 SCT------SEALNHGVLLVGYN-DSSK-----PPYWIIKNSWSSSWGEKGYIRIEKGTN 319
Query: 336 SCGINDYVRSALV 348
C + SA+V
Sbjct: 320 QCLVAQLASSAVV 332
>gi|340053963|emb|CCC48256.1| cysteine peptidase precursor [Trypanosoma vivax Y486]
Length = 452
Score = 220 bits (561), Expect = 7e-55, Method: Compositional matrix adjust.
Identities = 123/313 (39%), Positives = 171/313 (54%), Gaps = 18/313 (5%)
Query: 41 LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
LF F +++ ++Y T E RL +F N+R+ ++ + +G+ FSDL+ EF+
Sbjct: 33 LFAAFKQKYGRSYGTAAEEAFRLRVFEDNMRRSRMYA-AANPHATFGVTPFSDLTPEEFR 91
Query: 101 AKYLGFKLKPSYADRSVPAMI--PNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
+Y + A V ++ P P A DWR AVT VKDQ CGS W+FS GN
Sbjct: 92 TRYHNGERHFEAARGRVRTLVQVPPGKAPAAVDWRRKGAVTPVKDQGSCGSCWSFSAIGN 151
Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY-- 216
IEG +AA L SLSEQ L+ CD +D+GC GG + NAF+ I+ + G + EK+YPY
Sbjct: 152 IEGQWAAAGNPLTSLSEQMLVSCDSKDNGCGGGFMDNAFEWIVKENSGKVYTEKSYPYVS 211
Query: 217 -RGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSH 275
G++ C+ I G+V + DE +AKYL +NGP+AVA++A Y GV
Sbjct: 212 GGGEEPPCKPRGHEVGATITGHVDIPHDEDAIAKYLADNGPVAVAVDATTFMSYSGGVVT 271
Query: 276 PIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
+E L+H VL+VGY D +K PYWIIKNSW WGEKGY R+ +G
Sbjct: 272 SCT------SEALNHGVLLVGYN-DSSK-----PPYWIIKNSWSSSWGEKGYIRIEKGTN 319
Query: 336 SCGINDYVRSALV 348
C + SA+V
Sbjct: 320 QCLVAQLASSAVV 332
>gi|412992445|emb|CCO18425.1| unknown [Bathycoccus prasinos]
Length = 500
Score = 220 bits (560), Expect = 8e-55, Method: Compositional matrix adjust.
Identities = 136/318 (42%), Positives = 180/318 (56%), Gaps = 32/318 (10%)
Query: 55 TLVEYYSRLHIFSGNL-RKIQLLQDTEHGSGV--YGLNEFSDLSTAEFQAKYLGF----- 106
T EY R+ IF N R I+ D G G +G+ +F DLS EF+ +YLG
Sbjct: 188 TEEEYEKRMEIFQENWKRAIEREIDDRKGGGSAKHGVTKFFDLSEEEFREQYLGLLSTST 247
Query: 107 ---KLKPSYADRSV--PAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEG 161
K ++ + P+ LP+ +DWR AVT VKDQ CGS W FSTTG IEG
Sbjct: 248 SSSASKDAFRKHQMEAPSEEDLEKLPQYYDWRARGAVTPVKDQGQCGSCWTFSTTGAIEG 307
Query: 162 VYAAKTKKLVSLSEQELIDCD---------QEDDGCEGGSISNAFDTIMSKLGGGLEEEK 212
KT KLVSLSEQ+L+DCD D GC GG SNA + I+ GGL+ EK
Sbjct: 308 ANFIKTGKLVSLSEQQLLDCDVGCAPDIPNACDSGCNGGLPSNAMEYIVEH--GGLDTEK 365
Query: 213 TYPYRG-DDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVT 271
+YPY+ + CR + I+ Y V ++ET MA LV+ GP+++ INA +Q YV
Sbjct: 366 SYPYKAYKEDTCRAKEGKLGATISNYTFVGKNETHMAHALVKYGPLSIGINAAWMQSYVG 425
Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVD--RTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
GV+ P + C+ + L H VLIVGYG + HK PYW+IKNSWG GWGE+GY+R
Sbjct: 426 GVACP--WLCN--KDALDHGVLIVGYGEEGFAPARLHKE-PYWVIKNSWGMGWGEEGYYR 480
Query: 330 LYRGDGSCGINDYVRSAL 347
+ + G+CG+N+ V +AL
Sbjct: 481 ICKDKGNCGVNNMVVAAL 498
>gi|328870281|gb|EGG18656.1| hypothetical protein DFA_04151 [Dictyostelium fasciculatum]
Length = 347
Score = 220 bits (560), Expect = 8e-55, Method: Compositional matrix adjust.
Identities = 129/320 (40%), Positives = 175/320 (54%), Gaps = 27/320 (8%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGV---YGLNEFSDLSTAE 98
F F ++NK Y + E+ +L F +L++IQ L D + V +G+N+F+DLS E
Sbjct: 30 FREFQLKYNKHYESH-EFAQKLATFKNSLKRIQELNDMAKRAKVDTEFGVNKFADLSKEE 88
Query: 99 FQAKYLGF---------KLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGS 149
F YL P Y+D+ + LP +FDWR AVT VKDQ CGS
Sbjct: 89 FANYYLNKGGMESTDSETYAPDYSDKEIS------NLPTSFDWRTQGAVTPVKDQGQCGS 142
Query: 150 SWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLE 209
W+FSTTGN+EG + L LSEQ L+DC ++DGC GG + A+D I+ G++
Sbjct: 143 CWSFSTTGNVEGQWFLAGNDLTGLSEQNLVDCSTKNDGCNGGLMPLAYDYIVEN--NGID 200
Query: 210 EEKTYPYRG-DDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQF 268
E +YPY K C+ N KI+GY +VS +ET M LV NGP+++A +A Q+
Sbjct: 201 TEASYPYLAIQQKNCQFNPANIGAKIDGYYNVSSNETQMQINLVNNGPLSIAADAAEWQY 260
Query: 269 YVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYF 328
Y G+ I C +NL H +LIVGYG T+F + +WIIKNSW WG G+
Sbjct: 261 YKKGIFSGIFGIC---GKNLDHGILIVGYGQQTTEFGTEL--FWIIKNSWSTDWGLSGFM 315
Query: 329 RLYRGDGSCGINDYVRSALV 348
+ RG G CGIN V SA V
Sbjct: 316 LIKRGTGECGINLAVTSAYV 335
>gi|13507095|gb|AAK28439.1| cysteine protease 3 precursor [Clonorchis sinensis]
Length = 320
Score = 220 bits (560), Expect = 8e-55, Method: Compositional matrix adjust.
Identities = 132/336 (39%), Positives = 186/336 (55%), Gaps = 21/336 (6%)
Query: 13 LSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRK 72
L V++ F V+G + + + L+ F ++ K+Y+ + Y R +F NL +
Sbjct: 5 LCFLVALGFFGVLG-SNIPESENARQ--LYEEFKLKYKKSYSNDDDEY-RFRVFKDNLLR 60
Query: 73 IQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDW 132
I+ Q+ E G+ YG+ +FSDL+ EF+ +YL K DR I FDW
Sbjct: 61 IKQFQNMERGTAKYGVTQFSDLTAQEFKVRYLRSKFGGVPVDREPVPFIRMDVDDDNFDW 120
Query: 133 REYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGS 192
R + AV V DQ CGS WAFS GNIEG + KT L+ LSEQ+L+DCD D+GC GG+
Sbjct: 121 RNHGAVGPVLDQGDCGSCWAFSAVGNIEGQWFRKTDNLLQLSEQQLLDCDGVDEGCNGGT 180
Query: 193 ISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLV 252
AF I+ GGL+ + YPY G + CR+ +V ING + DE A+ L
Sbjct: 181 PQQAFRQILGM--GGLQLDSDYPYEGREGQCRMVPSKVKVYINGSKILPEDEQIQAQMLK 238
Query: 253 ENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYW 312
E GP++ A+NA LQ HP+ CD ++L+H+VL VGYG +PYW
Sbjct: 239 ETGPLSSALNALFLQ-------HPLPALCDA--QSLNHAVLTVGYG------KEGRLPYW 283
Query: 313 IIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
+KNSW +GE GYFR+YRGDG+CGIN V ++++
Sbjct: 284 TVKNSWSTMFGENGYFRIYRGDGTCGINTLVSTSII 319
>gi|302793594|ref|XP_002978562.1| hypothetical protein SELMODRAFT_109056 [Selaginella moellendorffii]
gi|300153911|gb|EFJ20548.1| hypothetical protein SELMODRAFT_109056 [Selaginella moellendorffii]
Length = 343
Score = 220 bits (560), Expect = 9e-55, Method: Compositional matrix adjust.
Identities = 126/314 (40%), Positives = 180/314 (57%), Gaps = 33/314 (10%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
F +F+++ K Y T EY RL +F NL + L+ + + ++G+ F+DL+ E +
Sbjct: 46 FKHFMQKFGKVYGTTEEYVHRLKVFQANLVHVMSLKKQDP-TAIHGITSFADLTPEEL-S 103
Query: 102 KYLGFKLKPSYADRSV--PAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNI 159
++LGF+ +Y++R V ++P LP AFDWRE+ AVT VK Q CGS W FSTTG +
Sbjct: 104 RFLGFR--KAYSNRVVNQAPLLPTDNLPEAFDWREHGAVTPVKFQGRCGSCWTFSTTGVV 161
Query: 160 EGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY--- 216
EG KT KL+SLSE++LIDCD +D+GCEGG + +A++ + ++ GLE ++ YPY
Sbjct: 162 EGANFLKTGKLISLSEEQLIDCDYKDNGCEGGDMLSAYEYVKAR---GLEADEDYPYEEL 218
Query: 217 -------RGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFY 269
RG CR I Y VS DE +A LV+NGP+++A+ L Y
Sbjct: 219 GYRHKPVRG---PCRYQPSKVVATIANYSRVSEDEDQIAANLVKNGPLSIALRGNVLFTY 275
Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
GV+ P C G ++H VL+VGYGV+ + YW KNSW + +GE GYFR
Sbjct: 276 EGGVACP--RICPG---EINHGVLLVGYGVE------NGLRYWTFKNSWTDEFGENGYFR 324
Query: 330 LYRGDGSCGINDYV 343
L RG G C + V
Sbjct: 325 LCRGVGVCDMTSEV 338
>gi|40806502|gb|AAR92156.1| putative cysteine protease 3 [Iris x hollandica]
Length = 292
Score = 219 bits (559), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 121/288 (42%), Positives = 168/288 (58%), Gaps = 24/288 (8%)
Query: 71 RKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSV--PAMIPNITLPR 128
R+ Q L T V+G+ +FSDL+ EF+ YLG + + S ++P LP
Sbjct: 5 RRHQQLDPT----AVHGVTQFSDLTPGEFKRTYLGLRKGKKHLVGSAHEAPLLPTNDLPE 60
Query: 129 AFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---- 184
FDWR+ AVTGVK+Q CGS W+FST+G +EG T KL +LSEQ+++DCD E
Sbjct: 61 DFDWRDKGAVTGVKNQGSCGSCWSFSTSGALEGANFLATGKLETLSEQQMVDCDHECDAE 120
Query: 185 -----DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKA-CRLNKKATQVKINGYV 238
D GC GG ++ AF + GGLE EK YPY G D+ C+ ++ + ++ +
Sbjct: 121 EPDDCDQGCNGGLMNTAFQYLQKV--GGLESEKDYPYTGTDRGTCKFDESKIKASVHNFS 178
Query: 239 SVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYG 298
VS DE +A LV++GP+A+AINA +Q Y+ GVS P + C ++L H VL+VGYG
Sbjct: 179 VVSIDEEQIAANLVKHGPLAIAINAVFMQTYIGGVSCP--YIC---GKHLDHGVLLVGYG 233
Query: 299 -VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
K PYWIIKNSWGE WGE GY+++ RG CG++ V +
Sbjct: 234 SAGYAPIRLKEKPYWIIKNSWGETWGENGYYKICRGRNVCGVDSMVST 281
>gi|302790828|ref|XP_002977181.1| hypothetical protein SELMODRAFT_106402 [Selaginella moellendorffii]
gi|300155157|gb|EFJ21790.1| hypothetical protein SELMODRAFT_106402 [Selaginella moellendorffii]
Length = 337
Score = 219 bits (559), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 129/343 (37%), Positives = 187/343 (54%), Gaps = 23/343 (6%)
Query: 6 FFAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHI 65
A +L + V + F + L ++ +F + +H K+Y++ E RL I
Sbjct: 1 MIASTLILLVVVGATPFAIARPAALEDGRALEIKNMFEDWAAKHGKSYSSDWEKARRLMI 60
Query: 66 FSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNI- 124
FS L I+ + + GLN+FSDL+ AEF+A ++G +P Y DR +PA ++
Sbjct: 61 FSDTLAYIEKHNAQPNTTFTLGLNKFSDLTNAEFRAMHVGKFKRPRYQDR-LPAEDEDVD 119
Query: 125 --TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCD 182
+LP + DWR+ AVT +KDQ CGS WAFS +IE + TK+LVSLSEQ+L+DCD
Sbjct: 120 VSSLPTSLDWRQKGAVTPIKDQGDCGSCWAFSAIASIESAHFLATKELVSLSEQQLMDCD 179
Query: 183 QEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQV-KINGYVSVS 241
D GC+GG + AF ++ GG+ E YPY G +C NK +V +I G+ V+
Sbjct: 180 TVDAGCDGGLMETAFKFVVKN--GGVTTEAAYPYTGSVGSCNANKAKNKVAEITGFKVVT 237
Query: 242 RDETDMAKYLVENGPMAVAI--NAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGV 299
D D V P+ V+I + Q Y +G+ + CD ++L H VL++GYG
Sbjct: 238 EDSADALMKAVSKTPVTVSICGSDENFQNYKSGI---LSGKCD---DSLDHGVLLIGYG- 290
Query: 300 DRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR--GDGSCGIN 340
T +PYWIIKNSWG WGE G+ ++ R GDG CG+N
Sbjct: 291 -----TEGGMPYWIIKNSWGTSWGEDGFMKIERKDGDGMCGMN 328
>gi|432091112|gb|ELK24324.1| Cathepsin W [Myotis davidii]
Length = 370
Score = 219 bits (558), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 123/323 (38%), Positives = 176/323 (54%), Gaps = 19/323 (5%)
Query: 41 LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
+F F Q+N++Y+ EY RL IF+ NL Q LQ+ + G+ +G+ FSDL+ EF
Sbjct: 41 VFTLFQIQYNRSYSNPAEYAHRLDIFARNLAHAQRLQEEDLGTAEFGVTAFSDLTEEEFD 100
Query: 101 AKYLGFKL--KPSYADRSVPAMIPNITLPRAFDWREYDAV-TGVKDQTMCGSSWAFSTTG 157
Y + + DR V + ++P DWR+ V + VKDQ C WA + G
Sbjct: 101 QLYGNQRAAGRAPNVDREVGSDEWQESVPSTCDWRKAPGVMSPVKDQKTCSCCWAMAAAG 160
Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
NIE + KT++ V +S QEL+DC + DGC GG + +AF T+++ GL EK YP++
Sbjct: 161 NIEAQWGIKTRQSVEVSVQELLDCGRCGDGCSGGFVWDAFITVLNN--SGLASEKDYPFQ 218
Query: 218 GDDKA-CRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHP 276
G +A C+ K I ++ +S +E +A YL GP+ V IN LQ Y GV
Sbjct: 219 GAVRAKCQAKKHKKVAWIQDFIMLSDNEQRIAWYLATEGPITVTINKKLLQQYQNGVIKA 278
Query: 277 IQFFCDGGNENLSHSVLIVGYGVDRT-----------KFTHKAVPYWIIKNSWGEGWGEK 325
Q CD +N+ H VL+VG+G ++ ++ PYWI+KNSWG WGEK
Sbjct: 279 TQTTCD--PQNVDHVVLLVGFGKTKSVEGRQAKGVPGHSRRRSTPYWILKNSWGANWGEK 336
Query: 326 GYFRLYRGDGSCGINDYVRSALV 348
GYFRL+RG +CGI Y +A V
Sbjct: 337 GYFRLHRGSNACGITKYPITARV 359
>gi|339244637|ref|XP_003378244.1| cathepsin F [Trichinella spiralis]
gi|316972865|gb|EFV56511.1| cathepsin F [Trichinella spiralis]
Length = 317
Score = 219 bits (558), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 120/282 (42%), Positives = 163/282 (57%), Gaps = 20/282 (7%)
Query: 76 LQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREY 135
LQ E G+ +YG F+D++ EF+ YL + + A++ + P FDWR Y
Sbjct: 3 LQQQEKGTAIYGPTIFADMTQDEFRKTYLNMLETSALLPKQRIALL-KVDRPNKFDWRNY 61
Query: 136 DAVTGVKDQTM----------CGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQED 185
+ VT VK Q CGSSWAFST NIE +A K L+SLSEQ++IDCD+ +
Sbjct: 62 NVVTKVKRQVWHKMQKKFLGKCGSSWAFSTIANIESAWAIKFGDLISLSEQQIIDCDKIN 121
Query: 186 DGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDET 245
GC GG A+ I+ G++ E YPY G +C+LNK+ +V IN V + ++ET
Sbjct: 122 RGCRGGQPLKAYHEIIRM--SGVQAESDYPYTGLHGSCKLNKEKIKVYINDTVLLHKNET 179
Query: 246 DMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNEN-LSHSVLIVGYGVDRTKF 304
+A YL E+GP+AV +NA L Y G+ P + C N N L+H I+GYG + +
Sbjct: 180 TIANYLYEHGPVAVRMNADILMLYRKGIIKPTKSSC---NPNFLNHGATIIGYG--KESW 234
Query: 305 TH-KAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
H + PYWIIKNSWG WGE GYFRLYRG+ +CG+N V S
Sbjct: 235 LHWWSNPYWIIKNSWGVDWGENGYFRLYRGNEACGVNRMVTS 276
>gi|149725427|ref|XP_001494683.1| PREDICTED: cathepsin W-like [Equus caballus]
Length = 373
Score = 219 bits (557), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 125/325 (38%), Positives = 184/325 (56%), Gaps = 22/325 (6%)
Query: 41 LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
+F F Q+N++Y++ EY RL IF+ NL + Q LQ+ + G+ +G++ FSDL+ EF
Sbjct: 41 VFTLFQIQYNRSYSSPAEYAHRLDIFARNLAQAQRLQEDDLGTAEFGVSPFSDLTEEEFG 100
Query: 101 AKYLGFKLKPSYAD---RSVPAMIPNITLPRAFDWREYDAV-TGVKDQTMCGSSWAFSTT 156
Y G + + A R V + T+P+ DW++ V + VK+Q MC WA +
Sbjct: 101 QLY-GHRRAAAGAPHVGRKVESEKWEKTVPQTCDWQKAAGVISSVKNQEMCNCCWAMAAA 159
Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
GNIE ++A + V +S Q+L+DCD+ +GC+GG + +AF T+++ GL EK YP+
Sbjct: 160 GNIEALWAITYHQSVEVSIQQLLDCDRCGNGCKGGFVWDAFLTVLNN--SGLASEKDYPF 217
Query: 217 RGDDKACRLNKKATQVK-INGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSH 275
RGD K R K +V I ++ + DE +A+YL +GP+ V IN LQ Y GV
Sbjct: 218 RGDAKPHRCQAKKPKVAWIQDFIRLPEDEQKIAEYLATHGPITVTINMKLLQQYQKGVIK 277
Query: 276 PIQFFCDGGNENLSHSVLIVGYG------------VDRTKFTHKAVPYWIIKNSWGEGWG 323
CD ++L HSVL+VG+G V ++ YWI+KNSWG WG
Sbjct: 278 ATPTTCD--PQHLDHSVLLVGFGGGKSVEGRRPGAVSSQSRPRRSSSYWILKNSWGAKWG 335
Query: 324 EKGYFRLYRGDGSCGINDYVRSALV 348
E+GYFRL+RG +CGI Y +ALV
Sbjct: 336 EEGYFRLHRGSNTCGITKYALTALV 360
>gi|303275866|ref|XP_003057227.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226461579|gb|EEH58872.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 329
Score = 218 bits (556), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 128/335 (38%), Positives = 183/335 (54%), Gaps = 33/335 (9%)
Query: 36 VKHTALFNYFLEQHNKTYAT-LVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDL 94
+H F+ F+ +H KTYA+ EY RL IF+ N+ + + + + YG F+DL
Sbjct: 2 TRHERDFDAFVLEHGKTYASDAKEYAKRLEIFAENMARAKEM--SARDGAEYGATPFADL 59
Query: 95 STAEFQAKYLGF---------KLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQT 145
+ EF + L +LK + R +P +P +P FDWR AVT VK+Q
Sbjct: 60 TEDEFASSLLMREPIDAARVERLKRHESSRVLP-HLPTENIPLNFDWRALGAVTPVKNQG 118
Query: 146 MCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNA 196
MCGS W+FS TG +EG + K+ LVSLSEQ+L+DCD D GC+GG +NA
Sbjct: 119 MCGSCWSFSATGAVEGAHFVKSGALVSLSEQQLVDCDHTCDPDSGTACDSGCDGGLPANA 178
Query: 197 FDTIMSKLGGGLEEEKTYPY---RGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVE 253
++ + GGL+ E YPY RGD + I Y VS DE+ +A LV+
Sbjct: 179 MAYVVKR--GGLDAEAAYPYLGARGDGRCKSKEDGPPAATITNYSFVSADESQIAAALVK 236
Query: 254 NGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTH--KAVPY 311
+GP++V I+A +Q Y GV+ P + CD L H VLIVG+G + + P+
Sbjct: 237 HGPLSVGIDARWMQLYRRGVACP--WACD--KTRLDHGVLIVGFGAEGRAPARGFRREPF 292
Query: 312 WIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSA 346
W+IKNSWG WGE+GY+++ + GSCG+N V +A
Sbjct: 293 WLIKNSWGARWGEEGYYKICKDKGSCGVNTMVLAA 327
>gi|633096|dbj|BAA04664.1| prepro NTP [Paragonimus westermani]
Length = 245
Score = 218 bits (555), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 116/253 (45%), Positives = 154/253 (60%), Gaps = 12/253 (4%)
Query: 96 TAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFST 155
T EF AKYL + R P + P DWR AVT V++Q CGS WAFST
Sbjct: 3 TPEFAAKYLSAPVNNDQVKRVRPTGLK--AAPERMDWRAKGAVTPVENQGECGSCWAFST 60
Query: 156 TGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYP 215
GN+EG + KT +LVSLS+Q+L+DCD +GC GG ++++ IM GGLE E YP
Sbjct: 61 AGNVEGQWFIKTGQLVSLSKQQLVDCDMAAEGCNGGWPASSYLEIMYM--GGLESESDYP 118
Query: 216 YRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSH 275
Y G ++ C LNK+ KI+ + + +E D A YL E+GP++ +NA ALQ+Y +GV
Sbjct: 119 YVGVEQTCALNKEKLVAKIDDSIVLGPEEEDHAAYLAEHGPLSTLLNAVALQYYQSGVLK 178
Query: 276 PIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
P F + + L+H+VL VGY + +PYWIIKNSWG WGEKGYFRL+RGD
Sbjct: 179 PT--FEECPDTELNHAVLTVGYDKEGD------MPYWIIKNSWGTDWGEKGYFRLFRGDC 230
Query: 336 SCGINDYVRSALV 348
+CGIN SA++
Sbjct: 231 TCGINRMATSAII 243
>gi|308808478|ref|XP_003081549.1| Cysteine proteinase Cathepsin F (ISS) [Ostreococcus tauri]
gi|116060014|emb|CAL56073.1| Cysteine proteinase Cathepsin F (ISS), partial [Ostreococcus tauri]
Length = 293
Score = 218 bits (555), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 124/293 (42%), Positives = 166/293 (56%), Gaps = 22/293 (7%)
Query: 70 LRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLG-FKLKPSYADR-----SVPAMIPN 123
L + Q + GS +G+ FSDL+ EF +YLG KL + ++ V +P
Sbjct: 3 LIRAATQQANDRGSAKHGVTRFSDLTPEEFAERYLGHVKLSSEHREKVRARGGVIEDLPT 62
Query: 124 ITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCD- 182
LP FDWR AV+ VKDQ CGS W FSTTG IEG + T KLV LSEQ+L+DCD
Sbjct: 63 KHLPAEFDWRFKGAVSRVKDQGQCGSCWTFSTTGAIEGAHFISTGKLVELSEQQLLDCDV 122
Query: 183 --------QEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKI 234
D GC GG SNA + I+ GG++ EK+YPY G+ C+ ++ +
Sbjct: 123 GCDPDVPNACDSGCNGGLPSNAMEYIVEH--GGIDTEKSYPYVGEKGECKADEGTLGATL 180
Query: 235 NGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLI 294
+ VS DE MA LV++GP+++ INA +Q Y+ GV+ P + CD +E L H VLI
Sbjct: 181 KNFSYVSSDEKQMAAALVKHGPLSIGINAAWMQTYIGGVACP--WLCD--SEALDHGVLI 236
Query: 295 VGYGVDR-TKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSA 346
VGYG + PYWI+KNSW WGE GY+R+ + GSCGIN+ V +A
Sbjct: 237 VGYGSSGFAPVRWQQEPYWIVKNSWSPAWGEGGYYRICKDKGSCGINNMVVAA 289
>gi|118197532|ref|YP_874244.1| cathepsin [Ectropis obliqua NPV]
gi|113472527|gb|ABI35734.1| cathepsin [Ectropis obliqua NPV]
Length = 299
Score = 218 bits (554), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 123/312 (39%), Positives = 177/312 (56%), Gaps = 25/312 (8%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
F+ +NK Y +E R IF NLR I + ++ +GS VY +N+FSDLST+E KY
Sbjct: 2 FVANYNKMYDDDLEKTKRYSIFRDNLRDINI-KNKLNGSAVYRINKFSDLSTSEIVLKYT 60
Query: 105 GFKLKPSYADRSVPAMIPNITL-------PRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
G + P+ +R I L P FDWR + VT +K+Q +CG+ WAF+T
Sbjct: 61 GLSVPPT--ERLTTNFCKTIVLDQPPGKGPLNFDWRHQNKVTSIKNQGVCGACWAFATLA 118
Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
+IE YA K ++LSEQ++IDCD D GC+GG + AF+ ++ GG++ E YPY
Sbjct: 119 SIESQYAIKHNVQINLSEQQMIDCDYVDMGCDGGLLHTAFEQMIEM--GGVKHEHEYPYE 176
Query: 218 GDDKACRLNKKATQVKING-YVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHP 276
G + CRLN VKI G Y + E + L GP+ +AI+A + Y GV +
Sbjct: 177 GINMNCRLNDDNFAVKIIGCYRYIVLQEEKLKDLLRAVGPIPIAIDASGIANYYQGVIN- 235
Query: 277 IQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGS 336
+C+ N L+H+VL+VGYGV+ +PYW IKN+WGE WGE GYFR+ + +
Sbjct: 236 ---YCE--NHGLNHAVLLVGYGVENN------IPYWTIKNTWGEDWGENGYFRVRQNINA 284
Query: 337 CGINDYVRSALV 348
CG+ + + S+ V
Sbjct: 285 CGMTNELASSAV 296
>gi|398010921|ref|XP_003858657.1| cathepsin L-like protease, partial [Leishmania donovani]
gi|322496866|emb|CBZ31937.1| cathepsin L-like protease, partial [Leishmania donovani]
Length = 345
Score = 218 bits (554), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 131/319 (41%), Positives = 182/319 (57%), Gaps = 22/319 (6%)
Query: 39 TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAE 98
ALF F + + Y TL E RL F NL ++ Q + +G+ +F DLS AE
Sbjct: 35 AALFEEFKRTYRRAYGTLAEEQQRLANFERNLELMREHQ-ARNPHARFGITKFFDLSEAE 93
Query: 99 FQAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAF 153
F A+YL F +A + +++ +P A DWRE AVT VK+Q CGS WAF
Sbjct: 94 FAARYLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKNQGACGSCWAF 153
Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
S GNIE +A LVSLSEQ+L+ CD +D+GC GG + AF+ ++ + G + EK+
Sbjct: 154 SAVGNIESQWARAGHGLVSLSEQQLVSCDDKDNGCNGGLMLQAFEWLLRHMYGIVFTEKS 213
Query: 214 YPY---RGDDKAC-RLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFY 269
YPY GD C +K +I+GYV + +ET MA +L ENGP+A+A++A + Y
Sbjct: 214 YPYTSGNGDVAECLNSSKLVPGAQIDGYVMIPSNETVMAAWLAENGPIAIAVDASSFMSY 273
Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
+GV C G + L+H VL+VGY ++T VPYW+IKNSWGE WGEKGY R
Sbjct: 274 QSGVLTS----CAG--DALNHGVLLVGY--NKT----GGVPYWVIKNSWGEDWGEKGYVR 321
Query: 330 LYRGDGSCGINDYVRSALV 348
+ G +C +++Y SA V
Sbjct: 322 VAMGRNACLLSEYPVSAHV 340
>gi|222628593|gb|EEE60725.1| hypothetical protein OsJ_14236 [Oryza sativa Japonica Group]
Length = 364
Score = 218 bits (554), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 117/281 (41%), Positives = 164/281 (58%), Gaps = 26/281 (9%)
Query: 83 SGVYGLNEFSDLSTAEFQAKYLGFKLKPSY-----ADRSVPAMIPNITLPRAFDWREYDA 137
+ +G+ +FSDL+ EF+ ++LG + +PS + ++P LP FDWRE+ A
Sbjct: 81 TATHGVTKFSDLTPGEFRDRFLGLR-RPSLEGLVGGEPHEAPILPTDGLPDDFDWREHGA 139
Query: 138 VTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGC 188
V VKDQ CGS W+FST+G +EG + T KL LSEQ+++DCD E D GC
Sbjct: 140 VGPVKDQGSCGSCWSFSTSGALEGAHFLATGKLEVLSEQQMVDCDHECDASESRACDSGC 199
Query: 189 EGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMA 248
GG ++ AF +M GGL+ EK YPY G + C+ +K ++ + +S +E +A
Sbjct: 200 NGGLMTTAFSYLMKS--GGLQSEKDYPYAGRENTCKFDKSKIVAQVKNFSVISVNEDQIA 257
Query: 249 KYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYG-VDRTKFTHK 307
LV++GP+A+AINA +Q Y+ GVS P F C +L H VL+VGYG K
Sbjct: 258 ANLVKHGPLAIAINAAYMQTYIGGVSCP--FIC---GRHLDHGVLLVGYGSAGYAPIRFK 312
Query: 308 AVPYWIIKNSWGEGWGEKGYFRLYRG---DGSCGINDYVRS 345
PYWIIKNSWGE WGEKGY+++ RG CG++ V S
Sbjct: 313 EKPYWIIKNSWGENWGEKGYYKICRGPHDKNKCGVDSMVSS 353
>gi|33622213|ref|NP_891858.1| cathepsin [Cryptophlebia leucotreta granulovirus]
gi|33569322|gb|AAQ21608.1| cathepsin [Cryptophlebia leucotreta granulovirus]
Length = 332
Score = 218 bits (554), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 133/352 (37%), Positives = 203/352 (57%), Gaps = 33/352 (9%)
Query: 10 VALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGN 69
+ LSL + VS+F + + +++L + LF+ F++Q+NKTY T E + F N
Sbjct: 1 MKFLSLFLLVSAFSFI-ESVIYNLE--QSEKLFDSFVKQYNKTYLTEEERMIKFDNFKNN 57
Query: 70 LRKIQLLQDTEHGS--GVYGLNEFSDLSTAEFQAKYLGFKL--KPSYADRSVPAM----- 120
LR ++ + GS V+ +N++SDL+ + GFKL K +Y+ +V
Sbjct: 58 LR---IINEKNRGSKHAVFDINKYSDLNKNDLLRHTTGFKLGLKKNYSFTTVKECGVVEI 114
Query: 121 --IPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQEL 178
P + LP FDWR+ VT VK+Q +CGS WAFST GNIE +Y K K++ LSEQ L
Sbjct: 115 KEEPQVLLPETFDWRDKHGVTPVKNQLICGSCWAFSTIGNIESLYNIKYDKVIDLSEQHL 174
Query: 179 IDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYV 238
I+CD ++GC GG + A + I+ + GGG+ E+ PY G D C+ K ++ I+G
Sbjct: 175 INCDLVNNGCNGGLMHWALENILQE-GGGVVSEENDPYYGLDSVCK--KTPWELNISGCK 231
Query: 239 S-VSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGY 297
+ ++E + + LV NGP++VAI+ + Y +G++ C+ N L+H+VL+VGY
Sbjct: 232 RYILQNENKLKELLVVNGPISVAIDVSDVINYKSGIAD----ICENNN-GLNHAVLLVGY 286
Query: 298 GVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCG-INDYVRSALV 348
G + VPYWI+KNSWG WGE G+FR+ R SCG +N+Y SA++
Sbjct: 287 G------EYDEVPYWILKNSWGIEWGEDGFFRIQRNKNSCGLLNEYASSAVL 332
>gi|194352748|emb|CAQ00102.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
Length = 368
Score = 217 bits (553), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 127/328 (38%), Positives = 178/328 (54%), Gaps = 28/328 (8%)
Query: 40 ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
A F F+ +H K Y+ EY RL +F+ N+ + Q + G+ +G+ FSDL+ EF
Sbjct: 48 AQFAAFVRRHGKEYSGPEEYARRLRVFAANVARAAAHQALDPGA-RHGVTPFSDLTREEF 106
Query: 100 QAKYLGFK-----LKPSYADRSVPAMIPN--ITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
+A+ G L+ + + LP +FDWR+ AVT VK Q +CGS WA
Sbjct: 107 EARLTGLVGAGDVLRSARRMPAAAPATEEEVAALPASFDWRDKGAVTDVKMQGVCGSCWA 166
Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDD---------GCEGGSISNAFDTIMSK 203
FSTTG +EG T KL+ LSEQ+L+DCD D GC GG ++NA+ +MS
Sbjct: 167 FSTTGAVEGANFVATGKLLDLSEQQLVDCDHTCDAVAKTECNSGCSGGLMTNAYRYLMSS 226
Query: 204 LGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA 263
GGL E+ YPY G CR ++ V++ + +V DE M LV GP+AV +NA
Sbjct: 227 --GGLMEQAAYPYTGAQGPCRFDRGKVAVRVANFTAVPLDEDQMRAALVRGGPLAVGLNA 284
Query: 264 YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGV---DRTKFTHKAVPYWIIKNSWGE 320
+Q YV GVS P+ C ++H VL+VGYG + ++ PYW+IKNSWG
Sbjct: 285 AFMQTYVGGVSCPL--ICP--RAMVNHGVLLVGYGARGFSALRLGYR--PYWLIKNSWGA 338
Query: 321 GWGEKGYFRLYRGDGSCGINDYVRSALV 348
WGE GY++L RG CG++ V + V
Sbjct: 339 QWGEGGYYKLCRGRNVCGVDSMVSAVAV 366
>gi|343417244|emb|CCD20093.1| cysteine peptidase precursor [Trypanosoma vivax Y486]
Length = 454
Score = 217 bits (553), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 122/313 (38%), Positives = 170/313 (54%), Gaps = 18/313 (5%)
Query: 41 LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
LF F +++ ++Y T E RL +F N+R+ ++ + +G+ FSDL+ EF+
Sbjct: 33 LFAAFKQKYGRSYGTAAEEAFRLRVFEDNMRRSRMYA-AANPHATFGVTPFSDLTPEEFR 91
Query: 101 AKYLGFKLKPSYADRSVPAMI--PNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
+Y + A V ++ P P A DW AVT VKDQ CGS W+FS GN
Sbjct: 92 TRYHNGERHFEAARGRVRTLVQVPPGKAPAAVDWGRKGAVTPVKDQGTCGSCWSFSAIGN 151
Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY-- 216
IEG +AA L SLSEQ L+ CD +D+GC GG + NAF+ I+ + G + EK+YPY
Sbjct: 152 IEGQWAAAGNPLTSLSEQMLVSCDTKDNGCGGGLMDNAFEWIVKENSGKVYTEKSYPYVS 211
Query: 217 -RGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSH 275
G++ C+ I G+V + DE +AKYL +NGP+AVA++A Y GV
Sbjct: 212 GGGEEPPCKPRGHKVGATITGHVDIPHDEDAIAKYLADNGPVAVAVDATTFMSYSGGVVT 271
Query: 276 PIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
+E L+H VL+VGY D +K PYWIIKNSW WGEKGY R+ +G
Sbjct: 272 SCT------SEALNHGVLLVGYN-DSSK-----PPYWIIKNSWSSSWGEKGYIRIEKGTN 319
Query: 336 SCGINDYVRSALV 348
C + SA+V
Sbjct: 320 QCLVAQRASSAVV 332
>gi|378943046|gb|AFC76264.1| cathepsin L-like protease [Leishmania major]
gi|378943056|gb|AFC76269.1| cathepsin L-like protease [Leishmania major]
gi|394331745|gb|AFN27095.1| cysteine protease [Leishmania major]
Length = 348
Score = 217 bits (553), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 129/319 (40%), Positives = 177/319 (55%), Gaps = 22/319 (6%)
Query: 39 TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAE 98
ALF F + + Y TL E RL F NL ++ Q + +G+ +F DLS AE
Sbjct: 35 AALFEEFKRTYQRAYGTLTEEQQRLANFERNLELMREHQ-ARNPHARFGITKFFDLSEAE 93
Query: 99 FQAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAF 153
F A+YL F +A + +++ +P A DWRE AVT VK+Q CGS WAF
Sbjct: 94 FAARYLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKNQGACGSCWAF 153
Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
S GNIE +A KLV LSEQ+L+ CD D+GC GG + AF+ ++ + G + EK+
Sbjct: 154 SAVGNIESQWAVAGHKLVRLSEQQLVSCDHVDNGCGGGLMLQAFEWVLRNMNGTVFTEKS 213
Query: 214 YPY---RGDDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFY 269
YPY GD C + + A +I+GYVS+ E MA +L +NGP+++A++A + Y
Sbjct: 214 YPYVSGNGDVPECSNSSELAPGARIDGYVSMESSERVMAAWLAKNGPISIAVDASSFMSY 273
Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
+GV C G E L+H VL+VGY + VPYW+IKNSWGE WGEKGY R
Sbjct: 274 HSGVLTS----CIG--EQLNHGVLLVGYNMT------GEVPYWVIKNSWGEDWGEKGYVR 321
Query: 330 LYRGDGSCGINDYVRSALV 348
+ G +C + Y S LV
Sbjct: 322 VTMGVNACLLTGYPVSVLV 340
>gi|378943060|gb|AFC76271.1| cathepsin L-like protease [Leishmania major]
Length = 348
Score = 217 bits (553), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 129/319 (40%), Positives = 177/319 (55%), Gaps = 22/319 (6%)
Query: 39 TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAE 98
ALF F + + Y TL E RL F NL ++ Q + +G+ +F DLS AE
Sbjct: 35 AALFEEFKRTYQRAYGTLTEEQQRLANFERNLELMREHQ-ARNPHARFGITKFFDLSEAE 93
Query: 99 FQAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAF 153
F A+YL F +A + +++ +P A DWRE AVT VK+Q CGS WAF
Sbjct: 94 FAARYLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKNQGACGSCWAF 153
Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
S GNIE +A KLV LSEQ+L+ CD D+GC GG + AF+ ++ + G + EK+
Sbjct: 154 SAVGNIESQWAVAGHKLVRLSEQQLVSCDHVDNGCGGGLMLQAFEWVLRNMNGTVFTEKS 213
Query: 214 YPY---RGDDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFY 269
YPY GD C + + A +I+GYVS+ E MA +L +NGP+++A++A + Y
Sbjct: 214 YPYTSGNGDVPECSNSSELAPGARIDGYVSMESSERVMAAWLAKNGPISIAVDASSFMSY 273
Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
+GV C G E L+H VL+VGY + VPYW+IKNSWGE WGEKGY R
Sbjct: 274 HSGVLTS----CIG--EQLNHGVLLVGYNMT------GEVPYWVIKNSWGEDWGEKGYVR 321
Query: 330 LYRGDGSCGINDYVRSALV 348
+ G +C + Y S LV
Sbjct: 322 VTMGVNACLLTGYPVSVLV 340
>gi|125547724|gb|EAY93546.1| hypothetical protein OsI_15336 [Oryza sativa Indica Group]
Length = 348
Score = 217 bits (552), Expect = 8e-54, Method: Compositional matrix adjust.
Identities = 117/281 (41%), Positives = 163/281 (58%), Gaps = 26/281 (9%)
Query: 83 SGVYGLNEFSDLSTAEFQAKYLGFKLKPSY-----ADRSVPAMIPNITLPRAFDWREYDA 137
+ +G+ +FSDL+ EF+ + LG + +PS + ++P LP FDWRE+ A
Sbjct: 65 TATHGVTKFSDLTPGEFRDRLLGLR-RPSLEGLVGGEPHEAPILPTDGLPDDFDWREHGA 123
Query: 138 VTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGC 188
V VKDQ CGS W+FST+G +EG + T KL LSEQ+++DCD E D GC
Sbjct: 124 VGPVKDQGSCGSCWSFSTSGALEGAHFLATGKLEVLSEQQMVDCDHECDASESRACDSGC 183
Query: 189 EGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMA 248
GG ++ AF +M GGL+ EK YPY G + C+ +K ++ + +S +E +A
Sbjct: 184 NGGLMTTAFSYLMKS--GGLQSEKDYPYAGRENTCKFDKSKIVAQVKNFSVISVNEDQIA 241
Query: 249 KYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYG-VDRTKFTHK 307
LV++GP+A+AINA +Q Y+ GVS P F C +L H VL+VGYG K
Sbjct: 242 ANLVKHGPLAIAINAAYMQTYIGGVSCP--FIC---GRHLDHGVLLVGYGSAGYAPIRFK 296
Query: 308 AVPYWIIKNSWGEGWGEKGYFRLYRG---DGSCGINDYVRS 345
PYWIIKNSWGE WGEKGY+++ RG CG++ V S
Sbjct: 297 EKPYWIIKNSWGENWGEKGYYKICRGPHDKNKCGVDSMVSS 337
>gi|354494740|ref|XP_003509493.1| PREDICTED: cathepsin W-like [Cricetulus griseus]
gi|344243260|gb|EGV99363.1| Cathepsin W [Cricetulus griseus]
Length = 376
Score = 217 bits (552), Expect = 8e-54, Method: Compositional matrix adjust.
Identities = 124/369 (33%), Positives = 191/369 (51%), Gaps = 46/369 (12%)
Query: 10 VALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGN 69
+ALL+ + ++ + D L ++ +F F ++N++YA EY RL+IF+ N
Sbjct: 11 LALLTASQGLNDSFLTKDTGPRPLELIE---VFKLFQIKYNRSYANPAEYARRLNIFAHN 67
Query: 70 LRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNIT---- 125
L + Q LQ+ + G+ +G FSDL+ EF Y + P IPN+
Sbjct: 68 LAQAQRLQEEDLGTAEFGETPFSDLTEEEFGQ---------LYGQQKAPKRIPNMVKKAG 118
Query: 126 -------LPRAFDWRE-YDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQE 177
+P DWR+ + ++ +K+Q C WA + NIE ++ KT+ V +S QE
Sbjct: 119 SEKWGQPVPSTCDWRKATNIISSIKNQKTCRCCWAIAAADNIEALWRIKTQHFVEVSVQE 178
Query: 178 LIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG--DDKACRLNKKATQVKIN 235
L+DC++ +GC+GG + +A+ T+++ GL EK YP++G + C N+ I
Sbjct: 179 LLDCERCGNGCDGGFVWDAYMTVLN--NSGLASEKDYPFKGYPNPHGCLANRYKKVAWIQ 236
Query: 236 GYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIV 295
+ + RDE +A YL +GP+ V IN LQ Y GV CD + + HSVL+V
Sbjct: 237 DFTMLGRDEQVIAGYLATHGPITVTINMKLLQGYQKGVIKATPTTCDP--QQVDHSVLLV 294
Query: 296 GYG----------------VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGI 339
G+G + + ++VPYWI+KNSWG WGEKGYFRLYRG+ SCGI
Sbjct: 295 GFGKGKEKEDIQSGTILSQTRKPRKPRRSVPYWILKNSWGAEWGEKGYFRLYRGNNSCGI 354
Query: 340 NDYVRSALV 348
Y +A +
Sbjct: 355 TKYPITACL 363
>gi|290999038|ref|XP_002682087.1| predicted protein [Naegleria gruberi]
gi|284095713|gb|EFC49343.1| predicted protein [Naegleria gruberi]
Length = 349
Score = 217 bits (552), Expect = 8e-54, Method: Compositional matrix adjust.
Identities = 133/342 (38%), Positives = 179/342 (52%), Gaps = 47/342 (13%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGV-------YGLNEFSDL 94
F +F + + K YAT E++ R IF N+ + L + + YG+ +F D+
Sbjct: 15 FQHFKKLYLKRYATEEEHHRRWKIFYDNINLVNQLNIMHKPNEIAGKPVAQYGITQFMDM 74
Query: 95 STAEFQAKYLGFKLKPSYADRSV------PAMIPNI-TLPRAFDWREYDAVTGVKDQTMC 147
S EF KL P + + P I LP +FDWRE+ AVT VKDQ C
Sbjct: 75 SPNEFAR----VKLLPPTKQKDINHTPTAPKEKYQIDALPESFDWREHGAVTAVKDQASC 130
Query: 148 GSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGG 207
GS WAFST NIEG Y L S Q+L+DCD + GC GG A I + GG
Sbjct: 131 GSCWAFSTVENIEGAYFLAGHNLTKFSPQQLVDCDNLNCGCFGGFPFIAMQYIQKR--GG 188
Query: 208 LEEEKTYPY----RGDDKACRLNKKATQ-----------------VKINGYVSVSRDETD 246
L E +YPY G+ C NK K+ GY +VS++E D
Sbjct: 189 LATESSYPYCIPPLGNCFPCNTNKTYCPSGEYCNRTCSVQNYQLVAKVAGYENVSQNEDD 248
Query: 247 MAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTH 306
+A YLV+NGP+++ +NA LQFY +G+S P+ +C ++ H+VL+VG+G T +
Sbjct: 249 IAAYLVKNGPLSICLNAMWLQFYHSGISDPM--YCP---PDIDHAVLLVGFGT-HTNWLG 302
Query: 307 KAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
+ YWI+KNSWGE WGEKGYFRL RG CGIN V +A+V
Sbjct: 303 EKTNYWIVKNSWGESWGEKGYFRLIRGKDKCGINTMVANAIV 344
>gi|217323618|gb|ACK38176.1| midgut cysteine peptidase, partial [Sphenophorus levis]
Length = 324
Score = 216 bits (551), Expect = 9e-54, Method: Compositional matrix adjust.
Identities = 126/336 (37%), Positives = 179/336 (53%), Gaps = 30/336 (8%)
Query: 12 LLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLR 71
L SL V S ++ ++ +H F F +H KTY E R IF NLR
Sbjct: 6 LASLLVVAVSATLLKEDGVH----------FQSFKLKHGKTYKNQAEETKRFAIFRENLR 55
Query: 72 KIQLLQDTEHGSGVY----GLNEFSDLSTAEFQAKY-LGFKLKPSYADRSVPAMIPNITL 126
KI+ + E+ G++ G+N+F+D++ AEF+A K KPS + +++
Sbjct: 56 KIEA-HNAEYKQGIHSYTQGINKFADMTRAEFKAMLATQVKTKPSIVATKTFQLADGVSV 114
Query: 127 PRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDD 186
P + DWR + VT +KDQ CGS W+F+ G+ EG YA T KL SEQ+L+DC + +
Sbjct: 115 PESIDWRSRNVVTPIKDQAQCGSCWSFAVVGSTEGAYALSTGKLTRFSEQQLVDCTTDLN 174
Query: 187 -GCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDET 245
GC+GG + + F I + GLE E YPY G D +C + K++ YVSV +E
Sbjct: 175 YGCDGGYLDDTFPYIQTN---GLELESDYPYTGYDGSCSYDSSKVVTKVSSYVSVPANEQ 231
Query: 246 DMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFT 305
+ + + GP+A+AINA LQFY +G+ +CD E L H VL VGY +
Sbjct: 232 ALLEAVGTAGPVAIAINADDLQFYFSGIID--DKYCD--PEWLDHGVLAVGYN------S 281
Query: 306 HKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGIND 341
+ YW+IKNSWG WGE GYFR RG CG+ +
Sbjct: 282 ENGLDYWLIKNSWGADWGESGYFRFLRGQNICGVKE 317
>gi|394331805|gb|AFN27125.1| cysteine protease [Leishmania major]
Length = 348
Score = 216 bits (551), Expect = 9e-54, Method: Compositional matrix adjust.
Identities = 128/319 (40%), Positives = 176/319 (55%), Gaps = 22/319 (6%)
Query: 39 TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAE 98
ALF F + + Y TL E RL F NL ++ Q + +G+ +F DLS AE
Sbjct: 35 AALFEEFKRTYQRAYGTLTEEQQRLANFERNLELMREHQ-ARNPHARFGITKFFDLSEAE 93
Query: 99 FQAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAF 153
F A+YL F +A + +++ +P A DWRE AVT VK+Q CGS WAF
Sbjct: 94 FAARYLNGAAYFAAAKQHAGQHYRKACADLSAVPDAVDWREKGAVTPVKNQGACGSCWAF 153
Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
S GNIE +A KLV LSEQ+L+ CD D+GC GG + AF+ ++ + G + EK+
Sbjct: 154 SAVGNIESQWAVAGHKLVRLSEQQLVSCDHVDNGCGGGLMLQAFEWVLRNMNGTVSTEKS 213
Query: 214 YPY---RGDDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFY 269
YPY GD C + + A +I+GYVS+ E MA +L +NGP+++A++A + Y
Sbjct: 214 YPYVSGNGDVPECSNSSELAPGARIDGYVSMESSERVMAAWLAKNGPISIAVDASSFMSY 273
Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
+GV C G E L+H VL+VGY + VPYW+IKNSWGE WGEKGY R
Sbjct: 274 HSGVLTS----CIG--EQLNHGVLLVGYNMT------GEVPYWVIKNSWGEDWGEKGYVR 321
Query: 330 LYRGDGSCGINDYVRSALV 348
+ G +C + Y S V
Sbjct: 322 VTMGVNACLLTGYPVSVHV 340
>gi|302763831|ref|XP_002965337.1| hypothetical protein SELMODRAFT_230602 [Selaginella moellendorffii]
gi|300167570|gb|EFJ34175.1| hypothetical protein SELMODRAFT_230602 [Selaginella moellendorffii]
Length = 343
Score = 216 bits (551), Expect = 9e-54, Method: Compositional matrix adjust.
Identities = 129/345 (37%), Positives = 187/345 (54%), Gaps = 25/345 (7%)
Query: 6 FFAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHI 65
A +L + V + F + L ++ +F + +H K+Y++ +E RL I
Sbjct: 5 MIASTLILLVVVGATPFAIARPAALEDGRALEIKNMFEDWAAKHGKSYSSDLEKARRLMI 64
Query: 66 FSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNI- 124
FS L I+ + + GLN+FSDL+ AEF+A ++G +P Y DR +PA ++
Sbjct: 65 FSDTLAYIEKHNAQPNTTFTLGLNKFSDLTNAEFRAMHVGKFKRPRYQDR-LPAEDEDVD 123
Query: 125 --TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCD 182
+LP + DWR+ AVT +KDQ CGS WAFS +IE + TK+LVSLSEQ+L+DCD
Sbjct: 124 VSSLPTSLDWRQKGAVTPIKDQGDCGSCWAFSAIASIESAHFLATKELVSLSEQQLMDCD 183
Query: 183 QEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVK---INGYVS 239
D GC+GG + AF ++ GG+ E +YPY G +C NK A K I G+
Sbjct: 184 TVDAGCDGGLMETAFKFVVKN--GGVTTEASYPYTGSVGSCNANKVAIINKVAEITGFKV 241
Query: 240 VSRDETDMAKYLVENGPMAVAI--NAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGY 297
V+ D D V P+ V+I + Q Y +G+ + C ++L H VL++GY
Sbjct: 242 VTEDSADALMKAVSKTPVTVSICGSDENFQNYKSGI---LSGQC---GDSLDHGVLLIGY 295
Query: 298 GVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR--GDGSCGIN 340
G T +PYWIIKNSWG WGE G+ ++ R GDG CG+N
Sbjct: 296 G------TEGGMPYWIIKNSWGTSWGEDGFMKIERKDGDGICGMN 334
>gi|146335582|gb|ABQ23400.1| cathepsin L isotype 3 [Trypanoplasma borreli]
Length = 442
Score = 216 bits (551), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 123/320 (38%), Positives = 176/320 (55%), Gaps = 26/320 (8%)
Query: 41 LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
LF F H + YA+ E R IF+ N++K L + ++ +G NEF+D+S+ EFQ
Sbjct: 24 LFRDFKTTHARNYASADEERKRFEIFAANMKKAAEL-NRKNPMATFGPNEFADMSSEEFQ 82
Query: 101 AK------YLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
+ Y +P ++ N + + DWR AVT VK+Q CGS W+FS
Sbjct: 83 TRHNAARHYAAVMARPPKNTKTFTEEEINAAVGQKVDWRLKGAVTPVKNQGSCGSCWSFS 142
Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
TTGNIEG +A T +LVSLSEQEL+ CD DDGC GG + NAF ++S G + E +Y
Sbjct: 143 TTGNIEGQHAIATGQLVSLSEQELVSCDTVDDGCSGGLMDNAFGWLLSAHNGQITTEASY 202
Query: 215 PY---RGDDKACRLNKKATQV--KINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFY 269
PY G AC N + V I + + + E DMA ++ + GP+++ ++A + Q Y
Sbjct: 203 PYVSGNGIVPACTFNSNSNPVGATITSFHDIPKTERDMAAFVFKYGPLSIGVDASSWQSY 262
Query: 270 VTGV-SHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYF 328
+ G+ SH C + + H VLIVG+ D T T PYWIIKNSW WGE+GY
Sbjct: 263 IGGILSH-----CS--DVQIDHGVLIVGF--DDTAST----PYWIIKNSWSSMWGEQGYI 309
Query: 329 RLYRGDGSCGINDYVRSALV 348
R+ +G CG+ + S++V
Sbjct: 310 RVAKGSNQCGLTSFPSSSVV 329
>gi|357116897|ref|XP_003560213.1| PREDICTED: probable cysteine proteinase A494-like [Brachypodium
distachyon]
Length = 373
Score = 216 bits (550), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 130/332 (39%), Positives = 187/332 (56%), Gaps = 35/332 (10%)
Query: 40 ALFNYFLEQHNKTYATLVEYYSR-LHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAE 98
A F F+ +H K Y+ E Y+R L +F+ NL + Q + G+ +G+ FSDL+ E
Sbjct: 52 AKFAAFVRRHGKEYSGGAEEYARRLRVFAANLARAAAHQALDPGA-RHGVTPFSDLTPEE 110
Query: 99 FQAKYLGFKLK------PSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
FQA+ G + + P+ A + + TLP +FDWR AVT VK Q MCGS WA
Sbjct: 111 FQARLTGLQQQGTNNNMPAAARATAEELA---TLPASFDWRAKGAVTEVKMQGMCGSCWA 167
Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSK 203
FSTTG +EG + T KL++LSEQ+L+DCD D GC GG ++NA+ ++
Sbjct: 168 FSTTGAVEGAHFVATGKLLNLSEQQLVDCDHTCDAVAKNECDSGCSGGLMTNAYTYLIR- 226
Query: 204 LGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKY-LVENGPMAVAIN 262
GGL E+ YPY G CR + V++ + +V D+ D + LV GP+AV +N
Sbjct: 227 -AGGLMEQAAYPYTGAQGTCRFDANKVAVRVTSFTAVPPDDEDQIRASLVRAGPLAVGLN 285
Query: 263 AYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGY---GVDRTKFTHKAVPYWIIKNSWG 319
A +Q Y+ GVS P+ C + ++H VL+VGY G+ + ++ PYWIIKNSWG
Sbjct: 286 AAFMQTYLGGVSCPL--LCP--RKLINHGVLLVGYGARGLAPLRLGYR--PYWIIKNSWG 339
Query: 320 EGWGEKGYFRLYRGDGS---CGINDYVRSALV 348
+ WGE GY+RL RG + CG++ V + V
Sbjct: 340 KEWGEGGYYRLCRGARNRNVCGVDSMVSAVAV 371
>gi|15824691|gb|AAL09443.1| cysteine protease [Leishmania donovani]
Length = 443
Score = 216 bits (550), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 129/319 (40%), Positives = 180/319 (56%), Gaps = 22/319 (6%)
Query: 39 TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAE 98
ALF F + + Y TL E RL F NL ++ Q + +G+ +F DLS AE
Sbjct: 35 AALFEEFKRTYRRAYGTLAEEQQRLANFERNLELMREHQ-ARNPHARFGITKFFDLSEAE 93
Query: 99 FQAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAF 153
F A+YL F +A + +++ +P A DWRE AVT VK+Q CGS WAF
Sbjct: 94 FAARYLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKNQGACGSCWAF 153
Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
S GNIE +A LVSLSEQ+L+ CD +D+GC GG + AF+ ++ + G + EK+
Sbjct: 154 SAVGNIESQWARVGHGLVSLSEQQLVSCDDKDNGCNGGLMLQAFEWLLRHMYGIVFTEKS 213
Query: 214 YPY---RGDDKAC-RLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFY 269
YPY GD C +K +I+GYV + +ET MA +L ENGP+A+A++A + Y
Sbjct: 214 YPYTSGNGDVAECLNSSKLVPGAQIDGYVMIPSNETVMAAWLAENGPIAIAVDASSFMSY 273
Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
+GV + L+H VL+VGY ++T VPYW+IKNSWGE WGEKGY R
Sbjct: 274 QSGV------LTSCAGDALNHGVLLVGY--NKT----GGVPYWVIKNSWGEDWGEKGYVR 321
Query: 330 LYRGDGSCGINDYVRSALV 348
+ G +C +++Y SA V
Sbjct: 322 VAMGKNACLLSEYPVSAHV 340
>gi|328866896|gb|EGG15279.1| cysteine protease [Dictyostelium fasciculatum]
Length = 347
Score = 216 bits (550), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 128/335 (38%), Positives = 176/335 (52%), Gaps = 49/335 (14%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGV---YGLNEFSDLSTAE 98
F F ++NK Y + E+ + F NL +I L SG +G+NEF+DLS E
Sbjct: 27 FRDFQVKYNKVYGSH-EFSQKFVTFKDNLNRIDTLNANAAASGSDTKFGVNEFADLSVQE 85
Query: 99 FQAKYLGFKLKPSYADRSVPAMIPN-------------ITLPRAFDWREYDAVTGVKDQT 145
F+ Y+ +VPA +P+ ++P +FDWR AVT VK+Q
Sbjct: 86 FRKFYM----------NAVPASVPSDAQVAGDYSDETLASIPSSFDWRTKGAVTPVKNQG 135
Query: 146 MCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE----------DDGCEGGSISN 195
CGS W+FSTTGN+EG + L LSEQ L+DCD DDGC GG N
Sbjct: 136 QCGSCWSFSTTGNVEGQWFLAGNTLTGLSEQNLVDCDHHCMTYDGQQSCDDGCNGGLQPN 195
Query: 196 AFDTIMSKLGGGLEEEKTYPYR--GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVE 253
AF I+ GG++ E +YPY DK C+ KI+ + +S +ET +A YL
Sbjct: 196 AFQYIIGN--GGIDTETSYPYLAVAQDK-CQFKASNIGAKISNWQMLSTNETQIAAYLAL 252
Query: 254 NGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWI 313
NGP+++A +A QFY+ GV C + L H +LIVGY + F H A PYW
Sbjct: 253 NGPVSIAADAAEWQFYIGGV---FDLPC---GKALDHGILIVGYDTETNIFGH-AKPYWW 305
Query: 314 IKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
+KNSWG WGE+GY ++ RG G CG+N +V ++ V
Sbjct: 306 VKNSWGASWGEQGYLKVLRGAGECGLNTFVSTSCV 340
>gi|46309423|ref|YP_006313.1| ORF31 [Agrotis segetum granulovirus]
gi|46200640|gb|AAS82707.1| ORF31 [Agrotis segetum granulovirus]
Length = 327
Score = 216 bits (550), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 124/321 (38%), Positives = 186/321 (57%), Gaps = 30/321 (9%)
Query: 41 LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
LF F++++NK+Y++ E + F N+R I +++ S VY +N +SD++ E
Sbjct: 24 LFEDFVQKYNKSYSSEEERQIKFDNFKNNIRSINE-KNSLSNSAVYDINFYSDMNKNELL 82
Query: 101 AKYLGFK-------LKPSYADRSVPAMI---PNITLPRAFDWREYDAVTGVKDQTMCGSS 150
K GFK L S+ + +I P + LP +FDWR+ +T VK+Q CGS
Sbjct: 83 RKQTGFKINLKKNNLDLSWNIKCNKKLINGNPAVLLPDSFDWRDRHVITSVKNQRDCGSC 142
Query: 151 WAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEE 210
WAFST NIE +YA K KL+ LSEQ+L++CD++++GC GG + A + I+ + GG+
Sbjct: 143 WAFSTIANIESLYAIKYNKLLDLSEQQLVNCDEQNNGCNGGLMHWAMEEIIRQ--GGVSN 200
Query: 211 EKTYPYRGDDKACRLNKKATQVKINGYVS-VSRDETDMAKYLVENGPMAVAINAYALQFY 269
E +PY D C+ +K V ING + +E + + L+ NGP+++AI+ + Y
Sbjct: 201 ETDFPYTASDGFCK--RKQGFVNINGCNQFILSNEDRLRELLIFNGPISIAIDVIDVIDY 258
Query: 270 VTGVSHPIQFFCDGGNEN-LSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYF 328
G+S + N+N L+H+VL+VGYGV +PYWI+KNSWG WGE GYF
Sbjct: 259 SQGISSTCR------NDNGLNHAVLLVGYGVKNN------IPYWILKNSWGSQWGENGYF 306
Query: 329 RLYRGDGSCG-INDYVRSALV 348
R+ R SCG INDY SA++
Sbjct: 307 RVQRNINSCGMINDYAASAIL 327
>gi|339896953|ref|XP_003392238.1| cathepsin L-like protease [Leishmania infantum JPCM5]
gi|14349351|gb|AAC38832.2| cysteine protease [Leishmania chagasi]
gi|17384031|emb|CAD12393.1| cysteine proteinase [Leishmania infantum]
gi|321398984|emb|CBZ08377.1| cathepsin L-like protease [Leishmania infantum JPCM5]
Length = 443
Score = 216 bits (549), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 129/319 (40%), Positives = 180/319 (56%), Gaps = 22/319 (6%)
Query: 39 TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAE 98
ALF F + + Y TL E RL F NL ++ Q + +G+ +F DLS AE
Sbjct: 35 AALFEEFKRTYRRAYGTLAEEQQRLANFERNLELMREHQ-ARNPHARFGITKFFDLSEAE 93
Query: 99 FQAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAF 153
F A+YL F +A + +++ +P A DWRE AVT VK+Q CGS WAF
Sbjct: 94 FAARYLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKNQGACGSCWAF 153
Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
S GNIE +A LVSLSEQ+L+ CD +D+GC GG + AF+ ++ + G + EK+
Sbjct: 154 SAVGNIESQWARAGHGLVSLSEQQLVSCDDKDNGCNGGLMLQAFEWLLRHMYGIVFTEKS 213
Query: 214 YPY---RGDDKAC-RLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFY 269
YPY GD C +K +I+GYV + +ET MA +L ENGP+A+A++A + Y
Sbjct: 214 YPYTSGNGDVAECLNSSKLVPGAQIDGYVMIPSNETVMAAWLAENGPIAIAVDASSFMSY 273
Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
+GV + L+H VL+VGY ++T VPYW+IKNSWGE WGEKGY R
Sbjct: 274 QSGV------LTSCAGDALNHGVLLVGY--NKT----GGVPYWVIKNSWGEDWGEKGYVR 321
Query: 330 LYRGDGSCGINDYVRSALV 348
+ G +C +++Y SA V
Sbjct: 322 VVMGLNACLLSEYPVSAHV 340
>gi|343471272|emb|CCD16264.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 447
Score = 216 bits (549), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 122/326 (37%), Positives = 185/326 (56%), Gaps = 24/326 (7%)
Query: 30 LHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLN 89
LH ++ F F ++++++Y E R +F ++ + + + + +G+
Sbjct: 31 LHAEQSLQQQ--FAAFKQKYSRSYKDATEEAFRFRMFKQSMERAKE-EAAANPYATFGVT 87
Query: 90 EFSDLSTAEFQAKYL-GFKLKPSYADRSVPAMIPNITL---PRAFDWREYDAVTGVKDQT 145
+FSD+S E +A YL G K + R P + N++ P A DWR+ AVT VKDQ
Sbjct: 88 QFSDMSPEELRATYLNGAKYYAAALKR--PRKVVNVSTGKAPPAVDWRKKGAVTPVKDQR 145
Query: 146 MCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLG 205
CGS WAFS TGNIEG + +L SLSEQ L+ CD DDGC+GG + A I+S
Sbjct: 146 KCGSCWAFSATGNIEGQWKVAGHELTSLSEQMLVSCDNMDDGCQGGLMDRALKWIVSSNK 205
Query: 206 GGLEEEKTYPY---RGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAIN 262
G + E++YPY GD C ++ K KI+G++++ +DE +A++L +NGP+A+A++
Sbjct: 206 GNVFTEESYPYDSTDGDVPPCNMSGKVVGAKISGHINLPKDENAIAEWLAKNGPVAIAVD 265
Query: 263 AYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGW 322
A + Y GV ++ L+H VL+VGY D +K PYWIIKNSWG+ W
Sbjct: 266 ASSFLDYKGGV------LTSCSSDALNHDVLLVGYD-DTSK-----PPYWIIKNSWGKKW 313
Query: 323 GEKGYFRLYRGDGSCGINDYVRSALV 348
GE+GY R+ +G C + +Y RSA+V
Sbjct: 314 GEEGYIRVEKGTNQCLMKEYARSAVV 339
>gi|332326581|gb|AEE42614.1| cysteine protease [Leishmania aethiopica]
Length = 443
Score = 215 bits (548), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 122/318 (38%), Positives = 177/318 (55%), Gaps = 22/318 (6%)
Query: 40 ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
ALF F + + Y T+ E RL F NL ++ Q + +G+ +F DLS AEF
Sbjct: 36 ALFEEFKRTYRRAYGTVAEEQQRLANFERNLELMREHQ-ARNPHARFGITKFFDLSEAEF 94
Query: 100 QAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
A+YL F +A + +++ +P A DWRE AVT VKDQ CGS WAFS
Sbjct: 95 AARYLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKDQGACGSCWAFS 154
Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
GNIE +A +L +LSEQ+L+ CD +D GC GG ++ AF+ ++ + G + E +Y
Sbjct: 155 AVGNIESQWAVAGHRLTALSEQQLVSCDDKDSGCGGGLMTQAFEWLLRNMNGTMXTEDSY 214
Query: 215 PY---RGDDKACRLNKKATQ-VKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYV 270
PY GD AC + + +I+GYV++ ET MA +L ++GP+++A++A + Y
Sbjct: 215 PYVSSTGDVPACTNSSQLVPGARIDGYVTIESSETVMAAWLAKSGPISIAVDASSFMSYX 274
Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
+GV + L+H VL+VGY + VPYW+IKNSWGE WGEKGY R+
Sbjct: 275 SGV------LTSCAGKXLNHGVLLVGYNMT------GEVPYWVIKNSWGEDWGEKGYVRV 322
Query: 331 YRGDGSCGINDYVRSALV 348
G +C + +Y SA V
Sbjct: 323 TMGVNACLLTEYPVSAHV 340
>gi|402585860|gb|EJW79799.1| cysteine protease 6 [Wuchereria bancrofti]
Length = 242
Score = 215 bits (548), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 107/224 (47%), Positives = 138/224 (61%), Gaps = 10/224 (4%)
Query: 125 TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE 184
LP FDW VT VK+Q CGS WAFS TGNIE ++A KT L+SLSEQELIDCD
Sbjct: 28 NLPNKFDWNTKGVVTPVKNQGSCGSCWAFSVTGNIESLWAIKTGNLISLSEQELIDCDVI 87
Query: 185 DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDE 244
D+GC GG NAF I K GGLE E YPY+ + C L + V I+ + + R+E
Sbjct: 88 DNGCNGGLPINAFREI--KRMGGLEPEDQYPYKAKNGTCHLVRAQIAVTIDDAIEIPRNE 145
Query: 245 TDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKF 304
T M ++ + GP++V I+A L +Y +G+ HP + C ++H VLI GYG++
Sbjct: 146 TVMKAWIAQRGPLSVGIDAELLAYYKSGILHPSKSRCPP--SKINHGVLITGYGIE---- 199
Query: 305 THKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
+PYW IKNSWGE WGE GYFRL RG CG++D V SA++
Sbjct: 200 --NGLPYWTIKNSWGEEWGENGYFRLMRGKDICGVSDLVSSAII 241
>gi|394331739|gb|AFN27092.1| cysteine protease [Leishmania major]
Length = 348
Score = 215 bits (547), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 128/319 (40%), Positives = 176/319 (55%), Gaps = 22/319 (6%)
Query: 39 TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAE 98
ALF F + + Y TL E RL F NL ++ Q + +G+ +F DLS AE
Sbjct: 35 AALFEEFKRTYQRAYGTLTEEQQRLANFERNLELMREHQ-ARNPHARFGITKFFDLSEAE 93
Query: 99 FQAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAF 153
F A+YL F +A + +++ +P A DWRE AVT VK+Q CGS WAF
Sbjct: 94 FAARYLNGAAYFAAAKQHAGQHCRKARADLSAVPDAVDWREKGAVTPVKNQGACGSCWAF 153
Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
S GNIE +A KLV LSEQ+L+ CD D+GC GG + AF+ ++ + G + EK+
Sbjct: 154 SAVGNIESQWAVAGHKLVRLSEQQLVSCDHVDNGCGGGLMLQAFEWVLRNMNGTVFTEKS 213
Query: 214 YPY---RGDDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFY 269
YPY GD C + + A +I+GYVS+ E MA +L +NGP+++A++A + Y
Sbjct: 214 YPYVSGNGDVPECSNSSELAPGARIDGYVSMESSERVMAAWLAKNGPISIAVDASSFMSY 273
Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
+GV C G E L+H VL+VGY + VPYW+IKNSWGE WGEKGY R
Sbjct: 274 HSGVLTS----CIG--EQLNHGVLLVGYNMT------GEVPYWVIKNSWGEDWGEKGYVR 321
Query: 330 LYRGDGSCGINDYVRSALV 348
+ G +C + Y S V
Sbjct: 322 VTMGVNACLLTGYPVSVHV 340
>gi|340053966|emb|CCC48259.1| cysteine peptidase precursor, fragment, partial [Trypanosoma vivax
Y486]
Length = 447
Score = 215 bits (547), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 123/313 (39%), Positives = 170/313 (54%), Gaps = 18/313 (5%)
Query: 41 LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
LF F +++ ++Y T E RL +F N+R+ ++ + +G+ FSDL+ EF+
Sbjct: 25 LFAAFKQKYGRSYGTAAEEAFRLRVFEDNMRRSRMYA-AANPHATFGVTPFSDLTPEEFR 83
Query: 101 AKYLGFKLKPSYADRSVPAMI--PNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
+Y + A V ++ P P A DWR AVT VKDQ CGS W+FS GN
Sbjct: 84 TRYHNGERHFEAARGRVRTLVQVPPGKAPAAVDWRRKGAVTPVKDQGSCGSCWSFSAIGN 143
Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG 218
IEG +AA L SLSEQ L+ CD +D+GC GG + NAF+ I+ + G + EK+YPY
Sbjct: 144 IEGQWAAAGNPLTSLSEQMLVSCDFKDNGCGGGFMDNAFEWIVKENSGKVYTEKSYPYVS 203
Query: 219 DDKA---CRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSH 275
+D + C I G+V + DE +AKYL +NGP+AVA++A Y GV
Sbjct: 204 EDGSKPFCIPYGHEVGATITGHVDIPHDEDAIAKYLADNGPVAVAVDATTFMSYSGGVVT 263
Query: 276 PIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
+E L+H VL+VGY D +K PYWIIKNSW WGEKGY R+ +G
Sbjct: 264 SCT------SEALNHGVLLVGYN-DSSK-----PPYWIIKNSWSSSWGEKGYIRIEKGTN 311
Query: 336 SCGINDYVRSALV 348
C + SA+V
Sbjct: 312 QCLVAQLASSAVV 324
>gi|255568297|ref|XP_002525123.1| cysteine protease, putative [Ricinus communis]
gi|223535582|gb|EEF37250.1| cysteine protease, putative [Ricinus communis]
Length = 349
Score = 215 bits (547), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 131/345 (37%), Positives = 187/345 (54%), Gaps = 25/345 (7%)
Query: 6 FFAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHI 65
F +L + + M LH ++ T ++ +H K Y E R I
Sbjct: 3 FLCKGKILPIALFFVLAMCADQAASRELHELEMTGRHEKWMAKHGKVYKDDKEKLRRFQI 62
Query: 66 FSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSV-PAMIPNI 124
F N+ I+ + S + G+N+F+DL+ EF+A + G+K +P A R + P N+
Sbjct: 63 FKSNVVFIESFNTAGNKSYMLGINKFADLTNEEFRAFWNGYK-RPLGASRKITPFKYENV 121
Query: 125 T-LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCD- 182
T LP + DWR AVT +KDQ +CGS WAFS EG++ +T KLVSLSEQEL+DCD
Sbjct: 122 TALPSSIDWRSKGAVTPIKDQGVCGSCWAFSAVAATEGIHKLRTGKLVSLSEQELVDCDV 181
Query: 183 -QEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQ-VKINGYVSV 240
+D GC+GG + +AF I K GG+ E YPY+G D C K+A++ VKI GY +V
Sbjct: 182 KGQDKGCQGGLMVDAFKFI--KRHGGMTSEANYPYQGRDGKCDTKKEASRAVKITGYQAV 239
Query: 241 SRDETDMAKYLVENGPMAVAINAYAL--QFYVTGVSHPIQFFCDGGNENLSHSVLIVGYG 298
++ V N P++VAI+A +L QFY +G+ F ++++H V VGYG
Sbjct: 240 PKNSEAALLKAVANQPVSVAIDAGSLSFQFYRSGI------FTGICGKDINHGVAAVGYG 293
Query: 299 VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR----GDGSCGI 339
++ YWI+KNSWG WGEKGY R+ R +G CGI
Sbjct: 294 R-----SNSGSKYWIVKNSWGTEWGEKGYIRMKRDVRSKEGLCGI 333
>gi|53748485|emb|CAH59428.1| cysteine protease 2 [Plantago major]
Length = 245
Score = 215 bits (547), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 112/243 (46%), Positives = 144/243 (59%), Gaps = 17/243 (6%)
Query: 113 ADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVS 172
AD + +P LP FDWRE AVT VK+Q CGS W+FSTTG +EG T +L+S
Sbjct: 1 ADENKAPKLPTSNLPEEFDWREKGAVTAVKNQGSCGSCWSFSTTGALEGANYLATGELIS 60
Query: 173 LSEQELIDCDQE----------DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKA 222
LSEQ+L+DCD E D GC GG ++NAF+ + GGL++EK YPY G D
Sbjct: 61 LSEQQLVDCDHECDPEEGADSCDAGCNGGLMNNAFEYALK--AGGLQKEKDYPYTGKDGT 118
Query: 223 CRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCD 282
C+ +K ++ + VS DE +A LV+ GP+AV INA +Q Y+ GVS P + C
Sbjct: 119 CKFDKTKIAASVHNFSVVSIDEDQIAANLVKYGPLAVGINAAWMQTYIGGVSCP--YIC- 175
Query: 283 GGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDY 342
++L H VLIVGYG K PYWIIKNSWGE WGE GY+++ RG CG+
Sbjct: 176 --GKSLDHGVLIVGYGTGYAPVRLKNKPYWIIKNSWGESWGESGYYKICRGRNVCGVESM 233
Query: 343 VRS 345
V S
Sbjct: 234 VSS 236
>gi|357216861|gb|AET71138.1| cysteine peptidase isoform b [Sphenophorus levis]
Length = 324
Score = 214 bits (546), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 122/308 (39%), Positives = 167/308 (54%), Gaps = 20/308 (6%)
Query: 40 ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY----GLNEFSDLS 95
A F F +H KTY E R IF NLRKI+ + E+ G++ G+N+F+D++
Sbjct: 24 AHFQSFKLKHGKTYKNQAEETKRFAIFRENLRKIEA-HNAEYKQGIHSYTQGINKFADMT 82
Query: 96 TAEFQAKY-LGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
AEF+A K KPS + +++P + DWR + VT +KDQ CGS WAF+
Sbjct: 83 RAEFKAMLATQVKTKPSIVATKTFQLADGVSVPESIDWRSRNVVTPIKDQAQCGSCWAFA 142
Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQEDD-GCEGGSISNAFDTIMSKLGGGLEEEKT 213
G+ EG YA T KL SEQ+L+DC + + GC+GG + + F I + GLE E
Sbjct: 143 VVGSTEGAYALSTGKLTRFSEQQLVDCTTDLNYGCDGGYLDDTFPYIQTN---GLELESD 199
Query: 214 YPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGV 273
YPY G D C K++ YVSV +E + + + GP+A+AINA LQFY +G+
Sbjct: 200 YPYTGYDGYCSYESSKVVTKVSSYVSVPANEQALLEAVGTAGPVAIAINADDLQFYFSGI 259
Query: 274 SHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG 333
+CD E L H VL VGY + + YW+IKNSWG WGE GYFR RG
Sbjct: 260 ID--DKYCD--PEYLDHGVLAVGYDSENGR------DYWLIKNSWGADWGESGYFRFLRG 309
Query: 334 DGSCGIND 341
CG+ +
Sbjct: 310 QNICGVKE 317
>gi|56553473|gb|AAV97878.1| recombinant cysteine protease [Cloning vector pQ-CPB]
Length = 335
Score = 214 bits (546), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 131/320 (40%), Positives = 176/320 (55%), Gaps = 28/320 (8%)
Query: 39 TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQ-DTEHGSGVYGLNEFSDLSTA 97
+ LF F + + + YATL E RL F NL ++ Q + H +G+ +F DLS
Sbjct: 27 SVLFEEFKQTYQRVYATLDEEQQRLANFQRNLELMREHQANNPHAR--FGITKFFDLSEE 84
Query: 98 EFQAKYLG----FKLKPSYAD---RSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSS 150
EF +YL F +A R V A + T P A DWRE AVT VKDQ MCGS
Sbjct: 85 EFATRYLSGATHFAKAKKFASQYYRKVGADLS--TAPAAVDWREKGAVTPVKDQGMCGSC 142
Query: 151 WAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEE 210
WAFS GNIE + T L+SLSEQEL+ CD D+GC GG + AFD +++ G +
Sbjct: 143 WAFSAIGNIESKWYLATHSLISLSEQELVSCDDVDEGCNGGLMGQAFDWLLNNRNGAVYT 202
Query: 211 EKTYPY-RGDDKACRLNKKATQV---KINGYVSVSRDETDMAKYLVENGPMAVAINAYAL 266
+YPY G+ ++ + V I+G+V++ +E MA +L NGP+A+A++A A
Sbjct: 203 GASYPYVSGNGSVPECSESSDLVIGAYIDGHVTIESNEDTMAAWLAANGPIAIAVDASAF 262
Query: 267 QFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKG 326
Y GV CDG + L+H VL+VGY + VPYW+IKNSWGE WGEKG
Sbjct: 263 MSYTGGVLTS----CDG--KQLNHGVLLVGYNMT------GEVPYWVIKNSWGENWGEKG 310
Query: 327 YFRLYRGDGSCGINDYVRSA 346
Y R+ +G C I +Y SA
Sbjct: 311 YVRVRKGTNECLIQEYPVSA 330
>gi|157864851|ref|XP_001681134.1| cathepsin L-like protease [Leishmania major strain Friedlin]
gi|68124428|emb|CAJ02284.1| cathepsin L-like protease [Leishmania major strain Friedlin]
gi|378943050|gb|AFC76266.1| cathepsin L-like protease [Leishmania major]
gi|378943052|gb|AFC76267.1| cathepsin L-like protease [Leishmania major]
gi|378943054|gb|AFC76268.1| cathepsin L-like protease [Leishmania major]
gi|378943058|gb|AFC76270.1| cathepsin L-like protease [Leishmania major]
gi|394331737|gb|AFN27091.1| cysteine protease [Leishmania major]
gi|394331741|gb|AFN27093.1| cysteine protease [Leishmania major]
gi|394331747|gb|AFN27096.1| cysteine protease [Leishmania major]
gi|394331749|gb|AFN27097.1| cysteine protease [Leishmania major]
gi|394331751|gb|AFN27098.1| cysteine protease [Leishmania major]
gi|394331753|gb|AFN27099.1| cysteine protease [Leishmania major]
gi|394331755|gb|AFN27100.1| cysteine protease [Leishmania major]
gi|394331757|gb|AFN27101.1| cysteine protease [Leishmania major]
gi|394331759|gb|AFN27102.1| cysteine protease [Leishmania major]
gi|394331761|gb|AFN27103.1| cysteine protease [Leishmania major]
gi|394331763|gb|AFN27104.1| cysteine protease [Leishmania major]
gi|394331765|gb|AFN27105.1| cysteine protease [Leishmania major]
gi|394331767|gb|AFN27106.1| cysteine protease [Leishmania major]
gi|394331769|gb|AFN27107.1| cysteine protease [Leishmania major]
gi|394331771|gb|AFN27108.1| cysteine protease [Leishmania major]
gi|394331773|gb|AFN27109.1| cysteine protease [Leishmania major]
gi|394331775|gb|AFN27110.1| cysteine protease [Leishmania major]
gi|394331777|gb|AFN27111.1| cysteine protease [Leishmania major]
gi|394331779|gb|AFN27112.1| cysteine protease [Leishmania major]
gi|394331781|gb|AFN27113.1| cysteine protease [Leishmania major]
gi|394331783|gb|AFN27114.1| cysteine protease [Leishmania major]
gi|394331785|gb|AFN27115.1| cysteine protease [Leishmania major]
gi|394331787|gb|AFN27116.1| cysteine protease [Leishmania major]
gi|394331789|gb|AFN27117.1| cysteine protease [Leishmania major]
gi|394331791|gb|AFN27118.1| cysteine protease [Leishmania major]
gi|394331793|gb|AFN27119.1| cysteine protease [Leishmania major]
gi|394331795|gb|AFN27120.1| cysteine protease [Leishmania major]
gi|394331797|gb|AFN27121.1| cysteine protease [Leishmania major]
gi|394331799|gb|AFN27122.1| cysteine protease [Leishmania major]
gi|394331801|gb|AFN27123.1| cysteine protease [Leishmania major]
gi|394331803|gb|AFN27124.1| cysteine protease [Leishmania major]
Length = 348
Score = 214 bits (546), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 128/319 (40%), Positives = 176/319 (55%), Gaps = 22/319 (6%)
Query: 39 TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAE 98
ALF F + + Y TL E RL F NL ++ Q + +G+ +F DLS AE
Sbjct: 35 AALFEEFKRTYQRAYGTLTEEQQRLANFERNLELMREHQ-ARNPHARFGITKFFDLSEAE 93
Query: 99 FQAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAF 153
F A+YL F +A + +++ +P A DWRE AVT VK+Q CGS WAF
Sbjct: 94 FAARYLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKNQGACGSCWAF 153
Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
S GNIE +A KLV LSEQ+L+ CD D+GC GG + AF+ ++ + G + EK+
Sbjct: 154 SAVGNIESQWAVAGHKLVRLSEQQLVSCDHVDNGCGGGLMLQAFEWVLRNMNGTVFTEKS 213
Query: 214 YPY---RGDDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFY 269
YPY GD C + + A +I+GYVS+ E MA +L +NGP+++A++A + Y
Sbjct: 214 YPYVSGNGDVPECSNSSELAPGARIDGYVSMESSERVMAAWLAKNGPISIAVDASSFMSY 273
Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
+GV C G E L+H VL+VGY + VPYW+IKNSWGE WGEKGY R
Sbjct: 274 HSGVLTS----CIG--EQLNHGVLLVGYNMT------GEVPYWVIKNSWGEDWGEKGYVR 321
Query: 330 LYRGDGSCGINDYVRSALV 348
+ G +C + Y S V
Sbjct: 322 VTMGVNACLLTGYPVSVHV 340
>gi|394331743|gb|AFN27094.1| cysteine protease [Leishmania major]
Length = 348
Score = 214 bits (545), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 126/313 (40%), Positives = 174/313 (55%), Gaps = 22/313 (7%)
Query: 39 TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAE 98
ALF F + + Y TL E RL F NL ++ Q + +G+ +F DLS AE
Sbjct: 35 AALFEEFKRTYQRAYGTLTEEQQRLANFERNLELMREHQ-ARNPHARFGITKFFDLSEAE 93
Query: 99 FQAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAF 153
F A+YL F +A + +++ +P A DWRE AVT VK+Q CGS WAF
Sbjct: 94 FAARYLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKNQGACGSCWAF 153
Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
S GNIE +A KLV LSEQ+L+ CD D+GC GG + AF+ ++ + G + EK+
Sbjct: 154 SAVGNIESQWAVAGHKLVRLSEQQLVSCDHVDNGCGGGLMLQAFEWVLRNMNGTVFTEKS 213
Query: 214 YPY---RGDDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFY 269
YPY GD C + + A +I+GYVS+ E MA +L +NGP+++A++A + Y
Sbjct: 214 YPYVSGNGDVPECSNSSELAPGARIDGYVSMESSERVMAAWLAKNGPISIAVDASSFMSY 273
Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
+GV C G E L+H VL+VGY + VPYW+IKNSWGE WGEKGY R
Sbjct: 274 HSGVLTS----CIG--EQLNHGVLLVGYNMT------GEVPYWVIKNSWGEDWGEKGYVR 321
Query: 330 LYRGDGSCGINDY 342
+ G +C + Y
Sbjct: 322 VTMGVNACLLTGY 334
>gi|74229834|gb|AAU14993.2| cysteine proteinase [Cryptobia salmositica]
Length = 443
Score = 214 bits (545), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 124/344 (36%), Positives = 189/344 (54%), Gaps = 28/344 (8%)
Query: 16 TVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQL 75
TV V++ ++V + + LF F H + YA+ E R IF+GN++K +
Sbjct: 3 TVIVAALLMV----CNAMGAPTTEVLFGNFKAAHARNYASPDEERKRFEIFAGNMKKAAV 58
Query: 76 LQDTEHGSGVYGLNEFSDLSTAEFQAKY------LGFKLKPSYADRSVPAMIPNITLPRA 129
L + ++ +G NEF+D+++ EFQ ++ K +P ++ A + +
Sbjct: 59 L-NRKNPMATFGPNEFADMTSEEFQTRHNAARHYAAAKARPPKNTKTFTAEEIKAAVGQQ 117
Query: 130 FDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCE 189
DWR AVT VK+Q CGS W+FSTTGNIEG +A T +LV++SEQEL+ CD DDGC
Sbjct: 118 IDWRLKGAVTPVKNQGACGSCWSFSTTGNIEGQHAIATGQLVAVSEQELVSCDPIDDGCN 177
Query: 190 GGSISNAFDTIMSKLGGGLEEEKTYPY---RGDDKACRLNKKATQV--KINGYVSVSRDE 244
GG + NAF ++S G + E YPY G AC + ++ V I+ + ++R E
Sbjct: 178 GGLMDNAFGWLISAHKGQIATEANYPYVSGNGIVPACSSSPESKPVGATISAFQDIARTE 237
Query: 245 TDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKF 304
DMA ++ ++GP+++ ++A Q Y G I +C + + H VLIVG+ D T
Sbjct: 238 EDMAAFVFKHGPLSIGVDASTWQSYAGG----IMSYCP--QDQIDHGVLIVGF--DDTAS 289
Query: 305 THKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
T PYWIIKNSW WGE+GY R+ +G CG+ + S++V
Sbjct: 290 T----PYWIIKNSWTANWGEEGYIRVAKGSNQCGLTSHPSSSVV 329
>gi|157864853|ref|XP_001681135.1| cathepsin L-like protease [Leishmania major strain Friedlin]
gi|157864857|ref|XP_001681137.1| cathepsin L-like protease [Leishmania major strain Friedlin]
gi|68124429|emb|CAJ02285.1| cathepsin L-like protease [Leishmania major strain Friedlin]
gi|68124431|emb|CAJ02287.1| cathepsin L-like protease [Leishmania major strain Friedlin]
Length = 443
Score = 214 bits (545), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 128/318 (40%), Positives = 176/318 (55%), Gaps = 22/318 (6%)
Query: 40 ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
ALF F + + Y TL E RL F NL ++ Q + +G+ +F DLS AEF
Sbjct: 36 ALFEEFKRTYQRAYGTLTEEQQRLANFERNLELMREHQ-ARNPHARFGITKFFDLSEAEF 94
Query: 100 QAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
A+YL F +A + +++ +P A DWRE AVT VK+Q CGS WAFS
Sbjct: 95 AARYLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKNQGACGSCWAFS 154
Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
GNIE +A KLV LSEQ+L+ CD D+GC GG + AF+ ++ + G + EK+Y
Sbjct: 155 AVGNIESQWAVAGHKLVRLSEQQLVSCDHVDNGCGGGLMLQAFEWVLRNMNGTVFTEKSY 214
Query: 215 PY---RGDDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYV 270
PY GD C + + A +I+GYVS+ E MA +L +NGP+++A++A + Y
Sbjct: 215 PYVSGNGDVPECSNSSELAPGARIDGYVSMESSERVMAAWLAKNGPISIAVDASSFMSYH 274
Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
+GV C G E L+H VL+VGY + VPYW+IKNSWGE WGEKGY R+
Sbjct: 275 SGVLTS----CIG--EQLNHGVLLVGYNMT------GEVPYWVIKNSWGEDWGEKGYVRV 322
Query: 331 YRGDGSCGINDYVRSALV 348
G +C + Y S V
Sbjct: 323 TMGVNACLLTGYPVSVHV 340
>gi|157864845|ref|XP_001681131.1| cathepsin L-like protease [Leishmania major strain Friedlin]
gi|68124425|emb|CAJ02281.1| cathepsin L-like protease [Leishmania major strain Friedlin]
Length = 348
Score = 214 bits (545), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 127/319 (39%), Positives = 175/319 (54%), Gaps = 22/319 (6%)
Query: 39 TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAE 98
ALF F + + Y TL E RL F NL ++ Q + +G+ +F DLS AE
Sbjct: 35 AALFEEFKRTYQRAYGTLTEEQQRLANFERNLELMREHQ-ARNPHARFGITKFFDLSEAE 93
Query: 99 FQAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAF 153
F A+YL F +A + +++ +P A DWRE AVT VK+Q CGS WAF
Sbjct: 94 FAARYLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKNQGACGSCWAF 153
Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
S GNIE +A KLV LSEQ+L+ CD D+GC GG + AF+ ++ + G + EK+
Sbjct: 154 SAVGNIESQWAVAGHKLVRLSEQQLVSCDHVDNGCGGGLMLQAFEWVLRNMNGTVSTEKS 213
Query: 214 YPY---RGDDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFY 269
YPY GD C + + A +I+GYVS+ E M +L +NGP+++A++A + Y
Sbjct: 214 YPYVSGNGDVPECSNSSELAPGARIDGYVSMESSERVMTAWLAKNGPISIAVDASSFMSY 273
Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
+GV C G E L+H VL+VGY + VPYW+IKNSWGE WGEKGY R
Sbjct: 274 HSGVLTS----CIG--EQLNHGVLLVGYNMT------GEVPYWVIKNSWGEDWGEKGYVR 321
Query: 330 LYRGDGSCGINDYVRSALV 348
+ G +C + Y S V
Sbjct: 322 VTMGVNACLLTGYPVSVHV 340
>gi|343477619|emb|CCD11596.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 445
Score = 214 bits (544), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 124/343 (36%), Positives = 188/343 (54%), Gaps = 24/343 (6%)
Query: 13 LSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRK 72
+ L + F+ V LH ++ F F ++++++Y E R +F ++ +
Sbjct: 14 VGLLAVAACFVPVALGVLHAEQSLQQQ--FAAFKQKYSRSYKDATEEAFRFRMFKQSMER 71
Query: 73 IQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL-GFKLKPSYADRSVPAMIPNITL---PR 128
+ + + +G+ +FSD+S EF+A YL G K + R P + ++ P
Sbjct: 72 AKE-EAAANPYATFGVTQFSDMSPEEFRATYLNGAKYYAAALKR--PRKVVTVSTGKAPP 128
Query: 129 AFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGC 188
A DWR+ AVT VKDQ CGS WAFS GNIEG + +L SLSEQ L+ CD DDGC
Sbjct: 129 AIDWRKKGAVTPVKDQRKCGSCWAFSAIGNIEGQWKVAGHELTSLSEQMLVSCDNMDDGC 188
Query: 189 EGGSISNAFDTIMSKLGGGLEEEKTYPY---RGDDKACRLNKKATQVKINGYVSVSRDET 245
+GG + A I+S G + E++YPY GD C + K KI+G +++ +DE
Sbjct: 189 QGGLMDRALKWIVSSNKGNVFTEESYPYDSTDGDVPPCNKSGKVVGAKISGLINLPKDEN 248
Query: 246 DMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFT 305
+A++L +NGP+A+A++A + Y GV ++ L+H VL+VGY D +K
Sbjct: 249 AIAEWLAKNGPIAIAVDASSFLDYTGGV------LTSCSSDALNHDVLLVGYD-DSSK-- 299
Query: 306 HKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
PYWIIKNSWG+ WGE+GY R+ +G C + +Y RSA+V
Sbjct: 300 ---PPYWIIKNSWGKKWGEEGYIRVEKGTNQCLMKEYARSAVV 339
>gi|1848231|gb|AAB48120.1| cathepsin L-like protease [Leishmania major]
Length = 443
Score = 214 bits (544), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 128/318 (40%), Positives = 176/318 (55%), Gaps = 22/318 (6%)
Query: 40 ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
ALF F + + Y TL E RL F NL ++ Q + +G+ +F DLS AEF
Sbjct: 36 ALFEEFKRTYQRAYGTLTEEQQRLANFERNLELMREHQ-ARNPHARFGITKFFDLSEAEF 94
Query: 100 QAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
A+YL F +A + +++ +P A DWRE AVT VK+Q CGS WAFS
Sbjct: 95 AARYLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKNQGACGSCWAFS 154
Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
GNIE +A KLV LSEQ+L+ CD D+GC GG + AF+ ++ + G + EK+Y
Sbjct: 155 AVGNIESQWAVAGHKLVRLSEQQLVSCDHVDNGCGGGLMLQAFEWVLRNMNGTVFTEKSY 214
Query: 215 PY---RGDDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYV 270
PY GD C + + A +I+GYVS+ E MA +L +NGP+++A++A + Y
Sbjct: 215 PYVSGNGDVPECSNSSELAPGARIDGYVSMESSERVMAAWLAKNGPISIAVDASSFMSYH 274
Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
+GV C G E L+H VL+VGY + VPYW+IKNSWGE WGEKGY R+
Sbjct: 275 SGVLTS----CIG--EQLNHGVLLVGYNMT------GEVPYWVIKNSWGEDWGEKGYVRV 322
Query: 331 YRGDGSCGINDYVRSALV 348
G +C + Y S V
Sbjct: 323 TMGVNACLLTGYPVSVHV 340
>gi|5231178|gb|AAD41105.1|AF157961_1 cysteine proteinase [Hypera postica]
Length = 324
Score = 214 bits (544), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 125/308 (40%), Positives = 176/308 (57%), Gaps = 21/308 (6%)
Query: 40 ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT-EHGSGVY--GLNEFSDLST 96
A F F +H KTY E R +IF+ N+R I+ E G Y G+N+F+D+S
Sbjct: 24 AKFQAFKLEHGKTYLNQAEESKRFNIFTDNVRAIEAHNALYEQGKVSYKKGINKFTDMSQ 83
Query: 97 AEFQAKY-LGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFST 155
EF+ L KP+ S + + +P + DWR+ VTGVKDQ CGS WAFS
Sbjct: 84 EEFKTMLTLSASRKPTLETTSY--VKTGVEIPSSVDWRKEGRVTGVKDQGDCGSCWAFSI 141
Query: 156 TGNIEGVYAAKTKKLVSLSEQELIDCDQEDD-GCEGGSISNAFDTIMSKLGGGLEEEKTY 214
TG+ EG YA K+ KLVSLSEQ+LIDC + GC+GGS+ + F +M GL+ E++Y
Sbjct: 142 TGSTEGAYARKSGKLVSLSEQQLIDCCTDTSAGCDGGSLDDNFKYVMKD---GLQSEESY 198
Query: 215 PYRGDDKACRLNKKATQVKINGYVSV-SRDETDMAKYLVENGPMAVAINAYALQFYVTGV 273
Y+G+D AC+ N + K++ Y S+ + DE + + + GP++V ++A L Y +G+
Sbjct: 199 TYKGEDGACKYNVASVVTKVSKYTSIPAEDEDALLEAVATVGPVSVGMDASYLSSYDSGI 258
Query: 274 SHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG 333
D L+H++L VGYG + K YWIIKNSWG WGE+GYFRL RG
Sbjct: 259 YEDQ----DCSPAGLNHAILAVGYGTENGK------DYWIIKNSWGASWGEQGYFRLARG 308
Query: 334 DGSCGIND 341
CGI++
Sbjct: 309 KNQCGISE 316
>gi|20147096|gb|AAM09951.1| 49 kDa cysteine proteinase Cysp1 [Cryptobia salmositica]
Length = 428
Score = 214 bits (544), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 120/319 (37%), Positives = 179/319 (56%), Gaps = 24/319 (7%)
Query: 41 LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
LF F H + YA+ E R IF+GN++K +L + ++ +G NEF+D+++ EFQ
Sbjct: 9 LFGNFKAAHARNYASPDEERKRFEIFAGNMKKAAVL-NRKNPMATFGPNEFADMTSEEFQ 67
Query: 101 AKY------LGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
++ K +P ++ A + + DWR AVT VK+Q CGS W+FS
Sbjct: 68 TRHNAARHYAAAKARPPKNTKTFTAEEIKAAVGQQIDWRLKGAVTPVKNQGACGSCWSFS 127
Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
TTGNIEG +A T +LV++SEQEL+ CD DDGC GG + NAF ++S G + E Y
Sbjct: 128 TTGNIEGQHAIATGQLVAVSEQELVSCDPIDDGCNGGLMDNAFGWLISAHKGQIATEANY 187
Query: 215 PY---RGDDKACRLNKKATQV--KINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFY 269
PY G AC + ++ V I+ + ++R E DMA ++ ++GP+++ ++A Q Y
Sbjct: 188 PYVSGNGIVPACSSSPESKPVGATISAFQDIARTEEDMAAFVFKHGPLSIGVDASTWQSY 247
Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
G I +C + + H VLIVG+ D T T PYWIIKNSW WGE+GY R
Sbjct: 248 AGG----IMSYCP--QDQIDHGVLIVGF--DDTAST----PYWIIKNSWTANWGEEGYIR 295
Query: 330 LYRGDGSCGINDYVRSALV 348
+ +G CG+ + S++V
Sbjct: 296 VAKGSNQCGLTSHPSSSVV 314
>gi|157864847|ref|XP_001681132.1| cathepsin L-like protease [Leishmania major strain Friedlin]
gi|68124426|emb|CAJ02282.1| cathepsin L-like protease [Leishmania major strain Friedlin]
Length = 443
Score = 213 bits (543), Expect = 9e-53, Method: Compositional matrix adjust.
Identities = 128/318 (40%), Positives = 176/318 (55%), Gaps = 22/318 (6%)
Query: 40 ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
ALF F + + Y TL E RL F NL ++ Q + +G+ +F DLS AEF
Sbjct: 36 ALFEEFKRTYQRAYGTLTEEQQRLANFERNLELMREHQ-ARNPHARFGITKFFDLSEAEF 94
Query: 100 QAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
A+YL F +A + +++ +P A DWRE AVT VK+Q CGS WAFS
Sbjct: 95 AARYLNGAAYFAAVKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKNQGACGSCWAFS 154
Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
GNIE +A KLV LSEQ+L+ CD D+GC GG + AF+ ++ + G + EK+Y
Sbjct: 155 AVGNIESQWAVAGHKLVRLSEQQLVSCDHVDNGCGGGLMLQAFEWVLRNMNGTVFTEKSY 214
Query: 215 PY---RGDDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYV 270
PY GD C + + A +I+GYVS+ E MA +L +NGP+++A++A + Y
Sbjct: 215 PYVSGNGDVPECSNSSELAPGARIDGYVSMESSERVMAAWLAKNGPISIAVDASSFMSYH 274
Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
+GV C G E L+H VL+VGY + VPYW+IKNSWGE WGEKGY R+
Sbjct: 275 SGVLTS----CIG--EQLNHGVLLVGYNMT------GEVPYWVIKNSWGEDWGEKGYVRV 322
Query: 331 YRGDGSCGINDYVRSALV 348
G +C + Y S V
Sbjct: 323 TMGVNACLLTGYPVSVHV 340
>gi|154332645|ref|XP_001562139.1| cathepsin L-like protease [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|134059587|emb|CAM37169.1| cathepsin L-like protease [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 441
Score = 213 bits (543), Expect = 9e-53, Method: Compositional matrix adjust.
Identities = 131/320 (40%), Positives = 176/320 (55%), Gaps = 28/320 (8%)
Query: 39 TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQ-DTEHGSGVYGLNEFSDLSTA 97
+ LF F + + + YATL E RL F NL ++ Q + H +G+ +F DLS
Sbjct: 35 SVLFEEFKQTYQRVYATLDEEQQRLANFQRNLELMREHQANNPHAR--FGITKFFDLSEE 92
Query: 98 EFQAKYLG----FKLKPSYAD---RSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSS 150
EF +YL F +A R V A + T P A DWRE AVT VKDQ MCGS
Sbjct: 93 EFATRYLSGATHFAKAKKFASQYYRKVGADLS--TAPAAVDWREKGAVTPVKDQGMCGSC 150
Query: 151 WAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEE 210
WAFS GNIE + T L+SLSEQEL+ CD D+GC GG + AFD +++ G +
Sbjct: 151 WAFSAIGNIESKWYLATHSLISLSEQELVSCDDVDEGCNGGLMLQAFDWLLNNRNGAVYT 210
Query: 211 EKTYPY-RGDDKACRLNKKATQV---KINGYVSVSRDETDMAKYLVENGPMAVAINAYAL 266
+YPY G+ ++ + V I+G+V++ +E MA +L NGP+A+A++A A
Sbjct: 211 GASYPYVSGNGSVPECSESSDLVIGAYIDGHVTIESNEDTMAAWLAANGPIAIAVDASAF 270
Query: 267 QFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKG 326
Y GV CDG + L+H VL+VGY + VPYW+IKNSWGE WGEKG
Sbjct: 271 MSYTGGVLTS----CDG--KQLNHGVLLVGYNMT------GEVPYWLIKNSWGENWGEKG 318
Query: 327 YFRLYRGDGSCGINDYVRSA 346
Y R+ +G C I +Y SA
Sbjct: 319 YVRVRKGTNECLIQEYPVSA 338
>gi|394331735|gb|AFN27090.1| cysteine protease [Leishmania major]
Length = 348
Score = 213 bits (542), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 127/319 (39%), Positives = 176/319 (55%), Gaps = 22/319 (6%)
Query: 39 TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAE 98
ALF F + + Y TL E RL F NL ++ Q + +G+ +F DLS AE
Sbjct: 35 AALFEEFKRTYQRAYGTLTEEQQRLANFERNLELMREHQ-ARNPHARFGITKFFDLSEAE 93
Query: 99 FQAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAF 153
F A+YL F +A + +++ +P A DWR+ AVT VK+Q CGS WAF
Sbjct: 94 FAARYLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWRKKGAVTPVKNQGACGSCWAF 153
Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
S GNIE +A KLV LSEQ+L+ CD D+GC GG + AF+ ++ + G + EK+
Sbjct: 154 SAVGNIESQWAVAGHKLVRLSEQQLVSCDHVDNGCGGGLMLQAFEWVLRNMNGTVFTEKS 213
Query: 214 YPY---RGDDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFY 269
YPY GD C + + A +I+GYVS+ E MA +L +NGP+++A++A + Y
Sbjct: 214 YPYVSGNGDVPECSNSSELAPGARIDGYVSMESSERVMAAWLAKNGPISIAVDASSFMSY 273
Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
+GV C G E L+H VL+VGY + VPYW+IKNSWGE WGEKGY R
Sbjct: 274 HSGVLTS----CIG--EQLNHGVLLVGYNMT------GEVPYWVIKNSWGEDWGEKGYVR 321
Query: 330 LYRGDGSCGINDYVRSALV 348
+ G +C + Y S V
Sbjct: 322 VTMGVNACLLTGYPVSVHV 340
>gi|157864849|ref|XP_001681133.1| cathepsin L-like protease [Leishmania major strain Friedlin]
gi|68124427|emb|CAJ02283.1| cathepsin L-like protease [Leishmania major strain Friedlin]
Length = 348
Score = 213 bits (542), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 127/319 (39%), Positives = 176/319 (55%), Gaps = 22/319 (6%)
Query: 39 TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAE 98
ALF F + + Y TL E RL F NL ++ Q + +G+ +F DLS AE
Sbjct: 35 AALFEEFKRTYQRAYGTLTEEQQRLANFERNLELMREHQ-ARNPHARFGITKFFDLSEAE 93
Query: 99 FQAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAF 153
F A+YL F +A + +++ +P A DWRE AVT VK+Q CGS WAF
Sbjct: 94 FAARYLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKNQGACGSCWAF 153
Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
S GNIE +A KLV LSEQ+L+ CD D+GC GG + AF+ ++ + G + EK+
Sbjct: 154 SAVGNIESQWAVAGHKLVRLSEQQLVSCDHVDNGCGGGLMLQAFEWVLRNMNGTVFTEKS 213
Query: 214 YPY---RGDDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFY 269
YPY GD C + + A +I+GYVS+ E MA +L +NGP+++A++A + Y
Sbjct: 214 YPYVSGNGDVPECSNSSELAPGARIDGYVSMESSERVMAAWLAKNGPISIAVDASSFMSY 273
Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
+GV C G E L+H VL+VGY + VPYW+IKNSWG+ WGEKGY R
Sbjct: 274 HSGVLTS----CIG--EQLNHGVLLVGYNMT------GEVPYWVIKNSWGKDWGEKGYVR 321
Query: 330 LYRGDGSCGINDYVRSALV 348
+ G +C + Y S V
Sbjct: 322 VTMGVNACLLTGYPVSVHV 340
>gi|154332649|ref|XP_001562141.1| cathepsin L-like protease [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|134059589|emb|CAM37171.1| cathepsin L-like protease [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 441
Score = 213 bits (542), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 128/318 (40%), Positives = 176/318 (55%), Gaps = 24/318 (7%)
Query: 39 TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQ-DTEHGSGVYGLNEFSDLSTA 97
+ LF F + + + YATL E RL F NL ++ Q + H +G+ +F DLS
Sbjct: 35 SVLFEEFKQTYQRVYATLDEEQQRLANFQRNLELMREHQANNPHAR--FGITKFFDLSEE 92
Query: 98 EFQAKYLG----FKLKPSYADRSVPAMIPNI-TLPRAFDWREYDAVTGVKDQTMCGSSWA 152
EF +YL F +A + + ++ T P A DWRE AVT VKDQ MCGS WA
Sbjct: 93 EFATRYLSGATHFAKAKKFASQHYRKVGADLSTAPAAVDWREKGAVTPVKDQGMCGSCWA 152
Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEK 212
FS GNIE + T L+SLSEQEL+ CD D+GC GG + AFD +++ G +
Sbjct: 153 FSAIGNIESQWYLATHSLISLSEQELVSCDDVDEGCNGGLMLQAFDWLLNNRNGAVYTGV 212
Query: 213 TYPY-RGDDKACRLNKKATQVK---INGYVSVSRDETDMAKYLVENGPMAVAINAYALQF 268
+YPY G+ ++ + V I+G+V++ +E MA +L NGP+A+A++A A
Sbjct: 213 SYPYVSGNGSVPECSESSDLVIGAYIDGHVTIESNEDTMAAWLAANGPIAIAVDASAFMS 272
Query: 269 YVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYF 328
Y GV CDG + L+H VL+VGY + VPYW+IKNSWGE WGEKGY
Sbjct: 273 YTGGVLTS----CDG--KQLNHGVLLVGYNMT------GEVPYWLIKNSWGENWGEKGYV 320
Query: 329 RLYRGDGSCGINDYVRSA 346
R+ +G C I +Y SA
Sbjct: 321 RVRKGTNECLIQEYPVSA 338
>gi|157864855|ref|XP_001681136.1| cathepsin L-like protease [Leishmania major strain Friedlin]
gi|68124430|emb|CAJ02286.1| cathepsin L-like protease [Leishmania major strain Friedlin]
Length = 348
Score = 213 bits (541), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 127/319 (39%), Positives = 175/319 (54%), Gaps = 22/319 (6%)
Query: 39 TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAE 98
ALF F + + Y TL E RL F NL ++ Q + +G+ +F DLS AE
Sbjct: 35 AALFEEFKRTYQRAYGTLTEEQQRLANFERNLELMREHQ-ARNPHARFGITKFFDLSEAE 93
Query: 99 FQAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAF 153
F A+YL F +A + +++ +P A DWRE AVT VK+Q CGS WAF
Sbjct: 94 FAARYLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKNQGACGSCWAF 153
Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
S GNIE +A KLV LSEQ+L+ CD D+GC GG + AF+ ++ + G + EK+
Sbjct: 154 SAVGNIESQWAVAGHKLVRLSEQQLVSCDHVDNGCGGGLMLQAFEWVLRNMNGTVFTEKS 213
Query: 214 YPY---RGDDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFY 269
YPY GD C + + A +I+GYVS+ E M +L +NGP+++A++A + Y
Sbjct: 214 YPYVSGNGDVPECSNSSELAPGARIDGYVSMESSERVMTAWLAKNGPISIAVDASSFMSY 273
Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
+GV C G E L+H VL+VGY + VPYW+IKNSWGE WGEKGY R
Sbjct: 274 HSGVLTS----CIG--EQLNHGVLLVGYNMT------GEVPYWVIKNSWGEDWGEKGYVR 321
Query: 330 LYRGDGSCGINDYVRSALV 348
+ G +C + Y S V
Sbjct: 322 VTMGVNACLLTGYPVSVHV 340
>gi|115472081|ref|NP_001059639.1| Os07g0480900 [Oryza sativa Japonica Group]
gi|27261016|dbj|BAC45132.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|113611175|dbj|BAF21553.1| Os07g0480900 [Oryza sativa Japonica Group]
gi|215693312|dbj|BAG88694.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 376
Score = 213 bits (541), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 128/335 (38%), Positives = 178/335 (53%), Gaps = 35/335 (10%)
Query: 40 ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
A F F+ +H + Y+ EY RL +F+ NL + Q + + +G+ FSDL+ EF
Sbjct: 46 AQFAAFVRRHGREYSGPEEYARRLRVFAANLARAAAHQALDP-TARHGVTPFSDLTREEF 104
Query: 100 QAKYLGFK--LKPSYADRSVPAMIPNIT-----LPRAFDWREYDAVTGVKDQTMCGSSWA 152
+A+ G + R +P+ P LP +FDWR+ AVT VK Q CGS WA
Sbjct: 105 EARLTGLAADVGDDVRRRPMPSAAPATEEEVSGLPASFDWRDRGAVTDVKMQGACGSCWA 164
Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSK 203
FSTTG +EG T L+ LSEQ+L+DCD D GC GG ++NA+ +MS
Sbjct: 165 FSTTGAVEGANFLATGNLLDLSEQQLVDCDHTCDAEKKTECDSGCGGGLMTNAYAYLMSS 224
Query: 204 LGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRD-------ETDMAKYLVENGP 256
GGL E+ YPY G CR + V++ + V+ + M LV +GP
Sbjct: 225 --GGLMEQSAYPYTGAQGTCRFDANRVAVRVANFTVVAPPGGNDGDGDAQMRAALVRHGP 282
Query: 257 MAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGY---GVDRTKFTHKAVPYWI 313
+AV +NA +Q YV GVS P+ C ++H VL+VGY G + H+ PYWI
Sbjct: 283 LAVGLNAAYMQTYVGGVSCPL--VCP--RAWVNHGVLLVGYGERGFAALRLGHR--PYWI 336
Query: 314 IKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
IKNSWG+ WGE+GY+RL RG CG++ V + V
Sbjct: 337 IKNSWGKAWGEQGYYRLCRGRNVCGVDTMVSAVAV 371
>gi|114679921|ref|YP_758371.1| cathepsin [Leucania separata nuclear polyhedrosis virus]
gi|39598652|gb|AAR28838.1| cathepsin [Leucania separata nuclear polyhedrosis virus]
Length = 359
Score = 213 bits (541), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 129/328 (39%), Positives = 175/328 (53%), Gaps = 44/328 (13%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
F F+ +N+TY VE R F NL+ I L S Y +N+FSDL+ E A
Sbjct: 54 FERFVRDYNRTYIDSVEREQRYETFVQNLKNINRLNQKSQAS--YDINKFSDLTKDEVVA 111
Query: 102 KYLGFKLKPS-----YADRS--------------VPAMIPNITLPRAFDWREYDAVTGVK 142
++ G L PS Y D + P +P++ +DWR VT VK
Sbjct: 112 RFTG--LDPSLAAAAYTDNNGTQYQLCKVVVVDGTPGRVPDL-----WDWRNSQKVTSVK 164
Query: 143 DQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMS 202
Q +CGS WAF++ NIE YA + +L+ LSEQ+L+DCDQ D GC GG + AF I+
Sbjct: 165 QQGVCGSCWAFASVANIESQYAIRHDRLLDLSEQQLVDCDQIDQGCSGGLMHLAFQEILQ 224
Query: 203 KLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVS-RDETDMAKYLVENGPMAVAI 261
GGLE E YPY+G D ACRLN + VK++ RDE + + + GP+AVAI
Sbjct: 225 M--GGLESELVYPYQGVDYACRLNPRKFDVKLSDCHRYDLRDERKLRELVYTVGPIAVAI 282
Query: 262 NAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEG 321
+ + Y +G+ C+ N L+H+VL+VG+G++ PYWI+KNSWG
Sbjct: 283 DCIDIIDYKSGIVS----MCN--NNGLNHAVLLVGFGIEFD------TPYWILKNSWGND 330
Query: 322 WGEKGYFRLYRGDGSCG-INDYVRSALV 348
WGEKGYFRL R CG +N+ SA V
Sbjct: 331 WGEKGYFRLKRNINGCGMMNELAASATV 358
>gi|72389847|ref|XP_845218.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|72389849|ref|XP_845219.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|72389851|ref|XP_845220.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|72389857|ref|XP_845223.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|62359926|gb|AAX80351.1| cysteine peptidase precursor [Trypanosoma brucei]
gi|62359927|gb|AAX80352.1| cysteine peptidase precursor [Trypanosoma brucei]
gi|62359928|gb|AAX80353.1| cysteine peptidase precursor [Trypanosoma brucei]
gi|62359931|gb|AAX80356.1| cysteine peptidase precursor [Trypanosoma brucei]
gi|70801753|gb|AAZ11659.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|70801754|gb|AAZ11660.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|70801755|gb|AAZ11661.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|70801758|gb|AAZ11664.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
Length = 450
Score = 213 bits (541), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 127/346 (36%), Positives = 180/346 (52%), Gaps = 27/346 (7%)
Query: 10 VALLSLTVSVSSFMVVGDEKLHHLHHVKHTAL-FNYFLEQHNKTYATLVEYYSRLHIFSG 68
V LL++ ++S L LH + + F F +++ K Y E R F
Sbjct: 14 VVLLAMAACLASV------ALGSLHVEESLEMRFAAFKKKYGKVYKDAKEEAFRFRAFEE 67
Query: 69 NLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITL-- 126
N+ + ++ Q + +G+ FSD++ EF+A+Y + A + + + N+T
Sbjct: 68 NMEQAKI-QAAANPYATFGVTPFSDMTREEFRARYRNGASYFAAAQKRLRKTV-NVTTGR 125
Query: 127 -PRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQED 185
P A DWRE AVT VKDQ CGS WAFST GNIEG + LVSLSEQ L+ CD D
Sbjct: 126 APAAVDWREKGAVTPVKDQGQCGSCWAFSTIGNIEGQWQVAGNPLVSLSEQMLVSCDTID 185
Query: 186 DGCEGGSISNAFDTIMSKLGGGLEEEKTYPY---RGDDKACRLNKKATQVKINGYVSVSR 242
GC GG + NAF+ I++ GG + E +YPY G+ C++N I +V + +
Sbjct: 186 SGCNGGLMDNAFNWIVNSNGGNVFTEASYPYVSGNGEQPQCQMNGHEIGAAITDHVDLPQ 245
Query: 243 DETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRT 302
DE +A YL ENGP+A+A++A + Y G+ +E L H VL+VGY +
Sbjct: 246 DEDAIAAYLAENGPLAIAVDATSFMDYNGGI------LTSCTSEQLDHGVLLVGYNDNSN 299
Query: 303 KFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
PYWIIKNSW WGE GY R+ +G C +N V SA+V
Sbjct: 300 P------PYWIIKNSWSNMWGEDGYIRIEKGTNQCLMNQAVSSAVV 339
>gi|72389855|ref|XP_845222.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|72389865|ref|XP_845227.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|72389867|ref|XP_845228.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|62359930|gb|AAX80355.1| cysteine peptidase precursor [Trypanosoma brucei]
gi|62359935|gb|AAX80360.1| cysteine peptidase precursor [Trypanosoma brucei]
gi|62359936|gb|AAX80361.1| cysteine peptidase precursor [Trypanosoma brucei]
gi|70801757|gb|AAZ11663.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|70801762|gb|AAZ11668.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|70801763|gb|AAZ11669.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
Length = 449
Score = 213 bits (541), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 127/346 (36%), Positives = 180/346 (52%), Gaps = 27/346 (7%)
Query: 10 VALLSLTVSVSSFMVVGDEKLHHLHHVKHTAL-FNYFLEQHNKTYATLVEYYSRLHIFSG 68
V LL++ ++S L LH + + F F +++ K Y E R F
Sbjct: 14 VVLLAMAACLASV------ALGSLHVEESLEMRFAAFKKKYGKVYKDAKEEAFRFRAFEE 67
Query: 69 NLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITL-- 126
N+ + ++ Q + +G+ FSD++ EF+A+Y + A + + + N+T
Sbjct: 68 NMEQAKI-QAAANPYATFGVTPFSDMTREEFRARYRNGASYFAAAQKRLRKTV-NVTTGR 125
Query: 127 -PRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQED 185
P A DWRE AVT VKDQ CGS WAFST GNIEG + LVSLSEQ L+ CD D
Sbjct: 126 APAAVDWREKGAVTPVKDQGQCGSCWAFSTIGNIEGQWQVAGNPLVSLSEQMLVSCDTID 185
Query: 186 DGCEGGSISNAFDTIMSKLGGGLEEEKTYPY---RGDDKACRLNKKATQVKINGYVSVSR 242
GC GG + NAF+ I++ GG + E +YPY G+ C++N I +V + +
Sbjct: 186 SGCNGGLMDNAFNWIVNSNGGNVFTEASYPYVSGNGEQPQCQMNGHEIGAAITDHVDLPQ 245
Query: 243 DETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRT 302
DE +A YL ENGP+A+A++A + Y G+ +E L H VL+VGY +
Sbjct: 246 DEDAIAAYLAENGPLAIAVDATSFMDYNGGI------LTSCTSEQLDHGVLLVGYNDNSN 299
Query: 303 KFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
PYWIIKNSW WGE GY R+ +G C +N V SA+V
Sbjct: 300 P------PYWIIKNSWSNMWGEDGYIRIEKGTNQCLMNQAVSSAVV 339
>gi|13625989|gb|AAK35220.1|AF362769_1 pre-procathepsin L [Paragonimus westermani]
Length = 235
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 104/224 (46%), Positives = 142/224 (63%), Gaps = 10/224 (4%)
Query: 125 TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE 184
T P + DWR+ AV V+ Q CGS WAFS T N+EG + KT +LVSLS+Q+L+DCD+
Sbjct: 21 TAPASVDWRKKGAVGPVEHQGSCGSCWAFSVTANVEGQWFLKTGRLVSLSKQQLVDCDRL 80
Query: 185 DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDE 244
D GC GG + I K GGLE + YPY G ++ACRL++ KI+ + + ++E
Sbjct: 81 DHGCSGGYPPYTYKEI--KRMGGLELQSAYPYTGWEQACRLDRSKLFAKIDDSIVLEKNE 138
Query: 245 TDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKF 304
A +L E+GPM+ +NA LQFY G+ HP ++ C E L+H+VL VGY +R
Sbjct: 139 EKQAAWLAEHGPMSTCLNAGPLQFYRYGILHPSEYACS--PEGLNHAVLTVGYDTER--- 193
Query: 305 THKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
VPYW ++NSWG WGE GYFR+YRGDG+CGI+ SA++
Sbjct: 194 ---GVPYWTVRNSWGTRWGENGYFRIYRGDGTCGIDRLTTSAII 234
>gi|72389861|ref|XP_845225.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|72389863|ref|XP_845226.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|62359933|gb|AAX80358.1| cysteine peptidase precursor [Trypanosoma brucei]
gi|62359934|gb|AAX80359.1| cysteine peptidase precursor [Trypanosoma brucei]
gi|70801760|gb|AAZ11666.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|70801761|gb|AAZ11667.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
Length = 449
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 127/346 (36%), Positives = 180/346 (52%), Gaps = 27/346 (7%)
Query: 10 VALLSLTVSVSSFMVVGDEKLHHLHHVKHTAL-FNYFLEQHNKTYATLVEYYSRLHIFSG 68
V LL++ ++S L LH + + F F +++ K Y E R F
Sbjct: 14 VVLLAMAACLASV------ALGSLHVEESLEMRFAAFKKKYGKVYKDAKEEAFRFRAFEE 67
Query: 69 NLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITL-- 126
N+ + ++ Q + +G+ FSD++ EF+A+Y + A + + + N+T
Sbjct: 68 NMEQAKI-QAAANPYATFGVTPFSDMTREEFRARYRNGASYFAAAQKRLRKTV-NVTTGR 125
Query: 127 -PRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQED 185
P A DWRE AVT VKDQ CGS WAFST GNIEG + LVSLSEQ L+ CD D
Sbjct: 126 APAAVDWREKGAVTPVKDQGQCGSCWAFSTIGNIEGQWQVAGNPLVSLSEQMLVSCDTID 185
Query: 186 DGCEGGSISNAFDTIMSKLGGGLEEEKTYPY---RGDDKACRLNKKATQVKINGYVSVSR 242
GC GG + NAF+ I++ GG + E +YPY G+ C++N I +V + +
Sbjct: 186 SGCNGGLMDNAFNWIVNSNGGNVFTEASYPYVSGNGEQPQCQMNGHEIGAAITDHVDLPQ 245
Query: 243 DETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRT 302
DE +A YL ENGP+A+A++A + Y G+ +E L H VL+VGY +
Sbjct: 246 DEDAIAAYLAENGPLAIAVDATSFMDYNGGI------LTSCTSEQLDHGVLLVGYNDNSN 299
Query: 303 KFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
PYWIIKNSW WGE GY R+ +G C +N V SA+V
Sbjct: 300 P------PYWIIKNSWSNMWGEDGYIRIEKGTNQCLMNQAVSSAVV 339
>gi|332326587|gb|AEE42617.1| cysteine protease [Leishmania aethiopica]
Length = 443
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 121/318 (38%), Positives = 175/318 (55%), Gaps = 22/318 (6%)
Query: 40 ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
ALF F + + Y T+ E RL F NL ++ Q + +G+ +F DLS AEF
Sbjct: 36 ALFEEFKRTYRRAYGTVAEEQQRLANFERNLELMREHQ-ARNPHARFGITKFFDLSEAEF 94
Query: 100 QAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
A+YL F +A + +++ +P A DWRE AVT VKDQ CGS WAFS
Sbjct: 95 AARYLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKDQGACGSCWAFS 154
Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
GNIE +A +L +LSEQ+L+ CD +D GC GG ++ AF+ ++ + G + E +Y
Sbjct: 155 AVGNIESQWAVAGHRLTALSEQQLVSCDDKDSGCNGGLMTQAFEWLLRNMNGTMLTEDSY 214
Query: 215 PY---RGDDKACRLNKKATQ-VKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYV 270
PY GD C + + +I+GYV++ ET MA +L ++GP+++A++A + Y
Sbjct: 215 PYVSSTGDVPECTNSSQLVPGARIDGYVTIESSETVMAAWLAKSGPISIAVDASSFMSYE 274
Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
+GV + L+H VL+VGY VPYW+IKNSWGE WGEKGY R+
Sbjct: 275 SGV------LTSCAGDALNHGVLLVGYNXT------GEVPYWVIKNSWGEDWGEKGYVRV 322
Query: 331 YRGDGSCGINDYVRSALV 348
G +C + +Y SA V
Sbjct: 323 TMGVNACLLTEYPVSAHV 340
>gi|302790570|ref|XP_002977052.1| hypothetical protein SELMODRAFT_268054 [Selaginella moellendorffii]
gi|300155028|gb|EFJ21661.1| hypothetical protein SELMODRAFT_268054 [Selaginella moellendorffii]
Length = 300
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 124/307 (40%), Positives = 172/307 (56%), Gaps = 23/307 (7%)
Query: 41 LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
+F + +H K+Y++ E RL IFS L I+ + + GLN+FSDL+ AEF+
Sbjct: 1 MFEGWAAKHGKSYSSDWEKARRLMIFSDTLAYIEKHNALPNTTFTLGLNKFSDLTNAEFR 60
Query: 101 AKYLGFKLKPSYADRSVPAMIPNI---TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
A Y+G P Y DR PA ++ +LP + DWR+ AVT +KDQ CGS WAFS
Sbjct: 61 ANYVGKFKPPRYQDRR-PAKDVDVDVSSLPTSLDWRQEGAVTPIKDQGQCGSCWAFSAIA 119
Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
+IE + TK+LVSLSEQ+LIDCD D GC+GG +AF ++ GG+ E+ YPY
Sbjct: 120 SIESAHFLATKELVSLSEQQLIDCDTVDQGCQGGFPEDAFKFVVEN--GGVTTEEAYPYT 177
Query: 218 GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAI--NAYALQFYVTGVSH 275
G +C NK V+I GY V++D D V P+ V I + Q Y +G+
Sbjct: 178 GFAGSCNANKNKV-VEITGYKDVTKDSADALMKAVSKTPVTVGICGSDQNFQNYRSGI-- 234
Query: 276 PIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR--G 333
+ C + H+VL++GYG T +PYWIIKNSWG WGE G+ R+ + G
Sbjct: 235 -LSGHCSNSRD---HAVLVIGYG------TEGGMPYWIIKNSWGTSWGEDGFMRIKKKDG 284
Query: 334 DGSCGIN 340
+G CG+N
Sbjct: 285 EGMCGMN 291
>gi|302763837|ref|XP_002965340.1| hypothetical protein SELMODRAFT_143126 [Selaginella moellendorffii]
gi|302790566|ref|XP_002977050.1| hypothetical protein SELMODRAFT_232903 [Selaginella moellendorffii]
gi|300155026|gb|EFJ21659.1| hypothetical protein SELMODRAFT_232903 [Selaginella moellendorffii]
gi|300167573|gb|EFJ34178.1| hypothetical protein SELMODRAFT_143126 [Selaginella moellendorffii]
Length = 300
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 124/307 (40%), Positives = 172/307 (56%), Gaps = 23/307 (7%)
Query: 41 LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
+F + +H K+Y++ E RL IFS L I+ + + GLN+FSDL+ AEF+
Sbjct: 1 MFEGWAAKHGKSYSSDWEKARRLMIFSDTLAYIEKHNALPNTTFTLGLNKFSDLTNAEFR 60
Query: 101 AKYLGFKLKPSYADRSVPAMIPNI---TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
A Y+G P Y DR PA ++ +LP + DWR+ AVT +KDQ CGS WAFS
Sbjct: 61 ANYVGKFKPPRYQDRR-PAKDVDVDVSSLPTSLDWRQEGAVTPIKDQGQCGSCWAFSAIA 119
Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
+IE + TK+LVSLSEQ+LIDCD D GC+GG +AF ++ GG+ E+ YPY
Sbjct: 120 SIESAHFLATKELVSLSEQQLIDCDTVDQGCQGGFPEDAFKFVVEN--GGVTTEEAYPYT 177
Query: 218 GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAI--NAYALQFYVTGVSH 275
G +C NK V+I GY V++D D V P+ V I + Q Y +G+
Sbjct: 178 GFAGSCNANKNKV-VEITGYKDVTKDSADALMKAVSKTPVTVGICGSDQNFQNYRSGI-- 234
Query: 276 PIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR--G 333
+ C + H+VL++GYG T +PYWIIKNSWG WGE G+ R+ + G
Sbjct: 235 -LSGHCSNSRD---HAVLVIGYG------TEGGMPYWIIKNSWGTSWGEDGFMRIKKEDG 284
Query: 334 DGSCGIN 340
+G CG+N
Sbjct: 285 EGMCGMN 291
>gi|72389853|ref|XP_845221.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|62359929|gb|AAX80354.1| cysteine peptidase precursor [Trypanosoma brucei]
gi|70801756|gb|AAZ11662.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
Length = 449
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 127/346 (36%), Positives = 180/346 (52%), Gaps = 27/346 (7%)
Query: 10 VALLSLTVSVSSFMVVGDEKLHHLHHVKHTAL-FNYFLEQHNKTYATLVEYYSRLHIFSG 68
V LL++ ++S L LH + + F F +++ K Y E R F
Sbjct: 14 VVLLAIAACLASV------ALGSLHVEESLEMRFAAFKKKYGKVYKDAKEEAFRFRAFEE 67
Query: 69 NLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITL-- 126
N+ + ++ Q + +G+ FSD++ EF+A+Y + A + + + N+T
Sbjct: 68 NMEQAKI-QAAANPYATFGVTPFSDMTREEFRARYRNGASYFAAAQKRLRKTV-NVTTGR 125
Query: 127 -PRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQED 185
P A DWRE AVT VKDQ CGS WAFST GNIEG + LVSLSEQ L+ CD D
Sbjct: 126 APAAVDWREKGAVTPVKDQGQCGSCWAFSTIGNIEGQWQVAGNPLVSLSEQMLVSCDTID 185
Query: 186 DGCEGGSISNAFDTIMSKLGGGLEEEKTYPY---RGDDKACRLNKKATQVKINGYVSVSR 242
GC GG + NAF+ I++ GG + E +YPY G+ C++N I +V + +
Sbjct: 186 SGCNGGLMDNAFNWIVNSNGGNVFTEASYPYVSGNGEQPQCQMNGHEIGAAITDHVDLPQ 245
Query: 243 DETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRT 302
DE +A YL ENGP+A+A++A + Y G+ +E L H VL+VGY +
Sbjct: 246 DEDAIAAYLAENGPLAIAVDATSFMDYNGGI------LTSCTSEQLDHGVLLVGYNDNSN 299
Query: 303 KFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
PYWIIKNSW WGE GY R+ +G C +N V SA+V
Sbjct: 300 P------PYWIIKNSWSNMWGEDGYIRIEKGTNQCLMNQAVSSAVV 339
>gi|309752918|gb|ADO85436.1| cathepsin [Pieris rapae granulovirus]
Length = 339
Score = 212 bits (539), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 125/323 (38%), Positives = 184/323 (56%), Gaps = 33/323 (10%)
Query: 41 LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGS--GVYGLNEFSDLSTAE 98
+F F++++NK+YAT E + F NL+ ++ D +GS V+ +N FSDL+ +
Sbjct: 35 IFEDFIKKYNKSYATDQERAIKYENFKNNLK---MINDKNNGSKDAVFDINAFSDLNKND 91
Query: 99 FQAKYLGFKL---KPSY--------ADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMC 147
+ GF++ K SY + V P I LP +FDWR+ VT VK+Q C
Sbjct: 92 LLRRTTGFRMGLKKNSYYTPDVSKECNVQVIKSEPQIILPESFDWRDKHGVTPVKNQLEC 151
Query: 148 GSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGG 207
GS WAFS NIE +Y K K + LSEQ LI+CD ++GC GG + A +TI+ + GG
Sbjct: 152 GSCWAFSAIANIESLYNIKHNKELDLSEQHLINCDSINNGCGGGLMHWALETILQQ--GG 209
Query: 208 LEEEKTYPYRGDDKACRLNKKATQVKINGYVS-VSRDETDMAKYLVENGPMAVAINAYAL 266
+ EK PY G D C+ K V I+G V ++E + + L+ NGP+++A++ +
Sbjct: 210 IVSEKDEPYYGLDAVCK--PKQFNVSISGCTRYVLKNENKLRELLIANGPISMAVDIIDV 267
Query: 267 QFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKG 326
Y G++ C+ N L+H+VL+VGYGV H +PYWI+KNSWGE WGEKG
Sbjct: 268 IDYKEGITD----ICENMN-GLNHAVLLVGYGV------HNNIPYWIMKNSWGEEWGEKG 316
Query: 327 YFRLYRGDGSCGI-NDYVRSALV 348
Y R+ R SCG+ N++ SA++
Sbjct: 317 YLRVQRNINSCGLMNEFASSAIL 339
>gi|340053971|emb|CCC48265.1| cysteine peptidase precursor, fragment, partial [Trypanosoma vivax
Y486]
Length = 389
Score = 212 bits (539), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 122/313 (38%), Positives = 169/313 (53%), Gaps = 18/313 (5%)
Query: 41 LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
LF F +++ ++Y T E RL +F N+R+ ++ + +G+ FSDL+ EF+
Sbjct: 33 LFAAFKQKYGRSYGTAAEEAFRLRVFEDNMRRSRMYA-AANPHATFGVTPFSDLTPEEFR 91
Query: 101 AKYLGFKLKPSYADRSVPAMI--PNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
+Y + A V ++ P P A DWR AVT VKDQ CGS W+FS GN
Sbjct: 92 TRYHNGERHFEAARGRVRTLVQVPPGKAPAAVDWRRKGAVTPVKDQGTCGSCWSFSAIGN 151
Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG 218
IEG +AA L SLSEQ L+ CD +D+GC GG + NAF+ I+ + G + K+YPY
Sbjct: 152 IEGQWAAAGNPLTSLSEQMLVSCDFKDNGCGGGFMDNAFEWIVKENSGKVYTGKSYPYVS 211
Query: 219 DDKA---CRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSH 275
+D + C I G+V + DE +AKYL +NGP+AVA++A Y GV
Sbjct: 212 EDGSKPFCIPYGHEVGATITGHVDIPHDEDAIAKYLADNGPVAVAVDATTFMSYSGGVVT 271
Query: 276 PIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
+E L+H VL+VGY D +K PYWIIKNSW WGEKGY R+ +G
Sbjct: 272 SCT------SEALNHGVLLVGYN-DSSK-----PPYWIIKNSWSSSWGEKGYIRIEKGTN 319
Query: 336 SCGINDYVRSALV 348
C + SA+V
Sbjct: 320 QCLVAQLASSAVV 332
>gi|288804650|ref|YP_003429335.1| cathepsin [Pieris rapae granulovirus]
gi|270161225|gb|ACZ63497.1| cathepsin [Pieris rapae granulovirus]
Length = 339
Score = 212 bits (539), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 125/323 (38%), Positives = 184/323 (56%), Gaps = 33/323 (10%)
Query: 41 LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGS--GVYGLNEFSDLSTAE 98
+F F++++NK+YAT E + F NL+ ++ D +GS V+ +N FSDL+ +
Sbjct: 35 IFEDFIKKYNKSYATDQERAIKYENFKNNLK---MINDKNNGSKYAVFDINAFSDLNKND 91
Query: 99 FQAKYLGFKL---KPSY--------ADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMC 147
+ GF++ K SY + V P I LP +FDWR+ VT VK+Q C
Sbjct: 92 LLRRTTGFRMGLKKNSYYTPDVSKECNVQVIKSEPQIILPESFDWRDKHGVTPVKNQLEC 151
Query: 148 GSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGG 207
GS WAFS NIE +Y K K + LSEQ LI+CD ++GC GG + A +TI+ + GG
Sbjct: 152 GSCWAFSAIANIESLYNIKHNKELDLSEQHLINCDSINNGCGGGLMHWALETILQQ--GG 209
Query: 208 LEEEKTYPYRGDDKACRLNKKATQVKINGYVS-VSRDETDMAKYLVENGPMAVAINAYAL 266
+ EK PY G D C+ K V I+G V ++E + + L+ NGP+++A++ +
Sbjct: 210 IVSEKDEPYYGLDAVCK--PKQFNVSISGCTRYVLKNENKLRELLIANGPISMAVDIIDV 267
Query: 267 QFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKG 326
Y G++ C+ N L+H+VL+VGYGV H +PYWI+KNSWGE WGEKG
Sbjct: 268 IDYKEGITD----ICENMN-GLNHAVLLVGYGV------HNNIPYWIMKNSWGEEWGEKG 316
Query: 327 YFRLYRGDGSCGI-NDYVRSALV 348
Y R+ R SCG+ N++ SA++
Sbjct: 317 YLRVQRNINSCGLMNEFASSAIL 339
>gi|15824693|gb|AAL09444.1| cysteine protease [Leishmania donovani]
Length = 394
Score = 212 bits (539), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 131/320 (40%), Positives = 179/320 (55%), Gaps = 24/320 (7%)
Query: 39 TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAE 98
ALF F + + Y TL E RL F NL ++ Q + +G+ +F DLS AE
Sbjct: 35 AALFEEFKRTYRRAYGTLAEEQQRLANFERNLELMREHQ-ARNPHARFGITKFFDLSEAE 93
Query: 99 FQAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAF 153
F A+YL F +A + +++ +P A DWRE AVT VK+Q CGS WAF
Sbjct: 94 FAARYLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKNQGACGSCWAF 153
Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
S GNIE +A LVSLSEQ+L+ CD +D+GC GG + AF+ ++ + G + EK+
Sbjct: 154 SAVGNIESQWARAGHGLVSLSEQQLVSCDDKDNGCNGGLMLQAFEWLLRHMYGIVFTEKS 213
Query: 214 YPY---RGDDKACRLN--KKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQF 268
YPY GD C LN K +I+GYV + +ET MA +L ENGP+A+ ++A +
Sbjct: 214 YPYTSGNGDVAEC-LNSSKLVPGARIDGYVMIPSNETVMAAWLAENGPIAIGVDASSFMS 272
Query: 269 YVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYF 328
Y +GV C G + L+H VL+VGY T VPY +IKNSWGE WGEKGY
Sbjct: 273 YQSGVLTS----CAG--DALNHGVLLVGYN------TTGGVPYCVIKNSWGEDWGEKGYV 320
Query: 329 RLYRGDGSCGINDYVRSALV 348
R+ G +C +++Y SA V
Sbjct: 321 RVAMGLNACLLSEYPVSAHV 340
>gi|113195461|ref|YP_717598.1| V-CATH [Clanis bilineata nucleopolyhedrosis virus]
gi|66968272|gb|AAY59557.1| V-CATH [Clanis bilineata nucleopolyhedrosis virus]
Length = 325
Score = 212 bits (539), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 122/318 (38%), Positives = 174/318 (54%), Gaps = 19/318 (5%)
Query: 36 VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLS 95
+K F F+ +NK Y E R IF NL +I + E V+ +N+FSD+S
Sbjct: 21 LKAPDYFESFVANYNKMYNDTQEKAYRYKIFKHNLEEINIKNQVED-HAVFSINKFSDMS 79
Query: 96 TAEFQAKYLGFKLKPSYADRSVPAMI---PNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
+E +KY G L + A+I P P FDWR+Y+AVT V+ Q CGS WA
Sbjct: 80 KSEIISKYTGLSLPSLMQENFCRAIILDGPPNKAPINFDWRQYNAVTPVRVQGNCGSCWA 139
Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEK 212
FST IE Y+ K K +SLS Q+L+DCD + GC GG + A + I++ GGG+ +E+
Sbjct: 140 FSTLAGIESQYSIKYNKQISLSVQQLVDCDTSNMGCAGGLLHTALEQIINA-GGGVLQEE 198
Query: 213 TYPYRGDDKACRLNKKATQVKING-YVSVSRDETDMAKYLVENGPMAVAINAYALQFYVT 271
YPY+G DK C L V++ G Y + +E + L GP+ VAI+A ++ Y
Sbjct: 199 DYPYKGVDKQCNLPHNNFAVQVLGCYRYIVMNEEKLKDVLRAVGPIPVAIDAASIVDYSR 258
Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
G+ ++ L+H+VL+VGYGV VPYW +KN+WG+ WGE GYFR+
Sbjct: 259 GIIRTCTYY------GLNHAVLLVGYGV------QDGVPYWTLKNTWGDDWGEHGYFRVR 306
Query: 332 RGDGSCG-INDYVRSALV 348
+ SCG IND +A++
Sbjct: 307 QNVNSCGIINDLASTAVI 324
>gi|29567137|ref|NP_818699.1| cathepsin [Adoxophyes honmai NPV]
gi|37076951|sp|Q80LP4.1|CATV_NPVAH RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|29467913|dbj|BAC67303.1| cathepsin [Adoxophyes honmai NPV]
Length = 337
Score = 212 bits (539), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 126/348 (36%), Positives = 188/348 (54%), Gaps = 29/348 (8%)
Query: 13 LSLTVSVSSFMVVGDEKLHHLHHVKHTA--LFNYFLEQHNKTYATLVEYYSRLHIFSGNL 70
++L + + +V + HL H A F F+ +NK Y R IF NL
Sbjct: 1 MTLLMIFTILLVASSQIEGHLKFDIHDAQHYFETFIINYNKQYPDTKTKNYRFKIFKQNL 60
Query: 71 RKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGF-KLKPSYADRSVPAMI-------- 121
I ++ + S +Y +N+FSDLS E KY G KPS RS
Sbjct: 61 EDINE-KNKLNDSAIYNINKFSDLSKNELLTKYTGLTSKKPSNMVRSTSNFCNVIHLDAP 119
Query: 122 PNI--TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELI 179
P++ LP+ FDWR + +T VKDQ CGS WA + G +E +YA K L++LSEQ+LI
Sbjct: 120 PDVHDELPQNFDWRVNNKMTSVKDQGACGSCWAHAAVGTLETLYAIKHNYLINLSEQQLI 179
Query: 180 DCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVS 239
DCD + C+GG + AF+ +M+ GGL EE YPY+G C+++ K + ++
Sbjct: 180 DCDSANMACDGGLMHTAFEQLMN--AGGLMEEIDYPYQGTKGVCKIDNKKFALSVSSCKR 237
Query: 240 -VSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYG 298
+ ++E ++ K L+ GP+A+AI+A ++ Y G+ H FC+ N L+H+VL+VGYG
Sbjct: 238 YIFQNEENLKKELITMGPIAMAIDAASISTYSKGIIH----FCE--NLGLNHAVLLVGYG 291
Query: 299 VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSA 346
T V YW +KNSWG WGE GYFR+ R +CG+N+ + ++
Sbjct: 292 ------TEGGVSYWTLKNSWGSDWGEDGYFRVKRNINACGLNNQLAAS 333
>gi|357619727|gb|EHJ72186.1| cathepsin [Danaus plexippus]
Length = 336
Score = 212 bits (539), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 123/307 (40%), Positives = 173/307 (56%), Gaps = 22/307 (7%)
Query: 41 LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGS--GVYGLNEFSDLSTAE 98
LF F+ ++NK Y + E R IF NL++I D H S V+G+N+F+DLS E
Sbjct: 40 LFENFIREYNKKYDSK-EKEERFKIFVNNLKRIN---DLNHKSTNAVHGINKFTDLSKEE 95
Query: 99 FQAKYLGFKLKPSYADRSV--PAMIP-NITLPRAFDWREYDAVTGVKDQTMCGSSWAFST 155
F+ Y GFK S+ D ++ P+ + NIT P AFDWR+ VT VK+Q CGS WAFST
Sbjct: 96 FKKFYTGFKPDKSFLDDNIKKPSQLSFNITAPPAFDWRDKGVVTRVKNQGTCGSCWAFST 155
Query: 156 TGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYP 215
GN+E V A K LV LSEQ+L+DCD +D+ C+ G NA ++S G E++YP
Sbjct: 156 IGNVESVNAIKHGNLVELSEQQLVDCDSKDEACDSGLPDNAQQYLVSH---GAISEQSYP 212
Query: 216 YRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSH 275
Y+G C + V+++ + V E MA+ L P+++ I A L Y G+
Sbjct: 213 YKGYAANCTYDSSQVVVRLSNFEKVVLSECQMAEKLYSTAPLSIVIAAEVLGTYTKGI-- 270
Query: 276 PIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
+ C+ +++L+H+VL+VGYG +WI+KNSWG WGE GYFR+ RG
Sbjct: 271 -LVNECE-QSQDLNHAVLLVGYG------NEGGTNFWILKNSWGTNWGEGGYFRIKRGVN 322
Query: 336 SCGINDY 342
I DY
Sbjct: 323 CLMITDY 329
>gi|332326589|gb|AEE42618.1| cysteine protease [Leishmania aethiopica]
Length = 443
Score = 212 bits (539), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 123/318 (38%), Positives = 178/318 (55%), Gaps = 22/318 (6%)
Query: 40 ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
ALF F + + Y T+ E RL F NL ++ Q + +G+ +F DLS AEF
Sbjct: 36 ALFEEFKRTYWRVYGTVAEEQQRLANFERNLELMREHQ-ARNPHARFGITKFFDLSEAEF 94
Query: 100 QAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
A+YL F +A + +++ +P A DWRE AVT VKDQ CGS WAFS
Sbjct: 95 AARYLNGAAYFAAAKQHAGQHHRKARADLSAVPDAVDWREKGAVTPVKDQGACGSCWAFS 154
Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
GNIE +A +L +LSEQ+L+ CD +D GC GG ++ AF+ ++ + G + E +Y
Sbjct: 155 AVGNIESQWAVAGHRLTALSEQQLVSCDDKDSGCNGGLMTQAFEWLLRNMNGTMLTEDSY 214
Query: 215 PY---RGDDKACRLNKKATQ-VKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYV 270
PY GD C + + +I+GYV++ ET MA +L ++GP+++A++A + Y
Sbjct: 215 PYVSSTGDVPECTNSSQLVPGARIDGYVTIESSETVMAAWLAKSGPISIAVDASSFMSYE 274
Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
+GV + L+H VL+VGY +RT VPYW+IKNSWGE WGEKGY R+
Sbjct: 275 SGV------LTSCAGDALNHGVLLVGY--NRT----GEVPYWVIKNSWGEDWGEKGYVRV 322
Query: 331 YRGDGSCGINDYVRSALV 348
G +C + +Y SA V
Sbjct: 323 TMGVNACLLTEYPVSAHV 340
>gi|343476708|emb|CCD12273.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 363
Score = 211 bits (538), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 125/343 (36%), Positives = 186/343 (54%), Gaps = 24/343 (6%)
Query: 13 LSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRK 72
+ L + F+ V LH ++ F F ++++++Y E R +F N+ +
Sbjct: 14 VGLLAVAACFVPVALGVLHAEQSLQQQ--FAAFKQKYSRSYKDATEEAFRFRVFKQNMER 71
Query: 73 IQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL-GFKLKPSYADRSVPAMIPNITL---PR 128
+ + + +G+ FSD+S EF+A Y G + + R P + ++ P
Sbjct: 72 AKE-EAAANPYATFGVTRFSDMSPEEFRATYHNGAEYYAAALKR--PRKVVTVSTGKAPD 128
Query: 129 AFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGC 188
A DWR+ AVT V+D+ +C SSWAFS GNIEG + +L SLSEQ L+ CD +DGC
Sbjct: 129 AVDWRKKGAVTPVRDERLCDSSWAFSAIGNIEGQWKVAGHELTSLSEQMLLSCDTREDGC 188
Query: 189 EGGSISNAFDTIMSKLGGGLEEEKTYPYR---GDDKACRLNKKATQVKINGYVSVSRDET 245
GG + AF I+S G + E++YPY GD C + K KI+ YV + +DE
Sbjct: 189 GGGLMDRAFQWIVSSNKGNVFTEQSYPYASTDGDVPRCNKSGKVVGAKISDYVDLPQDEN 248
Query: 246 DMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFT 305
+A++L +NGP+A+A+ A +LQ Y GV +E L H VL+VGY D +K
Sbjct: 249 AIAEWLAKNGPVAIAVEATSLQRYTGGV------LTSCISEQLDHGVLLVGYD-DTSK-- 299
Query: 306 HKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
PYWIIKNSWG+GWGE+GY R+ +G C + +Y SA+V
Sbjct: 300 ---PPYWIIKNSWGKGWGEEGYIRIEKGTNQCLMKNYASSAVV 339
>gi|1749812|emb|CAA90237.1| cysteine proteinase LmCPB1 [Leishmania mexicana]
Length = 359
Score = 211 bits (538), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 127/319 (39%), Positives = 175/319 (54%), Gaps = 22/319 (6%)
Query: 39 TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAE 98
ALF F + + Y TL E RL F NL ++ Q + +G+ +F DLS AE
Sbjct: 35 AALFEEFKRTYGRAYETLAEEQQRLANFERNLELMREHQ-ARNPHAQFGITKFFDLSEAE 93
Query: 99 FQAKYLGFKLKPSYADRSVPAMIPNI-----TLPRAFDWREYDAVTGVKDQTMCGSSWAF 153
F A+YL + A R P P +P A DWRE AVT VKDQ CGS WAF
Sbjct: 94 FCARYLNGAAYFAAAKRHTPQHYPKARADLSAVPDAVDWREKGAVTPVKDQGACGSCWAF 153
Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
S GNIEG + +LVSLSEQ+L+ CD +DGC+GG + AFD ++ G L E +
Sbjct: 154 SAVGNIEGQWYLAGHELVSLSEQQLVSCDDMNDGCDGGLMLQAFDWLLQNTNGHLYTEDS 213
Query: 214 YPYRGDD----KACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFY 269
YPY + + +K +I+G+V + E MA +L +NGP+A+A++A + Y
Sbjct: 214 YPYVSGNGYLPECSNSSKLVVGAQIDGHVLIGSSEKAMAAWLAKNGPIAIALDASSFMSY 273
Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
+GV C G + ++H+VL+VGY D T VPYW+IKNSWG WGE+GY R
Sbjct: 274 KSGV----LTACIG--KQVNHAVLLVGY--DMT----GEVPYWVIKNSWGGDWGEQGYVR 321
Query: 330 LYRGDGSCGINDYVRSALV 348
+ G +C +++Y SA V
Sbjct: 322 VVMGVNACLLSEYPVSAHV 340
>gi|209978824|ref|YP_002300567.1| cathepsin [Adoxophyes orana nucleopolyhedrovirus]
gi|192758806|gb|ACF05341.1| cathepsin [Adoxophyes orana nucleopolyhedrovirus]
Length = 337
Score = 211 bits (538), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 126/348 (36%), Positives = 187/348 (53%), Gaps = 29/348 (8%)
Query: 13 LSLTVSVSSFMVVGDEKLHHLHHVKHTA--LFNYFLEQHNKTYATLVEYYSRLHIFSGNL 70
++L + + +V + HL H A F F+ +NK YA R IF NL
Sbjct: 1 MTLLMIFTILLVASSQIEGHLKFDIHDAQHYFETFIVNYNKQYADTKTKNYRFKIFVQNL 60
Query: 71 RKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGF-KLKPSYADRSVPAMIPNI----- 124
I ++ + S +Y +N+FSDLS E KY G KPS +S I
Sbjct: 61 EYINE-KNKLNDSAIYNINKFSDLSKNELLTKYTGLTSRKPSNMVKSTSNFCNVIHLDAP 119
Query: 125 -----TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELI 179
LP+ FDWR + +T VKDQ CGS WA + G +E +YA K L++LSEQ+LI
Sbjct: 120 PDARDELPQNFDWRVNNKMTSVKDQGACGSCWAHAAVGTLETLYAIKHNYLINLSEQQLI 179
Query: 180 DCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVS 239
DCD + C+GG + AF+ +M+ GGL EE YPY+G C+++ K + ++
Sbjct: 180 DCDSANMACDGGLMHTAFEQLMN--AGGLMEEIDYPYQGTKGICKIDNKKFALSVSSCKR 237
Query: 240 -VSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYG 298
+ ++E ++ K L+ GP+A+AI+A ++ Y G+ H FC+ N L+H+VL+VGYG
Sbjct: 238 YIFQNEENLKKELITTGPIAMAIDAASISTYSKGIIH----FCE--NLGLNHAVLLVGYG 291
Query: 299 VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSA 346
T V YW +KNSWG WGE GYFR+ R +CG+N+ + ++
Sbjct: 292 ------TEGGVSYWTLKNSWGSDWGEDGYFRVKRNINACGLNNQLAAS 333
>gi|302790836|ref|XP_002977185.1| hypothetical protein SELMODRAFT_106228 [Selaginella moellendorffii]
gi|300155161|gb|EFJ21794.1| hypothetical protein SELMODRAFT_106228 [Selaginella moellendorffii]
Length = 299
Score = 211 bits (538), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 123/307 (40%), Positives = 172/307 (56%), Gaps = 23/307 (7%)
Query: 41 LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
+F + +H K+Y++ E RL IFS L I+ + + GLN+FSDL+ AEF+
Sbjct: 1 MFEDWAAKHGKSYSSDSEKARRLMIFSDTLAYIEKHNAQPNTTFTLGLNKFSDLTNAEFR 60
Query: 101 AKYLGFKLKPSYADRSVPAMIPNI---TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
A Y+G P Y DR PA ++ +LP + DWR+ AVT +KDQ CGS WAFS
Sbjct: 61 ANYVGKFKSPRYQDRR-PAKDVDVDVSSLPTSLDWRQEGAVTPIKDQGQCGSCWAFSAIA 119
Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
+IE + TK+LVSLSEQ+LIDCD D GC+GG +AF ++ GG+ E+ YPY
Sbjct: 120 SIESAHFLATKELVSLSEQQLIDCDTVDQGCQGGFPEDAFKFVVEN--GGVTTEEAYPYT 177
Query: 218 GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAI--NAYALQFYVTGVSH 275
G +C NK V+I GY V++D D V P+ V I + Q Y +G+
Sbjct: 178 GFAGSCNANKNKV-VEITGYKDVTKDSADALMKAVSKTPVTVGICGSDQNFQNYRSGI-- 234
Query: 276 PIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR--G 333
+ C + H+VL++GYG T +PYWIIKNSWG WGE G+ ++ + G
Sbjct: 235 -LSGQCSNSRD---HAVLVIGYG------TEGGMPYWIIKNSWGTSWGENGFMKIKKKDG 284
Query: 334 DGSCGIN 340
+G CG+N
Sbjct: 285 EGMCGMN 291
>gi|10391|emb|CAA38238.1| unnamed protein product [Trypanosoma brucei]
Length = 450
Score = 211 bits (538), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 128/346 (36%), Positives = 180/346 (52%), Gaps = 27/346 (7%)
Query: 10 VALLSLTVSVSSFMVVGDEKLHHLHHVKHTAL-FNYFLEQHNKTYATLVEYYSRLHIFSG 68
V LL++ ++S L LH + + F F +++ K Y E R F
Sbjct: 14 VVLLAMAACLASV------ALGSLHVEESLEMRFAAFKKKYGKVYKDAKEEAFRFRAFEE 67
Query: 69 NLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITL-- 126
N+ + ++ Q + +G+ FSD++ EF+A+Y + A + V + N+T
Sbjct: 68 NMEQAKI-QAAANPYATFGVTPFSDMTREEFRARYRNGASYFAAAQKRVRKTV-NVTTGR 125
Query: 127 -PRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQED 185
P A DWRE AVT VKDQ CGS WAFST GNIEG + LVSLSEQ L+ CD D
Sbjct: 126 APAAVDWREKGAVTPVKDQGQCGSCWAFSTIGNIEGQWQVAGNPLVSLSEQMLVSCDTID 185
Query: 186 DGCEGGSISNAFDTIMSKLGGGLEEEKTYPY---RGDDKACRLNKKATQVKINGYVSVSR 242
GC GG + NAF+ I++ GG + E +YPY G+ C++N I +V + +
Sbjct: 186 FGCGGGLMDNAFNWIVNSNGGNVFTEASYPYVSGNGEQPQCQMNGHEIGAAITDHVDLPQ 245
Query: 243 DETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRT 302
DE +A YL ENGP+A+A++A + Y G+ +E L H VL+VGY +
Sbjct: 246 DEDAIAAYLAENGPLAIAVDATSFMDYNGGI------LTSCTSEQLDHGVLLVGYNDNSN 299
Query: 303 KFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
PYWIIKNSW WGE GY R+ +G C +N V SA+V
Sbjct: 300 P------PYWIIKNSWSNMWGEDGYIRIEKGTNQCLMNQAVSSAVV 339
>gi|378943048|gb|AFC76265.1| cathepsin L-like protease [Leishmania major]
Length = 348
Score = 211 bits (537), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 127/319 (39%), Positives = 175/319 (54%), Gaps = 22/319 (6%)
Query: 39 TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAE 98
ALF F + + Y TL E RL F NL ++ Q + +G+ +F DLS A
Sbjct: 35 AALFEEFKRTYQRAYGTLTEEQRRLANFERNLELMREHQ-ARNPHARFGITKFFDLSEAV 93
Query: 99 FQAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAF 153
F A+YL F +A + +++ +P A DWRE AVT VK+Q CGS WAF
Sbjct: 94 FAARYLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKNQGACGSCWAF 153
Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
S GNIE +A KLV LSEQ+L+ CD D+GC GG + AF+ ++ + G + EK+
Sbjct: 154 SAVGNIESQWAVAGHKLVRLSEQQLVSCDHVDNGCGGGLMLQAFEWVLRNMNGTVFTEKS 213
Query: 214 YPY---RGDDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFY 269
YPY GD C + + A +I+GYVS+ E MA +L +NGP+++A++A + Y
Sbjct: 214 YPYVSGNGDVPECSNSSELAPGARIDGYVSMESSERVMAAWLAKNGPISIAVDASSFMSY 273
Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
+GV C G E L+H VL+VGY + VPYW+IKNSWGE WGEKGY R
Sbjct: 274 HSGVLTS----CIG--EQLNHGVLLVGYNMT------GEVPYWVIKNSWGEDWGEKGYVR 321
Query: 330 LYRGDGSCGINDYVRSALV 348
+ G +C + Y S V
Sbjct: 322 VTMGVNACLLTGYPVSVHV 340
>gi|45822207|emb|CAE47500.1| cathepsin L-like proteinase [Diabrotica virgifera virgifera]
Length = 326
Score = 211 bits (537), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 126/315 (40%), Positives = 175/315 (55%), Gaps = 19/315 (6%)
Query: 33 LHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT-EHGSGVY--GLN 89
+H + + F ++NK+Y +E R IF G+LRKI+ D +HG + G+
Sbjct: 14 VHALSDKEEWVQFKVRNNKSYRNYIEEQKRFTIFQGSLRKIENHNDKYDHGLSTFKLGVT 73
Query: 90 EFSDLSTAEFQAKYLGF-KLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCG 148
+F+DL+ EF + LG + S R + ++ P LP FDWRE AVT VKDQ CG
Sbjct: 74 KFADLTEKEF-SDMLGISRSTKSSRPRVIHSLTPVKDLPSKFDWREKGAVTEVKDQGSCG 132
Query: 149 SSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDD-GCEGGSISNAFDTIMSKLGGG 207
S W+FSTTG +EG Y KT KLVSLSEQ L+DC +ED GC GG + A + I + GG
Sbjct: 133 SCWSFSTTGTVEGAYFLKTGKLVSLSEQNLVDCAKEDCYGCSGGYMDKALEYI--ETAGG 190
Query: 208 LEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINA-YA 265
+ E YPY G D CR + KI+ + + + DE D+ ++ GP++VAI+A +
Sbjct: 191 IMSENDYPYEGIDDKCRFDSSKVAAKISNFTYIKKNDEDDLKNAVIAKGPISVAIDASFN 250
Query: 266 LQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEK 325
Q Y +G+ + D +L+H VL+VGYG T K YWI+KNSWG WG
Sbjct: 251 FQLYDSGILDDSSCYSDFN--SLNHGVLVVGYG------TEKEQDYWIVKNSWGADWGMD 302
Query: 326 GYFRLYRG-DGSCGI 339
GY + R + CGI
Sbjct: 303 GYIWMSRNKNNQCGI 317
>gi|6649593|gb|AAF21470.1|U85983_1 cysteine proteinase [Clonorchis sinensis]
Length = 259
Score = 211 bits (537), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 117/271 (43%), Positives = 158/271 (58%), Gaps = 16/271 (5%)
Query: 80 EHGSGVYGLNEFSDLSTAEFQAKYLGFKLK-PSYADRSVPAMIPNITLP-RAFDWREYDA 137
E G+ YG+ +FSDL++ EF+ +YL + P ++ P ++T+ FDWRE+ A
Sbjct: 2 EQGTAHYGVTQFSDLTSEEFKTRYLRMRFDGPIVSEDLTPE--EDVTMDNEKFDWREHGA 59
Query: 138 VTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAF 197
V V DQ CGS WAFS GN+ G + KT L++LSEQ+L+DCD DDGC+GG +
Sbjct: 60 VGPVLDQGKCGSCWAFSVIGNVVGQWFRKTGHLLALSEQQLVDCDYLDDGCDGGYPPQTY 119
Query: 198 DTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPM 257
I GGLE YPY G C ++K +NG + E A+ L GP+
Sbjct: 120 TAIQKM--GGLELASDYPYTGVGGICHMDKSKFVAYVNGSTILPLSEKVQAQKLRAIGPL 177
Query: 258 AVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNS 317
+ A+NA LQ Y G+ P +CD N H+VL VGYGV K PYWI+KNS
Sbjct: 178 SSALNADTLQLYKGGIMRPK--WCDPAGVN--HAVLTVGYGVQNGK------PYWIVKNS 227
Query: 318 WGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
WGE +GE+GYFR+YRGDG+CGIN V +A++
Sbjct: 228 WGEDFGEEGYFRIYRGDGTCGINSIVTTAII 258
>gi|154332647|ref|XP_001562140.1| cathepsin L-like protease [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|134059588|emb|CAM37170.1| cathepsin L-like protease [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 441
Score = 211 bits (537), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 127/318 (39%), Positives = 176/318 (55%), Gaps = 24/318 (7%)
Query: 39 TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQ-DTEHGSGVYGLNEFSDLSTA 97
+ LF F + + + YATL E RL F NL ++ Q + H +G+ +F DLS
Sbjct: 35 SVLFEEFKQTYQRVYATLDEEQQRLANFQRNLELMREHQANNPHAR--FGITKFFDLSEE 92
Query: 98 EFQAKYLG----FKLKPSYADRSVPAMIPNI-TLPRAFDWREYDAVTGVKDQTMCGSSWA 152
EF +YL F +A + + ++ T P A DWRE AVT VKDQ MCGS WA
Sbjct: 93 EFATRYLSGATHFAKAKKFASQHYRKVGADLSTAPAAVDWREKGAVTPVKDQGMCGSCWA 152
Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEK 212
FS GNIE + T L+SLSEQEL+ CD D+GC GG + AFD +++ G +
Sbjct: 153 FSAIGNIESQWYLATHSLISLSEQELVSCDDVDEGCNGGLMLQAFDWLLNNRNGAVYTGV 212
Query: 213 TYPY-RGDDKACRLNKKATQVK---INGYVSVSRDETDMAKYLVENGPMAVAINAYALQF 268
+YPY G+ ++ + V I+G+V++ +E MA +L NGP+A+A++A A
Sbjct: 213 SYPYVSGNGSVPECSESSDLVIGAYIDGHVTIESNEDTMAAWLAANGPIAIAVDASAFMS 272
Query: 269 YVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYF 328
Y GV CDG + L+H VL+VGY + VPYW+IKNSWG+ WGEKGY
Sbjct: 273 YTGGVLTS----CDG--KQLNHGVLLVGYNMT------GEVPYWLIKNSWGKNWGEKGYV 320
Query: 329 RLYRGDGSCGINDYVRSA 346
R+ +G C I +Y SA
Sbjct: 321 RVRKGTNECLIQEYPVSA 338
>gi|90592736|ref|YP_529689.1| VCATH [Agrotis segetum nucleopolyhedrovirus]
gi|71559186|gb|AAZ38185.1| VCATH [Agrotis segetum nucleopolyhedrovirus]
Length = 343
Score = 211 bits (537), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 121/314 (38%), Positives = 180/314 (57%), Gaps = 23/314 (7%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
F F+ Q+NK Y E R +IF N+ +I +++ + S VY +N F+D++ E
Sbjct: 45 FEQFISQYNKQYKNEAEKRHRFNIFMHNIEEINQ-KNSRNDSAVYKINRFADMTKNEVVI 103
Query: 102 KYLGF----KLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
++ G +L ++ + V P +FDWR Y+ VT VKDQ+MCG+ WAF++ G
Sbjct: 104 RHTGLASIGELNSNFCETVVVDGPGQRQRPSSFDWRTYNKVTSVKDQSMCGACWAFASLG 163
Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
+E YA K +L+ L+EQ+L+DCD D GC+GG I A++ IM GG+E+E YPYR
Sbjct: 164 ALESQYAIKYDRLIDLAEQQLVDCDFVDMGCDGGLIHTAYEQIMQM--GGVEQEFDYPYR 221
Query: 218 GDDKACRL--NKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSH 275
+ + C L +K A V+ + V R+E + L GP+A+A++A L Y G+
Sbjct: 222 AERQPCALKPHKFAAGVR-KCFRYVLRNEERLEDLLRHVGPIAIAVDAVDLTDYYGGIVS 280
Query: 276 PIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
FC+ N L+H+VL+VGYGV+ VP+W +KNSWG +GE GY R+ RG
Sbjct: 281 ----FCE--NNGLNHAVLLVGYGVENN------VPFWTLKNSWGSDYGEDGYVRVRRGVN 328
Query: 336 SCG-INDYVRSALV 348
SCG +N+ SA V
Sbjct: 329 SCGLVNELASSAQV 342
>gi|343470378|emb|CCD16903.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 445
Score = 211 bits (536), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 124/343 (36%), Positives = 187/343 (54%), Gaps = 24/343 (6%)
Query: 13 LSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRK 72
+ L + F+ V LH ++ F F ++++++Y E R +F ++ +
Sbjct: 14 VGLLAVAACFVPVALGVLHAEQSLQQQ--FAAFKQKYSRSYKDATEEAFRFRMFKQSMER 71
Query: 73 IQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL-GFKLKPSYADRSVPAMIPNITL---PR 128
+ + + +G+ +FSD+S EF+A YL G K + R P + N++ P
Sbjct: 72 AKE-EAAANPYATFGVTQFSDMSPEEFRATYLNGAKYYAAALKR--PRKVVNVSTGKAPP 128
Query: 129 AFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGC 188
A DWR+ AVT VKDQ CGS WAFS GNIEG + +L SLSEQ L+ CD D GC
Sbjct: 129 AIDWRKKGAVTPVKDQGKCGSCWAFSAIGNIEGQWKVAGHELTSLSEQMLVSCDNMDYGC 188
Query: 189 EGGSISNAFDTIMSKLGGGLEEEKTYPY---RGDDKACRLNKKATQVKINGYVSVSRDET 245
GG + A I+S G + E++YPY GD C + K KI+G +++ +DE
Sbjct: 189 RGGFLDRALKWIVSSNKGNVFTEESYPYDSTDGDVPPCNKSGKVVGAKISGLINLPKDEN 248
Query: 246 DMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFT 305
+A++L +NGP+A+A++A + Y GV ++ L+H VL+VGY D +K
Sbjct: 249 AIAEWLAKNGPIAIAVDASSFLDYTGGV------LTSCSSDALNHGVLLVGYD-DSSK-- 299
Query: 306 HKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
PYWIIKNSWG+ WGE+GY R+ +G C + +Y RSA+V
Sbjct: 300 ---PPYWIIKNSWGKKWGEEGYIRVEKGTNQCLMKEYARSAVV 339
>gi|332326583|gb|AEE42615.1| cysteine protease [Leishmania aethiopica]
Length = 443
Score = 211 bits (536), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 123/318 (38%), Positives = 177/318 (55%), Gaps = 22/318 (6%)
Query: 40 ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
ALF F + + Y T+ E RL F NL ++ Q + +G+ +F DLS AEF
Sbjct: 36 ALFEEFKRTYWRVYGTVAEEQQRLANFERNLELMREHQ-ARNPHARFGITKFFDLSEAEF 94
Query: 100 QAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
A+YL F +A + +++ +P A DWRE AVT VKDQ CGS WAFS
Sbjct: 95 AARYLNGAAYFAAAKQHAGQHHRKARADLSAVPDAVDWREKGAVTPVKDQGACGSCWAFS 154
Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
GNIE +A +L LSEQ+L+ CD +D GC GG ++ AF+ ++ + G + E +Y
Sbjct: 155 AVGNIESQWAVADHRLXXLSEQQLVSCDDKDSGCNGGLMTQAFEWLLRNMNGTMLTEDSY 214
Query: 215 PY---RGDDKACRLNKKATQ-VKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYV 270
PY GD C + + +I+GYV++ ET MA +L ++GP+++A++A + Y
Sbjct: 215 PYVSSTGDVPECTNSSQLVPGARIDGYVTIESSETVMAAWLAKSGPISIAVDASSFMSYE 274
Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
+GV + L+H VL+VGY +RT VPYW+IKNSWGE WGEKGY R+
Sbjct: 275 SGV------LTSCAGDALNHGVLLVGY--NRT----GEVPYWVIKNSWGEDWGEKGYVRV 322
Query: 331 YRGDGSCGINDYVRSALV 348
G +C + +Y SA V
Sbjct: 323 TMGVNACLLTEYPVSAHV 340
>gi|44844204|emb|CAF32698.1| cysteine proteinase [Leishmania infantum]
Length = 443
Score = 211 bits (536), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 127/319 (39%), Positives = 176/319 (55%), Gaps = 22/319 (6%)
Query: 39 TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAE 98
ALF F + + Y TL E RL F NL ++ Q + +G+ +F DLS AE
Sbjct: 35 AALFEEFKRTYRRAYGTLAEEQQRLANFERNLELMREHQ-ARNPHARFGITKFFDLSEAE 93
Query: 99 FQAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAF 153
F A+YL F +A + +++ +P A DWRE AVT VK CGS WAF
Sbjct: 94 FAARYLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKXXGACGSCWAF 153
Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
S GNIE +A LVSLSEQ+L+ CD +D+GC GG + AF+ ++ + G + EK+
Sbjct: 154 SAVGNIESQWARAGHGLVSLSEQQLVSCDDKDNGCNGGLMLQAFEXLLRHMYGIVFTEKS 213
Query: 214 YPY---RGDDKAC-RLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFY 269
YPY GD C +K +I+GYV + +ET MA +L ENGP+A+A++A + Y
Sbjct: 214 YPYTSGNGDVAECLNSSKLVPGAQIDGYVMIPSNETVMAAWLAENGPIAIAVDASSFMSY 273
Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
+GV + L+H VL+VGY ++T VPYW+IKNSWGE WGEKGY R
Sbjct: 274 QSGV------LTSCAGDALNHGVLLVGY--NKT----GGVPYWVIKNSWGEDWGEKGYVR 321
Query: 330 LYRGDGSCGINDYVRSALV 348
+ G +C + + SA V
Sbjct: 322 VVMGXNACLLXEXPXSAHV 340
>gi|340053968|emb|CCC48262.1| cysteine peptidase, Clan CA, family C1,Cathepsin L-like, fragment,
partial [Trypanosoma vivax Y486]
Length = 323
Score = 211 bits (536), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 120/302 (39%), Positives = 165/302 (54%), Gaps = 18/302 (5%)
Query: 41 LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
LF F +++ ++Y T E RL +F N+R+ ++ + +G+ FSDL+ EF+
Sbjct: 33 LFAAFKQKYGRSYGTAAEEAFRLRVFEDNMRRSRMYA-AANPHATFGVTPFSDLTPEEFR 91
Query: 101 AKYLGFKLKPSYADRSVPAMI--PNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
+Y + A V ++ P P A DWR AVT VKDQ CGS W+FS GN
Sbjct: 92 TRYHNGERHFEAARGRVRTLVQVPPGKAPAAVDWRRKGAVTPVKDQGRCGSCWSFSAIGN 151
Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG 218
IEG +AA L SLSEQ L+ CD +D+GC GG + NAF+ I+ + G + EK+YPY
Sbjct: 152 IEGQWAAAGNPLTSLSEQMLVSCDFKDNGCGGGFMDNAFEWIVKENSGKVYTEKSYPYVS 211
Query: 219 DDKA---CRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSH 275
+D + C I G+V + DE +AKYL +NGP+AVA++A Y GV
Sbjct: 212 EDGSKPFCIPYGHEVGATITGHVDIPHDEDAIAKYLADNGPVAVAVDATTFMSYSGGV-- 269
Query: 276 PIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
+E L+H VL+VGY D +K PYWIIKNSW WGEKGY R+ +G
Sbjct: 270 ----VTSCTSEALNHGVLLVGYN-DSSK-----PPYWIIKNSWSSSWGEKGYIRIEKGTN 319
Query: 336 SC 337
C
Sbjct: 320 QC 321
>gi|218199600|gb|EEC82027.1| hypothetical protein OsI_25996 [Oryza sativa Indica Group]
Length = 709
Score = 211 bits (536), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 128/331 (38%), Positives = 177/331 (53%), Gaps = 39/331 (11%)
Query: 40 ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
A F F+ +H + Y+ EY RL +F+ NL + Q + + +G+ FSDL+ EF
Sbjct: 46 AQFAAFVRRHGREYSGPEEYARRLRVFAANLARAAAHQALDP-TARHGVTPFSDLTREEF 104
Query: 100 QAKYLGF---------KLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGS 149
+A+ G + + + PA ++ LP +FDWR+ AVTGVK Q CGS
Sbjct: 105 EARLTGLATDVGDDDVRRRRLPMPSAAPATEEEVSGLPSSFDWRDRGAVTGVKMQGACGS 164
Query: 150 SWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTI 200
WAFSTTG +EG T L+ LSEQ+L+DCD D GC GG ++NA+ +
Sbjct: 165 CWAFSTTGAVEGANFLATGNLLDLSEQQLVDCDHTCDAEKKTECDSGCGGGLMTNAYAYL 224
Query: 201 MSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSR--------DETDMAKYLV 252
MS GGL E+ YPY G ACR + V++ + V+ + M LV
Sbjct: 225 MSS--GGLMEQSAYPYTGAQGACRFDANRVAVRVANFTVVAPAAGPGGNDGDAQMRAALV 282
Query: 253 ENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGY---GVDRTKFTHKAV 309
+GP+AV +NA +Q YV GVS P+ C N H VL+VGY G + H+
Sbjct: 283 RHGPLAVGLNAAYMQTYVGGVSCPL--VCPRAWVN--HGVLLVGYGERGFAALRLGHR-- 336
Query: 310 PYWIIKNSWGEGWGEKGYFRLYRGDGSCGIN 340
PYWIIKNSWG+ WGE+GY+RL RG CG++
Sbjct: 337 PYWIIKNSWGKAWGEQGYYRLCRGRNVCGVD 367
>gi|441611591|ref|XP_003273955.2| PREDICTED: LOW QUALITY PROTEIN: cathepsin F [Nomascus leucogenys]
Length = 548
Score = 211 bits (536), Expect = 6e-52, Method: Compositional matrix adjust.
Identities = 127/329 (38%), Positives = 183/329 (55%), Gaps = 16/329 (4%)
Query: 20 SSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT 79
S ++ ++ L VK ++F F+ +N+TY + E RL +F N+ + Q +Q
Sbjct: 235 SVISLLNEDPLPQDLPVKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQAL 294
Query: 80 EHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVT 139
+ G+ YG+ +FSDL+ EF+ YL L+ ++ A P +DWR AVT
Sbjct: 295 DRGTAQYGVTKFSDLTEEEFRTIYLNPLLREEPGNKMKQAKSVGDLAPPEWDWRSKGAVT 354
Query: 140 GVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDT 199
VKDQ MCGS WAFS TGN+EG + L+SLSEQEL+DCD+ D C GG SNA+
Sbjct: 355 KVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDKMDKACMGGLPSNAYSA 414
Query: 200 IMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAV 259
I K GGLE E Y Y+G ++C + + +V IN V +S++E +A +L + GP++V
Sbjct: 415 I--KNLGGLETEDDYSYQGHMQSCNFSAEKAKVYINDSVELSQNEQKLAAWLAKRGPISV 472
Query: 260 AINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWG 319
AINA+ +Q V H + + + H G D +P+W IKNSWG
Sbjct: 473 AINAFGMQ--VRPXPHCSAWIINSPDSCTLHCT----PGSD--------IPFWAIKNSWG 518
Query: 320 EGWGEKGYFRLYRGDGSCGINDYVRSALV 348
WGEKGY+ L+RG G+CG+N SA+V
Sbjct: 519 TDWGEKGYYYLHRGSGACGVNTMASSAVV 547
>gi|356543116|ref|XP_003540009.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 337
Score = 211 bits (536), Expect = 6e-52, Method: Compositional matrix adjust.
Identities = 128/336 (38%), Positives = 172/336 (51%), Gaps = 29/336 (8%)
Query: 12 LLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLR 71
+L L++ S M +LH + +++++ K Y E RL IF N+
Sbjct: 14 VLLLSICTSQVMS------RYLHEASMSERHEQWMKKYGKVYKDAAEKQKRLLIFKDNVE 67
Query: 72 KIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNIT-LPRAF 130
I+ + G+N +D + EF A + G+K K S++ P N+T +P A
Sbjct: 68 FIESFNAAGNKPYKLGINHLADQTNEEFVASHNGYKHKASHSQ--TPFKYENVTGVPNAV 125
Query: 131 DWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEG 190
DWRE AVT VKDQ CGS WAFST EG+Y T L+SLSEQEL+DCD D GC+G
Sbjct: 126 DWRENGAVTAVKDQGQCGSCWAFSTVAATEGIYQITTSMLMSLSEQELVDCDSVDHGCDG 185
Query: 191 GSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKAT-QVKINGYVSVSRDETDMAK 249
G + F+ I+ GG+ E YPY D C NK+A+ +I GY +V + D +
Sbjct: 186 GYMEGGFEFIIKN--GGISSEANYPYTAVDGTCDANKEASPAAQIKGYETVPANSEDALQ 243
Query: 250 YLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHK 307
V N P++V I+A A QFY +GV F L H V VGYG T
Sbjct: 244 KAVANQPVSVTIDAGGSAFQFYSSGV------FTGQCGTQLDHGVTAVGYGS-----TDD 292
Query: 308 AVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGI 339
YWI+KNSWG WGE+GY R+ RG +G CGI
Sbjct: 293 GTQYWIVKNSWGTQWGEEGYIRMQRGTDAQEGLCGI 328
>gi|395852405|ref|XP_003798729.1| PREDICTED: cathepsin W [Otolemur garnettii]
Length = 367
Score = 211 bits (536), Expect = 6e-52, Method: Compositional matrix adjust.
Identities = 121/323 (37%), Positives = 176/323 (54%), Gaps = 24/323 (7%)
Query: 41 LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
+F F Q N++Y+ E+ RL IF+ NL K Q LQ+ + G+ +G+ SDL+ EF
Sbjct: 41 VFKLFQVQFNRSYSNPAEHSRRLDIFAHNLAKAQQLQEEDLGTAEFGMTSLSDLTEEEF- 99
Query: 101 AKYLGFKLKPSYADRSVPAMIPNI-------TLPRAFDWR-EYDAVTGVKDQTMCGSSWA 152
K G + A VP M + TLPR DWR + ++ +K+Q C WA
Sbjct: 100 GKIFGHQ----KAVGEVPRMGRKVGSEQQGETLPRTCDWRNKAGIISRIKNQENCKCCWA 155
Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEK 212
+ NIE ++ K + V +S QEL+DC++ DGC+GG + +AF T+++ GL EK
Sbjct: 156 MAAADNIEALWGIKYHQSVEVSVQELLDCNRCGDGCQGGFVWDAFITVLN--NSGLASEK 213
Query: 213 TYPYRGDDKA--CRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYV 270
YP++ K C NK I ++ + +E +A+YL +GP+ V IN LQ Y
Sbjct: 214 DYPFKASVKTHRCLANKYRKVAWIQDFIMLEDNEHKIAQYLATHGPITVTINMKLLQHYK 273
Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTK-----FTHKAVPYWIIKNSWGEGWGEK 325
GV CD + ++HSVL+VG+G + H++ PYWI+KNSWG WGE+
Sbjct: 274 KGVIKAKPTTCDP--QLVNHSVLLVGFGAETVSSQSHLRPHRSTPYWILKNSWGAHWGEE 331
Query: 326 GYFRLYRGDGSCGINDYVRSALV 348
GYFRL+RG SCGI Y +A V
Sbjct: 332 GYFRLHRGSNSCGITKYPFTARV 354
>gi|161598418|gb|ABX74953.1| cysteine protease [Leishmania panamensis]
Length = 441
Score = 210 bits (534), Expect = 8e-52, Method: Compositional matrix adjust.
Identities = 122/318 (38%), Positives = 175/318 (55%), Gaps = 24/318 (7%)
Query: 39 TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQ-DTEHGSGVYGLNEFSDLSTA 97
+ LF F + + + YATL E R+ F NL ++ Q + H +G+ +F DLS A
Sbjct: 35 SVLFEEFKQTYKRVYATLAEEQQRVANFQRNLELMREHQANNPHAR--FGITKFFDLSEA 92
Query: 98 EFQAKYLG----FKLKPSYADRSVPAMIPNI-TLPRAFDWREYDAVTGVKDQTMCGSSWA 152
EF +YL F +A + + ++ T P A DWR+ AVT V DQ CGS WA
Sbjct: 93 EFATRYLSGATHFAKAKKFASQHYRKVGADLSTAPAAVDWRQMGAVTPVNDQGACGSCWA 152
Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEK 212
FS GNIE + T L++LSEQEL+ CD D+GC GG + AFD +++ G +
Sbjct: 153 FSAIGNIESQWYVTTHSLITLSEQELVSCDDVDEGCNGGLMLQAFDWLLNNKNGAVYTGA 212
Query: 213 TYPYRGDDKACRLNKKATQVK----INGYVSVSRDETDMAKYLVENGPMAVAINAYALQF 268
+YPY + + +++++ I+G+V++ +E MA +L NGP+A+A++A A
Sbjct: 213 SYPYVSGNGSVPECSESSELVVGAYIDGHVTIESNEDTMAAWLAVNGPIAIAVDASAFMS 272
Query: 269 YVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYF 328
Y G I CDG L+H VL+VGY + VPYW+IKNSWGE WGEKGY
Sbjct: 273 YTGG----ILTSCDG--RQLNHGVLLVGYNMT------GEVPYWLIKNSWGENWGEKGYV 320
Query: 329 RLYRGDGSCGINDYVRSA 346
R+ +G C I +Y SA
Sbjct: 321 RVRKGTNECLIQEYPASA 338
>gi|96979798|ref|YP_611001.1| cathepsin [Antheraea pernyi nucleopolyhedrovirus]
gi|37077647|sp|Q91CL9.1|CATV_NPVAP RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|16041073|dbj|BAB69773.1| cathepsin [Antheraea pernyi nucleopolyhedrovirus]
gi|94983331|gb|ABF50271.1| cathepsin [Antheraea pernyi nucleopolyhedrovirus]
gi|146229694|gb|ABQ12259.1| cathepsin [Antheraea pernyi nucleopolyhedrovirus]
Length = 324
Score = 210 bits (534), Expect = 8e-52, Method: Compositional matrix adjust.
Identities = 119/315 (37%), Positives = 182/315 (57%), Gaps = 20/315 (6%)
Query: 36 VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLS 95
+K + F FL + NK Y++ E R IF NL +I + ++ S Y +N+FSDLS
Sbjct: 22 LKAPSYFEEFLHKFNKNYSSESEKLRRFKIFQHNLEEI-INKNQNDTSAQYEINKFSDLS 80
Query: 96 TAEFQAKYLGFKL---KPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
E +KY G L K ++ + V P+ P FDWR + VT VK+Q MCG+ WA
Sbjct: 81 KDETISKYTGLSLPLQKQNFCEVVVLDRPPDKG-PLEFDWRRLNKVTSVKNQGMCGACWA 139
Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEK 212
F+T G++E +A K +L++LSEQ+LIDCD D GC+GG + A++ +M+ GG++ E
Sbjct: 140 FATLGSLESQFAIKHDQLINLSEQQLIDCDFVDVGCDGGLLHTAYEAVMNM--GGIQAEN 197
Query: 213 TYPYRGDDKACRLNKKATQVKING-YVSVSRDETDMAKYLVENGPMAVAINAYALQFYVT 271
YPY ++ CR+N V++ Y V+ E + L GP+ VAI+A + Y
Sbjct: 198 DYPYEANNGPCRVNAAKFVVRVKKCYRYVTLFEEKLKDLLRIVGPIPVAIDASDIVGYKR 257
Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
G+ +C+ N L+H+VL+VGYGV+ +P+WI+KN+WG WGE+GYFR+
Sbjct: 258 GIIR----YCE--NHGLNHAVLLVGYGVE------NGIPFWILKNTWGADWGEQGYFRVQ 305
Query: 332 RGDGSCGINDYVRSA 346
+ +CGI + + S+
Sbjct: 306 QNINACGIKNELPSS 320
>gi|261328617|emb|CBH11595.1| cysteine peptidase precursor [Trypanosoma brucei gambiense DAL972]
gi|261328620|emb|CBH11598.1| cysteine peptidase precursor [Trypanosoma brucei gambiense DAL972]
Length = 450
Score = 210 bits (534), Expect = 8e-52, Method: Compositional matrix adjust.
Identities = 127/346 (36%), Positives = 180/346 (52%), Gaps = 27/346 (7%)
Query: 10 VALLSLTVSVSSFMVVGDEKLHHLHHVKHTAL-FNYFLEQHNKTYATLVEYYSRLHIFSG 68
V LL++ ++S L LH + + F F +++ K Y E R F
Sbjct: 14 VVLLAMAACLASV------ALGSLHVEESLEMRFAAFKKKYGKVYKDAKEEAFRFRAFEE 67
Query: 69 NLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITL-- 126
N+ + ++ Q + +G+ FSD++ EF+A+Y + A + + + N+T
Sbjct: 68 NMEQAKI-QAAANPYATFGVTPFSDMTREEFRARYRNGASYFAAAQKRLRKTV-NVTTGR 125
Query: 127 -PRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQED 185
P A DWRE AVT VKDQ CGS WAFST GNIEG + LVSLSEQ L+ CD D
Sbjct: 126 APAAVDWREKGAVTPVKDQGQCGSCWAFSTIGNIEGQWQVAGNPLVSLSEQMLVSCDTID 185
Query: 186 DGCEGGSISNAFDTIMSKLGGGLEEEKTYPY---RGDDKACRLNKKATQVKINGYVSVSR 242
GC GG + NAF+ I++ GG + E +YPY G+ C++N I +V + +
Sbjct: 186 FGCGGGLMDNAFNWIVNSNGGNVFTEASYPYVSGNGEQPQCQMNGHEIGAAITDHVDLPQ 245
Query: 243 DETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRT 302
DE +A YL ENGP+A+A++A + Y G+ +E L H VL+VGY +
Sbjct: 246 DEDAIAAYLAENGPLAIAVDATSFMDYNGGI------LTSCTSEQLDHGVLLVGYNDNSN 299
Query: 303 KFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
PYWIIKNSW WGE GY R+ +G C +N V SA+V
Sbjct: 300 P------PYWIIKNSWSNMWGEDGYIRIEKGTNQCLMNQAVSSAVV 339
>gi|261328615|emb|CBH11593.1| cysteine peptidase precursor [Trypanosoma brucei gambiense DAL972]
Length = 451
Score = 210 bits (534), Expect = 8e-52, Method: Compositional matrix adjust.
Identities = 127/346 (36%), Positives = 180/346 (52%), Gaps = 27/346 (7%)
Query: 10 VALLSLTVSVSSFMVVGDEKLHHLHHVKHTAL-FNYFLEQHNKTYATLVEYYSRLHIFSG 68
V LL++ ++S L LH + + F F +++ K Y E R F
Sbjct: 14 VVLLAMAACLASV------ALGSLHVEESLEMRFAAFKKKYGKVYKDAKEEAFRFRAFEE 67
Query: 69 NLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITL-- 126
N+ + ++ Q + +G+ FSD++ EF+A+Y + A + + + N+T
Sbjct: 68 NMEQAKI-QAAANPYATFGVTPFSDMTREEFRARYRNGASYFAAAQKRLRKTV-NVTTGR 125
Query: 127 -PRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQED 185
P A DWRE AVT VKDQ CGS WAFST GNIEG + LVSLSEQ L+ CD D
Sbjct: 126 APAAVDWREKGAVTPVKDQGQCGSCWAFSTIGNIEGQWQVAGNPLVSLSEQMLVSCDTID 185
Query: 186 DGCEGGSISNAFDTIMSKLGGGLEEEKTYPY---RGDDKACRLNKKATQVKINGYVSVSR 242
GC GG + NAF+ I++ GG + E +YPY G+ C++N I +V + +
Sbjct: 186 FGCGGGLMDNAFNWIVNSNGGNVFTEASYPYVSGNGEQPQCQMNGHEIGAAITDHVDLPQ 245
Query: 243 DETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRT 302
DE +A YL ENGP+A+A++A + Y G+ +E L H VL+VGY +
Sbjct: 246 DEDAIAAYLAENGPLAIAVDATSFMDYNGGI------LTSCTSEQLDHGVLLVGYNDNSN 299
Query: 303 KFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
PYWIIKNSW WGE GY R+ +G C +N V SA+V
Sbjct: 300 P------PYWIIKNSWSNMWGEDGYIRIEKGTNQCLMNQAVSSAVV 339
>gi|72389859|ref|XP_845224.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|62359932|gb|AAX80357.1| cysteine peptidase precursor [Trypanosoma brucei]
gi|70801759|gb|AAZ11665.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
Length = 450
Score = 210 bits (534), Expect = 9e-52, Method: Compositional matrix adjust.
Identities = 127/346 (36%), Positives = 180/346 (52%), Gaps = 27/346 (7%)
Query: 10 VALLSLTVSVSSFMVVGDEKLHHLHHVKHTAL-FNYFLEQHNKTYATLVEYYSRLHIFSG 68
V LL++ ++S L LH + + F F +++ K Y E R F
Sbjct: 14 VVLLAMAACLASV------ALGSLHVEESLEMRFAAFKKKYGKVYKDAKEEAFRFRAFEE 67
Query: 69 NLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITL-- 126
N+ + ++ Q + +G+ FSD++ EF+A+Y + A + + + N+T
Sbjct: 68 NMEQAKI-QAAANPYATFGVTPFSDMTREEFRARYRNGASYFAAAQKRLRKTV-NVTTGR 125
Query: 127 -PRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQED 185
P A DWRE AVT VKDQ CGS WAFST GNIEG + LVSLSEQ L+ CD D
Sbjct: 126 APAAVDWREKGAVTPVKDQGQCGSCWAFSTIGNIEGQWQVAGNPLVSLSEQMLVSCDTID 185
Query: 186 DGCEGGSISNAFDTIMSKLGGGLEEEKTYPY---RGDDKACRLNKKATQVKINGYVSVSR 242
GC GG + NAF+ I++ GG + E +YPY G+ C++N I +V + +
Sbjct: 186 FGCGGGLMDNAFNWIVNSNGGNVFTEASYPYVSGNGEQPQCQMNGHEIGAAITDHVDLPQ 245
Query: 243 DETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRT 302
DE +A YL ENGP+A+A++A + Y G+ +E L H VL+VGY +
Sbjct: 246 DEDAIAAYLAENGPLAIAVDATSFMDYNGGI------LTSCTSEQLDHGVLLVGYNDNSN 299
Query: 303 KFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
PYWIIKNSW WGE GY R+ +G C +N V SA+V
Sbjct: 300 P------PYWIIKNSWSNMWGEDGYIRIEKGTNQCLMNQAVSSAVV 339
>gi|37732137|gb|AAR02406.1| cysteine proteinase [Anthonomus grandis]
Length = 322
Score = 210 bits (534), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 121/306 (39%), Positives = 174/306 (56%), Gaps = 19/306 (6%)
Query: 37 KHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT-EHGSGVY--GLNEFSD 93
KH ALF F ++ K+Y VE R +IF N+ +I+ E G Y +N+F+D
Sbjct: 21 KHQALFETFKVENGKSYRNQVEEVQRFNIFRANVLEIEQHNALYEQGLVSYKKAINQFTD 80
Query: 94 LSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAF 153
L+ EF+A YLG +KP + ++ + + +P + DWR VTGVK+Q CGS W+F
Sbjct: 81 LTQEEFKA-YLGLHVKP-VLNNTIQYELKGLEVPTSVDWRSAGQVTGVKNQGSCGSCWSF 138
Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQEDD-GCEGGSISNAFDTIMSKLGGGLEEEK 212
+ TG+ EG Y K K+LVSLSEQ+L+DC + GC GG + F I GL+ E
Sbjct: 139 ALTGSTEGAYYRKHKQLVSLSEQQLVDCSTSINYGCNGGFLDATFPYIEQY---GLQTES 195
Query: 213 TYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTG 272
+YPY G D +C+ + KI+ YVS+ E+ + + + GP+A+ ++A L Y +G
Sbjct: 196 SYPYTGVDGSCKYDSSKVVTKISNYVSLHGSESKVLEPVGSIGPVAITMDASYLSSYSSG 255
Query: 273 VSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR 332
+ + C NL+H+VL+VGYG + YWI+KNSWG GWGE+GYFRL R
Sbjct: 256 IYAANK--CT--TTNLNHAVLVVGYG------SQNGQNYWIVKNSWGSGWGEQGYFRLLR 305
Query: 333 GDGSCG 338
G CG
Sbjct: 306 GSNECG 311
>gi|302763109|ref|XP_002964976.1| hypothetical protein SELMODRAFT_83176 [Selaginella moellendorffii]
gi|302763113|ref|XP_002964978.1| hypothetical protein SELMODRAFT_83554 [Selaginella moellendorffii]
gi|300167209|gb|EFJ33814.1| hypothetical protein SELMODRAFT_83176 [Selaginella moellendorffii]
gi|300167211|gb|EFJ33816.1| hypothetical protein SELMODRAFT_83554 [Selaginella moellendorffii]
Length = 300
Score = 209 bits (533), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 122/307 (39%), Positives = 173/307 (56%), Gaps = 23/307 (7%)
Query: 41 LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
+F + +H+K+Y++ E RL +FS L I+ + + GLN+FSDL+ AEF+
Sbjct: 1 MFEDWAAKHDKSYSSDWEKARRLMVFSDTLAYIEKHNAQPNTTFTLGLNKFSDLTNAEFR 60
Query: 101 AKYLGFKLKPSYADRSVPAMIPNI---TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
A Y+G P Y DR PA ++ +LP + DWR+ AVT +KDQ CGS WAFS
Sbjct: 61 ANYVGKFKPPRYQDRR-PAKDVDVDVSSLPTSLDWRQEGAVTPIKDQGQCGSCWAFSAIA 119
Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
+IE + TK+LVSLSEQ+LIDCD D GC+GG +AF ++ GG+ E+ YPY
Sbjct: 120 SIESAHFLATKELVSLSEQQLIDCDTVDQGCQGGFPDDAFKFVVEN--GGVTTEEAYPYT 177
Query: 218 GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAI--NAYALQFYVTGVSH 275
G +C NK V+I GY V++D D V P+ V I + Q Y +G+
Sbjct: 178 GFAGSCNTNKNKV-VEITGYKDVTKDSADALMKAVSKTPVTVGICGSDQNFQNYRSGI-- 234
Query: 276 PIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR--G 333
+ C + H+VL++GYG T +PYWIIKNSWG WGE G+ ++ + G
Sbjct: 235 -LSGQCCNSRD---HAVLVIGYG------TEGGMPYWIIKNSWGTSWGEDGFMKIKKKDG 284
Query: 334 DGSCGIN 340
+G CG+N
Sbjct: 285 EGMCGMN 291
>gi|241062152|gb|ACS66748.1| cysteine protease [Leishmania guyanensis]
Length = 441
Score = 209 bits (533), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 122/318 (38%), Positives = 175/318 (55%), Gaps = 24/318 (7%)
Query: 39 TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQ-DTEHGSGVYGLNEFSDLSTA 97
+ LF F + + + YATL E R+ F NL ++ Q + H +G+ +F DLS A
Sbjct: 35 SVLFEEFKQTYKRVYATLAEEQQRVANFQRNLELMREHQANNPHAR--FGITKFFDLSEA 92
Query: 98 EFQAKYLG----FKLKPSYADRSVPAMIPNI-TLPRAFDWREYDAVTGVKDQTMCGSSWA 152
EF +YL F +A + + ++ T P A DWR+ AVT VKDQ CGS WA
Sbjct: 93 EFATRYLSGATHFAKAKKFASQHYRKVGADLSTAPAAVDWRQMGAVTPVKDQGACGSCWA 152
Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEK 212
S GNIE + T L++LSEQEL+ CD D+GC GG + AFD +++ G +
Sbjct: 153 LSAIGNIESQWYVTTHSLITLSEQELVSCDDVDEGCNGGLMLQAFDWLLNNKNGAVYTGA 212
Query: 213 TYPYRGDDKACRLNKKATQVK----INGYVSVSRDETDMAKYLVENGPMAVAINAYALQF 268
+YPY + + +++++ I+G+V++ +E MA +L NGP+A+A++A A
Sbjct: 213 SYPYVSGNGSVPECSESSELVVGAYIDGHVTIESNEDTMAAWLAVNGPIAIAVDASAFMS 272
Query: 269 YVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYF 328
Y G I CDG L+H VL+VGY + VPYW+IKNSWGE WGEKGY
Sbjct: 273 YTGG----ILTSCDG--RQLNHGVLLVGYNMT------GEVPYWLIKNSWGENWGEKGYV 320
Query: 329 RLYRGDGSCGINDYVRSA 346
R+ +G C I +Y SA
Sbjct: 321 RVRKGTNECLIQEYPVSA 338
>gi|15485586|emb|CAC67416.1| cysteine protease [Trypanosoma brucei rhodesiense]
Length = 450
Score = 209 bits (533), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 127/346 (36%), Positives = 179/346 (51%), Gaps = 27/346 (7%)
Query: 10 VALLSLTVSVSSFMVVGDEKLHHLHHVKHTAL-FNYFLEQHNKTYATLVEYYSRLHIFSG 68
V LL++ ++S L LH + + F F +++ K Y E R F
Sbjct: 14 VVLLAMAACLASV------ALGSLHVEESLEMRFAAFKKKYGKVYKDAKEEAFRFRAFEE 67
Query: 69 NLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITL-- 126
N+ + ++ Q + +G+ FSD++ EF+A+Y + A + + + N+T
Sbjct: 68 NMEQAKI-QAAANPYATFGVTPFSDMTREEFRARYRNGASYFAAAQKRLRKTV-NVTTGR 125
Query: 127 -PRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQED 185
P A DWRE AVT VKDQ CGS WAFST GNIEG + LVSLSEQ L+ CD D
Sbjct: 126 APAAVDWREKGAVTPVKDQGQCGSCWAFSTIGNIEGQWQVAGNPLVSLSEQMLVSCDTID 185
Query: 186 DGCEGGSISNAFDTIMSKLGGGLEEEKTYPY---RGDDKACRLNKKATQVKINGYVSVSR 242
GC GG + NAF+ I++ GG + E +YPY G+ C++N I +V + +
Sbjct: 186 FGCGGGLMDNAFNWIVNSNGGNVFTEASYPYVSGNGEQPQCQMNGHEIGAAITDHVDLPQ 245
Query: 243 DETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRT 302
DE +A YL ENGP+A+A++A + Y G+ +E L H VL+VGY
Sbjct: 246 DEDAIAAYLAENGPLAIAVDATSFMDYNGGI------LTSCTSEQLDHGVLLVGYNDSSN 299
Query: 303 KFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
PYWIIKNSW WGE GY R+ +G C +N V SA+V
Sbjct: 300 P------PYWIIKNSWSNMWGEDGYIRIEKGTNQCLMNQAVSSAVV 339
>gi|343476707|emb|CCD12272.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 447
Score = 209 bits (533), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 123/343 (35%), Positives = 184/343 (53%), Gaps = 24/343 (6%)
Query: 13 LSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRK 72
+ L + F+ V LH ++ F F ++++++Y E R +F N+ +
Sbjct: 14 VGLLAVAACFVPVALGVLHAEQSLQQQ--FAAFKQKYSRSYKDATEEAFRFRVFKQNMER 71
Query: 73 IQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL-GFKLKPSYADRSVPAMIPNITL---PR 128
+ + + +G+ FSD+S EF+A Y G + + R P + ++ P
Sbjct: 72 AKE-EAAANPYATFGVTRFSDMSPEEFRATYHNGAEYYAAALKR--PRKVVTVSTGKAPE 128
Query: 129 AFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGC 188
A DWR+ AVT VKDQ CGS WAFS GNIEG + L SLSEQ L+ CD ED GC
Sbjct: 129 AVDWRKKGAVTPVKDQGQCGSCWAFSAIGNIEGQWKVTGHNLTSLSEQMLVSCDTEDLGC 188
Query: 189 EGGSISNAFDTIMSKLGGGLEEEKTYPYR---GDDKACRLNKKATQVKINGYVSVSRDET 245
GG + NAF I+S + E++YPY G+ CR++ K KI +V + +DE
Sbjct: 189 AGGLMDNAFKWIVSSNRHNVFTEESYPYASKGGNVPPCRMSGKVVGAKIRDHVDLPKDEN 248
Query: 246 DMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFT 305
+A++L +NGP+A+A+++ + Q Y GV ++ L H VL+VGY D +K
Sbjct: 249 AIAEWLAKNGPVAIAVDSTSFQSYTGGV------LTSCISKQLDHGVLLVGYD-DTSK-- 299
Query: 306 HKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
PYWIIKNSW +GWGE+GY R+ +G C + +Y SA+V
Sbjct: 300 ---PPYWIIKNSWSKGWGEEGYIRIEKGTNQCLVKNYATSAVV 339
>gi|52546912|gb|AAU81589.1| cysteine proteinase [Petunia x hybrida]
Length = 257
Score = 209 bits (532), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 112/249 (44%), Positives = 145/249 (58%), Gaps = 18/249 (7%)
Query: 107 KLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAK 166
K KP + P ++P LP FDWRE AVTGVK+Q CGS W+FSTTG +EG +
Sbjct: 6 KAKPKLSTDKAP-ILPTSDLPDDFDWREKGAVTGVKNQGSCGSCWSFSTTGAVEGAHFLA 64
Query: 167 TKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
T +LVSLSEQ+L+DCD E D GC GG ++ AF+ + GGL+ EK YPY
Sbjct: 65 TGELVSLSEQQLVDCDHECDAEQQNECDAGCGGGLMTTAFEYTLK--AGGLQREKDYPYT 122
Query: 218 GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPI 277
G D C +K + + V DE +A LV++GP+AV INA +Q YV GVS P+
Sbjct: 123 GRDGKCHFDKSKIAASVANFSVVGLDEDQIAANLVKHGPLAVGINAAWMQTYVGGVSCPL 182
Query: 278 QFFCDGGNENLSHSVLIVGYG-VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGS 336
F + H VL+VGYG K PYWIIKNSWGE WGE+GY+++ RG
Sbjct: 183 ICF-----KRQDHGVLLVGYGSAGFAPIRLKEKPYWIIKNSWGESWGEQGYYKICRGRNI 237
Query: 337 CGINDYVRS 345
CG++ V +
Sbjct: 238 CGVDAMVST 246
>gi|157864843|ref|XP_001681130.1| cathepsin L-like protease [Leishmania major strain Friedlin]
gi|68124424|emb|CAJ02280.1| cathepsin L-like protease [Leishmania major strain Friedlin]
Length = 348
Score = 209 bits (531), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 124/319 (38%), Positives = 174/319 (54%), Gaps = 22/319 (6%)
Query: 39 TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAE 98
ALF F + + Y TL E RL F NL ++ Q + +G+ +F DLS AE
Sbjct: 35 AALFEEFKRTYQRAYGTLTEEQQRLANFERNLELMREHQ-ARNPHARFGITKFFDLSEAE 93
Query: 99 FQAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAF 153
F A+YL F +A + +++ +P A DWRE AVT VK+Q CGS WAF
Sbjct: 94 FAARYLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKNQGACGSCWAF 153
Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
S GNIE +A KLV LSEQ+L+ CD D+GC GG + AF+ ++ + G + EK+
Sbjct: 154 SAVGNIESQWAVAGHKLVRLSEQQLVSCDHVDNGCGGGLMLQAFEWVLRNMNGTVFTEKS 213
Query: 214 YPYRGD----DKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFY 269
YPY + ++ A +I+GYVS+ E MA +L +NGP+++A++A + Y
Sbjct: 214 YPYTSTFGYVPECSNSSELAPGARIDGYVSMESSERVMAAWLAKNGPISIAVDASSFMSY 273
Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
+GV C G E L+H VL+VGY + VPYW+IKNSWG+ WGEKGY R
Sbjct: 274 HSGVLTS----CIG--EQLNHGVLLVGYNMT------GEVPYWVIKNSWGKDWGEKGYVR 321
Query: 330 LYRGDGSCGINDYVRSALV 348
+ G +C + Y S V
Sbjct: 322 VTMGVNACLLTGYPVSVHV 340
>gi|68304200|ref|YP_249668.1| VCATH [Chrysodeixis chalcites nucleopolyhedrovirus]
gi|67973029|gb|AAY83995.1| VCATH [Chrysodeixis chalcites nucleopolyhedrovirus]
Length = 344
Score = 209 bits (531), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 122/312 (39%), Positives = 174/312 (55%), Gaps = 20/312 (6%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
F F ++ K YA E R IF NL I L ++ ++ S VY +N+F+DL+ E A
Sbjct: 47 FETFQTKYKKVYADDNERDYRYKIFKTNLEIINL-KNQQNDSAVYNINKFADLTKNEVIA 105
Query: 102 KYLGFKLK-PSYADRSVPAMI--PNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
K+ G ++ P+ + P ++ P+ FDWR+++ +T VKDQ CGS WAFST
Sbjct: 106 KFTGLGIRSPALKNSCEPVIVDGPSKYTQETFDWRQFNKITSVKDQGFCGSCWAFSTIAG 165
Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG 218
+E YA K + V LSEQ+L+DCD D GC GG + A++ IM+ GGLE E+ YPYR
Sbjct: 166 LESQYAIKYNEHVDLSEQQLVDCDTIDMGCAGGLLHTAYEEIMAM--GGLEYEEDYPYRS 223
Query: 219 DDKACRLNKKATQVKI-NGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPI 277
CRL +V + N Y V E + L E GP+AVA++A L Y G+
Sbjct: 224 VQGPCRLQSDKFEVSVDNCYRYVLYSEDKLKDVLHEMGPIAVAVDAVDLTDYYGGIITSC 283
Query: 278 QFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSC 337
+ N L+H+VL+VGYG++ VP+W++KNSWG +GE G+ R+ R SC
Sbjct: 284 K------NYGLNHAVLLVGYGIEN------GVPFWVLKNSWGSDYGENGFVRVKRNVNSC 331
Query: 338 G-INDYVRSALV 348
G IN+ SA +
Sbjct: 332 GMINELAASARI 343
>gi|15617524|ref|NP_258322.1| cathepsin-like cysteine proteinase [Spodoptera litura NPV]
gi|37077642|sp|Q91BH1.1|CATV_NPVST RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|15553260|gb|AAL01738.1|AF325155_50 cathepsin-like cysteine proteinase [Spodoptera litura NPV]
Length = 337
Score = 209 bits (531), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 120/316 (37%), Positives = 173/316 (54%), Gaps = 27/316 (8%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
F++QHNK Y T + + F NL + + + + VYG+N+FSD+ F ++
Sbjct: 36 FIKQHNKEYTTPDQRDAAFVNFKRNLADMNAMNNVSN-QAVYGINKFSDIDKITFVNEHA 94
Query: 105 GF----------KLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
G P V P+ P +FDWR+ + VT VK+Q +CGS WAF+
Sbjct: 95 GLVSNLINSTDSNFDPYRLCEYVTVAGPSARTPESFDWRKLNKVTKVKEQGVCGSCWAFA 154
Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
GNIE YA L+ LSEQ+L+DCD+ D GC+GG + AF I+ GG+E E Y
Sbjct: 155 AIGNIESQYAIMHDSLIDLSEQQLLDCDRVDQGCDGGLMHLAFQEIIRI--GGVEHEIDY 212
Query: 215 PYRGDDKACRLNKKATQVKIN-GYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGV 273
PY+G + ACRL V+++ Y RDE + + L +NGP+AVAI+ + Y +G+
Sbjct: 213 PYQGIEYACRLAPSKLAVRLSHCYQYDLRDERKLLELLYKNGPIAVAIDCVDIIDYRSGI 272
Query: 274 SHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG 333
+ D G L+H+VL+VGYG++ PYWI KNSWG WGE GYFR R
Sbjct: 273 ATVCN---DNG---LNHAVLLVGYGIEND------TPYWIFKNSWGSNWGENGYFRARRN 320
Query: 334 DGSCG-INDYVRSALV 348
+CG +N++ SA++
Sbjct: 321 INACGMLNEFAASAVL 336
>gi|355681666|gb|AER96819.1| cathepsin W [Mustela putorius furo]
Length = 373
Score = 209 bits (531), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 117/331 (35%), Positives = 178/331 (53%), Gaps = 22/331 (6%)
Query: 36 VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLS 95
+K +F F Q+N++Y+ EY RL IF+ NL + Q ++ + + +G+ FSDL+
Sbjct: 36 LKLEQVFELFRAQYNRSYSNPKEYAHRLEIFAHNLAQAQKMEVEDLATAEFGMTPFSDLT 95
Query: 96 TAEFQAKYLGFKLKPSYAD---RSVPAMIPNITLPRAFDWREYDAV-TGVKDQTMCGSSW 151
EF+ + K+ P R V + + ++P + DWR+ V + +K+Q C W
Sbjct: 96 EEEFEQLHGHQKITPGETPAVGRKVGSEVVMESVPASCDWRKLKGVKSPIKEQGNCNCCW 155
Query: 152 AFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEE 211
A + GNIE +++ + + V +S QEL+DC++ DGC+GG + +AF T+++ GL E
Sbjct: 156 AMAAAGNIEALWSIRYNQSVQVSVQELLDCNRCGDGCKGGFVWDAFVTVLN--NSGLASE 213
Query: 212 KTYPYRGDDK--ACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFY 269
K YP+RG K C + I ++ + +E MA YL +GP+ V IN LQ Y
Sbjct: 214 KDYPFRGSLKRHKCLASNYKKVAWIQDFIMLQNNEQTMANYLATHGPITVTINMKLLQQY 273
Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRT------------KFTHKAVPYWIIKNS 317
GV CD N HSVL+VG+G + H+ +PYWI+KNS
Sbjct: 274 KKGVIKATPATCDPYLVN--HSVLLVGFGKTNSSERRRAKGGHFWPHPHRPIPYWILKNS 331
Query: 318 WGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
WG WGE+GYFRL+RG +CGI Y +A V
Sbjct: 332 WGAEWGEEGYFRLHRGSNTCGITKYPLTARV 362
>gi|145351119|ref|XP_001419933.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144580166|gb|ABO98226.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 272
Score = 208 bits (530), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 122/273 (44%), Positives = 152/273 (55%), Gaps = 23/273 (8%)
Query: 91 FSDLSTAEFQAKYLGFKLKPSYADRSVPA-------MIPNITLPRAFDWREYDAVTGVKD 143
FSDL+ EF A+YLG S A +P LP FDWR AVT VKD
Sbjct: 2 FSDLTAEEFAARYLGHVRLSSEEREKRKARGGETLETLPVEHLPEEFDWRFKGAVTRVKD 61
Query: 144 QTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCD---------QEDDGCEGGSIS 194
Q CGS W FSTTG IEG + T KLV LSEQ+L+DCD D GC GG S
Sbjct: 62 QGQCGSCWTFSTTGAIEGAHFISTGKLVELSEQQLVDCDVGCDPDVPNACDSGCNGGLPS 121
Query: 195 NAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVEN 254
NA + I+ GG++ EK+YPY G+ C+ K + + VS DE MA LV+
Sbjct: 122 NAMEYIVEH--GGIDTEKSYPYVGEKGECKAKKGKLGATLKNFSFVSDDEKQMAAALVKY 179
Query: 255 GPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAV-PYWI 313
GP+++ INA +Q Y+ GV+ P + CD E+L H VLIVGYG A PYWI
Sbjct: 180 GPLSIGINAAWMQSYIGGVACP--WLCDA--ESLDHGVLIVGYGSSGFAPVRWAPEPYWI 235
Query: 314 IKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSA 346
+KNSW WGE GY+R+ + GSCGIN+ V +A
Sbjct: 236 VKNSWSPAWGEGGYYRICKDKGSCGINNMVVAA 268
>gi|332326585|gb|AEE42616.1| cysteine protease [Leishmania aethiopica]
Length = 443
Score = 208 bits (530), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 119/318 (37%), Positives = 175/318 (55%), Gaps = 22/318 (6%)
Query: 40 ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
ALF F + + Y T+ E RL F NL ++ Q + +G+ +F DLS AEF
Sbjct: 36 ALFEEFKRTYRRAYGTVAEEQQRLANFERNLELMREHQ-ARNPHARFGITKFFDLSEAEF 94
Query: 100 QAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
A+YL F +A + +++ +P A DWRE AVT VKBQ CGS WAFS
Sbjct: 95 AARYLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKBQGACGSCWAFS 154
Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
GNIE +A +L LSEQ+L+ CD +D GC GG ++ AF+ ++ + G + E +Y
Sbjct: 155 AVGNIESQWAVAGHRLXXLSEQQLVSCDDKDSGCXGGLMTQAFEWLLRXMNGTMFTEDSY 214
Query: 215 PY---RGDDKACRLNKKATQ-VKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYV 270
PY GD C + + +I+GYV + +ET MA +L ++GP+++ ++A + Y
Sbjct: 215 PYVSSTGDVPECTNSSELVPGARIDGYVMIESNETVMAAWLAKSGPISIGVDASSFMSYE 274
Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
+GV ++L+H VL+VGY + VPYW+IKNSWGE WGEKGY R+
Sbjct: 275 SGV------LTSCAGKHLNHGVLLVGYNMT------GEVPYWVIKNSWGEDWGEKGYVRV 322
Query: 331 YRGDGSCGINDYVRSALV 348
G +C + +Y SA V
Sbjct: 323 TMGVNACLLXEYPVSAHV 340
>gi|343477445|emb|CCD11724.1| unnamed protein product, partial [Trypanosoma congolense IL3000]
Length = 380
Score = 208 bits (530), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 125/343 (36%), Positives = 185/343 (53%), Gaps = 24/343 (6%)
Query: 13 LSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRK 72
+ L + F+ V LH ++ F F ++++++Y E R +F N+ +
Sbjct: 14 VGLLAVAACFVPVALGVLHAEQSLQQQ--FAAFKQKYSRSYKDATEEAFRFRVFKQNMER 71
Query: 73 IQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL-GFKLKPSYADRSVPAMIPNITL---PR 128
+ + + +G+ FSD+S EF+A Y G + + R P + N++ P
Sbjct: 72 AKE-EAAANPYATFGVTRFSDMSPEEFRATYHNGAEYYAAALKR--PRKVVNVSTGKAPE 128
Query: 129 AFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGC 188
A DWR+ AVT VKDQ CGS WAFS GNIEG + +L SLSEQ L+ CD D GC
Sbjct: 129 AVDWRKKGAVTPVKDQGQCGSCWAFSAIGNIEGQWKVAGHELTSLSEQMLVSCDTNDFGC 188
Query: 189 EGGSISNAFDTIMSKLGGGLEEEKTYPYR---GDDKACRLNKKATQVKINGYVSVSRDET 245
EGG + +AF I+S G + E++YPY G+ AC + K KI +V + DE
Sbjct: 189 EGGLMDDAFKWIVSSNKGNVFTEQSYPYASGGGNVPACDKSGKVVGAKIRDHVDLPEDEN 248
Query: 246 DMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFT 305
+A++L +NGP+A+A++A + Q Y GV +E+L H VL+VGY D +K
Sbjct: 249 AIAEWLAKNGPVAIAVDATSFQSYTGGV------LTSCISEHLDHGVLLVGYD-DTSK-- 299
Query: 306 HKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
PYWIIKNSW +GWGE+GY R+ +G C + + SA+V
Sbjct: 300 ---PPYWIIKNSWSKGWGEEGYIRIEKGTNQCLMKNLPSSAVV 339
>gi|537437|gb|AAC35211.1| cysteine proteinase [Hemerocallis hybrid cultivar]
Length = 359
Score = 208 bits (529), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 132/342 (38%), Positives = 174/342 (50%), Gaps = 26/342 (7%)
Query: 11 ALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYF--LEQHNKTYATLVEYYSRLHIFSG 68
ALLS+ + + S + +L++ + H+ L + R ++F
Sbjct: 7 ALLSVVLVLGSVALAQSIPFDEKDLASEESLWSLYEKWRAHHAVSRDLDDTDKRFNVFKE 66
Query: 69 NLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVP-----AMIPN 123
N++ I + + LN+F D++ EF++ Y G K+ R V +
Sbjct: 67 NVKFIHEFNQKKDATYKLALNKFGDMTNQEFRSTYAGSKIDHHMTLRGVKDAGEFSYEKF 126
Query: 124 ITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQ 183
LP + DWRE AVTGVKDQ CGS WAFST +EG+ KT +LVSLSEQ+L+DCD
Sbjct: 127 HDLPTSVDWREKGAVTGVKDQGQCGSCWAFSTVVAVEGINQIKTNELVSLSEQQLVDCDT 186
Query: 184 EDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRD 243
++ GC GG + AFD I K GGL E +YPY + K+C + V I+GY V R+
Sbjct: 187 KNSGCNGGLMDYAFDFI--KNNGGLSSEDSYPYLAEQKSCGSEANSAVVTIDGYQDVPRN 244
Query: 244 ETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDR 301
V N P++VAI A YA QFY GV F G E L H V VGYGVD
Sbjct: 245 NEAALMKAVANQPVSVAIEASGYAFQFYSQGV-----FSGHCGTE-LDHGVAAVGYGVD- 297
Query: 302 TKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGI 339
YWI+KNSWGEGWGE GY R+ RG G CGI
Sbjct: 298 ----DDGKKYWIVKNSWGEGWGESGYIRMERGIKDKRGKCGI 335
>gi|71663163|ref|XP_818578.1| cysteine peptidase [Trypanosoma cruzi strain CL Brener]
gi|70883837|gb|EAN96727.1| cysteine peptidase, putative [Trypanosoma cruzi]
Length = 467
Score = 208 bits (529), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 116/315 (36%), Positives = 164/315 (52%), Gaps = 18/315 (5%)
Query: 39 TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAE 98
T+ F F ++H + Y + E RL +F NL + L + +G+ FSDL+ E
Sbjct: 35 TSQFAEFKQKHGRVYESAAEEAFRLSVFRENLF-LARLHAAANPHATFGVTPFSDLTREE 93
Query: 99 FQAKYLGFKLKPSYADRS--VPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTT 156
F+++Y + + A VP + + P A DWR AVT VKDQ CGS WAFS
Sbjct: 94 FRSRYHNGAVHFAAAQERARVPVKVEVVGAPAAVDWRARGAVTAVKDQGQCGSCWAFSAI 153
Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
GN+E + L +LSEQ L+ CD+ D GC GG ++NAF+ I+ + G + E +YPY
Sbjct: 154 GNVECQWFLAGHPLTNLSEQMLVSCDKTDSGCSGGLMNNAFEWIVQENNGAVYTEDSYPY 213
Query: 217 ---RGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGV 273
G C + I G+V + +DE +A +L NGP+AVA++A + Y GV
Sbjct: 214 ASGEGISPPCTTSGHTVGATITGHVELPQDEAQIAAWLAVNGPVAVAVDASSWMTYTGGV 273
Query: 274 SHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG 333
+E L H VL+VGY AVPYWIIKNSW WGE+GY R+ +G
Sbjct: 274 ------MTSCVSEQLDHGVLLVGYN------DSAAVPYWIIKNSWTTQWGEEGYIRIAKG 321
Query: 334 DGSCGINDYVRSALV 348
C + + SA+V
Sbjct: 322 SNQCLVKEEASSAVV 336
>gi|395545396|ref|XP_003774588.1| PREDICTED: cathepsin W [Sarcophilus harrisii]
Length = 358
Score = 208 bits (529), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 125/320 (39%), Positives = 178/320 (55%), Gaps = 30/320 (9%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF-- 99
F F Q+NK+Y E RL IF+ NL + Q L + G +G+ FSDL+ EF
Sbjct: 44 FKAFQIQYNKSYPDAAEQECRLKIFADNLARAQQLTEEHQGLAQFGVTRFSDLTEEEFRR 103
Query: 100 -----QAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
Q YLG ++K P + T R+ DWR+ +T V+DQ C S WA S
Sbjct: 104 LYQPSQPNYLGLRVKTEGG--GYPRLQRLKT--RSCDWRKARVLTPVRDQKNCNSCWAIS 159
Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
GN+E ++A ++L LS QEL+DC + GCEGG + +A+ TI+++ GL EE+ Y
Sbjct: 160 AVGNVEALWAINYQQLFKLSVQELLDCRRCGQGCEGGFVWDAYMTILNQ--SGLAEEQDY 217
Query: 215 PYRGD-DKACRLNKKATQVKINGYVSVSRDET-----DMAKYLVENGPMAVAINAYALQF 268
PYR K C+ KK + I+ ++ + ++E DMA+YL E GP+ V IN+ L+
Sbjct: 218 PYRPQLSKGCQ--KKKKRAWIHDFLMLHKEENSPSPPDMAQYLAEKGPITVTINSRLLKS 275
Query: 269 YVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYF 328
Y+ GV P CD + + H V +VG+G FT YWI+KNSWG WGEKGYF
Sbjct: 276 YIRGVIKPGN-NCDP--KYVDHVVQLVGFGQIHN-FT-----YWILKNSWGSSWGEKGYF 326
Query: 329 RLYRGDGSCGINDYVRSALV 348
RL+RG +CGI + +A++
Sbjct: 327 RLHRGRNACGITKFPLTAVL 346
>gi|332326593|gb|AEE42620.1| cysteine protease [Leishmania aethiopica]
Length = 443
Score = 207 bits (528), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 120/318 (37%), Positives = 174/318 (54%), Gaps = 22/318 (6%)
Query: 40 ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
ALF F + + Y T+ E RL F NL ++ Q + +G+ +F DLS AEF
Sbjct: 36 ALFEEFKRTYWRVYGTVAEEQQRLANFERNLELMREHQ-ARNPHARFGITKFFDLSEAEF 94
Query: 100 QAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
A+YL F +A + +++ +P A DWRE AVT VKDQ CGS WAFS
Sbjct: 95 AARYLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKDQGACGSCWAFS 154
Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
GNIE +A +L +LSEQ+L+ CD +D GC GG ++ AF+ ++ + G + E +Y
Sbjct: 155 AVGNIESQWAVAGHRLTALSEQQLVSCDDKDSGCGGGLMTQAFEWLLRNMNGTMFTEDSY 214
Query: 215 PY---RGDDKACRLNKKATQ-VKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYV 270
PY GD C + + +I+GYV++ ET MA +L ++GP+++ ++A + Y
Sbjct: 215 PYVSSXGDVPECTNSSQLVPGARIDGYVTIESSETVMAAWLAKSGPISIGVDASSFMSYE 274
Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
+GV + L+H VL+VGY VPYW+IKNSWGE WGEKGY R+
Sbjct: 275 SGV------LTSCAGBXLNHGVLLVGYNXT------GEVPYWVIKNSWGEDWGEKGYVRV 322
Query: 331 YRGDGSCGINDYVRSALV 348
G +C + +Y SA V
Sbjct: 323 AMGVNACLLTEYPVSAHV 340
>gi|146078033|ref|XP_001463431.1| cathepsin L-like protease [Leishmania infantum JPCM5]
gi|134067516|emb|CAM65796.1| cathepsin L-like protease [Leishmania infantum JPCM5]
Length = 381
Score = 207 bits (528), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 129/320 (40%), Positives = 176/320 (55%), Gaps = 37/320 (11%)
Query: 39 TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAE 98
ALF F + + Y TL E RL F NL ++ Q + +G+ +F DLS AE
Sbjct: 35 AALFEEFKRTYRRAYGTLAEEQQRLANFERNLELMREHQ-ARNPHARFGITKFFDLSEAE 93
Query: 99 FQAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAF 153
F A+YL F +A + +++ +P A DWRE AVT VKDQ CGS WAF
Sbjct: 94 FAARYLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKDQGACGSCWAF 153
Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
S GNIE +A LVSLSEQ+L+ CD +D+GC GG + AF+ ++ + G + EK+
Sbjct: 154 SAVGNIESQWARAGHGLVSLSEQQLVSCDDKDNGCNGGLMLQAFEWLLRHMYGIVFTEKS 213
Query: 214 YPY---RGDDKACRLN--KKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQF 268
YPY GD C LN K +I+GYV + +ET MA +L ENGP+A+A++A +
Sbjct: 214 YPYTSGNGDVAEC-LNSSKLVPGAQIDGYVMIPSNETVMAAWLAENGPIAIAVDASSFMS 272
Query: 269 YVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYF 328
Y +G VL+VGY ++T VPYW+IKNSWGE WGEKGY
Sbjct: 273 YQSG-------------------VLLVGY--NKT----GGVPYWVIKNSWGEDWGEKGYV 307
Query: 329 RLYRGDGSCGINDYVRSALV 348
R+ G +C +++Y SA V
Sbjct: 308 RVAMGLNACLLSEYPVSAHV 327
>gi|116786779|gb|ABK24233.1| unknown [Picea sitchensis]
Length = 463
Score = 207 bits (528), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 136/354 (38%), Positives = 177/354 (50%), Gaps = 29/354 (8%)
Query: 1 MSCFYFFAGVALLSLTVSVS--SFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVE 58
M FA +AL ++ S S F ++G + L+ +L QH K Y L E
Sbjct: 1 MGILLLFAVLALSAMAGSASRADFSIIGYDSKDLREDDAIMELYELWLAQHKKAYNGLGE 60
Query: 59 YYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVP 118
+R +F N I + + S GLN+F+DLS EF+A YLG KL + P
Sbjct: 61 KQNRFSVFKDNFLYIHQHNNQGNPSYKLGLNQFADLSHEEFKATYLGAKLDTKKRLSNSP 120
Query: 119 AMIPNIT----LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLS 174
+ + LP + DWRE AVT VKDQ CGS WAFST +EG+ T L SLS
Sbjct: 121 SPRYQYSDGEDLPESIDWREKGAVTAVKDQGSCGSCWAFSTVAAVEGINQIVTGNLTSLS 180
Query: 175 EQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRL-NKKATQV 232
EQEL+DCD + GC GG + AF I++ GGL+ E YPY+ +D +C K A V
Sbjct: 181 EQELVDCDTSYNQGCNGGLMDYAFQFIINN--GGLDSEDDYPYKANDGSCDAYRKNAHVV 238
Query: 233 KINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSH 290
I+ Y V ++ K N P++VAI A A QFY +GV F L H
Sbjct: 239 TIDDYEDVPENDEKSLKKAAANQPISVAIEASGRAFQFYESGV------FTSTCGTQLDH 292
Query: 291 SVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-----DGSCGI 339
V +VGYG + YWI+KNSWG+ WGEKG+ RL R G CGI
Sbjct: 293 GVTLVGYG------SESGTDYWIVKNSWGKSWGEKGFIRLQRNIEGVSTGMCGI 340
>gi|31981819|ref|NP_034115.2| cathepsin W preproprotein [Mus musculus]
gi|341940311|sp|P56203.2|CATW_MOUSE RecName: Full=Cathepsin W; AltName: Full=Lymphopain; Flags:
Precursor
gi|26353368|dbj|BAC40314.1| unnamed protein product [Mus musculus]
gi|44890089|gb|AAS48498.1| cathepsin W precursor [Mus musculus]
gi|148701190|gb|EDL33137.1| cathepsin W, isoform CRA_b [Mus musculus]
gi|162317774|gb|AAI56226.1| Cathepsin W [synthetic construct]
gi|162318342|gb|AAI56999.1| Cathepsin W [synthetic construct]
Length = 371
Score = 207 bits (528), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 122/333 (36%), Positives = 177/333 (53%), Gaps = 38/333 (11%)
Query: 41 LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
+F F + N++Y EY RL IF+ NL + Q LQ + G+ +G FSDL+ EF
Sbjct: 39 VFKLFQIRFNRSYWNPAEYTRRLSIFAHNLAQAQRLQQEDLGTAEFGETPFSDLTEEEFG 98
Query: 101 AKYLGFKLKPSYADRSVPAMIPNIT-----------LPRAFDWRE-YDAVTGVKDQTMCG 148
Y P PN+T +PR DWR+ + ++ VK+Q C
Sbjct: 99 Q---------LYGQERSPERTPNMTKKVESNTWGESVPRTCDWRKAKNIISSVKNQGSCK 149
Query: 149 SSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGL 208
WA + NI+ ++ K ++ V +S QEL+DC++ +GC GG + +A+ T+++ GL
Sbjct: 150 CCWAMAAADNIQALWRIKHQQFVDVSVQELLDCERCGNGCNGGFVWDAYLTVLN--NSGL 207
Query: 209 EEEKTYPYRGDDKACR-LNKKATQVK-INGYVSVSRDETDMAKYLVENGPMAVAINAYAL 266
EK YP++GD K R L KK +V I + +S +E +A YL +GP+ V IN L
Sbjct: 208 ASEKDYPFQGDRKPHRCLAKKYKKVAWIQDFTMLSNNEQAIAHYLAVHGPITVTINMKLL 267
Query: 267 QFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDR------TKFTHK-----AVPYWIIK 315
Q Y GV CD + HSVL+VG+G ++ T +H + PYWI+K
Sbjct: 268 QHYQKGVIKATPSSCDP--RQVDHSVLLVGFGKEKEGMQTGTVLSHSRKRRHSSPYWILK 325
Query: 316 NSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
NSWG WGEKGYFRLYRG+ +CG+ Y +A V
Sbjct: 326 NSWGAHWGEKGYFRLYRGNNTCGVTKYPFTAQV 358
>gi|332375406|gb|AEE62844.1| unknown [Dendroctonus ponderosae]
Length = 320
Score = 207 bits (527), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 124/325 (38%), Positives = 173/325 (53%), Gaps = 19/325 (5%)
Query: 22 FMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT-E 80
F ++G + + F F + NKTY T VE +R IF L +I+ E
Sbjct: 3 FFILGSLFVAAVAASLEQDAFQAFKLKQNKTYKTPVEETTRYGIFQAKLLEIEEHNSRFE 62
Query: 81 HGSGVY--GLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAV 138
G Y G+N+FSD + EF A YLG KP+ + +P + +++P + DWR V
Sbjct: 63 QGLETYKKGVNKFSDWTQDEFNA-YLGLHPKPAKLGKGIPYVKTGVSVPASVDWRTEGYV 121
Query: 139 TGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCD--QEDDGCEGGSISNA 196
TGVK+Q CGS WAFS TG++EG T KLVSLSEQ+L+DC + GC+GG +
Sbjct: 122 TGVKNQGDCGSCWAFSLTGSVEGALFKSTGKLVSLSEQQLVDCTYGTVNFGCDGGYLEET 181
Query: 197 FDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGP 256
F I GLE E +YPY+ D C+ + KIN YV DE + + GP
Sbjct: 182 FPYIQET---GLEAEASYPYKARDGTCKFDASKVVTKINDYVYWYGDEEALLEATATIGP 238
Query: 257 MAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKN 316
++VA++A + Y +GV C +++L+H VL+VGYG + V YW++KN
Sbjct: 239 ISVAMDANYIDSYASGVFS--SRLCS--SDDLNHGVLVVGYG------SENGVNYWLVKN 288
Query: 317 SWGEGWGEKGYFRLYRGDGSCGIND 341
SW E WGE GY +L RG CGI +
Sbjct: 289 SWAEDWGESGYLKLLRGQNECGIAE 313
>gi|356543124|ref|XP_003540013.1| PREDICTED: vignain-like [Glycine max]
gi|356543126|ref|XP_003540014.1| PREDICTED: vignain-like [Glycine max]
Length = 337
Score = 207 bits (527), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 127/336 (37%), Positives = 171/336 (50%), Gaps = 29/336 (8%)
Query: 12 LLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLR 71
+L L++ S M +LH + +++++ K Y E RL IF N+
Sbjct: 14 VLLLSICTSQVMS------RNLHEASMSERHEQWMKKYGKVYKDAAEKQKRLLIFKDNVE 67
Query: 72 KIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNIT-LPRAF 130
I+ + +N +D + EF A + G+K K S++ P N+T +P A
Sbjct: 68 FIESFNAAGNRPYKLSINHLADQTNEEFVASHNGYKHKGSHSQ--TPFKYENVTGVPNAV 125
Query: 131 DWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEG 190
DWRE AVT VKDQ CGS WAFST EG+Y T L+SLSEQEL+DCD D GC+G
Sbjct: 126 DWRENGAVTAVKDQGQCGSCWAFSTVAATEGIYQITTSMLMSLSEQELVDCDSVDHGCDG 185
Query: 191 GSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKAT-QVKINGYVSVSRDETDMAK 249
G + F+ I+ GG+ E YPY D C NK+A+ +I GY +V + D +
Sbjct: 186 GYMEGGFEFIIKN--GGISSEANYPYTAVDGTCDANKEASPAAQIKGYETVPANSEDALQ 243
Query: 250 YLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHK 307
V N P++V I+A A QFY +GV F L H V VGYG T
Sbjct: 244 KAVANQPVSVTIDAGGSAFQFYSSGV------FTGQCGTQLDHGVTAVGYGS-----TDD 292
Query: 308 AVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGI 339
YWI+KNSWG WGE+GY R+ RG +G CGI
Sbjct: 293 GTQYWIVKNSWGTQWGEEGYIRMQRGTDAQEGLCGI 328
>gi|195056367|ref|XP_001995082.1| GH22826 [Drosophila grimshawi]
gi|193899288|gb|EDV98154.1| GH22826 [Drosophila grimshawi]
Length = 340
Score = 207 bits (527), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 137/353 (38%), Positives = 186/353 (52%), Gaps = 43/353 (12%)
Query: 5 YFFAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLH 64
Y FA +AL+++ +VS V+ +E + F +H K Y E RL
Sbjct: 4 YIFALLALVAVAQAVSFADVIKEE-------------WQTFKLEHRKQYQDETEERFRLK 50
Query: 65 IFSGNLRKI----QLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAM 120
IF+ N KI QL E S GLN+++D+ EF GF R+ A
Sbjct: 51 IFNENKHKIAKHNQLYAAGE-VSFKMGLNKYADMLHHEFHETMNGFNYTLHKQLRASDAT 109
Query: 121 IPNIT--------LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVS 172
+T LP++ DWR AVTGVKDQ CGS WAFS+TG +EG + KT L+S
Sbjct: 110 FTGVTFISPEHVKLPQSVDWRNKGAVTGVKDQGHCGSCWAFSSTGALEGQHFRKTGTLIS 169
Query: 173 LSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKAT 230
LSEQ L+DC + ++GC GG + NAF I K GG++ EK+YPY G D +C NK
Sbjct: 170 LSEQNLVDCSTKYGNNGCNGGLMDNAFRYI--KDNGGIDTEKSYPYEGIDDSCHFNKGTI 227
Query: 231 QVKINGYVSVSR-DETDMAKYLVENGPMAVAINAY--ALQFYVTGVSHPIQFFCDGGNEN 287
G+ + + DE +A+ + GP++VAI+A + QFY TGV Q CD +N
Sbjct: 228 GATDRGFTDIPQGDEKKLAQAVATIGPVSVAIDASHESFQFYSTGVYDEPQ--CD--PQN 283
Query: 288 LSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-DGSCGI 339
L H VL+VGYG D YW++KNSWG WG+KG+ ++ R D CGI
Sbjct: 284 LDHGVLVVGYGTDEN-----GKDYWLVKNSWGTTWGDKGFIKMARNDDNQCGI 331
>gi|19747207|gb|AAL96762.1|AC104496_8 Tcc1l8.8 [Trypanosoma cruzi]
Length = 500
Score = 207 bits (527), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 116/315 (36%), Positives = 164/315 (52%), Gaps = 18/315 (5%)
Query: 39 TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAE 98
T+ F F ++H + Y + E RL +F NL + L + +G+ FSDL+ E
Sbjct: 68 TSQFAEFKQKHGRVYESAAEEAFRLSVFRENLF-LARLHAAANPHATFGVTPFSDLTREE 126
Query: 99 FQAKYLGFKLKPSYADRS--VPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTT 156
F+++Y + + A VP + + P A DWR AVT VKDQ CGS WAFS
Sbjct: 127 FRSRYHNGAVHFAAAQERARVPVKVEVVGAPAAVDWRARGAVTAVKDQGQCGSCWAFSAI 186
Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
GN+E + L +LSEQ L+ CD+ D GC GG ++NAF+ I+ + G + E +YPY
Sbjct: 187 GNVECQWFLAGHPLTNLSEQMLVSCDKTDSGCSGGLMNNAFEWIVQENNGAVYTEDSYPY 246
Query: 217 ---RGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGV 273
G C + I G+V + +DE +A +L NGP+AVA++A + Y GV
Sbjct: 247 ASGEGISPPCTTSGHTVGATITGHVELPQDEAQIAAWLAVNGPVAVAVDASSWMTYTGGV 306
Query: 274 SHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG 333
+E L H VL+VGY AVPYWIIKNSW WGE+GY R+ +G
Sbjct: 307 ------MTSCVSEQLDHGVLLVGYN------DSAAVPYWIIKNSWTTQWGEEGYIRIAKG 354
Query: 334 DGSCGINDYVRSALV 348
C + + SA+V
Sbjct: 355 LNQCLVKEEASSAVV 369
>gi|393717301|gb|AFN21222.1| V-Cath [Bombyx mori NPV]
Length = 323
Score = 207 bits (527), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 119/317 (37%), Positives = 176/317 (55%), Gaps = 21/317 (6%)
Query: 36 VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLS 95
+K F F+ + NK Y++ VE R IF NL +I + ++ S Y +N+FSDLS
Sbjct: 22 LKAPNYFEEFVHRFNKNYSSEVEKLRRFKIFQHNLNEI--INKNQNDSAKYEINKFSDLS 79
Query: 96 TAEFQAKYLGFKLKPSYADRSVPAMI---PNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
E AKY G L P+ ++ P P FDWR + VT VK+Q MCG+ WA
Sbjct: 80 KDETIAKYTGLSL-PTQTQNFCKVILLDQPPGKGPLEFDWRRLNKVTSVKNQGMCGACWA 138
Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEK 212
F+T G++E +A K +L++LSEQ++IDCD D GC GG + AF+ I+ GG++ E
Sbjct: 139 FATLGSLESQFAIKHNELINLSEQQMIDCDFVDAGCNGGLLHTAFEAIIKM--GGVQLES 196
Query: 213 TYPYRGDDKACRLNKKATQVKI-NGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVT 271
YPY D+ CR+N V++ + Y + E + L GP+ +AI+A + Y
Sbjct: 197 DYPYEADNNNCRMNSNKFLVQVKDCYRYIIVYEEKLKDLLRLVGPIPMAIDAADIVNYKQ 256
Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
G+ I++ D G L+H+VL+VGYGV+ VPYW KN+WG WGE G+FR+
Sbjct: 257 GI---IKYCFDSG---LNHAVLLVGYGVENN------VPYWTFKNTWGTDWGEDGFFRVQ 304
Query: 332 RGDGSCGINDYVRSALV 348
+ +CG+ + + S V
Sbjct: 305 QNINACGMRNELASTAV 321
>gi|11464864|gb|AAG35357.1|AF314929_1 cruzipain [Trypanosoma cruzi]
Length = 467
Score = 207 bits (527), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 117/315 (37%), Positives = 166/315 (52%), Gaps = 18/315 (5%)
Query: 39 TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAE 98
T+ F F ++H + Y + E RL +F NL + L + +G+ FSDL+ E
Sbjct: 35 TSQFAEFKQKHGRVYESAAEEAFRLSVFRENLF-LARLHAAANPHATFGVTPFSDLTREE 93
Query: 99 FQAKYL-GFKLKPSYADRS-VPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTT 156
F+++Y G + +R+ VP + + P A DWR AVT VKDQ CGS WAFS
Sbjct: 94 FRSRYHNGAAHFAAAQERARVPVKVEVVGAPAAVDWRARGAVTAVKDQGQCGSCWAFSAI 153
Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
GN+E + L +LSEQ L+ CD+ D GC GG ++NAF+ I+ + G + E +YPY
Sbjct: 154 GNVECQWFLAGHPLTNLSEQMLVSCDKTDSGCSGGLMNNAFEWIVQENNGAVYTEDSYPY 213
Query: 217 ---RGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGV 273
G C + I G+V + +DE +A +L NGP+AVA++A + Y GV
Sbjct: 214 ASGEGISPPCTTSGHTVGATITGHVELPQDEAQIAAWLAVNGPVAVAVDASSWMTYTGGV 273
Query: 274 SHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG 333
+E L H VL+VGY AVPYWIIKNSW WGE+GY R+ +G
Sbjct: 274 ------MTSCVSEQLDHGVLLVGYN------DSAAVPYWIIKNSWTTQWGEEGYIRIAKG 321
Query: 334 DGSCGINDYVRSALV 348
C + + SA+V
Sbjct: 322 SNQCLVKEEASSAVV 336
>gi|225438807|ref|XP_002283263.1| PREDICTED: germination-specific cysteine protease 1-like isoform 1
[Vitis vinifera]
Length = 374
Score = 207 bits (527), Expect = 7e-51, Method: Compositional matrix adjust.
Identities = 125/314 (39%), Positives = 172/314 (54%), Gaps = 30/314 (9%)
Query: 41 LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
++ +++ +H K Y L E R IF NL+ I + ++ + GLN F+DL+ E++
Sbjct: 45 MYQWWMAKHGKAYNGLGEKEKRFEIFKDNLKFIDE-HNAQNRTYKVGLNRFADLTNEEYR 103
Query: 101 AKYLGFKLKP----SYADRSVP--AMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
A YLG + P + + P A++P LP + DWRE AV VKDQ CGS WAFS
Sbjct: 104 AIYLGTRSDPKRRFAKLKNASPRYAVMPGEVLPESVDWRETGAVNPVKDQRSCGSCWAFS 163
Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
T +EG+ T +L+SLSEQEL+DCD E D GC GG + AFD I+ GGL+ EK
Sbjct: 164 TVAAVEGINQIVTGELISLSEQELVDCDTEYDMGCNGGLMDYAFDFIIKN--GGLDTEKD 221
Query: 214 YPYRGDDKACRLNKKATQ-VKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYV 270
YPY G D C L+ K+++ V I+GY V + + V + P++VA+ A ALQ YV
Sbjct: 222 YPYTGFDGECNLSGKSSKVVSIDGYEDVPPFDEKALQKAVAHQPVSVAVEAGGRALQLYV 281
Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
+G+ F L H ++ VGYG T YWI++NSWG WGE GY R+
Sbjct: 282 SGI------FTGECGTALDHGIVAVGYG------TENGTDYWIVRNSWGSSWGENGYIRM 329
Query: 331 YRG-----DGSCGI 339
R G CGI
Sbjct: 330 ERNMADAFSGKCGI 343
>gi|449139100|gb|AGE89905.1| cathepsin-like cysteine proteinase [Spodoptera littoralis NPV]
Length = 336
Score = 207 bits (526), Expect = 7e-51, Method: Compositional matrix adjust.
Identities = 122/318 (38%), Positives = 173/318 (54%), Gaps = 26/318 (8%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
F F++QHNK Y T + F NL + + + + VYG+N+FSD+ F
Sbjct: 33 FENFIKQHNKEYTTPDQRDDAFVNFKRNLVNMNAMNNISN-HAVYGINKFSDIDKITFAN 91
Query: 102 KYLGFKLKPSYADRS---------VPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
+ G L + D + V P+ P +FDWR+ VT VK+Q +CGS WA
Sbjct: 92 VHAGLVLTLNATDSNFDPYRLCEFVTVAGPSARTPESFDWRKLHKVTKVKEQGVCGSCWA 151
Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEK 212
F+ GNIE YA L+ LSEQ+L+DCD+ D GC+GG + AF IM GG+E E
Sbjct: 152 FAAIGNIESQYAILHDSLIDLSEQQLLDCDRIDQGCDGGLMHLAFQEIMRI--GGVEHEI 209
Query: 213 TYPYRGDDKACRLNKKATQVKI-NGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVT 271
YPY+G + ACR V++ + Y RDE + + L +NGP+AVAI+ + Y +
Sbjct: 210 DYPYQGIEYACRSAPSKFAVRLSHCYQYDLRDERKLLELLYKNGPIAVAIDCRDIIDYRS 269
Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
G++ D G L+H+VL+VGYG++ PYWI KNSWG WGE GYFR
Sbjct: 270 GIATVCN---DNG---LNHAVLLVGYGIEND------TPYWIFKNSWGSNWGENGYFRAR 317
Query: 332 RGDGSCG-INDYVRSALV 348
R +CG +N++ SA++
Sbjct: 318 RNINACGMLNEFAASAVL 335
>gi|118157|sp|P25779.1|CYSP_TRYCR RecName: Full=Cruzipain; AltName: Full=Cruzaine; AltName:
Full=Major cysteine proteinase; Flags: Precursor
gi|162048|gb|AAA30181.1| cruzain [Trypanosoma cruzi]
gi|29409382|gb|AAM33131.1| cysteine proteinase precursor [Trypanosoma cruzi]
Length = 467
Score = 207 bits (526), Expect = 7e-51, Method: Compositional matrix adjust.
Identities = 117/315 (37%), Positives = 166/315 (52%), Gaps = 18/315 (5%)
Query: 39 TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAE 98
T+ F F ++H + Y + E RL +F NL + L + +G+ FSDL+ E
Sbjct: 35 TSQFAEFKQKHGRVYESAAEEAFRLSVFRENL-FLARLHAAANPHATFGVTPFSDLTREE 93
Query: 99 FQAKYL-GFKLKPSYADRS-VPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTT 156
F+++Y G + +R+ VP + + P A DWR AVT VKDQ CGS WAFS
Sbjct: 94 FRSRYHNGAAHFAAAQERARVPVKVEVVGAPAAVDWRARGAVTAVKDQGQCGSCWAFSAI 153
Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
GN+E + L +LSEQ L+ CD+ D GC GG ++NAF+ I+ + G + E +YPY
Sbjct: 154 GNVECQWFLAGHPLTNLSEQMLVSCDKTDSGCSGGLMNNAFEWIVQENNGAVYTEDSYPY 213
Query: 217 ---RGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGV 273
G C + I G+V + +DE +A +L NGP+AVA++A + Y GV
Sbjct: 214 ASGEGISPPCTTSGHTVGATITGHVELPQDEAQIAAWLAVNGPVAVAVDASSWMTYTGGV 273
Query: 274 SHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG 333
+E L H VL+VGY AVPYWIIKNSW WGE+GY R+ +G
Sbjct: 274 ------MTSCVSEQLDHGVLLVGYN------DSAAVPYWIIKNSWTTQWGEEGYIRIAKG 321
Query: 334 DGSCGINDYVRSALV 348
C + + SA+V
Sbjct: 322 SNQCLVKEEASSAVV 336
>gi|168057475|ref|XP_001780740.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162667829|gb|EDQ54449.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 463
Score = 207 bits (526), Expect = 7e-51, Method: Compositional matrix adjust.
Identities = 135/311 (43%), Positives = 178/311 (57%), Gaps = 28/311 (9%)
Query: 41 LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
+F+ +LE H++ Y +L E + R IF N I + + S GLN+FSDL+ EF+
Sbjct: 48 VFHQWLETHSRVYRSLSEKHHRFQIFKENFLYIHA-HNKQQKSYWLGLNKFSDLTHQEFR 106
Query: 101 AKYLGFKLKPSYADRSVPA-MIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNI 159
A+YLG KP R M ++ DWR AVT VKDQ CGS WAFS G++
Sbjct: 107 AQYLG--TKPVNRQRKEANFMYEDVEAEPKVDWRLKGAVTDVKDQGACGSCWAFSAVGSV 164
Query: 160 EGVYAAKTKKLVSLSEQELIDCD-QEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG 218
EGV A KT +LVSLSEQEL+DCD +++ GC GG + AF+ I+ GG++ EK YPY+
Sbjct: 165 EGVNAIKTGELVSLSEQELVDCDRKQNQGCNGGLMDYAFEFIIKN--GGIDTEKDYPYKA 222
Query: 219 DDKACRLNKKATQ-VKINGYVSV-SRDETDMAKYLVENGPMAVAINA--YALQFYVTGVS 274
D C ++ ++ V I+ Y V ++ E+ + K L +N P++VAI A Q Y GV
Sbjct: 223 RDGRCDEGRRNSKVVVIDDYQDVPTQSESALMKALTKN-PVSVAIEAGGRDFQHYQGGV- 280
Query: 275 HPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR-- 332
F G+E L H VL VGYG D V YWI+KNSWG GWGEKGY R+ R
Sbjct: 281 ----FTGPCGSE-LDHGVLAVGYGTD-----DDGVNYWIVKNSWGPGWGEKGYIRMERFG 330
Query: 333 ---GDGSCGIN 340
DG CGIN
Sbjct: 331 SDSTDGKCGIN 341
>gi|343472324|emb|CCD15484.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 445
Score = 207 bits (526), Expect = 7e-51, Method: Compositional matrix adjust.
Identities = 123/343 (35%), Positives = 184/343 (53%), Gaps = 24/343 (6%)
Query: 13 LSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRK 72
+ L + F+ V LH ++ F F ++++++Y E R +F N+ +
Sbjct: 14 VGLLAVAACFVPVALGVLHAEQSLQQQ--FAAFKQKYSRSYKDATEEAFRFRVFKQNMER 71
Query: 73 IQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL-GFKLKPSYADRSVPAMIPNITL---PR 128
+ + + +G+ FSD+S EF+A Y G + + R P + ++ P
Sbjct: 72 AKE-EAAANPYATFGVTRFSDMSPEEFRATYHNGAEYYAAALKR--PRKVVTVSTGKAPE 128
Query: 129 AFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGC 188
A DWR+ AVT VKDQ CGS WAFS GNIEG + +L SLSEQ L+ CD + C
Sbjct: 129 AVDWRKKGAVTPVKDQGQCGSCWAFSAIGNIEGQWKVAGHELTSLSEQTLVSCDPTEYAC 188
Query: 189 EGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDK---ACRLNKKATQVKINGYVSVSRDET 245
EGG + NAF I+S G + E++YPY + AC ++ K I+ YV + +DE
Sbjct: 189 EGGFMDNAFRWIISSNKGKVFTEQSYPYSSGGRNVPACNMSGKVVGANISDYVDLPQDEN 248
Query: 246 DMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFT 305
+A++L +NGP++V ++A + Q Y GV ++ L+H+VL+VGY D +K
Sbjct: 249 AIAEWLAKNGPVSVIVDATSFQSYTGGV------LTSCLSKILNHAVLLVGYD-DTSK-- 299
Query: 306 HKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
PYWIIKNSW E WGEKGY R+ +G C + +Y SALV
Sbjct: 300 ---PPYWIIKNSWSEKWGEKGYIRIEKGTNQCLVQEYASSALV 339
>gi|9630927|ref|NP_047524.1| Cystein Protease [Bombyx mori NPV]
gi|1168798|sp|P41721.1|CATV_NPVBM RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|540066|gb|AAB49542.1| cysteine protease [Bombyx mori NPV]
gi|3745946|gb|AAC63793.1| Cystein Protease [Bombyx mori NPV]
Length = 323
Score = 207 bits (526), Expect = 8e-51, Method: Compositional matrix adjust.
Identities = 118/317 (37%), Positives = 176/317 (55%), Gaps = 21/317 (6%)
Query: 36 VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLS 95
+K F F+ + NK Y++ VE R IF NL +I + ++ S Y +N+FSDLS
Sbjct: 22 LKAPNYFEEFVHRFNKNYSSEVEKLRRFKIFQHNLNEI--INKNQNDSAKYEINKFSDLS 79
Query: 96 TAEFQAKYLGFKLKPSYADRSVPAMI---PNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
E AKY G L P+ ++ P P FDWR + VT VK+Q MCG+ WA
Sbjct: 80 KDETIAKYTGLSL-PTQTQNFCKVILLDQPPGKGPLEFDWRRLNKVTSVKNQGMCGACWA 138
Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEK 212
F+T G++E +A K +L++LSEQ++IDCD D GC GG + AF+ I+ GG++ E
Sbjct: 139 FATLGSLESQFAIKHNELINLSEQQMIDCDFVDAGCNGGLLHTAFEAIIKM--GGVQLES 196
Query: 213 TYPYRGDDKACRLNKKATQVKI-NGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVT 271
YPY D+ CR+N V++ + Y + E + L GP+ +AI+A + Y
Sbjct: 197 DYPYEADNNNCRMNSNKFLVQVKDCYRYIIVYEEKLKDLLPLVGPIPMAIDAADIVNYKQ 256
Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
G+ I++ D G L+H+VL+VGYGV+ +PYW KN+WG WGE G+FR+
Sbjct: 257 GI---IKYCFDSG---LNHAVLLVGYGVENN------IPYWTFKNTWGTDWGEDGFFRVQ 304
Query: 332 RGDGSCGINDYVRSALV 348
+ +CG+ + + S V
Sbjct: 305 QNINACGMRNELASTAV 321
>gi|118156|sp|P14658.1|CYSP_TRYBB RecName: Full=Cysteine proteinase; Flags: Precursor
gi|10393|emb|CAA34485.1| unnamed protein product [Trypanosoma brucei]
Length = 450
Score = 207 bits (526), Expect = 8e-51, Method: Compositional matrix adjust.
Identities = 125/346 (36%), Positives = 179/346 (51%), Gaps = 27/346 (7%)
Query: 10 VALLSLTVSVSSFMVVGDEKLHHLHHVKHTAL-FNYFLEQHNKTYATLVEYYSRLHIFSG 68
V LL++ ++S L LH + + F F +++ K Y E R F
Sbjct: 14 VVLLAMAACLASV------ALGSLHVEESLEMRFAAFKKKYGKVYKDAKEEAFRFRAFEE 67
Query: 69 NLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITL-- 126
N+ + ++ Q + +G+ FSD++ EF+A+Y + A + + + N+T
Sbjct: 68 NMEQAKI-QAAANPYATFGVTPFSDMTREEFRARYRNGASYFAAAQKRLRKTV-NVTTGR 125
Query: 127 -PRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQED 185
P A DWRE AVT VK Q CGS WAFST GNIEG + LVSLSEQ L+ CD D
Sbjct: 126 APAAVDWREKGAVTPVKVQGQCGSCWAFSTIGNIEGQWQVAGNPLVSLSEQMLVSCDTID 185
Query: 186 DGCEGGSISNAFDTIMSKLGGGLEEEKTYPY---RGDDKACRLNKKATQVKINGYVSVSR 242
GC GG + NAF+ I++ GG + E +YPY G+ C++N I +V + +
Sbjct: 186 SGCNGGLMDNAFNWIVNSNGGNVFTEASYPYVSGNGEQPQCQMNGHEIGAAITDHVDLPQ 245
Query: 243 DETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRT 302
DE +A YL ENGP+A+A++A + Y G+ ++ L H VL+VGY +
Sbjct: 246 DEDAIAAYLAENGPLAIAVDAESFMDYNGGI------LTSCTSKQLDHGVLLVGYNDNSN 299
Query: 303 KFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
PYWIIKNSW WGE GY R+ +G C +N V SA+V
Sbjct: 300 P------PYWIIKNSWSNMWGEDGYIRIEKGTNQCLMNQAVSSAVV 339
>gi|255567869|ref|XP_002524912.1| cysteine protease, putative [Ricinus communis]
gi|223535747|gb|EEF37409.1| cysteine protease, putative [Ricinus communis]
Length = 366
Score = 207 bits (526), Expect = 8e-51, Method: Compositional matrix adjust.
Identities = 134/354 (37%), Positives = 192/354 (54%), Gaps = 43/354 (12%)
Query: 12 LLSLTVSVSSFMVVGDEKLHHLHHVKHT--------ALFNYFLEQHNKTYATLVEYYSRL 63
LL L ++SS + +H HH + + +++N++L +H+KTY L E R
Sbjct: 10 LLFLFFTLSSAWDMSILSHNHGHHHQSSWRSDNEVISMYNWWLAKHSKTYNKLGEREKRF 69
Query: 64 HIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPN 123
IF NLR I ++++ + GL F+DL+ E++AK+LG K P R + + P+
Sbjct: 70 EIFKNNLRFIDEHNNSKNRTYKVGLTRFADLTNEEYRAKFLGTKSDPKR--RLMKSKNPS 127
Query: 124 I--------TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSE 175
LP + DWR+ AV+ +KDQ CGS WAFST +EGV T +L+SLSE
Sbjct: 128 QRYAFKAGDVLPESIDWRQSGAVSAIKDQGSCGSCWAFSTIAAVEGVNKIVTGELISLSE 187
Query: 176 QELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNK-KATQVK 233
QEL+DCD+ + GC GG + NAF I++ GG++ +K YPY+ D C K K V
Sbjct: 188 QELVDCDRSYNAGCNGGLMDNAFQFIINN--GGIDTDKDYPYQAVDGKCDTTKVKNKAVT 245
Query: 234 INGYVSV-SRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSH 290
I+G+ V + DE + K V + P++VAI A ALQFY +GV F L H
Sbjct: 246 IDGFEDVMAFDEMALQK-AVAHQPVSVAIEASGMALQFYQSGV------FTGECGSALDH 298
Query: 291 SVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-----DGSCGI 339
V+IVGYG T + YW+++NSWG WGE GY ++ R G CGI
Sbjct: 299 GVVIVGYG------TEDGIDYWLVRNSWGRDWGENGYIKMQRNVVDTFTGKCGI 346
>gi|343471318|emb|CCD16236.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 445
Score = 207 bits (526), Expect = 8e-51, Method: Compositional matrix adjust.
Identities = 122/340 (35%), Positives = 183/340 (53%), Gaps = 24/340 (7%)
Query: 13 LSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRK 72
+ L + F+ V LH ++ F F ++++++Y E R +F ++ +
Sbjct: 14 VGLLAVAACFVPVALGVLHAEQSLQQQ--FAAFKQKYSRSYKDATEEAFRFRMFKQSMER 71
Query: 73 IQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL-GFKLKPSYADRSVPAMIPNITL---PR 128
+ + + +G+ +FSD+S EF+A YL G K + +R P + N++ P
Sbjct: 72 AKE-EAAANPYATFGVTQFSDMSPEEFRATYLNGAKYYAAALER--PRKVVNVSTGKAPP 128
Query: 129 AFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGC 188
A DWR+ AVT VKDQ CGS WAF+ TGNIEG + +L SLSEQ L+ CD +D C
Sbjct: 129 AVDWRKKGAVTPVKDQGSCGSCWAFAATGNIEGQWKIAGHELTSLSEQMLVSCDTTEDNC 188
Query: 189 EGGSISNAFDTIMSKLGGGLEEEKTYPYRGDD---KACRLNKKATQVKINGYVSVSRDET 245
GG AF I+S G + E++YPY D C + K KI+G++++ +DE
Sbjct: 189 RGGFADRAFKWIVSSNKGNVFTEESYPYASTDGYVPPCNKSGKVVGAKISGHINLPKDEN 248
Query: 246 DMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFT 305
+A++L NGP+A+A++A Y GV +E LSH VL+VGY D +K
Sbjct: 249 AIAEWLARNGPVAIAVDASTFLDYKGGV------LTSCSSEGLSHDVLLVGYN-DTSK-- 299
Query: 306 HKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
PYWIIKNSW + WGE+GY R+ +G C + +Y RS
Sbjct: 300 ---PPYWIIKNSWDKEWGEEGYIRIEKGTNLCLMKEYARS 336
>gi|302794759|ref|XP_002979143.1| hypothetical protein SELMODRAFT_110288 [Selaginella moellendorffii]
gi|300152911|gb|EFJ19551.1| hypothetical protein SELMODRAFT_110288 [Selaginella moellendorffii]
Length = 227
Score = 207 bits (526), Expect = 8e-51, Method: Compositional matrix adjust.
Identities = 104/234 (44%), Positives = 151/234 (64%), Gaps = 20/234 (8%)
Query: 120 MIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELI 179
++P LP++FDWRE+ A+T VK+Q CGS W FS+TG +EG + K+++L+SL E++L+
Sbjct: 3 LLPTDNLPKSFDWREHGAMTPVKNQGSCGSCWTFSSTGAVEGAHFLKSRELISLREEQLV 62
Query: 180 DCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDD--------KACRLNKKATQ 231
DCD+ D GC+GG + NA++ I +K GLE E+ YPY+ ++ C
Sbjct: 63 DCDRMDGGCKGGDMLNAYEYIKAK---GLEAEEDYPYQEENYKEYMFPHHRCHFRPSKVA 119
Query: 232 VKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHS 291
I Y +VS DE +A LV+NGP+++A+NA + Y+ GV+ P C GG +N++H+
Sbjct: 120 ATIANYSTVSEDEDQIAANLVKNGPLSIALNANYIMDYMGGVACP--RICPGG-DNMNHA 176
Query: 292 VLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
VL+VGYG+D K PYWI+KNSW E +GE GYFRL RG G CG+N V +
Sbjct: 177 VLLVGYGMDGDK------PYWILKNSWSENYGEDGYFRLCRGFGVCGMNTRVST 224
>gi|74229746|ref|YP_308950.1| cathepsin [Trichoplusia ni SNPV]
gi|72259660|gb|AAZ67431.1| cathepsin [Trichoplusia ni SNPV]
Length = 344
Score = 206 bits (525), Expect = 9e-51, Method: Compositional matrix adjust.
Identities = 121/312 (38%), Positives = 173/312 (55%), Gaps = 20/312 (6%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
F F ++ K YA E R IF NL I L ++ ++ S VY +N+F+DL+ E A
Sbjct: 47 FETFQTKYKKVYADDNERDYRYKIFKTNLEIINL-KNQQNDSAVYNINKFADLTKNEVIA 105
Query: 102 KYLGFKLK-PSYADRSVPAMI--PNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
K+ G +K P+ + P ++ P+ FDWR+++ +T VKDQ CGS WAFST
Sbjct: 106 KFTGLGVKSPNLKNFCDPLIVDGPSKYTQETFDWRQFNKITSVKDQGFCGSCWAFSTIAG 165
Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG 218
+E YA K + + LSEQ+L+DCD D GC GG + A++ IMS GG+E E+ YPYR
Sbjct: 166 LESQYAIKYNEHIDLSEQQLVDCDTIDMGCAGGLLHTAYEEIMSM--GGVEYEEDYPYRS 223
Query: 219 DDKACRLNKKATQVKI-NGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPI 277
CR+ QV + N Y + E + L E GP+AVA++A L Y G+
Sbjct: 224 VQGPCRIENDKFQVSVDNCYRYILYSEDKLKDVLHEMGPIAVAVDAVDLTDYYGGIITSC 283
Query: 278 QFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSC 337
+ N L+H+VL+VGYG T +P+W++KNSWG +GE G+ R+ R SC
Sbjct: 284 K------NYGLNHAVLLVGYG------TENGIPFWVLKNSWGTDYGENGFVRVKRNVNSC 331
Query: 338 G-INDYVRSALV 348
G IN+ SA +
Sbjct: 332 GMINELAASARI 343
>gi|393717160|gb|AFN21082.1| V-Cath [Bombyx mori NPV]
gi|393717442|gb|AFN21362.1| V-Cath [Bombyx mori NPV]
Length = 323
Score = 206 bits (525), Expect = 9e-51, Method: Compositional matrix adjust.
Identities = 118/317 (37%), Positives = 176/317 (55%), Gaps = 21/317 (6%)
Query: 36 VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLS 95
+K F F+ + NK Y++ VE R IF NL +I + ++ S Y +N+FSDLS
Sbjct: 22 LKAPNYFEEFVHRFNKNYSSEVEKLRRFKIFQHNLNEI--INKNQNDSAKYEINKFSDLS 79
Query: 96 TAEFQAKYLGFKLKPSYADRSVPAMI---PNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
E AKY G L P+ ++ P P FDWR + VT VK+Q MCG+ WA
Sbjct: 80 KDETIAKYTGLSL-PTQTQNFCKVILLDQPPGKGPLEFDWRRLNKVTSVKNQGMCGACWA 138
Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEK 212
F+T G++E +A K +L++LSEQ++IDCD D GC GG + AF+ I+ GG++ E
Sbjct: 139 FATLGSLESQFAIKHNELINLSEQQMIDCDFVDAGCNGGLLHTAFEAIIKM--GGVQLES 196
Query: 213 TYPYRGDDKACRLNKKATQVKI-NGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVT 271
YPY D+ CR+N V++ + Y + E + L GP+ +AI+A + Y
Sbjct: 197 DYPYEADNNNCRMNSNKFLVQVKDCYRYIIVYEEKLKDLLRLVGPIPMAIDAADIVNYKQ 256
Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
G+ I++ D G L+H+VL+VGYGV+ +PYW KN+WG WGE G+FR+
Sbjct: 257 GI---IKYCFDSG---LNHAVLLVGYGVENN------IPYWTFKNTWGTDWGEDGFFRVQ 304
Query: 332 RGDGSCGINDYVRSALV 348
+ +CG+ + + S V
Sbjct: 305 QNINACGMRNELASTAV 321
>gi|332326591|gb|AEE42619.1| cysteine protease [Leishmania aethiopica]
Length = 443
Score = 206 bits (525), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 123/318 (38%), Positives = 175/318 (55%), Gaps = 22/318 (6%)
Query: 40 ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
ALF F + + Y T+ E RL F NL ++ Q + +G+ +F DLS AEF
Sbjct: 36 ALFEEFKRTYWRAYGTVAEEQQRLANFERNLELMREHQ-ARNPHARFGITKFFDLSEAEF 94
Query: 100 QAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
A+YL F +A + +++ +P A DWRE AVT VKBQ CGS WAFS
Sbjct: 95 AARYLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKBQGACGSCWAFS 154
Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
GNIE +A LV LSEQ+L+ CD +D GC GG ++ AF+ ++ + G + E +Y
Sbjct: 155 AVGNIESQWAVAXHGLVRLSEQQLVSCDDKDSGCGGGLMTQAFEWLLRNMNGTMFTEDSY 214
Query: 215 PY---RGDDKACRLNKKATQ-VKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYV 270
PY GD C + + +I+GYV + ET MA +L ++GP+++A++A Y
Sbjct: 215 PYVSSTGDVPECTNSSELVPGARIDGYVMIESXETVMAAWLAKSGPISIAVDASPFMSYE 274
Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
+GV C G + L+H VL+VGY + VPYW+IKNSWGE WGEKGY R+
Sbjct: 275 SGV----LTSCVG--KXLNHGVLLVGYNMT------GEVPYWVIKNSWGEDWGEKGYVRV 322
Query: 331 YRGDGSCGINDYVRSALV 348
G +C + +Y SA V
Sbjct: 323 TMGVNACLLTEYPVSAHV 340
>gi|22653679|sp|Q26636.1|CATL_SARPE RecName: Full=Cathepsin L; Contains: RecName: Full=Cathepsin L
heavy chain; Contains: RecName: Full=Cathepsin L light
chain; Flags: Precursor
gi|505140|dbj|BAA03970.1| cathepsin L precursor [Sarcophaga peregrina]
Length = 339
Score = 206 bits (525), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 130/308 (42%), Positives = 179/308 (58%), Gaps = 27/308 (8%)
Query: 48 QHNKTYATLVEYYSRLHIFSGNLRKI-QLLQDTEHGSGVY--GLNEFSDLSTAEFQAKYL 104
QH K YA VE R+ IF+ N KI + Q G Y GLN+++D+ EF+
Sbjct: 34 QHRKNYANEVEERFRMKIFNENRHKIAKHNQLFAQGKVSYKLGLNKYADMLHHEFKETMN 93
Query: 105 GFK--LKPSYADRS--VPAM-IP--NITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
G+ L+ +R+ V A IP ++T+P++ DWRE+ AVTGVKDQ CGS WAFS+TG
Sbjct: 94 GYNHTLRQLMRERTGLVGATYIPPAHVTVPKSVDWREHGAVTGVKDQGHCGSCWAFSSTG 153
Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEEKTYP 215
+EG + K LVSLSEQ L+DC + ++GC GG + NAF I K GG++ EK+YP
Sbjct: 154 ALEGQHFRKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYI--KDNGGIDTEKSYP 211
Query: 216 YRGDDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINAY--ALQFYVTG 272
Y G D +C NK G+V + DE M K + GP++VAI+A + Q Y G
Sbjct: 212 YEGIDDSCHFNKATIGATDTGFVDIPEGDEEKMKKAVATMGPVSVAIDASHESFQLYSEG 271
Query: 273 VSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR 332
V + + CD +NL H VL+VGYG D + + YW++KNSWG WGE+GY ++ R
Sbjct: 272 VYNEPE--CD--EQNLDHGVLVVGYGTDES-----GMDYWLVKNSWGTTWGEQGYIKMAR 322
Query: 333 G-DGSCGI 339
+ CGI
Sbjct: 323 NQNNQCGI 330
>gi|2582055|gb|AAB82455.1| lymphopain [Mus musculus]
Length = 371
Score = 206 bits (525), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 122/333 (36%), Positives = 176/333 (52%), Gaps = 38/333 (11%)
Query: 41 LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
+F F + N++Y EY RL IF+ NL + Q LQ + G+ +G FSDL+ EF
Sbjct: 39 VFKLFQIRFNRSYWNPAEYTRRLSIFAHNLAQAQRLQQEDLGTAEFGETPFSDLTEEEFG 98
Query: 101 AKYLGFKLKPSYADRSVPAMIPNIT-----------LPRAFDWRE-YDAVTGVKDQTMCG 148
Y P PN+T +PR DWR+ + ++ VK+Q C
Sbjct: 99 Q---------LYGQERSPERTPNMTKKVESNTWGESVPRTCDWRKAKNIISSVKNQGSCK 149
Query: 149 SSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGL 208
WA + NI+ ++ K ++ V +S QEL+DC++ +GC GG + +A+ T+++ GL
Sbjct: 150 CCWAMAAADNIQALWRIKHQQFVDVSVQELLDCERCGNGCNGGFVWDAYLTVLN--NSGL 207
Query: 209 EEEKTYPYRGDDKACR-LNKKATQVK-INGYVSVSRDETDMAKYLVENGPMAVAINAYAL 266
EK YP++GD K R L KK +V I + +S +E +A YL +GP+ V IN L
Sbjct: 208 ASEKDYPFQGDRKPHRCLAKKYKKVAWIQDFTMLSNNEQAIAHYLAVHGPITVTINMKLL 267
Query: 267 QFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDR------TKFTHK-----AVPYWIIK 315
Q Y GV CD + HSVL+VG+G + T +H + PYWI+K
Sbjct: 268 QHYQKGVIKATPSSCDP--RQVDHSVLLVGFGKKKEGMQTGTVLSHSRKRRHSSPYWILK 325
Query: 316 NSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
NSWG WGEKGYFRLYRG+ +CG+ Y +A V
Sbjct: 326 NSWGAHWGEKGYFRLYRGNNTCGVTKYPFTAQV 358
>gi|55735421|gb|AAV59468.1| cathepsin [Bombyx mori NPV]
Length = 323
Score = 206 bits (525), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 117/317 (36%), Positives = 176/317 (55%), Gaps = 21/317 (6%)
Query: 36 VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLS 95
+K F F+ + NK Y++ VE R IF NL +I + ++ S Y +N+FSDLS
Sbjct: 22 LKAPNYFEEFVHRFNKNYSSEVEKLRRFKIFQHNLNEI--INKNQNDSAKYEINKFSDLS 79
Query: 96 TAEFQAKYLGFKLKPSYADRSVPAMI---PNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
E AKY G L P+ ++ P P FDWR + VT VK+Q MCG+ WA
Sbjct: 80 KDETIAKYTGLSL-PTQTQNFCKVILLDQPPGKGPLEFDWRRLNKVTSVKNQGMCGACWA 138
Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEK 212
F+T ++E +A K +L++LSEQ++IDCD D GC GG + AF+ I+ GG++ E
Sbjct: 139 FATLASLESQFAIKHNQLINLSEQQMIDCDFVDAGCNGGLLHTAFEAIIKM--GGVQLES 196
Query: 213 TYPYRGDDKACRLNKKATQVKI-NGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVT 271
YPY D+ CR+N V++ + Y ++ E + L GP+ +AI+A + Y
Sbjct: 197 DYPYEADNNNCRMNSNKFLVQVKDCYRYITVYEEKLKDLLRLVGPIPMAIDAADIVNYKQ 256
Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
G+ I++ D G L+H+VL+VGYGV+ +PYW KN+WG WGE G+FR+
Sbjct: 257 GI---IKYCFDSG---LNHAVLLVGYGVENN------IPYWTFKNTWGTDWGEDGFFRVQ 304
Query: 332 RGDGSCGINDYVRSALV 348
+ +CG+ + + S V
Sbjct: 305 QNINACGMRNELASTAV 321
>gi|401430288|ref|XP_003886537.1| unnamed protein product [Leishmania mexicana MHOM/GT/2001/U1103]
gi|356491333|emb|CBZ40988.1| unnamed protein product [Leishmania mexicana MHOM/GT/2001/U1103]
Length = 533
Score = 206 bits (525), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 125/318 (39%), Positives = 176/318 (55%), Gaps = 22/318 (6%)
Query: 40 ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
ALF F + + Y TL E RL F NL ++ Q + +G+ +F DLS AEF
Sbjct: 126 ALFEEFKRTYGRAYETLAEEQQRLANFERNLELMREHQ-ARNPHAQFGITKFFDLSEAEF 184
Query: 100 QAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
A+YL F +A + +++ +P A DWRE AVT VKDQ CGS WAFS
Sbjct: 185 AARYLNGAAYFAAAKRHAAQHYRKARADLSAVPDAVDWREKGAVTPVKDQGACGSCWAFS 244
Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
GNIEG + +LVSLSEQ+L+ CD +DGC+GG + AFD ++ G L E +Y
Sbjct: 245 AVGNIEGQWYLAGHELVSLSEQQLVSCDDMNDGCDGGLMLQAFDWLLQNTNGHLHTEDSY 304
Query: 215 PYRGDD----KACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYV 270
PY + + ++ +I+G+V + E MA +L +NGP+A+A++A + Y
Sbjct: 305 PYVSGNGYVPECSNSSELVVGAQIDGHVLIGSSEKAMAAWLAKNGPIAIALDASSFMSYK 364
Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
+GV C G + L+H VL+VGY D T VPYW+IKNSWG WGE+GY R+
Sbjct: 365 SGV----LTACIG--KQLNHGVLLVGY--DMT----GEVPYWVIKNSWGGDWGEQGYVRV 412
Query: 331 YRGDGSCGINDYVRSALV 348
G +C +++Y SA V
Sbjct: 413 VMGVNACLLSEYPVSAHV 430
>gi|12024965|gb|AAG45727.1| cathepsin L-like cysteine protease [Leishmania chagasi]
Length = 381
Score = 206 bits (524), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 128/320 (40%), Positives = 176/320 (55%), Gaps = 37/320 (11%)
Query: 39 TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAE 98
ALF F + + Y TL E RL F NL ++ Q + +G+ +F DLS AE
Sbjct: 35 AALFEEFKRTYRRAYGTLAEEQQRLANFERNLELMREHQ-ARNPHARFGITKFFDLSEAE 93
Query: 99 FQAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAF 153
F A+YL F +A + +++ +P A DWRE AVT VK+Q CGS WAF
Sbjct: 94 FAARYLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKNQGACGSCWAF 153
Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
S GNIE +A LVSLSEQ+L+ CD +D+GC GG + AF+ ++ + G + EK+
Sbjct: 154 SAVGNIESQWARAGHGLVSLSEQQLVSCDDKDNGCNGGLMLQAFEWLLRHMYGIVFTEKS 213
Query: 214 YPY---RGDDKACRLN--KKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQF 268
YPY GD C LN K +I+GYV + +ET MA +L ENGP+A+A++A +
Sbjct: 214 YPYTSGNGDVAEC-LNSSKLVPGAQIDGYVMIPSNETVMAAWLAENGPIAIAVDASSFMS 272
Query: 269 YVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYF 328
Y +G VL+VGY ++T VPYW+IKNSWGE WGEKGY
Sbjct: 273 YQSG-------------------VLLVGY--NKT----GGVPYWVIKNSWGEDWGEKGYV 307
Query: 329 RLYRGDGSCGINDYVRSALV 348
R+ G +C +++Y SA V
Sbjct: 308 RVAMGLNACLLSEYPVSAHV 327
>gi|15320768|ref|NP_203280.1| V-CATH [Epiphyas postvittana NPV]
gi|37077652|sp|Q91GE3.1|CATV_NPVEP RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|15213236|gb|AAK85675.1| V-CATH [Epiphyas postvittana NPV]
Length = 323
Score = 206 bits (524), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 118/316 (37%), Positives = 175/316 (55%), Gaps = 25/316 (7%)
Query: 36 VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLS 95
+K F F+ Q+NK Y + E R IF NL I + + + VY +N+FSDLS
Sbjct: 22 LKAPNYFEEFVRQYNKQYDSEYEKLRRYKIFQHNLNDI--ITKNRNDTAVYKINKFSDLS 79
Query: 96 TAEFQAKYLGFKLKPSYADRSVPAMI---PNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
E AKY G L P + ++ P P FDWR ++ +T VK+Q MCG+ WA
Sbjct: 80 KDETIAKYTGLSL-PLHTQNFCEVVVLDRPPGKGPLEFDWRRFNKITSVKNQGMCGACWA 138
Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEK 212
F+T ++E +A +L++LSEQ++IDCD D GCEGG + AF+ I+S GG++ E
Sbjct: 139 FATLASLESQFAIAHDRLINLSEQQMIDCDSVDVGCEGGLLHTAFEAIISM--GGVQIEN 196
Query: 213 TYPYRGDDKACRLNKKATQVKI---NGYVSVSRDETDMAKYLVENGPMAVAINAYALQFY 269
YPY + CR++ V + N Y+++ E + L GP+ VAI+A + Y
Sbjct: 197 DYPYESSNNYCRMDPTKFVVGVKQCNRYITIY--EEKLKDVLRLAGPIPVAIDASDILNY 254
Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
G+ +C N L+H+VL+VGYGV+ VPYWI+KNSWG WGE+G+F+
Sbjct: 255 EQGIIK----YC--ANNGLNHAVLLVGYGVENN------VPYWILKNSWGTDWGEQGFFK 302
Query: 330 LYRGDGSCGINDYVRS 345
+ + +CGI + + S
Sbjct: 303 IQQNVNACGIKNELAS 318
>gi|215401412|ref|YP_002332715.1| cathepsin [Spodoptera litura nucleopolyhedrovirus II]
gi|209483953|gb|ACI47386.1| cathepsin [Spodoptera litura nucleopolyhedrovirus II]
Length = 337
Score = 206 bits (524), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 125/323 (38%), Positives = 183/323 (56%), Gaps = 23/323 (7%)
Query: 33 LHHVKHTAL-FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEF 91
L+++ L F F+ Q+NK Y T E R +IF N+ I +++ + S +Y +N F
Sbjct: 30 LYNINSAPLYFEKFIAQYNKKYKTEDEKKYRYNIFRHNMESINH-KNSRNDSAIYKINRF 88
Query: 92 SDLSTAEFQAKYLGF---KLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCG 148
+D++ E ++ G +L ++ + V P +FDWR + VT VKDQ MCG
Sbjct: 89 ADMTKNEVVIRHTGLASGELGANFCETIVVDGPAQRQRPTSFDWRTLNKVTSVKDQGMCG 148
Query: 149 SSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGL 208
+ WAF+ G +E YA K +L+ L+EQ+L+DCD D GC+GG I A++ IM GG+
Sbjct: 149 ACWAFAGLGALESQYAIKYDRLIDLAEQQLVDCDSVDMGCDGGLIHTAYEQIMHM--GGV 206
Query: 209 EEEKTYPYRGDDKACRL--NKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYAL 266
E+E YPYR + + C L +K A V+ + Y V +E + L GP+A+A++A L
Sbjct: 207 EQEFDYPYRAERQPCALKPHKFAAGVR-SCYRYVLLNEERLEDLLRYVGPIAIAVDAVDL 265
Query: 267 QFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKG 326
Y G+ FC+ N L+H+VL+VGYGV+ VP+WIIKNSWG +GE G
Sbjct: 266 TDYYGGIVS----FCE--NNGLNHAVLLVGYGVENN------VPFWIIKNSWGSDYGEDG 313
Query: 327 YFRLYRGDGSCG-INDYVRSALV 348
Y R+ RG SCG IN+ SA V
Sbjct: 314 YVRVRRGVNSCGMINELASSAQV 336
>gi|237643659|ref|YP_002884349.1| V-CATH [Bombyx mandarina nucleopolyhedrovirus]
gi|229358205|gb|ACQ57300.1| V-CATH [Bombyx mandarina nucleopolyhedrovirus]
Length = 323
Score = 206 bits (524), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 118/317 (37%), Positives = 176/317 (55%), Gaps = 21/317 (6%)
Query: 36 VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLS 95
+K F F+ + NK Y++ VE R IF NL +I + ++ S Y +N+FSDLS
Sbjct: 22 LKAPNYFEEFVHRFNKNYSSEVEKLRRFKIFQHNLNEI--INKNQNDSAKYEINKFSDLS 79
Query: 96 TAEFQAKYLGFKLKPSYADRSVPAMI---PNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
E AKY G L P+ +I P P FDWR + VT VK+Q MCG+ WA
Sbjct: 80 KDETIAKYTGLSL-PTQTQNFCKVIILDQPPGKGPLEFDWRRLNKVTSVKNQGMCGACWA 138
Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEK 212
F+T G++E +A K +L++LSEQ++IDCD D GC GG + AF+ I+ GG++ E
Sbjct: 139 FATLGSLESQFAIKHNELINLSEQQMIDCDFVDAGCNGGLLHTAFEAIIKM--GGVQLES 196
Query: 213 TYPYRGDDKACRLNKKATQVKI-NGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVT 271
YPY D+ CR+N V++ + Y + E + L GP+ +AI+A + Y
Sbjct: 197 DYPYEADNNNCRMNSNKFLVQVKDCYRYIIVYEEKLKDLLRLVGPIPMAIDAADIVNYKQ 256
Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
G+ I++ + G L+H+VL+VGYGV+ +PYW KN+WG WGE G+FR+
Sbjct: 257 GI---IKYCFNSG---LNHAVLLVGYGVENN------IPYWTFKNTWGTDWGEDGFFRVQ 304
Query: 332 RGDGSCGINDYVRSALV 348
+ +CG+ + + S V
Sbjct: 305 QNINACGMRNELASTAV 321
>gi|228861649|ref|YP_002854669.1| cathepsin [Euproctis pseudoconspersa nucleopolyhedrovirus]
gi|226425097|gb|ACO53509.1| cathepsin [Euproctis pseudoconspersa nucleopolyhedrovirus]
Length = 334
Score = 206 bits (524), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 121/318 (38%), Positives = 180/318 (56%), Gaps = 20/318 (6%)
Query: 36 VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLS 95
+K F F+ +NK Y +E R HIF NL +I ++ + + VY +N+FSDLS
Sbjct: 31 LKAADYFELFVANYNKNYTDPLEKTKRYHIFKDNLEEINN-KNKSNDTAVYRINKFSDLS 89
Query: 96 TAEFQAKYLGFKLKPSYAD--RSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAF 153
T E +KY G + A+ + V P P FDWR+ + VT +K+Q CG+ WAF
Sbjct: 90 TNELISKYTGLNVPGETANFCKIVVLDQPPGKGPLNFDWRQQNKVTPIKNQGACGACWAF 149
Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
+T +IE YA + + LSEQ++IDCD D GC GG + AF+ ++ GG+EEE+
Sbjct: 150 ATLASIESQYAIRNNVHLDLSEQQMIDCDYVDMGCYGGLLHTAFEQMIQM--GGVEEERQ 207
Query: 214 YPYRGDDKACRL-NKKATQVKING-YVSVSRDETDMAKYLVENGPMAVAINAYALQFYVT 271
YPY G + CRL + + VK+ G Y + E + L GP+ +AI+A ++ Y
Sbjct: 208 YPYEGVNNNCRLKSDERFVVKVKGCYRYLVMREEKLKDLLRAVGPLPMAIDASSIFNYYR 267
Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
GV + +C GN L+H+VL+VGYGV+ VP+W KN+WG+ WGE GYFR+
Sbjct: 268 GVIN----YC--GNNGLNHAVLLVGYGVE------NGVPFWTFKNTWGDDWGEDGYFRVR 315
Query: 332 RGDGSCG-INDYVRSALV 348
+ +CG +N+ SA++
Sbjct: 316 QNVDACGMLNELTSSAVI 333
>gi|401430108|ref|XP_003879535.1| unnamed protein product [Leishmania mexicana MHOM/GT/2001/U1103]
gi|356491914|emb|CBZ40911.1| unnamed protein product [Leishmania mexicana MHOM/GT/2001/U1103]
Length = 359
Score = 206 bits (524), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 125/319 (39%), Positives = 178/319 (55%), Gaps = 22/319 (6%)
Query: 39 TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAE 98
ALF F + + Y TL E RL F NL ++ Q + +G+ +F DLS AE
Sbjct: 35 AALFEEFKRTYGRAYETLAEEQQRLANFERNLELMREHQ-ARNPHAQFGITKFFDLSEAE 93
Query: 99 FQAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAF 153
F A+YL F +A + +++ +P A DWRE AVT VKDQ CGS WAF
Sbjct: 94 FAARYLNGAAYFAAAKRHAAQHYRKARADLSAVPDAVDWREKGAVTPVKDQGECGSCWAF 153
Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
S+ GNIEG + +LVSLSEQ+L+ CD +DGC+GG + AFD ++ G L E +
Sbjct: 154 SSVGNIEGQWYLAGHELVSLSEQQLVSCDDMNDGCDGGLMLQAFDWLLQNTNGHLYTEDS 213
Query: 214 YPYRGDD----KACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFY 269
YPY + + +K +I+G+V + E MA +L +NGP+A+A++A + Y
Sbjct: 214 YPYVSGNGYLPECSNSSKLVVGAQIDGHVLIGSSEKAMAAWLAKNGPIAIALDASSFMSY 273
Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
+GV C G + ++H+VL+VGY D T VPYW+IKNSWG WGE+GY R
Sbjct: 274 KSGVLTA----CIG--KQVNHAVLLVGY--DMT----GEVPYWVIKNSWGGDWGEQGYVR 321
Query: 330 LYRGDGSCGINDYVRSALV 348
+ G +C +++Y SA V
Sbjct: 322 VVMGVNACLLSEYPVSAHV 340
>gi|444724527|gb|ELW65130.1| Cathepsin W [Tupaia chinensis]
Length = 491
Score = 206 bits (523), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 114/317 (35%), Positives = 175/317 (55%), Gaps = 14/317 (4%)
Query: 41 LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
+F F Q+N++Y++ E+ RL IF+ NL + Q LQ+ + G+ +G+ FSDL+ EF
Sbjct: 167 VFALFQIQYNRSYSSPAEHARRLDIFAHNLAQAQRLQEEDLGTAEFGVTPFSDLTDEEFS 226
Query: 101 AKYLGFKLKPSYAD--RSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
Y K+ R V ++ +P DWR+ ++ +++Q C WA + N
Sbjct: 227 QVYKQPKVPGEVPRMVRKVRSLKQGKPVPPTCDWRKARIISPIRNQKNCSCCWAMAAADN 286
Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG 218
IE + + + V +S QEL+DC + DGC+GG + +AF T+++ GL EK YPY+
Sbjct: 287 IEAQWGIRYNQSVKVSVQELLDCGRCGDGCKGGWVWDAFITVLNN--SGLASEKDYPYQS 344
Query: 219 --DDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHP 276
D + CR+ K+ I ++ + +E +A+YL +GP+ V IN L+ Y GV
Sbjct: 345 NVDPQRCRV-KRNKVAWIQDFIMLQDNEQIIAQYLASHGPITVTINMKPLKQYRKGVFEA 403
Query: 277 IQFFCDGGNENLSHSVLIVGYGVDRT-----KFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
CD + HSVL+VG+G ++ T + PYWI+KNSWG WGEKGYFRL+
Sbjct: 404 TPATCDPW--LVDHSVLLVGFGSSKSVKGMRAGTASSKPYWILKNSWGAKWGEKGYFRLH 461
Query: 332 RGDGSCGINDYVRSALV 348
RG +CGI Y +A V
Sbjct: 462 RGSNTCGIAKYPLTARV 478
>gi|124484387|dbj|BAF46304.1| cysteine proteinase precursor [Ipomoea nil]
Length = 474
Score = 206 bits (523), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 129/315 (40%), Positives = 168/315 (53%), Gaps = 33/315 (10%)
Query: 41 LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
+F +L +H K+Y + E R IF NL+ I E+ S GLN F+D++ E++
Sbjct: 49 MFESWLVKHGKSYNAVDEKDKRFKIFRDNLKYIDEKNSLENRSYKLGLNRFADITNEEYR 108
Query: 101 AKYLGFK------LKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
YLG K + S +DR P + +LP + DWRE AVTGVKDQ CGS WAFS
Sbjct: 109 TGYLGAKRDASRNMVKSKSDRYAP--VAGDSLPDSIDWREKGAVTGVKDQGSCGSCWAFS 166
Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
T +EGV T L+SLSEQEL+DCD++ + GC GG + AF I+ GG++ E+
Sbjct: 167 TIAAVEGVNQLATGNLISLSEQELVDCDRKINQGCNGGDMGYAFQFIIKN--GGIDSEED 224
Query: 214 YPYRGDDKAC---RLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQF 268
YPY G D C R N A I+GY V + + V N P++VAI A Y Q
Sbjct: 225 YPYTGKDGKCDSYRQN-NAKVASIDGYEEVPVNNEKSLQKAVANQPVSVAIEAGGYDFQL 283
Query: 269 YVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYF 328
Y +G+ F +L H V VGYG T V YWI+KNSWG+ WGEKGY
Sbjct: 284 YSSGI------FTGSCGTDLDHGVAAVGYG------TENGVDYWIVKNSWGDYWGEKGYV 331
Query: 329 RLYRG----DGSCGI 339
R+ R G CGI
Sbjct: 332 RMQRNVKAKTGLCGI 346
>gi|401416324|ref|XP_003872657.1| putative cathepsin L-like protease [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|322488881|emb|CBZ24131.1| putative cathepsin L-like protease [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 443
Score = 206 bits (523), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 125/319 (39%), Positives = 176/319 (55%), Gaps = 22/319 (6%)
Query: 39 TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAE 98
ALF F + + Y TL E RL F NL ++ Q + +G+ +F DLS AE
Sbjct: 35 AALFEEFKRTYGRAYETLAEEQQRLANFERNLELMREHQ-ARNPHAQFGITKFFDLSEAE 93
Query: 99 FQAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAF 153
F A+YL F +A + +++ +P A DWRE AVT VKDQ CGS WAF
Sbjct: 94 FAARYLNGAAYFAAAKRHAAQHYRKARADLSAVPDAVDWREKGAVTPVKDQGACGSCWAF 153
Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
S GNIEG + +LVSLSEQ+L+ CD +DGC+GG + AFD ++ G L E +
Sbjct: 154 SAVGNIEGQWYLAGHELVSLSEQQLVSCDDMNDGCDGGLMLQAFDWLLQNTNGHLHTEDS 213
Query: 214 YPYRGDD----KACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFY 269
YPY + + ++ +I+G+V + E MA +L +NGP+A+A++A + Y
Sbjct: 214 YPYVSGNGYVPECSNSSELVVGAQIDGHVLIGSSEKAMAAWLAKNGPIAIALDASSFMSY 273
Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
+GV C G + L+H VL+VGY D T VPYW+IKNSWG WGE+GY R
Sbjct: 274 KSGVLTA----CIG--KQLNHGVLLVGY--DMT----GEVPYWVIKNSWGGDWGEQGYVR 321
Query: 330 LYRGDGSCGINDYVRSALV 348
+ G +C +++Y SA V
Sbjct: 322 VVMGVNACLLSEYPVSAHV 340
>gi|71663165|ref|XP_818579.1| cruzipain precursor [Trypanosoma cruzi strain CL Brener]
gi|70883838|gb|EAN96728.1| cruzipain precursor, putative [Trypanosoma cruzi]
Length = 467
Score = 206 bits (523), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 117/315 (37%), Positives = 166/315 (52%), Gaps = 18/315 (5%)
Query: 39 TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAE 98
T+ F F ++H + Y + E RL +F NL + L + +G+ FSDL+ E
Sbjct: 35 TSQFAEFKQKHGRVYESAAEEAFRLSVFRENLF-LARLHAAANPHATFGVTPFSDLTREE 93
Query: 99 FQAKYL-GFKLKPSYADRS-VPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTT 156
F+++Y G + +R+ VP + + P A DWR AVT VKDQ CGS WAFS
Sbjct: 94 FRSRYHNGAAHFAAAQERARVPVKVEVVGAPAAVDWRARGAVTAVKDQGQCGSCWAFSAI 153
Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
GN+E + L +LSEQ L+ CD+ D GC GG ++NAF+ I+ + G + E +YPY
Sbjct: 154 GNVECQWFLAGHPLTNLSEQMLVSCDKTDFGCSGGLMNNAFEWIVQENNGAVYTEDSYPY 213
Query: 217 ---RGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGV 273
G C + I G+V + +DE +A +L NGP+AVA++A + Y GV
Sbjct: 214 ASGEGISPPCTTSGHTVGATITGHVELPQDEAQIAAWLAVNGPVAVAVDASSWMTYTGGV 273
Query: 274 SHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG 333
+E L H VL+VGY AVPYWIIKNSW WGE+GY R+ +G
Sbjct: 274 ------MTSCVSEQLDHGVLLVGYN------DSAAVPYWIIKNSWTTQWGEEGYIRIAKG 321
Query: 334 DGSCGINDYVRSALV 348
C + + SA+V
Sbjct: 322 SNQCLVKEEASSAVV 336
>gi|461905|sp|Q05094.1|CYSP2_LEIPI RecName: Full=Cysteine proteinase 2; AltName: Full=Amastigote
cysteine proteinase A-2; Flags: Precursor
gi|159298|gb|AAA29229.1| cysteine proteinase [Leishmania pifanoi]
Length = 444
Score = 206 bits (523), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 128/320 (40%), Positives = 177/320 (55%), Gaps = 23/320 (7%)
Query: 39 TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAE 98
ALF F + + Y TL E RL F NL ++ Q + +G+ +F DLS AE
Sbjct: 35 AALFEEFKRTYGRAYETLAEEQQRLANFERNLELMREHQ-ARNPHAQFGITKFFDLSEAE 93
Query: 99 FQAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAF 153
F A+YL F +A + +++ +P A DWRE AVT VKDQ CGS WAF
Sbjct: 94 FAARYLNGAAYFAAAKRHAAQHYRKARADLSAVPDAVDWREKGAVTPVKDQGACGSCWAF 153
Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
S GNIEG + +LVSLSEQ+L+ CD +DGC+GG + AFD ++ G L E +
Sbjct: 154 SAVGNIEGQWYLAGHELVSLSEQQLVSCDDMNDGCDGGLMLQAFDWLLQNTNGHLHTEDS 213
Query: 214 YPY---RGDDKACRLNKKATQV--KINGYVSVSRDETDMAKYLVENGPMAVAINAYALQF 268
YPY G C + + V +I+G+V + E MA +L +NGP+A+A++A +
Sbjct: 214 YPYVSGNGYVPECSNSSEELVVGAQIDGHVLIGSSEKAMAAWLAKNGPIAIALDASSFMS 273
Query: 269 YVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYF 328
Y +GV C G + L+H VL+VGY D T VPYW+IKNSWG WGE+GY
Sbjct: 274 YKSGVLTA----CIG--KQLNHGVLLVGY--DMT----GEVPYWVIKNSWGGDWGEQGYV 321
Query: 329 RLYRGDGSCGINDYVRSALV 348
R+ G +C +++Y SA V
Sbjct: 322 RVVMGVNACLLSEYPVSAHV 341
>gi|1730100|sp|P36400.2|LMCPB_LEIME RecName: Full=Cysteine proteinase B; Flags: Precursor
gi|899313|emb|CAA90236.1| LmCPb2.8 [Leishmania mexicana]
Length = 443
Score = 205 bits (522), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 125/319 (39%), Positives = 176/319 (55%), Gaps = 22/319 (6%)
Query: 39 TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAE 98
ALF F + + Y TL E RL F NL ++ Q + +G+ +F DLS AE
Sbjct: 35 AALFEEFKRTYGRAYETLAEEQQRLANFERNLELMREHQ-ARNPHAQFGITKFFDLSEAE 93
Query: 99 FQAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAF 153
F A+YL F +A + +++ +P A DWRE AVT VKDQ CGS WAF
Sbjct: 94 FAARYLNGAAYFAAAKRHAAQHYRKARADLSAVPDAVDWREKGAVTPVKDQGACGSCWAF 153
Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
S GNIEG + +LVSLSEQ+L+ CD +DGC+GG + AFD ++ G L E +
Sbjct: 154 SAVGNIEGQWYLAGHELVSLSEQQLVSCDDMNDGCDGGLMLQAFDWLLQNTNGHLHTEDS 213
Query: 214 YPYRGDD----KACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFY 269
YPY + + ++ +I+G+V + E MA +L +NGP+A+A++A + Y
Sbjct: 214 YPYVSGNGYVPECSNSSELVVGAQIDGHVLIGSSEKAMAAWLAKNGPIAIALDASSFMSY 273
Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
+GV C G + L+H VL+VGY D T VPYW+IKNSWG WGE+GY R
Sbjct: 274 KSGVLTA----CIG--KQLNHGVLLVGY--DMT----GEVPYWVIKNSWGGDWGEQGYVR 321
Query: 330 LYRGDGSCGINDYVRSALV 348
+ G +C +++Y SA V
Sbjct: 322 VVMGVNACLLSEYPVSAHV 340
>gi|110349473|gb|ABG73217.1| cathepsin L 1 precursor [Diaprepes abbreviatus]
Length = 322
Score = 205 bits (522), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 123/304 (40%), Positives = 169/304 (55%), Gaps = 21/304 (6%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQ---LLQDTEHGSGVYGLNEFSDLSTAE 98
F F +H KTY VE +R +IF NLR I+ +L + S G+N F+D++ E
Sbjct: 25 FQAFKLKHGKTYKNQVEETARFNIFKDNLRAIEQHNVLYEQGLVSYKKGINRFTDMTQEE 84
Query: 99 FQA-KYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
F+A L KP + + ++ + +P + DWR VTGVKDQ CGS WAFS TG
Sbjct: 85 FRAFLTLSSSKKPHF--NTTEHVLTGLAVPDSIDWRTKGQVTGVKDQGNCGSCWAFSVTG 142
Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
+ E Y K KLVSLSEQ+L+DC + + GC GG + F + SK GLE E TYPY
Sbjct: 143 STEAAYYRKAGKLVSLSEQQLVDCSTDINAGCNGGYLDETFTYVKSK---GLEAESTYPY 199
Query: 217 RGDDKACRLNKKATQVKINGYVSV-SRDETDMAKYLVENGPMAVAINAYALQFYVTGVSH 275
+G D +C+ + K++G+ S+ S DE + + GP++VAI+A L Y +G+
Sbjct: 200 KGTDGSCKYSASKVVTKVSGHKSLKSEDENALLDAVGNVGPVSVAIDATYLSSYESGIYE 259
Query: 276 PIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
+C L+H VL+VGYG K YWI+KNSWG +GE GYFRL RG
Sbjct: 260 --DDWCS--PSELNHGVLVVGYGTSNGK------KYWIVKNSWGGSFGESGYFRLLRGKN 309
Query: 336 SCGI 339
CG+
Sbjct: 310 ECGV 313
>gi|401416322|ref|XP_003872656.1| putative cathepsin L-like protease [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|322488880|emb|CBZ24130.1| putative cathepsin L-like protease [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 366
Score = 205 bits (521), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 125/319 (39%), Positives = 175/319 (54%), Gaps = 22/319 (6%)
Query: 39 TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAE 98
ALF F + + Y TL E RL F NL ++ Q + +G+ +F DLS AE
Sbjct: 35 AALFEEFKRTYGRAYETLAEEQQRLANFERNLELMREHQ-ARNPHAQFGITKFFDLSEAE 93
Query: 99 FQAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAF 153
F A+YL F +A + +++ +P A DWRE AVT VKDQ CGS WAF
Sbjct: 94 FAARYLNGAAYFAAAKRHAAQHYRKARADLSAVPDAVDWREKGAVTPVKDQGACGSCWAF 153
Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
S GNIEG + +LVSLSEQ+L+ CD +DGC GG + AFD ++ G L E +
Sbjct: 154 SAVGNIEGQWYLAGHELVSLSEQQLVSCDDMNDGCSGGLMLQAFDWLLQNTNGHLYTEDS 213
Query: 214 YPYRGDD----KACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFY 269
YPY + + ++ +I+G+V + E MA +L +NGP+A+A++A + Y
Sbjct: 214 YPYVSGNGYVPECSNSSELVVGAQIDGHVLIGSSEKAMAAWLAKNGPIAIALDASSFMSY 273
Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
+GV C G + L+H VL+VGY D T VPYW+IKNSWG WGE+GY R
Sbjct: 274 KSGVLTA----CIG--KQLNHGVLLVGY--DMT----GEVPYWVIKNSWGGDWGEQGYVR 321
Query: 330 LYRGDGSCGINDYVRSALV 348
+ G +C +++Y SA V
Sbjct: 322 VVMGVNACLLSEYPVSAHV 340
>gi|9542|emb|CAA78443.1| cysteine proteinase [Leishmania mexicana]
Length = 443
Score = 205 bits (521), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 125/319 (39%), Positives = 175/319 (54%), Gaps = 22/319 (6%)
Query: 39 TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAE 98
ALF F + + Y TL E RL F NL ++ Q + +G+ +F DLS AE
Sbjct: 35 AALFEEFKRTYGRAYETLAEEQQRLANFERNLELMREHQ-ARNPHAQFGITKFFDLSEAE 93
Query: 99 FQAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAF 153
F A+YL F +A + +++ +P A DWRE AVT VKDQ CGS WAF
Sbjct: 94 FAARYLNGAAYFAAAKRHAAQHYRKARADLSAVPDAVDWREKGAVTPVKDQGACGSCWAF 153
Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
S GNIEG + +LVSLSEQ+L+ CD D+GC GG + AFD ++ G L E +
Sbjct: 154 SAVGNIEGQWYLAGHELVSLSEQQLVSCDDMDNGCSGGLMLQAFDWLLQNTNGHLHTEDS 213
Query: 214 YPYRGDD----KACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFY 269
YPY + + ++ +I+G+V + E MA +L +NGP+A+A++A + Y
Sbjct: 214 YPYVSGNGYVPECSNSSELVVGAQIDGHVLIGSSEKAMAAWLAKNGPIAIALDASSFMSY 273
Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
+GV C G + L+H VL+VGY D T VPYW+IKNSWG WGE+GY R
Sbjct: 274 KSGVLTA----CIG--KQLNHGVLLVGY--DMT----GEVPYWVIKNSWGGDWGEQGYVR 321
Query: 330 LYRGDGSCGINDYVRSALV 348
+ G +C +++Y SA V
Sbjct: 322 VVMGVNACLLSEYPVSAHV 340
>gi|30387350|ref|NP_848429.1| cathepsin [Choristoneura fumiferana MNPV]
gi|1168799|sp|P41715.1|CATV_NPVCF RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|332509|gb|AAA96732.1| cathepsin [Choristoneura fumiferana MNPV]
gi|30270084|gb|AAP29900.1| cathepsin [Choristoneura fumiferana MNPV]
Length = 324
Score = 205 bits (521), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 116/315 (36%), Positives = 175/315 (55%), Gaps = 20/315 (6%)
Query: 36 VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLS 95
+K F FL + NK+Y++ E R IF NL +I + ++ + Y +N+F+DLS
Sbjct: 22 LKAPNYFEDFLHKFNKSYSSESEKLRRFQIFRHNLEEI-INKNHNDSTAQYEINKFADLS 80
Query: 96 TAEFQAKYLGFKLKPSYADRSVPAMI---PNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
E +KY G L P ++ P P FDWR + VT VK+Q MCG+ WA
Sbjct: 81 KDETISKYTGLSL-PLQTQNFCEVVVLDRPPDKGPLEFDWRRLNKVTSVKNQGMCGACWA 139
Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEK 212
F+T G++E +A K + ++LSEQ+LIDCD D GC+GG + AF+ +M+ GG++ E
Sbjct: 140 FATLGSLESQFAIKHNQFINLSEQQLIDCDFVDAGCDGGLLHTAFEAVMNM--GGIQAES 197
Query: 213 TYPYRGDDKACRLNKKATQVKING-YVSVSRDETDMAKYLVENGPMAVAINAYALQFYVT 271
YPY ++ CR N VK+ Y ++ E + L GP+ VAI+A + Y
Sbjct: 198 DYPYEANNGDCRANAAKFVVKVKKCYRYITVFEEKLKDLLRSVGPIPVAIDASDIVNYKR 257
Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
G+ +C N L+H+VL+VGY V+ VP+WI+KN+WG WGE+GYFR+
Sbjct: 258 GIMK----YC--ANHGLNHAVLLVGYAVE------NGVPFWILKNTWGADWGEQGYFRVQ 305
Query: 332 RGDGSCGINDYVRSA 346
+ +CGI + + S+
Sbjct: 306 QNINACGIQNELPSS 320
>gi|393660044|gb|AFN09033.1| V-Cath [Bombyx mori NPV]
Length = 323
Score = 205 bits (521), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 117/317 (36%), Positives = 176/317 (55%), Gaps = 21/317 (6%)
Query: 36 VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLS 95
+K F F+ + NK Y++ VE R IF NL +I + ++ S Y +N+FSDLS
Sbjct: 22 LKAPNYFEEFVHRFNKNYSSEVEKLRRFKIFQHNLNEI--INKNQNDSAKYEINKFSDLS 79
Query: 96 TAEFQAKYLGFKLKPSYADRSVPAMI---PNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
E AKY G L P+ ++ P P FDWR + VT VK+Q MCG+ WA
Sbjct: 80 KDETIAKYTGLSL-PTQTQNFCKVILLDQPPGKGPLEFDWRRLNKVTSVKNQGMCGACWA 138
Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEK 212
F+T G++E +A K +L++LSEQ++IDCD D GC GG + AF+ I+ GG++ E
Sbjct: 139 FATLGSLESQFAIKHNELINLSEQQMIDCDFVDAGCNGGLLHTAFEAIIKM--GGVQLES 196
Query: 213 TYPYRGDDKACRLNKKATQVKI-NGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVT 271
YPY D+ CR+N V++ + Y + E + L GP+ +AI+A + Y
Sbjct: 197 DYPYEADNNNCRMNSNKFLVQVKDCYRYIIVYEEKLKDLLRLVGPIPMAIDAADIVNYKQ 256
Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
G+ I++ + G L+H+VL+VGYGV+ +PYW KN+WG WGE G+FR+
Sbjct: 257 GI---IKYCFNSG---LNHAVLLVGYGVENN------IPYWTFKNTWGTDWGEDGFFRVQ 304
Query: 332 RGDGSCGINDYVRSALV 348
+ +CG+ + + S V
Sbjct: 305 QNINACGMRNELASTAV 321
>gi|167833701|gb|ACA02577.1| cathepsin [Spodoptera frugiperda MNPV]
Length = 340
Score = 205 bits (521), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 126/323 (39%), Positives = 181/323 (56%), Gaps = 23/323 (7%)
Query: 33 LHHVKHTAL-FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEF 91
L+++ L F F+ Q+NK Y + E R +IF N+ I +++ + S VY +N F
Sbjct: 33 LYNINSAPLYFEKFIAQYNKQYKSEDEKKYRYNIFRHNIESINQ-KNSRNDSAVYKINRF 91
Query: 92 SDLSTAEFQAKYLGF---KLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCG 148
+D++ E ++ G +L ++ + V P FDWR + VT VKDQ MCG
Sbjct: 92 ADMTKNEIVIRHTGLASGELGANFCETVVVDGPAQRQRPANFDWRTLNKVTSVKDQGMCG 151
Query: 149 SSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGL 208
+ WAF+ G +E YA K +L+ L+EQ+L+DCD D GC+GG I A++ IM GG+
Sbjct: 152 ACWAFAGLGALESQYAIKYDRLIDLAEQQLVDCDFVDMGCDGGLIHTAYEQIMRM--GGV 209
Query: 209 EEEKTYPYRGDDKACRL--NKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYAL 266
E+E YPY+ + + C L +K A V+ N Y V +E + L GP+A+A++A L
Sbjct: 210 EQEFDYPYKAERQPCALKPHKFAAGVR-NCYRYVLMNEERLEDLLRYVGPIAIAVDAVDL 268
Query: 267 QFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKG 326
Y G+ FC N L+H+VL+VGYGV+ VPYWIIKNSWG +GE G
Sbjct: 269 TDYYGGIVS----FCK--NNGLNHAVLLVGYGVENN------VPYWIIKNSWGSDYGEDG 316
Query: 327 YFRLYRGDGSCG-INDYVRSALV 348
Y R+ RG SCG IN+ SA V
Sbjct: 317 YVRVRRGVNSCGMINELASSAQV 339
>gi|125860143|ref|YP_001036312.1| viral cathepsin [Spodoptera frugiperda MNPV]
gi|120969288|gb|ABM45731.1| viral cathepsin [Spodoptera frugiperda MNPV]
gi|319997353|gb|ADV91251.1| V-CATH [Spodoptera frugiperda MNPV]
gi|384087478|gb|AFH58958.1| v-cath [Spodoptera frugiperda MNPV]
Length = 339
Score = 205 bits (521), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 126/323 (39%), Positives = 181/323 (56%), Gaps = 23/323 (7%)
Query: 33 LHHVKHTAL-FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEF 91
L+++ L F F+ Q+NK Y + E R +IF N+ I +++ + S VY +N F
Sbjct: 32 LYNINSAPLYFEKFIAQYNKQYKSEDEKKYRYNIFRHNIESINQ-KNSRNDSAVYKINRF 90
Query: 92 SDLSTAEFQAKYLGF---KLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCG 148
+D++ E ++ G +L ++ + V P FDWR + VT VKDQ MCG
Sbjct: 91 ADMTKNEIVIRHTGLASGELGANFCETVVVDGPAQRQRPANFDWRTLNKVTSVKDQGMCG 150
Query: 149 SSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGL 208
+ WAF+ G +E YA K +L+ L+EQ+L+DCD D GC+GG I A++ IM GG+
Sbjct: 151 ACWAFAGLGALESQYAIKYDRLIDLAEQQLVDCDFVDMGCDGGLIHTAYEQIMRM--GGV 208
Query: 209 EEEKTYPYRGDDKACRL--NKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYAL 266
E+E YPY+ + + C L +K A V+ N Y V +E + L GP+A+A++A L
Sbjct: 209 EQEFDYPYKAERQPCALKPHKFAAGVR-NCYRYVLMNEERLEDLLRYVGPIAIAVDAVDL 267
Query: 267 QFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKG 326
Y G+ FC N L+H+VL+VGYGV+ VPYWIIKNSWG +GE G
Sbjct: 268 TDYYGGIVS----FCK--NNGLNHAVLLVGYGVENN------VPYWIIKNSWGSDYGEDG 315
Query: 327 YFRLYRGDGSCG-INDYVRSALV 348
Y R+ RG SCG IN+ SA V
Sbjct: 316 YVRVRRGVNSCGMINELASSAQV 338
>gi|37651368|ref|NP_932731.1| cathepsin [Choristoneura fumiferana DEF MNPV]
gi|82024252|sp|Q6VTL7.1|CATV_NPVCD RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|37499277|gb|AAQ91676.1| cathepsin [Choristoneura fumiferana DEF MNPV]
Length = 324
Score = 205 bits (521), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 122/316 (38%), Positives = 175/316 (55%), Gaps = 22/316 (6%)
Query: 36 VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKI--QLLQDTEHGSGVYGLNEFSD 93
+K + F FL NK Y++ E R IF NL +I + L DT S Y +N+FSD
Sbjct: 22 LKAPSYFEDFLHNFNKNYSSKSEKLHRFKIFQHNLEEIINKNLNDT---SAQYEINKFSD 78
Query: 94 LSTAEFQAKYLGFKLKPSYADRSVPAMI--PNITLPRAFDWREYDAVTGVKDQTMCGSSW 151
LS E +KY G L + ++ P P FDWR + VT VK+Q CG+ W
Sbjct: 79 LSKDETISKYTGLSLPLQNQNFCEVVVLNRPPDKGPLEFDWRRLNKVTSVKNQGTCGACW 138
Query: 152 AFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEE 211
AF+T G++E +A K +L++LSEQ+LIDCD D GC+GG + A++ +M+ GG++ E
Sbjct: 139 AFATLGSLESQFAIKHDQLINLSEQQLIDCDFVDMGCDGGLLHTAYEAVMNM--GGIQAE 196
Query: 212 KTYPYRGDDKACRLNKKATQVKING-YVSVSRDETDMAKYLVENGPMAVAINAYALQFYV 270
YPY ++ CRLN VK+ Y V E + L GP+ VAI+A + Y
Sbjct: 197 NDYPYEANNGDCRLNAAKFVVKVKKCYRYVLMFEEKLKDLLRIVGPLPVAIDASDIVNYK 256
Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
GV +C N L+H+VL+VGY V+ VP+WI+KN+WG WGE+GYFR+
Sbjct: 257 RGVIR----YC--ANHGLNHAVLLVGYAVEN------GVPFWILKNTWGTDWGEQGYFRV 304
Query: 331 YRGDGSCGINDYVRSA 346
+ +CGI + + S+
Sbjct: 305 QQNINACGIQNELPSS 320
>gi|225458143|ref|XP_002280937.1| PREDICTED: cysteine proteinase RD21a [Vitis vinifera]
gi|302142569|emb|CBI19772.3| unnamed protein product [Vitis vinifera]
Length = 436
Score = 205 bits (521), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 123/308 (39%), Positives = 167/308 (54%), Gaps = 22/308 (7%)
Query: 41 LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
LF + EQ+ KTY++ E SRL +F N + + S LN F+DL+ EF+
Sbjct: 28 LFEAWCEQYGKTYSSEEEKASRLKVFEENHAFVTQHNSMANASYTLALNAFADLTHHEFK 87
Query: 101 AKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIE 160
A LGF + + RSV + + +P A DWR+ AVTGVKDQ CG W+FSTTG IE
Sbjct: 88 ASRLGFSPGRAQSIRSVGTPVQELHVPPAVDWRKSGAVTGVKDQGNCGGCWSFSTTGAIE 147
Query: 161 GVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGD 219
G+ T LVSLSEQEL+DCD+ + GCEGG + A+ ++ G++ E YPY G
Sbjct: 148 GINKIVTGSLVSLSEQELVDCDRSYNSGCEGGLMDYAYQFVIKNQ--GIDSEADYPYVGM 205
Query: 220 DKACRLNK-KATQVKINGYVSVSRDETDMAKYLVENGPMAVAI--NAYALQFYVTGVSHP 276
DK C K K V I+GY + ++ +V P++V I + Q Y GV
Sbjct: 206 DKPCNKEKLKKHIVTIDGYTDIPPNDEKQLLQVVAKQPVSVGICGSEKTFQLYSKGV--- 262
Query: 277 IQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGS 336
+ + L H+VLIVGYG T V +WI+KNSWGE WG +GY + R +G+
Sbjct: 263 ---YTGPCSSTLDHAVLIVGYG------TEDGVDFWIVKNSWGEHWGMRGYIHMLRNNGT 313
Query: 337 ----CGIN 340
CGIN
Sbjct: 314 AEGICGIN 321
>gi|332376957|gb|AEE63618.1| unknown [Dendroctonus ponderosae]
Length = 318
Score = 204 bits (520), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 122/305 (40%), Positives = 171/305 (56%), Gaps = 27/305 (8%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYG----LNEFSDLSTA 97
F F +H+K+Y+ VE RL IF+ NLR I+ + + +G+ +N+F+DL+
Sbjct: 25 FQSFKLKHSKSYSNQVEEAKRLAIFTENLRDIEE-HNALYAAGLVSYNKSVNQFTDLTID 83
Query: 98 EFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
EF+A YL KP+ +VP + + +P DWR VTGVKDQ CGS WAFS G
Sbjct: 84 EFKA-YLTLHSKPTL--NTVPYVRTGLQVPTTLDWRSQGYVTGVKDQGDCGSCWAFSVVG 140
Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
+ EG Y T KLVSLSEQ+LIDC +DGC+GG + F + GL E +YPY
Sbjct: 141 STEGAYYKSTGKLVSLSEQQLIDCTTNVNDGCDGGYLEETFPYVQQT---GLVSESSYPY 197
Query: 217 RGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGV--S 274
G D CR+++ K++ YV + E D+ + + GP++VA++A + Y +GV S
Sbjct: 198 TGRDGNCRISESDVVTKVSKYVLLG-GEADLLEAVGSVGPVSVAMDATYIYSYASGVYES 256
Query: 275 HPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGD 334
+ +L+H VL+VGYG T YW+IKNSWG WGE+GY +L RG
Sbjct: 257 SLCSLY------SLNHGVLVVGYG------TQDGKDYWLIKNSWGNTWGEQGYLKLLRGT 304
Query: 335 GSCGI 339
CGI
Sbjct: 305 NECGI 309
>gi|401430350|ref|XP_003886559.1| unnamed protein product, partial [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|356491516|emb|CBZ40966.1| unnamed protein product, partial [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 503
Score = 204 bits (520), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 125/318 (39%), Positives = 176/318 (55%), Gaps = 22/318 (6%)
Query: 40 ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
ALF F + + Y TL E RL F NL ++ Q + +G+ +F DLS AEF
Sbjct: 96 ALFEEFKRTYGRAYETLAEEQQRLANFERNLELMREHQ-ARNPHAQFGITKFFDLSEAEF 154
Query: 100 QAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
A+YL F +A + +++ +P A DWRE AVT VKDQ CGS WAFS
Sbjct: 155 AARYLNGAAYFAAAKRHAAQHYRKARADLSAVPDAVDWREKGAVTPVKDQGACGSCWAFS 214
Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
GNIEG + +LVSLSEQ+L+ CD +DGC+GG + AFD ++ G L E +Y
Sbjct: 215 AVGNIEGQWYLAGHELVSLSEQQLVSCDDMNDGCDGGLMLQAFDWLLQNTNGHLYTEDSY 274
Query: 215 PYRGDD----KACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYV 270
PY + + ++ +I+G+V + E MA +L +NGP+A+A++A + Y
Sbjct: 275 PYVSGNGYVPECSNSSELVVGAQIDGHVLIGSSEKAMAAWLAKNGPIAIALDASSFMSYK 334
Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
+GV C G + L+H VL+VGY D T VPYW+IKNSWG WGE+GY R+
Sbjct: 335 SGV----LTACIG--KQLNHGVLLVGY--DMT----GEVPYWVIKNSWGGDWGEQGYVRV 382
Query: 331 YRGDGSCGINDYVRSALV 348
G +C +++Y SA V
Sbjct: 383 VMGVNACLLSEYPVSAHV 400
>gi|224103643|ref|XP_002313136.1| predicted protein [Populus trichocarpa]
gi|222849544|gb|EEE87091.1| predicted protein [Populus trichocarpa]
Length = 477
Score = 204 bits (520), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 133/355 (37%), Positives = 179/355 (50%), Gaps = 35/355 (9%)
Query: 4 FYFFAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTA-----LFNYFLEQHNKTYATLVE 58
F F A LS+ +++ M + D L H + T L+ +L ++ K Y L E
Sbjct: 8 FAFLATFYFLSVCLAID--MSIIDYNLKHGQVPERTEAETLRLYEMWLVKYGKAYNALGE 65
Query: 59 YYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVP 118
R IF NL+ + + S GLN+F+DLS E++A YLG ++ P
Sbjct: 66 KERRFEIFKDNLKFVDQHNSVGNPSYKLGLNKFADLSNEEYRAAYLGTRMDGKRRLLGGP 125
Query: 119 AMIPNI-----TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSL 173
+ LP + DWRE AV VKDQ CGS WAFST G +EG+ T L SL
Sbjct: 126 KSARYLFKDGDDLPESVDWREKGAVAPVKDQGQCGSCWAFSTVGAVEGINQIVTGNLTSL 185
Query: 174 SEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKK-ATQ 231
SEQEL+DCD+ + GC GG + AF+ IM GG++ E+ YPY+ D C N+K A
Sbjct: 186 SEQELVDCDKVYNQGCNGGLMDYAFEFIMKN--GGIDTEEDYPYKAVDSMCDPNRKNARV 243
Query: 232 VKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLS 289
V I+GY V +++ + V N P++VAI A A Q Y +GV F L
Sbjct: 244 VTIDGYEDVPQNDEKSLRKAVANQPVSVAIEAGGRAFQLYQSGV------FTGSCGTQLD 297
Query: 290 HSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-----DGSCGI 339
H V+ VGYG T V YW+++NSWG WGE GY R+ R G CGI
Sbjct: 298 HGVVAVGYG------TENGVDYWVVRNSWGPAWGENGYIRMERNVASTETGKCGI 346
>gi|394331816|gb|AFN27127.1| cysteine protease [Leishmania tropica]
Length = 443
Score = 204 bits (520), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 118/318 (37%), Positives = 176/318 (55%), Gaps = 22/318 (6%)
Query: 40 ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
ALF F + + Y T+ E RL F NL ++ Q + +G+ +F DLS AEF
Sbjct: 36 ALFEEFKRTYRRAYGTVAEEQQRLANFERNLELMREHQ-ARNPHARFGITKFFDLSEAEF 94
Query: 100 QAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
A+YL F +A + +++ +P A DWR+ AVT VKDQ CGS WAFS
Sbjct: 95 AARYLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWRKKGAVTPVKDQGACGSCWAFS 154
Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
G+IE +A +L +LSEQ+L+ CD +D+GC GG + AF+ ++ + G + E +Y
Sbjct: 155 AVGSIESQWALAGHRLTALSEQQLVSCDDKDNGCAGGLMLQAFEWLLRNMNGTMFTEDSY 214
Query: 215 PYRGDDKACRLNKKATQV----KINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYV 270
PY ++Q+ +I+GY+++ ET MA +L +NGP+++A++A + Y
Sbjct: 215 PYVSSTGYVPECSNSSQLVPGARIDGYLTIESSETVMAAWLAKNGPISIAVDASSFMSYQ 274
Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
+GV + L+H VL+VGY +RT VPYW+IKNSWGE WGE GY R+
Sbjct: 275 SGV------LTSCAGDALNHGVLLVGY--NRT----GEVPYWVIKNSWGENWGENGYVRV 322
Query: 331 YRGDGSCGINDYVRSALV 348
G +C + +Y SA V
Sbjct: 323 TMGVNACLLTEYPVSAHV 340
>gi|374713649|gb|AEZ65082.1| cysteine protease [Carica papaya]
Length = 471
Score = 204 bits (520), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 124/314 (39%), Positives = 166/314 (52%), Gaps = 29/314 (9%)
Query: 41 LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
++ ++L +H K Y + E R IF NLR + + GL +F+DL+ E++
Sbjct: 51 MYEHWLVKHGKNYNAIGEKERRFEIFKDNLRFVDEQNSVPGRTYKLGLTKFADLTNEEYR 110
Query: 101 AKYLGFK------LKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
A YLG K L+ + R + + LP DWRE AVT VKDQ CGS WAFS
Sbjct: 111 AMYLGAKMEKKEKLRTERSQRYLHKAGNDDDLPSHVDWREKGAVTEVKDQGQCGSCWAFS 170
Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
T G++EG+ T L+SLSEQEL+DCD+ + GC GG + AF+ I+ GG++ E
Sbjct: 171 TVGSVEGINQIVTGDLISLSEQELVDCDKAYNQGCNGGLMDYAFEFIIKN--GGIDSEAD 228
Query: 214 YPYRGDDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYV 270
YPYR D C N+K A V I+GY V ++ + K V N P++VAI A Q Y
Sbjct: 229 YPYRASDNMCDSNRKNAHVVTIDGYEDVPENDEESLKKAVANQPVSVAIEAGGREFQLYQ 288
Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
+GV F NL H V+ VGYG T + YWI++NSWG WGE GY R+
Sbjct: 289 SGV------FTGRCGTNLDHGVVAVGYG------TENGIDYWIVRNSWGPKWGESGYIRM 336
Query: 331 YRG-----DGSCGI 339
R G CGI
Sbjct: 337 ERNVASTDTGKCGI 350
>gi|311247276|ref|XP_003122571.1| PREDICTED: cathepsin W-like [Sus scrofa]
Length = 367
Score = 204 bits (520), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 120/319 (37%), Positives = 177/319 (55%), Gaps = 16/319 (5%)
Query: 41 LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF- 99
+F F Q+N++Y+ E+ RL IF+ NL K Q LQ+ + G+ +G+ FSDL+ EF
Sbjct: 41 VFTLFQIQYNRSYSNPAEHARRLDIFAQNLAKAQRLQEEDLGTAEFGVTPFSDLTEEEFG 100
Query: 100 --QAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAV-TGVKDQTMCGSSWAFSTT 156
+ G PS + V + T+P++ DWR+ V + +K Q C WA +
Sbjct: 101 QLHGHHWGAGKAPSMGIK-VGSEESGETVPQSCDWRKKPGVISAIKHQKDCNCCWAMAAV 159
Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
N+E +A K + V LS Q+++DCD+ +GC GG + +AF T+++ GL E+ YPY
Sbjct: 160 DNVEAQWAIKYHQAVQLSVQQVLDCDRCGNGCNGGFVWDAFLTVLNT--SGLASEQDYPY 217
Query: 217 RGDDKACR-LNKKATQVK-INGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVS 274
+G K R L K+ +V I ++ + E +A+YL GP+ V INA LQ Y GV
Sbjct: 218 KGTVKTHRCLAKQHRKVAWIQDFLMLQFCEQSIARYLATEGPITVTINAGLLQQYKRGVI 277
Query: 275 HPIQFFCDGGNENLSHSVLIVGYGVD-----RTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
CD ++HSVL+VG+G R ++PYWI+KNSWG WGE+GYFR
Sbjct: 278 RATPATCD--PHLVNHSVLLVGFGKSKSVEGRRPRPGHSIPYWILKNSWGPDWGEEGYFR 335
Query: 330 LYRGDGSCGINDYVRSALV 348
L+RG +CGI Y +A V
Sbjct: 336 LHRGSNTCGITKYPVTARV 354
>gi|71666430|ref|XP_820174.1| cysteine peptidase [Trypanosoma cruzi strain CL Brener]
gi|70885508|gb|EAN98323.1| cysteine peptidase, putative [Trypanosoma cruzi]
Length = 467
Score = 204 bits (520), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 120/345 (34%), Positives = 176/345 (51%), Gaps = 20/345 (5%)
Query: 9 GVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSG 68
++L ++ V ++ + LH + + F F ++H + Y + E RL +F
Sbjct: 7 ALSLAAVLVVMACLVPAATASLHAEETLA--SQFAEFKQKHGRVYESAAEEAFRLSVFRE 64
Query: 69 NLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL-GFKLKPSYADRS-VPAMIPNITL 126
NL + L + +G+ FSDL+ EF+++Y G + +R+ VP + +
Sbjct: 65 NLF-LARLHAAANPHATFGVTPFSDLTREEFRSRYHNGAAHFAAAQERARVPVNVEVVGA 123
Query: 127 PRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDD 186
P A DWR AVT VKDQ CGS WAFS GN+E + L +LSEQ L+ CD+ D
Sbjct: 124 PAAVDWRARGAVTAVKDQGQCGSCWAFSAIGNVECQWFLAGHPLTNLSEQMLVSCDKTDS 183
Query: 187 GCEGGSISNAFDTIMSKLGGGLEEEKTYPY---RGDDKACRLNKKATQVKINGYVSVSRD 243
GC GG ++NAF+ I+ + G + E +YPY G C + I G+V + +D
Sbjct: 184 GCGGGLMNNAFEWIVQENNGAVYTEDSYPYASGEGISPPCTTSGHTVGATITGHVELPQD 243
Query: 244 ETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTK 303
E +A +L NGP+AVA++A + Y GV +E L H VL+VGY
Sbjct: 244 EAQIAAWLAVNGPVAVAVDASSWMTYTGGV------MTSCVSEQLDHGVLLVGYN----- 292
Query: 304 FTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
AVPYWIIKNSW WGE GY R+ +G C + + SA+V
Sbjct: 293 -DSAAVPYWIIKNSWTAQWGEDGYIRIAKGSNQCLVKEEASSAVV 336
>gi|348565006|ref|XP_003468295.1| PREDICTED: cathepsin W-like [Cavia porcellus]
Length = 375
Score = 204 bits (519), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 121/326 (37%), Positives = 173/326 (53%), Gaps = 22/326 (6%)
Query: 41 LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
+F F Q N++Y+ EY RL IF NL Q LQ+ E G+ +G+ FSDL+ EF
Sbjct: 41 VFKLFQIQFNRSYSNQAEYARRLDIFVHNLATAQRLQEEELGTAEFGVTPFSDLTEEEFG 100
Query: 101 AKYLGFKL--KPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
Y ++ K R V + ++ DWR+ ++ VK+Q C WA + GN
Sbjct: 101 QLYGNRRVARKDLRVARKVSFDKQEELMSQSCDWRKAHIISPVKNQGNCRCCWAIAAAGN 160
Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG 218
IE ++ + K V+LS QEL+DC + +DGC GG I +AF T+++ GL EK YP+RG
Sbjct: 161 IEAMWNIRYKVSVTLSVQELLDCARCEDGCAGGYIWDAFITVLNY--SGLASEKDYPFRG 218
Query: 219 --DDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHP 276
+ C + I Y+ + RDE +A+Y+ GP+ V IN+ LQ Y G+
Sbjct: 219 HANIHKCLASNYRKVAWIYDYIMLPRDEQGIARYVATQGPITVIINSKILQHYKKGIIKG 278
Query: 277 IQFFCDGGNENLSHSVLIVGYGVDR--------TKFTHK------AVPYWIIKNSWGEGW 322
CD + H VL+VGYG + T +H ++PYWI+KNSWG W
Sbjct: 279 TSSKCDPW--FVDHYVLLVGYGRSKAEEEKWTETDLSHSNRPPRHSIPYWILKNSWGANW 336
Query: 323 GEKGYFRLYRGDGSCGINDYVRSALV 348
GE+GYFRL+RG +CGI Y +A V
Sbjct: 337 GEEGYFRLHRGSNTCGITKYPITARV 362
>gi|393904668|gb|EFO15826.2| hypothetical protein LOAG_12683 [Loa loa]
Length = 202
Score = 204 bits (519), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 102/202 (50%), Positives = 130/202 (64%), Gaps = 10/202 (4%)
Query: 147 CGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGG 206
CGS WAFS TGNIEG +A K KL+SLSEQELIDCD D GC+GG NA+ I+ G
Sbjct: 10 CGSCWAFSVTGNIEGAWAIKKGKLISLSEQELIDCDVIDQGCKGGLPLNAYKEIIRM--G 67
Query: 207 GLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYAL 266
GLE EK YPY G + C L +K V IN + + DE +A ++ + GP+++ +NA L
Sbjct: 68 GLESEKDYPYDGHGEKCHLVRKEIAVYINDSIQLPDDEIKIAAWVAKKGPVSIGVNAGPL 127
Query: 267 QFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKG 326
QFY G+SHP + FC +++H VLIVGYG + K PYWIIKNSWG WGE G
Sbjct: 128 QFYRHGISHPWKAFCL--PSHINHGVLIVGYGQEANK------PYWIIKNSWGTKWGENG 179
Query: 327 YFRLYRGDGSCGINDYVRSALV 348
Y+RLYRG CG+ + +A+V
Sbjct: 180 YYRLYRGKNVCGVKEMATTAIV 201
>gi|14602252|ref|NP_148795.1| ORF11 cathepsin [Cydia pomonella granulovirus]
gi|13124000|sp|O91466.1|CATV_GVCPM RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|14591773|gb|AAK70678.1| ORF11 cathepsin [Cydia pomonella granulovirus]
Length = 333
Score = 204 bits (518), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 127/348 (36%), Positives = 189/348 (54%), Gaps = 29/348 (8%)
Query: 12 LLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLR 71
LL+ + S V + L++ LF F ++NKTY + E +L F NL+
Sbjct: 4 LLNFVILASVLTVTAHALTYDLNNSDE--LFKNFAIKYNKTYVSDEERAIKLENFKNNLK 61
Query: 72 KIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKL----KPSYADRSVPAMI-----P 122
I ++ V+ +NE+SDL+ + GF+L PS + +++ P
Sbjct: 62 MINE-KNMASKYAVFDINEYSDLNKNALLRRTTGFRLGLKKNPSAFTMTECSVVVIKDEP 120
Query: 123 NITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCD 182
LP DWR+ VT VK+Q CGS WAFST NIE +Y K K ++LSEQ L++CD
Sbjct: 121 QALLPETLDWRDKHGVTPVKNQMECGSCWAFSTIANIESLYNIKYDKALNLSEQHLVNCD 180
Query: 183 QEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVS-VS 241
++GC GG + A ++I+ + GG+ + PY G D C+ K ++ I+G V
Sbjct: 181 NINNGCAGGLMHWALESILQE--GGVVSAENEPYYGFDGVCK--KSPFELSISGSRRYVL 236
Query: 242 RDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDR 301
++E + + LV NGP++VAI+ L Y G++ C+ NE L+H+VL+VGYGV
Sbjct: 237 QNENKLRELLVVNGPISVAIDVSDLINYKAGIAD----ICE-NNEGLNHAVLLVGYGVKN 291
Query: 302 TKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCG-INDYVRSALV 348
VPYWI+KNSWG WGE+GYFR+ R SCG +N+Y SA++
Sbjct: 292 D------VPYWILKNSWGAEWGEEGYFRVQRDKNSCGMMNEYASSAIL 333
>gi|312095086|ref|XP_003148243.1| hypothetical protein LOAG_12683 [Loa loa]
Length = 195
Score = 204 bits (518), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 102/202 (50%), Positives = 130/202 (64%), Gaps = 10/202 (4%)
Query: 147 CGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGG 206
CGS WAFS TGNIEG +A K KL+SLSEQELIDCD D GC+GG NA+ I+ G
Sbjct: 3 CGSCWAFSVTGNIEGAWAIKKGKLISLSEQELIDCDVIDQGCKGGLPLNAYKEIIRM--G 60
Query: 207 GLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYAL 266
GLE EK YPY G + C L +K V IN + + DE +A ++ + GP+++ +NA L
Sbjct: 61 GLESEKDYPYDGHGEKCHLVRKEIAVYINDSIQLPDDEIKIAAWVAKKGPVSIGVNAGPL 120
Query: 267 QFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKG 326
QFY G+SHP + FC +++H VLIVGYG + K PYWIIKNSWG WGE G
Sbjct: 121 QFYRHGISHPWKAFCL--PSHINHGVLIVGYGQEANK------PYWIIKNSWGTKWGENG 172
Query: 327 YFRLYRGDGSCGINDYVRSALV 348
Y+RLYRG CG+ + +A+V
Sbjct: 173 YYRLYRGKNVCGVKEMATTAIV 194
>gi|255568299|ref|XP_002525124.1| cysteine protease, putative [Ricinus communis]
gi|223535583|gb|EEF37251.1| cysteine protease, putative [Ricinus communis]
Length = 342
Score = 204 bits (518), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 126/320 (39%), Positives = 168/320 (52%), Gaps = 25/320 (7%)
Query: 31 HHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNE 90
LH ++ +H K Y E R IF N+ I+ + S + G+N
Sbjct: 28 RELHESTMVERHEKWMAKHGKVYKDDEEKLRRFQIFKNNVEFIESSNAAGNNSYMLGINR 87
Query: 91 FSDLSTAEFQAKYLGFKLKPSYADRSV-PAMIPNIT-LPRAFDWREYDAVTGVKDQTMCG 148
F+DL+ EF+A + G+K +P A R V P N+T LP + DWR AVT +KDQ CG
Sbjct: 88 FADLTNEEFRASWNGYK-RPLDASRIVTPFKYENVTALPYSMDWRRKGAVTSIKDQRECG 146
Query: 149 SSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCD--QEDDGCEGGSISNAFDTIMSKLGG 206
S WAFS EGV+ +T KLVSLSEQEL+DCD ED GC+GG + +AF I K G
Sbjct: 147 SCWAFSAVAATEGVHKLRTGKLVSLSEQELVDCDVKGEDKGCQGGLMEDAFKFI--KRNG 204
Query: 207 GLEEEKTYPYRGDDKACRLNKKATQV-KINGYVSVSRDETDMAKYLVENGPMAVAINA-- 263
G+ E Y YRG D C K+A+ V KI GY V + V + P++V+I+A
Sbjct: 205 GITTEANYAYRGRDGKCDTKKEASHVAKITGYQVVPENSEAALLKAVAHQPVSVSIDAGS 264
Query: 264 YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWG 323
+ QFY +G+ + +L+H V VGYG + YWI+KNSWG WG
Sbjct: 265 MSFQFYQSGI------YAGSCGSDLNHGVAAVGYGT-----SSSGSKYWIVKNSWGPEWG 313
Query: 324 EKGYFRLYRG----DGSCGI 339
E+GY R+ R G CGI
Sbjct: 314 ERGYVRMKRDITSRKGLCGI 333
>gi|66730453|ref|NP_001019413.1| cathepsin W precursor [Rattus norvegicus]
gi|62531092|gb|AAH93401.1| Cathepsin W [Rattus norvegicus]
gi|149062072|gb|EDM12495.1| cathepsin W [Rattus norvegicus]
Length = 371
Score = 204 bits (518), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 118/325 (36%), Positives = 180/325 (55%), Gaps = 22/325 (6%)
Query: 41 LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
+F F Q N++Y+ EY RL IF+ NL + Q LQ+ + G+ +G FSDL+ EF
Sbjct: 39 VFKLFQIQFNRSYSNPAEYTRRLGIFAHNLAQAQRLQEEDLGTAEFGQTPFSDLTEEEFG 98
Query: 101 AKYLGFKLKPSY---ADRSVPAMIPNITLPRAFDWREY-DAVTGVKDQTMCGSSWAFSTT 156
Y G + P + V + ++P DWR+ + ++ +K+Q C WA +
Sbjct: 99 QLY-GHQRAPERILNMAKKVKSERWGESVPPTCDWRKVKNIISSIKNQGNCRCCWAIAAA 157
Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
NI+ ++ KT++ V +S QEL+DCD+ +GC GG + +A+ T+++ GL E+ YP+
Sbjct: 158 DNIQTLWRIKTQQFVDVSVQELLDCDRCGNGCNGGFVWDAYITVLN--NSGLASEEDYPF 215
Query: 217 RGDDKA--CRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVS 274
+G K C +K I + +S +E +A YL +GP+ V IN LQ+Y GV
Sbjct: 216 QGHQKPHRCLADKYRKVAWIQDFTMLSSNEQVIAGYLAIHGPITVTINMKLLQYYQKGVI 275
Query: 275 HPIQFFCDGGNENLSHSVLIVGYGVDR------TKFTH-----KAVPYWIIKNSWGEGWG 323
CD ++HSVL+VG+G ++ T +H ++ PYWI+KNSWG WG
Sbjct: 276 KATPSTCDP--HLVNHSVLLVGFGKEKGGMQTGTLLSHSRKPRRSTPYWILKNSWGAEWG 333
Query: 324 EKGYFRLYRGDGSCGINDYVRSALV 348
EKGYFRLYRG+ +CGI Y +A V
Sbjct: 334 EKGYFRLYRGNNTCGIAKYPITARV 358
>gi|1136308|gb|AAB41119.1| cruzipain [Trypanosoma cruzi]
Length = 467
Score = 204 bits (518), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 119/345 (34%), Positives = 176/345 (51%), Gaps = 20/345 (5%)
Query: 9 GVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSG 68
++L ++ V ++ + LH + + F F ++H + Y + E RL +F
Sbjct: 7 ALSLAAVLVVMACLVPAATASLHAEETLA--SQFAEFKQKHGRVYGSAAEEAFRLSVFRE 64
Query: 69 NLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL-GFKLKPSYADRS-VPAMIPNITL 126
NL + L + +G+ FSDL+ EF+++Y G + +R+ VP + +
Sbjct: 65 NL-FLARLHAAANPHATFGVTAFSDLTREEFRSRYHNGAAHFAAAQERARVPVNVEVVGA 123
Query: 127 PRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDD 186
P A DWR AVT VKDQ CGS WAFS GN+E + L +LSEQ L+ CD+ D
Sbjct: 124 PAAVDWRARGAVTAVKDQGQCGSCWAFSAIGNVECQWFLAGHPLTNLSEQMLVSCDKTDS 183
Query: 187 GCEGGSISNAFDTIMSKLGGGLEEEKTYPY---RGDDKACRLNKKATQVKINGYVSVSRD 243
GC GG ++NAF+ I+ + G + E +YPY G C + I G+V + +D
Sbjct: 184 GCGGGLMNNAFEWIVQENNGAVYTEDSYPYASGEGISPPCTTSGHTVGATITGHVELPQD 243
Query: 244 ETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTK 303
E +A +L NGP+AVA++A + Y GV +E L H VL+VGY
Sbjct: 244 EAQIAAWLAVNGPVAVAVDASSWMTYTGGV------MTSCVSEQLDHGVLLVGYN----- 292
Query: 304 FTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
AVPYW+IKNSW WGE GY R+ +G C + + SA+V
Sbjct: 293 -DSAAVPYWVIKNSWTTQWGEDGYIRIAKGSNQCLVKEEASSAVV 336
>gi|375073976|gb|AFA34855.1| cathepsin L-like protein [Trypanosoma cruzi]
Length = 467
Score = 204 bits (518), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 119/345 (34%), Positives = 176/345 (51%), Gaps = 20/345 (5%)
Query: 9 GVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSG 68
++L ++ V ++ + LH + + F F ++H + Y + E RL +F
Sbjct: 7 ALSLAAVLVVMACLVPAATASLHAEETLA--SQFAEFKQKHGRVYGSAAEEAFRLSVFRE 64
Query: 69 NLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL-GFKLKPSYADRS-VPAMIPNITL 126
NL + L + +G+ FSDL+ EF+++Y G + +R+ VP + +
Sbjct: 65 NL-FLARLHAAANPHATFGVTPFSDLTREEFRSRYHNGAAHFAAAQERARVPVNVEVVGA 123
Query: 127 PRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDD 186
P A DWR AVT VKDQ CGS WAFS GN+E + L +LSEQ L+ CD+ D
Sbjct: 124 PAAVDWRARGAVTAVKDQGQCGSCWAFSAIGNVECQWFLAGHPLTNLSEQMLVSCDKTDS 183
Query: 187 GCEGGSISNAFDTIMSKLGGGLEEEKTYPY---RGDDKACRLNKKATQVKINGYVSVSRD 243
GC GG ++NAF+ I+ + G + E +YPY G C + I G+V + +D
Sbjct: 184 GCGGGLMNNAFEWIVQENNGAVYTEGSYPYASGEGISPPCTTSGHTVGATITGHVELPQD 243
Query: 244 ETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTK 303
E +A +L NGP+AVA++A + Y GV +E L H VL+VGY
Sbjct: 244 EAQIAAWLAVNGPVAVAVDASSWMTYTGGV------MTSCVSEQLDHGVLLVGYN----- 292
Query: 304 FTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
AVPYW+IKNSW WGE GY R+ +G C + + SA+V
Sbjct: 293 -DSAAVPYWVIKNSWTTQWGEDGYIRIAKGSNQCLVKEEASSAVV 336
>gi|296085959|emb|CBI31400.3| unnamed protein product [Vitis vinifera]
Length = 257
Score = 204 bits (518), Expect = 7e-50, Method: Compositional matrix adjust.
Identities = 109/235 (46%), Positives = 143/235 (60%), Gaps = 20/235 (8%)
Query: 126 LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE- 184
LP +FDWRE AVT VK Q CGS WAFSTTG +EG + TKKL++LSEQ+L+DCD
Sbjct: 17 LPESFDWREKGAVTEVKMQGTCGSCWAFSTTGAVEGAHFISTKKLLTLSEQQLVDCDHMC 76
Query: 185 --------DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKING 236
D GCEGG ++NA+ ++ GGLEEE +YPY G C+ V++
Sbjct: 77 DIRDKTACDSGCEGGLMTNAYKYLIE--AGGLEEESSYPYTGKHGECKFKPDRVAVRVVN 134
Query: 237 YVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVG 296
+ V +E +A LV +GP+AV +NA +Q Y+ GVS P+ C ++H VL+VG
Sbjct: 135 FTEVPINENQIAANLVCHGPLAVGLNAIFMQTYIGGVSCPL--ICP--KRWINHGVLLVG 190
Query: 297 YGVDR---TKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
YG +F +K PYWIIKNSWG+ WGE GY+RL RG G CG+N V + V
Sbjct: 191 YGAKGYSILRFGYK--PYWIIKNSWGKRWGEHGYYRLCRGHGMCGMNTMVSAVNV 243
>gi|27819101|gb|AAO23117.1| cysteine proteinase [Bombyx mori NPV]
Length = 323
Score = 204 bits (518), Expect = 7e-50, Method: Compositional matrix adjust.
Identities = 117/317 (36%), Positives = 175/317 (55%), Gaps = 21/317 (6%)
Query: 36 VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLS 95
+K F F+ + NK Y++ VE R IF NL +I + ++ S Y +N+FSDLS
Sbjct: 22 LKAPNYFEEFVHRFNKNYSSEVEKLRRFKIFQHNLNEI--INKNQNDSAKYEINKFSDLS 79
Query: 96 TAEFQAKYLGFKLKPSYADRSVPAMI---PNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
E AKY G L P+ ++ P P FDWR + VT VK+Q MCG+ WA
Sbjct: 80 KDETIAKYTGLSL-PTQTQNFCKVILLDQPPGKGPLEFDWRRLNKVTSVKNQGMCGACWA 138
Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEK 212
F+T G++E +A K +L++LSEQ++I CD D GC GG + AF+ I+ GG++ E
Sbjct: 139 FATLGSLESQFAIKHNELINLSEQQMIGCDFVDAGCNGGLLHTAFEAIIKM--GGVQLES 196
Query: 213 TYPYRGDDKACRLNKKATQVKI-NGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVT 271
YPY D+ CR+N V++ + Y + E + L GP+ +AI+A + Y
Sbjct: 197 DYPYEADNNNCRMNSNKFLVQVKDCYRYIIVYEEKLKDLLRLVGPIPMAIDAADIVNYKQ 256
Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
G+ I++ D G L+H+VL+VGYGV+ +PYW KN+WG WGE G+FR+
Sbjct: 257 GI---IKYCFDSG---LNHAVLLVGYGVENN------IPYWTFKNTWGTDWGEDGFFRVQ 304
Query: 332 RGDGSCGINDYVRSALV 348
+ +CG+ + + S V
Sbjct: 305 QNINACGMRNELASTAV 321
>gi|8468605|gb|AAF75546.1| cruzipain [Trypanosoma cruzi]
Length = 467
Score = 203 bits (517), Expect = 8e-50, Method: Compositional matrix adjust.
Identities = 120/345 (34%), Positives = 175/345 (50%), Gaps = 20/345 (5%)
Query: 9 GVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSG 68
++L ++ V ++ + LH + + F F ++H + Y + E RL +F
Sbjct: 7 ALSLAAVLVVMACLVPAATASLHAEETLA--SQFAEFKQKHGRVYESAAEEAFRLSVFRE 64
Query: 69 NLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL-GFKLKPSYADRS-VPAMIPNITL 126
NL + L + +G+ FSDL+ EF+++Y G + +R+ VP + +
Sbjct: 65 NLF-LARLHAAANPHATFGVTPFSDLTREEFRSRYHNGAAHFAAAQERARVPVNVEVVGA 123
Query: 127 PRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDD 186
P A DWR AVT VKDQ CGS WAFS GN+E + L +LSEQ L+ CD+ D
Sbjct: 124 PAAVDWRARGAVTAVKDQGQCGSCWAFSAIGNVECQWFLAGHPLTNLSEQMLVSCDKTDS 183
Query: 187 GCEGGSISNAFDTIMSKLGGGLEEEKTYPY---RGDDKACRLNKKATQVKINGYVSVSRD 243
GC GG ++NAF I+ + G + E +YPY G C + I G+V + +D
Sbjct: 184 GCGGGLMNNAFGWIVQENNGAVYTENSYPYASGEGISPPCTTSGHTVGATITGHVELPQD 243
Query: 244 ETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTK 303
E +A +L NGP+AVA++A + Y GV +E L H VL+VGY
Sbjct: 244 EAQIAAWLAVNGPVAVAVDASSWMTYTGGV------MTSCVSEQLDHGVLLVGYN----- 292
Query: 304 FTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
AVPYWIIKNSW WGE GY R+ +G C + + SA+V
Sbjct: 293 -DSAAVPYWIIKNSWTAQWGEDGYIRIAKGSNQCLVKEEASSAVV 336
>gi|209170907|ref|YP_002268053.1| agip23 [Agrotis ipsilon multiple nucleopolyhedrovirus]
gi|208436498|gb|ACI28725.1| viral cathepsin [Agrotis ipsilon multiple nucleopolyhedrovirus]
Length = 364
Score = 203 bits (517), Expect = 8e-50, Method: Compositional matrix adjust.
Identities = 124/323 (38%), Positives = 182/323 (56%), Gaps = 23/323 (7%)
Query: 33 LHHVKHTAL-FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEF 91
L+++ L F F+ Q+NK Y E R +IF N+ I +++ + S VY +N F
Sbjct: 57 LYNINSAPLYFEKFISQYNKHYKNEDEKKYRYNIFRHNIESINH-KNSRNDSAVYKINRF 115
Query: 92 SDLSTAEFQAKYLGF---KLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCG 148
+D++ E ++ G +L ++ + V P +FDWR + VT VKDQ MCG
Sbjct: 116 ADMTKNEVVIRHTGLASGELGVNFCETIVVDGPGQRQRPTSFDWRTLNKVTSVKDQGMCG 175
Query: 149 SSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGL 208
+ WAF+ G +E YA K +L+ LSEQ+L+DCD D GC+GG I A++ IM GG+
Sbjct: 176 ACWAFAGLGALESQYAIKYDRLIDLSEQQLVDCDHVDMGCDGGLIHTAYEEIMRM--GGV 233
Query: 209 EEEKTYPYRGDDKACRL--NKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYAL 266
E++ YPYR + + C L +K A V+ + Y V +E + L GP+A+A++A +
Sbjct: 234 EQDFDYPYRAERQPCALKPHKFAAGVR-SCYRYVLLNEERLEDLLRHVGPIAIAVDAVDI 292
Query: 267 QFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKG 326
Y G+ FC+ N L+H+VL+VGYGV+ VPYWI+KNSWG +GE G
Sbjct: 293 TDYYGGIVS----FCE--NNGLNHAVLLVGYGVENN------VPYWILKNSWGSDYGEDG 340
Query: 327 YFRLYRGDGSCG-INDYVRSALV 348
Y R+ RG SCG IN+ SA V
Sbjct: 341 YVRVRRGVNSCGMINELASSAQV 363
>gi|71406896|ref|XP_805951.1| cysteine peptidase [Trypanosoma cruzi strain CL Brener]
gi|70869552|gb|EAN84100.1| cysteine peptidase, putative [Trypanosoma cruzi]
Length = 426
Score = 203 bits (517), Expect = 9e-50, Method: Compositional matrix adjust.
Identities = 116/313 (37%), Positives = 164/313 (52%), Gaps = 18/313 (5%)
Query: 39 TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAE 98
T+ F F ++H + Y + E RL +F NL + L + +G+ FSDL+ E
Sbjct: 35 TSQFAEFKQKHGRVYESAAEEAFRLSVFRENLF-LARLHAAANPHATFGVTPFSDLTREE 93
Query: 99 FQAKYL-GFKLKPSYADRS-VPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTT 156
F+++Y G + +R+ VP + + P A DWR AVT VKDQ CGS WAFS
Sbjct: 94 FRSRYHNGAAHFAAAQERARVPVKVEVVGAPAAVDWRARGAVTAVKDQGQCGSCWAFSAI 153
Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
GN+E + L +LSEQ L+ CD+ D GC GG ++NAF+ I+ + G + E +YPY
Sbjct: 154 GNVECQWFLAGHPLTNLSEQMLVSCDKTDSGCSGGLMNNAFEWIVQENNGAVYTEDSYPY 213
Query: 217 ---RGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGV 273
G C + I G+V + +DE +A +L NGP+AVA++A + Y GV
Sbjct: 214 ASGEGISPPCTTSGHTVGATITGHVELPQDEAQIAAWLAVNGPVAVAVDASSWMTYTGGV 273
Query: 274 SHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG 333
+E L H VL+VGY AVPYWIIKNSW WGE+GY R+ +G
Sbjct: 274 MTSCV------SEQLDHGVLLVGYN------DSAAVPYWIIKNSWTTQWGEEGYIRIAKG 321
Query: 334 DGSCGINDYVRSA 346
C + + SA
Sbjct: 322 LNQCLVKEEASSA 334
>gi|357619725|gb|EHJ72184.1| hypothetical protein KGM_03271 [Danaus plexippus]
Length = 338
Score = 203 bits (517), Expect = 9e-50, Method: Compositional matrix adjust.
Identities = 118/329 (35%), Positives = 174/329 (52%), Gaps = 24/329 (7%)
Query: 11 ALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNL 70
L+ ++ +G +L+ L LF F++ +NK Y E R IF NL
Sbjct: 12 VLVLFSIDQCKVRELGQRRLYSLEEA--PTLFEQFIKDYNKEYDE-SEKEERFKIFVNNL 68
Query: 71 RKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLK--PSYADRSVPAMIP--NITL 126
+ I + + + VYG+N+FSDLS EF Y G K + PS D + N+T
Sbjct: 69 KDINAMNE-RSSNAVYGINKFSDLSKEEFIKYYTGLKREESPSNEDHKKTDLPESFNVTA 127
Query: 127 PRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDD 186
P FDWR+ V+ +K+Q CGS WAFS N+E ++A KT KL+ +SEQ+L+DCD+ D
Sbjct: 128 PDQFDWRKKGVVSSIKNQKHCGSCWAFSAAANVESIHAIKTGKLIDVSEQQLLDCDKYDS 187
Query: 187 GCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETD 246
GC GG +D + + G K+YPY + CR + ++++ GY S+ D
Sbjct: 188 GCSGGL---PWDALRYFVANGAMSLKSYPYVAKEGKCRYDSSKVEIRLKGYKIFSKISED 244
Query: 247 MAK-YLVENGPMAVAINAYALQFYVTG-VSHPIQFFCDGGNENLSHSVLIVGYGVDRTKF 304
K +L GP+++AI+ ++ YV G V C ++H+VL+VGYG + +
Sbjct: 245 QIKEHLYNIGPLSIAIDVSPIKPYVGGIVMEECHEVC-----QVNHAVLLVGYGKEYS-- 297
Query: 305 THKAVPYWIIKNSWGEGWGEKGYFRLYRG 333
V YWI+KNSWG WGE GYFR+ RG
Sbjct: 298 ----VEYWIVKNSWGPNWGENGYFRMERG 322
>gi|8468607|gb|AAF75547.1| cruzipain [Trypanosoma cruzi]
Length = 467
Score = 203 bits (517), Expect = 9e-50, Method: Compositional matrix adjust.
Identities = 120/344 (34%), Positives = 176/344 (51%), Gaps = 20/344 (5%)
Query: 10 VALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGN 69
++L ++ V ++ + LH + + F F ++H + Y + E RL +F N
Sbjct: 8 LSLAAVLVVMACLVPAATASLHAEETLA--SQFAEFKQKHGRVYESAAEEAFRLSVFREN 65
Query: 70 LRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL-GFKLKPSYADRS-VPAMIPNITLP 127
L + L + +G+ FSDL+ EF ++Y G + +R+ VP + + P
Sbjct: 66 LF-LARLHAAANPHATFGVTPFSDLTREEFWSRYHNGAAHFAAAQERARVPVNVEVVGAP 124
Query: 128 RAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDG 187
A DWR AVT VKDQ CGS WAFS GN+E + L +LSEQ L+ CD+ D G
Sbjct: 125 AAVDWRARGAVTAVKDQGQCGSCWAFSAIGNVECQWFLAGHPLTNLSEQMLVSCDKTDSG 184
Query: 188 CEGGSISNAFDTIMSKLGGGLEEEKTYPY---RGDDKACRLNKKATQVKINGYVSVSRDE 244
C GG ++NAF+ I+ + G + E +YPY G C + I G+V + +DE
Sbjct: 185 CGGGLMNNAFEWIVQENNGAVYTEGSYPYASGEGISPPCTTSGHTVGATITGHVEIPQDE 244
Query: 245 TDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKF 304
+A +L NGP+AVA++A + Y GV +E L H VL+VGY
Sbjct: 245 AQIAAWLAVNGPVAVAVDASSWMTYTGGV------MTSCVSEQLDHGVLLVGYN------ 292
Query: 305 THKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
AVPYW+IKNSW WGE GY R+ +G C + + V SA+V
Sbjct: 293 DSAAVPYWVIKNSWTTHWGEGGYIRIAKGSNQCLVKEGVSSAVV 336
>gi|428186189|gb|EKX55040.1| hypothetical protein GUITHDRAFT_63227 [Guillardia theta CCMP2712]
Length = 344
Score = 203 bits (516), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 126/339 (37%), Positives = 186/339 (54%), Gaps = 25/339 (7%)
Query: 28 EKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSR------LHIFSGNLRKIQLLQDTEH 81
+K L H K+ + ++ +++++NK + V+ YS +F NL I + + E+
Sbjct: 13 DKSAALAHQKYLSAWSSWVKEYNKEH--WVDPYSSPESTRAFEVFQKNLDMI-MKHNEEY 69
Query: 82 GSGV----YGLNEFSDLSTAEFQAKYLGFK----LKPSYADRSVPAMIPNITLPRAFDWR 133
G+ GLN F+ L+ EF A+YLG+ +P +P + DWR
Sbjct: 70 NQGLQSYEMGLNGFAHLTFEEFSAQYLGYGGAEVEQPKTRRAGKHERKSRSEIPASVDWR 129
Query: 134 EYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGG 191
E AV VK+Q CGS WAFS +EG + + +L+SLSEQ+L+DC ++ + GC GG
Sbjct: 130 EKGAVAEVKNQGACGSCWAFSAVAALEGAHFLNSGELISLSEQQLVDCSKKFGNHGCAGG 189
Query: 192 SISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSR-DETDMAKY 250
+ NAF+ M+ G G + EK YPY+G D C+ + + I+GY V + +ETD+
Sbjct: 190 YMDNAFEYWMNNTGHGDDSEKDYPYKGMDGKCKFSADGVRATISGYNDVKQGNETDLLDA 249
Query: 251 LVENGPMAVAINA-YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAV 309
+ GP++VAI+A ALQFY+ GV + + C G L+H V VGYG +F K +
Sbjct: 250 VANVGPVSVAIHAGAALQFYLRGVFNGVAGTCFG---PLNHGVTAVGYGTASLRFGRK-M 305
Query: 310 PYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
YWIIKNSWG GWGEKG+ R RG CG+ + LV
Sbjct: 306 DYWIIKNSWGMGWGEKGFVRFARGKNLCGVANGASYPLV 344
>gi|20069912|ref|NP_613116.1| cathepsin [Mamestra configurata NPV-A]
gi|37077373|sp|Q8QLK1.1|CATV_NPVMC RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|20043306|gb|AAM09141.1| cathepsin [Mamestra configurata NPV-A]
gi|33331744|gb|AAQ11052.1| putative cysteine proteinase [Mamestra configurata NPV-A]
Length = 337
Score = 203 bits (516), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 127/343 (37%), Positives = 183/343 (53%), Gaps = 21/343 (6%)
Query: 12 LLSLTVSVSSFMVVGDEKLHHLHHVKHTAL-FNYFLEQHNKTYATLVEYYSRLHIFSGNL 70
LL V S VV +L+++ L F F+ Q+NK Y++ E R +IF N+
Sbjct: 9 LLVSAVLTSHDQVVAVTIKPNLYNINSAPLYFEKFISQYNKQYSSEDEKKYRYNIFRHNI 68
Query: 71 RKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGF---KLKPSYADRSVPAMIPNITLP 127
I +++ + S VY +N F+D++ E ++ G + ++ + V P
Sbjct: 69 ESINA-KNSRNDSAVYKINRFADMTKNEVVNRHTGLASGDIGANFCETIVVDGPGQRQRP 127
Query: 128 RAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDG 187
FDWR Y+ VT VKDQ MCG+ WAF+ G +E YA K +L+ L+EQ+L+DCD D G
Sbjct: 128 ANFDWRNYNKVTSVKDQGMCGACWAFAGLGALESQYAIKYDRLIDLAEQQLVDCDFVDMG 187
Query: 188 CEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKI-NGYVSVSRDETD 246
C+GG I A++ IM GG+E+E YPY+ C + V + N Y V E
Sbjct: 188 CDGGLIHTAYEQIMHI--GGVEQEYDYPYKAVRLPCAVKPHKFAVGVRNCYRYVLLSEER 245
Query: 247 MAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTH 306
+ L GP+A+A++A L Y GV FC+ N L+H+VL+VGYG++
Sbjct: 246 LEDLLRHVGPIAIAVDAVDLTDYYGGVIS----FCE--NNGLNHAVLLVGYGIENN---- 295
Query: 307 KAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCG-INDYVRSALV 348
VPYW IKNSWG +GE GY R+ RG SCG IN+ SA +
Sbjct: 296 --VPYWTIKNSWGSDYGENGYVRIRRGVNSCGMINELASSAQI 336
>gi|9634237|ref|NP_037776.1| ORF16 cathepsin [Spodoptera exigua MNPV]
gi|37077857|sp|Q9J8B9.1|CATV_NPVSE RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|6960476|gb|AAF33546.1|AF169823_16 ORF16 cathepsin [Spodoptera exigua MNPV]
Length = 337
Score = 203 bits (516), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 125/323 (38%), Positives = 182/323 (56%), Gaps = 23/323 (7%)
Query: 33 LHHVKHTAL-FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEF 91
L+++ L F F+ Q+NK Y + E R +IF N+ I +++ + S VY +N F
Sbjct: 30 LYNINSAPLYFEKFITQYNKQYKSEDEKKYRYNIFRHNIESINQ-KNSRNDSAVYKINRF 88
Query: 92 SDLSTAEFQAKYLGF---KLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCG 148
+D+ E ++ G +L ++ + V P +FDWR + +T VKDQ MCG
Sbjct: 89 ADMPKNEIVIRHTGLASGELGLNFCETIVVDGPAQRQRPVSFDWRSMNKITSVKDQGMCG 148
Query: 149 SSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGL 208
+ W F++ G +E YA K +L+ LSEQ+L+DCD D GC+GG I A++ IM GG+
Sbjct: 149 ACWRFASLGALESQYAIKYDRLIDLSEQQLVDCDFVDMGCDGGLIHTAYEQIMKM--GGV 206
Query: 209 EEEKTYPYRGDDKACRL--NKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYAL 266
E+E Y Y+ + + C L +K AT V+ N Y V +E + L GP+A+A++A L
Sbjct: 207 EQEFDYSYKAERQPCALKPHKFATGVR-NCYRYVILNEERLEDLLRYVGPIAIAVDAVDL 265
Query: 267 QFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKG 326
Y G+ FC+ N L+H+VL+VGYGV+ VPYWIIKNSWG +GE G
Sbjct: 266 TDYYGGIVS----FCE--NNGLNHAVLLVGYGVENN------VPYWIIKNSWGSDYGEDG 313
Query: 327 YFRLYRGDGSCG-INDYVRSALV 348
Y R+ RG SCG IN+ SA V
Sbjct: 314 YVRVRRGVNSCGMINELASSAQV 336
>gi|2351557|gb|AAB68595.1| cathepsin [Choristoneura fumiferana MNPV]
Length = 324
Score = 203 bits (516), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 119/316 (37%), Positives = 174/316 (55%), Gaps = 22/316 (6%)
Query: 36 VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKI--QLLQDTEHGSGVYGLNEFSD 93
+K + F FL NK Y++ E R IF NL +I + L DT S Y +N+FSD
Sbjct: 22 LKAPSYFEDFLHNFNKNYSSKSEKLHRFKIFQHNLEEIINKNLNDT---SAQYEINKFSD 78
Query: 94 LSTAEFQAKYLGFKLKPSYADRSVPAMI--PNITLPRAFDWREYDAVTGVKDQTMCGSSW 151
LS E +KY G L + ++ P P FDWR + VT VK+Q CG+ W
Sbjct: 79 LSKDETISKYTGLSLPLQNQNFCEVVVLNRPPDKGPLEFDWRRLNKVTSVKNQGTCGACW 138
Query: 152 AFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEE 211
AF+T G++E +A K +L++LSEQ+LIDCD D GC+GG + A++ +M+ GG++ E
Sbjct: 139 AFATLGSLESQFAIKHDQLINLSEQQLIDCDFVDMGCDGGLLHTAYEAVMNM--GGIQAE 196
Query: 212 KTYPYRGDDKACRLNKKATQVKING-YVSVSRDETDMAKYLVENGPMAVAINAYALQFYV 270
YPY ++ CR N VK+ Y ++ E + L GP+ VAI+A + Y
Sbjct: 197 NDYPYEANNGDCRANAAKFVVKVKKCYRYITVFEEKLKDLLRSVGPIPVAIDASDIVNYK 256
Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
G+ +C N L+H+VL+VGY V VP+WI+KN+WG WGE+GYFR+
Sbjct: 257 RGIMK----YC--ANHGLNHAVLLVGYAV------QNGVPFWILKNTWGADWGEQGYFRV 304
Query: 331 YRGDGSCGINDYVRSA 346
+ +CGI + + S+
Sbjct: 305 QQNINACGIQNELPSS 320
>gi|259016196|sp|P56202.2|CATW_HUMAN RecName: Full=Cathepsin W; AltName: Full=Lymphopain; Flags:
Precursor
Length = 376
Score = 203 bits (516), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 119/328 (36%), Positives = 176/328 (53%), Gaps = 27/328 (8%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
F F Q N++Y + E+ RL IF+ NL + Q LQ+ + G+ +G+ FSDL+ EF
Sbjct: 42 FKLFQIQFNRSYLSPEEHAHRLDIFAHNLAQAQRLQEEDLGTAEFGVTPFSDLTEEEFGQ 101
Query: 102 KYLGFKLK----PSYADRSVPAMIPNITLPRAFDWREY-DAVTGVKDQTMCGSSWAFSTT 156
Y G++ PS R + + P ++P + DWR+ A++ +KDQ C WA +
Sbjct: 102 LY-GYRRAAGGVPSMG-REIRSEEPEESVPFSCDWRKVASAISPIKDQKNCNCCWAMAAA 159
Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
GNIE ++ V +S QEL+DC + DGC GG + +AF T+++ GL EK YP+
Sbjct: 160 GNIETLWRISFWDFVDVSVQELLDCGRCGDGCHGGFVWDAFITVLNN--SGLASEKDYPF 217
Query: 217 RGDDKACRLNKKATQ--VKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVS 274
+G +A R + K Q I ++ + +E +A+YL GP+ V IN LQ Y GV
Sbjct: 218 QGKVRAHRCHPKKYQKVAWIQDFIMLQNNEHRIAQYLATYGPITVTINMKPLQLYRKGVI 277
Query: 275 HPIQFFCDGGNENLSHSVLIVGYGVDRTK--------------FTHKAVPYWIIKNSWGE 320
CD + + HSVL+VG+G +++ PYWI+KNSWG
Sbjct: 278 KATPTTCD--PQLVDHSVLLVGFGSVKSEEGIWAETVSSQSQPQPPHPTPYWILKNSWGA 335
Query: 321 GWGEKGYFRLYRGDGSCGINDYVRSALV 348
WGEKGYFRL+RG +CGI + +A V
Sbjct: 336 QWGEKGYFRLHRGSNTCGITKFPLTARV 363
>gi|1581746|prf||2117247B Cys protease:ISOTYPE=2
Length = 467
Score = 203 bits (516), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 117/315 (37%), Positives = 166/315 (52%), Gaps = 22/315 (6%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQL-LQDTEHGSGVYGLNEFSDLSTAEFQ 100
F F ++H K Y + E RL +F NL +L H S +G+ FSDL+ EF+
Sbjct: 38 FAAFKQRHGKVYGSAAEEAFRLGVFKENLLFARLHAAANPHAS--FGVTPFSDLTREEFR 95
Query: 101 AKYLGFKLKPSYADRSVPAMIPNIT----LPRAFDWREYDAVTGVKDQTMCGSSWAFSTT 156
++Y + A + V + P A DWR AVT +KDQ CGS WAFST
Sbjct: 96 SRYHNAAAHFAAAQKRVRVPVEVEVEVGGAPAAVDWRARGAVTAIKDQGGCGSCWAFSTI 155
Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
GNIEG + L LSEQ L+ CD D+GC+GG + +AFD I+ + G + E +Y Y
Sbjct: 156 GNIEGQWHLAGNPLTGLSEQMLVSCDNADNGCDGGLMDSAFDWIVGQNNGSVYTEASYSY 215
Query: 217 ---RGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGV 273
GD + C ++ I+G+V + +DE MA +L NGP+A+A++A + Y GV
Sbjct: 216 VSGGGDSQTCNMSSHVVGAVISGHVDLPQDEDKMAAWLAVNGPLAIAVDATSFMSYTGGV 275
Query: 274 SHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG 333
+ ++ L H V++VGY PYWIIKNSWG WGE+GY R+ +G
Sbjct: 276 ------LTNCVSDQLDHGVVLVGYNDSSNP------PYWIIKNSWGADWGEEGYIRIQKG 323
Query: 334 DGSCGINDYVRSALV 348
C + +Y SA+V
Sbjct: 324 TNQCLVKNYACSAVV 338
>gi|118395092|ref|XP_001029901.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89284178|gb|EAR82238.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 344
Score = 203 bits (516), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 117/325 (36%), Positives = 169/325 (52%), Gaps = 30/325 (9%)
Query: 40 ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
A F F + NK Y E++S H + + I + E+ + +G +FSD+S EF
Sbjct: 31 AEFEEFKSKFNKYYHNEHEHHSSFHNYKTSREHI-VKHQMENPNAKFGHTKFSDMSPEEF 89
Query: 100 QAKYLGF---------------KLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQ 144
+ K L F K +P + N LP +FDWR+ +T K Q
Sbjct: 90 ENKMLNFDFSLFKKAKSQGIKLKAEPMKGYLRQGENVDNSDLPESFDWRDKGIITPAKFQ 149
Query: 145 TMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKL 204
CGS W F+TTG IE YA K +L+ SEQ L+DCD + GC GG +++A+ +
Sbjct: 150 NTCGSCWTFATTGVIESQYALKYGELLHFSEQMLLDCDNINQGCRGGLMTDAYQFLQQ-- 207
Query: 205 GGGLEEEKTY-PYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA 263
GG++ TY Y+ C +K + K+ + + +E + + LV+NGP+AV INA
Sbjct: 208 SGGIQTADTYGDYKNKKDICNFDKAKVKAKVVDWYQIPENEETIRRELVKNGPVAVGINA 267
Query: 264 YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWG 323
LQFY G+ P CD + ++H+VLIVGYGV+ + +PYW+IKN WG WG
Sbjct: 268 RTLQFYEGGIVDPKN--CD---DKINHAVLIVGYGVE------EGIPYWLIKNQWGAEWG 316
Query: 324 EKGYFRLYRGDGSCGINDYVRSALV 348
KG+F+L RG CGI+ Y A V
Sbjct: 317 IKGFFKLIRGKKQCGIHTYASIAYV 341
>gi|1136312|gb|AAB41118.1| cruzipain [Trypanosoma cruzi]
Length = 383
Score = 203 bits (516), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 118/343 (34%), Positives = 174/343 (50%), Gaps = 20/343 (5%)
Query: 9 GVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSG 68
++L ++ V ++ + LH + + F F ++H + Y + E RL +F
Sbjct: 7 ALSLAAVLVVMACLVPAATASLHAEETL--ASQFAEFKQKHGRVYGSAAEEAFRLSVFRA 64
Query: 69 NLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL-GFKLKPSYADRS-VPAMIPNITL 126
NL + L + +G+ FSDL+ EF+++Y G + +R+ VP + +
Sbjct: 65 NLF-LARLHAAANPHATFGVTAFSDLTREEFRSRYHNGAAHFAAAQERARVPVNVEVVGA 123
Query: 127 PRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDD 186
P A DWR AVT VKDQ CGS WAFS GN+E + L +LSEQ L+ CD+ D
Sbjct: 124 PAAVDWRARGAVTAVKDQGQCGSCWAFSAIGNVECQWFLAGHPLTNLSEQMLVSCDKTDS 183
Query: 187 GCEGGSISNAFDTIMSKLGGGLEEEKTYPY---RGDDKACRLNKKATQVKINGYVSVSRD 243
GC GG ++NAF+ I+ + G + E +YPY G C + I G+V + +D
Sbjct: 184 GCGGGLMNNAFEWIVQENNGAVYTEDSYPYASGEGISPPCTTSGHTVGATITGHVELPQD 243
Query: 244 ETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTK 303
E +A +L NGP+AVA++A + Y GV +E L H VL+VGY
Sbjct: 244 EAQIAAWLAVNGPVAVAVDASSWMTYTGGV------MTSCVSEQLDHGVLLVGYN----- 292
Query: 304 FTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSA 346
AVPYW+IKNSW WGE GY R+ +G C + + SA
Sbjct: 293 -DSAAVPYWVIKNSWTTQWGEDGYIRIAKGSNQCLVKEEASSA 334
>gi|375073984|gb|AFA34859.1| cathepsin L-like protein [Trypanosoma rangeli]
Length = 467
Score = 203 bits (516), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 120/315 (38%), Positives = 166/315 (52%), Gaps = 22/315 (6%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQD-TEHGSGVYGLNEFSDLSTAEFQ 100
F F ++H K Y + E RL +F NL +L H S +G+ FSDL+ EF+
Sbjct: 38 FAAFKQRHGKVYRSAAEEAFRLGVFKENLLLARLHAAANPHAS--FGVTPFSDLTREEFR 95
Query: 101 AKYLGFKLKPSYADRSVPAMIPNIT----LPRAFDWREYDAVTGVKDQTMCGSSWAFSTT 156
++Y + A + + P A DWR AVT VKDQ CGS WAFST
Sbjct: 96 SRYHNAAAHFAAAQKRARVPVEVEVEVGGAPAAVDWRARGAVTAVKDQGECGSCWAFSTI 155
Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
GNIEG + L SLSEQ L+ CD D+GC+GG + NAFD I+ K G + E +Y Y
Sbjct: 156 GNIEGQWHLAGNPLTSLSEQMLVSCDNADNGCDGGLMDNAFDWIVGKNNGTVYTEASYSY 215
Query: 217 ---RGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGV 273
G+ + C ++ I+G+V + +DE MA +L NGP+A+A++A + Y GV
Sbjct: 216 VSGGGNSQKCDMSGHVVGAVISGHVDLPKDEDKMAAWLAANGPLAIAVDATSFMSYTGGV 275
Query: 274 SHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG 333
+ ++ L H V++VGY PYWIIKNSWG WGE GY R+ +G
Sbjct: 276 ------LTNCISDQLDHGVVLVGYNDSSNP------PYWIIKNSWGADWGEGGYIRIQKG 323
Query: 334 DGSCGINDYVRSALV 348
C +N+Y SA+V
Sbjct: 324 TNQCLVNNYACSAVV 338
>gi|375073978|gb|AFA34856.1| cathepsin L-like protein [Trypanosoma cruzi]
Length = 467
Score = 203 bits (516), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 119/345 (34%), Positives = 175/345 (50%), Gaps = 20/345 (5%)
Query: 9 GVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSG 68
++L ++ V ++ + LH + + F F ++H + Y + E RL +F
Sbjct: 7 ALSLAAVLVVMACLVPAATASLHAEETL--ASQFAEFKQKHGRVYESAAEEAFRLSVFRE 64
Query: 69 NLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL-GFKLKPSYADRS-VPAMIPNITL 126
NL + L + +G+ FSDL+ EF+++Y G + +R+ VP + +
Sbjct: 65 NL-FLARLHAAANPHATFGVTPFSDLTREEFRSRYHNGAAHFAAAQERARVPVNVEVVGA 123
Query: 127 PRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDD 186
P A DWR AVT VKDQ CGS WAFS GN+E + L +LSEQ L+ CD+ D
Sbjct: 124 PAAVDWRARGAVTAVKDQGQCGSCWAFSAIGNVECQWFLAGHPLTNLSEQMLVSCDKTDS 183
Query: 187 GCEGGSISNAFDTIMSKLGGGLEEEKTYPY---RGDDKACRLNKKATQVKINGYVSVSRD 243
GC GG ++NAF+ I+ + G + E +YPY G C + I G+V + +D
Sbjct: 184 GCSGGLMNNAFEWIVQENNGAVYTEDSYPYASGEGISPPCTTSGHTVGATITGHVELPQD 243
Query: 244 ETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTK 303
E +A +L NGP+AV ++A + Y GV +E L H VL+VGY
Sbjct: 244 EAQIAAWLAVNGPVAVGVDASSWMTYTGGV------MTSCVSEQLDHGVLLVGYN----- 292
Query: 304 FTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
AVPYWIIKNSW WGE GY R+ +G C + + SA+V
Sbjct: 293 -DSAAVPYWIIKNSWTTQWGEGGYIRVAKGSNQCLVKEEASSAVV 336
>gi|255568345|ref|XP_002525147.1| cysteine protease, putative [Ricinus communis]
gi|223535606|gb|EEF37274.1| cysteine protease, putative [Ricinus communis]
Length = 347
Score = 203 bits (516), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 138/347 (39%), Positives = 191/347 (55%), Gaps = 34/347 (9%)
Query: 8 AGVALLSL-TVSVSSFMVVGDEKLHHLH-HVKHTAL---FNYFLEQHNKTYATLVEYYSR 62
AG+ L++L T+ + S + ++H L TA+ ++ +LEQ+ + Y T EY R
Sbjct: 10 AGLMLITLCTLWIPS---IARSEIHSLPIDSAPTAMKVRYDKWLEQYGRKYDTKDEYLLR 66
Query: 63 LHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIP 122
I+ N++ I+ + ++++ S N+F+DL+ EF + YLG++++ SY R++ M
Sbjct: 67 FGIYHSNIQFIEYI-NSQNLSFKLTDNKFADLTNDEFNSIYLGYQIR-SYKRRNLSHMHE 124
Query: 123 NIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDC 181
N T LP A DWRE AVT +KDQ CGS WAFS +EG+ KT LVSLSEQEL+DC
Sbjct: 125 NSTDLPDAVDWRENGAVTPIKDQGQCGSCWAFSAVAAVEGINKIKTGNLVSLSEQELVDC 184
Query: 182 DQEDD--GCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQ-VKINGYV 238
D D GC GG + AF I S GGL E YPY+G D +C K V I GY
Sbjct: 185 DVNGDNKGCNGGFMEKAFTFIKSI--GGLTTENDYPYKGTDGSCEKAKTDNHAVIIGGYE 242
Query: 239 SVSRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVG 296
+V + + K V P++VAI+A Y Q Y GV +C L+H V IVG
Sbjct: 243 TVPANNENSLKVAVSKQPVSVAIDASGYEFQLYSEGV---FSGYC---GIQLNHGVTIVG 296
Query: 297 YGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGD----GSCGI 339
YG + YW++KNSWG+GWGE GY R+ R G CGI
Sbjct: 297 YG------DNNGQKYWLVKNSWGKGWGESGYIRMKRDSSDTKGMCGI 337
>gi|356515048|ref|XP_003526213.1| PREDICTED: vignain-like [Glycine max]
Length = 350
Score = 202 bits (515), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 124/336 (36%), Positives = 175/336 (52%), Gaps = 28/336 (8%)
Query: 12 LLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLR 71
+L L++ S M +LH + +++++ K Y E RL IF N+
Sbjct: 14 VLLLSICTSQVMS------RNLHEASMSERHEQWMKKYGKVYKDAAEKQKRLLIFKDNVE 67
Query: 72 KIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNIT-LPRAF 130
I+ + +N +D + EF A + G+K K S++ P N+T +P A
Sbjct: 68 FIESFNAAGNKPYKLSINHLADQTNEEFVASHNGYKYKGSHSQ--TPFKYGNVTDIPTAV 125
Query: 131 DWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEG 190
DWR+ AVT VKDQ CGS WAFST EG+Y T L+SLSEQEL+DCD D GC+G
Sbjct: 126 DWRQNGAVTAVKDQGQCGSCWAFSTVAATEGIYQISTGMLMSLSEQELVDCDSVDHGCDG 185
Query: 191 GSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKAT-QVKINGYVSVSRDETDMAK 249
G + + F+ I+ GG+ E YPY D C +K+A+ +I GY +V + + +
Sbjct: 186 GLMEDGFEFIIKN--GGISSEANYPYTAVDGTCDASKEASPAAQIKGYETVPANSEEALQ 243
Query: 250 YLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHK 307
V N P++V+I+A QFY +GV F L H V +VGYG TH+
Sbjct: 244 QAVANQPVSVSIDAGGSGFQFYSSGV------FTGQCGTQLDHGVTVVGYGT-TDDGTHE 296
Query: 308 AVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGI 339
YWI+KNSWG WGE+GY R+ RG +G CGI
Sbjct: 297 ---YWIVKNSWGTQWGEEGYIRMQRGIDAQEGLCGI 329
>gi|343473370|emb|CCD14732.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 445
Score = 202 bits (515), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 120/343 (34%), Positives = 183/343 (53%), Gaps = 24/343 (6%)
Query: 13 LSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRK 72
+ L + F+ V LH ++ F F ++++++Y E R +F N+ +
Sbjct: 14 VGLLAVAACFVPVALGVLHAEQSLQQQ--FAAFKQKYSRSYKDATEEAFRFRVFKQNMER 71
Query: 73 IQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL-GFKLKPSYADRSVPAMIPNITL---PR 128
+ + + +G+ FSD+S EF+A Y G + + R P + N++ P
Sbjct: 72 AKE-EAAANPYATFGVTRFSDMSPEEFRATYHNGAEYYAAALKR--PRKVVNVSTGKAPE 128
Query: 129 AFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGC 188
A DWR+ AVT VKDQ CGS WAFS GNIEG + +L SLSEQ L+ CD D GC
Sbjct: 129 AVDWRKKGAVTPVKDQGACGSCWAFSAIGNIEGQWKVAGHELTSLSEQMLVSCDTTDYGC 188
Query: 189 EGGSISNAFDTIMSKLGGGLEEEKTYPYR---GDDKACRLNKKATQVKINGYVSVSRDET 245
GG + + I+S G + ++YPY G C + K KI+G++++ +DE
Sbjct: 189 RGGLMDKSLQWIVSSNKGNVFTAQSYPYASGGGKMPPCNKSGKVVGAKISGHINLPKDEN 248
Query: 246 DMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFT 305
+A++L +NGP+A+A++A + Y GV ++ L H VL+VGY D +K
Sbjct: 249 AIAEWLAKNGPVAIAVDATSFLGYKGGV------LTSCISKGLDHDVLLVGYN-DTSK-- 299
Query: 306 HKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
PYWIIKNSW +GWGE+GY R+ +G C + +Y RSA+V
Sbjct: 300 ---PPYWIIKNSWSKGWGEEGYIRIEKGTNQCLMKNYARSAVV 339
>gi|292397748|ref|YP_003517814.1| cathepsin [Lymantria xylina MNPV]
gi|291065465|gb|ADD73783.1| cathepsin [Lymantria xylina MNPV]
Length = 335
Score = 202 bits (515), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 117/315 (37%), Positives = 172/315 (54%), Gaps = 23/315 (7%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQD--TEHGSGVYGLNEFSDLSTAEF 99
F F+E +NK Y + E R IF NL +I T+ + YG+N+FSDLS +E
Sbjct: 35 FESFVENYNKNYTSDWEKNKRYSIFKDNLHEINAKNGNATDGPTATYGINKFSDLSKSEL 94
Query: 100 QAKYLGFKLKPSYADRSVPAMI---PNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTT 156
AK+ G + P A ++ P P FDWRE + VT +K+Q CG+ WAF+T
Sbjct: 95 IAKFTGLSI-PQRASNFCKTIVLNQPPDKGPLHFDWREQNKVTSIKNQGACGACWAFATL 153
Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
++E +A + +LV LSEQ+LIDCD D GC GG + AF+ I+ GG++ E YP+
Sbjct: 154 ASVESQFAMRHNRLVDLSEQQLIDCDSVDMGCNGGLLHTAFEEIIRM--GGVQAELDYPF 211
Query: 217 RGDDKACRLNKKATQVK--INGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVS 274
G D+ C +++ V + Y V +E + L GP+ +AI+A + Y GV
Sbjct: 212 VGRDRRCGVDRHRPYVVSLVGCYRYVMVNEEKLKDLLRAVGPIPMAIDAADIVNYYRGVI 271
Query: 275 HPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGD 334
+ N L+H+VL+VGYGV+ VPYW KN+WG+ WGE GYFR+ +
Sbjct: 272 SSCE------NNGLNHAVLLVGYGVE------NGVPYWAFKNTWGDDWGENGYFRVRQNI 319
Query: 335 GSCG-INDYVRSALV 348
+CG +ND +A++
Sbjct: 320 NACGMVNDLASTAVL 334
>gi|86355549|ref|YP_473217.1| Cathepsin [Hyphantria cunea nucleopolyhedrovirus]
gi|86198154|dbj|BAE72318.1| Cathepsin [Hyphantria cunea nucleopolyhedrovirus]
Length = 324
Score = 202 bits (515), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 121/318 (38%), Positives = 179/318 (56%), Gaps = 21/318 (6%)
Query: 36 VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLS 95
+K + F FL + NK Y++ E R IF NL +I ++++ + Y +N+FSDLS
Sbjct: 22 LKAPSYFEDFLHKFNKHYSSESEKLRRFQIFQHNLEEI-IIKNQNDTTAQYEINKFSDLS 80
Query: 96 TAEFQAKYLGFKLKPSYADRSVPAMI---PNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
E +KY G L P ++ P P FDWR + VT VK+Q +CG+ WA
Sbjct: 81 KDETISKYTGLAL-PLQTQNFCEVVVLNRPPDKGPLEFDWRRLNKVTSVKNQGICGACWA 139
Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEK 212
F+T ++E +A K +L++LSEQ+LIDCD D GC GG + A++ +M GG++ E
Sbjct: 140 FATLASLESQFAIKHNQLINLSEQQLIDCDYVDAGCNGGLLHTAYEAVMQM--GGVQAEN 197
Query: 213 TYPYRGDDKACRLNKKATQVKING-YVSVSRDETDMAKYLVENGPMAVAINAYALQFYVT 271
YPY G D CR++ VK+ Y ++ E + L GP+ VAI+A + Y
Sbjct: 198 DYPYEGSDGNCRVDVAKFVVKVKKCYRYIAVFEEKLKDLLRIVGPIPVAIDASDIVNYRR 257
Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
G+ +C N L+H+VL+VGYGV+ VPYWI+KN+WGE WGE+GYFR+
Sbjct: 258 GIMR----YC--SNYGLNHAVLLVGYGVENN------VPYWILKNTWGEDWGEQGYFRVQ 305
Query: 332 RGDGSCGI-NDYVRSALV 348
+ +CGI N+ + SA +
Sbjct: 306 QNINACGIRNELLASAEI 323
>gi|225428879|ref|XP_002285299.1| PREDICTED: cysteine proteinase RD21a-like [Vitis vinifera]
Length = 469
Score = 202 bits (515), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 129/333 (38%), Positives = 175/333 (52%), Gaps = 30/333 (9%)
Query: 21 SFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTE 80
S + GD +L + A++ +L +H K+Y L E R IF NLR I+
Sbjct: 34 SIISYGD-RLEKRTDAEVMAVYEAWLVKHGKSYNALGERERRFEIFKDNLRFIEEHNAVN 92
Query: 81 HGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVP-----AMIPNITLPRAFDWREY 135
V GLN F+DL+ E++++YLG + + R+ + LP + DWRE
Sbjct: 93 RTYKV-GLNRFADLTNEEYRSRYLGRRDETRRGLRASRVSDRYSFRAGEDLPESVDWREK 151
Query: 136 DAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSIS 194
AV VKDQ CGS WAFST +EG+ T L+SLSEQEL+DCD+ + GC GG +
Sbjct: 152 GAVVPVKDQGNCGSCWAFSTIAAVEGINQIATGDLISLSEQELVDCDKSYNQGCNGGLMD 211
Query: 195 NAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVE 253
AF+ I++ GG++ E+ YPYR D C N+K A V I+GY V +++ K V
Sbjct: 212 YAFEFIINN--GGIDSEEDYPYRAADTTCDPNRKNARVVSIDGYEDVPQNDERSLKKAVA 269
Query: 254 NGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPY 311
N P++VAI A A Q Y +GV F L H V+ VGYG T +V Y
Sbjct: 270 NQPVSVAIEAGGRAFQLYQSGV------FTGQCGTQLDHGVVAVGYG------TENSVDY 317
Query: 312 WIIKNSWGEGWGEKGYFRLYRG-----DGSCGI 339
WI++NSWG WGE GY +L R G CGI
Sbjct: 318 WIVRNSWGPNWGESGYIKLERNLAGTETGKCGI 350
>gi|45822209|emb|CAE47501.1| cathepsin L-like proteinase [Diabrotica virgifera virgifera]
Length = 325
Score = 202 bits (515), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 121/303 (39%), Positives = 169/303 (55%), Gaps = 20/303 (6%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSG----VYGLNEFSDLSTAEFQ 100
F ++NK+Y + VE +R IF NLRKI+ + ++ +G +G+ +F+DL+ EF
Sbjct: 26 FKVKNNKSYKSYVEEQTRFRIFQENLRKIEN-HNEKYNNGESTFKFGVTKFTDLTEKEFL 84
Query: 101 AKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIE 160
+ K + + P LP AFDWR+ AVT VKDQ MCGS W FSTTG++E
Sbjct: 85 DLLVLSKNARPNRTHATHLLAPLRDLPSAFDWRDKGAVTEVKDQGMCGSCWTFSTTGSVE 144
Query: 161 GVYAAKTKKLVSLSEQELIDCDQEDD-GCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGD 219
+ KT LVSLSEQ L+DC ++ GC GG + A + I GG+ EK YPY G
Sbjct: 145 AAHFLKTGNLVSLSEQNLVDCAKDTCYGCGGGWMDKALEYIEK---GGIMSEKDYPYEGV 201
Query: 220 DKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINAYA-LQFYVTGVSHPI 277
D CR + KI+ + + + DE D+ + GP++VAI+A A Q YV+G+
Sbjct: 202 DDNCRFDISKVAAKISNFTYIKKNDEEDLKNAVAAKGPISVAIDASATFQLYVSGILDDT 261
Query: 278 QFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-DGS 336
+ C ++L+H VL+VGYG + K YWIIKNSWG WG GY R+ R +
Sbjct: 262 E--CSNEFDSLNHGVLVVGYGTENGK------DYWIIKNSWGVNWGMDGYIRMSRNKNNQ 313
Query: 337 CGI 339
CGI
Sbjct: 314 CGI 316
>gi|394331814|gb|AFN27126.1| cysteine protease [Leishmania tropica]
Length = 443
Score = 202 bits (515), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 116/318 (36%), Positives = 173/318 (54%), Gaps = 22/318 (6%)
Query: 40 ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
ALF F + + Y T+ E RL F NL ++ Q + +G+ +F DLS AEF
Sbjct: 36 ALFEEFKRTYRRAYGTVAEEQQRLANFERNLELMREHQ-ARNPHARFGITKFFDLSEAEF 94
Query: 100 QAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
A+YL F +A + +++ +P A DWR+ AVT VKDQ CGS WAFS
Sbjct: 95 AARYLNGAAYFAAAKQHAGQHYRKARADLSAVPYAVDWRKKGAVTPVKDQGACGSCWAFS 154
Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
G+IE +A +L +LSEQ+L+ CD +D GC GG + AF+ ++ + G + E +Y
Sbjct: 155 AVGSIESQWALAGHRLTALSEQQLVSCDDKDSGCGGGLMLQAFEWLLRNMNGTMFTEDSY 214
Query: 215 PYRGDDKACRLNKKATQV----KINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYV 270
PY ++Q+ +I+GY+++ ET MA +L +NGP+++A++A + Y
Sbjct: 215 PYVSSSGYVPECSNSSQLVPGARIDGYMTIESSETVMAAWLAKNGPISIAVDASSFMSYE 274
Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
+GV + L+H VL+VGY + VPYW+IKNSWGE WGE GY R+
Sbjct: 275 SGV------LTSCAGDTLNHGVLLVGYNMT------GEVPYWVIKNSWGEDWGENGYVRV 322
Query: 331 YRGDGSCGINDYVRSALV 348
G +C + +Y SA V
Sbjct: 323 TMGVNACLLTEYPVSAHV 340
>gi|332249835|ref|XP_003274061.1| PREDICTED: cathepsin W [Nomascus leucogenys]
Length = 403
Score = 202 bits (515), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 119/328 (36%), Positives = 174/328 (53%), Gaps = 27/328 (8%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
F F Q N++Y + E+ RL IF+ NL + Q LQ+ + G+ +G+ FSDL+ EF
Sbjct: 69 FKLFQIQFNRSYLSPEEHARRLDIFAHNLAQAQRLQEEDLGTAEFGVTPFSDLTEEEFGQ 128
Query: 102 KYLGFKLK----PSYADRSVPAMIPNITLPRAFDWREY-DAVTGVKDQTMCGSSWAFSTT 156
Y G++ PS R + + P ++P DWR+ A++ +KDQ C WA +
Sbjct: 129 LY-GYRRAAGGVPSMG-REIRSEEPEESVPFTCDWRKVAGAISPIKDQKNCNCCWAMAAA 186
Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
GNIE ++ V +S QEL+DC + DGC GG + +AF T+++ GL EK YP+
Sbjct: 187 GNIEALWRINFWDFVDVSVQELLDCSRCGDGCHGGFVWDAFITVLN--NSGLASEKDYPF 244
Query: 217 RGDDKACRLNKKATQ--VKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVS 274
+G +A R + K Q I ++ + E +A+YL GP+ V IN LQ Y GV
Sbjct: 245 QGKVRAHRCHPKKYQKVAWIQDFIMLQNSEHRIAQYLATYGPITVTINMKPLQLYRKGVI 304
Query: 275 HPIQFFCDGGNENLSHSVLIVGYGVDRTK--------------FTHKAVPYWIIKNSWGE 320
CD + + HSVL+VG+G +++ PYWI+KNSWG
Sbjct: 305 KATSTTCD--PQLVDHSVLLVGFGSVKSEEGIWAETVSSQSQPQPPHPTPYWILKNSWGA 362
Query: 321 GWGEKGYFRLYRGDGSCGINDYVRSALV 348
WGEKGYFRL+RG +CGI + +A V
Sbjct: 363 QWGEKGYFRLHRGSNTCGITKFPLTARV 390
>gi|9627870|ref|NP_054157.1| viral cathepsin-like protein [Autographa californica
nucleopolyhedrovirus]
gi|114680178|ref|YP_758591.1| viral cathepsin [Plutella xylostella multiple nucleopolyhedrovirus]
gi|115751|sp|P25783.1|CATV_NPVAC RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|332491|gb|AAA46752.1| viral cathepsin [Autographa californica nucleopolyhedrovirus]
gi|559196|gb|AAA66757.1| viral cathepsin-like protein [Autographa californica
nucleopolyhedrovirus]
gi|113015253|gb|ABE68510.1| viral cathepsin [Plutella xylostella multiple nucleopolyhedrovirus]
Length = 323
Score = 202 bits (515), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 116/317 (36%), Positives = 174/317 (54%), Gaps = 21/317 (6%)
Query: 36 VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLS 95
+K F F+ + NK Y + VE R IF NL +I + ++ S Y +N+FSDLS
Sbjct: 22 LKAPNYFEEFVHRFNKDYGSEVEKLRRFKIFQHNLNEI--INKNQNDSAKYEINKFSDLS 79
Query: 96 TAEFQAKYLGFKLKPSYADRSVPAMI---PNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
E AKY G L P ++ P P FDWR + VT VK+Q MCG+ WA
Sbjct: 80 KDETIAKYTGLSL-PIQTQNFCKVIVLDQPPGKGPLEFDWRRLNKVTSVKNQGMCGACWA 138
Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEK 212
F+T ++E +A K +L++LSEQ++IDCD D GC GG + AF+ I+ GG++ E
Sbjct: 139 FATLASLESQFAIKHNQLINLSEQQMIDCDFVDAGCNGGLLHTAFEAIIKM--GGVQLES 196
Query: 213 TYPYRGDDKACRLNKKATQVKI-NGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVT 271
YPY D+ CR+N V++ + Y ++ E + L GP+ +AI+A + Y
Sbjct: 197 DYPYEADNNNCRMNSNKFLVQVKDCYRYITVYEEKLKDLLRLVGPIPMAIDAADIVNYKQ 256
Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
G+ I++ + G L+H+VL+VGYGV+ +PYW KN+WG WGE G+FR+
Sbjct: 257 GI---IKYCFNSG---LNHAVLLVGYGVENN------IPYWTFKNTWGTDWGEDGFFRVQ 304
Query: 332 RGDGSCGINDYVRSALV 348
+ +CG+ + + S V
Sbjct: 305 QNINACGMRNELASTAV 321
>gi|23110964|ref|NP_001326.2| cathepsin W preproprotein [Homo sapiens]
gi|29476894|gb|AAH48255.1| Cathepsin W [Homo sapiens]
gi|119594870|gb|EAW74464.1| cathepsin W (lymphopain), isoform CRA_b [Homo sapiens]
Length = 376
Score = 202 bits (514), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 119/328 (36%), Positives = 176/328 (53%), Gaps = 27/328 (8%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
F F Q N++Y + E+ RL IF+ NL + Q LQ+ + G+ +G+ FSDL+ EF
Sbjct: 42 FKLFQIQFNRSYLSPEEHAHRLDIFAHNLAQAQRLQEEDLGTAEFGVTPFSDLTEEEFGQ 101
Query: 102 KYLGFKLK----PSYADRSVPAMIPNITLPRAFDWREY-DAVTGVKDQTMCGSSWAFSTT 156
Y G++ PS R + + P ++P + DWR+ A++ +KDQ C WA +
Sbjct: 102 LY-GYRRAAGGVPSMG-REIRSEEPEESVPFSCDWRKVAGAISPIKDQKNCNCCWAMAAA 159
Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
GNIE ++ V +S QEL+DC + DGC GG + +AF T+++ GL EK YP+
Sbjct: 160 GNIETLWRISFWDFVDVSVQELLDCGRCGDGCHGGFVWDAFITVLNN--SGLASEKDYPF 217
Query: 217 RGDDKACRLNKKATQ--VKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVS 274
+G +A R + K Q I ++ + +E +A+YL GP+ V IN LQ Y GV
Sbjct: 218 QGKVRAHRCHPKKYQKVAWIQDFIMLQNNEHRIAQYLATYGPITVTINMKPLQLYRKGVI 277
Query: 275 HPIQFFCDGGNENLSHSVLIVGYGVDRTK--------------FTHKAVPYWIIKNSWGE 320
CD + + HSVL+VG+G +++ PYWI+KNSWG
Sbjct: 278 KATPTTCD--PQLVDHSVLLVGFGSVKSEEGIWAETVSSQSQPQPPHPTPYWILKNSWGA 335
Query: 321 GWGEKGYFRLYRGDGSCGINDYVRSALV 348
WGEKGYFRL+RG +CGI + +A V
Sbjct: 336 QWGEKGYFRLHRGSNTCGITKFPLTARV 363
>gi|44844206|emb|CAF32699.1| cathepsin L-like cysteine proteinase [Leishmania infantum]
Length = 381
Score = 202 bits (514), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 127/320 (39%), Positives = 174/320 (54%), Gaps = 37/320 (11%)
Query: 39 TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAE 98
ALF F + + Y TL E RL F NL ++ Q + +G+ +F DLS AE
Sbjct: 35 AALFEEFKRTYRRAYGTLAEEQQRLANFERNLELMREHQ-ARNPHARFGITKFFDLSEAE 93
Query: 99 FQAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAF 153
F A+YL F +A + +++ +P A DWRE AVT VK CGS WAF
Sbjct: 94 FAARYLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKXXGACGSCWAF 153
Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
S GNIE +A LVSLSEQ+L+ CD +D+GC GG + AF+ ++ + G + EK+
Sbjct: 154 SAVGNIESQWARAGHGLVSLSEQQLVSCDDKDNGCNGGLMLQAFEWLLRHMYGIVFTEKS 213
Query: 214 YPY---RGDDKACRLN--KKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQF 268
YPY GD C LN K +I+GYV + +ET MA +L ENGP+A+A++A +
Sbjct: 214 YPYTSGNGDVAEC-LNSSKLVPGAQIDGYVMIPSNETVMAAWLAENGPIAIAVDASSFMS 272
Query: 269 YVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYF 328
Y +G VL+VGY ++T VPYW+IKNSWGE WGEKGY
Sbjct: 273 YQSG-------------------VLLVGY--NKT----GGVPYWVIKNSWGEDWGEKGYV 307
Query: 329 RLYRGDGSCGINDYVRSALV 348
R+ G +C +++Y SA V
Sbjct: 308 RVAMGLNACLLSEYPVSAHV 327
>gi|82659048|gb|ABB88697.1| cysteine protease [Leishmania tropica]
Length = 443
Score = 202 bits (514), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 118/318 (37%), Positives = 175/318 (55%), Gaps = 22/318 (6%)
Query: 40 ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
ALF F + + Y T+ E RL F NL ++ Q + +G+ +F DLS AEF
Sbjct: 36 ALFEEFKRTYRRAYGTVAEEQQRLANFERNLELMREHQ-ARNPHARFGITKFFDLSEAEF 94
Query: 100 QAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
A+YL F +A + +++ +P A DWR+ AVT VKDQ CGS WAFS
Sbjct: 95 AARYLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWRKKGAVTPVKDQGACGSCWAFS 154
Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
G+IE +A L +LSEQ+L+ CD +D+GC GG + AF+ ++ + G + E +Y
Sbjct: 155 AVGSIESQWALAGHGLTALSEQQLVSCDDKDNGCGGGLMLQAFEWLLRNMNGTMFTEDSY 214
Query: 215 PYRGDDKACRLNKKATQV----KINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYV 270
PY ++Q+ +I+GY+++ ET MA +L +NGP+++A++A + Y
Sbjct: 215 PYVSSSGYVPECSNSSQLVPGARIDGYMTIESSETVMAAWLAKNGPISIAVDASSFMSYQ 274
Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
+GV + L+H VL+VGY +RT VPYW+IKNSWGE WGE GY R+
Sbjct: 275 SGV------LTSCAGDALNHGVLLVGY--NRT----GEVPYWVIKNSWGEDWGENGYVRV 322
Query: 331 YRGDGSCGINDYVRSALV 348
G +C + +Y SA V
Sbjct: 323 TMGVNACLLTEYPVSAHV 340
>gi|9631045|ref|NP_047715.1| cathepsin-like proteinase [Lymantria dispar MNPV]
gi|13124028|sp|Q9YMP9.1|CATV_NPVLD RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|3822313|gb|AAC70264.1| cathepsin-like proteinase [Lymantria dispar MNPV]
Length = 356
Score = 202 bits (514), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 114/314 (36%), Positives = 173/314 (55%), Gaps = 21/314 (6%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQD--TEHGSGVYGLNEFSDLSTAEF 99
F F+E +NK Y + E R IF NL +I T+ + Y +N+FSDLS +E
Sbjct: 56 FESFVENYNKNYTSDWEKNKRYSIFKDNLHEINAKNGNATDGPTATYKINKFSDLSKSEL 115
Query: 100 QAKYLGFKLKPSYAD--RSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
AK+ G + ++ +++ P P FDWRE + VT +K+Q CG+ WAF+T
Sbjct: 116 IAKFTGLSIPERVSNFCKTIILNQPPDKGPLHFDWREQNKVTSIKNQGACGACWAFATLA 175
Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
++E +A + +L+ LSEQ+LIDCD D GC GG + AF+ IM GG++ E YP+
Sbjct: 176 SVESQFAMRHNRLIDLSEQQLIDCDSVDMGCNGGLLHTAFEEIMRM--GGVQTELDYPFV 233
Query: 218 GDDKACRLNKKATQVK--INGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSH 275
G ++ C L++ V + Y V +E + L GP+ +AI+A + Y GV
Sbjct: 234 GRNRRCGLDRHRPYVVSLVGCYRYVMVNEEKLKDLLRAVGPIPMAIDAADIVNYYRGVIS 293
Query: 276 PIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
+ N L+H+VL+VGYGV+ VPYW+ KN+WG+ WGE GYFR+ +
Sbjct: 294 SCE------NNGLNHAVLLVGYGVE------NGVPYWVFKNTWGDDWGENGYFRVRQNVN 341
Query: 336 SCG-INDYVRSALV 348
+CG +ND +A++
Sbjct: 342 ACGMVNDLASTAVL 355
>gi|397516975|ref|XP_003828695.1| PREDICTED: cathepsin W [Pan paniscus]
Length = 376
Score = 202 bits (514), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 118/328 (35%), Positives = 177/328 (53%), Gaps = 27/328 (8%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
F F Q N++Y + E+ RL IF+ NL + Q LQ+ + G+ +G+ FSDL+ EF
Sbjct: 42 FKLFQIQFNRSYLSPEEHAHRLDIFAHNLAQAQRLQEEDLGTAEFGVTPFSDLTEEEFGQ 101
Query: 102 KYLGFKLK----PSYADRSVPAMIPNITLPRAFDWREY-DAVTGVKDQTMCGSSWAFSTT 156
Y G++ PS R + + P ++P + DWR+ A++ +KDQ C WA +
Sbjct: 102 LY-GYRRAAGGVPSMG-REIRSEEPEESVPFSCDWRKVAGAISPIKDQKNCNCCWAMAAA 159
Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
GNIE ++ V +S QEL+DC + DGC+GG + +AF T+++ GL EK YP+
Sbjct: 160 GNIETLWRISFWDFVDVSVQELLDCSRCGDGCQGGFVWDAFITVLNN--SGLASEKDYPF 217
Query: 217 RGDDKACRLNKKATQ--VKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVS 274
+G +A R + K Q I ++ + +E +A+YL GP+ V IN L+ Y GV
Sbjct: 218 QGKVRAHRCHPKKYQKVAWIQDFIMLQNNEHRIAQYLATYGPITVTINMKPLRLYRKGVI 277
Query: 275 HPIQFFCDGGNENLSHSVLIVGYGVDRTK--------------FTHKAVPYWIIKNSWGE 320
CD + + HSVL+VG+G +++ PYWI+KNSWG
Sbjct: 278 KATPTTCD--PQLVDHSVLLVGFGSVKSEEGIWAETVSSQSQPQPPHPTPYWILKNSWGA 335
Query: 321 GWGEKGYFRLYRGDGSCGINDYVRSALV 348
WGEKGYFRL+RG +CGI + +A V
Sbjct: 336 QWGEKGYFRLHRGSNTCGITKFPLTARV 363
>gi|114638622|ref|XP_001170363.1| PREDICTED: cathepsin W [Pan troglodytes]
Length = 376
Score = 202 bits (514), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 118/328 (35%), Positives = 177/328 (53%), Gaps = 27/328 (8%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
F F Q N++Y + E+ RL IF+ NL + Q LQ+ + G+ +G+ FSDL+ EF
Sbjct: 42 FKLFQIQFNRSYLSPEEHAHRLDIFAHNLAQAQRLQEEDLGTAEFGVTPFSDLTEEEFGQ 101
Query: 102 KYLGFKLK----PSYADRSVPAMIPNITLPRAFDWREY-DAVTGVKDQTMCGSSWAFSTT 156
Y G++ PS R + + P ++P + DWR+ A++ +KDQ C WA +
Sbjct: 102 LY-GYRRAAGGVPSMG-REIRSEEPEESVPFSCDWRKVAGAISPIKDQKNCNCCWAMAAA 159
Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
GNIE ++ V +S QEL+DC + DGC+GG + +AF T+++ GL EK YP+
Sbjct: 160 GNIETLWRISFWDFVDVSVQELLDCSRCGDGCQGGFVWDAFITVLN--NSGLASEKDYPF 217
Query: 217 RGDDKACRLNKKATQ--VKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVS 274
+G +A R + K Q I ++ + +E +A+YL GP+ V IN L+ Y GV
Sbjct: 218 QGKVRAHRCHPKKYQKVAWIQDFIMLQNNEHRIAQYLATYGPITVTINMKPLRLYRKGVI 277
Query: 275 HPIQFFCDGGNENLSHSVLIVGYGVDRTK--------------FTHKAVPYWIIKNSWGE 320
CD + + HSVL+VG+G +++ PYWI+KNSWG
Sbjct: 278 KATPTTCD--PQLVDHSVLLVGFGSVKSEEGIWAERVSSQSQPQPPHPTPYWILKNSWGA 335
Query: 321 GWGEKGYFRLYRGDGSCGINDYVRSALV 348
WGEKGYFRL+RG +CGI + +A V
Sbjct: 336 QWGEKGYFRLHRGSNTCGITKFPLTARV 363
>gi|340380717|ref|XP_003388868.1| PREDICTED: pro-cathepsin H-like [Amphimedon queenslandica]
Length = 337
Score = 202 bits (514), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 118/303 (38%), Positives = 171/303 (56%), Gaps = 14/303 (4%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGV-YGLNEFSDLSTAEFQ 100
FN +++++ KTY+T+ EY RL +++ N I+ L + EHG Y LN+FSDL+ AEF+
Sbjct: 35 FNMWMKKYEKTYSTMEEYNERLRVYTSNYYYIEQL-NKEHGPHTEYELNQFSDLTFAEFK 93
Query: 101 AKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIE 160
YL S + + + N P A DWRE + +T VKDQ CGS W FSTTG +E
Sbjct: 94 KIYLTEPQHCSATNGNFQKPV-NARDPVAVDWREKNVITPVKDQGKCGSCWTFSTTGCLE 152
Query: 161 GVYAAKTKKLVSLSEQELIDCDQ--EDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG 218
+A KT +L+SLSEQ+L+DC + GC GG S AF+ I K GG+E E Y Y
Sbjct: 153 AHHAIKTGQLISLSEQQLVDCAGAFNNHGCNGGLPSQAFEYI--KYNGGIESESNYNYTA 210
Query: 219 DDKACRLNKKATQVKINGYVSVSRD-ETDMAKYLVENGPMAVAINAY-ALQFYVTGVSHP 276
D CR N ++ V++++D E D+ + GP+++A + Q Y GV
Sbjct: 211 KDGVCRFNSSLVAATVSDVVNITKDAEGDIGTAVANVGPVSIAFEVTKSFQHYKKGVYQG 270
Query: 277 IQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGS 336
C + ++H+VL+VGY ++TK + YWI+KNSW WG GYF + RG +
Sbjct: 271 EIEVCSQSPDKVNHAVLVVGY--NQTKLGEE---YWIVKNSWSASWGMDGYFWIRRGHNA 325
Query: 337 CGI 339
CG+
Sbjct: 326 CGL 328
>gi|401416326|ref|XP_003872658.1| putative cathepsin L-like protease [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|14348750|emb|CAC41275.1| CPB2 protein [Leishmania mexicana]
gi|322488882|emb|CBZ24132.1| putative cathepsin L-like protease [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 359
Score = 202 bits (514), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 123/319 (38%), Positives = 177/319 (55%), Gaps = 22/319 (6%)
Query: 39 TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAE 98
ALF F + + Y TL E RL F NL ++ Q + +G+ +F DLS AE
Sbjct: 35 AALFEEFKRTYGRAYETLAEEQQRLANFERNLELMREHQ-ARNPHAQFGITKFFDLSEAE 93
Query: 99 FQAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAF 153
F A+YL F +A + +++ +P A DWRE AVT VKDQ CGS WAF
Sbjct: 94 FAARYLNGAAYFAAAKRHAAQHYRKARADLSAVPDAVDWREKGAVTPVKDQGECGSCWAF 153
Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
S+ GNIEG + +LVSLSEQ+L+ CD +DGC+GG + AFD ++ G L E +
Sbjct: 154 SSVGNIEGQWYLAGHELVSLSEQQLVSCDDMNDGCDGGLMLQAFDWLLQNTNGHLYTEDS 213
Query: 214 YPYRGDD----KACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFY 269
YPY + + ++ +I+ +V + E MA +L +NGP+A+A++A + Y
Sbjct: 214 YPYVSGNGYLPECSNSSELVVGAQIDSHVLIGSSEKAMAAWLAKNGPIAIALDASSFMSY 273
Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
+GV C G + ++H+VL+VGY D T VPYW+IKNSWG WGE+GY R
Sbjct: 274 KSGVLTA----CIG--KEVNHAVLLVGY--DMT----GEVPYWVIKNSWGGDWGEQGYVR 321
Query: 330 LYRGDGSCGINDYVRSALV 348
+ G +C +++Y SA V
Sbjct: 322 VVMGVNACLLSEYPVSAHV 340
>gi|23577865|ref|NP_703114.1| viral cathepsin [Rachiplusia ou MNPV]
gi|37077115|sp|Q8B9D5.1|CATV_NPVR1 RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|23476510|gb|AAN28057.1| viral cathepsin [Rachiplusia ou MNPV]
Length = 323
Score = 202 bits (514), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 116/317 (36%), Positives = 175/317 (55%), Gaps = 21/317 (6%)
Query: 36 VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLS 95
+K F F+ + NK Y + VE R IF NL +I + ++ S Y +N+FSDLS
Sbjct: 22 LKAPNYFEEFVHRFNKDYGSEVEKLRRFKIFQHNLNEIII--KNQNDSAKYEINKFSDLS 79
Query: 96 TAEFQAKYLGFKLKPSYADRSVPAMI---PNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
E AKY G L P ++ P P FDWR + VT VK+Q MCG+ WA
Sbjct: 80 KDETIAKYTGLSL-PIQTQNFCKVIVLDQPPGKGPLEFDWRRLNKVTSVKNQGMCGACWA 138
Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEK 212
F+T ++E +A K +L++LSEQ++IDCD D GC GG + AF+ I+ GG++ E
Sbjct: 139 FATLASLESQFAIKHNQLINLSEQQMIDCDFVDAGCNGGLLHTAFEAIIKM--GGVQLES 196
Query: 213 TYPYRGDDKACRLNKKATQVKI-NGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVT 271
YPY D+ CR+N V++ + Y ++ E + L GP+ +AI+A + Y
Sbjct: 197 DYPYEADNNNCRMNTNKFLVQVKDCYRYITVYEEKLKDLLRLVGPIPMAIDAADIVNYKQ 256
Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
G+ I++ + G L+H+VL+VGYGV+ +PYW KN+WG WGE+G+FR+
Sbjct: 257 GI---IKYCFNSG---LNHAVLLVGYGVENN------IPYWTFKNTWGTDWGEEGFFRVQ 304
Query: 332 RGDGSCGINDYVRSALV 348
+ +CG+ + + S V
Sbjct: 305 QNINACGMRNELASTAV 321
>gi|442736236|gb|AGC65593.1| cathepsin [Achaea janata granulovirus]
Length = 338
Score = 202 bits (514), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 123/329 (37%), Positives = 177/329 (53%), Gaps = 44/329 (13%)
Query: 41 LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
LF+ F+ +++K Y + +E ++ +F NL + D + + + +N ++D S E
Sbjct: 33 LFDLFMIKYHKVYRSELERAAKFEVFKRNLATLNDKNDKDE-NATFDINAYTDRSRNELL 91
Query: 101 AKYLGFKLKPSYADRSVP-------------AMIPNITLPRAFDWREYDAVTGVKDQTMC 147
GF+ ++A + P A P LP +FDWR+ + VT VKDQ C
Sbjct: 92 RTQTGFQ--SNFARNASPFTQKKGMCITRVVAGTPPCLLPESFDWRDKNVVTPVKDQLEC 149
Query: 148 GSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGG 207
GS WAF+ N E YA K K V SEQ L+DCDQ + GC+GG + AF+ I+ GG
Sbjct: 150 GSCWAFTAIANFESQYAIKHGKHVDFSEQHLLDCDQLNYGCDGGLMHWAFEEIIRM--GG 207
Query: 208 LEEEKTYPYRGDDKACRLNKKATQVKINGYVSVS-------RDETDMAKYLVENGPMAVA 260
+ E YPY G + C N +N Y ++S RDE + + LV NGP+AVA
Sbjct: 208 VVLEYDYPYTGVESFCANN-------VNMYTTISGCVQYDLRDEEKLRELLVTNGPIAVA 260
Query: 261 INAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGE 320
++ + Y +GV FC G N L+H+VL+VGYGVD+T + YW++KNSWG
Sbjct: 261 LDIVDIVDYKSGVVS----FC-GTNNGLNHAVLLVGYGVDKT------IEYWLLKNSWGT 309
Query: 321 GWGEKGYFRLYRGDGSCGI-NDYVRSALV 348
WGE+GYFR+ R SCGI N Y S ++
Sbjct: 310 DWGEEGYFRIKRNRNSCGILNSYAASVIL 338
>gi|397133545|gb|AFO10079.1| V-CATH [Bombyx mandarina nucleopolyhedrovirus S2]
Length = 323
Score = 202 bits (514), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 116/317 (36%), Positives = 174/317 (54%), Gaps = 21/317 (6%)
Query: 36 VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLS 95
+K F F+ + NK Y + VE R IF NL +I + ++ S Y +N+FSDLS
Sbjct: 22 LKAPNYFEEFVHRFNKDYGSEVEKLRRFKIFQHNLNEI--INKDQNDSAKYEINKFSDLS 79
Query: 96 TAEFQAKYLGFKLKPSYADRSVPAMI---PNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
E AKY G L P ++ P P FDWR + VT VK+Q MCG+ WA
Sbjct: 80 KDETIAKYTGLSL-PIQTQNFCKVIVLDQPPGKGPLEFDWRRLNKVTSVKNQGMCGACWA 138
Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEK 212
F+T ++E +A K +L++LSEQ++IDCD D GC GG + AF+ I+ GG++ E
Sbjct: 139 FATLASLESQFAIKHNQLINLSEQQMIDCDFVDAGCNGGLLHTAFEAIIKM--GGVQLES 196
Query: 213 TYPYRGDDKACRLNKKATQVKI-NGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVT 271
YPY D+ CR+N V++ + Y ++ E + L GP+ +AI+A + Y
Sbjct: 197 DYPYEADNNNCRMNSNKFLVQVKDCYRYITVYEEKLKDLLRLVGPIPMAIDAADIVNYKQ 256
Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
G+ I++ + G L+H+VL+VGYGV+ +PYW KN+WG WGE G+FR+
Sbjct: 257 GI---IKYCFNSG---LNHAVLLVGYGVENN------IPYWTFKNTWGTDWGEDGFFRVQ 304
Query: 332 RGDGSCGINDYVRSALV 348
+ +CG+ + + S V
Sbjct: 305 QNINACGMRNELASTAV 321
>gi|71084302|gb|AAZ23596.1| cysteine protease [Leishmania aethiopica]
Length = 443
Score = 202 bits (513), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 116/318 (36%), Positives = 171/318 (53%), Gaps = 22/318 (6%)
Query: 40 ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
ALF F + + Y T+ E RL F NL ++ Q + +G+ +F DLS AEF
Sbjct: 36 ALFEEFKRTYRRAYGTVAEEQQRLANFERNLELMREHQ-ARNPHARFGITKFFDLSEAEF 94
Query: 100 QAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
A+YL F +A + +++ +P A DWRE AVT VK+Q CGS WAFS
Sbjct: 95 AARYLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKNQGACGSCWAFS 154
Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
GNIE +A +L +LSEQ+L+ CD D GC GG ++ AF+ ++ + G + E +Y
Sbjct: 155 VVGNIESQWAVAGHRLTALSEQQLVSCDDMDSGCGGGLMTQAFEWLLRNMNGTMFTEDSY 214
Query: 215 PYRGD----DKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYV 270
PY + ++ +I+GYV + +ET MA +L ++GP+++ ++A + Y
Sbjct: 215 PYVSTFGYVPECTNSSQLVPGARIDGYVMIESNETVMAAWLAKSGPISIGVDASSFMSYH 274
Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
GV + L+H VL+VGY + VPYW+IKNSWGE WGEKGY R+
Sbjct: 275 GGV------LTSCAGKQLNHGVLLVGYNMT------GEVPYWVIKNSWGENWGEKGYVRV 322
Query: 331 YRGDGSCGINDYVRSALV 348
G +C + +Y SA V
Sbjct: 323 TMGVNACLLTEYPVSAHV 340
>gi|2780176|emb|CAA71085.1| cystein proteinase [Leishmania mexicana]
Length = 443
Score = 202 bits (513), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 124/319 (38%), Positives = 175/319 (54%), Gaps = 22/319 (6%)
Query: 39 TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAE 98
ALF F + + Y TL E RL F NL ++ Q + +G+ +F DLS AE
Sbjct: 35 AALFEEFKRTYGRAYETLAEEQQRLANFERNLELMREHQ-ARNPHAQFGITKFFDLSEAE 93
Query: 99 FQAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAF 153
F A+YL F +A + +++ +P A DWRE AVT VK+Q CGS WAF
Sbjct: 94 FAARYLNGAAYFAAAKRHAAQHYRKARADLSAVPDAVDWREKGAVTPVKNQGACGSCWAF 153
Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
S GNIEG + +LVSLSEQ+L+ CD D+GC GG + AFD ++ G L E +
Sbjct: 154 SAVGNIEGQWYLAGHELVSLSEQQLVSCDDMDNGCSGGLMLQAFDWLLQNTNGHLYTEDS 213
Query: 214 YPYRGDD----KACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFY 269
YPY + + ++ +I+G+V + E MA +L +NGP+A+A++A + Y
Sbjct: 214 YPYVSGNGYVPECSNSSELVVGAQIDGHVLIGSSEKAMAAWLAKNGPIAIALDASSFMSY 273
Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
+GV C G + L+H VL+VGY D T VPYW+IKNSWG WGE+GY R
Sbjct: 274 KSGVLTA----CIG--KQLNHGVLLVGY--DMT----GEVPYWVIKNSWGGDWGEQGYVR 321
Query: 330 LYRGDGSCGINDYVRSALV 348
+ G +C +++Y SA V
Sbjct: 322 VVMGVNACLLSEYPVSAHV 340
>gi|340503366|gb|EGR29962.1| hypothetical protein IMG5_145110 [Ichthyophthirius multifiliis]
Length = 1095
Score = 202 bits (513), Expect = 3e-49, Method: Composition-based stats.
Identities = 111/277 (40%), Positives = 157/277 (56%), Gaps = 24/277 (8%)
Query: 83 SGVYGLNEFSDLSTAEFQAKYLGF------KLKPSYADRSVPAMIPNITL----PRAFDW 132
S V+G +FSDLS +F K+L ++K + P + +IT+ P FDW
Sbjct: 831 SAVFGHTKFSDLSPQQFAQKHLKLNQKKLLQVKKETKKLTTP-IQQDITVEENVPEQFDW 889
Query: 133 REYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGS 192
R+ + VT K Q CGS W FSTTG IE YA K +KLV SEQ+L+DCD +DGC GG
Sbjct: 890 RDRNVVTEPKYQNTCGSCWTFSTTGVIESQYAIKHQKLVPFSEQQLVDCDDINDGCHGGL 949
Query: 193 ISNAFDTIMSKLGGGLEEEKTY-PYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYL 251
+++A+ + GGLE + Y Y+ + C+ + Q KI + + DE + K L
Sbjct: 950 MTDAYKYLQQS--GGLEFAEDYGDYKNKKEKCKFDLNKVQAKIKEWQQIDEDEEIIKKQL 1007
Query: 252 VENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPY 311
+NGP+A +NA LQFY +G+ P + CD +++H++LIVGYGV++ Y
Sbjct: 1008 YQNGPIAAGVNARLLQFYKSGIFDPKE--CD---SDINHAILIVGYGVEK-----DGQKY 1057
Query: 312 WIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
WIIKN WG+ WG GYF+L RG CGI+ Y A +
Sbjct: 1058 WIIKNQWGKDWGMDGYFKLARGKKQCGIHTYASIAFI 1094
>gi|401430387|ref|XP_003886572.1| unnamed protein product, partial [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|356491640|emb|CBZ40951.1| unnamed protein product, partial [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 332
Score = 202 bits (513), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 121/308 (39%), Positives = 168/308 (54%), Gaps = 22/308 (7%)
Query: 39 TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAE 98
ALF F + + Y TL E RL F NL ++ Q + +G+ +F DLS AE
Sbjct: 35 AALFEEFKRTYGRAYETLAEEQQRLANFERNLELMREHQ-ARNPHAQFGITKFFDLSEAE 93
Query: 99 FQAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAF 153
F A+YL F +A + +++ +P A DWRE AVT VKDQ CGS WAF
Sbjct: 94 FAARYLNGAAYFAAAKRHAAQHYRKARADLSAVPDAVDWREKGAVTPVKDQGACGSCWAF 153
Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
S GNIEG + +LVSLSEQ+L+ CD +DGC GG + AFD ++ G L E +
Sbjct: 154 SAVGNIEGQWYLAGHELVSLSEQQLVSCDDMNDGCSGGLMLQAFDWLLQNTNGHLHTEDS 213
Query: 214 YPYRGDD----KACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFY 269
YPY + + ++ +I+G+V + E MA +L +NGP+A+A++A + Y
Sbjct: 214 YPYVSGNGYVPECSNSSELVVGAQIDGHVLIGSSEKAMAAWLAKNGPIAIALDASSFMSY 273
Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
+GV C G + L+H VL+VGY D T VPYW+IKNSWG WGE+GY R
Sbjct: 274 KSGVLTA----CIG--KQLNHGVLLVGY--DMT----GEVPYWVIKNSWGGDWGEQGYVR 321
Query: 330 LYRGDGSC 337
+ G +C
Sbjct: 322 VVMGVNAC 329
>gi|343477225|emb|CCD11889.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 447
Score = 202 bits (513), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 120/343 (34%), Positives = 183/343 (53%), Gaps = 24/343 (6%)
Query: 13 LSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRK 72
+ L + F+ V LH ++ F F ++++++Y E R +F N+ +
Sbjct: 14 VGLLAVAACFVPVALGVLHAEQSLQQQ--FAAFKQKYSRSYKDATEEAFRFRVFKQNMER 71
Query: 73 IQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL-GFKLKPSYADRSVPAMIPNITL---PR 128
+ + + +G+ FSD+S EF+A Y G + + R P + N++ P
Sbjct: 72 AKE-EAAANPYATFGVTRFSDMSPEEFRATYHNGAEYYAAALKR--PRKVVNVSTGKAPP 128
Query: 129 AFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGC 188
A DWR+ AVT VKDQ CGS WAFS GNIEG + +L SLSEQ L+ CD D GC
Sbjct: 129 AVDWRKKGAVTPVKDQGACGSCWAFSAIGNIEGQWKVAGHELTSLSEQMLVSCDTTDYGC 188
Query: 189 EGGSISNAFDTIMSKLGGGLEEEKTYPYR---GDDKACRLNKKATQVKINGYVSVSRDET 245
GG + + I+S G + ++YPY G C + K KI+G++++ +DE
Sbjct: 189 RGGLMDKSLQWIVSSNKGNVFTAQSYPYASGGGKMPPCNKSGKVVGAKISGHINLPKDEN 248
Query: 246 DMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFT 305
+A++L +NGP+A+A++A + Y GV ++ L H VL+VGY D +K
Sbjct: 249 AIAEWLAKNGPVAIAVDATSFLGYKGGV------LTSCISKGLDHDVLLVGYD-DTSK-- 299
Query: 306 HKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
PYWIIKNSW +GWGE+GY R+ +G C + +Y RSA+V
Sbjct: 300 ---PPYWIIKNSWSKGWGEEGYIRIEKGTNQCLMKNYARSAVV 339
>gi|1163075|emb|CAA81061.1| cysteine proteinase [Trypanosoma congolense]
Length = 442
Score = 202 bits (513), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 120/343 (34%), Positives = 183/343 (53%), Gaps = 24/343 (6%)
Query: 13 LSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRK 72
+ L + F+ V LH ++ F F ++++++Y E R +F N+ +
Sbjct: 9 VGLLAVAACFVPVALGVLHAEQSLQQQ--FAAFKQKYSRSYKDATEEAFRFRVFKQNMER 66
Query: 73 IQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL-GFKLKPSYADRSVPAMIPNITL---PR 128
+ + + +G+ FSD+S EF+A Y G + + R P + N++ P
Sbjct: 67 AKE-EAAANPYATFGVTRFSDMSPEEFRATYHNGAEYYAAALKR--PRKVVNVSTGKAPP 123
Query: 129 AFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGC 188
A DWR+ AVT VKDQ CGS WAFS GNIEG + +L SLSEQ L+ CD D GC
Sbjct: 124 AVDWRKKGAVTPVKDQGACGSCWAFSAIGNIEGQWKVAGHELTSLSEQMLVSCDTTDYGC 183
Query: 189 EGGSISNAFDTIMSKLGGGLEEEKTYPYR---GDDKACRLNKKATQVKINGYVSVSRDET 245
GG + + I+S G + ++YPY G C + K KI+G++++ +DE
Sbjct: 184 RGGLMDKSLQWIVSSNKGNVFTAQSYPYASGGGKMPPCNKSGKVVGAKISGHINLPKDEN 243
Query: 246 DMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFT 305
+A++L +NGP+A+A++A + Y GV ++ L H VL+VGY D +K
Sbjct: 244 AIAEWLAKNGPVAIAVDATSFLGYKGGV------LTSCISKGLDHDVLLVGYD-DTSK-- 294
Query: 306 HKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
PYWIIKNSW +GWGE+GY R+ +G C + +Y RSA+V
Sbjct: 295 ---PPYWIIKNSWSKGWGEEGYIRIEKGTNQCLMKNYARSAVV 334
>gi|11464866|gb|AAG35358.1|AF314930_1 cruzipain [Trypanosoma cruzi]
Length = 467
Score = 201 bits (512), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 116/315 (36%), Positives = 165/315 (52%), Gaps = 18/315 (5%)
Query: 39 TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAE 98
T+ F F ++H + Y + E RL +F NL + L + +G+ FSDL+ E
Sbjct: 35 TSQFAEFKQKHGRVYESAAEEAFRLSVFRENL-FLARLHAAANPHATFGVTPFSDLTREE 93
Query: 99 FQAKYL-GFKLKPSYADRS-VPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTT 156
F+++Y G + +R+ VP + + P A DWR AVT VKDQ CGS WAFS
Sbjct: 94 FRSRYHNGAAHFAAAQERARVPVKVEVVGAPAAVDWRARGAVTAVKDQGQCGSCWAFSAI 153
Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
GN+E + L +LSEQ L+ CD+ D GC GG ++NAF+ I+ + G + E +YPY
Sbjct: 154 GNVECQWFLAGHPLTNLSEQMLVSCDKTDSGCSGGLMNNAFEWIVQENNGAVYTEDSYPY 213
Query: 217 ---RGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGV 273
G C + I G+V + +DE +A +L NGP+AVA++A + Y GV
Sbjct: 214 ASGEGISPPCTTSGHTVGATITGHVGLPQDEAQIAAWLAVNGPVAVAVDASSWMTYTGGV 273
Query: 274 SHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG 333
+E L H VL+VGY AVPYWIIKNS WGE+GY R+ +G
Sbjct: 274 ------MTSCVSEQLDHGVLLVGYN------DSAAVPYWIIKNSRTTQWGEEGYIRIAKG 321
Query: 334 DGSCGINDYVRSALV 348
C + + SA+V
Sbjct: 322 SNQCLVKEEASSAVV 336
>gi|73983670|ref|XP_540846.2| PREDICTED: cathepsin W [Canis lupus familiaris]
Length = 374
Score = 201 bits (512), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 124/366 (33%), Positives = 188/366 (51%), Gaps = 35/366 (9%)
Query: 13 LSLTVSVSSFMVVGDEKLHH------------LHHVKHTALFNYFLEQHNKTYATLVEYY 60
++LT+ +S + + L H ++ +F F Q+N++Y+ EY
Sbjct: 1 MALTIYLSCLLALSVASLAHGIKRSLKNQDPGPQPLELKQVFALFQIQYNRSYSNPEEYA 60
Query: 61 SRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLK---PSYADRSV 117
RL IF+ NL + Q L+D + G+ +G+ FSDL+ EF Y ++ PS R V
Sbjct: 61 RRLDIFAHNLAQAQQLEDEDLGTAEFGVTPFSDLTEEEFGQFYGHQRMAGEAPSVG-RKV 119
Query: 118 PAMIPNITLPRAFDWREYDAV-TGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQ 176
+ +P DWR+ + + +K Q C WA + GNIE ++ + + V +S Q
Sbjct: 120 ESEEWGEPVPPTCDWRKLPGIISPIKQQGNCRCCWAMAAAGNIEALWGIRYHQPVEVSVQ 179
Query: 177 ELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACR-LNKKATQVK-I 234
EL+DC + DGC+GG +AF T+++ GL K YP+ G+ K R L KK +V I
Sbjct: 180 ELLDCGRCGDGCKGGFTWDAFITVLNN--SGLASAKDYPFLGNTKPHRCLAKKYKKVAWI 237
Query: 235 NGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLI 294
++ + +E +A YL GP+ V IN LQ Y GV CD + + HSVL+
Sbjct: 238 QDFIMLQGNEQAIAWYLATKGPITVTINMKLLQHYQKGVIQATHTTCD--PQRVDHSVLL 295
Query: 295 VGYGVDRT------------KFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDY 342
VG+G ++ H +PYWI+KNSWG WGE+GYFRL+RG+ +CGI Y
Sbjct: 296 VGFGKSKSVAGKQAEGGSSRPRPHHPIPYWILKNSWGAEWGEEGYFRLHRGNNTCGITKY 355
Query: 343 VRSALV 348
+A V
Sbjct: 356 PVTARV 361
>gi|50355617|dbj|BAD29957.1| cysteine protease [Daucus carota]
Length = 437
Score = 201 bits (512), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 122/314 (38%), Positives = 170/314 (54%), Gaps = 31/314 (9%)
Query: 41 LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
++N +L +H K+Y L E +R IF NLR I S GLN F+DL+ E++
Sbjct: 48 MYNSWLVKHGKSYNALGEKETRFQIFKDNLRYIDNHNADPDRSYELGLNRFADLTNEEYR 107
Query: 101 AKYLGFKLKPSY-------ADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAF 153
AKYLG K + S +DR P + LP + DWRE AV VKDQ CGS WAF
Sbjct: 108 AKYLGTKSRESRPKLSKGPSDRYAP--VEGEELPDSIDWREKGAVAAVKDQGSCGSCWAF 165
Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEK 212
S G +EG+ T +L++LSEQEL+DCD+ ++GCEGG + AF+ I+ GG++ +
Sbjct: 166 SAIGAVEGINQITTGELITLSEQELVDCDRSYNEGCEGGLMDYAFNFIIKN--GGIDSDL 223
Query: 213 TYPYRGDDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYAL--QFY 269
YPY G D C NK+ A V I+ Y V + + N P++VAI A + Q Y
Sbjct: 224 DYPYTGRDGTCNQNKENAKVVTIDSYEDVPVYDEKALQKAAANQPISVAIEAGGMDFQLY 283
Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
V+G+ F + H V++VGYG + + + YWI++NSWG WGE GY +
Sbjct: 284 VSGI------FTGKCGTAVDHGVVVVGYG------SEEGMDYWIVRNSWGAAWGEAGYLK 331
Query: 330 LYRG----DGSCGI 339
+ R G CGI
Sbjct: 332 MQRNVGKSSGLCGI 345
>gi|71400414|ref|XP_803044.1| cysteine peptidase [Trypanosoma cruzi strain CL Brener]
gi|70865609|gb|EAN81598.1| cysteine peptidase, putative [Trypanosoma cruzi]
Length = 467
Score = 201 bits (512), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 121/345 (35%), Positives = 177/345 (51%), Gaps = 20/345 (5%)
Query: 9 GVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSG 68
++L ++ V ++ + LH + + F F ++H + Y + E RL +F
Sbjct: 7 ALSLAAVLVVMACLVPAATASLHAEETLA--SQFAEFKQKHGRVYGSAAEEAFRLSVFRA 64
Query: 69 NLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL-GFKLKPSYADRS-VPAMIPNITL 126
NL + L + +G+ FSDL+ EF+++Y G + +R+ VP + +
Sbjct: 65 NL-FLARLHAAANPHATFGVTPFSDLTREEFRSRYHNGAAHFAAAQERARVPVDVEFVGA 123
Query: 127 PRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDD 186
P A DWRE AVT VK+Q MCGS WAF+ GNIE + L LSEQ L+ CD +
Sbjct: 124 PAAKDWREEGAVTAVKNQGMCGSCWAFAAIGNIECQWFLAGNPLTRLSEQMLVSCDNTNS 183
Query: 187 GCEGGSISNAFDTIMSKLGGGLEEEKTYPYR---GDDKACRLNKKATQVKINGYVSVSRD 243
GC GG AF I+ + G + E++YPY G C + I GYV++ RD
Sbjct: 184 GCGGGWPLVAFKWIVDRNNGTVYTEESYPYHSCIGISPPCTTSGHTVGATITGYVTIPRD 243
Query: 244 ETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTK 303
E +A +L NGP+AV ++A + FY GV ++ LSH+VL+VGY T
Sbjct: 244 ENGIAAWLAVNGPVAVVVDASSWIFYTGGV------MTSCVSKQLSHAVLLVGYNDSAT- 296
Query: 304 FTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
VP+WIIKNSW WGE GY R+ +G C + + V SA+V
Sbjct: 297 -----VPHWIIKNSWTTHWGEDGYIRIAKGSNQCLVKEGVSSAVV 336
>gi|30141019|dbj|BAC75923.1| cysteine protease-1 [Helianthus annuus]
Length = 461
Score = 201 bits (512), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 133/319 (41%), Positives = 168/319 (52%), Gaps = 40/319 (12%)
Query: 40 ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY----GLNEFSDLS 95
AL+ +L +H KTY L E R IF NLR I EH SG + GLN+F+DL+
Sbjct: 50 ALYESWLVKHGKTYNALGEKDRRFQIFKDNLRFID-----EHNSGDHTYKLGLNKFADLT 104
Query: 96 TAEFQAKYLGFK-------LKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCG 148
E++ Y G K L +DR A +LP DWRE AVT VKDQ CG
Sbjct: 105 NEEYRMTYTGIKTIDDKKKLSKMKSDRY--AYRSGDSLPEYVDWREQGAVTDVKDQGSCG 162
Query: 149 SSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGG 207
S WAFSTTG++EGV T L+S+SEQEL++CD + GC GG + AF+ I+ GG
Sbjct: 163 SCWAFSTTGSVEGVNKIVTGDLISVSEQELVNCDTSYNQGCNGGLMDYAFEFIIKN--GG 220
Query: 208 LEEEKTYPYRGDDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--Y 264
++ E+ YPY G D C NKK A V I+ Y V ++ K V N P+AVAI A
Sbjct: 221 IDTEEDYPYTGKDGKCDKNKKNAKVVTIDSYEDVPVNDESSLKKAVSNQPVAVAIEAGGR 280
Query: 265 ALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGE 324
QFY +G+ F L H VL GYG + K YW++KNSWG WGE
Sbjct: 281 DFQFYTSGI------FTGSCGTALDHGVLAAGYGTEDGK------DYWLVKNSWGAEWGE 328
Query: 325 KGYFRLYRG----DGSCGI 339
GY ++ R G CGI
Sbjct: 329 GGYLKMERNIADKSGKCGI 347
>gi|13124026|sp|Q9WGE0.1|CATV_NPVHC RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|4884631|gb|AAD31760.1|AF120926_1 cysteine proteinase [Hyphantria cunea nucleopolyhedrovirus]
Length = 324
Score = 201 bits (511), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 120/318 (37%), Positives = 178/318 (55%), Gaps = 21/318 (6%)
Query: 36 VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLS 95
+K + F FL + NK Y++ E R IF NL +I ++++ + Y +N+FSDLS
Sbjct: 22 LKAPSYFEDFLHKFNKHYSSESEKLRRFQIFQHNLEEI-IIKNQNDTTAQYEINKFSDLS 80
Query: 96 TAEFQAKYLGFKLKPSYADRSVPAMI---PNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
E +KY G L P ++ P P FDWR + VT VK+Q +CG+ WA
Sbjct: 81 KDETISKYTGLAL-PLQTQNFCEVVVLNRPPDKGPLEFDWRRLNKVTSVKNQGICGACWA 139
Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEK 212
F+T ++E +A K +L++LSEQ+LIDCD D GC GG + A++ +M GG++ E
Sbjct: 140 FATLASLESQFAIKHNQLINLSEQQLIDCDYVDAGCNGGLLHTAYEAVMQM--GGVQAEN 197
Query: 213 TYPYRGDDKACRLNKKATQVKING-YVSVSRDETDMAKYLVENGPMAVAINAYALQFYVT 271
YPY G D CR++ VK+ Y ++ E + L GP+ VAI+A + Y
Sbjct: 198 DYPYEGSDGNCRVDVAKFVVKVKKCYRYIAVFEEKLKDLLRIVGPIPVAIDASDIVNYRR 257
Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
G+ +C N +H+VL+VGYGV+ VPYWI+KN+WGE WGE+GYFR+
Sbjct: 258 GIMR----YC--SNYGFNHAVLLVGYGVENN------VPYWILKNTWGEDWGEQGYFRVQ 305
Query: 332 RGDGSCGI-NDYVRSALV 348
+ +CGI N+ + SA +
Sbjct: 306 QNINACGIRNELLASAEI 323
>gi|343474734|emb|CCD13687.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 524
Score = 201 bits (511), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 119/343 (34%), Positives = 184/343 (53%), Gaps = 24/343 (6%)
Query: 13 LSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRK 72
+ L + F+ V LH ++ F F ++++++Y E R +F ++ +
Sbjct: 93 VGLLAVAACFVPVALGVLHAEQSLQQQ--FAAFKQKYSRSYKDATEEAFRFRMFKQSMER 150
Query: 73 IQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL-GFKLKPSYADRSVPAMIPNITL---PR 128
+ + + +G+ +FSD+S EF+A YL G K + R P + N++ P
Sbjct: 151 AKE-EAAANPYATFGVTQFSDMSPEEFRATYLNGAKYYAAALKR--PRKVVNVSTGKAPP 207
Query: 129 AFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGC 188
A DWR+ AVT VKDQ CGS WAF+ GNIEG + +L SLSEQ L+ CD +D C
Sbjct: 208 AVDWRKKGAVTPVKDQGSCGSCWAFAAIGNIEGQWKIAGHELTSLSEQMLVSCDTTEDNC 267
Query: 189 EGGSISNAFDTIMSKLGGGLEEEKTYPYRGDD---KACRLNKKATQVKINGYVSVSRDET 245
GG AF I+S G + E++YPY D C + K KI+G++++ +DE
Sbjct: 268 GGGFADRAFKWIVSSNKGNVFTERSYPYASIDGYVPPCNKSGKVVGAKISGHINLPKDEN 327
Query: 246 DMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFT 305
+A++L NGP+A+A++A Y GV +++++H VL+VGY D +K
Sbjct: 328 AIAEWLARNGPVAIAVDASTFLDYKGGV------LTSCSSKHVNHEVLLVGYN-DTSK-- 378
Query: 306 HKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
PYWIIKNSW + WGE+GY R+ +G C + +Y RS +V
Sbjct: 379 ---PPYWIIKNSWDKEWGEEGYIRIEKGTNLCLMKEYARSVVV 418
>gi|194352750|emb|CAQ00103.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
gi|326514262|dbj|BAJ92281.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326519402|dbj|BAJ96700.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326524351|dbj|BAK00559.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326531998|dbj|BAK01375.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 356
Score = 201 bits (511), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 140/348 (40%), Positives = 182/348 (52%), Gaps = 32/348 (9%)
Query: 11 ALLSLTVSV-----SSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHI 65
ALL L V S F +VG + + + LF +L +H K YA+ E R +
Sbjct: 13 ALLLLCVGACVARNSDFSIVGYSEEDLSSNERLVELFEKWLAKHQKAYASFEEKLHRFEV 72
Query: 66 FSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNIT 125
F NL+ I + + E S GLNEF+DL+ EF+A YLG P+ S +++
Sbjct: 73 FKDNLKHIDKI-NREVTSYWLGLNEFADLTHDEFKAAYLGLDAAPARRGSSRSFRYEDVS 131
Query: 126 ---LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCD 182
LP++ DWR+ AVT VK+Q CGS WAFST +EG+ A T L +LSEQELIDC
Sbjct: 132 ASDLPKSVDWRKKGAVTEVKNQGQCGSCWAFSTVAAVEGINAIVTGNLTALSEQELIDCS 191
Query: 183 QE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQ--VKINGYVS 239
+ + GC GG + AF I S GGL E+ YPY ++ +C KKA V I+GY
Sbjct: 192 VDGNSGCNGGLMDYAFSYIASS--GGLHTEEAYPYLMEEGSCGDGKKAESEAVTISGYED 249
Query: 240 V-SRDETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHPIQFFCDGGNENLSHSVLIVG 296
V + DE + K L P++VAI A QFY GV F L H V VG
Sbjct: 250 VPANDEQALIKALAHQ-PVSVAIEASGRHFQFYSGGV------FDGPCGAQLDHGVAAVG 302
Query: 297 YGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR----GDGSCGIN 340
YG D+ K Y I++NSWG WGEKGY R+ R G+G CGIN
Sbjct: 303 YGSDKG----KGHDYIIVRNSWGAQWGEKGYIRMKRGTSNGEGLCGIN 346
>gi|47779249|gb|AAT38521.1| cysteine protease [Bombyx mori NPV]
Length = 323
Score = 201 bits (511), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 116/317 (36%), Positives = 174/317 (54%), Gaps = 21/317 (6%)
Query: 36 VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLS 95
+K F F+ + NK Y++ VE R IF NL +I + ++ S Y +N+FSDLS
Sbjct: 22 LKAPNYFEEFVHRFNKNYSSEVEKLRRFKIFQHNLNEI--INKNQNDSAKYEINKFSDLS 79
Query: 96 TAEFQAKYLGFKLKPSYADRSVPAMI---PNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
E AKY G L P+ ++ P P FDWR + VT VK+Q MCG+ WA
Sbjct: 80 KDETIAKYTGLSL-PTQTQNFCKVILLDQPPGKGPLEFDWRRLNKVTSVKNQGMCGACWA 138
Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEK 212
F+T G++E +A K +L++LSEQ++IDCD D GC GG + AF+ GG++ E
Sbjct: 139 FATLGSLESQFAIKHNELINLSEQQMIDCDFVDAGCNGGLLHTAFEANCRM--GGVQLES 196
Query: 213 TYPYRGDDKACRLNKKATQVKI-NGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVT 271
YPY D+ CR+N V++ + Y + E + L GP+ +AI+A + Y
Sbjct: 197 DYPYEADNNNCRMNSNKFLVQVKDCYRYIIVYEEKLKDLLRLVGPIPMAIDAADIVNYKQ 256
Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
G+ I++ + G L+H+VL+VGYGV+ +PYW KN+WG WGE G+FR+
Sbjct: 257 GI---IKYCFNSG---LNHAVLLVGYGVENN------IPYWTFKNTWGTDWGEDGFFRVQ 304
Query: 332 RGDGSCGINDYVRSALV 348
+ +CG+ + + S V
Sbjct: 305 QNINACGMRNELASTAV 321
>gi|50355623|dbj|BAD29960.1| cysteine protease [Daucus carota]
Length = 460
Score = 201 bits (511), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 121/312 (38%), Positives = 166/312 (53%), Gaps = 26/312 (8%)
Query: 40 ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
A + +L +H K+Y L E R IF N I + S GLN F+DL+ E+
Sbjct: 42 AAYESWLVKHGKSYNALGEKEQRFQIFKDNFLYIDEQNAAKDRSFKLGLNRFADLTNEEY 101
Query: 100 QAKYLGFKLKPSYADRSVP----AMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFST 155
++KY G + K S S A + +LP + DWRE+ AV VKDQ CGS WAFST
Sbjct: 102 RSKYTGIRTKDSRKKVSGKSQRYASLAGESLPESVDWREHGAVASVKDQGQCGSCWAFST 161
Query: 156 TGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
+EG+ T KL++LSEQEL+DCD+ ++GC GG + +AF I++ GG++ + Y
Sbjct: 162 ISAVEGINQIATGKLITLSEQELVDCDRSYNEGCNGGLMDDAFQFIINN--GGIDSDADY 219
Query: 215 PYRGDDKAC-RLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA--LQFYVT 271
PY G D C + K A V I+ Y V + + N P++VAI A QFY +
Sbjct: 220 PYTGRDGQCDQYRKNAKVVTIDSYEDVPEYDEKALQKAAANQPISVAIEASGRDFQFYDS 279
Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
G+ F +L H V++VGYG + K YWI++NSWG WGEKGY R+
Sbjct: 280 GI------FTGKCGTDLDHGVVVVGYGTENGK------DYWIVRNSWGADWGEKGYLRME 327
Query: 332 RG----DGSCGI 339
RG G CGI
Sbjct: 328 RGISSKAGICGI 339
>gi|394331822|gb|AFN27130.1| cysteine protease [Leishmania tropica]
Length = 443
Score = 201 bits (510), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 116/318 (36%), Positives = 172/318 (54%), Gaps = 22/318 (6%)
Query: 40 ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
ALF F + + Y T+ E RL F NL ++ Q + +G+ +F DLS AEF
Sbjct: 36 ALFEEFKRTYRRAYGTVAEEQQRLANFERNLELMREHQ-ARNPHARFGITKFFDLSEAEF 94
Query: 100 QAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
A+YL F +A + +++ +P A DWR+ AVT VKDQ CGS WAFS
Sbjct: 95 AARYLNGAAYFAAAKQHAGQHYRKARADLSAVPYAVDWRKKGAVTPVKDQGACGSCWAFS 154
Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
G+IE +A +L +LSEQ+L+ CD +D GC GG + AF+ ++ + G + E +Y
Sbjct: 155 AVGSIESQWALAGHRLTALSEQQLVSCDDKDSGCGGGLMLQAFEWLLRNMNGTMFTEDSY 214
Query: 215 PYRGDDKACRLNKKATQV----KINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYV 270
PY ++Q+ +I+GY+++ ET MA +L +NGP+++A++A + Y
Sbjct: 215 PYVSSSGYVPECSNSSQLVPGARIDGYMTIESSETVMAAWLAKNGPISIAVDASSFMSYE 274
Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
+GV L+H VL+VGY + VPYW+IKNSWGE WGE GY R+
Sbjct: 275 SGV------LTSCAGITLNHGVLLVGYNMT------GEVPYWVIKNSWGEDWGENGYVRV 322
Query: 331 YRGDGSCGINDYVRSALV 348
G +C + +Y SA V
Sbjct: 323 TMGVNACLLTEYPVSAHV 340
>gi|154183745|gb|ABS70713.1| cathepsin L-like cysteine proteinase [Dermacentor variabilis]
Length = 333
Score = 201 bits (510), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 124/317 (39%), Positives = 173/317 (54%), Gaps = 26/317 (8%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGV----YGLNEFSDLSTAEFQ 100
F H K+Y + +E R IFS N + + ++ G+ G+N+F DL EF
Sbjct: 30 FKATHKKSYQSNMEELLRFKIFSENSLLVAR-HNEKYARGLVSYKLGMNQFGDLLPHEFA 88
Query: 101 AKYLGFKLKPSYADRSV---PAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
+ G++ + S PA + +LP++ DWRE AVT VK+Q CGS WAFSTTG
Sbjct: 89 RMFNGYRGARTAGRGSTFLPPANVNYSSLPQSMDWREKGAVTPVKNQGQCGSCWAFSTTG 148
Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEEKTYP 215
++EG + KT LVSLSEQ L+DC + + GCEGG + NAF I K GG++ EK+YP
Sbjct: 149 SLEGQHFLKTGVLVSLSEQNLVDCSETFGNHGCEGGLMDNAFQYI--KANGGIDTEKSYP 206
Query: 216 YRGDDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINA--YALQFYVTG 272
Y +D CR K+ G+V + + E D+ K + GP++VAI+A + Q Y G
Sbjct: 207 YEAEDGECRFKKQNVGATDTGFVDIEQGSEDDLKKAVATVGPVSVAIDASHSSFQLYSEG 266
Query: 273 VSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR 332
V + C +E L H VL+VGYGV+ K YW++KNSW E WG+ GY ++ R
Sbjct: 267 VYDETE--CS--SEQLDHGVLVVGYGVEDGK------KYWLVKNSWAESWGDNGYIKMSR 316
Query: 333 G-DGSCGINDYVRSALV 348
D CGI LV
Sbjct: 317 DKDNQCGIASAASYPLV 333
>gi|119964630|ref|YP_950826.1| cathepsin [Maruca vitrata MNPV]
gi|119514473|gb|ABL76048.1| cathepsin [Maruca vitrata MNPV]
Length = 324
Score = 201 bits (510), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 115/317 (36%), Positives = 171/317 (53%), Gaps = 20/317 (6%)
Query: 36 VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLS 95
+K F F+ Q NK Y + +E R IF NL +I + ++ + Y +N+FSDLS
Sbjct: 22 LKAPNYFEEFVLQFNKNYGSEIEKLRRFKIFQHNLNEI-INKNQNDSAAKYEINKFSDLS 80
Query: 96 TAEFQAKYLGFKLKPSYADRSVPAMI---PNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
E AKY G L P ++ P P FDWR + VT VK+Q +CG+ WA
Sbjct: 81 KDETIAKYTGLSL-PIQTQNFCKVIVLDQPPGKGPFEFDWRRLNKVTNVKNQGVCGACWA 139
Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEK 212
F+ ++E +A K +L+ LSEQ++IDCD D GC GG + AF+ ++ GG++ EK
Sbjct: 140 FAALASLESQFAMKHNQLIDLSEQQMIDCDSVDAGCNGGLLHTAFEAVIKM--GGVQLEK 197
Query: 213 TYPYRGDDKACRLNKKATQVKI-NGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVT 271
YPY + CR+N VK+ + Y + E + L GP+ +AI+A + Y
Sbjct: 198 DYPYEAANNNCRMNSNKFLVKVKDCYRYIIVYEEKLKDLLRSVGPIPMAIDAADIVNYKQ 257
Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
G+ I++ + G L+H+VL+VGYGV+ +PYW KN+WG WGE GYFRL
Sbjct: 258 GI---IKYCLNSG---LNHAVLLVGYGVENN------IPYWTFKNTWGTDWGESGYFRLQ 305
Query: 332 RGDGSCGINDYVRSALV 348
+ +CG+ + + S V
Sbjct: 306 QNINACGMRNELASTAV 322
>gi|146216004|gb|ABQ10204.1| cysteine protease Cp6 [Actinidia deliciosa]
Length = 461
Score = 201 bits (510), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 126/314 (40%), Positives = 167/314 (53%), Gaps = 31/314 (9%)
Query: 40 ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
A++ +L +H K+Y + E R IF NLR I + E + GLN F+DL+ E+
Sbjct: 44 AMYESWLVKHGKSYNAIGEKEKRFQIFKDNLRFIDE-HNAESRTYKVGLNRFADLTNDEY 102
Query: 100 QAKYLGFKL-------KPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
++ YLG + +DR VP + +LP + DWRE AV GVKDQ CGS WA
Sbjct: 103 RSMYLGARTGSRRRLSTQKRSDRYVP--VAGESLPDSVDWREKGAVVGVKDQGSCGSCWA 160
Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEE 211
FST +EG+ T L+SLSEQEL+DCD ++GC GG + AF+ I+ GG++ E
Sbjct: 161 FSTIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAFEFIIKN--GGIDTE 218
Query: 212 KTYPYRGDDKAC-RLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQF 268
+ YPY D C + K A V I+ Y V + + V N P++VAI A A QF
Sbjct: 219 EDYPYNARDGRCDQYRKNAKVVTIDDYEDVPVNNEQALQKAVANQPVSVAIEASGMAFQF 278
Query: 269 YVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYF 328
Y +GV F L H V VGYG T +V YWI+KNSWG WGE GY
Sbjct: 279 YESGV------FTGNCGTALDHGVTAVGYG------TENSVDYWIVKNSWGSSWGESGYI 326
Query: 329 RLYRGDGS---CGI 339
R+ R G+ CGI
Sbjct: 327 RMERNTGATGKCGI 340
>gi|116794072|gb|ABK26996.1| unknown [Picea sitchensis]
Length = 367
Score = 201 bits (510), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 129/325 (39%), Positives = 174/325 (53%), Gaps = 40/325 (12%)
Query: 41 LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
LF+ +L +H K Y + E RL IF NL+ I + S GLN+F+DL+ EF+
Sbjct: 42 LFDRWLGRHGKLYGSHEEKARRLQIFRTNLQYIHAHNKNSNSSFRLGLNKFADLTNEEFK 101
Query: 101 AKYLGFKLKPSYADR----------------SVPAMIPNITLPRAFDWREYDAVTGVKDQ 144
+Y G K + DR +V + + ++ + DWR+ AVTGVKDQ
Sbjct: 102 TRYFG-KNSKQWRDRRRTELEGAELRPVLKQTVGSQSSSCSIASSLDWRKKGAVTGVKDQ 160
Query: 145 TMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKL 204
CGS WAFSTTG IEGV T KLVSLSEQEL+ CD + GCEGG + AF ++
Sbjct: 161 AQCGSCWAFSTTGAIEGVNFISTGKLVSLSEQELVACDATNYGCEGGDMDYAFTWVIQN- 219
Query: 205 GGGLEEEKTYPYRGDDKACRLNKKATQ-VKINGYVSVSRDETDMAKYLVENG--PMAVAI 261
GG++ EK Y Y G D C NK+A + V I+GY VS D++ + L G P++V I
Sbjct: 220 -GGIDTEKDYSYTGVDSTCNTNKEAKKIVSIDGYTDVSPDDSAL---LCAAGSQPVSVGI 275
Query: 262 NAYAL--QFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWG 319
+ A+ Q Y G+ C G +++ H+VL+VGY K YWI+KNSWG
Sbjct: 276 DGSAIDFQLYTGGI---YDGDCSGNPDDIDHAVLVVGYSAKNGK------DYWIVKNSWG 326
Query: 320 EGWGEKGYFRLYRGD----GSCGIN 340
WG +GYF + R G C IN
Sbjct: 327 TDWGLEGYFYILRNTELPYGVCAIN 351
>gi|224102377|ref|XP_002312656.1| predicted protein [Populus trichocarpa]
gi|222852476|gb|EEE90023.1| predicted protein [Populus trichocarpa]
Length = 358
Score = 201 bits (510), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 122/303 (40%), Positives = 167/303 (55%), Gaps = 26/303 (8%)
Query: 49 HNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKL 108
H+ +L E R ++F NL+ I + + + LN F+D++ EF Y G K+
Sbjct: 46 HHTVSRSLAEKQERFNVFKENLKHIHKVNHKDRPYKLK-LNSFADMTNHEFLQHYGGSKV 104
Query: 109 KPSYADRS----VPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVY 163
R +M + + LP + DWR+ AVTG+KDQ CGS WAFST +EG+
Sbjct: 105 SHYRVLRGQRQGTGSMHEDTSKLPSSVDWRKNGAVTGIKDQGKCGSCWAFSTVAAVEGIN 164
Query: 164 AAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKAC 223
KT +L+SLSEQEL+DCD ++ GC GG + +AF+ I K GGL E TYPYR ++ C
Sbjct: 165 KIKTGELISLSEQELVDCDSDNHGCNGGLMEDAFNFI--KQIGGLTSENTYPYRAKEEPC 222
Query: 224 RLNK-KATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHPIQFF 280
NK + V I+GY V ++ + V N P+A+A++A LQFY + F
Sbjct: 223 DSNKMNSPVVNIDGYEMVPENDENALMKAVANQPVAIAMDAGGKDLQFYSEAI-----FT 277
Query: 281 CDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGS 336
D G E L+H V +VGYG T YWI+KNSWG WGEKGY R+ RG +G
Sbjct: 278 GDCGTE-LNHGVALVGYGT-----TQDGTKYWIVKNSWGTDWGEKGYIRMQRGIDAEEGL 331
Query: 337 CGI 339
CGI
Sbjct: 332 CGI 334
>gi|255555337|ref|XP_002518705.1| cysteine protease, putative [Ricinus communis]
gi|223542086|gb|EEF43630.1| cysteine protease, putative [Ricinus communis]
Length = 471
Score = 201 bits (510), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 127/312 (40%), Positives = 169/312 (54%), Gaps = 28/312 (8%)
Query: 41 LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
++ +L +H K Y L E R IF NLR I + V GLN F+DL+ E++
Sbjct: 50 MYEMWLVEHGKAYNALGEKEKRFEIFKDNLRFIDEHNSVDRSYKV-GLNRFADLTNEEYK 108
Query: 101 AKYLGFKLKPS---YADRSVPAMIPN-ITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTT 156
A +LG K++ RS + + LP DWRE AV VKDQ CGS WAFST
Sbjct: 109 AMFLGTKMERKNRFLGTRSQRYLFKDGDDLPENVDWREKGAVVPVKDQGQCGSCWAFSTV 168
Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYP 215
G +EG+ T +L+SLSEQEL+DCD+ + GC GG + AF+ I++ GG++ E+ YP
Sbjct: 169 GAVEGINQIVTGELISLSEQELVDCDKSYNQGCNGGLMDYAFEFIINN--GGIDTEEDYP 226
Query: 216 YRGDDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYVTG 272
Y+ D C N+K A V I+GY V ++ + K V + P++VAI A A Q Y +G
Sbjct: 227 YKASDNICDPNRKNAKVVTIDGYEDVPENDENSLKKAVAHQPVSVAIEAGGRAFQLYKSG 286
Query: 273 VSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR 332
V F G E L H V+ VGYG T V YWI++NSWG WGE GY R+ R
Sbjct: 287 V-----FTGRCGTE-LDHGVVAVGYG------TENGVNYWIVRNSWGSAWGESGYIRMER 334
Query: 333 G-----DGSCGI 339
G CGI
Sbjct: 335 NVANTKTGKCGI 346
>gi|340381055|ref|XP_003389037.1| PREDICTED: cathepsin L1-like [Amphimedon queenslandica]
Length = 329
Score = 201 bits (510), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 118/334 (35%), Positives = 178/334 (53%), Gaps = 21/334 (6%)
Query: 13 LSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRK 72
+ L ++V++ +V ++TA + + + K Y++ E R I+ N +K
Sbjct: 1 MKLLIAVAALIVCATA-------FEYTAEWELWKRTNGKDYSSEKEELYRQTIWEAN-KK 52
Query: 73 IQLLQDTEHGSGVYGL--NEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAF 130
I L + + L N F+DL ++EF A Y G++ ++ + + LP
Sbjct: 53 IVLEHNANADKWGWTLEMNAFADLESSEFAAMYNGYRRSARKSNATRYHVPTGNALPDTV 112
Query: 131 DWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE--DDGC 188
DWR AVT VK+Q CGS WAFSTTG++EG K L SLSEQ+L+DC + + GC
Sbjct: 113 DWRTKGAVTPVKNQKQCGSCWAFSTTGSLEGQTFLKKGTLPSLSEQQLVDCSDKYGNHGC 172
Query: 189 EGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMA 248
+GG + NAF I + GG++ E +YPY + CR + A GY + D+ D
Sbjct: 173 QGGLMDNAFKYI--EANGGIDSEASYPYEAKNGKCRFQQSAVAATCTGYKDIPHDDIDGL 230
Query: 249 KYLVEN-GPMAVAINAY--ALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFT 305
+ V N GP++VA++A + Q Y GV P+ C + L H VL VGYG + +
Sbjct: 231 QDAVANVGPISVAMDASHSSFQLYAAGVYDPL--LCS--STRLDHGVLAVGYGTEPSGLF 286
Query: 306 HKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGI 339
H+ PYW++KNSWG WG++GYF++ R D CGI
Sbjct: 287 HEEKPYWLVKNSWGPDWGQQGYFKIVRKDNKCGI 320
>gi|394331820|gb|AFN27129.1| cysteine protease [Leishmania tropica]
Length = 443
Score = 200 bits (509), Expect = 7e-49, Method: Compositional matrix adjust.
Identities = 117/318 (36%), Positives = 174/318 (54%), Gaps = 22/318 (6%)
Query: 40 ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
ALF F + + Y T+ E RL F NL ++ Q + +G+ +F DLS AEF
Sbjct: 36 ALFEEFKRTYRRAYGTVAEEQQRLANFERNLELMREHQ-ARNPHARFGITKFFDLSEAEF 94
Query: 100 QAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
A+YL F +A + +++ +P A DWR+ AVT VKDQ CGS WAFS
Sbjct: 95 AARYLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWRKKGAVTPVKDQGACGSCWAFS 154
Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
G+IE +A +L +LSEQ+L+ CD +D+GC GG + AF+ ++ + G + E +Y
Sbjct: 155 AVGSIESQWALAGHRLTALSEQQLVSCDDKDNGCRGGLMLQAFEWLLRNMNGTMFTEDSY 214
Query: 215 PYRGDDKACRLNKKATQV----KINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYV 270
PY ++Q+ +I+GY+++ ET MA +L +NGP+++A++A + Y
Sbjct: 215 PYVSSTGYVPECSNSSQLVPGARIDGYMTIESSETVMAAWLAKNGPISIAVDASSFMSYQ 274
Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
+GV L+H VL+V Y +RT VPYW+IKNSWGE WGE GY R+
Sbjct: 275 SGV------LTSCAGMPLNHGVLLVWY--NRT----GEVPYWVIKNSWGENWGENGYVRV 322
Query: 331 YRGDGSCGINDYVRSALV 348
G +C + +Y SA V
Sbjct: 323 TMGVNACLLTEYPVSAHV 340
>gi|2582045|gb|AAB82449.1| lymphopain [Homo sapiens]
gi|2582181|gb|AAB82457.1| lymphopain [Homo sapiens]
gi|3033547|gb|AAC32181.1| cathepsin W [Homo sapiens]
Length = 376
Score = 200 bits (509), Expect = 7e-49, Method: Compositional matrix adjust.
Identities = 118/328 (35%), Positives = 175/328 (53%), Gaps = 27/328 (8%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
F F Q N++Y + E+ RL IF+ NL + Q LQ+ + G+ +G+ FSDL+ EF
Sbjct: 42 FKLFQIQFNRSYLSPEEHAHRLDIFAHNLAQAQRLQEEDLGTAEFGVTPFSDLTEEEFGQ 101
Query: 102 KYLGFKLK----PSYADRSVPAMIPNITLPRAFDWREY-DAVTGVKDQTMCGSSWAFSTT 156
Y G++ PS R + + P ++P + DWR+ A++ +KDQ C WA +
Sbjct: 102 LY-GYRRAAGGVPSMG-REIRSEEPEESVPFSCDWRKVAGAISPIKDQKNCNCCWAMAAA 159
Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
GNIE ++ V +S EL+DC + DGC GG + +AF T+++ GL EK YP+
Sbjct: 160 GNIETLWRISFWDFVDVSVHELLDCGRCGDGCHGGFVWDAFITVLNN--SGLASEKDYPF 217
Query: 217 RGDDKACRLNKKATQ--VKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVS 274
+G +A R + K Q I ++ + +E +A+YL GP+ V IN LQ Y GV
Sbjct: 218 QGKVRAHRCHPKKYQKVAWIQDFIMLQNNEHRIAQYLATYGPITVTINMKPLQLYRKGVI 277
Query: 275 HPIQFFCDGGNENLSHSVLIVGYGVDRTK--------------FTHKAVPYWIIKNSWGE 320
CD + + HSVL+VG+G +++ PYWI+KNSWG
Sbjct: 278 KATPTTCD--PQLVDHSVLLVGFGSVKSEEGIWAETVSSQSQPQPPHPTPYWILKNSWGA 335
Query: 321 GWGEKGYFRLYRGDGSCGINDYVRSALV 348
WGEKGYFRL+RG +CGI + +A V
Sbjct: 336 QWGEKGYFRLHRGSNTCGITKFPLTARV 363
>gi|9630063|ref|NP_046281.1| cathepsin [Orgyia pseudotsugata MNPV]
gi|2499880|sp|O10364.1|CATV_NPVOP RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|7435821|pir||T10394 cathepsin - Orgyia pseudotsugata nuclear polyhedrosis virus
gi|1911371|gb|AAC59124.1| cathepsin [Orgyia pseudotsugata MNPV]
Length = 324
Score = 200 bits (509), Expect = 8e-49, Method: Compositional matrix adjust.
Identities = 116/318 (36%), Positives = 172/318 (54%), Gaps = 21/318 (6%)
Query: 36 VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLS 95
+K F FL + NK Y++ E R IF NL +I + ++ + Y +N+FSDLS
Sbjct: 22 LKAPNYFEDFLHKFNKNYSSESEKLHRFKIFQHNLEEI-INKNQNDSTAQYEINKFSDLS 80
Query: 96 TAEFQAKYLGFKLKPSYADRSVPAMI---PNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
E +KY G L P +I P P FDWR+++ VT VK+Q +CG+ WA
Sbjct: 81 KEEAISKYTGLSL-PHQTQNFCEVVILDRPPDRGPLEFDWRQFNKVTSVKNQGVCGACWA 139
Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEK 212
F+T G++E +A K +L++LSEQ+ IDCD+ + GC+GG + AF++ M GG++ E
Sbjct: 140 FATLGSLESQFAIKYNRLINLSEQQFIDCDRVNAGCDGGLLHTAFESAMEM--GGVQMES 197
Query: 213 TYPYRGDDKACRLNKKATQVKINGYVS-VSRDETDMAKYLVENGPMAVAINAYALQFYVT 271
YPY + CR+N V + + E + L GP+ VAI+A + Y
Sbjct: 198 DYPYETANGQCRINPNRFVVGVRSCRRYIVMFEEKLKDLLRAVGPIPVAIDASDIVNYRR 257
Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
G+ N L+H+VL+VGY V+ +PYWI+KN+WG WGE GYFR+
Sbjct: 258 GIMRQC------ANHGLNHAVLLVGYAVENN------IPYWILKNTWGTDWGEDGYFRVQ 305
Query: 332 RGDGSCGI-NDYVRSALV 348
+ +CGI N+ V SA +
Sbjct: 306 QNINACGIRNELVSSAEI 323
>gi|403293523|ref|XP_003937763.1| PREDICTED: cathepsin W [Saimiri boliviensis boliviensis]
Length = 373
Score = 200 bits (509), Expect = 8e-49, Method: Compositional matrix adjust.
Identities = 115/328 (35%), Positives = 177/328 (53%), Gaps = 30/328 (9%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
F +F Q N++Y T E+ RL IF+ NL + Q LQ+ + G+ +G+ FSDL+ EF
Sbjct: 42 FKFFQRQFNRSYLTPEEHARRLDIFAHNLAQAQQLQEEDFGTAEFGVTPFSDLTEEEFGQ 101
Query: 102 KYLGFKLKPSYADRSVPAM-------IPNITLPRAFDWREY-DAVTGVKDQTMCGSSWAF 153
Y G + A VP M P ++P DWR+ A++ +++Q C WA
Sbjct: 102 LY-GHR----RAAGGVPGMGRVVGPEEPEESVPHTCDWRKVAGAISSIRNQGNCNCCWAM 156
Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
+ GNIE ++ K V++S QEL+DC + +GC GG + AF T+++ G+ E+
Sbjct: 157 AAAGNIEALWGINFLKFVNVSVQELLDCGRCGNGCYGGYVWEAFLTVLNN--SGVASERD 214
Query: 214 YPYRGDDKACRLNKKATQ--VKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVT 271
YP+R + + R + K + I ++ + +E +A+YL GP+ V IN L+ Y
Sbjct: 215 YPFRANFRPHRCHAKTSNKVAWIQDFIFLPDNEQRIAQYLATYGPITVTINMKYLKLYQK 274
Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFT-----------HKAVPYWIIKNSWGE 320
GV CD + + HSVL+VG+G D+++ ++ PYWI+KNSWG
Sbjct: 275 GVIKASPTTCD--PQFVDHSVLLVGFGSDKSEGMGAETVSSPSRHPRSTPYWILKNSWGA 332
Query: 321 GWGEKGYFRLYRGDGSCGINDYVRSALV 348
WGE+GYFRL+RG +CGI Y +A V
Sbjct: 333 QWGEEGYFRLHRGSNTCGITKYPVTARV 360
>gi|577617|gb|AAC37213.1| cysteine proteinase [Trypanosoma cruzi]
Length = 467
Score = 200 bits (509), Expect = 8e-49, Method: Compositional matrix adjust.
Identities = 118/345 (34%), Positives = 177/345 (51%), Gaps = 20/345 (5%)
Query: 9 GVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSG 68
++L ++ V ++ + LH + + F F ++H + Y + E RL +F
Sbjct: 7 ALSLAAVLVVMACLVPAATASLHAEETL--ASQFAEFKQKHGRVYGSAAEEAFRLSVFRA 64
Query: 69 NLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRS--VPAMIPNITL 126
NL + L + +G+ FSDL+ EF+++Y + A+ VP + +
Sbjct: 65 NL-FLARLHAAANPHATFGVTPFSDLTREEFRSRYHNGAAHFAAAEERARVPVDVEVVGA 123
Query: 127 PRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDD 186
P A DWRE AVT VK+Q +CGS WAF+ GNIEG + L LSEQ L+ CD +
Sbjct: 124 PAAKDWREEGAVTAVKNQGICGSCWAFAAIGNIEGQWFLAGNPLTRLSEQMLVSCDNTNS 183
Query: 187 GCEGGSISNAFDTIMSKLGGGLEEEKTYPYR---GDDKACRLNKKATQVKINGYVSVSRD 243
GC GG S AF+ I+ + G + E +YPY G C+ + + I G+V + +D
Sbjct: 184 GCGGGLSSKAFEWIVQENNGAVYTEDSYPYHSCIGIKLPCKDSDRTVGATITGHVELPQD 243
Query: 244 ETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTK 303
E +A GP++VA++A + FY GV + ++ LSH+VL+VGY
Sbjct: 244 EAQIAASGAVKGPLSVAVDASSWFFYTGGV------LTNCVSKRLSHAVLLVGYN----- 292
Query: 304 FTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
AVPYWIIKNSW WGE GY R+ +G C + + V SA+V
Sbjct: 293 -DSAAVPYWIIKNSWTTHWGEGGYIRIAKGSNQCLVKEEVSSAVV 336
>gi|22549430|ref|NP_689203.1| cath gene product [Mamestra configurata NPV-B]
gi|215401259|ref|YP_002332563.1| cathepsin [Helicoverpa armigera multiple nucleopolyhedrovirus]
gi|22476609|gb|AAM95015.1| putative cysteine proteinase [Mamestra configurata NPV-B]
gi|198448759|gb|ACH88549.1| cathepsin [Helicoverpa armigera multiple nucleopolyhedrovirus]
gi|390165231|gb|AFL64878.1| cathepsin [Mamestra brassicae MNPV]
gi|401665635|gb|AFP95747.1| putative cysteine proteinase [Mamestra brassicae MNPV]
Length = 341
Score = 200 bits (509), Expect = 8e-49, Method: Compositional matrix adjust.
Identities = 122/323 (37%), Positives = 177/323 (54%), Gaps = 21/323 (6%)
Query: 32 HLHHVKHTAL-FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNE 90
+L+++ L F F+ Q+NK Y++ E R +IF N+ I +++ + S VY +N
Sbjct: 33 NLYNINSAPLYFEKFITQYNKQYSSEDEKKYRYNIFRHNIESINA-KNSRNDSAVYKINR 91
Query: 91 FSDLSTAEFQAKYLGFKLKPSYADRSVPAMIP---NITLPRAFDWREYDAVTGVKDQTMC 147
F+D++ E ++ G + A+ ++ P FDWR Y+ VT VKDQ MC
Sbjct: 92 FADMTKNEVVNRHTGLASGDTGANFCETIVVDGPGQRQRPANFDWRNYNKVTSVKDQGMC 151
Query: 148 GSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGG 207
G+ WAF+ G +E YA K +L+ L+EQ+L+DCD D GC+GG I A++ IM GG
Sbjct: 152 GACWAFAGLGALESQYAIKYDRLIDLAEQQLVDCDFVDMGCDGGLIHTAYEQIMHI--GG 209
Query: 208 LEEEKTYPYRGDDKACRLNKKATQVKI-NGYVSVSRDETDMAKYLVENGPMAVAINAYAL 266
+E+E YPY+ C + V + N Y V E + L GP+A+A++A L
Sbjct: 210 VEQEYDYPYKAVRLPCAVKPHKFAVGVRNCYRYVLLSEERLEDLLRHVGPIAIAVDAVDL 269
Query: 267 QFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKG 326
Y GV FC+ N L+H+VL+VGYGV+ VPYW IKNSWG +GE G
Sbjct: 270 TDYYGGVIS----FCE--NNGLNHAVLLVGYGVENN------VPYWTIKNSWGPDYGENG 317
Query: 327 YFRLYRGDGSCG-INDYVRSALV 348
Y R+ RG SCG IN+ SA +
Sbjct: 318 YVRIRRGVNSCGMINELASSAQI 340
>gi|317106666|dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas]
Length = 441
Score = 200 bits (509), Expect = 8e-49, Method: Compositional matrix adjust.
Identities = 123/314 (39%), Positives = 169/314 (53%), Gaps = 30/314 (9%)
Query: 41 LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
LF + +QH KTYA+ E RL +F N + + S LN F+DL+ EF+
Sbjct: 29 LFETWCQQHGKTYASQEEKLFRLKVFQDNYDFVTEHNSQGNSSYTLSLNAFADLTHHEFK 88
Query: 101 AKYLGFK------LKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
A LG L ++R +P + ++ P + DWR+ AVT VKDQ CG+ W+FS
Sbjct: 89 ASRLGLSSAASASLNVDRSNRQIPDFVADV--PASVDWRKNGAVTQVKDQGNCGACWSFS 146
Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
TG IEG+ T LVSLSEQEL+DCD+ ++GCEGG + AF ++ G++ E+
Sbjct: 147 ATGAIEGINKIVTGSLVSLSEQELVDCDKSYNNGCEGGIMDYAFQFVIDN--HGIDTEED 204
Query: 214 YPYRGDDKACRLNK-KATQVKINGYVSVSRDETDMAKYLVENGPMAVAI--NAYALQFYV 270
YPY+G D++C K K V I+GYV V ++ V N P++V I + A Q Y
Sbjct: 205 YPYQGRDRSCNKEKLKRHVVTIDGYVDVPQNNEKELLKAVANQPVSVGICGSERAFQLYS 264
Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
G+ F + +L H+VLIVGYG + V YWI+KNSWG WG GY +
Sbjct: 265 KGI------FTGPCSTSLDHAVLIVGYG------SENGVDYWIVKNSWGSYWGMDGYMHM 312
Query: 331 YRGDGS----CGIN 340
R GS CGIN
Sbjct: 313 QRNSGSSRGLCGIN 326
>gi|356508487|ref|XP_003522988.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
Length = 349
Score = 200 bits (509), Expect = 8e-49, Method: Compositional matrix adjust.
Identities = 141/352 (40%), Positives = 186/352 (52%), Gaps = 37/352 (10%)
Query: 1 MSC-FYFFAGVALLSLTVSVSSFMVVG--DEKLHHLHHVKHTALFNYFLEQHNKTYATLV 57
++C F FA +A+ F +VG E L + K LF ++ +H K Y ++
Sbjct: 11 LACSFCLFASLAV------AGDFSIVGYSSEDLKSMD--KLIELFESWMSRHGKIYQSIE 62
Query: 58 EYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSV 117
E R IF NL+ I + GLNEF+DLS EF+ KYLG K+ S S
Sbjct: 63 EKLHRFDIFKDNLKHIDERNKVVSNYWL-GLNEFADLSHQEFKNKYLGLKVDYSRRRESP 121
Query: 118 PAMI-PNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQ 176
+ LP++ DWR+ AVT VK+Q CGS WAFST +EG+ T L SLSEQ
Sbjct: 122 EEFTYKDFELPKSVDWRKKGAVTQVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLSEQ 181
Query: 177 ELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQ-VKI 234
ELIDCD+ ++GC GG + AF I+ GGL +E+ YPY ++ C + K+ T+ V I
Sbjct: 182 ELIDCDRTYNNGCNGGLMDYAFSFIVEN--GGLHKEEDYPYIMEEGTCEMTKEETEVVTI 239
Query: 235 NGYVSVSR-DETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHPIQFFCDGGNENLSHS 291
+GY V + +E + K LV N P++VAI A QFY GV F +L H
Sbjct: 240 SGYHDVPQNNEQSLLKALV-NQPLSVAIEASGRDFQFYSGGV------FDGHCGSDLDHG 292
Query: 292 VLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGI 339
V VGYG T K V Y I+KNSWG WGEKGY R+ R +G CGI
Sbjct: 293 VAAVGYG------TSKGVNYIIVKNSWGSKWGEKGYIRMRRNIGKPEGICGI 338
>gi|426369199|ref|XP_004051582.1| PREDICTED: cathepsin W [Gorilla gorilla gorilla]
Length = 376
Score = 200 bits (508), Expect = 8e-49, Method: Compositional matrix adjust.
Identities = 117/328 (35%), Positives = 173/328 (52%), Gaps = 27/328 (8%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
F F Q N++Y + E+ RL IF+ NL + Q LQ+ + G+ +G+ FSDL+ EF
Sbjct: 42 FKLFQIQFNRSYLSPEEHAHRLDIFAHNLAQAQRLQEEDLGTAEFGVTPFSDLTEEEFGQ 101
Query: 102 KYLGFKLK----PSYADRSVPAMIPNITLPRAFDWREY-DAVTGVKDQTMCGSSWAFSTT 156
Y G++ PS R + + P ++P DWR+ A++ +KDQ C WA +
Sbjct: 102 LY-GYRRAAGGVPSMG-REIRSEEPEESVPFTCDWRKVAGAISPIKDQKNCNCCWAMAAA 159
Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
GNIE ++ V +S QEL+DC + DGC GG + +AF T+++ GL EK YP+
Sbjct: 160 GNIETLWRISFWDFVDVSVQELLDCGRCGDGCHGGFVWDAFITVLNN--SGLASEKDYPF 217
Query: 217 RGDDKA--CRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVS 274
+G +A C K I ++ + +E +A+YL GP+ V IN L+ Y GV
Sbjct: 218 QGKVRAHSCHPKKYQKVAWIQDFIMLQNNEHRIAQYLATYGPITVTINMKPLRLYRKGVI 277
Query: 275 HPIQFFCDGGNENLSHSVLIVGYGVDRTK--------------FTHKAVPYWIIKNSWGE 320
CD + + HSVL+VG+G +++ PYWI+KNSWG
Sbjct: 278 KATPITCD--PQLVDHSVLLVGFGSIKSEEGILAETVSSQSQPQPPHPTPYWILKNSWGA 335
Query: 321 GWGEKGYFRLYRGDGSCGINDYVRSALV 348
WGEKGYFRL+RG +CGI + +A V
Sbjct: 336 QWGEKGYFRLHRGSNTCGITKFPLTARV 363
>gi|144905104|dbj|BAF56427.1| cysteine proteinase [Lotus japonicus]
Length = 342
Score = 200 bits (508), Expect = 9e-49, Method: Compositional matrix adjust.
Identities = 127/343 (37%), Positives = 171/343 (49%), Gaps = 31/343 (9%)
Query: 7 FAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIF 66
FA V L L S + D +H H ++ ++ K Y L E R +IF
Sbjct: 12 FALVLCLGLWAFQVSSRTLQDASMHERHE--------QWMARYGKVYKDLQEKEKRFNIF 63
Query: 67 SGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYA-DRSVPAMIPNIT 125
N++ I+ + + G+N+F+DL+ EF A FK S + R+ N+T
Sbjct: 64 QENVKYIEASNNAGNKPYKLGVNQFTDLTNKEFIATRNKFKGHMSSSITRTTTFKYENVT 123
Query: 126 LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE- 184
P DWR+ AVT VK+Q CG WAFS EG++ T LVSLSEQEL+DCD
Sbjct: 124 APSTVDWRQEGAVTPVKNQGTCGCCWAFSAVAATEGIHKLSTGNLVSLSEQELVDCDTSG 183
Query: 185 -DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQV-KINGYVSVSR 242
D GC+GG + +AF I+ GGL E YPY+G D C N++ T V I GY V
Sbjct: 184 ADQGCQGGLMDDAFKFIIQN--GGLNTEAQYPYQGVDGTCNTNEEVTHVATITGYEDVPS 241
Query: 243 DETDMAKYLVENGPMAVAINAYALQF--YVTGVSHPIQFFCDGGNENLSHSVLIVGYGVD 300
+ + V N P++VAI+A F Y +GV F L H V +VGYGV
Sbjct: 242 NNEQALQQAVANQPISVAIDASGSDFQNYQSGV------FTGSCGTQLDHGVAVVGYGV- 294
Query: 301 RTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGI 339
+ YW++KNSWGE WGE+GY R+ R +G CGI
Sbjct: 295 ----SDDGTKYWLVKNSWGEDWGEEGYIRMQRDVEAPEGLCGI 333
>gi|4581057|gb|AAD24589.1|AF139913_1 cysteine protease [Trypanosoma congolense]
Length = 440
Score = 200 bits (508), Expect = 9e-49, Method: Compositional matrix adjust.
Identities = 124/343 (36%), Positives = 181/343 (52%), Gaps = 24/343 (6%)
Query: 13 LSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRK 72
+ L + F+ V LH ++ F F ++++++Y E R +F N+ +
Sbjct: 14 VGLHAVAACFVPVALGVLHAEQSLQQQ--FAAFKQKYSRSYKDATEEAFRFRVFKQNMER 71
Query: 73 IQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL-GFKLKPSYADRSVPAMIPNITL---PR 128
+ + + +G+ FSD+S EF+A Y G + + R P + N++ P
Sbjct: 72 AKE-EAAANPYATFGVTRFSDMSPEEFRATYHNGAEYYAAALKR--PRKVVNVSTGKAPP 128
Query: 129 AFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGC 188
A DWR+ AVT VKDQ C SSWAFS GNIEG + +L SLSEQ L+ CD D GC
Sbjct: 129 AIDWRKKGAVTPVKDQGQCHSSWAFSAIGNIEGQWKIAGHELTSLSEQMLVSCDTNDFGC 188
Query: 189 EGGSISNAFDTIMSKLGGGLEEEKTYPYR---GDDKACRLNKKATQVKINGYVSVSRDET 245
GG AF I+S G + E++YPY G+ C + K KI V + RDE
Sbjct: 189 GGGFSDPAFKWIVSSNKGNVFTEQSYPYASGGGNVPTCDKSGKVVGAKIRDRVDLPRDEN 248
Query: 246 DMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFT 305
+A++L + GP+A+A++A + Q Y GV +E+L H VL+VGY D +K
Sbjct: 249 AIAEWLAKKGPVAIAVDATSFQSYTGGV------LTSCISEHLDHGVLLVGYD-DTSK-- 299
Query: 306 HKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
PYWIIKNSWG+GWGE+GY R+ +G C + + SA+V
Sbjct: 300 ---PPYWIIKNSWGKGWGEEGYIRIEKGTNQCLMKNLPSSAVV 339
>gi|394331826|gb|AFN27132.1| cysteine protease [Leishmania tropica]
Length = 443
Score = 200 bits (508), Expect = 9e-49, Method: Compositional matrix adjust.
Identities = 117/318 (36%), Positives = 173/318 (54%), Gaps = 22/318 (6%)
Query: 40 ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
ALF F + + Y T+ E RL F NL ++ Q + +G+ +F DLS AEF
Sbjct: 36 ALFEEFKRTYRRAYGTVAEEQQRLANFERNLELMREHQ-ARNPHARFGITKFFDLSEAEF 94
Query: 100 QAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
A+YL F +A + +++ +P A DWR+ AVT VKDQ CGS WAFS
Sbjct: 95 AARYLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWRKKGAVTPVKDQGACGSCWAFS 154
Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
G+IE +A L +LSEQ+L+ CD +D+GC GG + AF+ ++ + G + E +Y
Sbjct: 155 AVGSIESQWALAGHGLTALSEQQLVSCDDKDNGCSGGLMLQAFEWLLRNMNGTMFTEDSY 214
Query: 215 PYRGDDKACRLNKKATQV----KINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYV 270
PY ++Q+ +I GY+++ ET +L +NGP+++A++A + Y
Sbjct: 215 PYVSSSGYVPECSNSSQLVPGARIEGYMTIESSETVKGAWLAKNGPISIAVDASSFMSYQ 274
Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
+GV + L+H VL+VGY +RT VPYW+IKNSWGE WGEKGY R+
Sbjct: 275 SGV------LTSCAGDALNHGVLLVGY--NRT----GEVPYWVIKNSWGEDWGEKGYVRV 322
Query: 331 YRGDGSCGINDYVRSALV 348
G +C + +Y SA V
Sbjct: 323 TMGVNACLLTEYPVSAHV 340
>gi|355344587|gb|AER60490.1| cysteine proteases [Gossypium hirsutum]
Length = 371
Score = 200 bits (508), Expect = 9e-49, Method: Compositional matrix adjust.
Identities = 126/316 (39%), Positives = 168/316 (53%), Gaps = 32/316 (10%)
Query: 40 ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
L+ ++ QH K Y + E R IF NLR I + + GLN+F+DL+ E+
Sbjct: 43 GLYKSWVIQHGKAYNGIGEEEKRFEIFKDNLRFIDEHNSNNNTTYKLGLNKFADLTNQEY 102
Query: 100 QAKYLGFKLKPSYADRSVPAMIPNI--------TLPRAFDWREYDAVTGVKDQTMCGSSW 151
+AK+LG + P R + + IP+ LP + DWR++ AV+ VKDQ CGS W
Sbjct: 103 RAKFLGTRTDPRR--RLMKSKIPSSRYAHRAGDNLPDSVDWRDHGAVSPVKDQGSCGSCW 160
Query: 152 AFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEE 210
AFST +EG+ + +LVSLSEQEL+DCD+ D GC GG + AF IM GG++
Sbjct: 161 AFSTIATVEGINKIVSGELVSLSEQELVDCDRSYDAGCNGGLMDYAFQFIMDN--GGIDT 218
Query: 211 EKTYPYRGDDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQ 267
EK YPY G + C KK A V I+GY V +E + K V + P+++AI A A Q
Sbjct: 219 EKDYPYLGFNNQCDPTKKNAKVVSIDGYEDVPNNENALKK-AVAHQPVSIAIEAGGRAFQ 277
Query: 268 FYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGY 327
Y +GV F L H V+ VGYG D YWI++NSWG WGE GY
Sbjct: 278 LYESGV------FNGECGLALDHGVVAVGYGTD-----DNGQDYWIVRNSWGSNWGENGY 326
Query: 328 FRLYR----GDGSCGI 339
R+ R G CGI
Sbjct: 327 IRMERNINANTGKCGI 342
>gi|89274062|dbj|BAE80740.1| cysteine proteinase [Platycodon grandiflorus]
Length = 462
Score = 200 bits (508), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 125/341 (36%), Positives = 177/341 (51%), Gaps = 25/341 (7%)
Query: 10 VALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGN 69
VA +L +S+ ++ K + A++ +L +H K+Y L E R IF N
Sbjct: 18 VASSALDMSIINYDATHASKSSWRTDDEVMAMYESWLVKHGKSYNALGEKEKRFQIFKDN 77
Query: 70 LRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNI--TLP 127
LR I E+ S GLN F+DL+ E+++ YLG K KP + P + +LP
Sbjct: 78 LRFIDEHNAEENLSYKVGLNRFADLTNEEYRSTYLGAKSKPKLSKVKSDRYAPRVGDSLP 137
Query: 128 RAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DD 186
+ DWR AV +KDQ CGS WAFST +EG+ T +L++LSEQEL+DCD+ ++
Sbjct: 138 ESVDWRAKGAVAPIKDQGSCGSCWAFSTVNAVEGINQIVTGELITLSEQELVDCDKSYNE 197
Query: 187 GCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKAC-RLNKKATQVKINGYVSVSRDET 245
GC+GG + F+ I++ GG++ +K YPY G D C + K A V I+ Y V +
Sbjct: 198 GCDGGLMDYGFEFIINN--GGIDTDKDYPYLGRDARCDQYRKNAKVVTIDSYEDVPVNNE 255
Query: 246 DMAKYLVENGPMAVAIN--AYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTK 303
+ K V + P++V I A QFY +G+ F L H V +VGYG
Sbjct: 256 EALKKAVASQPVSVGIEGGGRAFQFYDSGI------FTGKCGTALDHGVNVVGYG----- 304
Query: 304 FTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-----DGSCGI 339
T K YWI++NSWG WGE GY R+ R G CGI
Sbjct: 305 -TEKGKDYWIVRNSWGSSWGEAGYIRMERNLAGTSVGKCGI 344
>gi|1208549|gb|AAC49455.1| Pseudotzain [Pseudotsuga menziesii]
Length = 454
Score = 200 bits (508), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 132/354 (37%), Positives = 177/354 (50%), Gaps = 29/354 (8%)
Query: 1 MSCFYFFAGVALLSLTVSVS--SFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVE 58
M FA +AL ++ S S F ++ + + L+ +L QH K Y L E
Sbjct: 1 MGILLLFAVLALSAMAGSASRADFSIISYDSQDLIGDDAIMELYELWLAQHKKAYNGLDE 60
Query: 59 YYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKL--KPSYADRS 116
+ +F N I + + S GLN+F+DLS EF+A YLG KL K +
Sbjct: 61 KQKKFSVFKDNFLYIHQHNNQGNPSYKLGLNQFADLSHEEFKAAYLGTKLDAKKRLSRSP 120
Query: 117 VPAMIPNI--TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLS 174
P ++ LP + DWRE AVT VK+Q CGS WAFST +EG+ T L SLS
Sbjct: 121 SPRYQYSVGEDLPESIDWREKGAVTAVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLS 180
Query: 175 EQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRL-NKKATQV 232
EQEL+DCD + GC GG + AF I+S GGL+ E YPY+ ++ +C K A V
Sbjct: 181 EQELVDCDTSYNQGCNGGLMDYAFQFIISN--GGLDSEDDYPYKANNGSCDAYRKNAHVV 238
Query: 233 KINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSH 290
I+ Y V ++ K N P++VAI A A QFY +GV F L H
Sbjct: 239 TIDDYEDVPENDEKSLKKAAANQPISVAIEASGRAFQFYESGV------FTSNCGTQLDH 292
Query: 291 SVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-----DGSCGI 339
V +VGYG + + YW++KNSWG WGEKG+ +L R G CGI
Sbjct: 293 GVTLVGYG------SESGIDYWLVKNSWGNSWGEKGFIKLQRNLEGASTGMCGI 340
>gi|33333704|gb|AAQ11970.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
maculatus]
Length = 326
Score = 200 bits (508), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 125/309 (40%), Positives = 165/309 (53%), Gaps = 20/309 (6%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT-EHGSGVYG--LNEFSDLSTAE 98
+ F H KTY +LVE R +F NL IQ E G + + +F+D++ E
Sbjct: 23 WQQFKLDHGKTYRSLVEEKRRFSVFQKNLVDIQEHNKKYERGEESFAKKVTQFADMTHEE 82
Query: 99 FQ--AKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTT 156
F K G PS A ++ A DWRE AVT VKDQ CGS WAFS
Sbjct: 83 FLDLLKLQGVPALPSNAVHFDNFEDIDMEEKDAIDWREEGAVTPVKDQANCGSCWAFSAV 142
Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCDQED---DGCEGGSISNAFDTIMSKLGGGLEEEKT 213
G IEG + K LVSLS QEL+DC ED +GC+GG + AFD + + G++ E++
Sbjct: 143 GAIEGQFFKKNGTLVSLSAQELVDCATEDYGNNGCKGGLMGQAFDFVQDE---GIQTEES 199
Query: 214 YPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGV 273
YPY G +C+ + + K+ YV DE +MA+ + GP+AVAI A L FY G+
Sbjct: 200 YPYEGRRSSCKKSGEYV-TKVKTYV-FPLDEQEMARTVAAKGPVAVAIEASQLSFYDKGI 257
Query: 274 SHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG 333
+ C E+L+H VL+VGYG + V YWI+KNSWG WGEKGYFRL +
Sbjct: 258 VDE-RCRCSNKREDLNHGVLVVGYG------SENGVDYWIVKNSWGADWGEKGYFRLKKD 310
Query: 334 DGSCGINDY 342
+CGI Y
Sbjct: 311 VKACGIGTY 319
>gi|157093728|gb|ABV22590.1| KDEL-tailed cysteine endopeptidase [Solanum lycopersicum]
Length = 360
Score = 200 bits (508), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 127/307 (41%), Positives = 170/307 (55%), Gaps = 32/307 (10%)
Query: 49 HNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKL 108
H+ +L E + R ++F N+ + + + LN+F+D++ EF+ Y G K+
Sbjct: 44 HHTVSRSLDEKHKRFNVFKANVHYVHNFNKKDKPYKL-KLNKFADMTNHEFRQHYAGSKI 102
Query: 109 KPSY----ADRSVPAMI--PNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGV 162
K A R+ + +P + DWR+ AVT VKDQ CGS WAFST +EG+
Sbjct: 103 KHHRTLLGASRANGTFMYANEDNVPPSIDWRKKGAVTPVKDQGQCGSCWAFSTVVAVEGI 162
Query: 163 YAAKTKKLVSLSEQELIDCD-QEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDK 221
KTKKLVSLSEQEL+DCD E+ GC GG + AFD I + GG+ E+ YPY+ +D
Sbjct: 163 NQIKTKKLVSLSEQELVDCDTTENQGCNGGLMDPAFDFIKKR--GGITTEERYPYKAEDD 220
Query: 222 ACRLNKKATQ-VKINGYVSVSRDETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHPIQ 278
C + K+ T V I+G+ V ++ D V N P++VAI+A QFY GV
Sbjct: 221 KCDIQKRNTPVVSIDGHEDVPPNDEDALLKAVANQPISVAIDASGSQFQFYSEGV----- 275
Query: 279 FFCDGGNENLSHSVLIVGYG--VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG--- 333
F + G E L H V IVGYG VD TK YWI+KNSWG GWGEKGY R+ R
Sbjct: 276 FTGECGTE-LDHGVAIVGYGTTVDGTK-------YWIVKNSWGAGWGEKGYIRMQRKVDA 327
Query: 334 -DGSCGI 339
+G CGI
Sbjct: 328 EEGLCGI 334
>gi|238816977|gb|ACR56863.1| cathepsin L-like cysteine proteinase [Delia coarctata]
Length = 338
Score = 200 bits (508), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 122/311 (39%), Positives = 171/311 (54%), Gaps = 27/311 (8%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKI---QLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
F +H K Y + VE R+ IF+ N KI L S GLN+++D+ EF+
Sbjct: 30 FKMEHRKNYLSEVEERFRMKIFNENRHKIAKHNQLYAQGKVSFKLGLNKYADMLHHEFKE 89
Query: 102 KYLGF------KLKPSYADRSVPAMIP-NITLPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
G+ +L+ + + P N+ +P+A DWR++ AVT VKDQ CGS W+FS
Sbjct: 90 TMNGYNHTMRKELRAQEGFNGITYISPANVQVPKAVDWRQHGAVTSVKDQGHCGSCWSFS 149
Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEEK 212
+TG++EG + K LVSLSEQ L+DC + ++GC GG + NAF I K GG++ EK
Sbjct: 150 STGSLEGQHFRKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYI--KDNGGVDTEK 207
Query: 213 TYPYRGDDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINAY--ALQFY 269
+YPY G D +C NK G+V + + DE M K + GP+AVAI+A + Q Y
Sbjct: 208 SYPYEGIDDSCHFNKATVGATDTGFVDIPQGDEEAMMKAVATMGPVAVAIDASNESFQLY 267
Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
GV + D NL H VL+VGYG D+ YW++KNSWG WG++GY +
Sbjct: 268 SEGVYNDPNCSSD----NLDHGVLVVGYGTDK-----DGQDYWLVKNSWGTTWGDQGYIK 318
Query: 330 LYRG-DGSCGI 339
+ R D CGI
Sbjct: 319 MARNQDNQCGI 329
>gi|261328618|emb|CBH11596.1| cysteine peptidase precursor, (fragment) [Trypanosoma brucei
gambiense DAL972]
Length = 404
Score = 199 bits (507), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 115/306 (37%), Positives = 162/306 (52%), Gaps = 20/306 (6%)
Query: 49 HNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKL 108
+ K Y E R F N+ + ++ Q + +G+ FSD++ EF+A+Y
Sbjct: 2 YGKVYKDAKEEAFRFRAFEENMEQAKI-QAAANPYATFGVTPFSDMTREEFRARYRNGAS 60
Query: 109 KPSYADRSVPAMIPNITL---PRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAA 165
+ A + + + N+T P A DWRE AVT +KDQ CGS WAF + GNIEG +
Sbjct: 61 YFAAAQKRLRKTV-NVTTGRAPAAVDWREKGAVTPMKDQGQCGSCWAFYSIGNIEGQWQV 119
Query: 166 KTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY---RGDDKA 222
LVSLSEQ L+ CD D GC GG + NAF+ I++ GG + E +YPY G+
Sbjct: 120 AGNPLVSLSEQMLVSCDTIDFGCGGGLMDNAFNWIVNSNGGNVFTEASYPYVSGNGEQPQ 179
Query: 223 CRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCD 282
C++N I +V + +DE +A YL ENGP+A+A++A + Y G+
Sbjct: 180 CQMNGHEIGAAITDHVDLPQDEDAIAAYLAENGPLAIAVDATSFMDYNGGI------LTS 233
Query: 283 GGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDY 342
+E L H VL+VGY + PYWIIKNSW WGE GY R+ +G C +N
Sbjct: 234 CTSEQLDHGVLLVGYNDNSNP------PYWIIKNSWSNMWGEDGYIRIEKGTNQCLMNQA 287
Query: 343 VRSALV 348
V SA+V
Sbjct: 288 VSSAVV 293
>gi|146335580|gb|ABQ23399.1| cathepsin L isotype 2 [Trypanoplasma borreli]
Length = 443
Score = 199 bits (507), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 122/324 (37%), Positives = 169/324 (52%), Gaps = 34/324 (10%)
Query: 41 LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
LF+ F H + Y + E R IF+ N++K L + ++ +G NEF+D+S+ EFQ
Sbjct: 24 LFSDFKATHARNYVSPGEERKRFEIFAANMKKAAEL-NRKNPMATFGPNEFADMSSEEFQ 82
Query: 101 AKY-----------LGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGS 149
++ K S+ + A + DWR AVT VK+Q CGS
Sbjct: 83 TRHNAARHYAAAKARRAKHTKSFTKEEIKAADG-----QKIDWRLKGAVTSVKNQGSCGS 137
Query: 150 SWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLE 209
W+FSTTGNIEG A T LVSLSEQEL+ CD D+GC GG + NAF ++S GG +
Sbjct: 138 CWSFSTTGNIEGQNAIATGNLVSLSEQELVSCDTTDNGCNGGLMDNAFGWLISTRGGQIA 197
Query: 210 EEKTYPY---RGDDKAC--RLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAY 264
E +YPY G AC L+ K I+ + ++ E DMA ++ GP+++ ++A
Sbjct: 198 TEASYPYVSGNGIVPACSYNLDNKPVGATISNFQDITGTEEDMAAFVFNYGPLSIGVDAS 257
Query: 265 ALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGE 324
Q Y G I +C + + H VLIVGY D T T PYWIIKNSW WGE
Sbjct: 258 TWQSYAGG----IITYCP--DVQIDHGVLIVGY--DDTAPT----PYWIIKNSWTANWGE 305
Query: 325 KGYFRLYRGDGSCGINDYVRSALV 348
GY R+ +G CG+ S++V
Sbjct: 306 DGYIRVAKGSNMCGLTSTPSSSVV 329
>gi|343477446|emb|CCD11725.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 361
Score = 199 bits (507), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 120/343 (34%), Positives = 184/343 (53%), Gaps = 24/343 (6%)
Query: 13 LSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRK 72
+ L + F+ V LH ++ F F ++++++Y E R +F N+ +
Sbjct: 14 VGLLAVAACFVPVALGVLHAEQSLQQQ--FAAFKQKYSRSYRDATEEAFRFRVFKQNMER 71
Query: 73 IQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL-GFKLKPSYADRSVPAMIPNITL---PR 128
+ + + +G+ FSD+S EF+A Y G + + R P + N++ P
Sbjct: 72 AKE-EAAANPYATFGVTRFSDMSPEEFRATYHNGAEYYAAALKR--PRKVVNVSTGKAPE 128
Query: 129 AFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGC 188
A DWR+ AVT VKDQ C SSWAF+ GNIEG + +L SLSEQ L+ CD D GC
Sbjct: 129 AVDWRKKGAVTPVKDQGKCDSSWAFTVIGNIEGQWKIAGHELTSLSEQMLVSCDTNDLGC 188
Query: 189 EGGSISNAFDTIMSKLGGGLEEEKTYPYR---GDDKACRLNKKATQVKINGYVSVSRDET 245
G + AF I+S G + E++YPY G+ AC + K I+ +V + +E
Sbjct: 189 RAGFMDTAFKWIVSPNDGNVFTEQSYPYASGGGNVPACNKSGKVVGANIDDHVHILDNEN 248
Query: 246 DMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFT 305
+A++L +NGP+A+A++A + Q Y GV ++ ++ + L+VGY D +K
Sbjct: 249 AIAEWLAKNGPVAIAVDATSFQRYTGGV------LTSCISKEVNSAALLVGYD-DTSK-- 299
Query: 306 HKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
PYWIIKNSWG+GWGE+GY R+ +G C + DYV SA+V
Sbjct: 300 ---PPYWIIKNSWGKGWGEEGYIRIEKGTNQCRMKDYVSSAVV 339
>gi|113931178|ref|NP_001039033.1| cathepsin W [Xenopus (Silurana) tropicalis]
gi|89269052|emb|CAJ83515.1| cathepsin W [Xenopus (Silurana) tropicalis]
Length = 303
Score = 199 bits (507), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 112/305 (36%), Positives = 168/305 (55%), Gaps = 22/305 (7%)
Query: 48 QHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFK 107
Q+N++Y T E+ RL IFS NL++ LQ E G+ YG+ +FSDL+ EF +L
Sbjct: 3 QYNRSYKTREEFKYRLRIFSENLKEASRLQREELGTAQYGVTKFSDLTDEEFSIYHLPTN 62
Query: 108 LKPSYADRSVPAMIPN----ITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVY 163
+ P+ P ++ + P + DWR + ++ K+Q C S WAF+ NIE +
Sbjct: 63 ILPT------PPILKQSEEVLPFPTSCDWRTQNVISKAKNQRTCHSCWAFAAVANIEAQW 116
Query: 164 AAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKAC 223
A + +SLSEQ++IDC+ +GC GG +AF T++ + GGL EK+YPY G C
Sbjct: 117 AI-LGQTISLSEQQVIDCNTCRNGCSGGYAWDAFMTVLQQ--GGLTSEKSYPYTGHVSNC 173
Query: 224 RLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDG 283
R +A I+ + + ++ET MA ++ G + V IN L+ Y G+ ++ CD
Sbjct: 174 RKGFEAVGW-IHDFEMLKKNETAMASHVAHKGTLTVTINKAPLKHYQKGIVDTLRSNCDP 232
Query: 284 GNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYV 343
+ H VLIVGY +P WI+KNSWGE WGEKG+FR++R +CGI Y
Sbjct: 233 NY--VDHVVLIVGYR------GGGKLPQWILKNSWGEDWGEKGFFRMFRDKNACGITKYP 284
Query: 344 RSALV 348
+ +V
Sbjct: 285 VTCIV 289
>gi|2511693|emb|CAB17076.1| cysteine proteinase precursor [Phaseolus vulgaris]
Length = 455
Score = 199 bits (507), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 134/348 (38%), Positives = 181/348 (52%), Gaps = 29/348 (8%)
Query: 6 FFAGVALLS-LTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLH 64
FA AL S L +S+ S+ +K + +L+ +L +H K Y L E R
Sbjct: 3 LFALFALSSALDMSIISYDNAHQDKATWRTDEEVNSLYEEWLVKHGKLYNALGEKDKRFQ 62
Query: 65 IFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPA--MIP 122
IF NLR I Q+ E+ + GLN F+DL+ E++A+YLG K+ P+ P+ P
Sbjct: 63 IFKDNLRFIDQ-QNAENRTYKLGLNRFADLTNEEYRARYLGTKIDPNRRLGRTPSNRYAP 121
Query: 123 NI--TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELID 180
+ TLP + DWR+ AV VKDQ CGS WAFS G +EG+ T L+SLSEQEL+D
Sbjct: 122 RVGETLPDSVDWRKEGAVVPVKDQASCGSCWAFSAIGAVEGINKIVTGDLISLSEQELVD 181
Query: 181 CDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKAC-RLNKKATQVKINGYV 238
CD + GC GG + AF+ I+ GG++ E+ YPY+G D C K A V I+GY
Sbjct: 182 CDTGYNMGCNGGLMDYAFEFIIKN--GGIDSEEDYPYKGVDGRCDEYRKNAKVVSIDGYE 239
Query: 239 SVSRDETDMAKYLVENGPMAVAIN--AYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVG 296
V+ + K V N P++VA+ Q Y +GV F L H V+ VG
Sbjct: 240 DVNTYDELALKKAVANQPVSVAVEGGGREFQLYSSGV------FTGRCGTALDHGVVAVG 293
Query: 297 YGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-----DGSCGI 339
YG D +WI++NSWG WGE+GY RL R G CGI
Sbjct: 294 YGTD------NGHDFWIVRNSWGADWGEEGYIRLERNLGNSRSGKCGI 335
>gi|260516678|gb|ACX43965.1| cysteine protease 1 [Brachiaria hybrid cultivar]
Length = 338
Score = 199 bits (507), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 126/306 (41%), Positives = 165/306 (53%), Gaps = 24/306 (7%)
Query: 41 LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
+F F++Q++K Y+ E+ SR + F N+ I+L + S GLNEF+DLS EF+
Sbjct: 41 MFTAFMKQYSKAYSH-AEFSSRFNQFKANVETIRLHNTLANASYTMGLNEFADLSFEEFK 99
Query: 101 AKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIE 160
KY G+K RS P + DWR +AVT +KDQ CGS WAFS TG+IE
Sbjct: 100 GKYFGYKHVEREFARSNNLHQEVEAAPTSIDWRTSNAVTPIKDQGQCGSCWAFSATGSIE 159
Query: 161 GVYAAKTK-KLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
G + + K L SLSEQ+L+DC D GC GG + AF+ I++ G+ E YPY+
Sbjct: 160 GAWVLQGKHTLTSLSEQQLVDCSTSYGDAGCNGGLMDYAFEYIIANK--GICAESAYPYK 217
Query: 218 GDDKACRLNKKATQ-VKINGYVSV-SRDETDMAKYLVENGPMAVAINA--YALQFYVTGV 273
G C+ K T+ V I+GY V S DE + + GP++VAI A QFY +GV
Sbjct: 218 GVGGLCQ--KSCTKVVTISGYKDVASGDEASLLNAVGTVGPVSVAIEADQAGFQFYSSGV 275
Query: 274 SHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG 333
F NL H VL VGYG T + YWI+KNSWG WGE GY R+ R
Sbjct: 276 ------FSGTCGHNLDHGVLAVGYG------TTGSQDYWIVKNSWGTSWGESGYIRMIRN 323
Query: 334 DGSCGI 339
CGI
Sbjct: 324 KNQCGI 329
>gi|194352752|emb|CAQ00104.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
Length = 351
Score = 199 bits (507), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 142/349 (40%), Positives = 182/349 (52%), Gaps = 32/349 (9%)
Query: 10 VALLSLTVSV-----SSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLH 64
VA+L L V S F +VG + H + LF +L +H K YA+ E R
Sbjct: 7 VAVLLLCVGACVARNSDFSIVGYSEEDLSSHDRLVELFEKWLAKHQKAYASFEEKLHRFE 66
Query: 65 IFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNI 124
+F NL+ I + + E S GLNEF+DL+ EF+ YLG P+ S N+
Sbjct: 67 VFKDNLKLIDEI-NREVTSYWLGLNEFADLTHDEFKTTYLGLSPPPARRSSSRSFRYENV 125
Query: 125 T---LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDC 181
LP+A DWR+ AVT VK+Q CGS WAFST +EG+ A T L +LSEQELIDC
Sbjct: 126 AAHDLPKAVDWRKKGAVTDVKNQGQCGSCWAFSTVAAVEGINAIVTGNLTALSEQELIDC 185
Query: 182 DQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQ--VKINGYV 238
+ + GC GG + AF I S GGL E+ YPY ++ +C KK+ V I+GY
Sbjct: 186 SVDGNSGCNGGMMDYAFSYIASS--GGLHTEEAYPYLMEEGSCGDGKKSESEAVSISGYE 243
Query: 239 SV-SRDETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHPIQFFCDGGNENLSHSVLIV 295
V ++DE + K L P++VAI A QFY GV F L H V V
Sbjct: 244 DVPTKDEQALIKALAHQ-PVSVAIEASGRHFQFYSGGV------FDGPCGAQLDHGVAAV 296
Query: 296 GYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGIN 340
GYG D+ K Y I+KNSWG WGEKGY R+ RG +G CGIN
Sbjct: 297 GYGSDKG----KGHDYIIVKNSWGGKWGEKGYIRMKRGTGKSEGLCGIN 341
>gi|319976406|gb|ADV90878.1| cysteine proteinase B [Leishmania donovani]
Length = 332
Score = 199 bits (507), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 117/272 (43%), Positives = 163/272 (59%), Gaps = 21/272 (7%)
Query: 86 YGLNEFSDLSTAEFQAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTG 140
+G+ +F DLS AEF A+YL F +A + +++ +P A DWRE AVT
Sbjct: 5 FGITKFFDLSEAEFAARYLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTP 64
Query: 141 VKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTI 200
VK+Q CGS WAFS GNIE +A LVSLSEQ+L+ CD +D+GC GG + AF+ +
Sbjct: 65 VKNQGACGSCWAFSAVGNIESQWARAGHGLVSLSEQQLVSCDDKDNGCNGGLMLQAFEWL 124
Query: 201 MSKLGGGLEEEKTYPY---RGDDKAC-RLNKKATQVKINGYVSVSRDETDMAKYLVENGP 256
+ + G + EK+YPY GD C +K +I+GYV + +ET MA +L ENGP
Sbjct: 125 LRHMYGIVFTEKSYPYTSGNGDVAECLNSSKLVPGAQIDGYVMIPSNETVMAAWLAENGP 184
Query: 257 MAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKN 316
+A+A++A + Y +GV C G + L+H VL+VGY ++T VPYW+IKN
Sbjct: 185 IAIAVDASSFMSYQSGV----LTSCAG--DALNHGVLLVGY--NKT----GGVPYWVIKN 232
Query: 317 SWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
SWGE WGEKGY R+ G +C +++Y SA V
Sbjct: 233 SWGEDWGEKGYVRVAMGLNACLLSEYPVSAHV 264
>gi|440907378|gb|ELR57532.1| Cathepsin W [Bos grunniens mutus]
Length = 382
Score = 199 bits (506), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 122/336 (36%), Positives = 175/336 (52%), Gaps = 35/336 (10%)
Query: 41 LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
+F F Q+N++Y EY RL IF+ NL K Q LQ+ + G+ +G+ +FSDL+ EF
Sbjct: 41 VFRLFQMQYNRSYPNPAEYARRLDIFAQNLAKAQRLQEEDLGTAEFGVTQFSDLTEEEFV 100
Query: 101 AKY----LGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTT 156
Y G L S R V + + PR DWR+ ++ V+DQ C WA +
Sbjct: 101 QLYGSQVAGEALGVS---RKVGSEEWGESEPRTCDWRKVGPISLVRDQRNCNCCWAMAAA 157
Query: 157 GNIEGVYAAKTKKLVSLSEQ--------ELIDCDQEDDGCEGGSISNAFDTIMSKLGGGL 208
GNIE ++A K + V +S Q EL+DCD+ +GC GG + +AF T+++ GL
Sbjct: 158 GNIEALWAIKFRHFVEVSVQRMAGGRGWELLDCDRCGNGCRGGFVWDAFLTVLNN--SGL 215
Query: 209 EEEKTYPYRGDDKACR-LNKKATQVK-INGYVSVSRDETDMAKYLVENGPMAVAINAYAL 266
EK YP+ G K R L KK +V I ++ + E MA++L GP+ V IN L
Sbjct: 216 ASEKDYPFDGSGKTHRCLAKKYKKVAWIQDFIILQACEQSMARHLATEGPITVTINMTLL 275
Query: 267 QFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRT--------------KFTHKAVPYW 312
Q Y GV CD + HSVL+VG+G ++ +++ YW
Sbjct: 276 QQYQKGVIKATPTTCD--PTQVDHSVLLVGFGKTKSGEGRQGKAASFGSYARPRRSMAYW 333
Query: 313 IIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
+KNSWG WGE+GYFRL+RG +CGI + +A V
Sbjct: 334 TLKNSWGPQWGEEGYFRLHRGSNTCGITKFPVTARV 369
>gi|297598407|ref|NP_001045533.2| Os01g0971400 [Oryza sativa Japonica Group]
gi|15289977|dbj|BAB63672.1| putative cysteine protease CP1 [Oryza sativa Japonica Group]
gi|125529282|gb|EAY77396.1| hypothetical protein OsI_05384 [Oryza sativa Indica Group]
gi|125573472|gb|EAZ14987.1| hypothetical protein OsJ_04922 [Oryza sativa Japonica Group]
gi|215740756|dbj|BAG97412.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215741010|dbj|BAG97505.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215765325|dbj|BAG87022.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767338|dbj|BAG99566.1| unnamed protein product [Oryza sativa Japonica Group]
gi|255674119|dbj|BAF07447.2| Os01g0971400 [Oryza sativa Japonica Group]
Length = 365
Score = 199 bits (506), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 130/341 (38%), Positives = 175/341 (51%), Gaps = 35/341 (10%)
Query: 20 SSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT 79
S +VG + H + LF F+ ++ K Y++L E R +F NL I ++
Sbjct: 30 SELSIVGYSEEDLASHERLMELFEKFMAKYRKAYSSLEEKLRRFEVFKDNLNHID--EEN 87
Query: 80 EHGSGVY-GLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAM----IPNITLPRAFDWRE 134
+ +G + GLNEF+DL+ EF+A YLG L P+ + + + +LP+ DWR+
Sbjct: 88 KKITGYWLGLNEFADLTHDEFKAAYLGLTLTPARRNSNDQLFRYEEVEAASLPKEVDWRK 147
Query: 135 YDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSI 193
AVT VK+Q CGS WAFST +EG+ A T L LSEQELIDCD + ++GC GG +
Sbjct: 148 KGAVTEVKNQGQCGSCWAFSTVAAVEGINAIVTGNLTRLSEQELIDCDTDGNNGCSGGLM 207
Query: 194 SNAFDTIMSKLGGGLEEEKTYPYRGDDKACRL--------NKKATQVKINGYVSVSRDET 245
AF I + GGL E++YPY ++ CR + A V I+GY V R+
Sbjct: 208 DYAFSYIAAN--GGLHTEESYPYLMEEGTCRRGSTEGDDDGEAAAAVTISGYEDVPRNNE 265
Query: 246 DMAKYLVENGPMAVAINAYA--LQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTK 303
+ + P++VAI A QFY GV F L H V VGYG
Sbjct: 266 QALLKALAHQPVSVAIEASGRNFQFYSGGV------FDGPCGTRLDHGVTAVGYGT---- 315
Query: 304 FTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGIN 340
K Y I+KNSWG WGEKGY R+ RG DG CGIN
Sbjct: 316 -ASKGHDYIIVKNSWGSHWGEKGYIRMRRGTGKHDGLCGIN 355
>gi|307111936|gb|EFN60170.1| hypothetical protein CHLNCDRAFT_59551 [Chlorella variabilis]
Length = 364
Score = 199 bits (506), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 127/307 (41%), Positives = 170/307 (55%), Gaps = 24/307 (7%)
Query: 50 NKTYATLVEYYSR-LHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFK- 107
N+ YA+ E Y R +I+ NLR + H S + ++DLS E+++K LG+
Sbjct: 58 NRAYASSAEVYERRFNIWLDNLRFAHEY-NARHTSHWLSMGVYADLSQDEYRSKALGYNA 116
Query: 108 -LKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAK 166
L R+ P + P DW AVT VKDQ +CGS WAFSTTG +EG A
Sbjct: 117 HLHKKRPLRAAPFLYKGTVPPEEVDWVAGGAVTPVKDQLLCGSCWAFSTTGAVEGANAIA 176
Query: 167 TKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRL 225
T KLVSLSEQ L+DCD+E D GC GG + +AFD I++ GG++ E YPYR +D C+
Sbjct: 177 TGKLVSLSEQMLVDCDREYDTGCRGGFMDSAFDFIVNN--GGIDTEDDYPYRAEDGICQD 234
Query: 226 NKKATQ-VKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCD 282
N+ V I+GY V ++ + V + P++VAI A A Q Y GV F +
Sbjct: 235 NRTRRHVVTIDGYQDVPPNDENALMKAVAHQPVSVAIEADQLAFQLYGGGV-----FDAE 289
Query: 283 GGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG------DGS 336
G L H+VL+VGYG + TH +PYW++KNSWG WGEKGY RL R +G
Sbjct: 290 CGTA-LDHAVLVVGYGT-ASNGTHN-LPYWLVKNSWGAEWGEKGYIRLLRNLGKDAPEGQ 346
Query: 337 CGINDYV 343
CG+ Y
Sbjct: 347 CGLAMYA 353
>gi|50355619|dbj|BAD29958.1| cysteine protease [Daucus carota]
Length = 496
Score = 199 bits (506), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 125/313 (39%), Positives = 164/313 (52%), Gaps = 26/313 (8%)
Query: 39 TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAE 98
T LF +L H K+Y L E R IF NLR I E GLN+F+DL+ E
Sbjct: 42 TTLFESWLVTHGKSYNALGEEEKRFQIFKNNLRYIDEQNLVEDRGFKLGLNKFADLTNEE 101
Query: 99 FQAKYLGFKLKPSYADRSVP----AMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
+++KY G K K S A + +LP + DWRE AV VKDQ CGS WAFS
Sbjct: 102 YRSKYTGIKSKDLRKKVSAKSGRYATLSGESLPESVDWRESGAVATVKDQGSCGSCWAFS 161
Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
T +EG+ T KL++LSEQEL+DCD+ ++GC GG + AF+ I++ GG++ +
Sbjct: 162 TISAVEGINQIATGKLITLSEQELVDCDRSYNEGCNGGLMDYAFEFIINN--GGIDTDVD 219
Query: 214 YPYRGDDKAC-RLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA--LQFYV 270
YPY G D C + K A V I+ Y V + K N P++VAI A QFY
Sbjct: 220 YPYTGRDGKCDQYRKNAKVVTIDSYEDVPAYDELALKKAAANQPISVAIEASGRDFQFYD 279
Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
+G+ F L H V++VGYG + K YWI++NSWG WGE GY R+
Sbjct: 280 SGI------FTGKCGIALDHGVVVVGYGTENGK------DYWIVRNSWGADWGENGYLRM 327
Query: 331 YRG----DGSCGI 339
RG G CGI
Sbjct: 328 ERGISSKTGICGI 340
>gi|297688135|ref|XP_002821545.1| PREDICTED: cathepsin W [Pongo abelii]
Length = 376
Score = 199 bits (506), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 120/331 (36%), Positives = 175/331 (52%), Gaps = 33/331 (9%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
F F Q N++Y + E+ RL IF+ NL + Q LQ+ + G+ +G+ FSDL+ EF
Sbjct: 42 FKLFQIQFNRSYLSPEEHAHRLDIFANNLAQAQRLQEEDLGTAEFGVTPFSDLTEEEFGQ 101
Query: 102 KYLGFKLKPSYADRSVPAMIPNI-------TLPRAFDWREY-DAVTGVKDQTMCGSSWAF 153
Y G++ A VP+M I ++P DWR+ A++ +KDQ C WA
Sbjct: 102 LY-GYR----RAAGGVPSMGREIRSEELEESVPFTCDWRKVAGAISPIKDQKNCNCCWAM 156
Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
+ GNIE ++ V +S QEL+DC + DGC GG + +AF T+++ GL EK
Sbjct: 157 AAAGNIETLWRINFWDFVDVSVQELLDCGRCGDGCHGGFVWDAFITVLNN--SGLASEKD 214
Query: 214 YPYRGDDKACRLNKKATQ--VKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVT 271
YP++G +A R + K Q I ++ + +E +A+YL GP+ V IN LQ Y
Sbjct: 215 YPFQGKVRAHRCHPKKYQKVAWIQDFIMLQNNEHRIAQYLATYGPITVTINMKLLQLYRK 274
Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTK--------------FTHKAVPYWIIKNS 317
GV CD + + HSVL+VG+G +++ PYWI+KNS
Sbjct: 275 GVIKATPTTCD--PQLVDHSVLLVGFGNVKSEEGIWAETVLSQSQPQPPHPTPYWILKNS 332
Query: 318 WGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
WG WGEKGYFRL+RG +CGI + +A V
Sbjct: 333 WGAQWGEKGYFRLHRGSNTCGITKFPLTARV 363
>gi|146335578|gb|ABQ23398.1| cathepsin L isotype 1 [Trypanoplasma borreli]
Length = 443
Score = 199 bits (506), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 122/324 (37%), Positives = 169/324 (52%), Gaps = 34/324 (10%)
Query: 41 LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
LF+ F H + Y + E R IF+ N++K L + ++ +G NEF+D+S+ EFQ
Sbjct: 24 LFSDFKATHARNYVSPGEERKRFEIFAANMKKAAEL-NRKNPMATFGPNEFADMSSEEFQ 82
Query: 101 AKY-----------LGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGS 149
++ K S+ + A + DWR AVT VK+Q CGS
Sbjct: 83 TRHNAARHYAAAKARRAKHTKSFTKEEIKA-----ADGQKIDWRLKGAVTSVKNQGSCGS 137
Query: 150 SWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLE 209
W+FSTTGNIEG A T LVSLSEQEL+ CD D+GC GG + NAF ++S GG +
Sbjct: 138 CWSFSTTGNIEGQNAIATGNLVSLSEQELVSCDTTDNGCNGGLMDNAFGWLISTRGGQIA 197
Query: 210 EEKTYPY---RGDDKAC--RLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAY 264
E +YPY G AC L+ K I+ + ++ E DMA ++ GP+++ ++A
Sbjct: 198 TEASYPYVSGNGIVPACSYNLDNKPVGATISNFQDITGTEEDMAAFVFNYGPLSIGVDAS 257
Query: 265 ALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGE 324
Q Y G I +C + + H VLIVGY D T T PYWIIKNSW WGE
Sbjct: 258 TWQSYAGG----IITYCP--DVQIDHGVLIVGY--DDTAPT----PYWIIKNSWTANWGE 305
Query: 325 KGYFRLYRGDGSCGINDYVRSALV 348
GY R+ +G CG+ S++V
Sbjct: 306 DGYIRVAKGSNMCGLTSTPSSSVV 329
>gi|224114698|ref|XP_002316833.1| predicted protein [Populus trichocarpa]
gi|222859898|gb|EEE97445.1| predicted protein [Populus trichocarpa]
Length = 305
Score = 199 bits (506), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 123/305 (40%), Positives = 160/305 (52%), Gaps = 25/305 (8%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
++ Q+ + Y VE RL+IF N+ I+ +NEF+DL+ EFQA
Sbjct: 7 WMAQYGRAYKGHVEKERRLNIFKNNVEFIESFNKVGKKPYKLSVNEFADLTNEEFQASRN 66
Query: 105 GFKLKPSYADRSV-PAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGV 162
G+K+ + S P N++ +P DWR+ AVT +KDQ CG WAFS EG+
Sbjct: 67 GYKMSAHLSSSSTKPFRYENVSAVPSTMDWRKKGAVTPIKDQGQCGCCWAFSAVAATEGI 126
Query: 163 YAAKTKKLVSLSEQELIDCDQ--EDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDD 220
T KL+SLSEQEL+DCD ED GC GG + +AFD I+ GL E YPY+G D
Sbjct: 127 TQLSTGKLISLSEQELVDCDTSGEDQGCNGGLMDDAFDFIIQNK--GLTTEANYPYQGAD 184
Query: 221 KACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQ 278
AC K A KI GY V + V N P++VAI+A A QFY +GV
Sbjct: 185 GACNSGKAA--AKITGYEDVPANSEAALLKAVANQPVSVAIDAGGSAFQFYSSGV----- 237
Query: 279 FFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----D 334
F D G + L H V VGYG+ + YW++KNSWG WGE GY R+ R +
Sbjct: 238 FTGDCGTD-LDHGVTAVGYGM-----SDDGTKYWLVKNSWGTSWGENGYIRMERDIDAQE 291
Query: 335 GSCGI 339
G CGI
Sbjct: 292 GLCGI 296
>gi|1581747|prf||2117247C Cys protease:ISOTYPE=3
Length = 469
Score = 199 bits (506), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 116/317 (36%), Positives = 165/317 (52%), Gaps = 24/317 (7%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQL-LQDTEHGSGVYGLNEFSDLSTAEFQ 100
F F ++H K Y + E RL +F NL +L H S +G+ FSDL+ EF+
Sbjct: 38 FAAFKQRHGKVYGSAAEETFRLGVFKENLLFARLHAAANPHAS--FGVTPFSDLTREEFR 95
Query: 101 AKYLGFKLKPSYADRSVPAMIPNITL------PRAFDWREYDAVTGVKDQTMCGSSWAFS 154
++Y + A + V + P A DWR AVT +KDQ C S WAFS
Sbjct: 96 SRYHNAAAHFAAAQKRVRVPVEVEVEVEVGGAPAAVDWRARGAVTAIKDQGNCSSCWAFS 155
Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
T GNIEG + L LSEQ L+ CD D+GC+GG + +AFD I+ + G + E +Y
Sbjct: 156 TIGNIEGQWHLAGNPLTGLSEQMLVSCDNADNGCDGGLMDSAFDWIVEQNNGSVYTEASY 215
Query: 215 PY---RGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVT 271
Y GD + C ++ I+G+V + +DE MA +L NGP+A+A++A + Y
Sbjct: 216 SYVSGGGDSQTCDMSDHVVGAVISGHVDLPQDEDKMAAWLAVNGPLAIAVDATSFMSYTG 275
Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
GV + ++ L H V++VGY PYWIIKNSWG WGE+GY R+
Sbjct: 276 GV------LTNCVSDQLDHGVVLVGYNDSSNP------PYWIIKNSWGADWGEEGYIRIQ 323
Query: 332 RGDGSCGINDYVRSALV 348
+G C + +Y SA+V
Sbjct: 324 KGTNQCLVKNYACSAVV 340
>gi|2511689|emb|CAB17074.1| cysteine proteinase precursor [Phaseolus vulgaris]
Length = 364
Score = 199 bits (506), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 131/346 (37%), Positives = 183/346 (52%), Gaps = 41/346 (11%)
Query: 12 LLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLR 71
LLS T S ++ M + + + + ++ +L +H K Y L E R +F NL
Sbjct: 11 LLSFTFSHATAMSIINYSENEVMD-----MYEEWLVKHRKVYNGLDEKEKRFQVFKDNLG 65
Query: 72 KIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVP---------AMIP 122
IQ + ++ + GLN+F+D++ E++A YLG + A R V A
Sbjct: 66 FIQD-HNAQNNTYTLGLNKFADITNEEYRAMYLGTRTD---AKRRVMKTQNTGHRYAYNS 121
Query: 123 NITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCD 182
LP DWR AV +KDQ CGS WAFST +EG+ T + VSLSEQEL+DCD
Sbjct: 122 GDQLPVHVDWRLKGAVGPIKDQGNCGSCWAFSTVAAVEGINNIVTGEFVSLSEQELVDCD 181
Query: 183 QE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQ-VKINGYVSV 240
+E D+GC GG + AF I+ GG++ E+ YPY+G D C KK T+ V+I+GY V
Sbjct: 182 REYDEGCNGGLMDYAFQFIIQN--GGIDTEEDYPYQGIDGTCDQTKKKTKVVQIDGYEDV 239
Query: 241 SRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYG 298
+ + K V + P++VAI A ALQ Y +GV F L H V++VGYG
Sbjct: 240 PSNNENALKKAVSHQPVSVAIEASGRALQLYQSGV------FTGKCGTALDHGVVVVGYG 293
Query: 299 VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-----DGSCGI 339
T V YW+++NSWG GWGE GYF++ R +G CGI
Sbjct: 294 ------TENGVDYWLVRNSWGTGWGEDGYFKMERNVRSTSEGKCGI 333
>gi|343473977|emb|CCD14279.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 361
Score = 199 bits (506), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 120/343 (34%), Positives = 183/343 (53%), Gaps = 24/343 (6%)
Query: 13 LSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRK 72
+ L + F+ V LH ++ F F ++++++Y E R +F N+ +
Sbjct: 14 VGLLAVAACFVPVALGVLHAEQSLQQQ--FAAFKQKYSRSYRDATEEAFRFRVFKQNMER 71
Query: 73 IQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL-GFKLKPSYADRSVPAMIPNITL---PR 128
+ + + +G+ FSD+S EF+A Y G + + R P + N++ P
Sbjct: 72 AKE-EAAANPYATFGVTRFSDMSPEEFRATYHNGAEYYAAALKR--PRKVVNVSTGKAPE 128
Query: 129 AFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGC 188
A DWR+ AVT VKDQ C SSWAF+ GNIEG + +L SLSEQ L+ CD D GC
Sbjct: 129 AVDWRKKGAVTPVKDQGKCDSSWAFTVIGNIEGQWKIAGHELTSLSEQMLVSCDTNDLGC 188
Query: 189 EGGSISNAFDTIMSKLGGGLEEEKTYPYR---GDDKACRLNKKATQVKINGYVSVSRDET 245
G + AF I+S G + E++YPY G+ AC + K I +V + +E
Sbjct: 189 RAGFMDTAFKWIVSPNDGNVFTEQSYPYASGGGNVPACNKSGKVVGANIRDHVHILDNEN 248
Query: 246 DMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFT 305
+A++L +NGP+A+A++A + Q Y GV ++ ++ + L+VGY D +K
Sbjct: 249 AIAEWLAKNGPVAIAVDATSFQRYTGGV------LTSCISKEVNSAALLVGYD-DTSK-- 299
Query: 306 HKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
PYWIIKNSWG+GWGE+GY R+ +G C + DYV SA+V
Sbjct: 300 ---PPYWIIKNSWGKGWGEEGYIRIEKGTNQCRMKDYVSSAVV 339
>gi|115457680|ref|NP_001052440.1| Os04g0311400 [Oryza sativa Japonica Group]
gi|113564011|dbj|BAF14354.1| Os04g0311400, partial [Oryza sativa Japonica Group]
Length = 384
Score = 199 bits (506), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 105/233 (45%), Positives = 139/233 (59%), Gaps = 20/233 (8%)
Query: 126 LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE- 184
LP FDWRE+ AV VKDQ CGS W+FST+G +EG + T KL LSEQ+++DCD E
Sbjct: 148 LPDDFDWREHGAVGPVKDQGSCGSCWSFSTSGALEGAHFLATGKLEVLSEQQMVDCDHEC 207
Query: 185 --------DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKING 236
D GC GG ++ AF +M GGL+ EK YPY G + C+ +K ++
Sbjct: 208 DASESRACDSGCNGGLMTTAFSYLMKS--GGLQSEKDYPYAGRENTCKFDKSKIVAQVKN 265
Query: 237 YVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVG 296
+ +S +E +A LV++GP+A+AINA +Q Y+ GVS P F C +L H VL+VG
Sbjct: 266 FSVISVNEDQIAANLVKHGPLAIAINAAYMQTYIGGVSCP--FIC---GRHLDHGVLLVG 320
Query: 297 YG-VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG---DGSCGINDYVRS 345
YG K PYWIIKNSWGE WGEKGY+++ RG CG++ V S
Sbjct: 321 YGSAGYAPIRFKEKPYWIIKNSWGENWGEKGYYKICRGPHDKNKCGVDSMVSS 373
>gi|195124431|ref|XP_002006696.1| GI21205 [Drosophila mojavensis]
gi|193911764|gb|EDW10631.1| GI21205 [Drosophila mojavensis]
Length = 339
Score = 199 bits (505), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 128/352 (36%), Positives = 189/352 (53%), Gaps = 43/352 (12%)
Query: 6 FFAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHI 65
+A +AL+++ +VS V+ +E ++ F +H KTY E RL I
Sbjct: 4 LYALLALVAVAQAVSFADVIKEE-------------WHTFKLEHRKTYQDETEERFRLKI 50
Query: 66 FSGNLRKIQLLQDTEHGSG----VYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMI 121
F+ N KI + + +G +N+++D+ EF+ GF R+
Sbjct: 51 FNENKHKI-AKHNQRYATGEVTFKMAVNKYADMLHHEFRETMNGFNYTLHKELRASDPSF 109
Query: 122 PNIT--------LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSL 173
IT LP++ DWRE AVT VKDQ CGS WAFS+TG +EG + KT LVSL
Sbjct: 110 TGITFISPAHVKLPKSVDWREKGAVTAVKDQGHCGSCWAFSSTGALEGQHFRKTGTLVSL 169
Query: 174 SEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQ 231
SEQ L+DC + ++GC GG + NAF I K GG++ EK+YPY G D +C NK +
Sbjct: 170 SEQNLVDCSAKYGNNGCNGGLMDNAFRYI--KDNGGIDTEKSYPYEGIDDSCHFNKDSVG 227
Query: 232 VKINGYVSVSR-DETDMAKYLVENGPMAVAINAY--ALQFYVTGVSHPIQFFCDGGNENL 288
G+ + + +E MA+ + GP++VAI+A + QFY G+ + + C+ ++NL
Sbjct: 228 ATDRGFADIPQGNEKKMAEAVATIGPVSVAIDASHESFQFYSEGIYNEPE--CN--SQNL 283
Query: 289 SHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-DGSCGI 339
H VL+VGYG D + YW++KNSWG WG+KG+ ++ R D CGI
Sbjct: 284 DHGVLVVGYGTDES-----GKDYWLVKNSWGTTWGDKGFIKMARNEDNQCGI 330
>gi|21666724|gb|AAM73806.1|AF448505_1 cysteine proteinase [Brassica napus]
gi|21666726|gb|AAM73807.1|AF448506_1 cysteine proteinase [Brassica napus]
Length = 343
Score = 199 bits (505), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 113/308 (36%), Positives = 165/308 (53%), Gaps = 26/308 (8%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHG-SGVYGLNEFSDLSTAEFQAKY 103
++ +H + YA E +R +F N+ I+ L + ++G + +N+F+DL+ EF++ Y
Sbjct: 40 WMTEHGRVYADANEKNNRYVVFKRNVESIERLNEVQYGLTFKLAVNQFADLTNEEFRSMY 99
Query: 104 LGFKLKPSYADRSVPAM-----IPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
G+K + R+ P + + LP + DWR+ AVT +KDQ CGS WAFS
Sbjct: 100 TGYKGNSVLSSRTKPTSFRYQHVSSDALPISVDWRKKGAVTPIKDQGSCGSCWAFSAVAA 159
Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG 218
IEGV K KL+SLSEQEL+DCD DDGC GG +++AF+ M+ GGL E YPY+
Sbjct: 160 IEGVAQIKKGKLISLSEQELVDCDTNDDGCMGGYMNSAFNYTMTT--GGLTSESNYPYKS 217
Query: 219 DDKACRLNK-KATQVKINGYVSVSRDETDMAKYLVENGPMAVAI--NAYALQFYVTGVSH 275
D C +NK K I G+ V ++ V + P+++ I QFY +GV
Sbjct: 218 TDGTCNINKTKQIATSIKGFEDVPANDEKALMKAVAHHPVSIGIAGGGTGFQFYSSGV-- 275
Query: 276 PIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGD- 334
F + +L H V +VGYG + YWI+KNSWG WGE+GY R+ +
Sbjct: 276 ----FSGECSTHLDHGVAVVGYGK-----SSNGSKYWILKNSWGPKWGERGYMRIKKDTK 326
Query: 335 ---GSCGI 339
G CG+
Sbjct: 327 AKHGQCGL 334
>gi|356553978|ref|XP_003545327.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
Length = 496
Score = 199 bits (505), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 126/313 (40%), Positives = 164/313 (52%), Gaps = 27/313 (8%)
Query: 40 ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
+++ +L +H K Y L E R IF NLR I E + GLN F+DL+ E+
Sbjct: 77 SMYEQWLVKHGKVYNALGEKEKRFQIFKDNLRFIDDHNSQEDRTYKLGLNRFADLTNEEY 136
Query: 100 QAKYLGFKLKPSYADRSVPA--MIPNI--TLPRAFDWREYDAVTGVKDQTMCGSSWAFST 155
+AKYLG K+ P+ P+ P + LP + DWR+ AV VKDQ CGS WAFS
Sbjct: 137 RAKYLGTKIDPNRRLGKTPSNRYAPRVGDKLPESVDWRKEGAVPPVKDQGGCGSCWAFSA 196
Query: 156 TGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
G +EG+ T +L+SLSEQEL+DCD ++GC GG + AF+ I++ GG++ E+ Y
Sbjct: 197 IGAVEGINKIVTGELISLSEQELVDCDTGYNEGCNGGLMDYAFEFIINN--GGIDSEEDY 254
Query: 215 PYRGDDKACRL-NKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAIN--AYALQFYVT 271
PYRG D C K A V I+ Y V + K V N P++VAI Q YV+
Sbjct: 255 PYRGVDGRCDTYRKNAKVVSIDDYEDVPAYDELALKKAVANQPVSVAIEGGGREFQLYVS 314
Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
GV F L H V+ VGYG T YWI++NSWG WGE GY RL
Sbjct: 315 GV------FTGRCGTALDHGVVAVGYG------TANGHDYWIVRNSWGPSWGEDGYIRLE 362
Query: 332 RG-----DGSCGI 339
R G CGI
Sbjct: 363 RNLANSRSGKCGI 375
>gi|146216000|gb|ABQ10202.1| cysteine protease Cp4 [Actinidia deliciosa]
Length = 463
Score = 199 bits (505), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 131/347 (37%), Positives = 174/347 (50%), Gaps = 32/347 (9%)
Query: 10 VALLSLTVSVSSFMVVGDEKLHHLHHVKHT-----ALFNYFLEQHNKTYATLVEYYSRLH 64
LL L + SS + + H + T A++ +L H K Y + E R
Sbjct: 10 ACLLFLCFAFSSALDMSIISYDQTHPPQRTDAEAMAIYEKWLTTHGKAYNAIGEKERRFE 69
Query: 65 IFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLG----FKLKPSYADRSVPAM 120
IF NLR + + GS GLN F+DL+ E+++ +LG K + + A
Sbjct: 70 IFKDNLRFVDE-HNAVAGSYRVGLNRFADLTNEEYRSMFLGGNMEMKERSASTKSDRYAF 128
Query: 121 IPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELID 180
LP + DWRE AV+ VKDQ CGS WAFST +EG+ T +L+SLSEQEL+D
Sbjct: 129 RAGDKLPGSVDWREKGAVSPVKDQGQCGSCWAFSTISAVEGINQIVTGELISLSEQELVD 188
Query: 181 CDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKAC-RLNKKATQVKINGYV 238
CD+ + GC GG + F I++ GG++ E+ YPYR D C + K A V INGY
Sbjct: 189 CDKSYNMGCNGGLMDYGFQFIINN--GGIDTEEDYPYRAVDGTCDQFRKNARVVSINGYE 246
Query: 239 SVSRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVG 296
V D+ + K V N P++VAI A A Q Y +GV F NL H V+ VG
Sbjct: 247 DVPEDDENSLKKAVANQPVSVAIEAGGRAFQLYESGV------FTGHCGTNLDHGVVAVG 300
Query: 297 YGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGI 339
YG T V YW ++NSWG WGE GY +L R G CGI
Sbjct: 301 YG------TENGVDYWTVRNSWGPKWGENGYIKLERNINATSGKCGI 341
>gi|1173630|gb|AAB37233.1| cysteine proteinase [Phalaenopsis sp. SM9108]
Length = 359
Score = 199 bits (505), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 130/353 (36%), Positives = 180/353 (50%), Gaps = 37/353 (10%)
Query: 6 FFAGVALLSLTVSVSSFMV-VGDEKLHHLHHVKHTALFNYF--LEQHNKTYATLVEYYSR 62
F+ + + S SV++ + + D+ L +L+N + H+ L E R
Sbjct: 3 LFSLILVASFLASVAATAIDIADKDLE-----TEDSLWNLYERWRSHHTVSRDLDEKQKR 57
Query: 63 LHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRS------ 116
++F N R I + LN+F+DL+ EF++ Y G ++ + R
Sbjct: 58 FNVFKENPRYIHDFNKRKDIPYKLRLNKFADLTNHEFRSTYAGSRINHHRSLRGSRRGGA 117
Query: 117 ----VPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVS 172
+ + + +LP + DWR+ AVT VKDQ CGS WAFST +EG+ KTKKL+S
Sbjct: 118 TNSFMYQSLDSRSLPASIDWRQKGAVTAVKDQGQCGSCWAFSTVAAVEGINQIKTKKLLS 177
Query: 173 LSEQELIDCD-QEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQ 231
LSEQELIDCD E++GC GG + AFD I K GG+ E YPY +D C KK+
Sbjct: 178 LSEQELIDCDTDENNGCNGGLMDYAFDFI--KKNGGISSEAEYPYAAEDSYCATEKKSHV 235
Query: 232 VKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLS 289
V I+G+ V ++ D V N P+++AI A Y QFY GV F G E L
Sbjct: 236 VSIDGHEDVPANDEDSLLKAVANQPVSIAIEASGYDFQFYSEGV-----FTGRSGTE-LD 289
Query: 290 HSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGS---CGI 339
H V IVGYG T + YWI++NSWG WGEKGY R+ S CG+
Sbjct: 290 HGVAIVGYGK-----TQQGTKYWIVRNSWGAEWGEKGYIRISAASDSKRLCGL 337
>gi|408009|gb|AAA18215.1| cysteine protease precursor [Trypanosoma congolense]
Length = 444
Score = 199 bits (505), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 124/343 (36%), Positives = 183/343 (53%), Gaps = 25/343 (7%)
Query: 13 LSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRK 72
+ L + F+ V LH ++ F F ++++++Y E R +F N+ +
Sbjct: 14 VGLLAVAACFVPVALGVLHAEQSLQQQ--FAAFKQKYSRSYKDATEEAFRFRVFKQNMER 71
Query: 73 IQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL-GFKLKPSYADRSVPAMIPNITL---PR 128
+ + + +G+ FSD+S EF+A Y G + + R P + N++ P
Sbjct: 72 AKE-EAAANPYATFGVTRFSDMSPEEFRATYHNGAEYYAAALKR--PRKVVNVSTGKAPE 128
Query: 129 AFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGC 188
A DWR+ AVT VKDQ CGS WAFS GNIEG + +L SLSEQ L+ CD D GC
Sbjct: 129 AVDWRKKGAVTPVKDQGQCGSCWAFSAIGNIEGQWKVAGHELTSLSEQMLVSCDTNDFGC 188
Query: 189 EGGSISNAFDTIMSKLGGGLEEEKTYPYR---GDDKACRLNKKATQVKINGYVSVSRDET 245
EGG + +AF I+S G + E++YPY G+ C + K KI +V + DE
Sbjct: 189 EGGLMDDAFKWIVSSNKGNVFTEQSYPYASGGGNVPTCDKSGKVVGAKIRDHVDLPEDEN 248
Query: 246 DMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFT 305
+A++L +NGP+A+A++A + Q Y GV +E+L H VL+VGY D +K
Sbjct: 249 AIAEWLAKNGPVAIAVDATSFQSYTGGV------LTSCISEHLDHGVLLVGYD-DTSK-- 299
Query: 306 HKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
PYWIIKNSW +GWGE+GY L R + C + + SA+V
Sbjct: 300 ---PPYWIIKNSWSKGWGEEGYSALRRHN-QCLMKNLPSSAVV 338
>gi|356564154|ref|XP_003550321.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
Length = 476
Score = 199 bits (505), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 131/340 (38%), Positives = 175/340 (51%), Gaps = 28/340 (8%)
Query: 14 SLTVSVSSFMVVGDEKLHHLHHVKH-TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRK 72
+L +S+ S+ +K L + +++ +L +H K Y L E R IF NLR
Sbjct: 30 ALDMSIISYDSAHADKAATLRTEEELMSMYEQWLVKHGKVYNALGEKEKRFQIFKDNLRF 89
Query: 73 IQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPA--MIPNI--TLPR 128
I E + GLN F+DL+ E++AKYLG K+ P+ P+ P + LP
Sbjct: 90 IDDHNSAEDRTYKLGLNRFADLTNEEYRAKYLGTKIDPNRRLGKTPSNRYAPRVGDKLPD 149
Query: 129 AFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDG 187
+ DWR+ AV VKDQ CGS WAFS G +EG+ T +L+SLSEQEL+DCD + G
Sbjct: 150 SVDWRKEGAVPPVKDQGGCGSCWAFSAIGAVEGINKIVTGELISLSEQELVDCDTGYNQG 209
Query: 188 CEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRL-NKKATQVKINGYVSVSRDETD 246
C GG + AF+ I++ GG++ ++ YPYRG D C K A V I+ Y V +
Sbjct: 210 CNGGLMDYAFEFIINN--GGIDSDEDYPYRGVDGRCDTYRKNAKVVSIDDYEDVPAYDEL 267
Query: 247 MAKYLVENGPMAVAIN--AYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKF 304
K V N P++VAI Q YV+GV F L H V+ VGYG
Sbjct: 268 ALKKAVANQPVSVAIEGGGREFQLYVSGV------FTGRCGTALDHGVVAVGYG------ 315
Query: 305 THKAVPYWIIKNSWGEGWGEKGYFRLYRG-----DGSCGI 339
T K YWI++NSWG WGE GY RL R G CGI
Sbjct: 316 TAKGHDYWIVRNSWGSSWGEDGYIRLERNLANSRSGKCGI 355
>gi|357167190|ref|XP_003581045.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like
[Brachypodium distachyon]
Length = 415
Score = 199 bits (505), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 124/345 (35%), Positives = 185/345 (53%), Gaps = 37/345 (10%)
Query: 10 VALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGN 69
+A+L+ T +VS+ + A ++ ++ + Y + E RL +F N
Sbjct: 84 IAILACTCAVSALAA-----RDLTDDLSMVARHEQWMAKYGRVYNDVAEKAQRLEVFKAN 138
Query: 70 LRKIQLLQDTEHGSGVYGL--NEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITL- 126
+ I+L+ G+ + L N+F+D++ EF+A + G+K P+ R+ N++L
Sbjct: 139 VAFIELVN---AGNDKFSLEANQFADMTVDEFRAAHTGYKPVPANKGRTTQFKYANVSLD 195
Query: 127 --PRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE 184
P + DWR AVT +KDQ CG WAFST ++EG+ T KL+SLSEQEL+DCD +
Sbjct: 196 ALPASMDWRAKGAVTPIKDQGQCGCCWAFSTVASVEGIVKLSTGKLISLSEQELVDCDVD 255
Query: 185 --DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQV-KINGYVSV- 240
D GCEGG + NAF+ I+ GGL E YPY G D +C NK++ V I GY V
Sbjct: 256 GMDQGCEGGLMDNAFEFIIDN--GGLTTEGNYPYTGTDDSCNSNKESNDVASIKGYEDVP 313
Query: 241 SRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYG 298
S DET + K + P+++A++ +FY GV + C G E L H + VGYG
Sbjct: 314 SNDETSLLKAVAAQ-PVSIAVDGGDNLFRFYKGGV---LSGAC--GTE-LDHGIAAVGYG 366
Query: 299 VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGI 339
+ T +W++KNSWG WGEKG+ R+ R +G CG+
Sbjct: 367 I-----TSDGTKFWLMKNSWGTSWGEKGFIRMERDIADEEGLCGL 406
>gi|1256830|gb|AAB68374.1| cysteine endopeptidase 1 [Phaseolus vulgaris]
gi|2959418|emb|CAA12118.1| cysteine protease [Phaseolus vulgaris]
Length = 364
Score = 199 bits (505), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 131/346 (37%), Positives = 183/346 (52%), Gaps = 41/346 (11%)
Query: 12 LLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLR 71
LLS T S ++ M + + + + ++ +L +H K Y L E R +F NL
Sbjct: 11 LLSFTFSHATAMSIINYSENEVMD-----MYEEWLVKHRKVYNGLDEKEKRFQVFKDNLG 65
Query: 72 KIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVP---------AMIP 122
IQ + ++ + GLN+F+D++ E++A YLG + A R V A
Sbjct: 66 FIQD-HNAQNNTYTLGLNKFADITNKEYRAMYLGTRTD---AKRRVMKTQNTGHRYAYNS 121
Query: 123 NITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCD 182
LP DWR AV +KDQ CGS WAFST +EG+ T + VSLSEQEL+DCD
Sbjct: 122 GDQLPVHVDWRLKGAVGPIKDQGNCGSCWAFSTVAAVEGINNIVTGEFVSLSEQELVDCD 181
Query: 183 QE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQ-VKINGYVSV 240
+E D+GC GG + AF I+ GG++ E+ YPY+G D C KK T+ V+I+GY V
Sbjct: 182 REYDEGCNGGLMDYAFQFIIQN--GGIDTEEDYPYQGIDGTCDETKKKTKVVQIDGYEDV 239
Query: 241 SRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYG 298
+ + K V + P++VAI A ALQ Y +GV F L H V++VGYG
Sbjct: 240 PSNNENALKKAVSHQPVSVAIEASGRALQLYQSGV------FTGKCGTALDHGVVVVGYG 293
Query: 299 VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-----DGSCGI 339
T V YW+++NSWG GWGE GYF++ R +G CGI
Sbjct: 294 ------TENGVDYWLVRNSWGTGWGEDGYFKMERNVRSTSEGKCGI 333
>gi|410493601|ref|YP_006908539.1| V-CATH [Epinotia aporema granulovirus]
gi|354805035|gb|AER41457.1| V-CATH [Epinotia aporema granulovirus]
Length = 329
Score = 198 bits (504), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 120/317 (37%), Positives = 182/317 (57%), Gaps = 21/317 (6%)
Query: 40 ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
ALF+ F+ ++NK YAT E ++ IF NL I ++++ + +Y +N SDL+ E
Sbjct: 26 ALFDDFVIKYNKVYATDEERAAKYEIFRNNLVVINE-KNSKTTNALYDINRLSDLNKNEL 84
Query: 100 QAKYLGF------KLKPSY-ADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
+ GF L PS + + A P+ +LP +FDWR +AVT VK+Q CGS WA
Sbjct: 85 -LRSTGFSVNLKKNLNPSKECEYVLVADAPSRSLPASFDWRANNAVTPVKNQLDCGSCWA 143
Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEK 212
FST NIE +YA K V L+EQ L++CD ++ C GG + A + I+ GG+ EE+
Sbjct: 144 FSTIANIESLYAIKYGVEVDLAEQYLLNCDYTNNNCNGGLMHWALENILINDNGGVVEER 203
Query: 213 TYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTG 272
PY G+ AC + + ++ + T + + L+ENGP++VAI+ + + Y G
Sbjct: 204 HAPYVGEVTACDKEEYLFTITNCKRFNLVNEHT-LQQLLIENGPISVAIDVFDILDYKQG 262
Query: 273 VSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR 332
+S + D G L+H+VL+VGYGV + +PYW+ KNSWG+ WGE+G+FR+ R
Sbjct: 263 ISDNCR--SDNG---LNHAVLLVGYGV-----SINGIPYWVFKNSWGDDWGEQGFFRVRR 312
Query: 333 GDGSCG-INDYVRSALV 348
SCG +N Y SA++
Sbjct: 313 DINSCGMMNAYAASAVL 329
>gi|343472974|emb|CCD15016.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 361
Score = 198 bits (504), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 120/343 (34%), Positives = 182/343 (53%), Gaps = 24/343 (6%)
Query: 13 LSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRK 72
+ L + F+ V LH ++ F F ++++++Y E R +F N+ +
Sbjct: 14 VGLLAVAACFVPVALGVLHAEQSLQQQ--FAAFKQKYSRSYKDATEEAFRFRVFKQNMER 71
Query: 73 IQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL-GFKLKPSYADRSVPAMIPNITL---PR 128
+ + + +G+ FSD+S EF+A Y G + + R P + N++ P
Sbjct: 72 AKE-EAAANPYATFGVTRFSDMSPEEFRATYHNGAEYYAAALKR--PRKVVNVSTGRPPM 128
Query: 129 AFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGC 188
DWR+ AVT VKDQ C SSWAFS TGNIEG + +L SLSEQ L+ CD +D GC
Sbjct: 129 TVDWRKKGAVTPVKDQGKCDSSWAFSATGNIEGQWKVAGHELTSLSEQMLVSCDTDDLGC 188
Query: 189 EGGSISNAFDTIMSKLGGGLEEEKTYPYR---GDDKACRLNKKATQVKINGYVSVSRDET 245
G AF+ I+S G + E++YPY G+ C + K KI +V ++RDE
Sbjct: 189 RDGFPDIAFNWIVSSNKGNVFTEQSYPYASGGGNVPTCDKSGKVVGAKIRDHVDLARDED 248
Query: 246 DMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFT 305
+A++L GP A+ ++A + Q Y GV ++ ++ + L+VGY D +K
Sbjct: 249 MIAEWLARKGPAAITVDATSFQRYTGGV------LTSCISKEMNSAALLVGYD-DTSK-- 299
Query: 306 HKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
PYWIIKNSWG+GWGE+GY R+ +G C + +Y RSA+V
Sbjct: 300 ---PPYWIIKNSWGKGWGEEGYIRIEKGTNQCLVQEYARSAVV 339
>gi|171702829|dbj|BAG16370.1| cysteine protease [Brassica oleracea var. italica]
Length = 332
Score = 198 bits (504), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 113/308 (36%), Positives = 165/308 (53%), Gaps = 26/308 (8%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHG-SGVYGLNEFSDLSTAEFQAKY 103
++ +H + YA E +R +F N+ I+ L + ++G + +N+F+DL+ EF++ Y
Sbjct: 34 WMTEHGRVYADANEKNNRYVVFKRNVESIERLNEVQYGLTFKLAVNQFADLTNEEFRSMY 93
Query: 104 LGFKLKPSYADRSVPAM-----IPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
G+K + R+ P + + LP + DWR+ AVT +KDQ CGS WAFS
Sbjct: 94 TGYKGNSVLSSRTKPTSFRYQHVSSDALPISVDWRKKGAVTPIKDQGSCGSCWAFSAVAA 153
Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG 218
IEGV K KL+SLSEQEL+DCD DDGC GG +++AF+ M+ GGL E YPY+
Sbjct: 154 IEGVAQIKKGKLISLSEQELVDCDTNDDGCMGGYMNSAFNYTMTT--GGLTSESNYPYKS 211
Query: 219 DDKACRLNK-KATQVKINGYVSVSRDETDMAKYLVENGPMAVAI--NAYALQFYVTGVSH 275
D C +NK K I G+ V ++ V + P+++ I QFY +GV
Sbjct: 212 TDGTCNINKTKQIATSIKGFEDVPANDEKALMKAVAHHPVSIGIAGGGTGFQFYSSGV-- 269
Query: 276 PIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGD- 334
F + +L H V +VGYG + YWI+KNSWG WGE+GY R+ +
Sbjct: 270 ----FSGECSTHLDHGVAVVGYGK-----SSNGSKYWILKNSWGPKWGERGYMRIKKDTK 320
Query: 335 ---GSCGI 339
G CG+
Sbjct: 321 AKHGQCGL 328
>gi|375073982|gb|AFA34858.1| cathepsin L-like protein [Trypanosoma dionisii]
Length = 467
Score = 198 bits (504), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 113/312 (36%), Positives = 160/312 (51%), Gaps = 18/312 (5%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
F F +++ + Y + E RL +F NL +L + +G+ FSDL+ EF++
Sbjct: 38 FADFKQRYGRVYKSAAEEAFRLSVFRKNLLDAKL-HAAANPHATFGVTPFSDLTREEFRS 96
Query: 102 KYL--GFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNI 159
++ VP + P A DWR+ AVT VKDQ CGS WAFS GN+
Sbjct: 97 RHHSGAAHFAAGRKRARVPVDVGVGDAPAAVDWRDRGAVTPVKDQGQCGSCWAFSAIGNV 156
Query: 160 EGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGD 219
EG + L SLSEQ L+ CD D GC+GG +++AF+ I+ G + E++Y Y
Sbjct: 157 EGQWFLAGNALTSLSEQMLVSCDTMDSGCDGGLMNSAFEWIVEHHNGTVYTEESYRYASG 216
Query: 220 D---KACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHP 276
D + CR + + I G+V + DE MA +L NGP+AVA++A + FY GV
Sbjct: 217 DGIAQPCRTSGRTVGAVITGHVKLPPDEAKMATWLAANGPLAVAVDASSWMFYTGGV--- 273
Query: 277 IQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGS 336
+ L H VL+VGY A PYWI+KNSWG WGE GY R+ +G
Sbjct: 274 ---LTSCVSNELDHGVLLVGYN------DSAAPPYWIVKNSWGTLWGEDGYVRIAKGTNQ 324
Query: 337 CGINDYVRSALV 348
C + + SA+V
Sbjct: 325 CLVKEEASSAVV 336
>gi|151573014|gb|ABS17682.1| cathepsin L-1 [Artemia salina]
Length = 334
Score = 198 bits (504), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 131/356 (36%), Positives = 187/356 (52%), Gaps = 38/356 (10%)
Query: 6 FFAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHI 65
F G L+ L+ ++S ++ DE ++ F H K Y + +E R+ I
Sbjct: 4 FLLGAVLVQLSAALSLTNLLADE-------------WHLFKATHKKEYPSQLEEKFRMKI 50
Query: 66 FSGNLRKI---QLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKP---SYADRSVPA 119
+ N K+ +L + S +N+F DL EF++ G++ K S A+ +
Sbjct: 51 YLENKHKVAKHNILYEKGEKSYHVAMNKFGDLLHHEFRSIMNGYQHKKQNSSRAESTFTF 110
Query: 120 MIP-NITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQEL 178
M P N+T+P + DWRE A+T VKDQ CGS WAFS+TG +EG KT KLVSLSEQ L
Sbjct: 111 MEPANVTVPESVDWREKGAITPVKDQGQCGSCWAFSSTGALEGQTFRKTGKLVSLSEQNL 170
Query: 179 IDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKING 236
IDC + ++GC GG + AF I K G++ E TYPY +D CR N + G
Sbjct: 171 IDCSGKYGNEGCNGGLMDQAFQYI--KDNKGIDTENTYPYEAEDDVCRYNPRNRGAVDRG 228
Query: 237 YVSVSRDETDMAKYLVEN-GPMAVAINAY--ALQFYVTGVSHPIQFFCDGGNENLSHSVL 293
+V + E D K V GP++VAI+A + QFY GV + + CD +++L H VL
Sbjct: 229 FVDIPSGEEDKLKAAVATVGPVSVAIDASHESFQFYSKGVYY--EPSCD--SDDLDHGVL 284
Query: 294 IVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-DGSCGINDYVRSALV 348
+VGYG D K YW++KNSW E WG++GY ++ R CG+ LV
Sbjct: 285 VVGYGSDNGK------DYWLVKNSWSEHWGDEGYIKMARNRKNHCGVASAASYPLV 334
>gi|115461667|ref|NP_001054433.1| Os05g0108600 [Oryza sativa Japonica Group]
gi|14719319|gb|AAK73137.1|AC079022_10 putative cysteine proteinase [Oryza sativa]
gi|33151125|gb|AAP97431.1| cysteine protease CP1 [Oryza sativa]
gi|52353572|gb|AAU44138.1| cysteine proteinase CP1 [Oryza sativa Japonica Group]
gi|113577984|dbj|BAF16347.1| Os05g0108600 [Oryza sativa Japonica Group]
gi|125550541|gb|EAY96250.1| hypothetical protein OsI_18148 [Oryza sativa Indica Group]
Length = 358
Score = 198 bits (504), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 134/338 (39%), Positives = 175/338 (51%), Gaps = 31/338 (9%)
Query: 18 SVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQ 77
S F +VG + H + LF ++ ++ K YA+ E R +F NL I +
Sbjct: 27 SGGEFSIVGYSEEDLASHDRLIELFEKWVAKYRKAYASFEEKVRRFEVFKDNLNHIDDI- 85
Query: 78 DTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRS-------VPAMIPNITLPRAF 130
+ + S GLNEF+DL+ EF+A YLG P+ ++ + N +P+
Sbjct: 86 NKKVTSYWLGLNEFADLTHDEFKATYLGLTPPPTRSNSKHYSSEEFRYGKMSNGEVPKEM 145
Query: 131 DWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCE 189
DWR+ +AVT VK+Q CGS WAFST +EG+ A T L SLSEQELIDC + ++GC
Sbjct: 146 DWRKKNAVTEVKNQGQCGSCWAFSTVAAVEGINAIVTGNLTSLSEQELIDCSTDGNNGCN 205
Query: 190 GGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSV-SRDETDMA 248
GG + AF I S GGL E+ YPY ++ C K A V I+GY V + DE +
Sbjct: 206 GGLMDYAFSYIAST--GGLRTEEAYPYAMEEGDCDEGKGAAVVTISGYEDVPANDEQALV 263
Query: 249 KYLVENGPMAVAINAYA--LQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTH 306
K L P++VAI A QFY GV F E L H V VGYG T
Sbjct: 264 KALAHQ-PVSVAIEASGRHFQFYSGGV------FDGPCGEQLDHGVTAVGYG------TS 310
Query: 307 KAVPYWIIKNSWGEGWGEKGYFRLYR----GDGSCGIN 340
K Y I+KNSWG WGEKGY R+ R G+G CGIN
Sbjct: 311 KGQDYIIVKNSWGPHWGEKGYIRMKRGTGKGEGLCGIN 348
>gi|359491865|ref|XP_002273243.2| PREDICTED: xylem cysteine proteinase 1-like [Vitis vinifera]
Length = 351
Score = 198 bits (504), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 134/348 (38%), Positives = 180/348 (51%), Gaps = 29/348 (8%)
Query: 4 FYFFAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRL 63
F F +A+ + + F +VG K T LF ++ +H K+Y + E R
Sbjct: 10 FLLFISMAVFAYSAFARDFSIVGYSPDDLTSMDKLTDLFESWMSKHGKSYRSFEEKLHRF 69
Query: 64 HIFSGNLRKIQLLQDTEHGSGVY--GLNEFSDLSTAEFQAKYLGFKLK-PSYADRSVPAM 120
+F NL+ I +T Y GLNEF+DLS EF+ KYLG K++ P D
Sbjct: 70 EVFQDNLKHID---ETNKKVSSYWLGLNEFADLSHEEFKRKYLGLKIELPKRRDSPEEFS 126
Query: 121 IPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELI 179
++ LP++ DWR+ AV VK+Q CGS WAFST +EG+ T L +LSEQELI
Sbjct: 127 YKDVADLPKSVDWRKKGAVAHVKNQGACGSCWAFSTVAAVEGINQIVTGNLTALSEQELI 186
Query: 180 DCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQ-VKINGY 237
DCD+ ++GC GG + AF I+S GGL +E+ YPY ++ C K+ + V I+GY
Sbjct: 187 DCDKPFNNGCNGGLMDYAFAFIISN--GGLRKEEDYPYVMEEGTCGEKKEELEVVTISGY 244
Query: 238 VSVSRDETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHPIQFFCDGGNENLSHSVLIV 295
V D + N P++VAI A + QFY G+ F G E L H V V
Sbjct: 245 HDVPEDNEQSFLKALANQPLSVAIEASSRGFQFYSGGI-----FNGHCGTE-LDHGVAAV 298
Query: 296 GYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGI 339
GYG T K V Y +KNSWG WGEKGY R+ R +G CGI
Sbjct: 299 GYG------TSKGVDYITVKNSWGSKWGEKGYIRMKRNVGKPEGICGI 340
>gi|405953314|gb|EKC21001.1| Cathepsin F [Crassostrea gigas]
Length = 397
Score = 198 bits (504), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 124/351 (35%), Positives = 179/351 (50%), Gaps = 50/351 (14%)
Query: 40 ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
LF + +HNK Y + S+ +F NL+ I L G +GLN+ +DLS EF
Sbjct: 52 PLFQKWKSEHNKIYRNHMIERSKFKVFLENLKVINELNGQFQGKTTFGLNQLADLSQKEF 111
Query: 100 QAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNI 159
L K + ++ ++ + +LP +FDW VT VKDQ GS WAFS GNI
Sbjct: 112 SRIVLMPKRRAPVFEKERSSL--SGSLPDSFDWTNQSKVTAVKDQGAAGSCWAFSAIGNI 169
Query: 160 EGVYAAKTKKLVSLSEQELIDCD--------QEDDGCEGGSISNAFDTIMSKLGGGLEEE 211
EG +A K L + S ++++DCD + D G GG A+ +M GGLE
Sbjct: 170 EGQWAMMGKPLTNFSVEQIVDCDGMEDVAKGEADCGVFGGWPFLAYQYVMR--AGGLETW 227
Query: 212 KTY------------------------------PYRGDDKAC----RLNKKATQVKINGY 237
+ Y PY ++C ++K +K+ +
Sbjct: 228 EDYWYCSGLGGAAGTCEVCPAPGYNTALCGPPIPYCNMTQSCVTKLDVSKFHPGLKVMSW 287
Query: 238 VSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGY 297
++ ++ET +A+ L++ GP++VA+NA LQFY G+ P F CD +NL H+VL+VGY
Sbjct: 288 KAIDQNETSIAEQLIKLGPLSVALNAELLQFYHHGIFDPPSFVCD--PKNLDHAVLLVGY 345
Query: 298 GVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
G +++ F K YW IKNSWG WGEKGYFR+ RG G CGIN V SA++
Sbjct: 346 GSEKSIFGTKD--YWKIKNSWGPKWGEKGYFRMLRGQGKCGINTAVTSAVL 394
>gi|449438381|ref|XP_004136967.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
Length = 479
Score = 198 bits (504), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 128/322 (39%), Positives = 167/322 (51%), Gaps = 28/322 (8%)
Query: 31 HHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNE 90
H + AL+ +L H K Y + E R IF NLR I + E + GL
Sbjct: 51 HQRPDEEVAALYESWLVHHGKAYNAIGEKERRFEIFKDNLRFIDE-HNRESRTYKVGLTR 109
Query: 91 FSDLSTAEFQAKYLG--FKLKP--SYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTM 146
F+DL+ E++A++LG F KP S A A LP DWR+ AV VKDQ
Sbjct: 110 FADLTNEEYRARFLGGRFSRKPRLSAAKSGRYAAALGDDLPDDVDWRKKGAVATVKDQGQ 169
Query: 147 CGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLG 205
CGS WAFS+ +EG+ T +L+ LSEQEL+DCD+ + GC GG + AF I+
Sbjct: 170 CGSCWAFSSVAAVEGINQIVTGELIPLSEQELVDCDKSFNMGCNGGLMDYAFQFIIGN-- 227
Query: 206 GGLEEEKTYPYRGDDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA- 263
GG++ E+ YPY+G D AC N+K A V I+GY V ++ K V N P++VAI A
Sbjct: 228 GGIDTEEDYPYKGRDAACDPNRKNAKVVTIDGYEDVPENDESSLKKAVANQPVSVAIEAG 287
Query: 264 -YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGW 322
A Q Y +GV F +L H V+ VGYG D YWI++NSWG+ W
Sbjct: 288 GRAFQLYQSGV------FTGRCGTDLDHGVVAVGYGTD------NGTDYWIVRNSWGKDW 335
Query: 323 GEKGYFRLYRG-----DGSCGI 339
GE GY RL R G CGI
Sbjct: 336 GESGYIRLERNVANITTGKCGI 357
>gi|146335576|gb|ABQ23397.1| cathepsin L [Trypanosoma carassii]
Length = 456
Score = 198 bits (503), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 119/313 (38%), Positives = 167/313 (53%), Gaps = 17/313 (5%)
Query: 40 ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
A F F +H K+Y + E R+ +F ++ K + +G+ +FSDL+ EF
Sbjct: 34 AQFAAFKAEHGKSYTSAAEEGYRMRVFEESM-KAAQAHAAANPHAKFGVTKFSDLTHEEF 92
Query: 100 QAKYLGFKLKPSYADRSVPAMIPNI-TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
+ Y + A + + T P +DWR+ AVT VKDQ CGS W FSTTGN
Sbjct: 93 KTLYANGAAHFAAAAKRARRPVSVTGTAPDEWDWRKKGAVTPVKDQGHCGSCWTFSTTGN 152
Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY-- 216
IEG +A +L +LSEQ L+ CD D GC GG + NAF+ I+++ G + E++YPY
Sbjct: 153 IEGQWAVAGNELTNLSEQMLVSCDARDYGCSGGLMDNAFEWIVNQNDGFVFTEESYPYAS 212
Query: 217 -RGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSH 275
GD C + + I G+V + DE MA +L NGP+++A++A + + Y GV
Sbjct: 213 GSGDAPLCDVGGRKVGATIKGHVGLPNDEEKMAAWLAANGPISIAVDADSFKAYKGGV-- 270
Query: 276 PIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
C+ G L H VL+VGY K + PYWIIKNSWG WGE GY R+ G
Sbjct: 271 --LTGCEEG--QLDHGVLLVGY----NKVANP--PYWIIKNSWGPNWGEHGYIRVGFGTN 320
Query: 336 SCGINDYVRSALV 348
C +N Y SA+V
Sbjct: 321 QCNLNSYACSAIV 333
>gi|224076972|ref|XP_002305074.1| predicted protein [Populus trichocarpa]
gi|224106329|ref|XP_002333698.1| predicted protein [Populus trichocarpa]
gi|222837984|gb|EEE76349.1| predicted protein [Populus trichocarpa]
gi|222848038|gb|EEE85585.1| predicted protein [Populus trichocarpa]
Length = 307
Score = 198 bits (503), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 118/304 (38%), Positives = 162/304 (53%), Gaps = 22/304 (7%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
++ QH + Y + E R IF N+ +I+ + G+N+F+DL+ EF+A +
Sbjct: 8 WMAQHGRVYGDMKEKEKRYLIFKENIERIEAFNNGSDRGYKLGVNKFADLTNEEFRAMHH 67
Query: 105 GFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYA 164
G+K + S S +P + DWR+ AVT VKDQ CG WAFS IEG+
Sbjct: 68 GYKRQSSKLMSSSFRHENLSAIPTSMDWRKAGAVTPVKDQGTCGCCWAFSAVAAIEGIIK 127
Query: 165 AKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKA 222
KT KL+SLSEQ+L+DCD + D GC GG + NAF I+ GGL E TYPY+G D
Sbjct: 128 LKTGKLISLSEQQLVDCDVKGVDQGCGGGLMDNAFQFILRN--GGLTSEATYPYQGVDGT 185
Query: 223 CRLNKKAT-QVKINGYVSVSRDETDMAKYLVENGPMAVAIN--AYALQFYVTGVSHPIQF 279
C+ K A+ + KI GY V + + V P++VA+ Y QFY +GV F
Sbjct: 186 CKSKKTASIEAKITGYEDVPVNNENALLQAVAKQPVSVAVEGGGYDFQFYKSGV-----F 240
Query: 280 FCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DG 335
D G L H+V +GYG + YW++KNSWG WGE GY R+ RG +G
Sbjct: 241 KGDCGTY-LDHAVTAIGYGTN-----SDGTNYWLVKNSWGTSWGESGYMRMQRGIGAREG 294
Query: 336 SCGI 339
CG+
Sbjct: 295 LCGV 298
>gi|50355611|dbj|BAD29954.1| cysteine protease [Daucus carota]
Length = 474
Score = 198 bits (503), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 134/360 (37%), Positives = 184/360 (51%), Gaps = 39/360 (10%)
Query: 3 CFYFFAGVALLSLTVSVSSFMVVGDEKLHHLH------HVKHTALFNYFLEQHNKTYATL 56
C F +A + S S ++ ++ H L+ H + +L+ +L +H+K Y L
Sbjct: 15 CLVLFFSLASFLMLSSASDMSIITYDETHGLNSPPLRTHDQLLSLYESWLVKHHKNYNAL 74
Query: 57 VEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPS----- 111
E +R IF N+ + + S GLN+F+DL+ E+++ YL K+
Sbjct: 75 GEKETRFGIFKDNVGFVDRHNSMRNQSYKLGLNKFADLTNDEYRSLYLSGKMMKRERKNE 134
Query: 112 ---YADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTK 168
+DR V LP + DWR+ AV VKDQ CGS WAFST G +EG+ T
Sbjct: 135 DGFRSDRFV--FEDGDHLPESVDWRDRGAVAPVKDQGQCGSCWAFSTVGAVEGINKIVTG 192
Query: 169 KLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNK 227
+L+SLSEQEL+DCD + GC GG + AF+ I+ GG++ E YPY+G D C N+
Sbjct: 193 ELISLSEQELVDCDNGYNQGCNGGLMDYAFEFIVKN--GGIDTEDDYPYKGVDGLCDQNR 250
Query: 228 K-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGG 284
K A V INGY V ++ K V + P++VAI A A Q Y +GV F G
Sbjct: 251 KNAKVVTINGYEDVPHNDEKSLKKAVAHQPVSVAIEAGGRAFQLYESGV-----FTGQCG 305
Query: 285 NENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-----DGSCGI 339
E L H V+ VGYG + K YWI++NSWG WGE GY RL R G CGI
Sbjct: 306 TE-LDHGVVAVGYGSENGK------DYWIVRNSWGPDWGESGYIRLERNVASTSTGKCGI 358
>gi|165969032|ref|YP_001650932.1| peptidase [Orgyia leucostigma NPV]
gi|164663528|gb|ABY65748.1| peptidase [Orgyia leucostigma NPV]
Length = 328
Score = 198 bits (503), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 121/321 (37%), Positives = 175/321 (54%), Gaps = 24/321 (7%)
Query: 36 VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLS 95
+K F F+ + K Y +E R IF NL +I + ++ + + VY +N+FSDLS
Sbjct: 23 LKAPDYFESFVANYQKNYNDDLEKSKRYTIFKDNLEEINV-KNRLNDTAVYRINKFSDLS 81
Query: 96 TAEFQAKYLGFKLKPSYADRSVPAMI---PNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
E +KY G PS ++ P P FDWR+ + VT +K+Q CG+ WA
Sbjct: 82 KTEIISKYTGLN-APSETTNFCKTIVLDQPPGKGPLNFDWRQQNKVTSIKNQGSCGACWA 140
Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEK 212
F+T +IE YA + + ++LSEQ+LIDCD D GC GG + AF+ ++ GG+++E
Sbjct: 141 FATLASIESQYAIRNDRHINLSEQQLIDCDYVDMGCYGGLLHTAFEQMIQM--GGVKQEH 198
Query: 213 TYPYRGDDKACRLNKKATQ---VKING-YVSVSRDETDMAKYLVENGPMAVAINAYALQF 268
YPY G +K C LN V+I G Y V E + L GP+ +AI+A +
Sbjct: 199 EYPYAGVNKQCELNDITDDSFVVRIKGCYRYVVVREEKLKDLLRAVGPIPIAIDASGIVN 258
Query: 269 YVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYF 328
Y GV + +C+ N L+H+VL+VGYGVD VPYW KN+WG WGE GYF
Sbjct: 259 YYKGVIN----YCE--NYGLNHAVLLVGYGVD------NGVPYWTFKNTWGVDWGENGYF 306
Query: 329 RLYRGDGSCGI-NDYVRSALV 348
RL + +CG+ N+ SA++
Sbjct: 307 RLRQNINACGMANELASSAVI 327
>gi|33333712|gb|AAQ11974.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
maculatus]
Length = 326
Score = 198 bits (503), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 125/306 (40%), Positives = 164/306 (53%), Gaps = 20/306 (6%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT-EHGSGVYG--LNEFSDLSTAEFQ- 100
F H KTY +LVE R +F NL IQ E G + + +F+D++ EF
Sbjct: 26 FKLDHGKTYRSLVEEKRRFSVFQKNLVDIQEHNKKYERGEESFAKKVTQFADMTHEEFLD 85
Query: 101 -AKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNI 159
K G PS A ++ A DWRE AVT VKDQ CGS WAFS G I
Sbjct: 86 LLKLQGVPALPSNAVHFDNFEDIDMEEKDAVDWREEGAVTPVKDQANCGSCWAFSAVGAI 145
Query: 160 EGVYAAKTKKLVSLSEQELIDCDQED---DGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
EG + K LVSLS QEL+DC ED +GC+GG + AFD + + G++ E++YPY
Sbjct: 146 EGQFFKKNGTLVSLSAQELVDCATEDYGNNGCKGGLMGQAFDFVQDE---GIQTEESYPY 202
Query: 217 RGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHP 276
G +C+ + + K+ YV DE +MA+ + GP+AVAI A L FY G+
Sbjct: 203 EGRRSSCKKSGEYV-TKVKTYV-FPLDEQEMARTVAAKGPVAVAIEASQLSFYDKGIVDE 260
Query: 277 IQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGS 336
+ C E+L+H VL+VGYG + V YWI+KNSWG WGEKGYFRL + +
Sbjct: 261 -RCRCSNKREDLNHGVLVVGYG------SENGVDYWIVKNSWGADWGEKGYFRLKKDVKA 313
Query: 337 CGINDY 342
CGI Y
Sbjct: 314 CGIGYY 319
>gi|33333706|gb|AAQ11971.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
maculatus]
Length = 326
Score = 198 bits (503), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 125/306 (40%), Positives = 164/306 (53%), Gaps = 20/306 (6%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT-EHGSGVYG--LNEFSDLSTAEFQ- 100
F H KTY +LVE R +F NL IQ E G + + +F+D++ EF
Sbjct: 26 FKLDHGKTYRSLVEEKRRFSVFQKNLVDIQEHNKKYERGEESFAKKVTQFADMTHEEFLD 85
Query: 101 -AKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNI 159
K G PS A ++ A DWRE AVT VKDQ CGS WAFS G I
Sbjct: 86 LLKLQGVPALPSNAVHFDNFEDIDMEEKDAVDWREEGAVTPVKDQANCGSCWAFSAVGAI 145
Query: 160 EGVYAAKTKKLVSLSEQELIDCDQED---DGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
EG + K LVSLS QEL+DC ED +GC+GG + AFD + + G++ E++YPY
Sbjct: 146 EGQFFKKNGTLVSLSAQELVDCATEDYGNNGCKGGLMGQAFDFVQDE---GIQTEESYPY 202
Query: 217 RGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHP 276
G +C+ + + K+ YV DE +MA+ + GP+AVAI A L FY G+
Sbjct: 203 EGRRSSCKKSGEYV-TKVKTYV-FPLDEQEMARTVAAKGPVAVAIEASQLSFYDKGIVDE 260
Query: 277 IQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGS 336
+ C E+L+H VL+VGYG + V YWI+KNSWG WGEKGYFRL + +
Sbjct: 261 -RCRCSNKREDLNHGVLVVGYG------SENGVDYWIVKNSWGADWGEKGYFRLKKDVKA 313
Query: 337 CGINDY 342
CGI Y
Sbjct: 314 CGIGYY 319
>gi|33333700|gb|AAQ11968.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
maculatus]
Length = 326
Score = 198 bits (503), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 124/309 (40%), Positives = 165/309 (53%), Gaps = 20/309 (6%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT-EHGSGVYG--LNEFSDLSTAE 98
+ F H KTY +LVE R +F NL IQ E G + + +F+D++ E
Sbjct: 23 WQQFKLDHGKTYRSLVEEKRRFSVFQKNLVDIQEHNKKYERGEESFAKKVTQFADMTHEE 82
Query: 99 FQ--AKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTT 156
F K G PS A + ++ A DWRE AVT KDQ CGS WAFS
Sbjct: 83 FLDLLKLQGVPALPSNAVHFDNSEDIDMEEKDAVDWREEGAVTPAKDQANCGSCWAFSAV 142
Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCDQED---DGCEGGSISNAFDTIMSKLGGGLEEEKT 213
G IEG + K LVSLS QEL+DC ED +GC+GG + AFD + + G++ E++
Sbjct: 143 GAIEGQFFKKNGTLVSLSAQELVDCATEDYGNNGCKGGLMGQAFDFVQDE---GIQTEES 199
Query: 214 YPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGV 273
YPY G +C+ + + K+ YV DE +MA+ + GP+AVAI A L FY G+
Sbjct: 200 YPYEGRRSSCKKSGEYV-TKVKTYV-FPLDEQEMARTVAAKGPVAVAIEASQLSFYDKGI 257
Query: 274 SHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG 333
+ C E+L+H VL+VGYG + V YWI+KNSWG WGEKGYFRL +
Sbjct: 258 VDE-RCRCSNKREDLNHGVLVVGYG------SENGVDYWIVKNSWGADWGEKGYFRLKKD 310
Query: 334 DGSCGINDY 342
+CGI Y
Sbjct: 311 VKACGIGYY 319
>gi|260516654|gb|ACX43954.1| cysteine protease 1 [Brachiaria hybrid cultivar]
gi|260516656|gb|ACX43955.1| cysteine protease 1 [Brachiaria hybrid cultivar]
gi|260516658|gb|ACX43956.1| cysteine protease 1 [Brachiaria hybrid cultivar]
gi|260516660|gb|ACX43957.1| cysteine protease 1 [Brachiaria hybrid cultivar]
gi|260516662|gb|ACX43958.1| cysteine protease 2 [Brachiaria hybrid cultivar]
gi|260516664|gb|ACX43959.1| cysteine protease 2 [Brachiaria hybrid cultivar]
gi|260516666|gb|ACX43960.1| cysteine protease 2 [Brachiaria hybrid cultivar]
gi|260516668|gb|ACX43961.1| cysteine protease 2 [Brachiaria hybrid cultivar]
gi|260516670|gb|ACX43962.1| cysteine protease 2 [Brachiaria hybrid cultivar]
Length = 338
Score = 198 bits (503), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 125/306 (40%), Positives = 165/306 (53%), Gaps = 24/306 (7%)
Query: 41 LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
+F F++Q++K Y+ E+ SR + F N+ I+L + S GLNEF+DLS EF+
Sbjct: 41 MFTAFMKQYSKAYSH-AEFSSRFNQFKANVETIRLHNTLANASYTMGLNEFADLSFEEFK 99
Query: 101 AKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIE 160
KY G+K RS P + DWR +AVT +KDQ CGS WAFS TG+IE
Sbjct: 100 GKYFGYKHVEREFARSNNLHQEVEAAPTSIDWRTSNAVTPIKDQGQCGSCWAFSATGSIE 159
Query: 161 GVYAAKTK-KLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
G + + K L SLSEQ+L+DC + GC GG + AF+ I++ G+ E YPY+
Sbjct: 160 GAWVLQGKHTLTSLSEQQLVDCSTSYGNAGCNGGLMDYAFEYIIANK--GICAESAYPYK 217
Query: 218 GDDKACRLNKKATQ-VKINGYVSV-SRDETDMAKYLVENGPMAVAINA--YALQFYVTGV 273
G C+ K T+ V I+GY V S DE + + GP++VAI A QFY +GV
Sbjct: 218 GVGGLCQ--KSCTKVVTISGYKDVASGDEASLLNAVGTVGPVSVAIEADQAGFQFYSSGV 275
Query: 274 SHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG 333
F NL H VL VGYG T + YWI+KNSWG WGE GY R+ R
Sbjct: 276 ------FSGTCGHNLDHGVLAVGYG------TTGSQDYWIVKNSWGTSWGESGYIRMIRN 323
Query: 334 DGSCGI 339
CGI
Sbjct: 324 KNQCGI 329
>gi|33333702|gb|AAQ11969.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
maculatus]
Length = 326
Score = 198 bits (503), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 125/306 (40%), Positives = 164/306 (53%), Gaps = 20/306 (6%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT-EHGSGVYG--LNEFSDLSTAEFQ- 100
F H KTY +LVE R +F NL IQ E G + + +F+D++ EF
Sbjct: 26 FKLDHGKTYRSLVEEKRRFSVFQKNLIDIQEHNKKYERGEVSFAKKVTQFADMTHEEFLD 85
Query: 101 -AKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNI 159
K G PS A ++ A DWRE AVT VKDQ CGS WAFS G I
Sbjct: 86 LLKLQGVPALPSNAVHFDNFEDTDMEEKDAVDWREEGAVTPVKDQANCGSCWAFSAVGAI 145
Query: 160 EGVYAAKTKKLVSLSEQELIDCDQED---DGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
EG + K LVSLS QEL+DC ED +GC+GG + AFD + + G++ E++YPY
Sbjct: 146 EGQFFKKNGTLVSLSAQELVDCATEDYGNNGCKGGLMGQAFDFVQDE---GIQTEESYPY 202
Query: 217 RGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHP 276
G +C+ + + K+ YV DE +MA+ + GP+AVAI A L FY G+
Sbjct: 203 EGRRSSCKKSGEYV-TKVKTYV-FPLDEQEMARTVAAKGPVAVAIEASQLSFYDKGIVDE 260
Query: 277 IQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGS 336
+ C E+L+H VL+VGYG + V YWI+KNSWG WGEKGYFRL + +
Sbjct: 261 -RCRCSNKREDLNHGVLVVGYG------SENGVDYWIVKNSWGADWGEKGYFRLKKDVKA 313
Query: 337 CGINDY 342
CGI Y
Sbjct: 314 CGIGYY 319
>gi|194883222|ref|XP_001975702.1| GG20414 [Drosophila erecta]
gi|190658889|gb|EDV56102.1| GG20414 [Drosophila erecta]
Length = 341
Score = 197 bits (502), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 128/312 (41%), Positives = 172/312 (55%), Gaps = 28/312 (8%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKI-QLLQDTEHGSGVYGL--NEFSDLSTAEFQA 101
F +H K Y E RL IF+ N KI + Q G + L N+++DL EF+
Sbjct: 32 FKLEHRKNYQDDTEERFRLKIFNENKHKIAKHNQRYAEGKVSFKLAVNKYADLLHHEFRQ 91
Query: 102 KYLGFKLKPSYADRS-------VPAMIP-NITLPRAFDWREYDAVTGVKDQTMCGSSWAF 153
GF RS V + P ++TLP++ DWR AVT VKDQ CGS WAF
Sbjct: 92 LMNGFNYTLHKQLRSTDDSFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGHCGSCWAF 151
Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEE 211
S+TG +EG + K+ LVSLSEQ L+DC + ++GC GG + NAF I K GG++ E
Sbjct: 152 SSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYI--KDNGGIDTE 209
Query: 212 KTYPYRGDDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINAY--ALQF 268
K+YPY D +C NK A G+ + + DE MA+ + GP+AVAI+A + QF
Sbjct: 210 KSYPYEAIDDSCHFNKGAIGATDRGFTDIPQGDEKKMAEAVATVGPVAVAIDASHESFQF 269
Query: 269 YVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYF 328
Y GV + Q CD +NL H VL+VGYG D + YW++KNSWG WG+KG+
Sbjct: 270 YSEGVYNEPQ--CDA--QNLDHGVLVVGYGTDES-----GDDYWLVKNSWGTTWGDKGFI 320
Query: 329 RLYRG-DGSCGI 339
++ R D CGI
Sbjct: 321 KMLRNKDNQCGI 332
>gi|226495425|ref|NP_001148706.1| cysteine protease 1 precursor [Zea mays]
gi|195621544|gb|ACG32602.1| cysteine protease 1 precursor [Zea mays]
Length = 463
Score = 197 bits (502), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 123/314 (39%), Positives = 167/314 (53%), Gaps = 31/314 (9%)
Query: 41 LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY----GLNEFSDLST 96
++ ++ H +TY + E R +F NLR I + +GV+ GLN F+DL+
Sbjct: 40 MYAEWMAAHGRTYNAVGEEERRYQVFRDNLRYIDA-HNAAADAGVHSFRLGLNRFADLTN 98
Query: 97 AEFQAKYLGFKLKPSYADRSVPA---MIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAF 153
E++A YLG + +P +R + A N LP + DWR AV VKDQ CGS WAF
Sbjct: 99 DEYRATYLGARTRPQ-RERKLGARYHAADNEDLPESVDWRAKGAVAEVKDQGSCGSCWAF 157
Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEK 212
ST +EG+ T L+SLSEQEL+DCD + GC GG + AF+ I++ GG++ EK
Sbjct: 158 STIAAVEGINQIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAFEFIINN--GGIDTEK 215
Query: 213 TYPYRGDDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFY 269
YPY+G D C +N+K A V I+ Y V ++ + V N P++VAI A A Q Y
Sbjct: 216 DYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANDEKSLQKAVANQPVSVAIEAAGTAFQLY 275
Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
+G+ F L H V VGYG + K YWI+KNSWG WGE GY R
Sbjct: 276 SSGI------FTGSCGTALDHGVTAVGYGTENGK------DYWIVKNSWGSSWGESGYVR 323
Query: 330 LYRG----DGSCGI 339
+ R G CGI
Sbjct: 324 MERNIKASSGKCGI 337
>gi|30141025|dbj|BAC75926.1| cysteine protease-4 [Helianthus annuus]
Length = 352
Score = 197 bits (502), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 137/349 (39%), Positives = 182/349 (52%), Gaps = 35/349 (10%)
Query: 6 FFAGVALLSLTVSVSSFMVVG--DEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRL 63
F V++L+ + + F ++G E L +H V H LF +L +H+K Y +L E R
Sbjct: 13 FLVFVSVLACSALANEFSILGYAPEDLTSIHKVIH--LFESWLAKHSKIYESLDEKLHRF 70
Query: 64 HIFSGNLRKIQLLQDTEHGSGVY--GLNEFSDLSTAEFQAKYLGFKLK-PSYADRSVPAM 120
IF NL+ I DT Y GLNEF+DL+ EF+ K+LG K + P D S+
Sbjct: 71 EIFMDNLKHID---DTNKKVSNYWLGLNEFADLTHEEFKNKFLGLKGELPERKDESIEEF 127
Query: 121 IPN--ITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQEL 178
+ LP++ DWR+ AV VK+Q CGS WAFST +EG+ T L LSEQEL
Sbjct: 128 SYRDFVDLPKSVDWRKKGAVAPVKNQGQCGSCWAFSTVAAVEGINQIVTGNLTMLSEQEL 187
Query: 179 IDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQ-VKING 236
IDCD ++GC GG + AF +M GL +E+ YPY + C K ++ V I+G
Sbjct: 188 IDCDTTFNNGCNGGLMDYAFAYVMR---SGLHKEEEYPYIMSEGTCDEKKDVSETVTISG 244
Query: 237 YVSVSRDETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHPIQFFCDGGNENLSHSVLI 294
Y V R+ D + N P++VAI A QFY GV F G E L H V
Sbjct: 245 YHDVPRNNEDSFLKALANQPISVAIEASGRDFQFYSGGV-----FDGHCGTE-LDHGVAA 298
Query: 295 VGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGS----CGI 339
VGYG T K + Y I++NSWG WGEKGY R+ R G CG+
Sbjct: 299 VGYG------TTKGLDYVIVRNSWGPKWGEKGYIRMKRKTGKPHGMCGL 341
>gi|414585111|tpg|DAA35682.1| TPA: cysteine proteinase Mir3 [Zea mays]
Length = 468
Score = 197 bits (502), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 123/314 (39%), Positives = 167/314 (53%), Gaps = 31/314 (9%)
Query: 41 LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY----GLNEFSDLST 96
++ ++ H +TY + E R +F NLR I + +GV+ GLN F+DL+
Sbjct: 45 MYAEWMAAHGRTYNAVGEEERRYQVFRDNLRYIDA-HNAAADAGVHSFRLGLNRFADLTN 103
Query: 97 AEFQAKYLGFKLKPSYADRSVPA---MIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAF 153
E++A YLG + +P +R + A N LP + DWR AV VKDQ CGS WAF
Sbjct: 104 DEYRATYLGARTRPQ-RERKLGARYHAADNEDLPESVDWRAKGAVAEVKDQGSCGSCWAF 162
Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEK 212
ST +EG+ T L+SLSEQEL+DCD + GC GG + AF+ I++ GG++ EK
Sbjct: 163 STIAAVEGINQIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAFEFIINN--GGIDTEK 220
Query: 213 TYPYRGDDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFY 269
YPY+G D C +N+K A V I+ Y V ++ + V N P++VAI A A Q Y
Sbjct: 221 DYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANDEKSLQKAVANQPVSVAIEAAGTAFQLY 280
Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
+G+ F L H V VGYG + K YWI+KNSWG WGE GY R
Sbjct: 281 SSGI------FTGSCGTALDHGVTAVGYGTENGK------DYWIVKNSWGSSWGESGYVR 328
Query: 330 LYRG----DGSCGI 339
+ R G CGI
Sbjct: 329 MERNIKASSGKCGI 342
>gi|170041165|ref|XP_001848344.1| cathepsin l [Culex quinquefasciatus]
gi|167864709|gb|EDS28092.1| cathepsin l [Culex quinquefasciatus]
Length = 340
Score = 197 bits (502), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 131/325 (40%), Positives = 173/325 (53%), Gaps = 29/325 (8%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKI-QLLQDTEHGSGVYGL--NEFSDLSTAE 98
+N + QH K Y + E RL I+ N KI + Q E G + L N+++DL E
Sbjct: 27 WNAYKLQHRKKYDSETEERLRLKIYVQNKHKIAKHNQRFEQGQEKFRLRVNKYTDLLHEE 86
Query: 99 FQAKYLGFK--------LKPSYADRSVPAMIP-NITLPRAFDWREYDAVTGVKDQTMCGS 149
F GF LK D V + P N+ +P+ DWRE AVT VKDQ CGS
Sbjct: 87 FVQTLNGFNRTNAKKPMLKGVKIDEPVTYIEPANVEVPKTVDWREKGAVTPVKDQGHCGS 146
Query: 150 SWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGG 207
W+FS TG +EG + KT KLVSLSEQ L+DC + ++GC GG + AF I K GG
Sbjct: 147 CWSFSATGALEGQHFRKTGKLVSLSEQNLVDCSTKYGNNGCNGGMMDFAFQYI--KDNGG 204
Query: 208 LEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINAY-- 264
++ EK YPY D C N KA G+V + + DE + K + GP++VAI+A
Sbjct: 205 IDTEKAYPYEAIDDTCHYNPKAVGATDKGFVDIPQGDEKALMKAIATAGPVSVAIDASHE 264
Query: 265 ALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGE 324
+ QFY GV + Q CD +ENL H VL VGYG + + YW++KNSWG WG+
Sbjct: 265 SFQFYSEGVYYEPQ--CD--SENLDHGVLAVGYGT-----SEEGEDYWLVKNSWGTTWGD 315
Query: 325 KGYFRLYRG-DGSCGINDYVRSALV 348
+GY ++ R D CGI LV
Sbjct: 316 QGYVKMARNRDNHCGIATAASYPLV 340
>gi|2414570|emb|CAB16317.1| cysteine proteinase precursor [Nicotiana tabacum]
Length = 374
Score = 197 bits (502), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 119/313 (38%), Positives = 171/313 (54%), Gaps = 30/313 (9%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
+ +L +H + Y L E R IF NLR I+ ++ + + GLN+F+DL+ E++
Sbjct: 50 YEMWLAEHGRAYNALGEKEKRFEIFKDNLRFIEGHNNSGNRTYKVGLNQFADLTNEEYRT 109
Query: 102 KYLGFK--LKPSYADRSVP----AMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFST 155
YLG K + + P A PN +P + DWR+ AV +K+Q CGS WAFST
Sbjct: 110 MYLGTKSDARRRFVKSKNPSQRYASRPNELMPHSVDWRKRGAVAPIKNQGSCGSCWAFST 169
Query: 156 TGNIEGVYAAKTKKLVSLSEQELIDCDQ-EDDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
+EG+ T ++++LSEQEL+DCD+ ++ GC GG + AF+ I+S GG++ EK Y
Sbjct: 170 VAAVEGINQIVTGEMITLSEQELVDCDRVQNSGCNGGLMDYAFEFIISN--GGMDTEKHY 227
Query: 215 PYRGDDKACR-LNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYVT 271
PYRG + C + K V I+GY V R+E + K V + P+ VAI A A Q Y +
Sbjct: 228 PYRGVEGRCDPVRKNYKVVSIDGYEDVPRNERALQK-AVAHQPVCVAIEASGRAFQLYSS 286
Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
GV F E + H V++VGYG + V YWI++NSWG WGE GY ++
Sbjct: 287 GV------FTGECGEEVDHGVVVVGYG------SEDGVDYWIVRNSWGTKWGENGYVKME 334
Query: 332 RGD-----GSCGI 339
R G CGI
Sbjct: 335 RNVKKSHLGKCGI 347
>gi|9635308|ref|NP_059206.1| ORF58 [Xestia c-nigrum granulovirus]
gi|13124001|sp|Q9PYY5.1|CATV_GVXN RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|6175702|gb|AAF05172.1|AF162221_58 ORF58 [Xestia c-nigrum granulovirus]
Length = 346
Score = 197 bits (502), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 121/337 (35%), Positives = 175/337 (51%), Gaps = 30/337 (8%)
Query: 10 VALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGN 69
VALL+L V S++ + + + + LFN F+ ++NK Y E +R IF N
Sbjct: 19 VALLTLNVCAVSYIA------YDMSNAQE--LFNEFVVKYNKVYKDDQEKEARFEIFKQN 70
Query: 70 LRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNIT---- 125
L I E S ++ +N +D+S+ E K G KL ++ P +
Sbjct: 71 LADINARNALED-SAMFEINSRADISSNELLQKLTGLKLSLMRGEKKNSFCTPTVISGDS 129
Query: 126 ---LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCD 182
+P +FDWR+ ++VT VK Q CGS WAFS NIE +Y K + LSEQ+L+DCD
Sbjct: 130 SGKVPDSFDWRDRNSVTSVKMQKECGSCWAFSAVANIESLYHIKHNVSLDLSEQQLVDCD 189
Query: 183 QEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSR 242
+ ++GC GG +S AF+ I+ GG+ E YPY G D C+ + Q+ Y R
Sbjct: 190 KVNNGCNGGLMSWAFEGIIR--AGGISYEAPYPYTGVDGVCKNTTRYVQLS-GCYAYDLR 246
Query: 243 DETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRT 302
E + + L E GP++VAI+ L Y +GV+ + L+H VL+VGYG +
Sbjct: 247 SEKKLRQVLHEKGPVSVAIDVVDLTNYKSGVAKHCSV-----DHGLNHGVLLVGYGQEND 301
Query: 303 KFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGI 339
V YW +KNSWG WGE+G+FR+ R SCGI
Sbjct: 302 ------VKYWTLKNSWGSDWGEQGFFRIKRDVNSCGI 332
>gi|17384029|emb|CAD12392.1| cysteine proteinase [Leishmania infantum]
Length = 354
Score = 197 bits (502), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 129/360 (35%), Positives = 179/360 (49%), Gaps = 41/360 (11%)
Query: 5 YFFAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLH 64
+FFA V + V S ++ L + + +A + F ++H K + E R +
Sbjct: 7 FFFAIVVTIRFVVCYGSALIA-QTPLGVVDFIA-SAHYGRFKKRHGKPFGEDAEEGRRFN 64
Query: 65 IFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSY------------ 112
F N++ L + +F+DL+ EF YL P+Y
Sbjct: 65 AFKQNMQTAYFLNAHNPHAHYDVSGKFADLTPQEFAKLYL----NPNYYARHGKDYKEHV 120
Query: 113 -ADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLV 171
D SV + + ++ DWRE VT VK+Q MCGS WAF+TTGNIEG +A K LV
Sbjct: 121 HVDDSVRSGVMSV------DWREKGVVTPVKNQGMCGSCWAFATTGNIEGQWALKNHSLV 174
Query: 172 SLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR---GDDKACRLNKK 228
SLSEQ L+ CD DDGC GG + A I++ G + E +YPY G C N
Sbjct: 175 SLSEQVLVSCDNIDDGCNGGLMQQAMQWIINDHNGTVPTEDSYPYTSAGGTRPPCHDN-G 233
Query: 229 ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENL 288
KI GY+S+ DE ++A Y+ +NGP+AVA++A Q Y GV C G +L
Sbjct: 234 TVGAKIKGYMSLPHDEEEIAAYVGKNGPVAVAVDATTRQLYFGGVV----TLCFG--LSL 287
Query: 289 SHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
+H VL+VG+ PYWI+KNSWG WGEKGY RL G C + +YV +A +
Sbjct: 288 NHGVLVVGFN------RQAKPPYWIVKNSWGSSWGEKGYIRLAMGSNQCLLKNYVVTATI 341
>gi|224131910|ref|XP_002328138.1| predicted protein [Populus trichocarpa]
gi|222837653|gb|EEE76018.1| predicted protein [Populus trichocarpa]
Length = 349
Score = 197 bits (501), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 137/352 (38%), Positives = 185/352 (52%), Gaps = 34/352 (9%)
Query: 2 SCFYFFAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYS 61
S FFA +L +V F +VG H K LF ++ H K Y +L E
Sbjct: 9 SFLTFFA--SLFVCSVLAHDFSIVGYSPEHLTSVDKLVELFESWISGHGKAYNSLEEKLH 66
Query: 62 RLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLG----FKLKPSYADRSV 117
R +F NL+ I ++ E S GLNEF+DLS EF++K+LG F K S D S
Sbjct: 67 RFEVFKENLKHIDQ-RNKEVTSYWLGLNEFADLSHEEFKSKFLGLYPEFPRKKSSEDFSY 125
Query: 118 PAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQE 177
++ LP++ DWR+ AVT VK+Q CGS WAFST +EG+ L SLSEQ+
Sbjct: 126 RDVV---DLPKSIDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVAGNLTSLSEQQ 182
Query: 178 LIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQ-VKIN 235
LIDCD ++GC GG + AF+ I++ GGL +E+ YPY ++ C ++ + V I+
Sbjct: 183 LIDCDTSFNNGCNGGLMDYAFEFIVNN--GGLHKEEDYPYLMEEGTCDEKREEMEVVTIS 240
Query: 236 GYVSVSR-DETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHPIQFFCDGGNENLSHSV 292
GY V R DE + K L P++VAI+A QFY GV F +L H V
Sbjct: 241 GYHDVPRNDEQSLLKALAHQ-PLSVAIDASGRDFQFYSGGV------FSGPCGTDLDHGV 293
Query: 293 LIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGIN 340
VGYG + + Y I+KNSWG WGE+GY R+ R +G CGIN
Sbjct: 294 AAVGYG------SSSGIDYIIVKNSWGPKWGERGYLRMKRNTGKPEGLCGIN 339
>gi|318136892|gb|ADV41672.1| cysteine protease [Nicotiana tabacum]
Length = 349
Score = 197 bits (501), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 121/314 (38%), Positives = 163/314 (51%), Gaps = 27/314 (8%)
Query: 40 ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
A + ++ H+K Y L E R IF N+ +I+ E G+N+FSDL+ +F
Sbjct: 40 ARHDQWIAHHDKVYKDLNEKEMRFKIFKENVERIEAFNAGEDKGYKLGVNKFSDLTNEKF 99
Query: 100 QAKYLGFKLK-PSYADRSVPAM---IPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
+ + G+K P S P N+T +P DWR+ AVT +KDQ CG WAFS
Sbjct: 100 RVLHTGYKRSHPKVMSSSKPKTHFRYANVTDIPPTMDWRKKGAVTPIKDQKECGCCWAFS 159
Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCD--QEDDGCEGGSISNAFDTIMSKLGGGLEEEK 212
EG++ KT KL+ LSEQEL+DCD ED+GC GG + AFD I+ GL E
Sbjct: 160 AVAATEGLHQLKTGKLIPLSEQELVDCDVEGEDEGCSGGLLDTAFDFILK--NKGLTTEA 217
Query: 213 TYPYRGDDKACRLNKKA-TQVKINGYVSVSRDETDMAKYLVENGPMAVAIN--AYALQFY 269
YPY+G+D C K A + KI GY V + V N P++VAI+ ++ QFY
Sbjct: 218 NYPYKGEDGVCNKKKSALSAAKIAGYEDVPANSEKALLQAVANQPVSVAIDGSSFDFQFY 277
Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
+GV F + L+H+V VGYG T YWIIKNSWG WG+ GY R
Sbjct: 278 SSGV------FSGSCSTWLNHAVTAVGYGA-----TTDGTKYWIIKNSWGSKWGDSGYMR 326
Query: 330 LYRG----DGSCGI 339
+ R +G CG+
Sbjct: 327 IKRDVHEKEGLCGL 340
>gi|255088003|ref|XP_002505924.1| cysteine endopeptidase [Micromonas sp. RCC299]
gi|226521195|gb|ACO67182.1| cysteine endopeptidase [Micromonas sp. RCC299]
Length = 291
Score = 197 bits (501), Expect = 6e-48, Method: Compositional matrix adjust.
Identities = 118/289 (40%), Positives = 163/289 (56%), Gaps = 26/289 (8%)
Query: 77 QDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKL--KPSYADRSVPAMIPNIT---LPRAFD 131
Q + GS V+G+ +FSDL+ EF + +LG KL + A RS +P+ LP FD
Sbjct: 8 QAQDRGSAVHGVTQFSDLTPTEFASTFLGTKLANEDVAAIRSGMTTLPDYPAHDLPLEFD 67
Query: 132 WREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDD----- 186
WRE AVT VK+Q CGS W FS TG +EG KT +LVSLSEQ+L+DCD D
Sbjct: 68 WRERGAVTPVKNQGACGSCWTFSATGAVEGANFLKTGELVSLSEQQLVDCDHTCDPSAPR 127
Query: 187 ----GCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKA-TQVKINGYVSVS 241
GC GG NA + GL+ E YPY+G D C + ++ + VS
Sbjct: 128 NCDYGCNGGLPLNAMRYVQKH---GLDTESNYPYKGVDGKCASARHGPAAASVSSFNLVS 184
Query: 242 RDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDR 301
+ET +A L+++GP+++ I+A +Q YV GV+ P + C+ L H VLIVGYGV+
Sbjct: 185 TNETQIAAALLKHGPLSIGIDAAWMQTYVGGVACP--WICN--KAGLDHGVLIVGYGVNG 240
Query: 302 T---KFTHKAVPYWIIKNSWGEGWG-EKGYFRLYRGDGSCGINDYVRSA 346
T + H+ YWI+KNSWG WG E GY+ + + +CG+N V +A
Sbjct: 241 TAPARPWHRRQDYWIVKNSWGPNWGVEGGYYHICKDRAACGLNTMVVAA 289
>gi|18423124|ref|NP_568722.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|75309064|sp|Q9FGR9.1|CEP1_ARATH RecName: Full=KDEL-tailed cysteine endopeptidase CEP1; AltName:
Full=Cysteine proteinase CP56; Short=AtCP56; Flags:
Precursor
gi|9759028|dbj|BAB09397.1| cysteine endopeptidase [Arabidopsis thaliana]
gi|20258850|gb|AAM13907.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|308097832|gb|ADO14465.1| papain [Arabidopsis thaliana]
gi|332008536|gb|AED95919.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 361
Score = 197 bits (501), Expect = 6e-48, Method: Compositional matrix adjust.
Identities = 129/309 (41%), Positives = 171/309 (55%), Gaps = 36/309 (11%)
Query: 49 HNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKL 108
H+ +L E R ++F N++ I + S LN+F D+++ EF+ Y G +
Sbjct: 44 HHTVARSLEEKAKRFNVFKHNVKHIHETNKKDK-SYKLKLNKFGDMTSEEFRRTYAGSNI 102
Query: 109 K-------PSYADRSVPAMIPNI-TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIE 160
K A +S M N+ TLP + DWR+ AVT VK+Q CGS WAFST +E
Sbjct: 103 KHHRMFQGEKKATKSF--MYANVNTLPTSVDWRKNGAVTPVKNQGQCGSCWAFSTVVAVE 160
Query: 161 GVYAAKTKKLVSLSEQELIDCD-QEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGD 219
G+ +TKKL SLSEQEL+DCD ++ GC GG + AF+ I K GGL E YPY+
Sbjct: 161 GINQIRTKKLTSLSEQELVDCDTNQNQGCNGGLMDLAFEFIKEK--GGLTSELVYPYKAS 218
Query: 220 DKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHP 276
D+ C NK+ A V I+G+ V ++ D V N P++VAI+A QFY GV
Sbjct: 219 DETCDTNKENAPVVSIDGHEDVPKNSEDDLMKAVANQPVSVAIDAGGSDFQFYSEGV--- 275
Query: 277 IQFFCDGGNENLSHSVLIVGYG--VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG- 333
F G E L+H V +VGYG +D TK YWI+KNSWGE WGEKGY R+ RG
Sbjct: 276 --FTGRCGTE-LNHGVAVVGYGTTIDGTK-------YWIVKNSWGEEWGEKGYIRMQRGI 325
Query: 334 ---DGSCGI 339
+G CGI
Sbjct: 326 RHKEGLCGI 334
>gi|42564161|gb|AAS20592.1| digestive cysteine proteinase intestain [Leptinotarsa decemlineata]
Length = 326
Score = 197 bits (501), Expect = 6e-48, Method: Compositional matrix adjust.
Identities = 122/305 (40%), Positives = 173/305 (56%), Gaps = 19/305 (6%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQ---DTEHGSGVYGLNEFSDLSTAEFQA 101
F + H KTY +L+E +R IF NLRKI+ D S G+ F+DL+ EF+
Sbjct: 26 FKQTHGKTYKSLLEERTRFGIFQSNLRKIEEHNAKYDKGEESYFLGVTPFADLTHDEFKD 85
Query: 102 KYL-GFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIE 160
K K KP+ + ++ + +P + DW + AV VK Q CGS WAFS TG +E
Sbjct: 86 KLRRQIKTKPN-VEATLAVFPEGLEVPDSIDWTQKGAVLDVKYQGGCGSCWAFSATGALE 144
Query: 161 GVYAAKTKKLVSLSEQELIDCDQE--DDGCE-GGSISNAFDTIMSKLGGGLEEEKTYPYR 217
G A + LSEQ+L+DC + +D CE GG +S AFD ++ K G+E + +YPY+
Sbjct: 145 GQNAIVNNVKIPLSEQQLLDCSKPYGNDDCEHGGLMSFAFDYVLDK---GIEADSSYPYK 201
Query: 218 GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPI 277
G D C+ + K T +KI GY +VS E ++ K + GP++VAI+A +Q Y G+ +
Sbjct: 202 GIDTPCQYDAKKTVLKIKGYRNVSISEEELKKAVGTVGPVSVAIDADPIQLYSGGILDGL 261
Query: 278 QFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR-GDGS 336
FC NL+H VL VGYG + F K +W +KNSWG+ WGE+GYFR+ R +
Sbjct: 262 --FC---THNLNHGVLAVGYGEEDHLFGKKK--FWKVKNSWGKDWGEQGYFRIKRDANNL 314
Query: 337 CGIND 341
CGI D
Sbjct: 315 CGIAD 319
>gi|391332597|ref|XP_003740719.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
Length = 330
Score = 197 bits (501), Expect = 6e-48, Method: Compositional matrix adjust.
Identities = 137/339 (40%), Positives = 181/339 (53%), Gaps = 30/339 (8%)
Query: 12 LLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLR 71
+LSLTV+ + VG H +H LF QHNKTY + R IF N++
Sbjct: 2 ILSLTVAC---IFVGVSPAAVDAHDEHWELFK---RQHNKTYLQKQDV-GRRAIFEANIK 54
Query: 72 KIQ---LLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITL-- 126
KI LL D S GLN F+D++ EF+ KY G + + + A S N ++
Sbjct: 55 KINAHNLLYDLGRSSYRLGLNGFADMTPDEFE-KYRGTRFEANEARVSKLQHRDNRSMHV 113
Query: 127 PRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQ--E 184
P DWR VT VK+Q +CGS WAFSTTG +EG + ++ LVSLSEQ L+DC
Sbjct: 114 PDTVDWRTEGYVTPVKNQGVCGSCWAFSTTGALEGQHFRRSGDLVSLSEQMLVDCSAVYG 173
Query: 185 DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSV-SRD 243
+ GC GG + NAF I K GGLE EK+YPY G D C + + K+ G+V V SRD
Sbjct: 174 NAGCNGGLMDNAFRFI--KDAGGLETEKSYPYTGKDGTCHFDARGIGAKLTGFVDVPSRD 231
Query: 244 ETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDR 301
E + + GP++VAI+A QFY GV I C + +L H VL+VGYG
Sbjct: 232 EEALKEAAGVVGPVSVAIDASGQNFQFYKDGVYDEIT--CS--STSLDHGVLVVGYGT-- 285
Query: 302 TKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-DGSCGI 339
T YW++KNSWG WG+ GY ++ R + CGI
Sbjct: 286 ---TRDGKDYWLVKNSWGSSWGQSGYIQMSRNKENQCGI 321
>gi|42564159|gb|AAS20591.1| digestive cysteine proteinase intestain [Leptinotarsa decemlineata]
Length = 326
Score = 197 bits (501), Expect = 7e-48, Method: Compositional matrix adjust.
Identities = 121/305 (39%), Positives = 173/305 (56%), Gaps = 19/305 (6%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQ---DTEHGSGVYGLNEFSDLSTAEFQA 101
F + H KTY +L+E +R IF NLRKI+ D S G+ F+DL+ EF+
Sbjct: 26 FKQTHGKTYKSLLEERTRFGIFQSNLRKIEEHNAKYDKGEESYFLGVTPFADLTHDEFKD 85
Query: 102 KYL-GFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIE 160
+ K KP+ + ++ + +P + DW + AV VK Q CGS WAFS TG +E
Sbjct: 86 ELRRQIKTKPN-VEATLAVFPEGLEVPDSIDWTQKGAVLDVKYQGGCGSCWAFSATGALE 144
Query: 161 GVYAAKTKKLVSLSEQELIDCDQE--DDGCE-GGSISNAFDTIMSKLGGGLEEEKTYPYR 217
G A + LSEQ+L+DC + +D CE GG +S AFD ++ K G+E + +YPY+
Sbjct: 145 GQNAIVNNVKIPLSEQQLLDCSKPYGNDDCEHGGLMSFAFDYVLDK---GIEADSSYPYK 201
Query: 218 GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPI 277
G D C+ + K T +KI GY +VS E ++ K + GP++VAI+A +Q Y G+ +
Sbjct: 202 GIDTPCQYDAKKTVLKIKGYKNVSNSEEELKKAVGTVGPVSVAIDADPIQLYFGGILDGL 261
Query: 278 QFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR-GDGS 336
FC NL+H VL VGYG + F K +W +KNSWG+ WGE+GYFR+ R +
Sbjct: 262 --FC---THNLNHGVLAVGYGEEDHLFGKKK--FWKVKNSWGKDWGEQGYFRIKRDANNL 314
Query: 337 CGIND 341
CGI D
Sbjct: 315 CGIAD 319
>gi|37780041|gb|AAP32193.1| cysteine protease 14 [Trifolium repens]
Length = 351
Score = 197 bits (501), Expect = 7e-48, Method: Compositional matrix adjust.
Identities = 126/314 (40%), Positives = 166/314 (52%), Gaps = 26/314 (8%)
Query: 37 KHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLST 96
K LF ++ +H K Y T+ E R +F NL+ I + GLNEF+DLS
Sbjct: 42 KLIELFESWMSRHGKIYETIEEKLLRFEVFKDNLKHIDERNKIVSNYWL-GLNEFADLSH 100
Query: 97 AEFQAKYLGFKLKPSYADRSVPA---MIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAF 153
EF+ KYLG K+ S S ++ LP++ DWR+ AVT VK+Q CGS WAF
Sbjct: 101 QEFKNKYLGLKVNLSQRRESSNEEEFTYRDVDLPKSVDWRKKGAVTPVKNQGQCGSCWAF 160
Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEK 212
ST +EG+ T L SLSEQELIDCD ++GC GG + AF I+ GGL +E
Sbjct: 161 STVAAVEGINQIVTGNLTSLSEQELIDCDTTYNNGCNGGLMDYAFSFIVQ--NGGLHKED 218
Query: 213 TYPYRGDDKACRLNKKATQ-VKINGYVSVSRDETDMAKYLVENGPMAVAINAYA--LQFY 269
YPY ++ C + K+ TQ V INGY V ++ + N P++VAI A + QFY
Sbjct: 219 DYPYIMEESTCEMKKEETQVVTINGYHDVPQNNEQSLLKALANQPLSVAIEASSRDFQFY 278
Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
GV F +L H V VGYG T K + Y I+KNSWG WGEKG+ R
Sbjct: 279 SGGV------FDGHCGSDLDHGVSAVGYG------TSKNLDYIIVKNSWGAKWGEKGFIR 326
Query: 330 LYRG----DGSCGI 339
+ R +G CG+
Sbjct: 327 MKRNIGKPEGICGL 340
>gi|320169658|gb|EFW46557.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
Length = 324
Score = 197 bits (501), Expect = 7e-48, Method: Compositional matrix adjust.
Identities = 121/316 (38%), Positives = 174/316 (55%), Gaps = 22/316 (6%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
F+ + H +YAT+ E +R I+ NL I+ ++E S +N+F+DL+ EF A
Sbjct: 22 FDSWKATHGVSYATVGEETARRGIYRANLDFIEK-HNSEGHSYKLAVNKFADLTYPEFAA 80
Query: 102 KYLGFKLKPSYADRSVPAM--IPN-ITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
KYLG + + A +S A +P ++LP + DWR VT +KDQ CGS W+FSTTG+
Sbjct: 81 KYLGLRFDATNATKSFAASTYLPRMVSLPDSVDWRTAGIVTPIKDQGQCGSCWSFSTTGS 140
Query: 159 IEGVYAAKTKKLVSLSEQELIDCD--QEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
+EG +A KT +LVSLSEQ L+DC Q + GC GG + AF I+S G++ E +YPY
Sbjct: 141 VEGQHARKTGQLVSLSEQNLVDCSSAQGNAGCNGGLMDQAFQYIISN--NGIDTESSYPY 198
Query: 217 RGDDKACRLNKKATQVKINGYVSV-SRDETDMAKYLVENGPMAVAINAY--ALQFYVTGV 273
D C+ N + Y + S E+D+ + GP++VAI+A + QFY +GV
Sbjct: 199 TAQDGTCQFNSANVGATVASYQDIASGSESDLQNAVATVGPISVAIDASQPSFQFYSSGV 258
Query: 274 SHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR- 332
+ + C + L H VL VGYG T + YW++KNSWG WG+ GY + R
Sbjct: 259 YN--EPACS--SSQLDHGVLAVGYG------TSGSSDYWLVKNSWGTSWGQSGYIWMTRN 308
Query: 333 GDGSCGINDYVRSALV 348
+ CGI LV
Sbjct: 309 SNNQCGIATAASYPLV 324
>gi|195381187|ref|XP_002049336.1| GJ20806 [Drosophila virilis]
gi|194144133|gb|EDW60529.1| GJ20806 [Drosophila virilis]
Length = 339
Score = 197 bits (500), Expect = 8e-48, Method: Compositional matrix adjust.
Identities = 129/352 (36%), Positives = 185/352 (52%), Gaps = 43/352 (12%)
Query: 6 FFAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHI 65
FA +AL+++ +VS V+ +E + F +H K Y E RL I
Sbjct: 4 LFALLALVAVAQAVSYADVIKEE-------------WQTFKLEHRKNYVDETEERFRLKI 50
Query: 66 FSGNLRKIQLLQDTEHGSG----VYGLNEFSDLSTAEFQAKYLGFKLKPSYADR-SVPAM 120
F+ N KI + + SG +N+++D+ EF GF R S P+
Sbjct: 51 FNENKHKI-AKHNQRYASGEVSFKMAVNKYADMLHHEFHTTMNGFNYTLHKQLRASDPSF 109
Query: 121 I-------PNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSL 173
+ ++ +P++ DWR AVT VKDQ CGS WAFS+TG +EG + K L+SL
Sbjct: 110 VGVTFISPEHVKIPKSVDWRSKGAVTEVKDQGHCGSCWAFSSTGALEGQHFRKAGTLISL 169
Query: 174 SEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQ 231
SEQ L+DC + ++GC GG + NAF I K GG++ EK+YPY G D +C NK
Sbjct: 170 SEQNLVDCSTKYGNNGCNGGLMDNAFRYI--KDNGGIDTEKSYPYEGIDDSCHFNKATIG 227
Query: 232 VKINGYVSVSR-DETDMAKYLVENGPMAVAINAY--ALQFYVTGVSHPIQFFCDGGNENL 288
G V + + DE MA+ + GP++VAI+A + QFY G+ + Q CD +NL
Sbjct: 228 ATDRGSVDIPQGDEKKMAEAVATIGPVSVAIDASHESFQFYSEGIYNEPQ--CDP--QNL 283
Query: 289 SHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR-GDGSCGI 339
H VL+VGYG D + YW++KNSWG WG+KG+ ++ R D CGI
Sbjct: 284 DHGVLVVGYGTDES-----GQDYWLVKNSWGTTWGDKGFIKMARNADNQCGI 330
>gi|356508490|ref|XP_003522989.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
Length = 349
Score = 197 bits (500), Expect = 8e-48, Method: Compositional matrix adjust.
Identities = 126/312 (40%), Positives = 166/312 (53%), Gaps = 24/312 (7%)
Query: 37 KHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLST 96
K LF ++ +H K Y ++ E R IF NL+ I + GLNEF+DLS
Sbjct: 42 KLIELFESWMSKHGKIYQSIEEKLLRFEIFKDNLKHIDERNKVVSNYWL-GLNEFADLSH 100
Query: 97 AEFQAKYLGFKLKPSYADRSVPAMI-PNITLPRAFDWREYDAVTGVKDQTMCGSSWAFST 155
EF+ KYLG K+ S S ++ LP++ DWR+ AV VK+Q CGS WAFST
Sbjct: 101 QEFKNKYLGLKVDYSRRRESPEEFTYKDVELPKSVDWRKKGAVAPVKNQGSCGSCWAFST 160
Query: 156 TGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
+EG+ T L SLSEQELIDCD+ ++GC GG + AF I+ GGL +E+ Y
Sbjct: 161 VAAVEGINQIVTGNLTSLSEQELIDCDRTYNNGCNGGLMDYAFSFIVEN--GGLHKEEDY 218
Query: 215 PYRGDDKACRLNKKATQ-VKINGYVSVSRDETDMAKYLVENGPMAVAINAYA--LQFYVT 271
PY ++ C + K+ T+ V I+GY V ++ + N P++VAI A QFY
Sbjct: 219 PYIMEEGTCEMTKEETEVVTISGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQFYSG 278
Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
GV F +L H V VGYG T K V Y I+KNSWG WGEKGY R+
Sbjct: 279 GV------FDGHCGSDLDHGVAAVGYG------TAKGVDYIIVKNSWGSKWGEKGYIRMR 326
Query: 332 RG----DGSCGI 339
R +G CGI
Sbjct: 327 RNIGKPEGICGI 338
>gi|146084829|ref|XP_001465113.1| cysteine peptidase A (CPA) [Leishmania infantum JPCM5]
gi|134069209|emb|CAM67356.1| cysteine peptidase A (CPA) [Leishmania infantum JPCM5]
Length = 354
Score = 197 bits (500), Expect = 8e-48, Method: Compositional matrix adjust.
Identities = 128/360 (35%), Positives = 178/360 (49%), Gaps = 41/360 (11%)
Query: 5 YFFAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLH 64
+FFA V + V S ++ + + +A + F ++H K + E R +
Sbjct: 7 FFFAIVVTILFVVCYGSALIA--QTPLGVDDFIASAHYGRFKKRHGKPFGEDAEEGRRFN 64
Query: 65 IFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSY------------ 112
F N++ L + +F+DL+ EF YL P+Y
Sbjct: 65 AFKQNMQTAYFLNAHNPHAHYDVSGKFADLTPQEFAKLYL----NPNYYARHGKDYKEHV 120
Query: 113 -ADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLV 171
D SV + + ++ DWRE VT VK+Q MCGS WAF+TTGNIEG +A K LV
Sbjct: 121 HVDDSVRSGVMSV------DWREKGVVTPVKNQGMCGSCWAFATTGNIEGQWALKNHSLV 174
Query: 172 SLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR---GDDKACRLNKK 228
SLSEQ L+ CD DDGC GG + A I++ G + E +YPY G C N
Sbjct: 175 SLSEQVLVSCDNIDDGCNGGLMQQAMQWIINDHNGTVPTEDSYPYTSAGGTRPPCHDN-G 233
Query: 229 ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENL 288
KI GY+S+ DE ++A Y+ +NGP+AVA++A Q Y GV C G +L
Sbjct: 234 TVGAKIKGYMSLPHDEEEIAAYVGKNGPVAVAVDATTWQLYFGGVV----TLCFG--LSL 287
Query: 289 SHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
+H VL+VG+ PYWI+KNSWG WGEKGY RL G C + +YV +A +
Sbjct: 288 NHGVLVVGFN------RQAKPPYWIVKNSWGSSWGEKGYIRLAMGSNQCLLKNYVVTATI 341
>gi|440804656|gb|ELR25533.1| cysteine proteinase precursor, putative [Acanthamoeba castellanii
str. Neff]
Length = 330
Score = 197 bits (500), Expect = 9e-48, Method: Compositional matrix adjust.
Identities = 117/312 (37%), Positives = 171/312 (54%), Gaps = 18/312 (5%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
F F Q+ K+YA+ E+ RL IF NL +I L G+ YG+N+F+DL+ EF+A
Sbjct: 32 FRQFAAQYGKSYAS-EEFGERLRIFRDNLDRIDALNSANTGA-RYGVNKFADLTPKEFKA 89
Query: 102 KYL-GFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIE 160
YL G + + + LP FDWR+ AVT KDQ CG WAFS T IE
Sbjct: 90 TYLKGARSAGQKKAAATAKLDMTGPLPSQFDWRDKGAVTPTKDQGQCG--WAFSVTEAIE 147
Query: 161 GVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG 218
+ +KLVSL+ Q+++DCDQ D GC+GG A++ ++ GGL+ E++YPY
Sbjct: 148 SQWFLSGRKLVSLAPQQIVDCDQGNGDYGCDGGDPPTAYEYVIK--AGGLDTEESYPYTA 205
Query: 219 DDKACRLNKKATQVKING--YVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHP 276
+D C A KI+ Y++ +++ET+M L GP+++ ++A + Q+Y+ GV
Sbjct: 206 EDGQCAFKPSAVGAKISNWTYITTTKNETEMQYGLASRGPLSICVDASSWQYYIGGV--- 262
Query: 277 IQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGS 336
I C+ ++L H V+I GY V + + W I+NSWGE WG GY + RG
Sbjct: 263 ITSLCE---DSLDHCVMITGYSV-QEGWDFMKYDVWNIRNSWGEDWGYGGYLYVQRGSNL 318
Query: 337 CGINDYVRSALV 348
CG+ D V LV
Sbjct: 319 CGVGDEVTIPLV 330
>gi|394331824|gb|AFN27131.1| cysteine protease [Leishmania tropica]
Length = 443
Score = 196 bits (499), Expect = 9e-48, Method: Compositional matrix adjust.
Identities = 118/318 (37%), Positives = 172/318 (54%), Gaps = 22/318 (6%)
Query: 40 ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
ALF F + + Y T+ E RL F NL ++ Q + +G+ +F DLS AEF
Sbjct: 36 ALFEEFKRTYRRAYGTVAEEQQRLANFERNLELMREHQ-ARNPHARFGITKFFDLSEAEF 94
Query: 100 QAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
A+YL F +A + +++ +P A DWR+ AVT VKDQ CGS WAFS
Sbjct: 95 AARYLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWRKKGAVTPVKDQGACGSCWAFS 154
Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
G+IE +A +L +LSEQ+L+ CD +D GC + AF+ ++ + G + E +Y
Sbjct: 155 AVGSIESQWALAGHRLTALSEQQLVSCDDKDSGCRARLMLQAFEWLLRNMNGTMFTEDSY 214
Query: 215 PYRGDDKACRLNKKATQV----KINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYV 270
PY + Q+ +I+GY+++ ET MA +L +NGP+++A++A + Y
Sbjct: 215 PYVSSTGYVPECSNSIQLVPGARIDGYMTIESSETVMAAWLAKNGPISIAVDASSFMSYQ 274
Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
GV C G L+H VL+VGY +RT VPYW+IKNSWGE WGE GY R+
Sbjct: 275 RGVVTS----CAG--MPLNHGVLLVGY--NRT----GEVPYWVIKNSWGENWGENGYVRV 322
Query: 331 YRGDGSCGINDYVRSALV 348
G +C + +Y SA V
Sbjct: 323 TMGVNACLLTEYPVSAHV 340
>gi|242094000|ref|XP_002437490.1| hypothetical protein SORBIDRAFT_10g028000 [Sorghum bicolor]
gi|241915713|gb|EER88857.1| hypothetical protein SORBIDRAFT_10g028000 [Sorghum bicolor]
Length = 372
Score = 196 bits (499), Expect = 9e-48, Method: Compositional matrix adjust.
Identities = 120/294 (40%), Positives = 159/294 (54%), Gaps = 31/294 (10%)
Query: 62 RLHIFSGNLRKIQLLQDTEHGSGVY----GLNEFSDLSTAEFQAKYLGFKLKPSYADR-- 115
RL +F NLR I + E +G++ GL F+DL+ E++ + LGF+ + A R
Sbjct: 70 RLEVFRDNLRYIDA-HNAEADAGLHTFRLGLTPFADLTLEEYRGRALGFRARRGGASRVG 128
Query: 116 SVPAMIPNIT---LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVS 172
S + P LP A DWRE AVTGVK+Q CG WAFS IEG+ T LVS
Sbjct: 129 SGSSYRPRPRGGDLPDAIDWRELGAVTGVKNQEQCGGCWAFSAVAAIEGINEIVTGNLVS 188
Query: 173 LSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQ- 231
LSEQE+IDCD +D GC GG + NAF +++ GG++ E YPY G D AC N+ +
Sbjct: 189 LSEQEIIDCDTQDGGCNGGEMQNAFQFVINN--GGIDTEADYPYLGTDAACDANRVNERV 246
Query: 232 VKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQF--YVTGVSHPIQFFCDGGNENLS 289
V I+G+VSV+ + + V N P++VAI+A +F Y +G+ F L
Sbjct: 247 VTIDGFVSVATENETALQEAVANQPVSVAIDASGRKFQHYTSGI------FNGPCGTQLD 300
Query: 290 HSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR----GDGSCGI 339
H V VGYG + K YWI+KNSW WGE GY R+ R G CGI
Sbjct: 301 HGVTAVGYGSENGK------DYWIVKNSWSSSWGEAGYIRIRRNVAAATGKCGI 348
>gi|168017893|ref|XP_001761481.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162687165|gb|EDQ73549.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 471
Score = 196 bits (499), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 133/331 (40%), Positives = 180/331 (54%), Gaps = 29/331 (8%)
Query: 24 VVGDEKLHHLHHVKHTA-LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHG 82
+ D + H LH +F+ +LE+H++ Y +L E R IF NL I E
Sbjct: 33 AIMDYEAHELHSDDGMLDVFHQWLERHSRVYHSLSEKQRRFQIFKDNLHYIHNHNKQEK- 91
Query: 83 SGVYGLNEFSDLSTAEFQAKYLGFKLK-PSYADRSVPAMI-PNITLPRAFDWREYDAVTG 140
S GLN+FSDL+ EF+A YLG + ++ R+ I ++ DWR+ AV+
Sbjct: 92 SYWLGLNKFSDLTHDEFRALYLGIRPAGRAHGLRNGDRFIYEDVVAEEMVDWRKKGAVSD 151
Query: 141 VKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQ-EDDGCEGGSISNAFDT 199
VKDQ CGS WAFS G++EGV A T +L+SLSEQEL+DCD+ ++ GC GG + AFD
Sbjct: 152 VKDQGSCGSCWAFSAIGSVEGVNAIVTGELISLSEQELVDCDRGQNQGCNGGLMDYAFDF 211
Query: 200 IMSKLGGGLEEEKTYPYRGDDKACRLNKKATQ--VKINGYVSV-SRDETDMAKYLVENGP 256
I+ GG++ E+ YPY+ D C +K T V I+ Y V ++ E+ + K + +N P
Sbjct: 212 IIKN--GGIDTEEDYPYKATDGQCDEARKETSKVVVIDDYQDVPTKSESSLLKAVSKN-P 268
Query: 257 MAVAINAYA--LQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWII 314
++VAI A Q Y GV F +L H VL VGYG D V YWI+
Sbjct: 269 VSVAIEAGGRDFQHYQGGV------FTGPCGTDLDHGVLAVGYGTD-----DDGVNYWIV 317
Query: 315 KNSWGEGWGEKGYFRLYR-----GDGSCGIN 340
KNSWG WGEKGY R+ R G CGIN
Sbjct: 318 KNSWGPSWGEKGYIRMERMGSNSTSGKCGIN 348
>gi|161408097|dbj|BAF94152.1| cathepsin L-like cysteine protease 2 [Plautia stali]
Length = 334
Score = 196 bits (499), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 131/351 (37%), Positives = 188/351 (53%), Gaps = 40/351 (11%)
Query: 12 LLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLR 71
L++L +VS F + DE F F + H K Y +E R IF N +
Sbjct: 10 LIALGQAVSFFDLSADE-------------FTLFKKFHRKEYDNELEESYRKKIFLENKK 56
Query: 72 KIQLLQDTEHGSGVYG----LNEFSDLSTAEFQAKYLGFKLKPSYADRSVPA--MIP--N 123
+I+ ++ + G LN +D+ E+ YLGF + + + IP +
Sbjct: 57 RIEK-HNSRYKQGKVSFKLKLNHLADMLIHEYSDVYLGFNKSSKANNNKLQSYTFIPPAH 115
Query: 124 ITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQ 183
+TL + DWR AVT VK+Q CGS WAFSTTG +EG KT KLVSLSEQ L+DC
Sbjct: 116 VTLNKEVDWRTKGAVTPVKNQGHCGSCWAFSTTGALEGQNFRKTGKLVSLSEQNLVDCSG 175
Query: 184 E--DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVS 241
++GCEGG + NAF I K G++ EK+YPY G+D+ CR K + +G+V ++
Sbjct: 176 SYGNNGCEGGLMDNAFQYI--KENHGIDTEKSYPYEGEDETCRFRKTSIGATDSGFVDIT 233
Query: 242 R-DETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYG 298
+ DE + + + GP++VAI+A + QFY GV + + C +ENL H VL+VGYG
Sbjct: 234 QGDEEALMQAVATIGPISVAIDASHQSFQFYSEGVYYEPE--C--SSENLDHGVLVVGYG 289
Query: 299 VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-DGSCGINDYVRSALV 348
V+ + YW++KNSWG WG+ GY ++ R D +CGI LV
Sbjct: 290 VEDNQ------KYWLVKNSWGTQWGDGGYIKMARDQDNNCGIATQASYPLV 334
>gi|16304178|gb|AAL16954.1|AF426414_1 cathepsin L-like cysteine protease precursor [Delia radicum]
Length = 337
Score = 196 bits (499), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 119/310 (38%), Positives = 172/310 (55%), Gaps = 26/310 (8%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKI---QLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
F +H K + + VE R+ IF+ N KI L S GLN++SD+ EF+
Sbjct: 30 FKMEHRKNFLSEVEERFRMKIFNENRHKIAKHNQLYAQGKVSFKLGLNKYSDMLYHEFKE 89
Query: 102 KYLGFK--LKPSYADRSVPAMI----PNITLPRAFDWREYDAVTGVKDQTMCGSSWAFST 155
G+ ++ + +I N+ +P++ DWR++ AVT VKDQ CGS WAFS+
Sbjct: 90 TMNGYNHTMRKVLRAQGFSGIIYIPPANVQIPKSVDWRQHGAVTAVKDQGHCGSCWAFSS 149
Query: 156 TGNIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
T +EG + K LVSLSEQ L+DC + ++GC GG + NAF I K GG++ EK+
Sbjct: 150 TAALEGQHFRKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYI--KDNGGIDTEKS 207
Query: 214 YPYRGDDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINAY--ALQFYV 270
YPY G D +C K G+V + + DE + K + GP++VAI+A + Q Y
Sbjct: 208 YPYEGIDDSCHFTKSGVGATDTGFVDIPQGDEEALMKAVATMGPVSVAIDASHESFQLYS 267
Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
GV + + CD +NL H VL+VGYG D+T + YW++KNSWG WG++GY ++
Sbjct: 268 EGVYNEPE--CDA--QNLDHGVLVVGYGTDKT-----GLDYWLVKNSWGTTWGDQGYIKM 318
Query: 331 YRG-DGSCGI 339
R D CGI
Sbjct: 319 ARNQDNQCGI 328
>gi|357467173|ref|XP_003603871.1| Cysteine proteinase [Medicago truncatula]
gi|355492919|gb|AES74122.1| Cysteine proteinase [Medicago truncatula]
gi|388499154|gb|AFK37643.1| unknown [Medicago truncatula]
Length = 350
Score = 196 bits (499), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 128/315 (40%), Positives = 169/315 (53%), Gaps = 29/315 (9%)
Query: 37 KHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY--GLNEFSDL 94
K LF ++ +H K Y T+ E R +F NL+ I D Y GLNEF+DL
Sbjct: 42 KLIELFESWMSRHGKIYETIEEKLLRFEVFKDNLKHID---DRNKVVSNYWLGLNEFADL 98
Query: 95 STAEFQAKYLGFKLKPSYADRSVPAMIP--NITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
S EF+ KYLG K+ S S ++ LP++ DWR+ AVT VK+Q CGS WA
Sbjct: 99 SHQEFKNKYLGLKVDLSQRRESSEEEFTYRDVDLPKSVDWRKKGAVTPVKNQGQCGSCWA 158
Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEE 211
FST +EG+ T L SLSEQELIDCD ++GC GG + AF I+ GGL +E
Sbjct: 159 FSTVAAVEGINQIVTGNLTSLSEQELIDCDTTYNNGCNGGLMDYAFSFIVKN--GGLHKE 216
Query: 212 KTYPYRGDDKACRLNKKATQ-VKINGYVSVSRDETDMAKYLVENGPMAVAINAYA--LQF 268
+ YPY ++ C + K+ ++ V INGY V ++ + N P++VAI A QF
Sbjct: 217 EDYPYIMEESTCEMKKEVSEVVTINGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQF 276
Query: 269 YVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYF 328
Y GV F G+E L H V VGYG T K + Y I+KNSWG WGEKG+
Sbjct: 277 YSGGV-----FDGHCGSE-LDHGVSAVGYG------TSKGLDYIIVKNSWGAKWGEKGFI 324
Query: 329 RLYRG----DGSCGI 339
R+ R +G CG+
Sbjct: 325 RMKRNIGKSEGICGL 339
>gi|146215988|gb|ABQ10196.1| actinidin Act3a [Actinidia eriantha]
Length = 380
Score = 196 bits (499), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 128/348 (36%), Positives = 188/348 (54%), Gaps = 28/348 (8%)
Query: 6 FFAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHT-ALFNYFLEQHNKTYATLVEYYSRLH 64
F + L T + SF + D K+ L AL+ +L ++ K+Y +L E R+
Sbjct: 7 FISMSLLFFSTFLIFSFAI--DAKISPLRTNDEVMALYESWLVKYGKSYNSLGEREMRIE 64
Query: 65 IFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFK--LKPSYADRSVPAMIP 122
IF NLR I + S GLN+F+DL+ E+++ YLGFK LK ++R +P +
Sbjct: 65 IFKENLRFIDEHNADPNRSYTVGLNQFADLTDEEYRSTYLGFKSSLKSKVSNRYMPQV-- 122
Query: 123 NITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCD 182
LP DWR AV VK+Q +C S WAF+T +E + T L+SLSEQEL+DC+
Sbjct: 123 GEVLPDYVDWRTTGAVVDVKNQGLCSSCWAFATIATVESINQIITGDLISLSEQELVDCN 182
Query: 183 QE--DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQ-VKINGYVS 239
+ ++GC+GG + +A++ I++ GG+ E+ YPY G D C KK V I+ Y
Sbjct: 183 RTPINEGCKGGFMDDAYEFIINN--GGINTEENYPYIGQDDQCDEPKKNQNYVTIDSYEQ 240
Query: 240 VSRDETDMAKYLVENGPMAVAINAYAL--QFYVTGVSHPIQFFCDGGNENLSHSVLIVGY 297
V ++ K V P++VAI+AY L +FY +G+ F L+H+V I+GY
Sbjct: 241 VPPNDELAMKRAVAYQPVSVAIDAYCLGFRFYQSGI-----FTGGSCGTTLNHAVTIIGY 295
Query: 298 GVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR---GDGSCGINDY 342
G T + YWI+KNS+G WGE GY ++ R G+G CGI Y
Sbjct: 296 G------TENGIDYWIVKNSYGTQWGESGYGKVQRNVGGEGRCGIASY 337
>gi|323451241|gb|EGB07119.1| hypothetical protein AURANDRAFT_54023 [Aureococcus anophagefferens]
Length = 377
Score = 196 bits (499), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 117/315 (37%), Positives = 160/315 (50%), Gaps = 21/315 (6%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGL-NEFSDLSTAEFQ 100
F F+ + KTY T+ E+ RL +F+ N KI L D + GL N+F+D + EF
Sbjct: 65 FMTFMTKFEKTYETVEEWAHRLTVFAQNA-KIVLEHDAKAEGFALGLDNQFADWTAEEF- 122
Query: 101 AKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIE 160
A Y +P + + + P A DWR V +K+Q CGS W FST +IE
Sbjct: 123 ASYQKLHSRPKPSQAGATHEVSDKAAPTAVDWRTEGVVADIKNQGSCGSCWTFSTVVSIE 182
Query: 161 GVYAAKTKKLVSLSEQELIDCDQEDD---------GCEGGSISNAFDTIMSKLGGGLEEE 211
G A KT KLV+LSEQ L+DC ++D GC GG + NAFD I+ GG++ E
Sbjct: 183 GAAARKTGKLVTLSEQNLVDCVKKDQIDGGDECCMGCSGGLMDNAFDYIIKNQDGGIDTE 242
Query: 212 KTYPYRGDDKACRLNKKATQVKINGYVSVS-RDETDMAKYLVENGPMAVAINAY-ALQFY 269
+Y Y G D C +K I+ + V+ DE +A L GP+++A++A Q Y
Sbjct: 243 ASYGYTGKDGTCAFDKANVGATISNWTDVAVGDEVALADALANAGPVSIALDASKQWQLY 302
Query: 270 VTGVSHPIQFF-CDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYF 328
G+ P C + H V IVGYG D V YW I+NSWG WGE GY
Sbjct: 303 SGGILKPRSILGCSSDPTHADHGVAIVGYGTD------DGVDYWWIRNSWGTTWGESGYM 356
Query: 329 RLYRGDGSCGINDYV 343
RL RG +CG+ ++
Sbjct: 357 RLERGVNACGVANFA 371
>gi|530736|emb|CAA56915.1| cathepsin l [Nephrops norvegicus]
gi|1582621|prf||2119193B cathepsin L-related Cys protease
Length = 313
Score = 196 bits (499), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 115/306 (37%), Positives = 170/306 (55%), Gaps = 21/306 (6%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT-EHGSGVY--GLNEFSDLSTAE 98
+ +F Q+ + Y E R +F N + ++ E+G + +N+F D++ E
Sbjct: 12 WEHFKTQYGRKYGDAKEELYRQRVFQQNEQLVEAFNKKFENGEVTFKVAMNQFGDMTNEE 71
Query: 99 FQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
F A G+K K S + + + DWR AVT VKDQ CGS WAFS TG+
Sbjct: 72 FNAVMKGYK-KGSRGEPTTVFTAEGRPMAADVDWRTKGAVTPVKDQGQCGSCWAFSATGS 130
Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
+EG + K +LVSLSEQEL+DC E +DGC GG +++AFD I K GG++ E +YPY
Sbjct: 131 LEGQHFLKNNELVSLSEQELVDCSTEYGNDGCGGGWMTSAFDYI--KDNGGIDTESSYPY 188
Query: 217 RGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYVTGVS 274
D++CR + + G+V V E + + + + GP++VAI+A ++ QFY +GV
Sbjct: 189 EAQDRSCRFDANSIGATCTGFVEVQHTEEALHEAVSDIGPISVAIDASHFSFQFYSSGVY 248
Query: 275 HPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG- 333
+ + C NL H VL VGYG + T+ YW++KNSWG GWG+ GY ++ R
Sbjct: 249 YEKK--CS--PTNLDHGVLAVGYGTESTE------DYWLVKNSWGSGWGDAGYIKMSRNR 298
Query: 334 DGSCGI 339
D +CGI
Sbjct: 299 DNNCGI 304
>gi|390470786|ref|XP_003734355.1| PREDICTED: LOW QUALITY PROTEIN: cathepsin W [Callithrix jacchus]
Length = 373
Score = 196 bits (498), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 115/324 (35%), Positives = 176/324 (54%), Gaps = 22/324 (6%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
F +F Q N++Y T E+ RL IF+ NL + Q LQ+ + G+ +G+ FSDL+ EF
Sbjct: 42 FKFFQIQFNRSYLTPEEHARRLDIFAHNLVQAQRLQEEDLGTAEFGVTPFSDLTEEEFGQ 101
Query: 102 KYLGFKLK---PSYADRSVPAMIPNITLPRAFDWREY-DAVTGVKDQTMCGSSWAFSTTG 157
Y + PS + R V + P ++P DWR+ A++ +++Q C WA + G
Sbjct: 102 LYGHQRAAGGVPSMS-RVVGSEEPEESVPHTCDWRKVAGAISFIRNQGNCLCCWAMAAAG 160
Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
NIE +++ K V++S QEL+DC + DGC GG + +AF T++ G+ E YP++
Sbjct: 161 NIEALWSINFLKFVNVSVQELLDCGRCGDGCHGGYVWDAFSTVLKN--SGVVSESDYPFQ 218
Query: 218 GDDKACRLNKKATQ--VKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSH 275
+ R + K I ++ + D +A+YL GP+ V INA LQ Y GV
Sbjct: 219 ANFGPHRCHAKTYNKVAWIMDFIFLPDDXQRIAQYLTTYGPITVTINAKHLQLYQKGVIK 278
Query: 276 PIQFFCDGGNENLSHSVLIVGYGVDRTKFT-----------HKAVPYWIIKNSWGEGWGE 324
CD + + HSVL+VG+G ++++ ++ PYWI+KNSWG WGE
Sbjct: 279 ARPTTCD--PQFVDHSVLLVGFGSEKSEGMGAKTVSSQSRHPRSTPYWILKNSWGAQWGE 336
Query: 325 KGYFRLYRGDGSCGINDYVRSALV 348
+GYFRL+RG +CGI Y +A V
Sbjct: 337 EGYFRLHRGSNTCGITKYPVTARV 360
>gi|18418684|ref|NP_567983.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
gi|71153408|sp|O65493.1|XCP1_ARATH RecName: Full=Xylem cysteine proteinase 1; Short=AtXCP1; Flags:
Precursor
gi|6708181|gb|AAF25831.1|AF191027_1 papain-type cysteine endopeptidase XCP1 [Arabidopsis thaliana]
gi|3080415|emb|CAA18734.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|7270487|emb|CAB80252.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|26449881|dbj|BAC42063.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|28827736|gb|AAO50712.1| unknown protein [Arabidopsis thaliana]
gi|332661101|gb|AEE86501.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
Length = 355
Score = 196 bits (498), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 129/331 (38%), Positives = 177/331 (53%), Gaps = 28/331 (8%)
Query: 22 FMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEH 81
F +VG H + K LF ++ +H+K Y ++ E R +F NL I ++ E
Sbjct: 31 FSIVGYTPEHLTNTDKLLELFESWMSEHSKAYKSVEEKVHRFEVFRENLMHIDQ-RNNEI 89
Query: 82 GSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAM---IPNIT-LPRAFDWREYDA 137
S GLNEF+DL+ EF+ +YLG KP ++ + P+ +IT LP++ DWR+ A
Sbjct: 90 NSYWLGLNEFADLTHEEFKGRYLGLA-KPQFSRKRQPSANFRYRDITDLPKSVDWRKKGA 148
Query: 138 VTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNA 196
V VKDQ CGS WAFST +EG+ T L SLSEQELIDCD + GC GG + A
Sbjct: 149 VAPVKDQGQCGSCWAFSTVAAVEGINQITTGNLSSLSEQELIDCDTTFNSGCNGGLMDYA 208
Query: 197 FDTIMSKLGGGLEEEKTYPYRGDDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENG 255
F I+S GGL +E YPY ++ C+ K+ +V I+GY V ++ + + +
Sbjct: 209 FQYIIST--GGLHKEDDYPYLMEEGICQEQKEDVERVTISGYEDVPENDDESLVKALAHQ 266
Query: 256 PMAVAINAYA--LQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWI 313
P++VAI A QFY GV F +L H V VGYG + K Y I
Sbjct: 267 PVSVAIEASGRDFQFYKGGV------FNGKCGTDLDHGVAAVGYG------SSKGSDYVI 314
Query: 314 IKNSWGEGWGEKGYFRLYRG----DGSCGIN 340
+KNSWG WGEKG+ R+ R +G CGIN
Sbjct: 315 VKNSWGPRWGEKGFIRMKRNTGKPEGLCGIN 345
>gi|308462787|ref|XP_003093674.1| hypothetical protein CRE_29187 [Caenorhabditis remanei]
gi|308249538|gb|EFO93490.1| hypothetical protein CRE_29187 [Caenorhabditis remanei]
Length = 392
Score = 196 bits (498), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 119/346 (34%), Positives = 191/346 (55%), Gaps = 23/346 (6%)
Query: 4 FYFFAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRL 63
YFF + L++T+ + + +++ ++ F F ++ K + VE+ R
Sbjct: 58 LYFFTALFFLTVTLGL-----LYQKRVERQEFFENLQEFRDFNQKFQKIHKNSVEFKERF 112
Query: 64 HIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKL-KPSYADRSVPAM-I 121
IF GNL+K+++L+ + + +N+FSD+S E + L KL + ++ + ++ + +
Sbjct: 113 LIFRGNLKKLEILRSSNPDID-FSINQFSDMSENELKLILLDKKLLERNFQNSTLKSFDL 171
Query: 122 P-NITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELID 180
P N+T P DWR+ V VK+Q CGS WAF+T +E YA + L SLSEQEL+D
Sbjct: 172 PMNLTRPERIDWRDSGKVMSVKNQGACGSCWAFATVAAVESQYAIRKGTLWSLSEQELVD 231
Query: 181 CDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR-GDDKACRLNKKATQVKINGYVS 239
CD E GC GG + A + LG GLE E YPY C +N T+V ++ S
Sbjct: 232 CDGESYGCGGGFLDKALGWV---LGNGLETEDDYPYECTQHDQCYINGGKTRVTVDEGWS 288
Query: 240 VSRDETDMAKYLVENGPMAVAIN-AYALQFYVTGVSHPIQFFCDGGNENLS-HSVLIVGY 297
+ RDE +A ++ GP+A A++ + Y GV +P + C +E+L H++ ++GY
Sbjct: 289 LGRDEDSIADWVASVGPVAFAMSVPNSFTAYSNGVYNPSEHECR--DESLGYHAMTLIGY 346
Query: 298 GVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYV 343
G + + PYWI+KNSWG WG++GY RL RG+ +CG+ D+V
Sbjct: 347 GTEGNQ------PYWIVKNSWGSSWGDQGYMRLARGNNACGMRDFV 386
>gi|33348836|gb|AAQ16118.1| cathepsin L-like cysteine proteinase B [Rhipicephalus
haemaphysaloides haemaphysaloides]
Length = 335
Score = 196 bits (498), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 127/323 (39%), Positives = 169/323 (52%), Gaps = 26/323 (8%)
Query: 40 ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGV---YGLNEFSDLST 96
A ++ F H K YA+ E Y RL I+ N KI + S V +NEF DL
Sbjct: 25 AEWSAFKALHGKDYASDTEEYYRLKIYMENRLKIARHNEKYAKSQVSYKLAMNEFGDLLH 84
Query: 97 AEFQAKYLGFKLKPSYADRS-----VPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSW 151
EF + GFK + R P ++ LP+ DWR+ AVT VK+Q CGS W
Sbjct: 85 HEFVSTRNGFKRNYRDSPREGSFFVEPEGFEDLQLPKTVDWRKKGAVTPVKNQGQCGSCW 144
Query: 152 AFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLE 209
AFSTTG++EG + KT+KLVSLSEQ L+DC + ++GCEGG + NAF I S G++
Sbjct: 145 AFSTTGSLEGPHFRKTRKLVSLSEQNLVDCSRSFGNNGCEGGLMDNAFKYIKSN--KGID 202
Query: 210 EEKTYPYRGDDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINAY--AL 266
E +YPY D C N+ G+V + DE + K + GP++VAI+A +
Sbjct: 203 TEWSYPYNATDGVCHFNRSDVGATDTGFVDIPEGDENKLKKAVAAVGPVSVAIDASHESF 262
Query: 267 QFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKG 326
QFY GV + C +E L H VL+VGYG T YW++KNSWG WG++G
Sbjct: 263 QFYSEGVYDEPE--C--SSEQLDHGVLVVGYG------TKDGQDYWLVKNSWGTTWGDEG 312
Query: 327 YFRLYRG-DGSCGINDYVRSALV 348
Y + R D CGI LV
Sbjct: 313 YIYMTRNKDNQCGIASSASYPLV 335
>gi|162463464|ref|NP_001104879.1| cysteine proteinase Mir3 precursor [Zea mays]
gi|2425066|gb|AAB88263.1| cysteine proteinase Mir3 [Zea mays]
Length = 480
Score = 196 bits (498), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 122/314 (38%), Positives = 165/314 (52%), Gaps = 31/314 (9%)
Query: 41 LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY----GLNEFSDLST 96
++ ++ H +TY + R +F NLR I + +GV+ GLN F+DL+
Sbjct: 43 MYAEWMAAHGRTYNAVGAEERRYQVFRDNLRYIDA-HNAAADAGVHSFRLGLNRFADLTN 101
Query: 97 AEFQAKYLGFKLKPSYADRSVPA---MIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAF 153
E+ A YLG + +P DR + A N LP + DWR AV VKDQ CG+ WAF
Sbjct: 102 DEYPATYLGARTRPQR-DRKLGARYHAADNEDLPESVDWRAKGAVAEVKDQGSCGTCWAF 160
Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEK 212
ST +EG+ T L+SLSEQEL+DCD + GC GG + AF+ I++ GG++ EK
Sbjct: 161 STIAAVEGINQIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAFEFIINN--GGIDTEK 218
Query: 213 TYPYRGDDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFY 269
YPY+G D C +N+K A V I+ Y V ++ + V N P++VAI A A Q Y
Sbjct: 219 DYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANDEKSLQKAVANQPVSVAIEAAGTAFQLY 278
Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
+G+ F L H V VGYG + K YWI+KNSWG WGE GY R
Sbjct: 279 SSGI------FTGSCGTRLDHGVTAVGYGTENGK------DYWIVKNSWGSSWGESGYVR 326
Query: 330 LYRG----DGSCGI 339
+ R G CGI
Sbjct: 327 MERNIKASSGKCGI 340
>gi|5823020|gb|AAD53012.1|AF089849_1 senescence-specific cysteine protease [Brassica napus]
Length = 344
Score = 196 bits (498), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 114/308 (37%), Positives = 163/308 (52%), Gaps = 26/308 (8%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHG-SGVYGLNEFSDLSTAEFQAKY 103
++ +H + YA E +R +F N+ +I+ L D + G + +N+F+DL+ EF++ Y
Sbjct: 41 WMTEHGRVYADANEKNNRYAVFKRNVERIERLNDVQSGLTFKLAVNQFADLTNEEFRSMY 100
Query: 104 LGFKLKPSYADRSVPAM-----IPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
GFK + R+ P + + LP + DWR+ AVT +KDQ +CGS WAFS
Sbjct: 101 TGFKGNSVLSSRTKPTSFRYQNVSSDALPVSVDWRKKGAVTPIKDQGLCGSCWAFSAVAA 160
Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG 218
IEGV K KL+SLSEQEL+DCD D GC GG + AF+ ++ GGL E YPY+
Sbjct: 161 IEGVAQIKKGKLISLSEQELVDCDTNDGGCMGGLMDTAFNYTITI--GGLTSESNYPYKS 218
Query: 219 DDKACRLNK-KATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSH 275
+ C NK K I G+ V ++ V + P+++ I QFY +GV
Sbjct: 219 TNGTCNFNKTKQIATSIKGFEDVPANDEKALMKAVAHHPVSIGIAGGDIGFQFYSSGV-- 276
Query: 276 PIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-- 333
F +L H V VGYG R+K + YWI+KNSWG WGE+GY R+ +
Sbjct: 277 ----FSGECTTHLDHGVTAVGYG--RSK---NGLKYWILKNSWGPKWGERGYMRIKKDIK 327
Query: 334 --DGSCGI 339
G CG+
Sbjct: 328 PKHGQCGL 335
>gi|109119897|dbj|BAE96008.1| cysteine proteinase [Triticum aestivum]
Length = 377
Score = 196 bits (498), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 121/274 (44%), Positives = 152/274 (55%), Gaps = 37/274 (13%)
Query: 88 LNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNIT-----------LPRAFDWREYD 136
LN F D+S AEF+A + G ++ S R PA P++ LPR+ DWR+
Sbjct: 91 LNRFGDMSQAEFRATFAGSRV--SDRRRDGPATPPSVPGFMYAAVNVSDLPRSVDWRQKG 148
Query: 137 AVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQED-DGCEGGSISN 195
AVTGVK+Q CGS WAFST ++EG+ A +T KLVSLSEQELIDCD D DGCEGG + N
Sbjct: 149 AVTGVKNQGKCGSCWAFSTVVSVEGINAIRTGKLVSLSEQELIDCDTADNDGCEGGLMDN 208
Query: 196 AFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQ----VKINGYVSVSRDETDMAKYL 251
AF+ I K GGL E YPYR + C+ K A V I+G+ V + +
Sbjct: 209 AFEYI--KKNGGLTTEAAYPYRAANGTCKAAKVAKSSPMVVHIDGHQDVPANSEEALAKA 266
Query: 252 VENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAV 309
V N P++V I+A A FY GV F + G E L H V +VGYGV
Sbjct: 267 VANQPVSVGIDASGKAFMFYSEGV-----FTGECGTE-LDHGVAVVGYGV-----AEDGK 315
Query: 310 PYWIIKNSWGEGWGEKGYFRLYRGDGS----CGI 339
YW +KNSWG WGEKGY R+ + G+ CGI
Sbjct: 316 AYWTVKNSWGPSWGEKGYIRVEKDSGAEGGLCGI 349
>gi|168006315|ref|XP_001755855.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162693174|gb|EDQ79528.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 454
Score = 196 bits (498), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 124/311 (39%), Positives = 168/311 (54%), Gaps = 27/311 (8%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
F + +H K Y++L E+ R ++ NL IQ + S GL +F+D++ EF+
Sbjct: 46 FGAWAHKHGKVYSSLEEHAHRYMVWKDNLEYIQRHSEKNR-SYWLGLTKFADITNDEFRR 104
Query: 102 KYLGFKLKPS-YADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIE 160
+Y G ++ S + R + P + DWR+ AVT VKDQ CGS WAFS G++E
Sbjct: 105 QYTGTRIDRSKRSKRKTGFRYADSEAPESVDWRKKGAVTTVKDQGSCGSCWAFSAIGSVE 164
Query: 161 GVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGD 219
G+ A +T + VSLSEQEL+DCD E + GC GG + AFD I+ GG++ E YPY+G
Sbjct: 165 GINAIRTGEAVSLSEQELVDCDLEYNQGCNGGLMDYAFDFILEN--GGIDTENDYPYKGL 222
Query: 220 DKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHP 276
D C NKK A V I+GY V ++ + K V P++VAI A Q Y GV
Sbjct: 223 DGRCDNNKKNAHVVTIDGYEDVPENDEEALKKAVAGQPVSVAIEAGGRDFQLYSGGV--- 279
Query: 277 IQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGD-- 334
F +L H VL VGYG + ++ YWI+KNSWGE WGE GY R+ R
Sbjct: 280 ---FTGECGTDLDHGVLAVGYG------SEGSLDYWIVKNSWGEYWGESGYLRMQRNIKD 330
Query: 335 -----GSCGIN 340
G CGIN
Sbjct: 331 SNHQFGLCGIN 341
>gi|394331818|gb|AFN27128.1| cysteine protease [Leishmania tropica]
Length = 443
Score = 196 bits (498), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 115/318 (36%), Positives = 172/318 (54%), Gaps = 22/318 (6%)
Query: 40 ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
ALF F + + Y T+ E RL F NL ++ Q + +G+ +F DLS AEF
Sbjct: 36 ALFEEFKRTYRRAYGTVAEEQQRLANFERNLELMREHQ-ARNPHARFGITKFFDLSEAEF 94
Query: 100 QAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
A+YL F +A + +++ +P A DWR+ AVT VKDQ CGS WAFS
Sbjct: 95 AARYLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWRKKGAVTPVKDQGACGSCWAFS 154
Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
G+IE +A +L +LSE L+ C ++ GC GG + AF+ ++ + G + E +Y
Sbjct: 155 AVGSIESQWALAGHRLTALSEHHLVSCHDKNSGCTGGLMLQAFEWLLRNMNGTMFTEDSY 214
Query: 215 PYRGDDKACRLNKKATQV----KINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYV 270
PY ++Q+ +I+GY+++ ET MA +L +NGP+++A++A + Y
Sbjct: 215 PYVSSSGYVPECSNSSQLVPGARIDGYMTIESSETVMAAWLAKNGPISIAVDASSFMSYQ 274
Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
+GV +L+H VL+VGY +RT VPYW+IKNSWGE WGE GY R+
Sbjct: 275 SGV------LTSCAGISLNHGVLLVGY--NRT----GEVPYWVIKNSWGENWGENGYVRV 322
Query: 331 YRGDGSCGINDYVRSALV 348
G +C + +Y SA V
Sbjct: 323 TMGVNACLLTEYPVSAHV 340
>gi|171702841|dbj|BAG16376.1| cysteine protease [Brassica rapa var. perviridis]
Length = 333
Score = 196 bits (498), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 114/308 (37%), Positives = 163/308 (52%), Gaps = 26/308 (8%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHG-SGVYGLNEFSDLSTAEFQAKY 103
++ +H + YA E +R +F N+ +I+ L D + G + +N+F+DL+ EF++ Y
Sbjct: 35 WMTEHGRVYADANEKNNRYAVFKRNVERIERLNDVQSGLTFKLAVNQFADLTNEEFRSMY 94
Query: 104 LGFKLKPSYADRSVPAM-----IPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
GFK + R+ P + + LP + DWR+ AVT +KDQ +CGS WAFS
Sbjct: 95 TGFKGNSVLSSRTKPTSFRYQNVSSDALPVSVDWRKKGAVTPIKDQGLCGSCWAFSAVAA 154
Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG 218
IEGV K KL+SLSEQEL+DCD D GC GG + AF+ ++ GGL E YPY+
Sbjct: 155 IEGVAQIKKGKLISLSEQELVDCDTNDGGCMGGLMDTAFNYTITI--GGLTSESNYPYKS 212
Query: 219 DDKACRLNK-KATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSH 275
+ C NK K I G+ V ++ V + P+++ I QFY +GV
Sbjct: 213 TNGTCNFNKTKQIATSIKGFEDVPANDEKALMKAVAHHPVSIGIAGGDIGFQFYSSGV-- 270
Query: 276 PIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-- 333
F +L H V VGYG R+K + YWI+KNSWG WGE+GY R+ +
Sbjct: 271 ----FSGECTTHLDHGVTAVGYG--RSK---NGLKYWILKNSWGPKWGERGYMRIKKDIK 321
Query: 334 --DGSCGI 339
G CG+
Sbjct: 322 PKHGQCGL 329
>gi|2342494|dbj|BAA21848.1| bromelain [Ananas comosus]
gi|2463582|dbj|BAA22543.1| FB31 precursor [Ananas comosus]
Length = 352
Score = 196 bits (497), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 116/307 (37%), Positives = 163/307 (53%), Gaps = 24/307 (7%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
F ++ ++ + Y E R IF N+ I+ + S G+N+F+D++ EF A
Sbjct: 37 FEEWMAEYGRVYKDNDEKMRRFQIFKNNVNHIETFNNRNGNSYTLGINKFTDMTNNEFVA 96
Query: 102 KYLGFKLKPSYADRSVPAMIPNITLP---RAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
+Y G +P ++ ++ + ++ DWR+Y AVT VKDQ CGS WAFS
Sbjct: 97 QYTGGISRPLNIEKEPVVSFDDVNISAVGQSIDWRDYGAVTEVKDQNPCGSCWAFSAIAT 156
Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG 218
+EG+Y T LVSLSEQE++DC +GC+GG + NA+D I+S G+ E YPY+
Sbjct: 157 VEGIYKIVTGYLVSLSEQEVLDC-AVSNGCDGGFVDNAYDFIISN--NGVASEADYPYQA 213
Query: 219 DDKACRLNKKATQVKINGYVSV-SRDETDMAKYLVENGPMAVAINAYA--LQFYVTGVSH 275
C N I GY V S DE+ M KY V N P+A AI+A Q+Y GV
Sbjct: 214 YQGDCAANSWPNSAYITGYSYVRSNDESSM-KYAVWNQPIAAAIDASGDNFQYYNGGV-- 270
Query: 276 PIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-- 333
F +L+H++ I+GYG D + YWI+KNSWG WGE+GY R+ RG
Sbjct: 271 ----FSGPCGTSLNHAITIIGYGQDSS-----GTQYWIVKNSWGSSWGERGYIRMARGVS 321
Query: 334 -DGSCGI 339
G CGI
Sbjct: 322 SSGLCGI 328
>gi|55740402|gb|AAV63977.1| cathepsin L precursor [Artemia franciscana]
Length = 338
Score = 196 bits (497), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 129/356 (36%), Positives = 186/356 (52%), Gaps = 38/356 (10%)
Query: 6 FFAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHI 65
F G L+ L+ ++S ++ DE ++ F H K Y + +E R+ I
Sbjct: 8 FLLGAVLVQLSAALSLTNLLADE-------------WHLFKATHKKEYPSQLEEKFRMKI 54
Query: 66 FSGNLRKI---QLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKP---SYADRSVPA 119
+ N K+ +L + S +N+F DL EF++ G++ K S A+ +
Sbjct: 55 YLENKHKVAKHNILYEKGEKSYQVAMNKFGDLLHHEFRSIMNGYQHKKQNSSRAESTFTF 114
Query: 120 MIP-NITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQEL 178
M P N+ +P + DWRE A+T VKDQ CGS WAFS+TG +EG KT KL+SLSEQ L
Sbjct: 115 MEPANVEVPESVDWREKGAITPVKDQGQCGSCWAFSSTGALEGQTFRKTGKLISLSEQNL 174
Query: 179 IDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKING 236
IDC + ++GC GG + AF I K G++ E TYPY +D CR N + G
Sbjct: 175 IDCSGKYGNEGCNGGLMDQAFQYI--KDNKGIDTENTYPYEAEDDVCRYNPRNRGAVDRG 232
Query: 237 YVSVSRDETDMAKYLVEN-GPMAVAINAY--ALQFYVTGVSHPIQFFCDGGNENLSHSVL 293
+V + E D K V GP++VAI+A + QFY GV + + CD +++L H VL
Sbjct: 233 FVDIPSGEEDKLKAAVATVGPVSVAIDASHESFQFYSKGVYY--EPSCD--SDDLDHGVL 288
Query: 294 IVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-DGSCGINDYVRSALV 348
+VGYG D K YW++KNSW E WG++GY ++ R CG+ LV
Sbjct: 289 VVGYGSDNGK------DYWLVKNSWSEHWGDEGYIKIARNRKNHCGVATAASYPLV 338
>gi|1619905|gb|AAB16997.1| thiol protease isoform A, partial [Glycine max]
Length = 318
Score = 196 bits (497), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 113/246 (45%), Positives = 149/246 (60%), Gaps = 24/246 (9%)
Query: 110 PSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKK 169
P++A ++ ++P LP+ FDWR+ AVT VKD CGS W+FSTTG +E + T +
Sbjct: 74 PAHAQKA--PILPTKDLPKDFDWRDKGAVTNVKDLGGCGSCWSFSTTGALEVSFYLATGE 131
Query: 170 LVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDD 220
LVSLSEQ+L+DCD D GC GG ++NAF+ + S GG+++EK PY G D
Sbjct: 132 LVSLSEQQLVDCDHVCDPEEYGACDSGCNGGLMNNAFEILQS---GGVQKEKDIPYTGRD 188
Query: 221 KACRLNKKATQVKINGYVS-VSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQF 279
C+ +K T+V + VS DE +A LV+NGP+AVAINA +Q YV GVS P +
Sbjct: 189 GTCKFDK--TKVAATDLIKRVSLDEEQIAANLVKNGPLAVAINAVFMQTYVGGVSCP--Y 244
Query: 280 FCDGGNENLSHSVLIVGYGVDR-TKFTHKAVPYWIIKNSWGEGWGEK-GYFRLYRGDGSC 337
C ++L H VL+VGYG R K PYWIIKNSWGE WGE GY + RG C
Sbjct: 245 IC---GKHLDHGVLLVGYGEGRYAPIRFKNKPYWIIKNSWGESWGENDGYDEICRGRNVC 301
Query: 338 GINDYV 343
G++ V
Sbjct: 302 GVDAMV 307
>gi|156142226|gb|ABU51882.1| ervatamin-C precursor [Tabernaemontana divaricata]
Length = 365
Score = 196 bits (497), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 119/309 (38%), Positives = 168/309 (54%), Gaps = 31/309 (10%)
Query: 41 LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
++ +L +H+K Y+ LVEY R IF NL+ I ++E+ + GL ++DL+ EFQ
Sbjct: 44 IYELWLAKHDKVYSGLVEYEKRFEIFKDNLKFIDE-HNSENHTYKMGLTPYTDLTNEEFQ 102
Query: 101 AKYLG------FKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
A YLG +LK + A LP DWR+ AVT VK+Q CGS WAFS
Sbjct: 103 AIYLGTRSDTIHRLKRTINISERYAYEAGDNLPEQIDWRKKGAVTPVKNQGKCGSCWAFS 162
Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
T +E + +T L+SLSEQ+L+DC++++ GC+GG+ A+ I+ GG++ E Y
Sbjct: 163 TVSTVESINQIRTGNLISLSEQQLVDCNKKNHGCKGGAFVYAYQYIID--NGGIDTEANY 220
Query: 215 PYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQF--YVTG 272
PY+ CR KK V+I+GY V + K V + P VAI+A + QF Y +G
Sbjct: 221 PYKAVQGPCRAAKKV--VRIDGYKGVPHCNENALKKAVASQPSVVAIDASSKQFQHYKSG 278
Query: 273 VSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR 332
+ F L+H V+IVGY D YWI++NSWG WGE+GY R+ R
Sbjct: 279 I------FSGPCGTKLNHGVVIVGYWKD----------YWIVRNSWGRYWGEQGYIRMKR 322
Query: 333 --GDGSCGI 339
G G CGI
Sbjct: 323 VGGCGLCGI 331
>gi|37780039|gb|AAP32192.1| cysteine protease 14 [Trifolium repens]
Length = 351
Score = 196 bits (497), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 128/316 (40%), Positives = 167/316 (52%), Gaps = 30/316 (9%)
Query: 37 KHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY--GLNEFSDL 94
K LF ++ +H K Y T+ E R +F NL+ I D Y GLNEF+DL
Sbjct: 42 KLIELFESWMSRHGKIYETIEEKLLRFEVFKDNLKHID---DRNKIVSNYWLGLNEFADL 98
Query: 95 STAEFQAKYLGFKLKPSYADRSVPA---MIPNITLPRAFDWREYDAVTGVKDQTMCGSSW 151
S EF+ KYLG K+ S S ++ LP++ DWR+ AVT VK+Q CGS W
Sbjct: 99 SHQEFKNKYLGLKVDLSQRRESSNEEEFTYRDVDLPKSVDWRKKGAVTPVKNQGQCGSCW 158
Query: 152 AFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEE 210
AFST +EG+ T L SLSEQELIDCD ++GC GG + AF I GGL +
Sbjct: 159 AFSTVAAVEGINQIVTGNLTSLSEQELIDCDTTYNNGCNGGLMDYAFSFIGQ--NGGLHK 216
Query: 211 EKTYPYRGDDKACRLNKKATQ-VKINGYVSVSRDETDMAKYLVENGPMAVAINAYA--LQ 267
E+ YPY ++ C + K+ TQ V INGY V ++ + N P++VAI A + Q
Sbjct: 217 EEDYPYIMEESTCEMKKEETQVVTINGYHDVPQNNEQSLLKALANQPLSVAIEASSRDFQ 276
Query: 268 FYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGY 327
FY GV F +L H V VGYG T K + Y I+KNSWG WGEKG+
Sbjct: 277 FYSGGV------FDGHCGSDLDHGVSAVGYG------TSKNLDYIIVKNSWGAKWGEKGF 324
Query: 328 FRLYRG----DGSCGI 339
R+ R +G CG+
Sbjct: 325 IRMKRDIGKPEGICGL 340
>gi|356533293|ref|XP_003535200.1| PREDICTED: LOW QUALITY PROTEIN: cysteine proteinase RD21a-like
[Glycine max]
Length = 466
Score = 196 bits (497), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 124/321 (38%), Positives = 166/321 (51%), Gaps = 32/321 (9%)
Query: 35 HVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDL 94
H +H ++ +L +H K Y L E R IF NLR I+ S GLN+F+DL
Sbjct: 43 HTRH--VYEAWLVKHGKAYNALGEKERRFKIFKDNLRFIEEHNGAGDKSYKLGLNKFADL 100
Query: 95 STAEFQAKYLGFKLKPSYADRSVPAMIPNI-------TLPRAFDWREYDAVTGVKDQTMC 147
+ E++A +LG + + +V A + LP DWRE AVT +KDQ C
Sbjct: 101 TNEEYRAMFLGTRTRGPKNKAAVVAKKTDRYAYRAGEELPAMVDWREKGAVTPIKDQGQC 160
Query: 148 GSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGG 206
GS WAFST G +EG+ T L SLSEQEL+DCD+ + GC GG + AF+ I+ G
Sbjct: 161 GSCWAFSTVGAVEGINQIVTGNLTSLSEQELVDCDRGYNMGCNGGLMDYAFEFIVQN--G 218
Query: 207 GLEEEKTYPYRGDDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA 265
G++ E+ YPY D C N+K A V I+GY V ++ V N P++VAI A
Sbjct: 219 GIDTEEDYPYHAKDNTCDPNRKNARVVTIDGYEDVPTNDEKSLMKAVANQPVSVAIEAGG 278
Query: 266 LQF--YVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWG 323
++F Y +GV F NL H V+ VGYG T YW+++NSWG WG
Sbjct: 279 MEFQLYQSGV------FTGRCGTNLDHGVVAVGYG------TENGTDYWLVRNSWGSAWG 326
Query: 324 EKGYFRLYRG-----DGSCGI 339
E GY +L R G CGI
Sbjct: 327 ENGYIKLERNVQNTETGKCGI 347
>gi|168063167|ref|XP_001783545.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664932|gb|EDQ51634.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 461
Score = 196 bits (497), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 129/327 (39%), Positives = 169/327 (51%), Gaps = 32/327 (9%)
Query: 30 LHHLHHVKHTAL----FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGV 85
LH ++H L F + +H K Y + R ++ NL I+ + S
Sbjct: 38 LHMTTDLEHENLLLEQFAAWAHKHGKAYHDAEQCLHRFAVWKDNLAYIRHSETNRTYS-- 95
Query: 86 YGLNEFSDLSTAEFQAKYLGFKLKPSY-ADRSVPAMIPNITLPRAFDWREYDAVTGVKDQ 144
GL +F+DL+ EF+ Y G ++ S A R + P + DWR+ AVT VKDQ
Sbjct: 96 LGLTKFADLTNEEFRRMYTGTRIDRSRRAKRRTGFRYADSEAPESVDWRKNGAVTSVKDQ 155
Query: 145 TMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSK 203
CGS WAFS G++EG+ A + + VSLSEQEL+DCD E + GC GG + AFD I+
Sbjct: 156 GSCGSCWAFSAVGSVEGINAIRNGEAVSLSEQELVDCDLEYNQGCNGGLMDYAFDFIIQN 215
Query: 204 LGGGLEEEKTYPYRGDDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAIN 262
GG++ EK YPY+G D C +KK A V I+GY V ++ + K V P++VAI
Sbjct: 216 --GGIDTEKDYPYKGFDGRCDNSKKNAHVVTIDGYEDVPENDEEALKKAVAGQPVSVAIE 273
Query: 263 AYA--LQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGE 320
A Q Y GV F +L H VL VGYG T V YWI+KNSWGE
Sbjct: 274 AGGRDFQLYAQGV------FSGECGTDLDHGVLAVGYG------TEDGVDYWIVKNSWGE 321
Query: 321 GWGEKGYFRLYR-------GDGSCGIN 340
WGE GY R+ R G G CGIN
Sbjct: 322 YWGESGYLRMKRNMKDSNDGPGLCGIN 348
>gi|363807062|ref|NP_001242584.1| uncharacterized protein LOC100804015 precursor [Glycine max]
gi|255640677|gb|ACU20623.1| unknown [Glycine max]
Length = 366
Score = 195 bits (496), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 124/317 (39%), Positives = 169/317 (53%), Gaps = 34/317 (10%)
Query: 40 ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
A++ +L +H K Y L + R +F NL IQ + + + GLN+F+D++ E+
Sbjct: 36 AMYEEWLVRHQKGYNELGKKDKRFQVFKDNLGFIQEHNNNLNNTYKLGLNKFADMTNEEY 95
Query: 100 QAKYLGFKLKPSYADRSVP---------AMIPNITLPRAFDWREYDAVTGVKDQTMCGSS 150
+A YLG K S A R + A LP DWR AV +KDQ CGS
Sbjct: 96 RAMYLGTK---SNAKRRLMKTKSTGHRYAFSARDRLPVHVDWRMKGAVAPIKDQGSCGSC 152
Query: 151 WAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLE 209
WAFST +E + T K VSLSEQEL+DCD+ ++GC GG + AF+ I+ GG++
Sbjct: 153 WAFSTVATVEAINKIVTGKFVSLSEQELVDCDRAYNEGCNGGLMDYAFEFIIQN--GGID 210
Query: 210 EEKTYPYRGDDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YAL 266
+K YPYRG D C KK A V I+GY V + + K V + P++VAI A AL
Sbjct: 211 TDKDYPYRGFDGICDPTKKNAKVVNIDGYEDVPPYDENALKKAVAHQPVSVAIEASGRAL 270
Query: 267 QFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKG 326
Q Y +GV F +L H V++VGYG + V YW+++NSWG GWGE G
Sbjct: 271 QLYQSGV------FTGKCGTSLDHGVVVVGYG------SENGVDYWLVRNSWGTGWGEDG 318
Query: 327 YFRLYRG----DGSCGI 339
YF++ R G CGI
Sbjct: 319 YFKMQRNVRTSTGKCGI 335
>gi|402770507|gb|AFQ98388.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 195 bits (496), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 129/351 (36%), Positives = 185/351 (52%), Gaps = 40/351 (11%)
Query: 11 ALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNL 70
A++++TV+ SS ++ + + F H KTY + +E R IF+ N
Sbjct: 9 AIVAVTVAASSQEILRTQ-------------WEAFKTTHKKTYQSHMEELLRFKIFTEN- 54
Query: 71 RKIQLLQDTEHGSGV----YGLNEFSDLSTAEFQAKYLGF--KLKPSYADRSVPAMIPNI 124
I + ++ G+ G+N+F DL EF + G+ K + PA + +
Sbjct: 55 SLIIAKHNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNGYHGSRKSGGSTFLPPANVNDS 114
Query: 125 TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE 184
+LP+A DWR+ AVT VKDQ CGS WAFSTTG++EG + K +LVSLSEQ L+DC Q
Sbjct: 115 SLPKAVDWRKKGAVTPVKDQGQCGSCWAFSTTGSLEGQHFLKNGELVSLSEQNLVDCSQS 174
Query: 185 --DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSR 242
++GCEGG + +AF I K G++ EK+YPY D CR K+ GYV +
Sbjct: 175 FGNNGCEGGLMEDAFKYI--KANDGIDTEKSYPYEAVDGECRFKKEDVGATDTGYVEIKA 232
Query: 243 D-ETDMAKYLVENGPMAVAINA--YALQFYVTGV-SHPIQFFCDGGNENLSHSVLIVGYG 298
E D+ K + GP++VAI+A + Q Y GV P + +E+L H VL+VGYG
Sbjct: 233 GCEDDLKKAVATVGPISVAIDASHSSFQLYSEGVYDEP-----ECSSEDLDHGVLVVGYG 287
Query: 299 VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR-GDGSCGINDYVRSALV 348
V K YW++KNSW E WG++GY + R + CGI LV
Sbjct: 288 VKGGK------KYWLVKNSWAESWGDQGYILMSRDNNNQCGIASQASYPLV 332
>gi|297845064|ref|XP_002890413.1| hypothetical protein ARALYDRAFT_472321 [Arabidopsis lyrata subsp.
lyrata]
gi|297336255|gb|EFH66672.1| hypothetical protein ARALYDRAFT_472321 [Arabidopsis lyrata subsp.
lyrata]
Length = 357
Score = 195 bits (496), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 139/359 (38%), Positives = 187/359 (52%), Gaps = 43/359 (11%)
Query: 3 CFYFFAGVALLSLTVSVS-SFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYS 61
CF A LSL+V+ S + +VG H K LF ++ K Y T+ E
Sbjct: 11 CFPLALSAATLSLSVAASHDYSIVGYSPEDLESHDKLIELFENWISNFEKAYETVEEKLL 70
Query: 62 RLHIFSGNLRKIQLLQDTEHGSGVY--GLNEFSDLSTAEFQAKYLGFKL-------KPSY 112
R +F NL+ I +T Y GLNEF+DLS EF+ YLG K + SY
Sbjct: 71 RFEVFKDNLKHID---ETNKKVKSYWLGLNEFADLSHEEFKKMYLGLKTDIVRRDEERSY 127
Query: 113 AD---RSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKK 169
A+ R V A +P++ DWR+ AV VK+Q CGS WAFST +EG+ T
Sbjct: 128 AEFAYRDVEA------VPKSVDWRKKGAVAEVKNQGSCGSCWAFSTVAAVEGINKIVTGN 181
Query: 170 LVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKK 228
L +LSEQELIDCD ++GC GG + AF+ I+ GGL +E+ YPY ++ C + K
Sbjct: 182 LTTLSEQELIDCDTTYNNGCNGGLMDYAFEYIVK--NGGLRKEEDYPYSMEEGTCEMQKD 239
Query: 229 ATQ-VKINGYVSV-SRDETDMAKYLVENGPMAVAINAYALQF-YVTGVSHPIQFFCDGGN 285
++ V I+G+ V + DE + K L P++VAI+A +F + +GVS F
Sbjct: 240 ESETVTIDGHQDVPTNDEKSLLKALAHQ-PLSVAIDASGREFQFYSGVS----VFDGRCG 294
Query: 286 ENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGIN 340
+L H V VGYG + K Y I+KNSWG WGEKGY RL R +G CGIN
Sbjct: 295 VDLDHGVAAVGYG------SSKGSDYIIVKNSWGPKWGEKGYIRLKRNTGKPEGLCGIN 347
>gi|242020003|ref|XP_002430447.1| Cathepsin L precursor, putative [Pediculus humanus corporis]
gi|212515585|gb|EEB17709.1| Cathepsin L precursor, putative [Pediculus humanus corporis]
Length = 345
Score = 195 bits (496), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 131/355 (36%), Positives = 185/355 (52%), Gaps = 44/355 (12%)
Query: 6 FFAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHI 65
FF + +LS+ +VS + +V +E + F +H K Y VE R+ I
Sbjct: 5 FFIALTVLSIN-AVSFYDLVMEE-------------WQLFKAEHKKNYNNDVEEKFRMKI 50
Query: 66 FSGNLRKIQLLQDTEHGSG----VYGLNEFSDLSTAEFQAKYLGFK---LKPSYADRSVP 118
F N +KI +T++ G GLN++SD+ EF + GF + P +
Sbjct: 51 FMDNKQKI-TKHNTKYQRGEVGYKLGLNKYSDMLHHEFINTFNGFNKSIIPPHLRSNNGK 109
Query: 119 A------MIP--NITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKL 170
IP N+ LP+ DW + AVT VKDQ CGS WAFS TG +EG++ KTK L
Sbjct: 110 THLKGSFFIPPANVKLPKHVDWVKLGAVTPVKDQGHCGSCWAFSATGALEGLHFRKTKVL 169
Query: 171 VSLSEQELIDCDQED--DGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKK 228
VSLSEQ LIDC E+ +GC GG + AF + ++ GG++ E++YPY G++ CR +
Sbjct: 170 VSLSEQNLIDCSTEEGNNGCNGGLMDQAFQYV--RINGGIDTERSYPYEGNNDVCRYEPE 227
Query: 229 ATQVKINGYVSVSRDETDMAKYLVEN-GPMAVAINAY--ALQFYVTGVSHPIQFFCDGGN 285
+ GY V + D K V GP++VAI+A + Q Y +GV + C
Sbjct: 228 NSGAIDTGYTDVPLGDEDALKSAVATVGPVSVAIDASQESFQLYSSGVY--FEPNCKNEP 285
Query: 286 ENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR-GDGSCGI 339
E+L H VL+VGYG D YW++KNSWG+ WGE GY ++ R D CGI
Sbjct: 286 ESLDHGVLVVGYGTDE----ETQQDYWLVKNSWGDSWGENGYIKMARNADNQCGI 336
>gi|374713651|gb|AEZ65083.1| cysteine protease [Carica papaya]
Length = 467
Score = 195 bits (496), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 130/354 (36%), Positives = 179/354 (50%), Gaps = 30/354 (8%)
Query: 1 MSCFYFFAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYY 60
+S F A ++ +S+ S+ +K + A++ +L +H K Y L E
Sbjct: 8 LSLFLLMIFTASSAVDMSIVSYDQRHADKSSWRTDDEVMAMYEAWLVKHGKAYNALGEKE 67
Query: 61 SRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVP-- 118
R IF NLR I ++++ + GLN F+DL+ E+++ YLG K + R V
Sbjct: 68 KRFGIFKDNLRFIDE-HNSQNLTYRLGLNRFADLTNEEYRSMYLGVKPGATRVTRKVSRK 126
Query: 119 ----AMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLS 174
A LP DWR+ AV GVKDQ CGS WAFST +EG+ T L+SLS
Sbjct: 127 SDRFAARVGDALPDFIDWRKEGAVVGVKDQGSCGSCWAFSTIAAVEGINQIVTGDLISLS 186
Query: 175 EQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKAC-RLNKKATQV 232
EQEL+DCD ++GC GG + AF+ I++ GG++ E+ YPYR D+ C + K A V
Sbjct: 187 EQELVDCDTSYNEGCNGGLMDYAFEFIINN--GGIDSEEDYPYRAADQKCDQYRKNANVV 244
Query: 233 KINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSH 290
I+GY V ++ K V P++VAI A A Q Y +GV F +L H
Sbjct: 245 SIDGYEDVPENDEAALKKAVAKQPVSVAIEAGGRAFQLYQSGV------FTGKCGTSLDH 298
Query: 291 SVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-----DGSCGI 339
V VGYG T YWI+ NSWG+ WGE GY R+ R G CGI
Sbjct: 299 GVAAVGYG------TENGQDYWIVGNSWGKNWGEDGYIRMERNLAGSSSGKCGI 346
>gi|297792329|ref|XP_002864049.1| hypothetical protein ARALYDRAFT_495086 [Arabidopsis lyrata subsp.
lyrata]
gi|297309884|gb|EFH40308.1| hypothetical protein ARALYDRAFT_495086 [Arabidopsis lyrata subsp.
lyrata]
Length = 361
Score = 195 bits (496), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 130/308 (42%), Positives = 174/308 (56%), Gaps = 34/308 (11%)
Query: 49 HNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKL 108
H+ +L E R ++F N++ I E+ S LN+F D+++ EF+ Y G +
Sbjct: 44 HHTIARSLEEKAKRFNVFKHNVKHIHETNKKEN-SYKLKLNKFGDMTSEEFRRTYAGSNI 102
Query: 109 KPS---YADRSVPA--MIPNI-TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGV 162
K +R M N+ TLP + DWR+ AVT VK+Q CGS WAFST +EG+
Sbjct: 103 KHHRMFQGERQTTKSFMYANVDTLPTSVDWRKNGAVTPVKNQGQCGSCWAFSTVVAVEGI 162
Query: 163 YAAKTKKLVSLSEQELIDCD-QEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDK 221
+TKKL SLSEQEL+DCD ++ GC GG + AF+ I K GGL E YPY+ D+
Sbjct: 163 NQIRTKKLTSLSEQELVDCDTNKNQGCNGGLMDLAFEFIKEK--GGLTSELVYPYKASDE 220
Query: 222 ACRLNKK-ATQVKINGYVSVSRD-ETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHPI 277
C NK+ A V I+G+ V ++ E D+ K V + P++VAI+A QFY GV
Sbjct: 221 TCDTNKENAPVVSIDGHEDVPKNSEVDLMK-AVAHQPVSVAIDAGGSDFQFYSEGV---- 275
Query: 278 QFFCDGGNENLSHSVLIVGYG--VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-- 333
F G E L+H V +VGYG +D TK YWI+KNSWGE WGEKGY R+ RG
Sbjct: 276 -FTGRCGTE-LNHGVAVVGYGTTIDGTK-------YWIVKNSWGEEWGEKGYIRMQRGIR 326
Query: 334 --DGSCGI 339
+G CGI
Sbjct: 327 HKEGLCGI 334
>gi|356517188|ref|XP_003527271.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
Length = 350
Score = 195 bits (495), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 131/329 (39%), Positives = 170/329 (51%), Gaps = 28/329 (8%)
Query: 22 FMVVG--DEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT 79
F +VG E L + K LF ++ +H K Y + E R IF NL+ I
Sbjct: 28 FSIVGYSSEDLKSMD--KLIELFESWMSRHGKIYENIEEKLLRFEIFKDNLKHIDERNKV 85
Query: 80 EHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMI-PNITLPRAFDWREYDAV 138
+ GLNEF+DLS EF KYLG K+ S S ++ LP++ DWR+ AV
Sbjct: 86 VSNYWL-GLNEFADLSHREFNNKYLGLKVDYSRRRESPEEFTYKDVELPKSVDWRKKGAV 144
Query: 139 TGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAF 197
VK+Q CGS WAFST +EG+ T L SLSEQELIDCD+ ++GC GG + AF
Sbjct: 145 APVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCDRTYNNGCNGGLMDYAF 204
Query: 198 DTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQ-VKINGYVSVSRDETDMAKYLVENGP 256
I+ GGL +E+ YPY ++ C + K+ TQ V I+GY V ++ + N P
Sbjct: 205 SFIVEN--GGLHKEEDYPYIMEEGTCEMTKEETQVVTISGYHDVPQNNEQSLLKALANQP 262
Query: 257 MAVAINAYA--LQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWII 314
++VAI A QFY GV F +L H V VGYG T K V Y +
Sbjct: 263 LSVAIEASGRDFQFYSGGV------FDGHCGSDLDHGVAAVGYG------TAKGVDYITV 310
Query: 315 KNSWGEGWGEKGYFRLYRG----DGSCGI 339
KNSWG WGEKGY R+ R +G CGI
Sbjct: 311 KNSWGSKWGEKGYIRMRRNIGKPEGICGI 339
>gi|157132324|ref|XP_001655999.1| cathepsin l [Aedes aegypti]
gi|108881694|gb|EAT45919.1| AAEL002833-PA [Aedes aegypti]
Length = 339
Score = 195 bits (495), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 128/315 (40%), Positives = 171/315 (54%), Gaps = 28/315 (8%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKI-QLLQDTEHGSGVYGL--NEFSDLSTAE 98
+N F QH K Y + E RL I+ N KI + Q + G Y L N+++DL E
Sbjct: 27 WNAFKLQHRKNYDSETEERIRLKIYVQNKHKIAKHNQRFDLGQEKYRLRVNKYADLLHEE 86
Query: 99 FQAKYLGFK-------LKPSYADRSVPAMIP-NITLPRAFDWREYDAVTGVKDQTMCGSS 150
F GF LK + V + P N+ +P DWR+ AVT VKDQ CGS
Sbjct: 87 FVQTVNGFNRTDSKKSLKGVRIEEPVTFIEPANVEVPTTVDWRKKGAVTPVKDQGHCGSC 146
Query: 151 WAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGL 208
W+FS TG +EG + KT KLVSLSEQ L+DC + ++GC GG + AF I K GG+
Sbjct: 147 WSFSATGALEGQHFRKTGKLVSLSEQNLVDCSGKYGNNGCNGGMMDYAFQYI--KDNGGI 204
Query: 209 EEEKTYPYRGDDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINAY--A 265
+ EK+YPY D C N KA GYV + + DE + K L GP+++AI+A +
Sbjct: 205 DTEKSYPYEAIDDTCHFNPKAVGATDKGYVDIPQGDEEALKKALATVGPVSIAIDASHES 264
Query: 266 LQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEK 325
QFY GV + Q CD +ENL H VL VGYG + + YW++KNSWG WG++
Sbjct: 265 FQFYSEGVYYEPQ--CD--SENLDHGVLAVGYGT-----SEEGEDYWLVKNSWGTTWGDQ 315
Query: 326 GYFRLYRG-DGSCGI 339
GY ++ R D CG+
Sbjct: 316 GYVKMARNRDNHCGV 330
>gi|33333694|gb|AAQ11965.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
maculatus]
Length = 326
Score = 195 bits (495), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 123/306 (40%), Positives = 163/306 (53%), Gaps = 20/306 (6%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT-EHGSGVYG--LNEFSDLSTAEFQ- 100
F H KTY ++VE R +F NL IQ E G + + +F+D++ EF
Sbjct: 26 FKLDHGKTYRSVVEEKRRFSVFQKNLIDIQEHNKKYERGEVSFAKKVTQFADMTHEEFLD 85
Query: 101 -AKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNI 159
K G PS A ++ A DWRE AVT VKDQ CGS WAFS G I
Sbjct: 86 LLKLQGVPALPSNAVHFDNFEDTDMEEKDAVDWREEGAVTPVKDQANCGSCWAFSAVGAI 145
Query: 160 EGVYAAKTKKLVSLSEQELIDCDQED---DGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
EG + K LVSLS QEL+DC E+ +GC GG + AFD + + G++ E++YPY
Sbjct: 146 EGQFFKKNGTLVSLSAQELVDCATEEYGNNGCRGGLMGQAFDFVQDE---GIQTEESYPY 202
Query: 217 RGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHP 276
G +C+ + K+ YV DE +MA+ + GP+AVAI A L FY G+
Sbjct: 203 EGRRSSCKKSGDYV-TKVKTYV-FPLDEQEMARTVAAKGPVAVAIEASQLSFYDKGIVDE 260
Query: 277 IQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGS 336
+ C E+L+H VL+VGYG + V YWI+KNSWG WGEKGYFRL + +
Sbjct: 261 -KCRCSNKREDLNHGVLVVGYG------SENGVDYWIVKNSWGADWGEKGYFRLKKDVKA 313
Query: 337 CGINDY 342
CGI+ Y
Sbjct: 314 CGIDYY 319
>gi|357126406|ref|XP_003564878.1| PREDICTED: cysteine proteinase EP-B 1-like [Brachypodium
distachyon]
Length = 377
Score = 195 bits (495), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 114/263 (43%), Positives = 153/263 (58%), Gaps = 22/263 (8%)
Query: 88 LNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNIT--LPRAFDWREYDAVTGVKDQT 145
LN F D+ AEF++ + G + + +S+P I + +P+A DWR+ AVTGVKDQ
Sbjct: 96 LNRFGDMDQAEFRSTFAGPLHRHTRPAQSIPGFIYDTVKDIPQAVDWRQKGAVTGVKDQG 155
Query: 146 MCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQ--EDDGCEGGSISNAFDTIMSK 203
CGS WAFS ++EG+ A +T LVSLSEQELIDCD +D+GC+GG + +AF+ I +
Sbjct: 156 KCGSCWAFSAVASVEGLNAIRTGSLVSLSEQELIDCDTGGDDNGCQGGLMESAFEFI-AH 214
Query: 204 LGGGLEEEKTYPYRGDDKACRLNKKAT-QVKINGYVSVSRDETDMAKYLVENGPMAVAIN 262
GGL E YPY + C N+ ++ V+I+G+ SV + V + P++VAI+
Sbjct: 215 SAGGLATEAAYPYHASNGTCNANRGSSVSVRIDGHQSVPAGNEEALAKAVAHQPVSVAID 274
Query: 263 A--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGE 320
A A QFY GV F D G+E L H V +VGYGV YWI+KNSWG
Sbjct: 275 AGGQAFQFYSEGV-----FTGDCGSE-LDHGVAVVGYGVAE----EDGKEYWIVKNSWGP 324
Query: 321 GWGEKGYFRLYRGDGS----CGI 339
GWGE GY R+ R G CGI
Sbjct: 325 GWGEHGYVRMQRDSGVDGGLCGI 347
>gi|33333708|gb|AAQ11972.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
maculatus]
Length = 326
Score = 195 bits (495), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 124/306 (40%), Positives = 161/306 (52%), Gaps = 20/306 (6%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT-EHGSGVYG--LNEFSDLSTAEFQ- 100
F H KTY +LVE R +F NL IQ E G + + +F+D++ EF
Sbjct: 26 FKLDHGKTYRSLVEEKRRFSVFQKNLIDIQEHNKKYERGEVSFAKKVTQFADMTHEEFLD 85
Query: 101 -AKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNI 159
K G PS A ++ A DWRE AVT VKDQ CGS WAFS G I
Sbjct: 86 LLKLQGVPALPSNAVHFDNFEDTDMEEKDAVDWREEGAVTPVKDQANCGSCWAFSAVGAI 145
Query: 160 EGVYAAKTKKLVSLSEQELIDCDQED---DGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
EG + K LVSLS QEL+DC E+ +GC GG + AFD + + G++ E++YPY
Sbjct: 146 EGQFFKKNGTLVSLSAQELVDCATEEYGNNGCRGGLMGQAFDFVQDE---GIQTEESYPY 202
Query: 217 RGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHP 276
G +C+ + K+ YV DE +MA+ + GP+AVAI A L FY G+
Sbjct: 203 EGRRSSCKKSGDYV-TKVKTYV-FPLDEQEMARTVAAKGPVAVAIEASQLSFYDKGIVDE 260
Query: 277 IQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGS 336
C E+L+H VL+VGYG + V YWI+KNSWG WGEKGYFRL + +
Sbjct: 261 T-CRCSNKREDLNHGVLVVGYG------SENGVDYWIVKNSWGADWGEKGYFRLKKDVKA 313
Query: 337 CGINDY 342
CGI Y
Sbjct: 314 CGIGYY 319
>gi|33333696|gb|AAQ11966.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
maculatus]
Length = 326
Score = 195 bits (495), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 124/306 (40%), Positives = 161/306 (52%), Gaps = 20/306 (6%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT-EHGSGVYG--LNEFSDLSTAEFQ- 100
F H KTY +LVE R +F NL IQ E G + + +F+D++ EF
Sbjct: 26 FKLDHGKTYRSLVEEKRRFSVFQKNLIDIQEHNKKYERGEVSFAKKVTQFADMTHEEFLD 85
Query: 101 -AKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNI 159
K G PS A ++ A DWRE AVT VKDQ CGS WAFS G I
Sbjct: 86 LLKLQGVPALPSNAVHFDNFEDTDMEEKDAVDWREEGAVTPVKDQANCGSCWAFSAVGAI 145
Query: 160 EGVYAAKTKKLVSLSEQELIDCDQED---DGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
EG + K LVSLS QEL+DC E+ +GC GG + AFD + + G++ E++YPY
Sbjct: 146 EGQFFKKNGTLVSLSAQELVDCATEEYGNNGCRGGLMGQAFDFVQDE---GIQTEESYPY 202
Query: 217 RGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHP 276
G +C+ + K+ YV DE +MA+ + GP+AVAI A L FY G+
Sbjct: 203 EGRRSSCKKSGDYV-TKVKTYV-FPLDEQEMARTVAAKGPVAVAIEASQLSFYDKGIVDE 260
Query: 277 IQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGS 336
C E+L+H VL+VGYG + V YWI+KNSWG WGEKGYFRL + +
Sbjct: 261 T-CRCSNKREDLNHGVLVVGYG------SENGVDYWIVKNSWGADWGEKGYFRLKKDVKA 313
Query: 337 CGINDY 342
CGI Y
Sbjct: 314 CGIGYY 319
>gi|2463586|dbj|BAA22545.1| FB22 precursor [Ananas comosus]
Length = 340
Score = 195 bits (495), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 120/312 (38%), Positives = 167/312 (53%), Gaps = 35/312 (11%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
F ++ ++ + Y E R IF N+ I+ + S G+N+F+D++ EF
Sbjct: 37 FEEWMAEYGRVYKDNDEKMRRFQIFKNNVNHIETFNNRNGNSYTLGINKFTDMTNNEFVT 96
Query: 102 KYLG------FKLKP--SYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAF 153
+Y G FK +P S+ D ++ A + ++ DWR+Y AVT VKDQ CGS WAF
Sbjct: 97 QYTGVSLPLNFKREPVVSFDDVNISA------VGQSIDWRDYGAVTEVKDQNPCGSCWAF 150
Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
S +EG+Y T LVSLSEQE++DC +GC+GG + NA+D I+S G+ E
Sbjct: 151 SAIATVEGIYKIVTGYLVSLSEQEVLDC-AVSNGCDGGFVDNAYDFIISN--NGVASEAD 207
Query: 214 YPYRGDDKACRLNKKATQVKINGYVSV-SRDETDMAKYLVENGPMAVAINAYA--LQFYV 270
YPY+ + C N I GY V S DE+ M KY V N P+A AI+A Q+Y
Sbjct: 208 YPYQAYEGDCTANSWPNSAYITGYSYVRSNDESSM-KYAVWNQPIAAAIDASGDNFQYYN 266
Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
GV F +L+H++ I+GYG D + YWI+KNSWG WGE+GY R+
Sbjct: 267 GGV------FSGPCGTSLNHAITIIGYGQDSS-----GTQYWIVKNSWGSSWGERGYVRM 315
Query: 331 YRG---DGSCGI 339
RG G CGI
Sbjct: 316 ARGVSSSGLCGI 327
>gi|148907299|gb|ABR16787.1| unknown [Picea sitchensis]
Length = 372
Score = 195 bits (495), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 126/331 (38%), Positives = 182/331 (54%), Gaps = 33/331 (9%)
Query: 27 DEKLHHLHHVKHTALFNYFLEQHNKTYA-TLVEYYSRLHIFSGNLRKIQLLQDTEHGSGV 85
DE+L ++ L++ + QH T + E+ R IF N++ I + + + G
Sbjct: 32 DEELESDESLR--GLYDKWALQHRSTRSLDSDEHARRFEIFKENVKHIDSV-NKKDGPYK 88
Query: 86 YGLNEFSDLSTAEFQAKYLGFKL---KPSYADRSVPA---MIPNIT-LPRAFDWREYDAV 138
GLN+F+DLS EF+A ++ K+ K DR V + M N LP + DWR+ AV
Sbjct: 89 LGLNKFADLSNEEFKAMHMTTKMEKHKSLRGDRGVESGSFMYQNSKRLPASIDWRKKGAV 148
Query: 139 TGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFD 198
T VK+Q CGS WAFST ++EG+ KT KLVSLSEQ+L+DC +E+ GC GG + NAF
Sbjct: 149 TPVKNQGQCGSCWAFSTIASVEGINYIKTGKLVSLSEQQLVDCSKENAGCNGGLMDNAFQ 208
Query: 199 TIMSKLGGGLEEEKTYPYRGDDKAC---RLNKKATQVKINGYVSVSRDETDMAKYLVENG 255
I+ GG+ E YPY + C ++ K+ I+G+ V + K V +
Sbjct: 209 YIIDN--GGIVTEDEYPYTAEAGECSTTKIESKSIATIIDGFEDVPANNEGALKKAVAHQ 266
Query: 256 PMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWI 313
P+++AI A + QFY TGV F G E L H V++VGYG + + + YWI
Sbjct: 267 PVSIAIEASGHDFQFYSTGV-----FTGKCGTE-LDHGVVVVGYGK-----SPEGINYWI 315
Query: 314 IKNSWGEGWGEKGYFRLYRG----DGSCGIN 340
++NSWG WGE+GY R+ RG +G CGI+
Sbjct: 316 VRNSWGPEWGEQGYIRMQRGIEATEGKCGIS 346
>gi|224136808|ref|XP_002326950.1| predicted protein [Populus trichocarpa]
gi|222835265|gb|EEE73700.1| predicted protein [Populus trichocarpa]
Length = 456
Score = 195 bits (495), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 131/351 (37%), Positives = 177/351 (50%), Gaps = 38/351 (10%)
Query: 10 VALLSLTVSVSSFMVVGDEKLHHLHHVKHT--------ALFNYFLEQHNKTYATLVEYYS 61
+ LL L ++SS + H H K + A++ +L +H K Y L E
Sbjct: 2 LMLLFLVFALSSAFDMSIISYHQTHATKSSWRTDDEVMAMYEEWLVKHGKNYNALGEKEK 61
Query: 62 RLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGF-----KLKPSYADRS 116
R IF NL I ++E+ + GLN F+DL+ EF++ YLG K P +DR
Sbjct: 62 RFEIFKDNLMFIDQ-HNSENRTYTVGLNRFADLTNEEFRSMYLGTRTGHKKRLPKTSDRY 120
Query: 117 VPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQ 176
P + +LP + DWR+ AV VKDQ CGS WAFST +EG+ T L++LSEQ
Sbjct: 121 APRV--GDSLPDSVDWRKEGAVAEVKDQGGCGSCWAFSTIAAVEGINKIVTGDLIALSEQ 178
Query: 177 ELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRL-NKKATQVKI 234
EL+DCD ++GC GG + AF+ I++ GG++ E YPY G D C K A V I
Sbjct: 179 ELVDCDTSYNEGCNGGLMDYAFEFIINN--GGIDTEDDYPYLGRDGRCDTYRKNAKVVSI 236
Query: 235 NGYVSVSRDETDMAKYLVENGPMAVAIN--AYALQFYVTGVSHPIQFFCDGGNENLSHSV 292
+ Y V ++ K V N P++VAI Q Y +GV F +L H V
Sbjct: 237 DSYEDVPENDETALKKAVANQPVSVAIEGGGRNFQLYNSGV------FTGECGTSLDHGV 290
Query: 293 LIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGI 339
VGYG T K YWI++NSWG+ WGE GY R+ R G CGI
Sbjct: 291 AAVGYG------TEKGKDYWIVRNSWGKSWGESGYIRMERNIASPTGKCGI 335
>gi|334347644|ref|XP_001379528.2| PREDICTED: cathepsin W-like [Monodelphis domestica]
Length = 619
Score = 195 bits (495), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 116/315 (36%), Positives = 167/315 (53%), Gaps = 26/315 (8%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
F F Q+NK+YA E R IF+ NL Q L + G +G+ +FSDL+ EF
Sbjct: 266 FKAFQIQYNKSYADPAEQERRFEIFADNLAWAQQLTEKHGGMAQFGVTQFSDLTEEEFHQ 325
Query: 102 KYLGFK---LKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
Y + +PS R P + L R+ DWR+ +T V+ Q C S WA + GN
Sbjct: 326 HYQPAQSSYKEPSLKTRKHPRL--QRPLIRSCDWRKAGVLTPVRKQKKCRSCWAIAAVGN 383
Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG 218
+E ++A ++ LS QE++DCD+ C+GG + +AF TI+ + GL E+ YPY+
Sbjct: 384 VEALWAIHYEQHFELSVQEVLDCDRCGKACKGGFVWDAFLTILRQR--GLARERDYPYQD 441
Query: 219 DDKACRLNKKATQVK------INGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTG 272
+L++K Q K I ++ + ++E MA++L GP+ V IN L+ Y G
Sbjct: 442 -----QLSRKGCQKKQNRTGWIQDFLMLPKEENAMAEHLALKGPITVTINQALLKTYRKG 496
Query: 273 VSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR 332
V P D + HSVL+VG+G + K YWI+KNSWG WGE+GYFRL R
Sbjct: 497 VIRPKD---DCDPNQVDHSVLLVGFGQN-----TKDGAYWILKNSWGSDWGEEGYFRLRR 548
Query: 333 GDGSCGINDYVRSAL 347
G +CGI Y +AL
Sbjct: 549 GTNACGITKYPVTAL 563
>gi|28192373|gb|AAK07730.1| CPR1-like cysteine proteinase [Nicotiana tabacum]
Length = 374
Score = 195 bits (495), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 118/313 (37%), Positives = 170/313 (54%), Gaps = 30/313 (9%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
+ +L +H + Y L E R IF NLR I+ ++ + + GLN+F+DL+ E++
Sbjct: 50 YEMWLAEHGRAYNALGEKEKRFEIFKDNLRFIEEHNNSGNRTYKVGLNQFADLTNEEYRT 109
Query: 102 KYLGFK--LKPSYADRSVP----AMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFST 155
YLG K + + P A PN +P + DWR+ AV +K+Q CGS WAFST
Sbjct: 110 MYLGTKSDARRRFVKSKNPSQRYASRPNELMPHSVDWRKRGAVAPIKNQGSCGSCWAFST 169
Query: 156 TGNIEGVYAAKTKKLVSLSEQELIDCDQ-EDDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
+ G+ T ++++LSEQEL+DCD+ ++ GC GG + AF+ I+S GG++ EK Y
Sbjct: 170 VAAVGGINQIVTGEMITLSEQELVDCDRVQNSGCNGGLMDYAFEFIISN--GGMDTEKHY 227
Query: 215 PYRGDDKACR-LNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYVT 271
PYRG + C + K V I+GY V R+E + K V + P+ VAI A A Q Y +
Sbjct: 228 PYRGVEGRCDPVRKNYKVVSIDGYEDVPRNERALQK-AVAHQPVCVAIEASGRAFQLYSS 286
Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
GV F E + H V++VGYG + V YWI++NSWG WGE GY ++
Sbjct: 287 GV------FTGECGEEVDHGVVVVGYG------SEDGVDYWIVRNSWGTKWGENGYVKME 334
Query: 332 RGD-----GSCGI 339
R G CGI
Sbjct: 335 RNVKKSHLGKCGI 347
>gi|334265690|ref|YP_004376219.1| cathepsin [Clostera anachoreta granulovirus]
gi|315451014|gb|ADU24593.1| cathepsin [Clostera anachoreta granulovirus]
gi|327553705|gb|AEB00299.1| cathepsin [Clostera anachoreta granulovirus]
Length = 332
Score = 195 bits (495), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 122/322 (37%), Positives = 172/322 (53%), Gaps = 29/322 (9%)
Query: 40 ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
LF F+ NKTY++ E R IF NL I ++ E + +N +SDL +
Sbjct: 27 TLFEEFVTNFNKTYSSQDEKLIRYEIFKKNLALINN-KNMESKHATFDINIYSDLHKNDL 85
Query: 100 QAK----YLGFKLKPSYAD---RSVPAMI----PNITLPRAFDWREYDAVTGVKDQTMCG 148
+ +G K P + R + P+ LP FDWR + VT VKDQ CG
Sbjct: 86 LHRTTGLRIGLKKNPLFKAITFRECGVQVIGDEPHALLPETFDWRLRNGVTSVKDQLQCG 145
Query: 149 SSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGL 208
+ WAFS GNIE ++ K + LSEQ L++CD ++GC+GG + A + I+ + GGL
Sbjct: 146 ACWAFSALGNIESLHKIKYGVELDLSEQHLVNCDPLNNGCDGGLMHWALENILYE--GGL 203
Query: 209 EEEKTYPYRGDDKACRLNKKATQVKINGYVS-VSRDETDMAKYLVENGPMAVAINAYALQ 267
E+ PY G D C+ K I+G V ++E + + LV NGP++VAI+ +
Sbjct: 204 VAERDEPYFGYDAVCK--PKRLSSTISGCTRFVLQNENRLRELLVVNGPVSVAIDVIDVI 261
Query: 268 FYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGY 327
Y G++ C N L+H+VL+VGYGVD VPYWI+KNSWGE WGE G+
Sbjct: 262 DYKEGIAD----MCHNKN-GLNHAVLLVGYGVDND------VPYWILKNSWGENWGENGF 310
Query: 328 FRLYRGDGSCGI-NDYVRSALV 348
FR+ R SCGI N+Y SA++
Sbjct: 311 FRVQRNVNSCGIMNEYASSAIL 332
>gi|307110445|gb|EFN58681.1| hypothetical protein CHLNCDRAFT_56822 [Chlorella variabilis]
Length = 466
Score = 195 bits (495), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 120/307 (39%), Positives = 168/307 (54%), Gaps = 20/307 (6%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
F+++++ + YA+ EY R ++ NLR + + H S + ++DLS E+++
Sbjct: 40 FDFWVQTLKRAYASAEEYERRFDVWLDNLRFVHEY-NAGHTSHWLSMGVYADLSQDEYRS 98
Query: 102 KYLGFK--LKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNI 159
K LG+ L R+ P + P+ DW AVT VK+Q +CGS WAFSTTG +
Sbjct: 99 KALGYNADLHEERPLRAAPFLYEGTVPPKEVDWVAKGAVTPVKNQLLCGSCWAFSTTGAV 158
Query: 160 EGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG 218
EG A T KL SLSEQ L+DCD+E D+GC GG + AF+ IM GG++ E YPY
Sbjct: 159 EGASAIATGKLASLSEQMLVDCDRERDNGCHGGLMDFAFEFIMKN--GGIDTEDDYPYTA 216
Query: 219 DDKACRLNK-KATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSH 275
++ C+ NK + V I+ Y V ++ V N P++VAI A A Q Y GV
Sbjct: 217 EEGMCQDNKMRRHVVTIDDYQDVPPNDEHALMKAVANQPVSVAIEADQRAFQLYGGGV-- 274
Query: 276 PIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-- 333
F + G L H VL+VGYG H +PYW++KNSWG WG+KGY RL R
Sbjct: 275 ---FDAECGTA-LDHGVLVVGYGTASNGTHH--LPYWLVKNSWGAEWGDKGYIRLLRNLG 328
Query: 334 -DGSCGI 339
+G CG+
Sbjct: 329 EEGQCGV 335
>gi|91992508|gb|ABE72970.1| cathepsin L [Aedes aegypti]
Length = 339
Score = 195 bits (495), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 128/315 (40%), Positives = 171/315 (54%), Gaps = 28/315 (8%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKI-QLLQDTEHGSGVYGL--NEFSDLSTAE 98
+N F QH K Y + E RL I+ N KI + Q + G Y L N+++DL E
Sbjct: 27 WNAFKLQHRKNYDSETEERIRLKIYVQNKHKIAKHNQRFDLGQEKYRLRVNKYADLLHEE 86
Query: 99 FQAKYLGFK-------LKPSYADRSVPAMIP-NITLPRAFDWREYDAVTGVKDQTMCGSS 150
F GF LK + V + P N+ +P DWR+ AVT VKDQ CGS
Sbjct: 87 FVQTVNGFNRTDSKKSLKGVRIEEPVTFIEPANVEVPTTVDWRKKGAVTPVKDQGHCGSC 146
Query: 151 WAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGL 208
W+FS TG +EG + KT KLVSLSEQ L+DC + ++GC GG + AF I K GG+
Sbjct: 147 WSFSATGALEGQHFRKTGKLVSLSEQNLVDCSGKYGNNGCNGGMMDYAFQYI--KDNGGI 204
Query: 209 EEEKTYPYRGDDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINAY--A 265
+ EK+YPY D C N KA GYV + + DE + K L GP+++AI+A +
Sbjct: 205 DTEKSYPYEAIDDTCHFNPKAVGATDKGYVDIPQGDEEALKKALATVGPVSIAIDASHES 264
Query: 266 LQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEK 325
QFY GV + Q CD +ENL H VL VGYG + + YW++KNSWG WG++
Sbjct: 265 FQFYSEGVYYEPQ--CD--SENLDHGVLAVGYGT-----SEEGEDYWLVKNSWGTTWGDQ 315
Query: 326 GYFRLYRG-DGSCGI 339
GY ++ R D CG+
Sbjct: 316 GYVKMARNHDNHCGV 330
>gi|33333710|gb|AAQ11973.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
maculatus]
Length = 326
Score = 194 bits (494), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 124/306 (40%), Positives = 163/306 (53%), Gaps = 20/306 (6%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT-EHGSGVYG--LNEFSDLSTAEFQ- 100
F H KTY +LVE R +F NL IQ E G + + +F+D++ EF
Sbjct: 26 FKLDHGKTYRSLVEEKRRFSVFQKNLVDIQEHNKKYERGEESFAKKVTQFADMTHEEFLD 85
Query: 101 -AKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNI 159
K G PS A ++ A DWRE AVT VKDQ CGS WAFS G I
Sbjct: 86 LLKLQGVPALPSNAVHFDNFEDIDMEEKDAVDWREEGAVTPVKDQANCGSCWAFSAVGAI 145
Query: 160 EGVYAAKTKKLVSLSEQELIDCDQED---DGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
EG + K LVSLS QEL+DC ED +GC+GG + AFD + + G++ E++YPY
Sbjct: 146 EGQFFKKNGTLVSLSAQELVDCATEDYGNNGCKGGLMGQAFDFVQDE---GIQTEESYPY 202
Query: 217 RGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHP 276
G +C+ + + K+ YV DE +MA+ + GP+AVAI A L FY G+
Sbjct: 203 EGRRSSCKKSGEYV-TKVKTYV-FPLDEQEMARTVAAKGPVAVAIEASQLSFYDKGIVDE 260
Query: 277 IQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGS 336
+ C E+L+ VL+VGYG + V YWI+KNSWG WGEKGYFRL + +
Sbjct: 261 -RCRCSNKREDLNPGVLVVGYG------SENGVDYWIVKNSWGADWGEKGYFRLKKDVKA 313
Query: 337 CGINDY 342
CGI Y
Sbjct: 314 CGIGYY 319
>gi|14349349|gb|AAC38833.2| cysteine protease [Leishmania chagasi]
Length = 353
Score = 194 bits (494), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 127/360 (35%), Positives = 177/360 (49%), Gaps = 41/360 (11%)
Query: 5 YFFAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLH 64
+FFA V + V S ++ + + +A + F ++H K + E R +
Sbjct: 6 FFFAIVVTILFVVCYGSALIA--QTPLGVDDFIASAHYGRFKKRHGKPFGEDAEEGRRFN 63
Query: 65 IFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSY------------ 112
F N++ L + +F+DL+ EF YL P+Y
Sbjct: 64 AFKQNMQTAYFLNAHNPHAHYDVSGKFADLTPQEFAKLYL----NPNYYARHGKDYKEHV 119
Query: 113 -ADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLV 171
D SV + + ++ DWRE VT VK+Q MCGS WAF+TTGNIEG +A K LV
Sbjct: 120 HVDDSVRSGVMSV------DWREKGVVTPVKNQGMCGSCWAFATTGNIEGQWALKNHSLV 173
Query: 172 SLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR---GDDKACRLNKK 228
SLSEQ L+ CD DDGC GG + A I++ G + E +YPY G C N
Sbjct: 174 SLSEQVLVSCDNIDDGCNGGLMQQAMQWIINDHNGTVPTEDSYPYTSAGGTRPPCHDN-G 232
Query: 229 ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENL 288
KI GY+S+ DE ++A Y+ +NGP+AVA++A Q Y GV C G +L
Sbjct: 233 TVGAKIAGYMSLPHDEEEIAAYVGKNGPVAVAVDATTWQLYFGGVV----TLCFG--LSL 286
Query: 289 SHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
+H VL+VG+ PYWI+KNSWG WGEKGY RL G C + +Y +A +
Sbjct: 287 NHGVLVVGFN------RQAKPPYWIVKNSWGSSWGEKGYIRLAMGSNQCLLKNYAVTATI 340
>gi|398014254|ref|XP_003860318.1| cysteine peptidase A (CBA) [Leishmania donovani]
gi|13518086|gb|AAK27384.1| cysteine proteinase-like protein [Leishmania donovani]
gi|322498538|emb|CBZ33611.1| cysteine peptidase A (CBA) [Leishmania donovani]
Length = 354
Score = 194 bits (494), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 127/360 (35%), Positives = 177/360 (49%), Gaps = 41/360 (11%)
Query: 5 YFFAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLH 64
+FFA V + V S ++ + + +A + F ++H K + E R +
Sbjct: 7 FFFAIVVTILFVVCYGSALIA--QTPLGVDDFIASAHYGRFKKRHGKPFGEDAEEGRRFN 64
Query: 65 IFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSY------------ 112
F N++ L + +F+DL+ EF YL P+Y
Sbjct: 65 AFKQNMQTAYFLNAHNPHAHYDVSGKFADLTPQEFAKLYL----NPNYYARHGKDYKEHV 120
Query: 113 -ADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLV 171
D SV + + ++ DWRE VT VK+Q MCGS WAF+TTGNIEG +A K LV
Sbjct: 121 HVDDSVRSGVMSV------DWREKGVVTPVKNQGMCGSCWAFATTGNIEGQWALKNHSLV 174
Query: 172 SLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR---GDDKACRLNKK 228
SLSEQ L+ CD DDGC GG + A I++ G + E +YPY G C N
Sbjct: 175 SLSEQVLVSCDNIDDGCNGGLMEQAMQWIINDHNGTVPTEDSYPYTSAGGTRPPCHDN-G 233
Query: 229 ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENL 288
KI GY+S+ DE ++A Y+ +NGP+AVA++A Q Y GV C G +L
Sbjct: 234 TVGAKIAGYMSLPHDEEEIAAYVGKNGPVAVAVDATTWQLYFGGVV----TLCFG--LSL 287
Query: 289 SHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
+H VL+VG+ PYWI+KNSWG WGEKGY RL G C + +Y +A +
Sbjct: 288 NHGVLVVGFN------RQAKPPYWIVKNSWGSSWGEKGYIRLAMGSNQCLLKNYAVTATI 341
>gi|224056176|ref|XP_002298740.1| predicted protein [Populus trichocarpa]
gi|222845998|gb|EEE83545.1| predicted protein [Populus trichocarpa]
Length = 455
Score = 194 bits (494), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 123/313 (39%), Positives = 160/313 (51%), Gaps = 28/313 (8%)
Query: 41 LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
++ +L +H + Y L E R IF NL+ I + S GLN+F+DLS E++
Sbjct: 24 IYEMWLVKHGRAYNALGEKERRFEIFKDNLKFIDEHNSVGNPSYKLGLNKFADLSNDEYR 83
Query: 101 AKYLGFKLKPSYADRSVPA-----MIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFST 155
+ YLG ++ P LP DWRE AV VKDQ CGS WAFST
Sbjct: 84 SVYLGTRMDGKGRLLGGPKSERYLFKEGDDLPETVDWREKGAVAPVKDQGQCGSCWAFST 143
Query: 156 TGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
G +EG+ T L SLSEQEL+DCD+ + GC GG + AFD I+ GG++ E+ Y
Sbjct: 144 VGAVEGINQIVTGNLTSLSEQELVDCDKTYNLGCNGGLMDYAFDFIIEN--GGIDTEEDY 201
Query: 215 PYRGDDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYVT 271
PY+ D C N+K A V I+GY V +++ K V N P++VAI A Q Y +
Sbjct: 202 PYKAIDSMCDPNRKNARVVTIDGYEDVPQNDEKSLKKAVANQPVSVAIEAGGRGFQLYQS 261
Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
GV F L H V+ VGYG T V YWI++NSWG WGE GY R+
Sbjct: 262 GV------FTGSCGTQLDHGVVTVGYG------TEHGVDYWIVRNSWGPAWGENGYIRME 309
Query: 332 RG-----DGSCGI 339
R G CGI
Sbjct: 310 RDVASTETGKCGI 322
>gi|255646767|gb|ACU23856.1| unknown [Glycine max]
Length = 350
Score = 194 bits (494), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 131/329 (39%), Positives = 171/329 (51%), Gaps = 28/329 (8%)
Query: 22 FMVVG--DEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT 79
F +VG E L + K LF ++ +H K Y + E R IF NL+ I
Sbjct: 28 FSIVGYSSEDLKSMD--KLIELFESWMSRHGKIYENIEEKLLRFEIFKDNLKHIDERNKV 85
Query: 80 EHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMI-PNITLPRAFDWREYDAV 138
+ GL+EF+DLS EF KYLG K+ S S ++ LP++ DWR+ AV
Sbjct: 86 VSNYWL-GLSEFADLSHREFNNKYLGLKVDYSRRRESPEEFTYKDVELPKSVDWRKKGAV 144
Query: 139 TGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAF 197
VK+Q CGS WAFST +EG+ T L SLSEQELIDCD+ ++GC GG + AF
Sbjct: 145 APVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCDRTYNNGCNGGLMDYAF 204
Query: 198 DTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQ-VKINGYVSVSRDETDMAKYLVENGP 256
I+ GGL +E+ YPY ++ AC + K+ TQ V I+GY V ++ + N P
Sbjct: 205 SFIVEN--GGLHKEEDYPYIMEEGACEMTKEETQVVTISGYHDVPQNNEQSLLKALANQP 262
Query: 257 MAVAINAYA--LQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWII 314
++VAI A QFY GV F +L H V VGYG T K V Y +
Sbjct: 263 LSVAIEASGRDFQFYSGGV------FDGHCGSDLDHGVAAVGYG------TAKGVDYITV 310
Query: 315 KNSWGEGWGEKGYFRLYRG----DGSCGI 339
KNSWG WGEKGY R+ R +G CGI
Sbjct: 311 KNSWGSKWGEKGYIRMRRNIGKPEGICGI 339
>gi|15824704|gb|AAL09448.1| cysteine protease [Leishmania donovani]
Length = 353
Score = 194 bits (494), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 127/360 (35%), Positives = 177/360 (49%), Gaps = 41/360 (11%)
Query: 5 YFFAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLH 64
+FFA V + V S ++ + + +A + F ++H K + E R +
Sbjct: 6 FFFAIVVTILFVVCYGSALIA--QTPLGVDDFIASAHYGRFKKRHGKPFGEDAEEGRRFN 63
Query: 65 IFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSY------------ 112
F N++ L + +F+DL+ EF YL P+Y
Sbjct: 64 AFKQNMQTAYFLNAHNPHAHYDVSGKFADLTPQEFAKLYL----NPNYYARHGKDYKEHV 119
Query: 113 -ADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLV 171
D SV + + ++ DWRE VT VK+Q MCGS WAF+TTGNIEG +A K LV
Sbjct: 120 HVDDSVRSGVMSV------DWREKGVVTPVKNQGMCGSCWAFATTGNIEGQWALKNHSLV 173
Query: 172 SLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR---GDDKACRLNKK 228
SLSEQ L+ CD DDGC GG + A I++ G + E +YPY G C N
Sbjct: 174 SLSEQVLVSCDNIDDGCNGGLMEQAMQWIINDHNGTVPTEDSYPYTSAGGTRPPCHDN-G 232
Query: 229 ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENL 288
KI GY+S+ DE ++A Y+ +NGP+AVA++A Q Y GV C G +L
Sbjct: 233 TVGAKIAGYMSLPHDEEEIAAYVGKNGPVAVAVDATTWQLYFGGVV----TLCFG--LSL 286
Query: 289 SHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
+H VL+VG+ PYWI+KNSWG WGEKGY RL G C + +Y +A +
Sbjct: 287 NHGVLVVGFN------RQAKPPYWIVKNSWGSSWGEKGYIRLAMGSNQCLLKNYAVTATI 340
>gi|343472970|emb|CCD15012.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 382
Score = 194 bits (494), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 119/336 (35%), Positives = 177/336 (52%), Gaps = 24/336 (7%)
Query: 13 LSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRK 72
+ L + F+ V LH ++ F F ++++++Y E R +F N+ +
Sbjct: 14 VGLHAVAACFVPVALGVLHAEQSLQQQ--FAAFKQKYSRSYRDATEEAFRFRVFKQNMER 71
Query: 73 IQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL-GFKLKPSYADRSVPAMIPNITL---PR 128
+ + + +G+ FSD+S EF+A Y G + + R P + N++ P
Sbjct: 72 AKE-EAAANPYATFGVTRFSDMSPEEFRATYHNGAEYYAAALKR--PRKVVNVSTGKAPP 128
Query: 129 AFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGC 188
A DWR+ AVT VKDQ C SSWAFS GNIEG + +L SLSEQ L+ CD D GC
Sbjct: 129 AIDWRKKGAVTPVKDQGQCDSSWAFSAIGNIEGQWKVAGHELTSLSEQMLVSCDTNDFGC 188
Query: 189 EGGSISNAFDTIMSKLGGGLEEEKTYPYR---GDDKACRLNKKATQVKINGYVSVSRDET 245
GG AF I+S G + E++YPY G+ C + K KI V + RDE
Sbjct: 189 GGGFSDPAFKWIVSSNKGNVFTEQSYPYASGGGNVPTCDKSGKVVGAKIRDRVDLPRDEN 248
Query: 246 DMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFT 305
+A++L +NGP+A+A++A + Q Y GV ++ ++ +VL+VGY D +K
Sbjct: 249 AIAEWLAKNGPVAIAVDATSFQSYTGGV------LTSCISKEMNSAVLLVGYD-DTSK-- 299
Query: 306 HKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGIND 341
PYWIIKNSW +GWGEKGY R+ +G C + +
Sbjct: 300 ---PPYWIIKNSWSKGWGEKGYIRIEKGTNQCLVKN 332
>gi|242032709|ref|XP_002463749.1| hypothetical protein SORBIDRAFT_01g005350 [Sorghum bicolor]
gi|241917603|gb|EER90747.1| hypothetical protein SORBIDRAFT_01g005350 [Sorghum bicolor]
Length = 381
Score = 194 bits (494), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 122/311 (39%), Positives = 164/311 (52%), Gaps = 30/311 (9%)
Query: 47 EQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT-EHGSGVYGLNEFSDLSTAEFQAKYLG 105
+ H++ + E R F N+R I + S LN F D+ EF++ +
Sbjct: 50 QTHHRVHRHHGEKGRRFGTFKENVRFIHAHNKRGDRPSYRLRLNRFGDMGPEEFRSTFAD 109
Query: 106 FKL-------KPSYADRSVPAMIPN--ITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTT 156
++ + S A +VP + + +PR+ DWR++ AVT VK+Q CGS WAFST
Sbjct: 110 SRINDLRRYRESSPAATAVPGFMYDDATDVPRSVDWRQHGAVTAVKNQGRCGSCWAFSTV 169
Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
+EG+ A +T LVSLSEQEL+DCD ++GC+GG + NAFD I S GG+ E YPY
Sbjct: 170 VAVEGINAIRTGSLVSLSEQELVDCDTAENGCQGGLMENAFDFIKSY--GGITTESAYPY 227
Query: 217 RGDDKAC---RLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYVT 271
R + C R + V I+G+ V D V P++VAI+A A QFY
Sbjct: 228 RASNGTCDGMRARRGRVHVSIDGHQMVPTGSEDALAKAVARQPVSVAIDAGGQAFQFYSE 287
Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
GV F D G + L H V +VGYGV T PYWI+KNSWG WGE GY R+
Sbjct: 288 GV-----FTGDCGTD-LDHGVAVVGYGVSDVDGT----PYWIVKNSWGPSWGEGGYIRMQ 337
Query: 332 RGDGS---CGI 339
RG G+ CGI
Sbjct: 338 RGAGNGGLCGI 348
>gi|20334375|gb|AAM19208.1|AF493233_1 cysteine protease [Solanum pennellii]
Length = 337
Score = 194 bits (494), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 109/294 (37%), Positives = 157/294 (53%), Gaps = 17/294 (5%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
++ +H + Y VE R IF N++ I+ + + S G+NEF+D+++ EF AK+
Sbjct: 42 WMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFT 101
Query: 105 GFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYA 164
G + SY S + + +P DWRE AVT VK+Q CG WAFS G++EG Y
Sbjct: 102 GLNIPNSYLSPSPINDLSDDDMPSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYK 161
Query: 165 AKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACR 224
T L+ SEQEL+DC + GC GG ++NAFD I K GG+ E Y Y G CR
Sbjct: 162 IATGNLMEFSEQELLDCTTNNYGCNGGFMTNAFDFI--KENGGISRESDYEYLGQQYTCR 219
Query: 225 LNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA-LQFYVTGVSHPIQFFCDG 283
+K V+I+ Y V ET + + + + P+++ I A LQFY G DG
Sbjct: 220 SQEKTAAVQISSYQVVPEGETSLLQAVTKQ-PVSIGIAASQDLQFYAGGTY-------DG 271
Query: 284 GNEN-LSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGS 336
N ++H+V +GYG D K YW++KNSWG WGE G+ ++ R G+
Sbjct: 272 SCANRINHAVTAIGYGTD-----EKGQKYWLLKNSWGTSWGEDGFMKIIRDSGN 320
>gi|400180437|gb|AFP73356.1| cysteine protease [Solanum pennellii]
Length = 337
Score = 194 bits (494), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 109/294 (37%), Positives = 157/294 (53%), Gaps = 17/294 (5%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
++ +H + Y VE R IF N++ I+ + + S G+NEF+D+++ EF AK+
Sbjct: 42 WMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFT 101
Query: 105 GFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYA 164
G + SY S + + +P DWRE AVT VK+Q CG WAFS G++EG Y
Sbjct: 102 GLNIPNSYLSPSPINDLSDDDMPSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYK 161
Query: 165 AKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACR 224
T L+ SEQEL+DC + GC GG ++NAFD I K GG+ E Y Y G CR
Sbjct: 162 IATGNLMEFSEQELLDCTTNNYGCNGGFMTNAFDFI--KENGGISRESDYEYLGQQYTCR 219
Query: 225 LNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA-LQFYVTGVSHPIQFFCDG 283
+K V+I+ Y V ET + + + + P+++ I A LQFY G DG
Sbjct: 220 SQEKTAAVQISSYQVVPEGETSLLQAVTKQ-PVSIGIAASQDLQFYAGGTY-------DG 271
Query: 284 GNEN-LSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGS 336
N ++H+V +GYG D K YW++KNSWG WGE G+ ++ R G+
Sbjct: 272 SCANRINHAVTAIGYGTD-----EKGQKYWLLKNSWGTSWGEDGFMKIIRDSGN 320
>gi|91092014|ref|XP_970644.1| PREDICTED: similar to cathepsin-L-like cysteine peptidase 02
[Tribolium castaneum]
gi|270001249|gb|EEZ97696.1| cathepsin L precursor [Tribolium castaneum]
Length = 337
Score = 194 bits (494), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 125/344 (36%), Positives = 184/344 (53%), Gaps = 30/344 (8%)
Query: 21 SFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTE 80
+ VVG + + V+ + F H K Y + E R+ IF N K+ +
Sbjct: 8 ALCVVGSQAVSFFDLVQEQ--WGAFKVTHKKQYESETEERFRMKIFMENAHKV-AKHNKL 64
Query: 81 HGSGV----YGLNEFSDLSTAEFQAKYLGFK-----LKPSYADRSVPAMIP-NITLPRAF 130
+ G+ G+N++SD+ EF G+ L+ D S+ + P N+ LP+
Sbjct: 65 YAQGLVSFKLGVNKYSDMLNHEFVHTLNGYNRSKTPLRSGELDESITFIPPANVELPKQI 124
Query: 131 DWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE--DDGC 188
DWR+ AVT VKDQ CGS W+FSTTG++EG + K+KKLVSLSEQ LIDC ++ ++GC
Sbjct: 125 DWRKLGAVTPVKDQGQCGSCWSFSTTGSLEGQHFRKSKKLVSLSEQNLIDCSEKYGNNGC 184
Query: 189 EGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSV-SRDETDM 247
GG + NAF I K GG++ E++YPY+ +D+ C + G+V + S DE +
Sbjct: 185 NGGLMDNAFRYI--KDNGGIDTEQSYPYKAEDEKCHYKPRNKGATDRGFVDIESGDEEKL 242
Query: 248 AKYLVENGPMAVAINAY--ALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFT 305
+ GP++VAI+A Q Y GV + + C +E L H VL+VGYG D
Sbjct: 243 KAAVATVGPISVAIDASHPTFQQYSEGVYYEPE--C--SSEQLDHGVLVVGYGTDED--- 295
Query: 306 HKAVPYWIIKNSWGEGWGEKGYFRLYRG-DGSCGINDYVRSALV 348
YW++KNSWG+ WG++GY ++ R D +CGI LV
Sbjct: 296 --GNDYWLVKNSWGDSWGDQGYIKMARNRDNNCGIATQASYPLV 337
>gi|13124011|sp|Q9YWK4.1|CATV_NPVBS RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|3882976|gb|AAC77812.1| cathepsin [Buzura suppressaria NPV]
Length = 331
Score = 194 bits (494), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 116/311 (37%), Positives = 169/311 (54%), Gaps = 19/311 (6%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
F FL +NK Y E R IF L +I ++ + S VY +N+F+DLS E +
Sbjct: 31 FETFLANYNKMYNDTSEKERRFSIFQQTLEEINY-KNRLNDSAVYQINKFADLSKNEIIS 89
Query: 102 KYLGFKLKPSYADRSVPAMI--PNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNI 159
KY G + + +I P P FDWR+ + VT +K+Q CG+ WAF+T +I
Sbjct: 90 KYTGLNMPVQTTNFCKTIVIDQPPGKGPLNFDWRQQNKVTSIKNQKACGACWAFATLASI 149
Query: 160 EGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGD 219
E YA K + LSEQ++IDCD D GC+GG + AF+ ++ G L +E YPY G
Sbjct: 150 ESQYAIKNNVHIDLSEQQMIDCDYVDMGCDGGLLHTAFEQMIQM--GELVQEHEYPYAGV 207
Query: 220 DKACRLNKKAT-QVKING-YVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPI 277
+K C L T VK+ G Y V E + L GP+ +AI+A + Y G+ H
Sbjct: 208 NKPCELRGDETGVVKVKGCYRYVVFREEKLKDLLRAVGPIPMAIDASGIVNYHHGIIH-- 265
Query: 278 QFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSC 337
+C+ N L+H+VL+VGYGV+ VP+W KN+WG+ WGE+GYFR+ + +C
Sbjct: 266 --YCE--NYGLNHAVLLVGYGVENN------VPFWTFKNTWGKDWGEEGYFRVRQNVDAC 315
Query: 338 GINDYVRSALV 348
G+ + + S+ V
Sbjct: 316 GMTNELASSAV 326
>gi|18141281|gb|AAL60578.1|AF454956_1 senescence-associated cysteine protease [Brassica oleracea]
Length = 445
Score = 194 bits (494), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 125/317 (39%), Positives = 167/317 (52%), Gaps = 26/317 (8%)
Query: 34 HHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSD 93
+ + +F +L +++K Y L E R IF NL+ +Q + S GL F+D
Sbjct: 29 RNPEEVKMFERWLVENHKNYNGLGEKDKRFEIFMDNLKFVQEHNSVPNQSYELGLTRFAD 88
Query: 94 LSTAEFQAKYLGFKLKPSYADRSVPAMIPNI--TLPRAFDWREYDAVTGVKDQTMCGSSW 151
L+ EF+A YL K++ + + N+ LP DWR AV VKDQ CGS W
Sbjct: 89 LTNEEFRAIYLRSKMERTRDSVKSERYLHNVGDKLPDEVDWRAKGAVVPVKDQGSCGSCW 148
Query: 152 AFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEE 210
AFS G +EG+ KT +LVSLSEQEL+DCD ++GC GG + AF I+S GG++
Sbjct: 149 AFSAIGAVEGINQIKTGELVSLSEQELVDCDTSYNNGCGGGLMDYAFQFIISN--GGIDT 206
Query: 211 EKTYPYRG-DDKACRLNKKATQ-VKINGYVSVSRDETDMAKYLVENGPMAVAINA--YAL 266
E+ YPY DD C +KK T+ V I+GY V +E + K L N P++VAI A
Sbjct: 207 EEDYPYTATDDNICNTDKKNTRVVTIDGYEDVPENENSLKKALA-NQPISVAIEAGGRGF 265
Query: 267 QFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKG 326
Q Y +GV F L H V+ VGYG T + YWII+NSWG WGE G
Sbjct: 266 QLYKSGV------FTGTCGTALDHGVVAVGYG------TSEGQDYWIIRNSWGSNWGESG 313
Query: 327 YFRLYRG----DGSCGI 339
Y +L R G CG+
Sbjct: 314 YIKLQRNIKDSSGKCGV 330
>gi|242088413|ref|XP_002440039.1| hypothetical protein SORBIDRAFT_09g024940 [Sorghum bicolor]
gi|241945324|gb|EES18469.1| hypothetical protein SORBIDRAFT_09g024940 [Sorghum bicolor]
Length = 463
Score = 194 bits (494), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 132/326 (40%), Positives = 174/326 (53%), Gaps = 37/326 (11%)
Query: 38 HTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYG--------LN 89
+ ALF+ + +H K YAT E +RL +F+ N + + +G G LN
Sbjct: 37 YEALFDAWCAEHGKAYATPEERAARLAVFADNAAFVAAHNARVNAAGGGGAPPSYTLALN 96
Query: 90 EFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNI------TLPRAFDWREYDAVTGVKD 143
F+DL+ EF+A LG + A RS A + +P A DWRE AVT VKD
Sbjct: 97 AFADLTHEEFRAARLGRIAAGAAALRSPAAPVYRGLDGGLGAVPDALDWRENGAVTKVKD 156
Query: 144 QTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMS 202
Q CG+ W+FS TG +EG+ KT LVSLSEQELIDCD+ + GC GG + A+ ++
Sbjct: 157 QGSCGACWSFSATGAMEGINKIKTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYKFVVK 216
Query: 203 KLGGGLEEEKTYPYRGDDKACRLNK-KATQVKINGYVSVSRDETDMAKYLVENGPMAVAI 261
GG++ E+ YPYR D C NK K V I+GY V ++ D+ V P++V I
Sbjct: 217 N--GGIDTEEDYPYREADGTCNKNKLKKRIVTIDGYSDVPSNKEDLLLQAVAQQPVSVGI 274
Query: 262 --NAYALQFYVTGVSHPIQFFCDGGNE-NLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSW 318
+A A Q Y Q DG +L H+VLIVGYG + K YWI+KNSW
Sbjct: 275 CGSARAFQLYSQ------QGIFDGPCPTSLDHAVLIVGYGSEGGK------DYWIVKNSW 322
Query: 319 GEGWGEKGYFRLYR--GD--GSCGIN 340
GE WG KGY ++R GD G CGIN
Sbjct: 323 GESWGMKGYMHMHRNTGDSKGVCGIN 348
>gi|40806498|gb|AAR92154.1| putative cysteine protease 1 [Iris x hollandica]
Length = 340
Score = 194 bits (494), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 116/305 (38%), Positives = 157/305 (51%), Gaps = 27/305 (8%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
++ QH + Y E R IF N+ +I+ H + G+N+F+DL+ EF+ +
Sbjct: 44 WMAQHGRVYKNAAEKAHRFEIFRANVERIESFNAENHKFKL-GVNQFADLTNEEFKTRNT 102
Query: 105 GFKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVY 163
LKPS + N+T +P DWR AVT +KDQ CGS WAFS EG+
Sbjct: 103 ---LKPSKMASTKSFKYENVTAVPATMDWRTKGAVTPIKDQGQCGSCWAFSAVAATEGIT 159
Query: 164 AAKTKKLVSLSEQELIDCD--QEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDK 221
T KL+SLSEQE++DCD +D GC GG + +AF+ I+ G+ E YPY+ D
Sbjct: 160 KLSTGKLISLSEQEVVDCDVTSDDQGCNGGEMDDAFEYIIKN--KGITTEANYPYKAADG 217
Query: 222 ACRLNKKATQ-VKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQ 278
C K A+ I GY V+ + N P+AVAI+A +A Q Y +GV
Sbjct: 218 TCNTKKAASHAASITGYEDVTVNSEAALLKAAANQPIAVAIDAGDFAFQMYSSGV----- 272
Query: 279 FFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----D 334
F D G + L H V +VGYG T YW++KNSWG WGE GY R+ R +
Sbjct: 273 FTGDCGTD-LDHGVTLVGYGA-----TSDGTKYWLVKNSWGTSWGEDGYIRMERDVDAKE 326
Query: 335 GSCGI 339
G CGI
Sbjct: 327 GLCGI 331
>gi|255564910|ref|XP_002523448.1| cysteine protease, putative [Ricinus communis]
gi|223537276|gb|EEF38907.1| cysteine protease, putative [Ricinus communis]
Length = 341
Score = 194 bits (494), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 123/320 (38%), Positives = 159/320 (49%), Gaps = 25/320 (7%)
Query: 31 HHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNE 90
LH ++ ++ + Y E R IF N+ I+ + +NE
Sbjct: 27 RSLHDAAMNERHEMWMVKYGRVYKDNSEKERRFEIFRNNVEFIESFNKPGNRPYKLDINE 86
Query: 91 FSDLSTAEFQAKYLGFKLKPSYA-DRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCG 148
F+DL+ EF+A G+K + N+T +P + DWR+ AVT +KDQ CG
Sbjct: 87 FADLTNEEFKASRNGYKRSSNVGLSEKSSFRYGNVTAVPTSMDWRQKGAVTPIKDQGQCG 146
Query: 149 SSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQ--EDDGCEGGSISNAFDTIMSKLGG 206
WAFS +EG+ T KL+SLSEQEL+DCD ED GCEGG + +AF+ I K G
Sbjct: 147 CCWAFSAVAAMEGITKLSTGKLISLSEQELVDCDTSGEDQGCEGGLMDDAFEFI--KQNG 204
Query: 207 GLEEEKTYPYRGDDKACRLNKKATQ-VKINGYVSVSRDETDMAKYLVENGPMAVAINA-- 263
GL E YPY+G D C NK KI GY V + D V + P++VAI+A
Sbjct: 205 GLTTEANYPYQGTDGTCNTNKAGNDAAKITGYEDVPANSEDALLKAVASQPVSVAIDASG 264
Query: 264 YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWG 323
A QFY GV F D G E L H V VGYG T YW++KNSWG WG
Sbjct: 265 SAFQFYSGGV-----FTGDCGTE-LDHGVTAVGYG------TSDGTKYWLVKNSWGTSWG 312
Query: 324 EKGYFRLYRG----DGSCGI 339
E GY R+ R +G CGI
Sbjct: 313 EDGYIRMERDIEAKEGLCGI 332
>gi|182375363|gb|ACB87490.1| mucunain [Mucuna pruriens]
Length = 422
Score = 194 bits (493), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 124/317 (39%), Positives = 169/317 (53%), Gaps = 34/317 (10%)
Query: 40 ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
+L+ +L +H K Y L E R IF NLR I + ++ + GLN F+DL+ E+
Sbjct: 2 SLYEQWLVKHGKAYNALGEKDKRFDIFKDNLRFIDD-HNADNRTYKLGLNRFADLTNEEY 60
Query: 100 QAKYLGFKLKP--------SYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSW 151
+A+YLG ++ P + ++R P + N LP + DWR AV VKDQ CGS W
Sbjct: 61 RARYLGTRIDPNRRFVKTKTQSNRYAPRVGDN--LPESVDWRNESAVLPVKDQGNCGSCW 118
Query: 152 AFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEE 210
AFST G +EG+ T L+SLSEQEL+DCD + GC GG + A++ I++ GG++
Sbjct: 119 AFSTIGAVEGINKIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAYEFIINN--GGIDS 176
Query: 211 EKTYPYRGDDKAC-RLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAIN--AYALQ 267
E+ YPYR D C + K A V I+ Y V ++ K V N P++VAI Q
Sbjct: 177 EEDYPYRAVDGTCDQYRKNAKVVTIDSYEDVPANDELALKKAVANQPVSVAIEGGGREFQ 236
Query: 268 FYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGY 327
YV+GV F L H V+ VGYG + K YWI++NSWG WGE+GY
Sbjct: 237 LYVSGV------FTGRCGTALDHGVVAVGYG------SVKGHDYWIVRNSWGASWGEEGY 284
Query: 328 FRLYRG-----DGSCGI 339
RL R G CGI
Sbjct: 285 VRLERNLAKSRSGKCGI 301
>gi|33333698|gb|AAQ11967.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
maculatus]
Length = 326
Score = 194 bits (493), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 123/306 (40%), Positives = 162/306 (52%), Gaps = 20/306 (6%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT-EHGSGVYG--LNEFSDLSTAEFQ- 100
F H KTY ++VE R +F NL IQ E G + + +F+D++ EF
Sbjct: 26 FKLDHGKTYRSVVEEKRRFSVFQKNLIDIQEHNKKYERGEVSFAKKVTQFADMTHEEFLD 85
Query: 101 -AKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNI 159
K G PS A ++ A DWRE AVT VKDQ CGS WAFS G I
Sbjct: 86 LLKLQGVPALPSNAVHFDNFEDTDMEEKDAVDWREEGAVTPVKDQANCGSCWAFSAVGAI 145
Query: 160 EGVYAAKTKKLVSLSEQELIDCDQED---DGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
EG + K LVSLS QEL+DC E+ +GC GG + AFD + + G++ E++YPY
Sbjct: 146 EGQFFKKNGTLVSLSAQELVDCATEEYGNNGCRGGLMGQAFDFVQDE---GIQTEESYPY 202
Query: 217 RGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHP 276
G +C+ + K+ YV DE +MA+ + GP+AVAI A L FY G+
Sbjct: 203 EGRRSSCKKSGDYV-TKVKTYV-FPLDEQEMARTVAAKGPVAVAIEASQLSFYDKGIVDE 260
Query: 277 IQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGS 336
+ C E+L+H VL+VGYG + V YWI+KNSWG WGEKGYFRL + +
Sbjct: 261 -KCRCSNKREDLNHGVLVVGYG------SENGVDYWIVKNSWGADWGEKGYFRLKKDVKA 313
Query: 337 CGINDY 342
CGI Y
Sbjct: 314 CGIGYY 319
>gi|84181681|gb|AAW78661.2| senescence-specific cysteine protease [Nicotiana tabacum]
Length = 349
Score = 194 bits (493), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 121/321 (37%), Positives = 164/321 (51%), Gaps = 41/321 (12%)
Query: 40 ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
A + ++ H K Y L E R IF N+ +I+ E G N+FSDL+ EF
Sbjct: 40 ARHDQWIVHHEKVYKDLNEKEVRFQIFKENVERIEAFNAGEDKGYKLGFNKFSDLTNEEF 99
Query: 100 QAKYLGFKLKPSYADRSVPAMIP-----------NIT-LPRAFDWREYDAVTGVKDQTMC 147
+ + G+K RS P ++ N+T +P DWR+ AVT +KDQ C
Sbjct: 100 RVLHTGYK-------RSHPKVMTSSKGKTHFRYTNVTDIPPTMDWRKKGAVTPIKDQKEC 152
Query: 148 GSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCD--QEDDGCEGGSISNAFDTIMSKLG 205
G WAFS +EG++ KT +L+ LSEQEL+DCD ED+GC GG + AFD I+
Sbjct: 153 GCCWAFSAVAAMEGLHQLKTGELIPLSEQELVDCDVEGEDEGCSGGLLDTAFDFILKN-- 210
Query: 206 GGLEEEKTYPYRGDDKACRLNKKA-TQVKINGYVSVSRDETDMAKYLVENGPMAVAIN-- 262
GL E YPY+G+D C K A + KI GY V + V N P++VAI+
Sbjct: 211 KGLTTEVNYPYKGEDGVCNKKKSALSAAKITGYEDVPANSEKALLQAVANQPVSVAIDGS 270
Query: 263 AYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGW 322
++ QFY +GV F + L+H+V VGYG T YWIIKNSWG W
Sbjct: 271 SFDFQFYSSGV------FSGSCSTWLNHAVTAVGYGA-----TTDGTKYWIIKNSWGSKW 319
Query: 323 GEKGYFRLYRG----DGSCGI 339
G+ GY R+ R +G CG+
Sbjct: 320 GDSGYMRIKRDVHEKEGLCGL 340
>gi|156399477|ref|XP_001638528.1| predicted protein [Nematostella vectensis]
gi|156225649|gb|EDO46465.1| predicted protein [Nematostella vectensis]
Length = 325
Score = 194 bits (493), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 120/307 (39%), Positives = 168/307 (54%), Gaps = 22/307 (7%)
Query: 49 HNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKL 108
H KTY T E R I++ NL ++ + E+ S +N F+DL+ EF+ +++G++
Sbjct: 34 HGKTY-TGEEEDLRRAIWNDNLEIVKK-HNAENHSYKLDMNHFADLTVTEFKQRFMGYRA 91
Query: 109 KPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTK 168
+ S + N+ LP DWR+ VT VK+Q CGS WAFS+TG++EG + KT
Sbjct: 92 ASNSTGGSTFLPLSNVQLPAEVDWRDKGFVTAVKNQGQCGSCWAFSSTGSLEGQHFRKTG 151
Query: 169 KLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLN 226
KLVSLSEQ L+DC ++ ++GCEGG + AF I K G++ E++YPY D C
Sbjct: 152 KLVSLSEQNLVDCSKKYGNNGCEGGLMDYAFKYI--KNNDGIDTEQSYPYTARDGQCHFK 209
Query: 227 KKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINA--YALQFYVTGV-SHPIQFFCD 282
+ + GY V R E D+ + GP++VAI+A + Q Y TGV S P D
Sbjct: 210 PGSVGATVTGYTDVQRGSEGDLQSAVATVGPISVAIDAGHSSFQLYKTGVYSEP-----D 264
Query: 283 GGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-DGSCGIND 341
+ L H VL VGYG + K YW++KNSWGEGWG GY ++ R D CGI
Sbjct: 265 CSSTQLDHGVLAVGYGAEDGK------DYWLVKNSWGEGWGMNGYIKMSRNKDNQCGIAT 318
Query: 342 YVRSALV 348
LV
Sbjct: 319 QASYPLV 325
>gi|147790682|emb|CAN61026.1| hypothetical protein VITISV_001146 [Vitis vinifera]
Length = 469
Score = 194 bits (493), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 132/358 (36%), Positives = 181/358 (50%), Gaps = 38/358 (10%)
Query: 1 MSCFYFFAGVALLSL-TVSVSSFMVVGDEKLHHLHHVKHT-----ALFNYFLEQHNKTYA 54
M+ F F LL L + S ++G ++ H T A++ +L +H K+Y
Sbjct: 10 MAVFLFL----LLGLASASAXDMSIIGYDETHGDKSSWRTDEDVMAVYEAWLAKHGKSYN 65
Query: 55 TLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLG----FKLKP 110
L E R IF NLR I + E+ + GLN F+DL+ E+++ YLG K +
Sbjct: 66 ALGEKERRFQIFKDNLRFIDE-HNAENRTYKVGLNRFADLTNEEYRSMYLGTRTAAKRRS 124
Query: 111 SYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKL 170
S A +LP + DWR+ AV VKDQ CGS WAFST +EG+ T L
Sbjct: 125 SNKISDRYAFRVGDSLPESVDWRKKGAVVEVKDQGSCGSCWAFSTIAAVEGINKIVTGGL 184
Query: 171 VSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKAC-RLNKK 228
+SLSEQEL+DCD ++GC GG + AF+ I++ GG++ E+ YPY+ D C + K
Sbjct: 185 ISLSEQELVDCDTSYNEGCNGGLMDYAFEFIINN--GGIDSEEDYPYKASDGRCDQYRKN 242
Query: 229 ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNE 286
A V I+GY V ++ + V N P++VAI A Q Y +G+ F
Sbjct: 243 AXVVTIDGYEDVPENDEKSLEKAVANQPVSVAIEAGGREFQLYQSGI------FTGRCGT 296
Query: 287 NLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR-----GDGSCGI 339
L H V VGYG T V YWI+KNSWG WGE+GY R+ R G CGI
Sbjct: 297 ALDHGVTAVGYG------TENGVDYWIVKNSWGASWGEEGYIRMERDLATSATGKCGI 348
>gi|224076968|ref|XP_002305072.1| predicted protein [Populus trichocarpa]
gi|222848036|gb|EEE85583.1| predicted protein [Populus trichocarpa]
Length = 305
Score = 194 bits (493), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 114/302 (37%), Positives = 159/302 (52%), Gaps = 20/302 (6%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
++ QH + Y + E R IF N+ +I+ + G+N+F+DL+ EF+A Y
Sbjct: 8 WMAQHGRVYGDMKEKEKRYLIFKENIERIEAFNNGSDRGYKLGVNKFADLTNEEFRAMYH 67
Query: 105 GFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYA 164
G+K + S S +P + DWR AVT VKDQ CG WAFST IEG+
Sbjct: 68 GYKRQSSKLMSSSFRYENLSDIPTSMDWRNDGAVTPVKDQGTCGCCWAFSTVAAIEGIIK 127
Query: 165 AKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACR 224
+T L+SLSEQ+L+DC + GC+GG + AF I+ GGL E YPY+G D C
Sbjct: 128 LQTGNLISLSEQQLVDCTAGNKGCQGGLMDTAFQYIIRN--GGLTSEDNYPYQGVDGTCS 185
Query: 225 LNKKA-TQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHPIQFFC 281
K A T+ +I GY V ++ + V P++VA++ +FY +GV F
Sbjct: 186 SEKAASTEAQITGYEDVPQNNENALLQAVAKQPVSVAVDGGGNDFRFYKSGV-----FEG 240
Query: 282 DGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSC 337
D G NL+H V +GYG D YW++KNSWG WGE GY R+ RG +G C
Sbjct: 241 DCGT-NLNHGVTAIGYGTD-----SDGTDYWLVKNSWGTSWGESGYTRMQRGIGASEGLC 294
Query: 338 GI 339
G+
Sbjct: 295 GV 296
>gi|261289811|ref|XP_002611767.1| hypothetical protein BRAFLDRAFT_284308 [Branchiostoma floridae]
gi|229297139|gb|EEN67777.1| hypothetical protein BRAFLDRAFT_284308 [Branchiostoma floridae]
Length = 336
Score = 194 bits (493), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 129/333 (38%), Positives = 176/333 (52%), Gaps = 34/333 (10%)
Query: 36 VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHG--------SGVYG 87
+ H + + QH K Y T E YSR F N KI EH S
Sbjct: 18 LPHNKEWEMWKLQHGKQYETEAEEYSRRFTFEKNTIKI-----AEHNIRASLGMHSYTLA 72
Query: 88 LNEFSDLSTAEFQAKYLGFKLKPSYADR-----SVPAMIPNITLPRAFDWREYDAVTGVK 142
+N+F D+ EF + +G LK ++ V N TLP++ DWR V+ VK
Sbjct: 73 MNKFGDMHHEEFHQRIMGGCLKIVKVNKPLLGSEVGDNDDNGTLPKSVDWRNSAMVSEVK 132
Query: 143 DQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTI 200
DQ CGS WAFSTTG++EG +A KT KLV LSEQ+L+DC ++ + GC GG + AF I
Sbjct: 133 DQGECGSCWAFSTTGSLEGQHANKTGKLVDLSEQQLVDCSKDFGNQGCGGGLMDQAFQYI 192
Query: 201 MSKLGGGLEEEKTYPYRG-DDKACRLNKKATQVKINGYVSV-SRDETDMAKYLVENGPMA 258
K GGL+ E++YPY DDK C+ + + + GY V S +E + + + GP++
Sbjct: 193 --KANGGLDTEESYPYTATDDKPCKFDNSSVGATLIGYKDVKSGNEHALKRAVATVGPIS 250
Query: 259 VAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKN 316
VAI+A + QFY +GV Q C +E L H VL+VGYG +H+A +WI+KN
Sbjct: 251 VAIDAGHESFQFYSSGVYDEPQ--CS--SEQLDHGVLVVGYGA-MNDNSHQA--FWIVKN 303
Query: 317 SWGEGWGEKGYFRLYRG-DGSCGINDYVRSALV 348
SWG WG++GY + R D CGI LV
Sbjct: 304 SWGPNWGDQGYIMMSRNKDNQCGIATSASYPLV 336
>gi|297802418|ref|XP_002869093.1| hypothetical protein ARALYDRAFT_491113 [Arabidopsis lyrata subsp.
lyrata]
gi|297314929|gb|EFH45352.1| hypothetical protein ARALYDRAFT_491113 [Arabidopsis lyrata subsp.
lyrata]
Length = 355
Score = 194 bits (493), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 131/354 (37%), Positives = 183/354 (51%), Gaps = 30/354 (8%)
Query: 1 MSCFYFFAGVALLSLTVSV--SSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVE 58
++ F ++ +L S F +VG K LF ++ +H+K Y ++ E
Sbjct: 8 LTKFSLLVAISASALLCSALARDFSIVGYTPEQLTSTEKLLELFESWMSEHSKVYKSVEE 67
Query: 59 YYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVP 118
R +F NL I ++ E S GLNEF+DL+ EF+ +YLG KP ++ + P
Sbjct: 68 KVHRFEVFRENLMHIDQ-RNNEINSYWLGLNEFADLTHEEFKGRYLGLA-KPQFSRKRQP 125
Query: 119 AM---IPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLS 174
+ +IT LP++ DWR+ AV VKDQ CGS WAFST +EG+ T L SLS
Sbjct: 126 SANFRYRDITDLPKSVDWRKKGAVAPVKDQGQCGSCWAFSTVAAVEGINQITTGNLSSLS 185
Query: 175 EQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKK-ATQV 232
EQELIDCD + GC GG + AF I+S GGL +E YPY ++ C+ K+ +V
Sbjct: 186 EQELIDCDTTFNSGCNGGLMDYAFQYIIST--GGLHKEDDYPYLMEEGICQEQKEDVERV 243
Query: 233 KINGYVSVSRDETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHPIQFFCDGGNENLSH 290
I+GY V ++ + + + P++VAI A QFY GV F +L H
Sbjct: 244 TISGYEDVPENDDESLVKALAHQPVSVAIEASGRDFQFYKGGV------FNGQCGTDLDH 297
Query: 291 SVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGIN 340
V VGYG + K Y I+KNSWG WGEKG+ R+ R +G CGIN
Sbjct: 298 GVAAVGYG------SSKGSDYVIVKNSWGPRWGEKGFIRMKRNTGKPEGLCGIN 345
>gi|432910512|ref|XP_004078392.1| PREDICTED: cathepsin K-like [Oryzias latipes]
Length = 331
Score = 194 bits (493), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 130/337 (38%), Positives = 177/337 (52%), Gaps = 31/337 (9%)
Query: 12 LLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLR 71
LL L+ SV S M DE H + + H K Y T+ E R I+ NLR
Sbjct: 8 LLLLSASVMSQM---DETTLDAH-------WEEWKMTHTKEYITVEEEGIRRAIWEKNLR 57
Query: 72 KIQLL-QDTEHGSGVY--GLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPN--ITL 126
I+ Q+ G Y G+N+F D++ E + G ++ P + VP I L
Sbjct: 58 MIEAHNQEAALGMHTYTLGMNQFGDMTQEEVVERMTGLQM-PLNPEPRVPMETDGSLIKL 116
Query: 127 PRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDD 186
P++ D+R+ VT VK+Q CGS WAFS+ G +EG A KT LV LS Q L+DC E+D
Sbjct: 117 PKSVDYRKKGMVTSVKNQGSCGSCWAFSSVGALEGQLAKKTGNLVDLSPQNLVDCVTEND 176
Query: 187 GCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSR-DET 245
GC GG ++NAF + GG++ E YPY G+D+ CR N +I GY V DE
Sbjct: 177 GCGGGYMTNAFKYVQEN--GGIDSEAAYPYMGEDQPCRYNVSGLAAQIKGYKEVPEGDEH 234
Query: 246 DMAKYLVENGPMAVAINAYALQF--YVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTK 303
+A L + GP++V I+A F Y G I F + E+++H+VL VGYGV+
Sbjct: 235 ALAVALFKAGPVSVGIDASQNSFLYYQKG----IYFDRNCNKEDINHAVLAVGYGVNA-- 288
Query: 304 FTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGS-CGI 339
K +WI+KNSWGE WG KGY + R G+ CGI
Sbjct: 289 ---KGKKFWIVKNSWGETWGNKGYVLMARNRGNVCGI 322
>gi|195334204|ref|XP_002033774.1| GM21500 [Drosophila sechellia]
gi|194125744|gb|EDW47787.1| GM21500 [Drosophila sechellia]
Length = 341
Score = 194 bits (493), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 124/312 (39%), Positives = 171/312 (54%), Gaps = 28/312 (8%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKI-QLLQDTEHGSGVYGL--NEFSDLSTAEFQA 101
F +H K Y E RL IF+ N KI + Q G + L N+++DL EF+
Sbjct: 32 FKLEHRKNYQDDTEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADLLHHEFRQ 91
Query: 102 KYLGFKL----KPSYADRSVPAMI----PNITLPRAFDWREYDAVTGVKDQTMCGSSWAF 153
GF + AD S + ++TLP++ DWR AVT VKDQ CGS WAF
Sbjct: 92 LMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGHCGSCWAF 151
Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEE 211
S+TG +EG + K+ LVSLSEQ L+DC + ++GC GG + NAF I K GG++ E
Sbjct: 152 SSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYI--KDNGGIDTE 209
Query: 212 KTYPYRGDDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINAY--ALQF 268
K+YPY D +C NK G+ + + DE MA+ + GP+AVAI+A + QF
Sbjct: 210 KSYPYEAIDDSCHFNKGTIGATDRGFTDIPQGDEKKMAEAVATVGPVAVAIDASHESFQF 269
Query: 269 YVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYF 328
Y GV + Q CD +NL H VL+VG+G D + YW++KNSWG WG+KG+
Sbjct: 270 YSEGVYNEPQ--CDA--QNLDHGVLVVGFGTDES-----GEDYWLVKNSWGTTWGDKGFI 320
Query: 329 RLYRG-DGSCGI 339
++ R + CGI
Sbjct: 321 KMLRNKENQCGI 332
>gi|24653514|ref|NP_523735.2| cysteine proteinase-1, isoform C [Drosophila melanogaster]
gi|118572624|sp|Q95029.2|CATL_DROME RecName: Full=Cathepsin L; AltName: Full=Cysteine proteinase 1;
Contains: RecName: Full=Cathepsin L heavy chain;
Contains: RecName: Full=Cathepsin L light chain; Flags:
Precursor
gi|21627209|gb|AAM68565.1| cysteine proteinase-1, isoform C [Drosophila melanogaster]
Length = 371
Score = 194 bits (493), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 123/312 (39%), Positives = 171/312 (54%), Gaps = 28/312 (8%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKI-QLLQDTEHGSGVYGL--NEFSDLSTAEFQA 101
F +H K Y E RL IF+ N KI + Q G + L N+++DL EF+
Sbjct: 62 FKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADLLHHEFRQ 121
Query: 102 KYLGFKL----KPSYADRSVPAMI----PNITLPRAFDWREYDAVTGVKDQTMCGSSWAF 153
GF + AD S + ++TLP++ DWR AVT VKDQ CGS WAF
Sbjct: 122 LMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGHCGSCWAF 181
Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEE 211
S+TG +EG + K+ LVSLSEQ L+DC + ++GC GG + NAF I K GG++ E
Sbjct: 182 SSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYI--KDNGGIDTE 239
Query: 212 KTYPYRGDDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINAY--ALQF 268
K+YPY D +C NK G+ + + DE MA+ + GP++VAI+A + QF
Sbjct: 240 KSYPYEAIDDSCHFNKGTVGATDRGFTDIPQGDEKKMAEAVATVGPVSVAIDASHESFQF 299
Query: 269 YVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYF 328
Y GV + Q CD +NL H VL+VG+G D + YW++KNSWG WG+KG+
Sbjct: 300 YSEGVYNEPQ--CDA--QNLDHGVLVVGFGTDES-----GEDYWLVKNSWGTTWGDKGFI 350
Query: 329 RLYRG-DGSCGI 339
++ R + CGI
Sbjct: 351 KMLRNKENQCGI 362
>gi|159485468|ref|XP_001700766.1| cysteine endopeptidase [Chlamydomonas reinhardtii]
gi|158281265|gb|EDP07020.1| cysteine endopeptidase [Chlamydomonas reinhardtii]
Length = 498
Score = 194 bits (493), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 137/354 (38%), Positives = 185/354 (52%), Gaps = 38/354 (10%)
Query: 7 FAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYAT-LVEYYSRLHI 65
F +AL L + ++ + L V+ F + QH +TY+ EY RL +
Sbjct: 5 FLALALAGLVGLSCAHALLSSADMLALAQVEPERAFGLWATQHARTYSEGSPEYTRRLGV 64
Query: 66 FSGNLRKIQLLQDTEHGSGV-YGLNEFSDLSTAEFQAKYLGFK-----LKPSYADRSVPA 119
F+ N+R I + +G+ LNE++D + EF AK LG K LK A S +
Sbjct: 65 FADNVRAIA--EQNRRNTGITLALNEYADETWEEFAAKRLGLKISQEQLKAREARSSSSS 122
Query: 120 MI----PNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSE 175
+ P A DWR +AVT VK+Q CGS WAFS G+IEG A T +LV+LSE
Sbjct: 123 SSSWRYAQVQTPAAVDWRAKNAVTQVKNQGQCGSCWAFSAVGSIEGANALATGQLVALSE 182
Query: 176 QELIDCDQEDD-GCEGGSISNAFDTIMSKLGGGLEEEKTYPYR---GDDKACRLNKKATQ 231
Q+L+DCD + GC GG + +AF ++ GG++ E+ Y Y G C K+ +
Sbjct: 183 QQLVDCDTASNMGCSGGLMDDAFKYVLDN--GGIDTEEDYSYWSGYGFGFWCNKRKQTDR 240
Query: 232 --VKINGYVSVSRDETDMAKYLVENGPMAVAINAYA-LQFYVTGVSHPIQFFCDGGNENL 288
V I+GY V E + K V P+AVAI A A +QFY +GV I C+G L
Sbjct: 241 PAVSIDGYEDVPTSEPALLK-AVAGQPVAVAICASANMQFYSSGV---INSCCEG----L 292
Query: 289 SHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGS---CGI 339
+H VL VGY + KA PYWI+KNSWG WGE+GYFRL G+G CGI
Sbjct: 293 NHGVLAVGYDT-----SDKAQPYWIVKNSWGGSWGEQGYFRLKMGEGPKGLCGI 341
>gi|1581745|prf||2117247A Cys protease:ISOTYPE=1
Length = 467
Score = 194 bits (493), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 114/315 (36%), Positives = 163/315 (51%), Gaps = 22/315 (6%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQL-LQDTEHGSGVYGLNEFSDLSTAEFQ 100
F F ++H K Y + E RL +F NL +L H S + + FSDL+ EF+
Sbjct: 38 FAAFKQRHGKVYGSAAEEAFRLGVFKENLLFARLHAAANPHAS--FAVTPFSDLTREEFR 95
Query: 101 AKYLGFKLKPSYADRSVPAMIPNITLPR----AFDWREYDAVTGVKDQTMCGSSWAFSTT 156
++Y + A + V + A DWR AVT +KDQ C S WAFST
Sbjct: 96 SRYHNAAAHFAAAQKRVRVPVEVEVEVGGPPAAVDWRARGAVTAIKDQGNCSSCWAFSTI 155
Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
GNIEG + L LSEQ L+ CD D+GC+GG + +AFD I+ + G + E +Y Y
Sbjct: 156 GNIEGQWHLAGNPLTGLSEQMLVSCDNADNGCDGGLMDSAFDWIVEQNNGSVYTEASYSY 215
Query: 217 ---RGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGV 273
GD + C ++ I+G+V + +DE MA +L NGP+A+A++A + Y GV
Sbjct: 216 VSGGGDSQTCDMSDHVVGAVISGHVDLPQDEDKMAAWLAVNGPLAIAVDATSFMSYTGGV 275
Query: 274 SHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG 333
+ ++ L H V++VGY PYWIIKNSWG WGE+GY R+ +G
Sbjct: 276 ------LTNCVSDQLDHGVVLVGYNDSSNP------PYWIIKNSWGADWGEEGYIRIQKG 323
Query: 334 DGSCGINDYVRSALV 348
C + +Y SA+V
Sbjct: 324 TNQCLVKNYACSAVV 338
>gi|359483753|ref|XP_002266308.2| PREDICTED: oryzain alpha chain-like [Vitis vinifera]
Length = 501
Score = 194 bits (493), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 130/352 (36%), Positives = 191/352 (54%), Gaps = 33/352 (9%)
Query: 4 FYFFAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRL 63
+ +A +A LS ++ + F + G+E V+ LF+ + E+H + Y E R
Sbjct: 12 LFIWASLACLSSSLP-TEFYITGEE-FASEERVRE--LFHLWKERHKRVYKHAEETAKRF 67
Query: 64 HIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPS-----YADRSVP 118
IF NL+ + + ++++ G+N+F+D+S EF+ KYL KP Y RS+
Sbjct: 68 EIFKENLKYV-IERNSKGHRHTLGMNKFADMSNEEFKEKYLSKIKKPINKKNNYLRRSMQ 126
Query: 119 AM--IPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQ 176
+ P + DWR+ VTG+KDQ CGS WAFS+TG +EG+ A T L+SLSEQ
Sbjct: 127 QKKGTASCEAPSSLDWRKKGVVTGIKDQGDCGSCWAFSSTGAMEGINAIVTGDLISLSEQ 186
Query: 177 ELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQ-VKIN 235
EL+DCD + GCEGG + AF+ ++S GG++ E YPY G D C K+ T+ V I+
Sbjct: 187 ELVDCDTTNYGCEGGYMDYAFEWVISN--GGIDSESDYPYTGTDGTCNTTKEDTKVVSID 244
Query: 236 GYVSVSRDETDMAKYLVE-NGPMAVAINAYAL--QFYVTGVSHPIQFFCDGGNENLSHSV 292
GY V DE+D A N P++V ++ AL Q Y +G+ +++ H+V
Sbjct: 245 GYKDV--DESDSALLCAAVNQPISVGMDGSALDFQLYTSGIYAGDCSD---DPDDIDHAV 299
Query: 293 LIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGD----GSCGIN 340
LIVGYG + ++ YWI KNSWG WG +GYF + R G C IN
Sbjct: 300 LIVGYGSEDSE------DYWICKNSWGTSWGMEGYFYIKRNTDLPYGECAIN 345
>gi|375073980|gb|AFA34857.1| cathepsin L-like protein [Trypanosoma cruzi marinkellei]
Length = 467
Score = 194 bits (492), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 112/314 (35%), Positives = 161/314 (51%), Gaps = 18/314 (5%)
Query: 39 TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAE 98
T+ F F ++H + Y + E RL +F NL + L + +G+ FSDL+ E
Sbjct: 35 TSQFAEFKQKHGRVYKSAAEEAFRLSVFRENLF-LARLHAAANPHATFGVTPFSDLTREE 93
Query: 99 FQAKYLGFKLKPSYADRSV--PAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTT 156
F+++Y + A P + + +P A DWR AVT VKDQ CGS WAFS
Sbjct: 94 FRSRYHNGAAHFAAAQERARVPVNVEVVGVPAAVDWRARGAVTAVKDQGQCGSCWAFSAI 153
Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
GN+E + L +LSEQ L+ CD+ D GC GG +++AF+ I+ + G + E++YPY
Sbjct: 154 GNVESQWFLAGHPLTNLSEQMLVSCDKTDSGCSGGLMNDAFEWIVQENDGAVYTEESYPY 213
Query: 217 ---RGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGV 273
G C + I G+V + +DE +A +L NGP+AVA++A + Y GV
Sbjct: 214 ASGEGISPPCTTSGHTVGATITGHVELPQDEAQIAAWLAANGPVAVAVDATSWMTYTGGV 273
Query: 274 SHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG 333
+E L H VL+VGY VPYWIIKNSW WGE GY R+ +G
Sbjct: 274 ------MTSCVSEQLDHGVLLVGYN------DSAPVPYWIIKNSWTTLWGEDGYIRIAKG 321
Query: 334 DGSCGINDYVRSAL 347
C + + SA+
Sbjct: 322 SNQCLVKEEASSAV 335
>gi|151176971|gb|ABR88030.1| digestive cysteine protease [Dermestes frischii]
Length = 339
Score = 194 bits (492), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 125/321 (38%), Positives = 168/321 (52%), Gaps = 28/321 (8%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT-EHGSGVYGL--NEFSDLSTAEFQA 101
F QH K Y + E R+ IF N K+ E G Y L N+++D+ EF
Sbjct: 30 FKLQHKKQYKSDTEEKFRMKIFMENSHKVAKXNKLYEMGLVSYKLKINKYADMLHHEFVH 89
Query: 102 KYLGFK-------LKPSYADRSVPAMIP-NITLPRAFDWREYDAVTGVKDQTMCGSSWAF 153
GF L S ++ + P N+ P DWRE+ AVT VKDQ CGS W+F
Sbjct: 90 TVNGFNRTKNTPLLGTSEDEQGATFIAPANVKFPENVDWREHGAVTXVKDQGHCGSCWSF 149
Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEE 211
S TG +EG + KT KLVSLSEQ L+DC + +DGC GG + NAF + K G++ E
Sbjct: 150 SATGALEGQHFRKTNKLVSLSEQNLVDCSTKFGNDGCNGGLMDNAFKYV--KYNHGIDTE 207
Query: 212 KTYPYRGDDKACRLNKKATQVKINGYVSV-SRDETDMAKYLVENGPMAVAINAY--ALQF 268
+YPY DD+ C N K + G+V + + DE + + GP++VAI+A + Q
Sbjct: 208 ASYPYHADDEKCHYNPKTSGATDRGFVDIPTGDEEKLMAAVATVGPVSVAIDASHESFQL 267
Query: 269 YVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYF 328
Y GV + + C +E L H VL+VGYG D YWI+KNSWGE WGE+GY
Sbjct: 268 YSEGVYYDPE--C--SSEELDHGVLVVGYGTDEN-----GQDYWIVKNSWGESWGEQGYI 318
Query: 329 RLYRG-DGSCGINDYVRSALV 348
++ R D +CGI LV
Sbjct: 319 KMARNRDNNCGIATQASYPLV 339
>gi|402770505|gb|AFQ98387.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 194 bits (492), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 128/351 (36%), Positives = 185/351 (52%), Gaps = 40/351 (11%)
Query: 11 ALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNL 70
A++++TV+ SS ++ + + F H KTY + +E R IF+ N
Sbjct: 9 AIVAVTVAASSQEILRTQ-------------WEAFKTTHKKTYQSHMEELLRFKIFTEN- 54
Query: 71 RKIQLLQDTEHGSGV----YGLNEFSDLSTAEFQAKYLGFK--LKPSYADRSVPAMIPNI 124
I + ++ G+ G+N+F DL EF + G + K + PA + +
Sbjct: 55 SLIIAKHNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNGHRGTRKTGGSTFLPPANVNDS 114
Query: 125 TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE 184
+LP+A DWR+ AVT VKDQ CGS WAFS TG++EG + K +LVSLSEQ L+DC Q
Sbjct: 115 SLPKAVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQS 174
Query: 185 --DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSV-S 241
++GCEGG + +AF I K G++ EK+YPY D CR K+ GYV + +
Sbjct: 175 FGNNGCEGGLMEDAFKYI--KANDGIDTEKSYPYEAVDGECRFKKEDVGATDTGYVEIKA 232
Query: 242 RDETDMAKYLVENGPMAVAINA--YALQFYVTGV-SHPIQFFCDGGNENLSHSVLIVGYG 298
E D+ K + GP++VAI+A + Q Y GV P + +E+L H VL+VGYG
Sbjct: 233 GSEVDLKKAVATVGPISVAIDASHSSFQLYSEGVYDEP-----ECSSEDLDHGVLVVGYG 287
Query: 299 VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR-GDGSCGINDYVRSALV 348
V K YW++KNSW E WG++GY + R + CGI LV
Sbjct: 288 VKGGK------KYWLVKNSWAESWGDQGYILMSRDNNNQCGIASQASYPLV 332
>gi|2351107|dbj|BAA21929.1| bromelain [Ananas comosus]
Length = 312
Score = 194 bits (492), Expect = 7e-47, Method: Compositional matrix adjust.
Identities = 115/303 (37%), Positives = 161/303 (53%), Gaps = 24/303 (7%)
Query: 46 LEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLG 105
+ ++ + Y E R IF N+ I+ + S G+N+F+D++ EF A+Y G
Sbjct: 1 MAEYGRVYKDNDEKMRRFQIFKNNVNHIETFNNRNGNSYTLGINKFTDMTNNEFVAQYTG 60
Query: 106 FKLKPSYADRSVPAMIPNITLP---RAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGV 162
+P ++ ++ + ++ DWR+Y AVT VKDQ CGS WAFS +EG+
Sbjct: 61 GISRPLNIEKEPVVSFDDVNISAVGQSIDWRDYGAVTEVKDQNPCGSCWAFSAIATVEGI 120
Query: 163 YAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKA 222
Y T LVSLSEQE++DC +GC+GG + NA+D I+S G+ E YPY+
Sbjct: 121 YKIVTGYLVSLSEQEVLDC-AVSNGCDGGFVDNAYDFIISN--NGVASEADYPYQAYQGD 177
Query: 223 CRLNKKATQVKINGYVSV-SRDETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHPIQF 279
C N I GY V S DE+ M KY V N P+A AI+A Q+Y GV
Sbjct: 178 CAANSWPNSAYITGYSYVRSNDESSM-KYAVWNQPIAAAIDASGDNFQYYNGGV------ 230
Query: 280 FCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG---DGS 336
F +L+H++ I+GYG D + YWI+KNSWG WGE+GY R+ RG G
Sbjct: 231 FSGPCGTSLNHAITIIGYGQDSS-----GTQYWIVKNSWGSSWGERGYIRMARGVSSSGL 285
Query: 337 CGI 339
CGI
Sbjct: 286 CGI 288
>gi|27728675|gb|AAO18731.1| cysteine protease [Gossypium hirsutum]
Length = 389
Score = 194 bits (492), Expect = 7e-47, Method: Compositional matrix adjust.
Identities = 117/315 (37%), Positives = 172/315 (54%), Gaps = 28/315 (8%)
Query: 41 LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY----GLNEFSDLST 96
+F + E+H K Y E R F GNL+ I L ++ + + + GLN+F+D+S
Sbjct: 48 IFQQWKEKHRKVYRHAEEAEKRFENFKGNLKYI-LERNAKRKANKWEHHVGLNKFADMSN 106
Query: 97 AEFQAKYLGFKLKPSYA----DRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
EF+ YL KP R++ + + P + DWR Y VT VKDQ CGS WA
Sbjct: 107 EEFRKAYLSKVKKPINKGITLSRNMRRKVQSCDAPSSLDWRNYGVVTAVKDQGSCGSCWA 166
Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEK 212
FS+TG +EG+ A T L+SLSEQEL++CD + GCEGG + AF+ +++ GG++ E
Sbjct: 167 FSSTGAMEGINALVTGDLISLSEQELVECDTSNYGCEGGYMDYAFEWVINN--GGIDSES 224
Query: 213 TYPYRGDDKACRLNKKATQ-VKINGYVSVSRDETDMAKYLVENGPMAVAINAYAL--QFY 269
YPY G D C K+ T+ V I+GY V + ++ + + + P++V I+ A+ Q Y
Sbjct: 225 DYPYTGVDGTCNTTKEETKVVSIDGYQDVEQSDSALLCAVAQQ-PVSVGIDGSAIDFQLY 283
Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
G+ C +++ H+VLIVGYG + ++ YWI+KNSWG WG GYF
Sbjct: 284 TGGI---YDGSCSDDPDDIDHAVLIVGYGSEDSE------EYWIVKNSWGTSWGIDGYFY 334
Query: 330 LYRGD----GSCGIN 340
L R G C +N
Sbjct: 335 LKRDTDLPYGVCAVN 349
>gi|413919736|gb|AFW59668.1| cysteine protease 1 [Zea mays]
Length = 469
Score = 194 bits (492), Expect = 7e-47, Method: Compositional matrix adjust.
Identities = 119/313 (38%), Positives = 164/313 (52%), Gaps = 29/313 (9%)
Query: 41 LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY----GLNEFSDLST 96
++ ++ H +TY + E R +F NLR + + +GV+ GLN F+DL+
Sbjct: 45 MYAEWMAAHGRTYNAVGEEERRFEVFRDNLRYVDA-HNAAADAGVHSFRLGLNRFADLTN 103
Query: 97 AEFQAKYLGFKLKPSYADRSVPAMIP--NITLPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
E++A YLG + +P R + N LP + DWR AV VKDQ CGS WAFS
Sbjct: 104 DEYRATYLGVRSRPQRERRLGDRYLAGDNEDLPESVDWRAKGAVAEVKDQGSCGSCWAFS 163
Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
T +EG+ T ++SLSEQEL+DCD + GC GG + AF+ I++ GG++ E+
Sbjct: 164 TIAAVEGINQIVTGDMISLSEQELVDCDTSYNQGCNGGLMDYAFEFIINN--GGIDTEED 221
Query: 214 YPYRGDDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYV 270
YPY+G D C +N+K A V I+ Y V + + V N P++VAI A A Q Y
Sbjct: 222 YPYKGTDGRCDVNRKNAKVVTIDSYEDVPANSEKSLQKAVANQPISVAIEAGGRAFQLYN 281
Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
+G+ F L H V VGYG + K YWI+KNSWG WGE GY R+
Sbjct: 282 SGI------FTGTCGTALDHGVTAVGYGTENGK------DYWIVKNSWGSSWGESGYVRM 329
Query: 331 YRG----DGSCGI 339
R G CGI
Sbjct: 330 ERNIKASSGKCGI 342
>gi|320169652|gb|EFW46551.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
Length = 325
Score = 194 bits (492), Expect = 7e-47, Method: Compositional matrix adjust.
Identities = 119/312 (38%), Positives = 165/312 (52%), Gaps = 26/312 (8%)
Query: 49 HNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKL 108
HN+ YA+ E R I+ NL I S G+NEF DL+ EF AKYLG +
Sbjct: 28 HNRQYASAQEEALRQEIYLSNLELINEHNAAGRHSYTLGMNEFGDLAHHEFAAKYLGVRF 87
Query: 109 KPSYADRS------VPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGV 162
A +S +P M+ +LP + DWR VT VK+Q CGS W+FSTTG++EG
Sbjct: 88 NGVNATKSFASSTYLPRMV---SLPDSVDWRTAGIVTPVKNQGQCGSCWSFSTTGSVEGQ 144
Query: 163 YAAKTKKLVSLSEQELIDCDQED--DGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDD 220
+A KT LVSLSEQ L+DC ++ +GC GG + +AF+ I+ GG++ E +YPY
Sbjct: 145 HARKTGTLVSLSEQNLVDCSSQEGNEGCNGGLMDDAFEYIIKN--GGIDTEASYPYTATT 202
Query: 221 KACRLNKKATQVKINGYVS-VSRDETDMAKYLVENGPMAVAINAYAL--QFYVTGVSHPI 277
C+ N + Y ++ E+D+ + GP++VAI+A + QFY TGV +
Sbjct: 203 GTCKFNAANIGATVASYQDIITGSESDLQNAVATVGPVSVAIDASHINFQFYFTGVYNEK 262
Query: 278 QFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR-GDGS 336
+ C L H VL VGYG + + YW++KNSWG WG+ GY + R D
Sbjct: 263 K--CS--TTQLDHGVLAVGYGT-----STEGKDYWLVKNSWGATWGKAGYIWMSRNADNQ 313
Query: 337 CGINDYVRSALV 348
CGI LV
Sbjct: 314 CGIATSASYPLV 325
>gi|24653516|ref|NP_725347.1| cysteine proteinase-1, isoform A [Drosophila melanogaster]
gi|24653518|ref|NP_725348.1| cysteine proteinase-1, isoform B [Drosophila melanogaster]
gi|1658527|gb|AAB18345.1| cysteine proteinase 1 [Drosophila melanogaster]
gi|2305221|gb|AAB65749.1| cysteine proteinase-1 [Drosophila melanogaster]
gi|7303249|gb|AAF58311.1| cysteine proteinase-1, isoform A [Drosophila melanogaster]
gi|21627210|gb|AAM68566.1| cysteine proteinase-1, isoform B [Drosophila melanogaster]
gi|54650754|gb|AAV36956.1| LP06554p [Drosophila melanogaster]
gi|220951982|gb|ACL88534.1| Cp1-PA [synthetic construct]
Length = 341
Score = 194 bits (492), Expect = 7e-47, Method: Compositional matrix adjust.
Identities = 123/312 (39%), Positives = 171/312 (54%), Gaps = 28/312 (8%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKI-QLLQDTEHGSGVYGL--NEFSDLSTAEFQA 101
F +H K Y E RL IF+ N KI + Q G + L N+++DL EF+
Sbjct: 32 FKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADLLHHEFRQ 91
Query: 102 KYLGFKL----KPSYADRSVPAMI----PNITLPRAFDWREYDAVTGVKDQTMCGSSWAF 153
GF + AD S + ++TLP++ DWR AVT VKDQ CGS WAF
Sbjct: 92 LMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGHCGSCWAF 151
Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEE 211
S+TG +EG + K+ LVSLSEQ L+DC + ++GC GG + NAF I K GG++ E
Sbjct: 152 SSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYI--KDNGGIDTE 209
Query: 212 KTYPYRGDDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINAY--ALQF 268
K+YPY D +C NK G+ + + DE MA+ + GP++VAI+A + QF
Sbjct: 210 KSYPYEAIDDSCHFNKGTVGATDRGFTDIPQGDEKKMAEAVATVGPVSVAIDASHESFQF 269
Query: 269 YVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYF 328
Y GV + Q CD +NL H VL+VG+G D + YW++KNSWG WG+KG+
Sbjct: 270 YSEGVYNEPQ--CDA--QNLDHGVLVVGFGTDES-----GEDYWLVKNSWGTTWGDKGFI 320
Query: 329 RLYRG-DGSCGI 339
++ R + CGI
Sbjct: 321 KMLRNKENQCGI 332
>gi|255522980|gb|ACU12382.1| RE21773p [Drosophila melanogaster]
Length = 375
Score = 194 bits (492), Expect = 7e-47, Method: Compositional matrix adjust.
Identities = 123/312 (39%), Positives = 171/312 (54%), Gaps = 28/312 (8%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKI-QLLQDTEHGSGVYGL--NEFSDLSTAEFQA 101
F +H K Y E RL IF+ N KI + Q G + L N+++DL EF+
Sbjct: 66 FKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADLLHHEFRQ 125
Query: 102 KYLGFKL----KPSYADRSVPAMI----PNITLPRAFDWREYDAVTGVKDQTMCGSSWAF 153
GF + AD S + ++TLP++ DWR AVT VKDQ CGS WAF
Sbjct: 126 LMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGHCGSCWAF 185
Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEE 211
S+TG +EG + K+ LVSLSEQ L+DC + ++GC GG + NAF I K GG++ E
Sbjct: 186 SSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYI--KDNGGIDTE 243
Query: 212 KTYPYRGDDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINAY--ALQF 268
K+YPY D +C NK G+ + + DE MA+ + GP++VAI+A + QF
Sbjct: 244 KSYPYEAIDDSCHFNKGTVGATDRGFTDIPQGDEKKMAEAVATVGPVSVAIDASHESFQF 303
Query: 269 YVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYF 328
Y GV + Q CD +NL H VL+VG+G D + YW++KNSWG WG+KG+
Sbjct: 304 YSEGVYNEPQ--CDA--QNLDHGVLVVGFGTDES-----GEDYWLVKNSWGTTWGDKGFI 354
Query: 329 RLYRG-DGSCGI 339
++ R + CGI
Sbjct: 355 KMLRNKENQCGI 366
>gi|330434686|gb|AEC22811.1| cathepsin L [Macrobrachium nipponense]
Length = 342
Score = 194 bits (492), Expect = 7e-47, Method: Compositional matrix adjust.
Identities = 125/323 (38%), Positives = 172/323 (53%), Gaps = 31/323 (9%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEH-GSGVY--GLNEFSDLSTAEFQA 101
F +H+K Y + E R+ IF+ N +KI H GS Y G+N++ D+ EF
Sbjct: 32 FKFEHSKKYESDTEETFRMKIFAENKQKIAAHNKLYHTGSKTYKLGMNKYGDMLHHEFVN 91
Query: 102 KYLGFKLKPS----YADRSVPAM-----IPNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
GF+ S A+R ++ +P++ DWRE AVT VKDQ CGS WA
Sbjct: 92 MMNGFRANTSGAGYKANRGFQGAHFVEPPEDVVMPKSVDWREKGAVTEVKDQGSCGSCWA 151
Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEE 210
FS TG +EG + +T LVSLSEQ L+DC + ++GC GG + NAF I K+ GG++
Sbjct: 152 FSATGALEGQHYRQTGDLVSLSEQNLVDCSSKFGNNGCNGGLMDNAFQYI--KVNGGIDT 209
Query: 211 EKTYPYRGDDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINAY--ALQ 267
EK+YPY +D+ CR N G+V V +E + K + GP++VAI+A + Q
Sbjct: 210 EKSYPYEAEDEPCRYNPANAGADDRGFVDVREGNENALKKAIATIGPVSVAIDASQDSFQ 269
Query: 268 FYVTGV-SHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKG 326
FY GV S P D ENL H VL VGYG T YW++KNSW + WG++G
Sbjct: 270 FYQHGVYSDP-----DCSAENLDHGVLAVGYGT-----TEDGQDYWLVKNSWSKSWGDQG 319
Query: 327 YFRLYRGDGS-CGINDYVRSALV 348
Y ++ R + CGI LV
Sbjct: 320 YIKIARNQNNMCGIASAASYPLV 342
>gi|47086859|ref|NP_997749.1| cathepsin L, 1 a precursor [Danio rerio]
gi|42542930|gb|AAH66490.1| Cathepsin L1, a [Danio rerio]
Length = 337
Score = 194 bits (492), Expect = 7e-47, Method: Compositional matrix adjust.
Identities = 125/312 (40%), Positives = 172/312 (55%), Gaps = 22/312 (7%)
Query: 49 HNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY----GLNEFSDLSTAEFQAKYL 104
H+K Y E + R+ I+ NL+KI++ + EH G++ G+N F D++ EF+
Sbjct: 36 HSKKYHATEEGWRRV-IWEKNLKKIEM-HNLEHSMGIHTYRLGMNHFGDMTHEEFRQVMN 93
Query: 105 GFKLKPSYADRSVPAMIPN-ITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVY 163
GFK K R M PN I +P DWRE VT VKDQ CGS WAFSTTG +EG
Sbjct: 94 GFKHKKDRRFRGSLFMEPNFIEVPNKLDWREKGYVTPVKDQGECGSCWAFSTTGALEGQM 153
Query: 164 AAKTKKLVSLSEQELIDCDQED--DGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG-DD 220
KT KLVSLSEQ L+DC + + +GC GG + AF + + GL+ E++YPY G DD
Sbjct: 154 FRKTGKLVSLSEQNLVDCSRPEGNEGCNGGLMDQAFQYVKDQ--NGLDSEESYPYLGTDD 211
Query: 221 KACRLNKKATQVKINGYVSV-SRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPI 277
+ C + K + G+V + S E + K + GP++VAI+A + QFY +G+ +
Sbjct: 212 QPCHFDPKNSAANDTGFVDIPSGKERALMKAIAAVGPVSVAIDAGHESFQFYQSGIYYEK 271
Query: 278 QFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-DGS 336
+ C +E L H VL VGYG + K YWI+KNSW E WG+KGY + +
Sbjct: 272 E--C--SSEELDHGVLAVGYGFEGEDVDGKK--YWIVKNSWSENWGDKGYIYMAKDRHNH 325
Query: 337 CGINDYVRSALV 348
CGI LV
Sbjct: 326 CGIATAASYPLV 337
>gi|356517184|ref|XP_003527269.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
Length = 350
Score = 194 bits (492), Expect = 7e-47, Method: Compositional matrix adjust.
Identities = 125/312 (40%), Positives = 165/312 (52%), Gaps = 24/312 (7%)
Query: 37 KHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLST 96
K LF ++ +H K Y ++ E R IF NL+ I + GLNEF+DLS
Sbjct: 43 KLIELFESWISRHGKIYQSIEEKLHRFEIFKDNLKHIDERNKVVSNYWL-GLNEFADLSH 101
Query: 97 AEFQAKYLGFKLKPSYADRSVPAMI-PNITLPRAFDWREYDAVTGVKDQTMCGSSWAFST 155
EF+ KYLG K+ S S ++ LP++ DWR+ AVT VK+Q CGS WAFST
Sbjct: 102 QEFKNKYLGLKVDYSRRRESPEEFTYKDVELPKSVDWRKKGAVTQVKNQGSCGSCWAFST 161
Query: 156 TGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
+EG+ T L SLSEQELIDCD+ ++GC GG + AF I+ GL +E+ Y
Sbjct: 162 VAAVEGINQIVTGNLTSLSEQELIDCDRTYNNGCNGGLMDYAFSFIVEN--DGLHKEEDY 219
Query: 215 PYRGDDKACRLNKKATQ-VKINGYVSVSRDETDMAKYLVENGPMAVAINAYA--LQFYVT 271
PY ++ C + K+ T+ V I+GY V ++ + N P++VAI A QFY
Sbjct: 220 PYIMEEGTCEMAKEETEVVTISGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQFYSG 279
Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
GV F +L H V VGYG T K V Y +KNSWG WGEKGY R+
Sbjct: 280 GV------FDGHCGSDLDHGVAAVGYG------TAKGVDYITVKNSWGSKWGEKGYIRMR 327
Query: 332 RG----DGSCGI 339
R +G CGI
Sbjct: 328 RNIGKPEGICGI 339
>gi|5081735|gb|AAD39513.1|AF147207_1 cathepsin L-like protease precursor [Artemia franciscana]
Length = 338
Score = 194 bits (492), Expect = 7e-47, Method: Compositional matrix adjust.
Identities = 129/356 (36%), Positives = 185/356 (51%), Gaps = 38/356 (10%)
Query: 6 FFAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHI 65
F G L+ L+ ++S ++ DE ++ F H K Y + +E R+ I
Sbjct: 8 FLLGAVLVQLSAALSLTNLLADE-------------WHLFKATHKKEYPSQLEEKFRMKI 54
Query: 66 FSGNLRKI---QLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKP---SYADRSVPA 119
+ N K+ +L + S +N+F DL EF++ G++ K S A+ +
Sbjct: 55 YLENKHKVAKHNILYEKGEKSYQVAMNKFGDLLHHEFRSIMNGYQHKKQNSSRAESTFTF 114
Query: 120 MIP-NITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQEL 178
M P N+ +P + DWR A+T VKDQ CGS WAFS+TG +EG KT KL+SLSEQ L
Sbjct: 115 MEPANVEVPESVDWRVKGAITPVKDQGQCGSCWAFSSTGALEGQTFRKTGKLISLSEQNL 174
Query: 179 IDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKING 236
IDC + ++GC GG + AF I K G++ E TYPY +D CR N + G
Sbjct: 175 IDCSGKYGNEGCNGGLMDQAFQYI--KDNKGIDTENTYPYEAEDNVCRYNPRNRGAIDRG 232
Query: 237 YVSVSRDETDMAKYLVEN-GPMAVAINAY--ALQFYVTGVSHPIQFFCDGGNENLSHSVL 293
+V + E D K V GP++VAI+A + QFY GV + + CD +++L H VL
Sbjct: 233 FVHIPSGEEDKLKAAVATVGPVSVAIDASHESFQFYSKGVYY--EPSCD--SDDLDHGVL 288
Query: 294 IVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-DGSCGINDYVRSALV 348
+VGYG D K YW++KNSW E WG++GY ++ R CGI LV
Sbjct: 289 VVGYGSDNGK------DYWLVKNSWSEHWGDEGYIKIARNRKNHCGIATAASYPLV 338
>gi|442754503|gb|JAA69411.1| Putative cathepsin l-like cysteine proteinase b [Ixodes ricinus]
Length = 335
Score = 194 bits (492), Expect = 7e-47, Method: Compositional matrix adjust.
Identities = 124/325 (38%), Positives = 172/325 (52%), Gaps = 30/325 (9%)
Query: 40 ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT-EHGSGVY--GLNEFSDLST 96
A ++ F +H K+Y + E RL I+ N KI + G Y +NEF D+
Sbjct: 25 AEWSAFKAKHGKSYVSETEEVFRLKIYMENRHKIAKHNEKYARGEVPYSMAMNEFGDMLH 84
Query: 97 AEFQAKYLGFKLKPSYADRSV-------PAMIPNITLPRAFDWREYDAVTGVKDQTMCGS 149
EF + GFK +Y D+ P I + +LP+ DWR AVT VK+Q CGS
Sbjct: 85 HEFVSTRNGFKR--NYKDQPREGSTYLEPENIEDFSLPKTVDWRTKGAVTPVKNQGQCGS 142
Query: 150 SWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGG 207
WAFS TG++EG + K+ +VSLSEQ L+DC + ++GCEGG + NAF I + G
Sbjct: 143 CWAFSATGSLEGQHFRKSGSMVSLSEQNLVDCSTDFGNNGCEGGLMDNAFKYI--RANKG 200
Query: 208 LEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINAY-- 264
++ EK+YPY G D C K +G+V + ET + K + GP++VAI+A
Sbjct: 201 IDTEKSYPYNGTDGTCHFKKSTVGATDSGFVDIKEGSETQLKKAVATVGPISVAIDASHE 260
Query: 265 ALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGE 324
+ QFY GV + CD +E+L H VL+VGYG T YW++KNSWG WG+
Sbjct: 261 SFQFYSDGVYDEPE--CD--SESLDHGVLVVGYG------TLNGTDYWLVKNSWGTTWGD 310
Query: 325 KGYFRLYRG-DGSCGINDYVRSALV 348
+GY R+ R CGI LV
Sbjct: 311 EGYIRMSRNKKNQCGIASSASYPLV 335
>gi|1169186|sp|P43156.1|CYSP_HEMSP RecName: Full=Thiol protease SEN102; Flags: Precursor
gi|396568|emb|CAA52425.1| thiol-protease [Hemerocallis hybrid cultivar]
Length = 360
Score = 194 bits (492), Expect = 8e-47, Method: Compositional matrix adjust.
Identities = 130/351 (37%), Positives = 181/351 (51%), Gaps = 37/351 (10%)
Query: 7 FAGVALLSLT-VSVSSFMVVGDEKLHHLHHVKHTALFNYF--LEQHNKTYATLVEYYSRL 63
F +AL++L+ +S++ + ++ L +L+N + H+ L E R
Sbjct: 6 FIALALVALSFLSIAQSIPFTEKDL-----ASEDSLWNLYEKWRTHHTVARDLDEKNRRF 60
Query: 64 HIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPA---- 119
++F N++ I + LN+F D++ EF++KY G K++ + R +
Sbjct: 61 NVFKENVKFIHEFNQKKDAPYKLALNKFGDMTNQEFRSKYAGSKIQHHRSQRGIQKNTGS 120
Query: 120 -MIPNI-TLPRA-FDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQ 176
M N+ +LP A DWR AVTGVKDQ CGS WAFST ++EG+ KT +LVSLSEQ
Sbjct: 121 FMYENVGSLPAASIDWRAKGAVTGVKDQGQCGSCWAFSTIASVEGINQIKTGELVSLSEQ 180
Query: 177 ELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLN-KKATQVKI 234
EL+DCD ++GC GG + AF+ I G+ E +YPY D C N + V I
Sbjct: 181 ELVDCDTSYNEGCNGGLMDYAFEFIQKN---GITTEDSYPYAEQDGTCASNLLNSPVVSI 237
Query: 235 NGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSV 292
+G+ V + + V N P++V+I A Y QFY GV F G E L H V
Sbjct: 238 DGHQDVPANNENALMQAVANQPISVSIEASGYGFQFYSEGV-----FTGRCGTE-LDHGV 291
Query: 293 LIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGI 339
IVGYG T YWI+KNSWGE WGE GY R+ RG G CGI
Sbjct: 292 AIVGYGA-----TRDGTKYWIVKNSWGEEWGESGYIRMQRGISDKRGKCGI 337
>gi|226496089|ref|NP_001149658.1| cysteine protease 1 precursor [Zea mays]
gi|195629242|gb|ACG36262.1| cysteine protease 1 precursor [Zea mays]
Length = 469
Score = 193 bits (491), Expect = 9e-47, Method: Compositional matrix adjust.
Identities = 118/313 (37%), Positives = 164/313 (52%), Gaps = 29/313 (9%)
Query: 41 LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY----GLNEFSDLST 96
++ ++ H +TY + E R +F NLR + + +GV+ GLN F+DL+
Sbjct: 45 MYAEWMAAHGRTYNAVGEEERRFEVFRDNLRYVDA-HNAAADAGVHSFRLGLNRFADLTN 103
Query: 97 AEFQAKYLGFKLKPSYADRSVPAMIP--NITLPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
E++A YLG + +P R + N LP + DWR AV +KDQ CGS WAFS
Sbjct: 104 DEYRATYLGVRSRPQRERRLGDRYLAGDNEDLPESVDWRAKGAVAEIKDQGSCGSCWAFS 163
Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
T +EG+ T ++SLSEQEL+DCD + GC GG + AF+ I++ GG++ E+
Sbjct: 164 TIAAVEGINQIVTGDMISLSEQELVDCDTSYNQGCNGGLMDYAFEFIINN--GGIDTEED 221
Query: 214 YPYRGDDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYV 270
YPY+G D C +N+K A V I+ Y V + + V N P++VAI A A Q Y
Sbjct: 222 YPYKGTDGRCDVNRKNAKVVTIDSYEDVPANSEKSLQKAVANQPISVAIEAGGRAFQLYN 281
Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
+G+ F L H V VGYG + K YWI+KNSWG WGE GY R+
Sbjct: 282 SGI------FTGTCGTALDHGVTAVGYGTENGK------DYWIVKNSWGSSWGESGYVRM 329
Query: 331 YRG----DGSCGI 339
R G CGI
Sbjct: 330 ERNIKASSGKCGI 342
>gi|18422605|ref|NP_568651.1| senescence-associated protein 12 [Arabidopsis thaliana]
gi|13877737|gb|AAK43946.1|AF370131_1 putative senescence-specific cysteine protease SAG12 [Arabidopsis
thaliana]
gi|9758936|dbj|BAB09317.1| senescence-specific cysteine protease [Arabidopsis thaliana]
gi|14532898|gb|AAK64131.1| putative senescence-specific cysteine protease SAG12 [Arabidopsis
thaliana]
gi|332007929|gb|AED95312.1| senescence-associated protein 12 [Arabidopsis thaliana]
Length = 346
Score = 193 bits (491), Expect = 9e-47, Method: Compositional matrix adjust.
Identities = 124/356 (34%), Positives = 180/356 (50%), Gaps = 41/356 (11%)
Query: 1 MSCFYFFAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYY 60
M F F A + ++++S + +E + H++ ++ +H + YA + E
Sbjct: 6 MQIFLFVAIFSSFCFSITLSR--PLDNELIMQKRHIE-------WMTKHGRVYADVKEEN 56
Query: 61 SRLHIFSGNLRKIQLLQDTEHGSGV-YGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPA 119
+R +F N+ +I+ L G +N+F+DL+ EF++ Y GFK + + +S
Sbjct: 57 NRYVVFKNNVERIEHLNSIPAGRTFKLAVNQFADLTNDEFRSMYTGFKGVSALSSQSQTK 116
Query: 120 MIP----NIT---LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVS 172
M P N++ LP + DWR+ AVT +K+Q CG WAFS IEG K KL+S
Sbjct: 117 MSPFRYQNVSSGALPVSVDWRKKGAVTPIKNQGSCGCCWAFSAVAAIEGATQIKKGKLIS 176
Query: 173 LSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKAC---RLNKKA 229
LSEQ+L+DCD D GCEGG + AF+ I K GGL E YPY+G+D C + N KA
Sbjct: 177 LSEQQLVDCDTNDFGCEGGLMDTAFEHI--KATGGLTTESNYPYKGEDATCNSKKTNPKA 234
Query: 230 TQVKINGYVSVSRDETDMAKYLVENGPMAVAIN--AYALQFYVTGVSHPIQFFCDGGNEN 287
T I GY V ++ V + P++V I + QFY +GV F
Sbjct: 235 TS--ITGYEDVPVNDEQALMKAVAHQPVSVGIEGGGFDFQFYSSGV------FTGECTTY 286
Query: 288 LSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGI 339
L H+V +GYG + YWIIKNSWG WGE GY R+ + G CG+
Sbjct: 287 LDHAVTAIGYGE-----STNGSKYWIIKNSWGTKWGESGYMRIQKDVKDKQGLCGL 337
>gi|146215996|gb|ABQ10200.1| cysteine protease Cp2 [Actinidia deliciosa]
Length = 376
Score = 193 bits (491), Expect = 9e-47, Method: Compositional matrix adjust.
Identities = 121/316 (38%), Positives = 169/316 (53%), Gaps = 32/316 (10%)
Query: 40 ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
++ +L +H K Y + E R IF NL+ + ++E+ S GLN F+DL+ E+
Sbjct: 45 GIYAEWLAKHGKAYNGIGERERRFEIFKDNLKFVDE-HNSENRSYKVGLNRFADLTNEEY 103
Query: 100 QAKYLGFK-------LKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
++ +LG K +K A R A+ + LP + DWRE AV +KDQ CGS WA
Sbjct: 104 RSMFLGTKTDSKRRFMKSKSASRRY-AVQDSDMLPESVDWRESGAVAPIKDQGSCGSCWA 162
Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEE 211
FST +EGV T +++ LSEQEL+DCD+ D GC GG + AF+ I++ GG++ E
Sbjct: 163 FSTVAAVEGVNQIATGEMIQLSEQELVDCDRTYDAGCNGGLMDYAFEFIINN--GGIDTE 220
Query: 212 KTYPYRGDDKACRLNKKATQ-VKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQF 268
+ YPYRG D C +K T+ V IN Y V + K V + P++VAI A A Q
Sbjct: 221 EDYPYRGVDGTCDPERKNTKVVSINDYEDVPPYDEMALKKAVAHQPVSVAIEASGRAFQL 280
Query: 269 YVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYF 328
Y++GV F L H V++VGYG D +WI++NSWG WGE GY
Sbjct: 281 YLSGV------FTGECGRALDHGVVVVGYGTD------NGADHWIVRNSWGTSWGENGYI 328
Query: 329 RLYRG-----DGSCGI 339
R+ R G CGI
Sbjct: 329 RMERNVVDNFGGKCGI 344
>gi|50355613|dbj|BAD29955.1| cysteine protease [Daucus carota]
Length = 365
Score = 193 bits (491), Expect = 9e-47, Method: Compositional matrix adjust.
Identities = 125/325 (38%), Positives = 162/325 (49%), Gaps = 40/325 (12%)
Query: 33 LHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFS 92
L+ T + ++ ++ + Y T E R IF NL+ IQ + G+NEF+
Sbjct: 30 LNEASMTETHDQWMARYGRVYKTANEKNRRSTIFQENLKYIQTFNKANNKPYKLGVNEFA 89
Query: 93 DLSTAEFQAKYLGFKLKPSYADRSVPAMIPNI-------TLPRAFDWREYDAVTGVKDQT 145
DL+ EF FK V A + N+ +P DWR+ AVT +K+Q
Sbjct: 90 DLTNEEFTTSRNKFK-------SHVCATVTNVFRYENVTAVPATMDWRKKGAVTPIKNQG 142
Query: 146 MCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQ--EDDGCEGGSISNAFDTIMSK 203
CG WAFS +EG+ KT KL+SLSEQEL+DCD ED GCEGG + AFD I
Sbjct: 143 QCGCCWAFSAVAAMEGITQLKTGKLISLSEQELVDCDTNGEDQGCEGGLMDYAFDFIQQN 202
Query: 204 LGGGLEEEKTYPYRGDDKACRLNKKATQ-VKINGYVSVSRDETDMAKYLVENGPMAVAIN 262
GL E YPY G D C NK+A I G+ V + V N P++VAI+
Sbjct: 203 H--GLSTETNYPYSGTDGTCNANKEANHAATITGHEDVPANSESALLKAVANQPISVAID 260
Query: 263 AYA--LQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGV--DRTKFTHKAVPYWIIKNSW 318
A QFY +GV F + G E L H V VGYG D TK YW++KNSW
Sbjct: 261 ASGSDFQFYSSGV-----FTGECGTE-LDHGVTAVGYGTAADGTK-------YWLVKNSW 307
Query: 319 GEGWGEKGYFRLYRG----DGSCGI 339
G WGE+GY ++ RG +G CGI
Sbjct: 308 GTSWGEEGYIQMQRGVAAAEGLCGI 332
>gi|72008176|ref|XP_780713.1| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
Length = 335
Score = 193 bits (491), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 125/320 (39%), Positives = 167/320 (52%), Gaps = 25/320 (7%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLR---KIQLLQDTEHGSGVYGLNEFSDLSTAE 98
+N + +H K Y + E SR I+ NL K L D H + G+N+F+DL E
Sbjct: 28 WNQWKNEHGKRYLSDEEEASRKLIWEKNLDIVIKHNLKYDLGHFTYALGMNQFADLKNEE 87
Query: 99 FQAKYLGFKLKPSYADRSVPAMIP--NI-TLPRAFDWREYDAVTGVKDQTMCGSSWAFST 155
F A GF++ + +P NI LP+ DWR VT VKDQ CGS WAFST
Sbjct: 88 FVAMMTGFRVNGTSKAAKGSTFLPSNNIGELPKTVDWRTKGYVTPVKDQGQCGSCWAFST 147
Query: 156 TGNIEGVYAAKTKKLVSLSEQELIDCD--QEDDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
TG++EG + T KLVSLSEQ L+DC + ++GC+GG + AF I+ GG++ E++
Sbjct: 148 TGSLEGQHFKATGKLVSLSEQNLVDCSGKEGNEGCDGGLMDQAFQYIIK--AGGIDTEES 205
Query: 214 YPYRGDDKACRLNKKATQVKINGYVSVSRD-ETDMAKYLVENGPMAVAINA--YALQFYV 270
YPY+ D C K + GY V+ D ET + K + GP++VAI+A + Q Y
Sbjct: 206 YPYKAVDGECHFKKANIGATVTGYTDVTSDSETALQKAVAHIGPISVAIDASHMSFQLYK 265
Query: 271 TGV-SHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
+GV + P D + L H VL VGYG T YWI+KNSW E WG GY
Sbjct: 266 SGVYNEP-----DCSSTLLDHGVLAVGYGT-----TSDGTDYWIVKNSWAETWGMNGYLW 315
Query: 330 LYRG-DGSCGINDYVRSALV 348
+ R D CGI LV
Sbjct: 316 MSRNKDNQCGIATQASYPLV 335
>gi|224096714|ref|XP_002310708.1| predicted protein [Populus trichocarpa]
gi|222853611|gb|EEE91158.1| predicted protein [Populus trichocarpa]
Length = 356
Score = 193 bits (491), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 124/342 (36%), Positives = 183/342 (53%), Gaps = 42/342 (12%)
Query: 23 MVVGDEKLHHLHHVKHT--------ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQ 74
M + + +HL H + + +++ ++L++H K Y L E R IF NLR I
Sbjct: 1 MSIFNHDDNHLSHDQSSWRSDDEVMSIYKWWLQKHGKAYNRLGEKAKRFEIFKNNLRFID 60
Query: 75 LLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPN--------ITL 126
++++ + GL +F+DL+ E++A +LG + P R + + P+ L
Sbjct: 61 E-HNSQNRTYKVGLTKFADLTNQEYRAMFLGTRSDPKR--RLMKSKNPSERYAYKAGDKL 117
Query: 127 PRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQ-ED 185
P + DWR AV +KDQ CGS WAFST +EG+ T +L+SLSEQEL+DCD+ +
Sbjct: 118 PESVDWRGKGAVNPIKDQGSCGSCWAFSTVAAVEGINQIVTGELISLSEQELVDCDRFYN 177
Query: 186 DGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQ-VKINGYVSVSRDE 244
GC GG + AF I++ GGL+ EK YPY G+D C +K T+ V I+G+ V +
Sbjct: 178 AGCNGGLMDYAFQFIINN--GGLDTEKDYPYLGNDDTCDRDKMKTKAVSIDGFEDVLPFD 235
Query: 245 TDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRT 302
+ V + P++VAI A ALQFY +GV F L H V++VGYG
Sbjct: 236 EKALQKAVAHQPVSVAIEASGMALQFYQSGV------FTGECGTALDHGVVVVGYG---- 285
Query: 303 KFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-----DGSCGI 339
T K + YW+++NSWG WGE GY ++ R G CGI
Sbjct: 286 --TEKGLDYWLVRNSWGTEWGEHGYIKMQRNVRDTYTGRCGI 325
>gi|357446975|ref|XP_003593763.1| Cysteine proteinase [Medicago truncatula]
gi|355482811|gb|AES64014.1| Cysteine proteinase [Medicago truncatula]
Length = 350
Score = 193 bits (491), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 111/301 (36%), Positives = 168/301 (55%), Gaps = 21/301 (6%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
++ ++ +TY E R IF NL I+ + + S GLN +SDL++ EF A +
Sbjct: 36 WMMKYERTYTNSSEMEKRKKIFKENLEYIENFNNVGNKSYKLGLNRYSDLTSEEFIASHT 95
Query: 105 GFKLKPSYADRSVPAM-IP---NITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIE 160
GFK+ +D + ++ IP N +P FDWRE VT VK+Q CG WAF+ +E
Sbjct: 96 GFKVSDQLSDSKMRSVAIPFNLNDDVPTNFDWREKGVVTDVKNQRQCGCCWAFTAVAAVE 155
Query: 161 GVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDD 220
G+ K L+SLSEQ+L+DCD++ GC GG AFD+I+ G+ +E YPY+ +D
Sbjct: 156 GIVKIKNGNLISLSEQQLVDCDRQSSGCGGGDFVLAFDSIIKSR--GIVKEDDYPYKAND 213
Query: 221 -KACRLNKKATQVKINGYVSV-SRDETDMAKYLVENGPMAVAIN-AYALQFYVTGVSHPI 277
+ C+L + +INGY V + DE + + +++ P++VAI+ +Y Y+ GV
Sbjct: 214 VQTCQLGQIPGAAQINGYFKVPANDEQQLLRAVLQQ-PVSVAISTSYDFHHYMGGV---- 268
Query: 278 QFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSC 337
+ L+H+V I+GYGV + YW+IKNSWGE WGEKGY ++ R +
Sbjct: 269 --YEGSCGPKLNHAVTIIGYGV-----SEAGKKYWLIKNSWGETWGEKGYMKVLRESSAT 321
Query: 338 G 338
G
Sbjct: 322 G 322
>gi|334904467|gb|AEH26024.1| cysteine peptidase [Ananas comosus]
Length = 352
Score = 193 bits (491), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 109/306 (35%), Positives = 160/306 (52%), Gaps = 22/306 (7%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
F ++ ++ + Y E R IF N+ I+ S G+N+F+D++ +EF A
Sbjct: 37 FEEWMAEYGRVYKDNDEKMRRFQIFKNNVNHIETFNSHNGNSYTLGINQFTDMTKSEFVA 96
Query: 102 KYLGFKLKPSYADRSVPAMIPNITL---PRAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
+Y G +P +R ++ + P++ DWR+Y AV VK+Q CGS WAF+
Sbjct: 97 QYTGGISRPLNIEREPVVSFDDVNISAVPQSIDWRDYGAVNEVKNQNPCGSCWAFAAIAT 156
Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG 218
+EG+Y KT LVSLSEQE++DC GC+GG ++ A+D I+S G+ E+ YPY+
Sbjct: 157 VEGIYKIKTGYLVSLSEQEVLDC-AVSYGCKGGWVNKAYDFIISN--NGVTTEENYPYQA 213
Query: 219 DDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA-LQFYVTGVSHPI 277
C N I GY V R++ Y V N P+A I+A Q+Y GV
Sbjct: 214 YQGTCNANSFPNSAYITGYSYVRRNDERSMMYAVSNQPIAALIDASENFQYYNGGV---- 269
Query: 278 QFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG---- 333
F +L+H++ I+GYG D + YWI++NSWG WGE GY R+ RG
Sbjct: 270 --FSGPCGTSLNHAITIIGYGQDSS-----GTKYWIVRNSWGSSWGEGGYVRMARGVSSS 322
Query: 334 DGSCGI 339
G+CGI
Sbjct: 323 SGACGI 328
>gi|38147395|gb|AAR12010.1| cathepsin L-like proteinase [Triatoma infestans]
Length = 328
Score = 193 bits (491), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 129/323 (39%), Positives = 174/323 (53%), Gaps = 35/323 (10%)
Query: 36 VKHTALFNYFLE-------QHNKTYATLVEYYSRLHIFSGNLRKI-QLLQDTEHGSGVYG 87
+ HTAL +YF E Q K+Y E R++++ N RKI + + E+G Y
Sbjct: 13 ISHTALHDYFPEEWLAFKAQFGKSYKNSFEELFRMNVYKENQRKIDEHNKRYENGEVSYK 72
Query: 88 L--NEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNI--TLPRAFDWREYDAVTGVKD 143
L N F DL EF+A KLK S ++ + LP DWR+ AVT VKD
Sbjct: 73 LKMNHFGDLMQHEFKALN---KLKRSAKQQNSGEVFRATGGKLPAKVDWRQKGAVTPVKD 129
Query: 144 QTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIM 201
CGS WAFS+TG++ G K KKLVSLSEQ+L+DC +DGC+GG + AF I
Sbjct: 130 PGQCGSCWAFSSTGSLGGQLFLKNKKLVSLSEQQLVDCSGNYGNDGCDGGIMVQAFQYI- 188
Query: 202 SKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVA 260
K GG++ E +YPY +D CR K+ GYV +++ DE + + + E GP++VA
Sbjct: 189 -KGNGGIDTEGSYPYEAEDDKCRYKTKSVAGTDKGYVDIAQGDENALKEAVAEIGPISVA 247
Query: 261 INA--YALQFYVTGV-SHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNS 317
I+A + QFY G+ P FC N L H VL+VGYG T YW++KNS
Sbjct: 248 IDAGNLSFQFYSEGIYDEP---FCS--NTELDHGVLVVGYG------TENGQDYWLVKNS 296
Query: 318 WGEGWGEKGYFRLYRG-DGSCGI 339
WG WGE GY ++ R + CGI
Sbjct: 297 WGPSWGENGYIKIARNHNNHCGI 319
>gi|302143380|emb|CBI21941.3| unnamed protein product [Vitis vinifera]
Length = 354
Score = 193 bits (491), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 122/344 (35%), Positives = 187/344 (54%), Gaps = 27/344 (7%)
Query: 6 FFAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHI 65
+FA + + ++ VS S+F + + ++ + +L QH + Y E+ I
Sbjct: 7 YFALLIMWTVGVSWSAFSEEHEPMESEMSDMEKR--YERWLVQHGRRYKNRDEWQRHFGI 64
Query: 66 FSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKL-KPSYADRSVPAMIPNI 124
+ N+R I + + ++ S N+F+D++ E++A Y+G + S ++S +
Sbjct: 65 YQSNVRFINYI-NAQNFSFTLTDNQFADMTNEEYKALYMGLGTSETSRKNQSSFKRERSK 123
Query: 125 TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE 184
LP + DWR+ AVT V++Q CGS WAFST +EG+ +T KLVSLSEQEL+DCD +
Sbjct: 124 VLPISVDWRKMGAVTPVRNQGECGSCWAFSTVAAVEGINKIRTGKLVSLSEQELLDCDID 183
Query: 185 --DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQ-VKINGYVSVS 241
++GC GG + NAF I K GG+ + YPY G+ C +K A VKI+GY +V
Sbjct: 184 SGNEGCNGGYMVNAFKFI--KQNGGITTARNYPYIGEQGICNKDKAANHVVKISGYETVP 241
Query: 242 RDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGV 299
+ + + V P++VAI+A Y Q Y G+ FC + L+H+V ++GYG
Sbjct: 242 PNNEKILQAAVAKQPVSVAIDAGGYEFQLYSKGI---FNGFC---GKQLNHAVTVIGYGE 295
Query: 300 DRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR----GDGSCGI 339
D K YW++KNSWG GWGE GY R+ R +G CGI
Sbjct: 296 DNGK------KYWLVKNSWGTGWGEAGYARMIRDSRDDEGICGI 333
>gi|194757786|ref|XP_001961143.1| GF13722 [Drosophila ananassae]
gi|190622441|gb|EDV37965.1| GF13722 [Drosophila ananassae]
Length = 417
Score = 193 bits (490), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 120/309 (38%), Positives = 172/309 (55%), Gaps = 28/309 (9%)
Query: 48 QHNKTYATLVEYYSRLHIFSGNLRKI---QLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
+H K Y E RL IF+ N KI L + S +N+++D+ EF+
Sbjct: 111 EHRKNYLDETEERFRLKIFNENKHKIAKHNQLWASGKVSYKLAVNKYADMLHHEFRQLMN 170
Query: 105 GFKL----KPSYADRSVPAMI----PNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTT 156
GF + AD S + ++TLP++ DWR+ AVTGVKDQ CGS WAFS+T
Sbjct: 171 GFNYTLHKELRAADESFKGVTFISPEHVTLPKSVDWRDKGAVTGVKDQGHCGSCWAFSST 230
Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
G +EG + K+ LVSLSEQ L+DC + ++GC GG + NAF I K GG++ EK+Y
Sbjct: 231 GALEGQHYRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYI--KDNGGIDTEKSY 288
Query: 215 PYRGDDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINAY--ALQFYVT 271
PY D +C NK G+V + + +E +A+ + GP++VAI+A + QFY
Sbjct: 289 PYEALDDSCHFNKGTIGATDRGFVDIPQGNEKKLAEAVATIGPVSVAIDASHESFQFYSE 348
Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
GV ++ CD +NL H VL+VG+G D + YW++KNSWG WG+KG+ ++
Sbjct: 349 GVY--VEPACDA--QNLDHGVLVVGFGTDES-----GQDYWLVKNSWGTTWGDKGFIKML 399
Query: 332 RG-DGSCGI 339
R D CGI
Sbjct: 400 RNKDNQCGI 408
>gi|391340505|ref|XP_003744580.1| PREDICTED: digestive cysteine proteinase 1-like [Metaseiulus
occidentalis]
Length = 469
Score = 193 bits (490), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 120/308 (38%), Positives = 169/308 (54%), Gaps = 23/308 (7%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY--GLNEFSDLSTAEF 99
F +F E KTY E+ R IF NL I+ + S Y G+ +F+D+STAEF
Sbjct: 166 FEHFKEHFGKTYEG-DEHALRQGIFQRNLAHIEKFNAEKAASRGYTLGITQFADMSTAEF 224
Query: 100 QAKYLGFKLKPSYADR----SVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFST 155
+ YLG ++ S + + + LP A DWR+ AV+ VKDQ CGS WAFST
Sbjct: 225 RQTYLGLRMNASTIAKLRKLQREVVADDRDLPEAVDWRDKGAVSPVKDQGQCGSCWAFST 284
Query: 156 TGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYP 215
+G IEG + K +L+SLSEQ+++DC D GC GG A + + + GGLE E YP
Sbjct: 285 SGAIEGQHFLKNGELLSLSEQQMVDCSWLDFGCNGGQPMLAMEYV--RFNGGLELETAYP 342
Query: 216 YRGDDKACRLNKKATQVKINGY-VSVSRDETDMAKYLVENGPMAVAINAYA--LQFYVTG 272
Y+G +C +KK+ KI G+ ++ E+ + K + + GP++V ++A Q Y +G
Sbjct: 343 YKGVGGSCHSDKKSAAAKITGFWMAGFYSESALQKAVAKVGPISVGMDASGEDFQHYKSG 402
Query: 273 VSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR 332
+ +P G L H+VL VGYG T YW++KNSW WGEKGYF+L R
Sbjct: 403 IYNPESCSSIG----LDHAVLAVGYG------TSDDGDYWLVKNSWNTSWGEKGYFKLPR 452
Query: 333 GDGS-CGI 339
G+ CGI
Sbjct: 453 NKGNKCGI 460
>gi|2463588|dbj|BAA22546.1| FB1035 precursor [Ananas comosus]
Length = 324
Score = 193 bits (490), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 110/306 (35%), Positives = 161/306 (52%), Gaps = 23/306 (7%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
F ++ ++ + Y E R IF N++ I+ S G+N+F+D++ +EF A
Sbjct: 10 FEEWMAEYGRIYKDNDEKMRRFQIFKNNVKHIETFNSRNGNSYTLGINQFTDMTKSEFVA 69
Query: 102 KYLGFKLKPSYADRSVPAMIPNITL---PRAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
+Y G L P +R ++ + P++ DWR+Y AV VK+Q CGS WAF+
Sbjct: 70 QYTGVSL-PLNIEREPVVSFDDVNISAVPQSIDWRDYGAVNEVKNQNPCGSCWAFAAIAT 128
Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG 218
+EG+Y KT LVSLSEQE++DC GC+GG ++ A+D I+S G+ E+ YPY+
Sbjct: 129 VEGIYKIKTGYLVSLSEQEVLDC-AVSYGCKGGWVNKAYDFIISN--NGVTTEENYPYQA 185
Query: 219 DDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA-LQFYVTGVSHPI 277
C N I GY V R++ Y V N P+A I+A Q+Y GV
Sbjct: 186 YQGTCNANSFPNSAYITGYSYVRRNDERSMMYAVSNQPIAALIDASENFQYYNGGV---- 241
Query: 278 QFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG---- 333
F +L+H++ I+GYG D + YWI++NSWG WGE GY R+ RG
Sbjct: 242 --FSGPCGTSLNHAITIIGYGQDSS-----GTKYWIVRNSWGSSWGEGGYVRMARGVSSS 294
Query: 334 DGSCGI 339
G+CGI
Sbjct: 295 SGACGI 300
>gi|195484843|ref|XP_002090843.1| GE12574 [Drosophila yakuba]
gi|194176944|gb|EDW90555.1| GE12574 [Drosophila yakuba]
Length = 341
Score = 193 bits (490), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 124/312 (39%), Positives = 174/312 (55%), Gaps = 28/312 (8%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKI-QLLQDTEHGSGVYGL--NEFSDLSTAEFQA 101
F +H K Y E RL IF+ N KI + Q G + L N+++DL EF+
Sbjct: 32 FKLEHRKNYQDDTEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADLLHHEFRQ 91
Query: 102 KYLGF------KLKPSYAD-RSVPAMIP-NITLPRAFDWREYDAVTGVKDQTMCGSSWAF 153
GF +L+ + + V + P ++TLP++ DWR AVT VKDQ CGS WAF
Sbjct: 92 LMNGFNYTLHKQLRATDDSFKGVTFISPAHVTLPKSVDWRSKGAVTAVKDQGHCGSCWAF 151
Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEE 211
S+TG +EG + K+ LVSLSEQ L+DC + ++GC GG + NAF I K GG++ E
Sbjct: 152 SSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYI--KDNGGIDTE 209
Query: 212 KTYPYRGDDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINAY--ALQF 268
K+YPY D +C NK G+ + + DE MA+ + GP++VAI+A + QF
Sbjct: 210 KSYPYEAIDDSCHFNKGTIGATDRGFTDIPQGDEKKMAEAVATVGPVSVAIDASHESFQF 269
Query: 269 YVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYF 328
Y GV + Q CD +NL H VL+VG+G D + YW++KNSWG WG+KG+
Sbjct: 270 YSEGVYNEPQ--CDA--QNLDHGVLVVGFGTDES-----GDDYWLVKNSWGTTWGDKGFI 320
Query: 329 RLYRG-DGSCGI 339
++ R D CGI
Sbjct: 321 KMLRNKDNQCGI 332
>gi|157868354|ref|XP_001682730.1| cysteine peptidase A (CPA) [Leishmania major strain Friedlin]
gi|68126185|emb|CAJ07238.1| cysteine peptidase A (CPA) [Leishmania major strain Friedlin]
Length = 354
Score = 193 bits (490), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 124/321 (38%), Positives = 169/321 (52%), Gaps = 29/321 (9%)
Query: 39 TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLN-EFSDLSTA 97
+A + F E+H K++ + R + F N++ L +T + Y ++ +F+DL+
Sbjct: 39 SAHYGRFKERHGKSFGEDADEGHRFNAFKQNMQTAYFL-NTHNPHAHYDVSGKFADLTPQ 97
Query: 98 EFQAKYLGFKLKPSY-----ADRSVPAMIPNITLPRAF--DWREYDAVTGVKDQTMCGSS 150
EF YL P Y D + + L A DWRE AVT VK+Q MCGS
Sbjct: 98 EFAKLYL----NPDYYAHRGKDYKEHVHVDDSVLSGAMSVDWREKGAVTPVKNQGMCGSC 153
Query: 151 WAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEE 210
WAFS GNIE +A K LVSLSEQ L+ CD DDGC GG + A + I+ G +
Sbjct: 154 WAFSAIGNIESQWALKNHSLVSLSEQMLVSCDDIDDGCNGGLMDQAMEWIIQHHNGTVPT 213
Query: 211 EKTYPYR---GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQ 267
EK+YPY G C +K +I+GY+S+ DE +A Y+ + GP+AVA++A Q
Sbjct: 214 EKSYPYASAGGTSPPCH-DKGEFGARISGYMSLPHDEKAIAAYVEKKGPVAVAVDATTWQ 272
Query: 268 FYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGY 327
Y GV C G +L+H VL+VG+ R K PYWI+KNSWG WGEKGY
Sbjct: 273 LYFGGVV----TLCFG--LSLNHGVLVVGFN-KRAK-----PPYWIVKNSWGTSWGEKGY 320
Query: 328 FRLYRGDGSCGINDYVRSALV 348
RL G C + +Y +A V
Sbjct: 321 IRLAMGSNQCLLKNYPVTATV 341
>gi|154336052|ref|XP_001564262.1| cysteine peptidase A (CPA) [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|134061296|emb|CAM38321.1| cysteine peptidase A (CPA) [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 479
Score = 193 bits (490), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 124/352 (35%), Positives = 171/352 (48%), Gaps = 27/352 (7%)
Query: 6 FFAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHI 65
FA VA + + S ++ LH + +A F +F +QH K++ R +
Sbjct: 8 LFAMVATVLFALCYCSTVIA--RTLHGIDDEVASAHFMHFKKQHGKSFGEEAVEGHRFNA 65
Query: 66 FSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNIT 125
F N++ L + +F+ L+ EF +YL P Y R + A
Sbjct: 66 FKENMQTAVYLNAQNPHAHYDVSGKFAALTPQEFAKQYL----NPDYYTRQLKAHKERAH 121
Query: 126 L-------PRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQEL 178
+ A DWRE AVT VKDQ +CGS WAFS GNIEG +A LVSLSEQ L
Sbjct: 122 VYEGVRGGLSAVDWREKGAVTEVKDQGLCGSCWAFSAIGNIEGQWALSGNTLVSLSEQML 181
Query: 179 IDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDD--KACRLNKKATQVKING 236
+ CD D GC GG + A+ I+ G + E +YPY D A L+ +I+G
Sbjct: 182 VSCDTVDMGCNGGLMDQAWAWIIKNHSGAVYTEVSYPYTSGDGSTASCLSTGKVGARISG 241
Query: 237 YVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVG 296
VS+ +DE + +L +NGP+++A++A Q Y GV + NL+H VL+VG
Sbjct: 242 QVSLPQDEDAIEAWLEKNGPISIAVDATTWQLYFGGVVSNCFAY------NLNHGVLLVG 295
Query: 297 YGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
Y PYWI+KNSWG WGE GY RL +G C + DY SA V
Sbjct: 296 YNNSANP------PYWIVKNSWGTSWGEHGYIRLAKGSNQCMMKDYAMSATV 341
>gi|225446523|ref|XP_002275891.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP2 [Vitis vinifera]
Length = 358
Score = 193 bits (490), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 122/344 (35%), Positives = 187/344 (54%), Gaps = 27/344 (7%)
Query: 6 FFAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHI 65
+FA + + ++ VS S+F + + ++ + +L QH + Y E+ I
Sbjct: 11 YFALLIMWTVGVSWSAFSEEHEPMESEMSDMEKR--YERWLVQHGRRYKNRDEWQRHFGI 68
Query: 66 FSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKL-KPSYADRSVPAMIPNI 124
+ N+R I + + ++ S N+F+D++ E++A Y+G + S ++S +
Sbjct: 69 YQSNVRFINYI-NAQNFSFTLTDNQFADMTNEEYKALYMGLGTSETSRKNQSSFKRERSK 127
Query: 125 TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE 184
LP + DWR+ AVT V++Q CGS WAFST +EG+ +T KLVSLSEQEL+DCD +
Sbjct: 128 VLPISVDWRKMGAVTPVRNQGECGSCWAFSTVAAVEGINKIRTGKLVSLSEQELLDCDID 187
Query: 185 --DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQ-VKINGYVSVS 241
++GC GG + NAF I K GG+ + YPY G+ C +K A VKI+GY +V
Sbjct: 188 SGNEGCNGGYMVNAFKFI--KQNGGITTARNYPYIGEQGICNKDKAANHVVKISGYETVP 245
Query: 242 RDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGV 299
+ + + V P++VAI+A Y Q Y G+ FC + L+H+V ++GYG
Sbjct: 246 PNNEKILQAAVAKQPVSVAIDAGGYEFQLYSKGI---FNGFC---GKQLNHAVTVIGYGE 299
Query: 300 DRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR----GDGSCGI 339
D K YW++KNSWG GWGE GY R+ R +G CGI
Sbjct: 300 DNGK------KYWLVKNSWGTGWGEAGYARMIRDSRDDEGICGI 337
>gi|33242880|gb|AAQ01144.1| cathepsin [Branchiostoma lanceolatum]
Length = 334
Score = 193 bits (490), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 128/331 (38%), Positives = 174/331 (52%), Gaps = 32/331 (9%)
Query: 36 VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHG--------SGVYG 87
+ H + + QH K Y T E YSR IF N KI EH S
Sbjct: 18 LPHNKEWEMWKLQHGKQYETEAEEYSRRFIFEKNTVKI-----AEHNIRASLGMHSYTLA 72
Query: 88 LNEFSDLSTAEFQAKYLGFKLK---PSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQ 144
+N+F D+ EF + +G LK V N TLP++ DWR V+ VKDQ
Sbjct: 73 MNKFGDMHHEEFHQRIMGGCLKIVKKPLLGSEVGDNDDNGTLPKSVDWRNSHMVSEVKDQ 132
Query: 145 TMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMS 202
CGS WAFSTTG++EG ++ KT KLV LSEQ+L+DC ++ + GC GG + AF I
Sbjct: 133 GECGSCWAFSTTGSLEGQHSNKTGKLVDLSEQQLVDCSKDFGNQGCGGGLMDQAFQYI-- 190
Query: 203 KLGGGLEEEKTYPYRG-DDKACRLNKKATQVKINGYVSV-SRDETDMAKYLVENGPMAVA 260
K GGL+ E++YPY DDK C+ + + + GY V S +E + + + GP++VA
Sbjct: 191 KANGGLDTEESYPYTATDDKPCKFDNSSVGATLIGYKDVKSSNEHALKRAVATVGPVSVA 250
Query: 261 INA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSW 318
I+A + QFY +GV Q C E L H VL+VGYG +H+A +WI+KNSW
Sbjct: 251 IDAGHESFQFYSSGVYDEPQ--CS--TEQLDHGVLVVGYGA-MNDNSHQA--FWIVKNSW 303
Query: 319 GEGWGEKGYFRLYRG-DGSCGINDYVRSALV 348
G WG++GY + R + CGI LV
Sbjct: 304 GPNWGDQGYIMMSRNKNNQCGIATSASYPLV 334
>gi|410923307|ref|XP_003975123.1| PREDICTED: cathepsin L1-like [Takifugu rubripes]
Length = 336
Score = 193 bits (490), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 126/319 (39%), Positives = 176/319 (55%), Gaps = 22/319 (6%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY----GLNEFSDLSTA 97
+N + + H+K Y E + R+ ++ NL+KI+L + EH G + G+N F D++
Sbjct: 28 WNLWKDWHSKKYHEKEEGWRRM-VWEKNLKKIEL-HNLEHSMGKHTYSLGMNHFGDMTHE 85
Query: 98 EFQAKYLGFKLKPSYADRSVPAMIPN-ITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTT 156
EF+ G+KLK R M PN + PR+ DWR+ VT VKDQ CGS WAFSTT
Sbjct: 86 EFRQIMNGYKLKSQRKLRGSLFMEPNFLEAPRSVDWRDKGYVTPVKDQGQCGSCWAFSTT 145
Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCDQED--DGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
G +EG + KT LVSLSEQ L+DC + + +GC GG + AF I K GGL+ E++Y
Sbjct: 146 GAMEGQHFRKTGTLVSLSEQNLVDCSRPEGNEGCNGGLMDQAFQYI--KDNGGLDSEESY 203
Query: 215 PYRG-DDKACRLNKKATQVKINGYVSV-SRDETDMAKYLVENGPMAVAINA--YALQFYV 270
PY G D+ C + G+V V S E + K + GP++VAI+A + QFY
Sbjct: 204 PYLGTDEGPCHYDPSYNSANDTGFVDVPSGSERALMKAVASVGPVSVAIDAGHESFQFYH 263
Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
+G I + + +E L H VL+VGYG + K YWI+KNSW E WG+KGY +
Sbjct: 264 SG----IYYDKECSSEELDHGVLVVGYGFEGKDVDGKK--YWIVKNSWSENWGDKGYIYM 317
Query: 331 YRGDGS-CGINDYVRSALV 348
+ + CGI LV
Sbjct: 318 AKDKKNHCGIATAASYPLV 336
>gi|147788834|emb|CAN64655.1| hypothetical protein VITISV_005140 [Vitis vinifera]
Length = 341
Score = 193 bits (490), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 126/344 (36%), Positives = 170/344 (49%), Gaps = 27/344 (7%)
Query: 5 YFFAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLH 64
Y + +ALL + + +S K +LH ++ Q+ + Y E R
Sbjct: 7 YRYICLALLFVLAAWASHA-----KARNLHEASMYERHEDWMAQYGRVYKDAGEKSKRYK 61
Query: 65 IFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNI 124
IF N+ +I+ + S +NEF+DL+ EF+A FK + +
Sbjct: 62 IFKDNVARIESFNKAMNKSYKLSINEFADLTNEEFRASRNRFKAHICSTEATSFKYEHVX 121
Query: 125 TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQ- 183
+P DWR+ AVT +KDQ CGS WAFS +EG+ T KL+SLSEQEL+DCD
Sbjct: 122 AVPSTVDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTS 181
Query: 184 -EDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKA-TQVKINGYVSVS 241
ED GC GG + +AF I + GL E YPY G D C K A KINGY V
Sbjct: 182 GEDQGCSGGLMDDAFKFI--EQNHGLTTEANYPYAGTDGTCNRKKAAHPAAKINGYEDVP 239
Query: 242 RDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGV 299
+ + V + P+AVAI+A + QFY +GV F G E L H V VGYG
Sbjct: 240 ANNEKALQKAVAHQPIAVAIDAGGFEFQFYSSGV-----FTGQCGTE-LDHGVSAVGYGT 293
Query: 300 DRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGI 339
+ + YW++KNSWG GWGE+GY R+ R +G CGI
Sbjct: 294 -----SDDGMKYWLVKNSWGTGWGEEGYIRMQRDVTEKEGLCGI 332
>gi|357130141|ref|XP_003566711.1| PREDICTED: xylem cysteine proteinase 1-like [Brachypodium
distachyon]
Length = 457
Score = 193 bits (490), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 134/335 (40%), Positives = 177/335 (52%), Gaps = 29/335 (8%)
Query: 20 SSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT 79
S F +VG + + + LF +L +H K YA+ E R +F NL+ I + +
Sbjct: 128 SDFSIVGYSEEDLSSNDRIIELFEKWLAKHQKAYASFEEKLHRFEVFKDNLKHIDKV-NR 186
Query: 80 EHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAM----IPNITLPRAFDWREY 135
E S GLNEF+DL+ EF+A YLG P+ A S + + LP++ DWR
Sbjct: 187 EVTSYWLGLNEFADLTHEEFKATYLGLA-PPAPARESRGSFKYEDVSADDLPKSVDWRTK 245
Query: 136 DAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSIS 194
AVT VK+Q CGS WAFST +EG+ A T L +LSEQELIDC + ++GC GG +
Sbjct: 246 GAVTEVKNQGQCGSCWAFSTVAAVEGINAIVTGNLTALSEQELIDCSVDGNNGCNGGLMD 305
Query: 195 NAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQ--VKINGYVSV-SRDETDMAKYL 251
AF I S GGL E+ YPY ++ +C KK+ V I+GY V + +E + K L
Sbjct: 306 YAFSYIASS--GGLHTEEAYPYLMEEGSCGDGKKSESEAVTISGYEDVPAHNEQALIKAL 363
Query: 252 VENGPMAVAINAYA--LQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAV 309
P++VAI A QFY GV F L H V VGYG D+ K
Sbjct: 364 AHQ-PVSVAIEASGRHFQFYSGGV------FDGPCGTQLDHGVAAVGYGSDKG----KGH 412
Query: 310 PYWIIKNSWGEGWGEKGYFRLYR----GDGSCGIN 340
Y I++NSWG WGEKGY R+ R G+G CGIN
Sbjct: 413 DYIIVRNSWGAKWGEKGYIRMKRGTGKGEGLCGIN 447
>gi|226505708|ref|NP_001141813.1| uncharacterized protein LOC100273952 precursor [Zea mays]
gi|194706024|gb|ACF87096.1| unknown [Zea mays]
gi|413945958|gb|AFW78607.1| hypothetical protein ZEAMMB73_489507 [Zea mays]
Length = 460
Score = 193 bits (490), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 129/327 (39%), Positives = 171/327 (52%), Gaps = 41/327 (12%)
Query: 40 ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHG-------------SGVY 86
A F+ + +H K YAT E +RL +F+ N + S
Sbjct: 34 AQFDAWCAEHGKAYATPEERAARLAVFADNAAFVAAHNARAGANAAGGGGGGAAPPSYTL 93
Query: 87 GLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMI-----PNITLPRAFDWREYDAVTGV 141
LN F+DL+ EF+A LG ++ P A RS A + +P A DWR+ AVT V
Sbjct: 94 ALNAFADLTHEEFRAARLG-RIAPGAALRSRAAPVYWGLGGGAAVPDALDWRKSGAVTKV 152
Query: 142 KDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTI 200
KDQ CG+ W+FS TG +EG+ KT LVSLSEQELIDCD+ + GC GG + A+ +
Sbjct: 153 KDQGSCGACWSFSATGAMEGINKIKTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYKFV 212
Query: 201 MSKLGGGLEEEKTYPYRGDDKACRLNK-KATQVKINGYVSVSRDETDMAKYLVENGPMAV 259
+ GG++ E+ YPYR D C NK K V I+GY V ++ D+ V P++V
Sbjct: 213 IKN--GGIDTEEDYPYREADGTCNKNKLKKRVVTIDGYTDVPSNKEDLLLQAVAQQPVSV 270
Query: 260 AI--NAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNS 317
I +A A Q Y G+ F +L H+VLIVGYG + K YWI+KNS
Sbjct: 271 GICGSARAFQLYYQGI------FDGPCPTSLDHAVLIVGYGSEGGK------DYWIVKNS 318
Query: 318 WGEGWGEKGYFRLYR--GD--GSCGIN 340
WGE WG KGY ++R GD G CGIN
Sbjct: 319 WGESWGMKGYMHMHRNTGDSKGVCGIN 345
>gi|226533314|ref|NP_001150119.1| xylem cysteine proteinase 2 [Zea mays]
gi|195636886|gb|ACG37911.1| xylem cysteine proteinase 2 precursor [Zea mays]
gi|223946183|gb|ACN27175.1| unknown [Zea mays]
gi|413951209|gb|AFW83858.1| Xylem cysteine proteinase 2 [Zea mays]
Length = 385
Score = 193 bits (490), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 136/371 (36%), Positives = 175/371 (47%), Gaps = 53/371 (14%)
Query: 6 FFAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHI 65
AG + L+L F +VG + H LF +L +H + YA+L E R +
Sbjct: 23 LLAGSSCLALARPSGDFSIVGYSEEDLSSHESLAELFERWLSRHRRAYASLEEKLRRFQV 82
Query: 66 FSGNLRKIQLLQDTEHGSGVY--GLNEFSDLSTAEFQAKYLGFK-----------LKPSY 112
F NL I +T Y GLNEF+DL+ EF+A YLG +
Sbjct: 83 FKDNLHHID---ETNRKVSSYWLGLNEFADLTHDEFKATYLGLRSSVGDGGSGIDDDDEP 139
Query: 113 ADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVS 172
+ + +LP++ DWR AVTGVK+Q CGS WAFST +EG+ T L +
Sbjct: 140 EEEEGYEGVDGASLPKSVDWRSKGAVTGVKNQGQCGSCWAFSTVAAVEGINQIVTGNLTA 199
Query: 173 LSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACR------- 224
LSEQELIDCD + ++GC GG + AF I GGL E+ YPY ++ C+
Sbjct: 200 LSEQELIDCDTDGNNGCNGGLMDYAFSYIAHN--GGLHTEEAYPYLMEEGTCQRSSSSEK 257
Query: 225 --------LNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINAYA--LQFYVTGV 273
N A V I+GY V R +E + K L + P++VAI A QFY GV
Sbjct: 258 KWPGSSEDANDDAAVVTISGYEDVPRNNEQALLKALAQQ-PVSVAIEASGRNFQFYSGGV 316
Query: 274 SHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG 333
F L H V VGYG K Y I+KNSWG WGEKGY R+ RG
Sbjct: 317 ------FDGPCGTQLDHGVAAVGYGT-----AAKGHDYIIVKNSWGPSWGEKGYIRMRRG 365
Query: 334 DGS----CGIN 340
G CGIN
Sbjct: 366 TGKRQGLCGIN 376
>gi|358343350|ref|XP_003635767.1| Cysteine proteinase [Medicago truncatula]
gi|355501702|gb|AES82905.1| Cysteine proteinase [Medicago truncatula]
Length = 338
Score = 192 bits (489), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 119/309 (38%), Positives = 170/309 (55%), Gaps = 30/309 (9%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGL--NEFSDLSTAEF 99
+ +L+++ + Y E+ R I+ N++ I+ + Y L N F+D++ EF
Sbjct: 39 YETWLKRYGRHYRDREEWEVRFDIYQSNVQYIEFYNSQNYS---YKLIDNRFADITNEEF 95
Query: 100 QAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNI 159
++ YLG+ P + ++ + LP++ DWR+ AVT VKDQ CGS WAFS +
Sbjct: 96 KSTYLGYL--PRFRVQTEFRYHKHGELPKSIDWRKKGAVTHVKDQGRCGSCWAFSAVAAV 153
Query: 160 EGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
EG+ KT+ LVSLSEQ+LIDCD + ++GCEGG + AF+ I K GG+ K YPY+
Sbjct: 154 EGINKIKTENLVSLSEQQLIDCDIKSGNEGCEGGDMYIAFNYI--KKHGGIATAKEYPYK 211
Query: 218 GDDKACRLNK-KATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYVTGVS 274
G D C +K K V I+GY SV M K V + P+++A +A YA QFY G+
Sbjct: 212 GRDGNCNKSKAKNNAVTISGYESVPARNEKMLKAAVAHQPVSIATDAGGYAFQFYSKGI- 270
Query: 275 HPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG- 333
F +NL+H + IVGYG + YWI+KNSW WGE GY R+ R
Sbjct: 271 -----FSGSCGKNLNHGMTIVGYGEE------NGDKYWIVKNSWANDWGESGYVRMKRDT 319
Query: 334 ---DGSCGI 339
DG+CGI
Sbjct: 320 KDKDGTCGI 328
>gi|225458701|ref|XP_002284973.1| PREDICTED: cysteine proteinase RD21a-like [Vitis vinifera]
Length = 467
Score = 192 bits (489), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 132/357 (36%), Positives = 180/357 (50%), Gaps = 38/357 (10%)
Query: 1 MSCFYFFAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHT-----ALFNYFLEQHNKTYAT 55
M+ F F LL L S ++G ++ H T A++ +L +H K+Y
Sbjct: 10 MAVFLFL----LLGL-ASALDMSIIGYDETHGDKSSWRTDEDVMAVYEAWLAKHGKSYNA 64
Query: 56 LVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLG----FKLKPS 111
L E R IF NLR I + E+ + GLN F+DL+ E+++ YLG K + S
Sbjct: 65 LGEKERRFQIFKDNLRFIDE-HNAENRTYKVGLNRFADLTNEEYRSMYLGTRTAAKRRSS 123
Query: 112 YADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLV 171
A +LP + DWR+ AV VKDQ CGS WAFST +EG+ T L+
Sbjct: 124 NKISDRYAFRVGDSLPESVDWRKKGAVVEVKDQGSCGSCWAFSTIAAVEGINKIVTGGLI 183
Query: 172 SLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKAC-RLNKKA 229
SLSEQEL+DCD ++GC GG + AF+ I++ GG++ E+ YPY+ D C + K A
Sbjct: 184 SLSEQELVDCDTSYNEGCNGGLMDYAFEFIINN--GGIDSEEDYPYKASDGRCDQYRKNA 241
Query: 230 TQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNEN 287
V I+GY V ++ + V N P++VAI A Q Y +G+ F
Sbjct: 242 KVVTIDGYEDVPENDEKSLEKAVANQPVSVAIEAGGREFQLYQSGI------FTGRCGTA 295
Query: 288 LSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR-----GDGSCGI 339
L H V VGYG T V YWI+KNSWG WGE+GY R+ R G CGI
Sbjct: 296 LDHGVTAVGYG------TENGVDYWIVKNSWGASWGEEGYIRMERDLATSATGKCGI 346
>gi|195583187|ref|XP_002081405.1| GD10995 [Drosophila simulans]
gi|194193414|gb|EDX06990.1| GD10995 [Drosophila simulans]
Length = 341
Score = 192 bits (489), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 123/312 (39%), Positives = 171/312 (54%), Gaps = 28/312 (8%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKI-QLLQDTEHGSGVYGL--NEFSDLSTAEFQA 101
F +H K Y E RL IF+ N KI + Q G + L N+++DL EF+
Sbjct: 32 FKLEHRKNYQDDTEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADLLHHEFRQ 91
Query: 102 KYLGFKL----KPSYADRSVPAMI----PNITLPRAFDWREYDAVTGVKDQTMCGSSWAF 153
GF + AD S + ++TLP++ DWR AVT VKDQ CGS WAF
Sbjct: 92 LMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGHCGSCWAF 151
Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEE 211
S+TG +EG + K+ LVSLSEQ L+DC + ++GC GG + NAF I K GG++ E
Sbjct: 152 SSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYI--KDNGGIDTE 209
Query: 212 KTYPYRGDDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINAY--ALQF 268
K+YPY D +C NK G+ + + DE MA+ + GP++VAI+A + QF
Sbjct: 210 KSYPYEAIDDSCHFNKGTIGATDRGFTDIPQGDEKKMAEAVATVGPVSVAIDASHESFQF 269
Query: 269 YVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYF 328
Y GV + Q CD +NL H VL+VG+G D + YW++KNSWG WG+KG+
Sbjct: 270 YSEGVYNEPQ--CDA--QNLDHGVLVVGFGTDES-----GDDYWLVKNSWGTTWGDKGFI 320
Query: 329 RLYRG-DGSCGI 339
++ R + CGI
Sbjct: 321 KMLRNKENQCGI 332
>gi|302759380|ref|XP_002963113.1| hypothetical protein SELMODRAFT_270344 [Selaginella moellendorffii]
gi|300169974|gb|EFJ36576.1| hypothetical protein SELMODRAFT_270344 [Selaginella moellendorffii]
Length = 479
Score = 192 bits (489), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 135/323 (41%), Positives = 173/323 (53%), Gaps = 39/323 (12%)
Query: 40 ALFNYFLEQHNKTYATLV--------EYYSRLHIFSGNLRKIQLLQDTEHGSGVY-GLNE 90
ALF+ ++ QH K+YA E +R IF NLR I + E G + GLN
Sbjct: 55 ALFDSWMLQHGKSYAENALSGDSQAGEKATRYGIFKDNLRFIH--GENEKNQGYFLGLNA 112
Query: 91 FSDLSTAEFQAKYLGFKLKPSYADRSVPAM----IPNITLPRAFDWREYDAVTGVKDQTM 146
F+DL+ EF+A+ G + S S + LP + DWRE AV GVKDQ
Sbjct: 113 FADLTNEEFRAQRHGGRFDRSRERTSYEEFRYGSVQLKDLPDSIDWREKGAVVGVKDQGS 172
Query: 147 CGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQ-EDDGCEGGSISNAFDTIMSKLG 205
CGS WAFS IEGV T +LVSLSEQEL+DCD+ ED+GC GG + AF ++
Sbjct: 173 CGSCWAFSAVAAIEGVNKLATGELVSLSEQELVDCDKGEDEGCNGGLMDYAFGFVIKN-- 230
Query: 206 GGLEEEKTYPYRGDDKACRLNK-KATQVKINGYVSVS-RDETDMAKYLVENGPMAVAINA 263
GGL+ E YPY+G C +K A V I+GY V DET + K V + P++VAI+A
Sbjct: 231 GGLDTEADYPYKGYGTRCDRSKMNAKVVTIDGYEDVPVNDETALLK-AVAHQPVSVAIDA 289
Query: 264 --YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEG 321
++QFY +G+ F +L H V VGYG + K YWIIKNSWG
Sbjct: 290 GGSSMQFYRSGI------FTGRCGTDLDHGVTNVGYGKEDGK------AYWIIKNSWGSN 337
Query: 322 WGEKGYFRLYRGD----GSCGIN 340
WGEKGY ++ R G CGIN
Sbjct: 338 WGEKGYIKMARNTGLAAGLCGIN 360
>gi|194320502|gb|ACF48469.1| cathepsin L [Triatoma brasiliensis]
Length = 330
Score = 192 bits (489), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 127/353 (35%), Positives = 173/353 (49%), Gaps = 36/353 (10%)
Query: 6 FFAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHI 65
VA+++L+ + SF + +E ++ F H KTY E R+ I
Sbjct: 4 LLVAVAIIALSYAHPSFDIYPEE-------------WHVFKAMHGKTYKNQFEEMFRMKI 50
Query: 66 FSGNLRKIQLLQDT-EHGSGVYGL--NEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIP 122
F N +KI+ E G Y + N F DL EF+A GFK+ P
Sbjct: 51 FMDNKKKIEAHNAKYEQGEVSYKMMMNHFGDLMVHEFKALMNGFKMSPDTKRNGELYFPS 110
Query: 123 NITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCD 182
N LP+ DWR+ AVT VKDQ CGS W+FS TG++EG KT KLVSLSEQ L+DC
Sbjct: 111 NSNLPKTVDWRQKGAVTPVKDQGQCGSCWSFSATGSLEGQVFLKTGKLVSLSEQNLVDCS 170
Query: 183 QE--DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSV 240
++GCEGG + AF + G++ E +YPY + CR K G+V +
Sbjct: 171 TSYGNNGCEGGLMDQAFQYVSDN--KGIDTEASYPYEARENTCRFKKNKVGGTDKGHVDI 228
Query: 241 -SRDETDMAKYLVENGPMAVAINAY--ALQFYVTGV-SHPIQFFCDGGNENLSHSVLIVG 296
+ DE + L GP++VAI+A + QFY GV + P + + +L H VL VG
Sbjct: 229 PAGDEKALQNALATVGPISVAIDANHGSFQFYSKGVYNEP-----NCSSYDLDHGVLAVG 283
Query: 297 YGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGS-CGINDYVRSALV 348
YG T YW++KNSWG WGE GY ++ R + CGI LV
Sbjct: 284 YG------TENGQDYWLVKNSWGPSWGENGYIKIARNHSNHCGIASMASYPLV 330
>gi|224076970|ref|XP_002305073.1| predicted protein [Populus trichocarpa]
gi|222848037|gb|EEE85584.1| predicted protein [Populus trichocarpa]
Length = 340
Score = 192 bits (489), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 113/302 (37%), Positives = 158/302 (52%), Gaps = 20/302 (6%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
++ QH + Y + E R IF N+ +I+ + G+N+F+DL+ EF+A Y
Sbjct: 43 WMAQHGRVYGDMKEKEKRYLIFKENIERIEAFNNGSDRGYKLGVNKFADLTNEEFRAMYH 102
Query: 105 GFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYA 164
G+K + S S +P + DWR AVT VKDQ CG WAFST IEG+
Sbjct: 103 GYKRQSSKLMSSSFRYENLSDIPTSMDWRNDGAVTPVKDQGTCGCCWAFSTVAAIEGIIK 162
Query: 165 AKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACR 224
+T L+SLSEQ+L+DC + GC+GG + AF I+ GGL E YPY+G D C
Sbjct: 163 LQTGNLISLSEQQLVDCTAGNKGCQGGLMDTAFQYIIRN--GGLTSEDNYPYQGVDGTCS 220
Query: 225 LNKKA-TQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHPIQFFC 281
K A T+ +I GY V ++ + V P++V ++ QFY +GV F
Sbjct: 221 SEKAASTEAQITGYEDVPQNNENALLQAVAKQPVSVGVDGGGNDFQFYKSGV-----FNG 275
Query: 282 DGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGS----C 337
D G + +H+V +GYG D YW++KNSWG WGE GY R+ RG GS C
Sbjct: 276 DCGTQQ-NHAVTAIGYGTD-----IDGTDYWLVKNSWGTSWGENGYMRMRRGIGSSEGLC 329
Query: 338 GI 339
G+
Sbjct: 330 GV 331
>gi|47524507|gb|AAT34987.1| putative cysteine protease [Gossypium hirsutum]
Length = 344
Score = 192 bits (489), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 121/311 (38%), Positives = 166/311 (53%), Gaps = 31/311 (9%)
Query: 45 FLEQHNKTYATLVEYYS--RLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAK 102
++ QH + YA E + R ++F N+ +I+ D + + +N+F+DL+ EF+A
Sbjct: 40 WMSQHGRVYADEQEDHKNKRFNVFKENVERIEEFNDGK--TFKLAINQFADLTNEEFRAS 97
Query: 103 YLGFK---LKPSYADRSVPAMIPNIT--LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
Y GFK + S + P N++ LP + DWR+ AVT VK+Q CG WAFS
Sbjct: 98 YNGFKGPMVLSSQITKPTPFRYENVSSALPVSVDWRKKGAVTPVKNQGQCGCCWAFSAVA 157
Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEEKTYP 215
IEG+ T KL+SLSEQEL+DCD + D GCEGG + AF+ I++ GGL E YP
Sbjct: 158 AIEGITQISTGKLISLSEQELVDCDTKGIDHGCEGGLMDTAFEFIINN--GGLTTESNYP 215
Query: 216 YRGDDKACRLNK-KATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA--LQFYVTG 272
Y+G+D C NK V I GY V ++ V + P++VAI A QFY +G
Sbjct: 216 YKGEDGTCNFNKTNPIAVSITGYEDVPANDEQALMKAVAHQPVSVAIEAGGSDFQFYSSG 275
Query: 273 VSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR 332
V F + G E L H+V VGYG + YWI+KNSWG WGE GY + +
Sbjct: 276 V-----FTGECGTE-LDHAVTAVGYGE-----SEDGSKYWIVKNSWGTKWGESGYIEMQK 324
Query: 333 G----DGSCGI 339
G CGI
Sbjct: 325 DIKVKQGLCGI 335
>gi|343470212|emb|CCD17026.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 445
Score = 192 bits (489), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 118/346 (34%), Positives = 184/346 (53%), Gaps = 27/346 (7%)
Query: 10 VALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGN 69
V LL++ + F+ V LH ++ F F ++++++Y E R +F N
Sbjct: 14 VGLLAV---AACFVPVALGVLHAEQSLQQQ--FAAFKQKYSRSYKDATEEAFRFRVFKQN 68
Query: 70 LRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL-GFKLKPSYADRSVPAMIPNITL-- 126
+ + + + + +G+ FSD+S EF+A Y G + + R P + N++
Sbjct: 69 MERAKE-EAAANPYATFGVTRFSDMSPEEFRATYHNGAEYYAAALKR--PRKVVNVSTGK 125
Query: 127 -PRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQED 185
P A DWR+ AVT VKDQ C SSWAF+ GNIEG + +L SLSEQ L+ CD D
Sbjct: 126 APEAVDWRKKGAVTPVKDQGKCDSSWAFTVIGNIEGQWKIAGHELTSLSEQMLVSCDTND 185
Query: 186 DGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR---GDDKACRLNKKATQVKINGYVSVSR 242
GC G + AF I+S G + E++YPY G+ C + K I+ +V +
Sbjct: 186 LGCRAGFMDTAFKWIVSSNNGNVFTEQSYPYASGGGNVPTCNKSGKVVGANIDDHVHILD 245
Query: 243 DETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRT 302
+E +A++L + GP+A+A++A + Q Y GV ++ ++ + L+VGY D +
Sbjct: 246 NENAIAEWLAKKGPVAIAVDATSFQSYTGGV------LTSCISKEVNSAALLVGYD-DTS 298
Query: 303 KFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
K PYWIIKNSW +GWGE+GY R+ +G C + +YV SA+V
Sbjct: 299 K-----PPYWIIKNSWSKGWGEEGYIRIEKGTNQCRMKEYVSSAVV 339
>gi|222629675|gb|EEE61807.1| hypothetical protein OsJ_16426 [Oryza sativa Japonica Group]
Length = 459
Score = 192 bits (488), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 122/315 (38%), Positives = 167/315 (53%), Gaps = 33/315 (10%)
Query: 41 LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY----GLNEFSDLST 96
L+ + +H K+Y + E R F NLR I + +GV+ GLN F+DL+
Sbjct: 40 LYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDE-HNAAADAGVHSFRLGLNRFADLTN 98
Query: 97 AEFQAKYLGFKLKP----SYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
E++ YLG + KP +DR + A N LP + DWR AV +KDQ CGS WA
Sbjct: 99 EEYRDTYLGLRNKPRRERKVSDRYLAA--DNEALPESVDWRTKGAVAEIKDQGGCGSCWA 156
Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEE 211
FS +EG+ T L+SLSEQEL+DCD ++GC GG + AFD I++ GG++ E
Sbjct: 157 FSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAFDFIINN--GGIDTE 214
Query: 212 KTYPYRGDDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQF 268
YPY+G D+ C +N+K A V I+ Y V+ + + V N P++VAI A A Q
Sbjct: 215 DDYPYKGKDERCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVANQPVSVAIEAGGRAFQL 274
Query: 269 YVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYF 328
Y +G+ F L H V VGYG + K YWI++NSWG+ WGE GY
Sbjct: 275 YSSGI------FTGKCGTALDHGVAAVGYGTENGK------DYWIVRNSWGKSWGESGYV 322
Query: 329 RLYRG----DGSCGI 339
R+ R G CGI
Sbjct: 323 RMERNIKASSGKCGI 337
>gi|414879123|tpg|DAA56254.1| TPA: hypothetical protein ZEAMMB73_708930 [Zea mays]
Length = 368
Score = 192 bits (488), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 117/306 (38%), Positives = 158/306 (51%), Gaps = 26/306 (8%)
Query: 47 EQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGF 106
+ H++ + E R F N R I LN F D+ EF++ +
Sbjct: 46 QTHHRVHRHHGEKGRRFGTFKENARFIHAHNKRGDRPYRLRLNRFGDMGREEFRSGFADS 105
Query: 107 KLKPSYADRSVPAMIPNIT------LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIE 160
++ + + +P LPR+ DWR+ AVT VK+Q CGS WAFST +E
Sbjct: 106 RINDLRREPTAAPAVPGFMYDDATDLPRSVDWRQKGAVTAVKNQGRCGSCWAFSTVVAVE 165
Query: 161 GVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDD 220
G+ A +T LVSLSEQELIDCD +++GC+GG + NAF+ I S GG+ E YPY +
Sbjct: 166 GINAIRTGSLVSLSEQELIDCDTDENGCQGGLMENAFEFIKSH--GGITTESAYPYHASN 223
Query: 221 KAC--RLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHP 276
C ++ V I+G+ +V D V + P++VAI+A ALQFY GV
Sbjct: 224 GTCDGARARRGRVVAIDGHQAVPAGSEDALAKAVAHQPVSVAIDAGGQALQFYSEGV--- 280
Query: 277 IQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGS 336
F D G + L H V VGYGV + PYWI+KNSWG WGE GY R+ RG G+
Sbjct: 281 --FTGDCGTD-LDHGVAAVGYGV-----SDDGTPYWIVKNSWGPSWGEGGYIRMQRGTGN 332
Query: 337 ---CGI 339
CGI
Sbjct: 333 GGLCGI 338
>gi|162459393|ref|NP_001105993.1| cysteine protease component of protease-inhibitor complex precursor
[Zea mays]
gi|6682829|dbj|BAA88898.1| cysteine protease component of protease-inhibitor complex [Zea
mays]
Length = 465
Score = 192 bits (488), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 122/314 (38%), Positives = 166/314 (52%), Gaps = 31/314 (9%)
Query: 41 LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY----GLNEFSDLST 96
++ ++ H +TY + E R +F NLR I + +GV+ GLN F+DL+
Sbjct: 43 MYAEWMAAHGRTYNAVGEEERRYQVFRDNLRYIDA-HNAAADAGVHSFRLGLNRFADLTN 101
Query: 97 AEFQAKYLGFKLKPSYADRSVPA---MIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAF 153
E++A YLG + +P +R + A N LP + DWR AV VKDQ GS WAF
Sbjct: 102 DEYRATYLGARTRPQ-RERKLGARYHAADNEDLPESVDWRAKGAVAEVKDQGSYGSCWAF 160
Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEK 212
ST +EG+ T L+SLSEQEL+DCD + GC GG + AF+ I++ GG++ EK
Sbjct: 161 STIAAVEGINQIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAFEFIINN--GGIDTEK 218
Query: 213 TYPYRGDDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQF--Y 269
YPY+G D C +N+K A V I+ Y V ++ + V N P++VAI A QF Y
Sbjct: 219 DYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANDEKSLQKAVANQPVSVAIEAAGTQFQLY 278
Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
+G+ F L H V VGYG + K YWI+KNSWG WGE GY R
Sbjct: 279 SSGI------FTGSCGTALDHGVTAVGYGTENGK------DYWIVKNSWGSSWGESGYVR 326
Query: 330 LYRG----DGSCGI 339
+ R G CGI
Sbjct: 327 MERNIKASSGKCGI 340
>gi|356563584|ref|XP_003550041.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
Length = 366
Score = 192 bits (488), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 119/313 (38%), Positives = 164/313 (52%), Gaps = 28/313 (8%)
Query: 41 LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
++ +L +H K Y L E R +F NL IQ + ++ + GLN+F+D++ E++
Sbjct: 39 MYEEWLVKHQKVYNGLGEKDKRFQVFKDNLGFIQEHNNNQNNTYKLGLNKFADMTNEEYR 98
Query: 101 AKYLGFK------LKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
Y G K L + + A LP DWR AV +KDQ CGS WAFS
Sbjct: 99 VMYFGTKSDAKRRLMKTKSTGHRYAYSAGDQLPVHVDWRVKGAVAPIKDQGSCGSCWAFS 158
Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
T +E + T K VSLSEQEL+DCD+ + GC GG + AF+ I+ GG++ +K
Sbjct: 159 TVATVEAINKIVTGKFVSLSEQELVDCDRAYNQGCNGGLMDYAFEFIIQN--GGIDTDKD 216
Query: 214 YPYRGDDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYV 270
YPYRG D C KK A V I+GY V + + K V P+++AI A ALQ Y
Sbjct: 217 YPYRGFDGICDPTKKNAKAVNIDGYEDVPPYDENALKKAVARQPVSIAIEASGRALQLYQ 276
Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
+GV F +L H V++VGYG + V YW+++NSWG GWGE GYF++
Sbjct: 277 SGV------FTGECGTSLDHGVVVVGYG------SENGVDYWLVRNSWGTGWGEDGYFKM 324
Query: 331 YRG----DGSCGI 339
R G CGI
Sbjct: 325 QRNVRTPTGKCGI 337
>gi|6649595|gb|AAF21471.1|U85984_1 cysteine proteinase [Clonorchis sinensis]
Length = 217
Score = 192 bits (488), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 103/219 (47%), Positives = 136/219 (62%), Gaps = 10/219 (4%)
Query: 130 FDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCE 189
FDWR + AV V DQ CGS WAFS GNIEG + KT L+ LSEQ+L+DCD D+GC
Sbjct: 8 FDWRNHGAVGPVLDQGDCGSCWAFSAVGNIEGQWFRKTDNLLQLSEQQLLDCDGVDEGCN 67
Query: 190 GGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAK 249
GG+ AF I+ GGL+ + YPY G + CR+ +V ING + DE A+
Sbjct: 68 GGTPQQAFKQILGM--GGLQLDSDYPYEGREGQCRMVPSKVKVYINGSKILPEDEQIQAQ 125
Query: 250 YLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAV 309
L E GP++ A+NA LQFY G+ HP+ CD ++L+H+VL VGYG +
Sbjct: 126 MLKETGPLSSALNALFLQFYTEGILHPLPALCDA--QSLNHAVLTVGYG------KEGRL 177
Query: 310 PYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
PYW +KNSW +GE GYFR+YRGDG+CGIN V ++++
Sbjct: 178 PYWTVKNSWSTMFGENGYFRIYRGDGTCGINTLVSTSII 216
>gi|402770517|gb|AFQ98393.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 192 bits (488), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 122/317 (38%), Positives = 169/317 (53%), Gaps = 27/317 (8%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGV----YGLNEFSDLSTAEFQ 100
F H KTY + +E R IF+ N I + ++ G+ G+N+F DL EF
Sbjct: 30 FKTTHKKTYQSHMEELLRFKIFTEN-SLIIAKHNAKYAKGLVSYKLGMNQFGDLLAHEFA 88
Query: 101 AKYLGFKLKPSYADRSV--PAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
+ G S PA + + +LP+ DWR+ AVT VKDQ CGS WAFS TG+
Sbjct: 89 RIFNGHHGTRKTGGSSFLPPANVNDSSLPKVVDWRKKGAVTPVKDQGQCGSCWAFSATGS 148
Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
+EG + K +LVSLSEQ L+DC Q ++GCEGG + +AF I K G++ EK+YPY
Sbjct: 149 LEGQHFLKNGELVSLSEQNLVDCSQSFGNNGCEGGLMEDAFKYI--KANDGIDTEKSYPY 206
Query: 217 RGDDKACRLNKKATQVKINGYVSV-SRDETDMAKYLVENGPMAVAINA--YALQFYVTGV 273
+ D CR K+ GYV + + E D+ K + GP++VAI+A + Q Y GV
Sbjct: 207 KAVDGECRFKKEDVGATDTGYVEIKAGSEVDLKKAVATVGPISVAIDASHSSFQLYSEGV 266
Query: 274 -SHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR 332
P + +E+L H VL+VGYGV K YW++KNSW E WG++GY + R
Sbjct: 267 YDEP-----ECSSEDLDHGVLVVGYGVKGGK------KYWLVKNSWAESWGDQGYILMSR 315
Query: 333 -GDGSCGINDYVRSALV 348
+ CGI LV
Sbjct: 316 DNNNQCGIASQASYPLV 332
>gi|168058022|ref|XP_001781010.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162667567|gb|EDQ54194.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 457
Score = 192 bits (488), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 123/314 (39%), Positives = 167/314 (53%), Gaps = 30/314 (9%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
F + +H K Y+ E R ++ NL IQ ++ S GL +F+DL+ EF+
Sbjct: 45 FAAWAHKHGKVYSAAEERAHRFLVWKDNLEYIQR-HSEKNLSYWLGLTKFADLTNEEFRR 103
Query: 102 KYLGFKLKPSY---ADRSVPAMI--PNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTT 156
+Y G ++ S R+ N P++ DWRE AVT VKDQ CGS WAFS
Sbjct: 104 QYTGTRIDRSRRLKKGRNATGSFRYANSEAPKSIDWREKGAVTSVKDQGSCGSCWAFSAV 163
Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYP 215
G++EG+ A +T +SLS QEL+DCD++ + GC GG + AFD ++ GG++ EK YP
Sbjct: 164 GSVEGINAIRTGDAISLSVQELVDCDKKYNQGCNGGLMDYAFDFVIQN--GGIDTEKDYP 221
Query: 216 YRGDDKACRLNK-KATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA--LQFYVTG 272
Y+G D C +NK A V I+ Y V ++ + K V P++VAI A Q Y G
Sbjct: 222 YQGYDGRCDVNKMNARVVTIDSYEDVPENDEEALKKAVAGQPVSVAIEAGGRDFQLYSGG 281
Query: 273 VSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR 332
V F +L H VL VGYG + K + YWI+KNSWGE WGE GY R+ R
Sbjct: 282 V------FTGRCGTDLDHGVLAVGYG------SEKGLDYWIVKNSWGEYWGESGYLRMQR 329
Query: 333 ------GDGSCGIN 340
G G CGIN
Sbjct: 330 NLKDDNGYGLCGIN 343
>gi|30141027|dbj|BAC75927.1| cysteine protease-5 [Helianthus annuus]
Length = 365
Score = 192 bits (488), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 125/323 (38%), Positives = 170/323 (52%), Gaps = 33/323 (10%)
Query: 34 HHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSD 93
V++T + +L +H KTY L E SR IF+ NL+ I + + S GLN+F+D
Sbjct: 30 EEVRNT--YELWLARHGKTYNALGEKESRFRIFADNLKFIDEHNLSGNRSYKVGLNQFAD 87
Query: 94 LSTAEFQAKYLGFKLKPSYADRSVP--------AMIPNITLPRAFDWREYDAVTGVKDQT 145
L+ E+++ YLG K+ P + A+ N P DWRE AV+ VK+Q
Sbjct: 88 LTNEEYRSMYLGTKVDPYRRIAKMQRGEISRRYAVQENEMFPAKVDWRERGAVSPVKNQG 147
Query: 146 MCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKL 204
CGS WAFST ++EG+ T L+SLSEQEL+DCD + + GC GGS+ AF I+S
Sbjct: 148 GCGSCWAFSTVASVEGINKIVTGDLISLSEQELVDCDNKYNSGCNGGSMDYAFQFIVSN- 206
Query: 205 GGGLEEEKTYPYRGDDKAC-RLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA 263
GG++ E YPY+G C + KA V I+GY V V + P++V I A
Sbjct: 207 -GGIDSESDYPYKGVGAVCDPVRNKAKIVSIDGYEDVPPMNEKALMKAVAHQPVSVGIEA 265
Query: 264 --YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEG 321
A Q Y +GV + C NL H V++VGYG + K YWI++NSWG
Sbjct: 266 SGRAFQLYTSGV---LTGSC---GTNLDHGVVVVGYGSENGK------DYWIVRNSWGPE 313
Query: 322 WGEKGYFRLYRGD-----GSCGI 339
WGE GY R+ R G CGI
Sbjct: 314 WGEDGYIRMERNMVDTPVGMCGI 336
>gi|296081395|emb|CBI16828.3| unnamed protein product [Vitis vinifera]
Length = 359
Score = 192 bits (488), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 125/298 (41%), Positives = 167/298 (56%), Gaps = 28/298 (9%)
Query: 56 LVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKP---SY 112
L E R ++F N + + + + + LN+F+D++ EF++ Y G K+K
Sbjct: 53 LEEKNKRFNVFKENTKHVHKVNQMDKPYKLK-LNKFADMTNHEFRSSYGGSKVKHYRMLR 111
Query: 113 ADRSVPA--MIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKK 169
DR M T LP + DWR+ AVTG+KDQ CGS WAFST +EG+ KTK+
Sbjct: 112 GDRRGTGGFMHEKTTYLPPSVDWRKKGAVTGIKDQGKCGSCWAFSTVVGVEGINQIKTKE 171
Query: 170 LVSLSEQELIDCDQEDD-GCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNK- 227
L+SLSEQ+LIDCD+ DD GC GG + +AF+ I K GG+ E YPY+ D+ C + K
Sbjct: 172 LLSLSEQQLIDCDRSDDHGCNGGLMESAFEFI--KKNGGITTENNYPYKAKDERCDMLKM 229
Query: 228 KATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHPIQFFCDGGN 285
A V I+G+ SV ++ V + P++VAI+A LQFY GV F + G
Sbjct: 230 NAPVVTIDGHESVPVNDERALMKAVAHQPVSVAIDAGGSDLQFYSEGV-----FDGECGT 284
Query: 286 ENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGI 339
E L H V IVGYG T YWI+KNSWG WGEKGY R+ RG +G CGI
Sbjct: 285 E-LDHGVAIVGYGT-----TLDGTKYWIVKNSWGAEWGEKGYIRMARGIQAAEGQCGI 336
>gi|195153545|ref|XP_002017686.1| GL17172 [Drosophila persimilis]
gi|194113482|gb|EDW35525.1| GL17172 [Drosophila persimilis]
Length = 341
Score = 192 bits (488), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 129/348 (37%), Positives = 186/348 (53%), Gaps = 43/348 (12%)
Query: 10 VALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGN 69
+AL+++ +VS V+ +E ++ F +H K Y E RL IF+ N
Sbjct: 10 LALVAVAQAVSYAEVIQEE-------------WHTFKLEHRKNYQDETEERFRLKIFNEN 56
Query: 70 LRKI---QLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKL----KPSYADRSVPAMI- 121
KI L T S +N+++D+ EF + GF + AD S +
Sbjct: 57 KHKIAKHNQLWATGAVSFKMAVNKYADMLHHEFYSTMNGFNYTLHKQLRNADESFKGVTF 116
Query: 122 ---PNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQEL 178
++TLP+ DWR AVT VKDQ CGS WAFS+TG +EG + K+ LVSLSEQ L
Sbjct: 117 ISPEHVTLPKQVDWRTKGAVTDVKDQGHCGSCWAFSSTGALEGQHYRKSGVLVSLSEQNL 176
Query: 179 IDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKING 236
+DC + ++GC GG + NAF I K GG++ EK+YPY D +C NK + G
Sbjct: 177 VDCSTKYGNNGCNGGLMDNAFRYI--KDNGGIDTEKSYPYEAIDDSCHFNKGSIGATDRG 234
Query: 237 YVSVSR-DETDMAKYLVENGPMAVAINAY--ALQFYVTGV-SHPIQFFCDGGNENLSHSV 292
+V + + +E MA+ + GP+AVAI+A + QFY GV + P CD +NL H V
Sbjct: 235 FVDIPQGNEKKMAEAVATIGPVAVAIDASHESFQFYSEGVYNEPA---CDA--QNLDHGV 289
Query: 293 LIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-DGSCGI 339
L+VG+G D + YW++KNSWG WG+KG+ ++ R + CGI
Sbjct: 290 LVVGFGTDES-----GEDYWLVKNSWGTTWGDKGFIKMLRNKENQCGI 332
>gi|118486542|gb|ABK95110.1| unknown [Populus trichocarpa]
Length = 465
Score = 192 bits (488), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 126/330 (38%), Positives = 168/330 (50%), Gaps = 38/330 (11%)
Query: 31 HHLHHVKHT--------ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHG 82
H H K + A++ +L +H K Y L E R IF NL I ++E+
Sbjct: 32 HQTHATKSSWRTDDEVMAMYEEWLVKHGKNYNALGEKEKRFEIFKDNLMFIDQ-HNSENR 90
Query: 83 SGVYGLNEFSDLSTAEFQAKYLGF-----KLKPSYADRSVPAMIPNITLPRAFDWREYDA 137
+ GLN F+DL+ EF++ YLG K P +DR P + +LP + DWR+ A
Sbjct: 91 TYTVGLNRFADLTNEEFRSMYLGTRTGHKKRLPKTSDRYAPRV--GDSLPDSVDWRKEGA 148
Query: 138 VTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNA 196
V VKDQ CGS WAFST +EG+ T L++LSEQEL+DCD ++GC GG + A
Sbjct: 149 VAEVKDQGGCGSCWAFSTIAAVEGINKIVTGDLIALSEQELVDCDTSYNEGCNGGLMDYA 208
Query: 197 FDTIMSKLGGGLEEEKTYPYRGDDKACRL-NKKATQVKINGYVSVSRDETDMAKYLVENG 255
F+ I++ GG++ E YPY G D C K A V I+ Y V ++ K V N
Sbjct: 209 FEFIINN--GGIDTEDDYPYLGRDGRCDTYRKNAKVVSIDSYEDVPENDETALKKAVANQ 266
Query: 256 PMAVAIN--AYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWI 313
P++VAI Q Y +GV F +L H V VGYG T K YWI
Sbjct: 267 PVSVAIEGGGRNFQLYNSGV------FTGECGTSLDHGVAAVGYG------TEKGKDYWI 314
Query: 314 IKNSWGEGWGEKGYFRLYRG----DGSCGI 339
++NSWG+ WGE GY R+ R G CGI
Sbjct: 315 VRNSWGKSWGESGYIRMERNIASPTGKCGI 344
>gi|50355615|dbj|BAD29956.1| cysteine protease [Daucus carota]
Length = 423
Score = 192 bits (488), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 125/312 (40%), Positives = 168/312 (53%), Gaps = 39/312 (12%)
Query: 46 LEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGV-----YGLNEFSDLSTAEFQ 100
L +H+K Y L R IF NLR I EH GV GLN+F+DLS E++
Sbjct: 11 LVKHHKNYNALGAKEKRFEIFKDNLRFID-----EHNKGVNQSFKLGLNKFADLSNEEYK 65
Query: 101 AKYLGFKL----KPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTT 156
+ +LG ++ K +DR + LP++ DWRE AV VKDQ CGS WAFST
Sbjct: 66 SMFLGGRMVRDRKGFESDRFKYGV--GDELPQSVDWREKGAVAPVKDQGQCGSCWAFSTV 123
Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYP 215
+EG+ T L+SLSEQEL+DCD+ + GC GG + AF+ I+ GG++ E YP
Sbjct: 124 AAVEGINQIATGDLISLSEQELVDCDKGFNQGCNGGFMDYAFEFIVKN--GGIDTEDDYP 181
Query: 216 YRGDDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYVTG 272
Y+G D C N+K A V ING+ V +++ K V + P++VAI A A Q Y +G
Sbjct: 182 YKGVDGQCDQNRKNAKVVTINGFEDVPQNDEKSLKKAVAHQPVSVAIEAGGRAFQLYESG 241
Query: 273 VSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR 332
+ + + +L H V+ VGYG + K YWI++NSWG WGE GY RL R
Sbjct: 242 IFNGL------CGTDLDHGVVAVGYGTEDGK------DYWIVRNSWGPNWGENGYIRLER 289
Query: 333 -----GDGSCGI 339
G CGI
Sbjct: 290 NVASTNTGKCGI 301
>gi|255564908|ref|XP_002523447.1| cysteine protease, putative [Ricinus communis]
gi|223537275|gb|EEF38906.1| cysteine protease, putative [Ricinus communis]
Length = 342
Score = 192 bits (487), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 121/320 (37%), Positives = 158/320 (49%), Gaps = 24/320 (7%)
Query: 31 HHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNE 90
LH ++ ++ + Y E R IF N+ I+ + +NE
Sbjct: 27 RSLHDAAMNERHEMWMAKYGRVYKDNSEKERRFEIFRNNVEFIESFNKLGNRPYKLDINE 86
Query: 91 FSDLSTAEFQAKYLGFKLKPSYADRSVPAM-IPNIT-LPRAFDWREYDAVTGVKDQTMCG 148
F+DL+ EF+ G+K + N+T +P + DWR+ AVT +KDQ CG
Sbjct: 87 FADLTNEEFKVSKNGYKRSSGVGLTEKSSFRYANVTAVPTSMDWRQNGAVTPIKDQGQCG 146
Query: 149 SSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQ--EDDGCEGGSISNAFDTIMSKLGG 206
WAFS +EG+ T KL+SLSEQEL+DCD ED GCEGG + +AF+ I K G
Sbjct: 147 CCWAFSAVAAMEGITKLSTGKLISLSEQELVDCDTSGEDQGCEGGLMDDAFEFI--KQNG 204
Query: 207 GLEEEKTYPYRGDDKACRLNKKATQ-VKINGYVSVSRDETDMAKYLVENGPMAVAINA-- 263
GL E YPY+G D C NK KI GY V + D V + P++VAI+A
Sbjct: 205 GLTTEANYPYQGTDGTCNTNKAGNDAAKITGYEDVPANSEDALLKAVASQPVSVAIDASG 264
Query: 264 YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWG 323
A QFY GV F D G E L H V VGYG + YW++KNSWG WG
Sbjct: 265 SAFQFYSGGV-----FTGDCGTE-LDHGVTAVGYGT-----SDDGTKYWLVKNSWGTSWG 313
Query: 324 EKGYFRLYRG----DGSCGI 339
E GY R+ R +G CGI
Sbjct: 314 EDGYIRMERDIEAKEGLCGI 333
>gi|402770501|gb|AFQ98385.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 192 bits (487), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 123/317 (38%), Positives = 170/317 (53%), Gaps = 27/317 (8%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGV----YGLNEFSDLSTAEFQ 100
F H KTY + +E R IF+ N I + ++ G+ G+N+F DL EF
Sbjct: 30 FKTTHKKTYQSHMEELLRFKIFTEN-SLIIAKHNAKYAKGLVSYKLGMNQFGDLLAHEFA 88
Query: 101 AKYLGF--KLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
+ G K + PA + + +LP+A DWR+ AVT VKDQ CGS WAFS TG+
Sbjct: 89 RIFNGHHGTRKTGGSTFLPPANVNDSSLPKAVDWRKKGAVTPVKDQGQCGSCWAFSATGS 148
Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
+EG + K +LVSLSEQ L+DC Q ++GCEGG + +AF I K G++ EK+YPY
Sbjct: 149 LEGQHFLKNGELVSLSEQNLVDCSQSFGNNGCEGGLMEDAFKYI--KANDGIDTEKSYPY 206
Query: 217 RGDDKACRLNKKATQVKINGYVSV-SRDETDMAKYLVENGPMAVAINA--YALQFYVTGV 273
D CR K+ GYV + + E D+ K + GP++VAI+A + Q Y GV
Sbjct: 207 EAVDGECRFKKEDVGATDTGYVEIKAGSEDDLKKAVATVGPISVAIDASHSSFQLYSEGV 266
Query: 274 -SHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR 332
P + +E+L H VL+VGYGV K YW++KNSW E WG++GY + R
Sbjct: 267 YDEP-----ECSSEDLDHGVLVVGYGVKGGK------KYWLVKNSWAESWGDQGYILMSR 315
Query: 333 -GDGSCGINDYVRSALV 348
+ CGI LV
Sbjct: 316 DNNNQCGIASQASYPLV 332
>gi|5777889|emb|CAB53515.1| cysteine protease [Solanum tuberosum]
Length = 466
Score = 192 bits (487), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 121/348 (34%), Positives = 184/348 (52%), Gaps = 34/348 (9%)
Query: 10 VALLSLTVSVSSFMVVGDEKLHHLHHVKH---TALFNYFLEQHNKTYATLVEYYSRLHIF 66
+ L+ T+S +S M + H+HH +AL+ +L +H K+Y L E R IF
Sbjct: 14 LMLIFSTLSSASDMSIISYDETHIHHRSDDEVSALYESWLIEHGKSYNALGEKDKRFQIF 73
Query: 67 SGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFK-------LKPSYADRSVPA 119
NL+ I + S GL +F+DL+ E+++ YLG K L + +DR +P
Sbjct: 74 KDNLKYIDEQNSVPNQSYKLGLTKFADLTNEEYRSIYLGTKSSGDRRKLSKNKSDRYLPK 133
Query: 120 MIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELI 179
+ +LP + DWR+ + GVKDQ CGS WAFS +E + A T L+SLSEQEL+
Sbjct: 134 V--GDSLPESVDWRDKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGNLISLSEQELV 191
Query: 180 DCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKAC-RLNKKATQVKINGY 237
DCD+ ++GC+GG + AF+ +++ GG++ E+ YPY+ + C + K A VKI+ Y
Sbjct: 192 DCDKSYNEGCDGGLMDYAFEFVINN--GGIDTEEDYPYKERNDVCDQYRKNAKVVKIDSY 249
Query: 238 VSVSRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIV 295
V + + V + P+++AI A LQ Y +G+ F + H V+
Sbjct: 250 EDVPVNNEKALQKAVAHQPVSIAIEAGGRDLQHYKSGI------FTGKCGTAVDHGVVAA 303
Query: 296 GYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGI 339
GYG + + YWI++NSWG WGEKGY R+ R G CG+
Sbjct: 304 GYG------SENGMDYWIVRNSWGAKWGEKGYLRVQRNVASSSGLCGL 345
>gi|109939734|sp|P25776.2|ORYA_ORYSJ RecName: Full=Oryzain alpha chain; Flags: Precursor
gi|78192122|gb|ABB30151.1| oryzain alpha [Oryza sativa Japonica Group]
Length = 458
Score = 192 bits (487), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 122/315 (38%), Positives = 167/315 (53%), Gaps = 33/315 (10%)
Query: 41 LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY----GLNEFSDLST 96
L+ + +H K+Y + E R F NLR I + +GV+ GLN F+DL+
Sbjct: 39 LYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDE-HNAAADAGVHSFRLGLNRFADLTN 97
Query: 97 AEFQAKYLGFKLKP----SYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
E++ YLG + KP +DR + A N LP + DWR AV +KDQ CGS WA
Sbjct: 98 EEYRDTYLGLRNKPRRERKVSDRYLAA--DNEALPESVDWRTKGAVAEIKDQGGCGSCWA 155
Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEE 211
FS +EG+ T L+SLSEQEL+DCD ++GC GG + AFD I++ GG++ E
Sbjct: 156 FSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAFDFIINN--GGIDTE 213
Query: 212 KTYPYRGDDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQF 268
YPY+G D+ C +N+K A V I+ Y V+ + + V N P++VAI A A Q
Sbjct: 214 DDYPYKGKDERCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVANQPVSVAIEAGGRAFQL 273
Query: 269 YVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYF 328
Y +G+ F L H V VGYG + K YWI++NSWG+ WGE GY
Sbjct: 274 YSSGI------FTGKCGTALDHGVAAVGYGTENGK------DYWIVRNSWGKSWGESGYV 321
Query: 329 RLYRG----DGSCGI 339
R+ R G CGI
Sbjct: 322 RMERNIKASSGKCGI 336
>gi|405971603|gb|EKC36430.1| Cathepsin L [Crassostrea gigas]
Length = 360
Score = 192 bits (487), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 135/339 (39%), Positives = 175/339 (51%), Gaps = 35/339 (10%)
Query: 31 HHLHHVKHTALFNY---------FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEH 81
LH +K A NY F H+KTY L E R IF N++KI+ H
Sbjct: 36 EQLHILKAKAGINYQPYEQAWKEFKILHDKTYDALEEESRRFEIFRENVQKIEEHNKLYH 95
Query: 82 -GSGVY--GLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIP--NITLPRAFDWREYD 136
G Y G+N+FSDL EF KY G K K S D + + N+ P + DWR+
Sbjct: 96 LGKKSYYLGVNQFSDLKHEEF-VKYNGLK-KTSLKDGGCSSYLAANNLVEPDSVDWRKKG 153
Query: 137 AVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSIS 194
VT VK+Q CGS W+FSTTG++EG + K+ KLVSLSE +L+DC Q ++GC GG +
Sbjct: 154 YVTDVKNQGQCGSCWSFSTTGSLEGQHFRKSGKLVSLSESQLVDCSQSFGNEGCNGGLMD 213
Query: 195 NAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSV-SRDETDMAKYLVE 253
NAF I S GGLE E+ YPY+ C+ + G V V S E+ + K + E
Sbjct: 214 NAFKYIKSV--GGLESEEDYPYKPKQGTCKFDDTKVAATDTGCVDVESGSESALKKAVSE 271
Query: 254 NGPMAVAINA--YALQFYVTGV-SHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVP 310
GP++VAI+A + Q Y GV P + +E L H VL VGYG D +
Sbjct: 272 VGPVSVAIDASHSSFQSYAGGVYDEP-----ECSSEQLDHGVLCVGYGTD-----DQGQD 321
Query: 311 YWIIKNSWGEGWGEKGYFRLYRG-DGSCGINDYVRSALV 348
YWI+KNSWG WGE GY ++ R CGI LV
Sbjct: 322 YWIVKNSWGAEWGEDGYVKMSRNKKNQCGIATQASYPLV 360
>gi|262360187|gb|ACY38051.2| cysteine proteinase C1A [Dactylis glomerata]
Length = 365
Score = 192 bits (487), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 120/301 (39%), Positives = 162/301 (53%), Gaps = 45/301 (14%)
Query: 62 RLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPS---------- 111
R ++F N+R I + + LN+F+D++T EF+ Y G +++
Sbjct: 62 RFNVFKENVRYIHEANKKDRPFRL-ALNKFADMTTDEFRRTYAGSRVRHHRSLSGGRRQG 120
Query: 112 -----YADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAK 166
YAD LP A DWR+ AVT +KDQ CGS WAFST +EG+ +
Sbjct: 121 GGSFMYADAE--------NLPAAVDWRQKGAVTPIKDQGQCGSCWAFSTIVAVEGINKIR 172
Query: 167 TKKLVSLSEQELIDCD-QEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRL 225
T +LVSLSEQEL+DC+ E+DGC GG + AF I GG+ E +YPY+G+ +C
Sbjct: 173 TGRLVSLSEQELMDCNIGENDGCNGGLMDVAFQFIQQN--GGITTEASYPYQGEQNSCDQ 230
Query: 226 NKKATQ-VKINGYVSVSRDETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHPIQFFCD 282
+K+ + V I+GY V ++ + V N P++VAI+A QFY GV F D
Sbjct: 231 SKENSHDVSIDGYEDVPANDESALQKAVANQPVSVAIDASGNDFQFYSEGV-----FTTD 285
Query: 283 GGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCG 338
GG + L H V VGYG T YWI+KNSWGE WGEKGY R+ RG +G CG
Sbjct: 286 GGTD-LDHGVAAVGYGT-----TRDGTKYWIVKNSWGEDWGEKGYIRMQRGVKQAEGLCG 339
Query: 339 I 339
I
Sbjct: 340 I 340
>gi|146216002|gb|ABQ10203.1| cysteine protease Cp5 [Actinidia deliciosa]
Length = 509
Score = 192 bits (487), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 127/356 (35%), Positives = 182/356 (51%), Gaps = 32/356 (8%)
Query: 4 FYFFAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRL 63
F +A + L + S F +VG + + LF + E+H K Y E +
Sbjct: 14 FLVWASLTSLISSSLPSEFSIVG-RPGESIAEERVVELFKKWTEKHGKVYKHGQEVEKKF 72
Query: 64 HIFSGNLRKIQLLQDTEHGSG--VYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMI 121
F NLR + SG + GLN+F+D+S EF+ Y+ KP+ ++
Sbjct: 73 QNFRDNLRYVMEKNGERGASGGHLVGLNKFADMSNEEFREVYVSKVKKPTSKRMAIERRR 132
Query: 122 PNITL----------PRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLV 171
P + DWR+Y VTGVKDQ CGS WAFS+TG IEG+ A L+
Sbjct: 133 QGKAAAAKAVAACDGPTSLDWRKYGIVTGVKDQGDCGSCWAFSSTGAIEGINALANGDLI 192
Query: 172 SLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQ 231
SLSEQEL+DCD +DGCEGG + AF+ +MS GG++ E YPY G+D C K+ T+
Sbjct: 193 SLSEQELVDCDSTNDGCEGGYMDYAFEWVMSN--GGIDTETDYPYTGEDGTCNTTKEETK 250
Query: 232 -VKINGYVSVSRDETDMAKYLVENGPMAVAINAYAL--QFYVTGVSHPIQFFCDGGNENL 288
V I+GY V+ +E+ + +++ P++V I+ A+ Q Y G+ +++
Sbjct: 251 AVSIDGYEDVAEEESALFCAVLKQ-PISVGIDGGAIDFQLYTGGIYDGDCSD---DPDDI 306
Query: 289 SHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGD----GSCGIN 340
H+VL+VGYG + + YWIIKNSWG WG KGY + R G C IN
Sbjct: 307 DHAVLVVGYGAESGE------EYWIIKNSWGTDWGMKGYAYIKRNTSKDYGVCAIN 356
>gi|33242872|gb|AAQ01140.1| cathepsin [Branchiostoma lanceolatum]
Length = 334
Score = 192 bits (487), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 128/331 (38%), Positives = 174/331 (52%), Gaps = 32/331 (9%)
Query: 36 VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHG--------SGVYG 87
+ H + + QH K Y T E YSR IF N KI EH S
Sbjct: 18 LPHNKEWEMWKLQHGKQYETEAEEYSRRFIFEKNTVKI-----AEHNIRASLGMHSYTLA 72
Query: 88 LNEFSDLSTAEFQAKYLGFKLK---PSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQ 144
+N+F D+ EF + +G LK V N TLP++ DWR V+ VKDQ
Sbjct: 73 MNKFGDMHHEEFHQRIMGGCLKIVKKPLLGSDVGDNDDNGTLPKSVDWRNSHMVSEVKDQ 132
Query: 145 TMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMS 202
CGS WAFSTTG++EG +++KT KLV LSEQ+L+DC ++ + GC GG + AF I
Sbjct: 133 GECGSCWAFSTTGSLEGQHSSKTGKLVDLSEQQLVDCSKDFGNQGCGGGLMDQAFQYI-- 190
Query: 203 KLGGGLEEEKTYPYRG-DDKACRLNKKATQVKINGYVSV-SRDETDMAKYLVENGPMAVA 260
K GGL+ E++YPY DDK C+ + + + GY V S +E + + + GP++VA
Sbjct: 191 KANGGLDTEESYPYTATDDKPCKFDNSSVGATLVGYKDVKSGNEHALKRAVATVGPVSVA 250
Query: 261 INA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSW 318
I+A + QFY +GV Q C E L H VL VGYG +H+A +WI+KNSW
Sbjct: 251 IDAGHESFQFYSSGVYDEPQ--CS--TEQLDHGVLAVGYGA-MNDNSHQA--FWIVKNSW 303
Query: 319 GEGWGEKGYFRLYRG-DGSCGINDYVRSALV 348
G WG++GY + R + CGI LV
Sbjct: 304 GPSWGDQGYIMMSRNKNNQCGIATSASYPLV 334
>gi|6650705|gb|AAF21977.1|AF115280_1 thiolproteinase SmTP1 [Sarcocystis muris]
Length = 394
Score = 192 bits (487), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 125/319 (39%), Positives = 170/319 (53%), Gaps = 29/319 (9%)
Query: 34 HHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSD 93
HH + + F F HNK YAT E R IF NL I + + S V +N+F D
Sbjct: 83 HHFQ--SQFYQFQRDHNKFYATEEERLKRYAIFKNNLTYIHN-HNMQGYSYVLKMNKFGD 139
Query: 94 LSTAEFQAKYLGFK-----LKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCG 148
L+ EF+ +YLG+K P D ++ ++ N +P DWR+ VT VKDQ CG
Sbjct: 140 LTLEEFRQRYLGYKKPDLRTPPREVDTTLESVEDN-DIPTHVDWRQRGCVTSVKDQGDCG 198
Query: 149 SSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQ--EDDGCEGGSISNAFDTIMSKLGG 206
S WAFS TG +EGVY AKT KLV+LS+Q+L+DC + + GC+GG + AF+ ++ G
Sbjct: 199 SCWAFSATGAMEGVYCAKTGKLVNLSQQQLVDCSRFLGNQGCDGGRMEEAFEYVVEN--G 256
Query: 207 GLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVS-RDETDMAKYLVENGPMAVAI--NA 263
G+ + YPY D C+ ++ + I GY SV R E M L P++VAI N
Sbjct: 257 GICSGENYPYMRKDGVCKSSQCTSVATITGYRSVPRRSEKSMKTALALRSPVSVAIQANQ 316
Query: 264 YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWG 323
A QFY G+ F NL H VL+VGY + T YWI+KNSWG WG
Sbjct: 317 AAFQFYYDGI------FDAPCGTNLDHGVLLVGYSAE----TAGQGDYWIMKNSWGAAWG 366
Query: 324 EKGY--FRLYRGD-GSCGI 339
+ GY +++G G CG+
Sbjct: 367 KGGYMLMAMHKGPAGQCGV 385
>gi|116787404|gb|ABK24495.1| unknown [Picea sitchensis]
gi|224286306|gb|ACN40861.1| unknown [Picea sitchensis]
Length = 452
Score = 192 bits (487), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 127/346 (36%), Positives = 171/346 (49%), Gaps = 38/346 (10%)
Query: 11 ALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNL 70
+ ++ + S + F ++ + L + L+ +L +H + Y L E R +F N
Sbjct: 13 SAMAGSASRADFSIISSKDLREDDAIME--LYELWLAEHKRAYNGLDEKQKRFSVFKDNF 70
Query: 71 RKIQLLQDTEHGSG----VYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNIT- 125
I EH G GLN+F+DLS EF+A YLG KL P+ +
Sbjct: 71 LYIH-----EHNQGNRSYKLGLNQFADLSHEEFKATYLGAKLDTKKRLSRPPSRRYQYSD 125
Query: 126 ---LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCD 182
LP + DWRE AVT VKDQ CGS WAFST +EG+ T L+SLSEQEL+DCD
Sbjct: 126 GEDLPESIDWREKGAVTSVKDQGSCGSCWAFSTVAAVEGINQIVTGDLISLSEQELVDCD 185
Query: 183 QE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKAC-RLNKKATQVKINGYVSV 240
+ GC GG + AF+ I++ GGL+ E+ YPY D +C K A V I+ Y V
Sbjct: 186 TSYNQGCNGGLMDYAFEFIINN--GGLDSEEDYPYTAYDGSCDSYRKNAHVVTIDDYEDV 243
Query: 241 SRDETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYG 298
++ K N P++VAI A QFY +GV F L H V +VGYG
Sbjct: 244 PENDEKSLKKAAANQPISVAIEASGREFQFYDSGV------FTSTCGTQLDHGVTLVGYG 297
Query: 299 VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-----DGSCGI 339
+ YW +KNSWG+ WGE+G+ RL R G CGI
Sbjct: 298 ------SESGTDYWTVKNSWGKSWGEEGFIRLQRNIEVASTGMCGI 337
>gi|332374900|gb|AEE62591.1| unknown [Dendroctonus ponderosae]
Length = 359
Score = 192 bits (487), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 124/316 (39%), Positives = 164/316 (51%), Gaps = 39/316 (12%)
Query: 39 TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHG--------SGVYGLNE 90
T F F +++ K Y E R IF NL KI+ EH S GLN+
Sbjct: 21 TETFVTFQQKYGKVYQNDSELSVREEIFKENLAKIE-----EHNKQFQQNLVSYELGLNQ 75
Query: 91 FSDLSTAEFQAKYLGFKLKPSYADRSVPAM------IPNITLPRAFDWREYDAVTGVKDQ 144
FSDL+ AEFQA + P D+ M T P + +W E VT VK+Q
Sbjct: 76 FSDLTEAEFQAL---LTMSP-LTDQLTKQMEKYNSEFDIKTAPVSVNWAEKGVVTPVKNQ 131
Query: 145 TMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKL 204
CGS W F+TTG IE A KT LVSLSEQ+L+DC++ + GC+GG +S A + S
Sbjct: 132 GNCGSCWTFTTTGTIESRLALKTGSLVSLSEQQLLDCNRVNAGCDGGVLSYALQYVES-- 189
Query: 205 GGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSV-SRDETDMAKYLVENGPMAVAINA 263
GL E YPY+ + C K GY + +R E+D+ K + E GP+AVA+NA
Sbjct: 190 -AGLTTEDEYPYKAWNGTCNSTHKPVAAYTKGYTLIYTRSESDLMKAVAE-GPVAVALNA 247
Query: 264 YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWG 323
LQ+Y G+ +P + ++H L+VGY + T +PYWIIKNSWG WG
Sbjct: 248 DLLQYYSKGIFNP-----SACSSTVNHGGLVVGYEENAT------LPYWIIKNSWGATWG 296
Query: 324 EKGYFRLYRGDGSCGI 339
E GYFR+ +G CGI
Sbjct: 297 ENGYFRMAKGYNLCGI 312
>gi|359473128|ref|XP_002285397.2| PREDICTED: vignain-like [Vitis vinifera]
Length = 357
Score = 192 bits (487), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 125/298 (41%), Positives = 167/298 (56%), Gaps = 28/298 (9%)
Query: 56 LVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKP---SY 112
L E R ++F N + + + + + LN+F+D++ EF++ Y G K+K
Sbjct: 51 LEEKNKRFNVFKENTKHVHKVNQMDKPYKLK-LNKFADMTNHEFRSSYGGSKVKHYRMLR 109
Query: 113 ADRSVPA--MIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKK 169
DR M T LP + DWR+ AVTG+KDQ CGS WAFST +EG+ KTK+
Sbjct: 110 GDRRGTGGFMHEKTTYLPPSVDWRKKGAVTGIKDQGKCGSCWAFSTVVGVEGINQIKTKE 169
Query: 170 LVSLSEQELIDCDQEDD-GCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNK- 227
L+SLSEQ+LIDCD+ DD GC GG + +AF+ I K GG+ E YPY+ D+ C + K
Sbjct: 170 LLSLSEQQLIDCDRSDDHGCNGGLMESAFEFI--KKNGGITTENNYPYKAKDERCDMLKM 227
Query: 228 KATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHPIQFFCDGGN 285
A V I+G+ SV ++ V + P++VAI+A LQFY GV F + G
Sbjct: 228 NAPVVTIDGHESVPVNDERALMKAVAHQPVSVAIDAGGSDLQFYSEGV-----FDGECGT 282
Query: 286 ENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGI 339
E L H V IVGYG T YWI+KNSWG WGEKGY R+ RG +G CGI
Sbjct: 283 E-LDHGVAIVGYGT-----TLDGTKYWIVKNSWGAEWGEKGYIRMARGIQAAEGQCGI 334
>gi|125811033|ref|XP_001361727.1| GA25021 [Drosophila pseudoobscura pseudoobscura]
gi|54636904|gb|EAL26307.1| GA25021 [Drosophila pseudoobscura pseudoobscura]
Length = 341
Score = 192 bits (487), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 129/348 (37%), Positives = 185/348 (53%), Gaps = 43/348 (12%)
Query: 10 VALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGN 69
+AL+++ +VS V+ +E ++ F +H K Y E RL IF+ N
Sbjct: 10 LALVAVAQAVSYAEVIQEE-------------WHTFKLEHRKNYQDETEERFRLKIFNEN 56
Query: 70 LRKI---QLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKL----KPSYADRSVPAMI- 121
KI L T S +N+++D+ EF + GF + AD S +
Sbjct: 57 KHKIAKHNQLWATGAVSFKMAVNKYADMLHHEFYSTMNGFNYTLHKQLRNADESFKGVTF 116
Query: 122 ---PNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQEL 178
++TLP+ DWR AVT VKDQ CGS WAFS+TG +EG + K+ LVSLSEQ L
Sbjct: 117 ISPEHVTLPKQVDWRTKGAVTDVKDQGHCGSCWAFSSTGALEGQHYRKSGVLVSLSEQNL 176
Query: 179 IDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKING 236
+DC + ++GC GG + NAF I K GG++ EK+YPY D +C NK G
Sbjct: 177 VDCSTKYGNNGCNGGLMDNAFRYI--KDNGGIDTEKSYPYEAIDDSCHFNKGTIGATDRG 234
Query: 237 YVSVSR-DETDMAKYLVENGPMAVAINAY--ALQFYVTGV-SHPIQFFCDGGNENLSHSV 292
+V + + +E MA+ + GP+AVAI+A + QFY GV + P CD +NL H V
Sbjct: 235 FVDIPQGNEKKMAEAVATIGPVAVAIDASHESFQFYSEGVYNEPA---CDA--QNLDHGV 289
Query: 293 LIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-DGSCGI 339
L+VG+G D + YW++KNSWG WG+KG+ ++ R + CGI
Sbjct: 290 LVVGFGTDES-----GQDYWLVKNSWGTTWGDKGFIKMLRNKENQCGI 332
>gi|156397875|ref|XP_001637915.1| predicted protein [Nematostella vectensis]
gi|156225031|gb|EDO45852.1| predicted protein [Nematostella vectensis]
Length = 331
Score = 192 bits (487), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 125/313 (39%), Positives = 163/313 (52%), Gaps = 30/313 (9%)
Query: 49 HNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKL 108
H K Y E R I+ NL+KI + +H S +N D+++ E LG KL
Sbjct: 36 HGKEYPNKNEETMRNFIWQNNLKKIVTHNEGKH-SFKLAMNHLGDMTSLEISQTLLGLKL 94
Query: 109 K------PSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGV 162
K P A PA N+ + + DWR VT VK+Q CGS WAFSTTG +EG
Sbjct: 95 KKHAESQPKGATFLPPA---NVKVVDSIDWRSKGYVTPVKNQGQCGSCWAFSTTGALEGQ 151
Query: 163 YAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDD 220
+ KT KLVSLSEQ L+DC + ++GCEGG + NAF I K GG++ EK+YPY D
Sbjct: 152 HFRKTGKLVSLSEQNLVDCSGKYGNNGCEGGLMDNAFQYI--KENGGIDTEKSYPYLAKD 209
Query: 221 KACRLNKKATQVKINGYVSV-SRDETDMAKYLVENGPMAVAINA--YALQFYVTGV-SHP 276
C NK A K G+V + + DE + + L GP+++AI+A FY GV P
Sbjct: 210 GVCHYNKSAIGAKDTGFVDIPTGDENALQQALASVGPISIAIDASQSTFHFYHQGVYDDP 269
Query: 277 IQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGD-G 335
D + L H VL VGYG D K YW++KNSWG WGE+GY ++ R D
Sbjct: 270 -----DCSSTRLDHGVLAVGYGTDDGK------DYWLVKNSWGPSWGEEGYIKIARNDHD 318
Query: 336 SCGINDYVRSALV 348
CG+ LV
Sbjct: 319 KCGVASKASYPLV 331
>gi|57282619|emb|CAE54307.1| cysteine proteinase [Gossypium hirsutum]
Length = 372
Score = 192 bits (487), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 122/316 (38%), Positives = 167/316 (52%), Gaps = 32/316 (10%)
Query: 40 ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
L+ ++ QH K Y + E R IF NLR I + + GLN+F+DL+ E+
Sbjct: 44 GLYKSWVIQHGKAYNGIGEEEKRFEIFKDNLRFIDEHNSNNNTTYKLGLNKFADLTNQEY 103
Query: 100 QAKYLGFKLKPSYADRSVPAMIPNI--------TLPRAFDWREYDAVTGVKDQTMCGSSW 151
+AK+LG + P R + + IP+ LP + +WR++ AV+ VKDQ CGS W
Sbjct: 104 RAKFLGTRTDPRR--RLMKSKIPSSRYAHRAGDNLPDSVNWRDHGAVSRVKDQGSCGSCW 161
Query: 152 AFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEE 210
AFS +EG+ + +L+SLSEQEL+DCD+ D GC GG + AF I+ GG++
Sbjct: 162 AFSAIAAVEGINKIVSGELISLSEQELVDCDRSYDAGCNGGLMDYAFQFIIDN--GGIDT 219
Query: 211 EKTYPYRGDDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQ 267
EK YPY G + C KK A V I+GY V +E + K V + P+++AI A A Q
Sbjct: 220 EKDYPYLGFNNQCDPTKKNAKVVSIDGYEDVPNNENALKK-AVAHQPVSIAIEAGGRAFQ 278
Query: 268 FYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGY 327
Y +GV F L H V+ VGYG D YWI++NSWG WGE GY
Sbjct: 279 LYESGV------FNGECGLALDHGVVAVGYGSD-----DNGQDYWIVRNSWGGNWGENGY 327
Query: 328 FRLYR----GDGSCGI 339
R+ R G CGI
Sbjct: 328 IRMERNINANTGKCGI 343
>gi|34811401|pdb|1M6D|A Chain A, Crystal Structure Of Human Cathepsin F
gi|34811402|pdb|1M6D|B Chain B, Crystal Structure Of Human Cathepsin F
Length = 214
Score = 192 bits (487), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 106/222 (47%), Positives = 142/222 (63%), Gaps = 10/222 (4%)
Query: 127 PRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDD 186
P +DWR AVT VKDQ MCGS WAFS TGN+EG + L+SLSEQEL+DCD+ D
Sbjct: 2 PPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDKMDK 61
Query: 187 GCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETD 246
C GG SNA+ I K GGLE E Y Y+G ++C+ + + +V I V +S++E
Sbjct: 62 ACMGGLPSNAYSAI--KNLGGLETEDDYSYQGHMQSCQFSAEKAKVYIQDSVELSQNEQK 119
Query: 247 MAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTH 306
+A +L + GP++VAINA+ +QFY G+S P++ C + H+VL+VGYG
Sbjct: 120 LAAWLAKRGPISVAINAFGMQFYRHGISRPLRPLCS--PWLIDHAVLLVGYG------QR 171
Query: 307 KAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
VP+W IKNSWG WGEKGY+ L+RG G+CG+N SA+V
Sbjct: 172 SDVPFWAIKNSWGTDWGEKGYYYLHRGSGACGVNTMASSAVV 213
>gi|402770511|gb|AFQ98390.1| cathepsin L [Rhipicephalus microplus]
gi|402770513|gb|AFQ98391.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 192 bits (487), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 127/351 (36%), Positives = 182/351 (51%), Gaps = 40/351 (11%)
Query: 11 ALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNL 70
A++++TV+ SS ++ + + F H KTY + +E R IF+ N
Sbjct: 9 AIVAVTVAASSQEILRTQ-------------WEAFKTTHKKTYQSHMEELLRFKIFTEN- 54
Query: 71 RKIQLLQDTEHGSGV----YGLNEFSDLSTAEFQAKYLGFKLKPSYADRSV--PAMIPNI 124
I + ++ G+ G+N+F DL EF + G S PA + +
Sbjct: 55 SLIIAKHNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNGHHGTRKTGGSSFLPPANVNDS 114
Query: 125 TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE 184
+LP+ DWR+ AVT VKDQ CGS WAFS TG++EG + K +LVSLSEQ L+DC Q
Sbjct: 115 SLPKVVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQS 174
Query: 185 --DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSV-S 241
++GCEGG + +AF I K G++ EK+YPY D CR K+ GYV + +
Sbjct: 175 FGNNGCEGGLMEDAFKYI--KANDGIDTEKSYPYEAVDGECRFKKEDVGATDTGYVEIKA 232
Query: 242 RDETDMAKYLVENGPMAVAINA--YALQFYVTGV-SHPIQFFCDGGNENLSHSVLIVGYG 298
E D+ K + GP++VAI+A + Q Y GV P + +E+L H VL+VGYG
Sbjct: 233 GSEVDLKKAVATVGPISVAIDASHSSFQLYSEGVYDEP-----ECSSEDLDHGVLVVGYG 287
Query: 299 VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR-GDGSCGINDYVRSALV 348
V K YW++KNSW E WG++GY + R + CGI LV
Sbjct: 288 VKGGK------KYWLVKNSWAESWGDQGYILMSRDNNNQCGIASQASYPLV 332
>gi|409190991|gb|AFV30165.1| cysteine proteinase [Lotus japonicus]
Length = 342
Score = 192 bits (487), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 123/343 (35%), Positives = 167/343 (48%), Gaps = 31/343 (9%)
Query: 7 FAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIF 66
FA V L L S + D + H ++ ++ + Y L E R IF
Sbjct: 12 FALVLCLGLWAFQVSSRTLQDASMQERHE--------QWMARYGRVYKDLQEKEKRFSIF 63
Query: 67 SGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYA-DRSVPAMIPNIT 125
N+ I+ + G+N+F+DL+ EF A FK S + R+ N+T
Sbjct: 64 KENVNYIEASNNAGDKPYKLGVNQFADLTNEEFIATRNKFKGHMSSSITRTTTFKYENVT 123
Query: 126 LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE- 184
P DWR+ AVT VK+Q CG WAFS EG++ T LVSLSEQEL+DCD
Sbjct: 124 APSTVDWRQEGAVTPVKNQGTCGCCWAFSAVAATEGIHKLSTGNLVSLSEQELVDCDTSG 183
Query: 185 -DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQV-KINGYVSVSR 242
D GC+GG + +AF I+ GGL E YPY+G D C N++AT V I GY V
Sbjct: 184 ADQGCQGGLMDDAFKFIIQN--GGLNTEAQYPYQGVDGTCNTNEEATHVATITGYEDVPS 241
Query: 243 DETDMAKYLVENGPMAVAINAYALQF--YVTGVSHPIQFFCDGGNENLSHSVLIVGYGVD 300
+ + V N P+++AI+A F Y +GV F L H V +VGYGV
Sbjct: 242 NNEQALQQAVANQPISIAIDASGSDFQNYQSGV------FTGSCGTQLDHGVAVVGYGV- 294
Query: 301 RTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGI 339
+ YW++KNSWG WGE+GY R+ R +G CG+
Sbjct: 295 ----SDDGTKYWLVKNSWGADWGEEGYIRMQRDVDAPEGLCGL 333
>gi|383849553|ref|XP_003700409.1| PREDICTED: cathepsin L-like [Megachile rotundata]
Length = 343
Score = 192 bits (487), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 127/313 (40%), Positives = 172/313 (54%), Gaps = 31/313 (9%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQ---DTEHGSGVYGL--NEFSDLSTAEF 99
F +H K Y E R+ I+ N K+Q+ Q D E Y L N++ D+ EF
Sbjct: 31 FKMEHKKCYKHEAEERLRMKIYMKN--KLQIAQHNCDYELKKVTYRLKINKYGDMLNHEF 88
Query: 100 QAKYLGFKLKPSYADRS--VP---AMIP--NITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
+ G+ ++ R+ +P A I N+ LP+ DWR+ AVT VKDQ CGS WA
Sbjct: 89 KNMLNGYNRTINHTLRNERLPVGAAFIEPCNVELPKMVDWRKCGAVTEVKDQGHCGSCWA 148
Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEE 210
FS TG++EG + +T LVSLSEQ LIDC ++GC GG + AF I K GL+
Sbjct: 149 FSATGSLEGQHFRRTGVLVSLSEQNLIDCSGSYGNNGCNGGLMDQAFSYI--KDNKGLDT 206
Query: 211 EKTYPYRGDDKACRLNKKATQVKINGYVSVS-RDETDMAKYLVENGPMAVAINA--YALQ 267
EKTYPY G+D CR +K+++ G+V + DE + + GP++VAI+A + Q
Sbjct: 207 EKTYPYEGEDDKCRYDKRSSGASDVGFVDIPVGDEQKLKAAVATVGPVSVAIDASHQSFQ 266
Query: 268 FYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGY 327
FY G I F + + NL H VL+VGYG D + YWI+KNSWGE WGEKGY
Sbjct: 267 FYSDG----IYFEPECSSTNLDHGVLVVGYGTDE-----EGRDYWIVKNSWGESWGEKGY 317
Query: 328 FRLYRG-DGSCGI 339
++ R D CGI
Sbjct: 318 IKMARNIDNHCGI 330
>gi|427797099|gb|JAA64001.1| Putative cathepsin l cathepsin l, partial [Rhipicephalus
pulchellus]
Length = 331
Score = 192 bits (487), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 126/323 (39%), Positives = 166/323 (51%), Gaps = 26/323 (8%)
Query: 40 ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGV---YGLNEFSDLST 96
A ++ F H K Y + E Y RL I+ N KI + S V +NEF D+
Sbjct: 21 AEWSAFKALHGKEYESDTEEYYRLKIYMENRLKIARHNEKYAKSQVSYKLAMNEFGDMLH 80
Query: 97 AEFQAKYLGFKLKPSYADRS-----VPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSW 151
EF + GFK R P + + LP+ DWR+ AVT VK+Q CGS W
Sbjct: 81 HEFVSTRNGFKRNYRDTPREGSFFVEPEGLEDFHLPKTVDWRKKGAVTPVKNQGQCGSCW 140
Query: 152 AFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLE 209
+FSTTG++EG + K KLVSLSEQ LIDC + ++GCEGG + AF I K G++
Sbjct: 141 SFSTTGSLEGQHFRKLHKLVSLSEQNLIDCSRSFGNNGCEGGLMDYAFKYI--KANKGID 198
Query: 210 EEKTYPYRGDDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINAY--AL 266
E++YPY D C NK A G+V + DE + K + GP++VAI+A +
Sbjct: 199 TEQSYPYNATDGVCHFNKSAVGATDTGFVDIPEGDENKLKKAVATVGPVSVAIDASHESF 258
Query: 267 QFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKG 326
QFY GV + CD +E L H VL+VGYG T YW++KNSWG WG+ G
Sbjct: 259 QFYSEGVYDEPE--CD--SEQLDHGVLVVGYG------TKDGQDYWLVKNSWGTTWGDGG 308
Query: 327 YFRLYRG-DGSCGINDYVRSALV 348
Y + R D CGI LV
Sbjct: 309 YIYMSRNKDNQCGIASAASYPLV 331
>gi|171702843|dbj|BAG16377.1| cysteine protease [Brassica rapa var. perviridis]
Length = 431
Score = 192 bits (487), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 130/352 (36%), Positives = 184/352 (52%), Gaps = 39/352 (11%)
Query: 8 AGVALLSLTVSVSSFM---VVGDEKLHHL----HHVKHTALFNYFLEQHNKTYATLVEYY 60
A V L + VSS M ++ +K HH V+ + L+ ++ +H K +L E
Sbjct: 1 ATVILFLAMIVVSSAMDMSIISYDKNHHTVSSRSDVEVSRLYEEWVVKHGKAQNSLTEKD 60
Query: 61 SRLHIFSGNLRKIQLLQDTEHGSGV---YGLNEFSDLSTAEFQAKYLGFKLKPSYADRSV 117
R IF NLR I D +G + GL +F+DL+ E+++ YLG +LK S+
Sbjct: 61 RRFEIFKDNLRFI----DEHNGKNLSYRLGLTKFADLTNDEYRSMYLGSRLKRKATKTSL 116
Query: 118 --PAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSE 175
A + + +P + DWR+ AV VKDQ CGS WAFST G +EG+ T L+SLSE
Sbjct: 117 RYEARVGD-AIPESVDWRKEGAVAEVKDQGSCGSCWAFSTIGAVEGINKIVTGDLISLSE 175
Query: 176 QELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKAC-RLNKKATQVK 233
QEL+DCD ++GC GG + AF+ I+ GG++ E+ YPY+G D C + K A V
Sbjct: 176 QELVDCDTSYNEGCNGGLMDYAFEFIIKN--GGIDTEEDYPYKGVDGRCDQTRKNAKVVT 233
Query: 234 INGYVSVSRDETDMAKYLVENGPMAVAIN--AYALQFYVTGVSHPIQFFCDGGNENLSHS 291
I+ Y V + + K + + P++VAI A Q Y +G+ I +L H
Sbjct: 234 IDSYEDVPANSEESLKKALSHQPISVAIEGGGRAFQLYDSGIFDGI------CGTDLDHG 287
Query: 292 VLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR----GDGSCGI 339
V+ VGYG + K YWI+KNSWG WGE GY R+ R G CGI
Sbjct: 288 VVAVGYGTENGK------DYWIVKNSWGTSWGESGYIRMERNIASSAGKCGI 333
>gi|302796898|ref|XP_002980210.1| hypothetical protein SELMODRAFT_153766 [Selaginella moellendorffii]
gi|300151826|gb|EFJ18470.1| hypothetical protein SELMODRAFT_153766 [Selaginella moellendorffii]
Length = 479
Score = 191 bits (486), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 135/323 (41%), Positives = 173/323 (53%), Gaps = 39/323 (12%)
Query: 40 ALFNYFLEQHNKTYATLV--------EYYSRLHIFSGNLRKIQLLQDTEHGSGVY-GLNE 90
ALF+ ++ QH K+YA E +R IF NLR I + E G + GLN
Sbjct: 55 ALFDSWMLQHGKSYADNALSGDSQAGEKATRYGIFKDNLRFIH--GENEKNQGYFLGLNA 112
Query: 91 FSDLSTAEFQAKYLGFKLKPSYADRSVPAM----IPNITLPRAFDWREYDAVTGVKDQTM 146
F+DL+ EF+A+ G + S S + LP + DWRE AV GVKDQ
Sbjct: 113 FADLTNEEFRAQRHGGRFDRSRERTSHEEFRYGSVQLKDLPDSIDWREKGAVVGVKDQGS 172
Query: 147 CGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQ-EDDGCEGGSISNAFDTIMSKLG 205
CGS WAFS IEGV T +LVSLSEQEL+DCD+ ED+GC GG + AF ++
Sbjct: 173 CGSCWAFSAVAAIEGVNKLATGELVSLSEQELVDCDKGEDEGCNGGLMDYAFGFVIKN-- 230
Query: 206 GGLEEEKTYPYRGDDKACRLNK-KATQVKINGYVSVS-RDETDMAKYLVENGPMAVAINA 263
GGL+ E YPY+G C +K A V I+GY V DET + K V + P++VAI+A
Sbjct: 231 GGLDTEADYPYKGYGTRCDRSKMNAKVVTIDGYEDVPVNDETALLK-AVAHQPVSVAIDA 289
Query: 264 --YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEG 321
++QFY +G+ F +L H V VGYG + K YWIIKNSWG
Sbjct: 290 GGSSMQFYRSGI------FTGRCGTDLDHGVTNVGYGKEDGK------AYWIIKNSWGSN 337
Query: 322 WGEKGYFRLYRGD----GSCGIN 340
WGEKGY ++ R G CGIN
Sbjct: 338 WGEKGYVKMARNTGLAAGLCGIN 360
>gi|297794671|ref|XP_002865220.1| senescence-associated gene 12 [Arabidopsis lyrata subsp. lyrata]
gi|297311055|gb|EFH41479.1| senescence-associated gene 12 [Arabidopsis lyrata subsp. lyrata]
Length = 346
Score = 191 bits (486), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 123/356 (34%), Positives = 178/356 (50%), Gaps = 41/356 (11%)
Query: 1 MSCFYFFAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYY 60
M F F A + ++S+S + +E + H++ ++ +H + YA + E
Sbjct: 6 MQIFLFVAIFSSFYFSISLSR--PLDNELIMQKRHIE-------WMTKHGRVYADVKEKS 56
Query: 61 SRLHIFSGNLRKIQLLQDTEHGSGV-YGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPA 119
+R +F N+ +I+ L + G +N+F+DL+ EF++ Y GFK S + +S
Sbjct: 57 NRYVVFKSNVERIEHLNNIPAGRTFKLAVNQFADLTNDEFRSMYTGFKGVSSLSSQSQTK 116
Query: 120 M-------IPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVS 172
+ + LP + DWR AVT +K+Q CG WAFS IEG K KL+S
Sbjct: 117 TTSFRYQNVSSGALPISVDWRTKGAVTPIKNQGSCGCCWAFSAVAAIEGATQIKKGKLIS 176
Query: 173 LSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKAC---RLNKKA 229
LSEQ+L+DCD D GCEGG + AF+ IM+ GGL E YPY+G+D C + N KA
Sbjct: 177 LSEQQLVDCDTNDFGCEGGLMDTAFEHIMAT--GGLTTESNYPYKGEDATCNSKKTNPKA 234
Query: 230 TQVKINGYVSVSRDETDMAKYLVENGPMAVAIN--AYALQFYVTGVSHPIQFFCDGGNEN 287
T I GY V ++ V + P++V I + QFY +GV F
Sbjct: 235 TS--ITGYEDVPVNDEQALMKAVAHQPVSVGIEGGGFDFQFYSSGV------FTGECTTY 286
Query: 288 LSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGI 339
L H+V +GYG + YWIIKNSWG WGE GY R+ + G CG+
Sbjct: 287 LDHAVTAIGYGQ-----STNGSKYWIIKNSWGTKWGESGYMRIQKDIKDKQGLCGL 337
>gi|218195711|gb|EEC78138.1| hypothetical protein OsI_17694 [Oryza sativa Indica Group]
Length = 458
Score = 191 bits (486), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 122/315 (38%), Positives = 166/315 (52%), Gaps = 33/315 (10%)
Query: 41 LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY----GLNEFSDLST 96
L+ + +H K Y + E R F NLR I + +GV+ GLN F+DL+
Sbjct: 39 LYAEWKAEHGKNYNAVGEEERRYAAFRDNLRYIDE-HNAAADAGVHSFRLGLNRFADLTN 97
Query: 97 AEFQAKYLGFKLKP----SYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
E++ YLG + KP +DR + A N LP + DWR AV +KDQ CGS WA
Sbjct: 98 EEYRDTYLGLRNKPRRERKVSDRYLAA--DNEALPESVDWRTKGAVAEIKDQGGCGSCWA 155
Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEE 211
FS +EG+ T L+SLSEQEL+DCD ++GC GG + AFD I++ GG++ E
Sbjct: 156 FSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAFDFIINN--GGIDTE 213
Query: 212 KTYPYRGDDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQF 268
YPY+G D+ C +N+K A V I+ Y V+ + + V N P++VAI A A Q
Sbjct: 214 DDYPYKGKDERCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVANQPVSVAIEAGGRAFQL 273
Query: 269 YVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYF 328
Y +G+ F L H V VGYG + K YWI++NSWG+ WGE GY
Sbjct: 274 YSSGI------FTGKCGTALDHGVAAVGYGTENGK------DYWIVRNSWGKSWGESGYV 321
Query: 329 RLYRG----DGSCGI 339
R+ R G CGI
Sbjct: 322 RMERNIKASSGKCGI 336
>gi|449500145|ref|XP_004161017.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
Length = 349
Score = 191 bits (486), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 135/357 (37%), Positives = 184/357 (51%), Gaps = 35/357 (9%)
Query: 1 MSCFYFFAGVALLSLTVSVS-----SFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYAT 55
M+ F +LS T+ ++ F +VG H K LF ++ +H+KTY +
Sbjct: 1 MALSTFSKATLILSATLFITYAIAHDFSIVGYSPEHLASMDKTIELFESWMSKHSKTYRS 60
Query: 56 LVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY--GLNEFSDLSTAEFQAKYLGFKLKPSYA 113
+ E R IF NL+ I +T Y GLNEF+DLS EF++KYLG +++
Sbjct: 61 IEEKLHRFEIFLDNLKHID---ETNKKVSSYWLGLNEFADLSHEEFKSKYLGLRVEFPRK 117
Query: 114 DRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVS 172
S ++ LP + DWR AVT VK+Q CGS WAFST +EG+ T L S
Sbjct: 118 RSSRGFSYGDVEDLPESVDWRTKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTS 177
Query: 173 LSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKAC-RLNKKAT 230
LSEQELIDCD+ ++GC GG + AF IMS GL +E+ YPY ++ C R ++
Sbjct: 178 LSEQELIDCDRSFNNGCYGGLMDYAFQYIMSN--SGLRKEEDYPYLMEEGRCIREKEQFE 235
Query: 231 QVKINGYVSV-SRDETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHPIQFFCDGGNEN 287
V I+GY V + DE + K L P++VAI A + QFY G+ F
Sbjct: 236 VVTISGYEDVPANDEQSLLKALSHQ-PVSVAIEASSRNFQFYKGGI------FTGRCGTQ 288
Query: 288 LSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGIN 340
+ H V VGYG + + Y I+KNSWG WGE GY R+ R +G CGIN
Sbjct: 289 MDHGVTAVGYG------SSEGTDYIIVKNSWGPKWGENGYIRMKRNTGKPEGLCGIN 339
>gi|341888721|gb|EGT44656.1| hypothetical protein CAEBREN_22029 [Caenorhabditis brenneri]
Length = 396
Score = 191 bits (486), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 123/342 (35%), Positives = 182/342 (53%), Gaps = 28/342 (8%)
Query: 15 LTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQ 74
+T+ ++S + EKL + FN ++ +KT L EY R IF NLR I+
Sbjct: 64 MTILMASIFRIRAEKLKFFGLQQQFKDFNAKFQREHKT---LEEYKMRFEIFQKNLRDIE 120
Query: 75 LLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFK----------LKPSYADRSVPAMIPNI 124
L + ++ S YG+N+FSD + +E + + K LK + R+ +I N+
Sbjct: 121 EL-NLKNPSVQYGINKFSDKTESELKNLLMDKKFLDSSLSNSTLKTLSSYRNPRNIIKNV 179
Query: 125 TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE 184
P DWR V VKDQ CGS WAF+T +E YA + L SLSEQEL+DCD
Sbjct: 180 QRPDYIDWRNDGKVMSVKDQGQCGSCWAFATVAAVESQYAIRKGTLWSLSEQELVDCDGA 239
Query: 185 DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDD-KACRLNKKATQVKINGYVSVSRD 243
GC GG +++A I LG GLE E YPY C +N T+V I+ ++
Sbjct: 240 SYGCGGGFLTSALGFI---LGNGLETEDDYPYSATRHDQCWINGDKTRVWIDEGYQLTMS 296
Query: 244 ETDMAKYLVENGPMAVAIN-AYALQFYVTGVSHPIQFFCDGGNENLS-HSVLIVGYGVDR 301
E D+A+++ GP++ A++ + +Y G+ P + C +E+L H++ I+GYG +
Sbjct: 297 EDDVAEWVANVGPVSFAMSVPKSFPYYHDGIYSPSEHECK--DESLGYHAMAIIGYGQEG 354
Query: 302 TKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYV 343
+ YWI+KNSWG WG++GY RL RG +CG+NDYV
Sbjct: 355 GQ------NYWIVKNSWGGSWGDQGYMRLARGVNACGMNDYV 390
>gi|255563110|ref|XP_002522559.1| cysteine protease, putative [Ricinus communis]
gi|223538250|gb|EEF39859.1| cysteine protease, putative [Ricinus communis]
Length = 344
Score = 191 bits (486), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 119/320 (37%), Positives = 163/320 (50%), Gaps = 26/320 (8%)
Query: 33 LHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFS 92
LH ++ Q+ + Y VE R IF N+ I+ + + G+N F+
Sbjct: 29 LHEASMELRHKTWMTQYGRVYKGNVEKEKRFKIFKENVEFIESFNNNGNKPYKLGINAFT 88
Query: 93 DLSTAEFQAKYLGFKLKPSYAD---RSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCG 148
DL+ EF+A + G+ + S R+ N+T +P + DWR AVT +KDQ CG
Sbjct: 89 DLTNEEFRASHNGYTMSMSSHQSSYRTKSFRYENVTAVPPSLDWRTKGAVTHIKDQGQCG 148
Query: 149 SSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGG 206
WAFS +EG+ T L+SLSEQEL+DCD D GCEGG + +AF+ I+
Sbjct: 149 CCWAFSAVAAMEGITKLSTGTLISLSEQELVDCDTSGMDQGCEGGLMDDAFEFIIEN--N 206
Query: 207 GLEEEKTYPYRGDDKACRLNKKATQ-VKINGYVSVSRDETDMAKYLVENGPMAVAINA-- 263
GL E YPY G D +C K A KI GY +V + + + V N P++VAI+A
Sbjct: 207 GLTTEANYPYEGVDGSCNTRKAANHAAKITGYENVPAYDEEALRKAVANQPVSVAIDAGE 266
Query: 264 YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWG 323
A Q Y +G+ F D G E L H V +VGYG + YW++KNSWG WG
Sbjct: 267 SAFQHYSSGI-----FTGDCGTE-LDHGVTVVGYGT-----SDDGTKYWLVKNSWGTSWG 315
Query: 324 EKGYFRLYRG----DGSCGI 339
E GY R+ R +G CGI
Sbjct: 316 EDGYIRMERDIDAKEGLCGI 335
>gi|116781957|gb|ABK22314.1| unknown [Picea sitchensis]
Length = 369
Score = 191 bits (486), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 125/340 (36%), Positives = 181/340 (53%), Gaps = 32/340 (9%)
Query: 15 LTVSVSSFMV-VGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYS-RLHIFSGNLRK 72
L+ S S F DE L ++ +L++ + QH + + E ++ R IF N++
Sbjct: 20 LSASASDFTPGFTDEDLESEKSLR--SLYDNWALQHRSSRSLDSEEHAERFEIFKENVKY 77
Query: 73 IQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPA---MIPNI-TLPR 128
I + + + GLN+F+DLS EF+A Y+G K+ DR V + M N LP
Sbjct: 78 IDSVNKKDSPYKL-GLNKFADLSNEEFKAIYMGTKMDLR-GDREVQSGSFMYQNSEPLPA 135
Query: 129 AFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGC 188
+ DWR+ AV VK+Q CGS WAFST ++EG+ T LVSLSEQ+L+DC E+ GC
Sbjct: 136 SIDWRQKGAVAAVKNQGHCGSCWAFSTVASVEGINYITTGNLVSLSEQQLVDCSTENSGC 195
Query: 189 EGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKAC---RLNKKATQVKINGYVSVSRDET 245
GG + AF I++ GG+ E YPY + C ++N + T+V I+G+ V +
Sbjct: 196 NGGLMDTAFQYIINN--GGIVTEDNYPYTAEATECSSTKINSQTTRVVIDGFEDVPANNE 253
Query: 246 DMAKYLVENGPMAVAINAYA--LQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTK 303
K V + P++VAI A QFY TGV F L H V+ VGYG
Sbjct: 254 QALKEAVAHQPVSVAIEASGQDFQFYSTGV------FTGKCGTALDHGVVAVGYGT---- 303
Query: 304 FTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGI 339
+ + + YWI++NSWG WGE+GY R+ +G +G CGI
Sbjct: 304 -SPEGINYWIVRNSWGPKWGEEGYIRMQQGIEAAEGKCGI 342
>gi|358348957|ref|XP_003638507.1| Cysteine proteinase [Medicago truncatula]
gi|355504442|gb|AES85645.1| Cysteine proteinase [Medicago truncatula]
Length = 362
Score = 191 bits (486), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 129/345 (37%), Positives = 178/345 (51%), Gaps = 31/345 (8%)
Query: 12 LLSLTVSVSSFMVVGDE-KLHHLHHVKHTALFNYF--LEQHNKTYATLVEYYSRLHIFSG 68
LL + +S++ +VV + H +L++ + H+ L E R ++F
Sbjct: 6 LLLIVLSIALVLVVSESFDFHDKDVSSDESLWDLYERWRSHHTVSRNLNEKQKRFNVFKS 65
Query: 69 NLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPA-----MIPN 123
N+ + + + LN+F+D++ EF+ Y G K+ R P M N
Sbjct: 66 NVMHVHNTNKMDKPYKL-KLNKFADMTNHEFKTTYAGSKVNHHRMFRGTPRVSGTFMYEN 124
Query: 124 IT-LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCD 182
T P + DWR+ AVT VKDQ CGS WAFST +EG+ KT +LV LSEQELIDCD
Sbjct: 125 FTKAPASVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQIKTNRLVPLSEQELIDCD 184
Query: 183 -QEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKK-ATQVKINGYVSV 240
QE+ GC GG + AF+ I K GG+ E YPY +D +C K+ V I+G+ +V
Sbjct: 185 NQENQGCNGGLMEYAFEYIKQK--GGITTESYYPYTANDGSCDATKENVPAVSIDGHETV 242
Query: 241 SRDETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYG 298
++ D V N P++VAI+A QFY GV F D G E L+H V IVGYG
Sbjct: 243 PANDEDALLKAVANQPVSVAIDAGGSDFQFYSEGV-----FTGDCGKE-LNHGVAIVGYG 296
Query: 299 VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGI 339
T YWI++NSWG WGE+GY R+ R +G CGI
Sbjct: 297 T-----TVDGTNYWIVRNSWGAEWGEQGYIRMKRNVSNKEGLCGI 336
>gi|340375899|ref|XP_003386471.1| PREDICTED: probable cysteine proteinase A494-like [Amphimedon
queenslandica]
Length = 373
Score = 191 bits (486), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 124/352 (35%), Positives = 178/352 (50%), Gaps = 57/352 (16%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGV-YGLNEFSDLSTAEFQ 100
F + + H+K+Y T+ E R ++ N +Q L + V + LN F+DLS EF+
Sbjct: 33 FTDWCKLHSKSYRTITEAKERESVYKSNADLVQQLNNEYRERNVTFSLNHFADLSIEEFK 92
Query: 101 AKYLGFKLKPS------YADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
L KP Y S+P PN FDWR+ VT VK+Q G+ WAFS
Sbjct: 93 KLVLMSPQKPQPLPKQRYHSFSLPQDPPN-----TFDWRDKHVVTSVKNQGSAGTCWAFS 147
Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQE--------DDGCEGGSISNAFDTIMSKLGG 206
T GN+EG +A L SLS ++L+DCD D G GG A++ I ++ G
Sbjct: 148 TVGNVEGQWALGGHNLTSLSTEQLVDCDDTYDHNNLHMDCGVFGGWPYLAYEYIKNE--G 205
Query: 207 GLEEEKTYPYRGDDKAC----------------------------RLNK-KATQ-VKING 236
G+E E+ YPY C +L+K K Q + I
Sbjct: 206 GIEREEDYPYCSGQGTCFPCVPSGWNKTRCGPPPLYCNDTFSCTHKLDKSKFVQGLSIKS 265
Query: 237 YVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVG 296
++++ +DE +M L++ GP++V INA LQFY +GV PI C+ + L H+VL+VG
Sbjct: 266 WIAIQKDEVEMQAALIKQGPLSVLINALLLQFYRSGVWDPI-LKCNP--QELDHAVLLVG 322
Query: 297 YGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
YG ++ K PYW+IKNSWG WG GYF++ RG G CG++ V SA++
Sbjct: 323 YGTEKGLLEDK--PYWLIKNSWGIKWGMDGYFKMIRGKGKCGVDQQVTSAVL 372
>gi|4731374|gb|AAD28477.1|AF133839_1 papain-like cysteine protease [Sandersonia aurantiaca]
Length = 357
Score = 191 bits (486), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 122/306 (39%), Positives = 156/306 (50%), Gaps = 30/306 (9%)
Query: 49 HNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFK- 107
H+ L + R ++F N++ I + + LN+F D++ EF+AKY G K
Sbjct: 44 HHAVSRDLDQKQKRFNVFKENVKFIHEFNKNKDVTFKLALNKFGDMTNQEFRAKYAGSKV 103
Query: 108 -----LKPSYADRSVPA--MIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIE 160
+K S A M N P + DWRE AV VK+Q CGS WAFS +E
Sbjct: 104 HHHRTMKGSRHGSGSGAKFMYENAVAPPSIDWRERGAVAAVKNQGQCGSCWAFSAIAAVE 163
Query: 161 GVYAAKTKKLVSLSEQELIDCD-QEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGD 219
G+ TK+LV LSEQELIDCD ++ GC GG + AF+ I K GG+ E YPY+ +
Sbjct: 164 GINQIVTKELVPLSEQELIDCDTDQNQGCSGGLMDYAFEFI--KNNGGITTEDVYPYQAE 221
Query: 220 DKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPI 277
D C+ N A V I+GY V ++ D V N P+AVAI A Y QFY GV
Sbjct: 222 DATCKKNSPA--VVIDGYEDVPTNDEDALMKAVANQPVAVAIEASGYVFQFYSEGV---- 275
Query: 278 QFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG---- 333
F G E L H V +VGYG T YW ++NSWG WGE GY R+ RG
Sbjct: 276 -FTGRCGTE-LDHGVAVVGYGT-----TQDGTKYWTVRNSWGADWGESGYVRMQRGIKAT 328
Query: 334 DGSCGI 339
G CGI
Sbjct: 329 HGLCGI 334
>gi|22759715|dbj|BAC10906.1| cysteine proteinase [Zinnia elegans]
Length = 352
Score = 191 bits (486), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 140/350 (40%), Positives = 182/350 (52%), Gaps = 37/350 (10%)
Query: 6 FFAGVALLSLTVSVSSFMVVG--DEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRL 63
F V++L+ + F ++G E L +H V H LF +L +H+K Y +L E R
Sbjct: 13 LFLFVSILACSALAHEFSILGYAPEDLTSIHKVIH--LFESWLVKHSKFYESLDEKLHRF 70
Query: 64 HIFSGNLRKIQLLQDTEHGSGVY--GLNEFSDLSTAEFQAKYLGFKLK-PSYADRSVPAM 120
IF NL+ I +T Y GLNEF+DL+ EF+ K+LGFK + D S
Sbjct: 71 EIFMDNLKHID---ETNKKVSNYWLGLNEFADLTHEEFKHKFLGFKGELAERKDESSKEF 127
Query: 121 --IPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQEL 178
+ LP++ DWR+ AV VK+Q CGS WAFST +EG+ T L LSEQEL
Sbjct: 128 GYRDFVDLPKSVDWRKKGAVAPVKNQGQCGSCWAFSTVAAVEGINQIVTGNLTMLSEQEL 187
Query: 179 IDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQ-VKING 236
IDCD ++GC GG + AF +M GL +E+ YPY + C K ++ V I+G
Sbjct: 188 IDCDTTFNNGCNGGLMDYAFAYVMR---SGLHKEEEYPYIMSEGTCDEKKDVSEKVTISG 244
Query: 237 YVSVSR-DETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHPIQFFCDGGNENLSHSVL 293
Y V R DE K L N P++VAI A QFY GV F G E L H V
Sbjct: 245 YHDVPRNDEASFLKALA-NQPISVAIEASGRDFQFYSGGV-----FDGHCGTE-LDHGVA 297
Query: 294 IVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGS----CGI 339
VGYG T K + Y I++NSWG WGEKGY R+ RG G CG+
Sbjct: 298 AVGYG------TTKGLDYVIVRNSWGPKWGEKGYIRMKRGSGKPHGMCGL 341
>gi|226821421|gb|ACO82386.1| cathepsin L-like protein [Lutjanus argentimaculatus]
Length = 301
Score = 191 bits (486), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 125/311 (40%), Positives = 170/311 (54%), Gaps = 22/311 (7%)
Query: 50 NKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY----GLNEFSDLSTAEFQAKYLG 105
+K Y E + R+ ++ NL+KI++ + EH G + G+N F D++ EF+ G
Sbjct: 1 SKKYHEKEEGWRRM-VWEKNLKKIEM-HNLEHSMGTHSYRLGMNHFGDMTHEEFRQIMNG 58
Query: 106 FKLKPSYADRSVPAMIPN-ITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYA 164
+K KP M PN + PRA DWR+ VT VKDQ CGS WAFSTTG +EG +
Sbjct: 59 YKRKPQRKFTGSLFMEPNFLEAPRAVDWRDNGYVTPVKDQGQCGSCWAFSTTGALEGQHF 118
Query: 165 AKTKKLVSLSEQELIDCDQED--DGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG-DDK 221
KT KLVSLSEQ L+DC + + +GC GG + AF I K GL+ E +YPY G DD+
Sbjct: 119 RKTGKLVSLSEQNLVDCSRPEGNEGCNGGLMDQAFQYI--KDNQGLDSEDSYPYLGTDDQ 176
Query: 222 ACRLNKKATQVKINGYVSV-SRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQ 278
C + K G+V + S E + K + GP++VAI+A + QFY +G I
Sbjct: 177 PCHYDPKYNSANDTGFVDIPSGKERALMKAVAAVGPVSVAIDAGHESFQFYQSG----IY 232
Query: 279 FFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-DGSC 337
+ D +E L H VL+VGYG + K YWI+KNSW E WG+KGY + + C
Sbjct: 233 YEKDCSSEELDHGVLVVGYGFEGEDVDGKK--YWIVKNSWSEKWGDKGYIYMAKDRKNHC 290
Query: 338 GINDYVRSALV 348
GI LV
Sbjct: 291 GIATAASYPLV 301
>gi|1046373|gb|AAC49135.1| SAG12 protein [Arabidopsis thaliana]
Length = 346
Score = 191 bits (486), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 124/356 (34%), Positives = 179/356 (50%), Gaps = 41/356 (11%)
Query: 1 MSCFYFFAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYY 60
M F F A + ++++S + +E + H++ ++ +H + YA + E
Sbjct: 6 MQIFLFVAIFSSFCFSITLSR--PLDNELIMQKRHIE-------WMTKHGRVYADVKEEN 56
Query: 61 SRLHIFSGNLRKIQLLQDTEHGSGV-YGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPA 119
+R +F N+ +I+ L G +N+F+DL+ EF + Y GFK + + +S
Sbjct: 57 NRYVVFKNNVERIEHLNSIPAGRTFKLAVNQFADLTNDEFCSMYTGFKGVSALSSQSQTK 116
Query: 120 MIP----NIT---LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVS 172
M P N++ LP + DWR+ AVT +K+Q CG WAFS IEG K KL+S
Sbjct: 117 MSPFRYQNVSSGALPVSVDWRKKGAVTPIKNQGSCGCCWAFSAVAAIEGATQIKKGKLIS 176
Query: 173 LSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKAC---RLNKKA 229
LSEQ+L+DCD D GCEGG + AF+ I K GGL E YPY+G+D C + N KA
Sbjct: 177 LSEQQLVDCDTNDFGCEGGLMDTAFEHI--KATGGLTTESDYPYKGEDATCNSKKTNPKA 234
Query: 230 TQVKINGYVSVSRDETDMAKYLVENGPMAVAIN--AYALQFYVTGVSHPIQFFCDGGNEN 287
T I GY V ++ V + P++V I + QFY +GV F
Sbjct: 235 TS--ITGYEDVPVNDEQALMKAVAHQPVSVGIEGGGFDFQFYSSGV------FTGECTTY 286
Query: 288 LSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGI 339
L H+V +GYG + YWIIKNSWG WGE GY R+ + G CG+
Sbjct: 287 LDHAVTAIGYGE-----STNGSKYWIIKNSWGTKWGESGYMRIQKDVKDKQGLCGL 337
>gi|82796372|gb|ABB91778.1| cathepsin L [Hymeniacidon perlevis]
Length = 323
Score = 191 bits (486), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 114/299 (38%), Positives = 162/299 (54%), Gaps = 19/299 (6%)
Query: 48 QHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSG-VYGLNEFSDLSTAEFQAKYLGF 106
+HNK Y+ +E +R I+ GN + I++ G G+N+F DL + EF + G+
Sbjct: 28 EHNKKYSDDLEELTRYKIWQGNQKIIEVHNANSDKFGFTLGMNKFGDLESHEFAEMFNGY 87
Query: 107 KLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAK 166
++ V PN DWR AVTGVK+Q CGS WAFSTTG++EG + K
Sbjct: 88 MMQARSNSTKVFVADPNYKADPTVDWRTKGAVTGVKNQGQCGSCWAFSTTGSLEGQHFLK 147
Query: 167 TKKLVSLSEQELIDCD--QEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACR 224
T KLVSLSEQ L+DC + ++GC GG + AF+ I K GG++ E +YPY+ D+ CR
Sbjct: 148 TGKLVSLSEQNLVDCSGKEGNEGCNGGLMDQAFEYI--KKNGGIDTEASYPYQAHDERCR 205
Query: 225 LNKKATQVKINGYVSVSRDETDMAKYLVEN-GPMAVAINA--YALQFYVTGVSHPIQFFC 281
GYV + R++ + VE GP++VAI+A + Q Y +GV + + C
Sbjct: 206 FKASDVGATCTGYVDIKREDENALMQAVEKIGPVSVAIDASHSSFQLYRSGVYYERE--C 263
Query: 282 DGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-DGSCGI 339
L H VL +GYG T YW++KNSWG WG +GY + R + +CGI
Sbjct: 264 S--QTALDHGVLAIGYG------TEGGSDYWLVKNSWGTDWGMEGYIMMSRNRNNNCGI 314
>gi|33242878|gb|AAQ01143.1| cathepsin [Branchiostoma lanceolatum]
Length = 334
Score = 191 bits (486), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 128/331 (38%), Positives = 173/331 (52%), Gaps = 32/331 (9%)
Query: 36 VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHG--------SGVYG 87
+ H + + QH K Y T E YSR IF N KI EH S
Sbjct: 18 LPHNKEWEMWKLQHGKQYETEAEEYSRRFIFEKNTVKI-----AEHNIRASLGMHSYTLA 72
Query: 88 LNEFSDLSTAEFQAKYLGFKLK---PSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQ 144
+N+F D+ EF + +G LK V N TLP++ DWR V+ VKDQ
Sbjct: 73 MNKFGDMHHEEFHQRIMGGCLKIVKKPLLGSEVGDNDDNGTLPKSVDWRNSHMVSEVKDQ 132
Query: 145 TMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMS 202
CGS WAFSTTG++EG ++ KT KLV LSEQ+L+DC ++ + GC GG + AF I
Sbjct: 133 GECGSCWAFSTTGSLEGQHSNKTGKLVDLSEQQLVDCSKDFGNQGCGGGLMDQAFQYI-- 190
Query: 203 KLGGGLEEEKTYPYRG-DDKACRLNKKATQVKINGYVSV-SRDETDMAKYLVENGPMAVA 260
K GGL+ E++YPY DDK C+ + + + GY V S +E + + + GP++VA
Sbjct: 191 KANGGLDTEESYPYTATDDKPCKFDNSSVGATLVGYKDVKSGNEHALKRAVATVGPVSVA 250
Query: 261 INA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSW 318
I+A + QFY +GV Q C E L H VL VGYG +H+A +WI+KNSW
Sbjct: 251 IDAGHESFQFYSSGVYDEPQ--CS--TEQLDHGVLAVGYGA-MNDNSHQA--FWIVKNSW 303
Query: 319 GEGWGEKGYFRLYRG-DGSCGINDYVRSALV 348
G WG++GY + R + CGI LV
Sbjct: 304 GPSWGDQGYIMMSRNKNNQCGIATSASYPLV 334
>gi|402770515|gb|AFQ98392.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 191 bits (485), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 122/317 (38%), Positives = 168/317 (52%), Gaps = 27/317 (8%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGV----YGLNEFSDLSTAEFQ 100
F H KTY + +E R IF+ N I + ++ G+ G+N+F DL EF
Sbjct: 30 FKTTHKKTYQSHMEELLRFKIFTEN-SLIIAKHNAKYAKGLVSYKLGMNQFGDLLAHEFA 88
Query: 101 AKYLGFKLKPSYADRSV--PAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
+ G S PA + + +LP+ DWR+ AVT VKDQ CGS WAFS TG+
Sbjct: 89 RIFNGHHGTRKTGGSSFLPPANVNDSSLPKVVDWRKKGAVTPVKDQGQCGSCWAFSATGS 148
Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
+EG + K +LVSLSEQ L+DC Q ++GCEGG + +AF I K G++ EK+YPY
Sbjct: 149 LEGQHFLKNGELVSLSEQNLVDCSQSFGNNGCEGGLMEDAFKYI--KANDGIDTEKSYPY 206
Query: 217 RGDDKACRLNKKATQVKINGYVSV-SRDETDMAKYLVENGPMAVAINA--YALQFYVTGV 273
D CR K+ GYV + + E D+ K + GP++VAI+A + Q Y GV
Sbjct: 207 EAVDGECRFKKEDVGATDTGYVEIKAGSEVDLKKAVATVGPISVAIDASHSSFQLYSEGV 266
Query: 274 -SHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR 332
P + +E+L H VL+VGYGV K YW++KNSW E WG++GY + R
Sbjct: 267 YDEP-----ECSSEDLDHGVLVVGYGVKGGK------KYWLVKNSWAESWGDQGYILMSR 315
Query: 333 -GDGSCGINDYVRSALV 348
+ CGI LV
Sbjct: 316 DNNNQCGIASQASYPLV 332
>gi|156398078|ref|XP_001638016.1| predicted protein [Nematostella vectensis]
gi|156225133|gb|EDO45953.1| predicted protein [Nematostella vectensis]
Length = 326
Score = 191 bits (485), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 113/307 (36%), Positives = 166/307 (54%), Gaps = 21/307 (6%)
Query: 49 HNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKL 108
H K+Y+ + E +R+ I+ NL KI+ +H S +N DL+ EF+ YLG +
Sbjct: 34 HGKSYSDVHEERTRMAIWQQNLEKIKRHNAEDH-SYKMAMNHLGDLTEDEFRYFYLGVRA 92
Query: 109 KPSYADRSVPAMIP--NITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAK 166
+ R +P N+ +P + DW + VTGVK+Q CGS WAFSTTG++EG + K
Sbjct: 93 HHNSTKRGWATYMPPSNVKIPSSVDWSQKGYVTGVKNQGQCGSCWAFSTTGSVEGQHFRK 152
Query: 167 TKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACR 224
T LVSLSEQ LIDC ++GC+GG + NAF I S GG++ E +YPY G +C
Sbjct: 153 TGSLVSLSEQNLIDCSGSYGNNGCQGGLMDNAFRYIESN--GGIDTESSYPYLGQQGSCH 210
Query: 225 LNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINAYALQFYVTGV-SHPIQFFCD 282
+ ++ GY + + E + + GP++VA++A QFY +GV +P +C
Sbjct: 211 FSSSHVGARVTGYQDIPQGSEQALQSAVATVGPVSVAVDASQWQFYSSGVYDNP---YCS 267
Query: 283 GGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-DGSCGIND 341
+ L H VL++GYG + YW++KNSWG WG +GY + R + CGI
Sbjct: 268 --STQLDHGVLVIGYG------NYNGQDYWLVKNSWGYSWGVEGYIMMSRNKNNQCGIAS 319
Query: 342 YVRSALV 348
LV
Sbjct: 320 SASYPLV 326
>gi|254674508|dbj|BAH86062.1| cysteine protease [Haemaphysalis longicornis]
Length = 333
Score = 191 bits (485), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 121/318 (38%), Positives = 166/318 (52%), Gaps = 28/318 (8%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGV----YGLNEFSDLSTAEFQ 100
F QHNK Y++ VE R IF+ N + + ++ G+ +N+F DL EF
Sbjct: 30 FKSQHNKAYSSHVEELLRFKIFTENTLLV-AKHNAKYAKGLVSYKLAMNKFGDLLPHEFA 88
Query: 101 AKYLGFKLKPSYADRSV---PAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
G++ K + R PA + + +LP DWR+ AVT VK+Q CGS WAFSTTG
Sbjct: 89 KMVNGYRGKQNKEQRPTFIPPANLNDSSLPTTVDWRKKGAVTPVKNQGQCGSCWAFSTTG 148
Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEEKTYP 215
++EG + KT KLVSLSEQ L+DC + + GC GG + N F I K GG++ E+++P
Sbjct: 149 SLEGQHFRKTGKLVSLSEQNLVDCSDDFGNQGCNGGLMDNGFQYI--KANGGIDTEESHP 206
Query: 216 YRGDDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINAY--ALQFYVTG 272
Y D C+ K G+V + + E D+ K + GP++VAI+A + Q Y G
Sbjct: 207 YTAQDGDCKFKKADVGATDAGFVDIQQGSEDDLKKAVATVGPVSVAIDASHGSFQLYSQG 266
Query: 273 V-SHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
V P D + L H VL VGYGV K YW++KNSWG WG+ GY +
Sbjct: 267 VYDEP-----DCSSSQLDHGVLTVGYGVKNGK------KYWLVKNSWGGDWGDNGYILMS 315
Query: 332 RG-DGSCGINDYVRSALV 348
R D CGI LV
Sbjct: 316 RDKDNQCGIASSASYPLV 333
>gi|171702831|dbj|BAG16371.1| cysteine protease [Brassica oleracea var. italica]
Length = 441
Score = 191 bits (485), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 129/351 (36%), Positives = 182/351 (51%), Gaps = 37/351 (10%)
Query: 8 AGVALLSLTVSVSSFM---VVGDEKLHHLHHVKHTA----LFNYFLEQHNKTYATLVEYY 60
A V L + VSS M ++ +K HH + A L+ +L +H K +L E
Sbjct: 1 ATVILFLTMIVVSSAMDMSIISYDKNHHTVSSRSDAEVSRLYEEWLVKHGKAQNSLTEKD 60
Query: 61 SRLHIFSGNLRKIQLLQDTEHGSGV---YGLNEFSDLSTAEFQAKYLGFKLKPSYADRSV 117
R IF NLR I D +G + GL +F+DL+ E+++ YLG +LK S+
Sbjct: 61 RRFEIFKDNLRFI----DEHNGKNLSYRLGLTKFADLTNDEYRSMYLGSRLKRKATKSSL 116
Query: 118 PAMIP-NITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQ 176
+ +P + DWR+ AV VKDQ CGS WAFST G +EG+ T L++LSEQ
Sbjct: 117 RYEVRVGDAIPESVDWRKEGAVAEVKDQGSCGSCWAFSTIGAVEGINKIVTGDLITLSEQ 176
Query: 177 ELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKAC-RLNKKATQVKI 234
EL+DCD ++GC GG + AF+ I++ GG++ E+ YPY+G D C + K A V I
Sbjct: 177 ELVDCDTSYNEGCNGGLMDYAFEFIINN--GGIDTEEDYPYKGVDGRCDQTRKNAKVVTI 234
Query: 235 NGYVSVSRDETDMAKYLVENGPMAVAIN--AYALQFYVTGVSHPIQFFCDGGNENLSHSV 292
+ Y V + + K + + P++VAI A Q Y +G+ I +L H V
Sbjct: 235 DLYEDVPANSEESLKKALSHQPISVAIEGGGRAFQLYDSGIFDGI------CGTDLDHGV 288
Query: 293 LIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGI 339
+ VGYG + K YWI+KNSWG WGE GY R+ R G CGI
Sbjct: 289 VAVGYGTENGK------DYWIVKNSWGTSWGESGYIRMERNIASSAGKCGI 333
>gi|255539310|ref|XP_002510720.1| cysteine protease, putative [Ricinus communis]
gi|223551421|gb|EEF52907.1| cysteine protease, putative [Ricinus communis]
Length = 349
Score = 191 bits (485), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 138/353 (39%), Positives = 183/353 (51%), Gaps = 39/353 (11%)
Query: 4 FYFFAGVALLSLTVS-VSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSR 62
F FF V+L L S + +VG + K LF ++ + + Y + E R
Sbjct: 8 FLFFLAVSLSFLAYSGFARDSIVGYAPEDLTSNDKLIDLFESWISRFGRVYESAEEKLER 67
Query: 63 LHIFSGNLRKIQLLQDTEHGSGVY--GLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAM 120
IF NL I DT Y GLNEF+DLS EF+ KYLG LKP + R A
Sbjct: 68 FEIFKDNLFHID---DTNKKVRNYWLGLNEFADLSHEEFKNKYLG--LKPDLSKR---AQ 119
Query: 121 IP------NITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLS 174
P ++ +P++ DWR+ AVT VK+Q CGS WAFST +EG+ T L SLS
Sbjct: 120 CPEEFTYKDVAIPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLS 179
Query: 175 EQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQ-V 232
EQELIDCD ++GC GG + AF I++ GGL +E+ YPY ++ C + K+ + V
Sbjct: 180 EQELIDCDTTYNNGCNGGLMDYAFAYIVAN--GGLHKEEDYPYIMEEGTCDMRKEESDAV 237
Query: 233 KINGYVSVSRDETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHPIQFFCDGGNENLSH 290
I+GY V ++ + + N P+++AI A QFY GV F G E L H
Sbjct: 238 TISGYHDVPQNSEESLLKALANQPLSIAIEASGRDFQFYSGGV-----FDGHCGTE-LDH 291
Query: 291 SVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGI 339
V VGYG T K + Y I+KNSWG WGEKGY R+ R +G CGI
Sbjct: 292 GVAAVGYG------TSKGLDYIIVKNSWGPKWGEKGYIRMKRKTSKPEGICGI 338
>gi|18141285|gb|AAL60580.1|AF454958_1 senescence-associated cysteine protease [Brassica oleracea]
Length = 485
Score = 191 bits (485), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 129/351 (36%), Positives = 182/351 (51%), Gaps = 37/351 (10%)
Query: 8 AGVALLSLTVSVSSFM---VVGDEKLHHLHHVKHTA----LFNYFLEQHNKTYATLVEYY 60
A V L + VSS M ++ +K HH + A L+ +L +H K +L E
Sbjct: 7 ATVILFLTMIVVSSAMDMSIISYDKNHHTVSSRSDAEVSRLYEEWLVKHGKAQNSLTEKD 66
Query: 61 SRLHIFSGNLRKIQLLQDTEHGSGV---YGLNEFSDLSTAEFQAKYLGFKLKPSYADRSV 117
R IF NLR I D +G + GL +F+DL+ E+++ YLG +LK S+
Sbjct: 67 RRFEIFKDNLRFI----DEHNGKNLSYRLGLTKFADLTNDEYRSMYLGSRLKRKATKSSL 122
Query: 118 PAMIP-NITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQ 176
+ +P + DWR+ AV VKDQ CGS WAFST G +EG+ T L++LSEQ
Sbjct: 123 RYEVRVGDAIPESVDWRKEGAVAEVKDQGSCGSCWAFSTIGAVEGINKIVTGDLITLSEQ 182
Query: 177 ELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKAC-RLNKKATQVKI 234
EL+DCD ++GC GG + AF+ I++ GG++ E+ YPY+G D C + K A V I
Sbjct: 183 ELVDCDTSYNEGCNGGLMDYAFEFIINN--GGIDTEEDYPYKGVDGRCDQTRKNAKVVTI 240
Query: 235 NGYVSVSRDETDMAKYLVENGPMAVAIN--AYALQFYVTGVSHPIQFFCDGGNENLSHSV 292
+ Y V + + K + + P++VAI A Q Y +G+ I +L H V
Sbjct: 241 DLYEDVPANSEESLKKALSHQPISVAIEGGGRAFQLYDSGIFDGI------CGTDLDHGV 294
Query: 293 LIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGI 339
+ VGYG + K YWI+KNSWG WGE GY R+ R G CGI
Sbjct: 295 VAVGYGTENGK------DYWIVKNSWGTSWGESGYIRMERNIASSAGKCGI 339
>gi|255580659|ref|XP_002531152.1| cysteine protease, putative [Ricinus communis]
gi|223529265|gb|EEF31237.1| cysteine protease, putative [Ricinus communis]
Length = 340
Score = 191 bits (485), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 119/315 (37%), Positives = 162/315 (51%), Gaps = 29/315 (9%)
Query: 35 HVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDL 94
H KH F + Y+ E R IF N+++I+ S G+N+F+DL
Sbjct: 36 HEKHEEWMTRF----KRVYSDAKEKEIRYKIFKENVQRIESFNKASEKSYKLGINQFADL 91
Query: 95 STAEFQAKYLGFKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAF 153
+ EF+ FK + ++ P NIT +P + DWR+ AVT +KDQ CGS WAF
Sbjct: 92 TNEEFKTSRNRFKGHMC-SSQAGPFRYENITAVPSSMDWRKEGAVTAIKDQGQCGSCWAF 150
Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQ--EDDGCEGGSISNAFDTIMSKLGGGLEEE 211
S +EG+ T KL+SLSEQEL+DCD ED GC+GG + +AF I + GL E
Sbjct: 151 SAVAAVEGITQLATSKLISLSEQELVDCDTKGEDQGCQGGLMDDAFKFI--EQNQGLTTE 208
Query: 212 KTYPYRGDDKACRLNKKATQ-VKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQF 268
YPY G D C ++A KING+ V + V P++VAI+A + QF
Sbjct: 209 ANYPYEGSDGTCNTKQEANHAAKINGFEDVPANNEGALMKAVAKQPVSVAIDAGGFEFQF 268
Query: 269 YVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYF 328
Y +G+ F D G E L H V VGYG + YW++KNSWG WGE+GY
Sbjct: 269 YSSGI-----FTGDCGTE-LDHGVAAVGYG------ESNGMNYWLVKNSWGTQWGEEGYI 316
Query: 329 RLYRG----DGSCGI 339
R+ + +G CGI
Sbjct: 317 RMQKDIDAKEGLCGI 331
>gi|343472975|emb|CCD15017.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 293
Score = 191 bits (485), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 110/271 (40%), Positives = 156/271 (57%), Gaps = 21/271 (7%)
Query: 85 VYGLNEFSDLSTAEFQAKYL-GFKLKPSYADRSVPAMIPNITL---PRAFDWREYDAVTG 140
+G+ FSD+S EF+A Y G + + R P + N++ P DWR+ AVT
Sbjct: 15 TFGVTRFSDMSPEEFRATYHNGAEYYAAALKR--PRKVVNVSTGRPPMTVDWRKKGAVTP 72
Query: 141 VKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTI 200
VKD+ C S WAFS GNIEG + +L SLS Q L+ CD+++ GCEGG + AF I
Sbjct: 73 VKDEGKCDSFWAFSAIGNIEGQWKIAGHELTSLSGQMLVSCDKKNYGCEGGLMDRAFQWI 132
Query: 201 MSKLGGGLEEEKTYPYR---GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPM 257
+S G + E++YPY GD AC ++ K KI+ YV + +DE +A++L +NGP+
Sbjct: 133 VSSNKGNVFTEQSYPYDSSWGDVPACNMSGKVVGAKISSYVDLPQDENAIAEWLAKNGPV 192
Query: 258 AVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNS 317
A+A++A + + Y GV + L H VL+VGY D +K PYWIIKNS
Sbjct: 193 AIAVDATSFRSYTGGV------LTSCISRRLDHGVLLVGYD-DTSK-----PPYWIIKNS 240
Query: 318 WGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
WG+GWGE GY R+ +G C + +Y SA+V
Sbjct: 241 WGKGWGEWGYIRIEKGTNQCLVQEYASSAVV 271
>gi|1514953|dbj|BAA11170.1| cysteine proteinase [Oryza sativa (japonica cultivar-group)]
Length = 368
Score = 191 bits (485), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 122/307 (39%), Positives = 159/307 (51%), Gaps = 29/307 (9%)
Query: 47 EQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGF 106
++H+ E + R F N+R I + + G LN F D+ EF+A + G
Sbjct: 50 QEHHHVPRHHGEKHRRFGAFKDNVRYIH--EHNKRAPGYAPLNRFGDMGREEFRATFAGS 107
Query: 107 KLKPSYADRSVPAMIPNIT------LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIE 160
D +P LPRA DWR AVTGVKDQ CGS WAFST ++E
Sbjct: 108 HANDLRRDGLAAPPLPGFMYEGVRDLPRAVDWRRKGAVTGVKDQGKCGSCWAFSTVVSVE 167
Query: 161 GVYAAKTKKLVSLSEQELIDCDQEDD-GCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGD 219
G+ A +T +LVSLSEQELIDCD D+ GC+GG + NAF+ I K GG+ E YPYR
Sbjct: 168 GINAIRTGRLVSLSEQELIDCDTADNSGCQGGLMENAFEYI--KHSGGITTESAYPYRAA 225
Query: 220 DKAC-RLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHP 276
+ C + + V I+G+ +V + V N P++VAI+A + QFY GV
Sbjct: 226 NGTCDAVRARGGLVVIDGHQNVPANSEAALAKAVANQPVSVAIDAGDQSFQFYSDGV--- 282
Query: 277 IQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGS 336
F D G + L H V +VGYG T+ YWI+KNSWG WGE GY R+ R G
Sbjct: 283 --FAGDCGTD-LDHGVAVVGYGE-----TNDGTEYWIVKNSWGTAWGEGGYIRMQRDSGY 334
Query: 337 ----CGI 339
CGI
Sbjct: 335 DGGLCGI 341
>gi|391336140|ref|XP_003742440.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
Length = 330
Score = 191 bits (485), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 121/306 (39%), Positives = 162/306 (52%), Gaps = 20/306 (6%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY--GLNEFSDLSTAEF 99
+N F H K+Y E R IF NL I+ + G+NEF+D++ EF
Sbjct: 28 WNAFKSTHLKSYRDGQEELIRRFIFEDNLHTIEEFNRVNASLAGFTLGVNEFADMTNTEF 87
Query: 100 QAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNI 159
LG + A SV LP DW + VT VK+Q CGS WAFSTTG++
Sbjct: 88 SNMLLGLGGRNKIAGDSVFESSHVQDLPAEVDWTQKGYVTEVKNQGQCGSCWAFSTTGSL 147
Query: 160 EGVYAAKTKKLVSLSEQELIDC--DQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
EG KT KLVSLSEQ L+DC + + GC GG + AF I K GG++ E YPY
Sbjct: 148 EGQVFKKTGKLVSLSEQNLVDCSTSEGNQGCNGGLMDQAFTYI--KKNGGIDTEAAYPYT 205
Query: 218 GDDKACRLNKKATQVKINGYVSV-SRDETDMAKYLVENGPMAVAINAYAL--QFYVTGVS 274
G D CR + ++G+V V S DE + + + GP++VAI+A ++ QFY GV
Sbjct: 206 GSDGTCRFLENKVGATVSGFVDVKSGDENALKEAVATVGPISVAIDASSIFFQFYRGGVY 265
Query: 275 HPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGD 334
+P +FC + L H VL+VGYG + K YW++KNSWG WG KGY ++ R
Sbjct: 266 NP--WFCS--STELDHGVLVVGYGTEGGK------DYWLVKNSWGSSWGLKGYIKMVRNK 315
Query: 335 GS-CGI 339
+ CGI
Sbjct: 316 KNRCGI 321
>gi|33242874|gb|AAQ01141.1| cathepsin [Branchiostoma lanceolatum]
Length = 334
Score = 191 bits (485), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 128/331 (38%), Positives = 173/331 (52%), Gaps = 32/331 (9%)
Query: 36 VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHG--------SGVYG 87
+ H + + QH K Y T E YSR IF N KI EH S
Sbjct: 18 LPHNKEWEMWKLQHGKQYETEAEEYSRRFIFEKNTVKI-----AEHNIRASLGMHSYTLA 72
Query: 88 LNEFSDLSTAEFQAKYLGFKLK---PSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQ 144
+N+F D+ EF + +G LK V N TLP++ DWR V+ VKDQ
Sbjct: 73 MNKFGDMHHEEFHQRIMGGCLKIVKKPLLGSDVGDNDDNGTLPKSVDWRNSHMVSEVKDQ 132
Query: 145 TMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMS 202
CGS WAFSTTG++EG ++ KT KLV LSEQ+L+DC ++ + GC GG + AF I
Sbjct: 133 GECGSCWAFSTTGSLEGQHSNKTGKLVDLSEQQLVDCSKDFGNQGCGGGLMDQAFQYI-- 190
Query: 203 KLGGGLEEEKTYPYRG-DDKACRLNKKATQVKINGYVSV-SRDETDMAKYLVENGPMAVA 260
K GGL+ E++YPY DDK C+ + + + GY V S +E + + + GP++VA
Sbjct: 191 KANGGLDTEESYPYTATDDKPCKFDNSSVGATLVGYKDVKSGNEHALKRAVATVGPVSVA 250
Query: 261 INA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSW 318
I+A + QFY +GV Q C E L H VL VGYG +H+A +WI+KNSW
Sbjct: 251 IDAGHESFQFYSSGVYDEPQ--CS--TEQLDHGVLAVGYGA-MNDNSHQA--FWIVKNSW 303
Query: 319 GEGWGEKGYFRLYRG-DGSCGINDYVRSALV 348
G WG++GY + R + CGI LV
Sbjct: 304 GPSWGDQGYIMMSRNKNNQCGIATSASYPLV 334
>gi|449455625|ref|XP_004145553.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
Length = 351
Score = 191 bits (485), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 135/352 (38%), Positives = 180/352 (51%), Gaps = 33/352 (9%)
Query: 4 FYFFAGVALLSLTVSV--SSFMVVG--DEKLHHLHHVKHTALFNYFLEQHNKTYATLVEY 59
FYFF + + V+ F +VG E L + + LF ++ H K Y T+ E
Sbjct: 8 FYFFLAMCMSFFVVTSFGKDFSIVGYWPEDLTSMDRL--IELFEEWISNHGKIYETIEEK 65
Query: 60 YSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPA 119
+ R +F NL+ I + + S G+NEF+DL+ EF+ YLG K++ S +S
Sbjct: 66 WHRFEVFKDNLKHIDE-TNKKVTSYWLGVNEFADLTHQEFKNMYLGLKVESSRTRQSPEE 124
Query: 120 MIPN--ITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQE 177
+ LP++ DWR+ AVT VK+Q CGS WAFST +EG+ L SLSEQE
Sbjct: 125 FTYKDVVDLPKSVDWRKKGAVTRVKNQGSCGSCWAFSTVAAVEGINKIVGGNLTSLSEQE 184
Query: 178 LIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQ-VKIN 235
LIDCD+ ++GC GG + AF I+S GGL +E+ YPY + C K + V I+
Sbjct: 185 LIDCDRPYNNGCHGGLMDYAFSFIVSS--GGLHKEEDYPYLEVESTCDNKKGELEVVTIS 242
Query: 236 GYVSVSR-DETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHPIQFFCDGGNENLSHSV 292
GY V +E + K L P++VAI A QFY GV F L H V
Sbjct: 243 GYKDVPENNEASLIKALAHQ-PLSVAIEASGRDFQFYSGGV------FDGPCGTQLDHGV 295
Query: 293 LIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGS----CGIN 340
VGYG + K V Y I+KNSWG WGEKGY R+ R G CGIN
Sbjct: 296 TAVGYG------SSKGVDYIIVKNSWGPKWGEKGYIRMKRNTGKPAGLCGIN 341
>gi|151573016|gb|ABS17683.1| cathepsin L-1 [Artemia persimilis]
Length = 334
Score = 191 bits (484), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 127/356 (35%), Positives = 182/356 (51%), Gaps = 38/356 (10%)
Query: 6 FFAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHI 65
F G + L+ ++S ++ DE ++ F H K Y + +E R+ I
Sbjct: 4 FLLGAVFVQLSAALSLTNLLADE-------------WHLFKATHKKEYPSQLEEKFRMKI 50
Query: 66 FSGNLRKI---QLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKP---SYADRSVPA 119
+ N K+ +L + S +N+F DL EF++ G++ K S A+ +
Sbjct: 51 YLENKHKVAKHNILFEKGEKSYQVAMNKFGDLLHHEFRSIMNGYQHKKQNSSRAESTFTF 110
Query: 120 MIP-NITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQEL 178
M P N+ +P + DWRE A+T VKDQ CG WAFS+TG +EG KT KLVSL EQ L
Sbjct: 111 MEPANVEVPESVDWREKGAITPVKDQGQCGPCWAFSSTGALEGQTFRKTGKLVSLREQNL 170
Query: 179 IDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKING 236
IDC + ++GC GG + AF I K G++ E TYPY +D CR N + G
Sbjct: 171 IDCSGKYGNEGCNGGLMDQAFQYI--KDNKGIDTENTYPYEAEDDVCRYNPRNRGAVDRG 228
Query: 237 YVSVSRDETDMAKYLVEN-GPMAVAINAY--ALQFYVTGVSHPIQFFCDGGNENLSHSVL 293
+V + E D K V GP++VAI+A + QFY GV + CD +++L H VL
Sbjct: 229 FVDIPSGEEDKLKAAVATVGPVSVAIDASHESFQFYSKGVYYEPS--CD--SDDLDHGVL 284
Query: 294 IVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-DGSCGINDYVRSALV 348
+VGYG D K YW++KNSW E WG++GY ++ R CG+ LV
Sbjct: 285 VVGYGSDNGK------DYWLVKNSWSEHWGDQGYIKIARNRKNHCGVATAASYPLV 334
>gi|66378053|gb|AAY45871.1| cathepsin L-like cysteine proteinase [Longidorus elongatus]
Length = 358
Score = 191 bits (484), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 124/359 (34%), Positives = 191/359 (53%), Gaps = 43/359 (11%)
Query: 12 LLSLTVSVSSFMVVG---DEKLHHLH-HVKHTALFNY--------FLEQHNKTYATLVEY 59
++ +T+ + S ++G E++ + H ++ L N+ F +H K+Y T E
Sbjct: 1 MIRITLLLHSIFLLGFVNSEQISQIQEHPRNNLLINHPYYPVWTNFKLKHAKSYKTKDEE 60
Query: 60 YSRLHIFSGNLRKIQLLQDTEHGSGVYG----LNEFSDLSTAEFQAKYLGFKL------- 108
R +F+ N + I+ + E+ +G + LN+F+D++ AEF+ + GFKL
Sbjct: 61 LLRFQVFASNHKVIEQ-HNIEYEAGQHSFALSLNKFADMTNAEFRQRMNGFKLPAKRKLA 119
Query: 109 --KPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAK 166
+P D + M N+T+P + DWR+ VT VKDQ CGS WAFS TG++EG + +
Sbjct: 120 KSQPLKEDGMIFEMPDNVTIPDSVDWRKEGYVTKVKDQGSCGSCWAFSATGSLEGQHYKQ 179
Query: 167 TKKLVSLSEQELIDCD--QEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACR 224
T KLVSLSEQ L+DCD +D+GC GG + AF + + G++ E +YPY+G D CR
Sbjct: 180 TGKLVSLSEQNLVDCDVNGDDEGCNGGYMDGAFQYV--ETNKGIDTEASYPYKGRDGRCR 237
Query: 225 LNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFC 281
+ G+V + +ET + + GP++VAI+A + QFY SH + +
Sbjct: 238 FKSEDVGATDTGFVDIPEGNETLLEAAIATVGPVSVAIDAASFKFQFY----SHGVYYDR 293
Query: 282 DGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL-YRGDGSCGI 339
E L H VL VGY T Y+I+KNSW E WG+ GY + R + +CGI
Sbjct: 294 SCSPEYLDHGVLAVGYNS-----TKDGKQYYIVKNSWSEDWGDDGYILMSRRKNNNCGI 347
>gi|237844793|ref|XP_002371694.1| cathepsin L-like thiolproteinase, putative [Toxoplasma gondii ME49]
gi|50313163|gb|AAT74529.1| toxopain-2 [Toxoplasma gondii]
gi|89242977|gb|ABD64744.1| cathepsin L [Toxoplasma gondii]
gi|95007485|emb|CAJ20707.1| toxopain-2 [Toxoplasma gondii RH]
gi|211969358|gb|EEB04554.1| cathepsin L-like thiolproteinase, putative [Toxoplasma gondii ME49]
gi|221480879|gb|EEE19300.1| cysteine protease, putative [Toxoplasma gondii GT1]
gi|221501596|gb|EEE27366.1| cysteine protease, putative [Toxoplasma gondii VEG]
Length = 422
Score = 191 bits (484), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 123/310 (39%), Positives = 160/310 (51%), Gaps = 25/310 (8%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
F+ F + K+YAT E R IF NL I + + S +N F DLS EF+
Sbjct: 117 FSSFQAMYAKSYATEEEKQRRYAIFKNNLVYIHT-HNQQGYSYSLKMNHFGDLSRDEFRR 175
Query: 102 KYLGFKLKPSYADR--SVPAMIPNI---TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTT 156
KYLGFK + V + N+ LP DWR VT VKDQ CGS WAFSTT
Sbjct: 176 KYLGFKKSRNLKSHHLGVATELLNVLPSELPAGVDWRSRGCVTPVKDQRDCGSCWAFSTT 235
Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCDQED--DGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
G +EG + AKT KLVSLSEQEL+DC + + C GG +++AF ++ GG+ E Y
Sbjct: 236 GALEGAHCAKTGKLVSLSEQELMDCSRAEGNQSCSGGEMNDAFQYVLD--SGGICSEDAY 293
Query: 215 PYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYVTG 272
PY D+ CR VKI G+ V R K + P+++AI A QFY G
Sbjct: 294 PYLARDEECRAQSCEKVVKILGFKDVPRRSEAAMKAALAKSPVSIAIEADQMPFQFYHEG 353
Query: 273 VSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKG--YFRL 330
V F +L H VL+VGYG D+ +WI+KNSWG GWG G Y +
Sbjct: 354 V------FDASCGTDLDHGVLLVGYGTDK----ESKKDFWIMKNSWGTGWGRDGYMYMAM 403
Query: 331 YRG-DGSCGI 339
++G +G CG+
Sbjct: 404 HKGEEGQCGL 413
>gi|4426617|gb|AAD20453.1| cysteine endopeptidase precursor [Oryza sativa]
Length = 368
Score = 191 bits (484), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 122/307 (39%), Positives = 159/307 (51%), Gaps = 29/307 (9%)
Query: 47 EQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGF 106
++H+ E + R F N+R I + + G LN F D+ EF+A + G
Sbjct: 50 QEHHHVPRHHGEKHRRFGAFKDNVRYIH--EHNKRAPGYPPLNRFGDMGREEFRATFAGS 107
Query: 107 KLKPSYADRSVPAMIPNIT------LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIE 160
D +P LPRA DWR AVTGVKDQ CGS WAFST ++E
Sbjct: 108 HANDLRRDGLAAPPLPGFMYEGVRDLPRAVDWRRKGAVTGVKDQGKCGSCWAFSTVVSVE 167
Query: 161 GVYAAKTKKLVSLSEQELIDCDQEDD-GCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGD 219
G+ A +T +LVSLSEQELIDCD D+ GC+GG + NAF+ I K GG+ E YPYR
Sbjct: 168 GINAIRTGRLVSLSEQELIDCDTADNSGCQGGLMENAFEYI--KHSGGITTESAYPYRAA 225
Query: 220 DKAC-RLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHP 276
+ C + + V I+G+ +V + V N P++VAI+A + QFY GV
Sbjct: 226 NGTCDAVRARGGLVVIDGHQNVPANSEAALAKAVANQPVSVAIDAGDQSFQFYSDGV--- 282
Query: 277 IQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGS 336
F D G + L H V +VGYG T+ YWI+KNSWG WGE GY R+ R G
Sbjct: 283 --FAGDCGTD-LDHGVAVVGYGE-----TNDGTEYWIVKNSWGTAWGEGGYIRMQRDSGY 334
Query: 337 ----CGI 339
CGI
Sbjct: 335 DGGLCGI 341
>gi|55740406|gb|AAV63979.1| cathepsin L1 precursor [Artemia parthenogenetica]
Length = 338
Score = 191 bits (484), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 127/356 (35%), Positives = 184/356 (51%), Gaps = 38/356 (10%)
Query: 6 FFAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHI 65
F L+ L+ ++S ++ DE ++ F H K Y + +E R+ I
Sbjct: 8 FLLAAVLVQLSAALSLTNLLADE-------------WHLFKATHKKEYPSQLEEKLRMKI 54
Query: 66 FSGNLRKI---QLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKP---SYADRSVPA 119
+ N K+ +L + S +N+F DL EF++ G++ K S A+ +
Sbjct: 55 YLENKHKVAKHNILYEKGEKSYQVAMNKFGDLLHHEFRSIMNGYQHKKQNSSRAESTFTF 114
Query: 120 MIP-NITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQEL 178
M P N+ +P + DWRE A+T VKDQ CGS WAFS+TG +EG KT KLVSLSEQ L
Sbjct: 115 MEPANVEVPESVDWREKGAITPVKDQGQCGSCWAFSSTGALEGQTFRKTGKLVSLSEQNL 174
Query: 179 IDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKING 236
IDC + ++GC GG + AF I K G++ E TYPY +D CR N + G
Sbjct: 175 IDCSGKYGNEGCNGGLMDQAFQYI--KDNKGIDTENTYPYEAEDGVCRYNPRNRGAVDRG 232
Query: 237 YVSVSRDETDMAKYLVEN-GPMAVAINAY--ALQFYVTGVSHPIQFFCDGGNENLSHSVL 293
+V + E D K V GP++VAI+A + QFY G + + CD +++L H VL
Sbjct: 233 FVDIPSGEEDKLKAAVATVGPVSVAIDASHESFQFYSKGXYY--EPSCD--SDDLDHGVL 288
Query: 294 IVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-DGSCGINDYVRSALV 348
+VGYG D + YW++KNSW E WG++GY ++ R CG+ LV
Sbjct: 289 VVGYGSDNGE------DYWLVKNSWSEHWGDEGYIKIARNRKNHCGVATAASYPLV 338
>gi|157278115|ref|NP_001098156.1| cathepsin L precursor [Oryzias latipes]
gi|50251128|dbj|BAD27581.1| cathepsin L [Oryzias latipes]
Length = 336
Score = 191 bits (484), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 125/312 (40%), Positives = 172/312 (55%), Gaps = 22/312 (7%)
Query: 49 HNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY----GLNEFSDLSTAEFQAKYL 104
H+K Y E + RL ++ NLRKI+L + EH G + G+N F D++ EF+
Sbjct: 35 HSKNYHEKEEGWRRL-VWEKNLRKIEL-HNLEHSMGKHSYRLGMNHFGDMTHEEFRQIMN 92
Query: 105 GFKLKPSYADRSVPAMIPN-ITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVY 163
G+K + M PN + PRA DWR+ VT VKDQ CGS WAFSTTG +EG
Sbjct: 93 GYKRREQRKYSGSLFMEPNFLEAPRAVDWRDKGYVTPVKDQGQCGSCWAFSTTGALEGQQ 152
Query: 164 AAKTKKLVSLSEQELIDCDQED--DGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG-DD 220
KT KLVSLSEQ L+DC + + +GC GG + AF + K GL+ E YPY+G DD
Sbjct: 153 FRKTGKLVSLSEQNLVDCSRPEGNEGCNGGLMDQAFQYV--KDNQGLDSEDFYPYKGTDD 210
Query: 221 KACRLNKKATQVKINGYVSV-SRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPI 277
+ C+ N + + V G+V + S E + K + GP++VAI+A + QFY +G I
Sbjct: 211 QPCQYNAQYSAVNDTGFVDIPSGKERALMKAVASVGPVSVAIDAGHESFQFYQSG----I 266
Query: 278 QFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-DGS 336
F + ++ L H VL+VGYG + K YWI+KNSW E WG+KG+ + +
Sbjct: 267 YFEKECSSDELDHGVLVVGYGFEGEDVDGKK--YWIVKNSWSEKWGDKGFIYMAKDRHNH 324
Query: 337 CGINDYVRSALV 348
CGI LV
Sbjct: 325 CGIATAASYPLV 336
>gi|356559055|ref|XP_003547817.1| PREDICTED: cysteine proteinase RD21a [Glycine max]
Length = 366
Score = 191 bits (484), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 117/314 (37%), Positives = 165/314 (52%), Gaps = 28/314 (8%)
Query: 40 ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
++ +L +H K Y L E R +F NL IQ + ++ + GLN+F+D++ E+
Sbjct: 38 TMYEEWLVKHQKVYNGLREKDKRFQVFKDNLGFIQEHNNNQNNTYKLGLNQFADMTNEEY 97
Query: 100 QAKYLGFK------LKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAF 153
+ Y G K L + + A LP DWR AV +KDQ CGS WAF
Sbjct: 98 RVMYFGTKSDAKRRLMKTKSTGHRYAYSAGDRLPVHVDWRVKGAVAPIKDQGSCGSCWAF 157
Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEK 212
ST +E + T K VSLSEQEL+DCD+ ++GC GG + AF+ I+ GG++ +K
Sbjct: 158 STVATVEAINKIVTGKFVSLSEQELVDCDRAYNEGCNGGLMDYAFEFIIQN--GGIDTDK 215
Query: 213 TYPYRGDDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA--LQFY 269
YPYRG D C KK A V I+G+ V + + K V + P+++AI A LQ Y
Sbjct: 216 DYPYRGFDGICDPTKKNAKVVNIDGFEDVPPYDENALKKAVAHQPVSIAIEASGRDLQLY 275
Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
+GV F +L H V++VGYG + V YW+++NSWG GWGE GYF+
Sbjct: 276 QSGV------FTGKCGTSLDHGVVVVGYG------SENGVDYWLVRNSWGTGWGEDGYFK 323
Query: 330 LYRG----DGSCGI 339
+ R G CGI
Sbjct: 324 MQRNVRTPTGKCGI 337
>gi|5881566|dbj|BAA84280.1| Cysteine proteinase [Clonorchis sinensis]
Length = 232
Score = 191 bits (484), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 103/219 (47%), Positives = 130/219 (59%), Gaps = 12/219 (5%)
Query: 130 FDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCE 189
FDWRE+ AV V DQ CGS WAFS GN+ G + KT L++LSEQ+L+DCD DDGC+
Sbjct: 25 FDWREHGAVGPVLDQGKCGSCWAFSVIGNVVGQWFRKTGHLLALSEQQLVDCDYLDDGCD 84
Query: 190 GGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAK 249
GG + I GGLE YPY G C ++K ING + E A+
Sbjct: 85 GGYPPQTYTAIQKM--GGLELASDYPYTGVGGICHMDKSKFVAYINGSTILPLSEKVQAQ 142
Query: 250 YLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAV 309
L GP++ A+NA LQ Y G+ P +CD N H+VL VGYGV K
Sbjct: 143 KLRAIGPLSSALNADTLQLYKGGIMRPK--WCDPAGVN--HAVLTVGYGVQNGK------ 192
Query: 310 PYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
PYWI+KNSWGE +GE+GYFR+YRGDG+CGIN V +A++
Sbjct: 193 PYWIVKNSWGEDFGEEGYFRIYRGDGTCGINSIVTTAII 231
>gi|394331828|gb|AFN27133.1| cysteine protease [Leishmania tropica]
Length = 443
Score = 191 bits (484), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 116/318 (36%), Positives = 172/318 (54%), Gaps = 22/318 (6%)
Query: 40 ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
ALF F + + Y T+ E RL F NL ++ Q + +G+ +F DLS AEF
Sbjct: 36 ALFEEFKRTYRRAYGTVAEEQQRLANFERNLELMREHQ-ARNPHARFGITKFFDLSEAEF 94
Query: 100 QAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
A+YL F +A + +++ +P A DWR+ AVT VKDQ CGS WAFS
Sbjct: 95 AARYLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWRKKGAVTPVKDQGACGSCWAFS 154
Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
G+IE +A +L +LS+ L+ C +D+G G + AF+ ++ + G + E +Y
Sbjct: 155 AVGSIESQWALAGHRLTALSDHHLVSCHDKDNGRPAGLMLQAFEWLLRNMNGTMFTEDSY 214
Query: 215 PYRGDDKACRLNKKATQV----KINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYV 270
PY ++Q+ +I+GYV++ ET MA +L +NGP+++A++A + Y
Sbjct: 215 PYVSSSGYVPECSNSSQLVPGARIDGYVTIESSETVMAAWLAKNGPISIALDASSFMSYQ 274
Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
+GV C G L+H VL+VGY +RT VPYW+IKNSWGE WGE GY R+
Sbjct: 275 SGVV----TSCAG--MPLNHGVLLVGY--NRT----GEVPYWVIKNSWGENWGENGYVRV 322
Query: 331 YRGDGSCGINDYVRSALV 348
G +C + +Y SA V
Sbjct: 323 TMGVNACLLTEYPVSAHV 340
>gi|346469447|gb|AEO34568.1| hypothetical protein [Amblyomma maculatum]
Length = 333
Score = 191 bits (484), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 123/318 (38%), Positives = 169/318 (53%), Gaps = 28/318 (8%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGV----YGLNEFSDLSTAEF- 99
F H KTY + VE R IF+ N I + ++ G+ G+N+F+DL EF
Sbjct: 30 FKSTHKKTYKSNVEELLRFKIFTENSLFIAK-HNVKYAKGLVSYKLGINQFADLLPHEFV 88
Query: 100 --QAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
Y G +L + PA + + +LP+ DWR+ AVT VKDQ CGS WAFS+TG
Sbjct: 89 KMMNGYQGKRLAGRGSTYLPPANLNDSSLPKTVDWRKKGAVTPVKDQGQCGSCWAFSSTG 148
Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEEKTYP 215
++EG + KT KLVSLSEQ L+DC + GC GG + N+F+ I K GG++ E +YP
Sbjct: 149 SLEGQHFLKTGKLVSLSEQNLVDCSSAYGNQGCNGGLMDNSFNYI--KANGGIDTEDSYP 206
Query: 216 YRGDDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINA--YALQFYVTG 272
Y +D CR K+ G+V + E D+ K + GP++VAI+A + Q Y G
Sbjct: 207 YEAEDGDCRYKKEDVGATDTGFVDIKEGSEKDLQKAVATVGPVSVAIDASQQSFQLYSEG 266
Query: 273 V-SHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
V P + +E+L H VL VGYGV K YW++KNSW E WG+ GY +
Sbjct: 267 VYDEP-----NCSSESLDHGVLAVGYGVKNGK------KYWLVKNSWAETWGQDGYILMS 315
Query: 332 RG-DGSCGINDYVRSALV 348
R + CGI LV
Sbjct: 316 RDKNNQCGIASSASYPLV 333
>gi|237651947|gb|ACR08662.1| cathepsin F, partial [Drosophila silvestris]
Length = 186
Score = 191 bits (484), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 90/189 (47%), Positives = 125/189 (66%), Gaps = 5/189 (2%)
Query: 161 GVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDD 220
G+YA +T +L SEQEL+DCD D C GG + NA+ I K GGLE E YPY
Sbjct: 1 GLYAIRTGELQEFSEQELLDCDSTDSACNGGLMDNAYKAI--KDIGGLEYESEYPYAAKK 58
Query: 221 KACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQF 279
C N+ + V+I+G+V + + +ET M ++L+ NGP+++ +NA A+QFY GVSHP
Sbjct: 59 MQCHFNRTLSHVQISGFVDLPKGNETAMQEWLLSNGPISIGLNANAMQFYRGGVSHPWAP 118
Query: 280 FCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGI 339
C +NL H VLIVGYGV HK +PYWI+KNSWG+ WGE+GY+R+YRGD +CG+
Sbjct: 119 LCS--KKNLDHGVLIVGYGVSDYPNFHKTLPYWIVKNSWGQRWGEQGYYRIYRGDNTCGV 176
Query: 340 NDYVRSALV 348
++ SA++
Sbjct: 177 SEMATSAVL 185
>gi|225428328|ref|XP_002279940.1| PREDICTED: cysteine proteinase-like [Vitis vinifera]
Length = 707
Score = 191 bits (484), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 131/315 (41%), Positives = 168/315 (53%), Gaps = 27/315 (8%)
Query: 37 KHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLST 96
K A F ++ +H K Y ++ E R +F NL I ++ E S GLNEF+DLS
Sbjct: 399 KLIARFESWVSKHGKVYKSMEEKLHRFEVFRENLNHIDE-RNKEVSSYWLGLNEFADLSH 457
Query: 97 AEFQAKYLGFKLK-PSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
EF++KYLG + + P D S ++ LP + DWR+ AVT VK+Q CGS WAFS
Sbjct: 458 EEFKSKYLGLRAEFPRSRDYSGEFRYRDVADLPESVDWRKKGAVTHVKNQGACGSCWAFS 517
Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
T +EG+ T L +LSEQELIDCD + GC GG + AF I S GGL +E
Sbjct: 518 TVAAVEGINQIVTGNLTTLSEQELIDCDTTFNSGCNGGLMDYAFAFIASN--GGLHKEDD 575
Query: 214 YPYRGDDKACRLNKKATQ-VKINGYVSV-SRDETDMAKYLVENGPMAVAINAYA--LQFY 269
YPY ++ C K+ V I+GY V +DE + K L P++VAI A QFY
Sbjct: 576 YPYLMEEGTCEEQKEDVDIVTISGYEDVPEKDEESLLKALAHQ-PLSVAIEASGRDFQFY 634
Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
GV F G E L H V VGYG + K + Y I+KNSWG WGEKGY R
Sbjct: 635 SGGV-----FNGPCGTE-LDHGVAAVGYG------SSKGLDYIIVKNSWGPKWGEKGYIR 682
Query: 330 LYRG----DGSCGIN 340
+ R +G CGIN
Sbjct: 683 MKRNTGKTEGLCGIN 697
>gi|400180443|gb|AFP73358.1| cysteine protease, partial [Solanum habrochaites]
Length = 345
Score = 191 bits (484), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 109/304 (35%), Positives = 159/304 (52%), Gaps = 30/304 (9%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
++ +H + Y VE R IF N++ I+ + + S G+NEF+D+++ EF AK+
Sbjct: 42 WMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSEEFLAKFT 101
Query: 105 GFKLKPSYADRSVPAMIPNIT----------LPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
G + SY P+ +P+ +P DWRE AVT VK+Q CG WAFS
Sbjct: 102 GLNIPNSYLS---PSPMPSTEFKINDLSDDDMPSNLDWRESGAVTQVKNQGQCGCCWAFS 158
Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
G++EG Y T L+ SEQEL+DC + GC GG ++NAFD I+ GG+ E Y
Sbjct: 159 AVGSLEGAYKIATGNLMEFSEQELLDCTTNNYGCNGGFMTNAFDFIIEN--GGISRESDY 216
Query: 215 PYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA-YALQFYVTGV 273
Y G CR K V+I+ Y V ET + + + + P+++ I A + LQFY G
Sbjct: 217 EYLGQQYTCRSQGKTAAVQISNYQVVPEGETSLLQAVTKQ-PVSIGIAASHDLQFYAGGT 275
Query: 274 SHPIQFFCDGGNEN-LSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR 332
DG N ++H+V +GYG D K YW++KNSWG WGE G+ ++ R
Sbjct: 276 Y-------DGSCANRINHAVTAIGYGTD-----EKGQKYWLLKNSWGTSWGENGFMKIIR 323
Query: 333 GDGS 336
G+
Sbjct: 324 DSGN 327
>gi|390337642|ref|XP_780653.3| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
Length = 333
Score = 191 bits (484), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 123/308 (39%), Positives = 162/308 (52%), Gaps = 21/308 (6%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLR---KIQLLQDTEHGSGVYGLNEFSDLSTAE 98
+N + +H K Y + E SR I+ NL K L D H + G+N+F+DL E
Sbjct: 28 WNEWKNEHGKRYLSDEEEASRRLIWQKNLDIVIKHNLKYDLGHFTYDLGINQFTDLQNEE 87
Query: 99 FQAKYLGFKLK-PSYADRSVPAMIPNIT--LPRAFDWREYDAVTGVKDQTMCGSSWAFST 155
F A GF++ S A + + PN LP+ DWR VT VKDQ CGS WAFST
Sbjct: 88 FVAMMTGFRVSGTSKAAKGSTFLPPNNVGELPKTVDWRTKGYVTPVKDQGQCGSCWAFST 147
Query: 156 TGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYP 215
TG++EG + T KLVSLSEQ L+DC D GC+GG + AF I+ GG++ E +YP
Sbjct: 148 TGSVEGQHFKATGKLVSLSEQNLVDCSGRDAGCDGGFMDRAFQYIID--AGGIDTEASYP 205
Query: 216 YRGDDKACRLNKKATQVKINGYVSV-SRDETDMAKYLVENGPMAVAINA--YALQFYVTG 272
Y+ D C K + GY V S E + K + GP++VAI+A + Q Y +G
Sbjct: 206 YKAVDGKCHFKKANVGATVTGYTDVTSGSEKALQKAVAHVGPISVAIDASHMSFQHYKSG 265
Query: 273 VSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR 332
V + + CD + L H VL VGYG + YWI+KNSW E WG GY + R
Sbjct: 266 VYN--EPGCD--STVLDHGVLAVGYGT-----SSDGTDYWIVKNSWAETWGMNGYVWMSR 316
Query: 333 G-DGSCGI 339
D CGI
Sbjct: 317 NKDNQCGI 324
>gi|37780047|gb|AAP32196.1| cysteine protease 8 [Trifolium repens]
Length = 343
Score = 191 bits (484), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 118/306 (38%), Positives = 159/306 (51%), Gaps = 24/306 (7%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
++ Q+ K Y E +R IF+ N+ ++ + S G+N+F+DL+ EF A
Sbjct: 42 WMSQYGKIYKDHQERETRFKIFTENVNYVEASNADDTKSYKLGINQFADLTNEEFVASRN 101
Query: 105 GFK-LKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGV 162
FK S R+ N++ +P DWR+ AVT VK+Q CG WAFS EG+
Sbjct: 102 KFKGHMCSSITRTTTFKYENVSAIPSTVDWRKKGAVTPVKNQGQCGCCWAFSAVAATEGI 161
Query: 163 YAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDD 220
+ T KL+SLSEQEL+DCD + D GCEGG + +AF I+ GL E YPY G D
Sbjct: 162 HKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFIIQNH--GLSTEAQYPYEGVD 219
Query: 221 KACRLNKKATQ-VKINGYVSVSRDETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHPI 277
C NK + Q V I GY V + + V N P++VAI+A QFY +GV
Sbjct: 220 GTCNANKASVQAVTITGYEDVPANSEQALQKAVANQPISVAIDASGSDFQFYKSGV---- 275
Query: 278 QFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG---- 333
F L H V VGYGV ++ YW++KNSWG WGE+GY + RG
Sbjct: 276 --FTGSCGTELDHGVTAVGYGV-----SNDGTKYWLVKNSWGTDWGEEGYIMMQRGVEAA 328
Query: 334 DGSCGI 339
+G CGI
Sbjct: 329 EGLCGI 334
>gi|75277440|sp|O23791.1|BROM1_ANACO RecName: Full=Fruit bromelain; AltName: Allergen=Ana c 2; Flags:
Precursor
gi|2342496|dbj|BAA21849.1| bromelain [Ananas comosus]
Length = 351
Score = 190 bits (483), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 109/306 (35%), Positives = 159/306 (51%), Gaps = 23/306 (7%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
F ++ ++ + Y E R IF N++ I+ S G+N+F+D++ +EF A
Sbjct: 37 FEEWMAEYGRVYKDDDEKMRRFQIFKNNVKHIETFNSRNENSYTLGINQFTDMTKSEFVA 96
Query: 102 KYLGFKLKPSYADRSVPAMIPNITL---PRAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
+Y G L P +R ++ + P++ DWR+Y AV VK+Q CGS W+F+
Sbjct: 97 QYTGVSL-PLNIEREPVVSFDDVNISAVPQSIDWRDYGAVNEVKNQNPCGSCWSFAAIAT 155
Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG 218
+EG+Y KT LVSLSEQE++DC GC+GG ++ A+D I+S G+ E+ YPY
Sbjct: 156 VEGIYKIKTGYLVSLSEQEVLDC-AVSYGCKGGWVNKAYDFIISN--NGVTTEENYPYLA 212
Query: 219 DDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA-LQFYVTGVSHPI 277
C N I GY V R++ Y V N P+A I+A Q+Y GV
Sbjct: 213 YQGTCNANSFPNSAYITGYSYVRRNDERSMMYAVSNQPIAALIDASENFQYYNGGV---- 268
Query: 278 QFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG---- 333
F +L+H++ I+GYG D + YWI++NSWG WGE GY R+ RG
Sbjct: 269 --FSGPCGTSLNHAITIIGYGQDSS-----GTKYWIVRNSWGSSWGEGGYVRMARGVSSS 321
Query: 334 DGSCGI 339
G CGI
Sbjct: 322 SGVCGI 327
>gi|50355621|dbj|BAD29959.1| cysteine protease [Daucus carota]
Length = 361
Score = 190 bits (483), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 118/306 (38%), Positives = 161/306 (52%), Gaps = 24/306 (7%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
++ Q+ + Y E R IF N++ I+ S +NEF+D + EFQA
Sbjct: 60 WMIQYGRVYKDEAEKSVRFQIFMDNVKFIEEFNKDGRQSYKLAVNEFADQTNEEFQASRN 119
Query: 105 GFKLK-PSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGV 162
G+K+ S ++ N+T +P + DWR+ AVT VKDQ CGS WAFST EG+
Sbjct: 120 GYKMAVSSRPSQTTLFRYENVTAVPSSMDWRKKGAVTPVKDQGQCGSCWAFSTIAATEGI 179
Query: 163 YAAKTKKLVSLSEQELIDCDQ--EDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDD 220
KT KL+SLSEQEL+DCD+ ED GCEGG + + F+ I+ G L E +YPY D
Sbjct: 180 TKLKTGKLISLSEQELVDCDKTGEDQGCEGGYMEDGFEFIVKNKGIAL--EASYPYTAAD 237
Query: 221 KACRLNKKATQ-VKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPI 277
C ++A++ KI+GY V + V N P++V+I+A A QFY +GV
Sbjct: 238 GTCNSKEEASRAAKISGYEKVPANSETALLKAVANQPVSVSIDASGVAFQFYSSGV---- 293
Query: 278 QFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG---- 333
F +L H V VGYG T YW++KNSWG WG+ GY + RG
Sbjct: 294 --FTGECGTDLDHGVTAVGYGK-----TSDGTKYWLVKNSWGASWGDSGYIMMQRGVAAK 346
Query: 334 DGSCGI 339
G CGI
Sbjct: 347 GGLCGI 352
>gi|350538043|ref|NP_001234324.1| cysteine protease TDI-65 precursor [Solanum lycopersicum]
gi|5726641|gb|AAD48496.1|AF172856_1 cysteine protease TDI-65 [Solanum lycopersicum]
gi|2828252|emb|CAA05894.1| CYP1 [Solanum lycopersicum]
Length = 466
Score = 190 bits (483), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 121/348 (34%), Positives = 181/348 (52%), Gaps = 34/348 (9%)
Query: 10 VALLSLTVSVSSFMVVGDEKLHHLHHVKH---TALFNYFLEQHNKTYATLVEYYSRLHIF 66
+ L+ T+S +S M + H+H +AL+ +L +H K+Y L E R IF
Sbjct: 14 LMLIFSTLSSASDMSIISYDETHIHRRTDDEVSALYESWLIEHGKSYNALGEKDKRFQIF 73
Query: 67 SGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFK-------LKPSYADRSVPA 119
NLR I + S GL +F+DL+ E+++ YLG K L + +DR +P
Sbjct: 74 KDNLRYIDEQNSVPNQSYKLGLTKFADLTNEEYRSIYLGTKSSGDRKKLSKNKSDRYLPK 133
Query: 120 MIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELI 179
+ +LP + DWRE + GVKDQ CGS WAFS +E + A T L+SLSEQEL+
Sbjct: 134 V--GDSLPESIDWREKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGNLISLSEQELV 191
Query: 180 DCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKAC-RLNKKATQVKINGY 237
DCD+ ++GC+GG + AF+ ++ GG++ E+ YPY+ + C + K A VKI+ Y
Sbjct: 192 DCDRSYNEGCDGGLMDYAFEFVIKN--GGIDTEEDYPYKERNGVCDQYRKNAKVVKIDSY 249
Query: 238 VSVSRDETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHPIQFFCDGGNENLSHSVLIV 295
V + + V + P+++A+ A Q Y +G+ F + H V+I
Sbjct: 250 EDVPVNNEKALQKAVAHQPVSIALEAGGRDFQHYKSGI------FTGKCGTAVDHGVVIA 303
Query: 296 GYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR----GDGSCGI 339
GYG T + YWI++NSWG WGE GY R+ R G CG+
Sbjct: 304 GYG------TENGMDYWIVRNSWGANWGENGYLRVQRNVASSSGLCGL 345
>gi|146152090|gb|ABQ08058.1| cathepsin L [Misgurnus mizolepis]
Length = 337
Score = 190 bits (483), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 126/312 (40%), Positives = 167/312 (53%), Gaps = 22/312 (7%)
Query: 49 HNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY----GLNEFSDLSTAEFQAKYL 104
H K Y E + R+ I+ NLRKIQ + EH G++ G+N F D++ EF+
Sbjct: 36 HGKNYHEKEEGWRRM-IWEKNLRKIQF-HNLEHSMGIHTYRLGMNHFGDMNHEEFRQVMN 93
Query: 105 GFKLKPSYADRSVPAMIPN-ITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVY 163
G+K K + M PN + +P DWRE VT VKDQ CGS WAFSTTG +EG
Sbjct: 94 GYKHKTERKFKGSLFMEPNFLEVPSKLDWREKGYVTPVKDQGECGSCWAFSTTGAMEGQM 153
Query: 164 AAKTKKLVSLSEQELIDCDQED--DGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG-DD 220
K KLVSLSEQ L+DC + + +GC GG + AF I K GL+ E+ YPY G DD
Sbjct: 154 FRKQGKLVSLSEQNLVDCSRPEGNEGCNGGLMDQAFQYI--KDNNGLDSEEAYPYLGTDD 211
Query: 221 KACRLNKKATQVKINGYVSV-SRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPI 277
+ C + K G+V + S E + K + GP++VAI+A + QFY +G I
Sbjct: 212 QPCHYDPKYNAANDTGFVDIPSGKEHALMKAVASVGPVSVAIDAGHESFQFYQSG----I 267
Query: 278 QFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-DGS 336
F + +E L H VL+VGYG + K YWI+KNSW E WG+KGY + +
Sbjct: 268 YFEKECSSEELDHGVLVVGYGFEGEDVDGKK--YWIVKNSWSESWGDKGYIYMAKDRKNH 325
Query: 337 CGINDYVRSALV 348
CGI LV
Sbjct: 326 CGIATAASYPLV 337
>gi|401419663|ref|XP_003874321.1| cysteine peptidase A (CBA) [Leishmania mexicana MHOM/GT/2001/U1103]
gi|1706259|sp|P35591.2|CYSP1_LEIPI RecName: Full=Cysteine proteinase 1; AltName: Full=Amastigote
cysteine proteinase A-1; Flags: Precursor
gi|1220383|gb|AAA91859.1| cysteine proteinase [Leishmania pifanoi]
gi|322490556|emb|CBZ25817.1| cysteine peptidase A (CBA) [Leishmania mexicana MHOM/GT/2001/U1103]
Length = 354
Score = 190 bits (483), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 119/319 (37%), Positives = 168/319 (52%), Gaps = 25/319 (7%)
Query: 39 TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLN-EFSDLSTA 97
+A + F ++H K + E R + F N++ L +T++ Y ++ +F+DL+
Sbjct: 39 SAHYGSFKKRHGKAFGGDAEEGHRFNAFKQNMQTAYFL-NTQNPHAHYDVSGKFADLTPQ 97
Query: 98 EFQAKYL-----GFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
EF YL LK D V P+ + + DWR+ AVT VK+Q +CGS WA
Sbjct: 98 EFAKLYLNPDYYARHLKDHKEDVHVDDSAPSGVM--SVDWRDKGAVTPVKNQGLCGSCWA 155
Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEK 212
FS GNIEG +AA LVSLSEQ L+ CD D+GC GG + A + IM G + E
Sbjct: 156 FSAIGNIEGQWAASGHSLVSLSEQMLVSCDNIDEGCNGGLMDQAMNWIMQSHNGSVFTEA 215
Query: 213 TYPYR---GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFY 269
+YPY G C ++ KI G++S+ DE +A+++ + GP+AVA++A Q Y
Sbjct: 216 SYPYTSGGGTRPPCH-DEGEVGAKITGFLSLPHDEERIAEWVEKRGPVAVAVDATTWQLY 274
Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
GV C +L+H VLIVG+ + PYWI+KNSWG WGEKGY R
Sbjct: 275 FGGVVS----LCLA--WSLNHGVLIVGFNKNAKP------PYWIVKNSWGSSWGEKGYIR 322
Query: 330 LYRGDGSCGINDYVRSALV 348
L G C + +Y SA V
Sbjct: 323 LAMGSNQCMLKNYPVSATV 341
>gi|126021|sp|P25775.1|LMCPA_LEIME RecName: Full=Cysteine proteinase A; Flags: Precursor
gi|9573|emb|CAA44094.1| cysteine proteinase [Leishmania mexicana]
Length = 354
Score = 190 bits (483), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 119/319 (37%), Positives = 168/319 (52%), Gaps = 25/319 (7%)
Query: 39 TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLN-EFSDLSTA 97
+A + F ++H K + E R + F N++ L +T++ Y ++ +F+DL+
Sbjct: 39 SAHYGSFKKRHGKAFGGDAEEGHRFNAFKQNMQTAYFL-NTQNPHAHYDVSGKFADLTPQ 97
Query: 98 EFQAKYL-----GFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
EF YL LK D V P+ + + DWR+ AVT VK+Q +CGS WA
Sbjct: 98 EFAKLYLNPDYYARHLKNHKEDVHVDDSAPSGVM--SVDWRDKGAVTPVKNQGLCGSCWA 155
Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEK 212
FS GNIEG +AA LVSLSEQ L+ CD D+GC GG + A + IM G + E
Sbjct: 156 FSAIGNIEGQWAASGHSLVSLSEQMLVSCDNIDEGCNGGLMDQAMNWIMQSHNGSVFTEA 215
Query: 213 TYPYR---GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFY 269
+YPY G C ++ KI G++S+ DE +A+++ + GP+AVA++A Q Y
Sbjct: 216 SYPYTSGGGTRPPCH-DEGEVGAKITGFLSLPHDEERIAEWVEKRGPVAVAVDATTWQLY 274
Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
GV C +L+H VLIVG+ + PYWI+KNSWG WGEKGY R
Sbjct: 275 FGGVVS----LCLA--WSLNHGVLIVGFNKNAKP------PYWIVKNSWGSSWGEKGYIR 322
Query: 330 LYRGDGSCGINDYVRSALV 348
L G C + +Y SA V
Sbjct: 323 LAMGSNQCMLKNYPVSATV 341
>gi|164472556|gb|ABY58967.1| cathepsin L [Toxoplasma gondii]
Length = 421
Score = 190 bits (483), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 123/310 (39%), Positives = 160/310 (51%), Gaps = 25/310 (8%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
F+ F + K+YAT E R IF NL I + + S +N F DLS EF+
Sbjct: 116 FSSFQAMYAKSYATEEEKQRRYAIFKNNLVYIHT-HNQQGYSYSLKMNHFGDLSRDEFRR 174
Query: 102 KYLGFKLKPSYADR--SVPAMIPNI---TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTT 156
KYLGFK + V + N+ LP DWR VT VKDQ CGS WAFSTT
Sbjct: 175 KYLGFKKSRNLKSHHLGVATELLNVLPSELPAGVDWRSRGCVTPVKDQRDCGSCWAFSTT 234
Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCDQED--DGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
G +EG + AKT KLVSLSEQEL+DC + + C GG +++AF ++ GG+ E Y
Sbjct: 235 GALEGAHCAKTGKLVSLSEQELMDCSRAEGNQSCSGGEMNDAFQYVLD--SGGICSEDAY 292
Query: 215 PYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYVTG 272
PY D+ CR VKI G+ V R K + P+++AI A QFY G
Sbjct: 293 PYLARDEECRAQSCEKVVKILGFKDVPRRSEAAMKAALAKSPVSIAIEADQMPFQFYHEG 352
Query: 273 VSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKG--YFRL 330
V F +L H VL+VGYG D+ +WI+KNSWG GWG G Y +
Sbjct: 353 V------FDASCGTDLDHGVLLVGYGTDK----ESKKDFWIMKNSWGTGWGRDGYMYMAM 402
Query: 331 YRG-DGSCGI 339
++G +G CG+
Sbjct: 403 HKGEEGQCGL 412
>gi|30142040|gb|AAN34825.1| cysteine proteinase [Leishmania amazonensis]
gi|30142042|gb|AAN34826.1| cysteine proteinase [Leishmania amazonensis]
gi|30142572|gb|AAP21894.1| cysteine proteinase [Leishmania amazonensis]
Length = 354
Score = 190 bits (483), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 118/319 (36%), Positives = 168/319 (52%), Gaps = 25/319 (7%)
Query: 39 TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLN-EFSDLSTA 97
+A + F ++H+K + E R + F N++ L +T++ Y ++ +F+DL+
Sbjct: 39 SAHYGSFKKRHSKAFGGDAEEGHRFNAFKQNMQTAYFL-NTQNPHAHYDVSGKFADLTPQ 97
Query: 98 EFQAKYLG-----FKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
EF YL LK D V P+ + + DWR+ AVT VK+Q +CGS WA
Sbjct: 98 EFAKLYLNPDYYTSHLKDHKEDVHVDDSAPSGVM--SVDWRDKGAVTPVKNQGLCGSCWA 155
Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEK 212
FS GNIEG +AA LVSLSEQ L+ CD D+GC GG + A + IM G + E
Sbjct: 156 FSAIGNIEGQWAASGHSLVSLSEQMLVSCDNVDEGCNGGLMDQAMNWIMQSHNGSVFTEA 215
Query: 213 TYPYR---GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFY 269
+YPY G C ++ KI G++S+ DE +A ++ + GP+AVA++A Q Y
Sbjct: 216 SYPYTSGGGTRPPCH-DEGEVGAKITGFLSLPHDEERIADWVEKRGPVAVAVDATTWQLY 274
Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
GV + +L+H VLIVG+ + PYWI+KNSWG WGEKGY R
Sbjct: 275 FGGVVSLCLAW------SLNHGVLIVGFNKNAKP------PYWIVKNSWGSSWGEKGYIR 322
Query: 330 LYRGDGSCGINDYVRSALV 348
L G C + +Y SA V
Sbjct: 323 LAMGSNQCMLKNYPVSATV 341
>gi|71084306|gb|AAZ23598.1| cysteine protease [Leishmania major]
Length = 327
Score = 190 bits (483), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 123/321 (38%), Positives = 169/321 (52%), Gaps = 29/321 (9%)
Query: 39 TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLN-EFSDLSTA 97
+A + F E+H K++ + R + F N++ L +T + Y ++ +F+DL+
Sbjct: 12 SAHYGRFKERHGKSFGEDADEGHRFNAFKQNMQTAYFL-NTHNPHAHYDVSGKFADLTPQ 70
Query: 98 EFQAKYLGFKLKPSY-----ADRSVPAMIPNITLPRAF--DWREYDAVTGVKDQTMCGSS 150
EF YL P Y D + + L A DWRE AVT VK+Q MCGS
Sbjct: 71 EFAKLYL----NPDYYARRGKDYKEHVHVDDSVLSGAMSVDWREKVAVTPVKNQGMCGSC 126
Query: 151 WAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEE 210
WAFS GNIE +A K LVSLSEQ L+ CD DDGC GG + A + I+ G +
Sbjct: 127 WAFSAIGNIESQWALKNHSLVSLSEQMLVSCDDIDDGCNGGLMDQAMEWIIQHHNGTVPT 186
Query: 211 EKTYPYR---GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQ 267
E++YPY G C +K +I+GY+S+ DE +A Y+ + GP+AVA++A Q
Sbjct: 187 EESYPYASAGGTSPPCH-DKGEFGARISGYMSLPHDEKAIAAYVEKKGPVAVAVDATTWQ 245
Query: 268 FYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGY 327
Y GV C G +L+H VL+VG+ R K PYWI+KNSWG WGEKGY
Sbjct: 246 LYFGGVV----TLCFGW--SLNHGVLVVGFN-KRAK-----PPYWIVKNSWGTSWGEKGY 293
Query: 328 FRLYRGDGSCGINDYVRSALV 348
RL G C + +Y +A V
Sbjct: 294 IRLAMGSNQCLLKNYPVTATV 314
>gi|600111|emb|CAA84378.1| cysteine proteinase [Vicia sativa]
Length = 359
Score = 190 bits (483), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 125/346 (36%), Positives = 182/346 (52%), Gaps = 35/346 (10%)
Query: 12 LLSLTVSVSSFMVVGDEKLHHLHHVK-HTALFNYF--LEQHNKTYATLVEYYSRLHIFSG 68
LL +++S++ V + + H ++ +L+N + H+ L E ++R ++F
Sbjct: 6 LLFISLSLALIFTVANTFDFNEHDLESEKSLWNLYERWRSHHTVTRNLDEKHNRFNVFKA 65
Query: 69 NLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPA-----MIPN 123
N+ + + + LN+F D++ EF+ Y K+ R + M N
Sbjct: 66 NVMHVHNTNKLDKPYKL-KLNKFGDMTNYEFRRIYADSKISHHRMFRGMSHENGTFMYEN 124
Query: 124 -ITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCD 182
+ +P + DWR AVTGVKDQ CGS WAFST +EG+ KT+KLVSLSEQ+L+DCD
Sbjct: 125 AVDVPSSIDWRNKGAVTGVKDQGQCGSCWAFSTIAAVEGINQIKTQKLVSLSEQQLVDCD 184
Query: 183 -QEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVS 241
+E++GC GG + AF+ I G+ E YPY D C + K+ V I+G+ +V
Sbjct: 185 TEENEGCNGGLMEYAFEFIKQ---NGITTESNYPYAAKDGTCDVEKEDKAVSIDGHENVP 241
Query: 242 RDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGV 299
+ P++VAI+A Y QFY GV F + +L+H V IVGYGV
Sbjct: 242 INNEAALLKAAAKQPVSVAIDAGGYNFQFYSEGV------FTGHCDTDLNHGVAIVGYGV 295
Query: 300 --DRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGI 339
DRTK YWI+KNSWG WGE+GY R+ RG +G CGI
Sbjct: 296 TQDRTK-------YWIMKNSWGSEWGEQGYIRMQRGISSREGLCGI 334
>gi|38345906|emb|CAE04498.2| OSJNBb0059K02.8 [Oryza sativa Japonica Group]
Length = 458
Score = 190 bits (483), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 121/315 (38%), Positives = 167/315 (53%), Gaps = 33/315 (10%)
Query: 41 LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY----GLNEFSDLST 96
L+ + +H K+Y + E R F NLR I + +GV+ GLN F+DL+
Sbjct: 39 LYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDE-HNAAADAGVHSFRLGLNRFADLTN 97
Query: 97 AEFQAKYLGFKLKP----SYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
E++ YLG + KP +DR + A N LP + DWR AV +KDQ + GS WA
Sbjct: 98 EEYRDTYLGLRNKPRRERKVSDRYLAA--DNEALPESVDWRTKGAVAEIKDQEVAGSCWA 155
Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEE 211
FS +EG+ T L+SLSEQEL+DCD ++GC GG + AFD I++ GG++ E
Sbjct: 156 FSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAFDFIINN--GGIDTE 213
Query: 212 KTYPYRGDDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQF 268
YPY+G D+ C +N+K A V I+ Y V+ + + V N P++VAI A A Q
Sbjct: 214 DDYPYKGKDERCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVANQPVSVAIEAGGRAFQL 273
Query: 269 YVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYF 328
Y +G+ F L H V VGYG + K YWI++NSWG+ WGE GY
Sbjct: 274 YSSGI------FTGKCGTALDHGVAAVGYGTENGK------DYWIVRNSWGKSWGESGYV 321
Query: 329 RLYRG----DGSCGI 339
R+ R G CGI
Sbjct: 322 RMERNIKASSGKCGI 336
>gi|124484401|dbj|BAF46311.1| cysteine proteinase precursor [Ipomoea nil]
Length = 339
Score = 190 bits (483), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 123/337 (36%), Positives = 172/337 (51%), Gaps = 28/337 (8%)
Query: 13 LSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRK 72
L+L + S+++ L L V+H ++ Q+ + Y VE R +IF N+
Sbjct: 12 LALVFATSAYLATSRTLLDSLMAVRH----EQWMAQYGRVYKNEVEKTKRYNIFKENVEY 67
Query: 73 IQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNIT-LPRAFD 131
I+ G+N F+DL+ EF A G+ L P + P N++ +P D
Sbjct: 68 IESFNKAGTKPYKLGINAFADLTNKEFIASRNGYIL-PHECSSNTPFRYENVSAVPTTVD 126
Query: 132 WREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCE 189
WR+ AVT VKDQ CG WAFS +EG+ T L+SLSEQEL+DCD + D GCE
Sbjct: 127 WRKKGAVTPVKDQGQCGCCWAFSAVAAMEGITKLSTGNLISLSEQELVDCDVKGIDQGCE 186
Query: 190 GGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKAC-RLNKKATQVKINGYVSVSRDETDMA 248
GG + +AF I++ GL E YPY+G D +C + + KI+GY V +
Sbjct: 187 GGLMDDAFTFIINN--KGLTTESNYPYQGTDGSCKKSKSSNSAAKISGYEDVPANSESAL 244
Query: 249 KYLVENGPMAVAINAYA--LQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTH 306
+ V N P++VAI+A QFY +GV F + G E L H V VGYG+
Sbjct: 245 EKAVANQPVSVAIDAGGSDFQFYSSGV-----FTGECGTE-LDHGVTAVGYGI-----AE 293
Query: 307 KAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGI 339
YW++KNSWG WGEKGY R+ + +G CGI
Sbjct: 294 DGSKYWLVKNSWGTSWGEKGYIRMQKDIEAKEGLCGI 330
>gi|359485281|ref|XP_002280230.2| PREDICTED: LOW QUALITY PROTEIN: KDEL-tailed cysteine endopeptidase
CEP1 [Vitis vinifera]
Length = 341
Score = 190 bits (483), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 122/319 (38%), Positives = 159/319 (49%), Gaps = 24/319 (7%)
Query: 31 HHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNE 90
+LH ++ Q+ + Y E R IF N+ +I+ S +NE
Sbjct: 28 RNLHEASMYERHEDWMAQYGRVYKDADEKSKRYKIFKDNVARIESFNKAMDKSYKLSINE 87
Query: 91 FSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGS 149
F+DL+ EF FK + + N+T +P DWR+ AVT +KDQ CGS
Sbjct: 88 FADLTNEEFGTSRNRFKAHIC-STEATSFKYENVTAVPSTIDWRKKGAVTPIKDQGQCGS 146
Query: 150 SWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQ--EDDGCEGGSISNAFDTIMSKLGGG 207
WAFS +EG+ T KL+SLSEQEL+DCD ED GC GG + +AF I K G
Sbjct: 147 CWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQGCNGGLMDDAFKFI--KQNHG 204
Query: 208 LEEEKTYPYRGDDKACRLNKKA-TQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--Y 264
L E YPY G D C K A KINGY V + + V + P+AVAI+A +
Sbjct: 205 LTTEANYPYAGTDGTCNRKKAAHPAAKINGYEDVPANNEKALQKAVVHQPIAVAIDAGGF 264
Query: 265 ALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGE 324
QFY +GV F G E L H V VGYG + + YW++KNSWG GWGE
Sbjct: 265 EFQFYSSGV-----FTGQCGTE-LDHGVAAVGYGT-----SDDGMKYWLVKNSWGTGWGE 313
Query: 325 KGYFRLYRG----DGSCGI 339
+GY R+ R +G CGI
Sbjct: 314 EGYIRMQRDVTAKEGLCGI 332
>gi|402770509|gb|AFQ98389.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 190 bits (483), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 122/317 (38%), Positives = 170/317 (53%), Gaps = 27/317 (8%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGV----YGLNEFSDLSTAEFQ 100
F H KTY + +E R IF+ + I + ++ G+ G+N+F DL EF
Sbjct: 30 FKTTHKKTYQSHMEELLRFKIFTES-SLIIARHNAKYAKGLVSYKLGMNQFGDLLAHEFA 88
Query: 101 AKYLGF--KLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
+ G K + PA + + +LP+A DWR+ AVT VKDQ CGS WAFS TG+
Sbjct: 89 RIFNGHHGTRKTGGSTFLPPANVNDSSLPKAVDWRKKGAVTPVKDQGQCGSCWAFSATGS 148
Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
+EG + K +LVSLSEQ L+DC Q ++GCEGG + +AF I K G++ EK+YPY
Sbjct: 149 LEGQHFLKNGELVSLSEQNLVDCSQSFGNNGCEGGLMEDAFKYI--KANDGIDTEKSYPY 206
Query: 217 RGDDKACRLNKKATQVKINGYVSV-SRDETDMAKYLVENGPMAVAINA--YALQFYVTGV 273
D CR K+ GYV + + E D+ K + GP++VAI+A + Q Y GV
Sbjct: 207 EAVDGECRFKKEDVGATDTGYVEIKAGSEDDLKKAVATVGPISVAIDASHSSFQLYSEGV 266
Query: 274 -SHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR 332
P + +E+L H VL+VGYGV K YW++KNSW E WG++GY + R
Sbjct: 267 YDEP-----ECSSEDLDHGVLVVGYGVKGGK------KYWLVKNSWAESWGDQGYILMSR 315
Query: 333 -GDGSCGINDYVRSALV 348
+ CGI LV
Sbjct: 316 DNNNQCGIASQASYPLV 332
>gi|33333714|gb|AAQ11975.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
maculatus]
Length = 323
Score = 190 bits (483), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 118/305 (38%), Positives = 167/305 (54%), Gaps = 21/305 (6%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT-EHGSGVYG--LNEFSDLSTAEFQA 101
F H KTY ++VE R +F NL IQ E G + + +F+D++ EF
Sbjct: 26 FKLDHGKTYRSVVEEKRRFSVFQKNLIDIQEHNKKYERGEVSFAKKVTQFADMTHEEF-L 84
Query: 102 KYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEG 161
L + P+ +V +I A DWR+ AVT VK+Q CGS WAFS G IEG
Sbjct: 85 DLLKLQGVPALPSDAVYFEETDIEEKDAVDWRKEGAVTPVKNQGHCGSCWAFSAVGAIEG 144
Query: 162 VYAAKTKKLVSLSEQELIDCDQE---DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG 218
+ K LVSLS QEL+DC E ++GC GG + AFD + + G++ E++YPY+
Sbjct: 145 QFFKKNGTLVSLSAQELVDCATEYYGNEGCNGGLMGQAFDFVEDE---GIQTEESYPYKA 201
Query: 219 DDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPI 277
C++N + T+VK + +E ++A+ + GP+AVAI+A L FY G+
Sbjct: 202 KRSICQMNGEYVTKVKT---YHLLLNEQEIARAVSAKGPVAVAIDASQLSFYDQGIVDE- 257
Query: 278 QFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSC 337
+ C E+L+H VL+VGYG + V YWI+KNSWG WGEKGYFRL + +C
Sbjct: 258 KCKCSKKREDLNHGVLVVGYG------SENGVDYWIVKNSWGADWGEKGYFRLKKDVKAC 311
Query: 338 GINDY 342
GI +Y
Sbjct: 312 GIGNY 316
>gi|438000427|ref|YP_007250532.1| v-cath protein [Thysanoplusia orichalcea NPV]
gi|429842964|gb|AGA16276.1| v-cath protein [Thysanoplusia orichalcea NPV]
Length = 323
Score = 190 bits (483), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 117/314 (37%), Positives = 172/314 (54%), Gaps = 21/314 (6%)
Query: 36 VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLS 95
+K F F+ + NK Y++ E R IF NL +I + ++ S Y +N+FSDLS
Sbjct: 22 LKAPNYFEEFVHRFNKNYSSETEKLRRFKIFQHNLNEI--INKNQNDSAKYEINKFSDLS 79
Query: 96 TAEFQAKYLGFKLKPSYADRSVPAMI---PNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
E AKY G L P+ +I P P FDWR + VT VK+Q CG+ WA
Sbjct: 80 KDETIAKYTGLSL-PTQTQNFCKVIILDQPPGKGPLDFDWRRLNKVTNVKNQGTCGACWA 138
Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEK 212
F+T ++E YA K +L++LSEQ++IDCD D GC GG + AF+ I+ GG++ E
Sbjct: 139 FATLASLESQYAIKHNQLINLSEQQMIDCDFVDAGCNGGLLHTAFEAIIKM--GGVQLES 196
Query: 213 TYPYRGDDKACRLNKKATQVKI-NGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVT 271
YPY ++ CR+N V++ + Y V+ E + L GP+ +AI+A + Y
Sbjct: 197 DYPYEANNNNCRMNGNKFAVRVKDCYRYVTVYEEKLKDLLRVAGPIPMAIDAADIVNYKQ 256
Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
GV +C N L+H+VL+VGYGV+ +P+WI KN+WG WGE GYFR+
Sbjct: 257 GVIR----YC--FNSGLNHAVLLVGYGVENN------IPFWIFKNTWGTDWGEDGYFRVQ 304
Query: 332 RGDGSCGINDYVRS 345
+ +CG+ + + S
Sbjct: 305 QNINACGMRNELAS 318
>gi|115484973|ref|NP_001067630.1| Os11g0255300 [Oryza sativa Japonica Group]
gi|530335|emb|CAA56844.1| cysteine protease [Oryza sativa Japonica Group]
gi|5761322|dbj|BAA83472.1| cysteine endopeptidase [Oryza sativa Japonica Group]
gi|62732672|gb|AAX94791.1| cysteine proteinase (EC 3.4.22.-) - rice [Oryza sativa Japonica
Group]
gi|62732673|gb|AAX94792.1| cysteine proteinase (EC 3.4.22.-) - rice [Oryza sativa Japonica
Group]
gi|62732674|gb|AAX94793.1| cysteine proteinase (EC 3.4.22.-) - rice [Oryza sativa Japonica
Group]
gi|77549615|gb|ABA92412.1| Thiol protease SEN102 precursor, putative, expressed [Oryza sativa
Japonica Group]
gi|77549616|gb|ABA92413.1| Thiol protease SEN102 precursor, putative, expressed [Oryza sativa
Japonica Group]
gi|77549617|gb|ABA92414.1| Thiol protease SEN102 precursor, putative, expressed [Oryza sativa
Japonica Group]
gi|113644852|dbj|BAF27993.1| Os11g0255300 [Oryza sativa Japonica Group]
gi|125576789|gb|EAZ18011.1| hypothetical protein OsJ_33558 [Oryza sativa Japonica Group]
gi|215701098|dbj|BAG92522.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 378
Score = 190 bits (483), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 120/303 (39%), Positives = 154/303 (50%), Gaps = 37/303 (12%)
Query: 58 EYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSV 117
E R ++F N R I LN+F+D++T EF+ Y G + + RS+
Sbjct: 66 EARRRFNVFVENARYIHEANRRGGRPFRLALNKFADMTTDEFRRTYAGSRARHH---RSL 122
Query: 118 PAMIPNI------------TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAA 165
LP A DWRE AVTG+KDQ CGS WAFST +EGV
Sbjct: 123 SGGRGGEGGSFRYGGDDEDNLPPAVDWRERGAVTGIKDQGQCGSCWAFSTVAAVEGVNKI 182
Query: 166 KTKKLVSLSEQELIDCDQEDD-GCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACR 224
KT +LV+LSEQEL+DCD D+ GC+GG + AF I K GG+ E YPYR + C
Sbjct: 183 KTGRLVTLSEQELVDCDTGDNQGCDGGLMDYAFQFI--KRNGGITTESNYPYRAEQGRCN 240
Query: 225 LNKKATQ-VKINGYVSVSRDETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHPIQFFC 281
K ++ V I+GY V ++ + V N P+AVA+ A QFY GV F
Sbjct: 241 KAKASSHDVTIDGYEDVPANDESALQKAVANQPVAVAVEASGQDFQFYSEGV------FT 294
Query: 282 DGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-----DGS 336
+L H V VGYG+ T YWI+KNSWGE WGE+GY R+ RG +G
Sbjct: 295 GECGTDLDHGVAAVGYGI-----TRDGTKYWIVKNSWGEDWGERGYIRMQRGVSSDSNGL 349
Query: 337 CGI 339
CGI
Sbjct: 350 CGI 352
>gi|300175452|emb|CBK20763.2| unnamed protein product [Blastocystis hominis]
Length = 313
Score = 190 bits (483), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 117/311 (37%), Positives = 160/311 (51%), Gaps = 28/311 (9%)
Query: 36 VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLS 95
+++ FN F ++ K Y E R +F+ N+ Q + +H V G F+D++
Sbjct: 17 LRYENTFNSFEARYGKNYINAAERAFRQKVFAYNMEWAQKINSEDHPYTV-GATPFADMT 75
Query: 96 TAEFQ-AKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
EF +K G LKP + P M P A DWRE AVT VK+Q CGS WAFS
Sbjct: 76 NTEFAVSKLCGCMLKPKMTKPATPIMEP---AAEAVDWREKGAVTPVKNQASCGSCWAFS 132
Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
TG +EG +L+SLSEQ+L+DCD + GC GG ++ AF+ K G+ +E+ Y
Sbjct: 133 ATGAMEGRNFVANGELISLSEQQLVDCDHQSSGCGGGLMTYAFEYAKKK---GMCKEEDY 189
Query: 215 PYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYAL--QFYVTG 272
PY D+ C+ +K V GY V R + K V GP++VA+ A ++ Q Y G
Sbjct: 190 PYHAVDEDCKDDKCTPVVFPKGYEEVPRFDGAALKQAVSQGPVSVAVEADSIVFQMYTGG 249
Query: 273 VSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLY- 331
V +L+H VL VGYG D YWI+KNSWGE WG+KGY ++
Sbjct: 250 VIDS-----SACGTSLNHGVLAVGYGAD----------YWIVKNSWGESWGDKGYLKIKY 294
Query: 332 --RGDGSCGIN 340
G G CGIN
Sbjct: 295 TESGAGICGIN 305
>gi|400180449|gb|AFP73361.1| cysteine protease [Solanum chilense]
Length = 344
Score = 190 bits (483), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 108/301 (35%), Positives = 159/301 (52%), Gaps = 24/301 (7%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
++ +H + Y VE R IF N++ I+ + + S G+NEF+D+++ EF AK+
Sbjct: 42 WMSRHGRVYKDEVEKGERFMIFKKNMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFT 101
Query: 105 GFKLKPSYADRSVPAM-------IPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
G + SY S + + + +P DWRE AVT VK Q CG WAFS G
Sbjct: 102 GLNIPNSYLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGQCGCCWAFSAVG 161
Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
++EG Y T KL+ SEQEL+DC + GC GG ++NAFD I+ GG+ E Y Y
Sbjct: 162 SLEGAYKIATGKLMEFSEQELLDCTTNNYGCNGGFMTNAFDFIIEN--GGISRESDYEYL 219
Query: 218 GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA-LQFYVTGVSHP 276
G+ CR +K V+I+ Y V ET + + + + P+++ I A LQFY G
Sbjct: 220 GEQYTCRSQEKTAAVQISSYQVVPEGETSLLQAVTKQ-PVSIGIAASQDLQFYAEGTY-- 276
Query: 277 IQFFCDGG-NENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
DG + ++H+V +GYG D K YW++KNSWG WGE G+ ++ R G
Sbjct: 277 -----DGSCADRINHAVTAIGYGTD-----EKGQKYWLLKNSWGTSWGENGFMKIIRDSG 326
Query: 336 S 336
+
Sbjct: 327 N 327
>gi|401430127|ref|XP_003886478.1| unnamed protein product [Leishmania mexicana MHOM/GT/2001/U1103]
gi|356491231|emb|CBZ41048.1| unnamed protein product [Leishmania mexicana MHOM/GT/2001/U1103]
Length = 375
Score = 190 bits (482), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 111/272 (40%), Positives = 157/272 (57%), Gaps = 21/272 (7%)
Query: 86 YGLNEFSDLSTAEFQAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTG 140
+G+ +F DLS AEF A+YL F +A + +++ +P A DWRE AVT
Sbjct: 13 FGITKFFDLSEAEFAARYLNGAAYFAAAKRHAAQHYRKARADLSAVPDAVDWREKGAVTP 72
Query: 141 VKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTI 200
VKDQ CGS WAFS GNIEG + +LVSLSEQ+L+ CD +DGC+GG + AFD +
Sbjct: 73 VKDQGACGSCWAFSAVGNIEGQWYLAGHELVSLSEQQLVSCDDMNDGCDGGLMLQAFDWL 132
Query: 201 MSKLGGGLEEEKTYPYRGDD----KACRLNKKATQVKINGYVSVSRDETDMAKYLVENGP 256
+ G L E +YPY + + ++ +I+G+V + E MA +L +NGP
Sbjct: 133 LQNTNGHLHTEDSYPYVSGNGYVPECSNSSELVVGAQIDGHVLIGSSEKAMAAWLAKNGP 192
Query: 257 MAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKN 316
+A+A++A + Y +GV C G + L+H VL+VGY D T VPYW+IKN
Sbjct: 193 IAIALDASSFMSYKSGV----LTACIG--KQLNHGVLLVGY--DMT----GEVPYWVIKN 240
Query: 317 SWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
SWG WGE+GY R+ G +C +++Y SA V
Sbjct: 241 SWGGDWGEQGYVRVVMGVNACLLSEYPVSAHV 272
>gi|356545063|ref|XP_003540965.1| PREDICTED: thiol protease SEN102-like [Glycine max]
Length = 361
Score = 190 bits (482), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 122/341 (35%), Positives = 169/341 (49%), Gaps = 35/341 (10%)
Query: 10 VALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGN 69
+A L+ V+ S D ++ H ++ ++ K Y E R IF N
Sbjct: 36 MAFLAFQVTCRSLQ---DASMYERHE--------QWMTRYGKVYKDPQEREKRFRIFKEN 84
Query: 70 LRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFK-LKPSYADRSVPAMIPNIT-LP 127
+ I+ + + +N+F+DL+ EF A FK S R+ N+T +P
Sbjct: 85 VNYIEAFNNAANKRYKLAINQFADLTNEEFIAPRNRFKGHMCSSIIRTTTFKYENVTAVP 144
Query: 128 RAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE--D 185
DWR+ AVT +KDQ CG WAFS EG++A + KL+SLSEQEL+DCD + D
Sbjct: 145 STVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALTSGKLISLSEQELVDCDTKGVD 204
Query: 186 DGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQ-VKINGYVSVSRDE 244
GCEGG + +AF ++ GL E YPY+G D C N+ A V I GY V +
Sbjct: 205 QGCEGGLMDDAFKFVIQN--HGLNTEANYPYKGVDGKCNANEAANDVVTITGYEDVPANN 262
Query: 245 TDMAKYLVENGPMAVAINAYA--LQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRT 302
+ V N P++VAI+A QFY +GV F L H V VGYGV
Sbjct: 263 EKALQKAVANQPVSVAIDASGSDFQFYKSGV------FTGSCGTELDHGVTAVGYGV--- 313
Query: 303 KFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGI 339
++ YW++KNSWG WGE+GY R+ RG +G CGI
Sbjct: 314 --SNDGTEYWLVKNSWGTEWGEEGYIRMQRGVDSEEGLCGI 352
>gi|405958751|gb|EKC24845.1| Cathepsin L [Crassostrea gigas]
Length = 330
Score = 190 bits (482), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 118/308 (38%), Positives = 168/308 (54%), Gaps = 23/308 (7%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQ---LLQDTEHGSGVYGLNEFSDLSTAE 98
+ +L+ H K Y E R+ I+ GNL I+ L D S G+NE+ D++ E
Sbjct: 27 WQLYLKAHGKQYGAEEEARRRV-IWEGNLDYIEKHNLAADRGDYSFWLGMNEYGDMTNEE 85
Query: 99 FQAKYLGFKLKPSYADRSVPAMIPNI-TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
F++ G+K++ + S+ NI LP DWR VT +K+Q CGS W+FS TG
Sbjct: 86 FRSTMNGYKMRNGTSRGSLYLPPSNIGDLPDTVDWRPKGYVTPIKNQGQCGSCWSFSATG 145
Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEEKTYP 215
++EG KT KL SLSEQ L+DC Q+ + GC+GG + +AF I K G++ E +YP
Sbjct: 146 SLEGQTFKKTGKLPSLSEQNLVDCSQKQGNHGCQGGLMDDAFQYI--KDNNGIDTESSYP 203
Query: 216 YRGDDKACRLNKKATQVKINGYVSV-SRDETDMAKYLVENGPMAVAINA--YALQFYVTG 272
Y + CR N +G+ + S+ E+D+ + GP+AVAI+A + Q Y +G
Sbjct: 204 YEAKNGKCRFNAANVGATDSGFTDIKSKSESDLQSAVATVGPIAVAIDASHMSFQLYKSG 263
Query: 273 VSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR 332
V H +FFC L H VL VGYG + K YW++KNSWGE WG+KGY + R
Sbjct: 264 VYH--EFFCS--ETRLDHGVLAVGYGTESGK------DYWLVKNSWGESWGQKGYIMMSR 313
Query: 333 GD-GSCGI 339
+CGI
Sbjct: 314 NKRNNCGI 321
>gi|357452869|ref|XP_003596711.1| Cysteine proteinase [Medicago truncatula]
gi|355485759|gb|AES66962.1| Cysteine proteinase [Medicago truncatula]
Length = 344
Score = 190 bits (482), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 121/311 (38%), Positives = 160/311 (51%), Gaps = 29/311 (9%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
++E+H K Y E R IF NL I+ +N+F D + EF+A YL
Sbjct: 38 WMEEHGKFYKDAAEKEQRFQIFKENLEFIESFNAAGDNGFNLSINQFGDQTNDEFKANYL 97
Query: 105 GFKLKP-------SYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
K KP + + SV +P DWRE AVT +K Q +CGS WAF+T
Sbjct: 98 NGKKKPLIGVGIAAIEEESVFRYENVTEVPATMDWRERGAVTPIKHQHLCGSCWAFATVA 157
Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQED--DGCEGGSISNAFDTIMSKLGGGLEEEKTYP 215
IEG++ T +LVSLSEQEL+DC + + DGC GG + +A D I+ K GG+ E YP
Sbjct: 158 AIEGIHQITTGRLVSLSEQELVDCVKTNTTDGCNGGYVEDACDFIVKK--GGITSETNYP 215
Query: 216 YRGDDKACRLNKKATQV-KINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYVTG 272
Y D C + K V KI GY V + V N P+AV I A A QFY +G
Sbjct: 216 YTRVDGKCNVRKGTYNVAKIKGYEHVPANNEKALLKAVANQPIAVYIAATKRAFQFYSSG 275
Query: 273 VSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR 332
+ ++ C +L H+V IVGYG + V YW++KNSWG WGEKGY ++ R
Sbjct: 276 I---LKGKC---GIDLDHTVTIVGYGT-----SDDGVKYWLVKNSWGTKWGEKGYIKIKR 324
Query: 333 G----DGSCGI 339
+GSCGI
Sbjct: 325 DVHAKEGSCGI 335
>gi|18402225|ref|NP_566633.1| Granulin repeat cysteine protease family protein [Arabidopsis
thaliana]
gi|11994461|dbj|BAB02463.1| cysteine proteinase [Arabidopsis thaliana]
gi|17065298|gb|AAL32803.1| cysteine proteinase [Arabidopsis thaliana]
gi|20260004|gb|AAM13349.1| cysteine proteinase [Arabidopsis thaliana]
gi|332642713|gb|AEE76234.1| Granulin repeat cysteine protease family protein [Arabidopsis
thaliana]
Length = 452
Score = 190 bits (482), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 123/336 (36%), Positives = 174/336 (51%), Gaps = 27/336 (8%)
Query: 15 LTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQ 74
L +S+S V E + + ++ +L ++ K Y L E R IF NL+ ++
Sbjct: 18 LLISLSLGSVTATETTRNEAEARR--MYERWLVENRKNYNGLGEKERRFEIFKDNLKFVE 75
Query: 75 LLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNI--TLPRAFDW 132
+ + GL F+DL+ EF+A YL K++ + + + +LP A DW
Sbjct: 76 EHSSIPNRTYEVGLTRFADLTNDEFRAIYLRSKMERTRVPVKGEKYLYKVGDSLPDAIDW 135
Query: 133 REYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGG 191
R AV VKDQ CGS WAFS G +EG+ KT +L+SLSEQEL+DCD +DGC GG
Sbjct: 136 RAKGAVNPVKDQGSCGSCWAFSAIGAVEGINQIKTGELISLSEQELVDCDTSYNDGCGGG 195
Query: 192 SISNAFDTIMSKLGGGLEEEKTYPYRGDD-KACRLNKKATQ-VKINGYVSVSRDETDMAK 249
+ AF I+ GG++ E+ YPY D C +KK T+ V I+GY V +++ K
Sbjct: 196 LMDYAFKFIIEN--GGIDTEEDYPYIATDVNVCNSDKKNTRVVTIDGYEDVPQNDEKSLK 253
Query: 250 YLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHK 307
+ N P++VAI A A Q Y +GV F +L H V+ VGYG +
Sbjct: 254 KALANQPISVAIEAGGRAFQLYTSGV------FTGTCGTSLDHGVVAVGYG------SEG 301
Query: 308 AVPYWIIKNSWGEGWGEKGYFRLYR----GDGSCGI 339
YWI++NSWG WGE GYF+L R G CG+
Sbjct: 302 GQDYWIVRNSWGSNWGESGYFKLERNIKESSGKCGV 337
>gi|222629922|gb|EEE62054.1| hypothetical protein OsJ_16838 [Oryza sativa Japonica Group]
Length = 336
Score = 190 bits (482), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 127/307 (41%), Positives = 162/307 (52%), Gaps = 31/307 (10%)
Query: 49 HNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKL 108
+ K YA+ E R +F NL I + + + S GLNEF+DL+ EF+A YLG
Sbjct: 36 YRKAYASFEEKVRRFEVFKDNLNHIDDI-NKKVTSYWLGLNEFADLTHDEFKATYLGLTP 94
Query: 109 KPSYADRS-------VPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEG 161
P+ ++ + N +P+ DWR+ +AVT VK+Q CGS WAFST +EG
Sbjct: 95 PPTRSNSKHYSSEEFRYGKMSNGEVPKEMDWRKKNAVTEVKNQGQCGSCWAFSTVAAVEG 154
Query: 162 VYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDD 220
+ A T L SLSEQELIDC + ++GC GG + AF I S GGL E+ YPY ++
Sbjct: 155 INAIVTGNLTSLSEQELIDCSTDGNNGCNGGLMDYAFSYIAST--GGLRTEEAYPYAMEE 212
Query: 221 KACRLNKKATQVKINGYVSV-SRDETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHPI 277
C K A V I+GY V + DE + K L P++VAI A QFY GV
Sbjct: 213 GDCDEGKGAAVVTISGYEDVPANDEQALVKALAHQ-PVSVAIEASGRHFQFYSGGV---- 267
Query: 278 QFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR----G 333
F E L H V VGYG T K Y I+KNSWG WGEKGY R+ R G
Sbjct: 268 --FDGPCGEQLDHGVTAVGYG------TSKGQDYIIVKNSWGPHWGEKGYIRMKRGTGKG 319
Query: 334 DGSCGIN 340
+G CGIN
Sbjct: 320 EGLCGIN 326
>gi|21070926|gb|AAM34401.1|AF377947_7 putative cysteine proteinase [Oryza sativa Japonica Group]
gi|31712050|gb|AAP68356.1| putative cysteine protease [Oryza sativa Japonica Group]
gi|40538988|gb|AAR87245.1| putative cysteine protease [Oryza sativa Japonica Group]
gi|108711126|gb|ABF98921.1| Papain family cysteine protease containing protein, expressed
[Oryza sativa Japonica Group]
gi|125545747|gb|EAY91886.1| hypothetical protein OsI_13535 [Oryza sativa Indica Group]
Length = 350
Score = 190 bits (482), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 120/310 (38%), Positives = 157/310 (50%), Gaps = 27/310 (8%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYG----LNEFSDLSTAEFQ 100
++ +H KTY E RL +F N + I G G N F+DL+ EF+
Sbjct: 45 WMAKHGKTYKDEEEKARRLEVFRANAKLIDSFNAAAEKDGGGGHRLATNRFADLTDDEFR 104
Query: 101 AKYLGFKLKPSYADRSVPAMI-PNITL---PRAFDWREYDAVTGVKDQTMCGSSWAFSTT 156
A G++ P+ + + N +L P++ DWR AVTGVKDQ CG WAFS
Sbjct: 105 AARTGYQRPPAAVAGAGGGFLYENFSLAAAPQSMDWRAMGAVTGVKDQGSCGCCWAFSAV 164
Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCD--QEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
+EG+ +T +LVSLSEQEL+DCD ED GCEGG + AF I + GGL E +Y
Sbjct: 165 AAVEGLAKIRTGQLVSLSEQELVDCDVRGEDQGCEGGLMDTAFQYIARR--GGLAAESSY 222
Query: 215 PYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAIN--AYALQFYVTG 272
PYRG D ACR I G+ V ++ V P++VAIN Y +FY G
Sbjct: 223 PYRGVDGACRAAAGRAAASIRGFQDVPSNDEGALMAAVARQPVSVAINGAGYVFRFYDRG 282
Query: 273 VSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR 332
V G L+H+V VGYG YW++KNSWG WGE GY R+ R
Sbjct: 283 V-----LGGAGCGTELNHAVTAVGYGT-----ASDGTGYWLMKNSWGASWGEGGYVRIRR 332
Query: 333 G---DGSCGI 339
G +G+CGI
Sbjct: 333 GVGREGACGI 342
>gi|6649577|gb|AAF21462.1|U69121_1 cysteine proteinase PWCP2 [Paragonimus westermani]
Length = 260
Score = 190 bits (482), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 104/257 (40%), Positives = 146/257 (56%), Gaps = 7/257 (2%)
Query: 41 LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
L+ F + K YA + R IF NL + Q LQ + G+ YG+ +FSDL+ EF
Sbjct: 5 LYEQFKRXYGKVYAN-EDDQKRFAIFKDNLMRAQKLQLKDQGTARYGVTQFSDLTPEEFA 63
Query: 101 AKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIE 160
AKYL + R P + P DWR AVT V++Q CGS WAFST GN+E
Sbjct: 64 AKYLSAPVNNDQVKRVRPTGLK--AAPERIDWRAKGAVTAVENQGSCGSCWAFSTAGNVE 121
Query: 161 GVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDD 220
G + KT +LVSLS+Q+L+DCD+ DGC GG ++++ IM GGLE + YPY G
Sbjct: 122 GQWFIKTGQLVSLSKQQLVDCDRAADGCNGGWPASSYLEIMHM--GGLESQDDYPYAGVK 179
Query: 221 KACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFF 280
+ C + K+ KI+ +++ E D A YL E+GP++ +NA LQ+Y +G+ HP
Sbjct: 180 EQCFMEKERLLAKIDDSIALXPSEDDNAAYLAEHGPLSTLLNAITLQYYQSGIIHPSYXX 239
Query: 281 CDGGNENLSHSVLIVGY 297
C +L+H+VL VGY
Sbjct: 240 C--SPVDLNHAVLTVGY 254
>gi|326493368|dbj|BAJ85145.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 436
Score = 190 bits (482), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 118/314 (37%), Positives = 163/314 (51%), Gaps = 31/314 (9%)
Query: 41 LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY----GLNEFSDLST 96
++ ++ +H TY + E R F NLR I + +GV+ GLN F+DL+
Sbjct: 42 MYAEWMAEHGSTYNAIGEEERRFEAFRDNLRYIDQ-HNAAADAGVHSFRLGLNRFADLTN 100
Query: 97 AEFQAKYLGFKLKPSYADRSVPA---MIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAF 153
E+++ YLG + KP +R + A N LP + DWR+ AV VKDQ CGS WAF
Sbjct: 101 EEYRSTYLGARTKPDR-ERKLSARYQAADNDELPESVDWRKKGAVGAVKDQGGCGSCWAF 159
Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEK 212
S +EG+ T ++ LSEQEL+DCD + GC GG + AF+ I++ GG++ E+
Sbjct: 160 SAIAAVEGINQIVTGDMIPLSEQELVDCDTSYNQGCNGGLMDYAFEFIINN--GGIDSEE 217
Query: 213 TYPYRGDDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFY 269
YPY+ D C NKK A V I+GY V + + V N P++VAI A A Q Y
Sbjct: 218 DYPYKERDNRCDANKKNAKVVTIDGYEDVPVNSEKSLQKAVANQPISVAIEAGGRAFQLY 277
Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
+G+ F L H V VGYG + K YW+++NSWG WGE GY R
Sbjct: 278 KSGI------FTGTCGTALDHGVAAVGYGTENGK------DYWLVRNSWGSVWGEDGYIR 325
Query: 330 LYR----GDGSCGI 339
+ R G CGI
Sbjct: 326 MERNIKASSGKCGI 339
>gi|449454309|ref|XP_004144898.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
gi|449471311|ref|XP_004153272.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
Length = 349
Score = 190 bits (482), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 134/357 (37%), Positives = 183/357 (51%), Gaps = 35/357 (9%)
Query: 1 MSCFYFFAGVALLSLTVSVS-----SFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYAT 55
M+ F +LS T+ ++ F +VG H K LF ++ +H+K Y +
Sbjct: 1 MALSTFSKATLILSATLFITYATAHDFSIVGYSPEHLASMDKTIELFESWMSKHSKAYRS 60
Query: 56 LVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY--GLNEFSDLSTAEFQAKYLGFKLKPSYA 113
+ E R IF NL+ I +T Y GLNEF+DLS EF++KYLG +++
Sbjct: 61 IEEKLHRFEIFLDNLKHID---ETNKKVSSYWLGLNEFADLSHEEFKSKYLGLRVEFPRK 117
Query: 114 DRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVS 172
S ++ LP + DWR AVT VK+Q CGS WAFST +EG+ T L S
Sbjct: 118 RSSRGFSYGDVEDLPESVDWRTKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTS 177
Query: 173 LSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKAC-RLNKKAT 230
LSEQELIDCD+ ++GC GG + AF IMS GL +E+ YPY ++ C R ++
Sbjct: 178 LSEQELIDCDRSFNNGCYGGLMDYAFQYIMSN--SGLRKEEDYPYLMEEGRCIREKEQFE 235
Query: 231 QVKINGYVSV-SRDETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHPIQFFCDGGNEN 287
V I+GY V + DE + K L P++VAI A + QFY G+ F
Sbjct: 236 VVTISGYEDVPANDEQSLLKALSHQ-PVSVAIEASSRNFQFYKGGI------FTGRCGTQ 288
Query: 288 LSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGIN 340
+ H V VGYG + + Y I+KNSWG WGE GY R+ R +G CGIN
Sbjct: 289 MDHGVTAVGYG------SSEGTDYIIVKNSWGPKWGENGYIRMKRNTGKPEGLCGIN 339
>gi|449448298|ref|XP_004141903.1| PREDICTED: germination-specific cysteine protease 1-like [Cucumis
sativus]
gi|449531757|ref|XP_004172852.1| PREDICTED: germination-specific cysteine protease 1-like [Cucumis
sativus]
Length = 365
Score = 190 bits (482), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 129/350 (36%), Positives = 183/350 (52%), Gaps = 45/350 (12%)
Query: 10 VALLS---LTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIF 66
+ALLS L++S S+ D ++ + ++ +L +H K Y + E R IF
Sbjct: 8 LALLSFFFLSISASALSRRSDGEVREI--------YDLWLAKHGKAYNGIDEREKRFQIF 59
Query: 67 SGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPA------- 119
NL+ I ++E+ + GLN F+DL+ E++A YLG + P A R + A
Sbjct: 60 KENLKFIDD-HNSENRTYKVGLNMFADLTNEEYRALYLGTRSPP--ARRVMKAKTASRRY 116
Query: 120 MIPNI-TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQEL 178
+ N+ LP + DWR AV VK+Q CGS WAFST +EG+ T +L+SLSEQEL
Sbjct: 117 AVNNLDRLPESMDWRTRGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQEL 176
Query: 179 IDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKK-ATQVKING 236
+ CD++ + GC GG + AF I+ GGL+ E+ YPY D C +K A V I+
Sbjct: 177 VSCDKKYNSGCNGGLMDYAFQFIIDN--GGLDTEEDYPYEAFDGQCDPTRKNAKVVSIDA 234
Query: 237 YVSVSRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLI 294
Y V ++ + K V + P++VAI A ALQ Y +GV F L H V+
Sbjct: 235 YEDVPANDEESLKKAVAHQPVSVAIEASGLALQLYQSGV------FTGKCGSALDHGVVA 288
Query: 295 VGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-----DGSCGI 339
VGYG V YW+++NSWG WGE GYF+L R +G CGI
Sbjct: 289 VGYG------KENGVDYWLVRNSWGTSWGEDGYFKLERNVKHITEGKCGI 332
>gi|11359985|pir||T46294 hypothetical protein DKFZp434F0610.1 - human (fragment)
gi|6808322|emb|CAB70900.1| hypothetical protein [Homo sapiens]
Length = 308
Score = 190 bits (482), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 111/279 (39%), Positives = 163/279 (58%), Gaps = 4/279 (1%)
Query: 20 SSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT 79
S ++ ++ L VK ++F F+ +N+TY + E RL +F N+ + Q +Q
Sbjct: 5 SVISLLNEDPLSQDLPVKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQAL 64
Query: 80 EHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVT 139
+ G+ YG+ +FSDL+ EF+ YL L+ ++ A P +DWR AVT
Sbjct: 65 DRGTAQYGVTKFSDLTEEEFRTIYLNTLLRKEPGNKMKQAKSVGDLAPPEWDWRSKGAVT 124
Query: 140 GVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDT 199
VKDQ MCGS WAFS TGN+EG + L+SLSEQEL+DCD+ D C GG SNA+
Sbjct: 125 KVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDKMDKACMGGLPSNAYSA 184
Query: 200 IMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAV 259
I K GGLE E Y Y+G ++C + + +V IN V +S++E +A +L + GP++V
Sbjct: 185 I--KNLGGLETEDDYSYQGHMQSCNFSAEKAKVYINDSVELSQNEQKLAAWLAKRGPISV 242
Query: 260 AINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYG 298
AINA+ +QFY G+S P++ C + H+VL+VGYG
Sbjct: 243 AINAFGMQFYRHGISRPLRPLCSPW--LIDHAVLLVGYG 279
>gi|218181|dbj|BAA14402.1| oryzain alpha precursor [Oryza sativa Japonica Group]
Length = 458
Score = 190 bits (482), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 121/315 (38%), Positives = 166/315 (52%), Gaps = 33/315 (10%)
Query: 41 LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY----GLNEFSDLST 96
L+ + +H K+Y + E R F NLR I + +GV+ GLN F+DL+
Sbjct: 39 LYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDE-HNAAADAGVHSFRLGLNRFADLTN 97
Query: 97 AEFQAKYLGFKLKP----SYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
E++ YLG + KP +DR + A N LP + DWR AV +KDQ CGS WA
Sbjct: 98 EEYRDTYLGLRNKPRRERKVSDRYLAA--DNEALPESVDWRTKGAVAEIKDQGGCGSCWA 155
Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEE 211
FS +E + T L+SLSEQEL+DCD ++GC GG + AFD I++ GG++ E
Sbjct: 156 FSAIAAVEDINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAFDFIINN--GGIDTE 213
Query: 212 KTYPYRGDDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQF 268
YPY+G D+ C +N+K A V I+ Y V+ + + V N P++VAI A A Q
Sbjct: 214 DDYPYKGKDERCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVRNQPVSVAIEAGGRAFQL 273
Query: 269 YVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYF 328
Y +G+ F L H V VGYG + K YWI++NSWG+ WGE GY
Sbjct: 274 YSSGI------FTGKCGTALDHGVAAVGYGTENGK------DYWIVRNSWGKSWGESGYV 321
Query: 329 RLYRG----DGSCGI 339
R+ R G CGI
Sbjct: 322 RMERNIKASSGKCGI 336
>gi|7381610|gb|AAF61565.1|AF227957_1 cathepsin L-like proteinase precursor [Rhipicephalus microplus]
Length = 332
Score = 190 bits (482), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 126/351 (35%), Positives = 183/351 (52%), Gaps = 40/351 (11%)
Query: 11 ALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNL 70
A++++TV+ SS ++ + + F H K+Y + +E R IF+ N
Sbjct: 9 AIVAVTVAASSQEILRTQ-------------WEAFKTTHKKSYQSHMEELLRFKIFTEN- 54
Query: 71 RKIQLLQDTEHGSGV----YGLNEFSDLSTAEFQAKYLGF--KLKPSYADRSVPAMIPNI 124
I + ++ G+ G+N+F DL EF + G K + PA + +
Sbjct: 55 SLIIAKHNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNGHHGTRKTGGSTFLPPANVNDS 114
Query: 125 TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE 184
+LP+ DWR+ AVT VKDQ CGS WAFS TG++EG + K +LVSLSEQ L+DC Q
Sbjct: 115 SLPKVVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQS 174
Query: 185 --DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSV-S 241
++GCEGG + +AF I K G++ EK+YPY D CR K+ GYV + +
Sbjct: 175 FGNNGCEGGLMEDAFKYI--KANDGIDTEKSYPYEAVDGECRFKKEDVGATDTGYVEIKA 232
Query: 242 RDETDMAKYLVENGPMAVAINA--YALQFYVTGV-SHPIQFFCDGGNENLSHSVLIVGYG 298
E D+ K + GP++VAI+A + Q Y GV P + +E+L H VL+VGYG
Sbjct: 233 GSEVDLKKAVATVGPISVAIDASHSSFQLYSEGVYDEP-----ECSSEDLDHGVLVVGYG 287
Query: 299 VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR-GDGSCGINDYVRSALV 348
V K YW++KNSW E WG++GY + R + CGI LV
Sbjct: 288 VKGGK------KYWLVKNSWAESWGDQGYILMSRDNNNQCGIASQASYPLV 332
>gi|37780045|gb|AAP32195.1| cysteine protease 5 [Trifolium repens]
Length = 343
Score = 190 bits (482), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 124/345 (35%), Positives = 174/345 (50%), Gaps = 35/345 (10%)
Query: 6 FFAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHI 65
F + LL++ V+ + D+ + H ++ + K Y E RL I
Sbjct: 14 LFFCLGLLAIQVTSRTLQ---DDSIFERHE--------QWMTHYGKVYKNPQEREKRLRI 62
Query: 66 FSGNLRKIQLLQDTEHGSGV-YGLNEFSDLSTAEFQAKYLGFK-LKPSYADRSVPAMIPN 123
F+ NL+ I+ + + G+N+F+DL+ EF A FK S R+ N
Sbjct: 63 FTENLKYIEASNNAGNNKPYKLGINQFADLTNEEFIASRNKFKGHMCSSIIRTTTFKYEN 122
Query: 124 ITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQ 183
++P DWR+ AVT VK+Q CG WAFS EG++ T KLVSLSEQEL+DCD
Sbjct: 123 TSVPSTVDWRKKGAVTPVKNQGQCGCCWAFSAIAATEGIHKISTGKLVSLSEQELVDCDT 182
Query: 184 E--DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQ-VKINGYVSV 240
D GCEGG + +AF I+ G+ E YPY+G D C+ N+ +T I GY V
Sbjct: 183 NGVDQGCEGGLMDDAFKFIIQN--NGISTEAGYPYQGVDGTCKANEASTSAATITGYEDV 240
Query: 241 SRDETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYG 298
+ + + V N P++VAI+A QFY +GV F G E L H V VGYG
Sbjct: 241 PANNENALQKAVANQPISVAIDASGSDFQFYKSGV-----FTGSCGTE-LDHGVTAVGYG 294
Query: 299 VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR----GDGSCGI 339
+ ++ YW++KNSWG WGE+GY R+ R +G CGI
Sbjct: 295 I-----SNDGTKYWLVKNSWGTDWGEEGYIRMQRSIDAAEGLCGI 334
>gi|400180435|gb|AFP73355.1| cysteine protease [Solanum pennellii]
Length = 344
Score = 190 bits (482), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 109/301 (36%), Positives = 158/301 (52%), Gaps = 24/301 (7%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
++ +H + Y VE R IF N++ I+ + + S G+NEF+D+++ EF AK+
Sbjct: 42 WMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFT 101
Query: 105 GFKLKPSYADRSVPAM-------IPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
G + SY S + + + +P DWRE AVT VK+Q CG WAFS G
Sbjct: 102 GLNIPNSYVSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKNQGQCGCCWAFSAVG 161
Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
++EG Y T L+ SEQEL+DC + GC GG ++NAFD I K GG+ E Y Y
Sbjct: 162 SLEGAYKIATGNLMEFSEQELLDCTTNNYGCNGGFMTNAFDFI--KENGGISRESDYEYL 219
Query: 218 GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA-LQFYVTGVSHP 276
G CR +K V+I+ Y V ET + + + + P+++ I A LQFY G
Sbjct: 220 GQQYTCRSQEKTAAVQISSYQVVPEGETSLLQAVTKQ-PVSIGIAASQDLQFYAGGTY-- 276
Query: 277 IQFFCDGGNEN-LSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
DG N ++H+V +GYG D K YW++KNSWG WGE G+ ++ R G
Sbjct: 277 -----DGSCANRINHAVTAIGYGTD-----EKGQKYWLLKNSWGTSWGEDGFMKIIRDSG 326
Query: 336 S 336
+
Sbjct: 327 N 327
>gi|144905116|dbj|BAF56430.1| cysteine proteinase [Lotus japonicus]
Length = 341
Score = 190 bits (482), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 118/323 (36%), Positives = 161/323 (49%), Gaps = 32/323 (9%)
Query: 27 DEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY 86
D +H H ++ Q+ K Y E R IF N+++I+ + + S
Sbjct: 32 DASMHERHE--------QWMAQYGKVYKDSYEKELRSKIFKENVQRIEAFNNAGNKSYKL 83
Query: 87 GLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQT 145
G+N+F+DL+ EF+A+ S + R+ ++T +P + DWR+ AVT +KDQ
Sbjct: 84 GINQFADLTNEEFKARNRFKGHMCSNSTRTPTFKYEHVTSVPASLDWRQKGAVTPIKDQG 143
Query: 146 MCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSK 203
CG WAFS EG+ T KL+SLSEQEL+DCD + D GCEGG + +AF IM
Sbjct: 144 QCGCCWAFSAVAATEGITKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFIMQN 203
Query: 204 LGGGLEEEKTYPYRGDDKACRLNKKATQ-VKINGYVSVSRDETDMAKYLVENGPMAVAIN 262
GL E YPY+G D C N +A I G+ V + V N P++VAI+
Sbjct: 204 --KGLNTEAKYPYQGVDATCNANAEAKDAASIKGFEDVPANSESALLKAVANQPISVAID 261
Query: 263 AYA--LQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGE 320
A QFY +GV F L H V VGYG D YW++KNSWGE
Sbjct: 262 ASGSEFQFYSSGV------FTGSCGTELDHGVTAVGYGSD------GGTKYWLVKNSWGE 309
Query: 321 GWGEKGYFRLYRG----DGSCGI 339
WGE+GY R+ R +G CG
Sbjct: 310 QWGEQGYIRMQRDVAAEEGLCGF 332
>gi|426252044|ref|XP_004019728.1| PREDICTED: cathepsin W [Ovis aries]
Length = 375
Score = 189 bits (481), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 111/327 (33%), Positives = 169/327 (51%), Gaps = 24/327 (7%)
Query: 41 LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
+F F Q+N++Y E+ RL IF+ NL K Q LQ+ + G+ +G+ +FSDL+ EF
Sbjct: 41 VFRLFQMQYNRSYPNPAEHARRLDIFAQNLAKAQRLQEEDLGTAEFGVTQFSDLTEEEFV 100
Query: 101 AKYLGFKLKPSY--ADRSVPAMIPNITLPRAFDWR-EYDAVTGVKDQTMCGSSWAFSTTG 157
Y G ++ R V + + P DWR + + ++ V++Q C WA + G
Sbjct: 101 QLY-GSRVAGEALGVSRKVGSEEWGESQPPTCDWRNKPNTISPVRNQRHCNCCWAMAAAG 159
Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
NIE ++A K + V EL+DCD+ +GC+GG + +AF T++ GL E YP+
Sbjct: 160 NIEALWAIKFNRSVEERGGELLDCDRCGNGCKGGFVWDAFLTVLKNR--GLASETDYPFD 217
Query: 218 GDDKA--CRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSH 275
G K C K I ++ + E +A++L GP+ V IN LQ Y GV
Sbjct: 218 GSGKTHRCLAEKHKKVAWIQDFIMLQACEQSIARHLATQGPITVTINVKLLQQYQKGVIK 277
Query: 276 PIQFFCDGGNENLSHSVLIVGYGVDRT--------------KFTHKAVPYWIIKNSWGEG 321
CD ++ HSVL+VG+G ++ +++ YW +KNSWG
Sbjct: 278 ATPTTCD--PRHVDHSVLLVGFGKTKSVEGRQGKAASFRSYTRPRRSMAYWTLKNSWGPH 335
Query: 322 WGEKGYFRLYRGDGSCGINDYVRSALV 348
WGE+GYFRL+RG +CGI Y +A+V
Sbjct: 336 WGEEGYFRLHRGSNTCGITKYPVTAIV 362
>gi|18390634|ref|NP_563764.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|8844131|gb|AAF80223.1|AC025290_12 Contains similarity to a cysteine endopeptidase 1 from Phaseolus
vulgaris gb|U52970 and is a member of the papain
cysteine protease family PF|00112 [Arabidopsis thaliana]
gi|332189848|gb|AEE27969.1| cysteine proteinase-like protein [Arabidopsis thaliana]
Length = 343
Score = 189 bits (481), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 123/340 (36%), Positives = 175/340 (51%), Gaps = 27/340 (7%)
Query: 10 VALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGN 69
+A+L V ++S + D ++ H F +L+ H+K Y E+ R I+ N
Sbjct: 12 LAVLICFVLIASKLCSVDSSVYDPHKTLKQR-FEKWLKTHSKLYGGRDEWMLRFGIYQSN 70
Query: 70 LRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKP-SYADRSVPAMIPNITLPR 128
++ I + ++ H N F+D++ +EF+A +LG + P P +P
Sbjct: 71 VQLIDYI-NSLHLPFKLTDNRFADMTNSEFKAHFLGLNTSSLRLHKKQRPVCDPAGNVPD 129
Query: 129 AFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCD--QEDD 186
A DWR AVT +++Q CG WAFS IEG+ KT LVSLSEQ+LIDCD +
Sbjct: 130 AVDWRTQGAVTPIRNQGKCGGCWAFSAVAAIEGINKIKTGNLVSLSEQQLIDCDVGTYNK 189
Query: 187 GCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNK-KATQVKINGYVSVSRDET 245
GC GG + AF+ I K GGL E YPY G + C K K V I GY V+++E
Sbjct: 190 GCSGGLMETAFEFI--KTNGGLATETDYPYTGIEGTCDQEKSKNKVVTIQGYQKVAQNEA 247
Query: 246 DMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTK 303
+ + P++V I+A + Q Y +GV F + NL+H V +VGYGV+ +
Sbjct: 248 SL-QIAAAQQPVSVGIDAGGFIFQLYSSGV------FTNYCGTNLNHGVTVVGYGVEGDQ 300
Query: 304 FTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGI 339
YWI+KNSWG GWGE+GY R+ RG G CGI
Sbjct: 301 ------KYWIVKNSWGTGWGEEGYIRMERGVSEDTGKCGI 334
>gi|402770503|gb|AFQ98386.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 189 bits (481), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 127/351 (36%), Positives = 183/351 (52%), Gaps = 40/351 (11%)
Query: 11 ALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNL 70
A++++TV+ SS ++ + + F H KTY + +E R IF+ N
Sbjct: 9 AIVAVTVAASSQEILRTQ-------------WEAFKTTHKKTYQSHMEELLRFKIFTEN- 54
Query: 71 RKIQLLQDTEHGSGV----YGLNEFSDLSTAEFQAKYLGF--KLKPSYADRSVPAMIPNI 124
I + ++ G+ G+N+F DL EF + G K + PA + +
Sbjct: 55 SLIIAKHNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNGHHGTRKTGGSTFLPPANVNDS 114
Query: 125 TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE 184
+LP+ DWR+ AVT VKDQ CGS WAFS TG++EG + K +LVSLSEQ L+DC Q
Sbjct: 115 SLPKVVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGRHFLKNGELVSLSEQNLVDCSQS 174
Query: 185 --DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSV-S 241
++GCEGG + +AF I K G++ EK+YPY D CR K+ GYV + +
Sbjct: 175 FGNNGCEGGLMEDAFKYI--KENDGIDTEKSYPYEAVDGECRFKKEDVGATDTGYVEIKA 232
Query: 242 RDETDMAKYLVENGPMAVAINA--YALQFYVTGV-SHPIQFFCDGGNENLSHSVLIVGYG 298
E D+ K + GP++VAI+A + Q Y GV P + +E+L H VL+VGYG
Sbjct: 233 GSEDDLKKAVATVGPISVAIDASHSSFQLYSEGVYDEP-----ECSSEDLDHGVLVVGYG 287
Query: 299 VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR-GDGSCGINDYVRSALV 348
V K YW++KNSW E WG++GY + R + CGI LV
Sbjct: 288 VKGGK------KYWLVKNSWAESWGDQGYILMSRDNNNQCGIASQASYPLV 332
>gi|342305188|dbj|BAK55648.1| cathepsin L [Oplegnathus fasciatus]
Length = 336
Score = 189 bits (481), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 131/351 (37%), Positives = 187/351 (53%), Gaps = 30/351 (8%)
Query: 10 VALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGN 69
VA+L++ +S + D +L +H L+ + H K Y E + R+ ++ N
Sbjct: 4 VAVLAVCLSAALSAPSLDPQLD-----EHWDLWKSW---HTKKYHEKEEGWRRM-VWEKN 54
Query: 70 LRKIQLLQDTEHGSGVY----GLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPN-I 124
L+KI+L + EH G + G+N F D++ EF+ G+K K + M PN +
Sbjct: 55 LKKIEL-HNLEHSMGEHTYRLGMNHFGDMTHEEFRQIMYGYKRKSERKFKGSLFMEPNFL 113
Query: 125 TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE 184
PR+ DWR+ VT VKDQ CGS WAFSTTG +EG + KT KLVSLSEQ L+DC +
Sbjct: 114 EAPRSVDWRDNGYVTPVKDQGQCGSCWAFSTTGAMEGQHFRKTGKLVSLSEQNLVDCSRP 173
Query: 185 D--DGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG-DDKACRLNKKATQVKINGYVSV- 240
+ +GC GG + AF I K GL+ E +YPY G DD+ C + K G++ +
Sbjct: 174 EGNEGCNGGLMDQAFQYI--KDNQGLDSEDSYPYLGTDDQPCHYDPKYNSANDTGFIDIP 231
Query: 241 SRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYG 298
S E + K + GP++VAI+A + QFY +G I + + +E L H VL+VGYG
Sbjct: 232 SGKERALMKAVAAVGPVSVAIDAGHESFQFYQSG----IYYEKECSSEELDHGVLVVGYG 287
Query: 299 VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-DGSCGINDYVRSALV 348
+ K YWI+KNSW E WG+KGY + + CGI LV
Sbjct: 288 FEGEDVDGKK--YWIVKNSWSEKWGDKGYIYMAKDRKNHCGIATAASYPLV 336
>gi|225446585|ref|XP_002280215.1| PREDICTED: vignain [Vitis vinifera]
Length = 341
Score = 189 bits (481), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 118/304 (38%), Positives = 154/304 (50%), Gaps = 22/304 (7%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
++ Q+ + Y E R IF N+ +I+ S +NEF+DL+ EF+A
Sbjct: 42 WMAQYGRVYKDADEKSKRYKIFKDNVARIESFNKAMDKSYKLSINEFADLTNEEFRASRN 101
Query: 105 GFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYA 164
FK + + +P DWR+ AVT +KDQ CGS WAFS +EG+
Sbjct: 102 RFKAHICSTEATSFKYEHVAAVPSTVDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQ 161
Query: 165 AKTKKLVSLSEQELIDCDQ--EDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKA 222
T KL+SLSEQEL+DCD ED GC GG + +AF I + GL E YPY G D
Sbjct: 162 LSTGKLISLSEQELVDCDTSGEDQGCNGGLMDDAFKFI--EQNHGLATEANYPYAGTDGT 219
Query: 223 CRLNKKA-TQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQF 279
C K A KINGY V + + V + P+AVAI+A + QFY +GV F
Sbjct: 220 CNRKKAAHPAAKINGYEDVPANNEKALQKAVAHQPIAVAIDAGGFEFQFYSSGV-----F 274
Query: 280 FCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DG 335
G E L H V VGYG + + YW++KNSWG GWGE GY R+ R +G
Sbjct: 275 TGQCGTE-LDHGVAAVGYGT-----SDDGMKYWLVKNSWGTGWGEVGYIRMQRDVTAKEG 328
Query: 336 SCGI 339
CGI
Sbjct: 329 LCGI 332
>gi|356577813|ref|XP_003557017.1| PREDICTED: uncharacterized protein LOC100801364 [Glycine max]
Length = 890
Score = 189 bits (481), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 116/306 (37%), Positives = 158/306 (51%), Gaps = 24/306 (7%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
++ ++ K Y E R IF N+ I+ + + +N+F+DL+ EF A
Sbjct: 589 WMTRYGKVYKDPQEREKRFRIFKENVNYIEAFNNAANKRYKLAINQFADLTNEEFIAPRN 648
Query: 105 GFK-LKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGV 162
FK S R+ N+T +P DWR+ AVT +KDQ CG WAFS EG+
Sbjct: 649 RFKGHMCSSIIRTTTFKYENVTAVPSTVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGI 708
Query: 163 YAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDD 220
+A + KL+SLSEQEL+DCD + D GCEGG + +AF ++ GL E YPY+G D
Sbjct: 709 HALTSGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFVIQ--NHGLNTEANYPYKGVD 766
Query: 221 KACRLNKKATQ-VKINGYVSVSRDETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHPI 277
C N+ A V I GY V + + V N P++VAI+A QFY +GV
Sbjct: 767 GKCNANEAANDVVTITGYEDVPANNEKALQKAVANQPVSVAIDASGSDFQFYKSGV---- 822
Query: 278 QFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG---- 333
F L H V VGYGV ++ YW++KNSWG WGE+GY R+ RG
Sbjct: 823 --FTGSCGTELDHGVTAVGYGV-----SNDGTEYWLVKNSWGTEWGEEGYIRMQRGVDSE 875
Query: 334 DGSCGI 339
+G CGI
Sbjct: 876 EGLCGI 881
>gi|255580657|ref|XP_002531151.1| cysteine protease, putative [Ricinus communis]
gi|223529264|gb|EEF31236.1| cysteine protease, putative [Ricinus communis]
Length = 340
Score = 189 bits (481), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 122/343 (35%), Positives = 170/343 (49%), Gaps = 33/343 (9%)
Query: 7 FAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIF 66
A + LL VS + + D +H H ++ + + Y E R IF
Sbjct: 12 LALIFLLGALVSQAMARTLQDASMHEKHE--------EWMSRFGRVYNDGNEKEIRYKIF 63
Query: 67 SGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITL 126
N+++I+ S G+N+F+DL+ EF+ FK + ++ P N+T
Sbjct: 64 KENVQRIESFNKASGKSYKLGINQFADLTNEEFKTSRNRFKGHMC-SSQAGPFRYENLTA 122
Query: 127 -PRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQ-- 183
P + DWR+ AVT +KDQ CGS WAFS +EG+ T KL+SLSEQEL+DCD
Sbjct: 123 APSSMDWRKKGAVTAIKDQGQCGSCWAFSAVAAVEGITQLATSKLISLSEQELVDCDTKG 182
Query: 184 EDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQ-VKINGYVSVSR 242
ED GC+GG + +AF I GL E YPY G D C ++A KING+ V
Sbjct: 183 EDQGCQGGLMDDAFKFIEQNQ--GLTTEANYPYEGSDGTCNTKQEANHAAKINGFEDVPA 240
Query: 243 DETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVD 300
+ V P++VAI+A + QFY +G+ F D G E L H V VGYG
Sbjct: 241 NNEGALMKAVAKQPVSVAIDAGGFGFQFYSSGI-----FTGDCGTE-LDHGVAAVGYG-- 292
Query: 301 RTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGI 339
+ YW++KNSWG WGE+GY R+ + +G CGI
Sbjct: 293 ----ESNGMNYWLVKNSWGTQWGEEGYIRMQKDIDAKEGLCGI 331
>gi|111073715|dbj|BAF02546.1| triticain alpha [Triticum aestivum]
gi|388890585|gb|AFK80346.1| cysteine endopeptidase EP alpha [Secale cereale x Triticum durum]
Length = 461
Score = 189 bits (481), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 117/313 (37%), Positives = 162/313 (51%), Gaps = 29/313 (9%)
Query: 41 LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKI---QLLQDTEHGSGVYGLNEFSDLSTA 97
++ ++ +H +TY + E R +F NLR I D S GLN F+DL+
Sbjct: 40 MYAEWMSEHRRTYNAIGEEERRFEVFRDNLRYIDQHNAAADAGLHSFRLGLNRFADLTNE 99
Query: 98 EFQAKYLGFKLKPSYADRSVPAMIP---NITLPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
E+++ YLG + KP +R + A N LP DWR+ AV +KDQ CGS WAFS
Sbjct: 100 EYRSTYLGARTKPDR-ERKLSARYQADDNEELPETVDWRKKGAVAAIKDQGGCGSCWAFS 158
Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
+EG+ T ++ LSEQEL+DCD ++GC GG + AF+ I++ GG++ E+
Sbjct: 159 AIAAVEGINQIVTGDMIPLSEQELVDCDTSYNEGCNGGLMDYAFEFIINN--GGIDSEED 216
Query: 214 YPYRGDDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYV 270
YPY+ D C NKK A V I+GY V + + V N P++VAI A A Q Y
Sbjct: 217 YPYKERDNRCDANKKNAKVVTIDGYEDVPVNSEKSLQKAVANQPISVAIEAGGRAFQLYK 276
Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
+G+ F L H V VGYG + K YW+++NSWG WGE GY R+
Sbjct: 277 SGI------FTGTCGTALDHGVAAVGYGTENGK------DYWLVRNSWGTVWGEDGYIRM 324
Query: 331 YRG----DGSCGI 339
R G CGI
Sbjct: 325 ERNIKASSGKCGI 337
>gi|37780051|gb|AAP32198.1| cysteine protease 12 [Trifolium repens]
Length = 343
Score = 189 bits (480), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 124/345 (35%), Positives = 174/345 (50%), Gaps = 35/345 (10%)
Query: 6 FFAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHI 65
F + LL++ V+ + D+ + H ++ + K Y E RL I
Sbjct: 14 LFFCLGLLAIQVTSRTLQ---DDSIFERHE--------QWMTHYGKVYKNPQEREKRLRI 62
Query: 66 FSGNLRKIQLLQDTEHGSGV-YGLNEFSDLSTAEFQAKYLGFK-LKPSYADRSVPAMIPN 123
F+ NL+ I+ + + G+N+F+DL+ EF A FK S R+ N
Sbjct: 63 FTENLKYIEASNNAGNKKPYKLGINQFADLTNEEFIASRNKFKGHMCSSIIRTTTFKYEN 122
Query: 124 ITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQ 183
++P DWR+ AVT VK+Q CG WAFS EG++ T KLVSLSEQEL+DCD
Sbjct: 123 TSVPSTVDWRKKGAVTPVKNQGQCGCCWAFSAIAATEGIHKISTGKLVSLSEQELVDCDT 182
Query: 184 E--DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQ-VKINGYVSV 240
D GCEGG + +AF I+ G+ E YPY+G D C+ N+ +T I GY V
Sbjct: 183 NGVDQGCEGGLMDDAFKFIIQN--NGISTEAGYPYQGVDGTCKANEASTSAATITGYEDV 240
Query: 241 SRDETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYG 298
+ + + V N P++VAI+A QFY +GV F G E L H V VGYG
Sbjct: 241 PANNENALQKAVANQPISVAIDASGSDFQFYKSGV-----FTGSCGTE-LDHGVTAVGYG 294
Query: 299 VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR----GDGSCGI 339
+ ++ YW++KNSWG WGE+GY R+ R +G CGI
Sbjct: 295 I-----SNDGTKYWLVKNSWGTDWGEEGYIRMQRSIDAAEGLCGI 334
>gi|224065647|ref|XP_002301901.1| predicted protein [Populus trichocarpa]
gi|222843627|gb|EEE81174.1| predicted protein [Populus trichocarpa]
Length = 336
Score = 189 bits (480), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 126/314 (40%), Positives = 169/314 (53%), Gaps = 33/314 (10%)
Query: 41 LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY--GLNEFSDLSTAE 98
LF ++ +H K Y ++ E + R IF NL I +T Y GLNEFSDLS E
Sbjct: 32 LFESWISKHGKIYESIEEKWLRFEIFKDNLFHID---ETNKKVVNYWLGLNEFSDLSHEE 88
Query: 99 FQAKYLGFKLKPSYADRSVPAMIPN----ITLPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
F+ KYLG K+ S +R + N +++P++ DWR+ AVT VK+Q CGS WAFS
Sbjct: 89 FKNKYLGLKVDMS--ERRECSQEFNYKDVMSIPKSVDWRKKGAVTDVKNQGSCGSCWAFS 146
Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQEDD-GCEGGSISNAFDTIMSKLGGGLEEEKT 213
T +EG+ T L SLSEQEL+DCD ++ GC GG + AF I+S GGL +E
Sbjct: 147 TVAAVEGINQIVTGNLTSLSEQELVDCDTTNNYGCNGGLMDYAFSYIISN--GGLHKEVD 204
Query: 214 YPYRGDDKACRLNKKATQ-VKINGYVSVSRDETDMAKYLVENGPMAVAINAYA--LQFYV 270
YPY ++ C + K+ ++ V I+GY V ++ + + N P++VAI A QFY
Sbjct: 205 YPYIMEEGTCEMRKEESEVVTISGYHDVPQNSEESLLKALANQPLSVAIEASGRDFQFYS 264
Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
GV F L H V VGYG + + Y I+KNSWG WGEKGY R+
Sbjct: 265 GGV------FDGHCGTQLDHGVAAVGYG------STNGLDYIIVKNSWGSKWGEKGYIRM 312
Query: 331 YRGDGS----CGIN 340
R G CGIN
Sbjct: 313 KRNTGKPAGLCGIN 326
>gi|351629615|gb|AEQ54771.1| KDDL-tailed cysteine proteinase CP4 [Coffea canephora]
Length = 359
Score = 189 bits (480), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 118/306 (38%), Positives = 169/306 (55%), Gaps = 31/306 (10%)
Query: 49 HNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKL 108
H+ L E R ++F N+ I + + + LN F+D++ EF+ ++ K+
Sbjct: 46 HHTVSRDLSEKRKRFNVFKANVHHIHKVNQKDKPYKLK-LNSFADMTNHEFR-EFYSSKV 103
Query: 109 KP---SYADRSVPAMIPNIT--LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVY 163
K + R+ + T LP + DWR+ AVTGVK+Q CGS WAFST +EG+
Sbjct: 104 KHYRMLHGSRANTGFMHGKTESLPASVDWRKQGAVTGVKNQGKCGSCWAFSTVVGVEGIN 163
Query: 164 AAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKAC 223
KT +LVSLSEQEL+DC+ +++GC GG + NA++ I K GG+ E+ YPY+ D +C
Sbjct: 164 KIKTGQLVSLSEQELVDCETDNEGCNGGLMENAYEFI--KKSGGITTERLYPYKARDGSC 221
Query: 224 RLNK-KATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHPIQFF 280
+K A V I+G+ V ++ + V N P++VAI+A +QFY GV +
Sbjct: 222 DSSKMNAPAVTIDGHEMVPANDENALMKAVANQPVSVAIDASGSDMQFYSEGV-----YA 276
Query: 281 CDGGNENLSHSVLIVGYG--VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----- 333
D L H V +VGYG +D TK YWI+KNSWG GWGE+GY R+ RG
Sbjct: 277 GDSCGNELDHGVAVVGYGTALDGTK-------YWIVKNSWGTGWGEQGYIRMQRGVDAAE 329
Query: 334 DGSCGI 339
G CGI
Sbjct: 330 GGVCGI 335
>gi|357158628|ref|XP_003578189.1| PREDICTED: thiol protease aleurain-like [Brachypodium distachyon]
Length = 363
Score = 189 bits (480), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 120/315 (38%), Positives = 169/315 (53%), Gaps = 23/315 (7%)
Query: 33 LHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFS 92
L +H F F ++ K+Y + E R IFS +L +++ + + S G+N +S
Sbjct: 53 LGRSRHALRFARFAVRYGKSYESAAEVQRRFRIFSESLEEVRST-NQKGLSYRLGINRYS 111
Query: 93 DLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
D+S EFQA LG S R M LP DWRE V+ VKDQ+ CGS W
Sbjct: 112 DMSWEEFQASRLGAAQTCSATLRGNHRMQDANALPETKDWREDGIVSPVKDQSHCGSCWT 171
Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQ--EDDGCEGGSISNAFDTIMSKLGGGLEE 210
FSTTG +E Y T K +SLSEQ+L+DC + GC GG S AF+ I K GGL+
Sbjct: 172 FSTTGALEAAYTQATGKNISLSEQQLVDCAGAYNNFGCNGGLPSQAFEYI--KYNGGLDT 229
Query: 211 EKTYPYRGDDKACRLNKKATQVKINGYVSVS---RDETDMAKYLVENGPMAVA---INAY 264
E++YPY+G + C + V++ V+++ DE A LV P++VA IN +
Sbjct: 230 EESYPYKGVNGVCHYKPENAAVQVLDSVNITLNAEDELQNAVGLVR--PVSVAFEVINGF 287
Query: 265 ALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGE 324
+ Y +GV C ++++H+VL VGYGV+ PYW+IKNSWGE WG+
Sbjct: 288 --RQYKSGVY--TSDHCGTTPDDVNHAVLAVGYGVE------NGTPYWLIKNSWGESWGD 337
Query: 325 KGYFRLYRGDGSCGI 339
KGYF++ RG C +
Sbjct: 338 KGYFKMERGKNMCAV 352
>gi|297830592|ref|XP_002883178.1| hypothetical protein ARALYDRAFT_479457 [Arabidopsis lyrata subsp.
lyrata]
gi|297329018|gb|EFH59437.1| hypothetical protein ARALYDRAFT_479457 [Arabidopsis lyrata subsp.
lyrata]
Length = 452
Score = 189 bits (480), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 124/338 (36%), Positives = 177/338 (52%), Gaps = 31/338 (9%)
Query: 15 LTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQ 74
L +S+S V + + + ++ +L ++ K Y L E +R IF+ NL+ I+
Sbjct: 18 LLISLSLGSVTAADTTRNEAEARR--MYEQWLVENRKNYNGLGEKETRFEIFTDNLKYIE 75
Query: 75 LLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLK----PSYADRSVPAMIPNITLPRAF 130
+ + GL F+DL+ EF+A YL K++ P +R + + TLP
Sbjct: 76 EHNSVPNQTFEVGLTRFADLTNDEFRAIYLRSKMERTRVPVKGERYLYKV--GDTLPDQI 133
Query: 131 DWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCE 189
DWR AV VKDQ CGS WAFS G +EG+ KT +L+SLSEQEL+DCD + GC
Sbjct: 134 DWRAKGAVNPVKDQGNCGSCWAFSAIGAVEGINQIKTGELISLSEQELVDCDTSYNGGCG 193
Query: 190 GGSISNAFDTIMSKLGGGLEEEKTYPYRG-DDKACRLNKKATQ-VKINGYVSVSRDETDM 247
GG + AF I+ GG++ E+ YPY DD C +KK ++ V I+GY V +++
Sbjct: 194 GGLMDYAFKFIIEN--GGIDTEEDYPYTATDDNICNSDKKNSRVVTIDGYEDVPQNDEKS 251
Query: 248 AKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFT 305
K + N P++VAI A A Q Y +GV F +L H V+ VGYG +
Sbjct: 252 LKKALANQPISVAIEAGGRAFQLYKSGV------FTGTCGTSLDHGVVAVGYG------S 299
Query: 306 HKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGI 339
YWI++NSWG WGE GYF+L R G CG+
Sbjct: 300 EGGQDYWIVRNSWGSNWGESGYFKLERNIKESSGKCGV 337
>gi|1483570|emb|CAA68066.1| cathepsin l [Litopenaeus vannamei]
Length = 328
Score = 189 bits (480), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 123/317 (38%), Positives = 166/317 (52%), Gaps = 28/317 (8%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT----EHGSGVYGL--NEFSDLSTAE 98
F +H + YA++ E RL +F N Q + D E+G + L N+F D+++ E
Sbjct: 27 FKAEHGRRYASVQEERYRLSVFEQNQ---QFIDDHNARFENGEVTFTLQMNQFGDMTSEE 83
Query: 99 FQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
F A GF PS ++ P+ TLP+ DWR AVT VKDQ CGS WAFSTTG+
Sbjct: 84 FTATMNGFLNVPSRRPTAILRADPDETLPKEVDWRTKGAVTPVKDQKQCGSCWAFSTTGS 143
Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
+EG + K KLVSLSEQ L+DC + + GC GG + AF I K G++ E +YPY
Sbjct: 144 LEGQHFLKDGKLVSLSEQNLVDCSDKFGNMGCMGGLMDQAFRYI--KANKGIDTEDSYPY 201
Query: 217 RGDDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINAY--ALQFYVTGV 273
D CR + GYV V E+ + K + GP++VAI+A + QFY GV
Sbjct: 202 EAQDGKCRFDASNVGATDTGYVDVEHGSESALKKAVATIGPISVAIDASQPSFQFYHDGV 261
Query: 274 SHPIQFFCDGGNEN-LSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR 332
++ +G + L H VL VGYG T K YW++KNSW WG KGY ++ R
Sbjct: 262 -----YYEEGCSSTMLDHGVLAVGYGE-----TEKGEAYWLVKNSWNTSWGNKGYIQMSR 311
Query: 333 G-DGSCGINDYVRSALV 348
+CGI LV
Sbjct: 312 DKKNNCGIASQASYPLV 328
>gi|144905112|dbj|BAF56429.1| cysteine proteinase [Lotus japonicus]
Length = 341
Score = 189 bits (480), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 119/331 (35%), Positives = 171/331 (51%), Gaps = 28/331 (8%)
Query: 22 FMVVGDEKL--HHLHHVKHTALFNY--FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQ 77
+ VG ++ LH + + + + ++ +++K Y E R IF N+ I+
Sbjct: 17 LLAVGISRVISRELHETETSLIERHEQWMAKYDKVYKDAAEKEKRFLIFKDNVEFIESFN 76
Query: 78 DTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNIT-LPRAFDWREYD 136
+ G+N +DL+ EF+A G K Y + N+T +P + DWR+
Sbjct: 77 AAGNKPYKLGVNHLADLTIEEFKASRNGLKRSYDYEVGTTSFKYENVTAIPASVDWRKKG 136
Query: 137 AVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSIS 194
AVT +KDQ CGS WAFST EG++ T KLVSLSEQEL+DCD++ D GCEGG +
Sbjct: 137 AVTPIKDQGQCGSCWAFSTVAATEGIHKISTGKLVSLSEQELVDCDRKGTDQGCEGGYME 196
Query: 195 NAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVEN 254
+ F+ I+ GG+ E YPY+ D +C+ N A +I GY V + V N
Sbjct: 197 DGFEFIIKN--GGITTEANYPYKAVDGSCK-NATAPAAQIKGYEKVPVNSEKALLKAVAN 253
Query: 255 GPMAVAINAY--ALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYW 312
P++V+I+A + FY +G+ F + G E L H V VGYG YW
Sbjct: 254 QPVSVSIDAADGSFMFYSSGI-----FTGECGTE-LDHGVTAVGYG------RANGTDYW 301
Query: 313 IIKNSWGEGWGEKGYFRLYRG----DGSCGI 339
I+KNSWG WGE+GY R+ RG +G CGI
Sbjct: 302 IVKNSWGTVWGEQGYIRMQRGIAAKEGLCGI 332
>gi|357465603|ref|XP_003603086.1| Cysteine proteinase [Medicago truncatula]
gi|355492134|gb|AES73337.1| Cysteine proteinase [Medicago truncatula]
Length = 474
Score = 189 bits (480), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 126/350 (36%), Positives = 180/350 (51%), Gaps = 37/350 (10%)
Query: 10 VALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATL--VEYYSRLHIFS 67
A +L +S+ S+ +K + ++ + +H K + E R IF
Sbjct: 21 TATFALDMSIISYDKTHSDKSSRRSDKEVKNIYEEWRVKHGKLNNNIDGSEKDKRFEIFK 80
Query: 68 GNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKP---------SYADRSVP 118
NL+ I + E+ + GLN F+DLS E++++YLG K+ P + ++R P
Sbjct: 81 DNLKFIDE-HNAENRTYKVGLNRFADLSNEEYRSRYLGTKIDPIGMMMARTKTRSNRYAP 139
Query: 119 AMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQEL 178
++ LP++ DWR AV VKDQ CGS WAFST +EG+ T +LVSLSEQEL
Sbjct: 140 SV--GDKLPKSVDWRSQGAVVQVKDQGSCGSCWAFSTIAAVEGINKIVTGELVSLSEQEL 197
Query: 179 IDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKAC-RLNKKATQVKING 236
+DCD+ + GC+GG + AF+ I++ GG++ ++ YPYRG D C + K A V I+
Sbjct: 198 VDCDRTVNAGCDGGLMEYAFEFIINN--GGIDSDEDYPYRGVDGKCDQYKKNARVVSIDD 255
Query: 237 YVSVSRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLI 294
Y V + K V N P++VAI A Q YV+G+ F L H V
Sbjct: 256 YEQVPAYDELALKKAVANQPISVAIEAGGREFQLYVSGI------FTGKCGTALDHGVTA 309
Query: 295 VGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-----DGSCGI 339
VGYG T V YWI++NSWG+ WGE GY R+ R G CGI
Sbjct: 310 VGYG------TENGVDYWIVRNSWGKSWGESGYVRMERNLAASVAGKCGI 353
>gi|225446583|ref|XP_002280204.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1 [Vitis vinifera]
Length = 341
Score = 189 bits (480), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 122/319 (38%), Positives = 160/319 (50%), Gaps = 24/319 (7%)
Query: 31 HHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNE 90
+LH ++ Q+ + Y E R IF N+ +I+ S +NE
Sbjct: 28 RNLHEASMYERHEDWMVQYGREYKDADEKSKRYKIFKDNVARIESFNKAMDKSYKLSINE 87
Query: 91 FSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGS 149
F+DL+ EF+A FK + + N+T +P DWR+ AVT +KDQ CGS
Sbjct: 88 FADLTNEEFRASRNRFKAHIC-STEATSFKYENVTAVPSTVDWRKKGAVTPIKDQGQCGS 146
Query: 150 SWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQ--EDDGCEGGSISNAFDTIMSKLGGG 207
WAFS +EG+ T KL+SLSEQEL+DCD ED GC GG + +AF I + G
Sbjct: 147 CWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQGCSGGLMDDAFKFI--EQNHG 204
Query: 208 LEEEKTYPYRGDDKACRLNKKA-TQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--Y 264
L E YPY G D C K A KINGY V + + V + P+AVAI+A
Sbjct: 205 LTTEANYPYAGTDGTCNRKKAAHPAAKINGYEDVPANNEKALQKAVAHQPIAVAIDAGGS 264
Query: 265 ALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGE 324
QFY +GV F G E L H V VGYG + + YW++KNSWG GWGE
Sbjct: 265 EFQFYSSGV-----FTGQCGTE-LDHGVSAVGYGT-----SDDGMKYWLVKNSWGTGWGE 313
Query: 325 KGYFRLYRG----DGSCGI 339
+GY R+ R +G CGI
Sbjct: 314 EGYIRMQRDVTAKEGLCGI 332
>gi|151547430|gb|ABS12459.1| cysteine protease Cp [Citrus sinensis]
Length = 361
Score = 189 bits (480), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 120/318 (37%), Positives = 167/318 (52%), Gaps = 24/318 (7%)
Query: 30 LHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY--G 87
L + +H F F ++ K Y ++ E R FS NL L++ T Y G
Sbjct: 50 LQVIGQARHALSFARFARRYGKIYESVEEMKLRFATFSKNL---DLIRSTNCKGLSYRLG 106
Query: 88 LNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMC 147
LN+F+D S EFQ LG S + + ++ LP DWRE V+ VKDQ C
Sbjct: 107 LNKFADWSWEEFQRHRLGAAQNCSATTKGNHKLTADV-LPETKDWRESGIVSPVKDQGHC 165
Query: 148 GSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQ--EDDGCEGGSISNAFDTIMSKLG 205
GS W FSTTG++E Y K +SLSEQ+L+DC Q + GC GG S AF+ I K
Sbjct: 166 GSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYI--KYN 223
Query: 206 GGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVS---RDETDMAKYLVENGPMAVAIN 262
GGL+ E+ YPY G D C+ + + V++ V+++ DE A LV P++VA
Sbjct: 224 GGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVR--PVSVAFE 281
Query: 263 AY-ALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEG 321
+FY +GV + C +++H+V+ VGYGV+ VPYW+IKNSWGE
Sbjct: 282 VVDGFRFYKSGVYSSTK--CGNTPMDVNHAVVAVGYGVE------DGVPYWLIKNSWGEN 333
Query: 322 WGEKGYFRLYRGDGSCGI 339
WG+ GYF++ G CGI
Sbjct: 334 WGDHGYFKIKMGKNMCGI 351
>gi|426345827|ref|XP_004040600.1| PREDICTED: cathepsin O [Gorilla gorilla gorilla]
Length = 321
Score = 189 bits (480), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 113/284 (39%), Positives = 156/284 (54%), Gaps = 20/284 (7%)
Query: 71 RKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADR---SVPAMIPNITLP 127
R + L +E+ + YG+N+FS L EF+A YL + KPS R V IPN++LP
Sbjct: 52 RYLNSLFPSENSTAFYGINQFSHLFPEEFKAIYL--RSKPSKFPRYSAEVHMSIPNVSLP 109
Query: 128 RAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDG 187
FDWR+ VT V++Q MCG WAFS G +E YA K K L LS Q++IDC + G
Sbjct: 110 LRFDWRDKQVVTQVRNQQMCGGCWAFSVVGAVESAYAIKGKPLEDLSVQQVIDCSYNNYG 169
Query: 188 CEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACR-LNKKATQVKINGYVS--VSRDE 244
C GGS NA + ++K+ L ++ YP++ + C + + I GY + S E
Sbjct: 170 CNGGSTLNALN-WLNKMQVKLVKDSEYPFKAQNGLCHYFSGSHSGFSIKGYSAHDFSNQE 228
Query: 245 TDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKF 304
+MAK L+ GP+ V ++A + Q Y+ G+ IQ C G N H+VLI G+ D+T
Sbjct: 229 DEMAKALLTFGPLVVIVDAVSWQDYLGGI---IQHHCSSGEAN--HAVLITGF--DKTGS 281
Query: 305 THKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
T PYWI++NSWG WG GY + G CGI D V S V
Sbjct: 282 T----PYWIVRNSWGSSWGVDGYAHVKMGSNVCGIADSVSSIFV 321
>gi|121308860|dbj|BAF43527.1| cysteine proteinase [Zinnia elegans]
Length = 352
Score = 189 bits (480), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 139/350 (39%), Positives = 182/350 (52%), Gaps = 37/350 (10%)
Query: 6 FFAGVALLSLTVSVSSFMVVG--DEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRL 63
F V++L+ + F ++G E L +H V H LF +L +H+K Y +L E R
Sbjct: 13 LFLFVSILACSPLAHEFSILGYAPEDLTSIHKVIH--LFESWLVKHSKFYESLDEKLHRF 70
Query: 64 HIFSGNLRKIQLLQDTEHGSGVY--GLNEFSDLSTAEFQAKYLGFKLK-PSYADRSVPAM 120
IF NL+ I +T Y GLNEF+DL+ EF+ K+LGFK + D S
Sbjct: 71 EIFMDNLKHID---ETNKKVSNYWLGLNEFADLTHEEFKHKFLGFKGELAERKDESSKEF 127
Query: 121 --IPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQEL 178
+ LP++ DWR+ AV VK+Q CG+ WAFST +EG+ T L LSEQEL
Sbjct: 128 GYRDFVDLPKSVDWRKKGAVAPVKNQGQCGNCWAFSTVAAVEGINQIVTGNLTMLSEQEL 187
Query: 179 IDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQ-VKING 236
IDCD ++GC GG + AF +M GL +E+ YPY + C K ++ V I+G
Sbjct: 188 IDCDTTFNNGCNGGLMDYAFAYVMR---SGLHKEEEYPYIMSEGTCDEKKDVSEKVTISG 244
Query: 237 YVSVSR-DETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHPIQFFCDGGNENLSHSVL 293
Y V R DE K L N P++VAI A QFY GV F G E L H V
Sbjct: 245 YHDVPRNDEASFLKALA-NQPISVAIEASGRDFQFYSGGV-----FDGHCGTE-LDHGVA 297
Query: 294 IVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGS----CGI 339
VGYG T K + Y I++NSWG WGEKGY R+ RG G CG+
Sbjct: 298 AVGYG------TTKGLDYVIVRNSWGPKWGEKGYIRMKRGSGKPHGMCGL 341
>gi|356577811|ref|XP_003557016.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 189 bits (480), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 121/341 (35%), Positives = 169/341 (49%), Gaps = 35/341 (10%)
Query: 10 VALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGN 69
+A L+ V+ S D ++ H ++ ++ K Y E R IF N
Sbjct: 18 MAFLAFQVTCRSLQ---DASMYERHE--------QWMTRYGKVYKDPQEREKRFRIFKEN 66
Query: 70 LRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFK-LKPSYADRSVPAMIPNIT-LP 127
+ I+ + + +N+F+DL+ EF A FK S R+ N+T +P
Sbjct: 67 VNYIEAFNNAANKRYKLAINQFADLTNEEFIAPRNRFKGHMCSSIIRTTTFKYENVTAVP 126
Query: 128 RAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE--D 185
DWR+ AVT +KDQ CG WAFS EG++A + KL+SLSEQEL+DCD + D
Sbjct: 127 STVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALTSGKLISLSEQELVDCDTKGVD 186
Query: 186 DGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVK-INGYVSVSRDE 244
GCEGG + +AF ++ GL E YPY+G D C +N+ A I GY V +
Sbjct: 187 QGCEGGLMDDAFKFVIQN--HGLNTEANYPYKGVDGKCNVNEAANDAATITGYEDVPANN 244
Query: 245 TDMAKYLVENGPMAVAINAYA--LQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRT 302
+ V N P++VAI+A QFY +GV F L H V VGYGV
Sbjct: 245 EKALQKAVANQPVSVAIDASGSDFQFYKSGV------FTGSCGTELDHGVTAVGYGV--- 295
Query: 303 KFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGI 339
++ YW++KNSWG WGE+GY R+ RG +G CGI
Sbjct: 296 --SNDGTEYWLVKNSWGTEWGEEGYIRMQRGVNSEEGLCGI 334
>gi|344295866|ref|XP_003419631.1| PREDICTED: cathepsin W-like [Loxodonta africana]
Length = 376
Score = 189 bits (480), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 117/336 (34%), Positives = 176/336 (52%), Gaps = 41/336 (12%)
Query: 41 LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
+F F Q+N++Y+ E+ RL IF+ NL + Q LQ+ + G+ +G+ FSDL+ EF+
Sbjct: 41 VFALFQLQYNRSYSNPAEHARRLDIFARNLAQAQQLQEEDLGTAKFGVTPFSDLTEEEFR 100
Query: 101 AKYLGFKLKPSYADRSVPAMIPNIT-----------LPRAFDWREY-DAVTGVKDQTMCG 148
Y + P PN++ +P DWR+ + + V++Q C
Sbjct: 101 Q---------VYGQQKAPGRAPNVSRKAGPKEWGRPVPATCDWRKMANVIKPVRNQKNCK 151
Query: 149 SSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGL 208
WA + GNIE ++ K + V +S QEL+DC + DGC GG + +AF T+++ GL
Sbjct: 152 CCWAMAVAGNIEALWGIKYSQSVEVSVQELLDCGRCGDGCGGGFVWDAFITVLN--NSGL 209
Query: 209 EEEKTYPYRGDDKA--CRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYAL 266
EK YP++G+ KA C+ K I ++ + DE +A YL GP+ V IN L
Sbjct: 210 ASEKDYPFQGNVKAHKCQAKKHTNVAWIQDFIMLQDDEQIIAGYLATQGPITVTINMKLL 269
Query: 267 QFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRT--------------KFTHKAVPYW 312
Q Y GV CD ++HSVL+VG+G ++ +++PYW
Sbjct: 270 QHYQKGVIRAKSNDCDP--HRVNHSVLLVGFGKGKSVARMPAETPQGGAPAHPSRSIPYW 327
Query: 313 IIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
I+KNSWG WGE+GYFRL+RG +CGI Y +A V
Sbjct: 328 ILKNSWGSNWGEEGYFRLHRGSNTCGITKYPLTARV 363
>gi|320164780|gb|EFW41679.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
Length = 334
Score = 189 bits (480), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 114/312 (36%), Positives = 171/312 (54%), Gaps = 30/312 (9%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY----GLNEFSDLSTA 97
F + K+Y+ VE +R ++ N ++L D +G+G++ G+N F+DL+
Sbjct: 30 FEAWKRTFGKSYSDAVEEINRRAVWEAN----KMLVDAHNGAGIHSYTLGMNIFADLTHE 85
Query: 98 EFQAKYLGFKL---KPSYADRSVPAMIPNI-TLPRAFDWREYDAVTGVKDQTMCGSSWAF 153
EF+ YLG K+ +P S N+ LP + DWR VT VKDQ CGS W+F
Sbjct: 86 EFKRFYLGTKVDLNRPRSNFSSTFIPTANVGALPDSVDWRTAGIVTPVKDQGQCGSCWSF 145
Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCD--QEDDGCEGGSISNAFDTIMSKLGGGLEEE 211
STTG++EG +A KT +LVSLSEQ L+DC Q + GC GG + +AF I++ G++ E
Sbjct: 146 STTGSVEGQHARKTGQLVSLSEQNLVDCSKAQGNQGCNGGLMDDAFQYIITNK--GIDTE 203
Query: 212 KTYPYRGDDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINAY--ALQF 268
+YPY D C+ N ++ + ++R E+D+ + GP++VAI+A + Q
Sbjct: 204 ASYPYTAKDGTCKFNAANVGATLSSFQDITRGSESDLQNAVATVGPVSVAIDASKNSFQL 263
Query: 269 YVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYF 328
Y +GV + + C + +L H VL GYG T PYW++KNSWG WG+ GY
Sbjct: 264 YTSGVYNEKK--CS--STSLDHGVLAAGYG------TSNGTPYWLVKNSWGSSWGQAGYI 313
Query: 329 RLYR-GDGSCGI 339
+ R + CGI
Sbjct: 314 WMSRNANNQCGI 325
>gi|400180447|gb|AFP73360.1| cysteine protease [Solanum chilense]
Length = 345
Score = 189 bits (480), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 108/302 (35%), Positives = 159/302 (52%), Gaps = 25/302 (8%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
++ +H + Y VE R IF N++ I+ + + S G+NEF+D+++ EF AK+
Sbjct: 42 WMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFT 101
Query: 105 GFKLKPSYADRSVPAM--------IPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTT 156
G + SY S + + + +P DWRE AVT VK Q CG WAFS
Sbjct: 102 GLNIPNSYLSPSPMSSTEFKKINDLSDDDMPSNLDWRESGAVTQVKHQGQCGCCWAFSAV 161
Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
G++EG Y T KL+ SEQEL+DC + GC GG ++NAFD I+ GG+ E Y Y
Sbjct: 162 GSLEGAYKIATGKLMEFSEQELLDCTTNNYGCNGGFMTNAFDFIIEN--GGISRESDYEY 219
Query: 217 RGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA-LQFYVTGVSH 275
G+ CR +K V+I+ Y V ET + + + + P+++ I A LQFY G
Sbjct: 220 LGEQYTCRSQEKTAAVQISSYQVVPEGETSLLQAVTKQ-PVSIGIAASQDLQFYAGGTY- 277
Query: 276 PIQFFCDGG-NENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGD 334
DG + ++H+V +GYG D K YW++KNSWG WGE G+ ++ R
Sbjct: 278 ------DGSCADRINHAVTAIGYGTD-----EKGQKYWLLKNSWGTSWGENGFMKIIRDS 326
Query: 335 GS 336
G+
Sbjct: 327 GN 328
>gi|312281697|dbj|BAJ33714.1| unnamed protein product [Thellungiella halophila]
Length = 347
Score = 189 bits (480), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 112/310 (36%), Positives = 160/310 (51%), Gaps = 28/310 (9%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGV-YGLNEFSDLSTAEFQAKY 103
++ +H + YA + E +R +F N+ +I+ L + G +N+F+DL+ EF++ Y
Sbjct: 42 WMAKHGRVYADMKEKNNRYVVFKRNVERIERLNNVPAGRTFKLAVNQFADLTNDEFRSMY 101
Query: 104 LGFK----LKPSYADRSVPAMIPNIT---LPRAFDWREYDAVTGVKDQTMCGSSWAFSTT 156
G+K L ++ N++ LP + DWR+ AVT +K+Q CG WAFS
Sbjct: 102 TGYKGGSVLSSQSGTKTSSFRYQNVSSGALPVSVDWRKKGAVTPIKNQGTCGCCWAFSAV 161
Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
IEG K KL+SLSEQ+L+DCD D GC GG + AF+ IM+ GGL E YPY
Sbjct: 162 AAIEGATKIKKGKLISLSEQQLVDCDTNDFGCSGGLMDTAFEHIMAT--GGLTTESNYPY 219
Query: 217 RGDDKACRL-NKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAIN--AYALQFYVTGV 273
+G D C++ N K T I GY V ++ V + P+++ I + QFY +GV
Sbjct: 220 KGKDATCKIKNTKPTATSITGYEDVPVNDEKALMKAVAHQPVSIGIEGGGFDFQFYGSGV 279
Query: 274 SHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG 333
F L H+V VGYG + YWIIKNSWG WGE GY R+ +
Sbjct: 280 ------FTGECTTYLDHAVTAVGYGQ-----SSNGSKYWIIKNSWGTKWGESGYMRIKKD 328
Query: 334 ----DGSCGI 339
G CG+
Sbjct: 329 VKDKKGLCGL 338
>gi|18394919|ref|NP_564126.1| Xylem cysteine proteinase 2 [Arabidopsis thaliana]
gi|71153409|sp|Q9LM66.2|XCP2_ARATH RecName: Full=Xylem cysteine proteinase 2; Short=AtXCP2; Flags:
Precursor
gi|4836904|gb|AAD30607.1|AC007369_17 Putative cysteine proteinase [Arabidopsis thaliana]
gi|6708183|gb|AAF25832.1|AF191028_1 papain-type cysteine endopeptidase XCP2 [Arabidopsis thaliana]
gi|28466959|gb|AAO44088.1| At1g20850 [Arabidopsis thaliana]
gi|110743795|dbj|BAE99733.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|332191910|gb|AEE30031.1| Xylem cysteine proteinase 2 [Arabidopsis thaliana]
Length = 356
Score = 189 bits (480), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 130/326 (39%), Positives = 173/326 (53%), Gaps = 43/326 (13%)
Query: 35 HVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY-GLNEFSD 93
H K LF ++ K Y T+ E + R +F NL+ I + + G + GLNEF+D
Sbjct: 44 HDKLIELFENWISNFEKAYETVEEKFLRFEVFKDNLKHID--ETNKKGKSYWLGLNEFAD 101
Query: 94 LSTAEFQAKYLGFKL-------KPSYAD---RSVPAMIPNITLPRAFDWREYDAVTGVKD 143
LS EF+ YLG K + SYA+ R V A +P++ DWR+ AV VK+
Sbjct: 102 LSHEEFKKMYLGLKTDIVRRDEERSYAEFAYRDVEA------VPKSVDWRKKGAVAEVKN 155
Query: 144 QTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMS 202
Q CGS WAFST +EG+ T L +LSEQELIDCD ++GC GG + AF+ I+
Sbjct: 156 QGSCGSCWAFSTVAAVEGINKIVTGNLTTLSEQELIDCDTTYNNGCNGGLMDYAFEYIVK 215
Query: 203 KLGGGLEEEKTYPYRGDDKACRLNKKATQ-VKINGYVSV-SRDETDMAKYLVENGPMAVA 260
GGL +E+ YPY ++ C + K ++ V ING+ V + DE + K L P++VA
Sbjct: 216 N--GGLRKEEDYPYSMEEGTCEMQKDESETVTINGHQDVPTNDEKSLLKALAHQ-PLSVA 272
Query: 261 INAYA--LQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSW 318
I+A QFY GV F +L H V VGYG + K Y I+KNSW
Sbjct: 273 IDASGREFQFYSGGV------FDGRCGVDLDHGVAAVGYG------SSKGSDYIIVKNSW 320
Query: 319 GEGWGEKGYFRLYRG----DGSCGIN 340
G WGEKGY RL R +G CGIN
Sbjct: 321 GPKWGEKGYIRLKRNTGKPEGLCGIN 346
>gi|33242876|gb|AAQ01142.1| cathepsin [Branchiostoma lanceolatum]
Length = 334
Score = 189 bits (480), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 127/331 (38%), Positives = 172/331 (51%), Gaps = 32/331 (9%)
Query: 36 VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHG--------SGVYG 87
+ H + + QH K Y T E YSR I N KI EH S
Sbjct: 18 LPHNKEWEMWKLQHGKQYETEAEEYSRRFILEKNTVKI-----AEHNIRASLGMHSYTLA 72
Query: 88 LNEFSDLSTAEFQAKYLGFKLK---PSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQ 144
+N+F D+ EF + +G LK V N TLP++ DWR V+ VKDQ
Sbjct: 73 MNKFGDMHHEEFHQRIMGGCLKIVKKPLLGSDVGDNDDNGTLPKSVDWRNSHMVSEVKDQ 132
Query: 145 TMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMS 202
CGS WAFSTTG++EG ++ KT KLV LSEQ+L+DC ++ + GC GG + AF I
Sbjct: 133 GECGSCWAFSTTGSLEGQHSNKTGKLVDLSEQQLVDCSKDFGNQGCGGGLMDQAFQYI-- 190
Query: 203 KLGGGLEEEKTYPYRG-DDKACRLNKKATQVKINGYVSV-SRDETDMAKYLVENGPMAVA 260
K GGL+ E++YPY DDK C+ + + + GY V S +E + + + GP++VA
Sbjct: 191 KANGGLDTEESYPYTATDDKPCKFDNSSVGATLVGYKDVKSGNEHALKRAVATVGPVSVA 250
Query: 261 INA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSW 318
I+A + QFY +GV Q C E L H VL VGYG +H+A +WI+KNSW
Sbjct: 251 IDAGHESFQFYSSGVYDEPQ--CS--TEQLDHGVLAVGYGA-MNDNSHQA--FWIVKNSW 303
Query: 319 GEGWGEKGYFRLYRG-DGSCGINDYVRSALV 348
G WG++GY + R + CGI LV
Sbjct: 304 GPSWGDQGYIMMSRNKNNQCGIATSASYPLV 334
>gi|350535639|ref|NP_001233949.1| phytophthora-inhibited protease 1 [Solanum lycopersicum]
gi|108937128|gb|ABG23376.1| phytophthora-inhibited protease 1 [Solanum lycopersicum]
Length = 345
Score = 189 bits (479), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 118/354 (33%), Positives = 177/354 (50%), Gaps = 33/354 (9%)
Query: 1 MSCFYFFAGVALLSLTVSVSSF--MVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVE 58
M+ +F + ++ L S+ S +V L L ++ ++ H + Y +E
Sbjct: 1 MASNFFLKNITVVLLLFSILSLYPFIVTSRNLKELSMLER---HENWMVHHGRVYKDDIE 57
Query: 59 YYSRLHIFSGNLRKIQLLQDTEHGSGVYGL--NEFSDLSTAEFQAKYLGFKLKPSYADRS 116
R F N+ I+ ++G+ Y L N+++DL+T EF ++G S
Sbjct: 58 KEHRFKTFKENVEFIESFN--KNGTQRYKLAVNKYADLTTEEFTTSFMGLDTSLLSQQES 115
Query: 117 VPAMIP----NIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLV 171
++T +P + DWR+ +VTGVKDQ +CG WAFS IEG Y +L+
Sbjct: 116 TATTTSFKYDSVTEVPNSMDWRKRGSVTGVKDQGVCGCCWAFSAAAAIEGAYQIANNELI 175
Query: 172 SLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQ 231
SLSEQ+L+DC ++ GCEGG ++ A+D ++ GGG+ E YPY C+ + A
Sbjct: 176 SLSEQQLLDCSTQNKGCEGGLMTVAYDFLLQNNGGGITTETNYPYEEAQNVCKTEQPAA- 234
Query: 232 VKINGYVSVSRDETDMAKYLVENGPMAVAINAY-ALQFYVTGVSHPIQFFCDGG-NENLS 289
V INGY V DE+ + K +V N P++V I A Y +G+ DG N L+
Sbjct: 235 VTINGYEVVPSDESSLLKAVV-NQPISVGIAANDEFHMYGSGIY-------DGSCNSRLN 286
Query: 290 HSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR----GDGSCGI 339
H+V ++GYG YWI+KNSWG WGE+GY R+ R G CGI
Sbjct: 287 HAVTVIGYGTSE----EDGTKYWIVKNSWGSDWGEEGYMRIARDVGVDGGHCGI 336
>gi|299507656|gb|ADJ21807.1| cathepsin L [Oplegnathus fasciatus]
Length = 336
Score = 189 bits (479), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 131/351 (37%), Positives = 187/351 (53%), Gaps = 30/351 (8%)
Query: 10 VALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGN 69
VA+L++ +S + D +L +H L+ + H K Y E + R+ ++ N
Sbjct: 4 VAVLAVCLSAALSAPSLDPQLD-----EHWDLWKSW---HTKKYHEKEEGWRRM-VWEKN 54
Query: 70 LRKIQLLQDTEHGSGVY----GLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPN-I 124
L+KI+L + EH G + G+N F D++ EF+ G+K K + M PN +
Sbjct: 55 LKKIEL-HNLEHSMGEHTYRLGMNHFGDMTHEEFRQIMNGYKRKSERKFKGSLFMEPNFL 113
Query: 125 TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE 184
PR+ DWR+ VT VKDQ CGS WAFSTTG +EG + KT KLVSLSEQ L+DC +
Sbjct: 114 EAPRSVDWRDNGYVTPVKDQGQCGSCWAFSTTGAMEGQHFRKTGKLVSLSEQNLVDCSRP 173
Query: 185 D--DGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG-DDKACRLNKKATQVKINGYVSV- 240
+ +GC GG + AF I K GL+ E +YPY G DD+ C + K G++ +
Sbjct: 174 EGNEGCNGGLMDQAFQYI--KDNQGLDSEDSYPYLGTDDQPCHYDPKYNSANDTGFIDIP 231
Query: 241 SRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYG 298
S E + K + GP++VAI+A + QFY +G I + + +E L H VL+VGYG
Sbjct: 232 SGKERALMKAVAAVGPVSVAIDAGHESFQFYQSG----IYYEKECSSEELDHGVLVVGYG 287
Query: 299 VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-DGSCGINDYVRSALV 348
+ K YWI+KNSW E WG+KGY + + CGI LV
Sbjct: 288 FEGEDVDGKK--YWIVKNSWSEKWGDKGYIYMAKDRKNHCGIATAASYPLV 336
>gi|115441717|ref|NP_001045138.1| Os01g0907600 [Oryza sativa Japonica Group]
gi|5761329|dbj|BAA83473.1| cysteine endopeptidase [Oryza sativa]
gi|20804884|dbj|BAB92565.1| cysteine endopeptidase [Oryza sativa Japonica Group]
gi|56785107|dbj|BAD82745.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|113534669|dbj|BAF07052.1| Os01g0907600 [Oryza sativa Japonica Group]
gi|119395242|gb|ABL74582.1| cysteine endopeptidase [Oryza sativa Japonica Group]
gi|125528777|gb|EAY76891.1| hypothetical protein OsI_04850 [Oryza sativa Indica Group]
gi|125573036|gb|EAZ14551.1| hypothetical protein OsJ_04473 [Oryza sativa Japonica Group]
Length = 371
Score = 189 bits (479), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 124/310 (40%), Positives = 161/310 (51%), Gaps = 32/310 (10%)
Query: 47 EQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGL--NEFSDLSTAEFQAKYL 104
++H+ E + R F N+R I + + G Y L N F D+ EF+A +
Sbjct: 50 QEHHHVPRHHGEKHRRFGAFKDNVRYIH--EHNKRGGRGYRLRLNRFGDMGREEFRATFA 107
Query: 105 GFKLKPSYADRSVPAMIPNIT------LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
G D +P LPRA DWR AVTGVKDQ CGS WAFST +
Sbjct: 108 GSHANDLRRDGLAAPPLPGFMYEGVRDLPRAVDWRRKGAVTGVKDQGKCGSCWAFSTVVS 167
Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQEDD-GCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
+EG+ A +T +LVSLSEQELIDCD D+ GC+GG + NAF+ I K GG+ E YPYR
Sbjct: 168 VEGINAIRTGRLVSLSEQELIDCDTADNSGCQGGLMENAFEYI--KHSGGITTESAYPYR 225
Query: 218 GDDKACRL--NKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYVTGV 273
+ C ++A V I+G+ +V + V N P++VAI+A + QFY GV
Sbjct: 226 AANGTCDAVRARRAPLVVIDGHQNVPANSEAALAKAVANQPVSVAIDAGDQSFQFYSDGV 285
Query: 274 SHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG 333
F D G + L H V +VGYG T+ YWI+KNSWG WGE GY R+ R
Sbjct: 286 -----FAGDCGTD-LDHGVAVVGYGE-----TNDGTEYWIVKNSWGTAWGEGGYIRMQRD 334
Query: 334 DGS----CGI 339
G CGI
Sbjct: 335 SGYDGGLCGI 344
>gi|405966498|gb|EKC31776.1| Cathepsin L [Crassostrea gigas]
Length = 330
Score = 189 bits (479), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 117/308 (37%), Positives = 168/308 (54%), Gaps = 23/308 (7%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQ---LLQDTEHGSGVYGLNEFSDLSTAE 98
+ +L+ H K Y E R+ I+ GNL I+ L D S G+NE+ D++ E
Sbjct: 27 WQLYLKAHGKQYGAEEEARRRV-IWEGNLDYIEKHNLAADRGDYSFWLGMNEYGDMTNEE 85
Query: 99 FQAKYLGFKLKPSYADRSVPAMIPNI-TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
F++ G+K++ + S+ NI LP DWR VT +K+Q CGS W+FS TG
Sbjct: 86 FRSTMNGYKMRNGTSRGSLYLPPSNIGDLPDTVDWRPKGYVTPIKNQGQCGSCWSFSATG 145
Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEEKTYP 215
++EG KT KL SLSEQ L+DC Q+ + GC+GG + +AF I K G++ E +YP
Sbjct: 146 SLEGQTFKKTGKLPSLSEQNLVDCSQKQGNHGCQGGLMDDAFQYI--KDNSGIDTESSYP 203
Query: 216 YRGDDKACRLNKKATQVKINGYVSV-SRDETDMAKYLVENGPMAVAINA--YALQFYVTG 272
Y + CR N +G+ + S+ E+D+ + GP++VAI+A + Q Y +G
Sbjct: 204 YEAKNGKCRFNAANVGATDSGFTDIKSKSESDLQSAVATVGPISVAIDASHMSFQLYRSG 263
Query: 273 VSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR 332
V H +FFC L H VL VGYG + K YW++KNSWGE WG+KGY + R
Sbjct: 264 VYH--EFFCS--ETRLDHGVLAVGYGTESGK------DYWLVKNSWGESWGQKGYIMMSR 313
Query: 333 GD-GSCGI 339
+CGI
Sbjct: 314 NKRNNCGI 321
>gi|226502454|ref|NP_001140922.1| hypothetical protein [Zea mays]
gi|223948637|gb|ACN28402.1| unknown [Zea mays]
gi|413920877|gb|AFW60809.1| hypothetical protein ZEAMMB73_830238 [Zea mays]
Length = 354
Score = 189 bits (479), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 110/304 (36%), Positives = 150/304 (49%), Gaps = 21/304 (6%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY--GLNEFSDLSTAEFQAK 102
++ +H +TY E R +F N + Y LNEF+D++ EF A
Sbjct: 54 WMAEHGRTYRDEAEKAHRFQVFKANADFVDASNAAGDDKKSYRMELNEFADMTNDEFMAM 113
Query: 103 YLGFKLKPSYADRSVPAMIPNITLPRA------FDWREYDAVTGVKDQTMCGSSWAFSTT 156
Y G + P+ A + N+TL A DWR+ AVTG+K+Q CG WAF+
Sbjct: 114 YTGLRPVPAGAKKMAGFKYGNVTLSDADDNQQTVDWRQKGAVTGIKNQGQCGCCWAFAAV 173
Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYP 215
+EG++ T LVSLSEQ+++DCD E ++GC GG I NAF I GGL E YP
Sbjct: 174 AAVEGIHQITTGNLVSLSEQQVLDCDTEGNNGCNGGYIDNAFQYIAGN--GGLATEDAYP 231
Query: 216 YRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSH 275
Y C+ + I+GY V + V N P++VAI+A+ Q Y GV
Sbjct: 232 YTAAQAMCQSVQPV--AAISGYQDVPSGDEAALAAAVANQPVSVAIDAHNFQLYGGGVMT 289
Query: 276 PIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
NL+H+V VGYG PYW++KN WG+ WGE GY RL RG
Sbjct: 290 AASCSTP---PNLNHAVTAVGYGT-----AEDGTPYWLLKNQWGQNWGEGGYLRLERGAN 341
Query: 336 SCGI 339
+CG+
Sbjct: 342 ACGV 345
>gi|357619726|gb|EHJ72185.1| cathepsin [Danaus plexippus]
Length = 1118
Score = 189 bits (479), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 117/311 (37%), Positives = 171/311 (54%), Gaps = 25/311 (8%)
Query: 32 HLHHVKHT-ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNE 90
HL+ ++ LF F++ +NK Y E R IF NL+ I + + + VYG+N+
Sbjct: 808 HLYSLEEAPTLFEQFIKDYNKEYDE-SEKEERFKIFVNNLKDINAMNE-RSSNAVYGINK 865
Query: 91 FSDLSTAEFQAKYLGFKLK--PSYADRSVPAMIP--NITLPRAFDWREYDAVTGVKDQTM 146
FSDLS EF Y G K + PS D + N+T P FDWR+ V+ VK Q
Sbjct: 866 FSDLSKDEFVKFYTGLKREESPSNEDHKKTDLPKSFNVTAPDQFDWRKKGVVSSVKFQGH 925
Query: 147 CGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGG-SISNAFDTIMSKLG 205
C S WAFS GN+E + A KT KL+ +SEQ+L+DCD+ + GC GG + S + + K G
Sbjct: 926 CVSCWAFSVAGNVESINAIKTGKLIDVSEQQLVDCDEWNFGCSGGIACSKSHFSYFHKKG 985
Query: 206 GGLEEEKTYPYRGDDKACRLNKKATQVKINGY---VSVSRDETDMAKYLVENGPMAVAIN 262
E +YPY G + CR N +++ Y +++S DE + +YL GP+++ I+
Sbjct: 986 AMSLE--SYPYVGKEGQCRYNSSKVVIRLKDYQYFIALSEDE--IKEYLYNIGPLSIDID 1041
Query: 263 AYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGW 322
+ + Y G+ + C + +H+VL+VGYG + V YWI+KNSWG+ W
Sbjct: 1042 SSQIHHYKGGI---VIKECQEVKKT-NHAVLLVGYGKEN------GVEYWIVKNSWGQNW 1091
Query: 323 GEKGYFRLYRG 333
GEKGYFR+ RG
Sbjct: 1092 GEKGYFRIQRG 1102
Score = 175 bits (444), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 107/306 (34%), Positives = 162/306 (52%), Gaps = 22/306 (7%)
Query: 25 VGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSG 84
+G +L+ L LF F++ +NK Y E R IF NL+ I + + +
Sbjct: 504 LGQRRLYSLEEA--PTLFEQFIKDYNKEYDE-SEKEERFKIFVNNLKDINAMNE-RSSNA 559
Query: 85 VYGLNEFSDLSTAEFQAKYLGFKLK--PSYADRSVPAMIP--NITLPRAFDWREYDAVTG 140
VYG+N+FSDLS EF Y G K + PS D + N+T P FDWR+ V+
Sbjct: 560 VYGINKFSDLSKEEFIKYYTGLKREESPSNEDHKKTDLPESFNVTAPDQFDWRKKGVVSS 619
Query: 141 VKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTI 200
+K+Q CGS WAFS GN+E ++A KT KLV +SEQ+L+DCD +D GC GG NA
Sbjct: 620 IKNQKHCGSCWAFSAAGNVESIHAIKTGKLVHVSEQQLVDCDSQDSGCSGGLTWNAMRYF 679
Query: 201 MSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAV 259
+ L K+YPY ++ CR + +++ Y +++ E + ++L G +++
Sbjct: 680 RTNGAVSL---KSYPYVAQNENCRYDSNKVVIRLKDYKHITQLSEDQIKEHLYNIGLLSI 736
Query: 260 AINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWG 319
I + L +Y G+ + C + + H+VL+V YG + + V YWI+KNSWG
Sbjct: 737 DITSTQLTWYEGGI---LIEECRRSDL-VDHAVLLVEYGKENS------VEYWIVKNSWG 786
Query: 320 EGWGEK 325
+ GEK
Sbjct: 787 QNGGEK 792
Score = 175 bits (443), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 96/260 (36%), Positives = 141/260 (54%), Gaps = 26/260 (10%)
Query: 83 SGVYGLNEFSDLSTAEFQAKYLGFKLK--PSYADRSVPAMIP--NITLPRAFDWREYDAV 138
+ VYG+N+FSDLS EF Y G K + PS D + N+T P FDWR+ V
Sbjct: 7 NAVYGINKFSDLSKEEFVKYYTGLKREESPSNEDHKKTDLPESFNVTAPDQFDWRKKGVV 66
Query: 139 TGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFD 198
+ +K+Q CGS WAFS N+E ++A KT KL+ +SEQ+L+DCD+ D GC GG +D
Sbjct: 67 SSIKNQKHCGSCWAFSAAANVESIHAIKTGKLIDVSEQQLLDCDKYDSGCSGGL---PWD 123
Query: 199 TIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPM 257
+ + G K+YPY + CR + ++++ Y + E + ++L GP+
Sbjct: 124 ALRYFVANGAMSLKSYPYVAKEGKCRYDSSKVEIRLKEYKHKEKLSEDQIKEHLYNIGPL 183
Query: 258 AVAINAYALQFYVTGV----SHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWI 313
++AI + L Y G+ H ++H+VL+VGYG + V YWI
Sbjct: 184 SIAITSSPLASYNGGILIEECHRSYL--------INHAVLLVGYGKEN------GVKYWI 229
Query: 314 IKNSWGEGWGEKGYFRLYRG 333
+KNSWG+ WGE GYFR+ G
Sbjct: 230 VKNSWGQNWGENGYFRMKMG 249
Score = 125 bits (314), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 69/171 (40%), Positives = 96/171 (56%), Gaps = 8/171 (4%)
Query: 25 VGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSG 84
+G +L+ L LF F++ +NK Y E R IF NL+ I + + +
Sbjct: 287 LGQRRLYSLEEA--PTLFEQFIKDYNKEYDE-SEKEERFKIFVNNLKDINAMNE-RSSNA 342
Query: 85 VYGLNEFSDLSTAEFQAKYLGFKL-KPSYADRSVPAMIP---NITLPRAFDWREYDAVTG 140
VYG+N+FSDLS EF Y G K + + + +P NIT P FDWR+ V+
Sbjct: 343 VYGINKFSDLSKEEFIKYYTGLKRDRCTTTEHHKSTDLPKSFNITAPDQFDWRKKGVVSS 402
Query: 141 VKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGG 191
VK+Q CGS WAFS N+E ++A KT KL+ +SEQ+L+DCD+ D GC GG
Sbjct: 403 VKNQRHCGSCWAFSAAANVESIHAIKTGKLIDVSEQQLLDCDKYDSGCSGG 453
>gi|354472953|ref|XP_003498701.1| PREDICTED: cathepsin K [Cricetulus griseus]
Length = 329
Score = 189 bits (479), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 116/303 (38%), Positives = 166/303 (54%), Gaps = 23/303 (7%)
Query: 49 HNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY----GLNEFSDLSTAEFQAKYL 104
H K Y + V+ SR I+ NL+ I + + E GV+ +N D+++ E K
Sbjct: 33 HRKQYNSKVDEISRRLIWEKNLKHISI-HNLEASLGVHTYELAMNHLGDMTSEEVVQKMT 91
Query: 105 GFKLKPSYADRSVPAMIPNI--TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGV 162
G KL PS++ + IP P A D+R+ VT VK+Q CGS WAFS+ G +EG
Sbjct: 92 GLKLPPSHSHSNDTLYIPEWEGRAPDAIDYRKKGYVTPVKNQGECGSCWAFSSAGALEGQ 151
Query: 163 YAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKA 222
KT KL++LS Q L+DC E+ GC GG ++ AF + + GG++ E YPY G D++
Sbjct: 152 LKKKTGKLLNLSPQNLVDCVSENYGCGGGYMTTAFRYVQTN--GGIDSEDAYPYVGQDQS 209
Query: 223 CRLNKKATQVKINGYVSVS-RDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQF 279
C N A K GY + E + + + GP++V+I+A + QFY GV +
Sbjct: 210 CMYNPTAKAAKCRGYREIPVGSEKALKRAVARVGPISVSIDASLTSFQFYSRGVYYDEN- 268
Query: 280 FCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-DGSCG 338
CDG +N++H+VL+VGYG K +WIIKNSWGE WG KGY L R + +CG
Sbjct: 269 -CDG--DNVNHAVLVVGYGA------QKGNKHWIIKNSWGESWGNKGYVLLARNRNNACG 319
Query: 339 IND 341
I +
Sbjct: 320 ITN 322
>gi|41688064|dbj|BAD08618.1| cathepsin L preproprotein [Cyprinus carpio]
Length = 337
Score = 189 bits (479), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 124/312 (39%), Positives = 171/312 (54%), Gaps = 22/312 (7%)
Query: 49 HNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY----GLNEFSDLSTAEFQAKYL 104
H K Y E + R+ ++ NL+KI+L + EH G + G+N F D++ EF+
Sbjct: 36 HGKKYHEKEEGWRRM-VWEKNLQKIEL-HNLEHSMGTHTYRLGMNRFGDMTHEEFRQVMN 93
Query: 105 GFKLKPSYADRSVPAMIPN-ITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVY 163
G+K K R M PN + +P + DWRE VT VKDQ CGS WAFSTTG +EG
Sbjct: 94 GYKHKKERRFRGSLFMEPNFLEVPNSLDWREKGYVTPVKDQGECGSCWAFSTTGAMEGQM 153
Query: 164 AAKTKKLVSLSEQELIDCDQED--DGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG-DD 220
KT KLVSLSEQ L+DC + + +GC GG + AF I + GL+ E++YPY G DD
Sbjct: 154 FRKTGKLVSLSEQNLVDCSRPEGNEGCNGGLMDQAFQYIKDQ--NGLDSEESYPYVGTDD 211
Query: 221 KACRLNKKATQVKINGYVSV-SRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPI 277
+ C + K + G+V + S E + K + GP++VAI+A + QFY +G+ +
Sbjct: 212 QPCHYDPKYSAANDTGFVDIPSGKEHALMKAIAAVGPVSVAIDAGHESFQFYQSGIYYEK 271
Query: 278 QFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-DGS 336
+ C +E L H VL VGYG + K YWI+KNSW E WG+KGY + +
Sbjct: 272 E--CS--SEELDHGVLAVGYGFEGEDVDGKK--YWIVKNSWSENWGDKGYVYMAKDRHNH 325
Query: 337 CGINDYVRSALV 348
CGI LV
Sbjct: 326 CGIATAASYPLV 337
>gi|318037269|ref|NP_001187182.1| cathepsin L precursor [Ictalurus punctatus]
gi|196475596|gb|ACG76367.1| cathepsin L [Ictalurus punctatus]
Length = 336
Score = 189 bits (479), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 134/351 (38%), Positives = 182/351 (51%), Gaps = 31/351 (8%)
Query: 10 VALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGN 69
+A+L++ +S S D +L + + E HNK Y E + R+ ++ N
Sbjct: 5 LAVLAVCLSTVSAAPTVDREL--------DGHWQQWKEWHNKDYHEKEEGWRRM-VWEKN 55
Query: 70 LRKIQLLQDTEHGSGVY----GLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPN-I 124
L+KI+L + EH G + +N F D+ EF+ G+K K R M PN +
Sbjct: 56 LKKIEL-HNLEHSLGKHSYRLAMNHFGDMPHEEFRQVMNGYKHKVRKI-RGSLFMEPNFL 113
Query: 125 TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE 184
P DWRE VT VKDQ CGS WAFSTTG +EG KT KLVSLSEQ L+DC +
Sbjct: 114 EAPSKLDWREKGYVTPVKDQGQCGSCWAFSTTGAMEGQQFRKTGKLVSLSEQNLVDCSRP 173
Query: 185 D--DGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG-DDKACRLNKKATQVKINGYVSV- 240
+ +GC GG + AF I K GGL+ EK YPY G DD+ C + + G+V +
Sbjct: 174 EGNEGCNGGLMDQAFQYI--KDNGGLDTEKFYPYLGTDDQPCHYDPSYSAANDTGFVDIP 231
Query: 241 SRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYG 298
S E + K + GP++VAI+A + QFY +G I + D +E+L H VL+VGYG
Sbjct: 232 SGKEHALMKAVTAVGPVSVAIDAGHESFQFYQSG----IYYEADCSSEDLDHGVLVVGYG 287
Query: 299 VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-DGSCGINDYVRSALV 348
+ K YWI+KNSW E WG KGY + + CGI LV
Sbjct: 288 YEGENVDGKK--YWIVKNSWSEQWGNKGYIYMAKDRHNHCGIATAASYPLV 336
>gi|116563690|gb|ABJ99858.1| cathepsin L [Hippoglossus hippoglossus]
Length = 336
Score = 189 bits (479), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 136/353 (38%), Positives = 188/353 (53%), Gaps = 32/353 (9%)
Query: 10 VALLSLTVSVSSFM--VVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFS 67
+ LL LT +SS + V D +L+ +H L+ + H+K Y E + R+ ++
Sbjct: 2 LPLLVLTACLSSVLSAPVLDAQLN-----EHWDLWKSW---HSKKYHEKEEGWRRM-VWE 52
Query: 68 GNLRKIQLLQDTEHGSGVY----GLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPN 123
NL+KI+L + EH G + G+N F D++ EF+ G+KLK M PN
Sbjct: 53 KNLQKIEL-HNLEHSMGTHSFRLGMNHFGDMTHEEFRQIMNGYKLKTQRKFTGSLFMEPN 111
Query: 124 -ITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCD 182
+T P A DWRE VT VKDQ CGS WAFSTTG +EG KT KLVSLSEQ L+DC
Sbjct: 112 FMTAPSAVDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQQFRKTGKLVSLSEQNLVDCS 171
Query: 183 QED--DGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG-DDKACRLNKKATQVKINGYVS 239
+ + +GC GG + AF + GL+ E +YPY G DD+ C + G+V
Sbjct: 172 RPEGNEGCGGGLMDQAFQYVTDNQ--GLDSEDSYPYTGTDDQPCHYDPLYNSANDTGFVD 229
Query: 240 V-SRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVG 296
V S E + K + GP++VAI+A + QFY +G+ + + C +E L H VL VG
Sbjct: 230 VPSGKEHALMKAVASVGPVSVAIDAGHESFQFYQSGIYYEKE--CS--SEELDHGVLAVG 285
Query: 297 YGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-DGSCGINDYVRSALV 348
YG + K +WI+KNSWGE WG+KGY + + CGI LV
Sbjct: 286 YGFEGEDKMGKK--FWIVKNSWGEKWGDKGYIYMAKDRKNHCGIATAASYPLV 336
>gi|544129|sp|P25803.2|CYSEP_PHAVU RecName: Full=Vignain; AltName: Full=Bean endopeptidase; AltName:
Full=Cysteine proteinase EP-C1; Flags: Precursor
gi|20994|emb|CAA44816.1| endopeptidase [Phaseolus vulgaris]
Length = 362
Score = 189 bits (479), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 116/305 (38%), Positives = 166/305 (54%), Gaps = 28/305 (9%)
Query: 49 HNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKL 108
H+ +L E + R ++F NL + + + LN+F+D++ EF++ Y G K+
Sbjct: 46 HHTVSRSLGEKHKRFNVFKANLMHVHNTNKMDKPYKLK-LNKFADMTNHEFRSTYAGSKV 104
Query: 109 KPSYADRSVP----AMI--PNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGV 162
R P A + +++P + DWR+ AVT VKDQ CGS WAFST +EG+
Sbjct: 105 NHPRMFRGTPHENGAFMYEKVVSVPPSVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGI 164
Query: 163 YAAKTKKLVSLSEQELIDCDQEDD-GCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDK 221
KT KLV+LSEQEL+DCD+E++ GC GG + +AF+ I K GG+ E YPY+ +
Sbjct: 165 NQIKTNKLVALSEQELVDCDKEENQGCNGGLMESAFEFIKQK--GGITTESNYPYKAQEG 222
Query: 222 ACRLNK-KATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHPIQ 278
C +K V I+G+ +V ++ D V N P++VAI+A QFY GV
Sbjct: 223 TCDASKVNDLAVSIDGHENVPANDEDALLKAVANQPVSVAIDAGGSDFQFYSEGV----- 277
Query: 279 FFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----D 334
F + +L+H V IVGYG T YWI++NSWG WGE GY R+ R +
Sbjct: 278 -FTGDCSTDLNHGVAIVGYGT-----TVDGTNYWIVRNSWGPEWGEHGYIRMQRNISKKE 331
Query: 335 GSCGI 339
G CGI
Sbjct: 332 GLCGI 336
>gi|225446589|ref|XP_002280263.1| PREDICTED: vignain [Vitis vinifera]
Length = 339
Score = 189 bits (479), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 123/345 (35%), Positives = 169/345 (48%), Gaps = 31/345 (8%)
Query: 5 YFFAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLH 64
Y + +ALL + + +S LH ++ ++ + Y E R
Sbjct: 7 YQYVSMALLFILAAWAS-----QATSRSLHEASMYERHEDWMARYGRMYKDANEKEKRFK 61
Query: 65 IFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNI 124
IF N+ +I+ + +NEF+DL+ EF++ L + K + N+
Sbjct: 62 IFKDNVARIESFNKAMDKTYKLSINEFADLTNEEFRS--LRNRFKAHICSEATTFKYENV 119
Query: 125 T-LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQ 183
T +P DWR+ AVT +KDQ CG WAFS EG+ T KL+SLSEQEL+DCD
Sbjct: 120 TAVPSTIDWRKKGAVTPIKDQQQCGCCWAFSAVAATEGITQITTGKLISLSEQELVDCDT 179
Query: 184 --EDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKA-TQVKINGYVSV 240
E+ GC GG + +AF I GL E TYPY GDD C K+A KI GY V
Sbjct: 180 GGENQGCSGGLMDDAFRFIKIH---GLASEATYPYEGDDGTCNSKKEAHPAAKIKGYEDV 236
Query: 241 SRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYG 298
+ + V + P+AVAI+A + QFY +GV F G E L H V VGYG
Sbjct: 237 PANNEKALQKAVAHQPVAVAIDAGGFEFQFYTSGV-----FTGQCGTE-LDHGVAAVGYG 290
Query: 299 VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGI 339
+ + YW++KNSWG GWGE+GY R+ R +G CGI
Sbjct: 291 IG-----DDGMMYWLVKNSWGTGWGEEGYIRMQRDVTAKEGLCGI 330
>gi|535473|emb|CAA53377.1| cysteine protease [Vicia sativa]
Length = 368
Score = 189 bits (479), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 126/362 (34%), Positives = 187/362 (51%), Gaps = 52/362 (14%)
Query: 1 MSCFYFFAGVALLSLTVSVSSFMVVG---DEKLHHLHHVKHTALFNYFLEQHNKTYATLV 57
M+ FF +L++ ++++ + G DE + ++ +L +H K Y L
Sbjct: 4 MTILPFFLFFSLITFSLALDIQLPTGRSNDEVM---------TMYEEWLVKHQKVYNGLR 54
Query: 58 EYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSV 117
E R IF NL I + ++ + + GLN+F+D++ E++ YLG + +D
Sbjct: 55 EKDQRFQIFKDNLNFIDE-HNAQNYTYIVGLNKFADMTNEEYRDMYLGTR-----SDIKR 108
Query: 118 PAMIPNIT-----------LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAK 166
M IT LP DWR A+T +KDQ CGS WAFST +E +
Sbjct: 109 RIMKNKITGHRYAYNSGDRLPVHVDWRLKGAITHIKDQGSCGSCWAFSTIATVEAINKIV 168
Query: 167 TKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACR- 224
T KLVSLSEQEL+DCD+ ++GC GG + AF+ I+ GG++ ++ YPY+G + C
Sbjct: 169 TGKLVSLSEQELVDCDRAFNEGCNGGLMDYAFEFIIGN--GGIDTDQHYPYKGFEGRCDP 226
Query: 225 LNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCD 282
KKA V I+GY V + + K V + P++VAI A ALQ Y +GV F
Sbjct: 227 TRKKAKIVSIDGYEDVPSNNENALKKAVAHQPVSVAIEASGRALQLYQSGV------FTG 280
Query: 283 GGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-----DGSC 337
+L H+V+IVGYG + + YW+++NSWG WGE GYF++ R G C
Sbjct: 281 KCGTSLDHAVVIVGYG------SENGLDYWLVRNSWGTNWGEDGYFKMERNVKGTHTGKC 334
Query: 338 GI 339
GI
Sbjct: 335 GI 336
>gi|400180407|gb|AFP73342.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 189 bits (479), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 108/301 (35%), Positives = 158/301 (52%), Gaps = 24/301 (7%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
++ +H + Y VE R IF N++ I+ + + S G+NEF+D+++ EF AK+
Sbjct: 42 WMSRHGRVYKDEVEKVERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFT 101
Query: 105 GFKLKPSYADRSVPAM-------IPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
G + SY S + + + +P DWRE AVT VK Q CG WAFS G
Sbjct: 102 GLNIPNSYLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVG 161
Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
++EG Y T L+ SEQEL+DC + GC GG ++NAFD I K GG+ E Y Y
Sbjct: 162 SLEGAYKIATGNLMEFSEQELLDCTTNNYGCNGGFMTNAFDFI--KENGGISRESDYEYL 219
Query: 218 GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA-LQFYVTGVSHP 276
G+ CR +K V+I+ Y V ET + + + + P+++ I A LQFY G
Sbjct: 220 GEQYTCRSQEKTAAVQISSYQVVPEGETSLLQAVTKQ-PVSIGIAASQDLQFYAGGTY-- 276
Query: 277 IQFFCDGG-NENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
DG + ++H+V +GYG D K YW++KNSWG WGE G+ ++ R G
Sbjct: 277 -----DGSCADRINHAVTAIGYGTD-----EKGQKYWLLKNSWGTSWGENGFMKIIRDSG 326
Query: 336 S 336
+
Sbjct: 327 N 327
>gi|340380715|ref|XP_003388867.1| PREDICTED: pro-cathepsin H-like [Amphimedon queenslandica]
Length = 347
Score = 189 bits (479), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 110/311 (35%), Positives = 169/311 (54%), Gaps = 16/311 (5%)
Query: 36 VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLS 95
V+ A F + +H KTYAT EY RL +++ N ++ L + + + LN+F+DL+
Sbjct: 37 VQRAAEFERWTIKHKKTYATAEEYNWRLRVYTANHYYVKRLNEGHGPATEFELNQFADLT 96
Query: 96 TAEFQAKYLGFK---LKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
AEF+ YL + + + +P N+ P A DWR+ + +T V+DQ CGS WA
Sbjct: 97 FAEFKRIYLSSSSQHCRATTGNFQMPVKKNNVEDPVAIDWRKRNVITPVRDQGSCGSCWA 156
Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQ--EDDGCEGGSISNAFDTIMSKLGGGLEE 210
FS T + A KT +L+SLS+Q+L+DC + + GC+GG S AF+ I + GG+E
Sbjct: 157 FSATSCLSAHLALKTGQLISLSKQQLLDCSRSFNNRGCKGGLPSQAFEYI--RYNGGIES 214
Query: 211 EKTYPYRGDDKACRLNKKATQVKINGYVSVSRD-ETDMAKYLVENGPMAVAINAY-ALQF 268
E+ YPY+ ++ C + G V+ ++ E D+A L GP+++ I++ +
Sbjct: 215 ERDYPYKDREEKCHFKPSLVAATVTGVVNFTQGAEDDIAVALANIGPVSIGIHSTKSFAT 274
Query: 269 YVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYF 328
Y G+ C ++H+VLIVGY D+T K YWI KNSWG WG GYF
Sbjct: 275 YKKGIYQGK--LCSKNPRKINHAVLIVGY--DQTASGEK---YWIGKNSWGTNWGMNGYF 327
Query: 329 RLYRGDGSCGI 339
+ RG +CG+
Sbjct: 328 WIRRGHNACGL 338
>gi|156739289|ref|NP_001096592.1| uncharacterized protein LOC569326 precursor [Danio rerio]
gi|156230119|gb|AAI52283.1| Im:6910535 protein [Danio rerio]
Length = 335
Score = 189 bits (479), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 116/310 (37%), Positives = 171/310 (55%), Gaps = 23/310 (7%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY----GLNEFSDLSTA 97
+N + QH K+Y VE R+ I+ NLRKI+ + E+ G + G+N+F D++
Sbjct: 28 WNSWKSQHGKSYHEDVEVGRRM-IWEENLRKIEQ-HNFEYSLGNHTFKMGMNQFGDMTNE 85
Query: 98 EFQAKYLGFKLKPSYADRSVPAMIPNI-TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTT 156
EF+ G+K P+ + M P+ P+ DWR+ VT VKDQ CGS W+FS+T
Sbjct: 86 EFRQAMNGYKQDPNRTSKGALFMEPSFFAAPQQVDWRQRGYVTPVKDQKQCGSCWSFSST 145
Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCD--QEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
G +EG KT KL+S+SEQ L+DC Q + GC GG + AF + K GL+ E++Y
Sbjct: 146 GALEGQLFRKTGKLISMSEQNLVDCSRPQGNQGCNGGIMDQAFQYV--KENKGLDSEQSY 203
Query: 215 PYRG-DDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINA--YALQFYV 270
PY DD CR + + KI G+V + R +E + + GP++VAI+A +LQFY
Sbjct: 204 PYLARDDLPCRYDPRFNVAKITGFVDIPRGNELALMNAVAAVGPVSVAIDASHQSLQFYQ 263
Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
+G+ ++ L H+VL+VGYG YWI+KNSW + WG+KGY +
Sbjct: 264 SGI-----YYERACTSRLDHAVLVVGYGYQGADVAGNR--YWIVKNSWSDKWGDKGYIYM 316
Query: 331 YRG-DGSCGI 339
+ + CGI
Sbjct: 317 AKDKNNHCGI 326
>gi|302845628|ref|XP_002954352.1| hypothetical protein VOLCADRAFT_76255 [Volvox carteri f.
nagariensis]
gi|300260282|gb|EFJ44502.1| hypothetical protein VOLCADRAFT_76255 [Volvox carteri f.
nagariensis]
Length = 489
Score = 188 bits (478), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 130/325 (40%), Positives = 173/325 (53%), Gaps = 25/325 (7%)
Query: 28 EKLHHLHHVKHTALFNYFLEQHNKTYAT-LVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY 86
EKL A F ++ Q+ K YA + E +R ++ NL I L + S
Sbjct: 31 EKLLLDAKANPMAAFQQWMMQYTKAYANDIKELETRFSVWLENLNYI-LAYNARTTSHWL 89
Query: 87 GLNEFSDLSTAEFQAKYLGFKLKPSYAD---RSVPAMIPNI---TLPRAFDWREYDAVTG 140
LN F+DL+T EF+ + LG+ K A +S P + N+ LP DWR+ AVT
Sbjct: 90 HLNAFADLTTDEFRNR-LGYDFKARQASNRLQSSPFIYDNVDANQLPTEIDWRKKGAVTE 148
Query: 141 VKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCD-QEDDGCEGGSISNAFDT 199
VK+Q CGS WAF+TTG++EG+ A T +L SLSEQEL+DCD ED GC GG + A+
Sbjct: 149 VKNQGQCGSCWAFATTGSVEGINAIVTGELASLSEQELVDCDTDEDRGCSGGLMDYAYQW 208
Query: 200 IMSKLGGGLEEEKTYPYRGDDKACRLNKKATQ-VKINGYVSVSRDETDMAKYLVENGPMA 258
I+ GGL+ E YPY +D C KK + V I+GYV + ++ K + P+A
Sbjct: 209 IIKN--GGLDTEDDYPYTAEDGVCVAAKKNRRVVTIDGYVDIPENDEVALKKAAAHQPIA 266
Query: 259 VAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSW 318
VAI A A F + G C +L+H VL+VGYG D F + YWI+KNSW
Sbjct: 267 VAIEADAKSFQLYGGGVYDDPTC---GTSLNHGVLVVGYGKD-PHFGN----YWIVKNSW 318
Query: 319 GEGWGEKGYFRLYRG----DGSCGI 339
G WG+ GY RL G G CGI
Sbjct: 319 GPEWGDNGYIRLRMGAEDVQGMCGI 343
>gi|1345573|emb|CAA40073.1| endopeptidase (EP-C1) [Phaseolus vulgaris]
Length = 361
Score = 188 bits (478), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 116/305 (38%), Positives = 166/305 (54%), Gaps = 28/305 (9%)
Query: 49 HNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKL 108
H+ +L E + R ++F NL + + + LN+F+D++ EF++ Y G K+
Sbjct: 45 HHTVSRSLGEKHKRFNVFKANLMHVHNTNKMDKPYKLK-LNKFADMTNHEFRSTYAGSKV 103
Query: 109 KPSYADRSVP----AMI--PNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGV 162
R P A + +++P + DWR+ AVT VKDQ CGS WAFST +EG+
Sbjct: 104 NHHRMFRGTPHENGAFMYEKVVSVPPSVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGI 163
Query: 163 YAAKTKKLVSLSEQELIDCDQEDD-GCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDK 221
KT KLV+LSEQEL+DCD+E++ GC GG + +AF+ I K GG+ E YPY+ +
Sbjct: 164 NQIKTNKLVALSEQELVDCDKEENQGCNGGLMESAFEFIKQK--GGITTESNYPYKAQEG 221
Query: 222 ACRLNK-KATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHPIQ 278
C +K V I+G+ +V ++ D V N P++VAI+A QFY GV
Sbjct: 222 TCDASKVNDLAVSIDGHENVPANDEDALLKAVANQPVSVAIDAGGSDFQFYSEGV----- 276
Query: 279 FFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----D 334
F + +L+H V IVGYG T YWI++NSWG WGE GY R+ R +
Sbjct: 277 -FTGDCSTDLNHGVAIVGYGT-----TVDGTNYWIVRNSWGPEWGEHGYIRMQRNISKKE 330
Query: 335 GSCGI 339
G CGI
Sbjct: 331 GLCGI 335
>gi|323454466|gb|EGB10336.1| hypothetical protein AURANDRAFT_22962 [Aureococcus anophagefferens]
Length = 416
Score = 188 bits (478), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 124/330 (37%), Positives = 183/330 (55%), Gaps = 30/330 (9%)
Query: 40 ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
+LF+ F+++++K+Y T EY R IFS NL I L +T++ ++GLN F+D + E
Sbjct: 87 SLFDQFIDEYSKSYDTTHEYNDRFTIFSKNLNYIDAL-NTQNPHALFGLNVFADQTEEER 145
Query: 100 QAKYLGFKLKPSY--------ADRSVPAMIPNI------TLPRAFDWREYDAVTGVKDQT 145
+ + +Y +D + + P LP FDWRE AVT VK+Q
Sbjct: 146 SKRRMTDPSITNYTRVGWASGSDCAACNLYPAFGEYDMGNLPDDFDWRELGAVTRVKNQA 205
Query: 146 MCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLG 205
CGS W+FST ++EG + T L S + Q+L++C+ + GC+GG A +S G
Sbjct: 206 YCGSCWSFSTAADLEGTHYLATGDLESYAPQQLVECNTMNLGCDGGYPFAAM-QYLSHFG 264
Query: 206 GGLEEEKTYPYRGDDKACRLNKKATQ---VKINGY--VSVSRD-ETDMAKYLVENGPMAV 259
G + E T PY+ K LN+K I+G+ V++ D E+ M LV+NGP+++
Sbjct: 265 GMVTWE-TMPYK---KIELLNEKLEDGDVAHISGWQMVAMGADYESLMRVTLVKNGPLSI 320
Query: 260 AINAYALQFYVTGVSHPIQFF-CDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSW 318
A NA + +YV GV F CD +L H+VL+VGYGV T K VPYW+IKNSW
Sbjct: 321 AFNANGMDYYVHGVDGDGDMFTCD--PTSLDHAVLVVGYGVQHTDGNGK-VPYWVIKNSW 377
Query: 319 GEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
+ WGE GY+RL RG +CG+ + V ++V
Sbjct: 378 DDVWGEDGYYRLVRGSNACGVANMVVHSIV 407
>gi|357483847|ref|XP_003612210.1| Cysteine proteinase [Medicago truncatula]
gi|355513545|gb|AES95168.1| Cysteine proteinase [Medicago truncatula]
Length = 344
Score = 188 bits (478), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 123/317 (38%), Positives = 165/317 (52%), Gaps = 29/317 (9%)
Query: 35 HVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHG-SGVYGLNEFSD 93
H +H N++ K Y E R IF+ N++ I+ + ++ S G+N+F+D
Sbjct: 36 HERHERWMNHY----GKVYKDHQEREKRFKIFTENMKYIEAFNNGDNNESYKLGINQFAD 91
Query: 94 LSTAEFQAKYLGFK-LKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSW 151
L+ EF A FK S R+ N++ +P DWR+ AVT VK+Q CG W
Sbjct: 92 LTNEEFVASRNKFKGHMCSSIIRTTTFKYENVSAIPSTVDWRKKGAVTPVKNQGQCGCCW 151
Query: 152 AFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLE 209
AFS EG++ T KLVSLSEQEL+DCD + D GCEGG + +AF I+ GL
Sbjct: 152 AFSAVAATEGIHKLSTGKLVSLSEQELVDCDTKGVDQGCEGGLMDDAFKFIIQNH--GLN 209
Query: 210 EEKTYPYRGDDKACRLNKKATQ-VKINGYVSVSRDETDMAKYLVENGPMAVAINAYA--L 266
E YPY+G D C NK + Q I GY V + + V N P++VAI+A
Sbjct: 210 TEAQYPYQGVDGTCNANKASIQATTITGYEDVPANNEQALQKAVANQPISVAIDASGSDF 269
Query: 267 QFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKG 326
QFY +GV F G E L H V VGYGV ++ YW++KNSWG WGE+G
Sbjct: 270 QFYKSGV-----FTGSCGTE-LDHGVTAVGYGV-----SNDGTKYWLVKNSWGTDWGEEG 318
Query: 327 YFRLYRG----DGSCGI 339
Y + RG +G CGI
Sbjct: 319 YIMMQRGVEAAEGLCGI 335
>gi|334332720|ref|XP_001367595.2| PREDICTED: cathepsin L1-like [Monodelphis domestica]
Length = 333
Score = 188 bits (478), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 124/349 (35%), Positives = 186/349 (53%), Gaps = 40/349 (11%)
Query: 4 FYFFAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRL 63
FY L L + F D + H + QH +TYA + + R
Sbjct: 3 FYLCLASLCLGLVAATPEFDQTLDSQWHQ------------WKAQHRRTYAANEDGWRRA 50
Query: 64 HIFSGNLRKIQLLQDTEHGSGVY----GLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPA 119
+ NL+ I++ + E+ +G + G+N+F D++T EF+ G+ S R+ +
Sbjct: 51 -TWEKNLKMIEM-HNLEYSAGKHSFQLGMNKFGDMTTEEFKQVMNGYNSNGS-QKRTKGS 107
Query: 120 MIPN---ITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQ 176
+ LP++ DWRE VT VK+Q CGS WAFS TG++EG + KTKKLVSLSEQ
Sbjct: 108 LYREPLLAQLPKSVDWREKGYVTPVKNQGQCGSCWAFSATGSLEGQWFHKTKKLVSLSEQ 167
Query: 177 ELIDC--DQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKI 234
L+DC + ++GC GG + NAF+ + K GG++ E+ YPY G D C+ + + +
Sbjct: 168 NLVDCSTSEGNNGCSGGLMDNAFEYV--KNNGGIDTEQAYPYLGQDNECKYRAECSGANV 225
Query: 235 NGYVSV-SRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHS 291
G+V + S +E + K + GP++VAI+A + QFY +GV + Q C + L H
Sbjct: 226 TGFVDIPSMNERALMKAVANVGPISVAIDAGNPSFQFYESGVYYEPQ--CS--SSQLDHG 281
Query: 292 VLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR-GDGSCGI 339
VL+VGYG + YWI+KNSWGE WG+KGY + + + CGI
Sbjct: 282 VLVVGYG------SIGKDEYWIVKNSWGEEWGKKGYVLMAKFRNNHCGI 324
>gi|33242870|gb|AAQ01139.1| cathepsin [Branchiostoma lanceolatum]
Length = 334
Score = 188 bits (478), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 126/331 (38%), Positives = 173/331 (52%), Gaps = 32/331 (9%)
Query: 36 VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHG--------SGVYG 87
+ H + + QH K Y T E YSR IF N KI EH S
Sbjct: 18 LPHNKEWEMWKLQHGKQYETEAEEYSRRFIFEKNTVKI-----AEHNIRASLGMHSYTLA 72
Query: 88 LNEFSDLSTAEFQAKYLGFKLK---PSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQ 144
+N+F D+ EF + +G LK V N TLP++ DWR V+ VKDQ
Sbjct: 73 MNKFGDMHHEEFHQRIMGGCLKIVKKPLLGSDVGDNDDNGTLPKSVDWRNSHMVSEVKDQ 132
Query: 145 TMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMS 202
CGS WAFSTTG++EG ++ KT KLV LSEQ+L+DC ++ + GC GG + AF I +
Sbjct: 133 GECGSCWAFSTTGSLEGQHSNKTGKLVDLSEQQLVDCSKDFGNQGCGGGLMDQAFQYITA 192
Query: 203 KLGGGLEEEKTYPYRG-DDKACRLNKKATQVKINGYVSV-SRDETDMAKYLVENGPMAVA 260
GGL+ E++YPY DD+ C+ + + + GY V S +E + + + GP++VA
Sbjct: 193 N--GGLDTEESYPYTATDDEPCKFDNSSVGATLVGYKDVKSGNEHALKRAVATVGPVSVA 250
Query: 261 INA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSW 318
I+A + QFY +GV Q C E L H VL VGYG +H+A +WI+KNSW
Sbjct: 251 IDAGHESFQFYSSGVYDEPQ--CS--TEQLDHGVLAVGYGA-MNDNSHQA--FWIVKNSW 303
Query: 319 GEGWGEKGYFRLYRG-DGSCGINDYVRSALV 348
G WG++GY + R + CGI LV
Sbjct: 304 GPSWGDQGYIMMSRNKNNQCGIATSASYPLV 334
>gi|288548566|gb|ADC52431.1| cathepsin L2 cysteine protease [Pinctada fucata]
Length = 330
Score = 188 bits (478), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 118/309 (38%), Positives = 167/309 (54%), Gaps = 25/309 (8%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKI---QLLQDTEHGSGVYGLNEFSDLSTAE 98
+N F +Q+NK Y E RL ++ NL I L D + G+NE+ D++ E
Sbjct: 27 WNIFKKQYNKLYQNEEEARRRL-VWESNLDFITLHNLAADRGEHTFWVGMNEYGDMTNEE 85
Query: 99 FQAKYLGFKLKPSYADRSVPAMIPNIT--LPRAFDWREYDAVTGVKDQTMCGSSWAFSTT 156
F G++++ ++ V M PN LP DWR VT +K+Q CGS W+FS T
Sbjct: 86 FTKTMNGYRMRNKTSNAPV-FMPPNNMGDLPDTVDWRPKGYVTPIKNQGQCGSCWSFSAT 144
Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCD--QEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
G++EG KT KLVSLSEQ L+DC Q + GCEGG + +AF I K G++ E +Y
Sbjct: 145 GSLEGQTFKKTGKLVSLSEQNLVDCSKKQGNHGCEGGLMDDAFTYI--KANNGIDTEASY 202
Query: 215 PYRGDDKACRLNKKATQVKINGYVSV-SRDETDMAKYLVENGPMAVAINA--YALQFYVT 271
PY+ D C G+V + ++DE + + + GP++VAI+A + Q Y T
Sbjct: 203 PYKARDGKCEFKSADVGATDTGFVDIKTKDEEALKQAVATVGPISVAIDASHMSFQLYRT 262
Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
GV H +FC L H VL VGYG + +K YW++KNSWGE WG+KGY ++
Sbjct: 263 GVYH--DWFC--SQTKLDHGVLAVGYGTEDSK------DYWLVKNSWGESWGQKGYIQMS 312
Query: 332 RG-DGSCGI 339
R +CGI
Sbjct: 313 RNRRNNCGI 321
>gi|242055323|ref|XP_002456807.1| hypothetical protein SORBIDRAFT_03g043220 [Sorghum bicolor]
gi|241928782|gb|EES01927.1| hypothetical protein SORBIDRAFT_03g043220 [Sorghum bicolor]
Length = 369
Score = 188 bits (478), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 121/296 (40%), Positives = 157/296 (53%), Gaps = 27/296 (9%)
Query: 58 EYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLK-----PSY 112
E R F N+R I LN F D+ EF++ + ++ S
Sbjct: 57 EKGRRFGTFKENVRFIHAHNKRGDRPYRLSLNRFGDMGREEFRSTFADSRINDLRRAESP 116
Query: 113 ADRSVPA-MIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKL 170
A +VP M +T LP + DWR+ AVT VKDQ CGS WAFST ++EG+ A +T L
Sbjct: 117 AAPAVPGFMYDGVTDLPPSVDWRKEGAVTAVKDQGHCGSCWAFSTVVSVEGINAIRTGSL 176
Query: 171 VSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACR--LNKK 228
VSLSEQELIDCD +++GC+GG + NAF+ I S GG+ E YPYR + C +++
Sbjct: 177 VSLSEQELIDCDTDENGCQGGLMENAFEFIKSY--GGVTTESAYPYRASNGTCDSVRSRR 234
Query: 229 ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNE 286
V I+G+ V D V N P++VAI+A A QFY GV F D G
Sbjct: 235 GQIVSIDGHQMVPTGSEDALAKAVANQPVSVAIDAGGQAFQFYSEGV-----FTGDCGT- 288
Query: 287 NLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGS---CGI 339
+L H V VGYGV + YWI+KNSWG WGE GY R+ RG G+ CGI
Sbjct: 289 DLDHGVAAVGYGV-----SDDGTAYWIVKNSWGPSWGEGGYIRMQRGAGNGGLCGI 339
>gi|194352758|emb|CAQ00107.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
Length = 457
Score = 188 bits (478), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 126/324 (38%), Positives = 167/324 (51%), Gaps = 37/324 (11%)
Query: 38 HTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYG------LNEF 91
+ A F + +H K YAT E +RL F+ N + D SG G LN F
Sbjct: 35 YEAQFEAWCAEHGKAYATPGERAARLAAFAENAAFVAAHNDAVASSGPGGPSYTLALNAF 94
Query: 92 SDLSTAEFQAKYLGFKL-------KPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQ 144
+DL+ EF+A LG PS +D + + P A DWR+ AVT VKDQ
Sbjct: 95 ADLTHDEFRAARLGRLAVGPGPLGAPSPSDGGFEGRVGAV--PDALDWRQSGAVTKVKDQ 152
Query: 145 TMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSK 203
CG+ W+FS TG +EG+ T L+SLSEQELIDCD+ + GC GG ++ A+ ++
Sbjct: 153 GSCGACWSFSATGAMEGINKITTGSLLSLSEQELIDCDRSYNTGCGGGLMTYAYKFVIKN 212
Query: 204 LGGGLEEEKTYPYRGDDKACRLNK-KATQVKINGYVSVSRDETDMAKYLVENGPMAVAI- 261
GG++ E YP+R D C NK K V I+GY V + D+ V P++V I
Sbjct: 213 --GGIDTEDDYPFREADGTCNKNKLKKHVVTIDGYKEVPSSKEDLLLQAVAQQPISVGIC 270
Query: 262 -NAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGE 320
+A A Q Y G+ F +L H+VLIVGYG + K YWI+KNSWGE
Sbjct: 271 GSARAFQLYSQGI------FDGPCPTSLDHAVLIVGYGSEGGK------DYWIVKNSWGE 318
Query: 321 GWGEKGYFRLYRGDGS----CGIN 340
WG KGY ++R GS CGIN
Sbjct: 319 RWGMKGYMHMHRNTGSSSGICGIN 342
>gi|443724292|gb|ELU12369.1| hypothetical protein CAPTEDRAFT_165495 [Capitella teleta]
Length = 351
Score = 188 bits (478), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 123/337 (36%), Positives = 171/337 (50%), Gaps = 29/337 (8%)
Query: 27 DEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQL---LQDTEHGS 83
D L + V L+ F H + Y E R +F NL+KI++ L S
Sbjct: 29 DTILRFPNQVPFEKLWQDFKTVHERNYGE-TEEMQRKEVFRNNLKKIEMHNYLHSQGKSS 87
Query: 84 GVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRS------VPAMIPNITLPRAFDWREYDA 137
G+N+F+D+ EF + GF++ R + IP ++LP DWR+
Sbjct: 88 YRMGINQFADMEVKEFASVVNGFRMNNRTKVRDHLHSHYISPAIP-VSLPAEVDWRKEGY 146
Query: 138 VTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISN 195
VT +KDQ CGS W+FSTTG +EG + KT KLVSLSEQ LIDC ++GC GG +
Sbjct: 147 VTPIKDQGHCGSCWSFSTTGALEGQHFRKTGKLVSLSEQNLIDCSTSYGNNGCNGGVMDY 206
Query: 196 AFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVEN 254
AF I K G + E +YPY D CR K+ GY + + DE M + +
Sbjct: 207 AFQYI--KDNDGDDTEDSYPYEAADGPCRFKKEYVGATDTGYTDLPKGDEEKMKEAVAMV 264
Query: 255 GPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYW 312
GP++VAI+A + Q Y +GV ++ CD E L H VL+VGYG T YW
Sbjct: 265 GPVSVAIDASHTSFQMYQSGVYDEVE--CDP--EGLDHGVLVVGYG------TELGQDYW 314
Query: 313 IIKNSWGEGWGEKGYFRLYRG-DGSCGINDYVRSALV 348
++KNSWG WG++GY ++ R + CGI+ LV
Sbjct: 315 LVKNSWGTKWGDEGYIKMSRNKNNQCGISSMASYPLV 351
>gi|400180419|gb|AFP73348.1| cysteine protease [Solanum lycopersicoides]
Length = 343
Score = 188 bits (478), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 109/303 (35%), Positives = 160/303 (52%), Gaps = 25/303 (8%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
++ +H + Y VE R IF N++ I+ + + S G+NEF+D+++ EF K+
Sbjct: 42 WMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGINEFADITSEEFLTKFT 101
Query: 105 GFKLKPSYADRSVPAM-------IPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
G + PSY S + + + +P DWRE AVT VK+Q CG WAFS G
Sbjct: 102 GINI-PSYLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKNQGQCGCCWAFSAVG 160
Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
++EG Y T L+ SEQEL+DC + GC GG ++NAFD I K GG+ E Y Y+
Sbjct: 161 SLEGAYKIATGNLMEFSEQELLDCTTNNYGCNGGFMTNAFDFI--KENGGISSESDYEYQ 218
Query: 218 GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA-LQFYVTGVSHP 276
G CR +K V+I+ Y V ET + + + + P+++ I A LQFY G
Sbjct: 219 GQQYTCRSQEKTAAVQISSYQVVPEGETSLLQAVTKQ-PVSIGIAASQDLQFYAGGTY-- 275
Query: 277 IQFFCDGG-NENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
DG + ++H+V +GYG D K YW++KNSWG WGE G+ ++ R G
Sbjct: 276 -----DGSCADRINHAVTAIGYGTD-----EKGQKYWLLKNSWGTSWGENGFMKIIRDSG 325
Query: 336 SCG 338
+ G
Sbjct: 326 NPG 328
>gi|326490904|dbj|BAJ90119.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 457
Score = 188 bits (478), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 126/324 (38%), Positives = 167/324 (51%), Gaps = 37/324 (11%)
Query: 38 HTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYG------LNEF 91
+ A F + +H K YAT E +RL F+ N + D SG G LN F
Sbjct: 35 YEAQFEAWCAEHGKAYATPGERAARLAAFAENAAFVAAHNDAVASSGPGGPSYTLALNAF 94
Query: 92 SDLSTAEFQAKYLGFKL-------KPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQ 144
+DL+ EF+A LG PS +D + + P A DWR+ AVT VKDQ
Sbjct: 95 ADLTHDEFRAARLGRLAVGPGPLGAPSPSDGGFEGRVGAV--PDALDWRQSGAVTKVKDQ 152
Query: 145 TMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSK 203
CG+ W+FS TG +EG+ T L+SLSEQELIDCD+ + GC GG ++ A+ ++
Sbjct: 153 GSCGACWSFSATGAMEGINKITTGSLLSLSEQELIDCDRSYNTGCGGGLMTYAYKFVIKN 212
Query: 204 LGGGLEEEKTYPYRGDDKACRLNK-KATQVKINGYVSVSRDETDMAKYLVENGPMAVAI- 261
GG++ E YP+R D C NK K V I+GY V + D+ V P++V I
Sbjct: 213 --GGIDTEDDYPFREADGTCNKNKLKKHVVTIDGYKEVPSSKEDLLLQAVAQQPISVGIC 270
Query: 262 -NAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGE 320
+A A Q Y G+ F +L H+VLIVGYG + K YWI+KNSWGE
Sbjct: 271 GSARAFQLYSQGI------FDGPCPTSLDHAVLIVGYGSEGGK------DYWIVKNSWGE 318
Query: 321 GWGEKGYFRLYRGDGS----CGIN 340
WG KGY ++R GS CGIN
Sbjct: 319 RWGMKGYMHMHRNTGSSSGICGIN 342
>gi|400180377|gb|AFP73327.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 188 bits (478), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 107/301 (35%), Positives = 159/301 (52%), Gaps = 24/301 (7%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
++ +H + Y VE R IF N++ I+ + + S G+NEF+D+++ EF AK+
Sbjct: 42 WMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFT 101
Query: 105 GFKLKPSYADRSVPAM-------IPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
G + SY S + + + +P DWRE AVT VK Q CG WAFS G
Sbjct: 102 GLNIPNSYLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVG 161
Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
++EG Y T L+ SEQEL+DC + GC GG ++NAFD I+ GG+ E Y Y+
Sbjct: 162 SLEGAYKIATGNLMEFSEQELLDCTTNNYGCNGGFMTNAFDFIIEN--GGISRESDYEYQ 219
Query: 218 GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA-LQFYVTGVSHP 276
G+ CR +K V+I+ Y V ET + + + + P+++ I A LQFY G
Sbjct: 220 GEQYTCRSQEKTAAVQISSYQVVPEGETSLLQAVTKQ-PVSIGIAASQDLQFYAGGTY-- 276
Query: 277 IQFFCDGG-NENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
DG + ++H+V +GYG D K YW++KNSWG WGE G+ ++ R G
Sbjct: 277 -----DGSCADRINHAVTAIGYGTD-----EKGQKYWLLKNSWGTSWGENGFMKIIRDSG 326
Query: 336 S 336
+
Sbjct: 327 N 327
>gi|13491750|gb|AAK27968.1|AF242372_1 cysteine protease [Ipomoea batatas]
Length = 339
Score = 188 bits (478), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 127/344 (36%), Positives = 175/344 (50%), Gaps = 40/344 (11%)
Query: 7 FAGVALLSLTVSVS-SFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHI 65
FA A L+ + ++S S MVV E+ ++ Q+ + Y T E R +I
Sbjct: 16 FATSAYLATSRTLSDSLMVVRHEQ---------------WMAQYGRVYKTEAEKTKRFNI 60
Query: 66 FSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNIT 125
F N+ I+ G+N F+DL+ EF+A G+KL P + P N++
Sbjct: 61 FKENVEYIESFNKAGTKPYKLGINAFADLTNQEFKASRNGYKL-PHDCSSNTPFRYENVS 119
Query: 126 -LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE 184
+P DWR AVT VKDQ CG WAFS +EG+ T L+SLSEQEL+DCD +
Sbjct: 120 SVPTTVDWRTKGAVTPVKDQGQCGCCWAFSAVAAMEGITKLSTGNLISLSEQELVDCDVK 179
Query: 185 --DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKAC-RLNKKATQVKINGYVSVS 241
D GCEGG + +AF I++ GL E YPY+G D +C + + KI+GY V
Sbjct: 180 GTDQGCEGGLMDDAFSFIINNK--GLTTESNYPYQGTDGSCKKSKSSNSAAKISGYEDVP 237
Query: 242 RDETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGV 299
+ + V N P++VAI+A QFY +GV F + G E L H V VGYG+
Sbjct: 238 ANSESALEKAVANQPVSVAIDAGGSDFQFYSSGV-----FTGECGTE-LDHGVTAVGYGI 291
Query: 300 DRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGI 339
YW++KNSWG WGEKGY R+ + +G CGI
Sbjct: 292 -----AEDGSKYWLVKNSWGTSWGEKGYIRMQKDIEAKEGLCGI 330
>gi|126681066|gb|ABO26562.1| cathepsin L-like cysteine protease [Ixodes ricinus]
Length = 335
Score = 188 bits (478), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 122/325 (37%), Positives = 170/325 (52%), Gaps = 30/325 (9%)
Query: 40 ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT-EHGSGVY--GLNEFSDLST 96
A ++ F +H K+Y + E RL I+ N KI + G Y +NEF D+
Sbjct: 25 AEWSAFKAKHGKSYVSETEEVFRLKIYMENRHKIAKHNEKYARGEVPYSMAMNEFGDMLH 84
Query: 97 AEFQAKYLGFKLKPSYADRSV-------PAMIPNITLPRAFDWREYDAVTGVKDQTMCGS 149
EF + GFK +Y D+ P I + +LP+ DWR AVT VK+Q CGS
Sbjct: 85 HEFVSTRNGFKR--NYKDQPREGSTYLEPENIEDFSLPKTVDWRTKGAVTPVKNQGQCGS 142
Query: 150 SWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGG 207
WAFS TG++EG + K+ +VSLSEQ L+ C + ++GCEGG + +AF I + G
Sbjct: 143 CWAFSATGSLEGQHFRKSGSMVSLSEQNLVGCSTDFGNNGCEGGLMDDAFKYI--RANKG 200
Query: 208 LEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINAY-- 264
++ EK+YPY G D C K +G+V + ET + K + GP++VAI+A
Sbjct: 201 IDTEKSYPYNGTDGTCHFKKSTVGATDSGFVDIKEGSETQLKKAVATVGPISVAIDASHE 260
Query: 265 ALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGE 324
+ QFY GV + CD +E+L H VL+VGYG T YW +KNSWG WG+
Sbjct: 261 SFQFYSDGVYDEPE--CD--SESLDHGVLVVGYG------TLNGTDYWFVKNSWGTTWGD 310
Query: 325 KGYFRLYRG-DGSCGINDYVRSALV 348
+GY R+ R CGI LV
Sbjct: 311 EGYIRMSRNKKNQCGIASSASIPLV 335
>gi|146215982|gb|ABQ10193.1| actinidin Act2b [Actinidia eriantha]
Length = 378
Score = 188 bits (478), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 123/346 (35%), Positives = 177/346 (51%), Gaps = 34/346 (9%)
Query: 4 FYFFAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRL 63
FF+ + +LSL + + + + ++++ A++ +L + K+Y +L E R
Sbjct: 12 LLFFSTLLILSLALDIENSVQRTNDQV--------MAMYESWLVEQGKSYNSLDEKEMRF 63
Query: 64 HIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPN 123
IF NLR I + S GLN F+DL+ E+++ YLG K+ P D S M P
Sbjct: 64 EIFKENLRIIDDHNADANRSYSLGLNRFADLTDEEYRSTYLGLKMGPK-TDVSNEYM-PK 121
Query: 124 I--TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDC 181
+ LP DWR AV GVK+Q +C S WAFS +EG+ T L+SLSEQEL+DC
Sbjct: 122 VGEALPDYVDWRTVGAVVGVKNQGLCSSCWAFSAVTAVEGINKIVTGNLISLSEQELVDC 181
Query: 182 --DQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQ-VKINGYV 238
Q GC G +++AF I++ GG+ E YPY D C L+ K + V I+ Y
Sbjct: 182 GRTQRTKGCNRGLMTDAFQFIINN--GGINTEDNYPYTAKDGQCNLSLKNQKYVTIDNYK 239
Query: 239 SVSRDETDMAKYLVENGPMAVAINAYALQF--YVTGVSHPIQFFCDGGNENLSHSVLIVG 296
+V + K V P++V + + +F Y +G+ FC + H V IVG
Sbjct: 240 NVPSNNEMALKKAVAYQPVSVGVESEGGKFKLYTSGI---FTGFCGTA---VDHGVTIVG 293
Query: 297 YGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR---GDGSCGI 339
YG +R + YWI+KNSWG WGE GY R+ R G G CGI
Sbjct: 294 YGTER------GMDYWIVKNSWGTNWGENGYIRIQRNIGGAGKCGI 333
>gi|449673497|ref|XP_002169904.2| PREDICTED: cathepsin L-like [Hydra magnipapillata]
Length = 325
Score = 188 bits (478), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 119/306 (38%), Positives = 163/306 (53%), Gaps = 20/306 (6%)
Query: 49 HNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKL 108
HNK Y+ E R I+ N+ +I +++ + + +N F D++ EF+AK G L
Sbjct: 34 HNKAYSHESEENVRYAIWKDNMNRITEY-NSKSKNVILRMNHFGDMTNTEFRAKMNGLLL 92
Query: 109 KPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTK 168
+ + S + + P A DWR VT VK+Q CGS WAFS+TG +EG + KT
Sbjct: 93 H-KHQNGSTFLVPSHTAAPDAVDWRSEGYVTPVKNQGQCGSCWAFSSTGALEGQHFKKTG 151
Query: 169 KLVSLSEQELIDC--DQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLN 226
+LVSLSEQ L+DC D ++GC GG + NAF I K GG++ E YPY G D CR +
Sbjct: 152 RLVSLSEQNLVDCSTDYGNNGCNGGLMDNAFSYI--KANGGIDTETGYPYEGQDGTCRYS 209
Query: 227 KKATQVKINGYVSVSRDETDMAKYLVEN-GPMAVAINA--YALQFYVTGVSHPIQFFCDG 283
K + G+V + + D K V GP++VAI+A + QFY +GV Q C
Sbjct: 210 KSSIGADDTGFVDIPEGDEDALKQAVATVGPVSVAIDASHMSFQFYHSGVYDEPQ--CS- 266
Query: 284 GNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGD-GSCGINDY 342
L H VL+VGYG D K YW++KNSWG GWG +GY + R + CGI
Sbjct: 267 -PSALDHGVLVVGYGTDNGK------DYWLVKNSWGTGWGTEGYIYMSRNNQNQCGIASK 319
Query: 343 VRSALV 348
LV
Sbjct: 320 ASYPLV 325
>gi|400180379|gb|AFP73328.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 188 bits (478), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 108/301 (35%), Positives = 159/301 (52%), Gaps = 24/301 (7%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
++ +H + Y VE R IF N++ I+ + + S G+NEF+D+++ EF AK+
Sbjct: 42 WMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFT 101
Query: 105 GFKLKPSYADRSVPAM-------IPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
G + SY S + + + +P DWRE AVT VK Q CG WAFS G
Sbjct: 102 GLNIPNSYLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVG 161
Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
++EG Y T L+ SEQEL+DC + GC+GG ++NAFD I K GG+ E Y Y
Sbjct: 162 SLEGAYKIATGNLMEFSEQELLDCTTNNYGCDGGFMTNAFDFI--KENGGISRESDYEYL 219
Query: 218 GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA-LQFYVTGVSHP 276
G+ CR +K V+I+ Y V ET + + + + P+++ I A LQFY G
Sbjct: 220 GEQYTCRSQEKTAAVQISSYQVVPEGETSLLQAVTKQ-PVSIGIAASQDLQFYAGGTY-- 276
Query: 277 IQFFCDGG-NENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
DG + ++H+V +GYG D K YW++KNSWG WGE G+ ++ R G
Sbjct: 277 -----DGSCADRINHAVTAIGYGTD-----EKGQKYWLLKNSWGTSWGENGFMKIIRDSG 326
Query: 336 S 336
+
Sbjct: 327 N 327
>gi|400180441|gb|AFP73357.1| cysteine protease [Solanum habrochaites]
Length = 344
Score = 188 bits (478), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 109/301 (36%), Positives = 158/301 (52%), Gaps = 24/301 (7%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
++ +H + Y VE R IF N++ I+ + + S G+NEF+D+++ EF AK+
Sbjct: 42 WMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSEEFLAKFT 101
Query: 105 GFKLKPSYADRSVPAM-------IPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
G + SY S + I + +P DWRE AVT VK+Q CG WAFS G
Sbjct: 102 GLNIPNSYLSPSPMSSTEFKINDISDDDMPSNLDWRESGAVTQVKNQGQCGCCWAFSAVG 161
Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
++EG Y T L+ SEQEL+DC + GC GG ++NAFD I + GG+ E Y Y
Sbjct: 162 SLEGAYKIATGNLMEFSEQELLDCTTNNYGCNGGFMTNAFDFI--RENGGISRESDYEYL 219
Query: 218 GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA-LQFYVTGVSHP 276
G CR +K V+I+ Y V ET + + + + P+++ I A LQFY G
Sbjct: 220 GQQYTCRSQEKTAAVQISSYQVVPEGETSLLQAVTKQ-PVSIGIAASQDLQFYAGGTY-- 276
Query: 277 IQFFCDGGNEN-LSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
DG N ++H+V +GYG D YW++KNSWG WGEKG+ ++ R G
Sbjct: 277 -----DGSCANRINHAVTAIGYGTD-----ENGQKYWLLKNSWGTSWGEKGFMKIIRDYG 326
Query: 336 S 336
+
Sbjct: 327 N 327
>gi|14517542|gb|AAK62661.1| F2G19.31/F2G19.31 [Arabidopsis thaliana]
gi|19548039|gb|AAL87383.1| F2G19.31/F2G19.31 [Arabidopsis thaliana]
Length = 462
Score = 188 bits (478), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 125/351 (35%), Positives = 186/351 (52%), Gaps = 40/351 (11%)
Query: 10 VALLSLTVSVSSFMVVGDEKLHHLHHVKHT---------ALFNYFLEQHNKTYA--TLVE 58
+A+++++ +V ++ DEK H V T +++ +L +H K + +LVE
Sbjct: 13 LAMVTVSSAVDMSIISYDEK----HGVSTTGGRSEAEVMSIYEAWLVKHGKAQSQNSLVE 68
Query: 59 YYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVP 118
R IF NLR + + ++ S GL F+DL+ E+++KYLG K++ R+
Sbjct: 69 KDRRFEIFKDNLRFVDE-HNEKNLSYRLGLTRFADLTNDEYRSKYLGAKMEKKGERRTSL 127
Query: 119 AMIPNI--TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQ 176
+ LP + DWR+ AV VKDQ CGS WAFST G +EG+ T L++LSEQ
Sbjct: 128 RYEARVGDELPESIDWRKKGAVAEVKDQGGCGSCWAFSTIGAVEGINQIVTGDLITLSEQ 187
Query: 177 ELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKAC-RLNKKATQVKI 234
EL+DCD ++GC GG + AF+ I+ GG++ +K YPY+G D C ++ K A V I
Sbjct: 188 ELVDCDTSYNEGCNGGLMDYAFEFIIKN--GGIDTDKDYPYKGVDGTCDQIRKNAKVVTI 245
Query: 235 NGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSV 292
+ Y V + K V + P+++AI A A Q Y +G+ F L H V
Sbjct: 246 DSYEDVPTYSEESLKKAVAHQPISIAIEAGGRAFQLYDSGI------FDGSCGTQLDHGV 299
Query: 293 LIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR----GDGSCGI 339
+ VGYG + K YWI++NSWG+ WGE GY R+ R G CGI
Sbjct: 300 VAVGYGTENGK------DYWIVRNSWGKSWGESGYLRMARNIASSSGKCGI 344
>gi|118627554|emb|CAL64936.1| putative cysteine protease 8 [Trifolium pratense]
Length = 344
Score = 188 bits (478), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 119/307 (38%), Positives = 159/307 (51%), Gaps = 25/307 (8%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEH-GSGVYGLNEFSDLSTAEFQAKY 103
++ Q+ K Y E +R IF N+ I+ + + S G+N+F+DL+ EF A
Sbjct: 42 WMSQYGKIYKDHQERETRFKIFKENVNYIETFNNADDTKSYKLGINQFADLTNEEFIASR 101
Query: 104 LGFK-LKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEG 161
FK S R+ N++ +P DWR+ AVT VK+Q CG WAFS EG
Sbjct: 102 NKFKGHMCSSIMRTTSFKYENVSGIPSTVDWRKKGAVTPVKNQGQCGCCWAFSAVAATEG 161
Query: 162 VYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGD 219
++ T KL+SLSEQEL+DCD + D GCEGG + +AF I+ GL E YPY G
Sbjct: 162 IHKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFIIQNH--GLSTEAQYPYEGV 219
Query: 220 DKACRLNKKATQ-VKINGYVSVSRDETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHP 276
D C NK + Q V I GY V + + V N P++VAI+A QFY +GV
Sbjct: 220 DGTCNANKASVQAVTITGYEDVPANSEQALQKAVANQPISVAIDASGSDFQFYKSGV--- 276
Query: 277 IQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG--- 333
F L H V VGYGV ++ YW++KNSWG WGE+GY + RG
Sbjct: 277 ---FTGACGTELDHGVTAVGYGV-----SNDGTKYWLVKNSWGTDWGEEGYIMMQRGIEA 328
Query: 334 -DGSCGI 339
+G CGI
Sbjct: 329 AEGICGI 335
>gi|400180375|gb|AFP73326.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 188 bits (477), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 110/310 (35%), Positives = 163/310 (52%), Gaps = 25/310 (8%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
++ +H + Y VE R IF N++ I+ + + S G+NEF+D+++ EF AK+
Sbjct: 42 WMSRHGRVYKDEVEKGERFMIFKENIKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFT 101
Query: 105 GFKLKPSYADRSVPAM-------IPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
G + SY S + + + +P DWRE AVT VK Q CG WAFS G
Sbjct: 102 GLNIPNSYLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVG 161
Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
++EG Y T L+ SEQEL+DC + GC+GG ++NAFD I K GG+ E Y Y
Sbjct: 162 SLEGAYKIATGNLMEFSEQELLDCTTNNYGCDGGFMTNAFDFI--KENGGISSESDYEYL 219
Query: 218 GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA-LQFYVTGVSHP 276
G+ CR +K V+I+ Y V ET + + + + P+++ I A LQFY G
Sbjct: 220 GEQYTCRSQEKTAAVQISSYQVVPEGETSLLQAVTKQ-PVSIGIAASQDLQFYAGGTY-- 276
Query: 277 IQFFCDGG-NENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
DG + ++H+V +GYG D K YW++KNSWG WGE G+ ++ R G
Sbjct: 277 -----DGSCADRINHAVTAIGYGTD-----EKGQKYWLLKNSWGTSWGENGFMKIIRDSG 326
Query: 336 S-CGINDYVR 344
+ G+ D +
Sbjct: 327 NPAGLCDIAK 336
>gi|356543118|ref|XP_003540010.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 339
Score = 188 bits (477), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 116/303 (38%), Positives = 154/303 (50%), Gaps = 23/303 (7%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
+ +++ K Y E RL IF N+ I+ + +N +D + EF A +
Sbjct: 43 WTKKYGKVYKDAAEKQKRLLIFKDNVEFIESFNAAGNKPYKLSINHLTDQTNEEFVASHN 102
Query: 105 GFKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVY 163
G+K K S++ P NIT +P A DWRE AV +KDQ CG+ WAFST EG+Y
Sbjct: 103 GYKHKGSHS--QTPFKYENITGVPNAVDWRENGAVXAMKDQGQCGNCWAFSTVATTEGIY 160
Query: 164 AAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKAC 223
T L+SLSEQEL+DCD D GC+GG + F+ I GG+ E YPY D
Sbjct: 161 QITTSMLMSLSEQELVDCDSVDHGCDGGYMEGGFEFIXKN--GGISSEANYPYTAVDGTY 218
Query: 224 RLNKKATQ-VKINGYVSVSRDETDMAKYLVENGPMAVAIN--AYALQFYVTGVSHPIQFF 280
NK+A+ +I GY +V + D + V N P++V I+ A QF +GV F
Sbjct: 219 DANKEASPAAQIKGYETVPANSEDALQKAVANQPVSVTIDVGGSAFQFNSSGV------F 272
Query: 281 CDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGS 336
L H V VGYG T YWI+KNSWG WGE+GY R+ RG +G
Sbjct: 273 TGQCGTQLDHGVTAVGYGS-----TDDGTQYWIVKNSWGTQWGEEGYIRMQRGTDAQEGL 327
Query: 337 CGI 339
CGI
Sbjct: 328 CGI 330
>gi|220983358|dbj|BAH11164.1| cysteine protease [Hordeum vulgare]
Length = 462
Score = 188 bits (477), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 118/314 (37%), Positives = 164/314 (52%), Gaps = 31/314 (9%)
Query: 41 LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY----GLNEFSDLST 96
++ ++ +H+ TY + E R F NLR I + +GV+ GLN F+DL+
Sbjct: 41 MYAEWMAEHHSTYNPIGEEERRFEAFRNNLRYIDQ-HNAAADAGVHSFRLGLNRFADLTN 99
Query: 97 AEFQAKYLGFKLKPSYADRSVPA---MIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAF 153
E+++ YLG + KP +R + A N LP + DWR+ AV VKDQ CGS WAF
Sbjct: 100 EEYRSTYLGARTKPDR-ERKLSARYQAADNDELPESVDWRKKGAVGAVKDQGGCGSCWAF 158
Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEK 212
S +EG+ T ++ LSEQEL+DCD + GC GG + AF+ I++ GG++ E+
Sbjct: 159 SAIAAVEGINQIVTGDMIPLSEQELVDCDTSYNQGCNGGLMDYAFEFIINN--GGIDSEE 216
Query: 213 TYPYRGDDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFY 269
YPY+ D C NKK A V I+GY V + + V N P++VAI A A Q Y
Sbjct: 217 DYPYKERDNRCDANKKNAKVVTIDGYEDVPVNSEKSLQKAVANQPISVAIEAGGRAFQLY 276
Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
+G+ F L H V VGYG + K YW+++NSWG WGE GY R
Sbjct: 277 KSGI------FTGTCGTALDHGVAAVGYGTENGK------DYWLVRNSWGSVWGENGYIR 324
Query: 330 LYRG----DGSCGI 339
+ R G CGI
Sbjct: 325 MERNIKASSGKCGI 338
>gi|198427748|ref|XP_002130282.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
Length = 340
Score = 188 bits (477), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 123/353 (34%), Positives = 176/353 (49%), Gaps = 30/353 (8%)
Query: 13 LSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRK 72
+ +TV ++ + G L+ +H +LF + K Y T+ E ++ + N K
Sbjct: 1 MKVTVLLAVVLFAGCCSAMQLNQ-QHVSLFQTWKNLWKKVYQTVEEEEQKMATWFNNWNK 59
Query: 73 I---QLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMI-------- 121
I + + S +NE+ DL++ EF + G++ +S
Sbjct: 60 ISEHNMQYSLKQKSYRLEMNEYGDLTSEEFSSMMNGYRNDIRLKRKSTGGSTYLNLLSFG 119
Query: 122 PNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDC 181
I LP DWR++ VT VK+Q CGS W+FS TG++EG + KT KLVSLSEQ LIDC
Sbjct: 120 SQIQLPTLVDWRKHGLVTPVKNQGQCGSCWSFSATGSLEGQHKKKTGKLVSLSEQNLIDC 179
Query: 182 D--QEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVS 239
+ +DGC GG + AF I K+ GG++ E YPY D CR N + G+V
Sbjct: 180 STPEGNDGCNGGLMDQAFKYI--KIQGGIDTEAYYPYEAKDDTCRFNITDSGATDTGFVD 237
Query: 240 VSRDETDMAKYLVEN-GPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVG 296
+ + +M K GP++VAI+A + QFY GV + C + L H VL+VG
Sbjct: 238 IKSGDEEMLKEAAATVGPISVAIDASHTSFQFYSNGVYS--ETACS--STMLDHGVLVVG 293
Query: 297 YGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR-GDGSCGINDYVRSALV 348
YG + K YW++KNSWGEGWGE GY ++ R D CGI LV
Sbjct: 294 YGTENGK------DYWLVKNSWGEGWGEAGYIKMSRNADNQCGIATQASYPLV 340
>gi|388517427|gb|AFK46775.1| unknown [Medicago truncatula]
Length = 362
Score = 188 bits (477), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 128/345 (37%), Positives = 177/345 (51%), Gaps = 31/345 (8%)
Query: 12 LLSLTVSVSSFMVVGDE-KLHHLHHVKHTALFNYF--LEQHNKTYATLVEYYSRLHIFSG 68
LL + +S++ +VV + H +L++ + H+ L E R ++F
Sbjct: 6 LLLIVLSIALVLVVSESFDFHDKDVSSDESLWDLYERWRSHHTVSRNLNEKQKRFNVFKS 65
Query: 69 NLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPA-----MIPN 123
N+ + + + LN+F+D++ EF+ Y G K+ R P M N
Sbjct: 66 NVMHVHNTNKMDKPYKLK-LNKFADMTNHEFKTTYAGTKVNHHRMFRGTPRVSGTFMYEN 124
Query: 124 IT-LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCD 182
T P + DWR+ AVT VKDQ CGS WAFST +EG+ KT +LV LSEQELIDCD
Sbjct: 125 FTKAPASVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQIKTNRLVPLSEQELIDCD 184
Query: 183 -QEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKK-ATQVKINGYVSV 240
QE+ GC GG + AF+ I K GG+ E YPY +D +C K+ V I+G+ +V
Sbjct: 185 NQENQGCNGGLMEYAFEYIKQK--GGVTTESYYPYTANDGSCDATKENVPTVSIDGHETV 242
Query: 241 SRDETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYG 298
++ D V N P++VAI+A QFY GV F D G E L+H V IVGYG
Sbjct: 243 PANDEDALLKAVANQPVSVAIDAGGSDFQFYSEGV-----FTGDCGKE-LNHGVAIVGYG 296
Query: 299 VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGI 339
T YWI++NSWG WGE+G R+ R +G CGI
Sbjct: 297 T-----TVDGTNYWIVRNSWGAEWGEQGCIRMKRNVSNKEGLCGI 336
>gi|302764466|ref|XP_002965654.1| hypothetical protein SELMODRAFT_230713 [Selaginella moellendorffii]
gi|300166468|gb|EFJ33074.1| hypothetical protein SELMODRAFT_230713 [Selaginella moellendorffii]
Length = 345
Score = 188 bits (477), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 115/308 (37%), Positives = 162/308 (52%), Gaps = 23/308 (7%)
Query: 41 LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
++ ++++H K Y + EY R IF N+ I + S GLN+F+DL+ +EF+
Sbjct: 37 VYQKWIQEHGKAYNSAHEYKKRFQIFKENVNYINSHNARRNNSHSLGLNKFADLTNSEFR 96
Query: 101 AKYLGFKLKPS-YADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNI 159
Y+G +P+ + + A++ + + DWR+ VT +KDQ CGS WAFS +
Sbjct: 97 GLYVGRLQRPAPFHEVGDIALVADTAT--SVDWRKKGGVTEIKDQGDCGSCWAFSAVAAV 154
Query: 160 EGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG 218
EG+ T LVSLSEQEL+DCD + GC+GG + AF ++ GG+ + YPYR
Sbjct: 155 EGLTFLSTGTLVSLSEQELVDCDTTVNQGCDGGIMDYAFQYMIRN--GGITSQSNYPYRA 212
Query: 219 DDKACRLNK-KATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA--LQFYVTGVSH 275
AC +K K ING+ ++ ++ V N P++VAI A Q Y +GV
Sbjct: 213 LRGACDKDKVKYHAATINGFQAIPPQSEELLLRAVANQPVSVAIEAGGQDFQLYSSGV-- 270
Query: 276 PIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR--- 332
F NL H V IVGYG D YW++KNSWG GWGE GY R+ R
Sbjct: 271 ----FTGECGSNLDHGVAIVGYGTDA-----GGRQYWLVKNSWGSGWGESGYVRMERQGP 321
Query: 333 GDGSCGIN 340
G G CGIN
Sbjct: 322 GAGVCGIN 329
>gi|18401614|ref|NP_564497.1| cysteine proteinase RD21a [Arabidopsis thaliana]
gi|1172873|sp|P43297.1|RD21A_ARATH RecName: Full=Cysteine proteinase RD21a; Short=RD21; Flags:
Precursor
gi|12321010|gb|AAG50628.1|AC083835_13 cysteine protease, putative [Arabidopsis thaliana]
gi|435619|dbj|BAA02374.1| thiol protease [Arabidopsis thaliana]
gi|18175926|gb|AAL59952.1| putative cysteine proteinase RD21A [Arabidopsis thaliana]
gi|22136972|gb|AAM91715.1| putative cysteine proteinase RD21A [Arabidopsis thaliana]
gi|332194014|gb|AEE32135.1| cysteine proteinase RD21a [Arabidopsis thaliana]
Length = 462
Score = 188 bits (477), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 125/351 (35%), Positives = 186/351 (52%), Gaps = 40/351 (11%)
Query: 10 VALLSLTVSVSSFMVVGDEKLHHLHHVKHT---------ALFNYFLEQHNKTYA--TLVE 58
+A+++++ +V ++ DEK H V T +++ +L +H K + +LVE
Sbjct: 13 LAMVAVSSAVDMSIISYDEK----HGVSTTGGRSEAEVMSIYEAWLVKHGKAQSQNSLVE 68
Query: 59 YYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVP 118
R IF NLR + + ++ S GL F+DL+ E+++KYLG K++ R+
Sbjct: 69 KDRRFEIFKDNLRFVDE-HNEKNLSYRLGLTRFADLTNDEYRSKYLGAKMEKKGERRTSL 127
Query: 119 AMIPNI--TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQ 176
+ LP + DWR+ AV VKDQ CGS WAFST G +EG+ T L++LSEQ
Sbjct: 128 RYEARVGDELPESIDWRKKGAVAEVKDQGGCGSCWAFSTIGAVEGINQIVTGDLITLSEQ 187
Query: 177 ELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKAC-RLNKKATQVKI 234
EL+DCD ++GC GG + AF+ I+ GG++ +K YPY+G D C ++ K A V I
Sbjct: 188 ELVDCDTSYNEGCNGGLMDYAFEFIIKN--GGIDTDKDYPYKGVDGTCDQIRKNAKVVTI 245
Query: 235 NGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSV 292
+ Y V + K V + P+++AI A A Q Y +G+ F L H V
Sbjct: 246 DSYEDVPTYSEESLKKAVAHQPISIAIEAGGRAFQLYDSGI------FDGSCGTQLDHGV 299
Query: 293 LIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR----GDGSCGI 339
+ VGYG + K YWI++NSWG+ WGE GY R+ R G CGI
Sbjct: 300 VAVGYGTENGK------DYWIVRNSWGKSWGESGYLRMARNIASSSGKCGI 344
>gi|400180371|gb|AFP73324.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 188 bits (477), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 109/301 (36%), Positives = 160/301 (53%), Gaps = 24/301 (7%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
++ +H + Y VE R IF N++ I+ + + S G+NEF+D+++ EF AK+
Sbjct: 42 WMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFT 101
Query: 105 GFKLKPSYAD----RSVPAMIPNIT---LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
G + SY S +I +++ +P DWRE AVT VK Q CG WAFS G
Sbjct: 102 GLNIPNSYLSPSPMSSTEFIINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVG 161
Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
++EG Y T L+ SEQEL+DC + GC GG ++NAFD I K GG+ E Y Y
Sbjct: 162 SLEGAYKIATGNLMEFSEQELLDCTTNNYGCNGGFMTNAFDFI--KENGGISRESDYEYL 219
Query: 218 GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA-LQFYVTGVSHP 276
G+ CR +K V+I+ Y V ET + + + + P+++ I A LQFY G
Sbjct: 220 GEQYTCRSQEKTAAVQISSYQVVPEGETSLLQAVTKQ-PVSIGIAASQDLQFYAGGTY-- 276
Query: 277 IQFFCDGG-NENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
DG + ++H+V +GYG D K YW++KNSWG WGE G+ ++ R G
Sbjct: 277 -----DGSCADRINHAVTAIGYGTD-----EKGQKYWLLKNSWGTSWGENGFMKIIRDSG 326
Query: 336 S 336
+
Sbjct: 327 N 327
>gi|2352469|gb|AAC00067.1| cysteine protease [Trypanosoma cruzi]
Length = 471
Score = 188 bits (477), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 118/344 (34%), Positives = 173/344 (50%), Gaps = 22/344 (6%)
Query: 10 VALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGN 69
V L ++ V ++ + LH + T+ F F ++H + Y + L +F N
Sbjct: 8 VLLAAVLVVMACLVPAATASLHAEETL--TSQFAEFKQKHGRVYESAARRLP-LSVFREN 64
Query: 70 LRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL-GFKLKPSYADRS-VPAMIPNITLP 127
L + L + +G+ FSDL+ EF+++Y G + +R+ VP + + P
Sbjct: 65 LF-LARLHAAANPHATFGVTPFSDLTREEFRSRYHNGAAHFAAAQERARVPVKVEVVGAP 123
Query: 128 RAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDG 187
A DWR AVT VKDQ CGS WAFS GN+E + L +LSEQ L+ CD+ D G
Sbjct: 124 AAVDWRARGAVTAVKDQGQCGSCWAFSAIGNVECQWFLAGHPLTNLSEQMLVSCDKTDFG 183
Query: 188 CEGGSISNAFDTIMSKLGGGLEEEKTYPY---RGDDKACRLNKKATQVKINGYVSVSRDE 244
C GG ++NAF+ I+ + G + E +YPY G C + I G+V + +DE
Sbjct: 184 CSGGLMNNAFEWIVQENNGAVYTEDSYPYASGEGISPPCTTSGHTVGATITGHVELPQDE 243
Query: 245 TDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKF 304
+A + NGP+AVA++A + Y GV +E L H VL+VGY
Sbjct: 244 AQIAACVAVNGPVAVAVDASSWMTYTGGV------MTSCVSEQLDHGVLLVGYN------ 291
Query: 305 THKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
AVPYWIIKNSW GE+GY R+ +G C + + SA+V
Sbjct: 292 DSAAVPYWIIKNSWTTQ-GEEGYIRIAKGSNQCLVKEEASSAVV 334
>gi|357474725|ref|XP_003607647.1| Cysteine proteinase [Medicago truncatula]
gi|355508702|gb|AES89844.1| Cysteine proteinase [Medicago truncatula]
Length = 340
Score = 188 bits (477), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 117/305 (38%), Positives = 160/305 (52%), Gaps = 26/305 (8%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
++ ++ K Y +E R IF N+ I+ ++ +N +DL+ EF+A
Sbjct: 43 WMSEYGKLYKDAIEKEKRFMIFKDNVEFIESFNAADNKPYKLSVNHLADLTLDEFKASRN 102
Query: 105 GFK-LKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGV 162
G+K + +A S N+T +P A DWR AVT +KDQ CGS WAFST IEG+
Sbjct: 103 GYKKIDREFATTSFK--YENVTAIPEAVDWRVKGAVTPIKDQGQCGSCWAFSTVAAIEGI 160
Query: 163 YAAKTKKLVSLSEQELIDCDQ--EDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDD 220
T KL+SLSEQEL+DCD ED GCEGG + + F+ I+ GG+ E YPY+ D
Sbjct: 161 NQITTGKLISLSEQELVDCDTKGEDQGCEGGLMEDGFEFIIKN--GGITSETNYPYKAAD 218
Query: 221 KACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQ 278
+C A KI GY V + V N P++V+I+A + FY +G+
Sbjct: 219 GSCNTATTAPVAKITGYEKVPVNSEISLLKAVANQPISVSIDASDSSFMFYSSGI----- 273
Query: 279 FFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----D 334
+ + G E L H V VGYG + YWI+KNSWG WGEKGY R+ RG +
Sbjct: 274 YTGECGTE-LDHGVTAVGYG------SANGTDYWIVKNSWGTVWGEKGYIRMQRGIADKE 326
Query: 335 GSCGI 339
G CGI
Sbjct: 327 GLCGI 331
>gi|62320725|dbj|BAD95392.1| cysteine proteinase RD21A [Arabidopsis thaliana]
Length = 433
Score = 188 bits (477), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 125/351 (35%), Positives = 186/351 (52%), Gaps = 40/351 (11%)
Query: 10 VALLSLTVSVSSFMVVGDEKLHHLHHVKHT---------ALFNYFLEQHNKTYA--TLVE 58
+A+++++ +V ++ DEK H V T +++ +L +H K + +LVE
Sbjct: 13 LAMVAVSSAVDMSIISYDEK----HGVSTTGGRSEAEVMSIYEAWLVKHGKAQSQNSLVE 68
Query: 59 YYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVP 118
R IF NLR + + ++ S GL F+DL+ E+++KYLG K++ R+
Sbjct: 69 KDRRFEIFKDNLRFVDE-HNEKNLSYRLGLTRFADLTNDEYRSKYLGAKMEKKGERRTSL 127
Query: 119 AMIPNI--TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQ 176
+ LP + DWR+ AV VKDQ CGS WAFST G +EG+ T L++LSEQ
Sbjct: 128 RYEARVGDELPESIDWRKKGAVAEVKDQGGCGSCWAFSTIGAVEGINQIVTGDLITLSEQ 187
Query: 177 ELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKAC-RLNKKATQVKI 234
EL+DCD ++GC GG + AF+ I+ GG++ +K YPY+G D C ++ K A V I
Sbjct: 188 ELVDCDTSYNEGCNGGLMDYAFEFIIKN--GGIDTDKDYPYKGVDGTCDQIRKNAKVVTI 245
Query: 235 NGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSV 292
+ Y V + K V + P+++AI A A Q Y +G+ F L H V
Sbjct: 246 DSYEDVPTYSEESLKKAVAHQPISIAIEAGGRAFQLYDSGI------FDGSCGTQLDHGV 299
Query: 293 LIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR----GDGSCGI 339
+ VGYG + K YWI++NSWG+ WGE GY R+ R G CGI
Sbjct: 300 VAVGYGTENGK------DYWIVRNSWGKSWGESGYLRMARNIASSSGKCGI 344
>gi|944916|gb|AAA74430.1| cysteine proteinase [Mesembryanthemum crystallinum]
Length = 367
Score = 188 bits (477), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 111/291 (38%), Positives = 154/291 (52%), Gaps = 24/291 (8%)
Query: 58 EYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSV 117
E +R H+F N++ I + + + LN+F DL+ +EF Y K+ + S
Sbjct: 59 EKQNRFHVFKENVKYINEVNKMDKPYKLR-LNQFGDLTPSEFARTYANSKIIEGTRNESG 117
Query: 118 PAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQE 177
M N+ +PR+ DWR AVT VK+Q CG WAFS +EG+ T +L+SLSEQ+
Sbjct: 118 GFMYENVEVPRSIDWRVKGAVTPVKNQGRCGGCWAFSAAAAVEGINQITTGQLISLSEQQ 177
Query: 178 LIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNK-KATQVKING 236
LIDCD ++ GC GG++ AF+ I + GG+ E YPY+ C+ N + V I+G
Sbjct: 178 LIDCDTQNSGCRGGTMGRAFEYIKQR--GGITSEANYPYKAQAGMCKNNLIQRPTVSIDG 235
Query: 237 YVSVSRDETDMAKYLVENGPMAVAINAYALQ-----FYVTGVSHPIQFFCDGGNENLSHS 291
Y ++ R E + K L P++VA++A FY GV F L+H
Sbjct: 236 YYNIRRSEDAVLKILAHQ-PVSVAVDATTWSSLDWMFYFQGV------FTGPCGTKLNHG 288
Query: 292 VLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGD---GSCGI 339
V VGYG T+ YWIIKNSWGE WGE+GY R+ RG G CGI
Sbjct: 289 VTAVGYGT-----TNDGYDYWIIKNSWGETWGERGYMRMLRGVSPYGLCGI 334
>gi|224116880|ref|XP_002317417.1| predicted protein [Populus trichocarpa]
gi|118488173|gb|ABK95906.1| unknown [Populus trichocarpa]
gi|222860482|gb|EEE98029.1| predicted protein [Populus trichocarpa]
Length = 498
Score = 188 bits (477), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 119/314 (37%), Positives = 169/314 (53%), Gaps = 25/314 (7%)
Query: 39 TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGV---YGLNEFSDLS 95
T +F + E+H K Y E R+ F NL+ I + ++ + SG+ GLN+F+DLS
Sbjct: 47 TEVFKLWKEKHQKVYKHAEEAERRIGNFKRNLKYI-IEKNGKRKSGLEHKVGLNKFADLS 105
Query: 96 TAEFQAKYLGFKLKP-SYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
EF+ YL KP + ++ + P + DWR VT VKDQ CGS W+FS
Sbjct: 106 NEEFREMYLSKVKKPITIEEKRKHRHLQTCDAPSSLDWRNKGVVTAVKDQGDCGSCWSFS 165
Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQEDD-GCEGGSISNAFDTIMSKLGGGLEEEKT 213
TTG IE + A T L+SLSEQEL+DCD ++ GCEGG + +AF ++ GG++ E
Sbjct: 166 TTGAIEAINAIVTGDLISLSEQELVDCDTTNNYGCEGGDMDSAFQWVIGN--GGIDTEAD 223
Query: 214 YPYRGDDKACRLNKKATQ-VKINGYVSVSRDETDMAKYLVENGPMAVAINAYAL--QFYV 270
YPY G D C K+ + V I GYV V ++ + V+ P++V ++ AL Q Y
Sbjct: 224 YPYTGVDGTCNTAKEEKKVVSIEGYVDVDPSDSALLCATVQQ-PISVGMDGSALDFQLYT 282
Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
G+ C G ++ H++LIVGYG + + YWI+KNSWG WG +GYF +
Sbjct: 283 GGI---YDGDCSGDPNDIDHAILIVGYGSENDE------DYWIVKNSWGTEWGMEGYFYI 333
Query: 331 YRGD----GSCGIN 340
R G C IN
Sbjct: 334 RRNTSKPYGVCAIN 347
>gi|217073894|gb|ACJ85307.1| unknown [Medicago truncatula]
gi|388507498|gb|AFK41815.1| unknown [Medicago truncatula]
Length = 362
Score = 188 bits (477), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 128/345 (37%), Positives = 177/345 (51%), Gaps = 31/345 (8%)
Query: 12 LLSLTVSVSSFMVVGDE-KLHHLHHVKHTALFNYF--LEQHNKTYATLVEYYSRLHIFSG 68
LL + +S++ +VV + H +L++ + H+ L E R ++F
Sbjct: 6 LLLIVLSIALVLVVSESFDFHDKDVSSDESLWDLYERWRSHHTVSRNLNEKQKRFNVFKS 65
Query: 69 NLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPA-----MIPN 123
N+ + + + LN+F+D++ EF+ Y G K+ R P M N
Sbjct: 66 NVMHVHNTNKMDKPYKLK-LNKFADMTNHEFKTTYAGSKVNHHRMFRGTPRVSGTFMYEN 124
Query: 124 IT-LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCD 182
T P + DWR+ AVT VKDQ CGS WAFST +EG+ KT +LV LSEQELIDCD
Sbjct: 125 FTKAPASVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQIKTNRLVPLSEQELIDCD 184
Query: 183 -QEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKK-ATQVKINGYVSV 240
QE+ GC GG + AF+ I K GG+ E YPY +D +C K+ V I+G+ +V
Sbjct: 185 NQENQGCNGGLMEYAFEYIKQK--GGVTTESYYPYTANDGSCDATKENVPTVSIDGHETV 242
Query: 241 SRDETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYG 298
++ D V N P++VAI+A QFY GV F D G E L+H V IVGYG
Sbjct: 243 PANDEDALLKAVANQPVSVAIDAGGSDFQFYSEGV-----FTGDCGKE-LNHGVAIVGYG 296
Query: 299 VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGI 339
T YWI++NSWG WGE+G R+ R +G CGI
Sbjct: 297 T-----TVDGTNYWIVRNSWGAEWGEQGCIRMKRNVSNKEGLCGI 336
>gi|357474527|ref|XP_003607548.1| Cysteine protease [Medicago truncatula]
gi|358347211|ref|XP_003637653.1| Cysteine protease [Medicago truncatula]
gi|355503588|gb|AES84791.1| Cysteine protease [Medicago truncatula]
gi|355508603|gb|AES89745.1| Cysteine protease [Medicago truncatula]
Length = 345
Score = 188 bits (477), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 125/337 (37%), Positives = 171/337 (50%), Gaps = 34/337 (10%)
Query: 15 LTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQ 74
+ V+S + D ++ H ++ + K Y L E +RL IF N+ I+
Sbjct: 22 FAIQVTSRTLQDDSNIYEKHE--------QWMVHYGKVYKDLQERENRLKIFKENVNYIE 73
Query: 75 LLQDTEHGSGVY--GLNEFSDLSTAEFQAKYLGFK-LKPSYADRSVPAMIPNITLPRAFD 131
+ + +Y G+N+F+DL+ EF A FK S ++ N ++P D
Sbjct: 74 ASNNAGNNK-LYKLGINQFADLTNEEFIASRNKFKGHMCSSITKTSTFKYENASVPSTVD 132
Query: 132 WREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCE 189
WR+ AVT VK+Q CG WAFS EG++ T KLVSLSEQEL+DCD + D GCE
Sbjct: 133 WRKKGAVTPVKNQGQCGCCWAFSAVAATEGIHKLSTGKLVSLSEQELVDCDTKGVDQGCE 192
Query: 190 GGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQ-VKINGYVSVSRDETDMA 248
GG + +AF I+ GL E YPY+G D C NK + V I GY V +
Sbjct: 193 GGLMDDAFKFIIQNH--GLNTEAQYPYQGVDGTCSANKASIHAVTITGYEDVPANNEQAL 250
Query: 249 KYLVENGPMAVAINAYA--LQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTH 306
+ V N P++VAI+A QFY +GV F G E L H V VGYGV +
Sbjct: 251 QKAVANQPISVAIDASGSDFQFYKSGV-----FTGSCGTE-LDHGVTAVGYGVG-----N 299
Query: 307 KAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGI 339
YW++KNSWG WGE+GY ++ RG +G CGI
Sbjct: 300 DGTKYWLVKNSWGTDWGEEGYIKMQRGVDAAEGLCGI 336
>gi|24285904|gb|AAL14199.1| cysteine proteinase precursor [Ipomoea batatas]
gi|56961686|gb|AAK15148.2| cysteine proteinase-like protein [Ipomoea batatas]
Length = 341
Score = 188 bits (477), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 127/344 (36%), Positives = 175/344 (50%), Gaps = 40/344 (11%)
Query: 7 FAGVALLSLTVSVS-SFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHI 65
FA A L+ + ++S S MVV E+ ++ Q+ + Y VE R +I
Sbjct: 18 FATSAYLATSRTLSDSLMVVRHEQ---------------WMAQYGRVYENEVEKTKRFNI 62
Query: 66 FSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNIT 125
F N+ I+ G+N F+DL+ EF+A G+KL P + P N++
Sbjct: 63 FKENVEYIESFNKAGTKPYKLGINAFADLTNQEFKASRNGYKL-PHDCSSNTPFRYENVS 121
Query: 126 -LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE 184
+P DWR AVT VKDQ CG WAFS +EG+ T L+SLSEQEL+DCD +
Sbjct: 122 SVPTTVDWRTKGAVTPVKDQGQCGCCWAFSAVAAMEGITKLSTGNLISLSEQELVDCDVK 181
Query: 185 --DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKAC-RLNKKATQVKINGYVSVS 241
D GCEGG + +AF I++ GL E YPY+G D +C + + KI+GY V
Sbjct: 182 GIDQGCEGGLMDDAFSFIINNK--GLTTESNYPYQGTDGSCKKSKSSNSAAKISGYEDVP 239
Query: 242 RDETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGV 299
+ + V N P++VAI+A QFY +GV F + G E L H V VGYG+
Sbjct: 240 ANSESALEKAVANQPVSVAIDAGGSDFQFYSSGV-----FTGECGTE-LDHGVTAVGYGI 293
Query: 300 DRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGI 339
YW++KNSWG WGEKGY R+ + +G CGI
Sbjct: 294 -----AEDGSKYWLVKNSWGTSWGEKGYIRMQKDIEAKEGLCGI 332
>gi|255635584|gb|ACU18142.1| unknown [Glycine max]
Length = 345
Score = 188 bits (477), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 120/299 (40%), Positives = 158/299 (52%), Gaps = 20/299 (6%)
Query: 37 KHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLST 96
K LF ++ +H K Y ++ E R IF NL+ I + GLNEF+DLS
Sbjct: 42 KLIELFESWMSKHGKIYQSIEEKLLRFEIFKDNLKHIDERNKVVSNYWL-GLNEFADLSH 100
Query: 97 AEFQAKYLGFKLKPSYADRSVPAMI-PNITLPRAFDWREYDAVTGVKDQTMCGSSWAFST 155
EF+ KYLG K+ S S ++ LP++ DWR+ AV VK+Q CGS WAFST
Sbjct: 101 QEFKNKYLGLKVDYSRRRESPEEFTYKDVELPKSVDWRKKGAVAPVKNQGSCGSCWAFST 160
Query: 156 TGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
+EG+ T L SLSEQELIDCD+ +GC GG + AF I+ GGL +E+ Y
Sbjct: 161 VAAVEGINQIVTGNLTSLSEQELIDCDRTYSNGCNGGLMDYAFSFIVEN--GGLHKEEDY 218
Query: 215 PYRGDDKACRLNKKATQ-VKINGYVSVSRDETDMAKYLVENGPMAVAINAYA--LQFYVT 271
PY ++ C + K+ T+ V I+GY V ++ + N ++VAI A QFY
Sbjct: 219 PYIMEEGTCEMTKEETEVVTISGYHDVPQNNEQSLLKALANQSLSVAIEASGRDFQFYSG 278
Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
GV F +L H V VGYG T K V Y I+KNSWG WGEKGY R+
Sbjct: 279 GV------FDGHCGSDLDHGVAAVGYG------TAKGVDYIIVKNSWGSKWGEKGYIRM 325
>gi|195429415|ref|XP_002062758.1| GK19626 [Drosophila willistoni]
gi|194158843|gb|EDW73744.1| GK19626 [Drosophila willistoni]
Length = 341
Score = 188 bits (477), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 126/347 (36%), Positives = 180/347 (51%), Gaps = 41/347 (11%)
Query: 10 VALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGN 69
+AL+++T +VS +V +E +N F +H K YA E R+ IF+ N
Sbjct: 10 IALVAMTQAVSYSELVREE-------------WNTFKLEHRKNYADSTEETFRMKIFNEN 56
Query: 70 LRKI-QLLQDTEHGSGVY--GLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNIT- 125
I + Q G Y LN+++D+ EF+ GF RS +T
Sbjct: 57 KHHIAKHNQRYATGEVSYKLALNKYADMLHHEFRETMNGFNYTLHKQLRSTDESFTGVTF 116
Query: 126 -------LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQEL 178
LP A DWR AVT VKDQ CGS WAFS+TG IEG + K+ LVSLSEQ L
Sbjct: 117 ISPEHVKLPTAVDWRTKGAVTEVKDQGHCGSCWAFSSTGAIEGQHFRKSGTLVSLSEQNL 176
Query: 179 IDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKING 236
+DC + ++GC GG + NAF + K GG++ EK+Y Y G D +C +K + G
Sbjct: 177 VDCSTKYGNNGCNGGLMDNAFRYV--KDNGGIDTEKSYAYEGIDDSCHFDKNSIGATDRG 234
Query: 237 YVSVSR-DETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVL 293
+ + + +E +A+ + GP++VAI+A + QFY GV C ENL H VL
Sbjct: 235 FADIPQGNEKKLAQAVATIGPVSVAIDASQQSFQFYSEGVYDEPN--CSA--ENLDHGVL 290
Query: 294 IVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-DGSCGI 339
+VGYG ++ YW++KNSWG WG+KG+ ++ R + CGI
Sbjct: 291 VVGYGTEK-----DGSDYWLVKNSWGTTWGDKGFIKMSRNKENQCGI 332
>gi|356515036|ref|XP_003526207.1| PREDICTED: thiol protease SEN102-like [Glycine max]
Length = 336
Score = 188 bits (477), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 120/317 (37%), Positives = 161/317 (50%), Gaps = 28/317 (8%)
Query: 33 LHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFS 92
LH ++ ++ K Y E R IF N+ I+ + G+N +
Sbjct: 29 LHETSMRERHEQWMTEYGKVYKDAAEKDKRFQIFKDNVEFIESFNADGNKPYKLGVNHLA 88
Query: 93 DLSTAEFQAKYLGFKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSW 151
DL+ EF+A GFK ++ + N+T +P A DWR AVT +KDQ CGS W
Sbjct: 89 DLTVEEFKASRNGFKRPHEFSTTTF--KYENVTAIPAAIDWRTKGAVTPIKDQGQCGSCW 146
Query: 152 AFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLE 209
AFST EG++ T KLVSLSEQEL+DCD + D GCEGG + + F+ I+ GG+
Sbjct: 147 AFSTIAATEGIHQITTGKLVSLSEQELVDCDTKGVDQGCEGGYMEDGFEFIIKN--GGIT 204
Query: 210 EEKTYPYRGDDKACRLNKKATQV-KINGYVSVSRDETDMAKYLVENGPMAVAINA--YAL 266
E YPY+ D C NK + V +I GY V + + V N P++V+I+A
Sbjct: 205 SETNYPYKAVDGKC--NKATSPVAQIKGYEKVPPNSETALQKAVANQPVSVSIDADGAGF 262
Query: 267 QFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKG 326
FY +G+ + + G E L H V VGYG T YWI+KNSWG WGEKG
Sbjct: 263 MFYSSGI-----YNGECGTE-LDHGVTAVGYG------TANGTDYWIVKNSWGTQWGEKG 310
Query: 327 YFRLYRG----DGSCGI 339
Y R+ RG G CGI
Sbjct: 311 YVRMQRGIAAKHGLCGI 327
>gi|144905108|dbj|BAF56428.1| cysteine proteinase [Lotus japonicus]
Length = 342
Score = 188 bits (477), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 115/317 (36%), Positives = 161/317 (50%), Gaps = 23/317 (7%)
Query: 33 LHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFS 92
L V ++ Q+ K Y E R +IF N+++I+ + + G+N+F+
Sbjct: 30 LEDVSLKERHEQWMTQYGKVYTDSYEKELRSNIFKENVQRIEAFNNAGNKPYKLGINQFA 89
Query: 93 DLSTAEFQAKYLGFKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSW 151
DL+ EF+A+ S + R+ +++ +P + DWR+ AVT +KDQ CG W
Sbjct: 90 DLTNEEFKARNRFKGHMCSNSTRTPTFKYEDVSSVPASLDWRQKGAVTPIKDQGQCGCCW 149
Query: 152 AFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLE 209
AFS EG+ T KL+SLSEQEL+DCD + D GCEGG + +AF IM GL
Sbjct: 150 AFSAVAATEGITKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFIMQN--KGLN 207
Query: 210 EEKTYPYRGDDKACRLNKKATQ-VKINGYVSVSRDETDMAKYLVENGPMAVAINAYA--L 266
E YPY+G D C N +A I G+ V + V N P++VAI+A
Sbjct: 208 TEAKYPYQGVDATCNANAEAKDAASIKGFEDVPANSESALLKAVANQPISVAIDASGSEF 267
Query: 267 QFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKG 326
QFY +G+ F L H V VGYGV + YW++KNSWGE WGE+G
Sbjct: 268 QFYSSGL------FTGSCGTELDHGVTAVGYGV-----SDDGTKYWLVKNSWGEQWGEEG 316
Query: 327 YFRLYRG----DGSCGI 339
Y R+ R +G CGI
Sbjct: 317 YIRMQRDVAAEEGLCGI 333
>gi|33242882|gb|AAQ01145.1| cathepsin [Branchiostoma lanceolatum]
Length = 334
Score = 188 bits (477), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 126/331 (38%), Positives = 172/331 (51%), Gaps = 32/331 (9%)
Query: 36 VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHG--------SGVYG 87
+ H + + QH K Y T E YSR IF N KI EH S
Sbjct: 18 LPHNKEWEMWKLQHGKQYETEAEEYSRRFIFEKNTVKI-----AEHNIRASLGMHSYTLA 72
Query: 88 LNEFSDLSTAEFQAKYLGFKLK---PSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQ 144
+N+F D+ EF + +G LK V N TLP++ DWR V+ VKDQ
Sbjct: 73 MNKFGDMHHEEFHQRIMGGCLKIVKKPLLGSEVGDSDDNGTLPKSVDWRNSHMVSEVKDQ 132
Query: 145 TMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMS 202
CG WAFSTTG++EG ++ KT KLV LSEQ+L+DC ++ + GC GG + AF I +
Sbjct: 133 GECGPCWAFSTTGSLEGQHSNKTGKLVDLSEQQLVDCSKDFGNQGCGGGLMDQAFQYIPA 192
Query: 203 KLGGGLEEEKTYPYRG-DDKACRLNKKATQVKINGYVSV-SRDETDMAKYLVENGPMAVA 260
GGL+ E++YPY DDK C+ + + + GY V S +E + + + GP++VA
Sbjct: 193 N--GGLDTEESYPYTATDDKPCKFDNSSVGATLVGYKDVKSGNEHALKRAVATVGPVSVA 250
Query: 261 INA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSW 318
I+A + QFY +GV Q C E L H VL VGYG +H+A +WI+KNSW
Sbjct: 251 IDAGHESFQFYSSGVYDEPQ--CS--TEQLDHGVLAVGYGA-MNDNSHQA--FWIVKNSW 303
Query: 319 GEGWGEKGYFRLYRG-DGSCGINDYVRSALV 348
G WG++GY + R + CGI LV
Sbjct: 304 GPSWGDQGYIMMSRNKNNQCGIATSASYPLV 334
>gi|344953542|gb|AEN28617.1| cathepsin L-like cysteine protease [Epinephelus coioides]
Length = 336
Score = 188 bits (477), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 122/312 (39%), Positives = 170/312 (54%), Gaps = 22/312 (7%)
Query: 49 HNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY----GLNEFSDLSTAEFQAKYL 104
H+K Y E + R+ ++ NL+KI+L + EH G + G+N F D++ EF+
Sbjct: 35 HSKKYHEKEEGWRRM-VWEKNLKKIEL-HNLEHSMGTHSYRLGMNHFGDMTHEEFRQLMN 92
Query: 105 GFKLKPSYADRSVPAMIPN-ITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVY 163
G+K K R + PN + P++ DWR+ VT VKDQ CGS WAFSTTG +EG +
Sbjct: 93 GYKRKAETKARGSLFLEPNFLEAPKSVDWRDNGYVTPVKDQGQCGSCWAFSTTGALEGQH 152
Query: 164 AAKTKKLVSLSEQELIDCDQED--DGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG-DD 220
KT KLVSLSEQ L+DC + + +GC GG + AF + K GL+ E +YPY G DD
Sbjct: 153 FRKTGKLVSLSEQNLVDCSRPEGNEGCNGGLMDQAFQYV--KDNQGLDSEDSYPYLGTDD 210
Query: 221 KACRLNKKATQVKINGYVSV-SRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPI 277
+ C + V G+V + S E + K + GP++VAI+A + QFY +G I
Sbjct: 211 QPCHYDPTYNSVNDTGFVDIPSGKERALMKAVAAVGPVSVAIDAGHESFQFYQSG----I 266
Query: 278 QFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-DGS 336
+ + +E L H VL+VGYG K YWI+KNSW E WG+KGY + +
Sbjct: 267 YYEKECSSEELDHGVLVVGYGFQGEDVDGKK--YWIVKNSWSEKWGDKGYIYMAKDRKNH 324
Query: 337 CGINDYVRSALV 348
CGI LV
Sbjct: 325 CGIATAASYPLV 336
>gi|322799749|gb|EFZ20954.1| hypothetical protein SINV_06041 [Solenopsis invicta]
Length = 337
Score = 188 bits (477), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 125/316 (39%), Positives = 166/316 (52%), Gaps = 22/316 (6%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKI-QLLQDTEHGSGVY--GLNEFSDLSTAEFQA 101
F HNK Y + VE R+ I+ N RKI + + E Y G+N++ D+ EF
Sbjct: 32 FKLHHNKVYKSPVEEGYRMKIYMDNKRKIAEHNRKYELNEVTYKLGMNKYGDMLHHEFVN 91
Query: 102 KYLGFK--LKPSYADRSVPAMIP-NITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
GF + V + P N+ LP DW + AVT VKDQ CGS WAFS+TG
Sbjct: 92 TLNGFNKSVTAGIETEGVTFISPANVKLPDEVDWTKQGAVTAVKDQGHCGSCWAFSSTGA 151
Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
+EG + T LVSLSEQ LIDC + ++GC GG + AF I K GL+ EKTYPY
Sbjct: 152 LEGQHFRSTGYLVSLSEQNLIDCSGKYGNNGCNGGLMDYAFQYI--KDNKGLDTEKTYPY 209
Query: 217 RGDDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINAY--ALQFYVTGV 273
++ CR N + + GYV + + DE + + GP++VAI+A + Q Y GV
Sbjct: 210 EAENDRCRYNPRNSGATDKGYVDIPQGDEEKLKAAVATIGPISVAIDASHESFQLYSEGV 269
Query: 274 SHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG 333
+ D ENL H VLIVGYG D T YW++KNSWG+ WG+KGY ++ R
Sbjct: 270 ----YYDPDCSAENLDHGVLIVGYGTDET----SGHDYWLVKNSWGKTWGQKGYIKMARN 321
Query: 334 -DGSCGINDYVRSALV 348
+ CGI LV
Sbjct: 322 KNNHCGIASSASYPLV 337
>gi|166235890|ref|NP_031827.2| pro-cathepsin H preproprotein [Mus musculus]
gi|341940309|sp|P49935.2|CATH_MOUSE RecName: Full=Pro-cathepsin H; AltName: Full=Cathepsin B3; AltName:
Full=Cathepsin BA; Contains: RecName: Full=Cathepsin H
mini chain; Contains: RecName: Full=Cathepsin H;
Contains: RecName: Full=Cathepsin H heavy chain;
Contains: RecName: Full=Cathepsin H light chain; Flags:
Precursor
gi|74151776|dbj|BAE29677.1| unnamed protein product [Mus musculus]
gi|74181999|dbj|BAE34071.1| unnamed protein product [Mus musculus]
gi|74211659|dbj|BAE29188.1| unnamed protein product [Mus musculus]
gi|74213518|dbj|BAE35569.1| unnamed protein product [Mus musculus]
gi|148688954|gb|EDL20901.1| cathepsin H, isoform CRA_b [Mus musculus]
Length = 333
Score = 188 bits (477), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 123/339 (36%), Positives = 178/339 (52%), Gaps = 33/339 (9%)
Query: 8 AGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFS 67
AG LLS T + + V EK H F +++QH KTY++ VEY RL +F+
Sbjct: 10 AGAWLLS-TGATAELTVNAIEKFH----------FKSWMKQHQKTYSS-VEYNHRLQMFA 57
Query: 68 GNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRS--VPAMIPNIT 125
N RKIQ H + LN+FSD+S AE + K+L + + A +S + P
Sbjct: 58 NNWRKIQAHNQRNHTFKM-ALNQFSDMSFAEIKHKFLWSEPQNCSATKSNYLRGTGP--- 113
Query: 126 LPRAFDWREY-DAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQ- 183
P + DWR+ + V+ VK+Q CGS W FSTTG +E A + K++SL+EQ+L+DC Q
Sbjct: 114 YPSSMDWRKKGNVVSPVKNQGACGSCWTFSTTGALESAVAIASGKMLSLAEQQLVDCAQA 173
Query: 184 -EDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVS- 241
+ GC+GG S AF+ I+ G+ EE +YPY G D +CR N + + V+++
Sbjct: 174 FNNHGCKGGLPSQAFEYIL--YNKGIMEEDSYPYIGKDSSCRFNPQKAVAFVKNVVNITL 231
Query: 242 RDETDMAKYLVENGPMAVAINAYA-LQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVD 300
DE M + + P++ A Y +GV C + ++H+VL VGYG
Sbjct: 232 NDEAAMVEAVALYNPVSFAFEVTEDFLMYKSGVYSSKS--CHKTPDKVNHAVLAVGYG-- 287
Query: 301 RTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGI 339
+ YWI+KNSWG WGE GYF + RG CG+
Sbjct: 288 ----EQNGLLYWIVKNSWGSQWGENGYFLIERGKNMCGL 322
>gi|400180399|gb|AFP73338.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 188 bits (477), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 109/300 (36%), Positives = 159/300 (53%), Gaps = 24/300 (8%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
++ +H + Y VE R IF N++ I+ + + S G+NEF+D+++ EF AK+
Sbjct: 42 WMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFT 101
Query: 105 GFKLKPSYAD----RSVPAMIPNIT---LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
G + SY S +I +++ +P DWRE AVT VK Q CG WAFS G
Sbjct: 102 GLNIPNSYLSPSPMSSTEFIINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVG 161
Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
++EG Y T L+ SEQEL+DC + GC GG ++NAFD I K GG+ E Y Y
Sbjct: 162 SLEGAYKIATGNLMEFSEQELLDCTTNNYGCNGGFMTNAFDFI--KENGGISRESDYEYL 219
Query: 218 GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA-LQFYVTGVSHP 276
G+ CR +K V+I+ Y V ET + + + + P+++ I A LQFY G
Sbjct: 220 GEQYTCRSQEKTAAVQISSYQVVPEGETSLLQAVTKQ-PVSIGIAASQDLQFYAGGTY-- 276
Query: 277 IQFFCDGG-NENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
DG + ++H+V +GYG D K YW++KNSWG WGE G+ ++ R G
Sbjct: 277 -----DGSCADRINHAVTAIGYGTD-----EKGQKYWLLKNSWGTSWGENGFMKIIRDSG 326
>gi|330803818|ref|XP_003289899.1| hypothetical protein DICPUDRAFT_154350 [Dictyostelium purpureum]
gi|325080010|gb|EGC33584.1| hypothetical protein DICPUDRAFT_154350 [Dictyostelium purpureum]
Length = 326
Score = 188 bits (477), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 114/309 (36%), Positives = 179/309 (57%), Gaps = 24/309 (7%)
Query: 37 KHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLST 96
++ F ++ +H K+Y T E+ SR +F N+ I + + + + GLN +DL+
Sbjct: 27 QYQTAFQNWMVKHQKSY-TNDEFGSRYSVFQDNM-DIVAKWNQKGSNTILGLNVMADLTN 84
Query: 97 AEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTT 156
EF+ YLG K +Y +++ + LP + DWR AVT VK+Q CG +AFSTT
Sbjct: 85 EEFKKLYLGTKANVTYKKKTLVGVSG---LPASVDWRANGAVTAVKNQGQCGGCYAFSTT 141
Query: 157 GNIEGVYAAKTKKLVSLSEQELIDC--DQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
G++EG++ +++LV LSEQ+++DC + ++GC+GG ++N+F+ I++ GGL+ E +Y
Sbjct: 142 GSVEGIHEITSQQLVPLSEQQILDCSGSEGNNGCDGGLMTNSFEYIIAV--GGLDTEASY 199
Query: 215 PYRGDDKACRLNKKATQVKINGYVSV-SRDETDMAKYLVENGPMAVAINA--YALQFYVT 271
PY G+ C+ NKK I GY +V S E+D+ + V P++VAI+A + Q Y +
Sbjct: 200 PYTGEVGKCKFNKKNIGATITGYKNVESGSESDL-QTAVAAQPVSVAIDASQSSFQLYAS 258
Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
GV + + C + L H VL VGYG + YWI+KNSWG WGE G+ +
Sbjct: 259 GVYYEPE--CS--STQLDHGVLAVGYG------SQSGQDYWIVKNSWGADWGENGFILMA 308
Query: 332 RG-DGSCGI 339
R D +CGI
Sbjct: 309 RNKDNNCGI 317
>gi|400180393|gb|AFP73335.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 188 bits (477), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 107/301 (35%), Positives = 158/301 (52%), Gaps = 24/301 (7%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
++ +H + Y VE R IF N++ I+ + + S G+NEF+D+++ EF AK+
Sbjct: 42 WMSRHGRVYKDEVEKVERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFT 101
Query: 105 GFKLKPSYADRSVPAM-------IPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
G + SY S + + + +P DWRE AVT VK Q CG WAFS G
Sbjct: 102 GLNIPNSYLSPSPMSSTELKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVG 161
Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
++EG Y T L+ SEQEL+DC + GC GG ++NAFD I+ GG+ E Y Y
Sbjct: 162 SLEGAYKIATGNLMEFSEQELLDCTTNNYGCNGGFMTNAFDFIIEN--GGISRESDYEYL 219
Query: 218 GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA-LQFYVTGVSHP 276
G+ CR +K V+I+ Y V ET + + + + P+++ I A LQFY G
Sbjct: 220 GEQYTCRSQEKTAAVQISSYKVVPEGETSLLQAVTKQ-PVSIGIAASQDLQFYAGGTY-- 276
Query: 277 IQFFCDGG-NENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
DG + ++H+V +GYG D K YW++KNSWG WGE G+ ++ R G
Sbjct: 277 -----DGSCADRINHAVTAIGYGTD-----EKGQKYWLLKNSWGTSWGENGFMKIIRDSG 326
Query: 336 S 336
+
Sbjct: 327 N 327
>gi|394331830|gb|AFN27134.1| cysteine protease [Leishmania tropica]
Length = 443
Score = 188 bits (477), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 112/318 (35%), Positives = 170/318 (53%), Gaps = 22/318 (6%)
Query: 40 ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
ALF F + + Y T+ E RL F NL ++ Q + +G+ +F DLS AEF
Sbjct: 36 ALFEEFKRTYRRAYGTVAEEQQRLANFERNLELMREHQ-ARNPHARFGITKFFDLSEAEF 94
Query: 100 QAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
A+YL F +A + +++ +P A DWR+ A+T VK+Q CGS WAFS
Sbjct: 95 AARYLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWRKKGALTPVKNQGACGSCWAFS 154
Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
G+I+ +A +L +LSEQ+L+ C +D+GC G + AF ++ + G + E +Y
Sbjct: 155 AVGSIQSQWALAGHRLTALSEQQLVSCHDKDNGCPGRLMLQAFVGVLQNMNGTMFTEDSY 214
Query: 215 PYRGDDKACRLNKKATQV----KINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYV 270
PY ++Q+ +I+GY+++ T MA L +NGP+++A++A + Y
Sbjct: 215 PYVSSTGYVPECSNSSQLVPGARIDGYMTMESSGTVMAACLAKNGPISIAVDASSFMSYQ 274
Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
+GV L+H VL+VGY +RT VPYW+IKNSWGE WGE GY R+
Sbjct: 275 SGV------LTSCAGMPLNHGVLLVGY--NRT----GEVPYWVIKNSWGENWGENGYVRV 322
Query: 331 YRGDGSCGINDYVRSALV 348
G +C + +Y SA V
Sbjct: 323 TMGVNACLLTEYPVSAHV 340
>gi|114596533|ref|XP_517502.2| PREDICTED: cathepsin O [Pan troglodytes]
gi|410212082|gb|JAA03260.1| cathepsin O [Pan troglodytes]
gi|410330245|gb|JAA34069.1| cathepsin O [Pan troglodytes]
Length = 318
Score = 188 bits (477), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 113/284 (39%), Positives = 156/284 (54%), Gaps = 20/284 (7%)
Query: 71 RKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADR---SVPAMIPNITLP 127
R + L +E+ + YG+N+FS L EF+A YL + KPS R V IPN++LP
Sbjct: 49 RYLNSLFPSENSTAFYGINQFSYLFPEEFKAIYL--RSKPSKFPRYSAEVHMSIPNVSLP 106
Query: 128 RAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDG 187
FDWR+ VT V++Q MCG WAFS G +E YA K K L LS Q++IDC + G
Sbjct: 107 LRFDWRDKQVVTQVRNQQMCGGCWAFSVVGAVESAYAIKGKPLEDLSVQQVIDCSYNNYG 166
Query: 188 CEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACR-LNKKATQVKINGYVS--VSRDE 244
C GGS NA + ++K+ L ++ YP++ + C + + I GY + S E
Sbjct: 167 CNGGSTLNALN-WLNKMQVKLVKDSEYPFKAQNGLCHYFSGSHSGFSIKGYSAYDFSNQE 225
Query: 245 TDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKF 304
+MAK L+ GP+ V ++A + Q Y+ G+ IQ C G N H+VLI G+ D+T
Sbjct: 226 DEMAKALLTFGPLVVIVDAVSWQDYLGGI---IQHHCSSGEAN--HAVLITGF--DKTGS 278
Query: 305 THKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
T PYWI++NSWG WG GY + G CGI D V S V
Sbjct: 279 T----PYWIVRNSWGSSWGVDGYAHVKMGSNVCGIADSVSSIFV 318
>gi|388512155|gb|AFK44139.1| unknown [Medicago truncatula]
Length = 340
Score = 187 bits (476), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 117/305 (38%), Positives = 160/305 (52%), Gaps = 26/305 (8%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
++ ++ K Y +E R IF N+ I+ ++ +N +DL+ EF+A
Sbjct: 43 WMSEYGKLYKDAIEKEKRFMIFKDNVEFIESFNAADNKPYKLSVNHLADLTLDEFKASRN 102
Query: 105 GFK-LKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGV 162
G+K + +A S N+T +P A DWR AVT +KDQ CGS WAFST IEG+
Sbjct: 103 GYKKIDREFATTSFK--YENVTAIPEAVDWRVKGAVTPIKDQGQCGSCWAFSTVAAIEGI 160
Query: 163 YAAKTKKLVSLSEQELIDCDQ--EDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDD 220
T KL+SLSEQEL+DCD ED GCEGG + + F+ I+ GG+ E YPY+ D
Sbjct: 161 NQITTGKLISLSEQELVDCDTKGEDQGCEGGLMEDGFEFIIKN--GGITSETNYPYKAAD 218
Query: 221 KACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQ 278
+C A KI GY V + V N P++V+I+A + FY +G+
Sbjct: 219 GSCSAATTAPVAKITGYEKVPVNSEISLLKAVANQPISVSIDASDSSFMFYSSGI----- 273
Query: 279 FFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----D 334
+ + G E L H V VGYG + YWI+KNSWG WGEKGY R+ RG +
Sbjct: 274 YTGECGTE-LDHGVTAVGYG------SANGTDYWIVKNSWGTVWGEKGYIRMQRGIADKE 326
Query: 335 GSCGI 339
G CGI
Sbjct: 327 GLCGI 331
>gi|400180457|gb|AFP73365.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 187 bits (476), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 108/301 (35%), Positives = 160/301 (53%), Gaps = 24/301 (7%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
++ +H + Y VE R IF N++ I+ + + S G+NEF+D+++ EF AK+
Sbjct: 42 WMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFT 101
Query: 105 GFKLKPSYAD----RSVPAMIPNIT---LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
G + SY S +I +++ +P DWRE AVT VK Q CG WAFS G
Sbjct: 102 GLNIPNSYLSPSPMSSTEFIINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVG 161
Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
++EG Y T L+ SEQEL+DC + GC GG ++NAFD I+ GG+ E Y Y
Sbjct: 162 SLEGAYKIATGNLMEFSEQELLDCTTNNYGCNGGFMTNAFDFIIEN--GGISRESDYEYL 219
Query: 218 GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA-LQFYVTGVSHP 276
G+ CR +K V+I+ Y V ET + + + + P+++ I A LQFY G
Sbjct: 220 GEQYTCRSQEKTAAVQISSYKVVPEGETSLLQAVTKQ-PVSIGIAASQDLQFYAGGTY-- 276
Query: 277 IQFFCDGG-NENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
DG + ++H+V +GYG D K YW++KNSWG WGE G+ ++ R G
Sbjct: 277 -----DGSCADRINHAVTAIGYGTD-----EKGQKYWLLKNSWGTSWGENGFMKIIRDSG 326
Query: 336 S 336
+
Sbjct: 327 N 327
>gi|400180361|gb|AFP73319.1| cysteine protease [Solanum peruvianum]
gi|400180397|gb|AFP73337.1| cysteine protease [Solanum peruvianum]
gi|400180401|gb|AFP73339.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 187 bits (476), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 109/300 (36%), Positives = 159/300 (53%), Gaps = 24/300 (8%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
++ +H + Y VE R IF N++ I+ + + S G+NEF+D+++ EF AK+
Sbjct: 42 WMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFT 101
Query: 105 GFKLKPSYAD----RSVPAMIPNIT---LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
G + SY S +I +++ +P DWRE AVT VK Q CG WAFS G
Sbjct: 102 GLNIPNSYLSPSPMSSTEFIINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVG 161
Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
++EG Y T L+ SEQEL+DC + GC GG ++NAFD I K GG+ E Y Y
Sbjct: 162 SLEGAYKIATGNLMEFSEQELLDCTTNNYGCNGGFMTNAFDFI--KENGGISRESDYEYL 219
Query: 218 GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA-LQFYVTGVSHP 276
G+ CR +K V+I+ Y V ET + + + + P+++ I A LQFY G
Sbjct: 220 GEQYTCRSQEKTAAVQISSYQVVPEGETSLLQAVTKQ-PVSIGIAASQDLQFYAGGTY-- 276
Query: 277 IQFFCDGG-NENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
DG + ++H+V +GYG D K YW++KNSWG WGE G+ ++ R G
Sbjct: 277 -----DGSCADRINHAVTAIGYGTD-----EKGQKYWLLKNSWGTSWGENGFMKIIRDSG 326
>gi|400180353|gb|AFP73315.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 187 bits (476), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 108/301 (35%), Positives = 160/301 (53%), Gaps = 24/301 (7%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
++ +H + Y VE R IF N++ I+ + + S G+NEF+D+++ EF AK+
Sbjct: 42 WMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFT 101
Query: 105 GFKLKPSYAD----RSVPAMIPNIT---LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
G + SY S +I +++ +P DWRE AVT VK Q CG WAFS G
Sbjct: 102 GLNIPNSYLSPSPMSSTEFIINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVG 161
Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
++EG Y T L+ SEQEL+DC + GC GG ++NAFD I+ GG+ E Y Y
Sbjct: 162 SLEGAYKIATGNLMEFSEQELLDCTTNNYGCNGGFMTNAFDFIIEN--GGISRESDYEYL 219
Query: 218 GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA-LQFYVTGVSHP 276
G+ CR +K V+I+ Y V ET + + + + P+++ I A LQFY G
Sbjct: 220 GEQYTCRSQEKTAAVQISSYQVVPEGETSLLQAVTKQ-PVSIGIAASQDLQFYAGGTY-- 276
Query: 277 IQFFCDGG-NENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
DG + ++H+V +GYG D K YW++KNSWG WGE G+ ++ R G
Sbjct: 277 -----DGSCADRINHAVTAIGYGTD-----EKGQKYWLLKNSWGTSWGENGFMKIIRDSG 326
Query: 336 S 336
+
Sbjct: 327 N 327
>gi|194352754|emb|CAQ00105.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
gi|326513690|dbj|BAJ87864.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326514532|dbj|BAJ96253.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 463
Score = 187 bits (476), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 118/314 (37%), Positives = 163/314 (51%), Gaps = 31/314 (9%)
Query: 41 LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY----GLNEFSDLST 96
++ ++ +H TY + E R F NLR I + +GV+ GLN F+DL+
Sbjct: 42 MYAEWMAEHGSTYNAIGEEERRFEAFRDNLRYIDQ-HNAAADAGVHSFRLGLNRFADLTN 100
Query: 97 AEFQAKYLGFKLKPSYADRSVPA---MIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAF 153
E+++ YLG + KP +R + A N LP + DWR+ AV VKDQ CGS WAF
Sbjct: 101 EEYRSTYLGARTKPDR-ERKLSARYQAADNDELPESVDWRKKGAVGAVKDQGGCGSCWAF 159
Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEK 212
S +EG+ T ++ LSEQEL+DCD + GC GG + AF+ I++ GG++ E+
Sbjct: 160 SAIAAVEGINQIVTGDMIPLSEQELVDCDTSYNQGCNGGLMDYAFEFIINN--GGIDSEE 217
Query: 213 TYPYRGDDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFY 269
YPY+ D C NKK A V I+GY V + + V N P++VAI A A Q Y
Sbjct: 218 DYPYKERDNRCDANKKNAKVVTIDGYEDVPVNSEKSLQKAVANQPISVAIEAGGRAFQLY 277
Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
+G+ F L H V VGYG + K YW+++NSWG WGE GY R
Sbjct: 278 KSGI------FTGTCGTALDHGVAAVGYGTENGK------DYWLVRNSWGSVWGEDGYIR 325
Query: 330 LYRG----DGSCGI 339
+ R G CGI
Sbjct: 326 MERNIKASSGKCGI 339
>gi|302812789|ref|XP_002988081.1| hypothetical protein SELMODRAFT_183539 [Selaginella moellendorffii]
gi|300144187|gb|EFJ10873.1| hypothetical protein SELMODRAFT_183539 [Selaginella moellendorffii]
Length = 425
Score = 187 bits (476), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 115/296 (38%), Positives = 157/296 (53%), Gaps = 33/296 (11%)
Query: 62 RLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAM- 120
R F N R I+ S GLN+FSDL++ EF+ ++LG L+P D V M
Sbjct: 34 RFETFKENFRYIEEHNRAGKHSYRLGLNQFSDLTSEEFRQRFLG--LRPDLIDSPVLKMP 91
Query: 121 --------IPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVS 172
N+ LP + DWR++ AVT KDQ CG WAF+TTG IEG+ T +LVS
Sbjct: 92 RDSDIEEGFQNVDLPASVDWRQHGAVTAPKDQGSCGGCWAFATTGAIEGINQIVTGQLVS 151
Query: 173 LSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQ 231
LSEQELIDCD++ D GC+GG + NA+ I+ GGL+ E YPY + C + K ++
Sbjct: 152 LSEQELIDCDKKADKGCDGGLMENAYQFIVEN--GGLDTETDYPYHASESHCNMKKLNSR 209
Query: 232 -VKINGYVSVSRDETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHPIQFFCDGGNENL 288
V I+GY ++ + V P++VAI + Q Y +GV F E +
Sbjct: 210 VVAIDGYKAIPEGDEQALLLAVAKQPVSVAIEGASKDFQHYASGV------FTGHCGEEI 263
Query: 289 SHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGS----CGIN 340
+H VLIVGYG T + YWI+KNSW WG+ G+ ++ R G C IN
Sbjct: 264 NHGVLIVGYG------TEDGLDYWIVKNSWAATWGDGGFVKMQRNTGKRGGLCSIN 313
>gi|125533982|gb|EAY80530.1| hypothetical protein OsI_35710 [Oryza sativa Indica Group]
Length = 378
Score = 187 bits (476), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 119/303 (39%), Positives = 153/303 (50%), Gaps = 37/303 (12%)
Query: 58 EYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSV 117
E R ++F N R I LN+F+D++T EF+ Y G + + RS+
Sbjct: 66 EARRRFNVFVENARYIHEANRRGGRPFRLALNKFADMTTDEFRRTYAGSRARHH---RSL 122
Query: 118 PAMIPNI------------TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAA 165
LP A DWRE AVTG+KDQ CGS WAFS +EGV
Sbjct: 123 RGGRGGEGGSFRYGGDDEDNLPPAVDWRERGAVTGIKDQGQCGSCWAFSAVAAVEGVNKI 182
Query: 166 KTKKLVSLSEQELIDCDQEDD-GCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACR 224
KT +LV+LSEQEL+DCD D+ GC+GG + AF I K GG+ E YPYR + C
Sbjct: 183 KTGRLVTLSEQELVDCDTGDNQGCDGGLMDYAFQFI--KRNGGITTESNYPYRAEQGRCN 240
Query: 225 LNKKATQ-VKINGYVSVSRDETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHPIQFFC 281
K ++ V I+GY V ++ + V N P+AVA+ A QFY GV F
Sbjct: 241 KAKASSHDVTIDGYEDVPANDESALQKAVANQPVAVAVEASGQDFQFYSEGV------FT 294
Query: 282 DGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-----DGS 336
+L H V VGYG+ T YWI+KNSWGE WGE+GY R+ RG +G
Sbjct: 295 GECGTDLDHGVAAVGYGI-----TRDGTKYWIVKNSWGEDWGERGYIRMQRGVSSDSNGL 349
Query: 337 CGI 339
CGI
Sbjct: 350 CGI 352
>gi|261289779|ref|XP_002611751.1| hypothetical protein BRAFLDRAFT_284345 [Branchiostoma floridae]
gi|229297123|gb|EEN67761.1| hypothetical protein BRAFLDRAFT_284345 [Branchiostoma floridae]
Length = 330
Score = 187 bits (476), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 113/308 (36%), Positives = 171/308 (55%), Gaps = 26/308 (8%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLL-QDTEHGSGVY--GLNEFSDLSTAEFQA 101
F +H+K Y+ EY RL IF NL+ I+ Q+ + G Y G+N+F+D++ AE+
Sbjct: 27 FKIKHDKVYSEKEEYARRL-IFQDNLKTIESHNQEADTGKHSYWLGVNQFADMTHAEYLN 85
Query: 102 KYLGFKLKPS----YADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
+ +G L S R+ +PN+ + DWR+ VT +KDQ CGS WAFSTTG
Sbjct: 86 QVIGGCLITSNLTKTGSRATYRYMPNMQVNDTVDWRDKGLVTDIKDQGQCGSCWAFSTTG 145
Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQED--DGCEGGSISNAFDTIMSKLGGGLEEEKTYP 215
++EG +A T LVSLSEQ L+DC +++ GCEGG + F I+ G++ E+ YP
Sbjct: 146 SLEGQHAKATGTLVSLSEQNLVDCSRQEGNKGCEGGDMDQGFQYIIQN--KGIDTEQCYP 203
Query: 216 YRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVEN-GPMAVAINA--YALQFYVTG 272
Y+ + C+ + ++ + V+ + D K N GP++V I+A + QFY +G
Sbjct: 204 YKAKNHRCKFDNSCIGATMSSFTDVTSGDEDALKQACANIGPISVGIDASHQSFQFYSSG 263
Query: 273 VSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR 332
V + +F C + L H VL+VGYG T+ + YW++KNSWG WG +GY + R
Sbjct: 264 VYN--EFEC--SSTKLDHGVLVVGYG------TYGSKDYWLVKNSWGTVWGNEGYIMMSR 313
Query: 333 G-DGSCGI 339
D CG+
Sbjct: 314 NKDNQCGV 321
>gi|356560855|ref|XP_003548702.1| PREDICTED: P34 probable thiol protease-like [Glycine max]
Length = 357
Score = 187 bits (476), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 122/357 (34%), Positives = 178/357 (49%), Gaps = 24/357 (6%)
Query: 4 FYFFAGVALLSLTVSVSSFMV---VGDEKLHHLHHVKHT-ALFNYFLEQHNKTYATLVEY 59
+FF + L+ + S S+F V + L L T LF + ++H Y L E
Sbjct: 11 IFFFICITLICFSSS-SNFPVQYSILGPNLDKLPSQDETIQLFQLWRKEHGLVYKDLKEM 69
Query: 60 YSRLHIFSGNLRKIQLLQDTEHGSGVY--GLNEFSDLSTAEFQAKYLGFKLKPSYADRSV 117
R IF NL I Y GLN F+D S +EFQ YL P+ + +
Sbjct: 70 AKRFEIFLSNLNYIIEFNAKRSSPSGYLLGLNNFADWSPSEFQEIYLHSLDMPTDSAPKL 129
Query: 118 PAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQE 177
+ + P + DWR AVT +K+Q CGS WAFS G IEG++A T +L+SLSEQE
Sbjct: 130 NGPLLSCIAPASLDWRNKVAVTAIKNQGSCGSCWAFSAAGAIEGIHAITTGELISLSEQE 189
Query: 178 LIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKA-CRLNKKAT-QVKIN 235
L++CD+ GC GG ++ AFD ++S GG+ E YPY G D C +K+ + I+
Sbjct: 190 LVNCDRVSKGCNGGWVNKAFDWVISN--GGITLEAEYPYTGKDGGNCNSDKQVPIKATID 247
Query: 236 GYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIV 295
GY V + + + +V+ P+++ +NA Q Y +G+ Q C ++ +H VLIV
Sbjct: 248 GYEQVEQSDNGLLCSIVKQ-PISICLNATDFQLYESGIFDGQQ--CSSSSKYTNHCVLIV 304
Query: 296 GYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGD----GSCGINDYVRSALV 348
GY + YWI+KNSWG WG GY + R G CG+N + + +
Sbjct: 305 GYD------SSNGEDYWIVKNSWGTKWGINGYIWIKRNTGLPYGVCGMNAWAYNPTI 355
>gi|397504019|ref|XP_003822607.1| PREDICTED: cathepsin O [Pan paniscus]
Length = 321
Score = 187 bits (476), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 113/284 (39%), Positives = 156/284 (54%), Gaps = 20/284 (7%)
Query: 71 RKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADR---SVPAMIPNITLP 127
R + L +E+ + YG+N+FS L EF+A YL + KPS R V IPN++LP
Sbjct: 52 RYLNSLFPSENSTAFYGINQFSYLFPEEFKAIYL--RSKPSKFPRYSAEVHMSIPNVSLP 109
Query: 128 RAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDG 187
FDWR+ VT V++Q MCG WAFS G +E YA K K L LS Q++IDC + G
Sbjct: 110 LRFDWRDKQVVTQVRNQQMCGGCWAFSVVGAVESAYAIKGKPLEDLSVQQVIDCSYNNYG 169
Query: 188 CEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACR-LNKKATQVKINGYVS--VSRDE 244
C GGS NA + ++K+ L ++ YP++ + C + + I GY + S E
Sbjct: 170 CNGGSTLNALN-WLNKMQVKLVKDSEYPFKAQNGLCHYFSGSHSGFSIKGYSAYDFSNQE 228
Query: 245 TDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKF 304
+MAK L+ GP+ V ++A + Q Y+ G+ IQ C G N H+VLI G+ D+T
Sbjct: 229 DEMAKALLTFGPLVVIVDAVSWQDYLGGI---IQHHCSSGEAN--HAVLITGF--DKTGS 281
Query: 305 THKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
T PYWI++NSWG WG GY + G CGI D V S V
Sbjct: 282 T----PYWIVRNSWGSSWGVDGYAHVKMGSNVCGIADSVSSIFV 321
>gi|395735444|ref|XP_002815290.2| PREDICTED: cathepsin O [Pongo abelii]
Length = 318
Score = 187 bits (476), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 113/284 (39%), Positives = 156/284 (54%), Gaps = 20/284 (7%)
Query: 71 RKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADR---SVPAMIPNITLP 127
R + L +E+ + YG+N+FS L EF+A YL + KPS R V IPN++LP
Sbjct: 49 RYLNSLFPSENSTAFYGINQFSYLFPEEFKAIYL--RSKPSKFPRYSAEVRMSIPNVSLP 106
Query: 128 RAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDG 187
FDWR+ VT V++Q MCG WAFS G +E YA K K L LS Q++IDC + G
Sbjct: 107 LRFDWRDKHVVTQVRNQQMCGGCWAFSVVGAVESAYAIKGKPLEDLSVQQVIDCSYNNYG 166
Query: 188 CEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACR-LNKKATQVKINGYVS--VSRDE 244
C GGS NA + ++K+ L ++ YP++ + C + + I GY + S E
Sbjct: 167 CNGGSTLNALN-WLNKMQVKLVKDSEYPFKAQNGLCHYFSGSHSGFSIKGYSAYDFSNQE 225
Query: 245 TDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKF 304
+MAK L+ GP+ V ++A + Q Y+ G+ IQ C G N H+VLI G+ D+T
Sbjct: 226 DEMAKALLTFGPLVVIVDAVSWQDYLGGI---IQHHCSSGEAN--HAVLITGF--DKTGS 278
Query: 305 THKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
T PYWI++NSWG WG GY + G CGI D V S V
Sbjct: 279 T----PYWIVRNSWGSSWGVDGYAHVKMGSNVCGIADSVSSIFV 318
>gi|400180403|gb|AFP73340.1| cysteine protease [Solanum peruvianum]
gi|400180413|gb|AFP73345.1| cysteine protease [Solanum peruvianum]
gi|400180415|gb|AFP73346.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 187 bits (476), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 108/301 (35%), Positives = 158/301 (52%), Gaps = 24/301 (7%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
++ +H + Y VE R IF N++ I+ + + S G+NEF+D+++ EF AK+
Sbjct: 42 WMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFT 101
Query: 105 GFKLKPSYADRSVPAM-------IPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
G + SY S + + + +P DWRE AVT VK Q CG WAFS G
Sbjct: 102 GLNIPNSYLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVG 161
Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
++EG Y T L+ SEQEL+DC + GC GG ++NAFD I K GG+ E Y Y
Sbjct: 162 SLEGAYKIATGNLMEFSEQELLDCTTNNYGCNGGFMTNAFDFI--KENGGISRESDYEYL 219
Query: 218 GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA-LQFYVTGVSHP 276
G+ CR +K V+I+ Y V ET + + + + P+++ I A LQFY G
Sbjct: 220 GEQYTCRSQEKTAAVQISSYQVVPEGETSLLQAVTKQ-PVSIGIAASQDLQFYAGGTY-- 276
Query: 277 IQFFCDGG-NENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
DG + ++H+V +GYG D K YW++KNSWG WGE G+ ++ R G
Sbjct: 277 -----DGSCADRINHAVTAIGYGTD-----EKGQKYWLLKNSWGTSWGENGFMKIIRDSG 326
Query: 336 S 336
+
Sbjct: 327 N 327
>gi|449275508|gb|EMC84350.1| Cathepsin L1, partial [Columba livia]
Length = 319
Score = 187 bits (476), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 122/313 (38%), Positives = 173/313 (55%), Gaps = 23/313 (7%)
Query: 49 HNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY----GLNEFSDLSTAEFQAKYL 104
HNK Y E + R+ ++ NL+ I+L + +H G + G+N+F D++T EF+
Sbjct: 17 HNKDYHEREESWRRV-VWEKNLKMIEL-HNLDHTLGKHSYKLGMNQFGDMTTEEFRQLMN 74
Query: 105 GFKLKPSYAD-RSVPAMIPN-ITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGV 162
G+ K S R + P+ + PR+ DWRE VT VKDQ CGS WAFSTTG +EG
Sbjct: 75 GYAHKKSERKYRGSQFLEPSFLEAPRSVDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQ 134
Query: 163 YAAKTKKLVSLSEQELIDCDQED--DGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG-D 219
+ KT KLVSLSEQ L+DC + + GC GG + AF + GG++ E++YPY D
Sbjct: 135 HFRKTGKLVSLSEQNLVDCSRPEGNQGCNGGLMDQAFQYVQDN--GGIDSEESYPYTAKD 192
Query: 220 DKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINA--YALQFYVTGVSHP 276
D+ CR + G+V + + E + K + GP++VAI+A + QFY +G
Sbjct: 193 DEDCRYKAEYNAANDTGFVDIPQGHERALMKAVAAVGPVSVAIDAGHSSFQFYQSG---- 248
Query: 277 IQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-DG 335
I + D +E+L H VL+VGYG + K YWI+KNSWGE WG+KGY + +
Sbjct: 249 IYYEPDCSSEDLDHGVLVVGYGFEGEDVDGKK--YWIVKNSWGEKWGDKGYIYMAKDRKN 306
Query: 336 SCGINDYVRSALV 348
CGI LV
Sbjct: 307 HCGIATAASYPLV 319
>gi|156739281|ref|NP_001096588.1| cathepsin L1-like precursor [Danio rerio]
gi|166158351|ref|NP_001107526.1| uncharacterized protein LOC100135391 precursor [Xenopus (Silurana)
tropicalis]
gi|326672305|ref|XP_003199634.1| PREDICTED: cathepsin L1-like [Danio rerio]
gi|156230096|gb|AAI52237.1| MGC174155 protein [Danio rerio]
gi|163916362|gb|AAI57707.1| LOC100135391 protein [Xenopus (Silurana) tropicalis]
Length = 335
Score = 187 bits (476), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 116/310 (37%), Positives = 171/310 (55%), Gaps = 23/310 (7%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY----GLNEFSDLSTA 97
+N + QH K+Y VE R+ I+ NLRKI+ + E+ G + G+N+F D++
Sbjct: 28 WNSWKSQHGKSYHEDVEVGRRM-IWEENLRKIEQ-HNFEYSYGNHTFKMGMNQFGDMTNE 85
Query: 98 EFQAKYLGFKLKPSYADRSVPAMIPNI-TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTT 156
EF+ G+K P+ + M P+ P+ DWR+ VT VKDQ CGS W+FS+T
Sbjct: 86 EFRQAMNGYKHDPNRTSQGPLFMEPSFFAAPQQVDWRQRGYVTPVKDQKQCGSCWSFSST 145
Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCD--QEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
G +EG KT KL+S+SEQ L+DC Q + GC GG + AF + K GL+ E++Y
Sbjct: 146 GALEGQLFRKTGKLISMSEQNLVDCSRPQGNQGCNGGIMDQAFQYV--KENKGLDSEQSY 203
Query: 215 PYRG-DDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINA--YALQFYV 270
PY DD CR + + KI G+V + R +E + + GP++VAI+A +LQFY
Sbjct: 204 PYLARDDLPCRYDPRFNVAKITGFVDIPRGNELALMNAVAAVGPVSVAIDASHQSLQFYQ 263
Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
+G+ ++ L H+VL+VGYG YWI+KNSW + WG+KGY +
Sbjct: 264 SGI-----YYERACTSRLDHAVLVVGYGYQGADVAGNR--YWIVKNSWSDKWGDKGYIYM 316
Query: 331 YRG-DGSCGI 339
+ + CGI
Sbjct: 317 AKDKNNHCGI 326
>gi|326430490|gb|EGD76060.1| cysteine proteinase [Salpingoeca sp. ATCC 50818]
Length = 448
Score = 187 bits (476), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 122/321 (38%), Positives = 168/321 (52%), Gaps = 45/321 (14%)
Query: 36 VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY----GLNEF 91
V LF+ F + NK Y + E R +FS N+ I + E GV+ +N+F
Sbjct: 24 VNKGRLFDAFKTKFNKVYESAEEEARRFSVFSQNIDFINR-HNAEAARGVHTHTVDVNQF 82
Query: 92 SDLSTAEFQAKYL--------GFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKD 143
+DL+ E++ YL G + + + D PN + DWR+ AVT +K+
Sbjct: 83 ADLTNEEYRQLYLRPYPTELLGRERQEVWLDG------PNAG---SVDWRQKGAVTPIKN 133
Query: 144 QTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIM 201
Q CGS W+FSTTG++EG +A T LVSLSEQ+L+DC + GC GG + NAF I+
Sbjct: 134 QGQCGSCWSFSTTGSVEGAHAIATGNLVSLSEQQLVDCSGSFGNQGCNGGLMDNAFKYII 193
Query: 202 SKLGGGLEEEKTYPYRGDDKACRLNKKATQ-VKINGYVSVSRDETDMAKYLVENGPMAVA 260
S GGL+ E+ YPY D C +K++ V I+GY V ++ D VE GP++VA
Sbjct: 194 SN--GGLDTEQDYPYTARDGVCDKSKESKHAVSISGYKDVPQNNEDQLAAAVEKGPVSVA 251
Query: 261 INA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSW 318
I A + Q Y +GV F NL H VL+VGY D YWI+KNSW
Sbjct: 252 IEADQQSFQMYSSGV------FSGPCGTNLDHGVLVVGYTSD----------YWIVKNSW 295
Query: 319 GEGWGEKGYFRLYRGDGSCGI 339
G WG++GY + RG S GI
Sbjct: 296 GASWGDQGYIMMKRGVSSAGI 316
>gi|324512246|gb|ADY45078.1| Cathepsin L [Ascaris suum]
Length = 388
Score = 187 bits (476), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 123/308 (39%), Positives = 167/308 (54%), Gaps = 27/308 (8%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT-EHGSGVY--GLNEFSDLSTAEFQA 101
F EQH K Y + F NL +I+ + G + G N +DL E++
Sbjct: 86 FKEQHGKNYEDEETENDHMLAFLSNLEEIRKHNARYQRGESSFEMGTNHITDLPFEEYR- 144
Query: 102 KYLGFKLKPSYAD---RSVPAMIP-NITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
K G+K P Y D ++P NI +P +DWR++ VT VK+Q MCGS WAFS TG
Sbjct: 145 KLNGYK--PRYDDSHRNGTKFLVPFNINVPGHWDWRDHGYVTEVKNQGMCGSCWAFSATG 202
Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEEKTYP 215
+EG + K LVSLSEQ L+DC ++ ++GC GG + AF+ I K G++ E +YP
Sbjct: 203 ALEGQHKRKIGSLVSLSEQNLVDCSRKYGNNGCNGGLMDYAFEYI--KDNHGVDTEASYP 260
Query: 216 YRGDDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINA--YALQFYVTG 272
Y+G + C NKK + GYV + DE + + GP++VAI+A + Q Y G
Sbjct: 261 YKGKEMKCHFNKKTVGAEDEGYVDLPEGDEEKLKIAVATQGPISVAIDAGHPSFQMYRKG 320
Query: 273 VSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR 332
V + Q C +E+L H VL+VGYG D YWI+KNSWG GWGEKGY R+ R
Sbjct: 321 VYYEPQ--C--SSESLDHGVLVVGYGTDEIDGD-----YWIVKNSWGPGWGEKGYVRIAR 371
Query: 333 G-DGSCGI 339
D CGI
Sbjct: 372 NRDNHCGI 379
>gi|212275830|ref|NP_001130503.1| cysteine protease 1 [Zea mays]
gi|194689328|gb|ACF78748.1| unknown [Zea mays]
gi|219886279|gb|ACL53514.1| unknown [Zea mays]
gi|238010470|gb|ACR36270.1| unknown [Zea mays]
gi|413920875|gb|AFW60807.1| cysteine protease 1 [Zea mays]
Length = 354
Score = 187 bits (476), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 109/304 (35%), Positives = 151/304 (49%), Gaps = 21/304 (6%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY--GLNEFSDLSTAEFQAK 102
++ +H +TY E R +F N + Y LNEF+D++ EF A
Sbjct: 54 WMAEHGRTYRDEAEKAHRFQVFKANADFVDASNAAGDDKKSYRLELNEFADMTNDEFMAM 113
Query: 103 YLGFKLKPSYADRSVPAMIPNITLPRA------FDWREYDAVTGVKDQTMCGSSWAFSTT 156
Y G + P+ A + N+TL A DWR+ AVTG+K+Q CG WAF+
Sbjct: 114 YTGLRPVPAGAKKMAGFKYGNVTLSDADDDQQTVDWRQKGAVTGIKNQGQCGCCWAFAAV 173
Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYP 215
+EG++ T LVSLSEQ+++DCD + ++GC GG I NAF I+ GGL E YP
Sbjct: 174 AAVEGIHQITTGNLVSLSEQQVLDCDTDGNNGCNGGYIDNAFQYIVGN--GGLGTEDAYP 231
Query: 216 YRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSH 275
Y C+ + I+GY V + V N P++VAI+A+ Q Y GV
Sbjct: 232 YTAAQAMCQSVQPV--AAISGYQDVPSGDEAALAAAVANQPVSVAIDAHNFQLYGGGVMT 289
Query: 276 PIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
NL+H+V VGYG PYW++KN WG+ WGE GY RL RG
Sbjct: 290 AASCSTP---PNLNHAVTAVGYGT-----AEDGTPYWLLKNQWGQNWGEGGYLRLERGAN 341
Query: 336 SCGI 339
+CG+
Sbjct: 342 ACGV 345
>gi|357458909|ref|XP_003599735.1| Cysteine proteinase [Medicago truncatula]
gi|357474677|ref|XP_003607623.1| Cysteine proteinase [Medicago truncatula]
gi|355488783|gb|AES69986.1| Cysteine proteinase [Medicago truncatula]
gi|355508678|gb|AES89820.1| Cysteine proteinase [Medicago truncatula]
Length = 342
Score = 187 bits (476), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 122/310 (39%), Positives = 165/310 (53%), Gaps = 31/310 (10%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
++ Q K+Y E R IF N+ I+L + +N F+DL+ EF+A
Sbjct: 40 WMTQFGKSYKDAAEKEKRFQIFKNNVEFIELFNAVGNKPFNLSINHFADLTNEEFKASLN 99
Query: 105 GFKL---KPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIE 160
G K K + + N+T +P + DWR+ AVT +K+Q CGS WAFST +IE
Sbjct: 100 GNKKLHDKFDILNETTSFRYHNVTSVPASMDWRKRGAVTPIKNQGSCGSCWAFSTVASIE 159
Query: 161 GVYAAKTKKLVSLSEQELIDCDQ-EDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGD 219
G++ T +LVSLSEQELIDC + GC GG + +AF I K GG+ E YPY+
Sbjct: 160 GIHQITTGELVSLSEQELIDCVRGNSSGCSGGYLEDAFKFIAKK--GGMASETNYPYKET 217
Query: 220 DKACRLNKKATQV-KINGYVSV-SRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSH 275
D+ C+ K++ V +I GY V S E D+ K V N P++V ++A Y QFY G+
Sbjct: 218 DEKCKFKKESKHVAEIKGYEKVPSNSENDLLK-AVANQPVSVYVDAGDYVFQFYSGGI-- 274
Query: 276 PIQFFCDGGNENLSHSVLIVGYGV--DRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG 333
F + H V IVGYGV D T+ YW++KNSWG GWGEKGY +L R
Sbjct: 275 ----FTGKCGTDTDHVVTIVGYGVSLDYTE-------YWLVKNSWGTGWGEKGYMKLKRN 323
Query: 334 ----DGSCGI 339
G CGI
Sbjct: 324 VDSKKGLCGI 333
>gi|40806500|gb|AAR92155.1| putative cysteine protease 2 [Iris x hollandica]
Length = 359
Score = 187 bits (476), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 123/343 (35%), Positives = 170/343 (49%), Gaps = 29/343 (8%)
Query: 12 LLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYF--LEQHNKTYATLVEYYSRLHIFSGN 69
LL+L V+++ V + +L+ + H+ L E R ++F N
Sbjct: 7 LLALVVALAFVGVARTIPFNEKDLASEESLWGLYERWRSHHTVSRDLSEKNKRFNVFKEN 66
Query: 70 LRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPA-----MIPNI 124
+ I + + GLN+F+D++ EF++ Y G K+ R P M N+
Sbjct: 67 AKFIHEFNKKDAPYKL-GLNKFADMTNQEFRSTYAGSKIHHHRTQRGTPRATGSFMYENV 125
Query: 125 -TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCD- 182
++P + DWR AV VKDQ CGS WAFST ++EG+ KT +LV LS Q+L+DCD
Sbjct: 126 HSIPASVDWRTQGAVAPVKDQGQCGSCWAFSTIASVEGINKIKTNQLVPLSGQQLVDCDT 185
Query: 183 QEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSR 242
+++GC GG + AF+ I S GG+ E YPY + +C A V I+GY V
Sbjct: 186 DQNEGCNGGLMDYAFEFIKSN--GGITSESAYPYTAEQGSCASESSAPVVTIDGYEDVPA 243
Query: 243 DETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVD 300
+ V N ++VAI A A QFY GV F GNE L H V +VGYG
Sbjct: 244 NNEAALMKAVANQVVSVAIEASGMAFQFYSEGV-----FTGSCGNE-LDHGVAVVGYGAT 297
Query: 301 RTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGI 339
R YWI++NSWG WGEKGY R+ RG G CGI
Sbjct: 298 R-----DGTKYWIVRNSWGAEWGEKGYIRMQRGIRARHGLCGI 335
>gi|400180355|gb|AFP73316.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 187 bits (476), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 107/301 (35%), Positives = 158/301 (52%), Gaps = 24/301 (7%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
++ +H + Y VE R IF N++ I+ + + S G+NEF+D+++ EF AK+
Sbjct: 42 WMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFT 101
Query: 105 GFKLKPSYADRSVPAM-------IPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
G + SY S + + + +P DWRE AVT VK Q CG WAFS G
Sbjct: 102 GLNIPNSYLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVG 161
Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
++EG Y T L+ SEQEL+DC + GC GG ++NAFD I+ GG+ E Y Y
Sbjct: 162 SLEGAYKIATGNLMEFSEQELLDCTTNNYGCNGGFMTNAFDFIIEN--GGISRESDYEYL 219
Query: 218 GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA-LQFYVTGVSHP 276
G+ CR +K V+I+ Y V ET + + + + P+++ I A LQFY G
Sbjct: 220 GEQYTCRSQEKTAAVQISSYQVVPEGETSLLQAVTKQ-PVSIGIAASQDLQFYAGGTY-- 276
Query: 277 IQFFCDGG-NENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
DG + ++H+V +GYG D K YW++KNSWG WGE G+ ++ R G
Sbjct: 277 -----DGSCADRINHAVTAIGYGTD-----EKGQKYWLLKNSWGTSWGENGFMKIIRDSG 326
Query: 336 S 336
+
Sbjct: 327 N 327
>gi|269784818|ref|NP_001161481.1| cathepsin L1 precursor [Gallus gallus]
Length = 353
Score = 187 bits (476), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 121/313 (38%), Positives = 173/313 (55%), Gaps = 23/313 (7%)
Query: 49 HNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY----GLNEFSDLSTAEFQAKYL 104
H+K Y E + R+ ++ NL+ I+L + +H G + G+N+F D++ EF+
Sbjct: 51 HSKDYHEREESWRRV-VWEKNLKMIEL-HNLDHSLGKHSYKLGMNQFGDMTAEEFRQLMN 108
Query: 105 GFKLKPSYAD-RSVPAMIPN-ITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGV 162
G+K K S R + P+ + PR+ DWRE VT VKDQ CGS WAFSTTG +EG
Sbjct: 109 GYKHKKSERKYRGSQFLEPSFLEAPRSVDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQ 168
Query: 163 YAAKTKKLVSLSEQELIDCDQED--DGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG-D 219
+ KT KLVSLSEQ L+DC + + GC GG + AF + GG++ E++YPY D
Sbjct: 169 HFRKTGKLVSLSEQNLVDCSRPEGNQGCNGGLMDQAFQYVQDN--GGIDSEESYPYTAKD 226
Query: 220 DKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINA--YALQFYVTGVSHP 276
D+ CR + G+V + + E + K + GP++VAI+A + QFY +G
Sbjct: 227 DEDCRYKAEYNAANDTGFVDIPQGHERALMKAVASVGPVSVAIDAGHSSFQFYQSG---- 282
Query: 277 IQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-DG 335
I + D +E+L H VL+VGYG + K YWI+KNSWGE WG+KGY + +
Sbjct: 283 IYYEPDCSSEDLDHGVLVVGYGFEGEDVDGKK--YWIVKNSWGEKWGDKGYIYMAKDRKN 340
Query: 336 SCGINDYVRSALV 348
CGI LV
Sbjct: 341 HCGIATAASYPLV 353
>gi|45822201|emb|CAE47497.1| cathepsin L-like proteinase [Diabrotica virgifera virgifera]
Length = 315
Score = 187 bits (476), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 118/303 (38%), Positives = 162/303 (53%), Gaps = 29/303 (9%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT-EHGSGVY--GLNEFSDLSTAEFQA 101
F HNK+Y ++E R +F NL+KI+ E G Y +N+F+D S+AEFQA
Sbjct: 27 FKATHNKSY-NVIEDKLRFAVFQDNLKKIEEHNAKYESGEETYYLAVNKFADWSSAEFQA 85
Query: 102 ---KYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
+ + K K S+ + V PN+ DWR+ AV GVKDQ CGS WAFSTTG+
Sbjct: 86 MLARQMANKPKQSFIAKHVAD--PNVQAVEEVDWRD-SAVLGVKDQGQCGSCWAFSTTGS 142
Query: 159 IEGVYAAKTKKLVSLSEQELIDCD-QEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
+EG A + V LSEQEL+DCD + GC GG +++AF+ + GL E Y Y
Sbjct: 143 LEGQLAIHKNQRVPLSEQELVDCDTSRNAGCNGGLMTDAFNYVKRH---GLSSESQYAYT 199
Query: 218 GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPI 277
G D C+ + I+GYV + E +A + GP+++A++A Q Y G+
Sbjct: 200 GRDDRCKNVENKPLSSISGYVELETTEDALASAVASVGPVSIAVDADTWQLYGGGL---- 255
Query: 278 QFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSC 337
F NL+H VL VGY D +I+KNSWG WGE+GY R+ RG+ C
Sbjct: 256 -FNNKNCRTNLNHGVLAVGYTKDA----------FIVKNSWGTSWGEQGYIRVARGENLC 304
Query: 338 GIN 340
GIN
Sbjct: 305 GIN 307
>gi|326672302|ref|XP_003199633.1| PREDICTED: cathepsin L1-like [Danio rerio]
gi|157423549|gb|AAI53506.1| Im:6910535 [Danio rerio]
Length = 335
Score = 187 bits (476), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 115/310 (37%), Positives = 171/310 (55%), Gaps = 23/310 (7%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY----GLNEFSDLSTA 97
+N + QH K+Y VE R+ I+ NLRKI+ + E+ G + G+N+F D++
Sbjct: 28 WNSWKSQHGKSYHEDVEVGRRM-IWEENLRKIEQ-HNFEYSYGNHTFKMGMNQFGDMTNE 85
Query: 98 EFQAKYLGFKLKPSYADRSVPAMIPNI-TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTT 156
EF+ G+K P+ + M P+ P+ DWR+ VT VKDQ CGS W+FS+T
Sbjct: 86 EFRQAMNGYKQDPNRTSKGALFMEPSFFAAPQQVDWRQRGYVTPVKDQKQCGSCWSFSST 145
Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCD--QEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
G +EG KT KL+S+SEQ L+DC Q + GC GG + AF + K GL+ E++Y
Sbjct: 146 GALEGQLFRKTGKLISMSEQNLVDCSRPQGNQGCNGGIMDQAFQYV--KENKGLDSEQSY 203
Query: 215 PYRG-DDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINA--YALQFYV 270
PY DD CR + + KI G+V + + +E + + GP++VAI+A +LQFY
Sbjct: 204 PYLARDDLPCRYDPRFNVAKITGFVDIPKGNELALMNAVAAVGPVSVAIDASHQSLQFYQ 263
Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
+G+ ++ L H+VL+VGYG YWI+KNSW + WG+KGY +
Sbjct: 264 SGI-----YYERACTSRLDHAVLVVGYGYQGADVAGNR--YWIVKNSWSDKWGDKGYIYM 316
Query: 331 YRG-DGSCGI 339
+ + CGI
Sbjct: 317 AKDKNNHCGI 326
>gi|328876826|gb|EGG25189.1| hypothetical protein DFA_03437 [Dictyostelium fasciculatum]
Length = 341
Score = 187 bits (476), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 111/308 (36%), Positives = 155/308 (50%), Gaps = 15/308 (4%)
Query: 38 HTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTA 97
+T F ++ +HNK Y E+Y RL F N+ I+ + + +GLN+FSDLS
Sbjct: 28 YTTRFKTWMVEHNKMYHEEEEFYLRLSNFIRNIHSIEKMNRQYGRTATFGLNKFSDLSLD 87
Query: 98 EFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
EF+ YL KP P+ +P DWR VT VK+Q MCGS WAFS T
Sbjct: 88 EFKKHYLMPNYKPKARVTKETFNYPS-NIPATLDWRTKGYVTPVKNQLMCGSCWAFSATE 146
Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
IE ++ LSEQ+++DCD D GC GG A+ + + GGL TYPY
Sbjct: 147 QIETANIMAGGQVEYLSEQQIVDCDPYDGGCGGGDPYTAYQYVQNN--GGLTLNVTYPYT 204
Query: 218 GDDKACRLNKKATQVKIN--GYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSH 275
+ AC N A V++ GY S +ET + + + GP+++ +NA Y +G+
Sbjct: 205 AANGACYANSTAPAVQVTAFGYASSQGNETQLREAMAARGPLSICVNAEPWMSYQSGI-- 262
Query: 276 PIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
F +++L H V IVGY D T T PY+I++NSWG WG GY + G
Sbjct: 263 ----FSSTCSDDLDHCVQIVGYDTDATSKT----PYFIVRNSWGTDWGLLGYIYIQAGSN 314
Query: 336 SCGINDYV 343
CGI + V
Sbjct: 315 LCGITNEV 322
>gi|312381833|gb|EFR27483.1| hypothetical protein AND_05794 [Anopheles darlingi]
Length = 344
Score = 187 bits (476), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 133/365 (36%), Positives = 182/365 (49%), Gaps = 46/365 (12%)
Query: 6 FFAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHI 65
F V+ L+ +VS F +V +E +N F QH K Y + E R+ I
Sbjct: 4 FLLLVSFLAAANAVSIFNLVKEE-------------WNAFKLQHRKKYDSESEERIRMKI 50
Query: 66 FSGNLRKI-QLLQDTEHGSGVYGL--NEFSDLSTAEFQAKYLGFKLKPSYADR------- 115
+ N KI + Q + G + L N+++DL EF GF + +
Sbjct: 51 YVQNKHKIAKHNQRYDLGQEKFRLRVNKYADLLHEEFVHTLNGFNRSAAAGSKLLGREQL 110
Query: 116 -----SVPAMIP-NITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKK 169
+ + P N+ +P DWRE AVT VKDQ CGS W+FS TG +EG + KT K
Sbjct: 111 MTIEEPITWIEPANVDVPTTIDWREKGAVTPVKDQGHCGSCWSFSATGALEGQHFRKTGK 170
Query: 170 LVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNK 227
LVSLSEQ L+DC + ++GC GG + NAF + K G++ EK YPY D C N
Sbjct: 171 LVSLSEQNLVDCSTKYGNNGCNGGLMDNAFQYV--KDNKGIDTEKAYPYEAIDDECHYNP 228
Query: 228 KATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINAY--ALQFYVTGVSHPIQFFCDGG 284
KA G+V + + DE + K L GP++VAI+A + QFY GV + Q CD
Sbjct: 229 KAIGATDKGFVDIPQGDEKALKKALATVGPVSVAIDASHESFQFYSEGVYYEPQ--CD-- 284
Query: 285 NENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-DGSCGINDYV 343
+E L H VL VGYG T YW++KNSWG WG++GY ++ R + CGI
Sbjct: 285 SEQLDHGVLAVGYGT-----TEDGEDYWLVKNSWGTTWGDQGYVKMARNRENHCGIATTA 339
Query: 344 RSALV 348
LV
Sbjct: 340 SYPLV 344
>gi|242086591|ref|XP_002439128.1| hypothetical protein SORBIDRAFT_09g000960 [Sorghum bicolor]
gi|241944413|gb|EES17558.1| hypothetical protein SORBIDRAFT_09g000960 [Sorghum bicolor]
Length = 371
Score = 187 bits (476), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 129/332 (38%), Positives = 174/332 (52%), Gaps = 29/332 (8%)
Query: 22 FMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEH 81
F +VG +HH + LF ++ ++ K YA+ E R +F NL I + +
Sbjct: 46 FSIVGYSPEDLVHHDRLIKLFEEWVAKYRKAYASFEEKLHRFEVFKDNLHHIDEA-NKKV 104
Query: 82 GSGVYGLNEFSDLSTAEFQAKYLGFK---LKPSYADRSVPAMIPNITLPRAFDWREYDAV 138
+ GLN F+DL+ EF+A YLG + K + R + + +P + DWR+ AV
Sbjct: 105 TTYWLGLNAFADLTHDEFKATYLGLRQPETKKTTDSRFRYGGVADDDVPASVDWRKKGAV 164
Query: 139 TGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAF 197
T VK+Q CGS WAFST +EG+ T L SLSEQEL+DC + ++GC GG + NAF
Sbjct: 165 TDVKNQGQCGSCWAFSTVAAVEGINQIVTGNLTSLSEQELVDCSTDGNNGCNGGVMDNAF 224
Query: 198 DTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQ--VKINGYVSV-SRDETDMAKYLVEN 254
I S GGL E+ YPY ++ C + + V I+GY V + DE + K L
Sbjct: 225 SYIASS--GGLRTEEAYPYLMEEGDCDDKARDGEQVVTISGYEDVPANDEQALVKALAHQ 282
Query: 255 GPMAVAINAYA--LQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYW 312
P++VAI A QFY GV F G+E L H V VGYG + K Y
Sbjct: 283 -PLSVAIEASGRHFQFYSGGV-----FNGPCGSE-LDHGVAAVGYG------SSKGQDYI 329
Query: 313 IIKNSWGEGWGEKGYFRLYRG----DGSCGIN 340
I+KNSWG WGEKGY R+ RG +G CGIN
Sbjct: 330 IVKNSWGSHWGEKGYIRMKRGTGKPEGLCGIN 361
>gi|213512938|ref|NP_001133871.1| Cathepsin K precursor [Salmo salar]
gi|209155648|gb|ACI34056.1| Cathepsin K precursor [Salmo salar]
gi|223647252|gb|ACN10384.1| Cathepsin K precursor [Salmo salar]
gi|223673129|gb|ACN12746.1| Cathepsin K precursor [Salmo salar]
Length = 331
Score = 187 bits (475), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 116/329 (35%), Positives = 178/329 (54%), Gaps = 25/329 (7%)
Query: 22 FMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEH 81
+ +G H L+ + A ++ + H + Y L E R I+ N+R I+ + E
Sbjct: 8 LLFLGSVLAHPLNEMSLDAQWDSWKTTHLREYNGLGEEVIRRTIWEKNMRLIEA-HNEEA 66
Query: 82 GSGVY----GLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPN---ITLPRAFDWRE 134
G++ G+N D+++ E K G ++ P DRS IP+ + +PR+ D+R+
Sbjct: 67 ALGIHSYELGMNHLGDMTSEEIAEKLTGLQV-PMNRDRS-NTWIPDNNVVKIPRSIDYRK 124
Query: 135 YDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSIS 194
VT VK+Q CGS WAFS+ G +EG A T KL+ LS Q L+DC E++GC GG ++
Sbjct: 125 KGMVTPVKNQLSCGSCWAFSSAGALEGQLAKTTGKLIDLSPQNLVDCVTENNGCGGGYMT 184
Query: 195 NAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVE 253
NAF+ + GG++ E+ YPY G D C N + G+ + DE + K +V+
Sbjct: 185 NAFEYVEEN--GGIDTEEAYPYLGQDGQCAYNASGMGAQCRGFKEIPEGDEWALTKAVVK 242
Query: 254 NGPMAVAINAY--ALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPY 311
GP+AV I+A QFY GV + C+ ++++H+VL VGYG T K + +
Sbjct: 243 VGPVAVGIDATLSTFQFYQRGVYYDPN--CN--KDDINHAVLAVGYGQ-----TAKGMKF 293
Query: 312 WIIKNSWGEGWGEKGYFRLYRGDG-SCGI 339
WI+KNSW E WG++GY + R G +CGI
Sbjct: 294 WIVKNSWSESWGKQGYIMMARNRGNACGI 322
>gi|255032|gb|AAB23155.1| COT44=cysteine proteinase homolog [Brassica napus, seedling, rapid
cycling base population CrGC5, Peptide, 328 aa]
Length = 328
Score = 187 bits (475), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 122/324 (37%), Positives = 171/324 (52%), Gaps = 42/324 (12%)
Query: 33 LHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLL-QDTEHGSGVYGLNEF 91
L H K + N + Q ++ R +IF NLR I L ++ ++ + GL F
Sbjct: 9 LEHGKSNSNSNGIINQQDE----------RFNIFKDNLRFIDLHNENNKNATYKLGLTIF 58
Query: 92 SDLSTAEFQAKYLGFKLKP-------SYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQ 144
++L+ E+++ YLG + +P + A + ++ +P DWR+ AV +KDQ
Sbjct: 59 ANLTNDEYRSLYLGARTEPVRRITKAKNVNMKYSAAVNDVEVPVTVDWRQKGAVNAIKDQ 118
Query: 145 TMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSK 203
CGS WAFST +EG+ T +LVSLSEQEL+DCD+ + GC GG + AF IM
Sbjct: 119 GTCGSCWAFSTAAAVEGINKIVTGELVSLSEQELVDCDKSYNQGCNGGLMDYAFQFIMKN 178
Query: 204 LGGGLEEEKTYPYRGDDKACR-LNKKATQVKINGYVSV-SRDETDMAKYLVENGPMAVAI 261
GGL EK YPY G + C L K + V I+GY V S+DET + K V P++VAI
Sbjct: 179 --GGLNTEKDYPYHGTNGKCNSLLKNSRVVTIDGYEDVPSKDETAL-KRAVSYQPVSVAI 235
Query: 262 NA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWG 319
+A A Q Y +G+ F N+ H+V+ VGYG + V YWI++NSWG
Sbjct: 236 DAGGRAFQHYQSGI------FTGKCGTNMDHAVVAVGYG------SENGVDYWIVRNSWG 283
Query: 320 EGWGEKGYFRLYRG----DGSCGI 339
WGE GY R+ R G CGI
Sbjct: 284 TRWGEDGYIRMERNVASKSGKCGI 307
>gi|363814535|ref|NP_001242660.1| uncharacterized protein LOC100807362 precursor [Glycine max]
gi|255636658|gb|ACU18666.1| unknown [Glycine max]
Length = 367
Score = 187 bits (475), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 120/316 (37%), Positives = 166/316 (52%), Gaps = 35/316 (11%)
Query: 40 ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
+++ +L +H K Y + E R IF NL I+ V GLN FSDLS E+
Sbjct: 50 SIYEEWLVKHGKVYNAVEEKEKRFQIFKDNLNFIEEHNAVNRTYKV-GLNRFSDLSNEEY 108
Query: 100 QAKYLGFKLKPSY-----ADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
++KYLG K+ PS + R P + N LP + DWR+ AV VK+Q+ C WAFS
Sbjct: 109 RSKYLGTKIDPSRMMARPSRRYSPRVADN--LPESVDWRKEGAVVRVKNQSECEGCWAFS 166
Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
+EG+ T L +LSEQEL+DCD+ + GC GG + AF+ I++ GG++ E+
Sbjct: 167 AIAAVEGINKIVTGNLTALSEQELLDCDRTVNAGCSGGLVDYAFEFIINN--GGIDTEED 224
Query: 214 YPYRGDDKAC---RLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA--LQF 268
YP++G D C ++N +A V I+GY V + K V N P++VAI AY Q
Sbjct: 225 YPFQGADGICDQYKINARA--VTIDGYERVPAYDELALKKAVANQPVSVAIEAYGKEFQL 282
Query: 269 YVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYF 328
Y +G+ F ++ H V VGYG T + YWI+KNSWGE WGE GY
Sbjct: 283 YESGI------FTGTCGTSIDHGVTAVGYG------TENGIDYWIVKNSWGENWGEAGYV 330
Query: 329 RLYRG-----DGSCGI 339
+ R G CGI
Sbjct: 331 GMERNIAEDTAGKCGI 346
>gi|400180463|gb|AFP73368.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 187 bits (475), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 108/300 (36%), Positives = 157/300 (52%), Gaps = 24/300 (8%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
++ +H + Y VE R IF N++ I+ + + S G+NEF+D+++ EF AK+
Sbjct: 42 WMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFT 101
Query: 105 GFKLKPSYADRSVPAM-------IPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
G + SY S + + + +P DWRE AVT VK Q CG WAFS G
Sbjct: 102 GLNIPNSYLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVG 161
Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
++EG Y T L+ SEQEL+DC + GC GG ++NAFD I K GG+ E Y Y
Sbjct: 162 SLEGAYKIATGNLMEFSEQELLDCTTNNYGCNGGFMTNAFDFI--KENGGISRESDYEYL 219
Query: 218 GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA-LQFYVTGVSHP 276
G+ CR +K V+I+ Y V ET + + + + P+++ I A LQFY G
Sbjct: 220 GEQYTCRSQEKTAAVQISSYQVVPEGETSLLQAVTKQ-PVSIGIAASQDLQFYAGGTY-- 276
Query: 277 IQFFCDGG-NENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
DG + ++H+V +GYG D K YW++KNSWG WGE G+ ++ R G
Sbjct: 277 -----DGSCADRINHAVTAIGYGTD-----EKGQKYWLLKNSWGTSWGENGFMKIIRDSG 326
>gi|400180455|gb|AFP73364.1| cysteine protease [Solanum peruvianum]
gi|400180459|gb|AFP73366.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 187 bits (475), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 108/300 (36%), Positives = 157/300 (52%), Gaps = 24/300 (8%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
++ +H + Y VE R IF N++ I+ + + S G+NEF+D+++ EF AK+
Sbjct: 42 WMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFT 101
Query: 105 GFKLKPSYADRSVPAM-------IPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
G + SY S + + + +P DWRE AVT VK Q CG WAFS G
Sbjct: 102 GLNIPNSYLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVG 161
Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
++EG Y T L+ SEQEL+DC + GC GG ++NAFD I K GG+ E Y Y
Sbjct: 162 SLEGAYKIATGNLMEFSEQELLDCTTNNYGCNGGFMTNAFDFI--KENGGISRESDYEYL 219
Query: 218 GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA-LQFYVTGVSHP 276
G+ CR +K V+I+ Y V ET + + + + P+++ I A LQFY G
Sbjct: 220 GEQYTCRSQEKTAAVQISSYQVVPEGETSLLQAVTKQ-PVSIGIAASQDLQFYAGGTY-- 276
Query: 277 IQFFCDGG-NENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
DG + ++H+V +GYG D K YW++KNSWG WGE G+ ++ R G
Sbjct: 277 -----DGSCADRINHAVTAIGYGTD-----EKGQKYWLLKNSWGTSWGENGFMKIIRDSG 326
>gi|62526575|gb|AAX84673.1| cysteine protease CP1 [Manihot esculenta]
Length = 467
Score = 187 bits (475), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 125/346 (36%), Positives = 179/346 (51%), Gaps = 33/346 (9%)
Query: 12 LLSLTVSVSSFM-VVGDEKLHHLHHVKHT-----ALFNYFLEQHNKTYATLVEYYSRLHI 65
LS T+S +S M ++ ++ H T A++ +L + K Y L E R +
Sbjct: 16 FLSFTLSSASDMSIISYDQTHATKSSWRTDDEVMAIYEEWLVKQGKVYNALGEREKRFQV 75
Query: 66 FSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFK--LKPSYADRSVPAMIPN 123
F NLR I ++E+ + GLN F+DL+ E+++ YLG + +K + ++ P
Sbjct: 76 FKDNLRFIDE-HNSENRTYKLGLNGFADLTNEEYRSTYLGARGGMKRNRLRKTSDRYAPR 134
Query: 124 I--TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDC 181
+ +LP + DWR+ AV VKDQ CGS WAFST +EG+ T L+SLSEQEL+DC
Sbjct: 135 VGESLPDSVDWRKEGAVAEVKDQGSCGSCWAFSTIAAVEGINKIVTGDLISLSEQELVDC 194
Query: 182 DQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRL-NKKATQVKINGYVS 239
D ++GC GG + AF+ I++ GG++ E+ YPY D C K A V I+ Y
Sbjct: 195 DTSYNEGCNGGLMDYAFEFIINN--GGIDTEEDYPYLARDGRCDTYRKNAKVVTIDDYED 252
Query: 240 VSRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGY 297
V + + V N P++VAI A QFY +G+ F L H V VGY
Sbjct: 253 VPVNSETALQKAVANQPVSVAIEAGGRDFQFYASGI------FSGRCGTQLDHGVAAVGY 306
Query: 298 GVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGI 339
G + K YWI++NSWG+ WGE GY R+ R G CGI
Sbjct: 307 GTENGK------DYWIVRNSWGKSWGENGYLRMARSINSPTGICGI 346
>gi|225446581|ref|XP_002280246.1| PREDICTED: vignain [Vitis vinifera]
Length = 341
Score = 187 bits (475), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 121/317 (38%), Positives = 158/317 (49%), Gaps = 24/317 (7%)
Query: 33 LHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFS 92
LH ++ Q+ + Y E R IF N+ +I+ S +NEF+
Sbjct: 30 LHEASMYERHEDWMVQYGREYKDADEKSKRYKIFKDNVARIESFNKAMDKSYKLSINEFA 89
Query: 93 DLSTAEFQAKYLGFKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSW 151
DL+ EF+A FK + + N+T +P DWR+ AVT +KDQ CGS W
Sbjct: 90 DLTNEEFRASRNRFKAHIC-STEATSFKYENVTAVPSTVDWRKKGAVTPIKDQGQCGSCW 148
Query: 152 AFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQ--EDDGCEGGSISNAFDTIMSKLGGGLE 209
AFS +EG+ T KL+SLSEQEL+DCD ED GC GG + +AF I + GL
Sbjct: 149 AFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQGCSGGLMDDAFKFI--EQNHGLT 206
Query: 210 EEKTYPYRGDDKACRLNKKA-TQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA--L 266
E YPY G D C K A KINGY V + + V + P+AVAI+A
Sbjct: 207 TEANYPYAGTDGTCNRKKAAHPAAKINGYEDVPANNEKALQKAVAHQPIAVAIDASGSEF 266
Query: 267 QFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKG 326
QFY +GV F G E L H V VGYG + + YW++KNSW GWGE+G
Sbjct: 267 QFYSSGV-----FTGQCGTE-LDHGVAAVGYGT-----SDDGMKYWLVKNSWSTGWGEEG 315
Query: 327 YFRLYRG----DGSCGI 339
Y R+ R +G CGI
Sbjct: 316 YIRMQRDVTAKEGLCGI 332
>gi|400180453|gb|AFP73363.1| cysteine protease [Solanum chilense]
Length = 344
Score = 187 bits (475), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 107/301 (35%), Positives = 158/301 (52%), Gaps = 24/301 (7%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
++ +H + Y VE R IF N++ I+ + + S G+NEF+D+++ EF AK+
Sbjct: 42 WMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFT 101
Query: 105 GFKLKPSYADRSVPAM-------IPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
G + SY S + + + +P DWRE AVT VK Q CG WAFS G
Sbjct: 102 GLNIPNSYLSPSPVSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVG 161
Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
++EG Y T L+ SEQEL+DC + GC GG ++NAFD I+ GG+ E Y Y
Sbjct: 162 SLEGAYKIATGNLMEFSEQELLDCTTNNYGCNGGFMTNAFDFIIEN--GGISRESDYEYL 219
Query: 218 GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA-LQFYVTGVSHP 276
G+ CR +K V+I+ Y V ET + + + + P+++ I A LQFY G
Sbjct: 220 GEQYTCRSQEKTAAVQISSYQVVPEGETSLLQAVTKQ-PVSIGIAASQDLQFYAGGTY-- 276
Query: 277 IQFFCDGG-NENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
DG + ++H+V +GYG D K YW++KNSWG WGE G+ ++ R G
Sbjct: 277 -----DGSCADRINHAVTAIGYGTD-----EKGQKYWLLKNSWGTSWGENGFMKIIRDSG 326
Query: 336 S 336
+
Sbjct: 327 N 327
>gi|340504799|gb|EGR31212.1| papain family cysteine protease, putative [Ichthyophthirius
multifiliis]
Length = 250
Score = 187 bits (475), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 99/227 (43%), Positives = 132/227 (58%), Gaps = 14/227 (6%)
Query: 123 NITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCD 182
N LP FDWRE +T VK Q CG W F+TTG IE YA K KLV+ SEQ+LIDCD
Sbjct: 36 NQVLPSYFDWREQGIITPVKYQDTCGGCWTFATTGVIESQYALKYNKLVNFSEQQLIDCD 95
Query: 183 QEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY-PYRGDDKACRLNKKATQVKINGYVSVS 241
+DGC GG +++A+ I GGLE + Y Y C+++ K+ + +S
Sbjct: 96 SINDGCRGGLMTDAYKAIQEM--GGLETSEDYGEYLNSKGQCKIDSNKVSAKVINWYQIS 153
Query: 242 RDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDR 301
DE + + LV+NGP+AV +NA LQFY G+ P CD ++++H+VLIVGYG +
Sbjct: 154 EDEEAIRRELVQNGPIAVGVNARFLQFYQGGILDPK--LCD---DSINHAVLIVGYGEEN 208
Query: 302 TKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
K YWIIKN WG+ WG GYF+L RG CG++ Y A +
Sbjct: 209 GK------KYWIIKNQWGKSWGINGYFKLVRGKKQCGVHTYASIAFI 249
>gi|400180385|gb|AFP73331.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 187 bits (475), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 108/300 (36%), Positives = 157/300 (52%), Gaps = 24/300 (8%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
++ +H + Y VE R IF N++ I+ + + S G+NEF+D+++ EF AK+
Sbjct: 42 WMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFT 101
Query: 105 GFKLKPSYADRSVPAM-------IPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
G + SY S + + + +P DWRE AVT VK Q CG WAFS G
Sbjct: 102 GLNIPNSYLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVG 161
Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
++EG Y T L+ SEQEL+DC + GC GG ++NAFD I K GG+ E Y Y
Sbjct: 162 SLEGAYKIATGNLMEFSEQELLDCTTNNYGCNGGFMTNAFDFI--KENGGISRESDYEYL 219
Query: 218 GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA-LQFYVTGVSHP 276
G+ CR +K V+I+ Y V ET + + + + P+++ I A LQFY G
Sbjct: 220 GEQYTCRSQEKTAAVQISSYQVVPEGETSLLQAVTKQ-PVSIGIAASQDLQFYAGGTY-- 276
Query: 277 IQFFCDGG-NENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
DG + ++H+V +GYG D K YW++KNSWG WGE G+ ++ R G
Sbjct: 277 -----DGSCADRINHAVTAIGYGTD-----EKGQKYWLLKNSWGTSWGENGFMKIIRDSG 326
>gi|400180347|gb|AFP73312.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 187 bits (475), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 107/301 (35%), Positives = 158/301 (52%), Gaps = 24/301 (7%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
++ +H + Y VE R IF N++ I+ + + S G+NEF+D+++ EF AK+
Sbjct: 42 WMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFT 101
Query: 105 GFKLKPSYADRSVPAM-------IPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
G + SY S + + + +P DWRE AVT VK Q CG WAFS G
Sbjct: 102 GLNIPNSYLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVG 161
Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
++EG Y T L+ SEQEL+DC + GC+GG ++NAFD I+ GG+ E Y Y
Sbjct: 162 SLEGAYKIATGNLMEFSEQELLDCTTNNYGCDGGFMTNAFDFIIEN--GGISRESDYEYL 219
Query: 218 GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA-LQFYVTGVSHP 276
G CR +K V+I+ Y V ET + + + + P+++ I A LQFY G
Sbjct: 220 GQQYTCRSQEKTAAVQISSYQVVPEGETSLLQAVTKQ-PVSIGIAASQDLQFYAGGTY-- 276
Query: 277 IQFFCDGG-NENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
DG + ++H+V +GYG D K YW++KNSWG WGE G+ ++ R G
Sbjct: 277 -----DGSCADRINHAVTAIGYGTD-----EKGQKYWLLKNSWGTSWGENGFMKIIRDSG 326
Query: 336 S 336
+
Sbjct: 327 N 327
>gi|400180428|gb|AFP73352.1| cysteine protease [Solanum corneliomuelleri]
Length = 344
Score = 187 bits (475), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 107/301 (35%), Positives = 158/301 (52%), Gaps = 24/301 (7%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
++ +H + Y VE R IF N++ I+ + + S G+NEF+D+++ EF AK+
Sbjct: 42 WMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFT 101
Query: 105 GFKLKPSYADRSVPAM-------IPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
G + SY S + + + +P DWRE AVT VK Q CG WAFS G
Sbjct: 102 GLNIPNSYLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVG 161
Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
++EG Y T L+ SEQEL+DC + GC GG ++NAFD I+ GG+ E Y Y
Sbjct: 162 SLEGAYKIATGNLMEFSEQELLDCTTNNYGCNGGFMTNAFDFIIEN--GGISRESDYEYL 219
Query: 218 GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA-LQFYVTGVSHP 276
G+ CR +K V+I+ Y V ET + + + + P+++ I A LQFY G
Sbjct: 220 GEQYTCRSQEKTAAVQISSYQVVPEGETSLLQAVTKQ-PVSIGIAASQDLQFYAGGTY-- 276
Query: 277 IQFFCDGG-NENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
DG + ++H+V +GYG D K YW++KNSWG WGE G+ ++ R G
Sbjct: 277 -----DGSCADRINHAVTAIGYGTD-----EKGQKYWLLKNSWGTSWGENGFMKIIRDSG 326
Query: 336 S 336
+
Sbjct: 327 N 327
>gi|357458911|ref|XP_003599736.1| Cysteine proteinase [Medicago truncatula]
gi|357474719|ref|XP_003607644.1| Cysteine proteinase [Medicago truncatula]
gi|355488784|gb|AES69987.1| Cysteine proteinase [Medicago truncatula]
gi|355508699|gb|AES89841.1| Cysteine proteinase [Medicago truncatula]
Length = 340
Score = 187 bits (475), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 116/304 (38%), Positives = 156/304 (51%), Gaps = 24/304 (7%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
++ +H K Y +E R IF N+ I+ ++ +N +DL+ EF+A
Sbjct: 43 WMTEHGKVYEDAIEKEKRFMIFKDNVEFIESFNAADNQPYKLSVNHLADLTLDEFKASRN 102
Query: 105 GFKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVY 163
G+K K + N+T +P A DWR AVT +KDQ CGS WAFST EG+
Sbjct: 103 GYK-KIDREFTTTSFKYENVTAIPAAVDWRVKGAVTPIKDQGQCGSCWAFSTVAATEGIN 161
Query: 164 AAKTKKLVSLSEQELIDCDQ--EDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDK 221
T KLVSLSEQEL+DCD ED GCEGG + + F+ I+ GG+ E YPY+ D
Sbjct: 162 QITTGKLVSLSEQELVDCDTKGEDQGCEGGLMEDGFEFIIKN--GGITSETNYPYKAADG 219
Query: 222 ACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQF 279
+C KI GY V + V N P++V+I+A + FY +G+ +
Sbjct: 220 SCNTATTTPVAKITGYEKVPVNSEKSLLKAVANQPISVSIDASDSSFMFYSSGI-----Y 274
Query: 280 FCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DG 335
+ G E L H V VGYG + YWI+KNSWG WGEKGY R+ RG +G
Sbjct: 275 TGECGTE-LDHGVTAVGYG------SANGTDYWIVKNSWGTVWGEKGYIRMQRGIAAKEG 327
Query: 336 SCGI 339
CGI
Sbjct: 328 LCGI 331
>gi|326501772|dbj|BAK02675.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 333
Score = 187 bits (475), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 125/343 (36%), Positives = 180/343 (52%), Gaps = 35/343 (10%)
Query: 10 VALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGN 69
+++L++ S + D KL+ +H L+ E +NK Y+ E+ R + GN
Sbjct: 4 ISVLAVLALAFSCTLAFDAKLN-----QHWKLWK---EANNKRYSDAEEHVRRA-TWEGN 54
Query: 70 LRKIQLLQDTEHGSGVY----GLNEFSDLSTAEFQAKYLGFKLK---PSYADRSVPAMIP 122
L+K+Q + + GV+ G+N+++D++ EF G+ DR +
Sbjct: 55 LQKVQE-HNLQADLGVHTYWLGMNKYADMTVTEFVKVMNGYNATMRGQRTQDRHTFSFNS 113
Query: 123 NITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCD 182
I LP DWR+ VT VKDQ CGS WAFSTTG +EG + +T KLVSLSEQ L+DC
Sbjct: 114 KIALPDTVDWRDKGYVTDVKDQGQCGSCWAFSTTGALEGQHFKQTGKLVSLSEQNLVDCS 173
Query: 183 --QEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSV 240
Q + GC GG + AF+ I K G++ E +YPY D CR G+ +
Sbjct: 174 GKQGNMGCNGGLMDQAFEYI--KENNGIDTEDSYPYEAVDNQCRFKAANVGATDTGFTDI 231
Query: 241 -SRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGY 297
S+DE+ + + + GP++VAI+A + Q Y GV + + FC L H VL VGY
Sbjct: 232 TSKDESALQQAVATVGPISVAIDAGHTSFQLYKHGVYN--EPFC--SQTRLDHGVLAVGY 287
Query: 298 GVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGD-GSCGI 339
G D K YW++KNSWGEGWG+KGY ++ R CGI
Sbjct: 288 GTDSGK------DYWLVKNSWGEGWGDKGYIKMTRNKRNQCGI 324
>gi|147839728|emb|CAN70559.1| hypothetical protein VITISV_032465 [Vitis vinifera]
Length = 341
Score = 187 bits (475), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 121/317 (38%), Positives = 158/317 (49%), Gaps = 24/317 (7%)
Query: 33 LHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFS 92
LH ++ Q+ + Y E R IF N+ +I+ S +NEF+
Sbjct: 30 LHEASMYERHEDWMVQYGREYKDADEKSKRYKIFKDNVARIESFNKAMDKSYKLSINEFA 89
Query: 93 DLSTAEFQAKYLGFKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSW 151
DL+ EF+A FK + + N+T +P DWR+ AVT +KDQ CGS W
Sbjct: 90 DLTNEEFRASRNRFKAHIC-STEATSFKYENVTAVPSTVDWRKKGAVTPIKDQGQCGSCW 148
Query: 152 AFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQ--EDDGCEGGSISNAFDTIMSKLGGGLE 209
AFS +EG+ T KL+SLSEQEL+DCD ED GC GG + +AF I + GL
Sbjct: 149 AFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQGCSGGLMDDAFKFI--EQNHGLT 206
Query: 210 EEKTYPYRGDDKACRLNKKA-TQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA--L 266
E YPY G D C K A KINGY V + + V + P+AVAI+A
Sbjct: 207 TEANYPYAGTDGTCNRKKAAHPAAKINGYEDVPANNEKALQKAVAHQPIAVAIDASGSEF 266
Query: 267 QFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKG 326
QFY +GV F G E L H V VGYG + + YW++KNSW GWGE+G
Sbjct: 267 QFYSSGV-----FTGQCGTE-LDHGVAAVGYGT-----SDDGMKYWLVKNSWSTGWGEEG 315
Query: 327 YFRLYRG----DGSCGI 339
Y R+ R +G CGI
Sbjct: 316 YIRMQRDVTVKEGLCGI 332
>gi|4557501|ref|NP_001325.1| cathepsin O preproprotein [Homo sapiens]
gi|1168795|sp|P43234.1|CATO_HUMAN RecName: Full=Cathepsin O; Flags: Precursor
gi|574804|emb|CAA54562.1| cathepsin O [Homo sapiens]
gi|29351630|gb|AAH49206.1| Cathepsin O [Homo sapiens]
gi|312153238|gb|ADQ33131.1| cathepsin O [synthetic construct]
Length = 321
Score = 187 bits (475), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 113/284 (39%), Positives = 156/284 (54%), Gaps = 20/284 (7%)
Query: 71 RKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADR---SVPAMIPNITLP 127
R + L +E+ + YG+N+FS L EF+A YL + KPS R V IPN++LP
Sbjct: 52 RYLNSLFPSENSTAFYGINQFSYLFPEEFKAIYL--RSKPSKFPRYSAEVHMSIPNVSLP 109
Query: 128 RAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDG 187
FDWR+ VT V++Q MCG WAFS G +E YA K K L LS Q++IDC + G
Sbjct: 110 LRFDWRDKQVVTQVRNQQMCGGCWAFSVVGAVESAYAIKGKPLEDLSVQQVIDCSYNNYG 169
Query: 188 CEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACR-LNKKATQVKINGYVS--VSRDE 244
C GGS NA + ++K+ L ++ YP++ + C + + I GY + S E
Sbjct: 170 CNGGSTLNALN-WLNKMQVKLVKDSEYPFKAQNGLCHYFSGSHSGFSIKGYSAYDFSDQE 228
Query: 245 TDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKF 304
+MAK L+ GP+ V ++A + Q Y+ G+ IQ C G N H+VLI G+ D+T
Sbjct: 229 DEMAKALLTFGPLVVIVDAVSWQDYLGGI---IQHHCSSGEAN--HAVLITGF--DKTGS 281
Query: 305 THKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
T PYWI++NSWG WG GY + G CGI D V S V
Sbjct: 282 T----PYWIVRNSWGSSWGVDGYAHVKMGSNVCGIADSVSSIFV 321
>gi|400180451|gb|AFP73362.1| cysteine protease [Solanum chilense]
Length = 344
Score = 187 bits (475), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 108/300 (36%), Positives = 157/300 (52%), Gaps = 24/300 (8%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
++ +H + Y VE R IF N++ I+ + + S G+NEF+D+++ EF AK+
Sbjct: 42 WMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFT 101
Query: 105 GFKLKPSYADRSVPAM-------IPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
G + SY S + + + +P DWRE AVT VK Q CG WAFS G
Sbjct: 102 GLNIPNSYLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVG 161
Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
++EG Y T L+ SEQEL+DC + GC+GG ++NAFD I K GG+ E Y Y
Sbjct: 162 SLEGAYKIATGNLMEFSEQELLDCTTNNYGCDGGFMTNAFDFI--KENGGISSESDYEYL 219
Query: 218 GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA-LQFYVTGVSHP 276
G CR +K V+I+ Y V ET + + + + P+++ I A LQFY G
Sbjct: 220 GQQYTCRSQEKTAAVQISSYQVVPEGETSLLQAVTKQ-PVSIGIAASQDLQFYAGGTY-- 276
Query: 277 IQFFCDGG-NENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
DG + ++H+V +GYG D K YW++KNSWG WGE G+ ++ R G
Sbjct: 277 -----DGSCADRINHAVTAIGYGTD-----EKGQKYWLLKNSWGTSWGENGFMKIIRDSG 326
>gi|297843430|ref|XP_002889596.1| hypothetical protein ARALYDRAFT_887827 [Arabidopsis lyrata subsp.
lyrata]
gi|297335438|gb|EFH65855.1| hypothetical protein ARALYDRAFT_887827 [Arabidopsis lyrata subsp.
lyrata]
Length = 343
Score = 187 bits (475), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 117/308 (37%), Positives = 161/308 (52%), Gaps = 26/308 (8%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
F +L+ H+K Y E+ R I+ N++ I + ++ H N F+D++ +EF+A
Sbjct: 43 FEKWLKTHSKLYGGRDEWMLRFGIYQSNVQLIDYI-NSLHLPFKLTDNRFADMTNSEFKA 101
Query: 102 KYLGFKLKP-SYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIE 160
+LG + P P +P A DWR AVT +++Q CG WAFS IE
Sbjct: 102 HFLGLNTSSLRLHKKQRPVCDPAGNVPDAVDWRTQGAVTPIRNQGKCGGCWAFSAVAAIE 161
Query: 161 GVYAAKTKKLVSLSEQELIDCD--QEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG 218
G+ KT LVSLSEQ+LIDCD + GC GG + AF+ I S GGL E YPY G
Sbjct: 162 GINKIKTGNLVSLSEQQLIDCDVGTYNKGCSGGLMETAFEFIKSN--GGLTTETDYPYTG 219
Query: 219 DDKACRLNK-KATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSH 275
+ C K K V I GY V+++E + + P++V I+A + Q Y +GV
Sbjct: 220 IEGTCDQEKAKNKVVTIQGYQKVAQNEASL-QIAAAQQPVSVGIDAGGFIFQLYSSGV-- 276
Query: 276 PIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-- 333
F NL+H V +VGYGV+ + YWI+KNSWG GWGE+GY R+ RG
Sbjct: 277 ----FTSYCGTNLNHGVTVVGYGVEGDQ------KYWIVKNSWGTGWGEEGYIRMERGIS 326
Query: 334 --DGSCGI 339
G CGI
Sbjct: 327 EDTGKCGI 334
>gi|443698586|gb|ELT98517.1| hypothetical protein CAPTEDRAFT_128252 [Capitella teleta]
Length = 324
Score = 187 bits (475), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 121/309 (39%), Positives = 171/309 (55%), Gaps = 26/309 (8%)
Query: 41 LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQ---LLQDTEHGSGVYGLNEFSDLSTA 97
++ F H+KTYAT E R I+ +L I + D + G+NE+ DL+
Sbjct: 23 MWTLFKTTHSKTYATEAEDMRRF-IWERHLNMINQHNIEADLGKHTFSLGMNEYGDLTQH 81
Query: 98 EFQAKYLGFKLKPSYADRSVPAMIP-NITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTT 156
E+ A G+K+ S S + P N+ +P+ DWRE VT VK+Q CGS WAFS+T
Sbjct: 82 EY-AAMSGYKMAKSSVGSSF--LEPENLQVPKTVDWREKGYVTPVKNQGQCGSCWAFSST 138
Query: 157 GNIEGVYAAKTKKLVSLSEQELIDC--DQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
G++EG KT +L S+SEQ L+DC D+ + GC GG + NAF I + G++ EK+Y
Sbjct: 139 GSLEGQVFRKTGRLPSISEQNLVDCSRDEGNMGCSGGLMDNAFTYIKKNM--GIDSEKSY 196
Query: 215 PYRGDDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINA--YALQFYVT 271
PY D CR K + +G+V + DET + + GP++VAI+A + QFY T
Sbjct: 197 PYEAVDGECRYKKSDSVTTDSGFVDIPHGDETALRTAVASVGPVSVAIDASHTSFQFYKT 256
Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
GV + C + L H VL+VGYGV+ + YW++KNSWG WGE GY +L
Sbjct: 257 GVY--TEANCS--STQLDHGVLVVGYGVENGQ------DYWLVKNSWGASWGEAGYIKLA 306
Query: 332 RGDGS-CGI 339
R G+ CGI
Sbjct: 307 RNHGNQCGI 315
>gi|400180389|gb|AFP73333.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 187 bits (475), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 107/301 (35%), Positives = 158/301 (52%), Gaps = 24/301 (7%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
++ +H + Y VE R IF N++ I+ + + S G+NEF+D+++ EF AK+
Sbjct: 42 WMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFT 101
Query: 105 GFKLKPSYADRSVPAM-------IPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
G + SY S + + + +P DWRE AVT VK Q CG WAFS G
Sbjct: 102 GLNIPNSYLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVG 161
Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
++EG Y T L+ SEQEL+DC + GC GG ++NAFD I+ GG+ E Y Y
Sbjct: 162 SLEGAYKIATGNLMEFSEQELLDCTTNNYGCNGGFMTNAFDFIIEN--GGISRESDYEYL 219
Query: 218 GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA-LQFYVTGVSHP 276
G+ CR +K V+I+ Y V ET + + + + P+++ I A LQFY G
Sbjct: 220 GEQYTCRSQEKTAAVQISSYKVVPEGETSLLQAVTKQ-PVSIGIAASQDLQFYAGGTY-- 276
Query: 277 IQFFCDGG-NENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
DG + ++H+V +GYG D K YW++KNSWG WGE G+ ++ R G
Sbjct: 277 -----DGSCADRINHAVTAIGYGTD-----EKGQKYWLLKNSWGTSWGENGFMKIIRDSG 326
Query: 336 S 336
+
Sbjct: 327 N 327
>gi|400180373|gb|AFP73325.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 187 bits (475), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 110/310 (35%), Positives = 162/310 (52%), Gaps = 25/310 (8%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
++ +H Y VE R IF N++ I+ + + S G+NEF+D+++ EF AK+
Sbjct: 42 WMSRHGHVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFT 101
Query: 105 GFKLKPSYADRSVPAM-------IPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
G + SY S + + + +P DWRE AVT VK Q CG WAFS G
Sbjct: 102 GLNIPNSYLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGQCGCCWAFSAVG 161
Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
++EG Y T L+ SEQEL+DC + GC+GG ++NAFD I K GG+ E Y Y
Sbjct: 162 SLEGAYKIATGNLMEFSEQELLDCTTNNYGCDGGFMTNAFDFI--KENGGISSESDYEYL 219
Query: 218 GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA-LQFYVTGVSHP 276
G+ CR +K V+I+ Y V ET + + + + P+++ I A LQFY G
Sbjct: 220 GEQYTCRSQEKTAAVQISSYQVVPEGETSLLQAVTKQ-PVSIGIAASQDLQFYAGGTY-- 276
Query: 277 IQFFCDGG-NENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
DG + ++H+V +GYG D K YW++KNSWG WGE G+ ++ R G
Sbjct: 277 -----DGSCADRINHAVTAIGYGTD-----EKGQKYWLLKNSWGTSWGENGFMKIIRDSG 326
Query: 336 S-CGINDYVR 344
+ G+ D +
Sbjct: 327 NPAGLCDIAK 336
>gi|340368362|ref|XP_003382721.1| PREDICTED: cathepsin L1-like [Amphimedon queenslandica]
Length = 329
Score = 187 bits (475), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 115/299 (38%), Positives = 171/299 (57%), Gaps = 21/299 (7%)
Query: 49 HNKTYATLVEYYSRLHIFSGN--LRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGF 106
+ K+Y TL E R + N L K +HG + +N F DL++AEF + Y G+
Sbjct: 34 YGKSYLTLEEEKYRRDTWEENSLLIKTHNTDSDKHGYTLE-MNSFGDLTSAEFSSLYNGY 92
Query: 107 KLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAK 166
+ + + + N +P + DWR+ VT VK+Q CGS WAFSTTG++EG++A K
Sbjct: 93 RQNLETSGSVFSSSLRN-AMPSSLDWRDKKVVTDVKNQGKCGSCWAFSTTGSLEGLHALK 151
Query: 167 TKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACR 224
T LVSLSEQ+L+DC + ++GC+GG++ +AF I K GG + E++YPY +++CR
Sbjct: 152 TGHLVSLSEQQLMDCSVKYGNNGCDGGNMRSAFQYI--KDAGGDDTEESYPYTAKNESCR 209
Query: 225 LNKKATQVKINGYVSV-SRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFC 281
+ K GYV + S DE + L E GP++VA++A QFY G+ + C
Sbjct: 210 FDPKKVGATDEGYVRIPSGDEVSLMHALYEVGPISVAMDAGLKTFQFYKKGIYS--DYLC 267
Query: 282 DGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGS-CGI 339
N +L+H V ++GYG + PYW++KNSWG+ WG GYF L R G+ CG+
Sbjct: 268 S--NTHLNHGVTLIGYGE-----SSDGSPYWLVKNSWGKDWGIDGYFMLARYVGNMCGV 319
>gi|330842703|ref|XP_003293312.1| hypothetical protein DICPUDRAFT_41833 [Dictyostelium purpureum]
gi|325076376|gb|EGC30167.1| hypothetical protein DICPUDRAFT_41833 [Dictyostelium purpureum]
Length = 352
Score = 187 bits (475), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 111/315 (35%), Positives = 161/315 (51%), Gaps = 25/315 (7%)
Query: 41 LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
LF+++ +Q+ K Y T E+ R F NL+KI+ L + G +G+N++SDLS EF
Sbjct: 38 LFHHWTKQNGKIYETSEEFEKRFSNFKTNLKKIENLNNLHKGKASFGMNKYSDLSEEEFS 97
Query: 101 AKYL--GFKLKPSYA-DRSVPAMIPNITLPRAF-------------DWREYDAVTGVKDQ 144
YL FK KP D P+ L + DWR VT VKDQ
Sbjct: 98 NFYLMKNFKGKPEEERDYIKKPENPSSNLIGGYLNTDDGLKAMYQVDWRNKGLVTPVKDQ 157
Query: 145 TMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKL 204
CGS + FS T IE Y K + LSEQ+ +DCD D GC GG +N ++ I+S
Sbjct: 158 GQCGSCYIFSATEQIESEYIRAGHKAILLSEQQSVDCDTMDGGCGGGDPANVYNYIIS-- 215
Query: 205 GGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAY 264
GG+ EK YPY D C +A + YV+ + DE + + +GP+++ ++A
Sbjct: 216 AGGVSTEKDYPYTAQDGTCFNTTRAVSITGFQYVTQNSDEDTLITTIANHGPVSICVDAS 275
Query: 265 ALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGE 324
Q Y G+ G +N+ H V +VG +D+T ++ +PY+II+NSWG WG+
Sbjct: 276 TWQSYTGGI------ITTGCEQNIDHCVQVVGLDIDKTDPSN-PIPYYIIRNSWGTSWGD 328
Query: 325 KGYFRLYRGDGSCGI 339
KGY + +G CGI
Sbjct: 329 KGYIYVAQGSNLCGI 343
>gi|357474573|ref|XP_003607571.1| Cysteine proteinase EP-B [Medicago truncatula]
gi|34329348|gb|AAQ63885.1| putative cysteine proteinase [Medicago truncatula]
gi|355508626|gb|AES89768.1| Cysteine proteinase EP-B [Medicago truncatula]
Length = 345
Score = 187 bits (475), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 124/309 (40%), Positives = 164/309 (53%), Gaps = 30/309 (9%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY--GLNEFSDLSTAEFQAK 102
++ + K Y L E +RL IF N+ I+ + + +Y G+N+F+DL+ EF A
Sbjct: 44 WMVHYGKVYKDLQERENRLKIFKENVNYIEASNNAGNNK-LYKLGINQFADLTNEEFIAS 102
Query: 103 YLGFK-LKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEG 161
FK S ++ N ++P DWR+ AVT VK+Q CG WAFS EG
Sbjct: 103 RNKFKGHMCSSITKTSTFKYENASVPSTVDWRKKGAVTPVKNQGQCGCCWAFSAVAATEG 162
Query: 162 VYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGD 219
++ T KLVSLSEQEL+DCD + D GCEGG + +AF I+ GL E YPY+G
Sbjct: 163 IHKLSTGKLVSLSEQELVDCDTKGVDQGCEGGLMDDAFKFIIQNH--GLNTEAQYPYQGV 220
Query: 220 DKACRLNKKATQ-VKINGYVSVSRDETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHP 276
D C NK + V I GY V + + V N P++VAI+A QFY +GV
Sbjct: 221 DGTCSANKASIHAVTITGYEDVPANNEQALQKAVANQPISVAIDASGSDFQFYKSGV--- 277
Query: 277 IQFFCDGGNENLSHSVLIVGYGV--DRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG- 333
F G E L H V VGYGV D TK YW++KNSWG WGE+GY ++ RG
Sbjct: 278 --FTGSCGTE-LDHGVTAVGYGVGNDGTK-------YWLVKNSWGTDWGEEGYIKMQRGV 327
Query: 334 ---DGSCGI 339
+G CGI
Sbjct: 328 DAAEGLCGI 336
>gi|297745594|emb|CBI40759.3| unnamed protein product [Vitis vinifera]
Length = 300
Score = 187 bits (475), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 124/306 (40%), Positives = 164/306 (53%), Gaps = 29/306 (9%)
Query: 46 LEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY--GLNEFSDLSTAEFQAKY 103
+ +H K+Y + E R +F NL+ I +T Y GLNEF+DLS EF+ KY
Sbjct: 1 MSKHGKSYRSFEEKLHRFEVFQDNLKHID---ETNKKVSSYWLGLNEFADLSHEEFKRKY 57
Query: 104 LGFKLK-PSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEG 161
LG K++ P D ++ LP++ DWR+ AV VK+Q CGS WAFST +EG
Sbjct: 58 LGLKIELPKRRDSPEEFSYKDVADLPKSVDWRKKGAVAHVKNQGACGSCWAFSTVAAVEG 117
Query: 162 VYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDD 220
+ T L +LSEQELIDCD+ ++GC GG + AF I+S GGL +E+ YPY ++
Sbjct: 118 INQIVTGNLTALSEQELIDCDKPFNNGCNGGLMDYAFAFIISN--GGLRKEEDYPYVMEE 175
Query: 221 KACRLNKKATQ-VKINGYVSVSRDETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHPI 277
C K+ + V I+GY V D + N P++VAI A + QFY G+
Sbjct: 176 GTCGEKKEELEVVTISGYHDVPEDNEQSFLKALANQPLSVAIEASSRGFQFYSGGI---- 231
Query: 278 QFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG---- 333
F G E L H V VGYG T K V Y +KNSWG WGEKGY R+ R
Sbjct: 232 -FNGHCGTE-LDHGVAAVGYG------TSKGVDYITVKNSWGSKWGEKGYIRMKRNVGKP 283
Query: 334 DGSCGI 339
+G CGI
Sbjct: 284 EGICGI 289
>gi|400180383|gb|AFP73330.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 187 bits (475), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 108/301 (35%), Positives = 159/301 (52%), Gaps = 24/301 (7%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
++ +H + Y VE R IF N++ I+ + + S G+NEF+D+++ EF AK+
Sbjct: 42 WMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFT 101
Query: 105 GFKLKPSYAD----RSVPAMIPNIT---LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
G + SY S +I +++ +P DWRE AVT VK Q CG WAFS G
Sbjct: 102 GLNIPNSYLSPSPMSSTEFIINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVG 161
Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
++EG Y T L+ SEQEL+DC + GC GG ++NAFD I+ GG+ E Y Y
Sbjct: 162 SLEGAYKIATGNLMEFSEQELLDCTTNNYGCNGGFMTNAFDFIIEN--GGISRESDYEYL 219
Query: 218 GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA-LQFYVTGVSHP 276
G CR +K V+I+ Y V ET + + + + P+++ I A LQFY G
Sbjct: 220 GQQYTCRSQEKTAAVQISSYKVVPEGETSLLQAVTKQ-PVSIGIAASQDLQFYAGGTY-- 276
Query: 277 IQFFCDGG-NENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
DG + ++H+V +GYG D K YW++KNSWG WGE G+ ++ R G
Sbjct: 277 -----DGSCADRINHAVTAIGYGTD-----EKGQKYWLLKNSWGTSWGENGFMKIIRDSG 326
Query: 336 S 336
+
Sbjct: 327 N 327
>gi|357133074|ref|XP_003568153.1| PREDICTED: cysteine proteinase RD21a-like [Brachypodium distachyon]
Length = 565
Score = 187 bits (474), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 126/320 (39%), Positives = 166/320 (51%), Gaps = 34/320 (10%)
Query: 41 LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHG--------SGVYGLNEFS 92
LF + +H K YA+ E +RL F+ N + G S LN F+
Sbjct: 41 LFEAWCAEHGKAYASPGERAARLAAFADNAAFVAAHNAGGGGAGGSNAAPSYTLALNAFA 100
Query: 93 DLSTAEFQAKYLG-FKLKPSYADRSVPAMIPNI---TLPRAFDWREYDAVTGVKDQTMCG 148
DL+ AEF+A LG + + A S ++ +P A DWR+ AVT VKDQ CG
Sbjct: 101 DLTHAEFRAARLGRLAVGGARAPPSEGGFAGSVGVGAVPEALDWRQSGAVTKVKDQGSCG 160
Query: 149 SSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGG 207
+ W+FS TG IEG+ KT L+SLSEQELIDCD+ + GC GG + A+ ++ GG
Sbjct: 161 ACWSFSATGAIEGINKIKTGSLISLSEQELIDCDRSYNAGCGGGLMDYAYRFVIKN--GG 218
Query: 208 LEEEKTYPYRGDDKACRLNK-KATQVKINGYVSVSRDETDMAKYLVENGPMAVAI--NAY 264
++ E YPYR D C NK K V I+GY V ++ D V P++V I +A
Sbjct: 219 IDTEDDYPYREADGTCNKNKLKRHVVTIDGYSDVPANKEDSLLQAVAQQPISVGICGSAR 278
Query: 265 ALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGE 324
A Q Y G+ F +L H+VLIVGYG + K YWI+KNSWGE WG
Sbjct: 279 AFQLYSQGI------FDGPCPTSLDHAVLIVGYGSEGGK------DYWIVKNSWGERWGM 326
Query: 325 KGYFRLYRGDGS----CGIN 340
KGY ++R GS CGIN
Sbjct: 327 KGYMHMHRNTGSSSGICGIN 346
>gi|356515050|ref|XP_003526214.1| PREDICTED: vignain-like [Glycine max]
Length = 344
Score = 187 bits (474), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 119/332 (35%), Positives = 167/332 (50%), Gaps = 26/332 (7%)
Query: 22 FMVVGDEKL--HHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT 79
F+ VG ++ LH ++ ++ K Y E R IF N+ I+
Sbjct: 16 FLAVGISQVMPRKLHQTALRERHENWMAEYGKIYKDAAEKEKRFQIFKDNVEFIESFNAA 75
Query: 80 EHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPA---MIPNIT-LPRAFDWREY 135
+ G+N +DL+ EF+ G K ++ + N+T +P A DWR
Sbjct: 76 GNKPYKLGVNHLADLTLEEFKDSRNGLKRTYEFSTTTFKLNGFKYENVTDIPEAIDWRVK 135
Query: 136 DAVTGVKDQ-TMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSIS 194
AVT +KDQ CGS WAFST EG+Y T L+SLSEQEL+DCD D GC+GG +
Sbjct: 136 GAVTPIKDQGDQCGSCWAFSTVAATEGIYQISTGMLMSLSEQELVDCDSVDHGCDGGLME 195
Query: 195 NAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKAT-QVKINGYVSVSRDETDMAKYLVE 253
+ F+ I+ GG+ E YPY D C +K+A+ +I GY +V + + + V
Sbjct: 196 DGFEFIIKN--GGISSEANYPYTAVDGTCDASKEASPAAQIKGYETVPANSEEALQQAVA 253
Query: 254 NGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPY 311
N P++V+I+A QFY +GV F L H V +VGYG TH+ Y
Sbjct: 254 NQPVSVSIDAGGSGFQFYSSGV------FTGQCGTQLDHGVTVVGYGTTDDG-THE---Y 303
Query: 312 WIIKNSWGEGWGEKGYFRLYRG----DGSCGI 339
WI+KNSWG WGE+GY R+ RG +G CGI
Sbjct: 304 WIVKNSWGTQWGEEGYIRMQRGIDALEGLCGI 335
>gi|324513891|gb|ADY45690.1| Cysteine proteinase [Ascaris suum]
Length = 398
Score = 187 bits (474), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 114/312 (36%), Positives = 170/312 (54%), Gaps = 23/312 (7%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHG-SGVYGLNEFSDLSTAEFQ 100
F F+ +++K Y ++ R I+ N+ I L + +G S +YG N+F+D S EF+
Sbjct: 91 FMEFMHKYDKVYVDSAQFVKRFRIYVNNMANIDALNERNYGRSIIYGENQFADWSEDEFR 150
Query: 101 AK------YLGFKLKPSYADRSVPAMIPNI-TLPRAFDWREYDAVTGVKDQTMCGSSWAF 153
Y F + + D+ M+P +P FDWR Y+ VT VK Q CGS WAF
Sbjct: 151 QILLPRGFYKNFHKRAIFIDQPDEIMMPRKEIIPEHFDWRPYNVVTPVKAQLNCGSCWAF 210
Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
+TTG +E YA T +L SLSEQ+L+DC+ E++ C+GG I A + + GL E
Sbjct: 211 ATTGTVESAYAIGTGELKSLSEQQLLDCNVENNACDGGDIDKALRYVYEE---GLMTEYD 267
Query: 214 YPYRG-DDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA-LQFYVT 271
YPY + C L + T++K V + +DE + +L+ NGP+ V +N A ++ Y
Sbjct: 268 YPYVAHRQETCYLRGETTRIK--AAVFLHQDEASIIDWLIHNGPVNVGVNVTADMKAYKG 325
Query: 272 GVSHPIQFFCDGGNENL-SHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWG-EKGYFR 329
GV P ++ C+ N+ + +H++ IVGYG + YWI+KNSWG+ +G E GY
Sbjct: 326 GVYTPNKWECE--NKIIGTHAMNIVGYGT----WNKTNEKYWIVKNSWGQSYGVENGYVY 379
Query: 330 LYRGDGSCGIND 341
RG SCGI D
Sbjct: 380 FARGINSCGIED 391
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.319 0.136 0.414
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,683,889,489
Number of Sequences: 23463169
Number of extensions: 244309735
Number of successful extensions: 527360
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 6328
Number of HSP's successfully gapped in prelim test: 1064
Number of HSP's that attempted gapping in prelim test: 499062
Number of HSP's gapped (non-prelim): 8631
length of query: 348
length of database: 8,064,228,071
effective HSP length: 143
effective length of query: 205
effective length of database: 9,003,962,200
effective search space: 1845812251000
effective search space used: 1845812251000
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 77 (34.3 bits)