BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= psy2558
(348 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|Q9VN93|CPR1_DROME Putative cysteine proteinase CG12163 OS=Drosophila melanogaster
GN=CG12163 PE=2 SV=2
Length = 614
Score = 287 bits (735), Expect = 6e-77, Method: Compositional matrix adjust.
Identities = 143/321 (44%), Positives = 200/321 (62%), Gaps = 9/321 (2%)
Query: 31 HHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNE 90
H V H LF F + + Y + E RL IF NL+ I+ L E GS YG+ E
Sbjct: 299 HRFDKVDH--LFYKFQVRFGRRYVSTAERQMRLRIFRQNLKTIEELNANEMGSAKYGITE 356
Query: 91 FSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNI--TLPRAFDWREYDAVTGVKDQTMCG 148
F+D++++E++ + ++ + A A++P LP+ FDWR+ DAVT VK+Q CG
Sbjct: 357 FADMTSSEYKERTGLWQRDEAKATGGSAAVVPAYHGELPKEFDWRQKDAVTQVKNQGSCG 416
Query: 149 SSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGL 208
S WAFS TGNIEG+YA KT +L SEQEL+DCD D C GG + NA+ I K GGL
Sbjct: 417 SCWAFSVTGNIEGLYAVKTGELKEFSEQELLDCDTTDSACNGGLMDNAYKAI--KDIGGL 474
Query: 209 EEEKTYPYRGDDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINAYALQ 267
E E YPY+ C N+ + V++ G+V + + +ET M ++L+ NGP+++ INA A+Q
Sbjct: 475 EYEAEYPYKAKKNQCHFNRTLSHVQVAGFVDLPKGNETAMQEWLLANGPISIGINANAMQ 534
Query: 268 FYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGY 327
FY GVSHP + C +NL H VL+VGYGV HK +PYWI+KNSWG WGE+GY
Sbjct: 535 FYRGGVSHPWKALC--SKKNLDHGVLVVGYGVSDYPNFHKTLPYWIVKNSWGPRWGEQGY 592
Query: 328 FRLYRGDGSCGINDYVRSALV 348
+R+YRGD +CG+++ SA++
Sbjct: 593 YRVYRGDNTCGVSEMATSAVL 613
>sp|Q26534|CATL_SCHMA Cathepsin L OS=Schistosoma mansoni GN=CL1 PE=2 SV=1
Length = 319
Score = 266 bits (680), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 138/289 (47%), Positives = 183/289 (63%), Gaps = 11/289 (3%)
Query: 62 RLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFK-LKPSYADRSVPAM 120
R +IF N+ K QL Q GS +YG+ +SDL+T EF +L + PS + ++
Sbjct: 39 RFNIFKSNILKAQLYQVFVRGSAIYGVTPYSDLTTDEFARTHLTASWVVPSSRSNTPTSL 98
Query: 121 IPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELI 179
+ +P+ FDWRE AVT VK+Q MCGS WAFSTTGN+E + KT KL+SLSEQ+L+
Sbjct: 99 GKEVNNIPKNFDWREKGAVTEVKNQGMCGSCWAFSTTGNVESQWFRKTGKLLSLSEQQLV 158
Query: 180 DCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVS 239
DCD DDGC GG SNA+++I+ GGL E YPY ++ C L V IN V+
Sbjct: 159 DCDGLDDGCNGGLPSNAYESIIKM--GGLMLEDNYPYDAKNEKCHLKTDGVAVYINSSVN 216
Query: 240 VSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGV 299
+++DET++A +L N ++V +NA LQFY G+SHP FC L H+VL+VGYGV
Sbjct: 217 LTQDETELAAWLYHNSTISVGMNALLLQFYQHGISHPWWIFCS--KYLLDHAVLLVGYGV 274
Query: 300 DRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
+ K P+WI+KNSWG WGE GYFR+YRGDGSCGIN SA++
Sbjct: 275 -----SEKNEPFWIVKNSWGVEWGENGYFRMYRGDGSCGINTVATSAMI 318
>sp|P25804|CYSP_PEA Cysteine proteinase 15A OS=Pisum sativum PE=2 SV=1
Length = 363
Score = 253 bits (646), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 140/336 (41%), Positives = 202/336 (60%), Gaps = 25/336 (7%)
Query: 23 MVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHG 82
VV +E+ H L+ H F F + +K+YAT E+ R +F NL K +L Q+ +
Sbjct: 32 QVVDNEEDHLLNAEHH---FTSFKSKFSKSYATKEEHDYRFGVFKSNLIKAKLHQNRD-P 87
Query: 83 SGVYGLNEFSDLSTAEFQAKYLGFKLK---PSYADRSVPAMIPNITLPRAFDWREYDAVT 139
+ +G+ +FSDL+ +EF+ ++LG K + P++A ++ ++P LP FDWRE AVT
Sbjct: 88 TAEHGITKFSDLTASEFRRQFLGLKKRLRLPAHAQKA--PILPTTNLPEDFDWREKGAVT 145
Query: 140 GVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEG 190
VKDQ CGS WAFSTTG +EG + T KLVSLSEQ+L+DCD D GC G
Sbjct: 146 PVKDQGSCGSCWAFSTTGALEGAHYLATGKLVSLSEQQLVDCDHVCDPEQAGSCDSGCNG 205
Query: 191 GSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKY 250
G ++NAF+ ++ GG+ +EK Y Y G D +C+ +K ++ + V+ DE +A
Sbjct: 206 GLMNNAFEYLLE--SGGVVQEKDYAYTGRDGSCKFDKSKVVASVSNFSVVTLDEDQIAAN 263
Query: 251 LVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDR-TKFTHKAV 309
LV+NGP+AVAINA +Q Y++GVS P + C L H VL+VG+G K
Sbjct: 264 LVKNGPLAVAINAAWMQTYMSGVSCP--YVC--AKSRLDHGVLLVGFGKGAYAPIRLKEK 319
Query: 310 PYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
PYWIIKNSWG+ WGE+GY+++ RG CG++ V +
Sbjct: 320 PYWIIKNSWGQNWGEQGYYKICRGRNVCGVDSMVST 355
>sp|Q9R013|CATF_MOUSE Cathepsin F OS=Mus musculus GN=Ctsf PE=2 SV=1
Length = 462
Score = 251 bits (640), Expect = 7e-66, Method: Compositional matrix adjust.
Identities = 142/333 (42%), Positives = 197/333 (59%), Gaps = 11/333 (3%)
Query: 17 VSVSSFMVVGD-EKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQL 75
+ SSF+ + D + L VK LF F+ +N+TY + E RL +F+ N+ + Q
Sbjct: 139 ATFSSFLPLLDKDPLPQDFSVKMAPLFKDFMTTYNRTYESREEAQWRLTVFARNMIRAQK 198
Query: 76 LQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREY 135
+Q + G+ YG+ +FSDL+ EF YL L+ + PA N P +DWR+
Sbjct: 199 IQALDRGTAQYGITKFSDLTEEEFHTIYLNPLLQKESGRKMSPAKSINDLAPPEWDWRKK 258
Query: 136 DAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISN 195
AVT VK+Q MCGS WAFS TGN+EG + L+SLSEQEL+DCD+ D C GG SN
Sbjct: 259 GAVTEVKNQGMCGSCWAFSVTGNVEGQWFLNRGTLLSLSEQELLDCDKVDKACLGGLPSN 318
Query: 196 AFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENG 255
A+ I K GGLE E Y Y+G + C + + +V IN V +SR+E +A +L + G
Sbjct: 319 AYAAI--KNLGGLETEDDYGYQGHVQTCNFSAQMAKVYINDSVELSRNENKIAAWLAQKG 376
Query: 256 PMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIK 315
P++VAINA+ +QFY G++HP + C + H+VL+VGYG +PYW IK
Sbjct: 377 PISVAINAFGMQFYRHGIAHPFRPLCSPW--FIDHAVLLVGYG------NRSNIPYWAIK 428
Query: 316 NSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
NSWG WGE+GY+ LYRG G+CG+N SA+V
Sbjct: 429 NSWGSDWGEEGYYYLYRGSGACGVNTMASSAVV 461
>sp|P43295|A494_ARATH Probable cysteine proteinase A494 OS=Arabidopsis thaliana
GN=At2g21430 PE=2 SV=2
Length = 361
Score = 242 bits (618), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 145/359 (40%), Positives = 209/359 (58%), Gaps = 37/359 (10%)
Query: 10 VALLSLTVSVSSFMVVGDEKLHHLHHVKHTAL--------FNYFLEQHNKTYATLVEYYS 61
V+L+ + VSVS V GDE + V T F F ++ K Y ++ E+Y
Sbjct: 11 VSLIFVFVSVS---VCGDEDVLIRQVVDETEPKVLSSEDHFTLFKKKFGKVYGSIEEHYY 67
Query: 62 RLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLG----FKLKPSYADRSV 117
R +F NL + Q + S +G+ +FSDL+ +EF+ K+LG FKL P A+++
Sbjct: 68 RFSVFKANLLRAMRHQKMDP-SARHGVTQFSDLTRSEFRRKHLGVKGGFKL-PKDANQA- 124
Query: 118 PAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQE 177
++P LP FDWR+ AVT VK+Q CGS W+FSTTG +EG + T KLVSLSEQ+
Sbjct: 125 -PILPTQNLPEEFDWRDRGAVTPVKNQGSCGSCWSFSTTGALEGAHFLATGKLVSLSEQQ 183
Query: 178 LIDCDQE---------DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDD-KACRLNK 227
L+DCD E D GC GG +++AF+ + GGL EK YPY G D +C+L++
Sbjct: 184 LVDCDHECDPEEEGSCDSGCNGGLMNSAFEYTLKT--GGLMREKDYPYTGTDGGSCKLDR 241
Query: 228 KATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNEN 287
++ + VS +E +A L++NGP+AVAINA +Q Y+ GVS P + C +
Sbjct: 242 SKIVASVSNFSVVSINEDQIAANLIKNGPLAVAINAAYMQTYIGGVSCP--YIC---SRR 296
Query: 288 LSHSVLIVGYG-VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
L+H VL+VGYG ++ K PYWIIKNSWGE WGE G++++ +G CG++ V +
Sbjct: 297 LNHGVLLVGYGSAGFSQARLKEKPYWIIKNSWGESWGENGFYKICKGRNICGVDSLVST 355
>sp|Q9UBX1|CATF_HUMAN Cathepsin F OS=Homo sapiens GN=CTSF PE=1 SV=1
Length = 484
Score = 242 bits (618), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 136/329 (41%), Positives = 194/329 (58%), Gaps = 10/329 (3%)
Query: 20 SSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT 79
S ++ ++ L VK ++F F+ +N+TY + E RL +F N+ + Q +Q
Sbjct: 165 SVISLLNEDPLSQDLPVKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQAL 224
Query: 80 EHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVT 139
+ G+ YG+ +FSDL+ EF+ YL L+ ++ A P +DWR AVT
Sbjct: 225 DRGTAQYGVTKFSDLTEEEFRTIYLNTLLRKEPGNKMKQAKSVGDLAPPEWDWRSKGAVT 284
Query: 140 GVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDT 199
VKDQ MCGS WAFS TGN+EG + L+SLSEQEL+DCD+ D C GG SNA+
Sbjct: 285 KVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDKMDKACMGGLPSNAYSA 344
Query: 200 IMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAV 259
I K GGLE E Y Y+G ++C + + +V IN V +S++E +A +L + GP++V
Sbjct: 345 I--KNLGGLETEDDYSYQGHMQSCNFSAEKAKVYINDSVELSQNEQKLAAWLAKRGPISV 402
Query: 260 AINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWG 319
AINA+ +QFY G+S P++ C + H+VL+VGYG VP+W IKNSWG
Sbjct: 403 AINAFGMQFYRHGISRPLRPLCSPW--LIDHAVLLVGYG------NRSDVPFWAIKNSWG 454
Query: 320 EGWGEKGYFRLYRGDGSCGINDYVRSALV 348
WGEKGY+ L+RG G+CG+N SA+V
Sbjct: 455 TDWGEKGYYYLHRGSGACGVNTMASSAVV 483
>sp|P43296|RD19A_ARATH Cysteine proteinase RD19a OS=Arabidopsis thaliana GN=RD19A PE=2
SV=1
Length = 368
Score = 240 bits (613), Expect = 9e-63, Method: Compositional matrix adjust.
Identities = 141/340 (41%), Positives = 199/340 (58%), Gaps = 33/340 (9%)
Query: 23 MVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHG 82
VVG + L H F+ F + K YA+ E+ R +F NLR+ + Q +
Sbjct: 35 QVVGGAEPQVLTSEDH---FSLFKRKFGKVYASNEEHDYRFSVFKANLRRARRHQKLDP- 90
Query: 83 SGVYGLNEFSDLSTAEFQAKYLG----FKLKPSYADRSVPAMIPNITLPRAFDWREYDAV 138
S +G+ +FSDL+ +EF+ K+LG FKL P A+++ ++P LP FDWR++ AV
Sbjct: 91 SATHGVTQFSDLTRSEFRKKHLGVRSGFKL-PKDANKA--PILPTENLPEDFDWRDHGAV 147
Query: 139 TGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCE 189
T VK+Q CGS W+FS TG +EG T KLVSLSEQ+L+DCD E D GC
Sbjct: 148 TPVKNQGSCGSCWSFSATGALEGANFLATGKLVSLSEQQLVDCDHECDPEEADSCDSGCN 207
Query: 190 GGSISNAFDTIMSKLGGGLEEEKTYPYRGDD-KACRLNKKATQVKINGYVSVSRDETDMA 248
GG +++AF+ + GGL +E+ YPY G D K C+L+K ++ + +S DE +A
Sbjct: 208 GGLMNSAFEYTLKT--GGLMKEEDYPYTGKDGKTCKLDKSKIVASVSNFSVISIDEEQIA 265
Query: 249 KYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGV---DRTKFT 305
LV+NGP+AVAINA +Q Y+ GVS P + C L+H VL+VGYG +F
Sbjct: 266 ANLVKNGPLAVAINAGYMQTYIGGVSCP--YIC---TRRLNHGVLLVGYGAAGYAPARFK 320
Query: 306 HKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
K PYWIIKNSWGE WGE G++++ +G CG++ V +
Sbjct: 321 EK--PYWIIKNSWGETWGENGFYKICKGRNICGVDSMVST 358
>sp|P04988|CYSP1_DICDI Cysteine proteinase 1 OS=Dictyostelium discoideum GN=cprA PE=1 SV=2
Length = 343
Score = 230 bits (586), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 137/355 (38%), Positives = 192/355 (54%), Gaps = 36/355 (10%)
Query: 11 ALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNL 70
L TV VSS + +E+ L F ++ NK Y+ EY R IF NL
Sbjct: 8 VLAVFTVFVSSRGIPLEEQSQFLE----------FQDKFNKKYSH-EEYLERFEIFKSNL 56
Query: 71 RKIQ---LLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPN---I 124
KI+ L+ +G+N+F+DLS+ EF+ YL K D V + +
Sbjct: 57 GKIEELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFIN 116
Query: 125 TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE 184
++P AFDWR AVT VK+Q CGS W+FSTTGN+EG + KLVSLSEQ L+DCD E
Sbjct: 117 SIPTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHE 176
Query: 185 ----------DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDD-KACRLNKKATQVK 233
D+GC GG NA++ I+ GG++ E +YPY + C N K
Sbjct: 177 CMEYEGEQACDEGCNGGLQPNAYNYIIKN--GGIQTESSYPYTAETGTQCNFNSANIGAK 234
Query: 234 INGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVL 293
I+ + + ++ET MA Y+V GP+A+A +A QFY+ GV F +L H +L
Sbjct: 235 ISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGV-----FDIPCNPNSLDHGIL 289
Query: 294 IVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
IVGY T F K +PYWI+KNSWG WGE+GY L RG +CG++++V ++++
Sbjct: 290 IVGYSAKNTIF-RKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTSII 343
>sp|Q10716|CYSP1_MAIZE Cysteine proteinase 1 OS=Zea mays GN=CCP1 PE=2 SV=1
Length = 371
Score = 223 bits (567), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 125/321 (38%), Positives = 176/321 (54%), Gaps = 27/321 (8%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
F F+++ K+Y E+ RL +F NLR+ + Q + S +G+ +FSDL+ AEF+
Sbjct: 48 FLSFVQRFGKSYKDADEHAYRLSVFKDNLRRARRHQLLDP-SAEHGVTKFSDLTPAEFRR 106
Query: 102 KYLGFKLKPSYADRSV------PAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFST 155
YLG + R + ++P LP FDWR++ AV VK+Q CGS W+FS
Sbjct: 107 TYLGLRKSRRALLRELGESAHEAPVLPTDGLPDDFDWRDHGAVGPVKNQGSCGSCWSFSA 166
Query: 156 TGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSKLGG 206
+G +EG + T KL LSEQ+ +DCD E D GC GG ++ AF + G
Sbjct: 167 SGALEGAHYLATGKLEVLSEQQFVDCDHECDSSEPDSCDSGCNGGLMTTAFSYLQK--AG 224
Query: 207 GLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYAL 266
GLE EK YPY G D C+ +K + + VS DE ++ L+++GP+A+ INA +
Sbjct: 225 GLESEKDYPYTGSDGKCKFDKSKIVASVQNFSVVSVDEAQISANLIKHGPLAIGINAAYM 284
Query: 267 QFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDR-TKFTHKAVPYWIIKNSWGEGWGEK 325
Q Y+ GVS P + C +L H VL+VGYG K PYWIIKNSWGE WGE
Sbjct: 285 QTYIGGVSCP--YIC---GRHLDHGVLLVGYGASGFAPIRLKDKPYWIIKNSWGENWGEN 339
Query: 326 GYFRLYRGD---GSCGINDYV 343
GY+++ RG CG++ V
Sbjct: 340 GYYKICRGSNVRNKCGVDSMV 360
>sp|Q8V5U0|CATV_NPVHZ Viral cathepsin OS=Heliothis zea nuclear polyhedrosis virus
GN=VCATH PE=3 SV=1
Length = 367
Score = 221 bits (563), Expect = 7e-57, Method: Compositional matrix adjust.
Identities = 120/324 (37%), Positives = 182/324 (56%), Gaps = 31/324 (9%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQ-----------LLQDTEHGSGVYGLNE 90
F +FL+Q+NK+Y EY R ++F NL KI D+ S +G+N+
Sbjct: 57 FKHFLQQYNKSYDDPKEYQYRYNVFKDNLNKINSQNRENLLNNKNNNDSLSTSAQFGVNK 116
Query: 91 FSDLSTAEFQAKYLGFKLKPS----YADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTM 146
FSD + E GF L S + + P+I LP +DWR+ + VT +KDQ +
Sbjct: 117 FSDKTPDEVLHSNTGFFLNLSQHYTLCENRIVKGAPDIRLPDYYDWRDTNKVTPIKDQGV 176
Query: 147 CGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGG 206
CGS WAF GNIE YA + KL+ LSEQ+L+DCD+ D GC GG + AF ++ L G
Sbjct: 177 CGSCWAFVAIGNIESQYAIRHNKLIDLSEQQLLDCDEVDLGCNGGLMHLAFQELL--LMG 234
Query: 207 GLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVS-RDETDMAKYLVENGPMAVAINAYA 265
G+E E YPY+G ++ C L+ + VK+N RDE + + + GP+A+A++A
Sbjct: 235 GVETEADYPYQGSEQMCTLDNRKIAVKLNSCFKYDIRDENKLKELVYTTGPVAIAVDAMD 294
Query: 266 LQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEK 325
+ Y G+ + + +L+H+VL++G+G++ VPYWIIKNSWGE WGE
Sbjct: 295 IINYRRGILNQCHIY------DLNHAVLLIGWGIENN------VPYWIIKNSWGEDWGEN 342
Query: 326 GYFRLYRGDGSCG-INDYVRSALV 348
G+ R+ R +CG +N++ S+++
Sbjct: 343 GFLRVRRNVNACGLLNEFGASSVI 366
>sp|Q80LP4|CATV_NPVAH Viral cathepsin OS=Adoxophyes honmai nucleopolyhedrovirus GN=VCATH
PE=3 SV=1
Length = 337
Score = 212 bits (539), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 126/348 (36%), Positives = 188/348 (54%), Gaps = 29/348 (8%)
Query: 13 LSLTVSVSSFMVVGDEKLHHLHHVKHTA--LFNYFLEQHNKTYATLVEYYSRLHIFSGNL 70
++L + + +V + HL H A F F+ +NK Y R IF NL
Sbjct: 1 MTLLMIFTILLVASSQIEGHLKFDIHDAQHYFETFIINYNKQYPDTKTKNYRFKIFKQNL 60
Query: 71 RKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGF-KLKPSYADRSVPAMI-------- 121
I ++ + S +Y +N+FSDLS E KY G KPS RS
Sbjct: 61 EDINE-KNKLNDSAIYNINKFSDLSKNELLTKYTGLTSKKPSNMVRSTSNFCNVIHLDAP 119
Query: 122 PNI--TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELI 179
P++ LP+ FDWR + +T VKDQ CGS WA + G +E +YA K L++LSEQ+LI
Sbjct: 120 PDVHDELPQNFDWRVNNKMTSVKDQGACGSCWAHAAVGTLETLYAIKHNYLINLSEQQLI 179
Query: 180 DCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVS 239
DCD + C+GG + AF+ +M+ GGL EE YPY+G C+++ K + ++
Sbjct: 180 DCDSANMACDGGLMHTAFEQLMN--AGGLMEEIDYPYQGTKGVCKIDNKKFALSVSSCKR 237
Query: 240 -VSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYG 298
+ ++E ++ K L+ GP+A+AI+A ++ Y G+ H FC+ N L+H+VL+VGYG
Sbjct: 238 YIFQNEENLKKELITMGPIAMAIDAASISTYSKGIIH----FCE--NLGLNHAVLLVGYG 291
Query: 299 VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSA 346
T V YW +KNSWG WGE GYFR+ R +CG+N+ + ++
Sbjct: 292 ------TEGGVSYWTLKNSWGSDWGEDGYFRVKRNINACGLNNQLAAS 333
>sp|Q91CL9|CATV_NPVAP Viral cathepsin OS=Antheraea pernyi nuclear polyhedrosis virus
GN=VCATH PE=3 SV=1
Length = 324
Score = 210 bits (534), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 119/315 (37%), Positives = 182/315 (57%), Gaps = 20/315 (6%)
Query: 36 VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLS 95
+K + F FL + NK Y++ E R IF NL +I + ++ S Y +N+FSDLS
Sbjct: 22 LKAPSYFEEFLHKFNKNYSSESEKLRRFKIFQHNLEEI-INKNQNDTSAQYEINKFSDLS 80
Query: 96 TAEFQAKYLGFKL---KPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
E +KY G L K ++ + V P+ P FDWR + VT VK+Q MCG+ WA
Sbjct: 81 KDETISKYTGLSLPLQKQNFCEVVVLDRPPDKG-PLEFDWRRLNKVTSVKNQGMCGACWA 139
Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEK 212
F+T G++E +A K +L++LSEQ+LIDCD D GC+GG + A++ +M+ GG++ E
Sbjct: 140 FATLGSLESQFAIKHDQLINLSEQQLIDCDFVDVGCDGGLLHTAYEAVMNM--GGIQAEN 197
Query: 213 TYPYRGDDKACRLNKKATQVKING-YVSVSRDETDMAKYLVENGPMAVAINAYALQFYVT 271
YPY ++ CR+N V++ Y V+ E + L GP+ VAI+A + Y
Sbjct: 198 DYPYEANNGPCRVNAAKFVVRVKKCYRYVTLFEEKLKDLLRIVGPIPVAIDASDIVGYKR 257
Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
G+ +C+ N L+H+VL+VGYGV+ +P+WI+KN+WG WGE+GYFR+
Sbjct: 258 GIIR----YCE--NHGLNHAVLLVGYGVE------NGIPFWILKNTWGADWGEQGYFRVQ 305
Query: 332 RGDGSCGINDYVRSA 346
+ +CGI + + S+
Sbjct: 306 QNINACGIKNELPSS 320
>sp|Q91BH1|CATV_NPVST Viral cathepsin OS=Spodoptera litura multicapsid
nucleopolyhedrovirus GN=VCATH PE=3 SV=1
Length = 337
Score = 209 bits (531), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 120/316 (37%), Positives = 173/316 (54%), Gaps = 27/316 (8%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
F++QHNK Y T + + F NL + + + + VYG+N+FSD+ F ++
Sbjct: 36 FIKQHNKEYTTPDQRDAAFVNFKRNLADMNAMNNVSN-QAVYGINKFSDIDKITFVNEHA 94
Query: 105 GF----------KLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
G P V P+ P +FDWR+ + VT VK+Q +CGS WAF+
Sbjct: 95 GLVSNLINSTDSNFDPYRLCEYVTVAGPSARTPESFDWRKLNKVTKVKEQGVCGSCWAFA 154
Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
GNIE YA L+ LSEQ+L+DCD+ D GC+GG + AF I+ GG+E E Y
Sbjct: 155 AIGNIESQYAIMHDSLIDLSEQQLLDCDRVDQGCDGGLMHLAFQEIIRI--GGVEHEIDY 212
Query: 215 PYRGDDKACRLNKKATQVKIN-GYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGV 273
PY+G + ACRL V+++ Y RDE + + L +NGP+AVAI+ + Y +G+
Sbjct: 213 PYQGIEYACRLAPSKLAVRLSHCYQYDLRDERKLLELLYKNGPIAVAIDCVDIIDYRSGI 272
Query: 274 SHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG 333
+ D G L+H+VL+VGYG++ PYWI KNSWG WGE GYFR R
Sbjct: 273 ATVCN---DNG---LNHAVLLVGYGIEND------TPYWIFKNSWGSNWGENGYFRARRN 320
Query: 334 DGSCG-INDYVRSALV 348
+CG +N++ SA++
Sbjct: 321 INACGMLNEFAASAVL 336
>sp|P56203|CATW_MOUSE Cathepsin W OS=Mus musculus GN=Ctsw PE=2 SV=2
Length = 371
Score = 207 bits (528), Expect = 8e-53, Method: Compositional matrix adjust.
Identities = 122/333 (36%), Positives = 177/333 (53%), Gaps = 38/333 (11%)
Query: 41 LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
+F F + N++Y EY RL IF+ NL + Q LQ + G+ +G FSDL+ EF
Sbjct: 39 VFKLFQIRFNRSYWNPAEYTRRLSIFAHNLAQAQRLQQEDLGTAEFGETPFSDLTEEEFG 98
Query: 101 AKYLGFKLKPSYADRSVPAMIPNIT-----------LPRAFDWRE-YDAVTGVKDQTMCG 148
Y P PN+T +PR DWR+ + ++ VK+Q C
Sbjct: 99 Q---------LYGQERSPERTPNMTKKVESNTWGESVPRTCDWRKAKNIISSVKNQGSCK 149
Query: 149 SSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGL 208
WA + NI+ ++ K ++ V +S QEL+DC++ +GC GG + +A+ T+++ GL
Sbjct: 150 CCWAMAAADNIQALWRIKHQQFVDVSVQELLDCERCGNGCNGGFVWDAYLTVLN--NSGL 207
Query: 209 EEEKTYPYRGDDKACR-LNKKATQVK-INGYVSVSRDETDMAKYLVENGPMAVAINAYAL 266
EK YP++GD K R L KK +V I + +S +E +A YL +GP+ V IN L
Sbjct: 208 ASEKDYPFQGDRKPHRCLAKKYKKVAWIQDFTMLSNNEQAIAHYLAVHGPITVTINMKLL 267
Query: 267 QFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDR------TKFTHK-----AVPYWIIK 315
Q Y GV CD + HSVL+VG+G ++ T +H + PYWI+K
Sbjct: 268 QHYQKGVIKATPSSCDP--RQVDHSVLLVGFGKEKEGMQTGTVLSHSRKRRHSSPYWILK 325
Query: 316 NSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
NSWG WGEKGYFRLYRG+ +CG+ Y +A V
Sbjct: 326 NSWGAHWGEKGYFRLYRGNNTCGVTKYPFTAQV 358
>sp|P25779|CYSP_TRYCR Cruzipain OS=Trypanosoma cruzi PE=1 SV=1
Length = 467
Score = 207 bits (526), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 117/315 (37%), Positives = 166/315 (52%), Gaps = 18/315 (5%)
Query: 39 TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAE 98
T+ F F ++H + Y + E RL +F NL + L + +G+ FSDL+ E
Sbjct: 35 TSQFAEFKQKHGRVYESAAEEAFRLSVFRENL-FLARLHAAANPHATFGVTPFSDLTREE 93
Query: 99 FQAKYL-GFKLKPSYADRS-VPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTT 156
F+++Y G + +R+ VP + + P A DWR AVT VKDQ CGS WAFS
Sbjct: 94 FRSRYHNGAAHFAAAQERARVPVKVEVVGAPAAVDWRARGAVTAVKDQGQCGSCWAFSAI 153
Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
GN+E + L +LSEQ L+ CD+ D GC GG ++NAF+ I+ + G + E +YPY
Sbjct: 154 GNVECQWFLAGHPLTNLSEQMLVSCDKTDSGCSGGLMNNAFEWIVQENNGAVYTEDSYPY 213
Query: 217 ---RGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGV 273
G C + I G+V + +DE +A +L NGP+AVA++A + Y GV
Sbjct: 214 ASGEGISPPCTTSGHTVGATITGHVELPQDEAQIAAWLAVNGPVAVAVDASSWMTYTGGV 273
Query: 274 SHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG 333
+E L H VL+VGY AVPYWIIKNSW WGE+GY R+ +G
Sbjct: 274 ------MTSCVSEQLDHGVLLVGYN------DSAAVPYWIIKNSWTTQWGEEGYIRIAKG 321
Query: 334 DGSCGINDYVRSALV 348
C + + SA+V
Sbjct: 322 SNQCLVKEEASSAVV 336
>sp|P41721|CATV_NPVBM Viral cathepsin OS=Bombyx mori nuclear polyhedrosis virus GN=VCATH
PE=1 SV=1
Length = 323
Score = 207 bits (526), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 118/317 (37%), Positives = 176/317 (55%), Gaps = 21/317 (6%)
Query: 36 VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLS 95
+K F F+ + NK Y++ VE R IF NL +I + ++ S Y +N+FSDLS
Sbjct: 22 LKAPNYFEEFVHRFNKNYSSEVEKLRRFKIFQHNLNEI--INKNQNDSAKYEINKFSDLS 79
Query: 96 TAEFQAKYLGFKLKPSYADRSVPAMI---PNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
E AKY G L P+ ++ P P FDWR + VT VK+Q MCG+ WA
Sbjct: 80 KDETIAKYTGLSL-PTQTQNFCKVILLDQPPGKGPLEFDWRRLNKVTSVKNQGMCGACWA 138
Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEK 212
F+T G++E +A K +L++LSEQ++IDCD D GC GG + AF+ I+ GG++ E
Sbjct: 139 FATLGSLESQFAIKHNELINLSEQQMIDCDFVDAGCNGGLLHTAFEAIIKM--GGVQLES 196
Query: 213 TYPYRGDDKACRLNKKATQVKI-NGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVT 271
YPY D+ CR+N V++ + Y + E + L GP+ +AI+A + Y
Sbjct: 197 DYPYEADNNNCRMNSNKFLVQVKDCYRYIIVYEEKLKDLLPLVGPIPMAIDAADIVNYKQ 256
Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
G+ I++ D G L+H+VL+VGYGV+ +PYW KN+WG WGE G+FR+
Sbjct: 257 GI---IKYCFDSG---LNHAVLLVGYGVENN------IPYWTFKNTWGTDWGEDGFFRVQ 304
Query: 332 RGDGSCGINDYVRSALV 348
+ +CG+ + + S V
Sbjct: 305 QNINACGMRNELASTAV 321
>sp|P14658|CYSP_TRYBB Cysteine proteinase OS=Trypanosoma brucei brucei PE=1 SV=1
Length = 450
Score = 207 bits (526), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 125/346 (36%), Positives = 179/346 (51%), Gaps = 27/346 (7%)
Query: 10 VALLSLTVSVSSFMVVGDEKLHHLHHVKHTAL-FNYFLEQHNKTYATLVEYYSRLHIFSG 68
V LL++ ++S L LH + + F F +++ K Y E R F
Sbjct: 14 VVLLAMAACLASV------ALGSLHVEESLEMRFAAFKKKYGKVYKDAKEEAFRFRAFEE 67
Query: 69 NLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITL-- 126
N+ + ++ Q + +G+ FSD++ EF+A+Y + A + + + N+T
Sbjct: 68 NMEQAKI-QAAANPYATFGVTPFSDMTREEFRARYRNGASYFAAAQKRLRKTV-NVTTGR 125
Query: 127 -PRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQED 185
P A DWRE AVT VK Q CGS WAFST GNIEG + LVSLSEQ L+ CD D
Sbjct: 126 APAAVDWREKGAVTPVKVQGQCGSCWAFSTIGNIEGQWQVAGNPLVSLSEQMLVSCDTID 185
Query: 186 DGCEGGSISNAFDTIMSKLGGGLEEEKTYPY---RGDDKACRLNKKATQVKINGYVSVSR 242
GC GG + NAF+ I++ GG + E +YPY G+ C++N I +V + +
Sbjct: 186 SGCNGGLMDNAFNWIVNSNGGNVFTEASYPYVSGNGEQPQCQMNGHEIGAAITDHVDLPQ 245
Query: 243 DETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRT 302
DE +A YL ENGP+A+A++A + Y G+ ++ L H VL+VGY +
Sbjct: 246 DEDAIAAYLAENGPLAIAVDAESFMDYNGGI------LTSCTSKQLDHGVLLVGYNDNSN 299
Query: 303 KFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
PYWIIKNSW WGE GY R+ +G C +N V SA+V
Sbjct: 300 P------PYWIIKNSWSNMWGEDGYIRIEKGTNQCLMNQAVSSAVV 339
>sp|Q26636|CATL_SARPE Cathepsin L OS=Sarcophaga peregrina PE=1 SV=1
Length = 339
Score = 206 bits (525), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 130/308 (42%), Positives = 179/308 (58%), Gaps = 27/308 (8%)
Query: 48 QHNKTYATLVEYYSRLHIFSGNLRKI-QLLQDTEHGSGVY--GLNEFSDLSTAEFQAKYL 104
QH K YA VE R+ IF+ N KI + Q G Y GLN+++D+ EF+
Sbjct: 34 QHRKNYANEVEERFRMKIFNENRHKIAKHNQLFAQGKVSYKLGLNKYADMLHHEFKETMN 93
Query: 105 GFK--LKPSYADRS--VPAM-IP--NITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
G+ L+ +R+ V A IP ++T+P++ DWRE+ AVTGVKDQ CGS WAFS+TG
Sbjct: 94 GYNHTLRQLMRERTGLVGATYIPPAHVTVPKSVDWREHGAVTGVKDQGHCGSCWAFSSTG 153
Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEEKTYP 215
+EG + K LVSLSEQ L+DC + ++GC GG + NAF I K GG++ EK+YP
Sbjct: 154 ALEGQHFRKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYI--KDNGGIDTEKSYP 211
Query: 216 YRGDDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINAY--ALQFYVTG 272
Y G D +C NK G+V + DE M K + GP++VAI+A + Q Y G
Sbjct: 212 YEGIDDSCHFNKATIGATDTGFVDIPEGDEEKMKKAVATMGPVSVAIDASHESFQLYSEG 271
Query: 273 VSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR 332
V + + CD +NL H VL+VGYG D + + YW++KNSWG WGE+GY ++ R
Sbjct: 272 VYNEPE--CD--EQNLDHGVLVVGYGTDES-----GMDYWLVKNSWGTTWGEQGYIKMAR 322
Query: 333 G-DGSCGI 339
+ CGI
Sbjct: 323 NQNNQCGI 330
>sp|Q91GE3|CATV_NPVEP Viral cathepsin OS=Epiphyas postvittana nucleopolyhedrovirus
GN=VCATH PE=3 SV=1
Length = 323
Score = 206 bits (524), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 118/316 (37%), Positives = 175/316 (55%), Gaps = 25/316 (7%)
Query: 36 VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLS 95
+K F F+ Q+NK Y + E R IF NL I + + + VY +N+FSDLS
Sbjct: 22 LKAPNYFEEFVRQYNKQYDSEYEKLRRYKIFQHNLNDI--ITKNRNDTAVYKINKFSDLS 79
Query: 96 TAEFQAKYLGFKLKPSYADRSVPAMI---PNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
E AKY G L P + ++ P P FDWR ++ +T VK+Q MCG+ WA
Sbjct: 80 KDETIAKYTGLSL-PLHTQNFCEVVVLDRPPGKGPLEFDWRRFNKITSVKNQGMCGACWA 138
Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEK 212
F+T ++E +A +L++LSEQ++IDCD D GCEGG + AF+ I+S GG++ E
Sbjct: 139 FATLASLESQFAIAHDRLINLSEQQMIDCDSVDVGCEGGLLHTAFEAIISM--GGVQIEN 196
Query: 213 TYPYRGDDKACRLNKKATQVKI---NGYVSVSRDETDMAKYLVENGPMAVAINAYALQFY 269
YPY + CR++ V + N Y+++ E + L GP+ VAI+A + Y
Sbjct: 197 DYPYESSNNYCRMDPTKFVVGVKQCNRYITIY--EEKLKDVLRLAGPIPVAIDASDILNY 254
Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
G+ +C N L+H+VL+VGYGV+ VPYWI+KNSWG WGE+G+F+
Sbjct: 255 EQGIIK----YC--ANNGLNHAVLLVGYGVENN------VPYWILKNSWGTDWGEQGFFK 302
Query: 330 LYRGDGSCGINDYVRS 345
+ + +CGI + + S
Sbjct: 303 IQQNVNACGIKNELAS 318
>sp|Q05094|CYSP2_LEIPI Cysteine proteinase 2 OS=Leishmania pifanoi GN=CYS2 PE=1 SV=1
Length = 444
Score = 206 bits (523), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 128/320 (40%), Positives = 177/320 (55%), Gaps = 23/320 (7%)
Query: 39 TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAE 98
ALF F + + Y TL E RL F NL ++ Q + +G+ +F DLS AE
Sbjct: 35 AALFEEFKRTYGRAYETLAEEQQRLANFERNLELMREHQ-ARNPHAQFGITKFFDLSEAE 93
Query: 99 FQAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAF 153
F A+YL F +A + +++ +P A DWRE AVT VKDQ CGS WAF
Sbjct: 94 FAARYLNGAAYFAAAKRHAAQHYRKARADLSAVPDAVDWREKGAVTPVKDQGACGSCWAF 153
Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
S GNIEG + +LVSLSEQ+L+ CD +DGC+GG + AFD ++ G L E +
Sbjct: 154 SAVGNIEGQWYLAGHELVSLSEQQLVSCDDMNDGCDGGLMLQAFDWLLQNTNGHLHTEDS 213
Query: 214 YPY---RGDDKACRLNKKATQV--KINGYVSVSRDETDMAKYLVENGPMAVAINAYALQF 268
YPY G C + + V +I+G+V + E MA +L +NGP+A+A++A +
Sbjct: 214 YPYVSGNGYVPECSNSSEELVVGAQIDGHVLIGSSEKAMAAWLAKNGPIAIALDASSFMS 273
Query: 269 YVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYF 328
Y +GV C G + L+H VL+VGY D T VPYW+IKNSWG WGE+GY
Sbjct: 274 YKSGVLTA----CIG--KQLNHGVLLVGY--DMT----GEVPYWVIKNSWGGDWGEQGYV 321
Query: 329 RLYRGDGSCGINDYVRSALV 348
R+ G +C +++Y SA V
Sbjct: 322 RVVMGVNACLLSEYPVSAHV 341
>sp|P36400|LMCPB_LEIME Cysteine proteinase B OS=Leishmania mexicana GN=LMCPB PE=2 SV=2
Length = 443
Score = 205 bits (522), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 125/319 (39%), Positives = 176/319 (55%), Gaps = 22/319 (6%)
Query: 39 TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAE 98
ALF F + + Y TL E RL F NL ++ Q + +G+ +F DLS AE
Sbjct: 35 AALFEEFKRTYGRAYETLAEEQQRLANFERNLELMREHQ-ARNPHAQFGITKFFDLSEAE 93
Query: 99 FQAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAF 153
F A+YL F +A + +++ +P A DWRE AVT VKDQ CGS WAF
Sbjct: 94 FAARYLNGAAYFAAAKRHAAQHYRKARADLSAVPDAVDWREKGAVTPVKDQGACGSCWAF 153
Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
S GNIEG + +LVSLSEQ+L+ CD +DGC+GG + AFD ++ G L E +
Sbjct: 154 SAVGNIEGQWYLAGHELVSLSEQQLVSCDDMNDGCDGGLMLQAFDWLLQNTNGHLHTEDS 213
Query: 214 YPYRGDD----KACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFY 269
YPY + + ++ +I+G+V + E MA +L +NGP+A+A++A + Y
Sbjct: 214 YPYVSGNGYVPECSNSSELVVGAQIDGHVLIGSSEKAMAAWLAKNGPIAIALDASSFMSY 273
Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
+GV C G + L+H VL+VGY D T VPYW+IKNSWG WGE+GY R
Sbjct: 274 KSGVLTA----CIG--KQLNHGVLLVGY--DMT----GEVPYWVIKNSWGGDWGEQGYVR 321
Query: 330 LYRGDGSCGINDYVRSALV 348
+ G +C +++Y SA V
Sbjct: 322 VVMGVNACLLSEYPVSAHV 340
>sp|P41715|CATV_NPVCF Viral cathepsin OS=Choristoneura fumiferana nuclear polyhedrosis
virus GN=Vcath PE=3 SV=1
Length = 324
Score = 205 bits (521), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 116/315 (36%), Positives = 175/315 (55%), Gaps = 20/315 (6%)
Query: 36 VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLS 95
+K F FL + NK+Y++ E R IF NL +I + ++ + Y +N+F+DLS
Sbjct: 22 LKAPNYFEDFLHKFNKSYSSESEKLRRFQIFRHNLEEI-INKNHNDSTAQYEINKFADLS 80
Query: 96 TAEFQAKYLGFKLKPSYADRSVPAMI---PNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
E +KY G L P ++ P P FDWR + VT VK+Q MCG+ WA
Sbjct: 81 KDETISKYTGLSL-PLQTQNFCEVVVLDRPPDKGPLEFDWRRLNKVTSVKNQGMCGACWA 139
Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEK 212
F+T G++E +A K + ++LSEQ+LIDCD D GC+GG + AF+ +M+ GG++ E
Sbjct: 140 FATLGSLESQFAIKHNQFINLSEQQLIDCDFVDAGCDGGLLHTAFEAVMNM--GGIQAES 197
Query: 213 TYPYRGDDKACRLNKKATQVKING-YVSVSRDETDMAKYLVENGPMAVAINAYALQFYVT 271
YPY ++ CR N VK+ Y ++ E + L GP+ VAI+A + Y
Sbjct: 198 DYPYEANNGDCRANAAKFVVKVKKCYRYITVFEEKLKDLLRSVGPIPVAIDASDIVNYKR 257
Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
G+ +C N L+H+VL+VGY V+ VP+WI+KN+WG WGE+GYFR+
Sbjct: 258 GIMK----YC--ANHGLNHAVLLVGYAVE------NGVPFWILKNTWGADWGEQGYFRVQ 305
Query: 332 RGDGSCGINDYVRSA 346
+ +CGI + + S+
Sbjct: 306 QNINACGIQNELPSS 320
>sp|Q6VTL7|CATV_NPVCD Viral cathepsin OS=Choristoneura fumiferana defective polyhedrosis
virus GN=Vcath PE=3 SV=1
Length = 324
Score = 205 bits (521), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 122/316 (38%), Positives = 175/316 (55%), Gaps = 22/316 (6%)
Query: 36 VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKI--QLLQDTEHGSGVYGLNEFSD 93
+K + F FL NK Y++ E R IF NL +I + L DT S Y +N+FSD
Sbjct: 22 LKAPSYFEDFLHNFNKNYSSKSEKLHRFKIFQHNLEEIINKNLNDT---SAQYEINKFSD 78
Query: 94 LSTAEFQAKYLGFKLKPSYADRSVPAMI--PNITLPRAFDWREYDAVTGVKDQTMCGSSW 151
LS E +KY G L + ++ P P FDWR + VT VK+Q CG+ W
Sbjct: 79 LSKDETISKYTGLSLPLQNQNFCEVVVLNRPPDKGPLEFDWRRLNKVTSVKNQGTCGACW 138
Query: 152 AFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEE 211
AF+T G++E +A K +L++LSEQ+LIDCD D GC+GG + A++ +M+ GG++ E
Sbjct: 139 AFATLGSLESQFAIKHDQLINLSEQQLIDCDFVDMGCDGGLLHTAYEAVMNM--GGIQAE 196
Query: 212 KTYPYRGDDKACRLNKKATQVKING-YVSVSRDETDMAKYLVENGPMAVAINAYALQFYV 270
YPY ++ CRLN VK+ Y V E + L GP+ VAI+A + Y
Sbjct: 197 NDYPYEANNGDCRLNAAKFVVKVKKCYRYVLMFEEKLKDLLRIVGPLPVAIDASDIVNYK 256
Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
GV +C N L+H+VL+VGY V+ VP+WI+KN+WG WGE+GYFR+
Sbjct: 257 RGVIR----YC--ANHGLNHAVLLVGYAVEN------GVPFWILKNTWGTDWGEQGYFRV 304
Query: 331 YRGDGSCGINDYVRSA 346
+ +CGI + + S+
Sbjct: 305 QQNINACGIQNELPSS 320
>sp|O91466|CATV_GVCPM Viral cathepsin OS=Cydia pomonella granulosis virus (isolate
Mexico/1963) GN=VCATH PE=3 SV=1
Length = 333
Score = 204 bits (518), Expect = 9e-52, Method: Compositional matrix adjust.
Identities = 127/348 (36%), Positives = 189/348 (54%), Gaps = 29/348 (8%)
Query: 12 LLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLR 71
LL+ + S V + L++ LF F ++NKTY + E +L F NL+
Sbjct: 4 LLNFVILASVLTVTAHALTYDLNNSDE--LFKNFAIKYNKTYVSDEERAIKLENFKNNLK 61
Query: 72 KIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKL----KPSYADRSVPAMI-----P 122
I ++ V+ +NE+SDL+ + GF+L PS + +++ P
Sbjct: 62 MINE-KNMASKYAVFDINEYSDLNKNALLRRTTGFRLGLKKNPSAFTMTECSVVVIKDEP 120
Query: 123 NITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCD 182
LP DWR+ VT VK+Q CGS WAFST NIE +Y K K ++LSEQ L++CD
Sbjct: 121 QALLPETLDWRDKHGVTPVKNQMECGSCWAFSTIANIESLYNIKYDKALNLSEQHLVNCD 180
Query: 183 QEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVS-VS 241
++GC GG + A ++I+ + GG+ + PY G D C+ K ++ I+G V
Sbjct: 181 NINNGCAGGLMHWALESILQE--GGVVSAENEPYYGFDGVCK--KSPFELSISGSRRYVL 236
Query: 242 RDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDR 301
++E + + LV NGP++VAI+ L Y G++ C+ NE L+H+VL+VGYGV
Sbjct: 237 QNENKLRELLVVNGPISVAIDVSDLINYKAGIAD----ICE-NNEGLNHAVLLVGYGVKN 291
Query: 302 TKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCG-INDYVRSALV 348
VPYWI+KNSWG WGE+GYFR+ R SCG +N+Y SA++
Sbjct: 292 D------VPYWILKNSWGAEWGEEGYFRVQRDKNSCGMMNEYASSAIL 333
>sp|Q8QLK1|CATV_NPVMC Viral cathepsin OS=Mamestra configurata nucleopolyhedrovirus
GN=VCATH PE=3 SV=1
Length = 337
Score = 203 bits (516), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 127/343 (37%), Positives = 183/343 (53%), Gaps = 21/343 (6%)
Query: 12 LLSLTVSVSSFMVVGDEKLHHLHHVKHTAL-FNYFLEQHNKTYATLVEYYSRLHIFSGNL 70
LL V S VV +L+++ L F F+ Q+NK Y++ E R +IF N+
Sbjct: 9 LLVSAVLTSHDQVVAVTIKPNLYNINSAPLYFEKFISQYNKQYSSEDEKKYRYNIFRHNI 68
Query: 71 RKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGF---KLKPSYADRSVPAMIPNITLP 127
I +++ + S VY +N F+D++ E ++ G + ++ + V P
Sbjct: 69 ESINA-KNSRNDSAVYKINRFADMTKNEVVNRHTGLASGDIGANFCETIVVDGPGQRQRP 127
Query: 128 RAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDG 187
FDWR Y+ VT VKDQ MCG+ WAF+ G +E YA K +L+ L+EQ+L+DCD D G
Sbjct: 128 ANFDWRNYNKVTSVKDQGMCGACWAFAGLGALESQYAIKYDRLIDLAEQQLVDCDFVDMG 187
Query: 188 CEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKI-NGYVSVSRDETD 246
C+GG I A++ IM GG+E+E YPY+ C + V + N Y V E
Sbjct: 188 CDGGLIHTAYEQIMHI--GGVEQEYDYPYKAVRLPCAVKPHKFAVGVRNCYRYVLLSEER 245
Query: 247 MAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTH 306
+ L GP+A+A++A L Y GV FC+ N L+H+VL+VGYG++
Sbjct: 246 LEDLLRHVGPIAIAVDAVDLTDYYGGVIS----FCE--NNGLNHAVLLVGYGIENN---- 295
Query: 307 KAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCG-INDYVRSALV 348
VPYW IKNSWG +GE GY R+ RG SCG IN+ SA +
Sbjct: 296 --VPYWTIKNSWGSDYGENGYVRIRRGVNSCGMINELASSAQI 336
>sp|Q9J8B9|CATV_NPVSE Viral cathepsin OS=Spodoptera exigua nuclear polyhedrosis virus
(strain US) GN=VCATH PE=3 SV=1
Length = 337
Score = 203 bits (516), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 125/323 (38%), Positives = 182/323 (56%), Gaps = 23/323 (7%)
Query: 33 LHHVKHTAL-FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEF 91
L+++ L F F+ Q+NK Y + E R +IF N+ I +++ + S VY +N F
Sbjct: 30 LYNINSAPLYFEKFITQYNKQYKSEDEKKYRYNIFRHNIESINQ-KNSRNDSAVYKINRF 88
Query: 92 SDLSTAEFQAKYLGF---KLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCG 148
+D+ E ++ G +L ++ + V P +FDWR + +T VKDQ MCG
Sbjct: 89 ADMPKNEIVIRHTGLASGELGLNFCETIVVDGPAQRQRPVSFDWRSMNKITSVKDQGMCG 148
Query: 149 SSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGL 208
+ W F++ G +E YA K +L+ LSEQ+L+DCD D GC+GG I A++ IM GG+
Sbjct: 149 ACWRFASLGALESQYAIKYDRLIDLSEQQLVDCDFVDMGCDGGLIHTAYEQIMKM--GGV 206
Query: 209 EEEKTYPYRGDDKACRL--NKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYAL 266
E+E Y Y+ + + C L +K AT V+ N Y V +E + L GP+A+A++A L
Sbjct: 207 EQEFDYSYKAERQPCALKPHKFATGVR-NCYRYVILNEERLEDLLRYVGPIAIAVDAVDL 265
Query: 267 QFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKG 326
Y G+ FC+ N L+H+VL+VGYGV+ VPYWIIKNSWG +GE G
Sbjct: 266 TDYYGGIVS----FCE--NNGLNHAVLLVGYGVENN------VPYWIIKNSWGSDYGEDG 313
Query: 327 YFRLYRGDGSCG-INDYVRSALV 348
Y R+ RG SCG IN+ SA V
Sbjct: 314 YVRVRRGVNSCGMINELASSAQV 336
>sp|P56202|CATW_HUMAN Cathepsin W OS=Homo sapiens GN=CTSW PE=1 SV=2
Length = 376
Score = 203 bits (516), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 119/328 (36%), Positives = 176/328 (53%), Gaps = 27/328 (8%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
F F Q N++Y + E+ RL IF+ NL + Q LQ+ + G+ +G+ FSDL+ EF
Sbjct: 42 FKLFQIQFNRSYLSPEEHAHRLDIFAHNLAQAQRLQEEDLGTAEFGVTPFSDLTEEEFGQ 101
Query: 102 KYLGFKLK----PSYADRSVPAMIPNITLPRAFDWREY-DAVTGVKDQTMCGSSWAFSTT 156
Y G++ PS R + + P ++P + DWR+ A++ +KDQ C WA +
Sbjct: 102 LY-GYRRAAGGVPSMG-REIRSEEPEESVPFSCDWRKVASAISPIKDQKNCNCCWAMAAA 159
Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
GNIE ++ V +S QEL+DC + DGC GG + +AF T+++ GL EK YP+
Sbjct: 160 GNIETLWRISFWDFVDVSVQELLDCGRCGDGCHGGFVWDAFITVLNN--SGLASEKDYPF 217
Query: 217 RGDDKACRLNKKATQ--VKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVS 274
+G +A R + K Q I ++ + +E +A+YL GP+ V IN LQ Y GV
Sbjct: 218 QGKVRAHRCHPKKYQKVAWIQDFIMLQNNEHRIAQYLATYGPITVTINMKPLQLYRKGVI 277
Query: 275 HPIQFFCDGGNENLSHSVLIVGYGVDRTK--------------FTHKAVPYWIIKNSWGE 320
CD + + HSVL+VG+G +++ PYWI+KNSWG
Sbjct: 278 KATPTTCD--PQLVDHSVLLVGFGSVKSEEGIWAETVSSQSQPQPPHPTPYWILKNSWGA 335
Query: 321 GWGEKGYFRLYRGDGSCGINDYVRSALV 348
WGEKGYFRL+RG +CGI + +A V
Sbjct: 336 QWGEKGYFRLHRGSNTCGITKFPLTARV 363
>sp|P25783|CATV_NPVAC Viral cathepsin OS=Autographa californica nuclear polyhedrosis
virus GN=VCATH PE=1 SV=1
Length = 323
Score = 202 bits (515), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 116/317 (36%), Positives = 174/317 (54%), Gaps = 21/317 (6%)
Query: 36 VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLS 95
+K F F+ + NK Y + VE R IF NL +I + ++ S Y +N+FSDLS
Sbjct: 22 LKAPNYFEEFVHRFNKDYGSEVEKLRRFKIFQHNLNEI--INKNQNDSAKYEINKFSDLS 79
Query: 96 TAEFQAKYLGFKLKPSYADRSVPAMI---PNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
E AKY G L P ++ P P FDWR + VT VK+Q MCG+ WA
Sbjct: 80 KDETIAKYTGLSL-PIQTQNFCKVIVLDQPPGKGPLEFDWRRLNKVTSVKNQGMCGACWA 138
Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEK 212
F+T ++E +A K +L++LSEQ++IDCD D GC GG + AF+ I+ GG++ E
Sbjct: 139 FATLASLESQFAIKHNQLINLSEQQMIDCDFVDAGCNGGLLHTAFEAIIKM--GGVQLES 196
Query: 213 TYPYRGDDKACRLNKKATQVKI-NGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVT 271
YPY D+ CR+N V++ + Y ++ E + L GP+ +AI+A + Y
Sbjct: 197 DYPYEADNNNCRMNSNKFLVQVKDCYRYITVYEEKLKDLLRLVGPIPMAIDAADIVNYKQ 256
Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
G+ I++ + G L+H+VL+VGYGV+ +PYW KN+WG WGE G+FR+
Sbjct: 257 GI---IKYCFNSG---LNHAVLLVGYGVENN------IPYWTFKNTWGTDWGEDGFFRVQ 304
Query: 332 RGDGSCGINDYVRSALV 348
+ +CG+ + + S V
Sbjct: 305 QNINACGMRNELASTAV 321
>sp|Q9YMP9|CATV_NPVLD Viral cathepsin OS=Lymantria dispar multicapsid nuclear
polyhedrosis virus GN=VCATH PE=3 SV=1
Length = 356
Score = 202 bits (514), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 114/314 (36%), Positives = 173/314 (55%), Gaps = 21/314 (6%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQD--TEHGSGVYGLNEFSDLSTAEF 99
F F+E +NK Y + E R IF NL +I T+ + Y +N+FSDLS +E
Sbjct: 56 FESFVENYNKNYTSDWEKNKRYSIFKDNLHEINAKNGNATDGPTATYKINKFSDLSKSEL 115
Query: 100 QAKYLGFKLKPSYAD--RSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
AK+ G + ++ +++ P P FDWRE + VT +K+Q CG+ WAF+T
Sbjct: 116 IAKFTGLSIPERVSNFCKTIILNQPPDKGPLHFDWREQNKVTSIKNQGACGACWAFATLA 175
Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
++E +A + +L+ LSEQ+LIDCD D GC GG + AF+ IM GG++ E YP+
Sbjct: 176 SVESQFAMRHNRLIDLSEQQLIDCDSVDMGCNGGLLHTAFEEIMRM--GGVQTELDYPFV 233
Query: 218 GDDKACRLNKKATQVK--INGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSH 275
G ++ C L++ V + Y V +E + L GP+ +AI+A + Y GV
Sbjct: 234 GRNRRCGLDRHRPYVVSLVGCYRYVMVNEEKLKDLLRAVGPIPMAIDAADIVNYYRGVIS 293
Query: 276 PIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
+ N L+H+VL+VGYGV+ VPYW+ KN+WG+ WGE GYFR+ +
Sbjct: 294 SCE------NNGLNHAVLLVGYGVE------NGVPYWVFKNTWGDDWGENGYFRVRQNVN 341
Query: 336 SCG-INDYVRSALV 348
+CG +ND +A++
Sbjct: 342 ACGMVNDLASTAVL 355
>sp|Q8B9D5|CATV_NPVR1 Viral cathepsin OS=Rachiplusia ou multiple nucleopolyhedrovirus
(strain R1) GN=VCATH PE=3 SV=1
Length = 323
Score = 202 bits (514), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 116/317 (36%), Positives = 175/317 (55%), Gaps = 21/317 (6%)
Query: 36 VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLS 95
+K F F+ + NK Y + VE R IF NL +I + ++ S Y +N+FSDLS
Sbjct: 22 LKAPNYFEEFVHRFNKDYGSEVEKLRRFKIFQHNLNEIII--KNQNDSAKYEINKFSDLS 79
Query: 96 TAEFQAKYLGFKLKPSYADRSVPAMI---PNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
E AKY G L P ++ P P FDWR + VT VK+Q MCG+ WA
Sbjct: 80 KDETIAKYTGLSL-PIQTQNFCKVIVLDQPPGKGPLEFDWRRLNKVTSVKNQGMCGACWA 138
Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEK 212
F+T ++E +A K +L++LSEQ++IDCD D GC GG + AF+ I+ GG++ E
Sbjct: 139 FATLASLESQFAIKHNQLINLSEQQMIDCDFVDAGCNGGLLHTAFEAIIKM--GGVQLES 196
Query: 213 TYPYRGDDKACRLNKKATQVKI-NGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVT 271
YPY D+ CR+N V++ + Y ++ E + L GP+ +AI+A + Y
Sbjct: 197 DYPYEADNNNCRMNTNKFLVQVKDCYRYITVYEEKLKDLLRLVGPIPMAIDAADIVNYKQ 256
Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
G+ I++ + G L+H+VL+VGYGV+ +PYW KN+WG WGE+G+FR+
Sbjct: 257 GI---IKYCFNSG---LNHAVLLVGYGVENN------IPYWTFKNTWGTDWGEEGFFRVQ 304
Query: 332 RGDGSCGINDYVRSALV 348
+ +CG+ + + S V
Sbjct: 305 QNINACGMRNELASTAV 321
>sp|Q9WGE0|CATV_NPVHC Viral cathepsin OS=Hyphantria cunea nuclear polyhedrosis virus
GN=VCATH PE=3 SV=1
Length = 324
Score = 201 bits (511), Expect = 7e-51, Method: Compositional matrix adjust.
Identities = 120/318 (37%), Positives = 178/318 (55%), Gaps = 21/318 (6%)
Query: 36 VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLS 95
+K + F FL + NK Y++ E R IF NL +I ++++ + Y +N+FSDLS
Sbjct: 22 LKAPSYFEDFLHKFNKHYSSESEKLRRFQIFQHNLEEI-IIKNQNDTTAQYEINKFSDLS 80
Query: 96 TAEFQAKYLGFKLKPSYADRSVPAMI---PNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
E +KY G L P ++ P P FDWR + VT VK+Q +CG+ WA
Sbjct: 81 KDETISKYTGLAL-PLQTQNFCEVVVLNRPPDKGPLEFDWRRLNKVTSVKNQGICGACWA 139
Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEK 212
F+T ++E +A K +L++LSEQ+LIDCD D GC GG + A++ +M GG++ E
Sbjct: 140 FATLASLESQFAIKHNQLINLSEQQLIDCDYVDAGCNGGLLHTAYEAVMQM--GGVQAEN 197
Query: 213 TYPYRGDDKACRLNKKATQVKING-YVSVSRDETDMAKYLVENGPMAVAINAYALQFYVT 271
YPY G D CR++ VK+ Y ++ E + L GP+ VAI+A + Y
Sbjct: 198 DYPYEGSDGNCRVDVAKFVVKVKKCYRYIAVFEEKLKDLLRIVGPIPVAIDASDIVNYRR 257
Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
G+ +C N +H+VL+VGYGV+ VPYWI+KN+WGE WGE+GYFR+
Sbjct: 258 GIMR----YC--SNYGFNHAVLLVGYGVENN------VPYWILKNTWGEDWGEQGYFRVQ 305
Query: 332 RGDGSCGI-NDYVRSALV 348
+ +CGI N+ + SA +
Sbjct: 306 QNINACGIRNELLASAEI 323
>sp|O10364|CATV_NPVOP Viral cathepsin OS=Orgyia pseudotsugata multicapsid polyhedrosis
virus GN=VCATH PE=3 SV=1
Length = 324
Score = 200 bits (509), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 116/318 (36%), Positives = 172/318 (54%), Gaps = 21/318 (6%)
Query: 36 VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLS 95
+K F FL + NK Y++ E R IF NL +I + ++ + Y +N+FSDLS
Sbjct: 22 LKAPNYFEDFLHKFNKNYSSESEKLHRFKIFQHNLEEI-INKNQNDSTAQYEINKFSDLS 80
Query: 96 TAEFQAKYLGFKLKPSYADRSVPAMI---PNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
E +KY G L P +I P P FDWR+++ VT VK+Q +CG+ WA
Sbjct: 81 KEEAISKYTGLSL-PHQTQNFCEVVILDRPPDRGPLEFDWRQFNKVTSVKNQGVCGACWA 139
Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEK 212
F+T G++E +A K +L++LSEQ+ IDCD+ + GC+GG + AF++ M GG++ E
Sbjct: 140 FATLGSLESQFAIKYNRLINLSEQQFIDCDRVNAGCDGGLLHTAFESAMEM--GGVQMES 197
Query: 213 TYPYRGDDKACRLNKKATQVKINGYVS-VSRDETDMAKYLVENGPMAVAINAYALQFYVT 271
YPY + CR+N V + + E + L GP+ VAI+A + Y
Sbjct: 198 DYPYETANGQCRINPNRFVVGVRSCRRYIVMFEEKLKDLLRAVGPIPVAIDASDIVNYRR 257
Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
G+ N L+H+VL+VGY V+ +PYWI+KN+WG WGE GYFR+
Sbjct: 258 GIMRQC------ANHGLNHAVLLVGYAVENN------IPYWILKNTWGTDWGEDGYFRVQ 305
Query: 332 RGDGSCGI-NDYVRSALV 348
+ +CGI N+ V SA +
Sbjct: 306 QNINACGIRNELVSSAEI 323
>sp|Q9PYY5|CATV_GVXN Viral cathepsin OS=Xestia c-nigrum granulosis virus GN=VCATH PE=3
SV=1
Length = 346
Score = 197 bits (502), Expect = 8e-50, Method: Compositional matrix adjust.
Identities = 121/337 (35%), Positives = 175/337 (51%), Gaps = 30/337 (8%)
Query: 10 VALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGN 69
VALL+L V S++ + + + + LFN F+ ++NK Y E +R IF N
Sbjct: 19 VALLTLNVCAVSYIA------YDMSNAQE--LFNEFVVKYNKVYKDDQEKEARFEIFKQN 70
Query: 70 LRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNIT---- 125
L I E S ++ +N +D+S+ E K G KL ++ P +
Sbjct: 71 LADINARNALED-SAMFEINSRADISSNELLQKLTGLKLSLMRGEKKNSFCTPTVISGDS 129
Query: 126 ---LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCD 182
+P +FDWR+ ++VT VK Q CGS WAFS NIE +Y K + LSEQ+L+DCD
Sbjct: 130 SGKVPDSFDWRDRNSVTSVKMQKECGSCWAFSAVANIESLYHIKHNVSLDLSEQQLVDCD 189
Query: 183 QEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSR 242
+ ++GC GG +S AF+ I+ GG+ E YPY G D C+ + Q+ Y R
Sbjct: 190 KVNNGCNGGLMSWAFEGIIR--AGGISYEAPYPYTGVDGVCKNTTRYVQLS-GCYAYDLR 246
Query: 243 DETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRT 302
E + + L E GP++VAI+ L Y +GV+ + L+H VL+VGYG +
Sbjct: 247 SEKKLRQVLHEKGPVSVAIDVVDLTNYKSGVAKHCSV-----DHGLNHGVLLVGYGQEND 301
Query: 303 KFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGI 339
V YW +KNSWG WGE+G+FR+ R SCGI
Sbjct: 302 ------VKYWTLKNSWGSDWGEQGFFRIKRDVNSCGI 332
>sp|Q9FGR9|CEP1_ARATH KDEL-tailed cysteine endopeptidase CEP1 OS=Arabidopsis thaliana
GN=CEP1 PE=2 SV=1
Length = 361
Score = 197 bits (501), Expect = 9e-50, Method: Compositional matrix adjust.
Identities = 129/309 (41%), Positives = 171/309 (55%), Gaps = 36/309 (11%)
Query: 49 HNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKL 108
H+ +L E R ++F N++ I + S LN+F D+++ EF+ Y G +
Sbjct: 44 HHTVARSLEEKAKRFNVFKHNVKHIHETNKKDK-SYKLKLNKFGDMTSEEFRRTYAGSNI 102
Query: 109 K-------PSYADRSVPAMIPNI-TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIE 160
K A +S M N+ TLP + DWR+ AVT VK+Q CGS WAFST +E
Sbjct: 103 KHHRMFQGEKKATKSF--MYANVNTLPTSVDWRKNGAVTPVKNQGQCGSCWAFSTVVAVE 160
Query: 161 GVYAAKTKKLVSLSEQELIDCD-QEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGD 219
G+ +TKKL SLSEQEL+DCD ++ GC GG + AF+ I K GGL E YPY+
Sbjct: 161 GINQIRTKKLTSLSEQELVDCDTNQNQGCNGGLMDLAFEFIKEK--GGLTSELVYPYKAS 218
Query: 220 DKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHP 276
D+ C NK+ A V I+G+ V ++ D V N P++VAI+A QFY GV
Sbjct: 219 DETCDTNKENAPVVSIDGHEDVPKNSEDDLMKAVANQPVSVAIDAGGSDFQFYSEGV--- 275
Query: 277 IQFFCDGGNENLSHSVLIVGYG--VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG- 333
F G E L+H V +VGYG +D TK YWI+KNSWGE WGEKGY R+ RG
Sbjct: 276 --FTGRCGTE-LNHGVAVVGYGTTIDGTK-------YWIVKNSWGEEWGEKGYIRMQRGI 325
Query: 334 ---DGSCGI 339
+G CGI
Sbjct: 326 RHKEGLCGI 334
>sp|O65493|XCP1_ARATH Xylem cysteine proteinase 1 OS=Arabidopsis thaliana GN=XCP1 PE=1
SV=1
Length = 355
Score = 196 bits (498), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 129/331 (38%), Positives = 177/331 (53%), Gaps = 28/331 (8%)
Query: 22 FMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEH 81
F +VG H + K LF ++ +H+K Y ++ E R +F NL I ++ E
Sbjct: 31 FSIVGYTPEHLTNTDKLLELFESWMSEHSKAYKSVEEKVHRFEVFRENLMHIDQ-RNNEI 89
Query: 82 GSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAM---IPNIT-LPRAFDWREYDA 137
S GLNEF+DL+ EF+ +YLG KP ++ + P+ +IT LP++ DWR+ A
Sbjct: 90 NSYWLGLNEFADLTHEEFKGRYLGLA-KPQFSRKRQPSANFRYRDITDLPKSVDWRKKGA 148
Query: 138 VTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNA 196
V VKDQ CGS WAFST +EG+ T L SLSEQELIDCD + GC GG + A
Sbjct: 149 VAPVKDQGQCGSCWAFSTVAAVEGINQITTGNLSSLSEQELIDCDTTFNSGCNGGLMDYA 208
Query: 197 FDTIMSKLGGGLEEEKTYPYRGDDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENG 255
F I+S GGL +E YPY ++ C+ K+ +V I+GY V ++ + + +
Sbjct: 209 FQYIIST--GGLHKEDDYPYLMEEGICQEQKEDVERVTISGYEDVPENDDESLVKALAHQ 266
Query: 256 PMAVAINAYA--LQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWI 313
P++VAI A QFY GV F +L H V VGYG + K Y I
Sbjct: 267 PVSVAIEASGRDFQFYKGGV------FNGKCGTDLDHGVAAVGYG------SSKGSDYVI 314
Query: 314 IKNSWGEGWGEKGYFRLYRG----DGSCGIN 340
+KNSWG WGEKG+ R+ R +G CGIN
Sbjct: 315 VKNSWGPRWGEKGFIRMKRNTGKPEGLCGIN 345
>sp|Q9YWK4|CATV_NPVBS Viral cathepsin OS=Buzura suppressaria nuclear polyhedrosis virus
GN=VCATH PE=3 SV=1
Length = 331
Score = 194 bits (494), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 116/311 (37%), Positives = 169/311 (54%), Gaps = 19/311 (6%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
F FL +NK Y E R IF L +I ++ + S VY +N+F+DLS E +
Sbjct: 31 FETFLANYNKMYNDTSEKERRFSIFQQTLEEINY-KNRLNDSAVYQINKFADLSKNEIIS 89
Query: 102 KYLGFKLKPSYADRSVPAMI--PNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNI 159
KY G + + +I P P FDWR+ + VT +K+Q CG+ WAF+T +I
Sbjct: 90 KYTGLNMPVQTTNFCKTIVIDQPPGKGPLNFDWRQQNKVTSIKNQKACGACWAFATLASI 149
Query: 160 EGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGD 219
E YA K + LSEQ++IDCD D GC+GG + AF+ ++ G L +E YPY G
Sbjct: 150 ESQYAIKNNVHIDLSEQQMIDCDYVDMGCDGGLLHTAFEQMIQM--GELVQEHEYPYAGV 207
Query: 220 DKACRLNKKAT-QVKING-YVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPI 277
+K C L T VK+ G Y V E + L GP+ +AI+A + Y G+ H
Sbjct: 208 NKPCELRGDETGVVKVKGCYRYVVFREEKLKDLLRAVGPIPMAIDASGIVNYHHGIIH-- 265
Query: 278 QFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSC 337
+C+ N L+H+VL+VGYGV+ VP+W KN+WG+ WGE+GYFR+ + +C
Sbjct: 266 --YCE--NYGLNHAVLLVGYGVENN------VPFWTFKNTWGKDWGEEGYFRVRQNVDAC 315
Query: 338 GINDYVRSALV 348
G+ + + S+ V
Sbjct: 316 GMTNELASSAV 326
>sp|Q95029|CATL_DROME Cathepsin L OS=Drosophila melanogaster GN=Cp1 PE=2 SV=2
Length = 371
Score = 194 bits (493), Expect = 9e-49, Method: Compositional matrix adjust.
Identities = 123/312 (39%), Positives = 171/312 (54%), Gaps = 28/312 (8%)
Query: 45 FLEQHNKTYATLVEYYSRLHIFSGNLRKI-QLLQDTEHGSGVYGL--NEFSDLSTAEFQA 101
F +H K Y E RL IF+ N KI + Q G + L N+++DL EF+
Sbjct: 62 FKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADLLHHEFRQ 121
Query: 102 KYLGFKL----KPSYADRSVPAMI----PNITLPRAFDWREYDAVTGVKDQTMCGSSWAF 153
GF + AD S + ++TLP++ DWR AVT VKDQ CGS WAF
Sbjct: 122 LMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGHCGSCWAF 181
Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEE 211
S+TG +EG + K+ LVSLSEQ L+DC + ++GC GG + NAF I K GG++ E
Sbjct: 182 SSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYI--KDNGGIDTE 239
Query: 212 KTYPYRGDDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINAY--ALQF 268
K+YPY D +C NK G+ + + DE MA+ + GP++VAI+A + QF
Sbjct: 240 KSYPYEAIDDSCHFNKGTVGATDRGFTDIPQGDEKKMAEAVATVGPVSVAIDASHESFQF 299
Query: 269 YVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYF 328
Y GV + Q CD +NL H VL+VG+G D + YW++KNSWG WG+KG+
Sbjct: 300 YSEGVYNEPQ--CDA--QNLDHGVLVVGFGTDES-----GEDYWLVKNSWGTTWGDKGFI 350
Query: 329 RLYRG-DGSCGI 339
++ R + CGI
Sbjct: 351 KMLRNKENQCGI 362
>sp|P43156|CYSP_HEMSP Thiol protease SEN102 OS=Hemerocallis sp. GN=SEN102 PE=2 SV=1
Length = 360
Score = 194 bits (492), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 130/351 (37%), Positives = 181/351 (51%), Gaps = 37/351 (10%)
Query: 7 FAGVALLSLT-VSVSSFMVVGDEKLHHLHHVKHTALFNYF--LEQHNKTYATLVEYYSRL 63
F +AL++L+ +S++ + ++ L +L+N + H+ L E R
Sbjct: 6 FIALALVALSFLSIAQSIPFTEKDL-----ASEDSLWNLYEKWRTHHTVARDLDEKNRRF 60
Query: 64 HIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPA---- 119
++F N++ I + LN+F D++ EF++KY G K++ + R +
Sbjct: 61 NVFKENVKFIHEFNQKKDAPYKLALNKFGDMTNQEFRSKYAGSKIQHHRSQRGIQKNTGS 120
Query: 120 -MIPNI-TLPRA-FDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQ 176
M N+ +LP A DWR AVTGVKDQ CGS WAFST ++EG+ KT +LVSLSEQ
Sbjct: 121 FMYENVGSLPAASIDWRAKGAVTGVKDQGQCGSCWAFSTIASVEGINQIKTGELVSLSEQ 180
Query: 177 ELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLN-KKATQVKI 234
EL+DCD ++GC GG + AF+ I G+ E +YPY D C N + V I
Sbjct: 181 ELVDCDTSYNEGCNGGLMDYAFEFIQKN---GITTEDSYPYAEQDGTCASNLLNSPVVSI 237
Query: 235 NGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSV 292
+G+ V + + V N P++V+I A Y QFY GV F G E L H V
Sbjct: 238 DGHQDVPANNENALMQAVANQPISVSIEASGYGFQFYSEGV-----FTGRCGTE-LDHGV 291
Query: 293 LIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGI 339
IVGYG T YWI+KNSWGE WGE GY R+ RG G CGI
Sbjct: 292 AIVGYGA-----TRDGTKYWIVKNSWGEEWGESGYIRMQRGISDKRGKCGI 337
>sp|P25776|ORYA_ORYSJ Oryzain alpha chain OS=Oryza sativa subsp. japonica GN=Os04g0650000
PE=1 SV=2
Length = 458
Score = 192 bits (487), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 122/315 (38%), Positives = 167/315 (53%), Gaps = 33/315 (10%)
Query: 41 LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY----GLNEFSDLST 96
L+ + +H K+Y + E R F NLR I + +GV+ GLN F+DL+
Sbjct: 39 LYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDE-HNAAADAGVHSFRLGLNRFADLTN 97
Query: 97 AEFQAKYLGFKLKP----SYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
E++ YLG + KP +DR + A N LP + DWR AV +KDQ CGS WA
Sbjct: 98 EEYRDTYLGLRNKPRRERKVSDRYLAA--DNEALPESVDWRTKGAVAEIKDQGGCGSCWA 155
Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEE 211
FS +EG+ T L+SLSEQEL+DCD ++GC GG + AFD I++ GG++ E
Sbjct: 156 FSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAFDFIINN--GGIDTE 213
Query: 212 KTYPYRGDDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQF 268
YPY+G D+ C +N+K A V I+ Y V+ + + V N P++VAI A A Q
Sbjct: 214 DDYPYKGKDERCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVANQPVSVAIEAGGRAFQL 273
Query: 269 YVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYF 328
Y +G+ F L H V VGYG + K YWI++NSWG+ WGE GY
Sbjct: 274 YSSGI------FTGKCGTALDHGVAAVGYGTENGK------DYWIVRNSWGKSWGESGYV 321
Query: 329 RLYRG----DGSCGI 339
R+ R G CGI
Sbjct: 322 RMERNIKASSGKCGI 336
>sp|O23791|BROM1_ANACO Fruit bromelain OS=Ananas comosus PE=1 SV=1
Length = 351
Score = 190 bits (483), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 109/306 (35%), Positives = 159/306 (51%), Gaps = 23/306 (7%)
Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
F ++ ++ + Y E R IF N++ I+ S G+N+F+D++ +EF A
Sbjct: 37 FEEWMAEYGRVYKDDDEKMRRFQIFKNNVKHIETFNSRNENSYTLGINQFTDMTKSEFVA 96
Query: 102 KYLGFKLKPSYADRSVPAMIPNITL---PRAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
+Y G L P +R ++ + P++ DWR+Y AV VK+Q CGS W+F+
Sbjct: 97 QYTGVSL-PLNIEREPVVSFDDVNISAVPQSIDWRDYGAVNEVKNQNPCGSCWSFAAIAT 155
Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG 218
+EG+Y KT LVSLSEQE++DC GC+GG ++ A+D I+S G+ E+ YPY
Sbjct: 156 VEGIYKIKTGYLVSLSEQEVLDC-AVSYGCKGGWVNKAYDFIISN--NGVTTEENYPYLA 212
Query: 219 DDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA-LQFYVTGVSHPI 277
C N I GY V R++ Y V N P+A I+A Q+Y GV
Sbjct: 213 YQGTCNANSFPNSAYITGYSYVRRNDERSMMYAVSNQPIAALIDASENFQYYNGGV---- 268
Query: 278 QFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG---- 333
F +L+H++ I+GYG D + YWI++NSWG WGE GY R+ RG
Sbjct: 269 --FSGPCGTSLNHAITIIGYGQDSS-----GTKYWIVRNSWGSSWGEGGYVRMARGVSSS 321
Query: 334 DGSCGI 339
G CGI
Sbjct: 322 SGVCGI 327
>sp|P35591|CYSP1_LEIPI Cysteine proteinase 1 OS=Leishmania pifanoi GN=CYS1 PE=2 SV=2
Length = 354
Score = 190 bits (483), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 119/319 (37%), Positives = 168/319 (52%), Gaps = 25/319 (7%)
Query: 39 TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLN-EFSDLSTA 97
+A + F ++H K + E R + F N++ L +T++ Y ++ +F+DL+
Sbjct: 39 SAHYGSFKKRHGKAFGGDAEEGHRFNAFKQNMQTAYFL-NTQNPHAHYDVSGKFADLTPQ 97
Query: 98 EFQAKYL-----GFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
EF YL LK D V P+ + + DWR+ AVT VK+Q +CGS WA
Sbjct: 98 EFAKLYLNPDYYARHLKDHKEDVHVDDSAPSGVM--SVDWRDKGAVTPVKNQGLCGSCWA 155
Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEK 212
FS GNIEG +AA LVSLSEQ L+ CD D+GC GG + A + IM G + E
Sbjct: 156 FSAIGNIEGQWAASGHSLVSLSEQMLVSCDNIDEGCNGGLMDQAMNWIMQSHNGSVFTEA 215
Query: 213 TYPYR---GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFY 269
+YPY G C ++ KI G++S+ DE +A+++ + GP+AVA++A Q Y
Sbjct: 216 SYPYTSGGGTRPPCH-DEGEVGAKITGFLSLPHDEERIAEWVEKRGPVAVAVDATTWQLY 274
Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
GV C +L+H VLIVG+ + PYWI+KNSWG WGEKGY R
Sbjct: 275 FGGVVS----LCLA--WSLNHGVLIVGFNKNAKP------PYWIVKNSWGSSWGEKGYIR 322
Query: 330 LYRGDGSCGINDYVRSALV 348
L G C + +Y SA V
Sbjct: 323 LAMGSNQCMLKNYPVSATV 341
>sp|P25775|LMCPA_LEIME Cysteine proteinase A OS=Leishmania mexicana GN=LMCPA PE=2 SV=1
Length = 354
Score = 190 bits (483), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 119/319 (37%), Positives = 168/319 (52%), Gaps = 25/319 (7%)
Query: 39 TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLN-EFSDLSTA 97
+A + F ++H K + E R + F N++ L +T++ Y ++ +F+DL+
Sbjct: 39 SAHYGSFKKRHGKAFGGDAEEGHRFNAFKQNMQTAYFL-NTQNPHAHYDVSGKFADLTPQ 97
Query: 98 EFQAKYL-----GFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
EF YL LK D V P+ + + DWR+ AVT VK+Q +CGS WA
Sbjct: 98 EFAKLYLNPDYYARHLKNHKEDVHVDDSAPSGVM--SVDWRDKGAVTPVKNQGLCGSCWA 155
Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEK 212
FS GNIEG +AA LVSLSEQ L+ CD D+GC GG + A + IM G + E
Sbjct: 156 FSAIGNIEGQWAASGHSLVSLSEQMLVSCDNIDEGCNGGLMDQAMNWIMQSHNGSVFTEA 215
Query: 213 TYPYR---GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFY 269
+YPY G C ++ KI G++S+ DE +A+++ + GP+AVA++A Q Y
Sbjct: 216 SYPYTSGGGTRPPCH-DEGEVGAKITGFLSLPHDEERIAEWVEKRGPVAVAVDATTWQLY 274
Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
GV C +L+H VLIVG+ + PYWI+KNSWG WGEKGY R
Sbjct: 275 FGGVVS----LCLA--WSLNHGVLIVGFNKNAKP------PYWIVKNSWGSSWGEKGYIR 322
Query: 330 LYRGDGSCGINDYVRSALV 348
L G C + +Y SA V
Sbjct: 323 LAMGSNQCMLKNYPVSATV 341
>sp|Q9LM66|XCP2_ARATH Xylem cysteine proteinase 2 OS=Arabidopsis thaliana GN=XCP2 PE=1
SV=2
Length = 356
Score = 189 bits (480), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 130/326 (39%), Positives = 173/326 (53%), Gaps = 43/326 (13%)
Query: 35 HVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY-GLNEFSD 93
H K LF ++ K Y T+ E + R +F NL+ I + + G + GLNEF+D
Sbjct: 44 HDKLIELFENWISNFEKAYETVEEKFLRFEVFKDNLKHID--ETNKKGKSYWLGLNEFAD 101
Query: 94 LSTAEFQAKYLGFKL-------KPSYAD---RSVPAMIPNITLPRAFDWREYDAVTGVKD 143
LS EF+ YLG K + SYA+ R V A +P++ DWR+ AV VK+
Sbjct: 102 LSHEEFKKMYLGLKTDIVRRDEERSYAEFAYRDVEA------VPKSVDWRKKGAVAEVKN 155
Query: 144 QTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMS 202
Q CGS WAFST +EG+ T L +LSEQELIDCD ++GC GG + AF+ I+
Sbjct: 156 QGSCGSCWAFSTVAAVEGINKIVTGNLTTLSEQELIDCDTTYNNGCNGGLMDYAFEYIVK 215
Query: 203 KLGGGLEEEKTYPYRGDDKACRLNKKATQ-VKINGYVSV-SRDETDMAKYLVENGPMAVA 260
GGL +E+ YPY ++ C + K ++ V ING+ V + DE + K L P++VA
Sbjct: 216 N--GGLRKEEDYPYSMEEGTCEMQKDESETVTINGHQDVPTNDEKSLLKALAHQ-PLSVA 272
Query: 261 INAYA--LQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSW 318
I+A QFY GV F +L H V VGYG + K Y I+KNSW
Sbjct: 273 IDASGREFQFYSGGV------FDGRCGVDLDHGVAAVGYG------SSKGSDYIIVKNSW 320
Query: 319 GEGWGEKGYFRLYRG----DGSCGIN 340
G WGEKGY RL R +G CGIN
Sbjct: 321 GPKWGEKGYIRLKRNTGKPEGLCGIN 346
>sp|P25803|CYSEP_PHAVU Vignain OS=Phaseolus vulgaris PE=2 SV=2
Length = 362
Score = 189 bits (479), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 116/305 (38%), Positives = 166/305 (54%), Gaps = 28/305 (9%)
Query: 49 HNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKL 108
H+ +L E + R ++F NL + + + LN+F+D++ EF++ Y G K+
Sbjct: 46 HHTVSRSLGEKHKRFNVFKANLMHVHNTNKMDKPYKLK-LNKFADMTNHEFRSTYAGSKV 104
Query: 109 KPSYADRSVP----AMI--PNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGV 162
R P A + +++P + DWR+ AVT VKDQ CGS WAFST +EG+
Sbjct: 105 NHPRMFRGTPHENGAFMYEKVVSVPPSVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGI 164
Query: 163 YAAKTKKLVSLSEQELIDCDQEDD-GCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDK 221
KT KLV+LSEQEL+DCD+E++ GC GG + +AF+ I K GG+ E YPY+ +
Sbjct: 165 NQIKTNKLVALSEQELVDCDKEENQGCNGGLMESAFEFIKQK--GGITTESNYPYKAQEG 222
Query: 222 ACRLNK-KATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHPIQ 278
C +K V I+G+ +V ++ D V N P++VAI+A QFY GV
Sbjct: 223 TCDASKVNDLAVSIDGHENVPANDEDALLKAVANQPVSVAIDAGGSDFQFYSEGV----- 277
Query: 279 FFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----D 334
F + +L+H V IVGYG T YWI++NSWG WGE GY R+ R +
Sbjct: 278 -FTGDCSTDLNHGVAIVGYGT-----TVDGTNYWIVRNSWGPEWGEHGYIRMQRNISKKE 331
Query: 335 GSCGI 339
G CGI
Sbjct: 332 GLCGI 336
>sp|P43297|RD21A_ARATH Cysteine proteinase RD21a OS=Arabidopsis thaliana GN=RD21A PE=1
SV=1
Length = 462
Score = 188 bits (477), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 125/351 (35%), Positives = 186/351 (52%), Gaps = 40/351 (11%)
Query: 10 VALLSLTVSVSSFMVVGDEKLHHLHHVKHT---------ALFNYFLEQHNKTYA--TLVE 58
+A+++++ +V ++ DEK H V T +++ +L +H K + +LVE
Sbjct: 13 LAMVAVSSAVDMSIISYDEK----HGVSTTGGRSEAEVMSIYEAWLVKHGKAQSQNSLVE 68
Query: 59 YYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVP 118
R IF NLR + + ++ S GL F+DL+ E+++KYLG K++ R+
Sbjct: 69 KDRRFEIFKDNLRFVDE-HNEKNLSYRLGLTRFADLTNDEYRSKYLGAKMEKKGERRTSL 127
Query: 119 AMIPNI--TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQ 176
+ LP + DWR+ AV VKDQ CGS WAFST G +EG+ T L++LSEQ
Sbjct: 128 RYEARVGDELPESIDWRKKGAVAEVKDQGGCGSCWAFSTIGAVEGINQIVTGDLITLSEQ 187
Query: 177 ELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKAC-RLNKKATQVKI 234
EL+DCD ++GC GG + AF+ I+ GG++ +K YPY+G D C ++ K A V I
Sbjct: 188 ELVDCDTSYNEGCNGGLMDYAFEFIIKN--GGIDTDKDYPYKGVDGTCDQIRKNAKVVTI 245
Query: 235 NGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSV 292
+ Y V + K V + P+++AI A A Q Y +G+ F L H V
Sbjct: 246 DSYEDVPTYSEESLKKAVAHQPISIAIEAGGRAFQLYDSGI------FDGSCGTQLDHGV 299
Query: 293 LIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR----GDGSCGI 339
+ VGYG + K YWI++NSWG+ WGE GY R+ R G CGI
Sbjct: 300 VAVGYGTENGK------DYWIVRNSWGKSWGESGYLRMARNIASSSGKCGI 344
>sp|P49935|CATH_MOUSE Pro-cathepsin H OS=Mus musculus GN=Ctsh PE=2 SV=2
Length = 333
Score = 188 bits (477), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 123/339 (36%), Positives = 178/339 (52%), Gaps = 33/339 (9%)
Query: 8 AGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFS 67
AG LLS T + + V EK H F +++QH KTY++ VEY RL +F+
Sbjct: 10 AGAWLLS-TGATAELTVNAIEKFH----------FKSWMKQHQKTYSS-VEYNHRLQMFA 57
Query: 68 GNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRS--VPAMIPNIT 125
N RKIQ H + LN+FSD+S AE + K+L + + A +S + P
Sbjct: 58 NNWRKIQAHNQRNHTFKM-ALNQFSDMSFAEIKHKFLWSEPQNCSATKSNYLRGTGP--- 113
Query: 126 LPRAFDWREY-DAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQ- 183
P + DWR+ + V+ VK+Q CGS W FSTTG +E A + K++SL+EQ+L+DC Q
Sbjct: 114 YPSSMDWRKKGNVVSPVKNQGACGSCWTFSTTGALESAVAIASGKMLSLAEQQLVDCAQA 173
Query: 184 -EDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVS- 241
+ GC+GG S AF+ I+ G+ EE +YPY G D +CR N + + V+++
Sbjct: 174 FNNHGCKGGLPSQAFEYIL--YNKGIMEEDSYPYIGKDSSCRFNPQKAVAFVKNVVNITL 231
Query: 242 RDETDMAKYLVENGPMAVAINAYA-LQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVD 300
DE M + + P++ A Y +GV C + ++H+VL VGYG
Sbjct: 232 NDEAAMVEAVALYNPVSFAFEVTEDFLMYKSGVYSSKS--CHKTPDKVNHAVLAVGYG-- 287
Query: 301 RTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGI 339
+ YWI+KNSWG WGE GYF + RG CG+
Sbjct: 288 ----EQNGLLYWIVKNSWGSQWGENGYFLIERGKNMCGL 322
>sp|P43234|CATO_HUMAN Cathepsin O OS=Homo sapiens GN=CTSO PE=2 SV=1
Length = 321
Score = 187 bits (475), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 113/284 (39%), Positives = 156/284 (54%), Gaps = 20/284 (7%)
Query: 71 RKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADR---SVPAMIPNITLP 127
R + L +E+ + YG+N+FS L EF+A YL + KPS R V IPN++LP
Sbjct: 52 RYLNSLFPSENSTAFYGINQFSYLFPEEFKAIYL--RSKPSKFPRYSAEVHMSIPNVSLP 109
Query: 128 RAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDG 187
FDWR+ VT V++Q MCG WAFS G +E YA K K L LS Q++IDC + G
Sbjct: 110 LRFDWRDKQVVTQVRNQQMCGGCWAFSVVGAVESAYAIKGKPLEDLSVQQVIDCSYNNYG 169
Query: 188 CEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACR-LNKKATQVKINGYVS--VSRDE 244
C GGS NA + ++K+ L ++ YP++ + C + + I GY + S E
Sbjct: 170 CNGGSTLNALN-WLNKMQVKLVKDSEYPFKAQNGLCHYFSGSHSGFSIKGYSAYDFSDQE 228
Query: 245 TDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKF 304
+MAK L+ GP+ V ++A + Q Y+ G+ IQ C G N H+VLI G+ D+T
Sbjct: 229 DEMAKALLTFGPLVVIVDAVSWQDYLGGI---IQHHCSSGEAN--HAVLITGF--DKTGS 281
Query: 305 THKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
T PYWI++NSWG WG GY + G CGI D V S V
Sbjct: 282 T----PYWIVRNSWGSSWGVDGYAHVKMGSNVCGIADSVSSIFV 321
>sp|P25249|CYSP1_HORVU Cysteine proteinase EP-B 1 OS=Hordeum vulgare GN=EPB1 PE=2 SV=1
Length = 371
Score = 187 bits (474), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 120/275 (43%), Positives = 152/275 (55%), Gaps = 43/275 (15%)
Query: 88 LNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNIT-----------LPRAFDWREYD 136
LN F D+ AEF+A ++G L+ R PA P++ LP + DWR+
Sbjct: 91 LNRFGDMDQAEFRATFVG-DLR-----RDTPAKPPSVPGFMYAALNVSDLPPSVDWRQKG 144
Query: 137 AVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQED-DGCEGGSISN 195
AVTGVKDQ CGS WAFST ++EG+ A +T LVSLSEQELIDCD D DGC+GG + N
Sbjct: 145 AVTGVKDQGKCGSCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDTADNDGCQGGLMDN 204
Query: 196 AFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQ----VKINGYVSV-SRDETDMAKY 250
AF+ I K GGL E YPYR C + + A V I+G+ V + E D+A+
Sbjct: 205 AFEYI--KNNGGLITEAAYPYRAARGTCNVARAAQNSPVVVHIDGHQDVPANSEEDLAR- 261
Query: 251 LVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKA 308
V N P++VA+ A A FY GV F D G E L H V +VGYGV
Sbjct: 262 AVANQPVSVAVEASGKAFMFYSEGV-----FTGDCGTE-LDHGVAVVGYGV-----AEDG 310
Query: 309 VPYWIIKNSWGEGWGEKGYFRLYRGDGS----CGI 339
YW +KNSWG WGE+GY R+ + G+ CGI
Sbjct: 311 KAYWTVKNSWGPSWGEQGYIRVEKDSGASGGLCGI 345
>sp|Q94B08|GCP1_ARATH Germination-specific cysteine protease 1 OS=Arabidopsis thaliana
GN=GCP1 PE=2 SV=2
Length = 376
Score = 186 bits (473), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 117/296 (39%), Positives = 161/296 (54%), Gaps = 33/296 (11%)
Query: 62 RLHIFSGNLRKIQLL-QDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSY-------A 113
R +IF NLR I L +D ++ + GL +F+DL+ E++ YLG + +P+
Sbjct: 73 RFNIFKDNLRFIDLHNEDNKNATYKLGLTKFTDLTNDEYRKLYLGARTEPARRIAKAKNV 132
Query: 114 DRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSL 173
++ A + +P DWR+ AV +KDQ CGS WAFSTT +EG+ T +L+SL
Sbjct: 133 NQKYSAAVNGKEVPETVDWRQKGAVNPIKDQGTCGSCWAFSTTAAVEGINKIVTGELISL 192
Query: 174 SEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACR-LNKKATQ 231
SEQEL+DCD+ + GC GG + AF IM GGL EK YPYRG C K +
Sbjct: 193 SEQELVDCDKSYNQGCNGGLMDYAFQFIMKN--GGLNTEKDYPYRGFGGKCNSFLKNSRV 250
Query: 232 VKINGYVSV-SRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENL 288
V I+GY V ++DET + K + P++VAI A Q Y +G+ F NL
Sbjct: 251 VSIDGYEDVPTKDETALKK-AISYQPVSVAIEAGGRIFQHYQSGI------FTGSCGTNL 303
Query: 289 SHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-----DGSCGI 339
H+V+ VGYG + V YWI++NSWG WGE+GY R+ R G CGI
Sbjct: 304 DHAVVAVGYG------SENGVDYWIVRNSWGPRWGEEGYIRMERNLAASKSGKCGI 353
>sp|Q3ZKN1|CATK_CANFA Cathepsin K OS=Canis familiaris GN=CTSK PE=2 SV=1
Length = 330
Score = 186 bits (472), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 117/336 (34%), Positives = 180/336 (53%), Gaps = 33/336 (9%)
Query: 15 LTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQ 74
L + ++SF + +E L ++ + + + K Y + V+ SR I+ NL+ I
Sbjct: 8 LLLPMASFALYPEEIL--------DTQWDLWKKTYRKQYNSKVDELSRRLIWEKNLKHIS 59
Query: 75 LLQDTEHGSGVY----GLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNI--TLPR 128
+ + E GV+ +N D+++ E K G K+ PS++ + IP+ P
Sbjct: 60 I-HNLEASLGVHTYELAMNHLGDMTSEEVVQKMTGLKVPPSHSRSNDTLYIPDWESRAPD 118
Query: 129 AFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGC 188
+ D+R+ VT VK+Q CGS WAFS+ G +EG KT KL++LS Q L+DC E+DGC
Sbjct: 119 SVDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSENDGC 178
Query: 189 EGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSR-DETDM 247
GG ++NAF + G++ E YPY G D++C N K GY + +E +
Sbjct: 179 GGGYMTNAFQYVQKNR--GIDSEDAYPYVGQDESCMYNPTGKAAKCRGYREIPEGNEKAL 236
Query: 248 AKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGN-ENLSHSVLIVGYGVDRTKF 304
+ + GP++VAI+A + QFY GV ++ + N +NL+H+VL VGYG+
Sbjct: 237 KRAVARVGPISVAIDASLTSFQFYSKGV-----YYDENCNSDNLNHAVLAVGYGI----- 286
Query: 305 THKAVPYWIIKNSWGEGWGEKGYFRLYRG-DGSCGI 339
K +WIIKNSWGE WG KGY + R + +CGI
Sbjct: 287 -QKGNKHWIIKNSWGENWGNKGYILMARNKNNACGI 321
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.319 0.136 0.414
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 133,978,118
Number of Sequences: 539616
Number of extensions: 5768565
Number of successful extensions: 13044
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 217
Number of HSP's successfully gapped in prelim test: 13
Number of HSP's that attempted gapping in prelim test: 11996
Number of HSP's gapped (non-prelim): 264
length of query: 348
length of database: 191,569,459
effective HSP length: 118
effective length of query: 230
effective length of database: 127,894,771
effective search space: 29415797330
effective search space used: 29415797330
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 62 (28.5 bits)