BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 036910
(314 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|151547430|gb|ABS12459.1| cysteine protease Cp [Citrus sinensis]
Length = 361
Score = 608 bits (1567), Expect = e-171, Method: Compositional matrix adjust.
Identities = 311/361 (86%), Positives = 314/361 (86%), Gaps = 47/361 (13%)
Query: 1 MARPVQLVSSVILLLCCAAAASASASSFDDSNPIRLVSSDGLRDFETSVLQVIGQARHAL 60
MARPVQLVSSVILLLCCAAAASASASSFDDSNPIRLVSSDGLRDFETSVLQVIGQARHAL
Sbjct: 1 MARPVQLVSSVILLLCCAAAASASASSFDDSNPIRLVSSDGLRDFETSVLQVIGQARHAL 60
Query: 61 SFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN------------ 108
SFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN
Sbjct: 61 SFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLNKFADWSWEEFQR 120
Query: 109 -----------------------------------ISPVKDQGHCGSCWTFSTTGSLEAA 133
+SPVKDQGHCGSCWTFSTTGSLEAA
Sbjct: 121 HRLGAAQNCSATTKGNHKLTADVLPETKDWRESGIVSPVKDQGHCGSCWTFSTTGSLEAA 180
Query: 134 YHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGV 193
YHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGV
Sbjct: 181 YHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGV 240
Query: 194 CKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCG 253
CKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCG
Sbjct: 241 CKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCG 300
Query: 254 NTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASYPVV 313
NTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFK++MGKNMCGIATCASYPVV
Sbjct: 301 NTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKIKMGKNMCGIATCASYPVV 360
Query: 314 A 314
A
Sbjct: 361 A 361
>gi|146215994|gb|ABQ10199.1| cysteine protease Cp1 [Actinidia deliciosa]
Length = 358
Score = 503 bits (1294), Expect = e-140, Method: Compositional matrix adjust.
Identities = 251/361 (69%), Positives = 276/361 (76%), Gaps = 50/361 (13%)
Query: 1 MARPVQLVSSVILLLCCAAAASASASSFDDSNPIRLVSSDGLRDFETSVLQVIGQARHAL 60
MAR S +I+L+ C A AS SAS+FDD NPIR V SD LR+FETS+L V+G +RHAL
Sbjct: 1 MARTS--FSLLIILIACVAGAS-SASTFDDENPIRTVVSDALREFETSILSVLGDSRHAL 57
Query: 61 SFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN------------ 108
SFARFA RYGK YE+ EE KLRFA FS+NL LIRS N KGLSY LG+N
Sbjct: 58 SFARFAHRYGKRYETAEETKLRFAIFSENLKLIRSHNKKGLSYTLGVNHFADWTWEEFRR 117
Query: 109 -----------------------------------ISPVKDQGHCGSCWTFSTTGSLEAA 133
+SPVKDQGHCGSCWTFSTTG+LEAA
Sbjct: 118 HRLGAAQNCSATTKGNHKLTEEALPEMKDWRVSGIVSPVKDQGHCGSCWTFSTTGALEAA 177
Query: 134 YHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGV 193
Y QAFGKGISLSEQQLVDCA AFNN GC+GGLPSQAFEY+KYNGGLDTEEAYPYTGK+G
Sbjct: 178 YKQAFGKGISLSEQQLVDCAGAFNNFGCSGGLPSQAFEYVKYNGGLDTEEAYPYTGKNGE 237
Query: 194 CKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCG 253
CKFSSENVGVQVLDSVNITLGAEDEL+HAV VRPVSVAF+VV+GFR YK GVY+S CG
Sbjct: 238 CKFSSENVGVQVLDSVNITLGAEDELKHAVAFVRPVSVAFQVVNGFRLYKEGVYTSDTCG 297
Query: 254 NTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASYPVV 313
TPMDVNHAV+AVGYGVE+GVPYWLIKNSWG +WGD GYFKMEMGKNMCG+ATCASYPV+
Sbjct: 298 RTPMDVNHAVLAVGYGVENGVPYWLIKNSWGADWGDSGYFKMEMGKNMCGVATCASYPVI 357
Query: 314 A 314
A
Sbjct: 358 A 358
>gi|312282841|dbj|BAJ34286.1| unnamed protein product [Thellungiella halophila]
Length = 358
Score = 500 bits (1288), Expect = e-139, Method: Compositional matrix adjust.
Identities = 245/355 (69%), Positives = 277/355 (78%), Gaps = 49/355 (13%)
Query: 7 LVSSVILLLCCAAAASASASSFDDSNPIRLVSSDGLRDFETSVLQVIGQARHALSFARFA 66
++ SV+L++ AA+A+A FD+SNPIR+VS DGLR+ E SV+Q++GQ+RH LSFARF
Sbjct: 6 ILPSVVLVILIAASAAADIG-FDESNPIRMVS-DGLREIEESVVQILGQSRHVLSFARFT 63
Query: 67 RRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN------------------ 108
RYGK Y++ EE+KLRF+ F +NLDLIRSTN K LSY+LG+N
Sbjct: 64 HRYGKKYQNAEEIKLRFSIFKENLDLIRSTNKKRLSYKLGVNQFADLTWQEFQRNKLGAA 123
Query: 109 -----------------------------ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFG 139
+SPVKDQG CGSCWTFSTTG+LEAAYHQAFG
Sbjct: 124 QNCSATLKGSHKLTEAALPETKDWREDGIVSPVKDQGGCGSCWTFSTTGALEAAYHQAFG 183
Query: 140 KGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSE 199
KGISLSEQQLVDCA AFNN GCNGGLPSQAFEYIK NGGLDTEEAYPYTGKDG CK+S+E
Sbjct: 184 KGISLSEQQLVDCAGAFNNYGCNGGLPSQAFEYIKSNGGLDTEEAYPYTGKDGTCKYSAE 243
Query: 200 NVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDV 259
NVGVQVLDSVNITLGAEDEL+HAVGLVRPVS+AFEVV FR YKSGVY+ + CGNTPMDV
Sbjct: 244 NVGVQVLDSVNITLGAEDELKHAVGLVRPVSIAFEVVKSFRLYKSGVYTDSHCGNTPMDV 303
Query: 260 NHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASYPVVA 314
NHAV+AVGYG+EDGVPYWLIKNSWG +WGD GYFKMEMGKNMCGIATCASYPVVA
Sbjct: 304 NHAVLAVGYGIEDGVPYWLIKNSWGADWGDKGYFKMEMGKNMCGIATCASYPVVA 358
>gi|23397070|gb|AAN31820.1| putative cysteine proteinase AALP [Arabidopsis thaliana]
Length = 358
Score = 498 bits (1283), Expect = e-138, Method: Compositional matrix adjust.
Identities = 246/355 (69%), Positives = 279/355 (78%), Gaps = 49/355 (13%)
Query: 7 LVSSVILLLCCAAAASASASSFDDSNPIRLVSSDGLRDFETSVLQVIGQARHALSFARFA 66
++SSV+L++ AA+A+A+ FD+SNPIR+VS DGLR+ E SV Q++GQ+RH LSFARF
Sbjct: 6 ILSSVVLVVLFAASAAANIG-FDESNPIRMVS-DGLREVEESVSQILGQSRHVLSFARFT 63
Query: 67 RRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN------------------ 108
RYGK Y++VEEMKLRF+ F +NLDLIRSTN KGLSY+LG+N
Sbjct: 64 HRYGKKYQNVEEMKLRFSIFKENLDLIRSTNKKGLSYKLGVNQFADLTWQEFQRTKLGAA 123
Query: 109 -----------------------------ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFG 139
+SPVKDQG CGSCWTFSTTG+LEAAYHQAFG
Sbjct: 124 QNCSATLKGSHKVTEAALPETKDWREDGIVSPVKDQGGCGSCWTFSTTGALEAAYHQAFG 183
Query: 140 KGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSE 199
KGISLSEQQLVDCA AFNN GCNGGLPSQAFEYIK NGGLDTE+AYPYTGKD CKFS+E
Sbjct: 184 KGISLSEQQLVDCAGAFNNYGCNGGLPSQAFEYIKSNGGLDTEKAYPYTGKDETCKFSAE 243
Query: 200 NVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDV 259
NVGVQVL+SVNITLGAEDEL+HAVGLVRPVS+AFEV+ FR YKSGVY+ + CG+TPMDV
Sbjct: 244 NVGVQVLNSVNITLGAEDELKHAVGLVRPVSIAFEVIHSFRLYKSGVYTDSHCGSTPMDV 303
Query: 260 NHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASYPVVA 314
NHAV+AVGYGVEDGVPYWLIKNSWG +WGD GYFKMEMGKNMCGIATCASYPVVA
Sbjct: 304 NHAVLAVGYGVEDGVPYWLIKNSWGADWGDKGYFKMEMGKNMCGIATCASYPVVA 358
>gi|18141289|gb|AAL60582.1|AF454960_1 senescence-associated cysteine protease [Brassica oleracea]
Length = 359
Score = 498 bits (1282), Expect = e-138, Method: Compositional matrix adjust.
Identities = 246/361 (68%), Positives = 279/361 (77%), Gaps = 50/361 (13%)
Query: 1 MARPVQLVSSVILLLCCAAAASASASSFDDSNPIRLVSSDGLRDFETSVLQVIGQARHAL 60
M+ L S+V+L+L AA++A + FD+SNPIR+VS D LR+ E SV+Q++GQ+RH +
Sbjct: 2 MSVRTILPSAVLLILI--AASTAESIGFDESNPIRMVS-DRLREVEESVVQILGQSRHVI 58
Query: 61 SFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN------------ 108
SFARFA RYGK YE+ EEMKLRF+ F +NLDLIRSTN KGLSY+LG+N
Sbjct: 59 SFARFAHRYGKRYENAEEMKLRFSIFKENLDLIRSTNKKGLSYKLGVNQFADMTWQEFQR 118
Query: 109 -----------------------------------ISPVKDQGHCGSCWTFSTTGSLEAA 133
+SPVKDQG CGSCWTFSTTG+LEAA
Sbjct: 119 TKLGAAQNCSATLKGTHKLTGEALPETKDWREDGIVSPVKDQGGCGSCWTFSTTGALEAA 178
Query: 134 YHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGV 193
YHQAFGKGISLSEQQLVDCA AFNN GCNGGLPSQAFEYIK NGGLDTEEAYPYTG+DG
Sbjct: 179 YHQAFGKGISLSEQQLVDCAGAFNNYGCNGGLPSQAFEYIKSNGGLDTEEAYPYTGEDGT 238
Query: 194 CKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCG 253
CK+S+ENVGV+VLDSVNITLGAEDEL+HAVGLVRPVS+AFEV+ FR YKSGVYS + CG
Sbjct: 239 CKYSAENVGVEVLDSVNITLGAEDELKHAVGLVRPVSIAFEVIHSFRLYKSGVYSDSHCG 298
Query: 254 NTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASYPVV 313
TPMDVNHAV+AVGYG+EDGVPYWLIKNSWG +WGD GYFKMEMGKNMCGIATCASYPVV
Sbjct: 299 QTPMDVNHAVLAVGYGIEDGVPYWLIKNSWGADWGDKGYFKMEMGKNMCGIATCASYPVV 358
Query: 314 A 314
A
Sbjct: 359 A 359
>gi|224069140|ref|XP_002326284.1| predicted protein [Populus trichocarpa]
gi|118482340|gb|ABK93094.1| unknown [Populus trichocarpa]
gi|222833477|gb|EEE71954.1| predicted protein [Populus trichocarpa]
Length = 358
Score = 497 bits (1279), Expect = e-138, Method: Compositional matrix adjust.
Identities = 250/362 (69%), Positives = 281/362 (77%), Gaps = 53/362 (14%)
Query: 1 MARPVQLV-SSVILLLCCAAAASASASSFDDSNPIRLVSSDGLRDFETSVLQVIGQARHA 59
MAR LV SS++ LLCC AA S SFD+SNPI+LVS D L DFE+S ++V+GQ+R A
Sbjct: 1 MARVAGLVVSSILFLLCCVAAGS----SFDESNPIKLVS-DRLHDFESSFVKVLGQSRRA 55
Query: 60 LSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN----------- 108
LSFARFA R+GK YE+ EMKLRFA FS++LDLIRSTN KGL Y LGLN
Sbjct: 56 LSFARFAHRHGKRYETEGEMKLRFAIFSESLDLIRSTNKKGLPYTLGLNQFADWTWQEFQ 115
Query: 109 ------------------------------------ISPVKDQGHCGSCWTFSTTGSLEA 132
+SPVK+QGHCGSCWTFSTTG+LEA
Sbjct: 116 KYRLGAAQNCSATTRGNHKLTNALLPETKDWREEGIVSPVKNQGHCGSCWTFSTTGALEA 175
Query: 133 AYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDG 192
AYHQAFGKGISLSEQQLVDCA+AFNN GCNGGLPSQAFEYIK+NGGLDTEEAYPYTGKD
Sbjct: 176 AYHQAFGKGISLSEQQLVDCARAFNNFGCNGGLPSQAFEYIKFNGGLDTEEAYPYTGKDD 235
Query: 193 VCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKC 252
CKFSSENVGV+V++SVNITLGAEDEL+HAV VRPVSVAFEVV FR YK GVY+++ C
Sbjct: 236 ACKFSSENVGVRVVESVNITLGAEDELKHAVAFVRPVSVAFEVVGSFRLYKEGVYTTSTC 295
Query: 253 GNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASYPV 312
G+TPMDVNHAV+AVGYGVE+G+PYWLIKNSWGE+WGD+GYFKMEMGKNMCGIATCASYPV
Sbjct: 296 GSTPMDVNHAVLAVGYGVENGIPYWLIKNSWGEDWGDNGYFKMEMGKNMCGIATCASYPV 355
Query: 313 VA 314
VA
Sbjct: 356 VA 357
>gi|255550445|ref|XP_002516273.1| cysteine protease, putative [Ricinus communis]
gi|223544759|gb|EEF46275.1| cysteine protease, putative [Ricinus communis]
Length = 358
Score = 493 bits (1270), Expect = e-137, Method: Compositional matrix adjust.
Identities = 243/340 (71%), Positives = 269/340 (79%), Gaps = 48/340 (14%)
Query: 22 SASASSFDDSNPIRLVSSDGLRDFETSVLQVIGQARHALSFARFARRYGKIYESVEEMKL 81
+ S S+FD+SNPIRLVS D LRDFE SV +V+G +R ALSF+RF R+GK Y+S +EMK+
Sbjct: 20 AVSGSNFDESNPIRLVS-DRLRDFEASVTKVVGHSRRALSFSRFVYRHGKRYQSEDEMKM 78
Query: 82 RFATFSKNLDLIRSTNCKGLSYRLGLN--------------------------------- 108
RFA FS+NLD IRSTN KGLSY L +N
Sbjct: 79 RFAIFSENLDFIRSTNRKGLSYTLAVNDFADLTWQEFQKHRLGAAQNCSATTKGNHKLTG 138
Query: 109 --------------ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQ 154
+SPVK+QGHCGSCWTFSTTG+LEAAYHQAFGKGISLSEQQLVDCA
Sbjct: 139 VALPDTKDWREVGIVSPVKNQGHCGSCWTFSTTGALEAAYHQAFGKGISLSEQQLVDCAG 198
Query: 155 AFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLG 214
AFNN GC+GGLPSQAFEYIKYNGGL+TEEAYPYTG+DG CKFSSENVG+QVLDSVNITLG
Sbjct: 199 AFNNFGCHGGLPSQAFEYIKYNGGLETEEAYPYTGEDGACKFSSENVGIQVLDSVNITLG 258
Query: 215 AEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGV 274
AEDEL+ AVGLVRPVSVAFEVV GFRFYKSGVY+S CG+TPMDVNHAV+AVGYGVEDGV
Sbjct: 259 AEDELKEAVGLVRPVSVAFEVVSGFRFYKSGVYTSDTCGSTPMDVNHAVLAVGYGVEDGV 318
Query: 275 PYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASYPVVA 314
PYWL+KNSWGENWGDHGYFKMEMGKNMCG+ATCASYPVVA
Sbjct: 319 PYWLVKNSWGENWGDHGYFKMEMGKNMCGVATCASYPVVA 358
>gi|18407961|ref|NP_566880.1| thiol protease aleurain-like protein [Arabidopsis thaliana]
gi|73622182|sp|Q8RWQ9.1|ALEUL_ARATH RecName: Full=Thiol protease aleurain-like; Flags: Precursor
gi|20147207|gb|AAM10319.1| AT3g45310/F18N11_70 [Arabidopsis thaliana]
gi|332644500|gb|AEE78021.1| thiol protease aleurain-like protein [Arabidopsis thaliana]
Length = 358
Score = 492 bits (1266), Expect = e-136, Method: Compositional matrix adjust.
Identities = 242/361 (67%), Positives = 277/361 (76%), Gaps = 50/361 (13%)
Query: 1 MARPVQLVSSVILLLCCAAAASASASSFDDSNPIRLVSSDGLRDFETSVLQVIGQARHAL 60
M+ + L SS++L+L AAA++ FD+SNPI++VS D L + E +V+Q++GQ+RH L
Sbjct: 1 MSVKLNLSSSILLILF--AAAASKEIGFDESNPIKMVS-DNLHELEDTVVQILGQSRHVL 57
Query: 61 SFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN------------ 108
SF+RF RYGK Y+SVEEMKLRF+ F +NLDLIRSTN KGLSY+L LN
Sbjct: 58 SFSRFTHRYGKKYQSVEEMKLRFSVFKENLDLIRSTNKKGLSYKLSLNQFADLTWQEFQR 117
Query: 109 -----------------------------------ISPVKDQGHCGSCWTFSTTGSLEAA 133
+SPVK+QGHCGSCWTFSTTG+LEAA
Sbjct: 118 YKLGAAQNCSATLKGSHKITEATVPDTKDWREDGIVSPVKEQGHCGSCWTFSTTGALEAA 177
Query: 134 YHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGV 193
YHQAFGKGISLSEQQLVDCA FNN GC+GGLPSQAFEYIKYNGGLDTEEAYPYTGKDG
Sbjct: 178 YHQAFGKGISLSEQQLVDCAGTFNNFGCHGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGG 237
Query: 194 CKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCG 253
CKFS++N+GVQV DSVNITLGAEDEL+HAVGLVRPVSVAFEVV FRFYK GV++S CG
Sbjct: 238 CKFSAKNIGVQVRDSVNITLGAEDELKHAVGLVRPVSVAFEVVHEFRFYKKGVFTSNTCG 297
Query: 254 NTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASYPVV 313
NTPMDVNHAV+AVGYGVED VPYWLIKNSWG WGD+GYFKMEMGKNMCG+ATC+SYPVV
Sbjct: 298 NTPMDVNHAVLAVGYGVEDDVPYWLIKNSWGGEWGDNGYFKMEMGKNMCGVATCSSYPVV 357
Query: 314 A 314
A
Sbjct: 358 A 358
>gi|18424347|ref|NP_568921.1| thiol protease aleurain [Arabidopsis thaliana]
gi|71152227|sp|Q8H166.2|ALEU_ARATH RecName: Full=Thiol protease aleurain; Short=AtALEU; AltName:
Full=Senescence-associated gene product 2; Flags:
Precursor
gi|7230640|gb|AAF43041.1|AF233883_1 AALP protein [Arabidopsis thaliana]
gi|13430722|gb|AAK25983.1|AF360273_1 putative cysteine proteinase AALP [Arabidopsis thaliana]
gi|9757740|dbj|BAB08221.1| AALP protein [Arabidopsis thaliana]
gi|21617934|gb|AAM66984.1| cysteine proteinase AALP [Arabidopsis thaliana]
gi|23397068|gb|AAN31819.1| putative cysteine proteinase AALP [Arabidopsis thaliana]
gi|23397074|gb|AAN31822.1| putative cysteine proteinase AALP [Arabidopsis thaliana]
gi|24417304|gb|AAN60262.1| unknown [Arabidopsis thaliana]
gi|222423506|dbj|BAH19723.1| AT5G60360 [Arabidopsis thaliana]
gi|222424411|dbj|BAH20161.1| AT5G60360 [Arabidopsis thaliana]
gi|332009930|gb|AED97313.1| thiol protease aleurain [Arabidopsis thaliana]
Length = 358
Score = 488 bits (1255), Expect = e-135, Method: Compositional matrix adjust.
Identities = 238/334 (71%), Positives = 263/334 (78%), Gaps = 48/334 (14%)
Query: 28 FDDSNPIRLVSSDGLRDFETSVLQVIGQARHALSFARFARRYGKIYESVEEMKLRFATFS 87
FD+SNPIR+VS DGLR+ E SV Q++GQ+RH LSFARF RYGK Y++VEEMKLRF+ F
Sbjct: 26 FDESNPIRMVS-DGLREVEESVSQILGQSRHVLSFARFTHRYGKKYQNVEEMKLRFSIFK 84
Query: 88 KNLDLIRSTNCKGLSYRLGLN--------------------------------------- 108
+NLDLIRSTN KGLSY+LG+N
Sbjct: 85 ENLDLIRSTNKKGLSYKLGVNQFADLTWQEFQRTKLGAAQNCSATLKGSHKVTEAALPET 144
Query: 109 --------ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+SPVKDQG CGSCWTFSTTG+LEAAYHQAFGKGISLSEQQLVDCA AFNN G
Sbjct: 145 KDWREDGIVSPVKDQGGCGSCWTFSTTGALEAAYHQAFGKGISLSEQQLVDCAGAFNNYG 204
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
CNGGLPSQAFEYIK NGGLDTE+AYPYTGKD CKFS+ENVGVQVL+SVNITLGAEDEL+
Sbjct: 205 CNGGLPSQAFEYIKSNGGLDTEKAYPYTGKDETCKFSAENVGVQVLNSVNITLGAEDELK 264
Query: 221 HAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIK 280
HAVGLVRPVS+AFEV+ FR YKSGVY+ + CG+TPMDVNHAV+AVGYGVEDGVPYWLIK
Sbjct: 265 HAVGLVRPVSIAFEVIHSFRLYKSGVYTDSHCGSTPMDVNHAVLAVGYGVEDGVPYWLIK 324
Query: 281 NSWGENWGDHGYFKMEMGKNMCGIATCASYPVVA 314
NSWG +WGD GYFKMEMGKNMCGIATCASYPVVA
Sbjct: 325 NSWGADWGDKGYFKMEMGKNMCGIATCASYPVVA 358
>gi|225444726|ref|XP_002278624.1| PREDICTED: thiol protease aleurain-like isoform 1 [Vitis vinifera]
gi|147826441|emb|CAN62278.1| hypothetical protein VITISV_031382 [Vitis vinifera]
gi|297738562|emb|CBI27807.3| unnamed protein product [Vitis vinifera]
Length = 362
Score = 488 bits (1255), Expect = e-135, Method: Compositional matrix adjust.
Identities = 245/364 (67%), Positives = 276/364 (75%), Gaps = 52/364 (14%)
Query: 1 MARPVQLVSSVILLLCCAAAASAS---ASSFDDSNPIRLVSSDGLRDFETSVLQVIGQAR 57
MAR + +V++V++LLC A+ A SSFD+ NPIRLVS D +RD E+SVL++IG R
Sbjct: 1 MAR-LSVVAAVLILLCAVASGEADHHFRSSFDEENPIRLVS-DSIRDLESSVLRLIGDTR 58
Query: 58 HALSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN--------- 108
HA SFA FA RYGK Y++V+E+KLRF FS+NL LIRSTN KGL Y L +N
Sbjct: 59 HAHSFASFAHRYGKSYKTVDEIKLRFEIFSENLKLIRSTNRKGLPYTLAVNQFADWTWEE 118
Query: 109 --------------------------------------ISPVKDQGHCGSCWTFSTTGSL 130
+SP+KDQGHCGSCWTFSTTG+L
Sbjct: 119 FRRHRLGAAQNCSATLKGNHKLTDVILPETKDWREDGIVSPIKDQGHCGSCWTFSTTGAL 178
Query: 131 EAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGK 190
EAAY QAFGKGISLSEQQLVDCA AFNN GC+GGLPSQAFEYIKYNGGLDTEEAYPYTG
Sbjct: 179 EAAYAQAFGKGISLSEQQLVDCAGAFNNFGCHGGLPSQAFEYIKYNGGLDTEEAYPYTGL 238
Query: 191 DGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSST 250
DG CKFSSEN+GVQVLDSVNITLGAEDEL+HAV VRPVSVAFEVV FRFYK GVY+S
Sbjct: 239 DGTCKFSSENIGVQVLDSVNITLGAEDELKHAVAFVRPVSVAFEVVHDFRFYKKGVYTSG 298
Query: 251 KCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASY 310
CG+TPMDVNHAV+AVGYGVEDGV YWLIKNSWGENWGD+GYFKME+GKNMCG+ATC+SY
Sbjct: 299 TCGSTPMDVNHAVLAVGYGVEDGVAYWLIKNSWGENWGDNGYFKMELGKNMCGVATCSSY 358
Query: 311 PVVA 314
PVVA
Sbjct: 359 PVVA 362
>gi|77379397|gb|ABA71355.1| cysteine protease [Brassica napus]
Length = 359
Score = 486 bits (1252), Expect = e-135, Method: Compositional matrix adjust.
Identities = 239/357 (66%), Positives = 269/357 (75%), Gaps = 48/357 (13%)
Query: 5 VQLVSSVILLLCCAAAASASASSFDDSNPIRLVSSDGLRDFETSVLQVIGQARHALSFAR 64
V+ + + LL A ++A + F +SNPIR+V D L + E SV+Q++GQ RH LSFAR
Sbjct: 4 VRTILPSVALLILIAVSTAESIGFYESNPIRMVF-DRLLEVEESVVQILGQTRHVLSFAR 62
Query: 65 FARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN---------------- 108
F RYGK YE+ EEMKLRF+ F +NLDLIRSTN KGLSY+LG+N
Sbjct: 63 FTHRYGKRYENAEEMKLRFSIFKENLDLIRSTNKKGLSYKLGVNQFTDMTWQEFQRTKLG 122
Query: 109 -------------------------------ISPVKDQGHCGSCWTFSTTGSLEAAYHQA 137
+SPVKDQG CGSCWTFSTTG+LEAAYHQA
Sbjct: 123 AAQNCSATLKGTHKLTGEALPETKDWREDGIVSPVKDQGGCGSCWTFSTTGALEAAYHQA 182
Query: 138 FGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFS 197
FGKGISLSEQQLVDCA AFNN GCNGGLPSQAFEYIK NGGLDTEEAYPYTG+DG CK+S
Sbjct: 183 FGKGISLSEQQLVDCAGAFNNYGCNGGLPSQAFEYIKSNGGLDTEEAYPYTGEDGTCKYS 242
Query: 198 SENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPM 257
+ENVGVQVLDSVNITLGAEDEL+HAVGL+RPVS+AFEV+ FR YKSGVYS + CG TPM
Sbjct: 243 AENVGVQVLDSVNITLGAEDELKHAVGLLRPVSIAFEVIHSFRLYKSGVYSDSHCGQTPM 302
Query: 258 DVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASYPVVA 314
DVNHAV+AVGYG+EDGVPYWLIKNSWG +WGD GYFKMEMGKNMCGIATCASYPVVA
Sbjct: 303 DVNHAVLAVGYGIEDGVPYWLIKNSWGADWGDKGYFKMEMGKNMCGIATCASYPVVA 359
>gi|79314271|ref|NP_001030812.1| thiol protease aleurain-like protein [Arabidopsis thaliana]
gi|332644501|gb|AEE78022.1| thiol protease aleurain-like protein [Arabidopsis thaliana]
Length = 357
Score = 485 bits (1249), Expect = e-134, Method: Compositional matrix adjust.
Identities = 241/361 (66%), Positives = 276/361 (76%), Gaps = 51/361 (14%)
Query: 1 MARPVQLVSSVILLLCCAAAASASASSFDDSNPIRLVSSDGLRDFETSVLQVIGQARHAL 60
M+ + L SS++L+L AAA++ FD+SNPI++VS D L + E +V+Q++GQ+RH L
Sbjct: 1 MSVKLNLSSSILLILF--AAAASKEIGFDESNPIKMVS-DNLHELEDTVVQILGQSRHVL 57
Query: 61 SFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN------------ 108
SF+RF RYGK Y+SVEEMKLRF+ F +NLDLIRSTN KGLSY+L LN
Sbjct: 58 SFSRFTHRYGKKYQSVEEMKLRFSVFKENLDLIRSTNKKGLSYKLSLNQFADLTWQEFQR 117
Query: 109 -----------------------------------ISPVKDQGHCGSCWTFSTTGSLEAA 133
+SPVK+QGHCGSCWTFSTTG+LEAA
Sbjct: 118 YKLGAAQNCSATLKGSHKITEATVPDTKDWREDGIVSPVKEQGHCGSCWTFSTTGALEAA 177
Query: 134 YHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGV 193
YHQAFGKGISLSEQQLVDCA FNN GC+GGLPSQAFEYIKYNGGLDTEEAYPYTGKDG
Sbjct: 178 YHQAFGKGISLSEQQLVDCAGTFNNFGCHGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGG 237
Query: 194 CKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCG 253
CKFS++N+GVQV DSVNITLGAEDEL+HAVGLVRPVSVAFEVV FRFYK GV++S CG
Sbjct: 238 CKFSAKNIGVQVRDSVNITLGAEDELKHAVGLVRPVSVAFEVVHEFRFYKKGVFTSNTCG 297
Query: 254 NTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASYPVV 313
NTPMDVNHAV+AVGYGVED VPYWLIKNSWG WGD+GYFKMEMGKNMC +ATC+SYPVV
Sbjct: 298 NTPMDVNHAVLAVGYGVEDDVPYWLIKNSWGGEWGDNGYFKMEMGKNMC-VATCSSYPVV 356
Query: 314 A 314
A
Sbjct: 357 A 357
>gi|158148921|dbj|BAF81994.1| cysteine proteinase [Platycodon grandiflorus]
Length = 359
Score = 485 bits (1249), Expect = e-134, Method: Compositional matrix adjust.
Identities = 245/361 (67%), Positives = 271/361 (75%), Gaps = 49/361 (13%)
Query: 1 MARPVQLVSSVILLLCCAAAASASASSFDDSNPIRLVSSDGLRDFETSVLQVIGQARHAL 60
MAR V S +++L+ C A ASA SSF D NPI+ V SDGLR+ E SVLQVIGQ RH+L
Sbjct: 1 MAR-VSPASFLLILIACVAGASA-GSSFADQNPIKQVVSDGLRELEASVLQVIGQTRHSL 58
Query: 61 SFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN------------ 108
+FARFA RYGK YE+ EEMK RF+ F +L +IRS N KGLSY LG+N
Sbjct: 59 AFARFAHRYGKSYETAEEMKRRFSIFVDSLKMIRSHNKKGLSYTLGVNEFADLTWEEFRK 118
Query: 109 -----------------------------------ISPVKDQGHCGSCWTFSTTGSLEAA 133
++PVK+QGHCGSCWTFSTTG+LEAA
Sbjct: 119 HRLGAAQNCSATLKGNHKLTNGLLPLKKDWREVGIVTPVKNQGHCGSCWTFSTTGALEAA 178
Query: 134 YHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGV 193
Y QAFGK I LSEQQLVDCA+A+NN GCNGGLPSQAFEYIK NGGLDTEEAYPYTG DGV
Sbjct: 179 YVQAFGKAIFLSEQQLVDCARAYNNFGCNGGLPSQAFEYIKANGGLDTEEAYPYTGVDGV 238
Query: 194 CKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCG 253
CKFSSEN+GVQVLDSVNITLGAEDEL+ AV VRPVSVAFEVV GFR YKSGVY+S CG
Sbjct: 239 CKFSSENIGVQVLDSVNITLGAEDELKDAVAFVRPVSVAFEVVSGFRLYKSGVYTSDTCG 298
Query: 254 NTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASYPVV 313
NTPMDVNHAVVAVGYGVE+ VPYWLIKNSWG +WGD+GYFKMEMGKNMCG+ATCASYPVV
Sbjct: 299 NTPMDVNHAVVAVGYGVENDVPYWLIKNSWGADWGDNGYFKMEMGKNMCGVATCASYPVV 358
Query: 314 A 314
A
Sbjct: 359 A 359
>gi|8347420|dbj|BAA96501.1| cysteine protease [Nicotiana tabacum]
Length = 360
Score = 483 bits (1242), Expect = e-134, Method: Compositional matrix adjust.
Identities = 240/356 (67%), Positives = 272/356 (76%), Gaps = 51/356 (14%)
Query: 10 SVILLLCCA----AAASASASSFDDSNPIRLVSSDGLRDFETSVLQVIGQARHALSFARF 65
S++L L A A+A A ++F D NPIR V SDGL + E ++LQV+G+ RHALSFARF
Sbjct: 5 SLLLALVVAGGLFASALAGPATFADENPIRQVVSDGLHELENAILQVVGKTRHALSFARF 64
Query: 66 ARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN----------------- 108
A RYGK YESVEE+K RF F NL +IRS N KGLSY+LG+N
Sbjct: 65 AHRYGKRYESVEEIKQRFEVFLDNLKMIRSHNKKGLSYKLGVNEFTDLTWDEFRRDRLGA 124
Query: 109 ------------------------------ISPVKDQGHCGSCWTFSTTGSLEAAYHQAF 138
+SPVK+QG CGSCWTFSTTG+LEAAY QAF
Sbjct: 125 AQNCSATTKGNLKVTNVVLPETKDWREAGIVSPVKNQGKCGSCWTFSTTGALEAAYSQAF 184
Query: 139 GKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSS 198
GKGISLSEQQLVDCA AFNN GCNGGLPSQAFEYIK NGGLDTEEAYPYTGK+G+CKFSS
Sbjct: 185 GKGISLSEQQLVDCAGAFNNFGCNGGLPSQAFEYIKSNGGLDTEEAYPYTGKNGLCKFSS 244
Query: 199 ENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMD 258
ENVGV+V+DSVNITLGAEDEL++AV LVRPVS+AFEV+ GF+ YKSGVY+ST+CGNTPMD
Sbjct: 245 ENVGVKVIDSVNITLGAEDELKYAVALVRPVSIAFEVIKGFKQYKSGVYTSTECGNTPMD 304
Query: 259 VNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASYPVVA 314
VNHAV+AVGYGVE+GVPYWLIKNSWG +WGD+GYFKMEMGKNMCGIATCASYPVVA
Sbjct: 305 VNHAVLAVGYGVENGVPYWLIKNSWGADWGDNGYFKMEMGKNMCGIATCASYPVVA 360
>gi|449452572|ref|XP_004144033.1| PREDICTED: thiol protease aleurain-like [Cucumis sativus]
gi|449500499|ref|XP_004161114.1| PREDICTED: thiol protease aleurain-like [Cucumis sativus]
Length = 356
Score = 481 bits (1239), Expect = e-133, Method: Compositional matrix adjust.
Identities = 240/361 (66%), Positives = 269/361 (74%), Gaps = 52/361 (14%)
Query: 1 MARPVQLVSSVILLLCCAAAASASASSFDDSNPIRLVSSDGLRDFETSVLQVIGQARHAL 60
MA + VSS++L+L CA A S FDDSNPIR+VS D LR+ E V++V+GQ HAL
Sbjct: 1 MASRLFFVSSLLLVLSCAVAGSV----FDDSNPIRMVS-DRLRELELEVVRVLGQVPHAL 55
Query: 61 SFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN------------ 108
FARFA RYGK YE+ EEMKLRF F ++L+LI+STN +GLSY+LG+N
Sbjct: 56 RFARFAHRYGKKYETAEEMKLRFGIFLESLELIKSTNKQGLSYKLGVNQFADWTWEEFRK 115
Query: 109 -----------------------------------ISPVKDQGHCGSCWTFSTTGSLEAA 133
+SPVKDQGHCGSCWTFSTTG+LEAA
Sbjct: 116 HRLGAAQNCSATTKGSHKLTDTALPESKDWRKDGIVSPVKDQGHCGSCWTFSTTGALEAA 175
Query: 134 YHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGV 193
Y QA GKGISLSEQQLVDC + FNN GCNGGLPSQAFEYIKYNGGLDTEEAYPYTG DG
Sbjct: 176 YAQAHGKGISLSEQQLVDCGRGFNNFGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGVDGS 235
Query: 194 CKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCG 253
CKF ENVGVQV+DSVNITLGAEDEL+HAV VRPVSVAFEVV GFR Y GVY+S CG
Sbjct: 236 CKFVPENVGVQVIDSVNITLGAEDELKHAVAFVRPVSVAFEVVSGFRLYSKGVYTSNSCG 295
Query: 254 NTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASYPVV 313
+TPMDVNHAV+AVGYGVEDG+PYWLIKNSWG NWGD+GYFKMEMGKNMCG+ATCASYP+V
Sbjct: 296 STPMDVNHAVLAVGYGVEDGIPYWLIKNSWGGNWGDNGYFKMEMGKNMCGVATCASYPIV 355
Query: 314 A 314
A
Sbjct: 356 A 356
>gi|79331505|ref|NP_001032106.1| thiol protease aleurain [Arabidopsis thaliana]
gi|332009931|gb|AED97314.1| thiol protease aleurain [Arabidopsis thaliana]
Length = 357
Score = 481 bits (1238), Expect = e-133, Method: Compositional matrix adjust.
Identities = 237/334 (70%), Positives = 262/334 (78%), Gaps = 49/334 (14%)
Query: 28 FDDSNPIRLVSSDGLRDFETSVLQVIGQARHALSFARFARRYGKIYESVEEMKLRFATFS 87
FD+SNPIR+VS DGLR+ E SV Q++GQ+RH LSFARF RYGK Y++VEEMKLRF+ F
Sbjct: 26 FDESNPIRMVS-DGLREVEESVSQILGQSRHVLSFARFTHRYGKKYQNVEEMKLRFSIFK 84
Query: 88 KNLDLIRSTNCKGLSYRLGLN--------------------------------------- 108
+NLDLIRSTN KGLSY+LG+N
Sbjct: 85 ENLDLIRSTNKKGLSYKLGVNQFADLTWQEFQRTKLGAAQNCSATLKGSHKVTEAALPET 144
Query: 109 --------ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+SPVKDQG CGSCWTFSTTG+LEAAYHQAFGKGISLSEQQLVDCA AFNN G
Sbjct: 145 KDWREDGIVSPVKDQGGCGSCWTFSTTGALEAAYHQAFGKGISLSEQQLVDCAGAFNNYG 204
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
CNGGLPSQAFEYIK NGGLDTE+AYPYTGKD CKFS+ENVGVQVL+SVNITLGAEDEL+
Sbjct: 205 CNGGLPSQAFEYIKSNGGLDTEKAYPYTGKDETCKFSAENVGVQVLNSVNITLGAEDELK 264
Query: 221 HAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIK 280
HAVGLVRPVS+AFEV+ FR YKSGVY+ + CG+TPMDVNHAV+AVGYGVEDGVPYWLIK
Sbjct: 265 HAVGLVRPVSIAFEVIHSFRLYKSGVYTDSHCGSTPMDVNHAVLAVGYGVEDGVPYWLIK 324
Query: 281 NSWGENWGDHGYFKMEMGKNMCGIATCASYPVVA 314
NSWG +WGD GYFKMEMGKNMC IATCASYPVVA
Sbjct: 325 NSWGADWGDKGYFKMEMGKNMC-IATCASYPVVA 357
>gi|28192371|gb|AAK07729.1| NTCP23-like cysteine proteinase [Nicotiana tabacum]
Length = 360
Score = 479 bits (1233), Expect = e-133, Method: Compositional matrix adjust.
Identities = 239/356 (67%), Positives = 271/356 (76%), Gaps = 51/356 (14%)
Query: 10 SVILLLCCA----AAASASASSFDDSNPIRLVSSDGLRDFETSVLQVIGQARHALSFARF 65
S++L L A A+A A ++F D NPIR V SDGL + E ++LQV+G+ RHALS ARF
Sbjct: 5 SLLLALVVAGGLFASALAGPATFADENPIRQVVSDGLHELENAILQVVGKTRHALSSARF 64
Query: 66 ARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN----------------- 108
A RYGK YESVEE+K RF F NL +IRS N KGLSY+LG+N
Sbjct: 65 AHRYGKRYESVEEIKQRFEVFLDNLKMIRSHNKKGLSYKLGVNEFTDLTWDEFRRDRLGA 124
Query: 109 ------------------------------ISPVKDQGHCGSCWTFSTTGSLEAAYHQAF 138
+SPVK+QG CGSCWTFSTTG+LEAAY QAF
Sbjct: 125 AQNCSATTKGNLKVTNVVLPETKGWREAGIVSPVKNQGKCGSCWTFSTTGALEAAYSQAF 184
Query: 139 GKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSS 198
GKGISLSEQQLVDCA AFNN GCNGGLPSQAFEYIK NGGLDTEEAYPYTGK+G+CKFSS
Sbjct: 185 GKGISLSEQQLVDCAGAFNNFGCNGGLPSQAFEYIKSNGGLDTEEAYPYTGKNGLCKFSS 244
Query: 199 ENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMD 258
ENVGV+V+DSVNITLGAEDEL++AV LVRPVS+AFEV+ GF+ YKSGVY+ST+CGNTPMD
Sbjct: 245 ENVGVKVIDSVNITLGAEDELKYAVALVRPVSIAFEVIKGFKQYKSGVYTSTECGNTPMD 304
Query: 259 VNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASYPVVA 314
VNHAV+AVGYGVE+GVPYWLIKNSWG +WGD+GYFKMEMGKNMCGIATCASYPVVA
Sbjct: 305 VNHAVLAVGYGVENGVPYWLIKNSWGADWGDNGYFKMEMGKNMCGIATCASYPVVA 360
>gi|297793593|ref|XP_002864681.1| hypothetical protein ARALYDRAFT_496172 [Arabidopsis lyrata subsp.
lyrata]
gi|297310516|gb|EFH40940.1| hypothetical protein ARALYDRAFT_496172 [Arabidopsis lyrata subsp.
lyrata]
Length = 361
Score = 474 bits (1220), Expect = e-131, Method: Compositional matrix adjust.
Identities = 233/344 (67%), Positives = 266/344 (77%), Gaps = 49/344 (14%)
Query: 7 LVSSVILLLCCAAAASASASSFDDSNPIRLVSSDGLRDFETSVLQVIGQARHALSFARFA 66
++SSV+L++ AA+A+A FD+ NPIR+VS DGLR+ E +V Q++GQ+RH L+FARF
Sbjct: 6 VLSSVVLVILIAASAAADIG-FDELNPIRMVS-DGLREVEETVSQILGQSRHVLTFARFT 63
Query: 67 RRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN------------------ 108
RYGK Y++VEEMKLRF+ F +NLDLIRSTN KGLSY+LG+N
Sbjct: 64 HRYGKKYQNVEEMKLRFSIFKENLDLIRSTNKKGLSYKLGVNQFADLTWQEFQRTKLGAA 123
Query: 109 -----------------------------ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFG 139
+SPVKDQG CGSCWTFSTTG+LEAAYHQAFG
Sbjct: 124 QNCSATLKGSHKLTEAALPETKDWREDGIVSPVKDQGGCGSCWTFSTTGALEAAYHQAFG 183
Query: 140 KGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSE 199
KGISLSEQQLVDCA A+NN GCNGGLPSQAFEYIK NGGLDTEEAYPY GKDG CKFS+E
Sbjct: 184 KGISLSEQQLVDCAGAYNNYGCNGGLPSQAFEYIKSNGGLDTEEAYPYIGKDGTCKFSAE 243
Query: 200 NVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDV 259
NVGVQVLDSVNITLGAEDEL+HAVGLVRPVS+AFEV+ FR YKSGVY+ + CG+TPMDV
Sbjct: 244 NVGVQVLDSVNITLGAEDELKHAVGLVRPVSIAFEVIHSFRLYKSGVYTDSHCGSTPMDV 303
Query: 260 NHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCG 303
NHAV+AVGYGVEDGVPYWLIKNSWG +WGD GYFKMEMGKNMCG
Sbjct: 304 NHAVLAVGYGVEDGVPYWLIKNSWGADWGDKGYFKMEMGKNMCG 347
>gi|158524604|gb|ABW71226.1| cysteine protease [Nicotiana tabacum]
Length = 360
Score = 473 bits (1217), Expect = e-131, Method: Compositional matrix adjust.
Identities = 238/355 (67%), Positives = 269/355 (75%), Gaps = 47/355 (13%)
Query: 7 LVSSVILLLCCAAAASASASSFDDSNPIRLVSSDGLRDFETSVLQVIGQARHALSFARFA 66
L+ ++++ AAA A ++F NPIR V SDGL + E +LQV+GQ+RHALSF RFA
Sbjct: 6 LLLALVVAGGLFAAALAGPATFAVENPIRQVVSDGLHELENGILQVVGQSRHALSFVRFA 65
Query: 67 RRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN------------------ 108
RYGK YESVEE+K RF F NL +IRS N KGLSY+LG+N
Sbjct: 66 HRYGKRYESVEEIKQRFEVFLDNLKMIRSHNKKGLSYKLGVNEFTDLTWDEFRRDRLGAA 125
Query: 109 -----------------------------ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFG 139
+SPVK+QG CGSCWTFSTTG+LEAAY QAFG
Sbjct: 126 QNCSATTKGNVKLTNAVLPETKDWREDGIVSPVKNQGKCGSCWTFSTTGALEAAYSQAFG 185
Query: 140 KGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSE 199
KGISLSEQQLVDCA AFNN GCNGGLPSQAFEYIK NGGLDTEEAYPYTGK+G+CKFSSE
Sbjct: 186 KGISLSEQQLVDCAGAFNNFGCNGGLPSQAFEYIKSNGGLDTEEAYPYTGKNGLCKFSSE 245
Query: 200 NVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDV 259
NVGV+V+DSVNITLGAEDEL++AV LVRPVS+AFEV+ GF+ YKSGVYSST+CGNTPMDV
Sbjct: 246 NVGVKVIDSVNITLGAEDELKYAVALVRPVSIAFEVIKGFKQYKSGVYSSTECGNTPMDV 305
Query: 260 NHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASYPVVA 314
NHAV+AVGYGVE+GVPYWLIKNSWG +WGD GYFKMEMGKNMCGIATCASYPVVA
Sbjct: 306 NHAVLAVGYGVENGVPYWLIKNSWGADWGDDGYFKMEMGKNMCGIATCASYPVVA 360
>gi|71482942|gb|AAZ32410.1| cysteine proteinase aleuran type [Nicotiana benthamiana]
Length = 360
Score = 472 bits (1214), Expect = e-130, Method: Compositional matrix adjust.
Identities = 235/355 (66%), Positives = 270/355 (76%), Gaps = 47/355 (13%)
Query: 7 LVSSVILLLCCAAAASASASSFDDSNPIRLVSSDGLRDFETSVLQVIGQARHALSFARFA 66
L+ ++++ AAA A ++F D NPIR + SDGL + E +LQV+G+ RHAL FARFA
Sbjct: 6 LLLALVVAGGLFAAALAGPATFADENPIRQIVSDGLHELENGILQVVGKTRHALLFARFA 65
Query: 67 RRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN------------------ 108
RYGK YE+VEE+K RF F NL +IRS N KGLSY+LG+N
Sbjct: 66 HRYGKRYETVEEIKQRFEVFLDNLKMIRSHNKKGLSYKLGVNEFTDITWDEFRRDRLGAA 125
Query: 109 -----------------------------ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFG 139
+SPVK+QG CGSCWTFSTTG+LEAAY QAFG
Sbjct: 126 QNCSATTKGNLKLTNVVLPETKDWREAGIVSPVKNQGKCGSCWTFSTTGALEAAYGQAFG 185
Query: 140 KGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSE 199
KGISLSEQQLVDCA AFNN GCNGGLPSQAFEYIK NGGLDTEEAYPYTGK+G+CKFSSE
Sbjct: 186 KGISLSEQQLVDCAGAFNNFGCNGGLPSQAFEYIKSNGGLDTEEAYPYTGKNGLCKFSSE 245
Query: 200 NVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDV 259
NVGV+V+DSVNITLGAEDEL++AV LVRPVS+AFEV+ GF+ YKSGVY+ST+CGNTPMDV
Sbjct: 246 NVGVKVIDSVNITLGAEDELKYAVALVRPVSIAFEVIKGFKQYKSGVYTSTECGNTPMDV 305
Query: 260 NHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASYPVVA 314
NHAV+AVGYGVE+GVPYWLIKNSWG +WGD+GYFKMEMGKNMCGIATCASYPVVA
Sbjct: 306 NHAVLAVGYGVENGVPYWLIKNSWGADWGDNGYFKMEMGKNMCGIATCASYPVVA 360
>gi|6967097|emb|CAB72480.1| cysteine protease-like protein [Arabidopsis thaliana]
Length = 377
Score = 469 bits (1207), Expect = e-130, Method: Compositional matrix adjust.
Identities = 232/349 (66%), Positives = 265/349 (75%), Gaps = 50/349 (14%)
Query: 1 MARPVQLVSSVILLLCCAAAASASASSFDDSNPIRLVSSDGLRDFETSVLQVIGQARHAL 60
M+ + L SS++L+L AAA++ FD+SNPI++VS D L + E +V+Q++GQ+RH L
Sbjct: 1 MSVKLNLSSSILLILF--AAAASKEIGFDESNPIKMVS-DNLHELEDTVVQILGQSRHVL 57
Query: 61 SFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN------------ 108
SF+RF RYGK Y+SVEEMKLRF+ F +NLDLIRSTN KGLSY+L LN
Sbjct: 58 SFSRFTHRYGKKYQSVEEMKLRFSVFKENLDLIRSTNKKGLSYKLSLNQFADLTWQEFQR 117
Query: 109 -----------------------------------ISPVKDQGHCGSCWTFSTTGSLEAA 133
+SPVK+QGHCGSCWTFSTTG+LEAA
Sbjct: 118 YKLGAAQNCSATLKGSHKITEATVPDTKDWREDGIVSPVKEQGHCGSCWTFSTTGALEAA 177
Query: 134 YHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGV 193
YHQAFGKGISLSEQQLVDCA FNN GC+GGLPSQAFEYIKYNGGLDTEEAYPYTGKDG
Sbjct: 178 YHQAFGKGISLSEQQLVDCAGTFNNFGCHGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGG 237
Query: 194 CKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCG 253
CKFS++N+GVQV DSVNITLGAEDEL+HAVGLVRPVSVAFEVV FRFYK GV++S CG
Sbjct: 238 CKFSAKNIGVQVRDSVNITLGAEDELKHAVGLVRPVSVAFEVVHEFRFYKKGVFTSNTCG 297
Query: 254 NTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMC 302
NTPMDVNHAV+AVGYGVED VPYWLIKNSWG WGD+GYFKMEMGKNMC
Sbjct: 298 NTPMDVNHAVLAVGYGVEDDVPYWLIKNSWGGEWGDNGYFKMEMGKNMC 346
>gi|2677828|gb|AAB97142.1| cysteine protease [Prunus armeniaca]
Length = 358
Score = 466 bits (1200), Expect = e-129, Method: Compositional matrix adjust.
Identities = 235/361 (65%), Positives = 267/361 (73%), Gaps = 50/361 (13%)
Query: 1 MARPVQLVSSVILLLCCAAAASASASSFDDSNPIRLVSSDGLRDFETSVLQVIGQARHAL 60
MAR ++S+ ++L+ A + A+ASSFD+SNPIRLVS DGLR+ E V+QV+G +R AL
Sbjct: 1 MARVTLVLSAALVLV--AISCGAAASSFDESNPIRLVS-DGLRELEQQVVQVLGNSRRAL 57
Query: 61 SFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN------------ 108
FARFA RYGK YESVEEMKLR+ FS+N LIRSTN KGL Y L +N
Sbjct: 58 HFARFAHRYGKKYESVEEMKLRYEIFSENKKLIRSTNKKGLPYTLAVNRFADWSWEEFRR 117
Query: 109 -----------------------------------ISPVKDQGHCGSCWTFSTTGSLEAA 133
++PVKDQGHCGSCWTFSTTG+LEAA
Sbjct: 118 QRLGAAQNCSATTKGSHELTDAVLPESKNWREEGIVTPVKDQGHCGSCWTFSTTGALEAA 177
Query: 134 YHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGV 193
Y QAF K ISLSEQQLVDCA AFNN GC+GGLPSQAFEYIKYNGGLDTE AYPY G DG
Sbjct: 178 YVQAFRKQISLSEQQLVDCAGAFNNFGCHGGLPSQAFEYIKYNGGLDTEAAYPYVGTDGA 237
Query: 194 CKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCG 253
CKFS+ENVGVQVLDSVNITLG E EL+HAV VRPVSVAF+VV FR YKSGVY+S CG
Sbjct: 238 CKFSAENVGVQVLDSVNITLGDEQELKHAVAFVRPVSVAFQVVKSFRIYKSGVYTSDTCG 297
Query: 254 NTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASYPVV 313
++PMDVNHAV+AVGYG E GVP+WLIKNSWGE+WGD+GYFKME GKNMCG+ATCASYP+V
Sbjct: 298 SSPMDVNHAVLAVGYGEEGGVPFWLIKNSWGESWGDNGYFKMEFGKNMCGVATCASYPIV 357
Query: 314 A 314
A
Sbjct: 358 A 358
>gi|258406688|gb|ACV72067.1| putative cysteine protease [Lathyrus sativus]
Length = 350
Score = 465 bits (1197), Expect = e-128, Method: Compositional matrix adjust.
Identities = 229/352 (65%), Positives = 266/352 (75%), Gaps = 53/352 (15%)
Query: 10 SVILLLCCAAAASASASSFDDSNPIRLVSSDGLRDFETSVLQVIGQARHALSFARFARRY 69
S++++L C A+A SF DSNPIR+VS D E +LQVIG++RHA+SFARFA RY
Sbjct: 5 SLLIVLFCVTTAAA-GFSFHDSNPIRMVS-----DAEEQLLQVIGESRHAVSFARFANRY 58
Query: 70 GKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN--------------------- 108
GK+Y+SV+EMKLRF FS+NL+LIRSTN + LSY+LG+N
Sbjct: 59 GKLYDSVDEMKLRFKIFSENLELIRSTNKRRLSYKLGVNHFADWTWEEFKSHRLGAAQNC 118
Query: 109 --------------------------ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGI 142
+S VKDQGHCGSCWTFSTTG+LE+AY QAFGK I
Sbjct: 119 SATLKGNHKITDANLPDEKDWRKEGIVSEVKDQGHCGSCWTFSTTGALESAYAQAFGKNI 178
Query: 143 SLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVG 202
SLSEQQLVDCA AFNN GC+GGLPSQAFEYIKYNGGL+TEE YPYTG +G+CKF+SENV
Sbjct: 179 SLSEQQLVDCAGAFNNFGCSGGLPSQAFEYIKYNGGLETEETYPYTGSNGLCKFTSENVA 238
Query: 203 VQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHA 262
++VL SVNITLG+EDEL+HAV RPVSVAFEVV FR YKSGVY+ST CGNTPMDVNHA
Sbjct: 239 LKVLGSVNITLGSEDELKHAVAFARPVSVAFEVVHDFRLYKSGVYTSTACGNTPMDVNHA 298
Query: 263 VVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASYPVVA 314
V+AVGYG+EDG+PYW IKNSWG +WGDHGYFKMEMGKNMCG+ATC+SYPVVA
Sbjct: 299 VLAVGYGIEDGIPYWHIKNSWGGDWGDHGYFKMEMGKNMCGVATCSSYPVVA 350
>gi|145334857|ref|NP_001078774.1| thiol protease aleurain [Arabidopsis thaliana]
gi|332009932|gb|AED97315.1| thiol protease aleurain [Arabidopsis thaliana]
Length = 361
Score = 464 bits (1193), Expect = e-128, Method: Compositional matrix adjust.
Identities = 227/323 (70%), Positives = 252/323 (78%), Gaps = 48/323 (14%)
Query: 28 FDDSNPIRLVSSDGLRDFETSVLQVIGQARHALSFARFARRYGKIYESVEEMKLRFATFS 87
FD+SNPIR+VS DGLR+ E SV Q++GQ+RH LSFARF RYGK Y++VEEMKLRF+ F
Sbjct: 26 FDESNPIRMVS-DGLREVEESVSQILGQSRHVLSFARFTHRYGKKYQNVEEMKLRFSIFK 84
Query: 88 KNLDLIRSTNCKGLSYRLGLN--------------------------------------- 108
+NLDLIRSTN KGLSY+LG+N
Sbjct: 85 ENLDLIRSTNKKGLSYKLGVNQFADLTWQEFQRTKLGAAQNCSATLKGSHKVTEAALPET 144
Query: 109 --------ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+SPVKDQG CGSCWTFSTTG+LEAAYHQAFGKGISLSEQQLVDCA AFNN G
Sbjct: 145 KDWREDGIVSPVKDQGGCGSCWTFSTTGALEAAYHQAFGKGISLSEQQLVDCAGAFNNYG 204
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
CNGGLPSQAFEYIK NGGLDTE+AYPYTGKD CKFS+ENVGVQVL+SVNITLGAEDEL+
Sbjct: 205 CNGGLPSQAFEYIKSNGGLDTEKAYPYTGKDETCKFSAENVGVQVLNSVNITLGAEDELK 264
Query: 221 HAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIK 280
HAVGLVRPVS+AFEV+ FR YKSGVY+ + CG+TPMDVNHAV+AVGYGVEDGVPYWLIK
Sbjct: 265 HAVGLVRPVSIAFEVIHSFRLYKSGVYTDSHCGSTPMDVNHAVLAVGYGVEDGVPYWLIK 324
Query: 281 NSWGENWGDHGYFKMEMGKNMCG 303
NSWG +WGD GYFKMEMGKNMCG
Sbjct: 325 NSWGADWGDKGYFKMEMGKNMCG 347
>gi|94420703|gb|ABF18679.1| cysteine protease [Medicago sativa]
Length = 350
Score = 463 bits (1192), Expect = e-128, Method: Compositional matrix adjust.
Identities = 232/351 (66%), Positives = 259/351 (73%), Gaps = 54/351 (15%)
Query: 11 VILLLCCAAAASASASSFDDSNPIRLVSSDGLRDFETSVLQVIGQARHALSFARFARRYG 70
+I+ C A AA+ SF DSNPIR+VS D E +LQVIG++RHA+SFARFA RYG
Sbjct: 7 LIVFFCVATAAAGL--SFHDSNPIRMVS-----DMEKQLLQVIGESRHAVSFARFANRYG 59
Query: 71 KIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN---------------------- 108
K Y++V+EMK RF FS+NL LI STN K L Y LG+N
Sbjct: 60 KRYDTVDEMKRRFKIFSENLQLIESTNKKRLGYTLGVNHFADWTWEEFRSHRLGAAQNCS 119
Query: 109 -------------------------ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGIS 143
+S VKDQGHCGSCWTFSTTG+LE+AY QAFGK IS
Sbjct: 120 ATLKGNHRITDVVLPAEKDWRKEGIVSEVKDQGHCGSCWTFSTTGALESAYAQAFGKNIS 179
Query: 144 LSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGV 203
LSEQQLVDCA AFNN GCNGGLPSQAFEYIKYNGGL+TEEAYPYTG++G CKF+SE+V V
Sbjct: 180 LSEQQLVDCAGAFNNFGCNGGLPSQAFEYIKYNGGLETEEAYPYTGQNGPCKFTSEDVAV 239
Query: 204 QVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAV 263
QVL SVNITLGAEDEL+HAV RPVSVAFEVVD FR YK GVY+ST CGNTPMDVNHAV
Sbjct: 240 QVLGSVNITLGAEDELKHAVAFARPVSVAFEVVDDFRLYKKGVYTSTTCGNTPMDVNHAV 299
Query: 264 VAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASYPVVA 314
+AVGYG+EDGVPYWLIKNSWG WGDHGYFKMEMGKNMCG+ATC+SYPVVA
Sbjct: 300 LAVGYGIEDGVPYWLIKNSWGGEWGDHGYFKMEMGKNMCGVATCSSYPVVA 350
>gi|297819034|ref|XP_002877400.1| hypothetical protein ARALYDRAFT_323209 [Arabidopsis lyrata subsp.
lyrata]
gi|297323238|gb|EFH53659.1| hypothetical protein ARALYDRAFT_323209 [Arabidopsis lyrata subsp.
lyrata]
Length = 317
Score = 462 bits (1188), Expect = e-127, Method: Compositional matrix adjust.
Identities = 223/302 (73%), Positives = 253/302 (83%), Gaps = 7/302 (2%)
Query: 19 AAASASASSFDDSNPIRLVSSDGLRDFETSVLQVIGQARHALSFARFARRYGKIYESVEE 78
AAA+ FD+SNPI++VS D L + E +V+Q++GQ+RH LSF+RFA RYGK Y+SVEE
Sbjct: 17 AAAATKEIRFDESNPIKMVS-DNLHELEDNVVQILGQSRHVLSFSRFAHRYGKKYQSVEE 75
Query: 79 MKLRFATFSKNLDLIRSTNCKGLSYRLGLN------ISPVKDQGHCGSCWTFSTTGSLEA 132
MKLRF+ F +NLDLIRSTN KGLSY+L LN + +TTG+LEA
Sbjct: 76 MKLRFSVFKENLDLIRSTNKKGLSYKLSLNQFADLTWQEFQRYKLLLLLLLVNTTGALEA 135
Query: 133 AYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDG 192
AYHQAFGKGISLSEQQLVDCA FNN GC+GGLPSQAFEYIKYNGGLDTEEAYPYTGKDG
Sbjct: 136 AYHQAFGKGISLSEQQLVDCAGTFNNFGCHGGLPSQAFEYIKYNGGLDTEEAYPYTGKDG 195
Query: 193 VCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKC 252
CKFS++N+GVQVLDSVNITLGAEDEL+HAVGLVRPVSVAFEVV FRFYK GV++S C
Sbjct: 196 GCKFSAKNIGVQVLDSVNITLGAEDELKHAVGLVRPVSVAFEVVHEFRFYKKGVFTSNTC 255
Query: 253 GNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASYPV 312
GNTPMDVNHAV+AVGYGVED VPYWLIKNSWG +WGD+GYFKMEMGKNMCG+ATC+SYPV
Sbjct: 256 GNTPMDVNHAVLAVGYGVEDDVPYWLIKNSWGGDWGDNGYFKMEMGKNMCGVATCSSYPV 315
Query: 313 VA 314
VA
Sbjct: 316 VA 317
>gi|1134882|emb|CAA92583.1| cysteine protease [Pisum sativum]
Length = 350
Score = 461 bits (1185), Expect = e-127, Method: Compositional matrix adjust.
Identities = 229/352 (65%), Positives = 267/352 (75%), Gaps = 53/352 (15%)
Query: 10 SVILLLCCAAAASASASSFDDSNPIRLVSSDGLRDFETSVLQVIGQARHALSFARFARRY 69
S++++L C A+A+A SF DSNPIR+VS D E +LQVIG++RHA+SFARFA RY
Sbjct: 5 SLLIVLFCVASAAA-GFSFHDSNPIRMVS-----DVEEQLLQVIGESRHAVSFARFANRY 58
Query: 70 GKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN--------------------- 108
GK Y+SV+EMKLRF FS+NL+LIRS+N + LSY+LG+N
Sbjct: 59 GKRYDSVDEMKLRFKIFSENLELIRSSNKRRLSYKLGVNHFADWTWEEFRSHRLGAAQNC 118
Query: 109 --------------------------ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGI 142
+S VKDQG CGSCWTFSTTG+LE+AY QAFGK I
Sbjct: 119 SATLKGNHKITDANLPDEKDWRKEGIVSGVKDQGSCGSCWTFSTTGALESAYAQAFGKNI 178
Query: 143 SLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVG 202
SLSEQQLVDCA AFNN GC+GGLPSQAFEYIKYNGGL+TEEAYPYTG +G+CKF SE+V
Sbjct: 179 SLSEQQLVDCAGAFNNFGCSGGLPSQAFEYIKYNGGLETEEAYPYTGSNGLCKFRSEHVA 238
Query: 203 VQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHA 262
V+VL SVNITLGAEDEL+HA+ RPVSVAFEVV FR YKSGVY+ST CG+TPMDVNHA
Sbjct: 239 VKVLGSVNITLGAEDELKHAIAFARPVSVAFEVVHDFRLYKSGVYTSTACGSTPMDVNHA 298
Query: 263 VVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASYPVVA 314
V+AVGYG+EDG+PYWLIKNSWG +WGDHGYFKMEMGKNMCG+ATC+SYPVVA
Sbjct: 299 VLAVGYGIEDGIPYWLIKNSWGGDWGDHGYFKMEMGKNMCGVATCSSYPVVA 350
>gi|14422331|emb|CAC41636.1| early leaf senescence abundant cysteine protease [Pisum sativum]
Length = 350
Score = 460 bits (1183), Expect = e-127, Method: Compositional matrix adjust.
Identities = 228/352 (64%), Positives = 267/352 (75%), Gaps = 53/352 (15%)
Query: 10 SVILLLCCAAAASASASSFDDSNPIRLVSSDGLRDFETSVLQVIGQARHALSFARFARRY 69
S++++L C A+A+A SF DSNPIR+VS D E +LQVIG++RHA+SFARFA RY
Sbjct: 5 SLLIVLFCVASAAA-GFSFHDSNPIRMVS-----DVEEQLLQVIGESRHAVSFARFANRY 58
Query: 70 GKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN--------------------- 108
GK Y+SV+EMKLRF FS+N++LIRS+N + LSY+LG+N
Sbjct: 59 GKRYDSVDEMKLRFKIFSENIELIRSSNKRRLSYKLGVNHFADWTWEEFRSHRLGAAQNC 118
Query: 109 --------------------------ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGI 142
+S VKDQG CGSCWTFSTTG+LE+AY QAFGK I
Sbjct: 119 SATLKGNHKITDANLPDEKDWRKEGIVSGVKDQGSCGSCWTFSTTGALESAYAQAFGKNI 178
Query: 143 SLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVG 202
SLSEQQLVDCA AFNN GC+GGLPSQAFEYIKYNGGL+TEEAYPYTG +G+CKF SE+V
Sbjct: 179 SLSEQQLVDCAGAFNNFGCSGGLPSQAFEYIKYNGGLETEEAYPYTGSNGLCKFRSEHVA 238
Query: 203 VQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHA 262
V+VL SVNITLGAEDEL+HA+ RPVSVAFEVV FR YKSGVY+ST CG+TPMDVNHA
Sbjct: 239 VKVLGSVNITLGAEDELKHAIAFARPVSVAFEVVHDFRLYKSGVYTSTACGSTPMDVNHA 298
Query: 263 VVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASYPVVA 314
V+AVGYG+EDG+PYWLIKNSWG +WGDHGYFKMEMGKNMCG+ATC+SYPVVA
Sbjct: 299 VLAVGYGIEDGIPYWLIKNSWGGDWGDHGYFKMEMGKNMCGVATCSSYPVVA 350
>gi|388513209|gb|AFK44666.1| unknown [Lotus japonicus]
gi|388514955|gb|AFK45539.1| unknown [Lotus japonicus]
Length = 352
Score = 459 bits (1182), Expect = e-127, Method: Compositional matrix adjust.
Identities = 226/342 (66%), Positives = 254/342 (74%), Gaps = 52/342 (15%)
Query: 20 AASASASSFDDSNPIRLVSSDGLRDFETSVLQVIGQARHALSFARFARRYGKIYESVEEM 79
AA+A +SSF+DSNPIRLVS D E VLQVIGQ RHA+SFARFA +YGK Y+SVEE+
Sbjct: 16 AAAAGSSSFEDSNPIRLVS-----DLEEQVLQVIGQTRHAVSFARFASKYGKRYDSVEEI 70
Query: 80 KLRFATFSKNLDLIRSTNCKGLSYRLGLN------------------------------- 108
+ RF FS+NL+LI+STN K LSY+LGLN
Sbjct: 71 QHRFRIFSENLELIKSTNKKRLSYKLGLNHFADLSWDEFRTQKLGAAQNCSATLIGNHKL 130
Query: 109 ----------------ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDC 152
+S VKDQ HCGSCWTFSTTG+LEAAY QA GK ISLSEQQLVDC
Sbjct: 131 TDAVLPAEKDWRKESIVSEVKDQAHCGSCWTFSTTGALEAAYAQAHGKNISLSEQQLVDC 190
Query: 153 AQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNIT 212
A AFNN GCNGGLPSQAFEYIKYNGG+ E+ YPYT KD CKF++ENV V+VLDSVNIT
Sbjct: 191 AGAFNNFGCNGGLPSQAFEYIKYNGGIALEKEYPYTAKDEACKFTAENVAVRVLDSVNIT 250
Query: 213 LGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVED 272
LGAEDEL+HAV RPVSVAF+VVDGFR YK GVY+S CGNTPMDVNHAV+AVGYGVE+
Sbjct: 251 LGAEDELKHAVAFARPVSVAFQVVDGFRLYKEGVYTSDTCGNTPMDVNHAVLAVGYGVEN 310
Query: 273 GVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASYPVVA 314
VPYW+IKNSWG WGDHGYFKME+GKNMCG+ATCASYP+VA
Sbjct: 311 NVPYWIIKNSWGSTWGDHGYFKMELGKNMCGVATCASYPIVA 352
>gi|356565778|ref|XP_003551114.1| PREDICTED: thiol protease aleurain-like [Glycine max]
Length = 353
Score = 459 bits (1181), Expect = e-127, Method: Compositional matrix adjust.
Identities = 236/362 (65%), Positives = 266/362 (73%), Gaps = 57/362 (15%)
Query: 1 MARPVQLVSSVILLLCCAAAASAS-ASSFDDSNPIRLVSSDGLRDFETSVLQVIGQARHA 59
MAR +S +I C A A A SSFDD+NPIRL S D E+ VL VIGQ+RHA
Sbjct: 1 MAR----LSLLIFAFCAVAVAVAVAGSSFDDANPIRLAS-----DLESQVLDVIGQSRHA 51
Query: 60 LSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN----------- 108
LSFARFARR+GK Y SV+E++ RF FS NL LIRSTN + L+Y LG+N
Sbjct: 52 LSFARFARRHGKRYRSVDEIRNRFRIFSDNLKLIRSTNRRSLTYTLGVNHFADWTWEEFT 111
Query: 109 ------------------------------------ISPVKDQGHCGSCWTFSTTGSLEA 132
+S VKDQG+CGSCWTFSTTG+LEA
Sbjct: 112 RHKLGAPQNCSATLKGNHRLTDAVLPDEKDWRKEGIVSQVKDQGNCGSCWTFSTTGALEA 171
Query: 133 AYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDG 192
AY QAFGK ISLSEQQLVDCA AFNN GCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDG
Sbjct: 172 AYAQAFGKNISLSEQQLVDCAGAFNNFGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDG 231
Query: 193 VCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKC 252
VCKF+++NV V+V+DS+NITLGAEDEL+ AV VRPVSVAFEV FRFY +GVY+ST C
Sbjct: 232 VCKFTAKNVAVRVIDSINITLGAEDELKQAVAFVRPVSVAFEVAKDFRFYNNGVYTSTIC 291
Query: 253 GNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASYPV 312
G+TPMDVNHAV+AVGYGVEDGVPYW+IKNSWG NWGD+GYFKME+GKNMCG+ATCASYPV
Sbjct: 292 GSTPMDVNHAVLAVGYGVEDGVPYWIIKNSWGSNWGDNGYFKMELGKNMCGVATCASYPV 351
Query: 313 VA 314
VA
Sbjct: 352 VA 353
>gi|945081|gb|AAC49361.1| P21 [Petunia x hybrida]
Length = 358
Score = 457 bits (1176), Expect = e-126, Method: Compositional matrix adjust.
Identities = 227/343 (66%), Positives = 257/343 (74%), Gaps = 47/343 (13%)
Query: 19 AAASASASSFDDSNPIRLVSSDGLRDFETSVLQVIGQARHALSFARFARRYGKIYESVEE 78
A A A ++F D NPIR V SD + E+ +L V+GQ RHALSFARFARRYGK Y+SVEE
Sbjct: 16 AVAFARTANFADENPIRQVVSDSFHELESGILHVVGQTRHALSFARFARRYGKRYDSVEE 75
Query: 79 MKLRFATFSKNLDLIRSTNCKGLSYRLGLN------------------------------ 108
+K RF F NL++I S N KGLSY+LG+N
Sbjct: 76 IKQRFDIFLDNLEMINSHNDKGLSYKLGVNEFSDLTWDEFRRDRLGAAQNCSATTKGNLK 135
Query: 109 -----------------ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVD 151
+SPVK+QG CGSCWTFSTTG+LEAAY Q FGKGISLSEQQLVD
Sbjct: 136 LRDAVLPETKDWREAGIVSPVKNQGKCGSCWTFSTTGALEAAYTQKFGKGISLSEQQLVD 195
Query: 152 CAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNI 211
CA AFNN GCNGGLPSQAFEYIK NGGL+TEEAYPYTGK+G+CKFSS+NVGV+V DSVNI
Sbjct: 196 CAGAFNNFGCNGGLPSQAFEYIKSNGGLETEEAYPYTGKNGLCKFSSQNVGVKVTDSVNI 255
Query: 212 TLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE 271
TLGAEDEL++AV LVRPVSVAFEVV GF+ YKSGVY+ST+CG TPMDVNHAV+AVGYGVE
Sbjct: 256 TLGAEDELKYAVALVRPVSVAFEVVKGFKQYKSGVYTSTECGTTPMDVNHAVLAVGYGVE 315
Query: 272 DGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASYPVVA 314
GVP+WLIKNSWG +WGD+ YFKMEMG +MCGIATCASYPVVA
Sbjct: 316 YGVPFWLIKNSWGADWGDNAYFKMEMGNDMCGIATCASYPVVA 358
>gi|388491952|gb|AFK34042.1| unknown [Lotus japonicus]
Length = 352
Score = 456 bits (1172), Expect = e-126, Method: Compositional matrix adjust.
Identities = 225/342 (65%), Positives = 252/342 (73%), Gaps = 52/342 (15%)
Query: 20 AASASASSFDDSNPIRLVSSDGLRDFETSVLQVIGQARHALSFARFARRYGKIYESVEEM 79
AA+A +SSF+DSNPIRLVS D E VLQVIGQ RHA SFARFA +YGK Y+SVEE+
Sbjct: 16 AAAAGSSSFEDSNPIRLVS-----DLEEQVLQVIGQTRHAASFARFASKYGKRYDSVEEI 70
Query: 80 KLRFATFSKNLDLIRSTNCKGLSYRLGLN------------------------------- 108
+ RF FS+NL+LI+STN K LSY+LGLN
Sbjct: 71 QHRFRIFSENLELIKSTNKKRLSYKLGLNHFADLSWDEFRTQKLGAAQNCSATLIGNHKL 130
Query: 109 ----------------ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDC 152
+S VKDQ HCGSCWTFSTTG+LEAAY QA GK ISLSEQQLVDC
Sbjct: 131 TDAVLSAEKDWRKESIVSEVKDQAHCGSCWTFSTTGALEAAYAQAHGKNISLSEQQLVDC 190
Query: 153 AQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNIT 212
A AFNN GCNGGLPSQAFEYIKYNGG+ E+ YPYT KD KF++ENV V+VLDSVNIT
Sbjct: 191 AGAFNNFGCNGGLPSQAFEYIKYNGGIALEKEYPYTAKDEASKFTAENVAVRVLDSVNIT 250
Query: 213 LGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVED 272
LGAEDEL+HAV RPVSVAF+VVDGFR YK GVY+S CGNTPMDVNHAV+AVGYGVE+
Sbjct: 251 LGAEDELKHAVAFARPVSVAFQVVDGFRLYKEGVYTSDTCGNTPMDVNHAVLAVGYGVEN 310
Query: 273 GVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASYPVVA 314
VPYW+IKNSWG WGDHGYFKME+GKNMCG+ATCASYP+VA
Sbjct: 311 NVPYWIIKNSWGSTWGDHGYFKMELGKNMCGVATCASYPIVA 352
>gi|155970232|gb|ABU41785.1| cysteine protease [Rosa x borboniana]
Length = 357
Score = 453 bits (1165), Expect = e-125, Method: Compositional matrix adjust.
Identities = 228/340 (67%), Positives = 255/340 (75%), Gaps = 48/340 (14%)
Query: 22 SASASSFDDSNPIRLVSSDGLRDFETSVLQVIGQARHALSFARFARRYGKIYESVEEMKL 81
+ +ASSFD+S+PIRLV DGLR+ E V+QV+GQ H SFARFA RY K YESVEEM
Sbjct: 19 TCAASSFDESSPIRLVP-DGLRELEDQVVQVLGQVCHVRSFARFAYRYEKRYESVEEMGR 77
Query: 82 RFATFSKNLDLIRSTNCKGLSYRLGLN--------------------------------- 108
RF F++N LIRSTN KGLSY+LG+N
Sbjct: 78 RFEIFAENKKLIRSTNRKGLSYKLGVNRFADWTWEEFQRHRLGAAQNCSATTKGNHKLTD 137
Query: 109 --------------ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQ 154
++PVKDQGHCGSCWTFSTTG+LEAAY QAFGK IS SEQQLVDCA
Sbjct: 138 AVPPLTKNWRDEGIVTPVKDQGHCGSCWTFSTTGALEAAYVQAFGKQISPSEQQLVDCAG 197
Query: 155 AFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLG 214
AFNN GC+GGLPSQAFEYIKYNGGLDTE+AYPYT DG CKFSSENVGV+VLDSVNITL
Sbjct: 198 AFNNFGCSGGLPSQAFEYIKYNGGLDTEQAYPYTAVDGACKFSSENVGVRVLDSVNITLN 257
Query: 215 AEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGV 274
E+EL+HAV VRPVSVAF+VV FR YKSGVY+S CGNTPMDVNHAV+AVGYGVE+GV
Sbjct: 258 DEEELKHAVAFVRPVSVAFQVVQDFRLYKSGVYTSETCGNTPMDVNHAVLAVGYGVENGV 317
Query: 275 PYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASYPVVA 314
PYWLIKNSWG++WGD+GYFKME GKNMCG+ATCASYPVVA
Sbjct: 318 PYWLIKNSWGQSWGDNGYFKMEYGKNMCGVATCASYPVVA 357
>gi|1706261|sp|Q10717.1|CYSP2_MAIZE RecName: Full=Cysteine proteinase 2; Flags: Precursor
gi|644490|dbj|BAA08245.1| cysteine proteinase [Zea mays]
Length = 360
Score = 451 bits (1159), Expect = e-124, Method: Compositional matrix adjust.
Identities = 224/353 (63%), Positives = 253/353 (71%), Gaps = 49/353 (13%)
Query: 11 VILLLCCAAAASASASSFDDSNPIRLVSSDGLRDFETSVLQVIGQARHALSFARFARRYG 70
V+ ++ A A+ S F DSNPIR V+ E++V +G+ R AL FARFA RYG
Sbjct: 8 VLAVVVLADTAAVVNSGFADSNPIRPVTDRAASALESTVFAALGRTRDALRFARFAVRYG 67
Query: 71 KIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN---------------------- 108
K YES E+ RF FS++L L+RSTN KGLSYRLG+N
Sbjct: 68 KSYESAAEVHKRFRIFSESLQLVRSTNRKGLSYRLGINRFADMSWEEFRATRLGAAQNCS 127
Query: 109 ---------------------------ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKG 141
+SPVK+QGHCGSCWTFSTTG+LEAAY QA GK
Sbjct: 128 ATLTGNHRMRAAAVALPETKDWREDGIVSPVKNQGHCGSCWTFSTTGALEAAYTQATGKP 187
Query: 142 ISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENV 201
ISLSEQQLVDC AFNN GCNGGLPSQAFEYIKYNGGLDTEE+YPY G +G+CKF +ENV
Sbjct: 188 ISLSEQQLVDCGFAFNNFGCNGGLPSQAFEYIKYNGGLDTEESYPYQGVNGICKFKNENV 247
Query: 202 GVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNH 261
GV+VLDSVNITLGAEDEL+ AVGLVRPVSVAFEV+ GFR YKSGVY+S CG TPMDVNH
Sbjct: 248 GVKVLDSVNITLGAEDELKDAVGLVRPVSVAFEVITGFRLYKSGVYTSDHCGTTPMDVNH 307
Query: 262 AVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASYPVVA 314
AV+AVGYGVEDGVPYWLIKNSWG +WGD GYFKMEMGKNMCG+ATCASYP+VA
Sbjct: 308 AVLAVGYGVEDGVPYWLIKNSWGADWGDEGYFKMEMGKNMCGVATCASYPIVA 360
>gi|148927396|gb|ABR19829.1| cysteine proteinase [Elaeis guineensis]
Length = 358
Score = 449 bits (1154), Expect = e-123, Method: Compositional matrix adjust.
Identities = 228/361 (63%), Positives = 263/361 (72%), Gaps = 51/361 (14%)
Query: 1 MARPVQLVSSVILLLCCAAAASASASSFDDSNPIRLVSSDGLRDFETSVLQVIGQARHAL 60
MAR + ++ V L +A + + +FD++N I+ V+ + + ETS+L V+GQ R+AL
Sbjct: 1 MARFLAFLALVFL---SSAILARANHAFDEANLIQSVT-ERIDSLETSLLGVLGQTRNAL 56
Query: 61 SFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN------------ 108
FARFA RYGK Y+SVEEMKLRFA F +NL+LIRSTN +GL Y+LG+N
Sbjct: 57 HFARFAHRYGKRYQSVEEMKLRFAIFMENLELIRSTNRRGLPYKLGINRYADMSWEEFRA 116
Query: 109 -----------------------------------ISPVKDQGHCGSCWTFSTTGSLEAA 133
+SPVKDQG CGSCWTFSTTG+LEAA
Sbjct: 117 SRLGAAQNCSATLKGNHKMTDELLPKTKDWREDGIVSPVKDQGSCGSCWTFSTTGALEAA 176
Query: 134 YHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGV 193
Y QA GKGISLSEQQLVDCA AFNN GCNGGLPSQAFEYIKYNGGLDTEE+YPY G +G
Sbjct: 177 YTQATGKGISLSEQQLVDCAYAFNNFGCNGGLPSQAFEYIKYNGGLDTEESYPYAGVNGF 236
Query: 194 CKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCG 253
C F ENVGV+V++SVNITLGAEDEL HAVGLVRPVS+AFEVV GFRFYK GVY+S CG
Sbjct: 237 CHFKPENVGVKVVESVNITLGAEDELLHAVGLVRPVSIAFEVVSGFRFYKGGVYTSDTCG 296
Query: 254 NTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASYPVV 313
T MDVNHAV+AVGYGVE+GVPYWLIKNSWGE WG GYFKME+GKNMCGIATCASYP+V
Sbjct: 297 RTQMDVNHAVLAVGYGVENGVPYWLIKNSWGEEWGVDGYFKMELGKNMCGIATCASYPIV 356
Query: 314 A 314
A
Sbjct: 357 A 357
>gi|194689248|gb|ACF78708.1| unknown [Zea mays]
gi|414885653|tpg|DAA61667.1| TPA: cysteine protease2 [Zea mays]
Length = 360
Score = 446 bits (1148), Expect = e-123, Method: Compositional matrix adjust.
Identities = 220/338 (65%), Positives = 246/338 (72%), Gaps = 49/338 (14%)
Query: 26 SSFDDSNPIRLVSSDGLRDFETSVLQVIGQARHALSFARFARRYGKIYESVEEMKLRFAT 85
S F DSNPIR V+ E++V +G+ R AL FARFA RYGK YES E+ RF
Sbjct: 23 SGFADSNPIRPVTDRAASALESTVFAALGRTRDALRFARFAVRYGKSYESAAEVHKRFRI 82
Query: 86 FSKNLDLIRSTNCKGLSYRLGLN------------------------------------- 108
FS++L L+RSTN KGLSYRLG+N
Sbjct: 83 FSESLQLVRSTNRKGLSYRLGINRFADMSWEEFRATRLGAAQNCSATLTGNHRMRAAAVA 142
Query: 109 ------------ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAF 156
+SPVK+QGHCGSCWTFSTTG+LEAAY QA GK ISLSEQQL+DC AF
Sbjct: 143 LPETKDWREDGIVSPVKNQGHCGSCWTFSTTGALEAAYTQATGKPISLSEQQLIDCGFAF 202
Query: 157 NNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAE 216
NN GCNGGLPSQAFEYIKYNGGLDTEE+YPY G +G+CKF +ENVGV+VLDSVNITLGAE
Sbjct: 203 NNFGCNGGLPSQAFEYIKYNGGLDTEESYPYQGVNGICKFKNENVGVKVLDSVNITLGAE 262
Query: 217 DELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPY 276
DEL+ AVGLVRPVSVAFEV+ GFR YKSGVY+S CG TPMDVNHAV+AVGYGVEDGVPY
Sbjct: 263 DELKDAVGLVRPVSVAFEVITGFRLYKSGVYTSDHCGTTPMDVNHAVLAVGYGVEDGVPY 322
Query: 277 WLIKNSWGENWGDHGYFKMEMGKNMCGIATCASYPVVA 314
WLIKNSWG +WGD GYFKMEMGKNMCG+ATCASYP+VA
Sbjct: 323 WLIKNSWGADWGDEGYFKMEMGKNMCGVATCASYPIVA 360
>gi|162460343|ref|NP_001105479.1| cysteine protease2 precursor [Zea mays]
gi|1491774|emb|CAA68192.1| cysteine protease [Zea mays]
Length = 360
Score = 444 bits (1143), Expect = e-122, Method: Compositional matrix adjust.
Identities = 220/338 (65%), Positives = 245/338 (72%), Gaps = 49/338 (14%)
Query: 26 SSFDDSNPIRLVSSDGLRDFETSVLQVIGQARHALSFARFARRYGKIYESVEEMKLRFAT 85
S F DSNPIR V+ E++V +G+ R AL FARFA RYGK YES E+ RF
Sbjct: 23 SGFADSNPIRPVTDRAASALESTVFAALGRTRDALRFARFAVRYGKSYESAAEVHKRFRI 82
Query: 86 FSKNLDLIRSTNCKGLSYRLGLN------------------------------------- 108
FS++L L+RSTN KGLSYRLG+N
Sbjct: 83 FSESLQLVRSTNRKGLSYRLGINRFADMSWEEFRATRLGAAQNCSATLTGNHRMRAAAVA 142
Query: 109 ------------ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAF 156
+SPVK+QGHCGSCWTFSTTG+LEAAY QA GK ISLSEQQLVDC AF
Sbjct: 143 LPETKDWREDGIVSPVKNQGHCGSCWTFSTTGALEAAYTQATGKPISLSEQQLVDCGLAF 202
Query: 157 NNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAE 216
NN GCNGGLPSQAFEYIKYNGGLDTEE+YPY G +G+ KF +ENVGV+VLDSVNITLGAE
Sbjct: 203 NNFGCNGGLPSQAFEYIKYNGGLDTEESYPYQGVNGISKFKNENVGVKVLDSVNITLGAE 262
Query: 217 DELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPY 276
DEL+ AVGLVRPVSVAFEV+ GFR YKSGVY+S CG TPMDVNHAV+AVGYGVEDGVPY
Sbjct: 263 DELKDAVGLVRPVSVAFEVITGFRLYKSGVYTSDHCGTTPMDVNHAVLAVGYGVEDGVPY 322
Query: 277 WLIKNSWGENWGDHGYFKMEMGKNMCGIATCASYPVVA 314
WLIKNSWG +WGD GYFKMEMGKNMCG+ATCASYP+VA
Sbjct: 323 WLIKNSWGADWGDEGYFKMEMGKNMCGVATCASYPIVA 360
>gi|195624522|gb|ACG34091.1| thiol protease aleurain precursor [Zea mays]
Length = 360
Score = 444 bits (1142), Expect = e-122, Method: Compositional matrix adjust.
Identities = 219/338 (64%), Positives = 245/338 (72%), Gaps = 49/338 (14%)
Query: 26 SSFDDSNPIRLVSSDGLRDFETSVLQVIGQARHALSFARFARRYGKIYESVEEMKLRFAT 85
S F DSNPIR V+ E++V +G+ R AL FARFA RYGK YES E+ RF
Sbjct: 23 SGFADSNPIRPVTDRAASALESTVFAALGRTRDALRFARFAVRYGKSYESAAEVHKRFRI 82
Query: 86 FSKNLDLIRSTNCKGLSYRLGLN------------------------------------- 108
FS++L L+RSTN KGLSYRLG+N
Sbjct: 83 FSESLQLVRSTNRKGLSYRLGINRFADMSWEEFRATRLGAAQNCSATLTGNHRMRAAAVA 142
Query: 109 ------------ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAF 156
+SPVK+QGHCGSCWTFSTTG+LEAAY QA GK ISLSEQQL+DC AF
Sbjct: 143 LPETKDWREDGIVSPVKNQGHCGSCWTFSTTGALEAAYTQATGKPISLSEQQLIDCGFAF 202
Query: 157 NNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAE 216
NN GCNGGLPSQAFEYIKYNGGLDTEE+YPY G +G+CKF +ENVG +VLDSVNITLGAE
Sbjct: 203 NNFGCNGGLPSQAFEYIKYNGGLDTEESYPYQGVNGICKFKNENVGFKVLDSVNITLGAE 262
Query: 217 DELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPY 276
DEL+ AVGLVRPVSVAFEV+ GFR YKSGVY+S CG TPMDVNHAV+AVGYGVEDGVPY
Sbjct: 263 DELKDAVGLVRPVSVAFEVITGFRLYKSGVYTSDHCGTTPMDVNHAVLAVGYGVEDGVPY 322
Query: 277 WLIKNSWGENWGDHGYFKMEMGKNMCGIATCASYPVVA 314
WLIKNSWG +WGD GYFKMEMGKNMCG+ATCASYP+VA
Sbjct: 323 WLIKNSWGADWGDEGYFKMEMGKNMCGVATCASYPIVA 360
>gi|2499879|sp|Q40143.1|CYSP3_SOLLC RecName: Full=Cysteine proteinase 3; Flags: Precursor
gi|1235545|emb|CAA88629.1| pre-pro-cysteine proteinase [Solanum lycopersicum]
Length = 356
Score = 443 bits (1140), Expect = e-122, Method: Compositional matrix adjust.
Identities = 225/355 (63%), Positives = 260/355 (73%), Gaps = 53/355 (14%)
Query: 10 SVILLLCCA--AAASASASSFDDSNPIR-LVSSDGLRDFETSVLQVIGQARHALSFARFA 66
S++L+L A A A ++F D NPIR +V D + E +LQV+GQ R ALSFARFA
Sbjct: 5 SLVLILVAGLFATALAGPATFADKNPIRQVVFPD---ELENGILQVVGQTRSALSFARFA 61
Query: 67 RRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN------------------ 108
R+ K Y+SVEE+K RF F NL +IRS N KGLSY+LG+N
Sbjct: 62 IRHRKRYDSVEEIKQRFEIFLDNLKMIRSHNRKGLSYKLGINEFTDLTWDEFRKHKLGAS 121
Query: 109 -----------------------------ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFG 139
+SPVK QG CGSCWTFSTTG+LEAAY QAFG
Sbjct: 122 QNCSATTKGNLKLTNVVLPETKDWRKDGIVSPVKAQGKCGSCWTFSTTGALEAAYAQAFG 181
Query: 140 KGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSE 199
KGISLSEQQLVDCA AFNN GCNGGLPSQAFEYIK+NGGLDTEEAYPYTGK+G+CKFS
Sbjct: 182 KGISLSEQQLVDCAGAFNNFGCNGGLPSQAFEYIKFNGGLDTEEAYPYTGKNGICKFSQA 241
Query: 200 NVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDV 259
N+GV+V+ SVNITLGAE EL++AV LVRPVSVAFEVV GF+ YKSGVY+ST+CG+TPMDV
Sbjct: 242 NIGVKVISSVNITLGAEYELKYAVALVRPVSVAFEVVKGFKQYKSGVYASTECGDTPMDV 301
Query: 260 NHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASYPVVA 314
NHAV+AVGYGVE+G PYWLIKNSWG +WG+ GYFKMEMGKNMCG+ATCASYP+VA
Sbjct: 302 NHAVLAVGYGVENGTPYWLIKNSWGADWGEDGYFKMEMGKNMCGVATCASYPIVA 356
>gi|218202220|gb|EEC84647.1| hypothetical protein OsI_31538 [Oryza sativa Indica Group]
Length = 363
Score = 441 bits (1135), Expect = e-121, Method: Compositional matrix adjust.
Identities = 214/335 (63%), Positives = 249/335 (74%), Gaps = 48/335 (14%)
Query: 28 FDDSNPIRLVSSDGLRDFETSVLQVIGQARHALSFARFARRYGKIYESVEEMKLRFATFS 87
FDDSNPIR V+ E++V+ +G+ R AL FARFA R+GK Y E++ RF FS
Sbjct: 29 FDDSNPIRSVTDQAASALESTVIAALGRTRDALRFARFAVRHGKRYGDAAEVQRRFRIFS 88
Query: 88 KNLDLIRSTNCKGLSYRLGLN--------------------------------------- 108
++L+L+RSTN +GL YRLG+N
Sbjct: 89 ESLELVRSTNRRGLPYRLGINRFADMSWEEFQASRLGAAQNCSATLAGNHRMRDAAALPE 148
Query: 109 ---------ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQ 159
+SPVKDQGHCGSCWTFSTTGSLEAAY QA GK +SLSEQQLVDCA A+NN
Sbjct: 149 TKDWREDGIVSPVKDQGHCGSCWTFSTTGSLEAAYTQATGKPVSLSEQQLVDCATAYNNF 208
Query: 160 GCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDEL 219
GC+GGLPSQAFEYIKYNGGLDTEEAYPYTG +G+C + ENVGV+VLDSVNITLGAEDEL
Sbjct: 209 GCSGGLPSQAFEYIKYNGGLDTEEAYPYTGVNGICHYKPENVGVKVLDSVNITLGAEDEL 268
Query: 220 QHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
++AVGLVRPVSVAF+V++GFR YKSGVY+S CG +PMDVNHAV+AVGYGVE+GVPYWLI
Sbjct: 269 KNAVGLVRPVSVAFQVINGFRMYKSGVYTSDHCGTSPMDVNHAVLAVGYGVENGVPYWLI 328
Query: 280 KNSWGENWGDHGYFKMEMGKNMCGIATCASYPVVA 314
KNSWG +WGD+GYFKMEMGKNMCGIATCASYP+VA
Sbjct: 329 KNSWGADWGDNGYFKMEMGKNMCGIATCASYPIVA 363
>gi|115479391|ref|NP_001063289.1| Os09g0442300 [Oryza sativa Japonica Group]
gi|115510968|sp|P25778.2|ORYC_ORYSJ RecName: Full=Oryzain gamma chain; Flags: Precursor
gi|51535997|dbj|BAD38077.1| putative oryzain gamma chain precursor [Oryza sativa Japonica
Group]
gi|113631522|dbj|BAF25203.1| Os09g0442300 [Oryza sativa Japonica Group]
gi|215694919|dbj|BAG90110.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 362
Score = 441 bits (1134), Expect = e-121, Method: Compositional matrix adjust.
Identities = 214/335 (63%), Positives = 249/335 (74%), Gaps = 48/335 (14%)
Query: 28 FDDSNPIRLVSSDGLRDFETSVLQVIGQARHALSFARFARRYGKIYESVEEMKLRFATFS 87
FDDSNPIR V+ E++V+ +G+ R AL FARFA R+GK Y E++ RF FS
Sbjct: 28 FDDSNPIRSVTDHAASALESTVIAALGRTRDALRFARFAVRHGKRYGDAAEVQRRFRIFS 87
Query: 88 KNLDLIRSTNCKGLSYRLGLN--------------------------------------- 108
++L+L+RSTN +GL YRLG+N
Sbjct: 88 ESLELVRSTNRRGLPYRLGINRFADMSWEEFQASRLGAAQNCSATLAGNHRMRDAAALPE 147
Query: 109 ---------ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQ 159
+SPVKDQGHCGSCWTFSTTGSLEAAY QA GK +SLSEQQLVDCA A+NN
Sbjct: 148 TKDWREDGIVSPVKDQGHCGSCWTFSTTGSLEAAYTQATGKPVSLSEQQLVDCATAYNNF 207
Query: 160 GCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDEL 219
GC+GGLPSQAFEYIKYNGGLDTEEAYPYTG +G+C + ENVGV+VLDSVNITLGAEDEL
Sbjct: 208 GCSGGLPSQAFEYIKYNGGLDTEEAYPYTGVNGICHYKPENVGVKVLDSVNITLGAEDEL 267
Query: 220 QHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
++AVGLVRPVSVAF+V++GFR YKSGVY+S CG +PMDVNHAV+AVGYGVE+GVPYWLI
Sbjct: 268 KNAVGLVRPVSVAFQVINGFRMYKSGVYTSDHCGTSPMDVNHAVLAVGYGVENGVPYWLI 327
Query: 280 KNSWGENWGDHGYFKMEMGKNMCGIATCASYPVVA 314
KNSWG +WGD+GYFKMEMGKNMCGIATCASYP+VA
Sbjct: 328 KNSWGADWGDNGYFKMEMGKNMCGIATCASYPIVA 362
>gi|149392541|gb|ABR26073.1| oryzain gamma chain precursor [Oryza sativa Indica Group]
Length = 367
Score = 441 bits (1134), Expect = e-121, Method: Compositional matrix adjust.
Identities = 214/335 (63%), Positives = 249/335 (74%), Gaps = 48/335 (14%)
Query: 28 FDDSNPIRLVSSDGLRDFETSVLQVIGQARHALSFARFARRYGKIYESVEEMKLRFATFS 87
FDDSNPIR V+ E++V+ +G+ R AL FARFA R+GK Y E++ RF FS
Sbjct: 33 FDDSNPIRSVTDQAASALESTVIAALGRTRDALRFARFAVRHGKRYGDAAEVQRRFRIFS 92
Query: 88 KNLDLIRSTNCKGLSYRLGLN--------------------------------------- 108
++L+L+RSTN +GL YRLG+N
Sbjct: 93 ESLELVRSTNRRGLPYRLGINRFADMSWEEFQASRLGAAQNCSATLAGNHRMRDAAALPE 152
Query: 109 ---------ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQ 159
+SPVKDQGHCGSCWTFSTTGSLEAAY QA GK +SLSEQQLVDCA A+NN
Sbjct: 153 TKDWREDGIVSPVKDQGHCGSCWTFSTTGSLEAAYTQATGKPVSLSEQQLVDCATAYNNF 212
Query: 160 GCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDEL 219
GC+GGLPSQAFEYIKYNGGLDTEEAYPYTG +G+C + ENVGV+VLDSVNITLGAEDEL
Sbjct: 213 GCSGGLPSQAFEYIKYNGGLDTEEAYPYTGVNGICHYKPENVGVKVLDSVNITLGAEDEL 272
Query: 220 QHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
++AVGLVRPVSVAF+V++GFR YKSGVY+S CG +PMDVNHAV+AVGYGVE+GVPYWLI
Sbjct: 273 KNAVGLVRPVSVAFQVINGFRMYKSGVYTSDHCGTSPMDVNHAVLAVGYGVENGVPYWLI 332
Query: 280 KNSWGENWGDHGYFKMEMGKNMCGIATCASYPVVA 314
KNSWG +WGD+GYFKMEMGKNMCGIATCASYP+VA
Sbjct: 333 KNSWGADWGDNGYFKMEMGKNMCGIATCASYPIVA 367
>gi|242044818|ref|XP_002460280.1| hypothetical protein SORBIDRAFT_02g025920 [Sorghum bicolor]
gi|241923657|gb|EER96801.1| hypothetical protein SORBIDRAFT_02g025920 [Sorghum bicolor]
Length = 363
Score = 440 bits (1131), Expect = e-121, Method: Compositional matrix adjust.
Identities = 217/338 (64%), Positives = 246/338 (72%), Gaps = 49/338 (14%)
Query: 26 SSFDDSNPIRLVSSDGLRDFETSVLQVIGQARHALSFARFARRYGKIYESVEEMKLRFAT 85
S F DSN IR V+ E++V +G+ R AL FARFA RYGK YES E++ RF
Sbjct: 26 SDFADSNTIRSVTDRAASALESTVFGALGRTRDALRFARFAVRYGKSYESAAEVQKRFRI 85
Query: 86 FSKNLDLIRSTNCKGLSYRLGLN------------------------------------- 108
FS++L L+RSTN KGLSYRLG+N
Sbjct: 86 FSESLQLVRSTNRKGLSYRLGINRFSDMSWEEFRATRLGAAQNCSATLAGNHRMRAAAVA 145
Query: 109 ------------ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAF 156
+SPVK+QGHCGSCWTFSTTG+LEAAY QA GK ISLSEQQLVDC + F
Sbjct: 146 LPKTKDWREDGIVSPVKNQGHCGSCWTFSTTGALEAAYTQATGKPISLSEQQLVDCGKPF 205
Query: 157 NNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAE 216
NN GCNGGLPSQAFEYIKYNGGLDTEE+YPY G +G+C F +ENVGV+VLDSVNITLGAE
Sbjct: 206 NNFGCNGGLPSQAFEYIKYNGGLDTEESYPYKGVNGICDFKAENVGVKVLDSVNITLGAE 265
Query: 217 DELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPY 276
DEL+ AV LVRPVSVAF+VV+GFR YKSGVY+S CGNTPMDVNHAV+AVGYGVE+GVPY
Sbjct: 266 DELKDAVALVRPVSVAFQVVNGFRQYKSGVYTSDSCGNTPMDVNHAVLAVGYGVENGVPY 325
Query: 277 WLIKNSWGENWGDHGYFKMEMGKNMCGIATCASYPVVA 314
WLIKNSWG +WGD GYFKMEMGKNMCG+ATCASYP+VA
Sbjct: 326 WLIKNSWGADWGDKGYFKMEMGKNMCGVATCASYPIVA 363
>gi|388521567|gb|AFK48845.1| unknown [Medicago truncatula]
Length = 343
Score = 439 bits (1129), Expect = e-121, Method: Compositional matrix adjust.
Identities = 222/351 (63%), Positives = 252/351 (71%), Gaps = 61/351 (17%)
Query: 11 VILLLCCAAAASASASSFDDSNPIRLVSSDGLRDFETSVLQVIGQARHALSFARFARRYG 70
+I+ C A AA+ SF DSNPIR+VS D E +LQVIG++R FA RYG
Sbjct: 7 LIVFFCVATAAAGL--SFHDSNPIRMVS-----DMEEQLLQVIGESR-------FANRYG 52
Query: 71 KIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN---------------------- 108
K Y++V+EMK RF FS+NL LI+STN K L Y LG+N
Sbjct: 53 KRYDTVDEMKRRFKIFSENLQLIKSTNKKRLGYTLGVNHFADWTWEEFRSHRLGAAQNCS 112
Query: 109 -------------------------ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGIS 143
+S VKDQGHCGSCWTFSTTG+LE+AY QAFGK IS
Sbjct: 113 ATLKGNHRITDVVLPAEKDWRKEGIVSEVKDQGHCGSCWTFSTTGALESAYAQAFGKNIS 172
Query: 144 LSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGV 203
LSEQQLVDCA A+NN GCNGGLPSQAFEYIKYNGGL+TEE YPYTG++G+CKF+SENV V
Sbjct: 173 LSEQQLVDCAGAYNNFGCNGGLPSQAFEYIKYNGGLETEEVYPYTGQNGLCKFTSENVAV 232
Query: 204 QVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAV 263
QVL SVNITLGAEDEL+HAV RPVSVAF+VVD FR YK GVY+ T CG+TPMDVNHAV
Sbjct: 233 QVLGSVNITLGAEDELKHAVAFARPVSVAFQVVDDFRLYKKGVYTGTTCGSTPMDVNHAV 292
Query: 264 VAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASYPVVA 314
+AVGYG+EDGVPYWLIKNSWG WGDHGYFKMEMGKNMCG+ATC+SYPVVA
Sbjct: 293 LAVGYGIEDGVPYWLIKNSWGGEWGDHGYFKMEMGKNMCGVATCSSYPVVA 343
>gi|357158628|ref|XP_003578189.1| PREDICTED: thiol protease aleurain-like [Brachypodium distachyon]
Length = 363
Score = 432 bits (1110), Expect = e-118, Method: Compositional matrix adjust.
Identities = 212/335 (63%), Positives = 242/335 (72%), Gaps = 48/335 (14%)
Query: 28 FDDSNPIRLVSSDGLRDFETSVLQVIGQARHALSFARFARRYGKIYESVEEMKLRFATFS 87
F DSN IR V+ E++++ +G++RHAL FARFA RYGK YES E++ RF FS
Sbjct: 28 FTDSNLIRPVTERAATALESTIVAALGRSRHALRFARFAVRYGKSYESAAEVQRRFRIFS 87
Query: 88 KNLDLIRSTNCKGLSYRLGLN--------------------------------------- 108
++L+ +RSTN KGLSYRLG+N
Sbjct: 88 ESLEEVRSTNQKGLSYRLGINRYSDMSWEEFQASRLGAAQTCSATLRGNHRMQDANALPE 147
Query: 109 ---------ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQ 159
+SPVKDQ HCGSCWTFSTTG+LEAAY QA GK ISLSEQQLVDCA A+NN
Sbjct: 148 TKDWREDGIVSPVKDQSHCGSCWTFSTTGALEAAYTQATGKNISLSEQQLVDCAGAYNNF 207
Query: 160 GCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDEL 219
GCNGGLPSQAFEYIKYNGGLDTEE+YPY G +GVC + EN VQVLDSVNITL AEDEL
Sbjct: 208 GCNGGLPSQAFEYIKYNGGLDTEESYPYKGVNGVCHYKPENAAVQVLDSVNITLNAEDEL 267
Query: 220 QHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
Q+AVGLVRPVSVAFEV++GFR YKSGVY+S CG TP DVNHAV+AVGYGVE+G PYWLI
Sbjct: 268 QNAVGLVRPVSVAFEVINGFRQYKSGVYTSDHCGTTPDDVNHAVLAVGYGVENGTPYWLI 327
Query: 280 KNSWGENWGDHGYFKMEMGKNMCGIATCASYPVVA 314
KNSWGE+WGD GYFKME GKNMC +ATCASYP+VA
Sbjct: 328 KNSWGESWGDKGYFKMERGKNMCAVATCASYPIVA 362
>gi|111073719|dbj|BAF02548.1| triticain gamma [Triticum aestivum]
Length = 365
Score = 429 bits (1103), Expect = e-118, Method: Compositional matrix adjust.
Identities = 211/331 (63%), Positives = 241/331 (72%), Gaps = 48/331 (14%)
Query: 32 NPIRLVSSDGLRDFETSVLQVIGQARHALSFARFARRYGKIYESVEEMKLRFATFSKNLD 91
NPIR V+ E++VL +G+ RHAL FARFA RYGK YES E++ RF FS++L+
Sbjct: 34 NPIRPVTERAASTLESTVLAALGRTRHALRFARFAVRYGKSYESAAEVRRRFRIFSESLE 93
Query: 92 LIRSTNCKGLSYRLGLN------------------------------------------- 108
+RSTN KGLSYRLG+N
Sbjct: 94 EVRSTNRKGLSYRLGINRFSDMSWEEFQATRLGAAQTCSATLAGNHLMRDAAALPETKDW 153
Query: 109 -----ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNG 163
+SPVKDQ HCGSCWTFSTTG+LEAAY QA GK ISLSEQQLVDCA FNN GC+G
Sbjct: 154 REDGIVSPVKDQSHCGSCWTFSTTGALEAAYTQATGKNISLSEQQLVDCAGGFNNFGCSG 213
Query: 164 GLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAV 223
GLPSQAFEYIKYNGG+DTEE+YPY G +GVC + +EN VQVLDSVNITL AEDEL++AV
Sbjct: 214 GLPSQAFEYIKYNGGIDTEESYPYKGVNGVCHYKAENAVVQVLDSVNITLNAEDELKNAV 273
Query: 224 GLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSW 283
GLVRPVSVAFEV++GFR YKSGVYSS CG TP DVNHAV+AVGYGVE+GVPYWLIKNSW
Sbjct: 274 GLVRPVSVAFEVINGFRQYKSGVYSSDHCGTTPDDVNHAVLAVGYGVENGVPYWLIKNSW 333
Query: 284 GENWGDHGYFKMEMGKNMCGIATCASYPVVA 314
G +WGD+GYFKMEMGKNMC +ATCASYP+VA
Sbjct: 334 GADWGDNGYFKMEMGKNMCAVATCASYPIVA 364
>gi|113603|sp|P05167.1|ALEU_HORVU RecName: Full=Thiol protease aleurain; Flags: Precursor
gi|19021|emb|CAA28804.1| aleurain [Hordeum vulgare]
Length = 362
Score = 429 bits (1102), Expect = e-117, Method: Compositional matrix adjust.
Identities = 211/331 (63%), Positives = 240/331 (72%), Gaps = 48/331 (14%)
Query: 32 NPIRLVSSDGLRDFETSVLQVIGQARHALSFARFARRYGKIYESVEEMKLRFATFSKNLD 91
NPIR V+ E++VL +G+ RHAL FARFA RYGK YES E++ RF FS++L+
Sbjct: 31 NPIRPVTDRAASTLESAVLGALGRTRHALRFARFAVRYGKSYESAAEVRRRFRIFSESLE 90
Query: 92 LIRSTNCKGLSYRLGLN------------------------------------------- 108
+RSTN KGL YRLG+N
Sbjct: 91 EVRSTNRKGLPYRLGINRFSDMSWEEFQATRLGAAQTCSATLAGNHLMRDAAALPETKDW 150
Query: 109 -----ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNG 163
+SPVK+Q HCGSCWTFSTTG+LEAAY QA GK ISLSEQQLVDCA FNN GCNG
Sbjct: 151 REDGIVSPVKNQAHCGSCWTFSTTGALEAAYTQATGKNISLSEQQLVDCAGGFNNFGCNG 210
Query: 164 GLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAV 223
GLPSQAFEYIKYNGG+DTEE+YPY G +GVC + +EN VQVLDSVNITL AEDEL++AV
Sbjct: 211 GLPSQAFEYIKYNGGIDTEESYPYKGVNGVCHYKAENAAVQVLDSVNITLNAEDELKNAV 270
Query: 224 GLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSW 283
GLVRPVSVAF+V+DGFR YKSGVY+S CG TP DVNHAV+AVGYGVE+GVPYWLIKNSW
Sbjct: 271 GLVRPVSVAFQVIDGFRQYKSGVYTSDHCGTTPDDVNHAVLAVGYGVENGVPYWLIKNSW 330
Query: 284 GENWGDHGYFKMEMGKNMCGIATCASYPVVA 314
G +WGD+GYFKMEMGKNMC IATCASYPVVA
Sbjct: 331 GADWGDNGYFKMEMGKNMCAIATCASYPVVA 361
>gi|6851030|emb|CAB71032.1| cysteine protease [Lolium multiflorum]
Length = 359
Score = 426 bits (1094), Expect = e-117, Method: Compositional matrix adjust.
Identities = 210/336 (62%), Positives = 242/336 (72%), Gaps = 48/336 (14%)
Query: 27 SFDDSNPIRLVSSDGLRDFETSVLQVIGQARHALSFARFARRYGKIYESVEEMKLRFATF 86
SF DSNPIR V+ E++VL +G+ RHAL FARFA R+GK Y S E++ RF F
Sbjct: 23 SFADSNPIRPVTERAASAVESTVLGALGRTRHALRFARFAVRHGKSYGSAAEVQRRFRIF 82
Query: 87 SKNLDLIRSTNCKGLSYRLGLN-------------------------------------- 108
S++LD +RSTN KGLSY+LG+N
Sbjct: 83 SESLDEVRSTNRKGLSYKLGINRFSDMTWEEFQATKLGAAQTCSATLAGNHLMRDANALP 142
Query: 109 ----------ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
+SPVKDQ CGSCWTFSTTG+LEAAY QA GK ISLSEQQLVDCA A+NN
Sbjct: 143 ETKDWRETGIVSPVKDQASCGSCWTFSTTGALEAAYTQATGKNISLSEQQLVDCAGAYNN 202
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
GCNGGLPSQAFEYIKYNGG+DTEE+YPY G +GVCK+ EN VQV DSVNITL AEDE
Sbjct: 203 FGCNGGLPSQAFEYIKYNGGIDTEESYPYKGVNGVCKYRPENAAVQVADSVNITLNAEDE 262
Query: 219 LQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWL 278
L++AVGLVRPVSVAFEV+DGF+ YKSGVY+S CG TP DVNHAV+AVGYGVE+GVPYWL
Sbjct: 263 LKNAVGLVRPVSVAFEVIDGFKQYKSGVYTSDHCGTTPDDVNHAVLAVGYGVENGVPYWL 322
Query: 279 IKNSWGENWGDHGYFKMEMGKNMCGIATCASYPVVA 314
IKNSWG +WG+ GYFKMEMGKNMC +ATCASYP++A
Sbjct: 323 IKNSWGADWGEDGYFKMEMGKNMCAVATCASYPILA 358
>gi|326516056|dbj|BAJ88051.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 362
Score = 425 bits (1093), Expect = e-116, Method: Compositional matrix adjust.
Identities = 210/331 (63%), Positives = 239/331 (72%), Gaps = 48/331 (14%)
Query: 32 NPIRLVSSDGLRDFETSVLQVIGQARHALSFARFARRYGKIYESVEEMKLRFATFSKNLD 91
NPIR V+ E++VL +G+ RHAL FARFA YGK YES E++ RF FS++L+
Sbjct: 31 NPIRPVTDRAASTLESAVLGALGRTRHALRFARFAVGYGKSYESAAEVRRRFRIFSESLE 90
Query: 92 LIRSTNCKGLSYRLGLN------------------------------------------- 108
+RSTN KGL YRLG+N
Sbjct: 91 EVRSTNRKGLPYRLGINRFSDMSWEEFQATRLGAAQTCSATLAGNHLMRDAAALPETKDW 150
Query: 109 -----ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNG 163
+SPVK+Q HCGSCWTFSTTG+LEAAY QA GK ISLSEQQLVDCA FNN GCNG
Sbjct: 151 REDGIVSPVKNQAHCGSCWTFSTTGALEAAYTQATGKNISLSEQQLVDCAGGFNNFGCNG 210
Query: 164 GLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAV 223
GLPSQAFEYIKYNGG+DTEE+YPY G +GVC + +EN VQVLDSVNITL AEDEL++AV
Sbjct: 211 GLPSQAFEYIKYNGGIDTEESYPYKGVNGVCHYKAENAAVQVLDSVNITLNAEDELKNAV 270
Query: 224 GLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSW 283
GLVRPVSVAF+V+DGFR YKSGVY+S CG TP DVNHAV+AVGYGVE+GVPYWLIKNSW
Sbjct: 271 GLVRPVSVAFQVIDGFRQYKSGVYTSDHCGTTPDDVNHAVLAVGYGVENGVPYWLIKNSW 330
Query: 284 GENWGDHGYFKMEMGKNMCGIATCASYPVVA 314
G +WGD+GYFKMEMGKNMC IATCASYPVVA
Sbjct: 331 GADWGDNGYFKMEMGKNMCAIATCASYPVVA 361
>gi|218185|dbj|BAA14404.1| oryzain gamma precursor [Oryza sativa Japonica Group]
Length = 362
Score = 422 bits (1086), Expect = e-116, Method: Compositional matrix adjust.
Identities = 206/335 (61%), Positives = 241/335 (71%), Gaps = 48/335 (14%)
Query: 28 FDDSNPIRLVSSDGLRDFETSVLQVIGQARHALSFARFARRYGKIYESVEEMKLRFATFS 87
FDDSNPIR V+ E++V+ +G+ R AL FARFA R+GK Y E++ RF FS
Sbjct: 28 FDDSNPIRSVTDHAASALESTVIAALGRTRGALRFARFAVRHGKRYGDAAEVQRRFRIFS 87
Query: 88 KNLDLIRSTNCKGLSYRLGLN--------------------------------------- 108
++L+L+RSTN +GL YRLG+N
Sbjct: 88 ESLELVRSTNRRGLPYRLGINRFADMSWEEFQASRLGAAQNCSATLAGNHRMRDAPALPE 147
Query: 109 ---------ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQ 159
+SPVKDQGHCGSCW FSTTGSLEA Y QA G +SLSEQQL DCA +NN
Sbjct: 148 TKDWREDGIVSPVKDQGHCGSCWPFSTTGSLEARYTQATGPPVSLSEQQLADCATRYNNF 207
Query: 160 GCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDEL 219
GC+GGLPSQAFEYIKYNGGLDTEEAYPYTG +G+C + EN GV+VLDSVNITL AEDEL
Sbjct: 208 GCSGGLPSQAFEYIKYNGGLDTEEAYPYTGVNGICHYKPENAGVKVLDSVNITLVAEDEL 267
Query: 220 QHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
++AVGLVRPVSVAF+V++GFR YKSGVY+S CG +PMDVNHAV+AVGYGVE+GVPYWLI
Sbjct: 268 KNAVGLVRPVSVAFQVINGFRMYKSGVYTSDHCGTSPMDVNHAVLAVGYGVENGVPYWLI 327
Query: 280 KNSWGENWGDHGYFKMEMGKNMCGIATCASYPVVA 314
KNSWG +WGD+GYF MEMGKNMCGIATCASYP+VA
Sbjct: 328 KNSWGADWGDNGYFTMEMGKNMCGIATCASYPIVA 362
>gi|356530431|ref|XP_003533785.1| PREDICTED: cysteine proteinase [Glycine max]
Length = 354
Score = 422 bits (1085), Expect = e-115, Method: Compositional matrix adjust.
Identities = 211/333 (63%), Positives = 243/333 (72%), Gaps = 52/333 (15%)
Query: 29 DDSNPIRLVSSDGLRDFETSVLQVIGQARHALSFARFARRYGKIYESVEEMKLRFATFSK 88
+ +NPIR+VS E V++VIG+ R AL FARF R+GK Y+S EEMK R+ FS+
Sbjct: 27 EAANPIRMVSG-----VEAEVVRVIGECRRALKFARFVSRFGKSYQSEEEMKERYEIFSQ 81
Query: 89 NLDLIRSTNCKGLSYRLGLN---------------------------------------- 108
NL IRS N K L Y L +N
Sbjct: 82 NLRFIRSHNKKRLPYTLSVNHFADWTWEEFKRHRLGAAQNCSATLNGNHKLTDAVLPPTK 141
Query: 109 -------ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGC 161
+S VKDQG CGSCWTFSTTG+LEAAY QAFGK ISLSEQQLVDCA FNN GC
Sbjct: 142 DWRKEGIVSSVKDQGSCGSCWTFSTTGALEAAYAQAFGKSISLSEQQLVDCAGPFNNFGC 201
Query: 162 NGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQH 221
+GGLPSQAFEYIKYNGGL+TEEAYPYTGKDGVCKFS+ENV VQVLDSVNITLGAEDEL+H
Sbjct: 202 HGGLPSQAFEYIKYNGGLETEEAYPYTGKDGVCKFSAENVAVQVLDSVNITLGAEDELKH 261
Query: 222 AVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKN 281
AV VRPVSVAF+VV+GF FY++GV++S CG+T DVNHAV+AVGYGVE+GVPYWLIKN
Sbjct: 262 AVAFVRPVSVAFQVVNGFHFYENGVFTSDTCGSTSQDVNHAVLAVGYGVENGVPYWLIKN 321
Query: 282 SWGENWGDHGYFKMEMGKNMCGIATCASYPVVA 314
SWGE+WG++GYFKME+GKNMCG+ATCASYP+VA
Sbjct: 322 SWGESWGENGYFKMELGKNMCGVATCASYPIVA 354
>gi|222641669|gb|EEE69801.1| hypothetical protein OsJ_29533 [Oryza sativa Japonica Group]
Length = 314
Score = 406 bits (1043), Expect = e-111, Method: Compositional matrix adjust.
Identities = 199/291 (68%), Positives = 229/291 (78%), Gaps = 8/291 (2%)
Query: 28 FDDSNPIRLVSSDGLRDFETSVLQVIGQARHALSFARFA----RRYGKIYESVEEMKLRF 83
FDDSNPIR V+ E++V+ +G+ R AL FARFA RR G + +
Sbjct: 28 FDDSNPIRSVTDHAASALESTVIAALGRTRDALRFARFAVRSFRRAGS--GAAQNCSATL 85
Query: 84 ATFSKNLDLIRSTNCKGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGIS 143
A + D K +R +SPVKDQGHCGSCWTFSTTGSLEAAY QA GK +S
Sbjct: 86 AGNHRMRDAAALPETK--DWREDGIVSPVKDQGHCGSCWTFSTTGSLEAAYTQATGKPVS 143
Query: 144 LSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGV 203
LSEQQLVDCA A+NN GC+GGLPSQAFEYIKYNGGLDTEEAYPYTG +G+C + ENVGV
Sbjct: 144 LSEQQLVDCATAYNNFGCSGGLPSQAFEYIKYNGGLDTEEAYPYTGVNGICHYKPENVGV 203
Query: 204 QVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAV 263
+VLDSVNITLGAEDEL++AVGLVRPVSVAF+V++GFR YKSGVY+S CG +PMDVNHAV
Sbjct: 204 KVLDSVNITLGAEDELKNAVGLVRPVSVAFQVINGFRMYKSGVYTSDHCGTSPMDVNHAV 263
Query: 264 VAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASYPVVA 314
+AVGYGVE+GVPYWLIKNSWG +WGD+GYFKMEMGKNMCGIATCASYP+VA
Sbjct: 264 LAVGYGVENGVPYWLIKNSWGADWGDNGYFKMEMGKNMCGIATCASYPIVA 314
>gi|116779845|gb|ABK21448.1| unknown [Picea sitchensis]
gi|116791731|gb|ABK26088.1| unknown [Picea sitchensis]
gi|224286276|gb|ACN40847.1| unknown [Picea sitchensis]
Length = 357
Score = 404 bits (1039), Expect = e-110, Method: Compositional matrix adjust.
Identities = 204/361 (56%), Positives = 250/361 (69%), Gaps = 51/361 (14%)
Query: 1 MARPVQLVSSVILLLCCAAAASASASSFDDSNPIRLVSSDGLRDFETSVLQVIGQARHAL 60
MAR + +V S +L L A +A A SF+++ I +V+ D +++ E+S+ +++G ++
Sbjct: 1 MARILAIVLSTLLALAIAVSA---ARSFEETEYIDMVT-DKIQNLESSLFKILGTNPKSV 56
Query: 61 SFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN------------ 108
FA FA RYGK Y+SV ++ RF F KN++LI S N L Y L +N
Sbjct: 57 QFAEFALRYGKRYDSVRQLVHRFNAFVKNVELIESRNSMNLPYTLAINEFADITWEEFHG 116
Query: 109 ----------------------------------ISPVKDQGHCGSCWTFSTTGSLEAAY 134
+SPVK+Q HCGSCWTFSTTG+LEAAY
Sbjct: 117 QYLGASQNCSATKSNHKFTDAQPPTKKDWREEGIVSPVKNQAHCGSCWTFSTTGALEAAY 176
Query: 135 HQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVC 194
QA GK + LSEQQLVDCA AFNN GC+GGLPSQAFEYIKYNGGLDTEEAYPYT KDGVC
Sbjct: 177 TQATGKTVILSEQQLVDCAGAFNNFGCSGGLPSQAFEYIKYNGGLDTEEAYPYTAKDGVC 236
Query: 195 KFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGN 254
+ NVGV+V DSVNI+LGAEDEL+ AVGLVRPVSVAF+V+ FRFYK GV++ST CG
Sbjct: 237 NYDVNNVGVKVADSVNISLGAEDELKSAVGLVRPVSVAFQVIQDFRFYKEGVFTSTTCGQ 296
Query: 255 TPMDVNHAVVAVGYGV-EDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASYPVV 313
PMDVNHAV+AVGYGV E+G P+W+IKNSWG++WG GYFKMEMGKNMCG+ATCASYPVV
Sbjct: 297 GPMDVNHAVLAVGYGVSEEGTPHWIIKNSWGKSWGVEGYFKMEMGKNMCGVATCASYPVV 356
Query: 314 A 314
+
Sbjct: 357 S 357
>gi|148908373|gb|ABR17300.1| unknown [Picea sitchensis]
Length = 357
Score = 402 bits (1034), Expect = e-110, Method: Compositional matrix adjust.
Identities = 203/361 (56%), Positives = 250/361 (69%), Gaps = 51/361 (14%)
Query: 1 MARPVQLVSSVILLLCCAAAASASASSFDDSNPIRLVSSDGLRDFETSVLQVIGQARHAL 60
MAR + +V S +L L A +A A SF+++ I +V+ D +++ E+S+ +++G ++
Sbjct: 1 MARILAIVLSTLLALAIAVSA---ARSFEETEYIDMVT-DKIQNLESSLFKILGTNPKSV 56
Query: 61 SFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN------------ 108
FA FA RYGK Y+SV ++ RF F KN++LI S N L Y L +N
Sbjct: 57 QFAEFALRYGKRYDSVRQLVHRFNAFVKNVELIESRNSMNLPYTLAINEFADITWEEFHG 116
Query: 109 ----------------------------------ISPVKDQGHCGSCWTFSTTGSLEAAY 134
+SPVK+Q HCGSCWTFSTTG+LEAAY
Sbjct: 117 QYLGASQNCSATKSNHKFTDAQPPTKKDWREEGIVSPVKNQAHCGSCWTFSTTGALEAAY 176
Query: 135 HQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVC 194
QA GK + LSEQQLVDCA AFNN GC+GGLPSQAFEYIKYNGGLDTEEAYPYT KDGVC
Sbjct: 177 TQATGKTVILSEQQLVDCAGAFNNFGCSGGLPSQAFEYIKYNGGLDTEEAYPYTAKDGVC 236
Query: 195 KFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGN 254
+ NVGV+V DSVNI+LGAED+L+ AVGLVRPVSVAF+V+ FRFYK GV++ST CG
Sbjct: 237 NYDVNNVGVKVADSVNISLGAEDKLKSAVGLVRPVSVAFQVIQDFRFYKEGVFTSTTCGQ 296
Query: 255 TPMDVNHAVVAVGYGV-EDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASYPVV 313
PMDVNHAV+AVGYGV E+G P+W+IKNSWG++WG GYFKMEMGKNMCG+ATCASYPVV
Sbjct: 297 GPMDVNHAVLAVGYGVSEEGTPHWIIKNSWGKSWGVEGYFKMEMGKNMCGVATCASYPVV 356
Query: 314 A 314
+
Sbjct: 357 S 357
>gi|351724281|ref|NP_001237820.1| cysteine protease-like precursor [Glycine max]
gi|149393486|gb|ABR26679.1| putative cysteine protease [Glycine max]
Length = 355
Score = 389 bits (999), Expect = e-105, Method: Compositional matrix adjust.
Identities = 202/334 (60%), Positives = 233/334 (69%), Gaps = 53/334 (15%)
Query: 29 DDSNPIRLVSSDGLRDFETSVLQVIGQARHALSFARFARRYGKIYESVEEMKLRFATFSK 88
+ +NPIR+V+ E V++VIGQ R AL FARF R+GK Y S EEM+ R+ FS+
Sbjct: 27 EAANPIRMVAG-----VEAEVVRVIGQCRRALKFARFMSRFGKSYRSEEEMRERYEIFSQ 81
Query: 89 NLDLIRSTNCKGLSYRLGLN---------------------------------------- 108
NL IRS N L Y L +N
Sbjct: 82 NLRFIRSHNKNRLPYTLSVNHFADWTWEEFKRHRLGAAQNCSATLNGNHKLTDAVLPPTK 141
Query: 109 -------ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGC 161
+S VKDQG CGSCWTFSTTG+LEAA QAFGK ISLSEQQLVDCA FNN GC
Sbjct: 142 DWRKEGIVSDVKDQGSCGSCWTFSTTGALEAACAQAFGKSISLSEQQLVDCAGRFNNFGC 201
Query: 162 NGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQH 221
NGGLPSQAFEYIKYNGGL+TEEAYPYTGKDGVCKFS+ENV VQV+DSVNITLGAE+EL+H
Sbjct: 202 NGGLPSQAFEYIKYNGGLETEEAYPYTGKDGVCKFSAENVAVQVIDSVNITLGAENELKH 261
Query: 222 AVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKN 281
AV VRPVSVAF+VV+GF FY++GVY+S CG+T DVNHAV+AVGYGVE+GVPYWLIK
Sbjct: 262 AVAFVRPVSVAFQVVNGFHFYENGVYTSDICGSTSQDVNHAVLAVGYGVENGVPYWLIKK 321
Query: 282 SWGENWG-DHGYFKMEMGKNMCGIATCASYPVVA 314
GE G ++G K+E+GKNMCG+ATCASYPVVA
Sbjct: 322 FMGEKVGVENGLLKLELGKNMCGVATCASYPVVA 355
>gi|359484377|ref|XP_003633102.1| PREDICTED: thiol protease aleurain-like isoform 2 [Vitis vinifera]
Length = 318
Score = 386 bits (991), Expect = e-105, Method: Compositional matrix adjust.
Identities = 208/364 (57%), Positives = 238/364 (65%), Gaps = 96/364 (26%)
Query: 1 MARPVQLVSSVILLLCCAAAASAS---ASSFDDSNPIRLVSSDGLRDFETSVLQVIGQAR 57
MAR + +V++V++LLC A+ A SSFD+ NPIRLVS D +RD E+SVL++IG R
Sbjct: 1 MAR-LSVVAAVLILLCAVASGEADHHFRSSFDEENPIRLVS-DSIRDLESSVLRLIGDTR 58
Query: 58 HALSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN--------- 108
HA SFA FA RYGK Y++V+E+KLRF FS+NL LIRSTN KGL Y L +N
Sbjct: 59 HAHSFASFAHRYGKSYKTVDEIKLRFEIFSENLKLIRSTNRKGLPYTLAVNQFADWTWEE 118
Query: 109 --------------------------------------ISPVKDQGHCGSCWTFSTTGSL 130
+SP+KDQGHCGSCWTFSTTG+L
Sbjct: 119 FRRHRLGAAQNCSATLKGNHKLTDVILPETKDWREDGIVSPIKDQGHCGSCWTFSTTGAL 178
Query: 131 EAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGK 190
EAAY QAFGKGISLSEQQLVDCA AFNN GC+GGLPSQAFEYIKYNGGLDTEEAYPYTG
Sbjct: 179 EAAYAQAFGKGISLSEQQLVDCAGAFNNFGCHGGLPSQAFEYIKYNGGLDTEEAYPYTGL 238
Query: 191 DGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSST 250
DG CKFSSEN+GVQVLDSVNITL ++ HA
Sbjct: 239 DGTCKFSSENIGVQVLDSVNITL----DVNHA---------------------------- 266
Query: 251 KCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASY 310
V+AVGYGVEDGV YWLIKNSWGENWGD+GYFKME+GKNMCG+ATC+SY
Sbjct: 267 ------------VLAVGYGVEDGVAYWLIKNSWGENWGDNGYFKMELGKNMCGVATCSSY 314
Query: 311 PVVA 314
PVVA
Sbjct: 315 PVVA 318
>gi|385298943|gb|AFI60244.1| cysteine protease/senescence-enhanced 1, partial [Panicum virgatum]
Length = 282
Score = 383 bits (983), Expect = e-104, Method: Compositional matrix adjust.
Identities = 186/281 (66%), Positives = 213/281 (75%), Gaps = 48/281 (17%)
Query: 82 RFATFSKNLDLIRSTNCKGLSYRLGLN--------------------------------- 108
RF FS++L+L+RSTN KGL YRLG+N
Sbjct: 2 RFRIFSESLELVRSTNXKGLPYRLGINRFADMSWEXFRSTRLGAAQNCSATLAGNHRMRA 61
Query: 109 ---------------ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCA 153
+SPVK+QGHCGSCWTFSTTG+LEAAY QA GK +SLSEQQLVDCA
Sbjct: 62 AAALPETKDWREDGIVSPVKNQGHCGSCWTFSTTGALEAAYTQATGKPVSLSEQQLVDCA 121
Query: 154 QAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITL 213
A+NN GCNGGLPSQAFEYIK+NGGLDTEE+YPY G +G+C+F + NVGV+VLDSVNITL
Sbjct: 122 GAYNNFGCNGGLPSQAFEYIKHNGGLDTEESYPYKGVNGLCQFKASNVGVKVLDSVNITL 181
Query: 214 GAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDG 273
GAE+EL+ AVGLVRPVSVAFEV++GFR YKSGVY+S CG TPMDVNHAV+AVGYGVE+G
Sbjct: 182 GAENELKDAVGLVRPVSVAFEVINGFRLYKSGVYTSDHCGTTPMDVNHAVLAVGYGVENG 241
Query: 274 VPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASYPVVA 314
VPYWLIKNSWG +WGD GYFKMEMGKNMCG+ATCASYP+VA
Sbjct: 242 VPYWLIKNSWGADWGDEGYFKMEMGKNMCGVATCASYPIVA 282
>gi|414589597|tpg|DAA40168.1| TPA: hypothetical protein ZEAMMB73_868349 [Zea mays]
Length = 252
Score = 370 bits (949), Expect = e-100, Method: Compositional matrix adjust.
Identities = 171/206 (83%), Positives = 188/206 (91%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
+SPVK+QGHCGSCWTFSTTG+LEAAY QA GK ISLSEQQLVDC AFNN GC GGLPSQ
Sbjct: 47 VSPVKNQGHCGSCWTFSTTGALEAAYTQATGKAISLSEQQLVDCGFAFNNFGCKGGLPSQ 106
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
AFEYIKYNGGLDTEE+YPY G +G+C+F +ENVGV+VLDSVNITLGAEDEL+ AVGLVRP
Sbjct: 107 AFEYIKYNGGLDTEESYPYQGVNGICQFKAENVGVKVLDSVNITLGAEDELKDAVGLVRP 166
Query: 229 VSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWG 288
VSVAFEV+ GFR YK+GVY+S CG TPMDVNHAV+AVGYGVE+GVPYWLIKNSWG +WG
Sbjct: 167 VSVAFEVISGFRLYKTGVYTSDHCGTTPMDVNHAVLAVGYGVENGVPYWLIKNSWGADWG 226
Query: 289 DHGYFKMEMGKNMCGIATCASYPVVA 314
D GYFKMEMGKNMCG+ATCASYPVVA
Sbjct: 227 DEGYFKMEMGKNMCGVATCASYPVVA 252
>gi|302790930|ref|XP_002977232.1| hypothetical protein SELMODRAFT_228454 [Selaginella moellendorffii]
gi|300155208|gb|EFJ21841.1| hypothetical protein SELMODRAFT_228454 [Selaginella moellendorffii]
Length = 353
Score = 347 bits (889), Expect = 5e-93, Method: Compositional matrix adjust.
Identities = 177/337 (52%), Positives = 219/337 (64%), Gaps = 49/337 (14%)
Query: 23 ASASSFDDSNPIRLVSSDGLRDFETSVLQVIGQARHALSFARFARRYGKIYESVEEMKLR 82
++A DDS+ I +V DG+ +++G+ F FA R+ ++Y S+ E++ R
Sbjct: 15 STARFLDDSSAISMVI-DGIS--PARFTELLGEGHKVARFHEFATRHKRVYGSLVELRER 71
Query: 83 FATFSKNLDLIRSTNCKGLSYRLGLN---------------------------------- 108
F TFS+NL+LI TN K L Y L +N
Sbjct: 72 FVTFSRNLELIEETNRKELPYTLAVNQFADMSWEEFKKHNLFSSQNCSATTTNSVRAFLT 131
Query: 109 ------------ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAF 156
+SPVK+Q HCGSCWTFSTTG+LE+A+ QA GK + LSEQQLVDCA +
Sbjct: 132 PPSKKDWRDDKIVSPVKNQQHCGSCWTFSTTGALESAHAQATGKMVVLSEQQLVDCAGGY 191
Query: 157 NNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAE 216
NN GCNGGLPSQAFEYI+YNGGLDTE++YPYTG DG C ++ ++G +V D VNIT GAE
Sbjct: 192 NNFGCNGGLPSQAFEYIRYNGGLDTEDSYPYTGHDGKCTYNQNSIGAKVYDVVNITEGAE 251
Query: 217 DELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPY 276
DEL HAV RPVS+A+EV+ FRFYKSGVY+S CG P VNHAV+AVGY + VPY
Sbjct: 252 DELIHAVAFNRPVSIAYEVLKDFRFYKSGVYTSNVCGTGPDTVNHAVLAVGYNRDAPVPY 311
Query: 277 WLIKNSWGENWGDHGYFKMEMGKNMCGIATCASYPVV 313
W+IKNSWGE++G GYF MEMGKNMCGIATCASYPVV
Sbjct: 312 WIIKNSWGESFGLDGYFYMEMGKNMCGIATCASYPVV 348
>gi|302763927|ref|XP_002965385.1| hypothetical protein SELMODRAFT_439207 [Selaginella moellendorffii]
gi|300167618|gb|EFJ34223.1| hypothetical protein SELMODRAFT_439207 [Selaginella moellendorffii]
Length = 353
Score = 342 bits (878), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 175/337 (51%), Positives = 218/337 (64%), Gaps = 49/337 (14%)
Query: 23 ASASSFDDSNPIRLVSSDGLRDFETSVLQVIGQARHALSFARFARRYGKIYESVEEMKLR 82
++A DDS+ I +V DG+ +++G+ F FA R+ ++Y S+ E++ R
Sbjct: 15 STARFLDDSSAISMVI-DGIS--PARFTELLGEGHKVARFHEFATRHKRVYGSLVELRER 71
Query: 83 FATFSKNLDLIRSTNCKGLSYRLGLN---------------------------------- 108
F TFS+NL+LI TN K L Y L +N
Sbjct: 72 FVTFSRNLELIEETNRKELPYTLAVNQFADMSWEEFKKHNLFSSQNCSATATNSVRAFLT 131
Query: 109 ------------ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAF 156
+SPVK+Q HCGSCWTFSTTG+LE+A+ QA GK + LSEQQLVDCA +
Sbjct: 132 PPSKKDWRDDKIVSPVKNQQHCGSCWTFSTTGALESAHAQATGKMVVLSEQQLVDCAGGY 191
Query: 157 NNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAE 216
NN GC+GGLPSQAFEYI+YNGGLDTE++YPYT DG C ++ ++G +V D VNIT GAE
Sbjct: 192 NNFGCSGGLPSQAFEYIRYNGGLDTEDSYPYTAHDGKCMYNQNSIGAKVYDVVNITEGAE 251
Query: 217 DELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPY 276
DEL HAV RPVS+A+EV+ FRFYKSGVY+S CG P VNHAV+AVGY + VPY
Sbjct: 252 DELIHAVAFNRPVSIAYEVLKDFRFYKSGVYTSNVCGTGPDTVNHAVLAVGYNRDAPVPY 311
Query: 277 WLIKNSWGENWGDHGYFKMEMGKNMCGIATCASYPVV 313
W+IKNSWGE++G GYF MEMGKNMCGIATCASYPVV
Sbjct: 312 WIIKNSWGESFGLDGYFYMEMGKNMCGIATCASYPVV 348
>gi|356569685|ref|XP_003553027.1| PREDICTED: LOW QUALITY PROTEIN: cysteine proteinase 3-like [Glycine
max]
Length = 428
Score = 339 bits (870), Expect = 8e-91, Method: Compositional matrix adjust.
Identities = 180/312 (57%), Positives = 208/312 (66%), Gaps = 62/312 (19%)
Query: 16 CCAAAASASASSFDDSNPIRLVSSDGLRDFETSVLQVIGQARHALSFARFARRYGKIYES 75
C ++ S+FDD+NPIRL S D E+ VL VIG +RHALSFARFA R+ K Y S
Sbjct: 13 CGRKPSTCCCSTFDDANPIRLAS-----DLESQVLDVIGXSRHALSFARFACRHDKRYHS 67
Query: 76 VEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN--------------------------- 108
V E++ F FS NL LIRSTN + L+Y LG+N
Sbjct: 68 VGEIRNDFQIFSDNLKLIRSTNRRSLTYTLGVNHFADWTWEEFTRHKLDAPQNCSATLKG 127
Query: 109 --------------------ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQ 148
+S VKDQG+CGSCWTFSTTG+LEAAY QAFGK ISLSEQQ
Sbjct: 128 NHRLTDVVLPDEKDWRKEGIVSQVKDQGNCGSCWTFSTTGALEAAYTQAFGKNISLSEQQ 187
Query: 149 LVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDS 208
LVDCA AFNN GCNGGLPS+ LDTEEAYPYTGKDGVCKF+++N+ VQV+DS
Sbjct: 188 LVDCAGAFNNFGCNGGLPSR----------LDTEEAYPYTGKDGVCKFTAKNIAVQVIDS 237
Query: 209 VNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGY 268
+NITLGAEDEL+ V V PVSVAFEVV FRFY +GVY+ST CG+TPMDVNH V+AVGY
Sbjct: 238 INITLGAEDELKQVVAFVWPVSVAFEVVKDFRFYNNGVYTSTICGSTPMDVNHVVLAVGY 297
Query: 269 GVEDGVPYWLIK 280
GVEDGVPYW+IK
Sbjct: 298 GVEDGVPYWIIK 309
>gi|168047065|ref|XP_001775992.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162672650|gb|EDQ59184.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 336
Score = 330 bits (847), Expect = 4e-88, Method: Compositional matrix adjust.
Identities = 165/311 (53%), Positives = 201/311 (64%), Gaps = 48/311 (15%)
Query: 51 QVIGQARHALSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN-- 108
+++G +R L FA FA +Y K Y++VEE+K RF TF +++ L+ + N SY L +N
Sbjct: 18 EILGHSRDVLHFAGFAAKYKKEYKTVEELKHRFVTFLESVKLVETHNKGQHSYSLAVNEF 77
Query: 109 ---------------------------------------------ISPVKDQGHCGSCWT 123
+S VK+Q CGSCWT
Sbjct: 78 ADMTFEEFRDSRLMKGEQNCSATVGNHVLTGESLPKTKDWREEGIVSQVKNQASCGSCWT 137
Query: 124 FSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEE 183
FSTTG+LEAA+ QA GK + LSEQQLVDCA FNN GC GGLPSQAFEYI+YNGG+DTE+
Sbjct: 138 FSTTGALEAAHAQATGKMVLLSEQQLVDCAGEFNNFGCGGGLPSQAFEYIRYNGGIDTED 197
Query: 184 AYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYK 243
+YPY KD C+F +G QV D VNIT GAE +L+HA+ +RPVSVAFEVV FR Y
Sbjct: 198 SYPYNAKDSQCRFHKNTIGAQVWDVVNITEGAETQLKHAIATMRPVSVAFEVVHDFRLYN 257
Query: 244 SGVYSSTKCGNTPMDVNHAVVAVGYGV-EDGVPYWLIKNSWGENWGDHGYFKMEMGKNMC 302
GVY+S C P VNHAV+AVGYG E+GVPYW+IKNSWG +WG +GYF MEMGKNMC
Sbjct: 258 GGVYTSLNCHTGPQTVNHAVLAVGYGEDENGVPYWIIKNSWGADWGMNGYFNMEMGKNMC 317
Query: 303 GIATCASYPVV 313
G+ATCASYPVV
Sbjct: 318 GVATCASYPVV 328
>gi|37655265|gb|AAQ96835.1| cysteine proteinase [Glycine max]
Length = 215
Score = 326 bits (836), Expect = 8e-87, Method: Compositional matrix adjust.
Identities = 152/184 (82%), Positives = 170/184 (92%)
Query: 118 CGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNG 177
CGSCW FSTTG+LEAAY QAFGK ISLSEQQLVDCA FNN GC+GGLPSQAFEYIKYNG
Sbjct: 1 CGSCWAFSTTGALEAAYAQAFGKSISLSEQQLVDCAGPFNNFGCHGGLPSQAFEYIKYNG 60
Query: 178 GLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVD 237
GL+TEEAYPYTGKDGVCKFS+ENV VQVLDSVNITLGAEDEL+HAV VRPVSVAF+VV+
Sbjct: 61 GLETEEAYPYTGKDGVCKFSAENVAVQVLDSVNITLGAEDELKHAVAFVRPVSVAFQVVN 120
Query: 238 GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEM 297
GF FY++GV++S CG+T DVNHAV+AVGYGVE+GVPYWLIKNSWGE+WG++GYFKME+
Sbjct: 121 GFHFYENGVFTSDTCGSTSQDVNHAVLAVGYGVENGVPYWLIKNSWGESWGENGYFKMEL 180
Query: 298 GKNM 301
GKNM
Sbjct: 181 GKNM 184
>gi|6635844|gb|AAF20005.1|AF213939_1 cysteine protease [Prunus dulcis]
Length = 178
Score = 311 bits (797), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 147/178 (82%), Positives = 159/178 (89%)
Query: 114 DQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYI 173
DQGHCGSCWTFSTTG+LEAAY QAFGK ISLSEQQLVDCA AFNN GC+GGLPSQAFEYI
Sbjct: 1 DQGHCGSCWTFSTTGALEAAYVQAFGKQISLSEQQLVDCAGAFNNFGCHGGLPSQAFEYI 60
Query: 174 KYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAF 233
KYNGGLDTE AYPY G DG CKFS+ENVG QVLDSVNITLG E EL+HAV VRPVSVAF
Sbjct: 61 KYNGGLDTEAAYPYVGTDGACKFSAENVGAQVLDSVNITLGDEQELKHAVAFVRPVSVAF 120
Query: 234 EVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHG 291
+VV FRFYKSGVY+S CG++PMDVNHAV+AVGYG E GVP+WLIKNSWGE+WGD+G
Sbjct: 121 QVVKSFRFYKSGVYTSDTCGSSPMDVNHAVLAVGYGEEGGVPFWLIKNSWGESWGDNG 178
>gi|356570072|ref|XP_003553215.1| PREDICTED: LOW QUALITY PROTEIN: cysteine proteinase 3-like, partial
[Glycine max]
Length = 301
Score = 293 bits (749), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 156/274 (56%), Positives = 182/274 (66%), Gaps = 52/274 (18%)
Query: 26 SSFDDSNPIRLVSSDGLRDFETSVLQVIGQARHALSFARFARRYGKIYESVEEMKLRFAT 85
S+FDD NPIRL S D E+ VL VI Q+RHALSFA FA + K Y S++E++ F
Sbjct: 2 STFDDVNPIRLAS-----DLESQVLDVIMQSRHALSFACFACHHDKRYHSIDEIRNGFQI 56
Query: 86 FSKNLDLIRSTNCKGLSYRLGLN------------------------------------- 108
FS NL LIRSTN + L+Y LG+N
Sbjct: 57 FSDNLKLIRSTNRRSLTYMLGVNHFADWTWEEFTRHKLGAPQNCSATLKGNHRLTDVVLP 116
Query: 109 ----------ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
+S VKDQG+C S WTFSTTG+LEAAY QAFGK ISLSEQQLVDC AFNN
Sbjct: 117 DEKDWRKEGIVSQVKDQGNCRSSWTFSTTGALEAAYAQAFGKNISLSEQQLVDCVGAFNN 176
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
GCN GLPS+AFEYIKYNGGLDTEEAYPYTGKDGV KF+++NV +QV+DS+NITLGAEDE
Sbjct: 177 FGCNDGLPSKAFEYIKYNGGLDTEEAYPYTGKDGVYKFAAKNVAIQVIDSINITLGAEDE 236
Query: 219 LQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKC 252
L+ AV VRPVSVAFEV F+FY +GVY++T C
Sbjct: 237 LKQAVAFVRPVSVAFEVSKDFQFYNNGVYTNTIC 270
>gi|354466410|ref|XP_003495667.1| PREDICTED: pro-cathepsin H-like [Cricetulus griseus]
Length = 333
Score = 285 bits (728), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 147/313 (46%), Positives = 180/313 (57%), Gaps = 49/313 (15%)
Query: 48 SVLQVIGQARHALSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGL 107
+ ++ A F + ++ K Y SVE R TF+ N I + N + ++++GL
Sbjct: 19 ATTELTVNAIEKFHFKSWMTQHQKTYSSVE-YNYRLKTFANNWRKIHAHNQRNHTFKMGL 77
Query: 108 N------------------------------------------------ISPVKDQGHCG 119
N +S VK+QG CG
Sbjct: 78 NQFSDMTFAEIKRKYLWSEPQNCSATKGNYLRGTGPLPPSMDWRKKGNFVSAVKNQGSCG 137
Query: 120 SCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGL 179
SCWTFSTTG+LE+A A GK +SL+EQQLVDCAQ FNN GC GGLPSQAFEYI YN G+
Sbjct: 138 SCWTFSTTGALESAVAIASGKMLSLAEQQLVDCAQNFNNHGCEGGLPSQAFEYILYNKGI 197
Query: 180 DTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGF 239
E+ YPY GKDG CKF + V D NITL E + AV L PVS AFEV D F
Sbjct: 198 MGEDTYPYRGKDGHCKFDPQKAIAFVKDVANITLNDEKAMVEAVALYNPVSFAFEVTDDF 257
Query: 240 RFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGK 299
Y+ G+YSST C TP VNHAV+AVGYG +DG+PYW++KNSWG NWGD GYF +E GK
Sbjct: 258 MLYQKGIYSSTSCHKTPDKVNHAVLAVGYGEKDGIPYWIVKNSWGTNWGDKGYFLIERGK 317
Query: 300 NMCGIATCASYPV 312
NMCG+A CASYP+
Sbjct: 318 NMCGLAACASYPI 330
>gi|348671668|gb|EGZ11488.1| papain-like cysteine protease C1 [Phytophthora sojae]
Length = 396
Score = 284 bits (727), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 135/212 (63%), Positives = 160/212 (75%)
Query: 103 YRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCN 162
+R +SPVK+QG CGSCWTFSTTG LE+ G+ LSEQ L+DCAQAF+N GCN
Sbjct: 185 WRADGAVSPVKNQGKCGSCWTFSTTGCLESHLKLKHGQFKILSEQNLLDCAQAFDNHGCN 244
Query: 163 GGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHA 222
GGLPS AFEY+KYNGGLDTEE YPY K+G CKF++ +VG QV VNIT E EL+ A
Sbjct: 245 GGLPSHAFEYVKYNGGLDTEETYPYEAKEGKCKFNTYHVGAQVEQVVNITSRNEKELKAA 304
Query: 223 VGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNS 282
VG PVS+AF+VV FRFYKSGVY ST+C + DVNHAV+AVGYGVEDG +W++KNS
Sbjct: 305 VGSTGPVSIAFQVVSDFRFYKSGVYESTECHSGEKDVNHAVLAVGYGVEDGKKHWIVKNS 364
Query: 283 WGENWGDHGYFKMEMGKNMCGIATCASYPVVA 314
WG WG G+F++ G NMCG+A CASYPVVA
Sbjct: 365 WGAEWGMDGFFQIARGSNMCGLADCASYPVVA 396
>gi|344238391|gb|EGV94494.1| Ras-specific guanine nucleotide-releasing factor 1 [Cricetulus
griseus]
Length = 1632
Score = 283 bits (724), Expect = 6e-74, Method: Compositional matrix adjust.
Identities = 128/198 (64%), Positives = 146/198 (73%)
Query: 115 QGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIK 174
QG CGSCWTFSTTG+LE+A A GK +SL+EQQLVDCAQ FNN GC GGLPSQAFEYI
Sbjct: 1432 QGSCGSCWTFSTTGALESAVAIASGKMLSLAEQQLVDCAQNFNNHGCEGGLPSQAFEYIL 1491
Query: 175 YNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFE 234
YN G+ E+ YPY GKDG CKF + V D NITL E + AV L PVS AFE
Sbjct: 1492 YNKGIMGEDTYPYRGKDGHCKFDPQKAIAFVKDVANITLNDEKAMVEAVALYNPVSFAFE 1551
Query: 235 VVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFK 294
V D F Y+ G+YSST C TP VNHAV+AVGYG +DG+PYW++KNSWG NWGD GYF
Sbjct: 1552 VTDDFMLYQKGIYSSTSCHKTPDKVNHAVLAVGYGEKDGIPYWIVKNSWGTNWGDKGYFL 1611
Query: 295 MEMGKNMCGIATCASYPV 312
+E GKNMCG+A CASYP+
Sbjct: 1612 IERGKNMCGLAACASYPI 1629
>gi|313224805|emb|CBY20597.1| unnamed protein product [Oikopleura dioica]
Length = 343
Score = 283 bits (723), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 143/303 (47%), Positives = 186/303 (61%), Gaps = 50/303 (16%)
Query: 61 SFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCK-GLSYRLGLN----------- 108
+F ++ + K+YE+ EE ++R TFSKN ++I S N + +++ +GLN
Sbjct: 41 AFRQYEVEFSKMYETAEERRIRAQTFSKNFEMITSHNQREDVTWTMGLNFDADLTFSEFQ 100
Query: 109 ------------------------------------ISPVKDQGHCGSCWTFSTTGSLEA 132
+SPVK+QGHCGSCWTFSTTG LE+
Sbjct: 101 SRYLMVSQDCSATSTRDLDIDILSLPENFDWREHGGVSPVKNQGHCGSCWTFSTTGCLES 160
Query: 133 AYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDG 192
A+ K +LSEQQLVDCAQ F+N GCNGGLPS AFEYI Y GGL+ E+ Y Y ++G
Sbjct: 161 AHLIHHKKAYNLSEQQLVDCAQDFDNHGCNGGLPSHAFEYIHYVGGLEEEQDYSYHAEEG 220
Query: 193 VCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKC 252
+C+F V + NIT ED+L A+ PVSVAFEVVDGFRFYK GVY S C
Sbjct: 221 LCEFDPTKTAGTVREVFNITETDEDQLTIALAYFNPVSVAFEVVDGFRFYKEGVYQSDTC 280
Query: 253 GNTPMDVNHAVVAVGYGV--EDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASY 310
+ P DVNHAV+AVGYG+ + PY+++KNSWG WGD G+FK++ G+NMCGIATCAS+
Sbjct: 281 KSGPEDVNHAVLAVGYGMCKKCETPYFIVKNSWGAEWGDEGFFKIKRGENMCGIATCASF 340
Query: 311 PVV 313
P+V
Sbjct: 341 PIV 343
>gi|301103045|ref|XP_002900609.1| cysteine protease family C01A, putative [Phytophthora infestans
T30-4]
gi|262101872|gb|EEY59924.1| cysteine protease family C01A, putative [Phytophthora infestans
T30-4]
Length = 376
Score = 281 bits (718), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 133/211 (63%), Positives = 159/211 (75%)
Query: 103 YRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCN 162
+R +SPVK+QG CGSCWTFSTTG LE+ G+ LSEQ L+DCAQ F+N GCN
Sbjct: 165 WRADGAVSPVKNQGKCGSCWTFSTTGCLESHVKLKHGEFTILSEQNLLDCAQNFDNHGCN 224
Query: 163 GGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHA 222
GGLPS AFEYIKYNGGLDTEE YPY K+G CKF++ +VGVQV VNIT E+EL+ A
Sbjct: 225 GGLPSHAFEYIKYNGGLDTEETYPYEAKEGKCKFNTYHVGVQVDQVVNITTRNENELRAA 284
Query: 223 VGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNS 282
VG PVS+AF+VV FRFY+SGVY S +C + DVNHAV+AVGYGVEDG +W++KNS
Sbjct: 285 VGSTGPVSIAFQVVSDFRFYESGVYESKECRSDEKDVNHAVLAVGYGVEDGKDHWIVKNS 344
Query: 283 WGENWGDHGYFKMEMGKNMCGIATCASYPVV 313
WG WG G+F++ G NMCG+A CASYPVV
Sbjct: 345 WGSQWGMDGFFQIARGSNMCGVAVCASYPVV 375
>gi|53748483|emb|CAH59426.1| cysteine protease 1 [Plantago major]
Length = 149
Score = 279 bits (714), Expect = 9e-73, Method: Compositional matrix adjust.
Identities = 127/149 (85%), Positives = 140/149 (93%)
Query: 166 PSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGL 225
PSQAFEYIKYNGGL+TE AYPYTGKDGVCKFSSENVGV+V DSVNITLGAEDEL+HAV
Sbjct: 1 PSQAFEYIKYNGGLETESAYPYTGKDGVCKFSSENVGVRVFDSVNITLGAEDELKHAVAF 60
Query: 226 VRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGE 285
RPVSVAFEVV GFR YKSGVY+ST CGN+PMDVNHAV+AVGYGVE+G+PYWL+KNSWG
Sbjct: 61 ARPVSVAFEVVTGFRAYKSGVYTSTTCGNSPMDVNHAVLAVGYGVENGIPYWLVKNSWGA 120
Query: 286 NWGDHGYFKMEMGKNMCGIATCASYPVVA 314
+WGD+GYFKMEMGKNMCG+ATCASYP+VA
Sbjct: 121 DWGDNGYFKMEMGKNMCGVATCASYPIVA 149
>gi|410960470|ref|XP_003986812.1| PREDICTED: pro-cathepsin H [Felis catus]
Length = 321
Score = 279 bits (714), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 145/307 (47%), Positives = 177/307 (57%), Gaps = 49/307 (15%)
Query: 54 GQARHALSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN----- 108
GQ + F + ++ K Y S EE + R TF N I + N ++++GLN
Sbjct: 13 GQHHEKVHFKSWMVQHQKRYSS-EEYQRRLQTFVGNWRRISAHNAGNHTFKMGLNQFSDM 71
Query: 109 -------------------------------------------ISPVKDQGHCGSCWTFS 125
+SPVK+QG CGSCWTFS
Sbjct: 72 SFAEIKHKYLWSEPQNCSATRGNYLRGTGPYPPFVDWRTKGKYVSPVKNQGGCGSCWTFS 131
Query: 126 TTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAY 185
TTG+LE+A GK +SL+EQQLVDCAQ FNN GC GGLPSQAFEYI+YN G+ E+ Y
Sbjct: 132 TTGALESAIAIKTGKLLSLAEQQLVDCAQNFNNHGCQGGLPSQAFEYIRYNKGIMGEDTY 191
Query: 186 PYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSG 245
PY G+DG CKF V D NIT+ E+ + AV L PVS AFEV D F Y+ G
Sbjct: 192 PYKGQDGDCKFQPSKAIAFVKDVANITINDEEAMVEAVALYNPVSFAFEVTDDFMMYRKG 251
Query: 246 VYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIA 305
VYSST C TP VNHAV+AVGYG +DG+PYW++KNSWG WG GYF +E GKNMCG+A
Sbjct: 252 VYSSTSCHKTPDKVNHAVLAVGYGEKDGIPYWIVKNSWGPQWGMKGYFLIERGKNMCGLA 311
Query: 306 TCASYPV 312
CASYP+
Sbjct: 312 ACASYPI 318
>gi|348551380|ref|XP_003461508.1| PREDICTED: pro-cathepsin H-like [Cavia porcellus]
Length = 335
Score = 278 bits (711), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 130/204 (63%), Positives = 154/204 (75%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
+SPVK+QG CGSCWTFSTTG+LE+A A GK +SL+EQQLVDCAQ FNN GC GGLPSQ
Sbjct: 129 VSPVKNQGACGSCWTFSTTGALESAVAIASGKMLSLAEQQLVDCAQDFNNHGCEGGLPSQ 188
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
AFEYI YN G+ E+ YPY GKDG C+F + V D VNITL E+ + AV L P
Sbjct: 189 AFEYILYNKGIMGEDTYPYQGKDGHCRFQPQKAIAFVKDVVNITLNDEEAMVEAVALYNP 248
Query: 229 VSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWG 288
VS AFEV + F Y+SG+YSST C TP VNHAV+AVGYGV++GVPYW++KNSWG WG
Sbjct: 249 VSFAFEVTEDFISYQSGIYSSTSCHKTPDKVNHAVLAVGYGVQNGVPYWIVKNSWGTAWG 308
Query: 289 DHGYFKMEMGKNMCGIATCASYPV 312
GYF +E GKNMCG+A CAS+P+
Sbjct: 309 QDGYFLIERGKNMCGLAACASFPI 332
>gi|148688953|gb|EDL20900.1| cathepsin H, isoform CRA_a [Mus musculus]
Length = 291
Score = 278 bits (710), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 139/269 (51%), Positives = 176/269 (65%), Gaps = 14/269 (5%)
Query: 56 ARHALSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN------I 109
A F + +++ K Y SVE R F+ N I++ N + ++++ LN
Sbjct: 24 AIEKFHFKSWMKQHQKTYSSVE-YNHRLQMFANNWRKIQAHNQRNHTFKMALNQFSDMSF 82
Query: 110 SPVKD-------QGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCN 162
+ +K QG CGSCWTFSTTG+LE+A A GK +SL+EQQLVDCAQAFNN GC
Sbjct: 83 AEIKHKFLWSEPQGACGSCWTFSTTGALESAVAIASGKMLSLAEQQLVDCAQAFNNHGCK 142
Query: 163 GGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHA 222
GGLPSQAFEYI YN G+ E++YPY GKD C+F+ + V + VNITL E + A
Sbjct: 143 GGLPSQAFEYILYNKGIMEEDSYPYIGKDSSCRFNPQKAVAFVKNVVNITLNDEAAMVEA 202
Query: 223 VGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNS 282
V L PVS AFEV + F YKSGVYSS C TP VNHAV+AVGYG ++G+ YW++KNS
Sbjct: 203 VALYNPVSFAFEVTEDFLMYKSGVYSSKSCHKTPDKVNHAVLAVGYGEQNGLLYWIVKNS 262
Query: 283 WGENWGDHGYFKMEMGKNMCGIATCASYP 311
WG WG++GYF +E GKNMCG+A CASYP
Sbjct: 263 WGSQWGENGYFLIERGKNMCGLAACASYP 291
>gi|6978721|ref|NP_037071.1| pro-cathepsin H precursor [Rattus norvegicus]
gi|115729|sp|P00786.1|CATH_RAT RecName: Full=Pro-cathepsin H; Contains: RecName: Full=Cathepsin H
mini chain; Contains: RecName: Full=Cathepsin H;
Contains: RecName: Full=Cathepsin H heavy chain;
Contains: RecName: Full=Cathepsin H light chain; Flags:
Precursor
gi|55886|emb|CAA68699.1| cathepsin H pre-pro-peptide [Rattus norvegicus]
gi|55391460|gb|AAH85352.1| Cathepsin H [Rattus norvegicus]
gi|149018921|gb|EDL77562.1| cathepsin H, isoform CRA_a [Rattus norvegicus]
gi|226475|prf||1514114A cathepsin H
Length = 333
Score = 277 bits (709), Expect = 4e-72, Method: Compositional matrix adjust.
Identities = 143/305 (46%), Positives = 181/305 (59%), Gaps = 49/305 (16%)
Query: 56 ARHALSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN------- 108
A F + +++ K Y S E R F+ N I++ N + ++++GLN
Sbjct: 27 AIEKFHFTSWMKQHQKTYSS-REYSHRLQVFANNWRKIQAHNQRNHTFKMGLNQFSDMSF 85
Query: 109 -----------------------------------------ISPVKDQGHCGSCWTFSTT 127
+SPVK+QG CGSCWTFSTT
Sbjct: 86 AEIKHKYLWSEPQNCSATKSNYLRGTGPYPSSMDWRKKGNVVSPVKNQGACGSCWTFSTT 145
Query: 128 GSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPY 187
G+LE+A A GK ++L+EQQLVDCAQ FNN GC GGLPSQAFEYI YN G+ E++YPY
Sbjct: 146 GALESAVAIASGKMMTLAEQQLVDCAQNFNNHGCQGGLPSQAFEYILYNKGIMGEDSYPY 205
Query: 188 TGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVY 247
GK+G CKF+ E V + VNITL E + AV L PVS AFEV + F YKSGVY
Sbjct: 206 IGKNGQCKFNPEKAVAFVKNVVNITLNDEAAMVEAVALYNPVSFAFEVTEDFMMYKSGVY 265
Query: 248 SSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATC 307
SS C TP VNHAV+AVGYG ++G+ YW++KNSWG NWG++GYF +E GKNMCG+A C
Sbjct: 266 SSNSCHKTPDKVNHAVLAVGYGEQNGLLYWIVKNSWGSNWGNNGYFLIERGKNMCGLAAC 325
Query: 308 ASYPV 312
ASYP+
Sbjct: 326 ASYPI 330
>gi|338717354|ref|XP_001492337.3| PREDICTED: pro-cathepsin H-like [Equus caballus]
Length = 323
Score = 276 bits (707), Expect = 7e-72, Method: Compositional matrix adjust.
Identities = 145/308 (47%), Positives = 178/308 (57%), Gaps = 49/308 (15%)
Query: 53 IGQARHALSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN---- 108
+ +A F + ++ K Y S EE R TF N I + N ++R+GLN
Sbjct: 14 LSRACEKFHFKSWMVQHQKKYSS-EEYHHRLQTFVSNWRKINAHNTGNHTFRMGLNQFSA 72
Query: 109 --------------------------------------------ISPVKDQGHCGSCWTF 124
+SPVK+QG CGSCWTF
Sbjct: 73 MNFAELKHKYLWSEPQNCSATKGNYLRGAGPYPPSVDWRKKGNFVSPVKNQGGCGSCWTF 132
Query: 125 STTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEA 184
STTG+LE+A A GK +SL+EQQLVDCAQ FNN GC GGLPSQAFEYI+YN G+ E+
Sbjct: 133 STTGALESAVAIASGKLLSLAEQQLVDCAQNFNNHGCQGGLPSQAFEYIRYNKGIMGEDT 192
Query: 185 YPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKS 244
YPY G+DG CKF V D NITL E + AV L PVS AFEV + F Y+
Sbjct: 193 YPYKGQDGDCKFQPNKAIAFVKDVANITLNDEKAMVEAVALYNPVSFAFEVTEDFMMYRK 252
Query: 245 GVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGI 304
G+YSST C TP VNHAV+AVGYG E+G+PYW++KNSWG +WG +GYF +E GKNMCG+
Sbjct: 253 GIYSSTSCHKTPDKVNHAVLAVGYGEENGIPYWIVKNSWGPHWGMNGYFLIERGKNMCGL 312
Query: 305 ATCASYPV 312
A CASYP+
Sbjct: 313 AACASYPI 320
>gi|118388791|ref|XP_001027491.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89309261|gb|EAS07249.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 356
Score = 276 bits (707), Expect = 7e-72, Method: Compositional matrix adjust.
Identities = 136/230 (59%), Positives = 170/230 (73%), Gaps = 11/230 (4%)
Query: 88 KNLDLIRSTNCKGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGI---SL 144
KN+ + S N K L+ +SPVKDQ +CGSCWTFSTTG++E+ H A + + SL
Sbjct: 123 KNVQVPESINWKDLN-----KVSPVKDQQNCGSCWTFSTTGAIES--HYAIFEDVEPTSL 175
Query: 145 SEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQ 204
SEQQL+DCA AFNN GC+GGLPSQAFEYIKYNGG+ E +Y Y +D C+FS E VG +
Sbjct: 176 SEQQLIDCAGAFNNNGCSGGLPSQAFEYIKYNGGISYENSYYYIAQDQECQFSPETVGAR 235
Query: 205 VLD-SVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAV 263
V S NIT G ED+L+ AVG V PVS+AF+V+ F+ YKSGVYS+ C ++P VNHAV
Sbjct: 236 VRGGSFNITQGDEDQLKQAVGTVGPVSIAFQVMGDFKLYKSGVYSNPDCSSSPQTVNHAV 295
Query: 264 VAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASYPVV 313
+AVGYG E+GV YW +KNSW E WGD GYFK++ G NMCG+ATCASYP++
Sbjct: 296 LAVGYGSENGVDYWYVKNSWSEFWGDEGYFKIQRGVNMCGVATCASYPLL 345
>gi|166235890|ref|NP_031827.2| pro-cathepsin H preproprotein [Mus musculus]
gi|341940309|sp|P49935.2|CATH_MOUSE RecName: Full=Pro-cathepsin H; AltName: Full=Cathepsin B3; AltName:
Full=Cathepsin BA; Contains: RecName: Full=Cathepsin H
mini chain; Contains: RecName: Full=Cathepsin H;
Contains: RecName: Full=Cathepsin H heavy chain;
Contains: RecName: Full=Cathepsin H light chain; Flags:
Precursor
gi|74151776|dbj|BAE29677.1| unnamed protein product [Mus musculus]
gi|74181999|dbj|BAE34071.1| unnamed protein product [Mus musculus]
gi|74211659|dbj|BAE29188.1| unnamed protein product [Mus musculus]
gi|74213518|dbj|BAE35569.1| unnamed protein product [Mus musculus]
gi|148688954|gb|EDL20901.1| cathepsin H, isoform CRA_b [Mus musculus]
Length = 333
Score = 276 bits (706), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 142/305 (46%), Positives = 180/305 (59%), Gaps = 49/305 (16%)
Query: 56 ARHALSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN------- 108
A F + +++ K Y SVE R F+ N I++ N + ++++ LN
Sbjct: 27 AIEKFHFKSWMKQHQKTYSSVE-YNHRLQMFANNWRKIQAHNQRNHTFKMALNQFSDMSF 85
Query: 109 -----------------------------------------ISPVKDQGHCGSCWTFSTT 127
+SPVK+QG CGSCWTFSTT
Sbjct: 86 AEIKHKFLWSEPQNCSATKSNYLRGTGPYPSSMDWRKKGNVVSPVKNQGACGSCWTFSTT 145
Query: 128 GSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPY 187
G+LE+A A GK +SL+EQQLVDCAQAFNN GC GGLPSQAFEYI YN G+ E++YPY
Sbjct: 146 GALESAVAIASGKMLSLAEQQLVDCAQAFNNHGCKGGLPSQAFEYILYNKGIMEEDSYPY 205
Query: 188 TGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVY 247
GKD C+F+ + V + VNITL E + AV L PVS AFEV + F YKSGVY
Sbjct: 206 IGKDSSCRFNPQKAVAFVKNVVNITLNDEAAMVEAVALYNPVSFAFEVTEDFLMYKSGVY 265
Query: 248 SSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATC 307
SS C TP VNHAV+AVGYG ++G+ YW++KNSWG WG++GYF +E GKNMCG+A C
Sbjct: 266 SSKSCHKTPDKVNHAVLAVGYGEQNGLLYWIVKNSWGSQWGENGYFLIERGKNMCGLAAC 325
Query: 308 ASYPV 312
ASYP+
Sbjct: 326 ASYPI 330
>gi|351700981|gb|EHB03900.1| Cathepsin H [Heterocephalus glaber]
Length = 334
Score = 276 bits (705), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 129/204 (63%), Positives = 153/204 (75%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
+S VK+QG CGSCWTFSTTG+LE+A A GK +SL+EQQLVDCAQ FNN GC GGLPSQ
Sbjct: 128 VSAVKNQGACGSCWTFSTTGALESAVAIASGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQ 187
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
AFEYI YN G+ E+ YPY GKDG C+F + V D VNITL E+ + AV L P
Sbjct: 188 AFEYILYNKGIMGEDTYPYEGKDGHCRFQPQKAIAFVKDIVNITLNDEEAMVEAVALYNP 247
Query: 229 VSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWG 288
VS A+EV + F YK G+YSST C TP VNHAV+AVGYGV+ GVPYW++KNSWG WG
Sbjct: 248 VSFAYEVTEDFMSYKRGIYSSTSCHKTPDKVNHAVLAVGYGVDHGVPYWIVKNSWGTQWG 307
Query: 289 DHGYFKMEMGKNMCGIATCASYPV 312
++GYF +E GKNMCG+A CASYP+
Sbjct: 308 NNGYFLIERGKNMCGLAACASYPI 331
>gi|355692920|gb|EHH27523.1| Cathepsin H, partial [Macaca mulatta]
Length = 305
Score = 275 bits (704), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 144/301 (47%), Positives = 175/301 (58%), Gaps = 49/301 (16%)
Query: 60 LSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN----------- 108
F + ++ K Y S EE R TF+ N I + N ++++ LN
Sbjct: 3 FHFKSWMSKHHKTY-STEEYHHRMQTFASNWRKINAHNNGNHTFKMALNQFSDMSFAEIK 61
Query: 109 -------------------------------------ISPVKDQGHCGSCWTFSTTGSLE 131
+SPVK+QG CGSCWTFSTTG+LE
Sbjct: 62 HKYLWSEPQNCSATKSNYLRGTGPYPPSMDWRKKGNFVSPVKNQGACGSCWTFSTTGALE 121
Query: 132 AAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKD 191
+A A GK +SL+EQQLVDCAQ FNN GC GGLPSQAFEYI YN G+ E+ YPY GKD
Sbjct: 122 SAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNKGIMGEDTYPYQGKD 181
Query: 192 GVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTK 251
G CKF V D NIT+ AE+ + AV L PVS AFEV F YK+G+YSST
Sbjct: 182 GDCKFRPGKAIGFVKDVANITIYAEEAMVEAVALYNPVSFAFEVTQDFMMYKTGIYSSTS 241
Query: 252 CGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASYP 311
C TP VNHAV+AVGYG E+G+PYW++KNSWG WG +GYF +E GKNMCG+A CASYP
Sbjct: 242 CHKTPDKVNHAVLAVGYGEENGIPYWIVKNSWGPQWGMNGYFLIERGKNMCGLAACASYP 301
Query: 312 V 312
+
Sbjct: 302 I 302
>gi|47522632|ref|NP_999094.1| pro-cathepsin H precursor [Sus scrofa]
gi|5915886|sp|O46427.1|CATH_PIG RecName: Full=Pro-cathepsin H; Contains: RecName: Full=Cathepsin H
mini chain; Contains: RecName: Full=Cathepsin H;
Contains: RecName: Full=Cathepsin H heavy chain;
Contains: RecName: Full=Cathepsin H light chain; Flags:
Precursor
gi|2735659|gb|AAB93957.1| preprocathepsin H [Sus scrofa]
gi|172050733|gb|ACB70168.1| cathepsin H [Sus scrofa]
Length = 335
Score = 275 bits (703), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 142/301 (47%), Positives = 176/301 (58%), Gaps = 49/301 (16%)
Query: 60 LSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN----------- 108
L F + ++ K Y S+EE R F N I + N +++LGLN
Sbjct: 33 LHFKSWMVQHQKKY-SLEEYHHRLQVFVSNWRKINAHNAGNHTFKLGLNQFSDMSFDEIR 91
Query: 109 -------------------------------------ISPVKDQGHCGSCWTFSTTGSLE 131
+SPVK+QG CGSCWTFSTTG+LE
Sbjct: 92 HKYLWSEPQNCSATKGNYLRGTGPYPPSMDWRKKGNFVSPVKNQGSCGSCWTFSTTGALE 151
Query: 132 AAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKD 191
+A A GK +SL+EQQLVDCAQ FNN GC GGLPSQAFEYI+YN G+ E+ YPY G+D
Sbjct: 152 SAVAIATGKMLSLAEQQLVDCAQNFNNHGCQGGLPSQAFEYIRYNKGIMGEDTYPYKGQD 211
Query: 192 GVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTK 251
CKF + V D NIT+ E+ + AV L PVS AFEV + F Y+ G+YSST
Sbjct: 212 DHCKFQPDKAIAFVKDVANITMNDEEAMVEAVALYNPVSFAFEVTNDFLMYRKGIYSSTS 271
Query: 252 CGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASYP 311
C TP VNHAV+AVGYG E+G+PYW++KNSWG WG +GYF +E GKNMCG+A CASYP
Sbjct: 272 CHKTPDKVNHAVLAVGYGEENGIPYWIVKNSWGPQWGMNGYFLIERGKNMCGLAACASYP 331
Query: 312 V 312
+
Sbjct: 332 I 332
>gi|203341|gb|AAA63484.1| cathepsin H [Rattus norvegicus]
Length = 298
Score = 275 bits (703), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 129/204 (63%), Positives = 155/204 (75%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
+SPVK+QG CGSCWTFSTTG+LE+A A GK ++L+EQQLVDCAQ FNN GC GGLPSQ
Sbjct: 92 VSPVKNQGACGSCWTFSTTGALESAVAIASGKMMTLAEQQLVDCAQNFNNHGCQGGLPSQ 151
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
AFEYI YN G+ E++YPY GK+G CKF+ E V + VNITL E + AV L P
Sbjct: 152 AFEYILYNKGIMGEDSYPYIGKNGQCKFNPEKAVAFVKNVVNITLNDEAAMVEAVALYNP 211
Query: 229 VSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWG 288
VS AFEV + F YKSGVYSS C TP VNHAV+AVGYG ++G+ YW++KNSWG NWG
Sbjct: 212 VSFAFEVTEDFMMYKSGVYSSNSCHKTPDKVNHAVLAVGYGEQNGLLYWIVKNSWGSNWG 271
Query: 289 DHGYFKMEMGKNMCGIATCASYPV 312
++GYF +E GKNMCG+A CASYP+
Sbjct: 272 NNGYFLIERGKNMCGLAACASYPI 295
>gi|417399160|gb|JAA46608.1| Putative pro-cathepsin h [Desmodus rotundus]
Length = 336
Score = 275 bits (703), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 140/301 (46%), Positives = 174/301 (57%), Gaps = 49/301 (16%)
Query: 60 LSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN----------- 108
F + ++ K Y S EE + R TF+ N I+ N + ++++G+N
Sbjct: 34 FHFKSWMEQHQKTY-SAEEYRHRLQTFASNQRKIKEHNARNHTFKMGINPFSDMTFAEFK 92
Query: 109 -------------------------------------ISPVKDQGHCGSCWTFSTTGSLE 131
+SPVK+QG CGSCWTFSTTG+LE
Sbjct: 93 RRYLWSEPQNCSATKSNYLRGHGPYPTSVDWRKKGRFVSPVKNQGGCGSCWTFSTTGALE 152
Query: 132 AAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKD 191
+A GK +SLSEQQLVDCAQ FNN GC GGLPSQAFEYI+YN G+ E++YPY GKD
Sbjct: 153 SAIAIKTGKMLSLSEQQLVDCAQNFNNHGCQGGLPSQAFEYIRYNKGIMEEDSYPYEGKD 212
Query: 192 GVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTK 251
C+F E V D NITL E + AV L PVS AFEV F Y+ G+YSST
Sbjct: 213 SNCRFQPEKAIAFVKDVANITLNDEAAMVEAVALYNPVSFAFEVTSDFMLYRKGIYSSTS 272
Query: 252 CGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASYP 311
C TP VNHAV+AVGYG ++G PYW++KNSWG WG +GYF +E G NMCG+A CASYP
Sbjct: 273 CHKTPDKVNHAVLAVGYGEQNGKPYWIVKNSWGPYWGMNGYFLIERGTNMCGLAACASYP 332
Query: 312 V 312
+
Sbjct: 333 I 333
>gi|454101|gb|AAA82966.1| cathepsin H prepropeptide [Mus musculus]
Length = 333
Score = 275 bits (702), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 141/305 (46%), Positives = 179/305 (58%), Gaps = 49/305 (16%)
Query: 56 ARHALSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN------- 108
A F + +++ K Y SVE R F+ N I++ N + ++++ LN
Sbjct: 27 AIEKFHFKSWMKQHQKTYSSVE-YNHRLQMFANNWRKIQAHNQRNHTFKMALNQFSDMSF 85
Query: 109 -----------------------------------------ISPVKDQGHCGSCWTFSTT 127
+SPVK+QG C SCWTFSTT
Sbjct: 86 AEIKHKFLWSEPQNCSATKSNYLRGTGPYPSSMDWRKKGNVVSPVKNQGACASCWTFSTT 145
Query: 128 GSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPY 187
G+LE+A A GK +SL+EQQLVDCAQAFNN GC GGLPSQAFEYI YN G+ E++YPY
Sbjct: 146 GALESAVAIASGKMLSLAEQQLVDCAQAFNNHGCKGGLPSQAFEYILYNKGIMEEDSYPY 205
Query: 188 TGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVY 247
GKD C+F+ + V + VNITL E + AV L PVS AFEV + F YKSGVY
Sbjct: 206 IGKDSSCRFNPQKAVAFVKNVVNITLNDEAAMVEAVALYNPVSFAFEVTEDFLMYKSGVY 265
Query: 248 SSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATC 307
SS C TP VNHAV+AVGYG ++G+ YW++KNSWG WG++GYF +E GKNMCG+A C
Sbjct: 266 SSKSCHKTPDKVNHAVLAVGYGEQNGLLYWIVKNSWGSQWGENGYFLIERGKNMCGLAAC 325
Query: 308 ASYPV 312
ASYP+
Sbjct: 326 ASYPI 330
>gi|198435380|ref|XP_002128293.1| PREDICTED: similar to cathepsin H [Ciona intestinalis]
Length = 438
Score = 274 bits (700), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 149/303 (49%), Positives = 180/303 (59%), Gaps = 51/303 (16%)
Query: 62 FARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCK-GLSYRLGLN------------ 108
F + +GK Y + EE + RF FSK+L I+ N + ++ +GLN
Sbjct: 135 FKGWQIEHGKQYINQEEAEKRFQIFSKSLKTIKEFNNRVDRTWEMGLNEFSDRTFEEFAS 194
Query: 109 -----------------------------------ISPVKDQGHCGSCWTFSTTGSLEAA 133
++ VK+QG CGSCWTFSTTG LE+A
Sbjct: 195 IRLMMPQNCSATKGNHVSLGFEPPAQINCLEKGNFVTAVKNQGSCGSCWTFSTTGCLESA 254
Query: 134 --YHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKD 191
H+ +SLSEQQLVDCAQAFN+ GCNGGLPSQAFEYI YN GL TE YPY G D
Sbjct: 255 TAIHKEGNPLVSLSEQQLVDCAQAFNDHGCNGGLPSQAFEYIHYNKGLMTEADYPYQGVD 314
Query: 192 GVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTK 251
G C F + V VNIT G ED ++ AVGL+ PVS+AF+V FR YKSGVYSST
Sbjct: 315 GKCHFVASKASAFVKQIVNITKGNEDGIKEAVGLLNPVSIAFDVAKDFRHYKSGVYSSTL 374
Query: 252 CGNTPMDVNHAVVAVGYG-VEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASY 310
CGN +VNHAV+AVGYG +G YWL+KNSWG WG +GYFK+E G NMCG+A CASY
Sbjct: 375 CGNKASEVNHAVLAVGYGYTSNGQDYWLVKNSWGPQWGINGYFKIERGSNMCGLADCASY 434
Query: 311 PVV 313
PV+
Sbjct: 435 PVI 437
>gi|301775254|ref|XP_002923050.1| PREDICTED: cathepsin H-like [Ailuropoda melanoleuca]
Length = 307
Score = 274 bits (700), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 129/209 (61%), Positives = 152/209 (72%)
Query: 104 RLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNG 163
+ G +SPVK+QG CGSCWTFSTTG+LE+A GK +SL+EQQLVDCAQ FNN GC G
Sbjct: 96 KKGKFVSPVKNQGGCGSCWTFSTTGALESAIAIKTGKLLSLAEQQLVDCAQDFNNHGCQG 155
Query: 164 GLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAV 223
GLPSQAFEYI+YN G+ E++YPY G+DG CKF V D NIT+ E + AV
Sbjct: 156 GLPSQAFEYIRYNRGIMGEDSYPYKGQDGDCKFQPSKAIAFVKDVANITINDEQAMVEAV 215
Query: 224 GLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSW 283
L PVS AFEV F Y+ GVYSST C TP VNHAV+AVGYG ++GVPYW++KNSW
Sbjct: 216 ALFNPVSFAFEVTGDFMMYRKGVYSSTSCHKTPDKVNHAVLAVGYGEQNGVPYWIVKNSW 275
Query: 284 GENWGDHGYFKMEMGKNMCGIATCASYPV 312
G WG HGYF +E GKNMCG+A CASYP+
Sbjct: 276 GPQWGMHGYFLIERGKNMCGLAACASYPI 304
>gi|281350252|gb|EFB25836.1| hypothetical protein PANDA_012122 [Ailuropoda melanoleuca]
Length = 294
Score = 274 bits (700), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 129/209 (61%), Positives = 152/209 (72%)
Query: 104 RLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNG 163
+ G +SPVK+QG CGSCWTFSTTG+LE+A GK +SL+EQQLVDCAQ FNN GC G
Sbjct: 83 KKGKFVSPVKNQGGCGSCWTFSTTGALESAIAIKTGKLLSLAEQQLVDCAQDFNNHGCQG 142
Query: 164 GLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAV 223
GLPSQAFEYI+YN G+ E++YPY G+DG CKF V D NIT+ E + AV
Sbjct: 143 GLPSQAFEYIRYNRGIMGEDSYPYKGQDGDCKFQPSKAIAFVKDVANITINDEQAMVEAV 202
Query: 224 GLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSW 283
L PVS AFEV F Y+ GVYSST C TP VNHAV+AVGYG ++GVPYW++KNSW
Sbjct: 203 ALFNPVSFAFEVTGDFMMYRKGVYSSTSCHKTPDKVNHAVLAVGYGEQNGVPYWIVKNSW 262
Query: 284 GENWGDHGYFKMEMGKNMCGIATCASYPV 312
G WG HGYF +E GKNMCG+A CASYP+
Sbjct: 263 GPQWGMHGYFLIERGKNMCGLAACASYPI 291
>gi|13905172|gb|AAH06878.1| Cathepsin H [Mus musculus]
Length = 333
Score = 273 bits (699), Expect = 6e-71, Method: Compositional matrix adjust.
Identities = 141/305 (46%), Positives = 179/305 (58%), Gaps = 49/305 (16%)
Query: 56 ARHALSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN------- 108
A F + +++ K Y SVE R F+ N I++ N + ++++ LN
Sbjct: 27 AIEKFHFKSWMKQHQKTYSSVE-YNHRLQMFANNWRKIQAHNQRNHTFKMALNQFSDMSF 85
Query: 109 -----------------------------------------ISPVKDQGHCGSCWTFSTT 127
+SPV +QG CGSCWTFSTT
Sbjct: 86 AEIKHKFLWSEPQNCSATKSNYLRGTGPYPSSMDWRKKGNVVSPVINQGACGSCWTFSTT 145
Query: 128 GSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPY 187
G+LE+A A GK +SL+EQQLVDCAQAFNN GC GGLPSQAFEYI YN G+ E++YPY
Sbjct: 146 GALESAVAIASGKMLSLAEQQLVDCAQAFNNHGCKGGLPSQAFEYILYNKGIMEEDSYPY 205
Query: 188 TGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVY 247
GKD C+F+ + V + VNITL E + AV L PVS AFEV + F YKSGVY
Sbjct: 206 IGKDSSCRFNPQKAVAFVKNVVNITLNDEAAMVEAVALYNPVSFAFEVTEDFLMYKSGVY 265
Query: 248 SSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATC 307
SS C TP VNHAV+AVGYG ++G+ YW++KNSWG WG++GYF +E GKNMCG+A C
Sbjct: 266 SSKSCHKTPDKVNHAVLAVGYGEQNGLLYWIVKNSWGSQWGENGYFLIERGKNMCGLAAC 325
Query: 308 ASYPV 312
ASYP+
Sbjct: 326 ASYPI 330
>gi|296213765|ref|XP_002753411.1| PREDICTED: pro-cathepsin H [Callithrix jacchus]
Length = 336
Score = 273 bits (699), Expect = 6e-71, Method: Compositional matrix adjust.
Identities = 142/301 (47%), Positives = 171/301 (56%), Gaps = 48/301 (15%)
Query: 60 LSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN----------- 108
F + ++ K Y EE R TF+ N I + N ++++ +N
Sbjct: 33 FHFKSWMAKHHKTYSREEEYHQRLQTFASNWRKINAHNNGNHTFKMAVNQFSDMSFAEIK 92
Query: 109 -------------------------------------ISPVKDQGHCGSCWTFSTTGSLE 131
+SPVK+QG CGSCWTFSTTG+LE
Sbjct: 93 RKYLWSEPQNCSATKSNYLRGTGPYPPSVDWRKKGHFVSPVKNQGACGSCWTFSTTGALE 152
Query: 132 AAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKD 191
+A A GK +SL+EQQLVDCAQ FNN GC GGLPSQAFEYI YN G+ E+ YPY GKD
Sbjct: 153 SAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNNGIMGEDTYPYQGKD 212
Query: 192 GVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTK 251
CKF V D NIT+ ED + AV L PVS AFEV F YK G+YSST
Sbjct: 213 SDCKFQPGKAIGFVKDVANITIYDEDAMVEAVALYNPVSFAFEVTQDFMMYKRGIYSSTS 272
Query: 252 CGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASYP 311
C TP VNHAV+AVGYG E+G+PYW++KNSWG WG +GYF +E GKNMCG+A CASYP
Sbjct: 273 CHKTPDKVNHAVLAVGYGEENGIPYWIVKNSWGPQWGMNGYFLIERGKNMCGLAACASYP 332
Query: 312 V 312
V
Sbjct: 333 V 333
>gi|4139678|pdb|8PCH|A Chain A, Crystal Structure Of Porcine Cathepsin H Determined At 2.1
Angstrom Resolution: Location Of The Mini-Chain
C-Terminal Carboxyl Group Defines Cathepsin H
Aminopeptidase Function
gi|28948781|pdb|1NB3|A Chain A, Crystal Structure Of Stefin A In Complex With Cathepsin H:
N-Terminal Residues Of Inhibitors Can Adapt To The
Active Sites Of Endo-And Exopeptidases
gi|28948784|pdb|1NB3|B Chain B, Crystal Structure Of Stefin A In Complex With Cathepsin H:
N-Terminal Residues Of Inhibitors Can Adapt To The
Active Sites Of Endo-And Exopeptidases
gi|28948787|pdb|1NB3|C Chain C, Crystal Structure Of Stefin A In Complex With Cathepsin H:
N-Terminal Residues Of Inhibitors Can Adapt To The
Active Sites Of Endo-And Exopeptidases
gi|28948790|pdb|1NB3|D Chain D, Crystal Structure Of Stefin A In Complex With Cathepsin H:
N-Terminal Residues Of Inhibitors Can Adapt To The
Active Sites Of Endo-And Exopeptidases
gi|28948793|pdb|1NB5|A Chain A, Crystal Structure Of Stefin A In Complex With Cathepsin H
gi|28948796|pdb|1NB5|B Chain B, Crystal Structure Of Stefin A In Complex With Cathepsin H
gi|28948799|pdb|1NB5|C Chain C, Crystal Structure Of Stefin A In Complex With Cathepsin H
gi|28948802|pdb|1NB5|D Chain D, Crystal Structure Of Stefin A In Complex With Cathepsin H
Length = 220
Score = 273 bits (699), Expect = 6e-71, Method: Compositional matrix adjust.
Identities = 126/204 (61%), Positives = 152/204 (74%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
+SPVK+QG CGSCWTFSTTG+LE+A A GK +SL+EQQLVDCAQ FNN GC GGLPSQ
Sbjct: 14 VSPVKNQGSCGSCWTFSTTGALESAVAIATGKMLSLAEQQLVDCAQNFNNHGCQGGLPSQ 73
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
AFEYI+YN G+ E+ YPY G+D CKF + V D NIT+ E+ + AV L P
Sbjct: 74 AFEYIRYNKGIMGEDTYPYKGQDDHCKFQPDKAIAFVKDVANITMNDEEAMVEAVALYNP 133
Query: 229 VSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWG 288
VS AFEV + F Y+ G+YSST C TP VNHAV+AVGYG E+G+PYW++KNSWG WG
Sbjct: 134 VSFAFEVTNDFLMYRKGIYSSTSCHKTPDKVNHAVLAVGYGEENGIPYWIVKNSWGPQWG 193
Query: 289 DHGYFKMEMGKNMCGIATCASYPV 312
+GYF +E GKNMCG+A CASYP+
Sbjct: 194 MNGYFLIERGKNMCGLAACASYPI 217
>gi|297297049|ref|XP_002804951.1| PREDICTED: cathepsin H [Macaca mulatta]
Length = 323
Score = 273 bits (699), Expect = 6e-71, Method: Compositional matrix adjust.
Identities = 143/301 (47%), Positives = 174/301 (57%), Gaps = 49/301 (16%)
Query: 60 LSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN----------- 108
F + ++ K Y S EE R TF+ N I + N ++++ LN
Sbjct: 21 FHFKSWMSKHHKTY-STEEYHHRMQTFASNWRKINAHNNGNHTFKMALNQFSDMSFAEIK 79
Query: 109 -------------------------------------ISPVKDQGHCGSCWTFSTTGSLE 131
+SPVK+QG CGSCWTFSTTG+LE
Sbjct: 80 HKYLWSEPQNCSATKSNYLRGTGPYPPSMDWRKKGNFVSPVKNQGACGSCWTFSTTGALE 139
Query: 132 AAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKD 191
+A A GK +SL+EQQLVDCAQ FNN GC GGLPSQAFEYI YN G+ E+ YPY GKD
Sbjct: 140 SAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNKGIMGEDTYPYQGKD 199
Query: 192 GVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTK 251
G CKF V D NIT+ E+ + AV L PVS AFEV F YK+G+YSST
Sbjct: 200 GDCKFRPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFAFEVTQDFMIYKTGIYSSTS 259
Query: 252 CGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASYP 311
C TP VNHAV+AVGYG E+G+PYW++KNSWG WG +GYF +E GKNMCG+A CASYP
Sbjct: 260 CHKTPDKVNHAVLAVGYGEENGIPYWIVKNSWGPQWGMNGYFLIERGKNMCGLAACASYP 319
Query: 312 V 312
+
Sbjct: 320 I 320
>gi|172050735|gb|ACB70169.1| cathepsin H transcript variant 3 [Sus scrofa]
Length = 251
Score = 273 bits (698), Expect = 6e-71, Method: Compositional matrix adjust.
Identities = 126/204 (61%), Positives = 152/204 (74%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
+SPVK+QG CGSCWTFSTTG+LE+A A GK +SL+EQQLVDCAQ FNN GC GGLPSQ
Sbjct: 45 VSPVKNQGSCGSCWTFSTTGALESAVAIATGKMLSLAEQQLVDCAQNFNNHGCQGGLPSQ 104
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
AFEYI+YN G+ E+ YPY G+D CKF + V D NIT+ E+ + AV L P
Sbjct: 105 AFEYIRYNKGIMGEDTYPYKGQDDHCKFQPDKAIAFVKDVANITMNDEEAMVEAVALYNP 164
Query: 229 VSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWG 288
VS AFEV + F Y+ G+YSST C TP VNHAV+AVGYG E+G+PYW++KNSWG WG
Sbjct: 165 VSFAFEVTNDFLMYRKGIYSSTSCHKTPDKVNHAVLAVGYGEENGIPYWIVKNSWGPQWG 224
Query: 289 DHGYFKMEMGKNMCGIATCASYPV 312
+GYF +E GKNMCG+A CASYP+
Sbjct: 225 MNGYFLIERGKNMCGLAACASYPI 248
>gi|402875039|ref|XP_003901328.1| PREDICTED: pro-cathepsin H [Papio anubis]
Length = 335
Score = 273 bits (698), Expect = 7e-71, Method: Compositional matrix adjust.
Identities = 143/301 (47%), Positives = 174/301 (57%), Gaps = 49/301 (16%)
Query: 60 LSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN----------- 108
F + ++ K Y S EE R TF+ N I + N ++++ LN
Sbjct: 33 FHFKSWMSKHHKTY-STEEYHHRMQTFASNWRKINAHNNGNHTFKMALNQFSDMSFAEIK 91
Query: 109 -------------------------------------ISPVKDQGHCGSCWTFSTTGSLE 131
+SPVK+QG CGSCWTFSTTG+LE
Sbjct: 92 HKYLWSEPQNCSATKSNYLRGTGPYPPSMDWRKKGNFVSPVKNQGACGSCWTFSTTGALE 151
Query: 132 AAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKD 191
+A A GK +SL+EQQLVDCAQ FNN GC GGLPSQAFEYI YN G+ E+ YPY GKD
Sbjct: 152 SAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNKGIMGEDTYPYQGKD 211
Query: 192 GVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTK 251
G CKF V D NIT+ E+ + AV L PVS AFEV F YK+G+YSST
Sbjct: 212 GDCKFRPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFAFEVTQDFMMYKTGIYSSTS 271
Query: 252 CGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASYP 311
C TP VNHAV+AVGYG E+G+PYW++KNSWG WG +GYF +E GKNMCG+A CASYP
Sbjct: 272 CHKTPDKVNHAVLAVGYGEENGIPYWIVKNSWGPQWGMNGYFLIERGKNMCGLAACASYP 331
Query: 312 V 312
+
Sbjct: 332 I 332
>gi|109082090|ref|XP_001108862.1| PREDICTED: cathepsin H isoform 2 [Macaca mulatta]
Length = 335
Score = 273 bits (698), Expect = 8e-71, Method: Compositional matrix adjust.
Identities = 143/301 (47%), Positives = 174/301 (57%), Gaps = 49/301 (16%)
Query: 60 LSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN----------- 108
F + ++ K Y S EE R TF+ N I + N ++++ LN
Sbjct: 33 FHFKSWMSKHHKTY-STEEYHHRMQTFASNWRKINAHNNGNHTFKMALNQFSDMSFAEIK 91
Query: 109 -------------------------------------ISPVKDQGHCGSCWTFSTTGSLE 131
+SPVK+QG CGSCWTFSTTG+LE
Sbjct: 92 HKYLWSEPQNCSATKSNYLRGTGPYPPSMDWRKKGNFVSPVKNQGACGSCWTFSTTGALE 151
Query: 132 AAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKD 191
+A A GK +SL+EQQLVDCAQ FNN GC GGLPSQAFEYI YN G+ E+ YPY GKD
Sbjct: 152 SAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNKGIMGEDTYPYQGKD 211
Query: 192 GVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTK 251
G CKF V D NIT+ E+ + AV L PVS AFEV F YK+G+YSST
Sbjct: 212 GDCKFRPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFAFEVTQDFMIYKTGIYSSTS 271
Query: 252 CGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASYP 311
C TP VNHAV+AVGYG E+G+PYW++KNSWG WG +GYF +E GKNMCG+A CASYP
Sbjct: 272 CHKTPDKVNHAVLAVGYGEENGIPYWIVKNSWGPQWGMNGYFLIERGKNMCGLAACASYP 331
Query: 312 V 312
+
Sbjct: 332 I 332
>gi|355778231|gb|EHH63267.1| Cathepsin H, partial [Macaca fascicularis]
Length = 305
Score = 273 bits (697), Expect = 9e-71, Method: Compositional matrix adjust.
Identities = 143/301 (47%), Positives = 174/301 (57%), Gaps = 49/301 (16%)
Query: 60 LSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN----------- 108
F + ++ K Y S EE R TF+ N I + N ++++ LN
Sbjct: 3 FHFKSWMSKHHKTY-STEEYHHRMQTFASNWRKINAHNNGNHTFKMALNQFSDMSFAEIK 61
Query: 109 -------------------------------------ISPVKDQGHCGSCWTFSTTGSLE 131
+SPVK+QG CGSCWTFSTTG+LE
Sbjct: 62 HKYLWSEPQNCSATKSNYLRGTGPYPPSMDWRKKGNFVSPVKNQGACGSCWTFSTTGALE 121
Query: 132 AAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKD 191
+A A GK +SL+EQQLVDCAQ FNN GC GGLPSQAFEYI YN G+ E+ YPY GKD
Sbjct: 122 SAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNKGIMGEDTYPYQGKD 181
Query: 192 GVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTK 251
G CKF V D NIT+ E+ + AV L PVS AFEV F YK+G+YSST
Sbjct: 182 GDCKFRPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFAFEVTQDFMMYKTGIYSSTS 241
Query: 252 CGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASYP 311
C TP VNHAV+AVGYG E+G+PYW++KNSWG WG +GYF +E GKNMCG+A CASYP
Sbjct: 242 CHKTPDKVNHAVLAVGYGEENGIPYWIVKNSWGPQWGMNGYFLIERGKNMCGLAACASYP 301
Query: 312 V 312
+
Sbjct: 302 I 302
>gi|403258371|ref|XP_003921746.1| PREDICTED: pro-cathepsin H [Saimiri boliviensis boliviensis]
Length = 336
Score = 273 bits (697), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 141/301 (46%), Positives = 171/301 (56%), Gaps = 48/301 (15%)
Query: 60 LSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN----------- 108
F + ++ K Y EE R TF+ N I + N ++++ +N
Sbjct: 33 FHFKSWMAKHHKTYSREEEYHHRLQTFASNWRKINAHNNGNHTFKMAVNQFADMSFAEIK 92
Query: 109 -------------------------------------ISPVKDQGHCGSCWTFSTTGSLE 131
+SPVK+QG CGSCWTFSTTG+LE
Sbjct: 93 RKYLWSEPQNCSATKSNYLRGTGPYPPSVDWRKKGNFVSPVKNQGACGSCWTFSTTGALE 152
Query: 132 AAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKD 191
+A A GK +SL+EQQLVDCAQ FNN GC GGLPSQAFEYI YN G+ E+ YPY GKD
Sbjct: 153 SAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNKGIMGEDTYPYQGKD 212
Query: 192 GVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTK 251
CKF V D NIT+ ED + AV L PVS AFEV F YK G+YSST
Sbjct: 213 SDCKFQPGKAIGFVKDVANITIYDEDAMVEAVALYNPVSFAFEVTQDFMMYKRGIYSSTS 272
Query: 252 CGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASYP 311
C TP VNHAV+AVGYG E+G+PYW++KNSWG WG +GYF +E GKNMCG+A CASYP
Sbjct: 273 CHKTPDKVNHAVLAVGYGEENGIPYWIVKNSWGPQWGMNGYFLIERGKNMCGLAACASYP 332
Query: 312 V 312
+
Sbjct: 333 I 333
>gi|395502422|ref|XP_003755580.1| PREDICTED: pro-cathepsin H [Sarcophilus harrisii]
Length = 334
Score = 272 bits (696), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 127/204 (62%), Positives = 150/204 (73%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
+SPVK+QG CGSCWTFSTTG LE+A A GK +SL+EQQLVDCAQ FNN GCNGGLPSQ
Sbjct: 128 VSPVKNQGGCGSCWTFSTTGGLESAVAIATGKLLSLAEQQLVDCAQDFNNHGCNGGLPSQ 187
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
AFEYI YN G+ E+ YPY GKDG CKF V D NIT E+ + AV P
Sbjct: 188 AFEYIMYNKGIMGEDTYPYEGKDGTCKFQPNKAIAFVKDVANITAYDEEAMTEAVAHHNP 247
Query: 229 VSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWG 288
VS AFEV D F Y G+YS+ KC +P VNHAV+AVGYG E+G+PYW++KNSWG +WG
Sbjct: 248 VSFAFEVTDDFLSYHKGIYSNPKCSKSPDKVNHAVLAVGYGKENGIPYWIVKNSWGTSWG 307
Query: 289 DHGYFKMEMGKNMCGIATCASYPV 312
++GYF +E GKNMCG+A CASYP+
Sbjct: 308 NNGYFLIERGKNMCGLADCASYPI 331
>gi|291410711|ref|XP_002721635.1| PREDICTED: cathepsin H [Oryctolagus cuniculus]
Length = 333
Score = 272 bits (695), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 142/301 (47%), Positives = 176/301 (58%), Gaps = 49/301 (16%)
Query: 60 LSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN----------- 108
F + ++ K Y S EE R TF +N I + N ++++GLN
Sbjct: 31 FHFKSWMSQHHKKY-SAEEYPRRLQTFVRNWRKINAHNNGNHTFQMGLNQFSDMSFAEIK 89
Query: 109 -------------------------------------ISPVKDQGHCGSCWTFSTTGSLE 131
+SPVK+QG CGSCWTFSTTG+LE
Sbjct: 90 HKYLWTEPQNCSATKSNYLRGTGPYPSSVDWRKKGNFVSPVKNQGACGSCWTFSTTGALE 149
Query: 132 AAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKD 191
+A A GK +SL+EQQLVDCAQ FNN GC GGLPSQAFEYI YN G+ E++YPY +
Sbjct: 150 SAVAIAGGKMLSLAEQQLVDCAQNFNNHGCEGGLPSQAFEYILYNKGIMGEDSYPYRAME 209
Query: 192 GVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTK 251
G CKF + V D NITL E+ + AV L PVS AFEV + F Y+ G+YSST
Sbjct: 210 GRCKFQPQKAIAFVKDVANITLNDEEAMVEAVALYNPVSFAFEVTEDFMQYRKGIYSSTS 269
Query: 252 CGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASYP 311
C TP VNHAV+AVGYG E+GVPYW++KNSWG +WG +GYF +E GKNMCG+A CASYP
Sbjct: 270 CHKTPDKVNHAVLAVGYGEENGVPYWIVKNSWGSHWGMNGYFYIERGKNMCGLAACASYP 329
Query: 312 V 312
+
Sbjct: 330 I 330
>gi|48145879|emb|CAG33162.1| CTSH [Homo sapiens]
Length = 335
Score = 271 bits (694), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 141/301 (46%), Positives = 174/301 (57%), Gaps = 49/301 (16%)
Query: 60 LSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN----------- 108
F + ++ K Y S EE R TF+ N I + N ++++ LN
Sbjct: 33 FHFKSWTSKHRKTY-STEEYHHRLQTFASNWRKINAHNNGNHTFKMALNQFSDMSFAEIK 91
Query: 109 -------------------------------------ISPVKDQGHCGSCWTFSTTGSLE 131
+SPVK+QG CGSCWTFSTTG+LE
Sbjct: 92 HKYLWSEPQNCSATKSNYLRGTGPYPPSVDWRKKGNFVSPVKNQGACGSCWTFSTTGALE 151
Query: 132 AAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKD 191
+A A GK +SL+EQQLVDCAQ FNN GC GGLPSQAFEYI YN G+ E+ YPY GKD
Sbjct: 152 SAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNKGIMGEDTYPYQGKD 211
Query: 192 GVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTK 251
G CKF V D NIT+ E+ + AV L PVS AFEV F Y++G+YSST
Sbjct: 212 GYCKFQPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFAFEVTQDFMMYRTGIYSSTS 271
Query: 252 CGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASYP 311
C TP VNHAV+AVGYG ++G+PYW++KNSWG WG +GYF +E GKNMCG+A CASYP
Sbjct: 272 CHKTPDKVNHAVLAVGYGEKNGIPYWIVKNSWGPQWGMNGYFLIERGKNMCGLAACASYP 331
Query: 312 V 312
+
Sbjct: 332 I 332
>gi|61372279|gb|AAX43816.1| cathepsin H [synthetic construct]
Length = 336
Score = 271 bits (694), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 141/301 (46%), Positives = 174/301 (57%), Gaps = 49/301 (16%)
Query: 60 LSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN----------- 108
F + ++ K Y S EE R TF+ N I + N ++++ LN
Sbjct: 33 FHFKSWMSKHRKTY-STEEYHHRLQTFASNWRKINAHNNGNHTFKMALNQFSDMSFAEIK 91
Query: 109 -------------------------------------ISPVKDQGHCGSCWTFSTTGSLE 131
+SPVK+QG CGSCWTFSTTG+LE
Sbjct: 92 HKYLWSEPQNCSATKSNYLRGTGPYPPSVDWRKKGNFVSPVKNQGACGSCWTFSTTGALE 151
Query: 132 AAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKD 191
+A A GK +SL+EQQLVDCAQ FNN GC GGLPSQAFEYI YN G+ E+ YPY GKD
Sbjct: 152 SAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNKGIMGEDTYPYQGKD 211
Query: 192 GVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTK 251
G CKF V D NIT+ E+ + AV L PVS AFEV F Y++G+YSST
Sbjct: 212 GYCKFQPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFAFEVTQDFMMYRTGIYSSTS 271
Query: 252 CGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASYP 311
C TP VNHAV+AVGYG ++G+PYW++KNSWG WG +GYF +E GKNMCG+A CASYP
Sbjct: 272 CHKTPDKVNHAVLAVGYGEKNGIPYWIVKNSWGPQWGMNGYFLIERGKNMCGLAACASYP 331
Query: 312 V 312
+
Sbjct: 332 I 332
>gi|114658412|ref|XP_001153217.1| PREDICTED: pro-cathepsin H isoform 6 [Pan troglodytes]
gi|397478882|ref|XP_003810764.1| PREDICTED: pro-cathepsin H [Pan paniscus]
gi|12803323|gb|AAH02479.1| Cathepsin H [Homo sapiens]
gi|60655259|gb|AAX32193.1| cathepsin H [synthetic construct]
gi|123979560|gb|ABM81609.1| cathepsin H [synthetic construct]
gi|123994193|gb|ABM84698.1| cathepsin H [synthetic construct]
gi|189054474|dbj|BAG37247.1| unnamed protein product [Homo sapiens]
gi|410254318|gb|JAA15126.1| cathepsin H [Pan troglodytes]
gi|410294916|gb|JAA26058.1| cathepsin H [Pan troglodytes]
gi|410331109|gb|JAA34501.1| cathepsin H [Pan troglodytes]
Length = 335
Score = 271 bits (694), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 141/301 (46%), Positives = 174/301 (57%), Gaps = 49/301 (16%)
Query: 60 LSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN----------- 108
F + ++ K Y S EE R TF+ N I + N ++++ LN
Sbjct: 33 FHFKSWMSKHRKTY-STEEYHHRLQTFASNWRKINAHNNGNHTFKMALNQFSDMSFAEIK 91
Query: 109 -------------------------------------ISPVKDQGHCGSCWTFSTTGSLE 131
+SPVK+QG CGSCWTFSTTG+LE
Sbjct: 92 HKYLWSEPQNCSATKSNYLRGTGPYPPSVDWRKKGNFVSPVKNQGACGSCWTFSTTGALE 151
Query: 132 AAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKD 191
+A A GK +SL+EQQLVDCAQ FNN GC GGLPSQAFEYI YN G+ E+ YPY GKD
Sbjct: 152 SAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNKGIMGEDTYPYQGKD 211
Query: 192 GVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTK 251
G CKF V D NIT+ E+ + AV L PVS AFEV F Y++G+YSST
Sbjct: 212 GYCKFQPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFAFEVTQDFMMYRTGIYSSTS 271
Query: 252 CGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASYP 311
C TP VNHAV+AVGYG ++G+PYW++KNSWG WG +GYF +E GKNMCG+A CASYP
Sbjct: 272 CHKTPDKVNHAVLAVGYGEKNGIPYWIVKNSWGPQWGMNGYFLIERGKNMCGLAACASYP 331
Query: 312 V 312
+
Sbjct: 332 I 332
>gi|426379977|ref|XP_004056662.1| PREDICTED: pro-cathepsin H [Gorilla gorilla gorilla]
Length = 335
Score = 271 bits (693), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 127/204 (62%), Positives = 151/204 (74%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
+SPVK+QG CGSCWTFSTTG+LE+A A GK +SL+EQQLVDCAQ FNN GC GGLPSQ
Sbjct: 129 VSPVKNQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQ 188
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
AFEYI YN G+ E+ YPY GKDG CKF V D NIT+ E+ + AV L P
Sbjct: 189 AFEYILYNKGIMGEDTYPYQGKDGYCKFQPGKAIGFVKDVANITIYDEEAMVEAVALYNP 248
Query: 229 VSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWG 288
VS AFEV F Y++G+YSST C TP VNHAV+AVGYG ++G+PYW++KNSWG WG
Sbjct: 249 VSFAFEVTQDFMMYRTGIYSSTSCHKTPDKVNHAVLAVGYGEKNGIPYWIVKNSWGPKWG 308
Query: 289 DHGYFKMEMGKNMCGIATCASYPV 312
+GYF +E GKNMCG+A CASYP+
Sbjct: 309 MNGYFLIERGKNMCGLAACASYPI 332
>gi|23110955|ref|NP_004381.2| pro-cathepsin H preproprotein [Homo sapiens]
gi|288558851|sp|P09668.4|CATH_HUMAN RecName: Full=Pro-cathepsin H; Contains: RecName: Full=Cathepsin H
mini chain; Contains: RecName: Full=Cathepsin H;
Contains: RecName: Full=Cathepsin H heavy chain;
Contains: RecName: Full=Cathepsin H light chain; Flags:
Precursor
gi|119619549|gb|EAW99143.1| cathepsin H [Homo sapiens]
Length = 335
Score = 271 bits (693), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 141/301 (46%), Positives = 174/301 (57%), Gaps = 49/301 (16%)
Query: 60 LSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN----------- 108
F + ++ K Y S EE R TF+ N I + N ++++ LN
Sbjct: 33 FHFKSWMSKHRKTY-STEEYHHRLQTFASNWRKINAHNNGNHTFKMALNQFSDMSFAEIK 91
Query: 109 -------------------------------------ISPVKDQGHCGSCWTFSTTGSLE 131
+SPVK+QG CGSCWTFSTTG+LE
Sbjct: 92 HKYLWSEPQNCSATKSNYLRGTGPYPPSVDWRKKGNFVSPVKNQGACGSCWTFSTTGALE 151
Query: 132 AAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKD 191
+A A GK +SL+EQQLVDCAQ FNN GC GGLPSQAFEYI YN G+ E+ YPY GKD
Sbjct: 152 SAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNKGIMGEDTYPYQGKD 211
Query: 192 GVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTK 251
G CKF V D NIT+ E+ + AV L PVS AFEV F Y++G+YSST
Sbjct: 212 GYCKFQPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFAFEVTQDFMMYRTGIYSSTS 271
Query: 252 CGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASYP 311
C TP VNHAV+AVGYG ++G+PYW++KNSWG WG +GYF +E GKNMCG+A CASYP
Sbjct: 272 CHKTPDKVNHAVLAVGYGEKNGIPYWIVKNSWGPQWGMNGYFLIERGKNMCGLAACASYP 331
Query: 312 V 312
+
Sbjct: 332 I 332
>gi|380798253|gb|AFE71002.1| pro-cathepsin H preproprotein, partial [Macaca mulatta]
Length = 242
Score = 271 bits (693), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 130/209 (62%), Positives = 153/209 (73%)
Query: 104 RLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNG 163
+ G +SPVK+QG CGSCWTFSTTG+LE+A A GK +SL+EQQLVDCAQ FNN GC G
Sbjct: 31 KKGNFVSPVKNQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQG 90
Query: 164 GLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAV 223
GLPSQAFEYI YN G+ E+ YPY GKDG CKF V D NIT+ E+ + AV
Sbjct: 91 GLPSQAFEYILYNKGIMGEDTYPYQGKDGDCKFRPGKAIGFVKDVANITIYDEEAMVEAV 150
Query: 224 GLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSW 283
L PVS AFEV F YK+G+YSST C TP VNHAV+AVGYG E+G+PYW++KNSW
Sbjct: 151 ALYNPVSFAFEVTQDFMMYKTGIYSSTSCHKTPDKVNHAVLAVGYGEENGIPYWIVKNSW 210
Query: 284 GENWGDHGYFKMEMGKNMCGIATCASYPV 312
G WG +GYF +E GKNMCG+A CASYP+
Sbjct: 211 GPQWGMNGYFLIERGKNMCGLAACASYPI 239
>gi|60827884|gb|AAX36817.1| cathepsin H [synthetic construct]
Length = 336
Score = 271 bits (693), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 141/301 (46%), Positives = 174/301 (57%), Gaps = 49/301 (16%)
Query: 60 LSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN----------- 108
F + ++ K Y S EE R TF+ N I + N ++++ LN
Sbjct: 33 FHFKSWMSKHRKTY-STEEYHHRLQTFASNWRKINAHNNGNHTFKMALNQFSDMSFAEIK 91
Query: 109 -------------------------------------ISPVKDQGHCGSCWTFSTTGSLE 131
+SPVK+QG CGSCWTFSTTG+LE
Sbjct: 92 HKYLWSEPQNCSATKSNYLRGTGPYPPSVDWRKKGNFVSPVKNQGACGSCWTFSTTGALE 151
Query: 132 AAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKD 191
+A A GK +SL+EQQLVDCAQ FNN GC GGLPSQAFEYI YN G+ E+ YPY GKD
Sbjct: 152 SAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNKGIMGEDTYPYQGKD 211
Query: 192 GVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTK 251
G CKF V D NIT+ E+ + AV L PVS AFEV F Y++G+YSST
Sbjct: 212 GYCKFQPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFAFEVTQDFMMYRTGIYSSTS 271
Query: 252 CGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASYP 311
C TP VNHAV+AVGYG ++G+PYW++KNSWG WG +GYF +E GKNMCG+A CASYP
Sbjct: 272 CHKTPDKVNHAVLAVGYGEKNGIPYWIVKNSWGPQWGMNGYFLIERGKNMCGLAACASYP 331
Query: 312 V 312
+
Sbjct: 332 I 332
>gi|50657027|emb|CAH04631.1| cathepsin H [Suberites domuncula]
Length = 335
Score = 271 bits (693), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 142/301 (47%), Positives = 177/301 (58%), Gaps = 49/301 (16%)
Query: 62 FARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN------------- 108
F + ++GK+Y + EE + R F KN+ I + N +G SY L +N
Sbjct: 35 FKEWQEKHGKVYSTEEESQSRLKVFMKNVIYIDNHNKQGHSYELEVNEYADMTLDEFKDQ 94
Query: 109 -----------------------------------ISPVKDQGHCGSCWTFSTTGSLEAA 133
++PVK+QG CGSCWTFSTTG LE+
Sbjct: 95 YLMEPQHCSATHSLKSDPPKYRDPPKAIDWRSKGAVTPVKNQGQCGSCWTFSTTGCLESH 154
Query: 134 YHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGV 193
+ G+ +SLSEQQLVDCAQAFNN GCNGGLPSQAFEYI YNGGLD+EE+YPY D
Sbjct: 155 HFLKTGQLVSLSEQQLVDCAQAFNNNGCNGGLPSQAFEYIHYNGGLDSEESYPYRAHDEK 214
Query: 194 CKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCG 253
C F V V + VNIT E +L +AVG V PVS+A++V FRFYK GVY S +C
Sbjct: 215 CHFVPSEVSATVSNVVNITSKDEMQLYNAVGTVGPVSIAYDVSADFRFYKKGVYKSKECK 274
Query: 254 NTPMDVNHAVVAVGYG-VEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASYPV 312
P VNHAV+AVGY E G YW++KNSWG +G +GYF + G+NMCG+A CASYP+
Sbjct: 275 TDPEHVNHAVLAVGYNTTESGEDYWIVKNSWGTKFGINGYFWIARGENMCGLADCASYPI 334
Query: 313 V 313
V
Sbjct: 335 V 335
>gi|16506815|gb|AAL23962.1|AF426248_1 truncated cathepsin H [Homo sapiens]
Length = 323
Score = 271 bits (692), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 127/204 (62%), Positives = 151/204 (74%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
+SPVK+QG CGSCWTFSTTG+LE+A A GK +SL+EQQLVDCAQ FNN GC GGLPSQ
Sbjct: 117 VSPVKNQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNYGCQGGLPSQ 176
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
AFEYI YN G+ E+ YPY GKDG CKF V D NIT+ E+ + AV L P
Sbjct: 177 AFEYILYNKGIMGEDTYPYQGKDGYCKFQPGKAIGFVKDVANITIYDEEAMVEAVALYNP 236
Query: 229 VSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWG 288
VS AFEV F Y++G+YSST C TP VNHAV+AVGYG ++G+PYW++KNSWG WG
Sbjct: 237 VSFAFEVTQDFMMYRTGIYSSTSCHKTPDKVNHAVLAVGYGEKNGIPYWIVKNSWGPQWG 296
Query: 289 DHGYFKMEMGKNMCGIATCASYPV 312
+GYF +E GKNMCG+A CASYP+
Sbjct: 297 MNGYFLIERGKNMCGLAACASYPI 320
>gi|344284284|ref|XP_003413898.1| PREDICTED: pro-cathepsin H-like [Loxodonta africana]
Length = 335
Score = 271 bits (692), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 129/209 (61%), Positives = 151/209 (72%)
Query: 104 RLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNG 163
+ G +SPVK+QG CGSCWTFSTTG+LE+A A GK +SL+EQQLVDCA+ FNN GC G
Sbjct: 124 KKGHFVSPVKNQGACGSCWTFSTTGALESAIAIAGGKLLSLAEQQLVDCAKDFNNHGCQG 183
Query: 164 GLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAV 223
GLPSQAFEYI YN G+ E+ YPY G+D VCKF + V D NITL E+ + AV
Sbjct: 184 GLPSQAFEYILYNKGIMGEDTYPYKGQDDVCKFQPKKAIAFVKDVANITLNDEEAMVEAV 243
Query: 224 GLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSW 283
L PVS AFEV D F Y G+YSST C TP VNHAV+AVGYG E G+PYW++KNSW
Sbjct: 244 ALYNPVSFAFEVTDDFMKYSKGIYSSTSCHKTPDKVNHAVLAVGYGEEKGIPYWIVKNSW 303
Query: 284 GENWGDHGYFKMEMGKNMCGIATCASYPV 312
G WG GYF +E GKNMCG+A CASYP+
Sbjct: 304 GPYWGMDGYFLIERGKNMCGLAACASYPI 332
>gi|387015020|gb|AFJ49629.1| Cathepsin H [Crotalus adamanteus]
Length = 337
Score = 271 bits (692), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 144/307 (46%), Positives = 176/307 (57%), Gaps = 48/307 (15%)
Query: 54 GQARHALSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN----- 108
G A F +A ++ + Y S EE + R F N I N S+R+GLN
Sbjct: 28 GSATGEQLFKAWASQHRRAYRSEEEFRHRLQIFLDNKQKIDKHNAGNSSFRMGLNQFSDM 87
Query: 109 -------------------------------------------ISPVKDQGHCGSCWTFS 125
+SPVK+QG CGSCWTFS
Sbjct: 88 TFTEFRKKYLWQEPQNCSATMGNFPRSAGPCPKAIDWRKKGKFVSPVKNQGSCGSCWTFS 147
Query: 126 TTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAY 185
TTG LE+A GK ++L+EQQL+DCAQ FNN GC+GGLPSQAFEYI YN GL EEAY
Sbjct: 148 TTGCLESAIAIKTGKLLNLAEQQLIDCAQNFNNFGCSGGLPSQAFEYILYNKGLMDEEAY 207
Query: 186 PYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSG 245
PY ++G CKF + + D VNI+L E L AVG PVS+AFEV + F Y+ G
Sbjct: 208 PYRAQNGTCKFQPQKAVAFIKDVVNISLYDEQGLVQAVGTYNPVSIAFEVREDFVHYQEG 267
Query: 246 VYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIA 305
VY+ST C TP VNHAV+AVGYG E GVP+W++KNSWG +WG GYF +E GKNMCG+A
Sbjct: 268 VYTSTDCDKTPDKVNHAVLAVGYGEEGGVPFWIVKNSWGTSWGLDGYFNIERGKNMCGLA 327
Query: 306 TCASYPV 312
CAS+PV
Sbjct: 328 DCASFPV 334
>gi|332252750|ref|XP_003275518.1| PREDICTED: pro-cathepsin H [Nomascus leucogenys]
Length = 335
Score = 271 bits (692), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 127/204 (62%), Positives = 150/204 (73%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
+SPVK+QG CGSCWTFSTTG+LE+A A GK +SL+EQQLVDCAQ FNN GC GGLPSQ
Sbjct: 129 VSPVKNQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQ 188
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
AFEYI YN G+ E+ YPY GKDG CKF V D NIT+ E+ + AV L P
Sbjct: 189 AFEYILYNKGIMGEDTYPYQGKDGYCKFRPGKAIGFVKDVANITIYDEEAMVEAVALYNP 248
Query: 229 VSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWG 288
VS AFEV F Y+ G+YSST C TP VNHAV+AVGYG ++G+PYW++KNSWG WG
Sbjct: 249 VSFAFEVTQDFMMYRRGIYSSTSCHKTPDKVNHAVLAVGYGEKNGIPYWIVKNSWGPQWG 308
Query: 289 DHGYFKMEMGKNMCGIATCASYPV 312
+GYF +E GKNMCG+A CASYP+
Sbjct: 309 MNGYFLIERGKNMCGLAACASYPI 332
>gi|77735725|ref|NP_001029557.1| pro-cathepsin H precursor [Bos taurus]
gi|115312126|sp|Q3T0I2.1|CATH_BOVIN RecName: Full=Pro-cathepsin H; Contains: RecName: Full=Cathepsin H
mini chain; Contains: RecName: Full=Cathepsin H;
Contains: RecName: Full=Cathepsin H heavy chain;
Contains: RecName: Full=Cathepsin H light chain; Flags:
Precursor
gi|74267711|gb|AAI02387.1| Cathepsin H [Bos taurus]
gi|296475480|tpg|DAA17595.1| TPA: cathepsin H precursor [Bos taurus]
Length = 335
Score = 271 bits (692), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 140/310 (45%), Positives = 176/310 (56%), Gaps = 49/310 (15%)
Query: 51 QVIGQARHALSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN-- 108
++ + F + ++ K Y S EE R F+ NL I + N + ++++GLN
Sbjct: 24 ELAANSLEKFHFQSWMVQHQKKYSS-EEYYHRLQAFASNLREINAHNARNHTFKMGLNQF 82
Query: 109 ----------------------------------------------ISPVKDQGHCGSCW 122
++PVK+QG CGSCW
Sbjct: 83 SDMSFDELKRKYLWSEPQNCSATKSNYLRGTGPYPPSMDWRKKGNFVTPVKNQGSCGSCW 142
Query: 123 TFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTE 182
TFSTTG+LE+A A GK L+EQQLVDCAQ FNN GC GGLPSQAFEYI+YN G+ E
Sbjct: 143 TFSTTGALESAVAIATGKLPFLAEQQLVDCAQNFNNHGCQGGLPSQAFEYIRYNKGIMGE 202
Query: 183 EAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFY 242
+ YPY G+DG CK+ V D NITL E+ + AV L PVS AFEV F Y
Sbjct: 203 DTYPYRGQDGDCKYQPSKAIAFVKDVANITLNDEEAMVEAVALHNPVSFAFEVTADFMMY 262
Query: 243 KSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMC 302
+ G+YSST C TP VNHAV+AVGYG E G+PYW++KNSWG NWG GYF +E GKNMC
Sbjct: 263 RKGIYSSTSCHKTPDKVNHAVLAVGYGEEKGIPYWIVKNSWGPNWGMKGYFLIERGKNMC 322
Query: 303 GIATCASYPV 312
G+A CAS+P+
Sbjct: 323 GLAACASFPI 332
>gi|29710|emb|CAA34734.1| unnamed protein product [Homo sapiens]
Length = 335
Score = 270 bits (691), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 127/204 (62%), Positives = 151/204 (74%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
+SPVK+QG CGSCWTFSTTG+LE+A A GK +SL+EQQLVDCAQ FNN GC GGLPSQ
Sbjct: 129 VSPVKNQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNYGCQGGLPSQ 188
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
AFEYI YN G+ E+ YPY GKDG CKF V D NIT+ E+ + AV L P
Sbjct: 189 AFEYILYNKGIMGEDTYPYQGKDGYCKFQPGKAIGFVKDVANITIYDEEAMVEAVALYNP 248
Query: 229 VSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWG 288
VS AFEV F Y++G+YSST C TP VNHAV+AVGYG ++G+PYW++KNSWG WG
Sbjct: 249 VSFAFEVTQDFMMYRTGIYSSTSCHKTPDKVNHAVLAVGYGEKNGIPYWIVKNSWGPQWG 308
Query: 289 DHGYFKMEMGKNMCGIATCASYPV 312
+GYF +E GKNMCG+A CASYP+
Sbjct: 309 MNGYFLIERGKNMCGLAACASYPI 332
>gi|440910969|gb|ELR60703.1| Cathepsin H, partial [Bos grunniens mutus]
Length = 329
Score = 270 bits (691), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 140/310 (45%), Positives = 176/310 (56%), Gaps = 49/310 (15%)
Query: 51 QVIGQARHALSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN-- 108
++ + F + ++ K Y S EE R F+ NL I + N + ++++GLN
Sbjct: 18 ELAANSLEKFHFQSWMVQHQKKYSS-EEYYHRLQVFASNLREINAHNARNHTFKMGLNQF 76
Query: 109 ----------------------------------------------ISPVKDQGHCGSCW 122
++PVK+QG CGSCW
Sbjct: 77 SDMSFDELKRKYLWSEPQNCSATKSNYLRGTGPYPPSMDWRKKGNFVTPVKNQGSCGSCW 136
Query: 123 TFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTE 182
TFSTTG+LE+A A GK L+EQQLVDCAQ FNN GC GGLPSQAFEYI+YN G+ E
Sbjct: 137 TFSTTGALESAVAIATGKLPFLAEQQLVDCAQNFNNHGCQGGLPSQAFEYIRYNKGIMGE 196
Query: 183 EAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFY 242
+ YPY G+DG CK+ V D NITL E+ + AV L PVS AFEV F Y
Sbjct: 197 DTYPYRGQDGDCKYQPSKAIAFVKDVANITLNDEEAMVEAVALHNPVSFAFEVTADFMMY 256
Query: 243 KSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMC 302
+ G+YSST C TP VNHAV+AVGYG E G+PYW++KNSWG NWG GYF +E GKNMC
Sbjct: 257 RKGIYSSTSCHKTPDKVNHAVLAVGYGEEKGIPYWIVKNSWGPNWGMKGYFLIERGKNMC 316
Query: 303 GIATCASYPV 312
G+A CAS+P+
Sbjct: 317 GLAACASFPI 326
>gi|16506813|gb|AAL23961.1|AF426247_1 cathepsin H [Homo sapiens]
Length = 335
Score = 270 bits (691), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 127/204 (62%), Positives = 151/204 (74%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
+SPVK+QG CGSCWTFSTTG+LE+A A GK +SL+EQQLVDCAQ FNN GC GGLPSQ
Sbjct: 129 VSPVKNQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNYGCQGGLPSQ 188
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
AFEYI YN G+ E+ YPY GKDG CKF V D NIT+ E+ + AV L P
Sbjct: 189 AFEYILYNKGIMGEDTYPYQGKDGYCKFQPGKAIGFVKDVANITIYDEEAMVEAVALYNP 248
Query: 229 VSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWG 288
VS AFEV F Y++G+YSST C TP VNHAV+AVGYG ++G+PYW++KNSWG WG
Sbjct: 249 VSFAFEVTQDFMMYRTGIYSSTSCHKTPDKVNHAVLAVGYGEKNGIPYWIVKNSWGPQWG 308
Query: 289 DHGYFKMEMGKNMCGIATCASYPV 312
+GYF +E GKNMCG+A CASYP+
Sbjct: 309 MNGYFLIERGKNMCGLAACASYPI 332
>gi|146386356|gb|ABQ23966.1| cathepsin H [Oryctolagus cuniculus]
Length = 215
Score = 270 bits (690), Expect = 7e-70, Method: Compositional matrix adjust.
Identities = 127/204 (62%), Positives = 152/204 (74%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
+SPVK+QG CGSCWTFSTTG+LE+A A GK +SL+EQQLVDCAQ FNN GC GGLPSQ
Sbjct: 10 VSPVKNQGACGSCWTFSTTGALESAVAIAGGKMLSLAEQQLVDCAQNFNNHGCEGGLPSQ 69
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
AFEYI YN G+ E++YPY +G CKF + V D NITL E+ + AV L P
Sbjct: 70 AFEYILYNKGIMGEDSYPYRAMEGRCKFQPQKAIAFVKDVANITLNDEEAMVEAVALYNP 129
Query: 229 VSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWG 288
VS AFEV + F Y+ G+YSST C TP VNHAV+AVGYG E+GVPYW++KNSWG +WG
Sbjct: 130 VSFAFEVTEDFMQYRKGIYSSTSCHKTPDKVNHAVLAVGYGEENGVPYWIVKNSWGSHWG 189
Query: 289 DHGYFKMEMGKNMCGIATCASYPV 312
+GYF +E GKNMCG+A CASYP+
Sbjct: 190 MNGYFYIERGKNMCGLAACASYPI 213
>gi|426248750|ref|XP_004018122.1| PREDICTED: pro-cathepsin H [Ovis aries]
Length = 355
Score = 269 bits (688), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 139/301 (46%), Positives = 173/301 (57%), Gaps = 49/301 (16%)
Query: 60 LSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN----------- 108
F + ++ K Y S EE R F+ NL I + N + ++++GLN
Sbjct: 53 FHFQSWMVQHQKKYSS-EEYHHRLQVFASNLREINAHNARNHTFKMGLNQFSDMSFAELK 111
Query: 109 -------------------------------------ISPVKDQGHCGSCWTFSTTGSLE 131
++PVK+QG CGSCWTFSTTG+LE
Sbjct: 112 RKYLWSEPQNCSATKSNYLRGTGPYPPSMDWREKGNFVTPVKNQGSCGSCWTFSTTGALE 171
Query: 132 AAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKD 191
+A A GK L+EQQLVDCAQ FNN GC GGLPSQAFEYI+YN G+ E+ YPY G+D
Sbjct: 172 SAVAIATGKLPFLAEQQLVDCAQNFNNHGCQGGLPSQAFEYIRYNKGIMGEDTYPYRGED 231
Query: 192 GVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTK 251
G CK+ V D NITL E+ + AV L PVS AFEV F Y+ G+YSST
Sbjct: 232 GDCKYQPSKAIAFVKDVANITLNDEEAMVEAVALYNPVSFAFEVTADFMMYRKGIYSSTS 291
Query: 252 CGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASYP 311
C TP VNHAV+AVGYG E G+PYW++KNSWG +WG GYF +E GKNMCG+A CAS+P
Sbjct: 292 CHKTPDKVNHAVLAVGYGEEKGIPYWIVKNSWGPHWGMKGYFLIERGKNMCGLAACASFP 351
Query: 312 V 312
+
Sbjct: 352 I 352
>gi|334314327|ref|XP_001368532.2| PREDICTED: cathepsin H-like [Monodelphis domestica]
Length = 344
Score = 269 bits (688), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 126/209 (60%), Positives = 155/209 (74%)
Query: 104 RLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNG 163
+ G +SPVK+QG CGSCWTFSTTG LE+A A GK +SL+EQQLVDCAQAFNN GCNG
Sbjct: 133 KKGNYVSPVKNQGGCGSCWTFSTTGGLESAVAIATGKLLSLAEQQLVDCAQAFNNHGCNG 192
Query: 164 GLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAV 223
GLPSQAFEYI YN G+ E+ YPY GKDG C+F + V D VNIT+ E+ + AV
Sbjct: 193 GLPSQAFEYIMYNNGIMGEDTYPYEGKDGTCRFKPDKAIAFVKDVVNITIYDEEAMTEAV 252
Query: 224 GLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSW 283
PVS AFEV + F Y+ G+YS+ +C +P VNHAV+AVGYG +G+ YW++KNSW
Sbjct: 253 AHHNPVSFAFEVTEDFMSYRDGIYSNPRCDKSPDKVNHAVLAVGYGKNNGILYWIVKNSW 312
Query: 284 GENWGDHGYFKMEMGKNMCGIATCASYPV 312
G +WG++GYF +E GKNMCG+A CASYPV
Sbjct: 313 GTSWGNNGYFLIERGKNMCGLADCASYPV 341
>gi|29708|emb|CAA30428.1| cathepsin H [Homo sapiens]
Length = 248
Score = 269 bits (687), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 128/209 (61%), Positives = 153/209 (73%)
Query: 104 RLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNG 163
+ G +SPVK+QG CGSCWTFSTTG+LE+A A GK +SL+EQQLVDCAQ FNN GC G
Sbjct: 37 KKGNFVSPVKNQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNYGCQG 96
Query: 164 GLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAV 223
GLPSQAFEYI YN G+ E+ YPY GKDG CKF V D NIT+ E+ + AV
Sbjct: 97 GLPSQAFEYILYNKGIMGEDTYPYQGKDGYCKFQPGKAIGFVKDVANITIYDEEAMVEAV 156
Query: 224 GLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSW 283
L PVS AFEV F Y++G+YSST C TP VNHAV+AVGYG ++G+PYW++KNSW
Sbjct: 157 ALYNPVSFAFEVTQDFMMYRTGIYSSTSCHKTPDKVNHAVLAVGYGEKNGIPYWIVKNSW 216
Query: 284 GENWGDHGYFKMEMGKNMCGIATCASYPV 312
G WG +GYF +E GKNMCG+A CASYP+
Sbjct: 217 GPQWGMNGYFLIERGKNMCGLAACASYPI 245
>gi|171948778|gb|ACB59246.1| cathepsin H [Sus scrofa]
Length = 297
Score = 268 bits (685), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 126/207 (60%), Positives = 152/207 (73%), Gaps = 3/207 (1%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLP-- 166
+SPVK+QG CGSCWTFSTTG+LE+A A GK +SL+EQQLVDCAQ FNN GC GGLP
Sbjct: 88 VSPVKNQGSCGSCWTFSTTGALESAVAIATGKMLSLAEQQLVDCAQNFNNHGCQGGLPGL 147
Query: 167 -SQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGL 225
SQAFEYI+YN G+ E+ YPY G+D CKF + V D NIT+ E+ + AV L
Sbjct: 148 PSQAFEYIRYNKGIMGEDTYPYKGQDDHCKFQPDKAIAFVKDVANITMNDEEAMVEAVAL 207
Query: 226 VRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGE 285
PVS AFEV + F Y+ G+YSST C TP VNHAV+AVGYG E+G+PYW++KNSWG
Sbjct: 208 YNPVSFAFEVTNDFLMYRKGIYSSTSCHKTPDKVNHAVLAVGYGEENGIPYWIVKNSWGP 267
Query: 286 NWGDHGYFKMEMGKNMCGIATCASYPV 312
WG +GYF +E GKNMCG+A CASYP+
Sbjct: 268 QWGMNGYFLIERGKNMCGLAACASYPI 294
>gi|449270628|gb|EMC81287.1| Cathepsin H, partial [Columba livia]
Length = 261
Score = 267 bits (683), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 129/209 (61%), Positives = 152/209 (72%)
Query: 104 RLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNG 163
+ G ++PVK+QG CGSCWTFSTTG LE+A A GK +SL+EQQLVDCAQAFNN GC+G
Sbjct: 50 KKGNYVTPVKNQGPCGSCWTFSTTGCLESAIAIATGKLLSLAEQQLVDCAQAFNNHGCSG 109
Query: 164 GLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAV 223
GLPSQAFEYI YN GL E+ YPY ++G CKF E V D +NIT ED + AV
Sbjct: 110 GLPSQAFEYILYNRGLMGEDTYPYRAENGTCKFQPEKAIAFVRDVINITQYDEDGMVEAV 169
Query: 224 GLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSW 283
G PVS AFEV F Y+ GVYS+ +C +TP VNHAV+AVGYG EDG P+W++KNSW
Sbjct: 170 GKHNPVSFAFEVTSNFMHYRKGVYSNPRCEHTPDKVNHAVLAVGYGEEDGTPFWIVKNSW 229
Query: 284 GENWGDHGYFKMEMGKNMCGIATCASYPV 312
G WG GYF +E GKNMCG+A CASYPV
Sbjct: 230 GPLWGMDGYFLIERGKNMCGLAACASYPV 258
>gi|209732040|gb|ACI66889.1| Cathepsin H precursor [Salmo salar]
Length = 330
Score = 267 bits (683), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 137/248 (55%), Positives = 158/248 (63%), Gaps = 16/248 (6%)
Query: 81 LRFATFSKNLDLIRSTNCKG---------------LSYR-LGLNISPVKDQGHCGSCWTF 124
+ FA F K L NC + +R G +SPVK QGHCGSCWTF
Sbjct: 80 MSFAEFRKTFLLTEPQNCSATKGSHISSHGPYPGSVDWREKGNYVSPVKYQGHCGSCWTF 139
Query: 125 STTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEA 184
STTG LE+ A GK LSEQQLVDCAQ FNN GC GGLPSQAFEY+KYN GL TE+
Sbjct: 140 STTGCLESVTAIATGKLPLLSEQQLVDCAQDFNNHGCMGGLPSQAFEYVKYNNGLMTEDD 199
Query: 185 YPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKS 244
YPYTG DG C F E V D VNIT E + AV + PVS +EV D F YK
Sbjct: 200 YPYTGHDGSCNFKPELAAAFVKDVVNITSYDEKGMVDAVARLNPVSFGYEVTDDFLHYKD 259
Query: 245 GVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGI 304
GVYSST C NT +VNHAV+AVGYG ++ PYW++KNSWG NWG GYF +E G+NMCG+
Sbjct: 260 GVYSSTTCKNTTDNVNHAVLAVGYGEKNSTPYWIVKNSWGTNWGMDGYFLIERGRNMCGL 319
Query: 305 ATCASYPV 312
A C+SYP+
Sbjct: 320 AACSSYPL 327
>gi|363737841|ref|XP_001232765.2| PREDICTED: pro-cathepsin H [Gallus gallus]
Length = 327
Score = 265 bits (678), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 128/204 (62%), Positives = 150/204 (73%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++PVK+QG CGSCWTFSTTG LE+A A GK +SL+EQQLVDCAQAFNN GC+GGLPSQ
Sbjct: 121 VTPVKNQGPCGSCWTFSTTGCLESAIAIATGKLLSLAEQQLVDCAQAFNNHGCSGGLPSQ 180
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
AFEYI YN GL E+AYPY ++G CKF + V D +NIT E + AVG P
Sbjct: 181 AFEYILYNKGLMGEDAYPYRAQNGTCKFQPDKAIAFVKDVINITQYDEAGMVEAVGKHNP 240
Query: 229 VSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWG 288
VS AFEV F Y+ GVYS+ +C +TP VNHAV+AVGYG EDG PYW++KNSWG WG
Sbjct: 241 VSFAFEVTSDFMHYRKGVYSNPRCEHTPDKVNHAVLAVGYGEEDGRPYWIVKNSWGPLWG 300
Query: 289 DHGYFKMEMGKNMCGIATCASYPV 312
GYF +E GKNMCG+A CASYPV
Sbjct: 301 MDGYFLIERGKNMCGLAACASYPV 324
>gi|449471885|ref|XP_004186123.1| PREDICTED: LOW QUALITY PROTEIN: pro-cathepsin H [Taeniopygia
guttata]
Length = 334
Score = 265 bits (678), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 131/210 (62%), Positives = 153/210 (72%), Gaps = 6/210 (2%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++PVK QG CGSCWTFSTTG LE+A A GK +SL+EQQLVDCAQAFNN GC+GGLPSQ
Sbjct: 122 VTPVKIQGACGSCWTFSTTGCLESAIAIATGKLLSLAEQQLVDCAQAFNNHGCSGGLPSQ 181
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSEN---VGVQ---VLDSVNITLGAEDELQHA 222
AFEYI YN GL E++YPY K+G C+F +N VG V D +NIT ED + A
Sbjct: 182 AFEYILYNRGLMGEDSYPYRAKNGTCRFQPDNDIRVGKAIAFVKDVINITQYDEDGMVEA 241
Query: 223 VGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNS 282
VG PVS AFEV F Y+ GVYS+ +C +TP VNHAV+AVGYG EDG PYW++KNS
Sbjct: 242 VGRHNPVSFAFEVTSDFMHYRKGVYSNPRCEHTPDKVNHAVLAVGYGQEDGTPYWIVKNS 301
Query: 283 WGENWGDHGYFKMEMGKNMCGIATCASYPV 312
WG WG GYF +E GKNMCG+A CASYPV
Sbjct: 302 WGRLWGMQGYFLIERGKNMCGLAACASYPV 331
>gi|431920312|gb|ELK18347.1| Cathepsin H [Pteropus alecto]
Length = 232
Score = 265 bits (677), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 123/198 (62%), Positives = 144/198 (72%)
Query: 115 QGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIK 174
G CGSCWTFSTTG+LE+A GK +SL+EQQLVDCAQ FNN GC GGLPSQAFEYI+
Sbjct: 32 HGGCGSCWTFSTTGALESAIAIKTGKMLSLAEQQLVDCAQNFNNHGCKGGLPSQAFEYIR 91
Query: 175 YNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFE 234
YN G+ E+ YPY GKDG CKF E V D NIT+ E+ + AV L PVS AFE
Sbjct: 92 YNKGIMGEDTYPYQGKDGTCKFQPEKAIAFVKDVANITINDEEAMVEAVALYNPVSFAFE 151
Query: 235 VVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFK 294
V + F Y+ G+YSST C TP VNHAV+AVGYG E+G PYW++KNSWG WG +GYF
Sbjct: 152 VTEDFMLYRKGIYSSTSCHKTPDKVNHAVLAVGYGEENGKPYWIVKNSWGPQWGMNGYFL 211
Query: 295 MEMGKNMCGIATCASYPV 312
+E GKNMCG+A CASYP+
Sbjct: 212 IERGKNMCGLAACASYPI 229
>gi|225706914|gb|ACO09303.1| Cathepsin H precursor [Osmerus mordax]
Length = 328
Score = 264 bits (674), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 127/210 (60%), Positives = 152/210 (72%)
Query: 104 RLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNG 163
+ G ++ VK+QG CGSCWTFSTTG LE+ + GK + LSEQQLVDCAQAFNN GCNG
Sbjct: 119 KKGNYVTNVKNQGPCGSCWTFSTTGCLESVTAISTGKLLQLSEQQLVDCAQAFNNHGCNG 178
Query: 164 GLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAV 223
GLPSQAFEYIKYN GL TE+ YPYT +DG CKF E V D VNIT+ E + AV
Sbjct: 179 GLPSQAFEYIKYNKGLMTEDDYPYTAQDGTCKFKPERAAAFVKDVVNITMYDEMGMVDAV 238
Query: 224 GLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSW 283
+ PVS+A+EV F Y SGVYSS++C NT VNHAV+AVGY E+ PYW++KNSW
Sbjct: 239 ARLNPVSMAYEVTSDFMHYHSGVYSSSECHNTTDTVNHAVLAVGYDEENVTPYWIVKNSW 298
Query: 284 GENWGDHGYFKMEMGKNMCGIATCASYPVV 313
G WG GYF +E GKNMCG++ C+SYP+V
Sbjct: 299 GPFWGMKGYFFIERGKNMCGLSACSSYPLV 328
>gi|345798093|ref|XP_536212.3| PREDICTED: pro-cathepsin H [Canis lupus familiaris]
Length = 350
Score = 263 bits (671), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 139/302 (46%), Positives = 173/302 (57%), Gaps = 50/302 (16%)
Query: 60 LSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN----------- 108
+ F +A ++ K Y S EE R TF N I + N ++++GLN
Sbjct: 47 VHFKSWAVQHQKKYSS-EEYLQRLQTFVGNWRKINAHNAGNHTFKMGLNQFSDMNFAEIK 105
Query: 109 -------------------------------------ISPVKDQGHCGSCWTFSTTGSLE 131
+SPVK+QG CGSCWTFSTTG+LE
Sbjct: 106 HKYLWSEPQNCSATKGNYLRGTGPYPPFVDWRKKGKFVSPVKNQGSCGSCWTFSTTGALE 165
Query: 132 AAYHQAFGKGISLSEQQLVDCAQAFNNQGCNG-GLPSQAFEYIKYNGGLDTEEAYPYTGK 190
+A GK +SL+EQQLVDCAQ FNN GC G G P QAFEYI+YN G+ E++YPY G+
Sbjct: 166 SAIAIKSGKLLSLAEQQLVDCAQNFNNHGCQGYGAPLQAFEYIRYNKGIMGEDSYPYKGQ 225
Query: 191 DGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSST 250
DG CK+ V D NIT+ E + AV L PVS AFEV F Y+ G+YSST
Sbjct: 226 DGDCKYQPSKAIAFVKDVANITINDEQAMVEAVALYNPVSFAFEVTSDFMMYRKGIYSST 285
Query: 251 KCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASY 310
C TP VNHAV+AVGYG ++G+PYW++KNSWG WG +GYF ME GKNMCG+A CASY
Sbjct: 286 SCHKTPDKVNHAVLAVGYGEQNGIPYWIVKNSWGPQWGMNGYFLMERGKNMCGLAACASY 345
Query: 311 PV 312
P+
Sbjct: 346 PI 347
>gi|330376140|gb|AEC13302.1| cathepsin H [Gallus gallus]
Length = 329
Score = 262 bits (670), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 127/204 (62%), Positives = 149/204 (73%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++PVK+QG CGSCWTFSTTG LE+A A GK +SL+EQ LVDCAQAFNN GC+GGLPSQ
Sbjct: 123 VTPVKNQGPCGSCWTFSTTGCLESAIAIATGKLLSLAEQLLVDCAQAFNNHGCSGGLPSQ 182
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
AFEYI YN GL E+AYPY ++G CKF + V D +NIT E + AVG P
Sbjct: 183 AFEYILYNKGLMGEDAYPYRAQNGTCKFQPDKAIAFVKDVINITQYDEAGMVEAVGKHNP 242
Query: 229 VSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWG 288
VS AFEV F Y+ GVYS+ +C +TP VNHAV+AVGYG EDG PYW++KNSWG WG
Sbjct: 243 VSFAFEVTSDFMHYRKGVYSNPRCEHTPDKVNHAVLAVGYGEEDGRPYWIVKNSWGPLWG 302
Query: 289 DHGYFKMEMGKNMCGIATCASYPV 312
GYF +E GKNMCG+A CASYPV
Sbjct: 303 MDGYFLIERGKNMCGLAACASYPV 326
>gi|209731972|gb|ACI66855.1| Cathepsin H precursor [Salmo salar]
Length = 328
Score = 261 bits (668), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 133/248 (53%), Positives = 160/248 (64%), Gaps = 16/248 (6%)
Query: 81 LRFATFSKNLDLIRSTNC---------------KGLSYRLGLN-ISPVKDQGHCGSCWTF 124
L FA F K+ L NC + + +R N ++ VK+QG CGSCWTF
Sbjct: 78 LTFAEFRKSFLLTEPQNCSATKGSHVSSNGPYPESVDWRKKGNYVTAVKNQGSCGSCWTF 137
Query: 125 STTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEA 184
STTG LE+ A GK + LSEQQLVDCAQAFNN GCNGGLPSQAFEYIK+N G+ TE+
Sbjct: 138 STTGCLESVTAIATGKLLQLSEQQLVDCAQAFNNHGCNGGLPSQAFEYIKFNKGIMTEDD 197
Query: 185 YPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKS 244
YPYT D CKF ++ V D VNIT E + AV PVS+A+EV F Y
Sbjct: 198 YPYTAHDDTCKFKTDLAAAFVKDVVNITKYDEMGMVDAVARFNPVSLAYEVTSDFMHYDG 257
Query: 245 GVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGI 304
GVY+S +C NT VNHAV+AVGYG E G PYW++KNSWG +WG GYF +E GKNMCG+
Sbjct: 258 GVYTSKECHNTTDTVNHAVLAVGYGEEKGTPYWIVKNSWGSSWGMKGYFFIERGKNMCGL 317
Query: 305 ATCASYPV 312
A C+SYP+
Sbjct: 318 AACSSYPL 325
>gi|387915132|gb|AFK11175.1| cathspsin H [Callorhinchus milii]
Length = 330
Score = 259 bits (663), Expect = 9e-67, Method: Compositional matrix adjust.
Identities = 138/304 (45%), Positives = 176/304 (57%), Gaps = 49/304 (16%)
Query: 57 RHALSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN-------- 108
+ +SF + ++ K Y S EE R TF +N + N SYR+GLN
Sbjct: 25 QEIVSFKTWMTQHNKHYSS-EEYSYRLRTFIQNKRKVEEHNSGRHSYRMGLNQFSDMTFS 83
Query: 109 ----------------------------------------ISPVKDQGHCGSCWTFSTTG 128
++PVK+QG CGSCWTFSTTG
Sbjct: 84 EFKKLYLLREPQNCSATRGNHVLSMGPYPDFVDWRTKGNYVTPVKNQGGCGSCWTFSTTG 143
Query: 129 SLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYT 188
LE+A GK +SL+EQQLVDCA A+ N GCNGGLPSQAFEYIKYNGGL+ E+ YPYT
Sbjct: 144 CLESAIAIKTGKLLSLAEQQLVDCAGAYKNHGCNGGLPSQAFEYIKYNGGLEAEKDYPYT 203
Query: 189 GKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVYS 248
+D C++ V + VNIT E+ + AV + PVS+AFEV D F Y+ GVYS
Sbjct: 204 AQDQHCQYQPNKAVAFVKEVVNITQYDENGIVDAVARLNPVSIAFEVTDDFFQYEGGVYS 263
Query: 249 STKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCA 308
++ C +TP VNHAV+AVGYGV++G YW++KNSWG WG +GYF + GKNMCG+A C
Sbjct: 264 NSNCDSTPDKVNHAVLAVGYGVQNGTKYWIVKNSWGPEWGLNGYFYIIRGKNMCGLAACP 323
Query: 309 SYPV 312
SYP+
Sbjct: 324 SYPI 327
>gi|326926970|ref|XP_003209669.1| PREDICTED: cathepsin H-like [Meleagris gallopavo]
Length = 323
Score = 259 bits (662), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 127/209 (60%), Positives = 148/209 (70%)
Query: 104 RLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNG 163
R G QG CGSCWTFSTTG LE+A A GK +SL+EQQLVDCAQAFNN GC+G
Sbjct: 112 RCGATPDRFSTQGPCGSCWTFSTTGCLESAIAIATGKLLSLAEQQLVDCAQAFNNHGCSG 171
Query: 164 GLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAV 223
GLPSQAFEYI YN GL E+AYPY ++G CKF + V D +NIT E + AV
Sbjct: 172 GLPSQAFEYILYNKGLMGEDAYPYRAQNGTCKFQPDKAVAFVRDVINITQYDEASMVEAV 231
Query: 224 GLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSW 283
G PVS AFEV + F Y+ GVYS+ +C +TP VNHAV+AVGYG EDG+PYW++KNSW
Sbjct: 232 GKHNPVSFAFEVTNDFMHYRKGVYSNPRCEHTPDKVNHAVLAVGYGEEDGLPYWIVKNSW 291
Query: 284 GENWGDHGYFKMEMGKNMCGIATCASYPV 312
G WG GYF +E GKNMCG+A CASYPV
Sbjct: 292 GSLWGMDGYFLIERGKNMCGLAACASYPV 320
>gi|348505824|ref|XP_003440460.1| PREDICTED: pro-cathepsin H-like [Oreochromis niloticus]
Length = 324
Score = 258 bits (660), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 127/210 (60%), Positives = 148/210 (70%)
Query: 104 RLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNG 163
+ G ++PVK+QG CGSCWTFSTTG LE+ GK + LSEQQLVDCAQ FNN GCNG
Sbjct: 115 KKGNYVTPVKNQGGCGSCWTFSTTGCLESVTAINKGKLVPLSEQQLVDCAQDFNNHGCNG 174
Query: 164 GLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAV 223
GLPSQAFEYI YN GL TE+ YPYT +G C + V VNIT E E+ AV
Sbjct: 175 GLPSQAFEYIMYNKGLMTEQDYPYTAFEGKCVYKPGKAAAFVNSVVNITAYNELEMVDAV 234
Query: 224 GLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSW 283
G PVS AFEV F Y GVY+ST+C NT VNHAV+AVGYG E+G PYW++KNSW
Sbjct: 235 GTHNPVSFAFEVTSDFMSYHQGVYTSTECHNTTDKVNHAVLAVGYGQENGTPYWIVKNSW 294
Query: 284 GENWGDHGYFKMEMGKNMCGIATCASYPVV 313
G +WG +GYF +E GKNMCG+A CAS+PVV
Sbjct: 295 GSSWGMNGYFLIERGKNMCGLAACASFPVV 324
>gi|318844127|ref|NP_001187181.1| cathspsin H precursor [Ictalurus punctatus]
gi|196475594|gb|ACG76366.1| cathspsin H [Ictalurus punctatus]
Length = 326
Score = 257 bits (657), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 126/210 (60%), Positives = 149/210 (70%)
Query: 104 RLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNG 163
+ G ++ VK+QG CGSCWTFSTTG LE+ A GK L+EQQLVDCA AFNN GCNG
Sbjct: 117 KKGNYVTEVKNQGACGSCWTFSTTGCLESVTAIATGKLPLLAEQQLVDCAGAFNNHGCNG 176
Query: 164 GLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAV 223
GLPSQAFEYI YN GL TE+ YPY G+DG CKF + V D VNIT E + AV
Sbjct: 177 GLPSQAFEYIMYNKGLMTEDDYPYVGRDGPCKFDPKLAAAFVKDVVNITKYDEMGIVDAV 236
Query: 224 GLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSW 283
+ PVS+AFEV+ F YK GVY+S +C NT VNHAV+AVGY E+G PYW++KNSW
Sbjct: 237 ARLNPVSIAFEVLPEFMHYKDGVYTSNECHNTTETVNHAVLAVGYAEENGTPYWIVKNSW 296
Query: 284 GENWGDHGYFKMEMGKNMCGIATCASYPVV 313
G WG GYF +E G+NMCG+A CASYP+V
Sbjct: 297 GPQWGIDGYFYIERGQNMCGLAACASYPLV 326
>gi|308322047|gb|ADO28161.1| cathepsin H [Ictalurus furcatus]
Length = 326
Score = 257 bits (657), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 126/210 (60%), Positives = 149/210 (70%)
Query: 104 RLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNG 163
+ G ++ VK+QG CGSCWTFSTTG LE+ A GK L+EQQLVDCA AFNN GCNG
Sbjct: 117 KKGNYVTEVKNQGACGSCWTFSTTGCLESVTAIATGKLPLLAEQQLVDCAGAFNNHGCNG 176
Query: 164 GLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAV 223
GLPSQAFEYI YN GL TE+ YPY G+DG CKF + V D VNIT E + AV
Sbjct: 177 GLPSQAFEYIMYNKGLMTEDDYPYVGRDGPCKFDPKLAAAFVKDVVNITKYDEMGIVDAV 236
Query: 224 GLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSW 283
+ PVS+AFEV+ F YK GVY+S +C NT VNHAV+AVGY E+G PYW++KNSW
Sbjct: 237 ARLNPVSIAFEVLPEFMHYKDGVYTSNECHNTTETVNHAVLAVGYAEENGTPYWIVKNSW 296
Query: 284 GENWGDHGYFKMEMGKNMCGIATCASYPVV 313
G WG GYF +E G+NMCG+A CASYP+V
Sbjct: 297 GPQWGIDGYFYIERGQNMCGLAACASYPLV 326
>gi|410907221|ref|XP_003967090.1| PREDICTED: pro-cathepsin H-like [Takifugu rubripes]
Length = 324
Score = 257 bits (656), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 137/276 (49%), Positives = 161/276 (58%), Gaps = 24/276 (8%)
Query: 62 FARFARRYGKIYESVEEMKLR--------FATFSKNLDLIRSTNC--------------- 98
F RR K E +R FA F K+ NC
Sbjct: 49 FTENKRRIDKHNEGNHSFAMRLNQYSDMTFAEFRKHFLWAEPQNCSATKGSYIQTNSPHP 108
Query: 99 KGLSYRLGLN-ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFN 157
+ + +R N ++PVK+QG CGSCWTFSTTG LE+ GK + LSEQQLVDCAQ FN
Sbjct: 109 ESIDWRKKGNYVTPVKNQGSCGSCWTFSTTGCLESVTAINSGKLVPLSEQQLVDCAQDFN 168
Query: 158 NQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAED 217
N GCNGGLPSQAFEYIKYN GL TE YPYT + C + E V + VNIT E
Sbjct: 169 NHGCNGGLPSQAFEYIKYNKGLMTESDYPYTAFEDKCTYKPELAAAFVKNVVNITAYDEK 228
Query: 218 ELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYW 277
E++ AV PVS AFEV F Y SGVYSS+ C T VNHAV+AVGYG E+G PYW
Sbjct: 229 EMEDAVATRNPVSFAFEVTPDFMHYSSGVYSSSTCHTTTDKVNHAVLAVGYGSENGTPYW 288
Query: 278 LIKNSWGENWGDHGYFKMEMGKNMCGIATCASYPVV 313
++KNSWG WG GYF + GKNMCG+A C+S+P V
Sbjct: 289 IVKNSWGPGWGQDGYFLIMRGKNMCGLAACSSFPEV 324
>gi|146168075|ref|XP_001016705.2| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|146145247|gb|EAR96460.2| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 343
Score = 257 bits (656), Expect = 6e-66, Method: Compositional matrix adjust.
Identities = 138/297 (46%), Positives = 191/297 (64%), Gaps = 26/297 (8%)
Query: 32 NPIRLVSSDGLRDFETSVLQVIGQARHALS-FARFARRYGKI-YESVEEMKLR------F 83
NP+ SD R F+ + +I +H L+ +F ++ K +++ EE++
Sbjct: 52 NPL----SDRFRLFKKRLTNII---KHNLNPHKKFTQKINKFTFKTQEEIRSLNAAQNCS 104
Query: 84 ATFSKNLDLIRSTNCKGL----SYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFG 139
AT +N+ + ++ N K L +R ++PVKDQG CGSCWTFSTTG+LE+ H A
Sbjct: 105 ATARENMSVKKTYNLKDLPQYVDWRTKGVVTPVKDQGECGSCWTFSTTGALES--HWALH 162
Query: 140 KG---ISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKF 196
G + LSEQQL+DCA AFNN GC+GGLPSQA+EYI Y GGL+TE YPY G D C+F
Sbjct: 163 TGNAPLLLSEQQLIDCAGAFNNFGCDGGLPSQAYEYISYAGGLETEGDYPYEGTDNSCEF 222
Query: 197 SSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTP 256
+ V +V+ S NIT E+EL + + V PVS+A+E D F Y+ G+YS+ C +P
Sbjct: 223 NRAQVAAKVVSSYNITFQDENELIYHLATVGPVSIAYECTDDFMDYEGGIYSNPSCSKSP 282
Query: 257 MDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASYPVV 313
DVNHAV+AVGY + Y+++KNSWGE+WG +GYF +E+G NMCG+A CASYP+V
Sbjct: 283 EDVNHAVLAVGYNLTGN--YYIVKNSWGEDWGINGYFYIELGSNMCGLADCASYPIV 337
>gi|313221004|emb|CBY31836.1| unnamed protein product [Oikopleura dioica]
Length = 323
Score = 256 bits (653), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 136/314 (43%), Positives = 174/314 (55%), Gaps = 51/314 (16%)
Query: 50 LQVIGQARHALSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN- 108
L + A+ L F + + K YE+ EE K R F +N+ I N + S+ GLN
Sbjct: 7 LGLFASAKAGL-FEDWTAEHWKSYETAEEEKFRKGVFEENVAKIEQINKENRSWTAGLNK 65
Query: 109 ------------------------------------------------ISPVKDQGHCGS 120
+SPVKDQG CGS
Sbjct: 66 FSDLTWDEFQHFYLMQAEQDCSATSYNSKEYLAKQPMPTSWDWRKDNKVSPVKDQGQCGS 125
Query: 121 CWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLD 180
CWTFSTTG++EA + +LSEQQLVDCA AFNN GCNGGLPSQAFEYI G+
Sbjct: 126 CWTFSTTGNVEAGEAIHLNEYHTLSEQQLVDCAGAFNNHGCNGGLPSQAFEYIAAAPGIM 185
Query: 181 TEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFR 240
TE YPYT KDG C F + V V SVNIT G E E+ A+ + +P+S+AFEVVD F
Sbjct: 186 TEADYPYTAKDGNCVFDQKKAAVHVYGSVNITRGDEVEMAEAMVMYQPISIAFEVVDDFM 245
Query: 241 FYKSGVYSSTKCGNTPMDVNHAVVAVGYGVED-GVPYWLIKNSWGENWGDHGYFKMEMGK 299
YKSG YSS C +P DVNHAV+AVG+G + G +W +KNSW ++WG+ GYF ++ G
Sbjct: 246 HYKSGTYSSKDCKGSPTDVNHAVLAVGFGTDGAGTDFWTVKNSWSKDWGNQGYFNIQRGV 305
Query: 300 NMCGIATCASYPVV 313
NMCG++ C S+ ++
Sbjct: 306 NMCGLSQCTSFALI 319
>gi|47224192|emb|CAG13112.1| unnamed protein product [Tetraodon nigroviridis]
Length = 327
Score = 256 bits (653), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 132/254 (51%), Positives = 165/254 (64%), Gaps = 8/254 (3%)
Query: 60 LSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN-ISPVKDQGHC 118
++FA F +R+ ++ + ++ K S + + +R N ++PVK+QG C
Sbjct: 79 MTFAEFRKRF--LWSEPQNCSATKGSYMK----TNSPQPESIDWRTKGNYVTPVKNQGAC 132
Query: 119 GSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGG 178
GSCWTFSTTG LE+ GK + LSEQQLVDCA FNN GCNGGLPSQAFEYIKYN G
Sbjct: 133 GSCWTFSTTGCLESVTAINTGKLVPLSEQQLVDCAWDFNNHGCNGGLPSQAFEYIKYNKG 192
Query: 179 LDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDG 238
L TE YPYT +G CK+ E V + VNIT E ++ AV PVS AFEV D
Sbjct: 193 LMTESGYPYTAFEGKCKYKPELAAAFVKNVVNITAYDEKGMEDAVATHNPVSFAFEVTDD 252
Query: 239 FRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVED-GVPYWLIKNSWGENWGDHGYFKMEM 297
F YK GVYSS++C T VNHAV+AVGYG + VPYW++KNSWG WG++GYF +E
Sbjct: 253 FMHYKGGVYSSSRCHKTTDKVNHAVLAVGYGNNNSSVPYWIVKNSWGPYWGENGYFLIER 312
Query: 298 GKNMCGIATCASYP 311
GKNMCG+A C+SYP
Sbjct: 313 GKNMCGLAACSSYP 326
>gi|37788267|gb|AAO64473.1| cathepsin H precursor [Fundulus heteroclitus]
Length = 345
Score = 255 bits (652), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 129/234 (55%), Positives = 152/234 (64%), Gaps = 1/234 (0%)
Query: 81 LRFATFSKNLDLIRSTNCKGLSYRLGLN-ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFG 139
+ F F K + + +R N I+PVK QG CGSCWTFSTTG LE+ A
Sbjct: 112 MTFNEFRKAFLMSEGPQPDSIDWRKKGNYITPVKTQGSCGSCWTFSTTGCLESVTAIATV 171
Query: 140 KGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSE 199
K + LSEQQLVDCAQ FNN GCNGGLPSQAFEYI YN GL TE+ YPY +G+C +
Sbjct: 172 KLVPLSEQQLVDCAQDFNNHGCNGGLPSQAFEYIMYNKGLMTEQDYPYKFVEGICSYKPS 231
Query: 200 NVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDV 259
V + NIT E + AVG + PVS AFEV D F Y+ GVY+ST C NT V
Sbjct: 232 LAAAFVKEVRNITAYDEMGMVDAVGTLNPVSFAFEVTDDFMHYREGVYTSTTCHNTTDKV 291
Query: 260 NHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASYPVV 313
NHAV+AVGYG E G PYW++KNSWG +WG GYF +E GKNMCG+A C+S PVV
Sbjct: 292 NHAVLAVGYGQEKGTPYWIVKNSWGSSWGIDGYFLIERGKNMCGLAACSSSPVV 345
>gi|340380717|ref|XP_003388868.1| PREDICTED: pro-cathepsin H-like [Amphimedon queenslandica]
Length = 337
Score = 255 bits (652), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 136/257 (52%), Positives = 166/257 (64%), Gaps = 9/257 (3%)
Query: 60 LSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLNISPVKDQGHCG 119
L+FA F + Y + E + AT + + + + +R I+PVKDQG CG
Sbjct: 87 LTFAEFKKIY------LTEPQHCSATNGNFQKPVNARDPVAVDWREKNVITPVKDQGKCG 140
Query: 120 SCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGL 179
SCWTFSTTG LEA + G+ ISLSEQQLVDCA AFNN GCNGGLPSQAFEYIKYNGG+
Sbjct: 141 SCWTFSTTGCLEAHHAIKTGQLISLSEQQLVDCAGAFNNHGCNGGLPSQAFEYIKYNGGI 200
Query: 180 DTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGF 239
++E Y YT KDGVC+F+S V V D VNIT AE ++ AV V PVS+AFEV F
Sbjct: 201 ESESNYNYTAKDGVCRFNSSLVAATVSDVVNITKDAEGDIGTAVANVGPVSIAFEVTKSF 260
Query: 240 RFYKSGVYSS--TKCGNTPMDVNHAVVAVGYG-VEDGVPYWLIKNSWGENWGDHGYFKME 296
+ YK GVY C +P VNHAV+ VGY + G YW++KNSW +WG GYF +
Sbjct: 261 QHYKKGVYQGEIEVCSQSPDKVNHAVLVVGYNQTKLGEEYWIVKNSWSASWGMDGYFWIR 320
Query: 297 MGKNMCGIATCASYPVV 313
G N CG+ATCASYP+V
Sbjct: 321 RGHNACGLATCASYPIV 337
>gi|392873946|gb|AFM85805.1| cathepsin H [Callorhinchus milii]
Length = 259
Score = 253 bits (647), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 120/204 (58%), Positives = 150/204 (73%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++PVK+QG CGSCWTFSTTG LE+A GK +SL+EQQLVDCA A+ N GCNGGLPSQ
Sbjct: 53 VTPVKNQGGCGSCWTFSTTGCLESAIAIKTGKLLSLAEQQLVDCAGAYKNHGCNGGLPSQ 112
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
AFEYIKYNGGL+ E+ YPYT +D C++ V + VNIT E+ + AV + P
Sbjct: 113 AFEYIKYNGGLEAEKDYPYTAQDQHCQYQPNKAVAFVKEVVNITQYDENGIVDAVARLNP 172
Query: 229 VSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWG 288
VS+AFEV D F Y+ GVYS++ C +TP VNHAV+AVGYGV++G YW++KNSWG WG
Sbjct: 173 VSIAFEVTDDFFQYEGGVYSNSNCDSTPDKVNHAVLAVGYGVQNGTKYWIVKNSWGPEWG 232
Query: 289 DHGYFKMEMGKNMCGIATCASYPV 312
+GYF + GKNMCG+A C SYP+
Sbjct: 233 LNGYFYIIRGKNMCGLAACPSYPI 256
>gi|313213098|emb|CBY36961.1| unnamed protein product [Oikopleura dioica]
Length = 326
Score = 253 bits (647), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 135/317 (42%), Positives = 174/317 (54%), Gaps = 54/317 (17%)
Query: 50 LQVIGQARHALSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN- 108
L + A+ L F + + K YE+ E+ K R F +N+ I N + S+ GLN
Sbjct: 7 LGLFASAKAGL-FEDWTSEHWKSYETAEDEKFRKGVFEENIAKIEQINKENRSWTAGLNK 65
Query: 109 ---------------------------------------------------ISPVKDQGH 117
+SPVKDQG
Sbjct: 66 FSDLTWDEFQHFYLMQAGQDCSATSYNSKEYLAKGVEQPMPTSWDWRKDNKVSPVKDQGQ 125
Query: 118 CGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNG 177
CGSCWTFSTTG++EA + +LSEQQLVDCA AFNN GCNGGLPSQAFEYI
Sbjct: 126 CGSCWTFSTTGNVEAGEAIHLNEYHTLSEQQLVDCAGAFNNHGCNGGLPSQAFEYIAAAP 185
Query: 178 GLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVD 237
G+ TE YPYT KDG C F + V V SVNIT G E E+ A+ + +P+S+AFEVVD
Sbjct: 186 GIMTEADYPYTAKDGNCVFDQKKAAVHVYGSVNITRGDEVEMAEAMVMYQPISIAFEVVD 245
Query: 238 GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVED-GVPYWLIKNSWGENWGDHGYFKME 296
F YKSG YSS C +P DVNHAV+AVG+G + G +W +KNSW ++WG+ GYF ++
Sbjct: 246 DFMHYKSGTYSSKDCKGSPTDVNHAVLAVGFGTDGAGTDFWTVKNSWSKDWGNQGYFNIQ 305
Query: 297 MGKNMCGIATCASYPVV 313
G NMCG++ C S+ ++
Sbjct: 306 RGVNMCGLSQCTSFALI 322
>gi|298708365|emb|CBJ48428.1| Cathepsin H [Ectocarpus siliculosus]
Length = 668
Score = 253 bits (646), Expect = 7e-65, Method: Compositional matrix adjust.
Identities = 138/293 (47%), Positives = 176/293 (60%), Gaps = 29/293 (9%)
Query: 45 FETSVLQVIGQARHALSFARFARRYGKI-YESVEEMKLRFATFSKNLDLIRSTNCK---- 99
F ++ Q + A S++ R+ + +E + +L F + L S NC
Sbjct: 380 FRDNLRQAVDDAATPRSYSLGLNRFSDMTWEEFQATRLGFGSA-----LSASQNCSATHV 434
Query: 100 GLSYR-LGLN----------------ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGI 142
G YR LGL+ +S VK+Q HCGSCWTFSTTG LE+ ++ G+ +
Sbjct: 435 GSQYRALGLSKGRAPPAARDWRDLGAVSVVKNQDHCGSCWTFSTTGCLESHHYLRTGEMV 494
Query: 143 SLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKD-GVCKFSSENV 201
LSEQQL+DCA A++N GCNGGLPS AFEYI GGLDTEE YPY ++ G+C F+ +
Sbjct: 495 LLSEQQLLDCAGAYDNHGCNGGLPSHAFEYIASAGGLDTEEVYPYMAEESGLCSFADRGI 554
Query: 202 GVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNH 261
G V+ SVNIT E EL AVG PVSVAF+V F+ Y GVY + C P VNH
Sbjct: 555 GADVMRSVNITFQDERELLEAVGNTGPVSVAFQVAPDFKAYAGGVYDNPSCSTLPEQVNH 614
Query: 262 AVVAVGYG-VEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASYPVV 313
AV+ VGYG E+GV YW+IKNSWG WG G+F M GKNMCG+A CAS+P+V
Sbjct: 615 AVLCVGYGTTEEGVDYWIIKNSWGPEWGMDGFFHMARGKNMCGVADCASFPLV 667
>gi|313229615|emb|CBY18430.1| unnamed protein product [Oikopleura dioica]
Length = 326
Score = 253 bits (645), Expect = 9e-65, Method: Compositional matrix adjust.
Identities = 135/317 (42%), Positives = 174/317 (54%), Gaps = 54/317 (17%)
Query: 50 LQVIGQARHALSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN- 108
L + A+ L F + + K YE+ E+ K R F +N+ I N + S+ GLN
Sbjct: 7 LGLFASAKAGL-FEDWTAEHWKSYETAEDEKFRKGVFEENVAKIEKINKENRSWTAGLNK 65
Query: 109 ---------------------------------------------------ISPVKDQGH 117
+SPVKDQG
Sbjct: 66 FSDLTWDEFQHFYLMQAGQDCSATSYNSKEYLAKGVEQPMPTSWDWRKDNKVSPVKDQGQ 125
Query: 118 CGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNG 177
CGSCWTFSTTG++EA + +LSEQQLVDCA AFNN GCNGGLPSQAFEYI
Sbjct: 126 CGSCWTFSTTGNVEAGEAIHLNEYHTLSEQQLVDCAGAFNNHGCNGGLPSQAFEYIAAAP 185
Query: 178 GLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVD 237
G+ TE YPYT KDG C F + V V SVNIT G E E+ A+ + +P+S+AFEVVD
Sbjct: 186 GIMTEADYPYTAKDGNCVFDQKKAAVHVYGSVNITRGDEVEMAEAMVMYQPISIAFEVVD 245
Query: 238 GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVED-GVPYWLIKNSWGENWGDHGYFKME 296
F YKSG YSS C +P DVNHAV+AVG+G + G +W +KNSW ++WG+ GYF ++
Sbjct: 246 DFMHYKSGTYSSKDCKGSPTDVNHAVLAVGFGTDGAGTDFWTVKNSWSKDWGNQGYFNIQ 305
Query: 297 MGKNMCGIATCASYPVV 313
G NMCG++ C S+ ++
Sbjct: 306 RGVNMCGLSQCTSFALI 322
>gi|260821804|ref|XP_002606293.1| hypothetical protein BRAFLDRAFT_57270 [Branchiostoma floridae]
gi|229291634|gb|EEN62303.1| hypothetical protein BRAFLDRAFT_57270 [Branchiostoma floridae]
Length = 246
Score = 252 bits (643), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 118/205 (57%), Positives = 148/205 (72%), Gaps = 1/205 (0%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
+S VKDQGHCGSCWTFS TG LE+ FG ++LSEQQLV CAQ FNN GC GGLPSQ
Sbjct: 37 VSGVKDQGHCGSCWTFSATGCLESVTAITFGAPMNLSEQQLVSCAQGFNNHGCEGGLPSQ 96
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
A+EY+K+ G+++E+ YPYT KDG C F++ V D VNIT G EDE+ AVG + P
Sbjct: 97 AWEYVKWAQGIESEKDYPYTAKDGKCMFNTNKTIAYVRDVVNITQGDEDEILQAVGTLNP 156
Query: 229 VSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGV-PYWLIKNSWGENW 287
VS+A++VV F+ YK GVYSS C VNHAV+ VGYG ++ V PYW++KNSWG +W
Sbjct: 157 VSIAYQVVADFKLYKKGVYSSKLCHRDQEHVNHAVLVVGYGEDESVIPYWIVKNSWGPSW 216
Query: 288 GDHGYFKMEMGKNMCGIATCASYPV 312
G GYF +E +NMCG+A CA+YP+
Sbjct: 217 GMDGYFLIERNQNMCGLAECAAYPL 241
>gi|47086663|ref|NP_997853.1| cathepsin H precursor [Danio rerio]
gi|45709087|gb|AAH67615.1| Cathepsin H [Danio rerio]
Length = 330
Score = 251 bits (641), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 127/248 (51%), Positives = 155/248 (62%), Gaps = 16/248 (6%)
Query: 81 LRFATFSKNLDLIRSTNCKG---------------LSYRL-GLNISPVKDQGHCGSCWTF 124
+ FA F K L NC + +R G I+ VK+QG CGSCWTF
Sbjct: 80 MTFAEFKKTYLLTEPQNCSATRGNHVSSNGLYPDAIDWRTKGHYITDVKNQGPCGSCWTF 139
Query: 125 STTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEA 184
STTG LE+ A GK + L+EQQL+DCA F+N GCNGGLPS AFEYI YN GL TE+
Sbjct: 140 STTGCLESVTAIATGKLLQLAEQQLIDCAGDFDNHGCNGGLPSHAFEYIMYNKGLMTEDD 199
Query: 185 YPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKS 244
YPY K G C+F + V + VNIT E + AV + PVS A+EV F YK
Sbjct: 200 YPYQAKGGQCRFKPQLAAAFVKEVVNITKYDEMGMVDAVARLNPVSFAYEVTSDFMHYKD 259
Query: 245 GVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGI 304
G+Y+ST+C NT VNHAV+AVGY E+G PYW++KNSWG NWG GYF +E GKNMCG+
Sbjct: 260 GIYTSTECHNTTDMVNHAVLAVGYAEENGTPYWIVKNSWGTNWGIKGYFYIERGKNMCGL 319
Query: 305 ATCASYPV 312
A C+SYP+
Sbjct: 320 AACSSYPI 327
>gi|403364285|gb|EJY81901.1| Cathepsin H [Oxytricha trifallax]
Length = 363
Score = 248 bits (634), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 124/211 (58%), Positives = 152/211 (72%), Gaps = 4/211 (1%)
Query: 103 YRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCN 162
+R ++PVKDQG CGSCWTFST G+LEA + + + +LSEQQLVDCA A++N GCN
Sbjct: 141 WREHNGVTPVKDQGSCGSCWTFSTVGTLEAHFLIKYQQSRNLSEQQLVDCAGAYDNYGCN 200
Query: 163 GGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKF--SSENVGVQVLDSVNITLGAEDELQ 220
GGLPS AF+YI NGG+ TE AYPY KD C S ++VGV V SVN+T +EDEL
Sbjct: 201 GGLPSHAFQYISDNGGIATEAAYPYFAKDRPCTIQQSQKSVGV-VGGSVNLT-KSEDELA 258
Query: 221 HAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIK 280
A+ PVS+A+EV+D F Y SGVY++ C N P DVNHAVVAVG+G E+GV YWL+K
Sbjct: 259 IAIFQHGPVSIAYEVIDDFMDYHSGVYTTKDCKNGPDDVNHAVVAVGFGTENGVDYWLVK 318
Query: 281 NSWGENWGDHGYFKMEMGKNMCGIATCASYP 311
NSW WGD+GYFK++ G NMCGI C SYP
Sbjct: 319 NSWSTKWGDNGYFKIQRGVNMCGINNCNSYP 349
>gi|320164780|gb|EFW41679.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
Length = 334
Score = 243 bits (621), Expect = 7e-62, Method: Compositional matrix adjust.
Identities = 135/310 (43%), Positives = 176/310 (56%), Gaps = 59/310 (19%)
Query: 60 LSFARFARRYGKIY-ESVEEMKLRFATFSKNLDLIRSTNCKGL-SYRLGLNI-------- 109
+ F + R +GK Y ++VEE+ R A + N L+ + N G+ SY LG+NI
Sbjct: 28 MEFEAWKRTFGKSYSDAVEEINRR-AVWEANKMLVDAHNGAGIHSYTLGMNIFADLTHEE 86
Query: 110 --------------------------------------------SPVKDQGHCGSCWTFS 125
+PVKDQG CGSCW+FS
Sbjct: 87 FKRFYLGTKVDLNRPRSNFSSTFIPTANVGALPDSVDWRTAGIVTPVKDQGQCGSCWSFS 146
Query: 126 TTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAY 185
TTGS+E + + G+ +SLSEQ LVDC++A NQGCNGGL AF+YI N G+DTE +Y
Sbjct: 147 TTGSVEGQHARKTGQLVSLSEQNLVDCSKAQGNQGCNGGLMDDAFQYIITNKGIDTEASY 206
Query: 186 PYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVV-DGFRFYKS 244
PYT KDG CKF++ NVG + +IT G+E +LQ+AV V PVSVA + + F+ Y S
Sbjct: 207 PYTAKDGTCKFNAANVGATLSSFQDITRGSESDLQNAVATVGPVSVAIDASKNSFQLYTS 266
Query: 245 GVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEM-GKNMCG 303
GVY+ KC +T +D H V+A GYG +G PYWL+KNSWG +WG GY M N CG
Sbjct: 267 GVYNEKKCSSTSLD--HGVLAAGYGTSNGTPYWLVKNSWGSSWGQAGYIWMSRNANNQCG 324
Query: 304 IATCASYPVV 313
IAT ASYP+V
Sbjct: 325 IATSASYPIV 334
>gi|395822883|ref|XP_003784735.1| PREDICTED: pro-cathepsin H [Otolemur garnettii]
Length = 308
Score = 243 bits (621), Expect = 7e-62, Method: Compositional matrix adjust.
Identities = 120/204 (58%), Positives = 139/204 (68%), Gaps = 23/204 (11%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
+SPVK+QG CGSCWTFSTTG+LE+A A GK +SL+EQQLVDCA+ FNN GC GGLPSQ
Sbjct: 125 VSPVKNQGSCGSCWTFSTTGALESAVAIAGGKMLSLAEQQLVDCAKDFNNHGCQGGLPSQ 184
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
AFEYI YN G+ E+ YPY GK E+ + AV L P
Sbjct: 185 AFEYILYNKGIMGEDTYPYQGKYD-----------------------EEAMVEAVALYNP 221
Query: 229 VSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWG 288
VS AFEV D F YK G+YSST C TP VNHAV+AVGYG E+GVPYW++KNSWG WG
Sbjct: 222 VSFAFEVTDDFLMYKRGIYSSTSCHKTPDKVNHAVLAVGYGEENGVPYWIVKNSWGSQWG 281
Query: 289 DHGYFKMEMGKNMCGIATCASYPV 312
GYF +E GKNMCG+A CASYP+
Sbjct: 282 MDGYFLIERGKNMCGLAACASYPI 305
>gi|28974202|gb|AAO61485.1| cathepsin H [Sterkiella histriomuscorum]
Length = 366
Score = 242 bits (618), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 121/206 (58%), Positives = 148/206 (71%), Gaps = 5/206 (2%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
+SPVK+QG CGSCWTFST G +E+ Y +G +LSEQQLVDCA ++N GC+GGLPS
Sbjct: 147 VSPVKNQGKCGSCWTFSTVGCVESHYLLKYGAFRNLSEQQLVDCAGDYDNHGCSGGLPSH 206
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSS--ENVGVQVLDSVNITLGAEDELQHAVGLV 226
AFEYIK NGGL E YPY +G C ++VG++ +VNI+L ED+L+ A+ L
Sbjct: 207 AFEYIKDNGGLALETTYPYKAANGQCSIQKGQQSVGIRG-GAVNISLN-EDDLKQAIYLH 264
Query: 227 RPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGV-EDGVPYWLIKNSWGE 285
PVSVAF V+DGFR YKSGVY+ C N P DVNHAV+AVG+G E+ V YW+IKNSWG
Sbjct: 265 GPVSVAFRVIDGFRDYKSGVYAVEGCANGPNDVNHAVLAVGFGTDENKVDYWIIKNSWGA 324
Query: 286 NWGDHGYFKMEMGKNMCGIATCASYP 311
WGD G+FKM+ G NMCGI C SYP
Sbjct: 325 AWGDQGFFKMKRGVNMCGIQNCNSYP 350
>gi|118366977|ref|XP_001016704.1| Cysteine proteinase 3 precursor, putative [Tetrahymena thermophila]
gi|89298471|gb|EAR96459.1| Cysteine proteinase 3 precursor, putative [Tetrahymena thermophila
SB210]
Length = 343
Score = 241 bits (616), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 129/291 (44%), Positives = 182/291 (62%), Gaps = 22/291 (7%)
Query: 38 SSDGLRDFETSVLQVIGQARHALSFAR-FARRYGKI-YESVEEMKLR------FATFSKN 89
SS+ + F+ ++ +I +H L+ + + ++ K + + EE+ + AT +N
Sbjct: 54 SSERFKIFKQRLIDII---KHNLNPHKTYTQKINKFSFYTQEELSVLNAAQNCSATAKEN 110
Query: 90 LDLIRSTNCKGL----SYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKG---I 142
+ + N K + +R ++PVK+QG CGSCWTFSTTG+LE+ H A G +
Sbjct: 111 MAPKKKYNLKDIPEFVDWRTKGIVTPVKNQGQCGSCWTFSTTGALES--HWALHTGNAPL 168
Query: 143 SLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVG 202
LSEQQL+DCA FNN GC+GGLPSQAFEYI Y GGLDTE YPY D C+F +
Sbjct: 169 LLSEQQLIDCAGDFNNFGCSGGLPSQAFEYISYAGGLDTEGDYPYEATDNECEFKRSHAA 228
Query: 203 VQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHA 262
+V+ S NIT EDEL + + P+S+A++V D F Y G+YS+ C +P VNHA
Sbjct: 229 AKVVRSFNITFQDEDELIYHLATAGPISIAYQVTDDFFKYDGGIYSNPYCSTSPDMVNHA 288
Query: 263 VVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASYPVV 313
V+AVGY + Y+++KNSWGE+WG+ GYF +E+G NMCG+A CASYP+V
Sbjct: 289 VLAVGYNLTG--RYYIVKNSWGEHWGNEGYFNIELGSNMCGLADCASYPIV 337
>gi|108755401|emb|CAI77919.1| cathepsin H [Guillardia theta]
gi|122890320|emb|CAJ73711.1| Cathepsin H [Guillardia theta]
Length = 353
Score = 241 bits (615), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 120/217 (55%), Positives = 142/217 (65%), Gaps = 11/217 (5%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
+S VK+QG CGSCWTFST +LE+ + G+ + LSEQQLVDCA F N GCNGGLPSQ
Sbjct: 135 VSMVKNQGTCGSCWTFSTAAALESLHAIKTGEMVLLSEQQLVDCAADFKNNGCNGGLPSQ 194
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFS-----------SENVGVQVLDSVNITLGAED 217
AFEYI YNGGL E YPY DG C + +VG +V N T G E
Sbjct: 195 AFEYIMYNGGLSKMEEYPYVCGDGHCNVTGGPCAFDPVGKPWSVGAKVSKVANFTPGDEI 254
Query: 218 ELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYW 277
++ VG P+SVAFEVV R Y SGVYSS C TP VNHAV+AVGYG E G+PYW
Sbjct: 255 SMKTVVGSHNPISVAFEVVADLRHYSSGVYSSPTCVGTPDKVNHAVLAVGYGTEGGIPYW 314
Query: 278 LIKNSWGENWGDHGYFKMEMGKNMCGIATCASYPVVA 314
IKNSWG WGD+GYFK++ G N CGI+ CAS+P+ +
Sbjct: 315 TIKNSWGFAWGDNGYFKIQRGSNKCGISVCASFPITS 351
>gi|428175797|gb|EKX44685.1| hypothetical protein GUITHDRAFT_71985 [Guillardia theta CCMP2712]
Length = 354
Score = 241 bits (614), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 121/218 (55%), Positives = 144/218 (66%), Gaps = 12/218 (5%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
+S VK+QG CGSCWTFST +LE+ + G+ + LSEQQLVDCA F N GCNGGLPSQ
Sbjct: 135 VSMVKNQGTCGSCWTFSTAAALESLHAIKTGEMVLLSEQQLVDCAADFKNNGCNGGLPSQ 194
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFS-----------SENVGVQVLDSV-NITLGAE 216
AFEYI YNGGL E YPY DG C + +VG + + V N T G E
Sbjct: 195 AFEYIMYNGGLSKMEEYPYVCGDGHCNVTGGPCAFDPVGKPWSVGAKKVSKVANFTPGDE 254
Query: 217 DELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPY 276
++ VG P+SVAFEVV R Y SGVYSS C TP VNHAV+AVGYG E G+PY
Sbjct: 255 ISMKTVVGSHNPISVAFEVVADLRHYSSGVYSSPTCVGTPDKVNHAVLAVGYGTEGGIPY 314
Query: 277 WLIKNSWGENWGDHGYFKMEMGKNMCGIATCASYPVVA 314
W IKNSWG WGD+GYFK++ G NMCGI+ CAS+P+ +
Sbjct: 315 WTIKNSWGFAWGDNGYFKIQRGSNMCGISVCASFPITS 352
>gi|154183745|gb|ABS70713.1| cathepsin L-like cysteine proteinase [Dermacentor variabilis]
Length = 333
Score = 240 bits (613), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 134/306 (43%), Positives = 170/306 (55%), Gaps = 59/306 (19%)
Query: 65 FARRYGKIYESVEEMKLRFATFSKNLDLIRSTN---CKGL-SYRLGLN------------ 108
F + K Y+S E LRF FS+N L+ N +GL SY+LG+N
Sbjct: 30 FKATHKKSYQSNMEELLRFKIFSENSLLVARHNEKYARGLVSYKLGMNQFGDLLPHEFAR 89
Query: 109 ---------------------------------------ISPVKDQGHCGSCWTFSTTGS 129
++PVK+QG CGSCW FSTTGS
Sbjct: 90 MFNGYRGARTAGRGSTFLPPANVNYSSLPQSMDWREKGAVTPVKNQGQCGSCWAFSTTGS 149
Query: 130 LEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTG 189
LE + G +SLSEQ LVDC++ F N GC GGL AF+YIK NGG+DTE++YPY
Sbjct: 150 LEGQHFLKTGVLVSLSEQNLVDCSETFGNHGCEGGLMDNAFQYIKANGGIDTEKSYPYEA 209
Query: 190 KDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVV-DGFRFYKSGVYS 248
+DG C+F +NVG V+I G+ED+L+ AV V PVSVA + F+ Y GVY
Sbjct: 210 EDGECRFKKQNVGATDTGFVDIEQGSEDDLKKAVATVGPVSVAIDASHSSFQLYSEGVYD 269
Query: 249 STKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATC 307
T+C + +D H V+ VGYGVEDG YWL+KNSW E+WGD+GY KM K N CGIA+
Sbjct: 270 ETECSSEQLD--HGVLVVGYGVEDGKKYWLVKNSWAESWGDNGYIKMSRDKDNQCGIASA 327
Query: 308 ASYPVV 313
ASYP+V
Sbjct: 328 ASYPLV 333
>gi|291224892|ref|XP_002732436.1| PREDICTED: cathepsin H-like [Saccoglossus kowalevskii]
Length = 302
Score = 239 bits (611), Expect = 9e-61, Method: Compositional matrix adjust.
Identities = 117/206 (56%), Positives = 141/206 (68%), Gaps = 1/206 (0%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++ VK QG CGSCW+FSTTG+LE+A A ISLSEQQL+DCAQAFNN GCNGGLP+Q
Sbjct: 95 VTDVKSQGSCGSCWSFSTTGALESATAIAKSTLISLSEQQLIDCAQAFNNHGCNGGLPAQ 154
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
AFEYI YN GL + Y Y KDG CK+ V VNIT G ED + +AV P
Sbjct: 155 AFEYIHYNDGLMADIDYQYKAKDGKCKYDPSKAAAFVSKIVNITKGDEDGILNAVYKHGP 214
Query: 229 VSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYG-VEDGVPYWLIKNSWGENW 287
VS+A++V F Y SGVYSST C P VNHAV+A G+ +G+ YW++KNSWG +W
Sbjct: 215 VSIAYDVASDFHLYHSGVYSSTVCKIDPEHVNHAVLATGFNETAEGLKYWMVKNSWGPDW 274
Query: 288 GDHGYFKMEMGKNMCGIATCASYPVV 313
G GYF +E KNMCG+A CASYP+V
Sbjct: 275 GLDGYFWIERNKNMCGLADCASYPIV 300
>gi|403355691|gb|EJY77431.1| Cathepsin H [Oxytricha trifallax]
Length = 363
Score = 239 bits (611), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 117/206 (56%), Positives = 151/206 (73%), Gaps = 5/206 (2%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
+SPVK+QG CGSCWTFST G+LE+ + +G+ +LSEQQLVDCA ++N GCNGGLPS
Sbjct: 147 VSPVKNQGKCGSCWTFSTVGALESHFLLKYGQFRNLSEQQLVDCAGNYDNHGCNGGLPSH 206
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVC--KFSSENVGVQVLDSVNITLGAEDELQHAVGLV 226
AFEY+K NGG+ E +YPY C K S++VGV+ +VN++L +ED+L+ A+
Sbjct: 207 AFEYLKDNGGIAEETSYPYVAVTNTCALKKGSQSVGVKG-GAVNVSL-SEDDLKQAIYSH 264
Query: 227 RPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGV-EDGVPYWLIKNSWGE 285
PVS+AF+V FR Y++GVY+S C N P DVNHAV+AVG+G E+ V YW+IKNSWG
Sbjct: 265 GPVSIAFQVASDFRDYRAGVYTSKVCKNGPQDVNHAVLAVGFGTDENKVDYWIIKNSWGA 324
Query: 286 NWGDHGYFKMEMGKNMCGIATCASYP 311
WGD GYFKME G NMCG++ C SYP
Sbjct: 325 VWGDQGYFKMERGVNMCGVSNCNSYP 350
>gi|156046107|gb|ABU42573.1| cathepsin H variant 2 [Sus scrofa]
Length = 321
Score = 239 bits (610), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 130/301 (43%), Positives = 164/301 (54%), Gaps = 63/301 (20%)
Query: 60 LSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN----------- 108
L F + ++ K Y S+EE R F N I + N +++LGLN
Sbjct: 33 LHFKSWMVQHQKKY-SLEEYHHRLQVFVSNWRKIDAHNAGNHTFKLGLNQFSDMSFDEIR 91
Query: 109 -------------------------------------ISPVKDQGHCGSCWTFSTTGSLE 131
+SPVK+QG CGSCWTFSTTG+LE
Sbjct: 92 HKYLWSEPQNCSATKGNYLRGTGPYPPSMDWRKKGNFVSPVKNQGSCGSCWTFSTTGALE 151
Query: 132 AAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKD 191
+A A GK +SL+EQQLVDCAQ F EYI+YN G+ E+ YPY G+D
Sbjct: 152 SAVAIATGKMLSLAEQQLVDCAQNF--------------EYIRYNKGIMGEDTYPYKGQD 197
Query: 192 GVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTK 251
CKF + V D NIT+ E+ + AV L PVS AFEV + F Y+ G+YSST
Sbjct: 198 DHCKFQPDKAIAFVKDVANITMNDEEAMVEAVALYNPVSFAFEVTNDFLMYRKGIYSSTS 257
Query: 252 CGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASYP 311
C TP VNHAV+AVGYG E+G+PYW++KNSWG WG +GYF +E GKNMCG+A CASYP
Sbjct: 258 CHKTPDKVNHAVLAVGYGEENGIPYWIVKNSWGPQWGMNGYFLIERGKNMCGLAACASYP 317
Query: 312 V 312
+
Sbjct: 318 I 318
>gi|254674508|dbj|BAH86062.1| cysteine protease [Haemaphysalis longicornis]
Length = 333
Score = 238 bits (607), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 133/306 (43%), Positives = 170/306 (55%), Gaps = 59/306 (19%)
Query: 65 FARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCK---GL-SYRLGLN------------ 108
F ++ K Y S E LRF F++N L+ N K GL SY+L +N
Sbjct: 30 FKSQHNKAYSSHVEELLRFKIFTENTLLVAKHNAKYAKGLVSYKLAMNKFGDLLPHEFAK 89
Query: 109 ---------------------------------------ISPVKDQGHCGSCWTFSTTGS 129
++PVK+QG CGSCW FSTTGS
Sbjct: 90 MVNGYRGKQNKEQRPTFIPPANLNDSSLPTTVDWRKKGAVTPVKNQGQCGSCWAFSTTGS 149
Query: 130 LEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTG 189
LE + + GK +SLSEQ LVDC+ F NQGCNGGL F+YIK NGG+DTEE++PYT
Sbjct: 150 LEGQHFRKTGKLVSLSEQNLVDCSDDFGNQGCNGGLMDNGFQYIKANGGIDTEESHPYTA 209
Query: 190 KDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDG-FRFYKSGVYS 248
+DG CKF +VG V+I G+ED+L+ AV V PVSVA + G F+ Y GVY
Sbjct: 210 QDGDCKFKKADVGATDAGFVDIQQGSEDDLKKAVATVGPVSVAIDASHGSFQLYSQGVYD 269
Query: 249 STKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATC 307
C ++ +D H V+ VGYGV++G YWL+KNSWG +WGD+GY M K N CGIA+
Sbjct: 270 EPDCSSSQLD--HGVLTVGYGVKNGKKYWLVKNSWGGDWGDNGYILMSRDKDNQCGIASS 327
Query: 308 ASYPVV 313
ASYP+V
Sbjct: 328 ASYPLV 333
>gi|391328503|ref|XP_003738728.1| PREDICTED: digestive cysteine proteinase 3-like [Metaseiulus
occidentalis]
Length = 506
Score = 236 bits (601), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 121/217 (55%), Positives = 142/217 (65%), Gaps = 4/217 (1%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
KG Y L ++PVKDQG CGSCW FSTTGSLE + +A GK +SLSEQ LVDC+ N
Sbjct: 292 KGKDYWLEGYVTPVKDQGQCGSCWAFSTTGSLEGQHFKATGKLVSLSEQNLVDCSGDEGN 351
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
GC GGL Q F YIK NGG+DTEE+YPY +DG C F S VG +V V+I G+E
Sbjct: 352 NGCEGGLMDQGFTYIKNNGGIDTEESYPYNAEDGDCAFKSNAVGARVTGFVDIDSGSEKA 411
Query: 219 LQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYW 277
LQ AV V PVSVA + D F+ YK G+Y C +T +D H V+AVGYG E+GV YW
Sbjct: 412 LQKAVATVGPVSVAIDASNDSFQLYKEGIYDEPACSSTQLD--HGVLAVGYGSENGVDYW 469
Query: 278 LIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
L+KNSW WG GY KM K N CGIA+ ASYP V
Sbjct: 470 LVKNSWNTVWGQDGYIKMARNKDNQCGIASQASYPTV 506
Score = 178 bits (451), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 88/181 (48%), Positives = 116/181 (64%), Gaps = 5/181 (2%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + YR +++PVK+QG CGSCW FS TGSLE G +SLSEQ L+DC++ N
Sbjct: 122 KKVDYRKSGHVTPVKNQGLCGSCWAFSATGSLEGQLSIQNGTLVSLSEQNLLDCSR--EN 179
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
QGC+GG +AFEYIK NGG+DTEE+YPYTG+ G C F +N+G +V V++ E
Sbjct: 180 QGCDGGYMDKAFEYIKKNGGIDTEESYPYTGRKGKCMFKKKNIGARVTGHVDVPAEDEQA 239
Query: 219 LQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYW 277
L+ AV + P+SV + D FRFYK G+Y + C + +D H V+ VGYG E G YW
Sbjct: 240 LKLAVAKIGPISVGIDASKDSFRFYKEGIYDESSCSTSQLD--HGVLVVGYGSEKGKDYW 297
Query: 278 L 278
L
Sbjct: 298 L 298
>gi|375152052|gb|AFA36484.1| cysteine protease, partial [Lolium perenne]
Length = 142
Score = 235 bits (600), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 107/141 (75%), Positives = 124/141 (87%)
Query: 174 KYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAF 233
+YNGG+DTEE+YPY G +GVCK+ EN VQV DSVNITL AEDEL++AV LVRPVSVAF
Sbjct: 1 RYNGGIDTEESYPYKGVNGVCKYRPENAAVQVADSVNITLNAEDELKNAVELVRPVSVAF 60
Query: 234 EVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYF 293
EV+DGF+ YKSGVY+S CG TP DVNHAV+AVGYGVE+GVPYWLIKNSWG +WG+ GYF
Sbjct: 61 EVIDGFKQYKSGVYTSDHCGTTPDDVNHAVLAVGYGVENGVPYWLIKNSWGADWGEDGYF 120
Query: 294 KMEMGKNMCGIATCASYPVVA 314
KMEMGKNMC +ATCASYP++A
Sbjct: 121 KMEMGKNMCAVATCASYPILA 141
>gi|402770499|gb|AFQ98384.1| cathepsin L, partial [Hyalomma anatolicum anatolicum]
Length = 312
Score = 234 bits (597), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 119/238 (50%), Positives = 153/238 (64%), Gaps = 4/238 (1%)
Query: 78 EMKLRFATFSKNLDLIRSTNCKGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQA 137
E K R +TF ++ S+ K + +R ++PVKDQG CGSCW FS TGSLE +
Sbjct: 77 ERKGRGSTFLPPANVNDSSLPKTVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLK 136
Query: 138 FGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFS 197
GK +SLSEQ L+DC+ +F N+GC GGL AF+YIK N G+DTEE+YPY DG C+F
Sbjct: 137 SGKLVSLSEQNLIDCSGSFGNEGCGGGLMDNAFKYIKANDGIDTEESYPYEAMDGDCRFK 196
Query: 198 SENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTP 256
E+VG V+I G+ED+LQ AV V P+SVA + F+ Y GVY C +
Sbjct: 197 KEDVGATDTGFVDIQQGSEDDLQKAVATVGPISVAIDASHSSFQLYSEGVYDEPNCSSEE 256
Query: 257 MDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
+D H V+AVGYGV++G YWL+KNSW E WGD+GY M K N CGIA+ ASYP+V
Sbjct: 257 LD--HGVLAVGYGVKNGKKYWLVKNSWAETWGDNGYILMSRDKDNQCGIASSASYPLV 312
>gi|156399477|ref|XP_001638528.1| predicted protein [Nematostella vectensis]
gi|156225649|gb|EDO46465.1| predicted protein [Nematostella vectensis]
Length = 325
Score = 234 bits (597), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 130/311 (41%), Positives = 173/311 (55%), Gaps = 56/311 (18%)
Query: 53 IGQARHALSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN---- 108
+ Q R ++ F +GK Y EE LR A ++ NL++++ N + SY+L +N
Sbjct: 21 LSQDRQWHAWKDF---HGKTYTG-EEEDLRRAIWNDNLEIVKKHNAENHSYKLDMNHFAD 76
Query: 109 --------------------------------------------ISPVKDQGHCGSCWTF 124
++ VK+QG CGSCW F
Sbjct: 77 LTVTEFKQRFMGYRAASNSTGGSTFLPLSNVQLPAEVDWRDKGFVTAVKNQGQCGSCWAF 136
Query: 125 STTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEA 184
S+TGSLE + + GK +SLSEQ LVDC++ + N GC GGL AF+YIK N G+DTE++
Sbjct: 137 SSTGSLEGQHFRKTGKLVSLSEQNLVDCSKKYGNNGCEGGLMDYAFKYIKNNDGIDTEQS 196
Query: 185 YPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEV-VDGFRFYK 243
YPYT +DG C F +VG V ++ G+E +LQ AV V P+SVA + F+ YK
Sbjct: 197 YPYTARDGQCHFKPGSVGATVTGYTDVQRGSEGDLQSAVATVGPISVAIDAGHSSFQLYK 256
Query: 244 SGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGK-NMC 302
+GVYS C +T +D H V+AVGYG EDG YWL+KNSWGE WG +GY KM K N C
Sbjct: 257 TGVYSEPDCSSTQLD--HGVLAVGYGAEDGKDYWLVKNSWGEGWGMNGYIKMSRNKDNQC 314
Query: 303 GIATCASYPVV 313
GIAT ASYP+V
Sbjct: 315 GIATQASYPLV 325
>gi|320169658|gb|EFW46557.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
Length = 324
Score = 233 bits (595), Expect = 6e-59, Method: Compositional matrix adjust.
Identities = 126/305 (41%), Positives = 164/305 (53%), Gaps = 55/305 (18%)
Query: 62 FARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN------------- 108
F + +G Y +V E R + NLD I N +G SY+L +N
Sbjct: 22 FDSWKATHGVSYATVGEETARRGIYRANLDFIEKHNSEGHSYKLAVNKFADLTYPEFAAK 81
Query: 109 --------------------------------------ISPVKDQGHCGSCWTFSTTGSL 130
++P+KDQG CGSCW+FSTTGS+
Sbjct: 82 YLGLRFDATNATKSFAASTYLPRMVSLPDSVDWRTAGIVTPIKDQGQCGSCWSFSTTGSV 141
Query: 131 EAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGK 190
E + + G+ +SLSEQ LVDC+ A N GCNGGL QAF+YI N G+DTE +YPYT +
Sbjct: 142 EGQHARKTGQLVSLSEQNLVDCSSAQGNAGCNGGLMDQAFQYIISNNGIDTESSYPYTAQ 201
Query: 191 DGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVD-GFRFYKSGVYSS 249
DG C+F+S NVG V +I G+E +LQ+AV V P+SVA + F+FY SGVY+
Sbjct: 202 DGTCQFNSANVGATVASYQDIASGSESDLQNAVATVGPISVAIDASQPSFQFYSSGVYNE 261
Query: 250 TKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMG-KNMCGIATCA 308
C ++ +D H V+AVGYG YWL+KNSWG +WG GY M N CGIAT A
Sbjct: 262 PACSSSQLD--HGVLAVGYGTSGSSDYWLVKNSWGTSWGQSGYIWMTRNSNNQCGIATAA 319
Query: 309 SYPVV 313
SYP+V
Sbjct: 320 SYPLV 324
>gi|323452413|gb|EGB08287.1| hypothetical protein AURANDRAFT_3602, partial [Aureococcus
anophagefferens]
Length = 312
Score = 233 bits (595), Expect = 7e-59, Method: Compositional matrix adjust.
Identities = 137/312 (43%), Positives = 165/312 (52%), Gaps = 61/312 (19%)
Query: 61 SFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN------------ 108
SF + + +GK Y S A F + + + N + LS+R GLN
Sbjct: 1 SFDAYVQHFGKTYASDAHRDAASAHFEASKRRVAAHNARALSWRAGLNQFSDMSDDEFEA 60
Query: 109 ----------------------------------ISPVKDQGHCGSCWTFSTTGSLEA-- 132
+S VK+QGHCGSCWTFST G+LEA
Sbjct: 61 AVLMDPQECSATGGVGAGAAADLPDALDWRSRGVVSEVKNQGHCGSCWTFSTVGALEAHL 120
Query: 133 AYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDG 192
A Q + LSEQQLVDCA AF+ +GC GGLPS AFEY+KY GGL TE +YPY G D
Sbjct: 121 ALKQDAWRAPRLSEQQLVDCAGAFDTKGCAGGLPSHAFEYVKYAGGLSTEFSYPYRGVDQ 180
Query: 193 VCKF-----------SSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRF 241
C F S+ V SVNIT G E L++ + PVSVAF+V FR
Sbjct: 181 ACAFNATASSSGLPTSAGVGVVVPGGSVNITKGDEASLKYHLATKGPVSVAFQVASDFRD 240
Query: 242 YKSGVYSSTKCGNTPMDVNHAVVAVGYGVE--DGVPYWLIKNSWGENWGDHGYFKMEMGK 299
Y SGVYSST C N MDVNHAV+AVGYG + + YW IKNSW +WGD G+FKME
Sbjct: 241 YASGVYSSTVCKNGAMDVNHAVLAVGYGTDPVSNMTYWTIKNSWDYSWGDEGFFKMESFV 300
Query: 300 NMCGIATCASYP 311
NMCG+A C +YP
Sbjct: 301 NMCGVANCNAYP 312
>gi|346469447|gb|AEO34568.1| hypothetical protein [Amblyomma maculatum]
Length = 333
Score = 232 bits (592), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 117/234 (50%), Positives = 151/234 (64%), Gaps = 4/234 (1%)
Query: 82 RFATFSKNLDLIRSTNCKGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKG 141
R +T+ +L S+ K + +R ++PVKDQG CGSCW FS+TGSLE + GK
Sbjct: 102 RGSTYLPPANLNDSSLPKTVDWRKKGAVTPVKDQGQCGSCWAFSSTGSLEGQHFLKTGKL 161
Query: 142 ISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENV 201
+SLSEQ LVDC+ A+ NQGCNGGL +F YIK NGG+DTE++YPY +DG C++ E+V
Sbjct: 162 VSLSEQNLVDCSSAYGNQGCNGGLMDNSFNYIKANGGIDTEDSYPYEAEDGDCRYKKEDV 221
Query: 202 GVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVN 260
G V+I G+E +LQ AV V PVSVA + F+ Y GVY C + +D
Sbjct: 222 GATDTGFVDIKEGSEKDLQKAVATVGPVSVAIDASQQSFQLYSEGVYDEPNCSSESLD-- 279
Query: 261 HAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
H V+AVGYGV++G YWL+KNSW E WG GY M K N CGIA+ ASYP+V
Sbjct: 280 HGVLAVGYGVKNGKKYWLVKNSWAETWGQDGYILMSRDKNNQCGIASSASYPLV 333
>gi|261289785|ref|XP_002611754.1| hypothetical protein BRAFLDRAFT_284341 [Branchiostoma floridae]
gi|229297126|gb|EEN67764.1| hypothetical protein BRAFLDRAFT_284341 [Branchiostoma floridae]
Length = 327
Score = 232 bits (591), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 136/327 (41%), Positives = 168/327 (51%), Gaps = 60/327 (18%)
Query: 45 FETSVLQVIGQARHALSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGL--- 101
F VL V + + F +GK Y+S +E +R A F N +I+ N +
Sbjct: 3 FLILVLSVTMATAMDVEWEAFKLTHGKQYKSPDEENVRRAIFRDNNQMIKEHNQEAAMGR 62
Query: 102 -SYRLGLN---------------------------------------------------I 109
SY +G+N +
Sbjct: 63 RSYFMGMNQFGDLAHSEYLELVVGPGLLPLNLSTPSENVFESTPGLQVDDTVDWRQKGAV 122
Query: 110 SPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQA 169
+P+KDQGHCGSCW FSTTGSLE + GK +SLSEQ L+DC++ F N+GC GGL QA
Sbjct: 123 TPIKDQGHCGSCWAFSTTGSLEGQHFMKTGKLVSLSEQNLLDCSRRFGNKGCEGGLMDQA 182
Query: 170 FEYIKYNGGLDTEEAYPYTGKD-GVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
F YIK NGG+DTEE YPY KD VC + + G + +I E L AVG V P
Sbjct: 183 FRYIKSNGGIDTEECYPYMAKDEKVCDYKTSCSGATLSSYTDIKAMDEMALMQAVGTVGP 242
Query: 229 VSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENW 287
VSVA + RFYKSG+Y +C T +D H V+AVGYG DG+ YWL+KNSWG W
Sbjct: 243 VSVAIDASHKSLRFYKSGIYDEPECSRTKLD--HGVLAVGYGSMDGMDYWLVKNSWGSAW 300
Query: 288 GDHGYFKMEMGK-NMCGIATCASYPVV 313
GD GY KM K N CGIAT ASYPVV
Sbjct: 301 GDMGYVKMTRNKNNQCGIATKASYPVV 327
>gi|308322281|gb|ADO28278.1| cathepsin L [Ictalurus furcatus]
Length = 359
Score = 232 bits (591), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 132/312 (42%), Positives = 170/312 (54%), Gaps = 60/312 (19%)
Query: 60 LSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNC---KGL-SYRLGLN------- 108
+ F + +++GKIY+SVEE R T+ +N L+ + N KG+ SYRLG+N
Sbjct: 23 IEFQEWKQKFGKIYKSVEEESQRKKTWQENHKLVMNHNILADKGIKSYRLGMNYFADMSN 82
Query: 109 ---------------------------------------------ISPVKDQGHCGSCWT 123
++ V++Q C SCW
Sbjct: 83 QEYRQSVFKGCLSFNRTLNHSAATFLRQVGGPALPNTVNWTQMGYVTEVEEQKQCNSCWA 142
Query: 124 FSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEE 183
FS TG+LE + GK +SLS+QQLVDC++ F N GC GGL + AFEY+K NGGL TEE
Sbjct: 143 FSATGALEGQTFKKTGKLVSLSKQQLVDCSKKFGNNGCKGGLMNWAFEYVKENGGLHTEE 202
Query: 184 AYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVD-GFRFY 242
+YPY KDG C+ + VGV V I E+ LQ AV + P+SVA + F+ Y
Sbjct: 203 SYPYEAKDGSCRDNLGTVGVTCTGHVQINSEDENALQEAVATIGPISVAIDANHTSFQLY 262
Query: 243 KSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGK-NM 301
+SG+Y C T D+NH V+AVGYG +DG YWLIKNSWG NWGD GY KM K N
Sbjct: 263 ESGLYDEPDCSCT--DMNHGVLAVGYGTDDGKDYWLIKNSWGINWGDKGYIKMSRNKNNQ 320
Query: 302 CGIATCASYPVV 313
CGIAT ASYP+V
Sbjct: 321 CGIATAASYPLV 332
>gi|449683741|ref|XP_002155462.2| PREDICTED: cathepsin L-like [Hydra magnipapillata]
Length = 324
Score = 231 bits (589), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 128/310 (41%), Positives = 164/310 (52%), Gaps = 50/310 (16%)
Query: 52 VIGQARHALSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN--- 108
+I + S+ R+ + K Y E +R+ + N IR N +G + L +N
Sbjct: 17 IIERPTEDDSWIRWKMAHNKAYSHDGEETVRYTIWKDNERRIREHNLQGGDFLLEMNQFG 76
Query: 109 -------------------------------------------ISPVKDQGHCGSCWTFS 125
++PVKDQG CGSCW FS
Sbjct: 77 DMTNNEFKDFNGYLSHKHVSGSTFLTPNSFVAPDSVDWRNEGYVTPVKDQGQCGSCWAFS 136
Query: 126 TTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAY 185
TTGSLE + GK +SLSEQ LVDC+ A+ N GCNGGL AF YIK N G+D+E +Y
Sbjct: 137 TTGSLEGQNFKKTGKLVSLSEQNLVDCSTAYGNNGCNGGLMDNAFTYIKENNGIDSEASY 196
Query: 186 PYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVD-GFRFYKS 244
PYT KDG C F+ NV V+I G E++L+ AV V P+SVA + F+FY+
Sbjct: 197 PYTAKDGKCAFTKPNVAATDTGFVDIPSGDENKLKEAVASVGPISVAIDASHFSFQFYRK 256
Query: 245 GVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEM-GKNMCG 303
GVY+ KC +T +D H V+ VGYG E G YWL+KNSW +WGD GY KM KN CG
Sbjct: 257 GVYNERKCSSTELD--HGVLVVGYGTESGKDYWLVKNSWNTSWGDKGYIKMSRNAKNQCG 314
Query: 304 IATCASYPVV 313
IAT ASYP+V
Sbjct: 315 IATNASYPLV 324
>gi|391328505|ref|XP_003738729.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
Length = 323
Score = 231 bits (589), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 115/207 (55%), Positives = 140/207 (67%), Gaps = 4/207 (1%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++PVK+QG CGSCW FSTTGSLE + +A GK +SLSEQ LVDC++ N GCNGGL
Sbjct: 119 VTPVKNQGQCGSCWAFSTTGSLEGQHFKATGKLVSLSEQNLVDCSRVEGNNGCNGGLMDN 178
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
F YI+ NGG+DTEE+YPYTGKDG C F+ +VG +V V++ E LQ AV V P
Sbjct: 179 GFTYIQQNGGIDTEESYPYTGKDGDCAFNENSVGARVKGFVDVPQRDEAALQAAVASVGP 238
Query: 229 VSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENW 287
VSVA + D F++YK GVY C + +D H V+ VGYG E+GV YWL+KNSWG W
Sbjct: 239 VSVAIDASNDSFQYYKEGVYDEPSCSFSQLD--HGVLVVGYGTENGVDYWLVKNSWGPTW 296
Query: 288 GDHGYFKMEMGK-NMCGIATCASYPVV 313
G GY KM K N CGIA+ ASYP V
Sbjct: 297 GQDGYIKMMRNKENQCGIASMASYPTV 323
>gi|320169652|gb|EFW46551.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
Length = 325
Score = 231 bits (589), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 132/327 (40%), Positives = 171/327 (52%), Gaps = 57/327 (17%)
Query: 42 LRDFETSVLQVIGQARHALSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGL 101
++ F L + A+ FA + + + Y S +E LR + NL+LI N G
Sbjct: 1 MKAFTAVALLALVACATAMPFAEWKALHNRQYASAQEEALRQEIYLSNLELINEHNAAGR 60
Query: 102 -SYRLGLN---------------------------------------------------I 109
SY LG+N +
Sbjct: 61 HSYTLGMNEFGDLAHHEFAAKYLGVRFNGVNATKSFASSTYLPRMVSLPDSVDWRTAGIV 120
Query: 110 SPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQA 169
+PVK+QG CGSCW+FSTTGS+E + + G +SLSEQ LVDC+ N+GCNGGL A
Sbjct: 121 TPVKNQGQCGSCWSFSTTGSVEGQHARKTGTLVSLSEQNLVDCSSQEGNEGCNGGLMDDA 180
Query: 170 FEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPV 229
FEYI NGG+DTE +YPYT G CKF++ N+G V +I G+E +LQ+AV V PV
Sbjct: 181 FEYIIKNGGIDTEASYPYTATTGTCKFNAANIGATVASYQDIITGSESDLQNAVATVGPV 240
Query: 230 SVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE-DGVPYWLIKNSWGENW 287
SVA + F+FY +GVY+ KC T +D H V+AVGYG +G YWL+KNSWG W
Sbjct: 241 SVAIDASHINFQFYFTGVYNEKKCSTTQLD--HGVLAVGYGTSTEGKDYWLVKNSWGATW 298
Query: 288 GDHGYFKMEM-GKNMCGIATCASYPVV 313
G GY M N CGIAT ASYP+V
Sbjct: 299 GKAGYIWMSRNADNQCGIATSASYPLV 325
>gi|340380715|ref|XP_003388867.1| PREDICTED: pro-cathepsin H-like [Amphimedon queenslandica]
Length = 347
Score = 230 bits (587), Expect = 6e-58, Method: Compositional matrix adjust.
Identities = 115/255 (45%), Positives = 150/255 (58%), Gaps = 3/255 (1%)
Query: 60 LSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLNISPVKDQGHCG 119
L+FA F R Y + S + + F + + + +R I+PV+DQG CG
Sbjct: 95 LTFAEFKRIY--LSSSSQHCRATTGNFQMPVKKNNVEDPVAIDWRKRNVITPVRDQGSCG 152
Query: 120 SCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGL 179
SCW FS T L A G+ ISLS+QQL+DC+++FNN+GC GGLPSQAFEYI+YNGG+
Sbjct: 153 SCWAFSATSCLSAHLALKTGQLISLSKQQLLDCSRSFNNRGCKGGLPSQAFEYIRYNGGI 212
Query: 180 DTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGF 239
++E YPY ++ C F V V VN T GAED++ A+ + PVS+ F
Sbjct: 213 ESERDYPYKDREEKCHFKPSLVAATVTGVVNFTQGAEDDIAVALANIGPVSIGIHSTKSF 272
Query: 240 RFYKSGVYSSTKCGNTPMDVNHAVVAVGYG-VEDGVPYWLIKNSWGENWGDHGYFKMEMG 298
YK G+Y C P +NHAV+ VGY G YW+ KNSWG NWG +GYF + G
Sbjct: 273 ATYKKGIYQGKLCSKNPRKINHAVLIVGYDQTASGEKYWIGKNSWGTNWGMNGYFWIRRG 332
Query: 299 KNMCGIATCASYPVV 313
N CG+ATCASYPVV
Sbjct: 333 HNACGLATCASYPVV 347
>gi|309380130|gb|ADO65978.1| cathepsin L [Eriocheir sinensis]
gi|309380134|gb|ADO65980.1| cathepsin L [Eriocheir sinensis]
Length = 325
Score = 230 bits (587), Expect = 6e-58, Method: Compositional matrix adjust.
Identities = 130/306 (42%), Positives = 167/306 (54%), Gaps = 56/306 (18%)
Query: 62 FARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCK---GL-SYRLGLN--------- 108
+ +F RYGK Y S +E R + + +N + I S N + GL S+ L +N
Sbjct: 22 WQQFKARYGKQYRSTKEDSYRQSVYEQNQEFINSHNEQYENGLVSFTLAMNQFGDMTTEE 81
Query: 109 ---------------------------------------ISPVKDQGHCGSCWTFSTTGS 129
++PVKDQ CGSCW FS TGS
Sbjct: 82 INAAMNGFLSAGKKVPRGTMYQPLVDELPDTVDWRDKGAVTPVKDQKACGSCWAFSATGS 141
Query: 130 LEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTG 189
LE + + GK +SLSEQ LVDC+ + N GC GGL AF YIK N G+DTEE+YPY
Sbjct: 142 LEGQHFLSTGKLVSLSEQNLVDCSDKYGNFGCGGGLMDNAFRYIKDNNGIDTEESYPYEA 201
Query: 190 KDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYS 248
K+G C+F+S+NVG + V+I G+ED+LQ AV PVSVA + F FY G+Y
Sbjct: 202 KNGPCRFNSDNVGATLSSYVDIQHGSEDDLQKAVAEKGPVSVAIDASTSTFHFYSRGIYY 261
Query: 249 STKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATC 307
KC ++ +D H V+AVGYG +D YWL+KNSW E WGD GY KM + N CGIA+
Sbjct: 262 DEKCSSSFLD--HGVLAVGYGTDDSSDYWLVKNSWNETWGDSGYIKMSRNRNNNCGIASQ 319
Query: 308 ASYPVV 313
ASYPVV
Sbjct: 320 ASYPVV 325
>gi|33348834|gb|AAQ16117.1| cathepsin L-like cysteine proteinase A [Rhipicephalus
haemaphysaloides haemaphysaloides]
Length = 332
Score = 230 bits (586), Expect = 7e-58, Method: Compositional matrix adjust.
Identities = 131/305 (42%), Positives = 166/305 (54%), Gaps = 58/305 (19%)
Query: 65 FARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCK---GL-SYRLGLN------------ 108
F + K YES E LRF F++N +I N K GL SY+LG+N
Sbjct: 30 FKTTHKKSYESHMEELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMNQFGDLLAHEFAK 89
Query: 109 --------------------------------------ISPVKDQGHCGSCWTFSTTGSL 130
++PVKDQG CGSCW FS TGSL
Sbjct: 90 IFNGYRGQRTSRGSTFMPPANVNDSSLPSTVDWRKKGAVTPVKDQGQCGSCWAFSATGSL 149
Query: 131 EAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGK 190
E + G+ +SLSEQ LVDC+Q+F N GC GGL AF+YIK N G+D EE+YPY
Sbjct: 150 EGQHFLKDGELVSLSEQNLVDCSQSFGNNGCEGGLMDNAFKYIKANDGIDAEESYPYEAM 209
Query: 191 DGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSS 249
D C+F E+VG V+I G+ED+L+ AV V P+SVA + F+ Y GVY
Sbjct: 210 DDKCRFKKEDVGATDTGFVDIEGGSEDDLKKAVATVGPISVAIDAGHSSFQLYSEGVYDE 269
Query: 250 TKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCA 308
+C + +D H V+AVGYGV+DG YWL+KNSWG +WGD+GY M K N CGIA+ A
Sbjct: 270 PECSSEELD--HGVLAVGYGVKDGKKYWLVKNSWGGSWGDNGYILMSRDKNNQCGIASAA 327
Query: 309 SYPVV 313
SYP+V
Sbjct: 328 SYPLV 332
>gi|118363827|ref|XP_001015137.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89296904|gb|EAR94892.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 429
Score = 229 bits (583), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 109/210 (51%), Positives = 146/210 (69%), Gaps = 7/210 (3%)
Query: 109 ISPVKDQG----HCGSCWTFSTTGSLEAAYHQAFGKG-ISLSEQQLVDCAQAFNNQGCNG 163
+S VKDQ CGSCWTFS TG++E+ GK +LS+QQLVDCA F+NQGC+G
Sbjct: 134 VSSVKDQDAVGDDCGSCWTFSATGAIESHLALKTGKAPFNLSQQQLVDCAGKFDNQGCDG 193
Query: 164 GLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAV 223
GLPS+AFEYI Y GG+++ YPY GKDG CKF + V +V S NIT E+EL + +
Sbjct: 194 GLPSRAFEYIAYAGGIESSRDYPYKGKDGKCKFKPQKVVAKVQSSFNITFQDENELIYHL 253
Query: 224 GLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSW 283
PVS+A++V D F Y+ G+YS+ +C P +VNHAV+AVGY + Y+++KNSW
Sbjct: 254 AKNGPVSIAYQVTDDFENYEGGIYSNPECSTDPQEVNHAVLAVGYNLTG--RYYIVKNSW 311
Query: 284 GENWGDHGYFKMEMGKNMCGIATCASYPVV 313
G++WG GYF +E+G NMCG+A CASYP++
Sbjct: 312 GKDWGMDGYFYIELGSNMCGLADCASYPIL 341
>gi|56758090|gb|AAW27185.1| SJCHGC06231 protein [Schistosoma japonicum]
Length = 372
Score = 229 bits (583), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 117/279 (41%), Positives = 167/279 (59%), Gaps = 22/279 (7%)
Query: 41 GLRDFETSVLQVIGQARHALSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKG 100
G+ +F + + R S R A+ G + S E KL
Sbjct: 110 GVNNFTDKTEYELRKLRGYRSACRIAKPKGSTFISSEHAKLP----------------DR 153
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++PVK+QG CGSCW FS+TG++E +++ + ++LSEQQL+DC++++ N G
Sbjct: 154 VDWRRNGAVTPVKNQGQCGSCWAFSSTGAIEGQHYRKTNRLVNLSEQQLIDCSKSYGNNG 213
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDG----VCKFSSENVGVQVLDSVNITLGAE 216
C GGL AF+Y++ N G+D+E +YPY DG C F+S N+ QV +NI G E
Sbjct: 214 CEGGLMDLAFQYVRDNKGIDSEISYPYISGDGDENVRCLFNSTNIMAQVTGYINIHEGDE 273
Query: 217 DELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVP 275
L +AV + PVSVA + F YKSG+YS +C + D++H V+ VGYG+EDG P
Sbjct: 274 RALMNAVATIGPVSVAINAGLPSFSMYKSGIYSDPECASASEDLDHGVLLVGYGIEDGKP 333
Query: 276 YWLIKNSWGENWGDHGYFK-MEMGKNMCGIATCASYPVV 313
YWLIKNSWGE+WGD GY K ++ KNMCG+A+ ASYP+V
Sbjct: 334 YWLIKNSWGEDWGDKGYVKILKDSKNMCGVASAASYPLV 372
>gi|226477902|emb|CAX72658.1| Cathepsin L precursor [Schistosoma japonicum]
gi|226488903|emb|CAX74801.1| Cathepsin L precursor [Schistosoma japonicum]
Length = 372
Score = 229 bits (583), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 117/279 (41%), Positives = 167/279 (59%), Gaps = 22/279 (7%)
Query: 41 GLRDFETSVLQVIGQARHALSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKG 100
G+ +F + + R S R A+ G + S E KL
Sbjct: 110 GVNNFTDKTEYELRKLRGYRSACRIAKPKGSTFISSEHAKLP----------------DR 153
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++PVK+QG CGSCW FS+TG++E +++ + ++LSEQQL+DC++++ N G
Sbjct: 154 VDWRRNGAVTPVKNQGQCGSCWAFSSTGAIEGQHYRKTNRLVNLSEQQLIDCSKSYGNNG 213
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDG----VCKFSSENVGVQVLDSVNITLGAE 216
C GGL AF+Y++ N G+D+E +YPY DG C F+S N+ QV +NI G E
Sbjct: 214 CEGGLMDLAFQYVRDNEGIDSEISYPYISGDGDENVRCLFNSTNIMAQVTGYINIHEGDE 273
Query: 217 DELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVP 275
L +AV + PVSVA + F YKSG+YS +C + D++H V+ VGYG+EDG P
Sbjct: 274 RALMNAVATIGPVSVAINAGLSSFSMYKSGIYSDPECASASEDLDHGVLLVGYGIEDGKP 333
Query: 276 YWLIKNSWGENWGDHGYFK-MEMGKNMCGIATCASYPVV 313
YWLIKNSWGE+WGD GY K ++ KNMCG+A+ ASYP+V
Sbjct: 334 YWLIKNSWGEDWGDKGYVKILKDSKNMCGVASAASYPLV 372
>gi|118363825|ref|XP_001015136.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89296903|gb|EAR94891.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 355
Score = 228 bits (581), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 138/343 (40%), Positives = 188/343 (54%), Gaps = 42/343 (12%)
Query: 8 VSSVILLLCCAAAASASASSFDDSNPIRLVSSDGLRDFETSVLQVIGQAR------HALS 61
+ ++I L A SA + DSN L+S GL ++ + L I Q+ +
Sbjct: 1 MRNIIFLTLSALCLSAVVAQ--DSNQEILISR-GLVNYTDADLLSIYQSYGYEPDPSSER 57
Query: 62 FARFARRYGKIYE---------SVEEMKLRFATFSKNLDLIRSTNCKG------------ 100
F F R KI E S + KL F T S+ S NC
Sbjct: 58 FQLFKSRLAKIIEHNSNPDKKYSQKINKLTFQTGSELKKFRASQNCSATAQANTRSFRKY 117
Query: 101 --------LSYRLGLNISPVKDQGH-CGSCWTFSTTGSLEAAYHQAFGKG-ISLSEQQLV 150
+ +R ++ VK+QG CGSCW F+ +LE+ Y GK I SEQQLV
Sbjct: 118 DLSQLPQYVDWREKGVVTQVKNQGEDCGSCWAFAAVAALESHYALKTGKKPIQFSEQQLV 177
Query: 151 DCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVN 210
DCA+ F+ QGC+GGLPS+ FEY+ Y GG+ TE YPY GKD C+F+S QV S N
Sbjct: 178 DCARKFDTQGCDGGLPSKGFEYLAYAGGIQTEADYPYEGKDKKCRFNSSKAVAQVEKSFN 237
Query: 211 ITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGV 270
IT E+EL + + PV++A+EV D F YK GV++S+ C P DVNHAV+AVGY +
Sbjct: 238 ITFQDENELIYHLANYGPVAIAYEVNDDFDNYKDGVFTSSNCSTDPEDVNHAVLAVGYNM 297
Query: 271 EDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASYPVV 313
Y+++KNSWG++WG +GYF +E+G NMCG+A CASYP++
Sbjct: 298 TG--KYFIVKNSWGKDWGMNGYFYIELGSNMCGLADCASYPII 338
>gi|402770507|gb|AFQ98388.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 228 bits (580), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 129/305 (42%), Positives = 164/305 (53%), Gaps = 58/305 (19%)
Query: 65 FARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCK---GL-SYRLGLN------------ 108
F + K Y+S E LRF F++N +I N K GL SY+LG+N
Sbjct: 30 FKTTHKKTYQSHMEELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMNQFGDLLAHEFAR 89
Query: 109 --------------------------------------ISPVKDQGHCGSCWTFSTTGSL 130
++PVKDQG CGSCW FSTTGSL
Sbjct: 90 IFNGYHGSRKSGGSTFLPPANVNDSSLPKAVDWRKKGAVTPVKDQGQCGSCWAFSTTGSL 149
Query: 131 EAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGK 190
E + G+ +SLSEQ LVDC+Q+F N GC GGL AF+YIK N G+DTE++YPY
Sbjct: 150 EGQHFLKNGELVSLSEQNLVDCSQSFGNNGCEGGLMEDAFKYIKANDGIDTEKSYPYEAV 209
Query: 191 DGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSS 249
DG C+F E+VG V I G ED+L+ AV V P+SVA + F+ Y GVY
Sbjct: 210 DGECRFKKEDVGATDTGYVEIKAGCEDDLKKAVATVGPISVAIDASHSSFQLYSEGVYDE 269
Query: 250 TKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEM-GKNMCGIATCA 308
+C + D++H V+ VGYGV+ G YWL+KNSW E+WGD GY M N CGIA+ A
Sbjct: 270 PEC--SSEDLDHGVLVVGYGVKGGKKYWLVKNSWAESWGDQGYILMSRDNNNQCGIASQA 327
Query: 309 SYPVV 313
SYP+V
Sbjct: 328 SYPLV 332
>gi|33112581|gb|AAP94046.1| cathepsin-L-like cysteine peptidase 02 [Tenebrio molitor]
Length = 337
Score = 228 bits (580), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 114/216 (52%), Positives = 143/216 (66%), Gaps = 5/216 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++PVKDQG CGSCW+FS TGSLE + + GK +SLSEQ LVDC++ F N G
Sbjct: 124 IDWRDKGAVTPVKDQGQCGSCWSFSATGSLEGQHFRKSGKLVSLSEQNLVDCSEKFGNNG 183
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
CNGGL AF YIK NGG+DTE+AYPY +D C + +N G V+I G ED+LQ
Sbjct: 184 CNGGLMDNAFRYIKANGGIDTEQAYPYKAEDEKCHYKPKNKGATDRGYVDIESGNEDKLQ 243
Query: 221 HAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE-DGVPYWL 278
AV V PVSVA + F+ Y GVY +C +P ++H V+ VGYG E DG YWL
Sbjct: 244 SAVATVGPVSVAIDASHQSFQLYSGGVYYEPEC--SPSQLDHGVLVVGYGTEDDGTDYWL 301
Query: 279 IKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
+KNSWG++WGD GY KM + N CGIAT ASYP+V
Sbjct: 302 VKNSWGKSWGDQGYIKMARNRDNNCGIATEASYPLV 337
>gi|402770501|gb|AFQ98385.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 227 bits (579), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 128/305 (41%), Positives = 164/305 (53%), Gaps = 58/305 (19%)
Query: 65 FARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCK---GL-SYRLGLN------------ 108
F + K Y+S E LRF F++N +I N K GL SY+LG+N
Sbjct: 30 FKTTHKKTYQSHMEELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMNQFGDLLAHEFAR 89
Query: 109 --------------------------------------ISPVKDQGHCGSCWTFSTTGSL 130
++PVKDQG CGSCW FS TGSL
Sbjct: 90 IFNGHHGTRKTGGSTFLPPANVNDSSLPKAVDWRKKGAVTPVKDQGQCGSCWAFSATGSL 149
Query: 131 EAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGK 190
E + G+ +SLSEQ LVDC+Q+F N GC GGL AF+YIK N G+DTE++YPY
Sbjct: 150 EGQHFLKNGELVSLSEQNLVDCSQSFGNNGCEGGLMEDAFKYIKANDGIDTEKSYPYEAV 209
Query: 191 DGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSS 249
DG C+F E+VG V I G+ED+L+ AV V P+SVA + F+ Y GVY
Sbjct: 210 DGECRFKKEDVGATDTGYVEIKAGSEDDLKKAVATVGPISVAIDASHSSFQLYSEGVYDE 269
Query: 250 TKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEM-GKNMCGIATCA 308
+C + D++H V+ VGYGV+ G YWL+KNSW E+WGD GY M N CGIA+ A
Sbjct: 270 PEC--SSEDLDHGVLVVGYGVKGGKKYWLVKNSWAESWGDQGYILMSRDNNNQCGIASQA 327
Query: 309 SYPVV 313
SYP+V
Sbjct: 328 SYPLV 332
>gi|323446652|gb|EGB02738.1| hypothetical protein AURANDRAFT_34950 [Aureococcus anophagefferens]
Length = 235
Score = 227 bits (579), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 124/226 (54%), Positives = 145/226 (64%), Gaps = 15/226 (6%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEA--AYHQAFGKGISLSEQQLVDCAQAFNN 158
L +R +S VK+QGHCGSCWTFST G+LEA A Q + LSEQQLVDCA AF+
Sbjct: 5 LDWRSRGVVSEVKNQGHCGSCWTFSTVGALEAHLALKQDAWRAPRLSEQQLVDCAGAFDT 64
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKF-----------SSENVGVQVLD 207
+GC GGLPS AFEY+KY GGL TE +YPY G D C F S+ V
Sbjct: 65 KGCAGGLPSHAFEYVKYAGGLSTEFSYPYRGVDQACAFNATASSSGLPTSAGVGVVVPGG 124
Query: 208 SVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVG 267
SVNIT G E L++ + PVSVAF+V FR Y SGVYSST C N MDVNHAV+AVG
Sbjct: 125 SVNITKGDEAALKYHLATKGPVSVAFQVASDFRDYASGVYSSTVCKNGAMDVNHAVLAVG 184
Query: 268 YGVE--DGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASYP 311
YG + + YW IKNSW +WGD G+FKME NMCG+A C +YP
Sbjct: 185 YGTDPVSNMTYWTIKNSWDYSWGDEGFFKMESFVNMCGVANCNAYP 230
>gi|229595078|ref|XP_001020175.2| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|225566400|gb|EAR99930.2| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 375
Score = 227 bits (578), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 138/342 (40%), Positives = 187/342 (54%), Gaps = 42/342 (12%)
Query: 8 VSSVILLLCCAAAASASASSFDDSNPIRLVSSDGLRDFETSVLQVIGQAR------HALS 61
+ ++I L A SA + DSN L+S GL D+ + L I Q+ +
Sbjct: 1 MRNIIFLTLSALCLSAVIAQ--DSNQEILISR-GLVDYTDADLLSIYQSYGYEPDPSSER 57
Query: 62 FARFARRYGKIYE---------SVEEMKLRFATFSKNLDLIRSTNCKG------------ 100
F F R KI E S + KL F T S+ S NC
Sbjct: 58 FQLFKSRLAKIIEHNSNPDKKYSQKINKLTFQTGSELKKFRASQNCSATAQANTRSFRKY 117
Query: 101 --------LSYRLGLNISPVKDQGH-CGSCWTFSTTGSLEAAYHQAFGKG-ISLSEQQLV 150
+ +R ++ VK+QG CGSCW F+ +LE+ Y GK I SEQQLV
Sbjct: 118 DLSQLPQYVDWREKGVVTQVKNQGEDCGSCWAFAAVAALESHYALKTGKKPIQFSEQQLV 177
Query: 151 DCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVN 210
DCA+ F+ QGC+GGLPS+ FEY+ Y GG+ TE YPY GKD C+F+S QV S N
Sbjct: 178 DCARKFDTQGCDGGLPSKGFEYLAYAGGIQTEADYPYEGKDKKCRFNSSKAVAQVEKSFN 237
Query: 211 ITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGV 270
IT E+EL + + PV++A+EV D F Y+ GV++S+ C P DVNHAV+AVGY +
Sbjct: 238 ITFQDENELIYHLANYGPVAIAYEVNDDFDNYEDGVFTSSNCSTDPEDVNHAVLAVGYNM 297
Query: 271 EDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASYPV 312
Y+++KNSWG++WG +GYF +E+G NMCG+A CASYP+
Sbjct: 298 TG--KYFIVKNSWGKDWGMNGYFYIELGSNMCGLADCASYPI 337
>gi|402770503|gb|AFQ98386.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 227 bits (578), Expect = 6e-57, Method: Compositional matrix adjust.
Identities = 128/305 (41%), Positives = 164/305 (53%), Gaps = 58/305 (19%)
Query: 65 FARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCK---GL-SYRLGLN------------ 108
F + K Y+S E LRF F++N +I N K GL SY+LG+N
Sbjct: 30 FKTTHKKTYQSHMEELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMNQFGDLLAHEFAR 89
Query: 109 --------------------------------------ISPVKDQGHCGSCWTFSTTGSL 130
++PVKDQG CGSCW FS TGSL
Sbjct: 90 IFNGHHGTRKTGGSTFLPPANVNDSSLPKVVDWRKKGAVTPVKDQGQCGSCWAFSATGSL 149
Query: 131 EAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGK 190
E + G+ +SLSEQ LVDC+Q+F N GC GGL AF+YIK N G+DTE++YPY
Sbjct: 150 EGRHFLKNGELVSLSEQNLVDCSQSFGNNGCEGGLMEDAFKYIKENDGIDTEKSYPYEAV 209
Query: 191 DGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSS 249
DG C+F E+VG V I G+ED+L+ AV V P+SVA + F+ Y GVY
Sbjct: 210 DGECRFKKEDVGATDTGYVEIKAGSEDDLKKAVATVGPISVAIDASHSSFQLYSEGVYDE 269
Query: 250 TKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEM-GKNMCGIATCA 308
+C + D++H V+ VGYGV+ G YWL+KNSW E+WGD GY M N CGIA+ A
Sbjct: 270 PEC--SSEDLDHGVLVVGYGVKGGKKYWLVKNSWAESWGDQGYILMSRDNNNQCGIASQA 327
Query: 309 SYPVV 313
SYP+V
Sbjct: 328 SYPLV 332
>gi|348542774|ref|XP_003458859.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
Length = 330
Score = 227 bits (578), Expect = 6e-57, Method: Compositional matrix adjust.
Identities = 112/207 (54%), Positives = 143/207 (69%), Gaps = 5/207 (2%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++ VKDQ CGSCW FS TG+LE Y + GK +SLSEQQLVDC++ F N GC GG P
Sbjct: 127 VTHVKDQKECGSCWAFSATGALEGQYFKKTGKLVSLSEQQLVDCSRKFRNNGCEGGEPHW 186
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
AF+YI+YNGGLDTEE+Y Y KDG C ++ ++VG + VN++ ED L+ AV + P
Sbjct: 187 AFQYIRYNGGLDTEESYHYEAKDGQCHYNPDSVGAKCSGYVNVS-PFEDALKEAVATIGP 245
Query: 229 VSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENW 287
+SVA ++ F+ Y SGVY C N +++NHAV+AVGYG E+G YWL+KNSWG W
Sbjct: 246 ISVAIDISRVSFQLYHSGVYDEPWCSN--INLNHAVLAVGYGTENGHDYWLVKNSWGSEW 303
Query: 288 GDHGYFKMEMGK-NMCGIATCASYPVV 313
G+ GY KM K N CGIAT ASYP+V
Sbjct: 304 GNKGYIKMTRNKDNQCGIATEASYPLV 330
>gi|221090861|ref|XP_002167224.1| PREDICTED: cathepsin L-like [Hydra magnipapillata]
Length = 324
Score = 226 bits (577), Expect = 9e-57, Method: Compositional matrix adjust.
Identities = 124/301 (41%), Positives = 162/301 (53%), Gaps = 50/301 (16%)
Query: 61 SFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN------------ 108
S+ ++ + K+Y E +R+ + N IR N KG + L +N
Sbjct: 26 SWIQWKMYHNKVYSHDGEETVRYTIWKDNERRIREHNLKGGDFLLKMNQFGDMTNSEFKA 85
Query: 109 ----------------------------------ISPVKDQGHCGSCWTFSTTGSLEAAY 134
++PVKDQG CGSCW FSTTGSLE +
Sbjct: 86 FNGYLSHKHVNGSTFLTPNNFVAPDTVDWRNEGYVTPVKDQGQCGSCWAFSTTGSLEGQH 145
Query: 135 HQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVC 194
+ GK +SLSEQ LVDC+ A+ N GCNGGL AF YIK N G+D+E +YPYT +DG C
Sbjct: 146 FKKTGKLVSLSEQNLVDCSTAYGNNGCNGGLMDNAFTYIKENKGIDSEASYPYTAEDGKC 205
Query: 195 KFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCG 253
F +V V++ G E++L+ AV V P+SVA + + F+FY SGVY+ C
Sbjct: 206 VFKKPSVAATDTGFVDLPEGNENKLKEAVASVGPISVAIDASHESFQFYSSGVYNEPSCS 265
Query: 254 NTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEM-GKNMCGIATCASYPV 312
+T +D H V+ VGYG E G YWL+KNSW +WGD GY KM KN CGIAT ASYP+
Sbjct: 266 STELD--HGVLVVGYGTESGKDYWLVKNSWNTSWGDKGYIKMRRNAKNQCGIATKASYPL 323
Query: 313 V 313
V
Sbjct: 324 V 324
>gi|402770509|gb|AFQ98389.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 226 bits (576), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 113/232 (48%), Positives = 148/232 (63%), Gaps = 4/232 (1%)
Query: 84 ATFSKNLDLIRSTNCKGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGIS 143
+TF ++ S+ K + +R ++PVKDQG CGSCW FS TGSLE + G+ +S
Sbjct: 103 STFLPPANVNDSSLPKAVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKNGELVS 162
Query: 144 LSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGV 203
LSEQ LVDC+Q+F N GC GGL AF+YIK N G+DTE++YPY DG C+F E+VG
Sbjct: 163 LSEQNLVDCSQSFGNNGCEGGLMEDAFKYIKANDGIDTEKSYPYEAVDGECRFKKEDVGA 222
Query: 204 QVLDSVNITLGAEDELQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHA 262
V I G+ED+L+ AV V P+SVA + F+ Y GVY +C + D++H
Sbjct: 223 TDTGYVEIKAGSEDDLKKAVATVGPISVAIDASHSSFQLYSEGVYDEPEC--SSEDLDHG 280
Query: 263 VVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEM-GKNMCGIATCASYPVV 313
V+ VGYGV+ G YWL+KNSW E+WGD GY M N CGIA+ ASYP+V
Sbjct: 281 VLVVGYGVKGGKKYWLVKNSWAESWGDQGYILMSRDNNNQCGIASQASYPLV 332
>gi|391336140|ref|XP_003742440.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
Length = 330
Score = 226 bits (576), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 115/207 (55%), Positives = 138/207 (66%), Gaps = 4/207 (1%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++ VK+QG CGSCW FSTTGSLE + GK +SLSEQ LVDC+ + NQGCNGGL Q
Sbjct: 126 VTEVKNQGQCGSCWAFSTTGSLEGQVFKKTGKLVSLSEQNLVDCSTSEGNQGCNGGLMDQ 185
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
AF YIK NGG+DTE AYPYTG DG C+F VG V V++ G E+ L+ AV V P
Sbjct: 186 AFTYIKKNGGIDTEAAYPYTGSDGTCRFLENKVGATVSGFVDVKSGDENALKEAVATVGP 245
Query: 229 VSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENW 287
+SVA + F+FY+ GVY+ C +T +D H V+ VGYG E G YWL+KNSWG +W
Sbjct: 246 ISVAIDASSIFFQFYRGGVYNPWFCSSTELD--HGVLVVGYGTEGGKDYWLVKNSWGSSW 303
Query: 288 GDHGYFKM-EMGKNMCGIATCASYPVV 313
G GY KM KN CGIAT ASYP V
Sbjct: 304 GLKGYIKMVRNKKNRCGIATQASYPTV 330
>gi|5081735|gb|AAD39513.1|AF147207_1 cathepsin L-like protease precursor [Artemia franciscana]
Length = 338
Score = 226 bits (575), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 111/217 (51%), Positives = 145/217 (66%), Gaps = 4/217 (1%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
+ + +R+ I+PVKDQG CGSCW FS+TG+LE + GK ISLSEQ L+DC+ + N
Sbjct: 124 ESVDWRVKGAITPVKDQGQCGSCWAFSSTGALEGQTFRKTGKLISLSEQNLIDCSGKYGN 183
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
+GCNGGL QAF+YIK N G+DTE YPY +D VC+++ N G V+I G ED+
Sbjct: 184 EGCNGGLMDQAFQYIKDNKGIDTENTYPYEAEDNVCRYNPRNRGAIDRGFVHIPSGEEDK 243
Query: 219 LQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYW 277
L+ AV V PVSVA + + F+FY GVY C + D++H V+ VGYG ++G YW
Sbjct: 244 LKAAVATVGPVSVAIDASHESFQFYSKGVYYEPSCDSD--DLDHGVLVVGYGSDNGKDYW 301
Query: 278 LIKNSWGENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
L+KNSW E+WGD GY K+ KN CGIAT ASYP+V
Sbjct: 302 LVKNSWSEHWGDEGYIKIARNRKNHCGIATAASYPLV 338
>gi|326501772|dbj|BAK02675.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 333
Score = 225 bits (574), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 113/207 (54%), Positives = 135/207 (65%), Gaps = 4/207 (1%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++ VKDQG CGSCW FSTTG+LE + + GK +SLSEQ LVDC+ N GCNGGL Q
Sbjct: 129 VTDVKDQGQCGSCWAFSTTGALEGQHFKQTGKLVSLSEQNLVDCSGKQGNMGCNGGLMDQ 188
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
AFEYIK N G+DTE++YPY D C+F + NVG +IT E LQ AV V P
Sbjct: 189 AFEYIKENNGIDTEDSYPYEAVDNQCRFKAANVGATDTGFTDITSKDESALQQAVATVGP 248
Query: 229 VSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENW 287
+SVA + F+ YK GVY+ C T +D H V+AVGYG + G YWL+KNSWGE W
Sbjct: 249 ISVAIDAGHTSFQLYKHGVYNEPFCSQTRLD--HGVLAVGYGTDSGKDYWLVKNSWGEGW 306
Query: 288 GDHGYFKMEMGK-NMCGIATCASYPVV 313
GD GY KM K N CGIAT ASYP+V
Sbjct: 307 GDKGYIKMTRNKRNQCGIATAASYPLV 333
>gi|33112583|gb|AAP94047.1| cathepsin-L-like cysteine peptidase 03 [Tenebrio molitor]
Length = 337
Score = 225 bits (574), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 114/216 (52%), Positives = 141/216 (65%), Gaps = 5/216 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++PVKDQG CGSCW+FS TGSLE + + GK +SLSEQ LVDC++ F N G
Sbjct: 124 IDWRDKGAVTPVKDQGQCGSCWSFSATGSLEGQHFRKSGKLVSLSEQNLVDCSEKFGNNG 183
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
CNGGL AF YIK NGG+DTE+AYPY +D C + +N G V+I G ED+LQ
Sbjct: 184 CNGGLMDNAFRYIKANGGIDTEQAYPYKAEDEKCHYKPKNKGATDRGYVDIESGNEDKLQ 243
Query: 221 HAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE-DGVPYWL 278
AV V PVSVA + F+ Y GVY C + +D H V+ VGYG E DG YWL
Sbjct: 244 SAVATVGPVSVAIDASHQSFQLYSGGVYYEPDCSASQLD--HGVLVVGYGTEDDGTDYWL 301
Query: 279 IKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
+KNSWG++WGD GY KM + N CGIAT ASYP+V
Sbjct: 302 VKNSWGKSWGDQGYIKMARNRDNNCGIATEASYPLV 337
>gi|29165304|gb|AAO65603.1| cathepsin L precursor [Hydra vulgaris]
Length = 324
Score = 225 bits (574), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 124/301 (41%), Positives = 162/301 (53%), Gaps = 50/301 (16%)
Query: 61 SFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN------------ 108
S+ ++ + K+Y E +R+ + N IR N KG + L +N
Sbjct: 26 SWIQWKMYHNKVYSHDGEETVRYTIWKDNERRIREHNLKGGDFILKMNQFGDMTNSEFKA 85
Query: 109 ----------------------------------ISPVKDQGHCGSCWTFSTTGSLEAAY 134
++PVKDQG CGSCW FSTTGSLE +
Sbjct: 86 FNGYLSHKHVNGSTFLTPNNFVAPDTVDWRNEGYVTPVKDQGQCGSCWAFSTTGSLEGQH 145
Query: 135 HQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVC 194
+ GK +SLSEQ LVDC+ A+ N GC+GGL AF YIK N G+D+E +YPYT +DG C
Sbjct: 146 FKKTGKLVSLSEQNLVDCSTAYGNNGCDGGLMDNAFTYIKENKGIDSEASYPYTAEDGKC 205
Query: 195 KFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCG 253
F +V V+I G E++L+ AV V P+SVA + + F+FY SGVY+ C
Sbjct: 206 VFKKSSVAATDTGFVDIPEGNENKLKEAVASVGPISVAIDASHESFQFYSSGVYNEPSCS 265
Query: 254 NTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEM-GKNMCGIATCASYPV 312
+T +D H V+ VGYG E G YWL+KNSW +WGD GY KM KN CGIAT ASYP+
Sbjct: 266 STELD--HGVLVVGYGTESGKDYWLVKNSWNTSWGDKGYIKMRRNAKNQCGIATKASYPL 323
Query: 313 V 313
V
Sbjct: 324 V 324
>gi|30023547|gb|AAO48766.2| cathepsin L-like cysteine proteinase [Tenebrio molitor]
Length = 337
Score = 225 bits (574), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 114/216 (52%), Positives = 141/216 (65%), Gaps = 5/216 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++PVKDQG CGSCW+FS TGSLE + + GK +SLSEQ LVDC++ F N G
Sbjct: 124 IDWRDKGAVTPVKDQGQCGSCWSFSATGSLEGQHFRQSGKLVSLSEQNLVDCSEKFGNNG 183
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
CNGGL AF YIK NGG+DTE+AYPY +D C + +N G V+I G ED+LQ
Sbjct: 184 CNGGLMDNAFRYIKANGGIDTEQAYPYKAEDEKCHYKPKNKGATDRGYVDIESGNEDKLQ 243
Query: 221 HAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE-DGVPYWL 278
AV V PVSVA + F+ Y GVY C + +D H V+ VGYG E DG YWL
Sbjct: 244 SAVATVGPVSVAIDASHQSFQLYSGGVYYEPDCSASQLD--HGVLVVGYGTEDDGTDYWL 301
Query: 279 IKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
+KNSWG++WGD GY KM + N CGIAT ASYP+V
Sbjct: 302 VKNSWGKSWGDQGYIKMARNRNNNCGIATEASYPLV 337
>gi|346466067|gb|AEO32878.1| hypothetical protein [Amblyomma maculatum]
Length = 358
Score = 224 bits (572), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 112/217 (51%), Positives = 140/217 (64%), Gaps = 4/217 (1%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++PVK+QG CGSCW FSTTGSLE + + G+ +SLSEQ LVDC+ F N
Sbjct: 144 KTVDWRKKGAVTPVKNQGQCGSCWAFSTTGSLEGQHFRKTGRMVSLSEQNLVDCSGKFGN 203
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
GC GGL AF+YIK NGG+DTE +YPY G DG+C F +VG V+I G E
Sbjct: 204 NGCEGGLMDNAFKYIKANGGIDTELSYPYNGTDGICHFEKSDVGATDTGFVDIPEGNEQL 263
Query: 219 LQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYW 277
L+ AV V PVSVA + + F+FY GVY +C + +D H V+ VGYG +DG YW
Sbjct: 264 LKKAVATVGPVSVAIDASHESFQFYSQGVYDEPECSSESLD--HGVLVVGYGTKDGQDYW 321
Query: 278 LIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
L+KNSWG WGD GY M K N CGIA+ ASYP+V
Sbjct: 322 LVKNSWGTTWGDDGYIYMTRNKENQCGIASSASYPLV 358
>gi|261289783|ref|XP_002611753.1| hypothetical protein BRAFLDRAFT_236364 [Branchiostoma floridae]
gi|229297125|gb|EEN67763.1| hypothetical protein BRAFLDRAFT_236364 [Branchiostoma floridae]
Length = 307
Score = 224 bits (571), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 117/234 (50%), Positives = 146/234 (62%), Gaps = 8/234 (3%)
Query: 82 RFATFSKNLDLIRSTNCKGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKG 141
R + NL + T + +R ++PVK+Q CGSCW FSTTGSLE + GK
Sbjct: 80 RVHQYDSNLVELPDT----VDWRTKGYVTPVKNQEQCGSCWAFSTTGSLEGQTFKKTGKL 135
Query: 142 ISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENV 201
+SLSEQ LVDC+ F NQGCNGGL AF+YIK NGG+DTE++YPY +DG C+F +V
Sbjct: 136 VSLSEQNLVDCSGEFGNQGCNGGLMDDAFKYIKANGGIDTEDSYPYEARDGKCRFKPADV 195
Query: 202 GVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVN 260
G V +I+ G E L AV V P+SVA + F+ Y GVY +C +T +D
Sbjct: 196 GATVTGYTDISEGDEGALTQAVATVGPISVAIDASHHTFQMYSHGVYYEPQCSSTELD-- 253
Query: 261 HAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
H V+AVGYG E G YWL+KNSWGE WG +GY M K N CGIAT ASYP+V
Sbjct: 254 HGVLAVGYGTEGGKDYWLVKNSWGEVWGQNGYIMMSRNKNNQCGIATSASYPLV 307
>gi|161408095|dbj|BAF94151.1| cathepsin L-like cysteine protease 1 [Plautia stali]
Length = 344
Score = 224 bits (571), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 119/256 (46%), Positives = 151/256 (58%), Gaps = 15/256 (5%)
Query: 60 LSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLNISPVKDQGHCG 119
L F R R K+ E + F N D + K + +R ISPVKDQGHCG
Sbjct: 88 LGFNRSLRATNKVPEGI--------PFRHNKDAVIQ---KEVDWRQKGAISPVKDQGHCG 136
Query: 120 SCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGL 179
SCW FS+TG+LEA G+ +SLSEQ L+DC+ + N GC GGL QAF+Y++ N G+
Sbjct: 137 SCWAFSSTGALEAHTFLKKGRRVSLSEQNLIDCSLNYGNNGCEGGLMEQAFQYVRDNDGI 196
Query: 180 DTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVD-G 238
DTEEAYPY G+D C+F NVG V I G E L AV P+S+A + +
Sbjct: 197 DTEEAYPYEGEDSECRFKKNNVGATDAGFVTIPSGDEQALMEAVATQGPLSIAIDASNPS 256
Query: 239 FRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMG 298
F+FY GVY +C + +D H V+ VGYGVE YWL+KNSW E WG++GY KM
Sbjct: 257 FQFYSEGVYYEPECSSAQLD--HGVLLVGYGVEKDQKYWLVKNSWSEQWGENGYIKMARN 314
Query: 299 K-NMCGIATCASYPVV 313
K N CGIAT AS+P+V
Sbjct: 315 KDNNCGIATQASFPIV 330
>gi|55740402|gb|AAV63977.1| cathepsin L precursor [Artemia franciscana]
Length = 338
Score = 224 bits (571), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 109/207 (52%), Positives = 140/207 (67%), Gaps = 4/207 (1%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
I+PVKDQG CGSCW FS+TG+LE + GK ISLSEQ L+DC+ + N+GCNGGL Q
Sbjct: 134 ITPVKDQGQCGSCWAFSSTGALEGQTFRKTGKLISLSEQNLIDCSGKYGNEGCNGGLMDQ 193
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
AF+YIK N G+DTE YPY +D VC+++ N G V+I G ED+L+ AV V P
Sbjct: 194 AFQYIKDNKGIDTENTYPYEAEDDVCRYNPRNRGAVDRGFVDIPSGEEDKLKAAVATVGP 253
Query: 229 VSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENW 287
VSVA + + F+FY GVY C + D++H V+ VGYG ++G YWL+KNSW E+W
Sbjct: 254 VSVAIDASHESFQFYSKGVYYEPSCDSD--DLDHGVLVVGYGSDNGKDYWLVKNSWSEHW 311
Query: 288 GDHGYFKMEMG-KNMCGIATCASYPVV 313
GD GY K+ KN CG+AT ASYP+V
Sbjct: 312 GDEGYIKIARNRKNHCGVATAASYPLV 338
>gi|410898132|ref|XP_003962552.1| PREDICTED: cathepsin L-like [Takifugu rubripes]
Length = 335
Score = 224 bits (571), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 109/207 (52%), Positives = 138/207 (66%), Gaps = 4/207 (1%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++ VKDQ CGSCW FS TGSLE + GK +SLSEQQLVDC+ + N GCNGGL
Sbjct: 131 VTGVKDQKQCGSCWAFSATGSLEGQNFRKTGKLVSLSEQQLVDCSGDYGNMGCNGGLMDY 190
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
AF+YI+ NGG+DTE++YPY +DG C+F ENVG + V++T+G ED L+ AV + P
Sbjct: 191 AFKYIQENGGIDTEKSYPYEAEDGQCRFKPENVGAKCTGYVDVTVGDEDALKEAVATIGP 250
Query: 229 VSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENW 287
VSV + F+ Y SGVY C + D++H V+AVGYG ++G YWL+KNSWG W
Sbjct: 251 VSVGIDASHSSFQLYDSGVYDEQDC--SSQDLDHGVLAVGYGTDNGQDYWLVKNSWGLGW 308
Query: 288 GDHGYFKMEMGK-NMCGIATCASYPVV 313
G GY M K N CGIAT ASYP+V
Sbjct: 309 GQEGYIMMSRNKDNQCGIATAASYPLV 335
>gi|55740406|gb|AAV63979.1| cathepsin L1 precursor [Artemia parthenogenetica]
Length = 338
Score = 224 bits (570), Expect = 5e-56, Method: Compositional matrix adjust.
Identities = 108/207 (52%), Positives = 140/207 (67%), Gaps = 4/207 (1%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
I+PVKDQG CGSCW FS+TG+LE + GK +SLSEQ L+DC+ + N+GCNGGL Q
Sbjct: 134 ITPVKDQGQCGSCWAFSSTGALEGQTFRKTGKLVSLSEQNLIDCSGKYGNEGCNGGLMDQ 193
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
AF+YIK N G+DTE YPY +DGVC+++ N G V+I G ED+L+ AV V P
Sbjct: 194 AFQYIKDNKGIDTENTYPYEAEDGVCRYNPRNRGAVDRGFVDIPSGEEDKLKAAVATVGP 253
Query: 229 VSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENW 287
VSVA + + F+FY G Y C + D++H V+ VGYG ++G YWL+KNSW E+W
Sbjct: 254 VSVAIDASHESFQFYSKGXYYEPSCDSD--DLDHGVLVVGYGSDNGEDYWLVKNSWSEHW 311
Query: 288 GDHGYFKMEMG-KNMCGIATCASYPVV 313
GD GY K+ KN CG+AT ASYP+V
Sbjct: 312 GDEGYIKIARNRKNHCGVATAASYPLV 338
>gi|402770517|gb|AFQ98393.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 224 bits (570), Expect = 6e-56, Method: Compositional matrix adjust.
Identities = 127/305 (41%), Positives = 163/305 (53%), Gaps = 58/305 (19%)
Query: 65 FARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCK---GL-SYRLGLN------------ 108
F + K Y+S E LRF F++N +I N K GL SY+LG+N
Sbjct: 30 FKTTHKKTYQSHMEELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMNQFGDLLAHEFAR 89
Query: 109 --------------------------------------ISPVKDQGHCGSCWTFSTTGSL 130
++PVKDQG CGSCW FS TGSL
Sbjct: 90 IFNGHHGTRKTGGSSFLPPANVNDSSLPKVVDWRKKGAVTPVKDQGQCGSCWAFSATGSL 149
Query: 131 EAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGK 190
E + G+ +SLSEQ LVDC+Q+F N GC GGL AF+YIK N G+DTE++YPY
Sbjct: 150 EGQHFLKNGELVSLSEQNLVDCSQSFGNNGCEGGLMEDAFKYIKANDGIDTEKSYPYKAV 209
Query: 191 DGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSS 249
DG C+F E+VG V I G+E +L+ AV V P+SVA + F+ Y GVY
Sbjct: 210 DGECRFKKEDVGATDTGYVEIKAGSEVDLKKAVATVGPISVAIDASHSSFQLYSEGVYDE 269
Query: 250 TKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEM-GKNMCGIATCA 308
+C + D++H V+ VGYGV+ G YWL+KNSW E+WGD GY M N CGIA+ A
Sbjct: 270 PEC--SSEDLDHGVLVVGYGVKGGKKYWLVKNSWAESWGDQGYILMSRDNNNQCGIASQA 327
Query: 309 SYPVV 313
SYP+V
Sbjct: 328 SYPLV 332
>gi|161408097|dbj|BAF94152.1| cathepsin L-like cysteine protease 2 [Plautia stali]
Length = 334
Score = 224 bits (570), Expect = 6e-56, Method: Compositional matrix adjust.
Identities = 110/217 (50%), Positives = 142/217 (65%), Gaps = 4/217 (1%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++PVK+QGHCGSCW FSTTG+LE + GK +SLSEQ LVDC+ ++ N
Sbjct: 120 KEVDWRTKGAVTPVKNQGHCGSCWAFSTTGALEGQNFRKTGKLVSLSEQNLVDCSGSYGN 179
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
GC GGL AF+YIK N G+DTE++YPY G+D C+F ++G V+IT G E+
Sbjct: 180 NGCEGGLMDNAFQYIKENHGIDTEKSYPYEGEDETCRFRKTSIGATDSGFVDITQGDEEA 239
Query: 219 LQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYW 277
L AV + P+SVA + F+FY GVY +C + +D H V+ VGYGVED YW
Sbjct: 240 LMQAVATIGPISVAIDASHQSFQFYSEGVYYEPECSSENLD--HGVLVVGYGVEDNQKYW 297
Query: 278 LIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
L+KNSWG WGD GY KM + N CGIAT ASYP+V
Sbjct: 298 LVKNSWGTQWGDGGYIKMARDQDNNCGIATQASYPLV 334
>gi|402770505|gb|AFQ98387.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 224 bits (570), Expect = 6e-56, Method: Compositional matrix adjust.
Identities = 127/305 (41%), Positives = 163/305 (53%), Gaps = 58/305 (19%)
Query: 65 FARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCK---GL-SYRLGLN------------ 108
F + K Y+S E LRF F++N +I N K GL SY+LG+N
Sbjct: 30 FKTTHKKTYQSHMEELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMNQFGDLLAHEFAR 89
Query: 109 --------------------------------------ISPVKDQGHCGSCWTFSTTGSL 130
++PVKDQG CGSCW FS TGSL
Sbjct: 90 IFNGHRGTRKTGGSTFLPPANVNDSSLPKAVDWRKKGAVTPVKDQGQCGSCWAFSATGSL 149
Query: 131 EAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGK 190
E + G+ +SLSEQ LVDC+Q+F N GC GGL AF+YIK N G+DTE++YPY
Sbjct: 150 EGQHFLKNGELVSLSEQNLVDCSQSFGNNGCEGGLMEDAFKYIKANDGIDTEKSYPYEAV 209
Query: 191 DGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSS 249
DG C+F E+VG V I G+E +L+ AV V P+SVA + F+ Y GVY
Sbjct: 210 DGECRFKKEDVGATDTGYVEIKAGSEVDLKKAVATVGPISVAIDASHSSFQLYSEGVYDE 269
Query: 250 TKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEM-GKNMCGIATCA 308
+C + D++H V+ VGYGV+ G YWL+KNSW E+WGD GY M N CGIA+ A
Sbjct: 270 PEC--SSEDLDHGVLVVGYGVKGGKKYWLVKNSWAESWGDQGYILMSRDNNNQCGIASQA 327
Query: 309 SYPVV 313
SYP+V
Sbjct: 328 SYPLV 332
>gi|402770515|gb|AFQ98392.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 223 bits (569), Expect = 6e-56, Method: Compositional matrix adjust.
Identities = 127/305 (41%), Positives = 163/305 (53%), Gaps = 58/305 (19%)
Query: 65 FARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCK---GL-SYRLGLN------------ 108
F + K Y+S E LRF F++N +I N K GL SY+LG+N
Sbjct: 30 FKTTHKKTYQSHMEELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMNQFGDLLAHEFAR 89
Query: 109 --------------------------------------ISPVKDQGHCGSCWTFSTTGSL 130
++PVKDQG CGSCW FS TGSL
Sbjct: 90 IFNGHHGTRKTGGSSFLPPANVNDSSLPKVVDWRKKGAVTPVKDQGQCGSCWAFSATGSL 149
Query: 131 EAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGK 190
E + G+ +SLSEQ LVDC+Q+F N GC GGL AF+YIK N G+DTE++YPY
Sbjct: 150 EGQHFLKNGELVSLSEQNLVDCSQSFGNNGCEGGLMEDAFKYIKANDGIDTEKSYPYEAV 209
Query: 191 DGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSS 249
DG C+F E+VG V I G+E +L+ AV V P+SVA + F+ Y GVY
Sbjct: 210 DGECRFKKEDVGATDTGYVEIKAGSEVDLKKAVATVGPISVAIDASHSSFQLYSEGVYDE 269
Query: 250 TKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEM-GKNMCGIATCA 308
+C + D++H V+ VGYGV+ G YWL+KNSW E+WGD GY M N CGIA+ A
Sbjct: 270 PEC--SSEDLDHGVLVVGYGVKGGKKYWLVKNSWAESWGDQGYILMSRDNNNQCGIASQA 327
Query: 309 SYPVV 313
SYP+V
Sbjct: 328 SYPLV 332
>gi|427797099|gb|JAA64001.1| Putative cathepsin l cathepsin l, partial [Rhipicephalus
pulchellus]
Length = 331
Score = 223 bits (569), Expect = 6e-56, Method: Compositional matrix adjust.
Identities = 110/217 (50%), Positives = 143/217 (65%), Gaps = 4/217 (1%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++PVK+QG CGSCW+FSTTGSLE + + K +SLSEQ L+DC+++F N
Sbjct: 117 KTVDWRKKGAVTPVKNQGQCGSCWSFSTTGSLEGQHFRKLHKLVSLSEQNLIDCSRSFGN 176
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
GC GGL AF+YIK N G+DTE++YPY DGVC F+ VG V+I G E++
Sbjct: 177 NGCEGGLMDYAFKYIKANKGIDTEQSYPYNATDGVCHFNKSAVGATDTGFVDIPEGDENK 236
Query: 219 LQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYW 277
L+ AV V PVSVA + + F+FY GVY +C + +D H V+ VGYG +DG YW
Sbjct: 237 LKKAVATVGPVSVAIDASHESFQFYSEGVYDEPECDSEQLD--HGVLVVGYGTKDGQDYW 294
Query: 278 LIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
L+KNSWG WGD GY M K N CGIA+ ASYP+V
Sbjct: 295 LVKNSWGTTWGDGGYIYMSRNKDNQCGIASAASYPLV 331
>gi|226469954|emb|CAX70258.1| Cathepsin L precursor [Schistosoma japonicum]
Length = 372
Score = 223 bits (569), Expect = 6e-56, Method: Compositional matrix adjust.
Identities = 115/279 (41%), Positives = 165/279 (59%), Gaps = 22/279 (7%)
Query: 41 GLRDFETSVLQVIGQARHALSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKG 100
G+ +F + + R S R A+ G + S E KL
Sbjct: 110 GVNNFTDKTEYELRKLRGYRSACRIAKPKGSTFISSEHAKLP----------------DR 153
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++PVK+QG CGSCW FS+TG++E +++ + ++LSEQQL+DC++++ N G
Sbjct: 154 VDWRRNGAVTPVKNQGQCGSCWAFSSTGAIEGQHYRKTNRLVNLSEQQLIDCSKSYGNNG 213
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDG----VCKFSSENVGVQVLDSVNITLGAE 216
C GGL AF+Y++ N G+D+E +YPY DG C F+ N+ QV +NI G E
Sbjct: 214 CEGGLMDLAFQYVRDNEGIDSEISYPYISGDGDENVRCLFNFTNIMAQVTGYINIHEGDE 273
Query: 217 DELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVP 275
L +AV + PVSVA + F YKSG+YS +C + D++H V+ VGYG+EDG P
Sbjct: 274 RALMNAVTTIGPVSVAINAGLSSFSMYKSGIYSDPECASASEDLDHGVLLVGYGIEDGKP 333
Query: 276 YWLIKNSWGENWGDHGYFK-MEMGKNMCGIATCASYPVV 313
YWLIKNSWGE+WGD GY K ++ KNMC +A+ ASYP+V
Sbjct: 334 YWLIKNSWGEDWGDKGYVKILKDSKNMCSVASAASYPLV 372
>gi|330800456|ref|XP_003288252.1| hypothetical protein DICPUDRAFT_55299 [Dictyostelium purpureum]
gi|325081708|gb|EGC35214.1| hypothetical protein DICPUDRAFT_55299 [Dictyostelium purpureum]
Length = 531
Score = 223 bits (569), Expect = 6e-56, Method: Compositional matrix adjust.
Identities = 124/302 (41%), Positives = 162/302 (53%), Gaps = 51/302 (16%)
Query: 62 FARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN------------- 108
F F Y K YE+ EE +RF + + I S N K LSY+LG N
Sbjct: 225 FVAFKSEYEKSYENKEEHDMRFKNYKVAHNKIVSHNAKNLSYKLGFNHYADLSDHEFNTL 284
Query: 109 ------------------------------------ISPVKDQGHCGSCWTFSTTGSLEA 132
++PVKDQG CGSCWTF +TGSLE
Sbjct: 285 IKPKVARPSNNGAHSVHDDEDIYTIPQSVDWRNQKCVTPVKDQGVCGSCWTFGSTGSLEG 344
Query: 133 AYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDG 192
G +SLSEQQLVDCA +QGCNGG + AF+YI GG+ TE Y Y ++
Sbjct: 345 TNCVTNGYLVSLSEQQLVDCAYLMGSQGCNGGFAASAFQYIMDAGGIATESDYQYLMQNA 404
Query: 193 VCKFSSENV-GVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSST 250
+CK S GV V VN+T G+ + L +AV PV++A + VD FR+Y+SG+YS+
Sbjct: 405 LCKDKSTTFSGVGVSSYVNVTAGSINALLNAVATQGPVAIAIDASVDDFRYYQSGIYSNP 464
Query: 251 KCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASY 310
C N P D++H V+A+GYG +GV YWL+KNSW NWG GYF +E N+CG A+ A+Y
Sbjct: 465 SCKNGPDDLDHEVLAIGYGTLNGVDYWLVKNSWSTNWGMEGYFMLERANNLCGPASQATY 524
Query: 311 PV 312
P+
Sbjct: 525 PL 526
>gi|151573014|gb|ABS17682.1| cathepsin L-1 [Artemia salina]
Length = 334
Score = 223 bits (569), Expect = 7e-56, Method: Compositional matrix adjust.
Identities = 108/207 (52%), Positives = 140/207 (67%), Gaps = 4/207 (1%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
I+PVKDQG CGSCW FS+TG+LE + GK +SLSEQ L+DC+ + N+GCNGGL Q
Sbjct: 130 ITPVKDQGQCGSCWAFSSTGALEGQTFRKTGKLVSLSEQNLIDCSGKYGNEGCNGGLMDQ 189
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
AF+YIK N G+DTE YPY +D VC+++ N G V+I G ED+L+ AV V P
Sbjct: 190 AFQYIKDNKGIDTENTYPYEAEDDVCRYNPRNRGAVDRGFVDIPSGEEDKLKAAVATVGP 249
Query: 229 VSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENW 287
VSVA + + F+FY GVY C + D++H V+ VGYG ++G YWL+KNSW E+W
Sbjct: 250 VSVAIDASHESFQFYSKGVYYEPSCDSD--DLDHGVLVVGYGSDNGKDYWLVKNSWSEHW 307
Query: 288 GDHGYFKMEMG-KNMCGIATCASYPVV 313
GD GY KM KN CG+A+ ASYP+V
Sbjct: 308 GDEGYIKMARNRKNHCGVASAASYPLV 334
>gi|402770511|gb|AFQ98390.1| cathepsin L [Rhipicephalus microplus]
gi|402770513|gb|AFQ98391.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 223 bits (569), Expect = 7e-56, Method: Compositional matrix adjust.
Identities = 127/305 (41%), Positives = 163/305 (53%), Gaps = 58/305 (19%)
Query: 65 FARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCK---GL-SYRLGLN------------ 108
F + K Y+S E LRF F++N +I N K GL SY+LG+N
Sbjct: 30 FKTTHKKTYQSHMEELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMNQFGDLLAHEFAR 89
Query: 109 --------------------------------------ISPVKDQGHCGSCWTFSTTGSL 130
++PVKDQG CGSCW FS TGSL
Sbjct: 90 IFNGHHGTRKTGGSSFLPPANVNDSSLPKVVDWRKKGAVTPVKDQGQCGSCWAFSATGSL 149
Query: 131 EAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGK 190
E + G+ +SLSEQ LVDC+Q+F N GC GGL AF+YIK N G+DTE++YPY
Sbjct: 150 EGQHFLKNGELVSLSEQNLVDCSQSFGNNGCEGGLMEDAFKYIKANDGIDTEKSYPYEAV 209
Query: 191 DGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSS 249
DG C+F E+VG V I G+E +L+ AV V P+SVA + F+ Y GVY
Sbjct: 210 DGECRFKKEDVGATDTGYVEIKAGSEVDLKKAVATVGPISVAIDASHSSFQLYSEGVYDE 269
Query: 250 TKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEM-GKNMCGIATCA 308
+C + D++H V+ VGYGV+ G YWL+KNSW E+WGD GY M N CGIA+ A
Sbjct: 270 PEC--SSEDLDHGVLVVGYGVKGGKKYWLVKNSWAESWGDQGYILMSRDNNNQCGIASQA 327
Query: 309 SYPVV 313
SYP+V
Sbjct: 328 SYPLV 332
>gi|157132324|ref|XP_001655999.1| cathepsin l [Aedes aegypti]
gi|108881694|gb|EAT45919.1| AAEL002833-PA [Aedes aegypti]
Length = 339
Score = 223 bits (568), Expect = 8e-56, Method: Compositional matrix adjust.
Identities = 109/216 (50%), Positives = 144/216 (66%), Gaps = 5/216 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++PVKDQGHCGSCW+FS TG+LE + + GK +SLSEQ LVDC+ + N G
Sbjct: 126 VDWRKKGAVTPVKDQGHCGSCWSFSATGALEGQHFRKTGKLVSLSEQNLVDCSGKYGNNG 185
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
CNGG+ AF+YIK NGG+DTE++YPY D C F+ + VG V+I G E+ L+
Sbjct: 186 CNGGMMDYAFQYIKDNGGIDTEKSYPYEAIDDTCHFNPKAVGATDKGYVDIPQGDEEALK 245
Query: 221 HAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGV-EDGVPYWL 278
A+ V PVS+A + + F+FY GVY +C + +D H V+AVGYG E+G YWL
Sbjct: 246 KALATVGPVSIAIDASHESFQFYSEGVYYEPQCDSENLD--HGVLAVGYGTSEEGEDYWL 303
Query: 279 IKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
+KNSWG WGD GY KM + N CG+ATCASYP+V
Sbjct: 304 VKNSWGTTWGDQGYVKMARNRDNHCGVATCASYPLV 339
>gi|405966500|gb|EKC31778.1| Cathepsin L [Crassostrea gigas]
Length = 271
Score = 223 bits (568), Expect = 9e-56, Method: Compositional matrix adjust.
Identities = 112/207 (54%), Positives = 137/207 (66%), Gaps = 4/207 (1%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++ +K+QGHCGSCW+FS TGSLE + +A K +SLSEQ LVDC+Q N GC GGL
Sbjct: 67 VTDIKNQGHCGSCWSFSATGSLEGQHFKASKKLVSLSEQNLVDCSQREGNHGCQGGLMDN 126
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
AF YI+ N G+DTEE+YPYT K+G C F ENVG V+I ED+LQ AV V P
Sbjct: 127 AFRYIESNKGIDTEESYPYTAKNGFCHFKKENVGATDTGYVDIPHMQEDKLQEAVATVGP 186
Query: 229 VSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENW 287
+SVA + F+ Y+ GVYS C ++ +D H V+AVGYG E G YWL+KNSWG +W
Sbjct: 187 ISVAIDAGHKSFQLYREGVYSEPACSSSKLD--HGVLAVGYGTESGDDYWLVKNSWGTSW 244
Query: 288 GDHGYFKMEMGK-NMCGIATCASYPVV 313
G GY M K NMCGIAT ASYP V
Sbjct: 245 GMQGYVMMARNKHNMCGIATQASYPKV 271
>gi|33348836|gb|AAQ16118.1| cathepsin L-like cysteine proteinase B [Rhipicephalus
haemaphysaloides haemaphysaloides]
Length = 335
Score = 223 bits (568), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 111/217 (51%), Positives = 142/217 (65%), Gaps = 4/217 (1%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++PVK+QG CGSCW FSTTGSLE + + K +SLSEQ LVDC+++F N
Sbjct: 121 KTVDWRKKGAVTPVKNQGQCGSCWAFSTTGSLEGPHFRKTRKLVSLSEQNLVDCSRSFGN 180
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
GC GGL AF+YIK N G+DTE +YPY DGVC F+ +VG V+I G E++
Sbjct: 181 NGCEGGLMDNAFKYIKSNKGIDTEWSYPYNATDGVCHFNRSDVGATDTGFVDIPEGDENK 240
Query: 219 LQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYW 277
L+ AV V PVSVA + + F+FY GVY +C + +D H V+ VGYG +DG YW
Sbjct: 241 LKKAVAAVGPVSVAIDASHESFQFYSEGVYDEPECSSEQLD--HGVLVVGYGTKDGQDYW 298
Query: 278 LIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
L+KNSWG WGD GY M K N CGIA+ ASYP+V
Sbjct: 299 LVKNSWGTTWGDEGYIYMTRNKDNQCGIASSASYPLV 335
>gi|72008176|ref|XP_780713.1| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
Length = 335
Score = 223 bits (567), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 128/310 (41%), Positives = 164/310 (52%), Gaps = 60/310 (19%)
Query: 62 FARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCK----GLSYRLGLN--------- 108
+ ++ +GK Y S EE R + KNLD++ N K +Y LG+N
Sbjct: 28 WNQWKNEHGKRYLSDEEEASRKLIWEKNLDIVIKHNLKYDLGHFTYALGMNQFADLKNEE 87
Query: 109 ------------------------------------------ISPVKDQGHCGSCWTFST 126
++PVKDQG CGSCW FST
Sbjct: 88 FVAMMTGFRVNGTSKAAKGSTFLPSNNIGELPKTVDWRTKGYVTPVKDQGQCGSCWAFST 147
Query: 127 TGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYP 186
TGSLE + +A GK +SLSEQ LVDC+ N+GC+GGL QAF+YI GG+DTEE+YP
Sbjct: 148 TGSLEGQHFKATGKLVSLSEQNLVDCSGKEGNEGCDGGLMDQAFQYIIKAGGIDTEESYP 207
Query: 187 YTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVD-GFRFYKSG 245
Y DG C F N+G V ++T +E LQ AV + P+SVA + F+ YKSG
Sbjct: 208 YKAVDGECHFKKANIGATVTGYTDVTSDSETALQKAVAHIGPISVAIDASHMSFQLYKSG 267
Query: 246 VYSSTKCGNTPMDVNHAVVAVGYG-VEDGVPYWLIKNSWGENWGDHGYFKMEMGK-NMCG 303
VY+ C +T +D H V+AVGYG DG YW++KNSW E WG +GY M K N CG
Sbjct: 268 VYNEPDCSSTLLD--HGVLAVGYGTTSDGTDYWIVKNSWAETWGMNGYLWMSRNKDNQCG 325
Query: 304 IATCASYPVV 313
IAT ASYP+V
Sbjct: 326 IATQASYPLV 335
>gi|443724292|gb|ELU12369.1| hypothetical protein CAPTEDRAFT_165495 [Capitella teleta]
Length = 351
Score = 223 bits (567), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 105/207 (50%), Positives = 140/207 (67%), Gaps = 4/207 (1%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++P+KDQGHCGSCW+FSTTG+LE + + GK +SLSEQ L+DC+ ++ N GCNGG+
Sbjct: 147 VTPIKDQGHCGSCWSFSTTGALEGQHFRKTGKLVSLSEQNLIDCSTSYGNNGCNGGVMDY 206
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
AF+YIK N G DTE++YPY DG C+F E VG ++ G E++++ AV +V P
Sbjct: 207 AFQYIKDNDGDDTEDSYPYEAADGPCRFKKEYVGATDTGYTDLPKGDEEKMKEAVAMVGP 266
Query: 229 VSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENW 287
VSVA + F+ Y+SGVY +C P ++H V+ VGYG E G YWL+KNSWG W
Sbjct: 267 VSVAIDASHTSFQMYQSGVYDEVEC--DPEGLDHGVLVVGYGTELGQDYWLVKNSWGTKW 324
Query: 288 GDHGYFKMEMGK-NMCGIATCASYPVV 313
GD GY KM K N CGI++ ASYP+V
Sbjct: 325 GDEGYIKMSRNKNNQCGISSMASYPLV 351
>gi|7381610|gb|AAF61565.1|AF227957_1 cathepsin L-like proteinase precursor [Rhipicephalus microplus]
Length = 332
Score = 223 bits (567), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 127/305 (41%), Positives = 163/305 (53%), Gaps = 58/305 (19%)
Query: 65 FARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCK---GL-SYRLGLN------------ 108
F + K Y+S E LRF F++N +I N K GL SY+LG+N
Sbjct: 30 FKTTHKKSYQSHMEELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMNQFGDLLAHEFAR 89
Query: 109 --------------------------------------ISPVKDQGHCGSCWTFSTTGSL 130
++PVKDQG CGSCW FS TGSL
Sbjct: 90 IFNGHHGTRKTGGSTFLPPANVNDSSLPKVVDWRKKGAVTPVKDQGQCGSCWAFSATGSL 149
Query: 131 EAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGK 190
E + G+ +SLSEQ LVDC+Q+F N GC GGL AF+YIK N G+DTE++YPY
Sbjct: 150 EGQHFLKNGELVSLSEQNLVDCSQSFGNNGCEGGLMEDAFKYIKANDGIDTEKSYPYEAV 209
Query: 191 DGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSS 249
DG C+F E+VG V I G+E +L+ AV V P+SVA + F+ Y GVY
Sbjct: 210 DGECRFKKEDVGATDTGYVEIKAGSEVDLKKAVATVGPISVAIDASHSSFQLYSEGVYDE 269
Query: 250 TKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEM-GKNMCGIATCA 308
+C + D++H V+ VGYGV+ G YWL+KNSW E+WGD GY M N CGIA+ A
Sbjct: 270 PEC--SSEDLDHGVLVVGYGVKGGKKYWLVKNSWAESWGDQGYILMSRDNNNQCGIASQA 327
Query: 309 SYPVV 313
SYP+V
Sbjct: 328 SYPLV 332
>gi|118123|sp|P25782.1|CYSP2_HOMAM RecName: Full=Digestive cysteine proteinase 2; Flags: Precursor
gi|11053|emb|CAA45128.1| cysteine proteinase preproenzyme [Homarus americanus]
Length = 323
Score = 223 bits (567), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 111/215 (51%), Positives = 140/215 (65%), Gaps = 4/215 (1%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++PVKDQG CGSCW FSTTGSLE + G ISL+EQQLVDC++ + QG
Sbjct: 111 VDWRTKGAVTPVKDQGQCGSCWAFSTTGSLEGQHFLKTGSLISLAEQQLVDCSRPYGPQG 170
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
CNGG + AF+YIK N G+DTE AYPY +DG C+F S +V NI G+E LQ
Sbjct: 171 CNGGWMNDAFDYIKANNGIDTEAAYPYEARDGSCRFDSNSVAATCSGHTNIASGSETGLQ 230
Query: 221 HAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
AV + P+SV + F+FY SGVY C +P ++HAV+AVGYG E G +WL+
Sbjct: 231 QAVRDIGPISVTIDAAHSSFQFYSSGVYYEPSC--SPSYLDHAVLAVGYGSEGGQDFWLV 288
Query: 280 KNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
KNSW +WGD GY KM + N CGIAT ASYP+V
Sbjct: 289 KNSWATSWGDAGYIKMSRNRNNNCGIATVASYPLV 323
>gi|47230018|emb|CAG10432.1| unnamed protein product [Tetraodon nigroviridis]
Length = 294
Score = 223 bits (567), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 107/207 (51%), Positives = 140/207 (67%), Gaps = 4/207 (1%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++ VKDQ CGSCW FS TGSLE ++ GK +SLSEQQLVDC+ + N GC GGL
Sbjct: 90 VTGVKDQKQCGSCWAFSATGSLEGQNYRKTGKLVSLSEQQLVDCSGDYGNMGCGGGLMDS 149
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
AF+YI+ NGG+DTEE+YPY +DG C+F +N+G + V++T G ED L+ AV + P
Sbjct: 150 AFKYIQENGGIDTEESYPYEAEDGKCRFKPQNIGAKCTGYVDVTAGDEDALKEAVATIGP 209
Query: 229 VSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENW 287
VSVA + F+ Y+SGVY +C + D++H V+AVGYG ++G YWL+KNSWG W
Sbjct: 210 VSVAIDASHSSFQLYESGVYDELEC--SSEDLDHGVLAVGYGTDNGQDYWLVKNSWGLGW 267
Query: 288 GDHGYFKMEMGK-NMCGIATCASYPVV 313
G GY M K N CGIA+ ASYP+V
Sbjct: 268 GQKGYIMMSRNKHNQCGIASMASYPLV 294
>gi|198427748|ref|XP_002130282.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
Length = 340
Score = 222 bits (566), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 109/207 (52%), Positives = 137/207 (66%), Gaps = 4/207 (1%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++PVK+QG CGSCW+FS TGSLE + + GK +SLSEQ L+DC+ N GCNGGL Q
Sbjct: 136 VTPVKNQGQCGSCWSFSATGSLEGQHKKKTGKLVSLSEQNLIDCSTPEGNDGCNGGLMDQ 195
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
AF+YIK GG+DTE YPY KD C+F+ + G V+I G E+ L+ A V P
Sbjct: 196 AFKYIKIQGGIDTEAYYPYEAKDDTCRFNITDSGATDTGFVDIKSGDEEMLKEAAATVGP 255
Query: 229 VSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENW 287
+SVA + F+FY +GVYS T C +T +D H V+ VGYG E+G YWL+KNSWGE W
Sbjct: 256 ISVAIDASHTSFQFYSNGVYSETACSSTMLD--HGVLVVGYGTENGKDYWLVKNSWGEGW 313
Query: 288 GDHGYFKMEM-GKNMCGIATCASYPVV 313
G+ GY KM N CGIAT ASYP+V
Sbjct: 314 GEAGYIKMSRNADNQCGIATQASYPLV 340
>gi|348546019|ref|XP_003460476.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
gi|348546143|ref|XP_003460538.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
Length = 334
Score = 222 bits (565), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 105/207 (50%), Positives = 140/207 (67%), Gaps = 4/207 (1%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++ VKDQ CGSCW FS TGSLE + + G +SLSEQQLVDC+ + N GC GGL
Sbjct: 130 VTDVKDQKQCGSCWAFSATGSLEGQHFRKTGTLVSLSEQQLVDCSGDYGNMGCMGGLMDY 189
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
AF+YI+ NGG+DTEE+YPY ++G C+++ +N+G ++ G ED L+ AV + P
Sbjct: 190 AFQYIQANGGIDTEESYPYEAENGKCRYNPDNIGATSTGYTEVSQGDEDALKEAVATIGP 249
Query: 229 VSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENW 287
+SV + F+FY+SGVY+ C + ++++H V+AVGYG EDG YWL+KNSWG W
Sbjct: 250 ISVGIDASQMSFQFYESGVYNEPDC--SSLELDHGVLAVGYGTEDGNDYWLVKNSWGLEW 307
Query: 288 GDHGYFKMEMGK-NMCGIATCASYPVV 313
GD GY KM K N CGIAT ASYP+V
Sbjct: 308 GDKGYIKMSRNKSNQCGIATAASYPLV 334
>gi|325303202|tpg|DAA34687.1| TPA_inf: cathepsin L-like cysteine proteinase B [Amblyomma
variegatum]
Length = 337
Score = 222 bits (565), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 111/217 (51%), Positives = 138/217 (63%), Gaps = 4/217 (1%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++PVK+QG CGSCW FSTTGSLE + + G +SLSEQ LVDC+ AF N
Sbjct: 123 KTVDWRKKGAVTPVKNQGQCGSCWAFSTTGSLEGQHFRKSGDMVSLSEQNLVDCSTAFGN 182
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
GC GGL AF+YIK NGG+DTE++YPY G DG C F +VG V+I G E
Sbjct: 183 NGCEGGLMDNAFKYIKANGGIDTEKSYPYNGTDGTCHFKKSDVGATDTGFVDIPEGNEHL 242
Query: 219 LQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYW 277
L+ AV V P+SVA + F+FY GVY +C + +D H V+ VGYG +D YW
Sbjct: 243 LKKAVATVGPISVAIDASHQSFQFYSQGVYDEPECSSENLD--HGVLVVGYGTKDDQDYW 300
Query: 278 LIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
L+KNSWG WGD GY M K N CGIA+ ASYP+V
Sbjct: 301 LVKNSWGTTWGDGGYIYMTRNKDNQCGIASSASYPLV 337
>gi|432936690|ref|XP_004082231.1| PREDICTED: cathepsin L-like [Oryzias latipes]
Length = 334
Score = 221 bits (564), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 111/233 (47%), Positives = 145/233 (62%), Gaps = 9/233 (3%)
Query: 83 FATFSKNLDLIRSTNCKGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGI 142
F +N DL + + + Y ++ VKDQ CGSCW FS TGSLE + GK +
Sbjct: 109 FFRLPENKDLPAAVDWRDKGY-----VTDVKDQKQCGSCWAFSATGSLEGQTFRKTGKLV 163
Query: 143 SLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVG 202
SLSEQQLVDC+ + N GC GGL AF YI+ GG+DTEE+YPY +DG C++ + VG
Sbjct: 164 SLSEQQLVDCSGDYGNMGCGGGLMDDAFRYIQATGGIDTEESYPYEAEDGECRYKPDAVG 223
Query: 203 VQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNH 261
V+++ G ED LQ AV + P+SV + F+ Y+SG+Y +C ++ +D H
Sbjct: 224 ATCTGYVDVSSGDEDALQEAVATIGPISVGIDASHISFQLYESGLYDEPQCSSSELD--H 281
Query: 262 AVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
V+AVGYG E+G YWL+KNSWG WGD GY KM K N CGIAT ASYP+V
Sbjct: 282 GVLAVGYGSENGQDYWLVKNSWGLTWGDQGYIKMSKNKSNQCGIATAASYPLV 334
>gi|52345644|ref|NP_001004869.1| cathepsin L2 precursor [Xenopus (Silurana) tropicalis]
gi|49522051|gb|AAH74718.1| MGC69486 protein [Xenopus (Silurana) tropicalis]
Length = 335
Score = 221 bits (564), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 125/294 (42%), Positives = 176/294 (59%), Gaps = 22/294 (7%)
Query: 35 RLVSSDGLRDFETSVLQ-VIGQARHALSFARFARRYGKIYESV-----EEMKLRFATF-- 86
R++ LR E L+ +G+ H+L +F + + + + K+R +TF
Sbjct: 49 RVLWEKNLRMIEFHNLEHSLGKHSHSLGMNQFGDMTNEEFRQLMNGYKNQKKIRGSTFLA 108
Query: 87 SKNLDLIRSTNCKGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSE 146
N + +S + + Y ++PVKDQG CGSCW FSTTG+LE +++ GK ISLSE
Sbjct: 109 PNNFESPKSVDWRKKGY-----VTPVKDQGQCGSCWAFSTTGALEGQHYRNTGKMISLSE 163
Query: 147 QQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGV-CKFSSENVGVQV 205
Q LVDC++A NQGCNGGL QAF+Y+K NGG+D+E++YPYT KD C +
Sbjct: 164 QNLVDCSRAQGNQGCNGGLMDQAFQYVKDNGGIDSEDSYPYTAKDDQECHYDPNYNSAND 223
Query: 206 LDSVNITLGAEDELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVV 264
V++T G+E +L +AV V PVSVA + F+FYKSG+Y +C + D++H V+
Sbjct: 224 TGFVDVTSGSEKDLMNAVASVGPVSVAVDAGHQSFQFYKSGIYYEPEC--SSEDLDHGVL 281
Query: 265 AVGYGV----EDGVPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
VGYG EDG YW++KNSW E WG+ GY + + N CGIAT ASYP+V
Sbjct: 282 VVGYGFEGEDEDGKKYWIVKNSWSEKWGNDGYIYIAKDRHNHCGIATAASYPLV 335
>gi|228244|prf||1801240B Cys protease 2
Length = 323
Score = 221 bits (564), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 110/215 (51%), Positives = 140/215 (65%), Gaps = 4/215 (1%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++PVKDQG CGSCW FSTTGSLE + G ISL+EQQLVDC++ + QG
Sbjct: 111 VDWRTKGAVTPVKDQGQCGSCWAFSTTGSLEGQHFLKTGSLISLAEQQLVDCSRPYGPQG 170
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
CNGG + AF+YIK N G+DTE +YPY +DG C+F S +V NI G+E LQ
Sbjct: 171 CNGGWMNDAFDYIKANNGIDTEASYPYEARDGSCRFDSNSVAATCSGHTNIASGSETGLQ 230
Query: 221 HAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
AV + P+SV + F+FY SGVY C +P ++HAV+AVGYG E G +WL+
Sbjct: 231 QAVRDIGPISVTIDAAHSSFQFYSSGVYYEPSC--SPSYLDHAVLAVGYGSEGGQDFWLV 288
Query: 280 KNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
KNSW +WGD GY KM + N CGIAT ASYP+V
Sbjct: 289 KNSWATSWGDAGYIKMSRNRNNNCGIATVASYPLV 323
>gi|156397875|ref|XP_001637915.1| predicted protein [Nematostella vectensis]
gi|156225031|gb|EDO45852.1| predicted protein [Nematostella vectensis]
Length = 331
Score = 221 bits (564), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 111/240 (46%), Positives = 153/240 (63%), Gaps = 11/240 (4%)
Query: 78 EMKLRFATF--SKNLDLIRSTNCKGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYH 135
E + + ATF N+ ++ S + + Y ++PVK+QG CGSCW FSTTG+LE +
Sbjct: 99 ESQPKGATFLPPANVKVVDSIDWRSKGY-----VTPVKNQGQCGSCWAFSTTGALEGQHF 153
Query: 136 QAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCK 195
+ GK +SLSEQ LVDC+ + N GC GGL AF+YIK NGG+DTE++YPY KDGVC
Sbjct: 154 RKTGKLVSLSEQNLVDCSGKYGNNGCEGGLMDNAFQYIKENGGIDTEKSYPYLAKDGVCH 213
Query: 196 FSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDG-FRFYKSGVYSSTKCGN 254
++ +G + V+I G E+ LQ A+ V P+S+A + F FY GVY C +
Sbjct: 214 YNKSAIGAKDTGFVDIPTGDENALQQALASVGPISIAIDASQSTFHFYHQGVYDDPDCSS 273
Query: 255 TPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
T +D H V+AVGYG +DG YWL+KNSWG +WG+ GY K+ + CG+A+ ASYP+V
Sbjct: 274 TRLD--HGVLAVGYGTDDGKDYWLVKNSWGPSWGEEGYIKIARNDHDKCGVASKASYPLV 331
>gi|91992508|gb|ABE72970.1| cathepsin L [Aedes aegypti]
Length = 339
Score = 221 bits (564), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 109/216 (50%), Positives = 143/216 (66%), Gaps = 5/216 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++PVKDQGHCGSCW+FS TG+LE + + GK +SLSEQ LVDC+ + N G
Sbjct: 126 VDWRKKGAVTPVKDQGHCGSCWSFSATGALEGQHFRKTGKLVSLSEQNLVDCSGKYGNNG 185
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
CNGG+ AF+YIK NGG+DTE++YPY D C F+ + VG V+I G E+ L+
Sbjct: 186 CNGGMMDYAFQYIKDNGGIDTEKSYPYEAIDDTCHFNPKAVGATDKGYVDIPQGDEEALK 245
Query: 221 HAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGV-EDGVPYWL 278
A+ V PVS+A + + F+FY GVY +C + +D H V+AVGYG E+G YWL
Sbjct: 246 KALATVGPVSIAIDASHESFQFYSEGVYYEPQCDSENLD--HGVLAVGYGTSEEGEDYWL 303
Query: 279 IKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
+KNSWG WGD GY KM N CG+ATCASYP+V
Sbjct: 304 VKNSWGTTWGDQGYVKMARNHDNHCGVATCASYPLV 339
>gi|290462225|gb|ADD24160.1| Cathepsin L [Lepeophtheirus salmonis]
Length = 334
Score = 221 bits (563), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 115/231 (49%), Positives = 151/231 (65%), Gaps = 11/231 (4%)
Query: 87 SKNLDLIRSTNCKGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSE 146
SKN++L + +R ++PVK+QG CGSCW+FS TGSLE + GK ISLSE
Sbjct: 111 SKNINLPEHVD-----WREEGAVTPVKNQGQCGSCWSFSATGSLEGQDFRKTGKLISLSE 165
Query: 147 QQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVL 206
Q LVDC++ + N GC GGL AF+YI+ N G+DTE +YPY G DG C + +N G +
Sbjct: 166 QNLVDCSRKYGNNGCEGGLMDYAFKYIQDNNGIDTEASYPYEGIDGHCHYDPKNKGGSDI 225
Query: 207 DSVNITLGAEDELQHAVGLVRPVSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVA 265
V+I G+E +LQ A+ V P+SVA + F+FY GVYS KC +P +++H V+A
Sbjct: 226 GFVDIKKGSEKDLQKALATVGPISVAIDASHMSFQFYSHGVYSEKKC--SPENLDHGVLA 283
Query: 266 VGYGVED--GVPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
VGYG ++ G YWL+KNSW E WG+ GY KM K NMCGIA+ ASYPVV
Sbjct: 284 VGYGTDEVTGEDYWLVKNSWSEKWGEDGYIKMARNKDNMCGIASSASYPVV 334
>gi|261289811|ref|XP_002611767.1| hypothetical protein BRAFLDRAFT_284308 [Branchiostoma floridae]
gi|229297139|gb|EEN67777.1| hypothetical protein BRAFLDRAFT_284308 [Branchiostoma floridae]
Length = 336
Score = 221 bits (563), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 114/221 (51%), Positives = 143/221 (64%), Gaps = 8/221 (3%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R +S VKDQG CGSCW FSTTGSLE + GK + LSEQQLVDC++ F N
Sbjct: 118 KSVDWRNSAMVSEVKDQGECGSCWAFSTTGSLEGQHANKTGKLVDLSEQQLVDCSKDFGN 177
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGV-CKFSSENVGVQVLDSVNITLGAED 217
QGC GGL QAF+YIK NGGLDTEE+YPYT D CKF + +VG ++ ++ G E
Sbjct: 178 QGCGGGLMDQAFQYIKANGGLDTEESYPYTATDDKPCKFDNSSVGATLIGYKDVKSGNEH 237
Query: 218 ELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDG--- 273
L+ AV V P+SVA + + F+FY SGVY +C + +D H V+ VGYG +
Sbjct: 238 ALKRAVATVGPISVAIDAGHESFQFYSSGVYDEPQCSSEQLD--HGVLVVGYGAMNDNSH 295
Query: 274 VPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
+W++KNSWG NWGD GY M K N CGIAT ASYP+V
Sbjct: 296 QAFWIVKNSWGPNWGDQGYIMMSRNKDNQCGIATSASYPLV 336
>gi|156371477|ref|XP_001628790.1| predicted protein [Nematostella vectensis]
gi|156215775|gb|EDO36727.1| predicted protein [Nematostella vectensis]
Length = 330
Score = 221 bits (563), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 109/207 (52%), Positives = 138/207 (66%), Gaps = 4/207 (1%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++PVK+QG CGSCW FSTTGSLE + GK +SLSEQ LVDC+ A+ N GC GGL
Sbjct: 126 VTPVKNQGQCGSCWAFSTTGSLEGQNFKKTGKLVSLSEQNLVDCSTAYGNNGCQGGLMDY 185
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
AF+YIK NGG+DTEE+YPY ++ C+F N+G V++T G E+ L+ A G V P
Sbjct: 186 AFKYIKENGGIDTEESYPYEARNDRCRFQKSNIGAVDTGFVDVTHGDEEALKTAAGTVGP 245
Query: 229 VSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENW 287
+SVA + F+FY SGVY++ C +T +D H V+ VGYG G YWL+KNSWGE W
Sbjct: 246 ISVAIDAGHMSFQFYHSGVYNNAGCSSTSLD--HGVLVVGYGTYQGSDYWLVKNSWGERW 303
Query: 288 GDHGYFKMEMGK-NMCGIATCASYPVV 313
G GY M K N CG+AT ASYP+V
Sbjct: 304 GMEGYIMMSRNKNNQCGVATQASYPLV 330
>gi|229595080|ref|XP_001020177.2| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|225566401|gb|EAR99932.2| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 405
Score = 221 bits (563), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 132/341 (38%), Positives = 186/341 (54%), Gaps = 46/341 (13%)
Query: 10 SVILLLCCAAAASASASSFDDSNPIRLVSSDGLRDFETSVLQVIGQAR------HALSFA 63
+V LC +A + DSN L+S GL D+ + L I Q+ ++ F
Sbjct: 58 AVYFTLCLSAVIAQ------DSNQEILISR-GLVDYTDADLLSIYQSYGYEPDPNSERFQ 110
Query: 64 RFARRYGKIYESVEEMKLRFA------TFSKNLDLIR---STNCKG-------------- 100
F R KI E +++ TF +L+L + S NC
Sbjct: 111 LFKSRLAKIIEHNSNPDKKYSQIINKLTFQTDLELKKFRASQNCSATAQANTRSFRKYDL 170
Query: 101 ------LSYRLGLNISPVKDQGH-CGSCWTFSTTGSLEAAYHQAFGKG-ISLSEQQLVDC 152
+ +R ++ VK QG CGSCW F+ +LE+ Y GK I SEQQLVDC
Sbjct: 171 SQLPQYVDWREKGVVTQVKSQGKDCGSCWAFAAVAALESHYALKTGKKPIQFSEQQLVDC 230
Query: 153 AQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNIT 212
A+ F+ +GC+GGLPS+ FEY+ Y GG+ E YPY G+D C+F+S VQV S NIT
Sbjct: 231 ARKFDTKGCSGGLPSKGFEYLAYAGGIQNEADYPYEGEDKNCRFNSSKTVVQVQKSYNIT 290
Query: 213 LGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVED 272
E+EL + + PV++A++V F YK+GV++S+ C P DVNHAV+AVGY +
Sbjct: 291 FQDENELIYHLANYGPVTIAYQVNSDFDNYKNGVFTSSNCSKDPEDVNHAVLAVGYNMTG 350
Query: 273 GVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASYPVV 313
Y++ KNSWG +WG +GYF +E+G NMCG+A CASYP++
Sbjct: 351 --KYFIAKNSWGNDWGMNGYFYIELGSNMCGLADCASYPII 389
>gi|33242876|gb|AAQ01142.1| cathepsin [Branchiostoma lanceolatum]
Length = 334
Score = 221 bits (562), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 115/221 (52%), Positives = 143/221 (64%), Gaps = 8/221 (3%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R +S VKDQG CGSCW FSTTGSLE + GK + LSEQQLVDC++ F N
Sbjct: 116 KSVDWRNSHMVSEVKDQGECGSCWAFSTTGSLEGQHSNKTGKLVDLSEQQLVDCSKDFGN 175
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGV-CKFSSENVGVQVLDSVNITLGAED 217
QGC GGL QAF+YIK NGGLDTEE+YPYT D CKF + +VG ++ ++ G E
Sbjct: 176 QGCGGGLMDQAFQYIKANGGLDTEESYPYTATDDKPCKFDNSSVGATLVGYKDVKSGNEH 235
Query: 218 ELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDG--- 273
L+ AV V PVSVA + + F+FY SGVY +C +D H V+AVGYG +
Sbjct: 236 ALKRAVATVGPVSVAIDAGHESFQFYSSGVYDEPQCSTEQLD--HGVLAVGYGAMNDNSH 293
Query: 274 VPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
+W++KNSWG +WGD GY M K N CGIAT ASYP+V
Sbjct: 294 QAFWIVKNSWGPSWGDQGYIMMSRNKNNQCGIATSASYPLV 334
>gi|449673497|ref|XP_002169904.2| PREDICTED: cathepsin L-like [Hydra magnipapillata]
Length = 325
Score = 221 bits (562), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 106/207 (51%), Positives = 138/207 (66%), Gaps = 4/207 (1%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++PVK+QG CGSCW FS+TG+LE + + G+ +SLSEQ LVDC+ + N GCNGGL
Sbjct: 121 VTPVKNQGQCGSCWAFSSTGALEGQHFKKTGRLVSLSEQNLVDCSTDYGNNGCNGGLMDN 180
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
AF YIK NGG+DTE YPY G+DG C++S ++G V+I G ED L+ AV V P
Sbjct: 181 AFSYIKANGGIDTETGYPYEGQDGTCRYSKSSIGADDTGFVDIPEGDEDALKQAVATVGP 240
Query: 229 VSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENW 287
VSVA + F+FY SGVY +C +P ++H V+ VGYG ++G YWL+KNSWG W
Sbjct: 241 VSVAIDASHMSFQFYHSGVYDEPQC--SPSALDHGVLVVGYGTDNGKDYWLVKNSWGTGW 298
Query: 288 GDHGYFKMEM-GKNMCGIATCASYPVV 313
G GY M +N CGIA+ ASYP+V
Sbjct: 299 GTEGYIYMSRNNQNQCGIASKASYPLV 325
>gi|33242878|gb|AAQ01143.1| cathepsin [Branchiostoma lanceolatum]
Length = 334
Score = 221 bits (562), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 115/221 (52%), Positives = 143/221 (64%), Gaps = 8/221 (3%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R +S VKDQG CGSCW FSTTGSLE + GK + LSEQQLVDC++ F N
Sbjct: 116 KSVDWRNSHMVSEVKDQGECGSCWAFSTTGSLEGQHSNKTGKLVDLSEQQLVDCSKDFGN 175
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGV-CKFSSENVGVQVLDSVNITLGAED 217
QGC GGL QAF+YIK NGGLDTEE+YPYT D CKF + +VG ++ ++ G E
Sbjct: 176 QGCGGGLMDQAFQYIKANGGLDTEESYPYTATDDKPCKFDNSSVGATLVGYKDVKSGNEH 235
Query: 218 ELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDG--- 273
L+ AV V PVSVA + + F+FY SGVY +C +D H V+AVGYG +
Sbjct: 236 ALKRAVATVGPVSVAIDAGHESFQFYSSGVYDEPQCSTEQLD--HGVLAVGYGAMNDNSH 293
Query: 274 VPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
+W++KNSWG +WGD GY M K N CGIAT ASYP+V
Sbjct: 294 QAFWIVKNSWGPSWGDQGYIMMSRNKNNQCGIATSASYPLV 334
>gi|442754503|gb|JAA69411.1| Putative cathepsin l-like cysteine proteinase b [Ixodes ricinus]
Length = 335
Score = 221 bits (562), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 107/217 (49%), Positives = 138/217 (63%), Gaps = 4/217 (1%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++PVK+QG CGSCW FS TGSLE + + G +SLSEQ LVDC+ F N
Sbjct: 121 KTVDWRTKGAVTPVKNQGQCGSCWAFSATGSLEGQHFRKSGSMVSLSEQNLVDCSTDFGN 180
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
GC GGL AF+YI+ N G+DTE++YPY G DG C F VG V+I G+E +
Sbjct: 181 NGCEGGLMDNAFKYIRANKGIDTEKSYPYNGTDGTCHFKKSTVGATDSGFVDIKEGSETQ 240
Query: 219 LQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYW 277
L+ AV V P+SVA + + F+FY GVY +C + +D H V+ VGYG +G YW
Sbjct: 241 LKKAVATVGPISVAIDASHESFQFYSDGVYDEPECDSESLD--HGVLVVGYGTLNGTDYW 298
Query: 278 LIKNSWGENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
L+KNSWG WGD GY +M KN CGIA+ ASYP+V
Sbjct: 299 LVKNSWGTTWGDEGYIRMSRNKKNQCGIASSASYPLV 335
>gi|33242874|gb|AAQ01141.1| cathepsin [Branchiostoma lanceolatum]
Length = 334
Score = 221 bits (562), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 115/221 (52%), Positives = 143/221 (64%), Gaps = 8/221 (3%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R +S VKDQG CGSCW FSTTGSLE + GK + LSEQQLVDC++ F N
Sbjct: 116 KSVDWRNSHMVSEVKDQGECGSCWAFSTTGSLEGQHSNKTGKLVDLSEQQLVDCSKDFGN 175
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGV-CKFSSENVGVQVLDSVNITLGAED 217
QGC GGL QAF+YIK NGGLDTEE+YPYT D CKF + +VG ++ ++ G E
Sbjct: 176 QGCGGGLMDQAFQYIKANGGLDTEESYPYTATDDKPCKFDNSSVGATLVGYKDVKSGNEH 235
Query: 218 ELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDG--- 273
L+ AV V PVSVA + + F+FY SGVY +C +D H V+AVGYG +
Sbjct: 236 ALKRAVATVGPVSVAIDAGHESFQFYSSGVYDEPQCSTEQLD--HGVLAVGYGAMNDNSH 293
Query: 274 VPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
+W++KNSWG +WGD GY M K N CGIAT ASYP+V
Sbjct: 294 QAFWIVKNSWGPSWGDQGYIMMSRNKNNQCGIATSASYPLV 334
>gi|405966499|gb|EKC31777.1| Cathepsin L [Crassostrea gigas]
Length = 331
Score = 221 bits (562), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 110/207 (53%), Positives = 137/207 (66%), Gaps = 4/207 (1%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++ +K+QGHCGSCW+FS TGSLE + +A K +SLSEQ LVDC++ N GC GGL
Sbjct: 127 VTDIKNQGHCGSCWSFSATGSLEGQHFKASKKLVSLSEQNLVDCSKKEGNHGCQGGLMDN 186
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
AF YI+ N G+DTEE+YPYT K+G C F +ENVG V+I ED+LQ AV V P
Sbjct: 187 AFRYIESNKGIDTEESYPYTAKNGFCHFKAENVGATDTGYVDIPHMQEDKLQEAVATVGP 246
Query: 229 VSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENW 287
+SV + F+ Y+ GVYS C ++ +D H V+AVGYG E G YWL+KNSWG +W
Sbjct: 247 ISVGIDAGHKSFQLYREGVYSEPACSSSKLD--HGVLAVGYGTESGDDYWLVKNSWGTSW 304
Query: 288 GDHGYFKMEMGK-NMCGIATCASYPVV 313
G GY M K NMCGIAT ASYP V
Sbjct: 305 GMQGYVMMARNKHNMCGIATQASYPKV 331
>gi|33242872|gb|AAQ01140.1| cathepsin [Branchiostoma lanceolatum]
Length = 334
Score = 220 bits (561), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 115/221 (52%), Positives = 143/221 (64%), Gaps = 8/221 (3%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R +S VKDQG CGSCW FSTTGSLE + GK + LSEQQLVDC++ F N
Sbjct: 116 KSVDWRNSHMVSEVKDQGECGSCWAFSTTGSLEGQHSSKTGKLVDLSEQQLVDCSKDFGN 175
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGV-CKFSSENVGVQVLDSVNITLGAED 217
QGC GGL QAF+YIK NGGLDTEE+YPYT D CKF + +VG ++ ++ G E
Sbjct: 176 QGCGGGLMDQAFQYIKANGGLDTEESYPYTATDDKPCKFDNSSVGATLVGYKDVKSGNEH 235
Query: 218 ELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDG--- 273
L+ AV V PVSVA + + F+FY SGVY +C +D H V+AVGYG +
Sbjct: 236 ALKRAVATVGPVSVAIDAGHESFQFYSSGVYDEPQCSTEQLD--HGVLAVGYGAMNDNSH 293
Query: 274 VPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
+W++KNSWG +WGD GY M K N CGIAT ASYP+V
Sbjct: 294 QAFWIVKNSWGPSWGDQGYIMMSRNKNNQCGIATSASYPLV 334
>gi|444730298|gb|ELW70685.1| Pro-cathepsin H [Tupaia chinensis]
Length = 418
Score = 220 bits (560), Expect = 7e-55, Method: Compositional matrix adjust.
Identities = 109/209 (52%), Positives = 132/209 (63%), Gaps = 24/209 (11%)
Query: 104 RLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNG 163
+ G +SPVK+QG CGSCWTFSTTG+LE+A GK +SL
Sbjct: 40 KKGKFVSPVKNQGACGSCWTFSTTGALESAVAITTGKLLSL------------------- 80
Query: 164 GLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAV 223
AFEYI YN G+ E+ YPY G+DG CKF + V D NITL E+ + AV
Sbjct: 81 -----AFEYILYNKGIMGEDTYPYRGQDGHCKFQPQKAIAFVKDVANITLNDEEAMVEAV 135
Query: 224 GLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSW 283
L PVS AFEV + F Y+ G+YSST C TP VNHAV+AVGYG E+G+PYW++KNSW
Sbjct: 136 ALYNPVSFAFEVTNDFMMYRKGIYSSTSCHKTPDKVNHAVLAVGYGEENGIPYWIVKNSW 195
Query: 284 GENWGDHGYFKMEMGKNMCGIATCASYPV 312
G WG +GYF +E GKNMCG+A CASYPV
Sbjct: 196 GPQWGMNGYFLIERGKNMCGLAACASYPV 224
>gi|330434686|gb|AEC22811.1| cathepsin L [Macrobrachium nipponense]
Length = 342
Score = 220 bits (560), Expect = 7e-55, Method: Compositional matrix adjust.
Identities = 108/218 (49%), Positives = 144/218 (66%), Gaps = 5/218 (2%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++ VKDQG CGSCW FS TG+LE +++ G +SLSEQ LVDC+ F N
Sbjct: 127 KSVDWREKGAVTEVKDQGSCGSCWAFSATGALEGQHYRQTGDLVSLSEQNLVDCSSKFGN 186
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
GCNGGL AF+YIK NGG+DTE++YPY +D C+++ N G V++ G E+
Sbjct: 187 NGCNGGLMDNAFQYIKVNGGIDTEKSYPYEAEDEPCRYNPANAGADDRGFVDVREGNENA 246
Query: 219 LQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYG-VEDGVPY 276
L+ A+ + PVSVA + D F+FY+ GVYS C +D H V+AVGYG EDG Y
Sbjct: 247 LKKAIATIGPVSVAIDASQDSFQFYQHGVYSDPDCSAENLD--HGVLAVGYGTTEDGQDY 304
Query: 277 WLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
WL+KNSW ++WGD GY K+ + NMCGIA+ ASYP+V
Sbjct: 305 WLVKNSWSKSWGDQGYIKIARNQNNMCGIASAASYPLV 342
>gi|34559455|gb|AAQ75437.1| cathepsin L-like protease [Helicoverpa armigera]
gi|338855117|gb|AEJ31938.1| cathepsin L-like protease [Helicoverpa assulta]
Length = 341
Score = 220 bits (560), Expect = 7e-55, Method: Compositional matrix adjust.
Identities = 111/216 (51%), Positives = 144/216 (66%), Gaps = 5/216 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++ VKDQG CGSCW FSTTG+LE + + G +SLSEQ L+DC+ A+ N G
Sbjct: 128 VDWRKKGAVTEVKDQGKCGSCWAFSTTGALEGQHFRKTGYLVSLSEQNLIDCSAAYGNNG 187
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
CNGGL AF+YIK NGG+DTE+AYPY G D C+++++N G + V+I G E++L
Sbjct: 188 CNGGLMDNAFKYIKDNGGIDTEKAYPYEGVDDKCRYNAKNSGADDVGFVDIPQGDEEKLM 247
Query: 221 HAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGV-EDGVPYWL 278
AV V PVSVA + + F+FY GVY C +T D++H V+ VGYG E G YWL
Sbjct: 248 QAVATVGPVSVAIDASQESFQFYSDGVYYDENCSST--DLDHGVMVVGYGTDEQGGDYWL 305
Query: 279 IKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
+KNSWG WGD GY KM K N CGIA+ ASYP+V
Sbjct: 306 VKNSWGRTWGDLGYIKMARNKNNHCGIASSASYPLV 341
>gi|151573016|gb|ABS17683.1| cathepsin L-1 [Artemia persimilis]
Length = 334
Score = 220 bits (560), Expect = 8e-55, Method: Compositional matrix adjust.
Identities = 107/217 (49%), Positives = 142/217 (65%), Gaps = 4/217 (1%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
+ + +R I+PVKDQG CG CW FS+TG+LE + GK +SL EQ L+DC+ + N
Sbjct: 120 ESVDWREKGAITPVKDQGQCGPCWAFSSTGALEGQTFRKTGKLVSLREQNLIDCSGKYGN 179
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
+GCNGGL QAF+YIK N G+DTE YPY +D VC+++ N G V+I G ED+
Sbjct: 180 EGCNGGLMDQAFQYIKDNKGIDTENTYPYEAEDDVCRYNPRNRGAVDRGFVDIPSGEEDK 239
Query: 219 LQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYW 277
L+ AV V PVSVA + + F+FY GVY C + D++H V+ VGYG ++G YW
Sbjct: 240 LKAAVATVGPVSVAIDASHESFQFYSKGVYYEPSCDSD--DLDHGVLVVGYGSDNGKDYW 297
Query: 278 LIKNSWGENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
L+KNSW E+WGD GY K+ KN CG+AT ASYP+V
Sbjct: 298 LVKNSWSEHWGDQGYIKIARNRKNHCGVATAASYPLV 334
>gi|312381833|gb|EFR27483.1| hypothetical protein AND_05794 [Anopheles darlingi]
Length = 344
Score = 220 bits (560), Expect = 8e-55, Method: Compositional matrix adjust.
Identities = 109/216 (50%), Positives = 141/216 (65%), Gaps = 5/216 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++PVKDQGHCGSCW+FS TG+LE + + GK +SLSEQ LVDC+ + N G
Sbjct: 131 IDWREKGAVTPVKDQGHCGSCWSFSATGALEGQHFRKTGKLVSLSEQNLVDCSTKYGNNG 190
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
CNGGL AF+Y+K N G+DTE+AYPY D C ++ + +G V+I G E L+
Sbjct: 191 CNGGLMDNAFQYVKDNKGIDTEKAYPYEAIDDECHYNPKAIGATDKGFVDIPQGDEKALK 250
Query: 221 HAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYG-VEDGVPYWL 278
A+ V PVSVA + + F+FY GVY +C + +D H V+AVGYG EDG YWL
Sbjct: 251 KALATVGPVSVAIDASHESFQFYSEGVYYEPQCDSEQLD--HGVLAVGYGTTEDGEDYWL 308
Query: 279 IKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
+KNSWG WGD GY KM + N CGIAT ASYP+V
Sbjct: 309 VKNSWGTTWGDQGYVKMARNRENHCGIATTASYPLV 344
>gi|118373972|ref|XP_001020178.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89301945|gb|EAR99933.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 339
Score = 219 bits (559), Expect = 9e-55, Method: Compositional matrix adjust.
Identities = 101/206 (49%), Positives = 139/206 (67%), Gaps = 3/206 (1%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKG-ISLSEQQLVDCAQAFNNQGCNGGLPS 167
++ VK+QG CGSCW F+ G++E+ + GK I LSEQQL+DCA+ F+N GC+GGLPS
Sbjct: 135 VTAVKNQGECGSCWAFAAVGAIESHFSLKTGKSPIQLSEQQLIDCARQFDNHGCDGGLPS 194
Query: 168 QAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVR 227
+AFEYI Y GG++ + YPYTGK+ C+F EN+ +V S NIT E EL + +
Sbjct: 195 KAFEYIAYEGGIENSKDYPYTGKNNKCQFDGENIVTKVKQSFNITYLDEKELIYHLVHKG 254
Query: 228 PVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENW 287
PV++A+E D F Y+SG+Y C P VNHAV+AVGY Y+++KNSWG+ W
Sbjct: 255 PVTLAYEAADEFDNYQSGIYEGKNCEQDPQKVNHAVLAVGYNKTG--DYYIVKNSWGDKW 312
Query: 288 GDHGYFKMEMGKNMCGIATCASYPVV 313
G +GYF + KN CG+A+CASYP++
Sbjct: 313 GMNGYFYIRANKNACGLASCASYPII 338
>gi|32394728|gb|AAM96000.1| cathepsin L precursor [Metapenaeus ensis]
Length = 322
Score = 219 bits (559), Expect = 9e-55, Method: Compositional matrix adjust.
Identities = 114/227 (50%), Positives = 145/227 (63%), Gaps = 5/227 (2%)
Query: 90 LDLIRSTNCKGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQL 149
L+ T K + +R ++PVKDQ CGSCW FSTTGSLE + GK +SLSEQ L
Sbjct: 98 LEADDETLPKHVDWRTKGAVTPVKDQKQCGSCWAFSTTGSLEGQHFLKDGKLVSLSEQNL 157
Query: 150 VDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSV 209
VDC+ F N GC GGL QAF+YIK N G+DTEE+YPY +DG C+F S NVG V
Sbjct: 158 VDCSGKFGNMGCCGGLMDQAFKYIKENKGIDTEESYPYEAQDGKCRFDSSNVGATDTGFV 217
Query: 210 NITLGAEDELQHAVGLVRPVSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGY 268
+I G E+ L AV + P+SVA + F+FY GVY +C +T +D H V+A+GY
Sbjct: 218 DIAHGEENSLMKAVANIGPISVAIDASHPSFQFYHQGVYYEKECSSTMLD--HGVLAIGY 275
Query: 269 G-VEDGVPYWLIKNSWGENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
G +DG YWL+KNSW +WGD G+ +M KN CGIA+ ASYP+V
Sbjct: 276 GETDDGKEYWLVKNSWNTSWGDKGFIQMSRNKKNNCGIASQASYPLV 322
>gi|340371596|ref|XP_003384331.1| PREDICTED: digestive cysteine proteinase 2-like [Amphimedon
queenslandica]
Length = 327
Score = 219 bits (559), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 125/321 (38%), Positives = 171/321 (53%), Gaps = 59/321 (18%)
Query: 48 SVLQVIGQARHALSFARFAR----RYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSY 103
+VL +IG A++ A R +YGK Y S+ E +R + +N D + N S+
Sbjct: 11 AVLLLIGLVSAAVNDAEEWRLWKGKYGKTYRSIYEDNMRQKIWLQNRDYVNEHNSMDSSF 70
Query: 104 RLGLN--------------------------------------------------ISPVK 113
+L +N ++PVK
Sbjct: 71 QLEVNEFADLTAEEFSSIYNGYGKGRNRENHENTTIYRYTGGAIPDSVDWRTKGLVTPVK 130
Query: 114 DQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYI 173
+Q CGSCW FSTTGSLE A+ + GK +SLSEQ LVDC + + GC GGL + AF+YI
Sbjct: 131 NQKQCGSCWAFSTTGSLEGAHAKKTGKLVSLSEQNLVDCDK--KDHGCQGGLMTTAFKYI 188
Query: 174 KYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAF 233
+ N G+DTEE+YPY K+G C+F +++G V V+I + L+ AV + P+SVA
Sbjct: 189 EENKGIDTEESYPYKAKNGRCEFKKDDIGATVERHVSILTTDCEALKKAVAEIGPISVAM 248
Query: 234 EVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGY 292
+ F+ YKSG+Y C + +D H V+ VGYG EDG YWL+KNSWG+NWG GY
Sbjct: 249 DASHSSFQLYKSGIYDPKICSSRKLD--HGVLVVGYGKEDGEEYWLVKNSWGKNWGMEGY 306
Query: 293 FKMEMGKNMCGIATCASYPVV 313
FK+ KN+CGI T A YPVV
Sbjct: 307 FKIASKKNLCGICTSACYPVV 327
>gi|391332597|ref|XP_003740719.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
Length = 330
Score = 219 bits (559), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 109/216 (50%), Positives = 141/216 (65%), Gaps = 5/216 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++PVK+QG CGSCW FSTTG+LE + + G +SLSEQ LVDC+ + N G
Sbjct: 117 VDWRTEGYVTPVKNQGVCGSCWAFSTTGALEGQHFRRSGDLVSLSEQMLVDCSAVYGNAG 176
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
CNGGL AF +IK GGL+TE++YPYTGKDG C F + +G ++ V++ E+ L+
Sbjct: 177 CNGGLMDNAFRFIKDAGGLETEKSYPYTGKDGTCHFDARGIGAKLTGFVDVPSRDEEALK 236
Query: 221 HAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYG-VEDGVPYWL 278
A G+V PVSVA + F+FYK GVY C +T +D H V+ VGYG DG YWL
Sbjct: 237 EAAGVVGPVSVAIDASGQNFQFYKDGVYDEITCSSTSLD--HGVLVVGYGTTRDGKDYWL 294
Query: 279 IKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
+KNSWG +WG GY +M K N CGIAT ASYP V
Sbjct: 295 VKNSWGSSWGQSGYIQMSRNKENQCGIATMASYPTV 330
>gi|32394730|gb|AAM96001.1| cathepsin L precursor [Metapenaeus ensis]
Length = 306
Score = 219 bits (559), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 114/227 (50%), Positives = 145/227 (63%), Gaps = 5/227 (2%)
Query: 90 LDLIRSTNCKGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQL 149
L+ T K + +R ++PVKDQ CGSCW FSTTGSLE + GK +SLSEQ L
Sbjct: 82 LEADDETLPKHVDWRTKGAVTPVKDQKQCGSCWAFSTTGSLEGQHFLKDGKLVSLSEQNL 141
Query: 150 VDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSV 209
VDC+ F N GC GGL QAF+YIK N G+DTEE+YPY +DG C+F S NVG V
Sbjct: 142 VDCSGKFGNMGCCGGLMDQAFKYIKENKGIDTEESYPYEAQDGKCRFDSSNVGATDTGFV 201
Query: 210 NITLGAEDELQHAVGLVRPVSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGY 268
+I G E+ L AV + P+SVA + F+FY GVY +C +T +D H V+A+GY
Sbjct: 202 DIAHGEENSLMKAVANIGPISVAIDASHPSFQFYHQGVYYEKECSSTMLD--HGVLAIGY 259
Query: 269 G-VEDGVPYWLIKNSWGENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
G +DG YWL+KNSW +WGD G+ +M KN CGIA+ ASYP+V
Sbjct: 260 GETDDGKEYWLVKNSWNTSWGDKGFIQMSRNKKNNCGIASQASYPLV 306
>gi|340727787|ref|XP_003402217.1| PREDICTED: cathepsin L-like [Bombus terrestris]
Length = 343
Score = 219 bits (558), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 110/218 (50%), Positives = 145/218 (66%), Gaps = 5/218 (2%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++PVKDQGHCGSCW+FS TG+LE + + G I LSEQ L+DC+ + N
Sbjct: 124 KTVDWREHGAVTPVKDQGHCGSCWSFSATGALEGQHFRRTGILIPLSEQNLIDCSGKYGN 183
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
GCNGGL QAF+YIK N GLDTE YPY ++ C++++ N G + + V+I G E +
Sbjct: 184 NGCNGGLMDQAFQYIKDNKGLDTEVTYPYEAENDKCRYNAANSGARDVGYVDIPQGNEKK 243
Query: 219 LQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGV-EDGVPY 276
L+ AV + PVSVA + F+FY GVY +C + +D H V+AVGYG E+G Y
Sbjct: 244 LKAAVATIGPVSVAIDASHQSFQFYSEGVYYEPECSSENLD--HGVLAVGYGTDENGQDY 301
Query: 277 WLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
WL+KNSWGE WGD+GY KM K N CGIA+ ASYP+V
Sbjct: 302 WLVKNSWGETWGDNGYIKMARNKLNHCGIASTASYPLV 339
>gi|332375975|gb|AEE63128.1| unknown [Dendroctonus ponderosae]
Length = 338
Score = 219 bits (558), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 108/209 (51%), Positives = 142/209 (67%), Gaps = 6/209 (2%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++PVKDQGHCGSCW+FS TG+LE + + K +SLSEQ LVDC+ F N GCNGGL
Sbjct: 132 VTPVKDQGHCGSCWSFSATGALEGQHFRQTKKLVSLSEQNLVDCSSRFGNNGCNGGLMDN 191
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
AF YIK NGG+DTE AYPY G+D ++S++N G V+I G ED+L+ AV V P
Sbjct: 192 AFRYIKNNGGIDTEAAYPYMGEDEKFRYSAKNRGATDKGFVDIPSGDEDKLKAAVATVGP 251
Query: 229 VSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVED--GVPYWLIKNSWGE 285
+S+A + + F+ Y +GVYS C +T +D H V+ VGYG ++ G+ YWL+KNSWG+
Sbjct: 252 ISIAIDASHESFQLYSNGVYSDPTCSSTELD--HGVLVVGYGTDEKTGMDYWLVKNSWGD 309
Query: 286 NWGDHGYFKMEMGK-NMCGIATCASYPVV 313
WG GY KM + N CG+AT ASYP+V
Sbjct: 310 TWGLDGYIKMARNQDNQCGVATQASYPLV 338
>gi|33242880|gb|AAQ01144.1| cathepsin [Branchiostoma lanceolatum]
Length = 334
Score = 219 bits (558), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 114/221 (51%), Positives = 141/221 (63%), Gaps = 8/221 (3%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R +S VKDQG CGSCW FSTTGSLE + GK + LSEQQLVDC++ F N
Sbjct: 116 KSVDWRNSHMVSEVKDQGECGSCWAFSTTGSLEGQHSNKTGKLVDLSEQQLVDCSKDFGN 175
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGV-CKFSSENVGVQVLDSVNITLGAED 217
QGC GGL QAF+YIK NGGLDTEE+YPYT D CKF + +VG ++ ++ E
Sbjct: 176 QGCGGGLMDQAFQYIKANGGLDTEESYPYTATDDKPCKFDNSSVGATLIGYKDVKSSNEH 235
Query: 218 ELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDG--- 273
L+ AV V PVSVA + + F+FY SGVY +C +D H V+ VGYG +
Sbjct: 236 ALKRAVATVGPVSVAIDAGHESFQFYSSGVYDEPQCSTEQLD--HGVLVVGYGAMNDNSH 293
Query: 274 VPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
+W++KNSWG NWGD GY M K N CGIAT ASYP+V
Sbjct: 294 QAFWIVKNSWGPNWGDQGYIMMSRNKNNQCGIATSASYPLV 334
>gi|350412176|ref|XP_003489564.1| PREDICTED: cathepsin L-like [Bombus impatiens]
Length = 343
Score = 219 bits (558), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 110/218 (50%), Positives = 145/218 (66%), Gaps = 5/218 (2%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++PVKDQGHCGSCW+FS TG+LE + + G I LSEQ L+DC+ + N
Sbjct: 124 KTVDWREHGAVTPVKDQGHCGSCWSFSATGALEGQHFRRTGILIPLSEQNLIDCSGKYGN 183
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
GCNGGL QAF+YIK N GLDTE YPY ++ C++++ N G + + V+I G E +
Sbjct: 184 NGCNGGLMDQAFQYIKDNKGLDTEVTYPYEAENDKCRYNAANSGARDVGYVDIPQGNEKK 243
Query: 219 LQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGV-EDGVPY 276
L+ AV + PVSVA + F+FY GVY +C + +D H V+AVGYG E+G Y
Sbjct: 244 LKAAVATIGPVSVAIDASHQSFQFYSEGVYYEPECSSENLD--HGVLAVGYGTDENGQDY 301
Query: 277 WLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
WL+KNSWGE WGD+GY KM K N CGIA+ ASYP+V
Sbjct: 302 WLVKNSWGETWGDNGYIKMARNKLNHCGIASTASYPLV 339
>gi|6630974|gb|AAF19631.1|AF194427_1 cysteine proteinase precursor [Myxine glutinosa]
Length = 324
Score = 219 bits (558), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 130/321 (40%), Positives = 165/321 (51%), Gaps = 58/321 (18%)
Query: 49 VLQVIGQARHALSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKG----LSYR 104
VL + A +AL + + +YGK Y E LR + NL +++ N +YR
Sbjct: 6 VLLALVVAANALDWESWKGKYGKSYLGRGEEVLRKRVWESNLQIVQQHNVLADQGQANYR 65
Query: 105 LGLN--------------------------------------------------ISPVKD 114
LG+N ++PVKD
Sbjct: 66 LGMNTYADLYNEEFMALKGSGGLLQAKDKSSTQTFKPLVGVTLPSSVDWRNQGYVTPVKD 125
Query: 115 QGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIK 174
QG CGSCWTFS TGSLE + G +SLSEQQLVDCA + N GCNGGL A++YIK
Sbjct: 126 QGQCGSCWTFSATGSLEGQHFAKTGNLLSLSEQQLVDCAGRYGNYGCNGGLMESAYDYIK 185
Query: 175 YNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFE 234
GG++ E AYPYT +DG CKF V V I +G E L AVG + PV+V+ +
Sbjct: 186 GVGGVELESAYPYTARDGRCKFDRSKVVATCKGYVVIPVGDEQALMQAVGTIGPVAVSID 245
Query: 235 VVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYF 293
F+ Y+SGVY +C +T +D H V+AVGYG E G YWL+KNSWG WGD GY
Sbjct: 246 ASGYSFQLYESGVYDFRRCSSTNLD--HGVLAVGYGTEGGQNYWLVKNSWGPGWGDQGYI 303
Query: 294 KMEMGK-NMCGIATCASYPVV 313
KM K N CGIAT + YP+V
Sbjct: 304 KMSKDKNNQCGIATDSCYPLV 324
>gi|229367042|gb|ACQ58501.1| Cathepsin L precursor [Anoplopoma fimbria]
Length = 334
Score = 219 bits (557), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 106/207 (51%), Positives = 140/207 (67%), Gaps = 4/207 (1%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++ VKDQ CGSCW FSTTGSLE + GK +SLSEQQLVDC+ + N+GC GGL
Sbjct: 130 VTEVKDQKQCGSCWAFSTTGSLEGQTFRKTGKLVSLSEQQLVDCSGDYGNEGCMGGLMDS 189
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
AF YI+ NGG+DTE++YPY +DG C+++S N+G V++ G ED L+ AV + P
Sbjct: 190 AFRYIQANGGIDTEDSYPYEAEDGQCRYNSANIGATCTGYVDVKQGDEDALKEAVATIGP 249
Query: 229 VSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENW 287
VSVA + F+ Y+SGVY +C ++ +D H V+AVGYG ++G YWL+KNSWG W
Sbjct: 250 VSVAIDASHSSFQLYESGVYDEPECSSSELD--HGVLAVGYGSDNGHDYWLVKNSWGLGW 307
Query: 288 GDHGYFKMEMGK-NMCGIATCASYPVV 313
G+ GY M K N CGIAT +SYP+V
Sbjct: 308 GNKGYIMMTRNKHNQCGIATASSYPLV 334
>gi|164420679|ref|NP_001037464.2| fibroinase precursor [Bombyx mori]
gi|40556818|gb|AAR87763.1| fibroinase precursor [Bombyx mori]
Length = 341
Score = 219 bits (557), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 133/343 (38%), Positives = 194/343 (56%), Gaps = 46/343 (13%)
Query: 12 ILLLCCAAAASASASSFD----DSNPIRLVSSDGLRDFETSV-----LQVIGQARHAL-- 60
++LL CA AA ++ FD + + +L ++E+ V +++ + +H +
Sbjct: 4 LVLLLCAVAAVSAVQFFDLVKEEWSAFKLQHR---LNYESEVEDNFRMKIYAEHKHIIAK 60
Query: 61 ----------SFARFARRYGKI--YESVEEMK--LRFATFSKNL----------DLIRST 96
S+ +YG + +E V+ M + A +KNL I
Sbjct: 61 HNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPA 120
Query: 97 NCK---GLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCA 153
N K + +R ++ +KDQG CGSCW+FSTTG+LE + + G +SLSEQ L+DC+
Sbjct: 121 NVKLPEQVDWRKHGAVTDIKDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCS 180
Query: 154 QAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITL 213
+ + N GCNGGL AF+YIK NGG+DTE+ YPY G D C+++ +N G + + V+I
Sbjct: 181 EQYGNNGCNGGLMDNAFKYIKDNGGIDTEQTYPYEGVDDKCRYNPKNTGAEDVGFVDIPE 240
Query: 214 GAEDELQHAVGLVRPVSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGV-E 271
G E +L AV V PVSVA + F+ Y SGVY+ +C +T D++H V+ VGYG E
Sbjct: 241 GDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSST--DLDHGVLVVGYGTDE 298
Query: 272 DGVPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
GV YWL+KNSWG +WG+ GY KM K N CGIA+ ASYP+V
Sbjct: 299 QGVDYWLVKNSWGRSWGELGYIKMIRNKNNRCGIASSASYPLV 341
>gi|170041165|ref|XP_001848344.1| cathepsin l [Culex quinquefasciatus]
gi|167864709|gb|EDS28092.1| cathepsin l [Culex quinquefasciatus]
Length = 340
Score = 219 bits (557), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 110/218 (50%), Positives = 141/218 (64%), Gaps = 5/218 (2%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++PVKDQGHCGSCW+FS TG+LE + + GK +SLSEQ LVDC+ + N
Sbjct: 125 KTVDWREKGAVTPVKDQGHCGSCWSFSATGALEGQHFRKTGKLVSLSEQNLVDCSTKYGN 184
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
GCNGG+ AF+YIK NGG+DTE+AYPY D C ++ + VG V+I G E
Sbjct: 185 NGCNGGMMDFAFQYIKDNGGIDTEKAYPYEAIDDTCHYNPKAVGATDKGFVDIPQGDEKA 244
Query: 219 LQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGV-EDGVPY 276
L A+ PVSVA + + F+FY GVY +C + +D H V+AVGYG E+G Y
Sbjct: 245 LMKAIATAGPVSVAIDASHESFQFYSEGVYYEPQCDSENLD--HGVLAVGYGTSEEGEDY 302
Query: 277 WLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
WL+KNSWG WGD GY KM + N CGIAT ASYP+V
Sbjct: 303 WLVKNSWGTTWGDQGYVKMARNRDNHCGIATAASYPLV 340
>gi|89272015|emb|CAJ83143.1| cathepsin L2 [Xenopus (Silurana) tropicalis]
Length = 335
Score = 218 bits (556), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 124/294 (42%), Positives = 175/294 (59%), Gaps = 22/294 (7%)
Query: 35 RLVSSDGLRDFETSVLQ-VIGQARHALSFARFARRYGKIYESV-----EEMKLRFATF-- 86
R++ LR E L+ +G+ H+L +F + + + + K+R +TF
Sbjct: 49 RVLWEKNLRMIEFHNLEHSLGKHSHSLGMNQFGDMTNEEFRQLMNGYKNQKKIRGSTFLA 108
Query: 87 SKNLDLIRSTNCKGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSE 146
N + +S + + Y ++PVKDQG CGSCW FSTTG+LE +++ GK ISLSE
Sbjct: 109 PNNFESPKSVDWRKKGY-----VTPVKDQGQCGSCWAFSTTGALEGQHYRNTGKMISLSE 163
Query: 147 QQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGV-CKFSSENVGVQV 205
Q LVDC++A NQGCNGGL QAF+Y+K NGG+D+E++YPYT KD C +
Sbjct: 164 QNLVDCSRAQGNQGCNGGLMDQAFQYVKDNGGIDSEDSYPYTAKDDQECHYDPNYNSAND 223
Query: 206 LDSVNITLGAEDELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVV 264
V++T +E +L +AV V PVSVA + F+FYKSG+Y +C + D++H V+
Sbjct: 224 TGFVDVTSESEKDLMNAVASVGPVSVAVDAGHQSFQFYKSGIYYEPEC--SSEDLDHGVL 281
Query: 265 AVGYGV----EDGVPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
VGYG EDG YW++KNSW E WG+ GY + + N CGIAT ASYP+V
Sbjct: 282 VVGYGFEGEDEDGKKYWIVKNSWSEKWGNDGYIYIAKDRHNHCGIATAASYPLV 335
>gi|312306194|gb|ADQ73946.1| cathepsin L [Paralithodes camtschaticus]
Length = 324
Score = 218 bits (556), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 119/306 (38%), Positives = 163/306 (53%), Gaps = 55/306 (17%)
Query: 61 SFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCK----GLSYRLGLN-------- 108
SF +F +YG+ Y + +E + R + + +N++ I + N + ++Y L +N
Sbjct: 21 SFHQFKVQYGRQYATAQEERYRSSVYDQNMEFIEAHNEQYTNGEVTYMLAINQFGDMTNE 80
Query: 109 ---------------------------------------ISPVKDQGHCGSCWTFSTTGS 129
++PVKDQ CGSCW FS TGS
Sbjct: 81 EINAVMNGLLPASESRGVAVLGGRDDTLPAEVDWRTKGAVTPVKDQKACGSCWAFSATGS 140
Query: 130 LEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTG 189
LE + GK +SLSEQ LVDC+ + GC GGL AF YIK NGG+DTE +YPY
Sbjct: 141 LEGQHFLKDGKLVSLSEQNLVDCSTKQGDHGCGGGLMDFAFTYIKDNGGIDTEASYPYEA 200
Query: 190 KDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDG-FRFYKSGVYS 248
DG C+++ N G V V++ +ED LQ AV + P+SVA + F FY GVY
Sbjct: 201 TDGKCQYNPANSGATVTGYVDVEHDSEDALQKAVATIGPISVAIDASRSTFHFYHKGVYY 260
Query: 249 STKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATC 307
+C +T +D H V+AVGYG +DG YWL+KNSW WG+HG+ +M + N CGIAT
Sbjct: 261 DKECSSTSLD--HGVLAVGYGTQDGTDYWLVKNSWNITWGNHGFIEMSRNRNNNCGIATQ 318
Query: 308 ASYPVV 313
ASYP+V
Sbjct: 319 ASYPLV 324
>gi|33242870|gb|AAQ01139.1| cathepsin [Branchiostoma lanceolatum]
Length = 334
Score = 218 bits (556), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 114/221 (51%), Positives = 142/221 (64%), Gaps = 8/221 (3%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R +S VKDQG CGSCW FSTTGSLE + GK + LSEQQLVDC++ F N
Sbjct: 116 KSVDWRNSHMVSEVKDQGECGSCWAFSTTGSLEGQHSNKTGKLVDLSEQQLVDCSKDFGN 175
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDG-VCKFSSENVGVQVLDSVNITLGAED 217
QGC GGL QAF+YI NGGLDTEE+YPYT D CKF + +VG ++ ++ G E
Sbjct: 176 QGCGGGLMDQAFQYITANGGLDTEESYPYTATDDEPCKFDNSSVGATLVGYKDVKSGNEH 235
Query: 218 ELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDG--- 273
L+ AV V PVSVA + + F+FY SGVY +C +D H V+AVGYG +
Sbjct: 236 ALKRAVATVGPVSVAIDAGHESFQFYSSGVYDEPQCSTEQLD--HGVLAVGYGAMNDNSH 293
Query: 274 VPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
+W++KNSWG +WGD GY M K N CGIAT ASYP+V
Sbjct: 294 QAFWIVKNSWGPSWGDQGYIMMSRNKNNQCGIATSASYPLV 334
>gi|146217394|gb|ABQ10739.1| cathepsin L [Penaeus monodon]
Length = 341
Score = 218 bits (555), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 106/218 (48%), Positives = 143/218 (65%), Gaps = 5/218 (2%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++P+KDQG CGSCW FS TG+LE + G+ +SLSEQ LVDC++ F N
Sbjct: 126 KNVDWRTKGAVTPIKDQGQCGSCWAFSATGALEGQTFRKTGQLVSLSEQNLVDCSRKFGN 185
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
GCNGGL AFEY+K NGG+DTEE+YPY +D C ++ G + V++ G+E
Sbjct: 186 NGCNGGLMDNAFEYVKENGGIDTEESYPYDAEDEKCHYNPRAAGAEDKGFVDVREGSEHA 245
Query: 219 LQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGV-EDGVPY 276
L+ AV V PVSVA + + F+FY GVY +C +P ++H V+ VGYG+ +DG Y
Sbjct: 246 LKKAVATVGPVSVAIDASHESFQFYSHGVYIEPEC--SPEMLDHGVLVVGYGIDDDGTDY 303
Query: 277 WLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
WL+KNSWG WGD GY KM + N CGIA+ AS+P+V
Sbjct: 304 WLVKNSWGTTWGDQGYVKMARNRDNQCGIASSASFPLV 341
>gi|229893789|gb|ACQ90252.1| cathepsin L [Pinctada fucata]
Length = 362
Score = 218 bits (555), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 138/364 (37%), Positives = 182/364 (50%), Gaps = 69/364 (18%)
Query: 10 SVILLLCCAAAASASASSFDDSNPIRLVSSDGLRDFETSVLQVIGQAR---HALSFARFA 66
+VI +L +AA + + F+ P ++ + L+ LQV R + ++ F
Sbjct: 6 AVICVLTVVSAAPQAVNWFE-IQPAKVEHASNLK------LQVKASTRLGPYHETWKEFK 58
Query: 67 RRYGKIYESVEEMKLRFATFSKNLDLIRSTNCK----GLSYRLGLN-------------- 108
+GK+Y++VEE RF F L+ I N K SY +G+N
Sbjct: 59 TLFGKVYDTVEEEIKRFDIFRDTLERIEEHNRKYHMGQKSYYMGVNQFSDMSHDEYLRHN 118
Query: 109 ------------------------------------ISPVKDQGHCGSCWTFSTTGSLEA 132
++PVK+QG CGSCW+FSTTGSLE
Sbjct: 119 GLRRGNRKYSKGEGCDSYTKSGKQLDDKVDWRDKGYVTPVKNQGQCGSCWSFSTTGSLEG 178
Query: 133 AYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDG 192
+ + GK ISLSEQQLVDC+ F N+GCNGGL AFEYIK GGL+ E+ YPYT K G
Sbjct: 179 QHFRQTGKLISLSEQQLVDCSGTFGNEGCNGGLMDNAFEYIKSIGGLEGEDDYPYTAKQG 238
Query: 193 VCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDG-FRFYKSGVYSSTK 251
C ++ G ED L+ A+ V P+SVA + F+ Y GVY +
Sbjct: 239 KCHLKKSLFKANDTGCTDVESGDEDALKDALASVGPISVAIDASHASFQSYDGGVYDEEE 298
Query: 252 CGNTPMDVNHAVVAVGYGVED-GVPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCAS 309
C + +D H V+ VGYG E+ G YWL+KNSWGE WG+ GY KM K N CGIAT AS
Sbjct: 299 CSSQNLD--HGVLTVGYGTEENGGDYWLVKNSWGEMWGEEGYIKMSRNKDNQCGIATQAS 356
Query: 310 YPVV 313
YP V
Sbjct: 357 YPNV 360
>gi|196002275|ref|XP_002111005.1| expressed hypothetical protein [Trichoplax adhaerens]
gi|190586956|gb|EDV27009.1| expressed hypothetical protein [Trichoplax adhaerens]
Length = 325
Score = 218 bits (555), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 112/256 (43%), Positives = 155/256 (60%), Gaps = 7/256 (2%)
Query: 60 LSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLNISPVKDQGHCG 119
L+ F + Y S+++ + +TF L+ T + +R ++P+K+QG CG
Sbjct: 75 LTRKEFVKTYNGYRLSMKKSTNKPSTFMAPLNTNMPTE---VDWRKEGYVTPIKNQGRCG 131
Query: 120 SCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGL 179
SCW FSTTGSLE + + GK +SLSEQ L+DC+ A N GC GG AFEYIK N G+
Sbjct: 132 SCWAFSTTGSLEGQHFRKTGKLVSLSEQNLIDCSAAEGNDGCGGGFMDDAFEYIKLNNGI 191
Query: 180 DTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVV-DG 238
DTE +YPY G+D +C++ N G ++I +ED+L+ AV V P+SVA +
Sbjct: 192 DTEASYPYEGRDDICRYKKTNKGAIDTGYMDIKQYSEDDLKAAVATVGPISVAIDASHKS 251
Query: 239 FRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMG 298
F Y +GVY +C T +D H V+ VGYG E+G YWL+KNSWG +WG +GY KM
Sbjct: 252 FHMYHTGVYHEPECSQTVLD--HGVLVVGYGTENGEDYWLVKNSWGTDWGMNGYIKMSRN 309
Query: 299 K-NMCGIATCASYPVV 313
+ N CGIAT ASYP++
Sbjct: 310 RSNNCGIATNASYPLI 325
>gi|443708542|gb|ELU03619.1| hypothetical protein CAPTEDRAFT_17807 [Capitella teleta]
Length = 350
Score = 218 bits (555), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 107/207 (51%), Positives = 137/207 (66%), Gaps = 4/207 (1%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++PVK+QG CGSCW FSTTGSLE + + GK +SLSEQ LVDC+ ++ N+GCNGG+
Sbjct: 146 VTPVKNQGQCGSCWAFSTTGSLEGQHFRKTGKLVSLSEQNLVDCSTSYGNEGCNGGIVDY 205
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
AF+YIK N G DTE YPY DG C+F S VG ++ G E +++ AV LV P
Sbjct: 206 AFQYIKDNDGDDTEACYPYEAVDGTCRFKSVCVGATCTGYTDLPKGDEAKMKEAVALVGP 265
Query: 229 VSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENW 287
VSVA + F+ Y+SG+Y +C +P ++HAV+ VGYG E G YWL+KNSWG W
Sbjct: 266 VSVAIDASHSSFQMYQSGIYVEQEC--SPKQLDHAVLVVGYGTEQGQDYWLVKNSWGTTW 323
Query: 288 GDHGYFKMEMG-KNMCGIATCASYPVV 313
GD GY KM N CGIA+ ASYP+V
Sbjct: 324 GDEGYIKMARNMDNQCGIASQASYPLV 350
>gi|229366214|gb|ACQ58087.1| Cathepsin L precursor [Anoplopoma fimbria]
Length = 334
Score = 218 bits (555), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 105/207 (50%), Positives = 140/207 (67%), Gaps = 4/207 (1%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++ VKDQ CGSCW FSTTGSLE + GK +SLSEQQLVDC+ + N+GC GGL
Sbjct: 130 VTDVKDQKQCGSCWAFSTTGSLEGQTFRKTGKLVSLSEQQLVDCSGDYGNEGCMGGLMDS 189
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
AF YI+ NGG+DTE++YPY +DG C+++S N+G V++ G ED L+ A+ + P
Sbjct: 190 AFRYIQANGGIDTEDSYPYEAEDGQCRYNSANIGATCTGYVDVKQGDEDALKEALATIGP 249
Query: 229 VSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENW 287
VSVA + F+ Y+SGVY +C ++ +D H V+AVGYG ++G YWL+KNSWG W
Sbjct: 250 VSVAIDASHSSFQLYESGVYDEPECSSSELD--HGVLAVGYGSDNGHDYWLVKNSWGLGW 307
Query: 288 GDHGYFKMEMGK-NMCGIATCASYPVV 313
G+ GY M K N CGIAT +SYP+V
Sbjct: 308 GNKGYIMMTRNKHNQCGIATASSYPLV 334
>gi|156398078|ref|XP_001638016.1| predicted protein [Nematostella vectensis]
gi|156225133|gb|EDO45953.1| predicted protein [Nematostella vectensis]
Length = 326
Score = 218 bits (555), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 119/296 (40%), Positives = 161/296 (54%), Gaps = 54/296 (18%)
Query: 69 YGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN-------------------- 108
+GK Y V E + R A + +NL+ I+ N + SY++ +N
Sbjct: 34 HGKSYSDVHEERTRMAIWQQNLEKIKRHNAEDHSYKMAMNHLGDLTEDEFRYFYLGVRAH 93
Query: 109 ------------------------------ISPVKDQGHCGSCWTFSTTGSLEAAYHQAF 138
++ VK+QG CGSCW FSTTGS+E + +
Sbjct: 94 HNSTKRGWATYMPPSNVKIPSSVDWSQKGYVTGVKNQGQCGSCWAFSTTGSVEGQHFRKT 153
Query: 139 GKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSS 198
G +SLSEQ L+DC+ ++ N GC GGL AF YI+ NGG+DTE +YPY G+ G C FSS
Sbjct: 154 GSLVSLSEQNLIDCSGSYGNNGCQGGLMDNAFRYIESNGGIDTESSYPYLGQQGSCHFSS 213
Query: 199 ENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMD 258
+VG +V +I G+E LQ AV V PVSVA + ++FY SGVY + C +T +D
Sbjct: 214 SHVGARVTGYQDIPQGSEQALQSAVATVGPVSVAVDASQ-WQFYSSGVYDNPYCSSTQLD 272
Query: 259 VNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
H V+ +GYG +G YWL+KNSWG +WG GY M K N CGIA+ ASYP+V
Sbjct: 273 --HGVLVIGYGNYNGQDYWLVKNSWGYSWGVEGYIMMSRNKNNQCGIASSASYPLV 326
>gi|91092014|ref|XP_970644.1| PREDICTED: similar to cathepsin-L-like cysteine peptidase 02
[Tribolium castaneum]
gi|270001249|gb|EEZ97696.1| cathepsin L precursor [Tribolium castaneum]
Length = 337
Score = 218 bits (555), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 113/239 (47%), Positives = 149/239 (62%), Gaps = 8/239 (3%)
Query: 81 LRFATFSKNLDLIRSTNC---KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQA 137
LR +++ I N K + +R ++PVKDQG CGSCW+FSTTGSLE + +
Sbjct: 101 LRSGELDESITFIPPANVELPKQIDWRKLGAVTPVKDQGQCGSCWSFSTTGSLEGQHFRK 160
Query: 138 FGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFS 197
K +SLSEQ L+DC++ + N GCNGGL AF YIK NGG+DTE++YPY +D C +
Sbjct: 161 SKKLVSLSEQNLIDCSEKYGNNGCNGGLMDNAFRYIKDNGGIDTEQSYPYKAEDEKCHYK 220
Query: 198 SENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVD-GFRFYKSGVYSSTKCGNTP 256
N G V+I G E++L+ AV V P+SVA + F+ Y GVY +C +
Sbjct: 221 PRNKGATDRGFVDIESGDEEKLKAAVATVGPISVAIDASHPTFQQYSEGVYYEPECSSEQ 280
Query: 257 MDVNHAVVAVGYGV-EDGVPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
+D H V+ VGYG EDG YWL+KNSWG++WGD GY KM + N CGIAT ASYP+V
Sbjct: 281 LD--HGVLVVGYGTDEDGNDYWLVKNSWGDSWGDQGYIKMARNRDNNCGIATQASYPLV 337
>gi|957281|gb|AAB33990.1| cysteine proteinase [Bombyx mori]
Length = 344
Score = 218 bits (554), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 109/216 (50%), Positives = 147/216 (68%), Gaps = 5/216 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++ +KDQG CGSCW+FSTTG+LE + + G +SLSEQ L+DC++ + N G
Sbjct: 131 VDWRKHGAVTDIKDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNG 190
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
CNGGL AF+YIK NGG+DTE+AYPY G D C+++ +N G + + V+I G E +L
Sbjct: 191 CNGGLMDNAFKYIKDNGGIDTEQAYPYEGVDDKCRYNPKNTGAEDVGFVDIPEGDEQKLM 250
Query: 221 HAVGLVRPVSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGV-EDGVPYWL 278
AV V PVSVA + F+ Y SGVY+ +C +T D++H V+ VGYG E GV YWL
Sbjct: 251 EAVATVGPVSVAIDASHTHFQLYSSGVYNEEECSST--DLDHGVLVVGYGTDEQGVDYWL 308
Query: 279 IKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
+KNSWG +WG+ GY KM K N CGIA+ ASYP+V
Sbjct: 309 VKNSWGRSWGELGYIKMIRNKNNRCGIASSASYPLV 344
>gi|262410743|gb|ACY66807.1| cathepsin L [Aphis gossypii]
Length = 341
Score = 218 bits (554), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 108/228 (47%), Positives = 149/228 (65%), Gaps = 8/228 (3%)
Query: 92 LIRSTNC---KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQ 148
++S N K + +R ++PVK+QG CGSCW+FS TGSLE + + G +SLSEQ
Sbjct: 116 FLKSENVVVPKAIDWRKKGYVTPVKNQGQCGSCWSFSATGSLEGQHFRKTGVLVSLSEQN 175
Query: 149 LVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDS 208
L+DC++ + N GC GGL AF+YIK N GLDTE++YPY +D C+++ EN G
Sbjct: 176 LIDCSRKYGNNGCEGGLMDLAFKYIKSNKGLDTEKSYPYEAEDDKCRYNPENSGATDKGF 235
Query: 209 VNITLGAEDELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVG 267
V+I G ED L HA+ V PVS+A + + F+FYK GV+ + +C +T +D H V+AVG
Sbjct: 236 VDIPEGDEDALMHALATVGPVSIAIDASSEKFQFYKKGVFYNPRCSSTELD--HGVLAVG 293
Query: 268 YGVE-DGVPYWLIKNSWGENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
YG + G YW++KNSWG+ WGD GY M KN CG+A+ ASYP+V
Sbjct: 294 YGTDHKGGDYWIVKNSWGKTWGDQGYIMMARNKKNNCGVASSASYPLV 341
>gi|256082975|ref|XP_002577726.1| subfamily C1A unassigned peptidase (C01 family) [Schistosoma
mansoni]
Length = 1471
Score = 218 bits (554), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 110/231 (47%), Positives = 149/231 (64%), Gaps = 9/231 (3%)
Query: 92 LIRSTNCK---GLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQ 148
IRS + K + +R ++ VK+QG CGSCW FSTTG++E +++ + ++LSEQQ
Sbjct: 140 FIRSEHTKLPSKVDWRREGAVTDVKNQGQCGSCWAFSTTGAIEGQHYRKTNRLVNLSEQQ 199
Query: 149 LVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGV----CKFSSENVGVQ 204
LVDC++++ N GC+GGL + AFEY++ N G+D+E +YPY DG C F++ N+ Q
Sbjct: 200 LVDCSKSYGNNGCSGGLMNSAFEYVRDNEGIDSEISYPYVSGDGTENNRCLFNASNILAQ 259
Query: 205 VLDSVNITLGAEDELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAV 263
V VNI G E L AV PVSVA + F YKSG+YS T C T ++H V
Sbjct: 260 VTGYVNIHEGDERALMDAVATKGPVSVAINAGLPSFSMYKSGIYSDTDCEGTLDALDHGV 319
Query: 264 VAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
+ VGYG E+G YWLIKNSWGE WG+ GY K+ G NMCG+A+ ASYP+V
Sbjct: 320 LVVGYGEENGRSYWLIKNSWGEEWGEKGYIKISKGSHNMCGVASAASYPLV 370
>gi|17062058|gb|AAL34984.1|AF320565_1 cathepsine L-like cysteine protease [Rhodnius prolixus]
Length = 316
Score = 217 bits (553), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 105/217 (48%), Positives = 142/217 (65%), Gaps = 4/217 (1%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++PVKDQGHCGSCW+FS TGSLE G+ +SLSEQ LVDC++ + N
Sbjct: 102 KSVDWRQRGAVTPVKDQGHCGSCWSFSATGSLEGQLFLKTGRLVSLSEQNLVDCSKTYGN 161
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
GC GGL +QAF+Y++ N G+DTE +YPY ++ C+F + VG V+I +E +
Sbjct: 162 SGCEGGLMNQAFQYVRDNKGIDTEASYPYEARENNCRFKEDKVGGTDKGYVDILEASEKD 221
Query: 219 LQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYW 277
LQ AV V P+SV + + F+FY GVY C +P ++H V+ VGYG E+G YW
Sbjct: 222 LQSAVATVGPISVRIDASHESFQFYSEGVYKEQYC--SPSQLDHGVLTVGYGTENGQDYW 279
Query: 278 LIKNSWGENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
L+KNSWG +WG+ GY K+ KN CGIA+ ASYPVV
Sbjct: 280 LVKNSWGPSWGESGYIKIARNHKNHCGIASMASYPVV 316
>gi|195381187|ref|XP_002049336.1| GJ20806 [Drosophila virilis]
gi|194144133|gb|EDW60529.1| GJ20806 [Drosophila virilis]
Length = 339
Score = 217 bits (553), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 106/218 (48%), Positives = 140/218 (64%), Gaps = 5/218 (2%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++ VKDQGHCGSCW FS+TG+LE + + G ISLSEQ LVDC+ + N
Sbjct: 124 KSVDWRSKGAVTEVKDQGHCGSCWAFSSTGALEGQHFRKAGTLISLSEQNLVDCSTKYGN 183
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
GCNGGL AF YIK NGG+DTE++YPY G D C F+ +G SV+I G E +
Sbjct: 184 NGCNGGLMDNAFRYIKDNGGIDTEKSYPYEGIDDSCHFNKATIGATDRGSVDIPQGDEKK 243
Query: 219 LQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGV-EDGVPY 276
+ AV + PVSVA + + F+FY G+Y+ +C P +++H V+ VGYG E G Y
Sbjct: 244 MAEAVATIGPVSVAIDASHESFQFYSEGIYNEPQC--DPQNLDHGVLVVGYGTDESGQDY 301
Query: 277 WLIKNSWGENWGDHGYFKM-EMGKNMCGIATCASYPVV 313
WL+KNSWG WGD G+ KM N CGIA+ +SYP+V
Sbjct: 302 WLVKNSWGTTWGDKGFIKMARNADNQCGIASASSYPLV 339
>gi|318816588|ref|NP_001187996.1| cathepsin L precursor [Ictalurus punctatus]
gi|308324547|gb|ADO29408.1| cathepsin L [Ictalurus punctatus]
Length = 334
Score = 217 bits (553), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 127/313 (40%), Positives = 165/313 (52%), Gaps = 61/313 (19%)
Query: 60 LSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNC---KGL-SYRLGLN------- 108
L F + ++GKIY+SVEE R T+ +N L+ N +G+ SYRLG+
Sbjct: 24 LEFHSWKLKFGKIYKSVEEESQRKNTWLENRKLVLVHNMLADQGIKSYRLGMTYFADMDN 83
Query: 109 ----------------------------------------------ISPVKDQGHCGSCW 122
++ VKDQ +CGSCW
Sbjct: 84 QEYRQSVFKGCLGSFNRTKGHRASTFLLQAGGAVLPDTVDWRDKGYVAEVKDQKNCGSCW 143
Query: 123 TFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTE 182
FS TGSLE + GK +SLSEQQLVDC+ + N GC GGL AFEYI+ N G+DTE
Sbjct: 144 AFSATGSLEGQTFRKTGKLVSLSEQQLVDCSGKYGNMGCGGGLMDLAFEYIEDNKGIDTE 203
Query: 183 EAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVD-GFRF 241
E+YPY DG C+F VG V+I E+ LQ AV + P+SVA + F+
Sbjct: 204 ESYPYEATDGDCRFKPATVGATCTGYVDINSEDENALQKAVANIGPISVAIDAGHISFQL 263
Query: 242 YKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGK-N 300
Y SG+Y+ C + D++H V+AVGYG ++ YWL+KNSWG +WGD GY KM K N
Sbjct: 264 YGSGIYNEPNC--SSEDLDHGVLAVGYGTDNQQDYWLVKNSWGLDWGDQGYIKMTRNKNN 321
Query: 301 MCGIATCASYPVV 313
CGIAT ASYP+V
Sbjct: 322 QCGIATAASYPLV 334
>gi|1483570|emb|CAA68066.1| cathepsin l [Litopenaeus vannamei]
Length = 328
Score = 217 bits (553), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 110/218 (50%), Positives = 140/218 (64%), Gaps = 5/218 (2%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++PVKDQ CGSCW FSTTGSLE + GK +SLSEQ LVDC+ F N
Sbjct: 113 KEVDWRTKGAVTPVKDQKQCGSCWAFSTTGSLEGQHFLKDGKLVSLSEQNLVDCSDKFGN 172
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
GC GGL QAF YIK N G+DTE++YPY +DG C+F + NVG V++ G+E
Sbjct: 173 MGCMGGLMDQAFRYIKANKGIDTEDSYPYEAQDGKCRFDASNVGATDTGYVDVEHGSESA 232
Query: 219 LQHAVGLVRPVSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYG-VEDGVPY 276
L+ AV + P+SVA + F+FY GVY C +T +D H V+AVGYG E G Y
Sbjct: 233 LKKAVATIGPISVAIDASQPSFQFYHDGVYYEEGCSSTMLD--HGVLAVGYGETEKGEAY 290
Query: 277 WLIKNSWGENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
WL+KNSW +WG+ GY +M KN CGIA+ ASYP+V
Sbjct: 291 WLVKNSWNTSWGNKGYIQMSRDKKNNCGIASQASYPLV 328
>gi|410923307|ref|XP_003975123.1| PREDICTED: cathepsin L1-like [Takifugu rubripes]
Length = 336
Score = 217 bits (553), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 136/339 (40%), Positives = 184/339 (54%), Gaps = 43/339 (12%)
Query: 11 VILLLCCAAAASASA--SSFDDSNPI-----------------RLVSSDGLRDFETSVLQ 51
V+L LC AA SA + D+ + R+V L+ E L+
Sbjct: 5 VVLALCVTAALSAPSLDPQLDEHWNLWKDWHSKKYHEKEEGWRRMVWEKNLKKIELHNLE 64
Query: 52 -VIGQARHALSFARFA-------RRYGKIYESVEEMKLRFATFSKN--LDLIRSTNCKGL 101
+G+ ++L F R+ Y+ + KLR + F + L+ RS + +
Sbjct: 65 HSMGKHTYSLGMNHFGDMTHEEFRQIMNGYKLKSQRKLRGSLFMEPNFLEAPRSVDWRDK 124
Query: 102 SYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGC 161
Y ++PVKDQG CGSCW FSTTG++E + + G +SLSEQ LVDC++ N+GC
Sbjct: 125 GY-----VTPVKDQGQCGSCWAFSTTGAMEGQHFRKTGTLVSLSEQNLVDCSRPEGNEGC 179
Query: 162 NGGLPSQAFEYIKYNGGLDTEEAYPYTGKD-GVCKFSSENVGVQVLDSVNITLGAEDELQ 220
NGGL QAF+YIK NGGLD+EE+YPY G D G C + V++ G+E L
Sbjct: 180 NGGLMDQAFQYIKDNGGLDSEESYPYLGTDEGPCHYDPSYNSANDTGFVDVPSGSERALM 239
Query: 221 HAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----DGVP 275
AV V PVSVA + + F+FY SG+Y +C + +D H V+ VGYG E DG
Sbjct: 240 KAVASVGPVSVAIDAGHESFQFYHSGIYYDKECSSEELD--HGVLVVGYGFEGKDVDGKK 297
Query: 276 YWLIKNSWGENWGDHGYFKM-EMGKNMCGIATCASYPVV 313
YW++KNSW ENWGD GY M + KN CGIAT ASYP+V
Sbjct: 298 YWIVKNSWSENWGDKGYIYMAKDKKNHCGIATAASYPLV 336
>gi|348531523|ref|XP_003453258.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
Length = 341
Score = 217 bits (552), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 105/207 (50%), Positives = 142/207 (68%), Gaps = 4/207 (1%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++ V++Q CGSCW FS TGSLE + + GK +SLS+QQLVDC+ F N+GCNGGL
Sbjct: 137 VTNVQNQMDCGSCWAFSATGSLEGQHFRKTGKLVSLSKQQLVDCSGEFGNEGCNGGLMDS 196
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
AF+YI+ NGG+DTEE+YPY +DG C+++ ++ G V++ E+ L+ AV + P
Sbjct: 197 AFQYIQANGGIDTEESYPYEAEDGKCRYNPKSTGATCTGYVDVQPANEETLKEAVATIGP 256
Query: 229 VSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENW 287
+SVA + F+FY+SGVY C +T +D HAV+AVGYG E+G+ YWL+KNS G W
Sbjct: 257 ISVAIDAFHPSFQFYESGVYDEPDCSSTMLD--HAVLAVGYGTENGLDYWLVKNSAGVGW 314
Query: 288 GDHGYFKMEMGK-NMCGIATCASYPVV 313
G+ GY KM K N CGIAT ASYP+V
Sbjct: 315 GEKGYIKMSRNKSNQCGIATAASYPLV 341
>gi|307192137|gb|EFN75465.1| Cathepsin L [Harpegnathos saltator]
Length = 339
Score = 217 bits (552), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 106/216 (49%), Positives = 143/216 (66%), Gaps = 5/216 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++P+KDQGHCGSCW+FS TG+LE +++ GK +SLSEQ L+DC+ + N G
Sbjct: 126 VDWRTHGAVTPIKDQGHCGSCWSFSATGALEGQHYRITGKLVSLSEQNLIDCSGRYGNNG 185
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
CNGGL QAF+YIK N GLDTE +YPY ++ C+++ N G V+I G E +L+
Sbjct: 186 CNGGLMDQAFQYIKDNHGLDTEISYPYEAENDKCRYNPRNNGATDSGYVDIPEGNEKKLK 245
Query: 221 HAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDG-VPYWL 278
AV + PVSVA + + F+FY+ GVY +C + +D H V+ VGYG +D YWL
Sbjct: 246 AAVATIGPVSVAIDASAESFQFYREGVYYEPRCSSENLD--HGVLVVGYGTDDNDQDYWL 303
Query: 279 IKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
+KNSWG WGD GY KM K N CGIA+ ASYP+V
Sbjct: 304 VKNSWGVTWGDEGYIKMARNKDNHCGIASSASYPLV 339
>gi|728637|emb|CAA59441.1| cathepsin l [Litopenaeus vannamei]
Length = 326
Score = 217 bits (552), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 120/306 (39%), Positives = 161/306 (52%), Gaps = 56/306 (18%)
Query: 62 FARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCK----GLSYRLGLN--------- 108
+ F +G+ Y SV+E + R + F +N I N + +++ L +N
Sbjct: 23 WQNFKAEHGRRYASVQEERYRLSVFEQNQQFIDDHNARFENGEVTFTLQMNQFGDMTSEE 82
Query: 109 --------------------------------------ISPVKDQGHCGSCWTFSTTGSL 130
++PVKDQ CGSCW FSTTGSL
Sbjct: 83 IVATMNGFLGAPTRRPAAVLKADDETLPEKVDWRTKGAVTPVKDQKQCGSCWAFSTTGSL 142
Query: 131 EAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGK 190
E + GK +SLSEQ LVDC+ F N GC GGL QAF YIK N G+DTE++YPY +
Sbjct: 143 EGQHFLKDGKLVSLSEQNLVDCSDKFGNMGCMGGLMDQAFRYIKANKGIDTEDSYPYEAQ 202
Query: 191 DGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDG-FRFYKSGVYSS 249
DG C+F + NVG V++ G+E L+ AV + P+SV + F FY +GVY
Sbjct: 203 DGKCRFDASNVGATDTGYVDVEHGSESALKKAVATIGPISVGIDASQSTFHFYHTGVYHD 262
Query: 250 TKCGNTPMDVNHAVVAVGYGV-EDGVPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATC 307
C +T +D H V+AVGYG E+G +WL+KNSW +WGD GY KM + N CGIA+
Sbjct: 263 DHCSSTMLD--HGVLAVGYGSDENGGDFWLVKNSWNTSWGDKGYIKMSRNRNNNCGIASQ 320
Query: 308 ASYPVV 313
ASYP+V
Sbjct: 321 ASYPLV 326
>gi|405966498|gb|EKC31776.1| Cathepsin L [Crassostrea gigas]
Length = 330
Score = 217 bits (552), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 109/207 (52%), Positives = 134/207 (64%), Gaps = 4/207 (1%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++P+K+QG CGSCW+FS TGSLE + GK SLSEQ LVDC+Q N GC GGL
Sbjct: 126 VTPIKNQGQCGSCWSFSATGSLEGQTFKKTGKLPSLSEQNLVDCSQKQGNHGCQGGLMDD 185
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
AF+YIK N G+DTE +YPY K+G C+F++ NVG +I +E +LQ AV V P
Sbjct: 186 AFQYIKDNSGIDTESSYPYEAKNGKCRFNAANVGATDSGFTDIKSKSESDLQSAVATVGP 245
Query: 229 VSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENW 287
+SVA + F+ Y+SGVY C T +D H V+AVGYG E G YWL+KNSWGE+W
Sbjct: 246 ISVAIDASHMSFQLYRSGVYHEFFCSETRLD--HGVLAVGYGTESGKDYWLVKNSWGESW 303
Query: 288 GDHGYFKMEMGK-NMCGIATCASYPVV 313
G GY M K N CGIAT ASYP V
Sbjct: 304 GQKGYIMMSRNKRNNCGIATSASYPTV 330
>gi|2765358|emb|CAA74241.1| cathepsin L [Litopenaeus vannamei]
Length = 325
Score = 217 bits (552), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 120/306 (39%), Positives = 161/306 (52%), Gaps = 56/306 (18%)
Query: 62 FARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCK----GLSYRLGLN--------- 108
+ F +G+ Y SV+E + R + F +N I N + +++ L +N
Sbjct: 22 WQNFKAEHGRRYASVQEERYRLSVFEQNQQFIDDHNARFENGEVTFTLQMNQFGDMTSEE 81
Query: 109 --------------------------------------ISPVKDQGHCGSCWTFSTTGSL 130
++PVKDQ CGSCW FSTTGSL
Sbjct: 82 IVATMNGFLGAPTRRPAAVLKADDETLPEKVDWRTKGAVTPVKDQKQCGSCWAFSTTGSL 141
Query: 131 EAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGK 190
E + GK +SLSEQ LVDC+ F N GC GGL QAF YIK N G+DTE++YPY +
Sbjct: 142 EGQHFLKDGKLVSLSEQNLVDCSDKFRNMGCMGGLMDQAFRYIKANKGIDTEDSYPYEAQ 201
Query: 191 DGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDG-FRFYKSGVYSS 249
DG C+F + NVG V++ G+E L+ AV + P+SV + F FY +GVY
Sbjct: 202 DGKCRFDASNVGATDTGYVDVEHGSESALKKAVATIGPISVGIDASQSTFHFYHTGVYHD 261
Query: 250 TKCGNTPMDVNHAVVAVGYGV-EDGVPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATC 307
C +T +D H V+AVGYG E+G +WL+KNSW +WGD GY KM + N CGIA+
Sbjct: 262 DHCSSTMLD--HGVLAVGYGSDENGGDFWLVKNSWNTSWGDKGYIKMSRNRNNNCGIASQ 319
Query: 308 ASYPVV 313
ASYP+V
Sbjct: 320 ASYPLV 325
>gi|21617827|sp|P09648.1|CATL1_CHICK RecName: Full=Cathepsin L1; Contains: RecName: Full=Cathepsin L1
heavy chain; Contains: RecName: Full=Cathepsin L1 light
chain
Length = 218
Score = 217 bits (552), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 111/218 (50%), Positives = 143/218 (65%), Gaps = 5/218 (2%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
+ + +R ++PVKDQG CGSCW FSTTG+LE + + GK +SLSEQ LVDC++ N
Sbjct: 3 RSVDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQHFRTKGKLVSLSEQNLVDCSRPEGN 62
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGV-CKFSSENVGVQVLDSVNITLGAED 217
QGCNGGL QAF+Y++ NGG+D+EE+YPYT KD C++ +E V+I G E
Sbjct: 63 QGCNGGLMDQAFQYVQDNGGIDSEESYPYTAKDDEDCRYKAEYNAANDTGFVDIPQGHER 122
Query: 218 ELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPY 276
L AV V PVSVA + F+FY+SG+Y C + D++H V+ VGYG E G Y
Sbjct: 123 ALMKAVASVGPVSVAIDAGHSSFQFYQSGIYYEPDC--SSEDLDHGVLVVGYGFEGGKKY 180
Query: 277 WLIKNSWGENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
W++KNSWGE WGD GY M KN CGIAT ASYP+V
Sbjct: 181 WIVKNSWGEKWGDKGYIYMAKDRKNHCGIATAASYPLV 218
>gi|348531585|ref|XP_003453289.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
Length = 366
Score = 217 bits (552), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 105/207 (50%), Positives = 136/207 (65%), Gaps = 4/207 (1%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++ VKDQ CGSCW FSTTG LE + + GK +SLSEQQL+DC+ +F N GCNGG +
Sbjct: 162 VTEVKDQKICGSCWAFSTTGVLEGQHFRKTGKLVSLSEQQLMDCSHSFGNNGCNGGSVKR 221
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
AF+YI+ NGG+DTE +YPY K C++ + +G + V + ED L+ AV + P
Sbjct: 222 AFQYIQANGGIDTEASYPYEAKGQQCRYKPDGIGAKCTGYVEVKPSNEDALKEAVATIGP 281
Query: 229 VSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENW 287
+SV + + FRFY+SGVY C T + NH V+AVGYG E+G YWLIKNSWG W
Sbjct: 282 ISVGIDASHNSFRFYQSGVYDEPDCSKTVL--NHDVLAVGYGTENGHDYWLIKNSWGIRW 339
Query: 288 GDHGYFKMEMGK-NMCGIATCASYPVV 313
GD GY KM K N CGIA+ A+YP+V
Sbjct: 340 GDKGYIKMSRNKSNQCGIASDATYPLV 366
>gi|405958751|gb|EKC24845.1| Cathepsin L [Crassostrea gigas]
Length = 330
Score = 216 bits (551), Expect = 7e-54, Method: Compositional matrix adjust.
Identities = 109/207 (52%), Positives = 134/207 (64%), Gaps = 4/207 (1%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++P+K+QG CGSCW+FS TGSLE + GK SLSEQ LVDC+Q N GC GGL
Sbjct: 126 VTPIKNQGQCGSCWSFSATGSLEGQTFKKTGKLPSLSEQNLVDCSQKQGNHGCQGGLMDD 185
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
AF+YIK N G+DTE +YPY K+G C+F++ NVG +I +E +LQ AV V P
Sbjct: 186 AFQYIKDNNGIDTESSYPYEAKNGKCRFNAANVGATDSGFTDIKSKSESDLQSAVATVGP 245
Query: 229 VSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENW 287
++VA + F+ YKSGVY C T +D H V+AVGYG E G YWL+KNSWGE+W
Sbjct: 246 IAVAIDASHMSFQLYKSGVYHEFFCSETRLD--HGVLAVGYGTESGKDYWLVKNSWGESW 303
Query: 288 GDHGYFKMEMGK-NMCGIATCASYPVV 313
G GY M K N CGIAT ASYP V
Sbjct: 304 GQKGYIMMSRNKRNNCGIATSASYPTV 330
>gi|269784818|ref|NP_001161481.1| cathepsin L1 precursor [Gallus gallus]
Length = 353
Score = 216 bits (551), Expect = 7e-54, Method: Compositional matrix adjust.
Identities = 119/245 (48%), Positives = 153/245 (62%), Gaps = 16/245 (6%)
Query: 78 EMKLRFATFSKN--LDLIRSTNCKGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYH 135
E K R + F + L+ RS + + Y ++PVKDQG CGSCW FSTTG+LE +
Sbjct: 116 ERKYRGSQFLEPSFLEAPRSVDWREKGY-----VTPVKDQGQCGSCWAFSTTGALEGQHF 170
Query: 136 QAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGV-C 194
+ GK +SLSEQ LVDC++ NQGCNGGL QAF+Y++ NGG+D+EE+YPYT KD C
Sbjct: 171 RKTGKLVSLSEQNLVDCSRPEGNQGCNGGLMDQAFQYVQDNGGIDSEESYPYTAKDDEDC 230
Query: 195 KFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCG 253
++ +E V+I G E L AV V PVSVA + F+FY+SG+Y C
Sbjct: 231 RYKAEYNAANDTGFVDIPQGHERALMKAVASVGPVSVAIDAGHSSFQFYQSGIYYEPDCS 290
Query: 254 NTPMDVNHAVVAVGYGVE----DGVPYWLIKNSWGENWGDHGYFKMEMG-KNMCGIATCA 308
+ D++H V+ VGYG E DG YW++KNSWGE WGD GY M KN CGIAT A
Sbjct: 291 SE--DLDHGVLVVGYGFEGEDVDGKKYWIVKNSWGEKWGDKGYIYMAKDRKNHCGIATAA 348
Query: 309 SYPVV 313
SYP+V
Sbjct: 349 SYPLV 353
>gi|288548566|gb|ADC52431.1| cathepsin L2 cysteine protease [Pinctada fucata]
Length = 330
Score = 216 bits (551), Expect = 8e-54, Method: Compositional matrix adjust.
Identities = 106/207 (51%), Positives = 134/207 (64%), Gaps = 4/207 (1%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++P+K+QG CGSCW+FS TGSLE + GK +SLSEQ LVDC++ N GC GGL
Sbjct: 126 VTPIKNQGQCGSCWSFSATGSLEGQTFKKTGKLVSLSEQNLVDCSKKQGNHGCEGGLMDD 185
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
AF YIK N G+DTE +YPY +DG C+F S +VG V+I E+ L+ AV V P
Sbjct: 186 AFTYIKANNGIDTEASYPYKARDGKCEFKSADVGATDTGFVDIKTKDEEALKQAVATVGP 245
Query: 229 VSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENW 287
+SVA + F+ Y++GVY C T +D H V+AVGYG ED YWL+KNSWGE+W
Sbjct: 246 ISVAIDASHMSFQLYRTGVYHDWFCSQTKLD--HGVLAVGYGTEDSKDYWLVKNSWGESW 303
Query: 288 GDHGYFKMEMG-KNMCGIATCASYPVV 313
G GY +M +N CGIAT ASYP V
Sbjct: 304 GQKGYIQMSRNRRNNCGIATSASYPTV 330
>gi|33242882|gb|AAQ01145.1| cathepsin [Branchiostoma lanceolatum]
Length = 334
Score = 216 bits (551), Expect = 8e-54, Method: Compositional matrix adjust.
Identities = 113/221 (51%), Positives = 141/221 (63%), Gaps = 8/221 (3%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R +S VKDQG CG CW FSTTGSLE + GK + LSEQQLVDC++ F N
Sbjct: 116 KSVDWRNSHMVSEVKDQGECGPCWAFSTTGSLEGQHSNKTGKLVDLSEQQLVDCSKDFGN 175
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDG-VCKFSSENVGVQVLDSVNITLGAED 217
QGC GGL QAF+YI NGGLDTEE+YPYT D CKF + +VG ++ ++ G E
Sbjct: 176 QGCGGGLMDQAFQYIPANGGLDTEESYPYTATDDKPCKFDNSSVGATLVGYKDVKSGNEH 235
Query: 218 ELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDG--- 273
L+ AV V PVSVA + + F+FY SGVY +C +D H V+AVGYG +
Sbjct: 236 ALKRAVATVGPVSVAIDAGHESFQFYSSGVYDEPQCSTEQLD--HGVLAVGYGAMNDNSH 293
Query: 274 VPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
+W++KNSWG +WGD GY M K N CGIAT ASYP+V
Sbjct: 294 QAFWIVKNSWGPSWGDQGYIMMSRNKNNQCGIATSASYPLV 334
>gi|449275508|gb|EMC84350.1| Cathepsin L1, partial [Columba livia]
Length = 319
Score = 216 bits (551), Expect = 8e-54, Method: Compositional matrix adjust.
Identities = 119/245 (48%), Positives = 153/245 (62%), Gaps = 16/245 (6%)
Query: 78 EMKLRFATFSKN--LDLIRSTNCKGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYH 135
E K R + F + L+ RS + + Y ++PVKDQG CGSCW FSTTG+LE +
Sbjct: 82 ERKYRGSQFLEPSFLEAPRSVDWREKGY-----VTPVKDQGQCGSCWAFSTTGALEGQHF 136
Query: 136 QAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGV-C 194
+ GK +SLSEQ LVDC++ NQGCNGGL QAF+Y++ NGG+D+EE+YPYT KD C
Sbjct: 137 RKTGKLVSLSEQNLVDCSRPEGNQGCNGGLMDQAFQYVQDNGGIDSEESYPYTAKDDEDC 196
Query: 195 KFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCG 253
++ +E V+I G E L AV V PVSVA + F+FY+SG+Y C
Sbjct: 197 RYKAEYNAANDTGFVDIPQGHERALMKAVAAVGPVSVAIDAGHSSFQFYQSGIYYEPDC- 255
Query: 254 NTPMDVNHAVVAVGYGVE----DGVPYWLIKNSWGENWGDHGYFKMEMG-KNMCGIATCA 308
+ D++H V+ VGYG E DG YW++KNSWGE WGD GY M KN CGIAT A
Sbjct: 256 -SSEDLDHGVLVVGYGFEGEDVDGKKYWIVKNSWGEKWGDKGYIYMAKDRKNHCGIATAA 314
Query: 309 SYPVV 313
SYP+V
Sbjct: 315 SYPLV 319
>gi|449513868|ref|XP_002191976.2| PREDICTED: cathepsin L1-like [Taeniopygia guttata]
Length = 443
Score = 216 bits (551), Expect = 9e-54, Method: Compositional matrix adjust.
Identities = 119/245 (48%), Positives = 153/245 (62%), Gaps = 16/245 (6%)
Query: 78 EMKLRFATFSKN--LDLIRSTNCKGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYH 135
E K R + F + L+ RS + + Y ++PVKDQG CGSCW FSTTG+LE +
Sbjct: 206 ERKYRGSQFLEPNFLEAPRSVDWREKGY-----VTPVKDQGQCGSCWAFSTTGALEGQHF 260
Query: 136 QAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGV-C 194
+ GK +SLSEQ LVDC++ NQGCNGGL QAF+Y++ NGG+D+EE+YPYT KD C
Sbjct: 261 RKTGKLVSLSEQNLVDCSRPEGNQGCNGGLMDQAFQYVQDNGGIDSEESYPYTAKDDEDC 320
Query: 195 KFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCG 253
++ +E V+I G E L AV V PVSVA + F+FY+SG+Y C
Sbjct: 321 RYKAEYNAANDTGFVDIPQGHERALMKAVAAVGPVSVAIDAGHSSFQFYQSGIYYEPDC- 379
Query: 254 NTPMDVNHAVVAVGYGVE----DGVPYWLIKNSWGENWGDHGYFKMEMG-KNMCGIATCA 308
+ D++H V+ VGYG E DG YW++KNSWGE WGD GY M KN CGIAT A
Sbjct: 380 -SSEDLDHGVLVVGYGFEGEDVDGKKYWIVKNSWGEKWGDKGYIYMAKDRKNHCGIATAA 438
Query: 309 SYPVV 313
SYP+V
Sbjct: 439 SYPLV 443
>gi|395514296|ref|XP_003761355.1| PREDICTED: cathepsin L1-like [Sarcophilus harrisii]
Length = 262
Score = 216 bits (550), Expect = 9e-54, Method: Compositional matrix adjust.
Identities = 114/225 (50%), Positives = 142/225 (63%), Gaps = 7/225 (3%)
Query: 94 RSTNCKGLSYRLG-LNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDC 152
R+ C G + G + + +KD+G CGSCW FS TGSLE + GK +SLSEQ LVDC
Sbjct: 40 RANGCDGRWDQAGSVRDTSIKDKGQCGSCWAFSATGSLEGQWFHKTGKLVSLSEQNLVDC 99
Query: 153 AQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNIT 212
+ A N GC GGL AFEY+K NGG+DTEE+YPY GKDG C ++S+ G V V+I
Sbjct: 100 STAQGNSGCQGGLMDNAFEYVKKNGGIDTEESYPYVGKDGTCHYNSQCSGANVTGYVDIP 159
Query: 213 LGAEDELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE 271
G E L AV V P+SVA + F+FY+SGVY +C + +D H V+ VG+GVE
Sbjct: 160 AGVERALAKAVATVGPISVAIDAGHSSFQFYRSGVYYEPECSSEELD--HGVLVVGFGVE 217
Query: 272 --DGVPYWLIKNSWGENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
+G YW++KNSWGE WGD GY M N CGIAT ASYP V
Sbjct: 218 GKNGKKYWIVKNSWGEEWGDRGYVLMTRDHNNHCGIATAASYPEV 262
>gi|52630917|gb|AAU84922.1| putative cathepsin L [Toxoptera citricida]
Length = 341
Score = 216 bits (550), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 108/228 (47%), Positives = 149/228 (65%), Gaps = 8/228 (3%)
Query: 92 LIRSTNC---KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQ 148
++S N K + +R ++PVK+QG CGSCW+FS TGSLE + + G +SLSEQ
Sbjct: 116 FLKSENVVIPKSIDWRKKGYVTPVKNQGQCGSCWSFSATGSLEGQHFRKTGVLVSLSEQN 175
Query: 149 LVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDS 208
L+DC++ + N GC GGL AF+YIK N GLDTE++YPY +D C+++ EN G
Sbjct: 176 LIDCSRKYGNNGCEGGLMDLAFKYIKSNKGLDTEKSYPYEAEDDKCRYNPENSGATDKGF 235
Query: 209 VNITLGAEDELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVG 267
V+I G ED L HA+ V PVS+A + + F+FYK GV+ + +C +T +D H V+AVG
Sbjct: 236 VDIPEGDEDALVHALATVGPVSIAIDASSEKFQFYKKGVFYNPRCSSTELD--HGVLAVG 293
Query: 268 YGVE-DGVPYWLIKNSWGENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
YG + G YW++KNSWG+ WGD GY M KN CG+A+ ASYP+V
Sbjct: 294 YGTDHKGGDYWIVKNSWGKTWGDQGYIMMARNKKNNCGVASSASYPLV 341
>gi|194320502|gb|ACF48469.1| cathepsin L [Triatoma brasiliensis]
Length = 330
Score = 216 bits (550), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 115/265 (43%), Positives = 155/265 (58%), Gaps = 15/265 (5%)
Query: 60 LSFARFARRYGKIYESVEEMKLRFATFSKNLDLIR---------STNCKGLSYRLGLNIS 110
+S+ +G + V E K F + D R S K + +R ++
Sbjct: 70 VSYKMMMNHFGDLM--VHEFKALMNGFKMSPDTKRNGELYFPSNSNLPKTVDWRQKGAVT 127
Query: 111 PVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAF 170
PVKDQG CGSCW+FS TGSLE GK +SLSEQ LVDC+ ++ N GC GGL QAF
Sbjct: 128 PVKDQGQCGSCWSFSATGSLEGQVFLKTGKLVSLSEQNLVDCSTSYGNNGCEGGLMDQAF 187
Query: 171 EYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVS 230
+Y+ N G+DTE +YPY ++ C+F VG V+I G E LQ+A+ V P+S
Sbjct: 188 QYVSDNKGIDTEASYPYEARENTCRFKKNKVGGTDKGHVDIPAGDEKALQNALATVGPIS 247
Query: 231 VAFEVVDG-FRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGD 289
VA + G F+FY GVY+ C + D++H V+AVGYG E+G YWL+KNSWG +WG+
Sbjct: 248 VAIDANHGSFQFYSKGVYNEPNC--SSYDLDHGVLAVGYGTENGQDYWLVKNSWGPSWGE 305
Query: 290 HGYFKMEMGK-NMCGIATCASYPVV 313
+GY K+ N CGIA+ ASYP+V
Sbjct: 306 NGYIKIARNHSNHCGIASMASYPLV 330
>gi|158300877|ref|XP_001689282.1| AGAP011828-PA [Anopheles gambiae str. PEST]
gi|157013372|gb|EDO63348.1| AGAP011828-PA [Anopheles gambiae str. PEST]
Length = 344
Score = 216 bits (549), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 109/216 (50%), Positives = 140/216 (64%), Gaps = 5/216 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++ VKDQGHCGSCW+FS TG+LE + + GK +SLSEQ LVDC+Q + N G
Sbjct: 131 MDWRTKGAVTQVKDQGHCGSCWSFSATGALEGQHFRKTGKLVSLSEQNLVDCSQKYGNNG 190
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
CNGG+ AF+YIK N G+DTE++YPY D C ++ + VG V+I G E L
Sbjct: 191 CNGGMMDFAFQYIKDNKGIDTEKSYPYEAIDDECHYNPKAVGATDKGFVDIPQGNEKALM 250
Query: 221 HAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYG-VEDGVPYWL 278
A+ V PVSVA + + F+FY GVY +C + +D H V+AVGYG EDG YWL
Sbjct: 251 KALATVGPVSVAIDASHESFQFYSEGVYYEPQCDSEQLD--HGVLAVGYGTTEDGEDYWL 308
Query: 279 IKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
+KNSWG WGD GY KM + N CGIAT ASYP+V
Sbjct: 309 VKNSWGTTWGDQGYVKMARNRDNHCGIATTASYPLV 344
>gi|238481789|gb|ACR43934.1| cathepsin L-like cysteine proteinase [Haliotis diversicolor
supertexta]
Length = 347
Score = 216 bits (549), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 107/215 (49%), Positives = 137/215 (63%), Gaps = 4/215 (1%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++ VK+QG CGSCW+FSTTGSLE + GK +SLSEQQLVDC+ F N+G
Sbjct: 135 VDWRKKGYVTAVKNQGQCGSCWSFSTTGSLEGQHFHKSGKLVSLSEQQLVDCSGKFGNEG 194
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
CNGGL QAFEYI NGG++TEE YPY + C F V V++ G E +L+
Sbjct: 195 CNGGLMDQAFEYIITNGGIETEEEYPYDARQERCHFKKSEVAATASGCVDVKSGDETDLK 254
Query: 221 HAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
++V V PVS+A + F+ Y GVY KC +T +D H V+ VGYG +DG YWL+
Sbjct: 255 NSVAEVGPVSIAIDASHQSFQLYSGGVYDEPKCSSTELD--HGVLVVGYGTDDGQDYWLV 312
Query: 280 KNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
KNSWG WG GY KM + N CG+AT ASYP+V
Sbjct: 313 KNSWGTTWGLEGYVKMSRNQDNQCGVATQASYPLV 347
>gi|121543825|gb|ABM55577.1| putative cathepsin L-like protease [Maconellicoccus hirsutus]
Length = 341
Score = 216 bits (549), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 107/215 (49%), Positives = 139/215 (64%), Gaps = 2/215 (0%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++ VK+QG CGSCW FSTTGSLE + + + SLSEQ L+DC+ + N G
Sbjct: 127 VDWRTEGAVTEVKNQGQCGSCWAFSTTGSLEGQHFRNTKQLTSLSEQNLIDCSGKYGNNG 186
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C+GGL AF YIK N G+DTE++YPY G D C++ + G V+I G E++L+
Sbjct: 187 CSGGLMDNAFAYIKSNKGIDTEQSYPYEGIDDKCRYKPQESGATDKGFVDIPQGDEEKLK 246
Query: 221 HAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
AV V P+SVA + F+FYK GVY CGN D++H V+AVGYG E+G YWL+
Sbjct: 247 LAVATVGPISVAIDASHQSFQFYKKGVYYDKGCGNGEEDLDHGVLAVGYGTENGKDYWLV 306
Query: 280 KNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
KNSWG+ WG GY KM K N CGIAT ASYP+V
Sbjct: 307 KNSWGKRWGLDGYIKMARNKHNHCGIATSASYPLV 341
>gi|390337645|ref|XP_001199228.2| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
Length = 333
Score = 215 bits (548), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 112/218 (51%), Positives = 141/218 (64%), Gaps = 7/218 (3%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++PVKDQG CGSCW FS TGSLE + + GK +SLSEQ LVDC+ N
Sbjct: 120 KTVDWRTKGYVTPVKDQGQCGSCWAFSATGSLEGQHFKKTGKLVSLSEQNLVDCSD--KN 177
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
GCNGGL +AF+YI GG+DTEE+YPY DG C F + NVG V ++T G+E
Sbjct: 178 YGCNGGLMDRAFQYIIDAGGIDTEESYPYIAMDGNCHFKTANVGATVTGYTDVTSGSEKA 237
Query: 219 LQHAVGLVRPVSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE-DGVPY 276
LQ AV + P+SVA + F+ Y+SGVY+ C +T +D H V+AVGYG DG Y
Sbjct: 238 LQKAVAHIGPISVAIDASHFSFQLYQSGVYNEPGCSSTLLD--HGVLAVGYGTTIDGTDY 295
Query: 277 WLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
W++KNSW E WG +GY M K N CGIAT ASYP+V
Sbjct: 296 WIVKNSWAETWGMNGYIWMSRNKDNQCGIATQASYPLV 333
>gi|348531515|ref|XP_003453254.1| PREDICTED: cathepsin L2-like [Oreochromis niloticus]
Length = 333
Score = 215 bits (548), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 112/238 (47%), Positives = 152/238 (63%), Gaps = 9/238 (3%)
Query: 79 MKLRFATFSKNLDLIRSTNC-KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQA 137
+ R +TF++ L + T K + +R ++ VK Q CGSCW FS TG+LE + +
Sbjct: 102 LHRRGSTFNR---LPKGTKLPKTVDWRKQGYVTKVKHQKECGSCWAFSATGALEGQHFRK 158
Query: 138 FGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFS 197
K +SLSEQQLVDC+++F N GCNGG + AF+YI+YNGGLDTE++YPY KDG+C ++
Sbjct: 159 TRKLVSLSEQQLVDCSRSFGNHGCNGGWMNPAFQYIRYNGGLDTEDSYPYKAKDGICHYN 218
Query: 198 SENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTP 256
+VG V+++ E L+ AV + P+S+A + + F+ Y+SGVY +C
Sbjct: 219 PNSVGAICSGHVDVSPD-EAALKQAVATIGPISIAVDASHESFQLYQSGVYDEHRCNKK- 276
Query: 257 MDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
V HA++ VGYG E G YWLIKNSWG WGD GY KM K N CGIAT ASYP+V
Sbjct: 277 -HVTHAMLVVGYGTEGGHDYWLIKNSWGLQWGDKGYIKMTRNKGNQCGIATAASYPLV 333
>gi|2804262|dbj|BAA24442.1| cysteine proteinase [Sitophilus zeamais]
Length = 338
Score = 215 bits (548), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 106/208 (50%), Positives = 140/208 (67%), Gaps = 5/208 (2%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++ VKDQGHCGSCW+FS TGSLE + + GK +SLSEQ LVDC+ + N GCNGGL
Sbjct: 133 VTEVKDQGHCGSCWSFSATGSLEGQHFRKTGKLVSLSEQNLVDCSGRYGNNGCNGGLMDN 192
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
AF YIK NGG+DTE++YPY +D C + ++N G V+I ED+L+ AV V P
Sbjct: 193 AFRYIKDNGGIDTEKSYPYLAEDEKCHYKAQNSGATDKGFVDIEEANEDDLKAAVATVGP 252
Query: 229 VSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGV-EDGVPYWLIKNSWGEN 286
VS+A + + F+ Y GVYS +C + +D H V+ VGYG +DG YWL+KNSWG +
Sbjct: 253 VSIAIDASHETFQLYSDGVYSDPECSSQELD--HGVLVVGYGTSDDGQDYWLVKNSWGPS 310
Query: 287 WGDHGYFKMEMGK-NMCGIATCASYPVV 313
WG +GY KM + NMCG+A+ ASYP+V
Sbjct: 311 WGLNGYIKMARNQDNMCGVASQASYPLV 338
>gi|3929819|emb|CAA77182.1| cathepsin H [Mus musculus]
Length = 166
Score = 215 bits (547), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 103/166 (62%), Positives = 122/166 (73%)
Query: 118 CGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNG 177
CGSCWTFSTTG+LE+A A GK +SL+EQQLVDCAQAFNN GC GGLPSQAFEYI YN
Sbjct: 1 CGSCWTFSTTGALESAVAIASGKMLSLAEQQLVDCAQAFNNHGCKGGLPSQAFEYILYNK 60
Query: 178 GLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVD 237
G+ E++YPY GKD C+F+ + V + VNITL E + AV L PVS AFEV +
Sbjct: 61 GIMEEDSYPYIGKDSSCRFNPQKAVAFVKNVVNITLNDEAAMVEAVALYNPVSFAFEVTE 120
Query: 238 GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSW 283
F YKSGVYSS C TP VNHAV+AVGYG ++G+ YW++KNSW
Sbjct: 121 DFLMYKSGVYSSKSCHKTPDKVNHAVLAVGYGEQNGLLYWIVKNSW 166
>gi|383849553|ref|XP_003700409.1| PREDICTED: cathepsin L-like [Megachile rotundata]
Length = 343
Score = 215 bits (547), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 108/218 (49%), Positives = 143/218 (65%), Gaps = 5/218 (2%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++ VKDQGHCGSCW FS TGSLE + + G +SLSEQ L+DC+ ++ N
Sbjct: 124 KMVDWRKCGAVTEVKDQGHCGSCWAFSATGSLEGQHFRRTGVLVSLSEQNLIDCSGSYGN 183
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
GCNGGL QAF YIK N GLDTE+ YPY G+D C++ + G + V+I +G E +
Sbjct: 184 NGCNGGLMDQAFSYIKDNKGLDTEKTYPYEGEDDKCRYDKRSSGASDVGFVDIPVGDEQK 243
Query: 219 LQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGV-EDGVPY 276
L+ AV V PVSVA + F+FY G+Y +C +T +D H V+ VGYG E+G Y
Sbjct: 244 LKAAVATVGPVSVAIDASHQSFQFYSDGIYFEPECSSTNLD--HGVLVVGYGTDEEGRDY 301
Query: 277 WLIKNSWGENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
W++KNSWGE+WG+ GY KM N CGIA+ ASYP+V
Sbjct: 302 WIVKNSWGESWGEKGYIKMARNIDNHCGIASSASYPIV 339
>gi|3929735|emb|CAA77179.1| cathepsin H [Homo sapiens]
Length = 166
Score = 215 bits (547), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 103/166 (62%), Positives = 120/166 (72%)
Query: 118 CGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNG 177
CGSCWTFSTTG+LE+A A GK +SL+EQQLVDCAQ FNN GC GGLPSQAFEYI YN
Sbjct: 1 CGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNK 60
Query: 178 GLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVD 237
G+ E+ YPY GKDG CKF V D NIT+ E+ + AV L PVS AFEV
Sbjct: 61 GIMGEDTYPYQGKDGYCKFQPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFAFEVTQ 120
Query: 238 GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSW 283
F Y++G+YSST C TP VNHAV+AVGYG E+G+PYW++KNSW
Sbjct: 121 DFMMYRTGIYSSTSCHKTPDKVNHAVLAVGYGEENGIPYWIVKNSW 166
>gi|242020003|ref|XP_002430447.1| Cathepsin L precursor, putative [Pediculus humanus corporis]
gi|212515585|gb|EEB17709.1| Cathepsin L precursor, putative [Pediculus humanus corporis]
Length = 345
Score = 215 bits (547), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 103/209 (49%), Positives = 136/209 (65%), Gaps = 4/209 (1%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++PVKDQGHCGSCW FS TG+LE + + +SLSEQ L+DC+ N GCNGGL Q
Sbjct: 137 VTPVKDQGHCGSCWAFSATGALEGLHFRKTKVLVSLSEQNLIDCSTEEGNNGCNGGLMDQ 196
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
AF+Y++ NGG+DTE +YPY G + VC++ EN G ++ LG ED L+ AV V P
Sbjct: 197 AFQYVRINGGIDTERSYPYEGNNDVCRYEPENSGAIDTGYTDVPLGDEDALKSAVATVGP 256
Query: 229 VSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVP--YWLIKNSWGE 285
VSVA + + F+ Y SGVY C N P ++H V+ VGYG ++ YWL+KNSWG+
Sbjct: 257 VSVAIDASQESFQLYSSGVYFEPNCKNEPESLDHGVLVVGYGTDEETQQDYWLVKNSWGD 316
Query: 286 NWGDHGYFKM-EMGKNMCGIATCASYPVV 313
+WG++GY KM N CGIAT S+P V
Sbjct: 317 SWGENGYIKMARNADNQCGIATQPSFPQV 345
>gi|157779038|gb|ABV71063.1| cathepsin L3 precursor [Schistosoma mansoni]
gi|360044915|emb|CCD82463.1| subfamily C1A unassigned peptidase (C01 family) [Schistosoma
mansoni]
Length = 370
Score = 215 bits (547), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 112/239 (46%), Positives = 151/239 (63%), Gaps = 9/239 (3%)
Query: 84 ATFSKNLDLIRSTNCK---GLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGK 140
A K IRS + K + +R ++ VK+QG CGSCW FSTTG++E +++ +
Sbjct: 132 AIRHKGSTFIRSEHTKLPSKVDWRREGAVTDVKNQGQCGSCWAFSTTGAIEGQHYRKTNR 191
Query: 141 GISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGV----CKF 196
++LSEQQLVDC++++ N GC+GGL + AFEY++ N G+D+E +YPY DG C F
Sbjct: 192 LVNLSEQQLVDCSKSYGNNGCSGGLMNSAFEYVRDNEGIDSEISYPYVSGDGTENNRCLF 251
Query: 197 SSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNT 255
++ N+ QV VNI G E L AV PVSVA + F YKSG+YS T C T
Sbjct: 252 NASNILAQVTGYVNIHEGDERALMDAVATKGPVSVAINAGLPSFSMYKSGIYSDTDCEGT 311
Query: 256 PMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
++H V+ VGYG E+G YWLIKNSWGE WG+ GY K+ G NMCG+A+ ASYP+V
Sbjct: 312 LDALDHGVLVVGYGEENGRSYWLIKNSWGEEWGEKGYIKISKGSHNMCGVASAASYPLV 370
>gi|209693435|ref|NP_001129410.1| cathepsin L precursor [Acyrthosiphon pisum]
gi|251823771|ref|NP_001156569.1| cathepsin L precursor [Acyrthosiphon pisum]
Length = 341
Score = 215 bits (547), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 109/238 (45%), Positives = 153/238 (64%), Gaps = 8/238 (3%)
Query: 82 RFATFSKNLDLIRSTNC---KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAF 138
R T + + ++S N K + +R ++PVK+QG CGSCW+FS TGSLE + +
Sbjct: 106 RNFTNDEAVTFLKSENVVIPKSVDWRKKGYVTPVKNQGQCGSCWSFSATGSLEGQHFRKT 165
Query: 139 GKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSS 198
G +SLSEQ L+DC++ + N GC GGL AF+YIK N GLDTE++YPY +D C+++
Sbjct: 166 GVLVSLSEQNLIDCSRKYGNNGCEGGLMDLAFKYIKSNKGLDTEKSYPYEAEDDKCRYNP 225
Query: 199 ENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPM 257
EN G V+I G ED L HA+ V PVS+A + + F+FYK GV+ + +C +T +
Sbjct: 226 ENSGATDKGFVDIPEGDEDALMHALATVGPVSIAIDASSEKFQFYKKGVFYNPRCSSTEL 285
Query: 258 DVNHAVVAVGYGVE-DGVPYWLIKNSWGENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
D H V+AVG+G + G YW++KNSWG+ WGD GY M KN CG+A+ ASYP+V
Sbjct: 286 D--HGVLAVGFGSDKKGGDYWIVKNSWGKTWGDEGYIMMARNKKNNCGVASSASYPLV 341
>gi|21425246|emb|CAD33266.1| cathepsin L [Aphis gossypii]
Length = 341
Score = 214 bits (546), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 109/238 (45%), Positives = 153/238 (64%), Gaps = 8/238 (3%)
Query: 82 RFATFSKNLDLIRSTNC---KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAF 138
R T + + ++S N K + +R ++PVK+QG CGSCW+FS TGSLE + +
Sbjct: 106 RNFTNDEAVTFLKSENVVIPKSVDWRKKGYVTPVKNQGQCGSCWSFSATGSLEGQHFRKT 165
Query: 139 GKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSS 198
G +SLSEQ L+DC++ + N GC GGL AF+YIK N GLDTE++YPY +D C+++
Sbjct: 166 GVLVSLSEQNLIDCSRKYGNNGCEGGLMDLAFKYIKSNKGLDTEKSYPYEAEDDKCRYNP 225
Query: 199 ENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPM 257
EN G V+I G ED L HA+ V PVS+A + + F+FYK GV+ + +C +T +
Sbjct: 226 ENSGATDKGFVDIPEGDEDALMHALATVGPVSIAIDASSEKFQFYKKGVFYNPRCSSTEL 285
Query: 258 DVNHAVVAVGYGVE-DGVPYWLIKNSWGENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
D H V+AVG+G + G YW++KNSWG+ WGD GY M KN CG+A+ ASYP+V
Sbjct: 286 D--HGVLAVGFGSDKKGGDYWIVKNSWGKTWGDEGYIMMARNKKNNCGVASSASYPLV 341
>gi|306992173|gb|ADN19567.1| cathepsin L-like proteinase [Spodoptera frugiperda]
Length = 344
Score = 214 bits (546), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 108/216 (50%), Positives = 144/216 (66%), Gaps = 5/216 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++ VKDQG CGSCW FSTTG+LE + + G +SLSEQ LVDC+ A+ N G
Sbjct: 131 VDWRKKGAVTDVKDQGKCGSCWAFSTTGALEGQHFRKTGYLVSLSEQNLVDCSAAYGNNG 190
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
CNGGL AF+YIK NGG+DTE++YPY D C+++ +N G + V+I G E++L
Sbjct: 191 CNGGLMDNAFKYIKDNGGIDTEKSYPYEAVDDKCRYNPKNSGADDVGFVDIPQGDEEKLM 250
Query: 221 HAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGV-EDGVPYWL 278
AV V P+SVA + + F+FY GVY C +T D++H V+ VGYG E+G YWL
Sbjct: 251 QAVATVGPISVAIDASQETFQFYSKGVYYDENCSST--DLDHGVMVVGYGTEEEGGDYWL 308
Query: 279 IKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
+KNSWG +WG+ GY KM K N CGIA+ ASYP+V
Sbjct: 309 VKNSWGRSWGELGYIKMAHNKNNHCGIASSASYPLV 344
>gi|328776427|ref|XP_625135.3| PREDICTED: cathepsin L-like [Apis mellifera]
Length = 351
Score = 214 bits (546), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 106/218 (48%), Positives = 143/218 (65%), Gaps = 5/218 (2%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++PVKDQGHCGSCW+FS TG+LE + + G +SLSEQ L+DC+ + N
Sbjct: 130 KKVDWRKEGAVTPVKDQGHCGSCWSFSATGALEGQHFRRTGVLVSLSEQNLIDCSGKYGN 189
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
GCNGGL QAF+YIK N GLDTE +YPY ++ C+++ N G + ++I G E
Sbjct: 190 NGCNGGLMDQAFQYIKDNKGLDTEASYPYEAENDKCRYNPANSGAIDVGYIDIPTGDEKL 249
Query: 219 LQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGV-EDGVPY 276
L+ AV + PVSVA + F+FY GVY +C + +D H V+ +GYG E+G Y
Sbjct: 250 LKAAVATIGPVSVAIDASHQSFQFYSEGVYYEPECSSEELD--HGVLVIGYGTNENGQDY 307
Query: 277 WLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
WL+KNSWGE WG++GY KM K N CGIA+ ASYP+V
Sbjct: 308 WLVKNSWGETWGNNGYIKMARNKLNHCGIASSASYPLV 345
>gi|380014284|ref|XP_003691169.1| PREDICTED: cathepsin L-like [Apis florea]
Length = 345
Score = 214 bits (545), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 106/218 (48%), Positives = 143/218 (65%), Gaps = 5/218 (2%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++PVKDQGHCGSCW+FS TG+LE + + G +SLSEQ L+DC+ + N
Sbjct: 124 KKVDWRKEGAVTPVKDQGHCGSCWSFSATGALEGQHFRRTGVLVSLSEQNLIDCSGKYGN 183
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
GCNGGL QAF+YIK N GLDTE +YPY ++ C+++ N G + ++I G E
Sbjct: 184 NGCNGGLMDQAFQYIKDNKGLDTEASYPYEAENDKCRYNPANSGAIDVGYIDIPTGNEKL 243
Query: 219 LQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGV-EDGVPY 276
L+ AV + PVSVA + F+FY GVY +C + +D H V+ +GYG E+G Y
Sbjct: 244 LKAAVATIGPVSVAIDASHQSFQFYSEGVYYEPECSSEELD--HGVLVIGYGTNENGEDY 301
Query: 277 WLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
WL+KNSWGE WG++GY KM K N CGIA+ ASYP+V
Sbjct: 302 WLVKNSWGETWGNNGYIKMARNKLNHCGIASSASYPLV 339
>gi|345493482|ref|XP_001602523.2| PREDICTED: cathepsin L-like [Nasonia vitripennis]
Length = 514
Score = 214 bits (545), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 106/209 (50%), Positives = 140/209 (66%), Gaps = 6/209 (2%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++PVK+QG+CGSCW FS TGSLE + + G ISLSEQ LVDC+ F N GC+GGL +
Sbjct: 307 VTPVKNQGNCGSCWAFSATGSLEGQHFRHNGSLISLSEQNLVDCSGRFGNDGCDGGLMNN 366
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
AF Y+K N GLD+E++YPY +D C+++ +N VNI G+E +LQ AV V P
Sbjct: 367 AFTYVKVNRGLDSEKSYPYEAEDDRCRYNPKNSAADDAGYVNIPTGSESKLQAAVATVGP 426
Query: 229 VSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVED--GVPYWLIKNSWGE 285
+SVA + D F FY SGVY C T D++H V+A+GYG + G +WL+KNSWGE
Sbjct: 427 ISVAIDADSDSFMFYHSGVYYEPDCSRT--DLDHGVLAIGYGTDSKTGKQFWLVKNSWGE 484
Query: 286 NWGDHGYFKMEMGK-NMCGIATCASYPVV 313
+WG+ GY +M + N CGIAT ASYP+V
Sbjct: 485 DWGEKGYIRMSRNRHNNCGIATAASYPLV 513
Score = 174 bits (441), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 84/181 (46%), Positives = 119/181 (65%), Gaps = 4/181 (2%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++P+KDQGHCGSCW+FS TG+LE + + GK +SLSEQ L+DC+ + N
Sbjct: 124 KSVDWRQEGAVTPIKDQGHCGSCWSFSATGALEGQHFRQTGKLVSLSEQNLIDCSGKYGN 183
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
GCNGGL AF+YI+ N GLDTE YPY +D C++++ N G + + V+I G E++
Sbjct: 184 NGCNGGLMDNAFKYIRDNKGLDTESTYPYEAEDDECRYNARNSGAEDVGFVDIPEGDEEK 243
Query: 219 LQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGV-EDGVPY 276
L+ A+ + PVSVA + F+FY +GVY +C +T +D H V+ VGYG EDG Y
Sbjct: 244 LKAAIATIGPVSVAIDASHQTFQFYSTGVYYEPECSSTELD--HGVLVVGYGTSEDGQDY 301
Query: 277 W 277
W
Sbjct: 302 W 302
>gi|50539796|ref|NP_001002368.1| cathepsin L.1 precursor [Danio rerio]
gi|49900360|gb|AAH75887.1| Cathepsin L.1 [Danio rerio]
Length = 334
Score = 214 bits (545), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 105/207 (50%), Positives = 136/207 (65%), Gaps = 4/207 (1%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++ +KDQ CGSCW FS TGSLE + GK +SLSEQQLVDC+ ++ N GC+GGL Q
Sbjct: 130 VTDIKDQKQCGSCWAFSATGSLEGQTFRKTGKLVSLSEQQLVDCSGSYGNYGCDGGLMDQ 189
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
AF+YI+ N GLDTE++YPY +DG C+F+ VG V+I G E LQ AV + P
Sbjct: 190 AFQYIEANKGLDTEDSYPYEAQDGECRFNPSTVGASCTGYVDIASGDESALQEAVATIGP 249
Query: 229 VSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENW 287
+SVA + F+ Y SGVY+ C ++ +D H V+AVGYG +G YW++KNSWG +W
Sbjct: 250 ISVAIDAGHSSFQLYSSGVYNEPDCSSSELD--HGVLAVGYGSSNGDDYWIVKNSWGLDW 307
Query: 288 GDHGYFKMEMGK-NMCGIATCASYPVV 313
G GY M K N CGIAT ASYP+V
Sbjct: 308 GVQGYILMSRNKSNQCGIATAASYPLV 334
>gi|195056367|ref|XP_001995082.1| GH22826 [Drosophila grimshawi]
gi|193899288|gb|EDV98154.1| GH22826 [Drosophila grimshawi]
Length = 340
Score = 214 bits (544), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 105/208 (50%), Positives = 135/208 (64%), Gaps = 5/208 (2%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++ VKDQGHCGSCW FS+TG+LE + + G ISLSEQ LVDC+ + N GCNGGL
Sbjct: 135 VTGVKDQGHCGSCWAFSSTGALEGQHFRKTGTLISLSEQNLVDCSTKYGNNGCNGGLMDN 194
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
AF YIK NGG+DTE++YPY G D C F+ +G +I G E +L AV + P
Sbjct: 195 AFRYIKDNGGIDTEKSYPYEGIDDSCHFNKGTIGATDRGFTDIPQGDEKKLAQAVATIGP 254
Query: 229 VSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGV-EDGVPYWLIKNSWGEN 286
VSVA + + F+FY +GVY +C P +++H V+ VGYG E+G YWL+KNSWG
Sbjct: 255 VSVAIDASHESFQFYSTGVYDEPQC--DPQNLDHGVLVVGYGTDENGKDYWLVKNSWGTT 312
Query: 287 WGDHGYFKMEMG-KNMCGIATCASYPVV 313
WGD G+ KM N CGIAT +SYP+V
Sbjct: 313 WGDKGFIKMARNDDNQCGIATASSYPLV 340
>gi|82796372|gb|ABB91778.1| cathepsin L [Hymeniacidon perlevis]
Length = 323
Score = 214 bits (544), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 109/215 (50%), Positives = 136/215 (63%), Gaps = 4/215 (1%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++ VK+QG CGSCW FSTTGSLE + GK +SLSEQ LVDC+ N+G
Sbjct: 111 VDWRTKGAVTGVKNQGQCGSCWAFSTTGSLEGQHFLKTGKLVSLSEQNLVDCSGKEGNEG 170
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
CNGGL QAFEYIK NGG+DTE +YPY D C+F + +VG V+I E+ L
Sbjct: 171 CNGGLMDQAFEYIKKNGGIDTEASYPYQAHDERCRFKASDVGATCTGYVDIKREDENALM 230
Query: 221 HAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
AV + PVSVA + F+ Y+SGVY +C T +D H V+A+GYG E G YWL+
Sbjct: 231 QAVEKIGPVSVAIDASHSSFQLYRSGVYYERECSQTALD--HGVLAIGYGTEGGSDYWLV 288
Query: 280 KNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
KNSWG +WG GY M + N CGIAT ASYP V
Sbjct: 289 KNSWGTDWGMEGYIMMSRNRNNNCGIATEASYPTV 323
>gi|94448674|emb|CAI91575.1| cathepsin L2 [Lubomirskia baicalensis]
Length = 324
Score = 214 bits (544), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 107/207 (51%), Positives = 136/207 (65%), Gaps = 4/207 (1%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++PVK+QG CGSCW+FS TGS+E + A G +SLSEQ LVDC+ A N GCNGGL
Sbjct: 120 VTPVKNQGQCGSCWSFSATGSMEGQHFNATGTLMSLSEQNLVDCSAAEGNHGCNGGLMDD 179
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
AFEY+ N G+DTE +YPY D CKF++ +VG + V++T +E +LQ AV + P
Sbjct: 180 AFEYVIKNNGIDTEASYPYRAVDSTCKFNTADVGATISGYVDVTKDSESDLQVAVATIGP 239
Query: 229 VSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENW 287
VSVA + F+FY SGVY C +T +D H V+AVGYG + YWL+KNSWG +W
Sbjct: 240 VSVAIDASHISFQFYSSGVYDPLICSSTNLD--HGVLAVGYGTDGSKDYWLVKNSWGASW 297
Query: 288 GDHGYFKM-EMGKNMCGIATCASYPVV 313
G GY +M N CGIAT ASYPVV
Sbjct: 298 GMSGYIEMVRNHNNKCGIATSASYPVV 324
>gi|118425914|gb|ABK90856.1| cathepsin-L-like cysteine peptidase [Radix peregra]
Length = 324
Score = 213 bits (543), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 106/207 (51%), Positives = 135/207 (65%), Gaps = 4/207 (1%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++ VKDQG CGSCW FSTTGSLE + +A + +SLSE LVDC++ + NQGCNGGL
Sbjct: 120 VTEVKDQGQCGSCWAFSTTGSLEGQHFKATKQLVSLSESNLVDCSKKWGNQGCNGGLMDN 179
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
AF+YI N G+DTE++YPY +D C F NVG +IT G+ED LQ AV + P
Sbjct: 180 AFKYIADNKGIDTEKSYPYKPEDRKCNFKKANVGATDKLYKDITSGSEDALQEAVATIGP 239
Query: 229 VSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENW 287
+SVA + D F+ Y GVY+ C +D H V+AVGY ++G YW++KNSWG++W
Sbjct: 240 ISVAIDASHDSFQLYSGGVYNEKACSTKTLD--HGVLAVGYDSKNGDDYWIVKNSWGKSW 297
Query: 288 GDHGYFKMEMG-KNMCGIATCASYPVV 313
G GY M KN CGIAT ASYPVV
Sbjct: 298 GIDGYIWMSRNKKNQCGIATMASYPVV 324
>gi|110349475|gb|ABG73218.1| cathepsin L 2 precursor [Diaprepes abbreviatus]
Length = 348
Score = 213 bits (542), Expect = 8e-53, Method: Compositional matrix adjust.
Identities = 110/232 (47%), Positives = 146/232 (62%), Gaps = 5/232 (2%)
Query: 83 FATFSKNLDLIRSTNCKGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGI 142
F T++ +L + +R ++PVK+Q +CGSCW+FS TG+LEA + + K I
Sbjct: 121 FVTYALPTNLDEVDLPTDIDWRQKGAVTPVKNQRNCGSCWSFSATGALEAQWFKKTNKLI 180
Query: 143 SLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVG 202
SLSEQQLVDC+ + N GC+GG AF YIK NGG+DTE++YPYT KDG C + N
Sbjct: 181 SLSEQQLVDCSGRYGNHGCHGGWMHWAFGYIKENGGIDTEQSYPYTAKDGRCAYKPGNKA 240
Query: 203 VQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHA 262
V + + G E++L V V P+S+A EV F+FY SGVY +CG++ +NHA
Sbjct: 241 ATVSQVIMVPRG-ENQLAAKVSSVGPISIAAEVSHKFQFYHSGVYDEPQCGHS---LNHA 296
Query: 263 VVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
++AVGYG G +WL+KNSWG WGD GY +M K N CGIA ASYP V
Sbjct: 297 MLAVGYGSMGGKNFWLVKNSWGTGWGDQGYIRMAKDKNNQCGIALMASYPGV 348
>gi|340505335|gb|EGR31675.1| papain family cysteine protease, putative [Ichthyophthirius
multifiliis]
Length = 229
Score = 213 bits (542), Expect = 8e-53, Method: Compositional matrix adjust.
Identities = 114/217 (52%), Positives = 146/217 (67%), Gaps = 6/217 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGIS---LSEQQLVDCAQAFN 157
L +R ++ VK+Q CGSCW+FSTTG++E+ H A G LSEQQL+DCAQ FN
Sbjct: 8 LDWRQYGIVTSVKNQRSCGSCWSFSTTGAVES--HWALKNGNPPPILSEQQLIDCAQDFN 65
Query: 158 NQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAED 217
N GC GGLPSQAFEYI YNGGL++E+ YPY C F + V ++ NIT E+
Sbjct: 66 NFGCKGGLPSQAFEYIFYNGGLESEKDYPYMAATRNCTFDASKVSAKLEGQYNITFQDEN 125
Query: 218 ELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE-DGVPY 276
EL + + P+S+A++V + F Y+SGVYSS C P DVNHAV+AVGYGV G Y
Sbjct: 126 ELLYKLANEGPISIAYQVNNDFFQYRSGVYSSPSCSQQPSDVNHAVLAVGYGVSISGQLY 185
Query: 277 WLIKNSWGENWGDHGYFKMEMGKNMCGIATCASYPVV 313
+++KNSWG WG +GYF +E G NMCG+A CASYP+V
Sbjct: 186 YIVKNSWGPEWGINGYFLIERGTNMCGLADCASYPIV 222
>gi|357627452|gb|EHJ77132.1| cathepsin L-like protease [Danaus plexippus]
Length = 341
Score = 213 bits (542), Expect = 9e-53, Method: Compositional matrix adjust.
Identities = 132/344 (38%), Positives = 185/344 (53%), Gaps = 47/344 (13%)
Query: 12 ILLLCCAAAASASASSF-----DDSNPIRLVSSDGLRDFETS---VLQVIGQARHALSFA 63
ILL+ CA A+ +A SF ++ N +L D ET +++ + +H + A
Sbjct: 3 ILLVLCAVVAAGTAVSFFDLVREEWNTFKLEHKKQY-DSETEEKFRMKIYAENKHKV--A 59
Query: 64 RFARRYGK----------------IYESVEEMKLRFATFSKNLDLI-RSTNCKG------ 100
+ +RY K +E V M T N L + + +G
Sbjct: 60 KHNQRYQKGLVSYRLKTNKYSDMLHHEFVNTMNGFNKTVKHNKGLYAKGNDIRGATFVSP 119
Query: 101 --------LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDC 152
+ +R ++PVKDQG CGSCW+FSTTG+LE + + G +SLSEQ L+DC
Sbjct: 120 ANVAAPPTVDWRQHGAVTPVKDQGKCGSCWSFSTTGALEGQHFRKSGFLVSLSEQNLIDC 179
Query: 153 AQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNIT 212
+ A+ N GCNGGL AF+YIK N G+DTE+ YPY D C+++ +N G + + V+I
Sbjct: 180 SSAYGNNGCNGGLMDNAFKYIKDNDGIDTEKTYPYEAVDDKCRYNPKNSGAEDVGFVDIP 239
Query: 213 LGAEDELQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGV- 270
G E +L A+ V PVSVA + + F+ Y GVY C + +D H V+ VGYG
Sbjct: 240 AGDEHKLMLALATVGPVSVAIDASQESFQLYSDGVYYDENCSSENLD--HGVLVVGYGTD 297
Query: 271 EDGVPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
EDG YWL+KNSWG +WGD GY KM + N CGIA+ ASYP+V
Sbjct: 298 EDGGDYWLVKNSWGPSWGDEGYIKMARNRDNHCGIASSASYPLV 341
>gi|37994576|gb|AAH60335.1| Unknown (protein for MGC:68554) [Xenopus laevis]
Length = 335
Score = 213 bits (542), Expect = 9e-53, Method: Compositional matrix adjust.
Identities = 109/222 (49%), Positives = 145/222 (65%), Gaps = 9/222 (4%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++PVKDQG CGSCW FSTTG+LE +++ K ISLSEQ LVDC++A N
Sbjct: 116 KSVDWRKKGYVTPVKDQGQCGSCWAFSTTGALEGQHYRKTSKLISLSEQNLVDCSRAQGN 175
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGV-CKFSSENVGVQVLDSVNITLGAED 217
+GCNGGL QAF+Y+K NGG+D+E++YPYT KD C + N V++ G E
Sbjct: 176 EGCNGGLMDQAFQYVKDNGGIDSEDSYPYTAKDDQECHYDPNNNSANDTGFVDVQSGCEK 235
Query: 218 ELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----D 272
+L AV V PVSVA + F+FY+SG+Y +C + D++H V+ VGYG E D
Sbjct: 236 DLMKAVASVGPVSVAIDAGHQSFQFYQSGIYYEPEC--SSEDLDHGVLVVGYGFESEDVD 293
Query: 273 GVPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
G YW++KNSW E WGD+GY + + N CGIAT ASYP+V
Sbjct: 294 GKKYWIVKNSWSEKWGDNGYINIAKDRHNHCGIATAASYPLV 335
>gi|94421564|gb|ABF18889.1| cathepsin-L [Lygus lineolaris]
Length = 314
Score = 213 bits (542), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 99/196 (50%), Positives = 130/196 (66%), Gaps = 3/196 (1%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++P+K+QG CGSCW FSTTGSLE + GK +SLSEQ+LVDC+ A N G
Sbjct: 117 VDWRTKGAVTPIKNQGQCGSCWAFSTTGSLEGQHALKKGKLVSLSEQELVDCSAAEGNDG 176
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C+GGL AF YIK N G+DTE++YPYTG+DG C F +V V V++T G+E LQ
Sbjct: 177 CDGGLMDDAFTYIKKNNGIDTEQSYPYTGEDGTCSFKKSDVAATVTGFVDVTSGSESGLQ 236
Query: 221 HAVGLVRPVSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
A + P+SVA + F+ Y+SGVY + C T +D H V+ VGYG +DG YWL+
Sbjct: 237 DASATIGPISVAIDASSWDFQLYESGVYDVSDCSTTELD--HGVLVVGYGTDDGTAYWLV 294
Query: 280 KNSWGENWGDHGYFKM 295
KNSWG +WG HGY +M
Sbjct: 295 KNSWGTDWGHHGYIQM 310
>gi|148224022|ref|NP_001087489.1| cathepsin L2 precursor [Xenopus laevis]
gi|51258284|gb|AAH80004.1| MGC81823 protein [Xenopus laevis]
Length = 335
Score = 213 bits (542), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 110/222 (49%), Positives = 147/222 (66%), Gaps = 9/222 (4%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++PVKDQG CGSCW FSTTG+LE +++ GK ISLSEQ LVDC++A N
Sbjct: 116 KTVDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQHYRKAGKLISLSEQNLVDCSRAQGN 175
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGV-CKFSSENVGVQVLDSVNITLGAED 217
QGCNGGL QAF+Y+K NGG+D+E++YPYT KD C + V++ G+E
Sbjct: 176 QGCNGGLMDQAFQYVKDNGGIDSEDSYPYTAKDDQECHYDPNYNSANDTGFVDVPSGSEK 235
Query: 218 ELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----D 272
+L AV V PVSVA + F+FY+SG+Y +C + D++H V+ VGYG E D
Sbjct: 236 DLMKAVASVGPVSVAVDAGHKSFQFYQSGIYYDPEC--SSEDLDHGVLVVGYGFEGEDVD 293
Query: 273 GVPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
G YW++KNSW E WG++GY K+ + N CGIAT ASYP+V
Sbjct: 294 GKRYWIVKNSWSEKWGNNGYIKIAKDRHNHCGIATAASYPLV 335
>gi|225706370|gb|ACO09031.1| Cathepsin L precursor [Osmerus mordax]
Length = 337
Score = 213 bits (542), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 132/335 (39%), Positives = 181/335 (54%), Gaps = 35/335 (10%)
Query: 11 VILLLCCAAAASASA--SSFDDSNPI------------------RLVSSDGLRDFETSVL 50
V+L+LC AA +A + FD+ + R+V L+ E L
Sbjct: 6 VVLVLCTGAALAAPRFDAQFDEHWDLWKSWHSKNYQHEKEEGWRRMVWEKNLKKIEMHNL 65
Query: 51 Q-VIGQARHALSFARFARRYGKIYESVEE-MKLRFATFSKNLDLIRSTNC---KGLSYRL 105
+ +G+ ++L F + + V KL+ F +L + N K + +R
Sbjct: 66 EHSLGKHSYSLGMNHFGDMTNEEFRQVMNGYKLQQRKFKGSL-FLEPNNMEAPKQVDWRE 124
Query: 106 GLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGL 165
++PVKDQG CGSCW FSTTG++E + K +SLSEQ LVDC++ N+GCNGGL
Sbjct: 125 EGYVTPVKDQGQCGSCWAFSTTGAMEGQMFRKTQKLVSLSEQNLVDCSRPEGNEGCNGGL 184
Query: 166 PSQAFEYIKYNGGLDTEEAYPYTGKDG-VCKFSSENVGVQVLDSVNITLGAEDELQHAVG 224
QAF+YI+ N GLD+EEAYPY G D C + +E ++I G E L A+
Sbjct: 185 MDQAFQYIQDNSGLDSEEAYPYLGTDDQPCNYKAEFSAANDTGFMDIPSGKEHALMKAIA 244
Query: 225 LVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----DGVPYWLI 279
V PVSVA + + F+FY+SG+Y +C + +D H V+AVGYG E DG YW++
Sbjct: 245 SVGPVSVAIDAGHESFQFYQSGIYYEKECSSEELD--HGVLAVGYGFEGEDVDGKKYWIV 302
Query: 280 KNSWGENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
KNSW E WGD GY M KN CGIAT ASYP+V
Sbjct: 303 KNSWSEKWGDKGYILMAKDRKNHCGIATAASYPLV 337
>gi|195124431|ref|XP_002006696.1| GI21205 [Drosophila mojavensis]
gi|193911764|gb|EDW10631.1| GI21205 [Drosophila mojavensis]
Length = 339
Score = 213 bits (541), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 104/218 (47%), Positives = 140/218 (64%), Gaps = 5/218 (2%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++ VKDQGHCGSCW FS+TG+LE + + G +SLSEQ LVDC+ + N
Sbjct: 124 KSVDWREKGAVTAVKDQGHCGSCWAFSSTGALEGQHFRKTGTLVSLSEQNLVDCSAKYGN 183
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
GCNGGL AF YIK NGG+DTE++YPY G D C F+ ++VG +I G E +
Sbjct: 184 NGCNGGLMDNAFRYIKDNGGIDTEKSYPYEGIDDSCHFNKDSVGATDRGFADIPQGNEKK 243
Query: 219 LQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGV-EDGVPY 276
+ AV + PVSVA + + F+FY G+Y+ +C + +D H V+ VGYG E G Y
Sbjct: 244 MAEAVATIGPVSVAIDASHESFQFYSEGIYNEPECNSQNLD--HGVLVVGYGTDESGKDY 301
Query: 277 WLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
WL+KNSWG WGD G+ KM + N CGIA+ +SYP+V
Sbjct: 302 WLVKNSWGTTWGDKGFIKMARNEDNQCGIASASSYPLV 339
>gi|225719768|gb|ACO15730.1| Cathepsin L1 precursor [Caligus clemensi]
Length = 338
Score = 213 bits (541), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 124/294 (42%), Positives = 164/294 (55%), Gaps = 20/294 (6%)
Query: 35 RLVSSDGLRDFETSVLQ-VIGQARHALSFARFA-------RRYGKIYESVEEMKLRFATF 86
R+V L+ E L+ +G+ H L F R+ Y+ E K + + F
Sbjct: 50 RMVWEKNLKKIEIHNLEHTMGKHSHRLGMNHFGDMTNEEFRQTMNGYKQTTERKFKGSLF 109
Query: 87 SKNLDLIRSTNCKGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSE 146
+ L K + +R ++PVKDQG CGSCW FSTTG++E + GK +SLSE
Sbjct: 110 MEPNYLQAP---KAVDWREKGYVTPVKDQGSCGSCWAFSTTGAMEGQPFRKTGKLVSLSE 166
Query: 147 QQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKD-GVCKFSSENVGVQV 205
Q LVDC++ N+GCNGGL QAF+YI+ N GLDTEE+YPY G D C + E
Sbjct: 167 QNLVDCSRPEGNEGCNGGLMDQAFQYIQDNAGLDTEESYPYVGTDEDPCHYKPEFSAANE 226
Query: 206 LDSVNITLGAEDELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVV 264
V+I G E + AV V PVSVA + + F+FY+SG+Y +C + +D H V+
Sbjct: 227 TGFVDIPSGKEHAMMKAVAAVGPVSVAIDAGHESFQFYESGIYYEKECSSEELD--HGVL 284
Query: 265 AVGYGVE----DGVPYWLIKNSWGENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
VGYG E DG YW++KNSW E WGD GY M KN CGIAT +SYP+V
Sbjct: 285 VVGYGFEGEDVDGKKYWIVKNSWSEKWGDKGYIYMAKDRKNHCGIATASSYPLV 338
>gi|226443040|ref|NP_001140018.1| Cathepsin L1 precursor [Salmo salar]
gi|221221188|gb|ACM09255.1| Cathepsin L1 precursor [Salmo salar]
Length = 338
Score = 213 bits (541), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 124/294 (42%), Positives = 165/294 (56%), Gaps = 20/294 (6%)
Query: 35 RLVSSDGLRDFETSVLQ-VIGQARHALSFARFA-------RRYGKIYESVEEMKLRFATF 86
R+V L+ E L+ +G+ + L F R+ Y+ E K + + F
Sbjct: 50 RMVWEKNLKKIEMHNLEHTMGKHSYRLGMNHFGDMTNEEFRQTMNGYKQTTERKFKGSLF 109
Query: 87 SKNLDLIRSTNCKGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSE 146
+ L K + +R ++PVKDQG CGSCW FSTTG++E + GK +SLSE
Sbjct: 110 MEPNYLQAP---KAVDWREKGYVTPVKDQGSCGSCWAFSTTGAMEGQQFRKTGKLVSLSE 166
Query: 147 QQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKD-GVCKFSSENVGVQV 205
Q LVDC++ N+GCNGGL QAF+YI+ N GLDTEE+YPY G D C + E G
Sbjct: 167 QNLVDCSRPEGNEGCNGGLMDQAFQYIQDNAGLDTEESYPYVGTDEDPCHYKPEFSGANE 226
Query: 206 LDSVNITLGAEDELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVV 264
V+I G E + AV V PVSVA + + F+FY+SG+Y +C + +D H V+
Sbjct: 227 TGFVDIPSGKEHAMMKAVAAVGPVSVAIDAGHESFQFYESGIYYEKECSSEELD--HGVL 284
Query: 265 AVGYGVE----DGVPYWLIKNSWGENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
VGYG E DG YW++KNSW E WGD GY M KN CGIAT +SYP+V
Sbjct: 285 VVGYGFEGEDVDGKKYWIVKNSWSEKWGDKGYIYMAKDRKNHCGIATASSYPLV 338
>gi|126681066|gb|ABO26562.1| cathepsin L-like cysteine protease [Ixodes ricinus]
Length = 335
Score = 213 bits (541), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 104/217 (47%), Positives = 135/217 (62%), Gaps = 4/217 (1%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++PVK+QG CGSCW FS TGSLE + + G +SLSEQ LV C+ F N
Sbjct: 121 KTVDWRTKGAVTPVKNQGQCGSCWAFSATGSLEGQHFRKSGSMVSLSEQNLVGCSTDFGN 180
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
GC GGL AF+YI+ N G+DTE++YPY G DG C F VG V+I G+E +
Sbjct: 181 NGCEGGLMDDAFKYIRANKGIDTEKSYPYNGTDGTCHFKKSTVGATDSGFVDIKEGSETQ 240
Query: 219 LQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYW 277
L+ AV V P+SVA + + F+FY GVY +C + +D H V+ VGYG +G YW
Sbjct: 241 LKKAVATVGPISVAIDASHESFQFYSDGVYDEPECDSESLD--HGVLVVGYGTLNGTDYW 298
Query: 278 LIKNSWGENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
+KNSWG WGD GY +M KN CGIA+ AS P+V
Sbjct: 299 FVKNSWGTTWGDEGYIRMSRNKKNQCGIASSASIPLV 335
>gi|195995651|ref|XP_002107694.1| hypothetical protein TRIADDRAFT_36902 [Trichoplax adhaerens]
gi|190588470|gb|EDV28492.1| hypothetical protein TRIADDRAFT_36902 [Trichoplax adhaerens]
Length = 544
Score = 213 bits (541), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 113/301 (37%), Positives = 161/301 (53%), Gaps = 52/301 (17%)
Query: 62 FARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN------------- 108
F FA ++ K Y+ E + R TF +NL I STN + L + + +N
Sbjct: 240 FHHFASKHQKNYKDERERRFRENTFRQNLRFIHSTNRQRLGFTVKVNHLADLTDNEIKVM 299
Query: 109 -------------------------------------ISPVKDQGHCGSCWTFSTTGSLE 131
++PVKDQG CGSCW+F TTG++E
Sbjct: 300 NGRKTSLKKSKTYQMPFNLTGLERYVAPTIDWRKLGAVTPVKDQGVCGSCWSFGTTGTIE 359
Query: 132 AAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAY-PYTGK 190
+ + GK +SLS+Q ++DC F N GC+GG +AFE+I +GG+ TE++Y Y +
Sbjct: 360 GSLYLKSGKLVSLSQQNMIDCTWGFGNNGCDGGEEFRAFEWIAKHGGIATEKSYGQYLAQ 419
Query: 191 DGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSS 249
DG CK + +G ++ V + G + L+ AV V PV+V + + F FY SG+Y
Sbjct: 420 DGKCKLNKTKIGAKIRGWVQVPHGNQSALKLAVSAVGPVAVGMDAALKSFSFYSSGIYYD 479
Query: 250 TKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCAS 309
+CGN D++HAV+AVGYG E+G YW+IKNSW +WGD GY K+ M N CGIAT AS
Sbjct: 480 KQCGNKEQDLDHAVLAVGYGNENGQDYWIIKNSWSTHWGDDGYVKLSMKNNNCGIATDAS 539
Query: 310 Y 310
+
Sbjct: 540 F 540
>gi|238816977|gb|ACR56863.1| cathepsin L-like cysteine proteinase [Delia coarctata]
Length = 338
Score = 213 bits (541), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 106/218 (48%), Positives = 138/218 (63%), Gaps = 5/218 (2%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++ VKDQGHCGSCW+FS+TGSLE + + G +SLSEQ LVDC+ + N
Sbjct: 123 KAVDWRQHGAVTSVKDQGHCGSCWSFSSTGSLEGQHFRKAGVLVSLSEQNLVDCSTKYGN 182
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
GCNGGL AF YIK NGG+DTE++YPY G D C F+ VG V+I G E+
Sbjct: 183 NGCNGGLMDNAFRYIKDNGGVDTEKSYPYEGIDDSCHFNKATVGATDTGFVDIPQGDEEA 242
Query: 219 LQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE-DGVPY 276
+ AV + PV+VA + + F+ Y GVY+ C + +D H V+ VGYG + DG Y
Sbjct: 243 MMKAVATMGPVAVAIDASNESFQLYSEGVYNDPNCSSDNLD--HGVLVVGYGTDKDGQDY 300
Query: 277 WLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
WL+KNSWG WGD GY KM + N CGIAT +S+P V
Sbjct: 301 WLVKNSWGTTWGDQGYIKMARNQDNQCGIATASSFPTV 338
>gi|343978787|gb|AEM76722.1| cathepsin L-like proteinase [Triatoma brasiliensis]
Length = 330
Score = 213 bits (541), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 112/250 (44%), Positives = 150/250 (60%), Gaps = 21/250 (8%)
Query: 66 ARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLNISPVKDQGHCGSCWTFS 125
+R GKIY F N L +S + +R ++PVKDQG CGSCW+FS
Sbjct: 100 TKREGKIY------------FPSNDKLPKSVD-----WRQKGAVTPVKDQGQCGSCWSFS 142
Query: 126 TTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAY 185
TGSLE GK +SLSEQ L+DC++ + N GC GGL +AF+Y+ N G+DTE +Y
Sbjct: 143 ATGSLEGQIFLKKGKLVSLSEQNLMDCSKEYGNNGCEGGLMDKAFQYVSDNKGIDTESSY 202
Query: 186 PYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVV-DGFRFYKS 244
PY +D C+F + VG V+I G E LQ+A+ V P+SVA + + F FY
Sbjct: 203 PYEARDYACRFKKDKVGGTDKGYVDIPEGDEKALQNALATVGPISVAIDASHESFHFYSE 262
Query: 245 GVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGK-NMCG 303
GVY+ C + D++H V+AVGYG E+G YWL+KNSWG +WG+ GY K+ N CG
Sbjct: 263 GVYNEPYC--SSYDLDHGVLAVGYGTENGQDYWLVKNSWGPSWGESGYIKIARNHSNHCG 320
Query: 304 IATCASYPVV 313
IA+ ASYP+V
Sbjct: 321 IASMASYPIV 330
>gi|254746340|emb|CAX16635.1| putative C1A cysteine protease precursor [Manduca sexta]
Length = 342
Score = 213 bits (541), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 106/208 (50%), Positives = 141/208 (67%), Gaps = 5/208 (2%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++ VKDQG CGSCW FSTTG+LE + + G +SLSEQ L+DC+ + N GCNGGL
Sbjct: 137 VTEVKDQGKCGSCWAFSTTGALEGQHFRKSGYLVSLSEQNLIDCSSTYGNNGCNGGLMDN 196
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
AF+YIK NGG+DTE+ YPY G D C+++ +N G + + V+I G E++L AV V P
Sbjct: 197 AFKYIKDNGGIDTEKTYPYEGVDDKCRYNPKNSGAEDVGFVDIPSGDEEKLMQAVATVGP 256
Query: 229 VSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVED-GVPYWLIKNSWGEN 286
VSVA + + F+FY GVY T+C +T D++H V+ VGYG ++ G YWL+KNSW
Sbjct: 257 VSVAIDASQNSFQFYSGGVYYDTECSST--DLDHGVLVVGYGTDEAGGDYWLVKNSWSRT 314
Query: 287 WGDHGYFKMEMGK-NMCGIATCASYPVV 313
WG+ GY KM + N CGIAT ASYP+V
Sbjct: 315 WGELGYIKMARNRDNHCGIATDASYPLV 342
>gi|225718114|gb|ACO14903.1| Cathepsin L precursor [Caligus clemensi]
Length = 336
Score = 212 bits (540), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 108/209 (51%), Positives = 138/209 (66%), Gaps = 6/209 (2%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++PVK+QG CGSCW+FS TG+LE + GK ISLSEQ LVDC++ F N GC GGL
Sbjct: 130 VTPVKNQGQCGSCWSFSATGALEGQDFRKTGKLISLSEQNLVDCSRKFGNNGCEGGLMDF 189
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
AF YI+ N G+DTE +YPY G DG C ++ +N G + V+I G+E +L+ AV V P
Sbjct: 190 AFTYIRDNKGIDTEASYPYEGIDGHCHYNPKNKGGSDIGFVDIKKGSEKDLKKAVAGVGP 249
Query: 229 VSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE--DGVPYWLIKNSWGE 285
+SVA + F+FY GVY +KC + +D H V+ VG+G + G YWL+KNSW E
Sbjct: 250 ISVAIDASHMSFQFYSHGVYVESKCSSEELD--HGVLVVGFGTDSVSGEDYWLVKNSWSE 307
Query: 286 NWGDHGYFKMEMGK-NMCGIATCASYPVV 313
WGD GY KM K NMCGIA+ ASYPVV
Sbjct: 308 KWGDQGYIKMARNKENMCGIASSASYPVV 336
>gi|405971603|gb|EKC36430.1| Cathepsin L [Crassostrea gigas]
Length = 360
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 124/308 (40%), Positives = 161/308 (52%), Gaps = 57/308 (18%)
Query: 61 SFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTN----CKGLSYRLGLN-------- 108
++ F + K Y+++EE RF F +N+ I N SY LG+N
Sbjct: 55 AWKEFKILHDKTYDALEEESRRFEIFRENVQKIEEHNKLYHLGKKSYYLGVNQFSDLKHE 114
Query: 109 ----------------------------------------ISPVKDQGHCGSCWTFSTTG 128
++ VK+QG CGSCW+FSTTG
Sbjct: 115 EFVKYNGLKKTSLKDGGCSSYLAANNLVEPDSVDWRKKGYVTDVKNQGQCGSCWSFSTTG 174
Query: 129 SLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYT 188
SLE + + GK +SLSE QLVDC+Q+F N+GCNGGL AF+YIK GGL++EE YPY
Sbjct: 175 SLEGQHFRKSGKLVSLSESQLVDCSQSFGNEGCNGGLMDNAFKYIKSVGGLESEEDYPYK 234
Query: 189 GKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVV-DGFRFYKSGVY 247
K G CKF V V++ G+E L+ AV V PVSVA + F+ Y GVY
Sbjct: 235 PKQGTCKFDDTKVAATDTGCVDVESGSESALKKAVSEVGPVSVAIDASHSSFQSYAGGVY 294
Query: 248 SSTKCGNTPMDVNHAVVAVGYGVED-GVPYWLIKNSWGENWGDHGYFKMEMG-KNMCGIA 305
+C + +D H V+ VGYG +D G YW++KNSWG WG+ GY KM KN CGIA
Sbjct: 295 DEPECSSEQLD--HGVLCVGYGTDDQGQDYWIVKNSWGAEWGEDGYVKMSRNKKNQCGIA 352
Query: 306 TCASYPVV 313
T ASYP+V
Sbjct: 353 TQASYPLV 360
>gi|47086859|ref|NP_997749.1| cathepsin L, 1 a precursor [Danio rerio]
gi|42542930|gb|AAH66490.1| Cathepsin L1, a [Danio rerio]
Length = 337
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 111/220 (50%), Positives = 140/220 (63%), Gaps = 9/220 (4%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
L +R ++PVKDQG CGSCW FSTTG+LE + GK +SLSEQ LVDC++ N+G
Sbjct: 120 LDWREKGYVTPVKDQGECGSCWAFSTTGALEGQMFRKTGKLVSLSEQNLVDCSRPEGNEG 179
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDG-VCKFSSENVGVQVLDSVNITLGAEDEL 219
CNGGL QAF+Y+K GLD+EE+YPY G D C F +N V+I G E L
Sbjct: 180 CNGGLMDQAFQYVKDQNGLDSEESYPYLGTDDQPCHFDPKNSAANDTGFVDIPSGKERAL 239
Query: 220 QHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----DGV 274
A+ V PVSVA + + F+FY+SG+Y +C + +D H V+AVGYG E DG
Sbjct: 240 MKAIAAVGPVSVAIDAGHESFQFYQSGIYYEKECSSEELD--HGVLAVGYGFEGEDVDGK 297
Query: 275 PYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
YW++KNSW ENWGD GY M + N CGIAT ASYP+V
Sbjct: 298 KYWIVKNSWSENWGDKGYIYMAKDRHNHCGIATAASYPLV 337
>gi|395514298|ref|XP_003761356.1| PREDICTED: cathepsin L1-like [Sarcophilus harrisii]
Length = 365
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 108/218 (49%), Positives = 139/218 (63%), Gaps = 5/218 (2%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++PVK+QG CGSCW FS TGSLE + + GK +SLSEQ LVDC+ A N
Sbjct: 150 KSVDWRDKGYVTPVKNQGQCGSCWAFSATGSLEGQWFRKTGKLVSLSEQNLVDCSTAQGN 209
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
GC GGL AFEY+K NGG+DTEE+YPY D C++ + G + V+I E
Sbjct: 210 SGCQGGLMDNAFEYVKENGGIDTEESYPYIAADDTCQYKPQYSGANITGYVDIPSRMEKA 269
Query: 219 LQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE-DGVPY 276
L+ AV V P+SVA + F+FY+SGVY +C + D++H V+AVGYGV+ Y
Sbjct: 270 LEKAVATVGPISVAIDAGHSSFQFYRSGVYYEPECSSE--DLDHGVLAVGYGVQGKNGKY 327
Query: 277 WLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
W++KNSWGE WGD GY M + N CGIAT ASYP V
Sbjct: 328 WIVKNSWGEEWGDSGYILMARDRNNHCGIATAASYPEV 365
>gi|307175095|gb|EFN65237.1| Cathepsin L [Camponotus floridanus]
Length = 372
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 111/245 (45%), Positives = 157/245 (64%), Gaps = 13/245 (5%)
Query: 75 SVEEMKLRFATFSK--NLDLIRSTNCKGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEA 132
+V E +L ATF + N++L +S + +R ++ +KDQG CGSCW FS+TG+LE
Sbjct: 135 TVSEEQLIGATFIEPANVELPKSVD-----WRKKGAVTAIKDQGQCGSCWAFSSTGALEG 189
Query: 133 AYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDG 192
+ + G +SLSEQ L+DC+ + N GCNGGL AF YIK N GLDTE++YPY ++
Sbjct: 190 QHFRQSGVLVSLSEQNLIDCSGKYGNNGCNGGLMDYAFRYIKENKGLDTEKSYPYEAEND 249
Query: 193 VCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTK 251
C+++ +N G + V+I G ED+L+ AV + P+SVA + + F FY GVY +
Sbjct: 250 QCRYNPKNSGASDVGFVDIPEGDEDKLKAAVATIGPISVAIDASHESFHFYSEGVYYEPE 309
Query: 252 CGNTPMDVNHAVVAVGYGVEDGV--PYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCA 308
C +P +++H V+ VGYG + G YWL+KNSWGE WG+ GY KM K N CGIA+ A
Sbjct: 310 C--SPANLDHGVLIVGYGTDSGTGEDYWLVKNSWGETWGEKGYIKMARNKENHCGIASSA 367
Query: 309 SYPVV 313
SYP+V
Sbjct: 368 SYPLV 372
>gi|225709022|gb|ACO10357.1| Cathepsin L precursor [Caligus rogercresseyi]
Length = 332
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 113/229 (49%), Positives = 144/229 (62%), Gaps = 9/229 (3%)
Query: 92 LIRSTNCK---GLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQ 148
I S N K + +R ++PVK+QG CGSCW FS+TGSLE + GK I LSEQ
Sbjct: 106 FIPSKNVKLPTHVDWREDGAVTPVKNQGQCGSCWAFSSTGSLEGQTFRKTGKLIPLSEQN 165
Query: 149 LVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDS 208
LVDC++ + N GC GGL AF YI+ N G+DTE +YPY G G C + G +
Sbjct: 166 LVDCSRKYGNNGCEGGLMDFAFTYIRDNKGIDTEGSYPYEGVGGRCHYDPSKKGSSDIGF 225
Query: 209 VNITLGAEDELQHAVGLVRPVSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVG 267
V++ G+E+EL AV V PVSVA + F+FY GVY +KC +P +++H V+ VG
Sbjct: 226 VDVKKGSEEELLKAVASVGPVSVAIDASHMSFQFYSHGVYFESKC--SPENLDHGVLVVG 283
Query: 268 YGVED--GVPYWLIKNSWGENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
YG ++ G YWL+KNSW ENWGD GY KM KNMCGIA+ ASYPVV
Sbjct: 284 YGTDENSGEDYWLVKNSWSENWGDQGYIKMARNKKNMCGIASSASYPVV 332
>gi|23452059|gb|AAN32912.1| cathepsin [Danio rerio]
Length = 310
Score = 212 bits (539), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 111/220 (50%), Positives = 140/220 (63%), Gaps = 9/220 (4%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
L +R ++PVKDQG CGSCW FSTTG+LE + GK +SLSEQ LVDC++ N+G
Sbjct: 93 LDWREKGYVTPVKDQGECGSCWAFSTTGALEGQMFRKTGKLVSLSEQNLVDCSRPEGNEG 152
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDG-VCKFSSENVGVQVLDSVNITLGAEDEL 219
CNGGL QAF+Y+K GLD+EE+YPY G D C F +N V+I G E L
Sbjct: 153 CNGGLMDQAFQYVKDQNGLDSEESYPYLGTDDQPCHFDPKNSAANDTGFVDIPSGKERAL 212
Query: 220 QHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----DGV 274
A+ V PVSVA + + F+FY+SG+Y +C + +D H V+AVGYG E DG
Sbjct: 213 MKAIAAVGPVSVAIDAGHESFQFYQSGIYYEKECSSEELD--HGVLAVGYGFEGEDVDGK 270
Query: 275 PYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
YW++KNSW ENWGD GY M + N CGIAT ASYP+V
Sbjct: 271 KYWIVKNSWSENWGDKGYIYMAKDRHNHCGIATAASYPLV 310
>gi|41688064|dbj|BAD08618.1| cathepsin L preproprotein [Cyprinus carpio]
Length = 337
Score = 211 bits (538), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 124/294 (42%), Positives = 167/294 (56%), Gaps = 20/294 (6%)
Query: 35 RLVSSDGLRDFETSVLQ-VIGQARHALSFARFA-------RRYGKIYESVEEMKLRFATF 86
R+V L+ E L+ +G + L RF R+ Y+ +E + R + F
Sbjct: 49 RMVWEKNLQKIELHNLEHSMGTHTYRLGMNRFGDMTHEEFRQVMNGYKHKKERRFRGSLF 108
Query: 87 SKNLDLIRSTNCKGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSE 146
+ + + N L +R ++PVKDQG CGSCW FSTTG++E + GK +SLSE
Sbjct: 109 MEP-NFLEVPNS--LDWREKGYVTPVKDQGECGSCWAFSTTGAMEGQMFRKTGKLVSLSE 165
Query: 147 QQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDG-VCKFSSENVGVQV 205
Q LVDC++ N+GCNGGL QAF+YIK GLD+EE+YPY G D C + +
Sbjct: 166 QNLVDCSRPEGNEGCNGGLMDQAFQYIKDQNGLDSEESYPYVGTDDQPCHYDPKYSAAND 225
Query: 206 LDSVNITLGAEDELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVV 264
V+I G E L A+ V PVSVA + + F+FY+SG+Y +C + +D H V+
Sbjct: 226 TGFVDIPSGKEHALMKAIAAVGPVSVAIDAGHESFQFYQSGIYYEKECSSEELD--HGVL 283
Query: 265 AVGYGVE----DGVPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
AVGYG E DG YW++KNSW ENWGD GY M + N CGIAT ASYP+V
Sbjct: 284 AVGYGFEGEDVDGKKYWIVKNSWSENWGDKGYVYMAKDRHNHCGIATAASYPLV 337
>gi|288548564|gb|ADC52430.1| cathepsin L1 cysteine protease [Pinctada fucata]
Length = 331
Score = 211 bits (538), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 105/208 (50%), Positives = 138/208 (66%), Gaps = 5/208 (2%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++PVK+QGHCGSCW+FS TGSLE + ++ GK +SLSEQ L+DC++ N GC GGL
Sbjct: 126 VTPVKNQGHCGSCWSFSATGSLEGQHFKSTGKLVSLSEQNLIDCSKKEGNHGCKGGLMDF 185
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGV-CKFSSENVGVQVLDSVNITLGAEDELQHAVGLVR 227
AFEYI+ N G+DTE++YPYT KDG+ C+F +VG V++ +E LQ AV V
Sbjct: 186 AFEYIQKNDGIDTEQSYPYTAKDGIECRFKKADVGATDKGKVDLPRQSEKALQEAVATVG 245
Query: 228 PVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGEN 286
P+SVA + F+ YK G+Y+ C +T +D H V+AVGYG E YWL+KNSWG
Sbjct: 246 PISVAMDAGHRSFQLYKRGIYTEPMCSSTKLD--HGVLAVGYGSEGEGDYWLVKNSWGAT 303
Query: 287 WGDHGYFKMEMG-KNMCGIATCASYPVV 313
WG G+F + +N CGIAT ASYP V
Sbjct: 304 WGMEGFFMLARNHRNECGIATQASYPKV 331
>gi|22653679|sp|Q26636.1|CATL_SARPE RecName: Full=Cathepsin L; Contains: RecName: Full=Cathepsin L
heavy chain; Contains: RecName: Full=Cathepsin L light
chain; Flags: Precursor
gi|505140|dbj|BAA03970.1| cathepsin L precursor [Sarcophaga peregrina]
Length = 339
Score = 211 bits (537), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 105/218 (48%), Positives = 139/218 (63%), Gaps = 5/218 (2%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++ VKDQGHCGSCW FS+TG+LE + + G +SLSEQ LVDC+ + N
Sbjct: 124 KSVDWREHGAVTGVKDQGHCGSCWAFSSTGALEGQHFRKAGVLVSLSEQNLVDCSTKYGN 183
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
GCNGGL AF YIK NGG+DTE++YPY G D C F+ +G V+I G E++
Sbjct: 184 NGCNGGLMDNAFRYIKDNGGIDTEKSYPYEGIDDSCHFNKATIGATDTGFVDIPEGDEEK 243
Query: 219 LQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGV-EDGVPY 276
++ AV + PVSVA + + F+ Y GVY+ +C +D H V+ VGYG E G+ Y
Sbjct: 244 MKKAVATMGPVSVAIDASHESFQLYSEGVYNEPECDEQNLD--HGVLVVGYGTDESGMDY 301
Query: 277 WLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
WL+KNSWG WG+ GY KM + N CGIAT +SYP V
Sbjct: 302 WLVKNSWGTTWGEQGYIKMARNQNNQCGIATASSYPTV 339
>gi|348531519|ref|XP_003453256.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
Length = 334
Score = 211 bits (537), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 102/207 (49%), Positives = 134/207 (64%), Gaps = 4/207 (1%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++ VKDQ CGSCW FSTTG LE + + GK +SLSEQQL+DC+ +F N GCNGG +
Sbjct: 130 VTEVKDQKQCGSCWAFSTTGVLEGQHFRKTGKLVSLSEQQLMDCSHSFGNNGCNGGSVKR 189
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
A +YI+ NGG+DTE +YPY K C++ + +G + V++ E+ L+ AV + P
Sbjct: 190 ALQYIQANGGIDTETSYPYKAKGQRCRYKPDGIGAKCTGYVHVKPSNEETLKKAVATLGP 249
Query: 229 VSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENW 287
+SV + F+FY+SGVY C T +D H +AVGYG E+G YWLIKNSWG W
Sbjct: 250 ISVGIDASRHSFQFYQSGVYDDPDCSKTVLD--HGALAVGYGTENGHDYWLIKNSWGLRW 307
Query: 288 GDHGYFKMEMGK-NMCGIATCASYPVV 313
GD GY KM K N CGIA+ ASYP+V
Sbjct: 308 GDKGYIKMSRNKSNQCGIASEASYPLV 334
>gi|23306947|dbj|BAC16538.1| cathepsin L [Engraulis japonicus]
Length = 336
Score = 211 bits (536), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 127/294 (43%), Positives = 166/294 (56%), Gaps = 20/294 (6%)
Query: 35 RLVSSDGLRDFETSVLQ-VIGQARHALSFARFA-------RRYGKIYESVEEMKLRFATF 86
R+V LR E L+ +G + L F R+ Y+ E +++ + F
Sbjct: 48 RVVWEKNLRKIEMHNLEHSMGAHSYRLGMNHFGDMTHEEFRQVMNGYKHKAERRVKGSLF 107
Query: 87 SKNLDLIRSTNCKGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSE 146
+ + I + K + YR +PVKDQG CGSCW FSTTG++E + GK +SLSE
Sbjct: 108 MEP-NFIEAP--KKIDYRDLGYATPVKDQGQCGSCWAFSTTGAMEGQLFREGGKLVSLSE 164
Query: 147 QQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGV-CKFSSENVGVQV 205
Q LVDC++ N+GCNGGL QAF+YIK NGGLDTE+AYPY G D C + +
Sbjct: 165 QNLVDCSRPEGNEGCNGGLMDQAFQYIKDNGGLDTEDAYPYLGTDDQDCHYDPKYSAAND 224
Query: 206 LDSVNITLGAEDELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVV 264
V+I G E L AV V PVSVA + + F+FY SG+Y +C +T +D H V+
Sbjct: 225 TGFVDIPEGKERALMKAVAAVGPVSVAIDAGHESFQFYHSGIYFEKECSSTELD--HGVL 282
Query: 265 AVGYGVE----DGVPYWLIKNSWGENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
VGYG E DG YW++KNSW E WGD GY M KN CGIAT ASYP++
Sbjct: 283 VVGYGFEGEDVDGKKYWIVKNSWSEKWGDEGYIYMAKDRKNHCGIATAASYPLM 336
>gi|185135439|ref|NP_001117777.1| procathepsin L precursor [Oncorhynchus mykiss]
gi|14582899|gb|AAK69706.1|AF358668_1 procathepsin L [Oncorhynchus mykiss]
Length = 338
Score = 210 bits (535), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 123/294 (41%), Positives = 164/294 (55%), Gaps = 20/294 (6%)
Query: 35 RLVSSDGLRDFETSVLQ-VIGQARHALSFARFA-------RRYGKIYESVEEMKLRFATF 86
R+V L+ E L+ +G+ + L F R+ Y+ E K + + F
Sbjct: 50 RMVWEKNLKKIEIHNLEHTMGKHSYRLGMNHFGDMTNEEFRQTMNGYKQTTERKFKGSLF 109
Query: 87 SKNLDLIRSTNCKGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSE 146
+ L K + +R ++PVKDQG CGSCW FSTTG++E + GK +SLSE
Sbjct: 110 MEPNYLQAP---KAVDWREKGYVTPVKDQGSCGSCWAFSTTGAMEGQQFRKTGKLVSLSE 166
Query: 147 QQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKD-GVCKFSSENVGVQV 205
Q LVDC++ N+GCNGGL QAF+YI+ N GLDTEE+YPY G D C + E
Sbjct: 167 QNLVDCSRPEGNEGCNGGLMDQAFQYIQDNAGLDTEESYPYVGTDEDPCHYKPEFSAANE 226
Query: 206 LDSVNITLGAEDELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVV 264
V+I G E + AV V PVSVA + + F+FY+SG+Y +C + +D H V+
Sbjct: 227 TGFVDIPSGKEHAMMKAVAAVGPVSVAIDAGHESFQFYESGIYYEKECSSEELD--HGVL 284
Query: 265 AVGYGVE----DGVPYWLIKNSWGENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
VGYG E DG YW++KNSW E WGD GY M KN CGIAT +SYP+V
Sbjct: 285 VVGYGFEGEDVDGKKYWIVKNSWSEKWGDKGYIYMAKDRKNHCGIATASSYPLV 338
>gi|2239109|emb|CAA70694.1| cathepsin S-like cysteine proteinase [Heterodera glycines]
Length = 353
Score = 210 bits (535), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 112/248 (45%), Positives = 155/248 (62%), Gaps = 9/248 (3%)
Query: 73 YESVEEMKLRFATFSKNLDLI---RSTNCKGLSYRLGLNISPVKDQGHCGSCWTFSTTGS 129
Y + +++R N+ + ST + L +R ++ VKDQG CGSCW FS TG+
Sbjct: 108 YNRIRGLQMRSNRQRHNMATLAGNSSTLPEKLDWREKGAVTEVKDQGDCGSCWAFSATGA 167
Query: 130 LEAAYHQAFG-KGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYT 188
+E A Q K ISLSEQ LVDC+ + N+GC+GGL AFEY++ N GLDTEE+YPY
Sbjct: 168 IEGALAQKKASKIISLSEQNLVDCSSKYGNEGCDGGLMDSAFEYVRDNNGLDTEESYPYE 227
Query: 189 GKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVD-GFRFYKSGVY 247
G C+F +E VG V+ ++ G E++L+ AV + P+SVA + + F+FYK+GVY
Sbjct: 228 AVTGKCQFKNETVGGTVVSFKDLKKGDEEQLKIAVATIGPISVALDASNLSFQFYKTGVY 287
Query: 248 SSTKCGNTPMDVNHAVVAVGYGVEDGV-PYWLIKNSWGENWGDHGYFKMEMGK-NMCGIA 305
C N +D H V+ VGYG ++ YWL+KNSWG +WG++GY ++ K N CGIA
Sbjct: 288 YERWCSNRYLD--HGVLLVGYGTDETHGDYWLVKNSWGPHWGENGYIRIARNKQNHCGIA 345
Query: 306 TCASYPVV 313
T ASYPVV
Sbjct: 346 TMASYPVV 353
>gi|348542776|ref|XP_003458860.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
Length = 334
Score = 210 bits (535), Expect = 6e-52, Method: Compositional matrix adjust.
Identities = 107/233 (45%), Positives = 145/233 (62%), Gaps = 9/233 (3%)
Query: 83 FATFSKNLDLIRSTNCKGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGI 142
F + +DL + + + Y ++ VKDQ CGSCW FS TG+LE + + G +
Sbjct: 109 FLRLPEGIDLPDAVDWREQGY-----VTGVKDQKQCGSCWAFSATGALEGQHFRKTGILV 163
Query: 143 SLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVG 202
SLSEQQLVDC+ A+ N+GCNGG AF YI+ NGG+DTE +YPY +D +C+++ +VG
Sbjct: 164 SLSEQQLVDCSGAYGNEGCNGGWMDSAFRYIEANGGIDTEASYPYEAEDWLCRYNPASVG 223
Query: 203 VQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDG-FRFYKSGVYSSTKCGNTPMDVNH 261
V++ E+ L+ AV + PVSVA + F+FY SGVY C + +D H
Sbjct: 224 ATCSGYVDVNKYDEEALKEAVATIGPVSVAIDASHASFQFYTSGVYDEPGCSSIELD--H 281
Query: 262 AVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
V+AVGYG E+G YWL+KNSWG WG+ GY KM K N CGIA+ ASYP+V
Sbjct: 282 GVLAVGYGTENGHDYWLVKNSWGRGWGEMGYIKMSRNKHNQCGIASAASYPLV 334
>gi|281206749|gb|EFA80934.1| counting factor associated protein [Polysphondylium pallidum PN500]
Length = 530
Score = 210 bits (535), Expect = 6e-52, Method: Compositional matrix adjust.
Identities = 122/304 (40%), Positives = 163/304 (53%), Gaps = 52/304 (17%)
Query: 62 FARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN------------- 108
F +F Y K+Y EE RFAT+ +N ++I + N + SY+L +N
Sbjct: 227 FEQFKTTYDKVYAHDEEHSERFATYKQNREMIIAHNTQESSYKLAMNHFGDMTAEEFELK 286
Query: 109 ------------------------------------ISPVKDQGHCGSCWTFSTTGSLEA 132
++ VKDQG CGSCWTF +TGSLE
Sbjct: 287 IKPRVPRPDTNGAHDVHDNDRTINLPATVDWRQQGCVTRVKDQGVCGSCWTFGSTGSLEG 346
Query: 133 AYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDG 192
A GK +SLSEQQLVDCA +QGCNGG S AF+YI GG+ E YPY ++G
Sbjct: 347 VSCLATGKLVSLSEQQLVDCAYLGQSQGCNGGFASDAFQYIMNFGGIAYESTYPYLMQNG 406
Query: 193 VCKFSSENV-GVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSST 250
CK SS + ++V VN+T +E LQ+AV V PV++A + FRFY SGVY S+
Sbjct: 407 YCKDSSSQLSNIKVKSYVNVTSFSEPALQNAVATVGPVAIAIDASAPDFRFYSSGVYYSS 466
Query: 251 KCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCAS 309
C N D++H V+AVGYG +G YW++KNSW ++G GY M + N CG+A+ +
Sbjct: 467 VCKNGLDDLDHEVLAVGYGTLNGADYWIVKNSWSTHYGAEGYILMSRNRGNNCGVASQPT 526
Query: 310 YPVV 313
YPVV
Sbjct: 527 YPVV 530
>gi|389608655|dbj|BAM17937.1| cathepsin L [Papilio xuthus]
Length = 341
Score = 210 bits (535), Expect = 6e-52, Method: Compositional matrix adjust.
Identities = 133/348 (38%), Positives = 193/348 (55%), Gaps = 49/348 (14%)
Query: 8 VSSVILLLCCAAAASASASSFD----DSNPIRLVSSDGLRDFETSV-----LQVIGQARH 58
+ S+++LLC AAASA S FD + N ++ + + +++ V +++ + +H
Sbjct: 1 MRSLVILLCVVAAASA-VSFFDLVKEEWNAFKM---EHQKQYDSEVEDKFRMKIYAENKH 56
Query: 59 AL------------SFARFARRYGKI--YESVEEMKLRFATFSKNLDLI--RSTNCKGLS 102
+ SF +YG + +E V M F +KN + +S +G +
Sbjct: 57 NIAKHNQKYARGEVSFRLKQNKYGDMLHHEFVHTMN-GFNKTTKNSKGLFGKSAGERGAT 115
Query: 103 YRLGLNI--------------SPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQ 148
+ N+ + VKDQG CGSCW+FS+TG+LE +++ +SLSEQ
Sbjct: 116 FITPANVHLPDHVDWRKHGAVTEVKDQGKCGSCWSFSSTGALEGQHYRRTNILVSLSEQN 175
Query: 149 LVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDS 208
L+DC+ A+ N GCNGGL AF+YIK N G+DTE++YPY G D C+++ +N G
Sbjct: 176 LIDCSAAYGNNGCNGGLMDNAFKYIKDNRGIDTEKSYPYEGIDDKCRYNPKNTGADDNGF 235
Query: 209 VNITLGAEDELQHAVGLVRPVSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVG 267
V+I G E +L AV V PVSVA + F+FY GVY C ++ +D H V+ VG
Sbjct: 236 VDIPSGDEGKLMAAVATVGPVSVAIDASQSSFQFYSDGVYFDENCSSSSLD--HGVLVVG 293
Query: 268 YGV-EDGVPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
YG E+G YWL+KNSWG +WGD GY KM + N CGIAT ASYP+V
Sbjct: 294 YGTDENGGDYWLVKNSWGRSWGDLGYIKMARNRDNHCGIATAASYPLV 341
>gi|118424553|gb|ABK90824.1| cathepsin L-like cysteine proteinase [Spodoptera exigua]
Length = 344
Score = 210 bits (535), Expect = 6e-52, Method: Compositional matrix adjust.
Identities = 106/216 (49%), Positives = 142/216 (65%), Gaps = 5/216 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++ VKDQG CGSCW FSTTG+LE + + G +SLSEQ L+DC+ A+ N G
Sbjct: 131 VDWRKKGAVTDVKDQGKCGSCWAFSTTGALEGQHFRKTGYLVSLSEQNLIDCSAAYGNNG 190
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
CNGGL AF+YIK NGG+DTE++YPY D C+++ + G + V+I G E++L
Sbjct: 191 CNGGLMDNAFKYIKDNGGIDTEKSYPYEAVDDKCRYNPKESGADDVGFVDIPQGDEEKLM 250
Query: 221 HAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGV-EDGVPYWL 278
AV V P+SVA + + F+FY GVY C +T D++H V+ VGYG EDG WL
Sbjct: 251 QAVATVGPISVAIDASQETFQFYSKGVYYDENCSST--DLDHGVMVVGYGTEEDGSDDWL 308
Query: 279 IKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
+KNSWG +WG+ GY KM K N CGIA+ ASYP+V
Sbjct: 309 VKNSWGRSWGELGYIKMARNKNNHCGIASSASYPLV 344
>gi|223646726|gb|ACN10121.1| Cathepsin L1 precursor [Salmo salar]
gi|223672581|gb|ACN12472.1| Cathepsin L1 precursor [Salmo salar]
Length = 338
Score = 210 bits (534), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 123/294 (41%), Positives = 164/294 (55%), Gaps = 20/294 (6%)
Query: 35 RLVSSDGLRDFETSVLQ-VIGQARHALSFARFA-------RRYGKIYESVEEMKLRFATF 86
R+V L+ E L+ +G+ + L F R+ Y+ E K + + F
Sbjct: 50 RMVWEKNLKKIEMHNLEHTMGKHSYRLGMNHFGDMTNEEFRQTMNGYKQTTERKFKGSLF 109
Query: 87 SKNLDLIRSTNCKGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSE 146
+ L K + +R ++PVKDQG CGSCW FSTTG++E + GK +SLSE
Sbjct: 110 MEPNYLQAP---KAVDWREKGYVTPVKDQGSCGSCWAFSTTGAMEGQQFRKTGKLVSLSE 166
Query: 147 QQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKD-GVCKFSSENVGVQV 205
Q LVDC++ N+GCNGGL QAF+YI+ N GLDTEE+YPY G D C + E G
Sbjct: 167 QNLVDCSRPEGNEGCNGGLMDQAFQYIQDNAGLDTEESYPYVGTDEDPCHYKPEFSGANE 226
Query: 206 LDSVNITLGAEDELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVV 264
V+I G E + AV V PVSVA + + F+FY+ G+Y +C + +D H V+
Sbjct: 227 TGFVDIPSGKEHAMMKAVAAVGPVSVAIDAGHESFQFYEFGIYYEKECSSEELD--HGVL 284
Query: 265 AVGYGVE----DGVPYWLIKNSWGENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
VGYG E DG YW++KNSW E WGD GY M KN CGIAT +SYP+V
Sbjct: 285 VVGYGFEGEDVDGKKYWIVKNSWSEKWGDKGYIYMAKDRKNHCGIATASSYPLV 338
>gi|16304178|gb|AAL16954.1|AF426414_1 cathepsin L-like cysteine protease precursor [Delia radicum]
Length = 337
Score = 210 bits (534), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 106/218 (48%), Positives = 136/218 (62%), Gaps = 5/218 (2%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++ VKDQGHCGSCW FS+T +LE + + G +SLSEQ LVDC+ + N
Sbjct: 122 KSVDWRQHGAVTAVKDQGHCGSCWAFSSTAALEGQHFRKAGVLVSLSEQNLVDCSTKYGN 181
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
GCNGGL AF YIK NGG+DTE++YPY G D C F+ VG V+I G E+
Sbjct: 182 NGCNGGLMDNAFRYIKDNGGIDTEKSYPYEGIDDSCHFTKSGVGATDTGFVDIPQGDEEA 241
Query: 219 LQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVED-GVPY 276
L AV + PVSVA + + F+ Y GVY+ +C +D H V+ VGYG + G+ Y
Sbjct: 242 LMKAVATMGPVSVAIDASHESFQLYSEGVYNEPECDAQNLD--HGVLVVGYGTDKTGLDY 299
Query: 277 WLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
WL+KNSWG WGD GY KM + N CGIAT +SYP V
Sbjct: 300 WLVKNSWGTTWGDQGYIKMARNQDNQCGIATASSYPTV 337
>gi|390337642|ref|XP_780653.3| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
Length = 333
Score = 210 bits (534), Expect = 8e-52, Method: Compositional matrix adjust.
Identities = 111/218 (50%), Positives = 140/218 (64%), Gaps = 7/218 (3%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++PVKDQG CGSCW FSTTGS+E + +A GK +SLSEQ LVDC+ +
Sbjct: 120 KTVDWRTKGYVTPVKDQGQCGSCWAFSTTGSVEGQHFKATGKLVSLSEQNLVDCSG--RD 177
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
GC+GG +AF+YI GG+DTE +YPY DG C F NVG V ++T G+E
Sbjct: 178 AGCDGGFMDRAFQYIIDAGGIDTEASYPYKAVDGKCHFKKANVGATVTGYTDVTSGSEKA 237
Query: 219 LQHAVGLVRPVSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGV-EDGVPY 276
LQ AV V P+SVA + F+ YKSGVY+ C +T +D H V+AVGYG DG Y
Sbjct: 238 LQKAVAHVGPISVAIDASHMSFQHYKSGVYNEPGCDSTVLD--HGVLAVGYGTSSDGTDY 295
Query: 277 WLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
W++KNSW E WG +GY M K N CGIAT ASYP+V
Sbjct: 296 WIVKNSWAETWGMNGYVWMSRNKDNQCGIATNASYPLV 333
>gi|55740404|gb|AAV63978.1| cathepsin L2 precursor [Artemia franciscana]
Length = 226
Score = 210 bits (534), Expect = 8e-52, Method: Compositional matrix adjust.
Identities = 105/215 (48%), Positives = 137/215 (63%), Gaps = 2/215 (0%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++PVK QG C SCW FS+TG+LE+ + GK ISLSEQ L+DC+ + N G
Sbjct: 12 VDWREKGAVTPVKYQGQCASCWAFSSTGALESQTFRKTGKLISLSEQNLIDCSGEYGNLG 71
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C GG SQAFEYIK N G+DTE Y Y K+ C+ + N G L VNI G ED+L+
Sbjct: 72 CKGGWISQAFEYIKDNKGIDTENKYHYEAKENFCRDNPRNRGAVALGFVNIPSGEEDKLK 131
Query: 221 HAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
AV V PVS +V +GF+FY GVY C + +NHAV+ +GYG ++G YWL+
Sbjct: 132 AAVATVGPVSAVIDVSHEGFQFYSKGVYYEPSCKTSFEHLNHAVLVIGYGSDNGEDYWLV 191
Query: 280 KNSWGENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
KNSW ++WGD GY K+ KN CG+AT A YP+V
Sbjct: 192 KNSWSKHWGDEGYLKIARNRKNHCGVATAALYPIV 226
>gi|384941728|gb|AFI34469.1| cathepsin L2 preproprotein [Macaca mulatta]
Length = 334
Score = 210 bits (534), Expect = 9e-52, Method: Compositional matrix adjust.
Identities = 118/274 (43%), Positives = 152/274 (55%), Gaps = 22/274 (8%)
Query: 55 QARHALSFARFA------RRYGKIYESVEEMKLRFATFSKN---LDLIRSTNCKGLSYRL 105
Q +H + A A + ++ KLR + LDL +S + + Y
Sbjct: 68 QGKHGFAMAMNAFGDMTNEEFRQVMGCFRNQKLRKGKLFREPLFLDLPKSVDWRKKGY-- 125
Query: 106 GLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGL 165
++PVK+Q CGSCW FS TG+LE + GK +SLSEQ LVDC++ NQGCNGG
Sbjct: 126 ---VTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGF 182
Query: 166 PSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGL 225
+ AF Y+K NGGLD+EE+YPY DG+CK+ SEN + G E L AV
Sbjct: 183 MNSAFRYVKENGGLDSEESYPYVAMDGICKYRSENSVANDTGFEVVPAGKEKALMKAVAT 242
Query: 226 VRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----DGVPYWLIK 280
V P+SVA + F+FYKSG+Y C + +D H V+ VGYG E D YWL+K
Sbjct: 243 VGPISVAMDAGHSSFQFYKSGIYFEPDCSSKNLD--HGVLVVGYGFEGANSDNNKYWLVK 300
Query: 281 NSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
NSWG WG +GY K+ K N CGIAT ASYP V
Sbjct: 301 NSWGPEWGSNGYVKIAKDKDNHCGIATAASYPTV 334
>gi|109112413|ref|XP_001106814.1| PREDICTED: cathepsin L2 isoform 3 [Macaca mulatta]
gi|297271422|ref|XP_002800251.1| PREDICTED: cathepsin L2 [Macaca mulatta]
Length = 334
Score = 209 bits (533), Expect = 9e-52, Method: Compositional matrix adjust.
Identities = 118/274 (43%), Positives = 151/274 (55%), Gaps = 22/274 (8%)
Query: 55 QARHALSFARFA------RRYGKIYESVEEMKLRFATFSKN---LDLIRSTNCKGLSYRL 105
Q +H + A A + ++ KLR + LDL +S + + Y
Sbjct: 68 QGKHGFTMAMNAFGDMTNEEFRQVMGCFRNQKLRKGKLFREPLFLDLPKSVDWRKKGY-- 125
Query: 106 GLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGL 165
++PVK+Q CGSCW FS TG+LE + GK +SLSEQ LVDC+ NQGCNGG
Sbjct: 126 ---VTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSHPQGNQGCNGGF 182
Query: 166 PSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGL 225
+ AF Y+K NGGLD+EE+YPY DG+CK+ SEN + G E L AV
Sbjct: 183 MNSAFRYVKENGGLDSEESYPYVAMDGICKYRSENSVANDTGFKVVPAGKEKALMKAVAT 242
Query: 226 VRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----DGVPYWLIK 280
V P+SVA + F+FYKSG+Y C + +D H V+ VGYG E D YWL+K
Sbjct: 243 VGPISVAMDAGHSSFQFYKSGIYFEPDCSSKNLD--HGVLVVGYGFEGANSDNNKYWLVK 300
Query: 281 NSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
NSWG WG +GY K+ K N CGIAT ASYP V
Sbjct: 301 NSWGPEWGSNGYVKIAKDKDNHCGIATAASYPTV 334
>gi|21953244|emb|CAD42716.1| putative cathepsin L [Myzus persicae]
Length = 341
Score = 209 bits (533), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 105/235 (44%), Positives = 151/235 (64%), Gaps = 8/235 (3%)
Query: 85 TFSKNLDLIRSTNC---KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKG 141
T + + ++S N K + +R ++PVK+QG CGSCW+FS TGSLE + + G
Sbjct: 109 TNDEGVTFLKSENVVIPKSIDWRKKGYVTPVKNQGQCGSCWSFSATGSLEGQHFRKTGVL 168
Query: 142 ISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENV 201
+SLSEQ L+DC++ + N GC GGL AF+YIK N GLDTE++YPY +D C+++ +N
Sbjct: 169 VSLSEQNLIDCSRKYGNNGCEGGLMDLAFKYIKSNKGLDTEKSYPYEAEDDKCRYNPDNS 228
Query: 202 GVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVN 260
G V+I G E+ L HA+ V PVS+A + + F+FYK GV+ + +C +T +D
Sbjct: 229 GATDNGFVDIPEGDEEALMHALATVGPVSIAIDASSEKFQFYKKGVFYNPRCSSTELD-- 286
Query: 261 HAVVAVGYGVE-DGVPYWLIKNSWGENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
H V+AVG+ + G YW++KNSWG+ WGD GY M KN CG+A+ ASYP+V
Sbjct: 287 HGVLAVGFRTDKKGGDYWIVKNSWGKTWGDEGYIMMARNKKNNCGVASSASYPLV 341
>gi|38147395|gb|AAR12010.1| cathepsin L-like proteinase [Triatoma infestans]
Length = 328
Score = 209 bits (533), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 111/265 (41%), Positives = 161/265 (60%), Gaps = 8/265 (3%)
Query: 54 GQARHALSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCK---GLSYRLGLNIS 110
G+ + L F ++++ ++K R A + ++ R+T K + +R ++
Sbjct: 67 GEVSYKLKMNHFGDLMQHEFKALNKLK-RSAKQQNSGEVFRATGGKLPAKVDWRQKGAVT 125
Query: 111 PVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAF 170
PVKD G CGSCW FS+TGSL K +SLSEQQLVDC+ + N GC+GG+ QAF
Sbjct: 126 PVKDPGQCGSCWAFSSTGSLGGQLFLKNKKLVSLSEQQLVDCSGNYGNDGCDGGIMVQAF 185
Query: 171 EYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVS 230
+YIK NGG+DTE +YPY +D C++ +++V V+I G E+ L+ AV + P+S
Sbjct: 186 QYIKGNGGIDTEGSYPYEAEDDKCRYKTKSVAGTDKGYVDIAQGDENALKEAVAEIGPIS 245
Query: 231 VAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGD 289
VA + + F+FY G+Y C NT +D H V+ VGYG E+G YWL+KNSWG +WG+
Sbjct: 246 VAIDAGNLSFQFYSEGIYDEPFCSNTELD--HGVLVVGYGTENGQDYWLVKNSWGPSWGE 303
Query: 290 HGYFKMEMG-KNMCGIATCASYPVV 313
+GY K+ N CGIA+ ASYP+V
Sbjct: 304 NGYIKIARNHNNHCGIASMASYPIV 328
>gi|23344734|gb|AAN28680.1| cathepsin L [Theromyzon tessulatum]
Length = 351
Score = 209 bits (533), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 121/315 (38%), Positives = 159/315 (50%), Gaps = 62/315 (19%)
Query: 60 LSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGL----SYRLGLN------- 108
+++ +F + K+Y +EE LR F+ N I+ N S+ +G+N
Sbjct: 39 VAWHKFKLEHNKVYVGIEEESLRKTIFATNYKFIKDHNALHATGEKSFTVGVNEFADMTV 98
Query: 109 ------------------------------------------ISPVKDQGHCGSCWTFST 126
+S VK+QG CGSCW FST
Sbjct: 99 HEFAQMMNGLKPDSTRVSGSTYLSPNIDAPLPVEVDWRTKGLVSEVKNQGSCGSCWAFST 158
Query: 127 TGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYP 186
TGSLE + + G + LSEQ LVDC+ ++ N GCNGGL + AF+YIK N G+DTEEAYP
Sbjct: 159 TGSLEGQHMRKTGTMVDLSEQNLVDCSTSYGNDGCNGGLMTNAFKYIKDNKGIDTEEAYP 218
Query: 187 YTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEV-VDGFRFYKSG 245
Y G+DG CKF VG V V I G E +LQ A+ V PVSVA + F YKSG
Sbjct: 219 YAGRDGDCKFKKNKVGATVTGFVEIPAGNEKKLQEALATVGPVSVAIDANHQSFMLYKSG 278
Query: 246 VYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGK------ 299
VY +C + +D H V+AVGYG G Y+++KNSWG WG+ GY +
Sbjct: 279 VYDEPECDSAQLD--HGVLAVGYGSIHGKDYYIVKNSWGTTWGEQGYIRFSTTAVPDAIG 336
Query: 300 NMCGIATCASYPVVA 314
+CGI ASYPV+A
Sbjct: 337 GICGILLDASYPVIA 351
>gi|34850847|dbj|BAC87861.1| cathepsin L [Engraulis japonicus]
Length = 336
Score = 209 bits (533), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 127/294 (43%), Positives = 166/294 (56%), Gaps = 20/294 (6%)
Query: 35 RLVSSDGLRDFETSVLQ-VIGQARHALSFARFA-------RRYGKIYESVEEMKLRFATF 86
R+V LR E L+ +G + L F R+ Y+ E +++ + F
Sbjct: 48 RVVWEKNLRKIEMHNLEHSMGAHSYRLGMNHFGDMTHEEFRQVMNGYKHKAERRVKGSLF 107
Query: 87 SKNLDLIRSTNCKGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSE 146
+ + I + K + YR +PVKDQG CGSCW FSTTG++E + GK +SLSE
Sbjct: 108 MEP-NFIEAP--KKIDYRDLGYATPVKDQGQCGSCWAFSTTGAMEGQLFREGGKLVSLSE 164
Query: 147 QQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGV-CKFSSENVGVQV 205
Q LVDC++ N+GCNGGL QAF+YIK NGGLDTE+AYPY G D C + +
Sbjct: 165 QNLVDCSRPEGNEGCNGGLMDQAFQYIKDNGGLDTEDAYPYLGTDDQDCHYDPKYSAAND 224
Query: 206 LDSVNITLGAEDELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVV 264
V+I G E L AV V PVSVA + + F+FY SG+Y +C +T +D H V+
Sbjct: 225 TGFVDIPEGKERALMKAVAAVGPVSVAIDAGHECFQFYHSGIYFEKECSSTELD--HGVL 282
Query: 265 AVGYGVE----DGVPYWLIKNSWGENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
VGYG E DG YW++KNSW E WGD GY M KN CGIAT ASYP++
Sbjct: 283 VVGYGFEGEDVDGKKYWIVKNSWSEKWGDEGYIYMAKDRKNHCGIATAASYPLM 336
>gi|330805277|ref|XP_003290611.1| hypothetical protein DICPUDRAFT_81345 [Dictyostelium purpureum]
gi|325079250|gb|EGC32859.1| hypothetical protein DICPUDRAFT_81345 [Dictyostelium purpureum]
Length = 330
Score = 209 bits (533), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 114/263 (43%), Positives = 155/263 (58%), Gaps = 12/263 (4%)
Query: 56 ARHALSFARFA----RRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLNISP 111
++ L +FA Y KIY K+ A N ++I T + +R +S
Sbjct: 72 SKTVLGLTQFADLTNEEYRKIYLGT---KVNVAPEKHNFNMIHFTGPDSIDWRTKGAVSH 128
Query: 112 VKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFE 171
VKDQG CGSCW+FSTTGS+E A+ G ++LSEQ LVDC+ F N GC+GGL AF+
Sbjct: 129 VKDQGQCGSCWSFSTTGSVEGAHQIKTGNMVTLSEQNLVDCSGKFGNNGCDGGLMVNAFK 188
Query: 172 YIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSV 231
+I GG+ TE++YPY G CKF+ VG + IT G+E ELQ A+ +PVS+
Sbjct: 189 FIMSQGGVATEDSYPYNAVQGKCKFTKSMVGANISGYKEITQGSELELQAAL-TKQPVSI 247
Query: 232 AFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDH 290
A + F+ YKSGVY +C + +D H V+AVGYG E+G Y+++KNSW ++WG
Sbjct: 248 AIDASQQSFQLYKSGVYDEPECSSYQLD--HGVLAVGYGTENGKDYYIVKNSWADSWGQD 305
Query: 291 GY-FKMEMGKNMCGIATCASYPV 312
GY F KN CG+AT ASYP+
Sbjct: 306 GYIFMSRNAKNQCGVATMASYPI 328
>gi|299507656|gb|ADJ21807.1| cathepsin L [Oplegnathus fasciatus]
Length = 336
Score = 209 bits (533), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 123/296 (41%), Positives = 167/296 (56%), Gaps = 24/296 (8%)
Query: 35 RLVSSDGLRDFETSVLQ-VIGQARHALSFARFA-------RRYGKIYESVEEMKLRFATF 86
R+V L+ E L+ +G+ + L F R+ Y+ E K + + F
Sbjct: 48 RMVWEKNLKKIELHNLEHSMGEHTYRLGMNHFGDMTHEEFRQIMNGYKRKSERKFKGSLF 107
Query: 87 SKN--LDLIRSTNCKGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISL 144
+ L+ RS + + Y ++PVKDQG CGSCW FSTTG++E + + GK +SL
Sbjct: 108 MEPNFLEAPRSVDWRDNGY-----VTPVKDQGQCGSCWAFSTTGAMEGQHFRKTGKLVSL 162
Query: 145 SEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDG-VCKFSSENVGV 203
SEQ LVDC++ N+GCNGGL QAF+YIK N GLD+E++YPY G D C + +
Sbjct: 163 SEQNLVDCSRPEGNEGCNGGLMDQAFQYIKDNQGLDSEDSYPYLGTDDQPCHYDPKYNSA 222
Query: 204 QVLDSVNITLGAEDELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHA 262
++I G E L AV V PVSVA + + F+FY+SG+Y +C + +D H
Sbjct: 223 NDTGFIDIPSGKERALMKAVAAVGPVSVAIDAGHESFQFYQSGIYYEKECSSEELD--HG 280
Query: 263 VVAVGYGVE----DGVPYWLIKNSWGENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
V+ VGYG E DG YW++KNSW E WGD GY M KN CGIAT ASYP+V
Sbjct: 281 VLVVGYGFEGEDVDGKKYWIVKNSWSEKWGDKGYIYMAKDRKNHCGIATAASYPLV 336
>gi|151176971|gb|ABR88030.1| digestive cysteine protease [Dermestes frischii]
Length = 339
Score = 209 bits (532), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 105/218 (48%), Positives = 140/218 (64%), Gaps = 5/218 (2%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
+ + +R ++ VKDQGHCGSCW+FS TG+LE + + K +SLSEQ LVDC+ F N
Sbjct: 124 ENVDWREHGAVTXVKDQGHCGSCWSFSATGALEGQHFRKTNKLVSLSEQNLVDCSTKFGN 183
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
GCNGGL AF+Y+KYN G+DTE +YPY D C ++ + G V+I G E++
Sbjct: 184 DGCNGGLMDNAFKYVKYNHGIDTEASYPYHADDEKCHYNPKTSGATDRGFVDIPTGDEEK 243
Query: 219 LQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGV-EDGVPY 276
L AV V PVSVA + + F+ Y GVY +C + +D H V+ VGYG E+G Y
Sbjct: 244 LMAAVATVGPVSVAIDASHESFQLYSEGVYYDPECSSEELD--HGVLVVGYGTDENGQDY 301
Query: 277 WLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
W++KNSWGE+WG+ GY KM + N CGIAT ASYP+V
Sbjct: 302 WIVKNSWGESWGEQGYIKMARNRDNNCGIATQASYPLV 339
>gi|449679414|ref|XP_002161570.2| PREDICTED: cathepsin L-like [Hydra magnipapillata]
Length = 353
Score = 209 bits (532), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 117/305 (38%), Positives = 161/305 (52%), Gaps = 53/305 (17%)
Query: 62 FARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN------------- 108
+ RF ++GK Y + +E ++ + KN + I + N + S+ +G+N
Sbjct: 49 WRRFKIKFGKFYSNQDEETSKYLNWKKNNENIINHNSENHSFEIGINQFSDLTHEEFMKI 108
Query: 109 -------------------------------------ISPVKDQGHCGSCWTFSTTGSLE 131
++PVK+QG C SCW FSTTG+LE
Sbjct: 109 HGGCLKLSKSIVNFTKEFSLPNKVNIPDKVDWRTEGYVTPVKNQGLCRSCWAFSTTGALE 168
Query: 132 AAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKD 191
+ G +LSEQ LVDC++++ NQGC+GG + AFEYIK N GLD+E YPY K+
Sbjct: 169 GQTFRKTGILPTLSEQNLVDCSKSYGNQGCDGGWTNNAFEYIKDNDGLDSENGYPYDAKE 228
Query: 192 -GVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVD-GFRFYKSGVYSS 249
G C + + V I G ED L+ AV V P++V + F+ YKSGVY+
Sbjct: 229 LGYCYYDEKYKEASDSGFVEIPYGDEDALKEAVATVGPIAVNIDASKPSFQSYKSGVYNE 288
Query: 250 TKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCA 308
CGN ++ HAV+ VGYG E G +WL+KNSWG+ WGDHGY KM K N CGIAT A
Sbjct: 289 PTCGNGITNLTHAVLVVGYGTEKGHKFWLVKNSWGKTWGDHGYIKMSRNKSNQCGIATRA 348
Query: 309 SYPVV 313
S+P+V
Sbjct: 349 SFPLV 353
>gi|195429415|ref|XP_002062758.1| GK19626 [Drosophila willistoni]
gi|194158843|gb|EDW73744.1| GK19626 [Drosophila willistoni]
Length = 341
Score = 209 bits (532), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 126/341 (36%), Positives = 177/341 (51%), Gaps = 38/341 (11%)
Query: 7 LVSSVILLLCCAAAASASASSFDDSNPIRLVSSDGLRDF--ETSVLQVIGQARHALSFAR 64
L++ +I L+ A S S ++ N +L D ET +++ + +H + A+
Sbjct: 5 LITLLIALVAMTQAVSYSELVREEWNTFKLEHRKNYADSTEETFRMKIFNENKHHI--AK 62
Query: 65 FARRY--GKI--------YESVEEMKLRFATFSKNLDL---IRSTN-------------- 97
+RY G++ Y + + R N L +RST+
Sbjct: 63 HNQRYATGEVSYKLALNKYADMLHHEFRETMNGFNYTLHKQLRSTDESFTGVTFISPEHV 122
Query: 98 --CKGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQA 155
+ +R ++ VKDQGHCGSCW FS+TG++E + + G +SLSEQ LVDC+
Sbjct: 123 KLPTAVDWRTKGAVTEVKDQGHCGSCWAFSSTGAIEGQHFRKSGTLVSLSEQNLVDCSTK 182
Query: 156 FNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGA 215
+ N GCNGGL AF Y+K NGG+DTE++Y Y G D C F ++G +I G
Sbjct: 183 YGNNGCNGGLMDNAFRYVKDNGGIDTEKSYAYEGIDDSCHFDKNSIGATDRGFADIPQGN 242
Query: 216 EDELQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE-DG 273
E +L AV + PVSVA + F+FY GVY C +D H V+ VGYG E DG
Sbjct: 243 EKKLAQAVATIGPVSVAIDASQQSFQFYSEGVYDEPNCSAENLD--HGVLVVGYGTEKDG 300
Query: 274 VPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
YWL+KNSWG WGD G+ KM K N CGIA+ +SYP+V
Sbjct: 301 SDYWLVKNSWGTTWGDKGFIKMSRNKENQCGIASASSYPLV 341
>gi|388509526|gb|AFK42829.1| unknown [Lotus japonicus]
Length = 333
Score = 209 bits (532), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 121/309 (39%), Positives = 158/309 (51%), Gaps = 60/309 (19%)
Query: 62 FARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCK---GL-SYRLGLN--------- 108
+A F +GK Y + EE+ R A + N+ +IR N + GL +Y LGLN
Sbjct: 28 WALFKTTFGKQYSTAEEITRRLA-WEANVAIIRQHNLEHDLGLHTYTLGLNNYADLTNAE 86
Query: 109 ------------------------------------------ISPVKDQGHCGSCWTFST 126
++P+KDQG CGSCW FS+
Sbjct: 87 FNQVMNGLRVNASQTKSANRRTYVAPVGVELPTSVDWRTKGYVTPIKDQGQCGSCWAFSS 146
Query: 127 TGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYP 186
TGSLE + G+ +SLSEQ L DC+Q N GCNGGL QAF YIK N G+DTE +YP
Sbjct: 147 TGSLEGQHFAKTGQLVSLSEQNLTDCSQKQGNMGCNGGLMDQAFTYIKENNGIDTESSYP 206
Query: 187 YTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVV-DGFRFYKSG 245
Y D C F + +VG +I E+ LQ A+ V P+SVA + F+ Y+SG
Sbjct: 207 YKAVDEKCHFKAADVGATDTGYTDIAQQDENALQSAIATVGPISVAIDASHSSFQLYRSG 266
Query: 246 VYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGK-NMCGI 304
Y+ C T +D H V+AVGY EDG Y+++KNSWG +WG GY M K N CGI
Sbjct: 267 AYNERACSATQLD--HGVLAVGYDSEDGKDYYIVKNSWGTSWGQKGYIWMTRNKNNQCGI 324
Query: 305 ATCASYPVV 313
AT ++YP V
Sbjct: 325 ATMSTYPTV 333
>gi|443685370|gb|ELT89004.1| hypothetical protein CAPTEDRAFT_95613, partial [Capitella teleta]
Length = 295
Score = 209 bits (531), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 104/215 (48%), Positives = 136/215 (63%), Gaps = 4/215 (1%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++PVK+QG CGSCW FS G+LE + + GK +SLSEQ LVDC++++ N G
Sbjct: 83 VDWRKKGYVTPVKNQGQCGSCWAFSAIGALEGQHFRKTGKLVSLSEQNLVDCSKSYGNNG 142
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
CNGG+ AF+YIK N G DTE YPY DG+C+F E VG ++ G E +++
Sbjct: 143 CNGGVMDYAFKYIKDNDGDDTEACYPYEAVDGMCRFKRECVGATCRGYTDLPWGNEVKMK 202
Query: 221 HAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
AV LV PVSVA + F YK GVY +C +P ++H V+ VGYG E G+ YWL+
Sbjct: 203 EAVALVGPVSVAIDASHSSFMSYKGGVYVEKEC--SPYQLDHGVLVVGYGTEQGLDYWLV 260
Query: 280 KNSWGENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
KNSWG WGD GY KM N CGIA+ A YP+V
Sbjct: 261 KNSWGTTWGDQGYIKMARNMHNHCGIASMACYPLV 295
>gi|391338876|ref|XP_003743781.1| PREDICTED: cathepsin L-like isoform 4 [Metaseiulus occidentalis]
Length = 336
Score = 209 bits (531), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 106/218 (48%), Positives = 141/218 (64%), Gaps = 5/218 (2%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++PVK+QGHCGSCW+FSTTG+LE + G+ +SLSEQ L+DC+ ++ N
Sbjct: 121 KSVDWREKGAVTPVKNQGHCGSCWSFSTTGALEGQLFRKTGELVSLSEQNLIDCSTSYGN 180
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
GC GGL AF YIK N G+DTEE+YPY GK G C++ E+ + V+I G E
Sbjct: 181 NGCGGGLMDNAFTYIKENHGIDTEESYPYEGKQGKCRYHKEDSAGRDTGFVDIPSGNERA 240
Query: 219 LQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYG-VEDGVPY 276
L A+ + PVSVA + + F+FY GVY+ C + +D H V+AVGYG +DG Y
Sbjct: 241 LAKALATIGPVSVAIDASHESFQFYHEGVYNPPDCDSHSLD--HGVLAVGYGTTDDGQDY 298
Query: 277 WLIKNSWGENWGDHGYFKM-EMGKNMCGIATCASYPVV 313
++IKNSWGE WG GY M KN CG+AT ASYP+V
Sbjct: 299 YIIKNSWGERWGQEGYVLMARNSKNECGVATQASYPLV 336
>gi|391338870|ref|XP_003743778.1| PREDICTED: cathepsin L-like isoform 1 [Metaseiulus occidentalis]
gi|391338872|ref|XP_003743779.1| PREDICTED: cathepsin L-like isoform 2 [Metaseiulus occidentalis]
gi|391338874|ref|XP_003743780.1| PREDICTED: cathepsin L-like isoform 3 [Metaseiulus occidentalis]
Length = 331
Score = 209 bits (531), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 106/218 (48%), Positives = 141/218 (64%), Gaps = 5/218 (2%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++PVK+QGHCGSCW+FSTTG+LE + G+ +SLSEQ L+DC+ ++ N
Sbjct: 116 KSVDWREKGAVTPVKNQGHCGSCWSFSTTGALEGQLFRKTGELVSLSEQNLIDCSTSYGN 175
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
GC GGL AF YIK N G+DTEE+YPY GK G C++ E+ + V+I G E
Sbjct: 176 NGCGGGLMDNAFTYIKENHGIDTEESYPYEGKQGKCRYHKEDSAGRDTGFVDIPSGNERA 235
Query: 219 LQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYG-VEDGVPY 276
L A+ + PVSVA + + F+FY GVY+ C + +D H V+AVGYG +DG Y
Sbjct: 236 LAKALATIGPVSVAIDASHESFQFYHEGVYNPPDCDSHSLD--HGVLAVGYGTTDDGQDY 293
Query: 277 WLIKNSWGENWGDHGYFKM-EMGKNMCGIATCASYPVV 313
++IKNSWGE WG GY M KN CG+AT ASYP+V
Sbjct: 294 YIIKNSWGERWGQEGYVLMARNSKNECGVATQASYPLV 331
>gi|340370276|ref|XP_003383672.1| PREDICTED: digestive cysteine proteinase 2-like [Amphimedon
queenslandica]
Length = 327
Score = 209 bits (531), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 103/206 (50%), Positives = 136/206 (66%), Gaps = 4/206 (1%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++P+K+QG CGSCW+FS+TGSLE + G +SLSEQQL+DC+ + N GCNGGL
Sbjct: 121 VTPIKNQGQCGSCWSFSSTGSLEGQHFINTGTLVSLSEQQLMDCSTKYGNHGCNGGLMDN 180
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
+F Y+K G +TE+ YPYT ++GVC++ S V V+I G ED L+ AV V P
Sbjct: 181 SFRYLKSVAGDETEDNYPYTAENGVCRYDSSLAVVTDKSYVDIPQGDEDSLKDAVANVGP 240
Query: 229 VSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENW 287
+SVA + F+ Y SGVY ++ C +T +D H V+A+GYG EDG YWL+KNSWG +W
Sbjct: 241 ISVAIDASHSSFQLYNSGVYYASTCSSTQLD--HGVLAIGYGTEDGKDYWLVKNSWGTSW 298
Query: 288 GDHGYFKMEMGK-NMCGIATCASYPV 312
G GY KM + N CGIAT ASYP
Sbjct: 299 GMEGYIKMSRNRNNNCGIATQASYPT 324
>gi|342305188|dbj|BAK55648.1| cathepsin L [Oplegnathus fasciatus]
Length = 336
Score = 209 bits (531), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 123/296 (41%), Positives = 167/296 (56%), Gaps = 24/296 (8%)
Query: 35 RLVSSDGLRDFETSVLQ-VIGQARHALSFARFA-------RRYGKIYESVEEMKLRFATF 86
R+V L+ E L+ +G+ + L F R+ Y+ E K + + F
Sbjct: 48 RMVWEKNLKKIELHNLEHSMGEHTYRLGMNHFGDMTHEEFRQIMYGYKRKSERKFKGSLF 107
Query: 87 SKN--LDLIRSTNCKGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISL 144
+ L+ RS + + Y ++PVKDQG CGSCW FSTTG++E + + GK +SL
Sbjct: 108 MEPNFLEAPRSVDWRDNGY-----VTPVKDQGQCGSCWAFSTTGAMEGQHFRKTGKLVSL 162
Query: 145 SEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDG-VCKFSSENVGV 203
SEQ LVDC++ N+GCNGGL QAF+YIK N GLD+E++YPY G D C + +
Sbjct: 163 SEQNLVDCSRPEGNEGCNGGLMDQAFQYIKDNQGLDSEDSYPYLGTDDQPCHYDPKYNSA 222
Query: 204 QVLDSVNITLGAEDELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHA 262
++I G E L AV V PVSVA + + F+FY+SG+Y +C + +D H
Sbjct: 223 NDTGFIDIPSGKERALMKAVAAVGPVSVAIDAGHESFQFYQSGIYYEKECSSEELD--HG 280
Query: 263 VVAVGYGVE----DGVPYWLIKNSWGENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
V+ VGYG E DG YW++KNSW E WGD GY M KN CGIAT ASYP+V
Sbjct: 281 VLVVGYGFEGEDVDGKKYWIVKNSWSEKWGDKGYIYMAKDRKNHCGIATAASYPLV 336
>gi|116242316|gb|ABJ89815.1| putative cathepsin L preprotein [Clonorchis sinensis]
Length = 371
Score = 209 bits (531), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 122/306 (39%), Positives = 166/306 (54%), Gaps = 61/306 (19%)
Query: 65 FARRYGKIYESVEEMKLRFATFSKNLDLIRSTNC---KG-LSYRLGLN------------ 108
F +Y ++Y+S E + R F++N I N KG +SY +G+N
Sbjct: 70 FLEKYKRVYDSKLEEERRLGIFTENFIRISEHNLLFEKGEVSYSMGINAFSDKTNSELDV 129
Query: 109 -----------------------------------ISPVKDQGHCGSCWTFSTTGSLEAA 133
++PVK+QG CGSCW FS TG +E
Sbjct: 130 LRGFRHSSKASRSGSQYIPFDAAPPAEVDWRTKGAVTPVKNQGDCGSCWAFSATGGIEGQ 189
Query: 134 YHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPY----TG 189
++ A GK +SLSEQQLVDC+ + N GC+GGL AFEY+K + G+DTE YPY TG
Sbjct: 190 HYLATGKLVSLSEQQLVDCSSS--NDGCDGGLMDLAFEYVKEHKGIDTEVHYPYVSGNTG 247
Query: 190 KDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYS 248
C F + V V V+I G E LQ AVG P+SV + F Y+SG+YS
Sbjct: 248 YARQCSFDPKYAAVNVTGYVDIPEGQELLLQQAVGFHGPISVGINAGLPSFMAYESGIYS 307
Query: 249 STKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFK-MEMGKNMCGIATC 307
+C P D++H V+ VGYGV++GVPYWLIKNSWGE+WG++GY + + N+CG+AT
Sbjct: 308 DHRC--NPHDLDHGVLVVGYGVDNGVPYWLIKNSWGEDWGENGYVRILRNHNNLCGVATM 365
Query: 308 ASYPVV 313
ASYP++
Sbjct: 366 ASYPLM 371
>gi|2706547|emb|CAA75862.1| putative cathepsin L [Xenopus laevis]
Length = 231
Score = 209 bits (531), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 108/222 (48%), Positives = 144/222 (64%), Gaps = 9/222 (4%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++PVKDQG CGSCW STTG+LE +++ K ISLSEQ LVDC++A N
Sbjct: 12 KSVDWRKKGYVTPVKDQGQCGSCWAPSTTGALEGQHYRKTSKLISLSEQNLVDCSRAQGN 71
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGV-CKFSSENVGVQVLDSVNITLGAED 217
+GCNGGL QAF+Y+K NGG+D+E++YPYT KD C + N V++ G E
Sbjct: 72 EGCNGGLMDQAFQYVKDNGGIDSEDSYPYTAKDDQECHYDPNNNSANDTGFVDVQSGCEK 131
Query: 218 ELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----D 272
+L AV V PVSVA + F+FY+SG+Y +C + D++H V+ VGYG E D
Sbjct: 132 DLMKAVASVGPVSVAIDAGHQSFQFYQSGIYYEPEC--SSEDLDHGVLVVGYGFESEDVD 189
Query: 273 GVPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
G YW++KNSW E WGD+GY + + N CGIAT ASYP+V
Sbjct: 190 GKKYWIVKNSWSEKWGDNGYINIAKDRHNHCGIATAASYPLV 231
>gi|340368362|ref|XP_003382721.1| PREDICTED: cathepsin L1-like [Amphimedon queenslandica]
Length = 329
Score = 208 bits (530), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 117/257 (45%), Positives = 155/257 (60%), Gaps = 11/257 (4%)
Query: 60 LSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLNISPVKDQGHCG 119
L+ A F+ Y +++E + FS +L R+ L +R ++ VK+QG CG
Sbjct: 80 LTSAEFSSLYNGYRQNLETSG---SVFSSSL---RNAMPSSLDWRDKKVVTDVKNQGKCG 133
Query: 120 SCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGL 179
SCW FSTTGSLE + G +SLSEQQL+DC+ + N GC+GG AF+YIK GG
Sbjct: 134 SCWAFSTTGSLEGLHALKTGHLVSLSEQQLMDCSVKYGNNGCDGGNMRSAFQYIKDAGGD 193
Query: 180 DTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEV-VDG 238
DTEE+YPYT K+ C+F + VG V I G E L HA+ V P+SVA + +
Sbjct: 194 DTEESYPYTAKNESCRFDPKKVGATDEGYVRIPSGDEVSLMHALYEVGPISVAMDAGLKT 253
Query: 239 FRFYKSGVYSSTKCGNTPMDVNHAVVAVGYG-VEDGVPYWLIKNSWGENWGDHGYFKM-E 296
F+FYK G+YS C NT + NH V +GYG DG PYWL+KNSWG++WG GYF +
Sbjct: 254 FQFYKKGIYSDYLCSNTHL--NHGVTLIGYGESSDGSPYWLVKNSWGKDWGIDGYFMLAR 311
Query: 297 MGKNMCGIATCASYPVV 313
NMCG+AT ASYP++
Sbjct: 312 YVGNMCGVATDASYPIL 328
>gi|344953542|gb|AEN28617.1| cathepsin L-like cysteine protease [Epinephelus coioides]
Length = 336
Score = 208 bits (529), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 132/341 (38%), Positives = 179/341 (52%), Gaps = 39/341 (11%)
Query: 7 LVSSVILLLCCAAAASASA--SSFDDSNPI-----------------RLVSSDGLRDFET 47
++ ++ LC +AA SA + DD + R+V L+ E
Sbjct: 1 MLPLAVVALCLSAALSAPSLDPQLDDHWELWKSWHSKKYHEKEEGWRRMVWEKNLKKIEL 60
Query: 48 SVLQ-VIGQARHALSFARFA-------RRYGKIYESVEEMKLRFATFSKNLDLIRSTNCK 99
L+ +G + L F R+ Y+ E K R + F L+ K
Sbjct: 61 HNLEHSMGTHSYRLGMNHFGDMTHEEFRQLMNGYKRKAETKARGSLF---LEPNFLEAPK 117
Query: 100 GLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQ 159
+ +R ++PVKDQG CGSCW FSTTG+LE + + GK +SLSEQ LVDC++ N+
Sbjct: 118 SVDWRDNGYVTPVKDQGQCGSCWAFSTTGALEGQHFRKTGKLVSLSEQNLVDCSRPEGNE 177
Query: 160 GCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDG-VCKFSSENVGVQVLDSVNITLGAEDE 218
GCNGGL QAF+Y+K N GLD+E++YPY G D C + V V+I G E
Sbjct: 178 GCNGGLMDQAFQYVKDNQGLDSEDSYPYLGTDDQPCHYDPTYNSVNDTGFVDIPSGKERA 237
Query: 219 LQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----DG 273
L AV V PVSVA + + F+FY+SG+Y +C + +D H V+ VGYG + DG
Sbjct: 238 LMKAVAAVGPVSVAIDAGHESFQFYQSGIYYEKECSSEELD--HGVLVVGYGFQGEDVDG 295
Query: 274 VPYWLIKNSWGENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
YW++KNSW E WGD GY M KN CGIAT ASYP+V
Sbjct: 296 KKYWIVKNSWSEKWGDKGYIYMAKDRKNHCGIATAASYPLV 336
>gi|228245|prf||1801240C Cys protease 3
Length = 321
Score = 208 bits (529), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 104/217 (47%), Positives = 136/217 (62%), Gaps = 4/217 (1%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
+ + +R ++PVKDQ CGSCW FS TG+LE + + +SLSEQQLVDC+ + N
Sbjct: 107 RDVDWRTKALVTPVKDQEQCGSCWAFSATGALEGQHFLKNDELVSLSEQQLVDCSTDYGN 166
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
GC GG + AF+YIK NGG+DTE +YPY +D C+F + ++G SV I E+
Sbjct: 167 DGCGGGWMTSAFDYIKDNGGIDTESSYPYEAEDRSCRFDANSIGAICTGSVEIVQHTEEA 226
Query: 219 LQHAVGLVRPVSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYW 277
LQ AV V P+SVA + F+FY SGVY C +P ++H V+AVGYG E YW
Sbjct: 227 LQEAVSGVGPISVAIDASHFSFQFYSSGVYYEQNC--SPTFLDHGVLAVGYGTESTKDYW 284
Query: 278 LIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
L+KNSWG +WGD GY KM + N CGIA+ SYP V
Sbjct: 285 LVKNSWGSSWGDAGYIKMSRNRDNNCGIASEPSYPTV 321
>gi|402898110|ref|XP_003912074.1| PREDICTED: cathepsin L2 [Papio anubis]
Length = 334
Score = 208 bits (529), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 117/274 (42%), Positives = 151/274 (55%), Gaps = 22/274 (8%)
Query: 55 QARHALSFARFA------RRYGKIYESVEEMKLRFATFSKN---LDLIRSTNCKGLSYRL 105
Q +H + A A + ++ KLR + LDL +S + + Y
Sbjct: 68 QGKHGFTMAMNAFGDMTNEEFRQVMGCFRNQKLRKGKLFREPLFLDLPKSVDWRKKGY-- 125
Query: 106 GLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGL 165
++PVK+Q CGSCW FS TG+LE + GK +SLSEQ LVDC++ NQGCNGG
Sbjct: 126 ---VTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGF 182
Query: 166 PSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGL 225
+ AF Y+K NGGLD+EE+YPY DG+CK+ EN + G E L AV
Sbjct: 183 MNSAFRYVKENGGLDSEESYPYVAMDGICKYRPENSVANDTGFEVVPAGKEKALMKAVAT 242
Query: 226 VRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----DGVPYWLIK 280
V P+SVA + F+FYKSG+Y C + +D H V+ VGYG E D YWL+K
Sbjct: 243 VGPISVAMDAGHSSFQFYKSGIYFEPDCSSKNLD--HGVLVVGYGFEGANSDNNKYWLVK 300
Query: 281 NSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
NSWG WG +GY K+ K N CGIAT ASYP V
Sbjct: 301 NSWGPEWGSNGYVKIAKDKDNHCGIATAASYPTV 334
>gi|24653514|ref|NP_523735.2| cysteine proteinase-1, isoform C [Drosophila melanogaster]
gi|118572624|sp|Q95029.2|CATL_DROME RecName: Full=Cathepsin L; AltName: Full=Cysteine proteinase 1;
Contains: RecName: Full=Cathepsin L heavy chain;
Contains: RecName: Full=Cathepsin L light chain; Flags:
Precursor
gi|21627209|gb|AAM68565.1| cysteine proteinase-1, isoform C [Drosophila melanogaster]
Length = 371
Score = 208 bits (529), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 105/218 (48%), Positives = 136/218 (62%), Gaps = 5/218 (2%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++ VKDQGHCGSCW FS+TG+LE + + G +SLSEQ LVDC+ + N
Sbjct: 156 KSVDWRTKGAVTAVKDQGHCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGN 215
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
GCNGGL AF YIK NGG+DTE++YPY D C F+ VG +I G E +
Sbjct: 216 NGCNGGLMDNAFRYIKDNGGIDTEKSYPYEAIDDSCHFNKGTVGATDRGFTDIPQGDEKK 275
Query: 219 LQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGV-EDGVPY 276
+ AV V PVSVA + + F+FY GVY+ +C +D H V+ VG+G E G Y
Sbjct: 276 MAEAVATVGPVSVAIDASHESFQFYSEGVYNEPQCDAQNLD--HGVLVVGFGTDESGEDY 333
Query: 277 WLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
WL+KNSWG WGD G+ KM K N CGIA+ +SYP+V
Sbjct: 334 WLVKNSWGTTWGDKGFIKMLRNKENQCGIASASSYPLV 371
>gi|255522980|gb|ACU12382.1| RE21773p [Drosophila melanogaster]
Length = 375
Score = 208 bits (529), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 105/218 (48%), Positives = 136/218 (62%), Gaps = 5/218 (2%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++ VKDQGHCGSCW FS+TG+LE + + G +SLSEQ LVDC+ + N
Sbjct: 160 KSVDWRTKGAVTAVKDQGHCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGN 219
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
GCNGGL AF YIK NGG+DTE++YPY D C F+ VG +I G E +
Sbjct: 220 NGCNGGLMDNAFRYIKDNGGIDTEKSYPYEAIDDSCHFNKGTVGATDRGFTDIPQGDEKK 279
Query: 219 LQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGV-EDGVPY 276
+ AV V PVSVA + + F+FY GVY+ +C +D H V+ VG+G E G Y
Sbjct: 280 MAEAVATVGPVSVAIDASHESFQFYSEGVYNEPQCDAQNLD--HGVLVVGFGTDESGEDY 337
Query: 277 WLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
WL+KNSWG WGD G+ KM K N CGIA+ +SYP+V
Sbjct: 338 WLVKNSWGTTWGDKGFIKMLRNKENQCGIASASSYPLV 375
>gi|74765984|sp|Q24940.1|CATLL_FASHE RecName: Full=Cathepsin L-like proteinase; Flags: Precursor
gi|497700|gb|AAA29136.1| cathepsin [Fasciola hepatica]
Length = 326
Score = 208 bits (529), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 101/215 (46%), Positives = 142/215 (66%), Gaps = 4/215 (1%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++ VKDQG+CGSCW FSTTG++E Y + IS SEQQLVDC+ + N G
Sbjct: 112 IDWRESGYVTEVKDQGNCGSCWAFSTTGTMEGQYMKNERTSISFSEQQLVDCSGPWGNNG 171
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C+GGL A++Y+K GL+TE +YPYT +G C+++ + +V + G+E EL+
Sbjct: 172 CSGGLMENAYQYLK-QFGLETESSYPYTAVEGQCRYNKQLGVAKVTGYYTVHSGSEVELK 230
Query: 221 HAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIK 280
+ VG RP +VA +V F Y+SG+Y S C +P+ VNHAV+AVGYG + G YW++K
Sbjct: 231 NLVGARRPAAVAVDVESDFMMYRSGIYQSQTC--SPLRVNHAVLAVGYGTQGGTDYWIVK 288
Query: 281 NSWGENWGDHGYFKMEMGK-NMCGIATCASYPVVA 314
NSWG WG+ GY +M + NMCGIA+ AS P+VA
Sbjct: 289 NSWGTYWGERGYIRMARNRGNMCGIASLASLPMVA 323
>gi|157278115|ref|NP_001098156.1| cathepsin L precursor [Oryzias latipes]
gi|50251128|dbj|BAD27581.1| cathepsin L [Oryzias latipes]
Length = 336
Score = 207 bits (528), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 124/296 (41%), Positives = 169/296 (57%), Gaps = 24/296 (8%)
Query: 35 RLVSSDGLRDFETSVLQ-VIGQARHALSFARFA-------RRYGKIYESVEEMKLRFATF 86
RLV LR E L+ +G+ + L F R+ Y+ E+ K + F
Sbjct: 48 RLVWEKNLRKIELHNLEHSMGKHSYRLGMNHFGDMTHEEFRQIMNGYKRREQRKYSGSLF 107
Query: 87 SKN--LDLIRSTNCKGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISL 144
+ L+ R+ + + Y ++PVKDQG CGSCW FSTTG+LE + GK +SL
Sbjct: 108 MEPNFLEAPRAVDWRDKGY-----VTPVKDQGQCGSCWAFSTTGALEGQQFRKTGKLVSL 162
Query: 145 SEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDG-VCKFSSENVGV 203
SEQ LVDC++ N+GCNGGL QAF+Y+K N GLD+E+ YPY G D C+++++ V
Sbjct: 163 SEQNLVDCSRPEGNEGCNGGLMDQAFQYVKDNQGLDSEDFYPYKGTDDQPCQYNAQYSAV 222
Query: 204 QVLDSVNITLGAEDELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHA 262
V+I G E L AV V PVSVA + + F+FY+SG+Y +C + +D H
Sbjct: 223 NDTGFVDIPSGKERALMKAVASVGPVSVAIDAGHESFQFYQSGIYFEKECSSDELD--HG 280
Query: 263 VVAVGYGVE----DGVPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
V+ VGYG E DG YW++KNSW E WGD G+ M + N CGIAT ASYP+V
Sbjct: 281 VLVVGYGFEGEDVDGKKYWIVKNSWSEKWGDKGFIYMAKDRHNHCGIATAASYPLV 336
>gi|281203744|gb|EFA77940.1| hypothetical protein PPL_08585 [Polysphondylium pallidum PN500]
Length = 505
Score = 207 bits (528), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 122/338 (36%), Positives = 162/338 (47%), Gaps = 79/338 (23%)
Query: 52 VIGQARHALSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN--- 108
+ + ++ F + R+ K Y+ V E K RF+ F N+D + S N K LGLN
Sbjct: 171 LFSEEQYKNEFENWIDRFEKKYD-VSEFKKRFSIFKSNMDFVHSWNSKNSQTVLGLNHLA 229
Query: 109 ------------------------------------------------ISPVKDQGHCGS 120
+SP+KDQG CGS
Sbjct: 230 DLTNLEYRQFYLGTHKKAVLGTPGNHEVSNLQSVFGDSATVDWRQKGAVSPIKDQGQCGS 289
Query: 121 CWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLD 180
CW+FSTTGS+E A+ G + LSEQ LVDC+ + N GCNGGL AFEYI N G+D
Sbjct: 290 CWSFSTTGSVEGAHQIKSGNMVELSEQNLVDCSTSEGNMGCNGGLMDYAFEYIITNNGID 349
Query: 181 TEEAYPYTGKDG-VCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVV-DG 238
TE +YPYT G CK++ N G + NIT G+E +L AV PVSVA + +
Sbjct: 350 TESSYPYTASSGTTCKYNKANSGATISSYKNITAGSESDLADAVKNAGPVSVAIDASHNS 409
Query: 239 FRFYKSGVYSSTKCGNTPMDVNHAVVAVGYG----------------------VEDGVPY 276
F+ Y G+Y C + +D H V+ VGYG +D Y
Sbjct: 410 FQLYSHGIYYDASCSSVNLD--HGVLVVGYGSGTPDSDSRVHKGSQVRVKVPKTDDTKNY 467
Query: 277 WLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
W++KNSWG +WGD G+ M + N CGIA+CASYP+V
Sbjct: 468 WIVKNSWGTSWGDKGFIYMSKDRDNNCGIASCASYPIV 505
>gi|348531517|ref|XP_003453255.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
Length = 330
Score = 207 bits (528), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 103/217 (47%), Positives = 144/217 (66%), Gaps = 5/217 (2%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++ VK Q CGSCW FS TG+LE + + G + LSEQQLVDC++ + N
Sbjct: 117 KTVDWREQGYVTDVKHQQQCGSCWAFSATGALEGQHFKKTGTLVPLSEQQLVDCSRKYRN 176
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
GC+GG P+ AF+YI+ NGG+DTE++Y Y KDG C++ S ++G + V+++ E+
Sbjct: 177 NGCDGGEPNWAFQYIRDNGGVDTEKSYRYEAKDGQCRYRSNSIGAKCNGYVDVS-PFEEA 235
Query: 219 LQHAVGLVRPVSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYW 277
L AV + P+SV+ + F+ Y+SGVY C N +++NHAV+AVGYG E+G YW
Sbjct: 236 LMEAVATIGPISVSIDDSRVSFQLYQSGVYDEPWCSN--INLNHAVLAVGYGTENGHDYW 293
Query: 278 LIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
L+KNSWG WG+ GY KM K N CGIAT ASYP+V
Sbjct: 294 LVKNSWGSGWGNKGYIKMTRNKGNQCGIATEASYPLV 330
>gi|355567966|gb|EHH24307.1| Cathepsin L2 [Macaca mulatta]
gi|355753494|gb|EHH57540.1| Cathepsin L2 [Macaca fascicularis]
gi|380790509|gb|AFE67130.1| cathepsin L2 preproprotein [Macaca mulatta]
Length = 334
Score = 207 bits (528), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 117/274 (42%), Positives = 150/274 (54%), Gaps = 22/274 (8%)
Query: 55 QARHALSFARFA------RRYGKIYESVEEMKLRFATFSKN---LDLIRSTNCKGLSYRL 105
Q +H + A A + ++ KLR + LDL +S + + Y
Sbjct: 68 QGKHGFAMAMNAFGDMTNEEFRQVMGCFRNQKLRKGKLFREPLFLDLPKSVDWRKKGY-- 125
Query: 106 GLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGL 165
++PVK+Q CGSCW FS TG+LE + GK +SLSEQ LVDC+ NQGCNGG
Sbjct: 126 ---VTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSHPQGNQGCNGGF 182
Query: 166 PSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGL 225
+ AF Y+K NGGLD+EE+YPY DG+CK+ EN + G E L AV
Sbjct: 183 MNSAFRYVKENGGLDSEESYPYVAMDGICKYRPENSVANDTGFEVVPAGKEKALMKAVAT 242
Query: 226 VRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----DGVPYWLIK 280
V P+SVA + F+FYKSG+Y C + +D H V+ VGYG E D YWL+K
Sbjct: 243 VGPISVAMDAGHSSFQFYKSGIYFEPDCSSKNLD--HGVLVVGYGFEGANSDNNKYWLVK 300
Query: 281 NSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
NSWG WG +GY K+ K N CGIAT ASYP V
Sbjct: 301 NSWGPEWGSNGYVKIAKDKDNHCGIATAASYPTV 334
>gi|24653516|ref|NP_725347.1| cysteine proteinase-1, isoform A [Drosophila melanogaster]
gi|24653518|ref|NP_725348.1| cysteine proteinase-1, isoform B [Drosophila melanogaster]
gi|1658527|gb|AAB18345.1| cysteine proteinase 1 [Drosophila melanogaster]
gi|2305221|gb|AAB65749.1| cysteine proteinase-1 [Drosophila melanogaster]
gi|7303249|gb|AAF58311.1| cysteine proteinase-1, isoform A [Drosophila melanogaster]
gi|21627210|gb|AAM68566.1| cysteine proteinase-1, isoform B [Drosophila melanogaster]
gi|54650754|gb|AAV36956.1| LP06554p [Drosophila melanogaster]
gi|220951982|gb|ACL88534.1| Cp1-PA [synthetic construct]
Length = 341
Score = 207 bits (528), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 105/218 (48%), Positives = 136/218 (62%), Gaps = 5/218 (2%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++ VKDQGHCGSCW FS+TG+LE + + G +SLSEQ LVDC+ + N
Sbjct: 126 KSVDWRTKGAVTAVKDQGHCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGN 185
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
GCNGGL AF YIK NGG+DTE++YPY D C F+ VG +I G E +
Sbjct: 186 NGCNGGLMDNAFRYIKDNGGIDTEKSYPYEAIDDSCHFNKGTVGATDRGFTDIPQGDEKK 245
Query: 219 LQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGV-EDGVPY 276
+ AV V PVSVA + + F+FY GVY+ +C +D H V+ VG+G E G Y
Sbjct: 246 MAEAVATVGPVSVAIDASHESFQFYSEGVYNEPQCDAQNLD--HGVLVVGFGTDESGEDY 303
Query: 277 WLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
WL+KNSWG WGD G+ KM K N CGIA+ +SYP+V
Sbjct: 304 WLVKNSWGTTWGDKGFIKMLRNKENQCGIASASSYPLV 341
>gi|194757786|ref|XP_001961143.1| GF13722 [Drosophila ananassae]
gi|190622441|gb|EDV37965.1| GF13722 [Drosophila ananassae]
Length = 417
Score = 207 bits (527), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 105/218 (48%), Positives = 136/218 (62%), Gaps = 5/218 (2%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++ VKDQGHCGSCW FS+TG+LE +++ G +SLSEQ LVDC+ + N
Sbjct: 202 KSVDWRDKGAVTGVKDQGHCGSCWAFSSTGALEGQHYRKSGVLVSLSEQNLVDCSTKYGN 261
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
GCNGGL AF YIK NGG+DTE++YPY D C F+ +G V+I G E +
Sbjct: 262 NGCNGGLMDNAFRYIKDNGGIDTEKSYPYEALDDSCHFNKGTIGATDRGFVDIPQGNEKK 321
Query: 219 LQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGV-EDGVPY 276
L AV + PVSVA + + F+FY GVY C +D H V+ VG+G E G Y
Sbjct: 322 LAEAVATIGPVSVAIDASHESFQFYSEGVYVEPACDAQNLD--HGVLVVGFGTDESGQDY 379
Query: 277 WLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
WL+KNSWG WGD G+ KM K N CGIA+ +SYP+V
Sbjct: 380 WLVKNSWGTTWGDKGFIKMLRNKDNQCGIASASSYPLV 417
>gi|305434756|gb|ADM53740.1| cathepsin L1 precursor [Lepeophtheirus salmonis]
Length = 325
Score = 207 bits (527), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 103/209 (49%), Positives = 136/209 (65%), Gaps = 6/209 (2%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++ VKDQ CGSCW FSTTGS+E Y K +S SEQQLVDC+ F N+GCNGG
Sbjct: 119 VTAVKDQKDCGSCWAFSTTGSVEGQYFIKNKKLLSFSEQQLVDCSSDFRNEGCNGGWMDN 178
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
AF+Y+ N G+ TE+ YPYT DGVC ++ ++ ++ G+ED+L+ AV + P
Sbjct: 179 AFKYLIANKGIATEDTYPYTATDGVCVYNKTMAAGRISSFKDVKHGSEDQLKLAVAQIGP 238
Query: 229 VSVAFEVVDG-FRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVED--GVPYWLIKNSWGE 285
+SVA + G F+FYK GVY +C + +D H V+AVGYG + G+ YWL+KNSW
Sbjct: 239 ISVAIDASSGDFQFYKKGVYVDEECSSKYLD--HGVLAVGYGTDKGTGLDYWLVKNSWSA 296
Query: 286 NWGDHGYFKMEMG-KNMCGIATCASYPVV 313
+WGD GY KM KNMCGIA+ ASYPV+
Sbjct: 297 SWGDQGYIKMARNHKNMCGIASLASYPVI 325
>gi|535600|gb|AAA29137.1| cathepsin [Fasciola hepatica]
Length = 326
Score = 207 bits (527), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 106/216 (49%), Positives = 142/216 (65%), Gaps = 6/216 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++ VKDQG CGSCW FSTTG++E Y + IS SEQQLVDC+ F N G
Sbjct: 112 IDWRESGYVTEVKDQGGCGSCWAFSTTGAMEGQYMKNEKTSISFSEQQLVDCSGPFGNYG 171
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGV-QVLDSVNITLGAEDEL 219
CNGGL A+EY+K GL+TE +YPY +G C++ +E +GV +V + G E EL
Sbjct: 172 CNGGLMENAYEYLK-RFGLETESSYPYRAVEGQCRY-NEQLGVAKVTGYYTVHSGDEVEL 229
Query: 220 QHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
Q+ VG RP +VA +V F Y+SG+Y S C +P +NH V+AVGYG++DG YW++
Sbjct: 230 QNLVGCRRPAAVALDVESDFMMYRSGIYQSQTC--SPDRLNHGVLAVGYGIQDGTDYWIV 287
Query: 280 KNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVVA 314
KNSWG WG+ GY +M + NMCGIA+ AS P+VA
Sbjct: 288 KNSWGTWWGEDGYIRMVRKRGNMCGIASLASVPMVA 323
>gi|410519429|gb|AFV73398.1| cathepsin L [Haliotis discus hannai]
Length = 326
Score = 207 bits (527), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 106/215 (49%), Positives = 131/215 (60%), Gaps = 4/215 (1%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++ VK+QG CGSCW FS+TGSLE + + K ISLSEQ LVDC+ N G
Sbjct: 114 VDWRTKGYVTEVKNQGQCGSCWAFSSTGSLEGQTFKKYNKLISLSEQNLVDCSTEQGNMG 173
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C GGL QAF YIK N G+DTE +YPY G C+F+ NVG +I +E +LQ
Sbjct: 174 CGGGLMDQAFTYIKVNDGIDTETSYPYEAASGKCRFNKANVGANDTGYTDIKSKSESDLQ 233
Query: 221 HAVGLVRPVSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
AV V P++VA + F+ YKSGVY C T +D H V+AVGYG + G YWL+
Sbjct: 234 SAVATVGPIAVAIDASHMSFQLYKSGVYHYIFCSQTRLD--HGVLAVGYGTDSGKDYWLV 291
Query: 280 KNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
KNSWG WG GY M + N CGIAT ASYP V
Sbjct: 292 KNSWGATWGQQGYIMMSRNRDNNCGIATQASYPTV 326
>gi|110625773|ref|NP_081620.2| cathepsin L-like 3 precursor [Mus musculus]
gi|74208432|dbj|BAE26401.1| unnamed protein product [Mus musculus]
gi|187955662|gb|AAI47425.1| RIKEN cDNA 2310051M13 gene [Mus musculus]
gi|187957686|gb|AAI47424.1| RIKEN cDNA 2310051M13 gene [Mus musculus]
Length = 331
Score = 207 bits (527), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 112/242 (46%), Positives = 149/242 (61%), Gaps = 13/242 (5%)
Query: 77 EEMKLRFATFSKNL--DLIRSTNCKGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAY 134
++ K+ F + L D+ +S + + Y ++PVKDQG CGSCW FS GSLE
Sbjct: 98 QKTKMMMKVFQEPLLGDVPKSVDWRDHGY-----VTPVKDQGSCGSCWAFSAVGSLEGQM 152
Query: 135 HQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVC 194
+ GK + LS Q LVDC+ + NQGC+GGLP AF+Y+K NGGLDT +YPY +G C
Sbjct: 153 FRKTGKLVPLSVQNLVDCSWSQGNQGCDGGLPDLAFQYVKDNGGLDTSVSYPYEALNGTC 212
Query: 195 KFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCG 253
+++ +N V VN+ +ED L AV V P+SV + F+FYK G+Y C
Sbjct: 213 RYNPKNSAATVTGFVNVQ-SSEDALMKAVATVGPISVGIDTKHKSFQFYKEGMYYEPDCS 271
Query: 254 NTPMDVNHAVVAVGYGVE-DGVPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYP 311
+T +D HAV+ VGYG E DG YWL+KNSWG +WG +GY KM + N CGIA+ ASYP
Sbjct: 272 STVLD--HAVLVVGYGEESDGRKYWLVKNSWGRDWGMNGYIKMAKDRNNNCGIASDASYP 329
Query: 312 VV 313
VV
Sbjct: 330 VV 331
>gi|6630972|gb|AAF19630.1|AF194426_1 cysteine proteinase precursor [Myxine glutinosa]
Length = 324
Score = 207 bits (526), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 114/275 (41%), Positives = 159/275 (57%), Gaps = 9/275 (3%)
Query: 46 ETSVLQVIGQARHALSFARFARRYGKIYESVEE----MKLRFATFSKNLD-LIRSTNCKG 100
+ +VL GQA + L +A Y + + +++ ++ + + ++ L+ T
Sbjct: 52 QHNVLADQGQANYRLGMNTYADLYNEEFMALKGSSGILQAKDQSSTQTFKPLVGVTLPSS 111
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++PVKDQG CGSCW+FS TGSLE + G +SLSEQQLVDC+ ++ N G
Sbjct: 112 VDWRNQGYVTPVKDQGQCGSCWSFSATGSLEGQHFAKTGTLVSLSEQQLVDCSWSYGNYG 171
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C+GGL A++YI+ GG+ E AYPYT ++G C F V I G E L
Sbjct: 172 CSGGLMESAYDYIRDAGGVQLESAYPYTAQNGRCHFDQSKAVATCTGHVAIPSGDEQSLM 231
Query: 221 HAVGLVRPVSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
AVG V PV+VA + F+ Y+SGVY ++C ++ +D H V+A GYG E G YWL+
Sbjct: 232 QAVGTVGPVAVAIDASGYDFQLYESGVYDRSRCSSSSLD--HGVLAAGYGTEGGNDYWLV 289
Query: 280 KNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
KNSWG WG GY KM K N CGIAT A YP+V
Sbjct: 290 KNSWGPGWGAQGYIKMSRNKSNQCGIATMACYPLV 324
>gi|195583187|ref|XP_002081405.1| GD10995 [Drosophila simulans]
gi|194193414|gb|EDX06990.1| GD10995 [Drosophila simulans]
Length = 341
Score = 207 bits (526), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 104/218 (47%), Positives = 136/218 (62%), Gaps = 5/218 (2%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++ VKDQGHCGSCW FS+TG+LE + + G +SLSEQ LVDC+ + N
Sbjct: 126 KSVDWRTKGAVTAVKDQGHCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGN 185
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
GCNGGL AF YIK NGG+DTE++YPY D C F+ +G +I G E +
Sbjct: 186 NGCNGGLMDNAFRYIKDNGGIDTEKSYPYEAIDDSCHFNKGTIGATDRGFTDIPQGDEKK 245
Query: 219 LQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGV-EDGVPY 276
+ AV V PVSVA + + F+FY GVY+ +C +D H V+ VG+G E G Y
Sbjct: 246 MAEAVATVGPVSVAIDASHESFQFYSEGVYNEPQCDAQNLD--HGVLVVGFGTDESGDDY 303
Query: 277 WLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
WL+KNSWG WGD G+ KM K N CGIA+ +SYP+V
Sbjct: 304 WLVKNSWGTTWGDKGFIKMLRNKENQCGIASASSYPLV 341
>gi|195153545|ref|XP_002017686.1| GL17172 [Drosophila persimilis]
gi|194113482|gb|EDW35525.1| GL17172 [Drosophila persimilis]
Length = 341
Score = 207 bits (526), Expect = 7e-51, Method: Compositional matrix adjust.
Identities = 103/218 (47%), Positives = 138/218 (63%), Gaps = 5/218 (2%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++ VKDQGHCGSCW FS+TG+LE +++ G +SLSEQ LVDC+ + N
Sbjct: 126 KQVDWRTKGAVTDVKDQGHCGSCWAFSSTGALEGQHYRKSGVLVSLSEQNLVDCSTKYGN 185
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
GCNGGL AF YIK NGG+DTE++YPY D C F+ ++G V+I G E +
Sbjct: 186 NGCNGGLMDNAFRYIKDNGGIDTEKSYPYEAIDDSCHFNKGSIGATDRGFVDIPQGNEKK 245
Query: 219 LQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGV-EDGVPY 276
+ AV + PV+VA + + F+FY GVY+ C +D H V+ VG+G E G Y
Sbjct: 246 MAEAVATIGPVAVAIDASHESFQFYSEGVYNEPACDAQNLD--HGVLVVGFGTDESGEDY 303
Query: 277 WLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
WL+KNSWG WGD G+ KM K N CGIA+ +SYP+V
Sbjct: 304 WLVKNSWGTTWGDKGFIKMLRNKENQCGIASASSYPLV 341
>gi|146152090|gb|ABQ08058.1| cathepsin L [Misgurnus mizolepis]
Length = 337
Score = 207 bits (526), Expect = 7e-51, Method: Compositional matrix adjust.
Identities = 111/220 (50%), Positives = 139/220 (63%), Gaps = 9/220 (4%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
L +R ++PVKDQG CGSCW FSTTG++E + GK +SLSEQ LVDC++ N+G
Sbjct: 120 LDWREKGYVTPVKDQGECGSCWAFSTTGAMEGQMFRKQGKLVSLSEQNLVDCSRPEGNEG 179
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDG-VCKFSSENVGVQVLDSVNITLGAEDEL 219
CNGGL QAF+YIK N GLD+EEAYPY G D C + + V+I G E L
Sbjct: 180 CNGGLMDQAFQYIKDNNGLDSEEAYPYLGTDDQPCHYDPKYNAANDTGFVDIPSGKEHAL 239
Query: 220 QHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----DGV 274
AV V PVSVA + + F+FY+SG+Y +C + +D H V+ VGYG E DG
Sbjct: 240 MKAVASVGPVSVAIDAGHESFQFYQSGIYFEKECSSEELD--HGVLVVGYGFEGEDVDGK 297
Query: 275 PYWLIKNSWGENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
YW++KNSW E+WGD GY M KN CGIAT ASYP+V
Sbjct: 298 KYWIVKNSWSESWGDKGYIYMAKDRKNHCGIATAASYPLV 337
>gi|397499865|ref|XP_003820654.1| PREDICTED: cathepsin L2 isoform 1 [Pan paniscus]
gi|397499867|ref|XP_003820655.1| PREDICTED: cathepsin L2 isoform 2 [Pan paniscus]
Length = 334
Score = 206 bits (525), Expect = 8e-51, Method: Compositional matrix adjust.
Identities = 109/230 (47%), Positives = 141/230 (61%), Gaps = 13/230 (5%)
Query: 90 LDLIRSTNCKGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQL 149
LDL +S + + Y ++PVK+Q CGSCW FS TG+LE + GK +SLSEQ L
Sbjct: 112 LDLPKSVDWRKKGY-----VTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNL 166
Query: 150 VDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSV 209
VDC++ NQGCNGG ++AF+Y+K NGGLD+EE+YPY D +CK+ EN
Sbjct: 167 VDCSRPQGNQGCNGGFMARAFQYVKENGGLDSEESYPYVAMDEICKYRPENSVANDTGFT 226
Query: 210 NITLGAEDELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGY 268
+T G E L AV V P+SVA + F+FYKSG+Y C + +D H V+ VGY
Sbjct: 227 VVTPGKEKALMKAVATVGPISVAMDAGHSSFQFYKSGIYFEPDCSSKNLD--HGVLVVGY 284
Query: 269 GVE----DGVPYWLIKNSWGENWGDHGYFKM-EMGKNMCGIATCASYPVV 313
G E + YWL+KNSWG WG +GY K+ + KN CGIAT ASYP V
Sbjct: 285 GFEGANSNNSKYWLVKNSWGPEWGSNGYVKIAKDKKNHCGIATAASYPNV 334
>gi|13774082|gb|AAK38169.1| cathepsin L-like [Fasciola hepatica]
Length = 310
Score = 206 bits (525), Expect = 8e-51, Method: Compositional matrix adjust.
Identities = 100/215 (46%), Positives = 142/215 (66%), Gaps = 4/215 (1%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++ VKDQG+CGSCW FSTTG++E Y + IS SEQQLVDC+ + N G
Sbjct: 96 IDWRESGYVTEVKDQGNCGSCWAFSTTGTMEGQYMKNERTSISFSEQQLVDCSGPWGNNG 155
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C+GGL A++Y+K GL+TE +YPYT +G C+++ + +V + G+E EL+
Sbjct: 156 CSGGLMENAYQYLK-QFGLETESSYPYTAVEGQCRYNRQLGVAKVTGYYTVHSGSEVELK 214
Query: 221 HAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIK 280
+ VG RP ++A +V F Y+SG+Y S C P +NHAV+AVGYG +DG YW++K
Sbjct: 215 NLVGSRRPAAIAVDVESDFMMYRSGIYQSQTC--LPFALNHAVLAVGYGTQDGTDYWIVK 272
Query: 281 NSWGENWGDHGYFKMEMGK-NMCGIATCASYPVVA 314
NSWG +WG+ GY +M + NMCGIA+ AS P+VA
Sbjct: 273 NSWGLSWGERGYIRMARNRGNMCGIASLASLPMVA 307
>gi|125811033|ref|XP_001361727.1| GA25021 [Drosophila pseudoobscura pseudoobscura]
gi|54636904|gb|EAL26307.1| GA25021 [Drosophila pseudoobscura pseudoobscura]
Length = 341
Score = 206 bits (525), Expect = 8e-51, Method: Compositional matrix adjust.
Identities = 103/218 (47%), Positives = 137/218 (62%), Gaps = 5/218 (2%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++ VKDQGHCGSCW FS+TG+LE +++ G +SLSEQ LVDC+ + N
Sbjct: 126 KQVDWRTKGAVTDVKDQGHCGSCWAFSSTGALEGQHYRKSGVLVSLSEQNLVDCSTKYGN 185
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
GCNGGL AF YIK NGG+DTE++YPY D C F+ +G V+I G E +
Sbjct: 186 NGCNGGLMDNAFRYIKDNGGIDTEKSYPYEAIDDSCHFNKGTIGATDRGFVDIPQGNEKK 245
Query: 219 LQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGV-EDGVPY 276
+ AV + PV+VA + + F+FY GVY+ C +D H V+ VG+G E G Y
Sbjct: 246 MAEAVATIGPVAVAIDASHESFQFYSEGVYNEPACDAQNLD--HGVLVVGFGTDESGQDY 303
Query: 277 WLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
WL+KNSWG WGD G+ KM K N CGIA+ +SYP+V
Sbjct: 304 WLVKNSWGTTWGDKGFIKMLRNKENQCGIASASSYPLV 341
>gi|195484843|ref|XP_002090843.1| GE12574 [Drosophila yakuba]
gi|194176944|gb|EDW90555.1| GE12574 [Drosophila yakuba]
Length = 341
Score = 206 bits (525), Expect = 8e-51, Method: Compositional matrix adjust.
Identities = 104/218 (47%), Positives = 136/218 (62%), Gaps = 5/218 (2%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++ VKDQGHCGSCW FS+TG+LE + + G +SLSEQ LVDC+ + N
Sbjct: 126 KSVDWRSKGAVTAVKDQGHCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGN 185
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
GCNGGL AF YIK NGG+DTE++YPY D C F+ +G +I G E +
Sbjct: 186 NGCNGGLMDNAFRYIKDNGGIDTEKSYPYEAIDDSCHFNKGTIGATDRGFTDIPQGDEKK 245
Query: 219 LQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGV-EDGVPY 276
+ AV V PVSVA + + F+FY GVY+ +C +D H V+ VG+G E G Y
Sbjct: 246 MAEAVATVGPVSVAIDASHESFQFYSEGVYNEPQCDAQNLD--HGVLVVGFGTDESGDDY 303
Query: 277 WLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
WL+KNSWG WGD G+ KM K N CGIA+ +SYP+V
Sbjct: 304 WLVKNSWGTTWGDKGFIKMLRNKDNQCGIASASSYPLV 341
>gi|389610697|dbj|BAM18960.1| cathepsin L [Papilio polytes]
Length = 341
Score = 206 bits (525), Expect = 9e-51, Method: Compositional matrix adjust.
Identities = 130/345 (37%), Positives = 191/345 (55%), Gaps = 49/345 (14%)
Query: 11 VILLLCCAAAASASASSFD----DSNPIRLVSSDGLRDFETSV-----LQVIGQARHALS 61
+++L+C AAASA S FD + N ++ + + +++ V +++ + +H ++
Sbjct: 4 LVVLMCVVAAASA-VSFFDLVKEEWNAFKM---EHQKQYDSEVEDKFRMKIYAENKHKIA 59
Query: 62 F--ARFAR----------RYGKI--YESVEEMKLRFATFSKN-------------LDLIR 94
+FAR +YG + +E V M F +KN I
Sbjct: 60 KHNQKFARGQVPFRVKQNKYGDMLHHEFVHTMN-GFNKTTKNGKGLFGKSAGERGATFIP 118
Query: 95 STNCK---GLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVD 151
N + + +R ++ VKDQG CGSCW+FS TG+LE +++ +SLSEQ L+D
Sbjct: 119 PANVRVPDHVDWRKHGAVTEVKDQGKCGSCWSFSATGALEGQHYRQTNILVSLSEQNLID 178
Query: 152 CAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNI 211
C+ A+ N GCNGGL AF+YIK N G+DTE++YPY D C+++ N G + ++I
Sbjct: 179 CSTAYGNNGCNGGLMDNAFKYIKDNKGIDTEKSYPYEAVDDKCRYNPRNSGADDVGFIDI 238
Query: 212 TLGAEDELQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGV 270
G E +L AV V PVSVA + + F+FY GVY C +T +D H V+ VGYG
Sbjct: 239 PSGDEGKLMAAVATVGPVSVAIDASQETFQFYSDGVYFDENCSSTSLD--HGVLVVGYGT 296
Query: 271 -EDGVPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
E+G YWL+KNSWG +WGD GY KM + N CGIAT AS+P+V
Sbjct: 297 DENGGDYWLVKNSWGRSWGDLGYIKMARNRDNHCGIATAASFPLV 341
>gi|300120790|emb|CBK21032.2| unnamed protein product [Blastocystis hominis]
Length = 516
Score = 206 bits (525), Expect = 9e-51, Method: Compositional matrix adjust.
Identities = 117/303 (38%), Positives = 155/303 (51%), Gaps = 49/303 (16%)
Query: 59 ALSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN---------- 108
A F +F + K Y VE K R F +N + N + SY+L LN
Sbjct: 214 AAEFKQFVKDNKKCYNDVE-YKERQLNFLRNKARVEKVNSENRSYKLKLNHLADRSESEL 272
Query: 109 -------------------------------------ISPVKDQGHCGSCWTFSTTGSLE 131
++PVKDQ CGSCWT+ T G LE
Sbjct: 273 RAMMGLKRSQKKDFAAHRYTPSNGVKPDFVDWREKGAVTPVKDQCMCGSCWTYGTVGVLE 332
Query: 132 AAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAY-PYTGK 190
Y +GK + SEQ L+DC+ F N GCNGG +A+ ++ +NGGL T+E Y Y G
Sbjct: 333 GQYFLKYGKLVKFSEQNLLDCSWNFGNDGCNGGEDFRAYGWMLHNGGLMTDEDYGHYLGI 392
Query: 191 DGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSST 250
DG C F+ V++ D V IT G+ +EL+ AV V P+SV V F FY GV+ +
Sbjct: 393 DGWCHFNKSAAAVKITDYVLITPGSVEELEDAVANVGPISVGIAVTTDFLFYAEGVFDNP 452
Query: 251 KCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASY 310
+C + D HAV+AVGYG E+G YWLIKNSW WGD+GY K+ N+CG+AT ASY
Sbjct: 453 ECSSAVEDQAHAVLAVGYGTENGKDYWLIKNSWSTYWGDNGYVKIARKNNICGVATAASY 512
Query: 311 PVV 313
P++
Sbjct: 513 PIL 515
>gi|194883222|ref|XP_001975702.1| GG20414 [Drosophila erecta]
gi|190658889|gb|EDV56102.1| GG20414 [Drosophila erecta]
Length = 341
Score = 206 bits (525), Expect = 9e-51, Method: Compositional matrix adjust.
Identities = 104/218 (47%), Positives = 136/218 (62%), Gaps = 5/218 (2%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++ VKDQGHCGSCW FS+TG+LE + + G +SLSEQ LVDC+ + N
Sbjct: 126 KSVDWRTKGAVTAVKDQGHCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGN 185
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
GCNGGL AF YIK NGG+DTE++YPY D C F+ +G +I G E +
Sbjct: 186 NGCNGGLMDNAFRYIKDNGGIDTEKSYPYEAIDDSCHFNKGAIGATDRGFTDIPQGDEKK 245
Query: 219 LQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGV-EDGVPY 276
+ AV V PV+VA + + F+FY GVY+ +C +D H V+ VGYG E G Y
Sbjct: 246 MAEAVATVGPVAVAIDASHESFQFYSEGVYNEPQCDAQNLD--HGVLVVGYGTDESGDDY 303
Query: 277 WLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
WL+KNSWG WGD G+ KM K N CGIA+ +SYP+V
Sbjct: 304 WLVKNSWGTTWGDKGFIKMLRNKDNQCGIASASSYPLV 341
>gi|114625736|ref|XP_001153919.1| PREDICTED: cathepsin L2 isoform 2 [Pan troglodytes]
gi|114625742|ref|XP_520130.2| PREDICTED: cathepsin L2 isoform 5 [Pan troglodytes]
Length = 334
Score = 206 bits (525), Expect = 9e-51, Method: Compositional matrix adjust.
Identities = 109/230 (47%), Positives = 140/230 (60%), Gaps = 13/230 (5%)
Query: 90 LDLIRSTNCKGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQL 149
LDL +S + + Y ++PVK+Q CGSCW FS TG+LE + GK +SLSEQ L
Sbjct: 112 LDLPKSVDWRKKGY-----VTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNL 166
Query: 150 VDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSV 209
VDC++ NQGCNGG ++AF+Y+K NGGLD+EE+YPY D +CK+ EN
Sbjct: 167 VDCSRPQGNQGCNGGFMARAFQYVKENGGLDSEESYPYVAMDEICKYRPENSVANDTGFT 226
Query: 210 NITLGAEDELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGY 268
+T G E L AV V P+SVA + F+FYKSG+Y C + +D H V+ VGY
Sbjct: 227 VVTPGKEKALMKAVATVGPISVAMDAGHSSFQFYKSGIYFEPDCSSKNLD--HGVLVVGY 284
Query: 269 GVE----DGVPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
G E + YWL+KNSWG WG +GY K+ K N CGIAT ASYP V
Sbjct: 285 GFEGANSNNSKYWLVKNSWGPEWGSNGYVKIAKDKNNHCGIATAASYPNV 334
>gi|195334204|ref|XP_002033774.1| GM21500 [Drosophila sechellia]
gi|194125744|gb|EDW47787.1| GM21500 [Drosophila sechellia]
Length = 341
Score = 206 bits (524), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 103/218 (47%), Positives = 136/218 (62%), Gaps = 5/218 (2%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++ VKDQGHCGSCW FS+TG+LE + + G +SLSEQ LVDC+ + N
Sbjct: 126 KSVDWRTKGAVTAVKDQGHCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGN 185
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
GCNGGL AF YIK NGG+DTE++YPY D C F+ +G +I G E +
Sbjct: 186 NGCNGGLMDNAFRYIKDNGGIDTEKSYPYEAIDDSCHFNKGTIGATDRGFTDIPQGDEKK 245
Query: 219 LQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGV-EDGVPY 276
+ AV V PV+VA + + F+FY GVY+ +C +D H V+ VG+G E G Y
Sbjct: 246 MAEAVATVGPVAVAIDASHESFQFYSEGVYNEPQCDAQNLD--HGVLVVGFGTDESGEDY 303
Query: 277 WLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
WL+KNSWG WGD G+ KM K N CGIA+ +SYP+V
Sbjct: 304 WLVKNSWGTTWGDKGFIKMLRNKENQCGIASASSYPLV 341
>gi|226821421|gb|ACO82386.1| cathepsin L-like protein [Lutjanus argentimaculatus]
Length = 301
Score = 206 bits (524), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 109/222 (49%), Positives = 139/222 (62%), Gaps = 9/222 (4%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
+ + +R ++PVKDQG CGSCW FSTTG+LE + + GK +SLSEQ LVDC++ N
Sbjct: 82 RAVDWRDNGYVTPVKDQGQCGSCWAFSTTGALEGQHFRKTGKLVSLSEQNLVDCSRPEGN 141
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDG-VCKFSSENVGVQVLDSVNITLGAED 217
+GCNGGL QAF+YIK N GLD+E++YPY G D C + + V+I G E
Sbjct: 142 EGCNGGLMDQAFQYIKDNQGLDSEDSYPYLGTDDQPCHYDPKYNSANDTGFVDIPSGKER 201
Query: 218 ELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----D 272
L AV V PVSVA + + F+FY+SG+Y C + +D H V+ VGYG E D
Sbjct: 202 ALMKAVAAVGPVSVAIDAGHESFQFYQSGIYYEKDCSSEELD--HGVLVVGYGFEGEDVD 259
Query: 273 GVPYWLIKNSWGENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
G YW++KNSW E WGD GY M KN CGIAT ASYP+V
Sbjct: 260 GKKYWIVKNSWSEKWGDKGYIYMAKDRKNHCGIATAASYPLV 301
>gi|348545637|ref|XP_003460286.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
Length = 334
Score = 206 bits (523), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 101/207 (48%), Positives = 134/207 (64%), Gaps = 4/207 (1%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++ VKDQ CGSCW FS TG+LE + + G +SLSEQQLVDC+ F N GC GG
Sbjct: 130 VTEVKDQKQCGSCWAFSATGALEGQHFRKTGTLVSLSEQQLVDCSSNFGNSGCMGGWMDF 189
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
AF+YIKYN G+DTEE YPY K+G+C++ +++G + + E L+ AV V P
Sbjct: 190 AFKYIKYNRGIDTEEFYPYEAKNGLCRYKRDSIGATCSGYIIVKRFEEQALKEAVATVGP 249
Query: 229 VSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENW 287
+SV + F+ Y+SGVY CG+ + +NHAV+AVGYG E+G YWL+KNSWG W
Sbjct: 250 ISVTIDASRPSFQLYESGVYYDDGCGS--IFLNHAVLAVGYGTENGHDYWLVKNSWGLGW 307
Query: 288 GDHGYFKMEMG-KNMCGIATCASYPVV 313
G+ GY +M KN CGIA+ A YP+V
Sbjct: 308 GEKGYIRMSRNKKNQCGIASVARYPLV 334
>gi|50403821|gb|AAT76664.1| cathepsin L1 proteinase [Fasciola hepatica]
Length = 326
Score = 206 bits (523), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 103/215 (47%), Positives = 140/215 (65%), Gaps = 4/215 (1%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++ VKDQG+CGSCW FSTTG+ E Y + IS SEQQLVDC+ + N G
Sbjct: 112 IDWRESGYVTGVKDQGNCGSCWAFSTTGTTEGQYMKNERTSISFSEQQLVDCSGPWGNNG 171
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C GGL A+EY+K GL+TE +YPYT +G C+ S + +V + G+E EL+
Sbjct: 172 CGGGLMENAYEYLK-QFGLETESSYPYTAVEGQCRHSKQLGVAKVTGYYTVHSGSEVELK 230
Query: 221 HAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIK 280
+ VG RP +VA +V F Y+SG+Y S C +P+ VNHAV+AVGYG + G YW++K
Sbjct: 231 NLVGAERPAAVAVDVESDFMMYRSGIYQSQTC--SPLSVNHAVLAVGYGTQGGTDYWIVK 288
Query: 281 NSWGENWGDHGYFKMEMGK-NMCGIATCASYPVVA 314
NSWG +WG+ GY +M + NMCGIA+ AS P+VA
Sbjct: 289 NSWGLSWGERGYIRMVRNRGNMCGIASLASLPMVA 323
>gi|320543907|ref|NP_001188921.1| cysteine proteinase-1, isoform D [Drosophila melanogaster]
gi|318068589|gb|ADV37168.1| cysteine proteinase-1, isoform D [Drosophila melanogaster]
Length = 249
Score = 206 bits (523), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 105/218 (48%), Positives = 136/218 (62%), Gaps = 5/218 (2%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++ VKDQGHCGSCW FS+TG+LE + + G +SLSEQ LVDC+ + N
Sbjct: 34 KSVDWRTKGAVTAVKDQGHCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGN 93
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
GCNGGL AF YIK NGG+DTE++YPY D C F+ VG +I G E +
Sbjct: 94 NGCNGGLMDNAFRYIKDNGGIDTEKSYPYEAIDDSCHFNKGTVGATDRGFTDIPQGDEKK 153
Query: 219 LQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGV-EDGVPY 276
+ AV V PVSVA + + F+FY GVY+ +C +D H V+ VG+G E G Y
Sbjct: 154 MAEAVATVGPVSVAIDASHESFQFYSEGVYNEPQCDAQNLD--HGVLVVGFGTDESGEDY 211
Query: 277 WLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
WL+KNSWG WGD G+ KM K N CGIA+ +SYP+V
Sbjct: 212 WLVKNSWGTTWGDKGFIKMLRNKENQCGIASASSYPLV 249
>gi|345320664|ref|XP_001521690.2| PREDICTED: cathepsin L1-like [Ornithorhynchus anatinus]
Length = 388
Score = 206 bits (523), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 107/212 (50%), Positives = 138/212 (65%), Gaps = 9/212 (4%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++PVK+QGHCGSCW FSTTG+LE + G+ +SLSEQ LVDC+ NQGCNGG+
Sbjct: 179 VTPVKNQGHCGSCWAFSTTGALEGQLFRKSGRLVSLSEQNLVDCSWQQGNQGCNGGIVDF 238
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGV-CKFSSENVGVQVLDSVNITLGAEDELQHAVGLVR 227
AF+YI N G+D+E+ YPYT KD C F E +V V+I +E+ L AV V
Sbjct: 239 AFQYILENRGIDSEDCYPYTAKDTAQCAFKPECATARVTGFVDIPPHSEEALMKAVATVG 298
Query: 228 PVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVED----GVPYWLIKNS 282
PVSVA + FRFY+SG++ KC + + NHAV+ VGYG E G YW++KNS
Sbjct: 299 PVSVAIDAHPTSFRFYQSGIFYEPKCSSERL--NHAVLVVGYGYEGEDEAGKKYWIVKNS 356
Query: 283 WGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
WG+ WGDHGYF + + N CGIAT ASYP++
Sbjct: 357 WGKQWGDHGYFYLSKDRGNHCGIATTASYPLL 388
>gi|297684914|ref|XP_002820054.1| PREDICTED: cathepsin L2 isoform 2 [Pongo abelii]
Length = 334
Score = 206 bits (523), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 110/230 (47%), Positives = 137/230 (59%), Gaps = 13/230 (5%)
Query: 90 LDLIRSTNCKGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQL 149
LDL +S + + Y ++PVK+Q CGSCW FS TG+LE + GK +SLSEQ L
Sbjct: 112 LDLPKSVDWRKKGY-----VTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNL 166
Query: 150 VDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSV 209
VDC+ NQGCNGG +AF+Y+K NGGLD+EE+YPY D +CK+ EN
Sbjct: 167 VDCSHPQGNQGCNGGFMDKAFQYVKENGGLDSEESYPYVAMDEICKYRPENSVANDTGFT 226
Query: 210 NITLGAEDELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGY 268
I G E L AV V P+SVA + F+FYKSG+Y C + +D H V+ VGY
Sbjct: 227 VILPGKEKALMKAVATVGPISVAMDAGHSSFQFYKSGIYFEPDCSSKNLD--HGVLVVGY 284
Query: 269 GVE----DGVPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
G E D YWL+KNSWG WG +GY K+ K N CGIAT ASYP V
Sbjct: 285 GFEGANSDNSKYWLVKNSWGPEWGSNGYVKIAKDKNNHCGIATAASYPDV 334
>gi|41152538|gb|AAR99518.1| cathepsin L protein [Fasciola hepatica]
Length = 326
Score = 206 bits (523), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 114/271 (42%), Positives = 166/271 (61%), Gaps = 17/271 (6%)
Query: 57 RHALSFARFARRYGKIYE-SVEEMKLRFAT-FSKNLDLIR-----STNCKG----LSYRL 105
RH L + + + + EE K ++ T S+ D++ TN + + +R
Sbjct: 57 RHDLGLVTYTLGLNQFTDMTFEEFKAKYLTEMSRASDILSHGVPYETNNRAVPDKIDWRE 116
Query: 106 GLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGL 165
++ VKDQG+CGSCW FSTTG++E Y + IS SEQQLVDC+ + N GC+GGL
Sbjct: 117 SGYVTEVKDQGNCGSCWAFSTTGTMEGQYMKNERTSISFSEQQLVDCSGPWGNNGCSGGL 176
Query: 166 PSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGV-QVLDSVNITLGAEDELQHAVG 224
A++Y+K GL+TE +YPYT +G C++ +E +GV +V + G+E EL++ VG
Sbjct: 177 MENAYQYLK-QFGLETESSYPYTAVEGQCRY-NEQLGVAKVTGYYTVHSGSEVELKNLVG 234
Query: 225 LVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWG 284
P +VA +V F Y+SG+Y S C +P+ VNHAV+AVGYG + G YW++KNSWG
Sbjct: 235 SEGPAAVAVDVESDFMMYRSGIYQSQTC--SPLSVNHAVLAVGYGTQGGTDYWIVKNSWG 292
Query: 285 ENWGDHGYFKMEMGK-NMCGIATCASYPVVA 314
+WG+ GY +M + NMCGIA+ AS P+VA
Sbjct: 293 LSWGERGYIRMVRNRGNMCGIASLASLPMVA 323
>gi|334332720|ref|XP_001367595.2| PREDICTED: cathepsin L1-like [Monodelphis domestica]
Length = 333
Score = 205 bits (522), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 107/217 (49%), Positives = 137/217 (63%), Gaps = 4/217 (1%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++PVK+QG CGSCW FS TGSLE + K +SLSEQ LVDC+ + N
Sbjct: 119 KSVDWREKGYVTPVKNQGQCGSCWAFSATGSLEGQWFHKTKKLVSLSEQNLVDCSTSEGN 178
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
GC+GGL AFEY+K NGG+DTE+AYPY G+D CK+ +E G V V+I E
Sbjct: 179 NGCSGGLMDNAFEYVKNNGGIDTEQAYPYLGQDNECKYRAECSGANVTGFVDIPSMNERA 238
Query: 219 LQHAVGLVRPVSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYW 277
L AV V P+SVA + + F+FY+SGVY +C ++ +D H V+ VGYG YW
Sbjct: 239 LMKAVANVGPISVAIDAGNPSFQFYESGVYYEPQCSSSQLD--HGVLVVGYGSIGKDEYW 296
Query: 278 LIKNSWGENWGDHGYFKM-EMGKNMCGIATCASYPVV 313
++KNSWGE WG GY M + N CGIAT ASYP V
Sbjct: 297 IVKNSWGEEWGKKGYVLMAKFRNNHCGIATAASYPQV 333
>gi|330803818|ref|XP_003289899.1| hypothetical protein DICPUDRAFT_154350 [Dictyostelium purpureum]
gi|325080010|gb|EGC33584.1| hypothetical protein DICPUDRAFT_154350 [Dictyostelium purpureum]
Length = 326
Score = 205 bits (522), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 115/312 (36%), Positives = 170/312 (54%), Gaps = 51/312 (16%)
Query: 48 SVLQVIGQARHALSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGL 107
S ++ Q ++ +F + ++ K Y + +E R++ F N+D++ N KG + LGL
Sbjct: 18 SAARIFSQKQYQTAFQNWMVKHQKSYTN-DEFGSRYSVFQDNMDIVAKWNQKGSNTILGL 76
Query: 108 NI---------------------------------------------SPVKDQGHCGSCW 122
N+ + VK+QG CG C+
Sbjct: 77 NVMADLTNEEFKKLYLGTKANVTYKKKTLVGVSGLPASVDWRANGAVTAVKNQGQCGGCY 136
Query: 123 TFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTE 182
FSTTGS+E + + + LSEQQ++DC+ + N GC+GGL + +FEYI GGLDTE
Sbjct: 137 AFSTTGSVEGIHEITSQQLVPLSEQQILDCSGSEGNNGCDGGLMTNSFEYIIAVGGLDTE 196
Query: 183 EAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVD-GFRF 241
+YPYTG+ G CKF+ +N+G + N+ G+E +LQ AV +PVSVA + F+
Sbjct: 197 ASYPYTGEVGKCKFNKKNIGATITGYKNVESGSESDLQTAVA-AQPVSVAIDASQSSFQL 255
Query: 242 YKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGK-N 300
Y SGVY +C +T +D H V+AVGYG + G YW++KNSWG +WG++G+ M K N
Sbjct: 256 YASGVYYEPECSSTQLD--HGVLAVGYGSQSGQDYWIVKNSWGADWGENGFILMARNKDN 313
Query: 301 MCGIATCASYPV 312
CGIAT AS+P
Sbjct: 314 NCGIATMASFPT 325
>gi|297684916|ref|XP_002820055.1| PREDICTED: cathepsin L2 isoform 3 [Pongo abelii]
Length = 345
Score = 205 bits (522), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 110/230 (47%), Positives = 137/230 (59%), Gaps = 13/230 (5%)
Query: 90 LDLIRSTNCKGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQL 149
LDL +S + + Y ++PVK+Q CGSCW FS TG+LE + GK +SLSEQ L
Sbjct: 123 LDLPKSVDWRKKGY-----VTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNL 177
Query: 150 VDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSV 209
VDC+ NQGCNGG +AF+Y+K NGGLD+EE+YPY D +CK+ EN
Sbjct: 178 VDCSHPQGNQGCNGGFMDKAFQYVKENGGLDSEESYPYVAMDEICKYRPENSVANDTGFT 237
Query: 210 NITLGAEDELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGY 268
I G E L AV V P+SVA + F+FYKSG+Y C + +D H V+ VGY
Sbjct: 238 VILPGKEKALMKAVATVGPISVAMDAGHSSFQFYKSGIYFEPDCSSKNLD--HGVLVVGY 295
Query: 269 GVE----DGVPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
G E D YWL+KNSWG WG +GY K+ K N CGIAT ASYP V
Sbjct: 296 GFEGANSDNSKYWLVKNSWGPEWGSNGYVKIAKDKNNHCGIATAASYPDV 345
>gi|405963298|gb|EKC28885.1| Cathepsin L [Crassostrea gigas]
Length = 265
Score = 205 bits (522), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 104/207 (50%), Positives = 131/207 (63%), Gaps = 6/207 (2%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++PVK+QG CGSCW FSTTG LE +++ GK +SLSEQ L+DC++ N GCNGGLP +
Sbjct: 63 VTPVKNQGQCGSCWAFSTTGGLEGQHYRKTGKLVSLSEQNLLDCSK--ENMGCNGGLPQK 120
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
A++YIK NGG+DTEE+YPY GK C F VG V +T G E L+ AV V P
Sbjct: 121 AYKYIKENGGIDTEESYPYLGKKETCSFRPSEVGATCTGFVQVTAGDELALKKAVASVGP 180
Query: 229 VSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENW 287
++V + F+ YK GVY C P+ +HAV+ VGYGV G YWL+KNSWG +W
Sbjct: 181 ITVCIDASQPSFQLYKGGVYDEQSC--NPIVFDHAVLIVGYGVYQGKDYWLVKNSWGTSW 238
Query: 288 GDHGYFKMEMGK-NMCGIATCASYPVV 313
G GY M + N CGIA A YP V
Sbjct: 239 GMDGYIMMSRNQNNQCGIANHAVYPTV 265
>gi|530736|emb|CAA56915.1| cathepsin l [Nephrops norvegicus]
gi|1582621|prf||2119193B cathepsin L-related Cys protease
Length = 313
Score = 205 bits (522), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 102/215 (47%), Positives = 135/215 (62%), Gaps = 5/215 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++PVKDQG CGSCW FS TGSLE + + +SLSEQ+LVDC+ + N G
Sbjct: 102 VDWRTKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKNNELVSLSEQELVDCSTEYGNDG 161
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C GG + AF+YIK NGG+DTE +YPY +D C+F + ++G V + E+ L
Sbjct: 162 CGGGWMTSAFDYIKDNGGIDTESSYPYEAQDRSCRFDANSIGATCTGFVEVQH-TEEALH 220
Query: 221 HAVGLVRPVSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
AV + P+SVA + F+FY SGVY KC +P +++H V+AVGYG E YWL+
Sbjct: 221 EAVSDIGPISVAIDASHFSFQFYSSGVYYEKKC--SPTNLDHGVLAVGYGTESTEDYWLV 278
Query: 280 KNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
KNSWG WGD GY KM + N CGIA+ SYP V
Sbjct: 279 KNSWGSGWGDAGYIKMSRNRDNNCGIASEPSYPTV 313
>gi|15826035|pdb|1FH0|A Chain A, Crystal Structure Of Human Cathepsin V Complexed With An
Irreversible Vinyl Sulfone Inhibitor
gi|15826036|pdb|1FH0|B Chain B, Crystal Structure Of Human Cathepsin V Complexed With An
Irreversible Vinyl Sulfone Inhibitor
Length = 221
Score = 205 bits (522), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 107/221 (48%), Positives = 136/221 (61%), Gaps = 8/221 (3%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++PVK+Q CGSCW FS TG+LE + GK +SLSEQ LVDC++ N
Sbjct: 3 KSVDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGN 62
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
QGCNGG ++AF+Y+K NGGLD+EE+YPY D +CK+ EN Q + G E
Sbjct: 63 QGCNGGFMARAFQYVKENGGLDSEESYPYVAVDEICKYRPENSVAQDTGFTVVAPGKEKA 122
Query: 219 LQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----DG 273
L AV V P+SVA + F+FYKSG+Y C + +D H V+ VGYG E D
Sbjct: 123 LMKAVATVGPISVAMDAGHSSFQFYKSGIYFEPDCSSKNLD--HGVLVVGYGFEGANSDN 180
Query: 274 VPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
YWL+KNSWG WG +GY K+ K N CGIAT ASYP V
Sbjct: 181 SKYWLVKNSWGPEWGSNGYVKIAKDKNNHCGIATAASYPNV 221
>gi|159792912|gb|ABW98676.1| cathepsin L [Apostichopus japonicus]
Length = 332
Score = 205 bits (522), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 111/246 (45%), Positives = 147/246 (59%), Gaps = 11/246 (4%)
Query: 71 KIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLNISPVKDQGHCGSCWTFSTTGSL 130
K S E ++F +++K S + + Y ++PVKDQG CGSCW FSTTGSL
Sbjct: 95 KFDASRERQGIKFLSYAK-FQAPDSVDWRDEGY-----VTPVKDQGQCGSCWAFSTTGSL 148
Query: 131 EAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGK 190
E + ++ G SLSEQ LVDC+ ++ N GC GGL AF+YIK N G+DTE+ YPY +
Sbjct: 149 EGQHFRSTGVLTSLSEQNLVDCSISYGNNGCEGGLMDYAFQYIKDNLGIDTEDKYPYEAE 208
Query: 191 DGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSS 249
D C+FS +NVG V++ G ED L+ A P+SVA + + F+ Y+SGVY
Sbjct: 209 DDTCRFSPDNVGATDSGYVDVDSGDEDALKEACAANGPISVAIDASHESFQLYESGVYDE 268
Query: 250 TKCGNTPMDVNHAVVAVGYGVED-GVPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATC 307
C + +D H V+ VGYG + G YW++KNSWG +WG GY M K N CGIAT
Sbjct: 269 ESCSSIELD--HGVLVVGYGTDSVGGDYWIVKNSWGLSWGQEGYIWMSRNKDNQCGIATS 326
Query: 308 ASYPVV 313
ASYP V
Sbjct: 327 ASYPTV 332
>gi|403300987|ref|XP_003941193.1| PREDICTED: cathepsin L2 [Saimiri boliviensis boliviensis]
Length = 333
Score = 205 bits (522), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 113/230 (49%), Positives = 140/230 (60%), Gaps = 14/230 (6%)
Query: 90 LDLIRSTNCKGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQL 149
LDL +S + + Y ++PVK+Q CGSCW FS TG+LE + GK +SLSEQ L
Sbjct: 112 LDLPKSVDWRKKGY-----VTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNL 166
Query: 150 VDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSV 209
VDC++ NQGCNGG + AF Y+K NGGLD+E +YPY KDG+CK+ EN V
Sbjct: 167 VDCSRPQGNQGCNGGFMNYAFRYVKENGGLDSEASYPYEAKDGICKYKPENSVANDTGFV 226
Query: 210 NITLGAEDELQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGY 268
I E EL AV V P+SVA + F+FYKSG+Y KC + +D H V+ VGY
Sbjct: 227 VIPT-HEKELMKAVATVGPISVAVDASHSSFQFYKSGIYFEKKCSSKNLD--HGVLVVGY 283
Query: 269 GVE----DGVPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
G E YWLIKNSWG WG +GY K+ + N CGIAT ASYPVV
Sbjct: 284 GFEGANSKDNKYWLIKNSWGPEWGLNGYIKIAKDQNNHCGIATAASYPVV 333
>gi|313221001|emb|CBY31833.1| unnamed protein product [Oikopleura dioica]
gi|313229611|emb|CBY18426.1| unnamed protein product [Oikopleura dioica]
Length = 362
Score = 205 bits (521), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 109/228 (47%), Positives = 141/228 (61%), Gaps = 5/228 (2%)
Query: 90 LDLIRSTNCKGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQL 149
+D S K + +R ++ VKDQG CGSCW+FS TG+LE Q FGK LSEQ L
Sbjct: 128 VDADESKLDKSVDWREKGAVTEVKDQGQCGSCWSFSATGALEGQMAQVFGKLPDLSEQNL 187
Query: 150 VDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGV-CKFSSENVGVQVLDS 208
VDC++ NQGCNGGL AF+Y+K GLD E+ YPY G D C++ +
Sbjct: 188 VDCSRPEGNQGCNGGLMDAAFQYVKDQDGLDGEDWYPYEGVDNKECRYDKSHREADDTGF 247
Query: 209 VNITLGAEDELQHAVGLVRPVSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVG 267
I G E L+HA+ V PVSVA + + F+FY+SGVY C +P +++H V+AVG
Sbjct: 248 KMIPEGNEKALKHALAKVGPVSVAIDASNPSFQFYQSGVYYEPNC--SPENLDHGVLAVG 305
Query: 268 YGVEDGVPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVVA 314
YG EDG Y+L+KNSW E WGD+GY KM K N CGIA+ A YP+V+
Sbjct: 306 YGTEDGEHYYLVKNSWSEAWGDNGYIKMARNKENHCGIASYAVYPIVS 353
>gi|118119|sp|P13277.2|CYSP1_HOMAM RecName: Full=Digestive cysteine proteinase 1; Flags: Precursor
gi|11051|emb|CAA45127.1| cysteine proteinase preproenzyme [Homarus americanus]
gi|228243|prf||1801240A Cys protease 1
Length = 322
Score = 205 bits (521), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 117/306 (38%), Positives = 159/306 (51%), Gaps = 55/306 (17%)
Query: 61 SFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCK----GLSYRLGLN-------- 108
S+ F ++G+ Y +EE + R F NL I N K ++Y L +N
Sbjct: 19 SWEEFKGKFGRKYVDLEEERYRLNVFLDNLQYIEEFNKKYERGEVTYNLAINQFSDMTNE 78
Query: 109 --------------------------------------ISPVKDQGHCGSCWTFSTTGSL 130
++PVKDQG CGSCW FSTTG +
Sbjct: 79 KFNAVMKGYKKGPRPAAVFTSTDAAPESTEVDWRTKGAVTPVKDQGQCGSCWAFSTTGGI 138
Query: 131 EAAYHQAFGKGISLSEQQLVDCAQ-AFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTG 189
E + G+ +SLSEQQLVDCA ++ NQGCNGG +A Y++ NGG+DTE +YPY
Sbjct: 139 EGQHFLKTGRLVSLSEQQLVDCAGGSYYNQGCNGGWVERAIMYVRDNGGVDTESSYPYEA 198
Query: 190 KDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVD-GFRFYKSGVYS 248
+D C+F+S +G V I G+E L+ A + P+SVA + F+ Y +GVY
Sbjct: 199 RDNTCRFNSNTIGATCTGYVGIAQGSESALKTATRDIGPISVAIDASHRSFQSYYTGVYY 258
Query: 249 STKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATC 307
C ++ +D HAV+AVGYG E G +WL+KNSW +WG+ GY KM + N CGIAT
Sbjct: 259 EPSCSSSQLD--HAVLAVGYGSEGGQDFWLVKNSWATSWGESGYIKMARNRNNNCGIATD 316
Query: 308 ASYPVV 313
A YP V
Sbjct: 317 ACYPTV 322
>gi|317135059|gb|ADV03094.1| cathepsin L [Hyriopsis cumingii]
gi|372126672|gb|AEX88474.1| cathepsin L [Hyriopsis schlegelii]
Length = 333
Score = 205 bits (521), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 103/210 (49%), Positives = 132/210 (62%), Gaps = 10/210 (4%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++PVK+QG CGSC+ FS TG++E + + GK +SLSEQ +VDC+ N+GC GGL +
Sbjct: 129 VTPVKNQGGCGSCYAFSATGAVEGQHFRKTGKLVSLSEQNIVDCSFKEGNKGCRGGLMDK 188
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
+F YIK N G+DTEEAYPY +DG C+F VG V V++ E LQHAV + P
Sbjct: 189 SFTYIKDNNGIDTEEAYPYEARDGPCRFRRSEVGATVRGYVDLPENDEIALQHAVTTIGP 248
Query: 229 VSVAFEVVDG----FRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWG 284
+SVA +DG FRFY GV+ + C T +NH V+ VGYG DG+ YWL+KNSWG
Sbjct: 249 ISVA---IDGHHFNFRFYHHGVFDNPNCSKTK--INHGVLVVGYGTRDGLDYWLVKNSWG 303
Query: 285 ENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
E WG GY M N C I ASYP+V
Sbjct: 304 ERWGAEGYILMSRNNDNQCCITCAASYPIV 333
>gi|327263389|ref|XP_003216502.1| PREDICTED: cathepsin L1-like [Anolis carolinensis]
Length = 339
Score = 205 bits (521), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 110/223 (49%), Positives = 138/223 (61%), Gaps = 10/223 (4%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++PVKDQG CGSCW FSTTGSLE + + GK +SLSEQ LVDC++ N
Sbjct: 119 KSVDWREKGYVTPVKDQGQCGSCWAFSTTGSLEGQHFRKTGKLVSLSEQNLVDCSRPEGN 178
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGV-CKFSSENVGVQVLDSVNITLGAED 217
QGCNGGL QAFEYI NGG+D+EE+YPY KD C + SE V++ G E
Sbjct: 179 QGCNGGLMDQAFEYIADNGGIDSEESYPYIAKDDEDCLYKSEFNAANDTGFVDVPEGHER 238
Query: 218 ELQHAVGLVRPVSVAFEVVDG-FRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----- 271
L AV V PVSVA + F+FY+SG+Y C + +D H V+ VGYG E
Sbjct: 239 ALMKAVAAVGPVSVAIDASHSTFQFYESGIYYDPDCSSEELD--HGVLVVGYGFEGTDDD 296
Query: 272 DGVPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
+ YW++KNSW + WGD GY M + N CGIAT ASYP+V
Sbjct: 297 NKKKYWIVKNSWSDKWGDKGYILMAKDRNNHCGIATAASYPLV 339
>gi|8547325|gb|AAF76330.1|AF271385_1 cathepsin L [Fasciola hepatica]
Length = 326
Score = 205 bits (521), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 105/216 (48%), Positives = 142/216 (65%), Gaps = 6/216 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++ VKDQG CGSCW FSTTG++E Y + IS SEQQLVDC++ F N G
Sbjct: 112 IDWRESGYVTEVKDQGGCGSCWAFSTTGAMEGQYMKNQRTSISFSEQQLVDCSRDFGNYG 171
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGV-QVLDSVNITLGAEDEL 219
CNGGL A+EY+K GL+TE +YPY +G C++ +E +GV +V + G E EL
Sbjct: 172 CNGGLMENAYEYLK-RFGLETESSYPYRAVEGQCRY-NEQLGVAKVTGYYTVHSGDEVEL 229
Query: 220 QHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
Q+ VG P +VA +V F Y+SG+Y S C +P +NH V+AVGYG++DG YW++
Sbjct: 230 QNLVGAEGPAAVALDVESDFMMYRSGIYQSQTC--SPDRLNHGVLAVGYGIQDGTDYWIV 287
Query: 280 KNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVVA 314
KNSWG WG+ GY +M + NMCGIA+ AS P+VA
Sbjct: 288 KNSWGTWWGEDGYIRMVRKRGNMCGIASLASVPMVA 323
>gi|342675481|gb|AEL31666.1| cathepsin L [Cynoglossus semilaevis]
Length = 336
Score = 205 bits (521), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 107/212 (50%), Positives = 136/212 (64%), Gaps = 9/212 (4%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++PVKDQG CGSCW FSTTG+LZ + GK +SLSEQ LVDC++ N+GC GGL Q
Sbjct: 127 VTPVKDQGQCGSCWAFSTTGALZGQNFRKMGKLVSLSEQNLVDCSRPEGNEGCGGGLMDQ 186
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDG-VCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVR 227
AF+Y+K N GLD+E++YPY G D C + + V V+I G E L AV V
Sbjct: 187 AFQYVKDNQGLDSEDSYPYLGTDDQPCHYDPKYNSVNDTGFVDIPSGKEHALMKAVASVG 246
Query: 228 PVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----DGVPYWLIKNS 282
PVSVA + + F+FY+SG+Y +C + +D H V+AVGYG E DG YW++KNS
Sbjct: 247 PVSVAIDAGHESFQFYQSGIYYEKECSSEELD--HGVLAVGYGFEGEDVDGKKYWIVKNS 304
Query: 283 WGENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
W E WGD GY M KN CGIAT ASYP+V
Sbjct: 305 WSEKWGDKGYIYMAKDRKNHCGIATAASYPLV 336
>gi|326430490|gb|EGD76060.1| cysteine proteinase [Salpingoeca sp. ATCC 50818]
Length = 448
Score = 205 bits (521), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 122/307 (39%), Positives = 162/307 (52%), Gaps = 65/307 (21%)
Query: 62 FARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKG--------------------- 100
F F ++ K+YES EE RF+ FS+N+D I N +
Sbjct: 30 FDAFKTKFNKVYESAEEEARRFSVFSQNIDFINRHNAEAARGVHTHTVDVNQFADLTNEE 89
Query: 101 ------------------------------LSYRLGLNISPVKDQGHCGSCWTFSTTGSL 130
+ +R ++P+K+QG CGSCW+FSTTGS+
Sbjct: 90 YRQLYLRPYPTELLGRERQEVWLDGPNAGSVDWRQKGAVTPIKNQGQCGSCWSFSTTGSV 149
Query: 131 EAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGK 190
E A+ A G +SLSEQQLVDC+ +F NQGCNGGL AF+YI NGGLDTE+ YPYT +
Sbjct: 150 EGAHAIATGNLVSLSEQQLVDCSGSFGNQGCNGGLMDNAFKYIISNGGLDTEQDYPYTAR 209
Query: 191 DGVCKFSSENV-GVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYS 248
DGVC S E+ V + ++ ED+L AV PVSVA E F+ Y SGV+S
Sbjct: 210 DGVCDKSKESKHAVSISGYKDVPQNNEDQLAAAV-EKGPVSVAIEADQQSFQMYSSGVFS 268
Query: 249 STKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMG---KNMCGIA 305
CG +++H V+ VGY + YW++KNSWG +WGD GY M+ G +CGIA
Sbjct: 269 G-PCG---TNLDHGVLVVGYTSD----YWIVKNSWGASWGDQGYIMMKRGVSSAGICGIA 320
Query: 306 TCASYPV 312
SYP+
Sbjct: 321 MQPSYPI 327
>gi|377823949|gb|AFB77219.1| cathepsin L1 [Fasciola gigantica]
Length = 326
Score = 204 bits (520), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 101/215 (46%), Positives = 141/215 (65%), Gaps = 4/215 (1%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++ VKDQG+CGSCW FSTTG++E Y + IS SEQQLVDC+ + N G
Sbjct: 112 IDWRESGYVTEVKDQGNCGSCWAFSTTGTMEGQYMKNERTSISFSEQQLVDCSGPWGNYG 171
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C GGL A+EY+K GL+TE +YPYT +G C+++ + +V D + G+E EL+
Sbjct: 172 CMGGLMENAYEYLK-QFGLETESSYPYTAVEGQCRYNRQLGVAKVTDYYTVHSGSEVELK 230
Query: 221 HAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIK 280
+ VG P +VA +V F Y+ G+Y S C +P+ VNHAV+AVGYG + G YW++K
Sbjct: 231 NLVGAEGPAAVAVDVESDFMMYRGGIYQSQTC--SPLGVNHAVLAVGYGTQGGTDYWIVK 288
Query: 281 NSWGENWGDHGYFKMEMGK-NMCGIATCASYPVVA 314
NSWG +WG+ GY +M + NMCGIA+ AS P+VA
Sbjct: 289 NSWGSSWGERGYIRMVRNRGNMCGIASLASLPMVA 323
>gi|14041143|emb|CAA71554.1| cathepsin [Geodia cydonium]
Length = 322
Score = 204 bits (520), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 106/215 (49%), Positives = 133/215 (61%), Gaps = 4/215 (1%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++ VK+QG CGSCW FS TGSLE + A GK +SLSEQ LVDC+ A N+G
Sbjct: 107 VDWRTKGYVTGVKNQGQCGSCWAFSATGSLEGQHFNATGKLVSLSEQNLVDCSSAEGNEG 166
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
CNGGLP AF+Y+ NGG+DTE +YPY +D C +SS N+G V+I +E +LQ
Sbjct: 167 CNGGLPDDAFKYVIKNGGIDTEASYPYVARDEKCHYSSANIGSTCSSYVDIESKSEAQLQ 226
Query: 221 HAVGLVRPVSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
A V P+ V + GF+ Y GVY S C T +D H V+ VGYGV YW++
Sbjct: 227 VASATVGPIPVGIDASHLGFQLYDGGVYHSDLCSQTRLD--HGVLVVGYGVYKEKDYWMV 284
Query: 280 KNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
KNSWG NWG G M + N CGIAT ASYPVV
Sbjct: 285 KNSWGTNWGISGDMMMSRNRDNNCGIATMASYPVV 319
>gi|116488416|gb|AAB41670.2| secreted cathepsin L 1 [Fasciola hepatica]
Length = 326
Score = 204 bits (520), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 100/215 (46%), Positives = 142/215 (66%), Gaps = 4/215 (1%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++ VKDQG+CGSCW FSTTG++E Y + IS SEQQLVDC++ + N G
Sbjct: 112 IDWRESGYVTEVKDQGNCGSCWAFSTTGTMEGQYMKNERTSISFSEQQLVDCSRPWGNNG 171
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C GGL A++Y+K GL+TE +YPYT +G C+++ + +V + G+E EL+
Sbjct: 172 CGGGLMENAYQYLK-QFGLETESSYPYTAVEGQCRYNKQLGVAKVTGFYTVHSGSEVELK 230
Query: 221 HAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIK 280
+ VG P +VA +V F Y+SG+Y S C +P+ VNHAV+AVGYG + G YW++K
Sbjct: 231 NLVGAEGPAAVAVDVESDFMMYRSGIYQSQTC--SPLRVNHAVLAVGYGTQGGTDYWIVK 288
Query: 281 NSWGENWGDHGYFKMEMGK-NMCGIATCASYPVVA 314
NSWG +WG+ GY +M + NMCGIA+ AS P+VA
Sbjct: 289 NSWGLSWGERGYIRMVRNRGNMCGIASLASLPMVA 323
>gi|392881548|gb|AFM89606.1| cathepsin L [Callorhinchus milii]
Length = 338
Score = 204 bits (520), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 106/221 (47%), Positives = 143/221 (64%), Gaps = 9/221 (4%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++PVKDQG CGSCW FSTTG+LE + + G+ +SLSEQ LV+C++ N
Sbjct: 119 KHVDWRDEGYVTPVKDQGQCGSCWAFSTTGALEGQHFRRTGQLVSLSEQNLVECSKPEGN 178
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGV-CKFSSENVGVQVLDSVNITLGAED 217
+GCNGGL QAF+Y+K NGG+D+E++YPY G D C ++ + V+I G E
Sbjct: 179 EGCNGGLMDQAFQYVKDNGGIDSEDSYPYVGTDDTPCHYNPQYNAANDTGFVDIPSGKER 238
Query: 218 ELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----D 272
L A+ V PVSVA + F+FY+SG+Y +C +T D++H V+ VGYGVE D
Sbjct: 239 ALMKAIAAVGPVSVAIDAGHTSFQFYQSGIYFEAECSST--DLDHGVLVVGYGVEKRDTD 296
Query: 273 GVPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPV 312
G YW++KNSW E WG +GY M K N CGIAT ASYP+
Sbjct: 297 GKKYWIVKNSWSEKWGQNGYILMAKDKDNHCGIATAASYPL 337
>gi|392884266|gb|AFM90965.1| cathepsin L [Callorhinchus milii]
Length = 338
Score = 204 bits (520), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 106/221 (47%), Positives = 143/221 (64%), Gaps = 9/221 (4%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++PVKDQG CGSCW FSTTG+LE + + G+ +SLSEQ LV+C++ N
Sbjct: 119 KHVDWRDEGYVTPVKDQGQCGSCWAFSTTGALEGQHFRRTGQLVSLSEQNLVECSKPEGN 178
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGV-CKFSSENVGVQVLDSVNITLGAED 217
+GCNGGL QAF+Y+K NGG+D+E++YPY G D C ++ + V+I G E
Sbjct: 179 EGCNGGLMDQAFQYVKDNGGIDSEDSYPYVGTDDTPCHYNPQYNAANDTGFVDIPSGKER 238
Query: 218 ELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----D 272
L A+ V PVSVA + F+FY+SG+Y +C +T D++H V+ VGYGVE D
Sbjct: 239 ALMKAIAAVGPVSVAIDAGHTSFQFYQSGIYFEAECSST--DLDHGVLVVGYGVEKRDTD 296
Query: 273 GVPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPV 312
G YW++KNSW E WG +GY M K N CGIAT ASYP+
Sbjct: 297 GKKYWIVKNSWSEKWGQNGYILMAKDKDNHCGIATAASYPL 337
>gi|291383517|ref|XP_002708299.1| PREDICTED: cathepsin L1 [Oryctolagus cuniculus]
Length = 333
Score = 204 bits (520), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 110/221 (49%), Positives = 135/221 (61%), Gaps = 9/221 (4%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++PVK+QG CGSCW FS TG+LE Q GK ISLSEQ LVDC+ N
Sbjct: 116 KSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFQKTGKLISLSEQNLVDCSHPQGN 175
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
QGCNGGL AF+Y+K N GLD+EE+YPY G DG CK+ E V+I G E
Sbjct: 176 QGCNGGLMDYAFQYVKDNSGLDSEESYPYEGMDGTCKYKPECSVANDTGFVDIP-GHEKA 234
Query: 219 LQHAVGLVRPVSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----DG 273
L AV V P+S A + F+FYKSG+Y C + D++H ++ VGYG E +
Sbjct: 235 LLRAVATVGPISAAIDAGHMSFQFYKSGIYYDPDC--SSKDLDHGILVVGYGFEGTNSNA 292
Query: 274 VPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
YWL+KNSWG WGD GY K+ K N CGIAT ASYP V
Sbjct: 293 TKYWLVKNSWGTTWGDEGYVKIIRDKDNHCGIATAASYPTV 333
>gi|118429523|gb|ABK91809.1| cathepsin L-like proteinase precursor [Clonorchis sinensis]
Length = 373
Score = 204 bits (520), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 114/296 (38%), Positives = 176/296 (59%), Gaps = 25/296 (8%)
Query: 34 IRLVSSDGLRDFETSVLQVIGQARHALSFARFARRYGKIYESVEEMKLRFATFSKNLDLI 93
++ +++ +R + +V + GQ + + F+ + + EE+K R F +L+
Sbjct: 87 FKIFANNFVRISKHNVRFIQGQVSYTMGINEFSDK------TDEELK-RLRCFRGSLNAS 139
Query: 94 R---------STNCKGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISL 144
R + + +R ++PVK+QG+CGSCW FS TG++E A G +SL
Sbjct: 140 RDGSKYITIAAPPPSEIDWRNKGAVTPVKNQGNCGSCWAFSATGAIEGQNFLATGNLVSL 199
Query: 145 SEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPY-TGKDG----VCKFSSE 199
SEQQLVDC+ + N CNGGL AF+Y+K + G+DTE +YPY +G+ G C+F+ +
Sbjct: 200 SEQQLVDCSSEYGNNACNGGLMDNAFKYVKDSNGIDTEASYPYVSGETGDANPTCRFNLK 259
Query: 200 NVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMD 258
V+V +++ G EL+ AVG P+SVA + F YKSGVYS +C + D
Sbjct: 260 EAVVRVTGYIDLPRGQVSELKQAVGHYGPISVAINAGLPSFMSYKSGVYSDDQCSSD--D 317
Query: 259 VNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFK-MEMGKNMCGIATCASYPVV 313
++H V+ VGYG E+G+PYWLIKNSWG +WG++GY K + N+CG+A+ ASYP++
Sbjct: 318 LDHGVLLVGYGEENGIPYWLIKNSWGPHWGENGYVKILRDHNNLCGVASMASYPLI 373
>gi|66378053|gb|AAY45871.1| cathepsin L-like cysteine proteinase [Longidorus elongatus]
Length = 358
Score = 204 bits (520), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 105/208 (50%), Positives = 136/208 (65%), Gaps = 5/208 (2%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++ VKDQG CGSCW FS TGSLE +++ GK +SLSEQ LVDC +++GCNGG
Sbjct: 151 VTKVKDQGSCGSCWAFSATGSLEGQHYKQTGKLVSLSEQNLVDCDVNGDDEGCNGGYMDG 210
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
AF+Y++ N G+DTE +YPY G+DG C+F SE+VG V+I G E L+ A+ V P
Sbjct: 211 AFQYVETNKGIDTEASYPYKGRDGRCRFKSEDVGATDTGFVDIPEGNETLLEAAIATVGP 270
Query: 229 VSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYG-VEDGVPYWLIKNSWGEN 286
VSVA + F+FY GVY C +P ++H V+AVGY +DG Y+++KNSW E+
Sbjct: 271 VSVAIDAASFKFQFYSHGVYYDRSC--SPEYLDHGVLAVGYNSTKDGKQYYIVKNSWSED 328
Query: 287 WGDHGYFKMEMGK-NMCGIATCASYPVV 313
WGD GY M K N CGIAT ASYP V
Sbjct: 329 WGDDGYILMSRRKNNNCGIATMASYPFV 356
>gi|530734|emb|CAA56914.1| cathepsin l [Nephrops norvegicus]
gi|1582620|prf||2119193A cathepsin L-related Cys protease
Length = 324
Score = 204 bits (519), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 120/308 (38%), Positives = 158/308 (51%), Gaps = 57/308 (18%)
Query: 61 SFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCK----GLSYRLGLN-------- 108
S+ F ++G+ Y +EE + R F NL I N K ++Y L +N
Sbjct: 19 SWEEFKGKFGRKYVDLEEERYRLNVFLDNLQYIEEFNKKYESGEVTYNLAINQFSDLTND 78
Query: 109 ----------------------------------------ISPVKDQGHCGSCWTFSTTG 128
++ VKDQG CGSCW FS TG
Sbjct: 79 EFNSMMKGYKTSLRPKPVAVFTSTDAAPETTEVDWRTKGCVTHVKDQGQCGSCWAFSATG 138
Query: 129 SLEAAYHQAFGKGISLSEQQLVDCAQA-FNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPY 187
SLE + +G+ +SL+EQQLVDCA + NQGCNGG +QAF+YIK NGG+DTE +YPY
Sbjct: 139 SLEGQHFLKYGELVSLAEQQLVDCAGGIYYNQGCNGGWVNQAFKYIKANGGIDTESSYPY 198
Query: 188 TGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVD-GFRFYKSGV 246
+D C+F+S +V V+I G+E P+SVA + F+ Y SGV
Sbjct: 199 EARDNTCRFNSNSVAATCSGFVSIAQGSESPEVRRTTNTGPISVAIDAAHRSFQSYSSGV 258
Query: 247 YSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIA 305
Y C ++ +D HAV+AVGYG E G +WL+KNSWG +WG GY M + N CGIA
Sbjct: 259 YYEPSCSSSQLD--HAVLAVGYGSEGGQDFWLVKNSWGTSWGSAGYINMARNRNNNCGIA 316
Query: 306 TCASYPVV 313
T ASYP V
Sbjct: 317 TDASYPTV 324
>gi|387914010|gb|AFK10614.1| cathepsin L [Callorhinchus milii]
gi|392873762|gb|AFM85713.1| cathepsin L [Callorhinchus milii]
gi|392877488|gb|AFM87576.1| cathepsin L [Callorhinchus milii]
Length = 338
Score = 204 bits (519), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 106/221 (47%), Positives = 143/221 (64%), Gaps = 9/221 (4%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++PVKDQG CGSCW FSTTG+LE + + G+ +SLSEQ LV+C++ N
Sbjct: 119 KHVDWRDEGYVTPVKDQGQCGSCWAFSTTGALEGQHFRRTGQLVSLSEQNLVECSKPEGN 178
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGV-CKFSSENVGVQVLDSVNITLGAED 217
+GCNGGL QAF+Y+K NGG+D+E++YPY G D C ++ + V+I G E
Sbjct: 179 EGCNGGLMDQAFQYVKDNGGIDSEDSYPYVGTDDTPCHYNPQYNAANDTGFVDIPSGKER 238
Query: 218 ELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----D 272
L A+ V PVSVA + F+FY+SG+Y +C +T D++H V+ VGYGVE D
Sbjct: 239 ALMKAIAAVGPVSVAIDAGHTSFQFYQSGIYFEAECSST--DLDHGVLVVGYGVEKRDTD 296
Query: 273 GVPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPV 312
G YW++KNSW E WG +GY M K N CGIAT ASYP+
Sbjct: 297 GKKYWIVKNSWSEKWGQNGYILMAKDKDNHCGIATAASYPL 337
>gi|1093503|prf||2104214A Cys protease
Length = 255
Score = 204 bits (519), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 105/218 (48%), Positives = 136/218 (62%), Gaps = 5/218 (2%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++ VKDQGHCGSCW FS+TG+LE + + G +SLSEQ LVDC+ + N
Sbjct: 40 KSVDWRTKGAVTAVKDQGHCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGN 99
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
GCNGGL AF YIK NGG+DTE++YPY D C F+ VG +I G E +
Sbjct: 100 NGCNGGLMDNAFPYIKDNGGIDTEKSYPYEAIDDSCHFNRAQVGATDRGFTDIPQGDEKK 159
Query: 219 LQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGV-EDGVPY 276
+ AV V PVSVA + + F+FY GVY+ +C +D H V+ VG+G E G Y
Sbjct: 160 MPEAVATVGPVSVAIDASHESFQFYSEGVYNEPQCDAQNLD--HGVLVVGFGTDESGEDY 217
Query: 277 WLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
WL+KNSWG WGD G+ KM K N CGIA+ +SYP+V
Sbjct: 218 WLVKNSWGTTWGDKGFIKMLRNKENQCGIASPSSYPLV 255
>gi|310975575|gb|ADP55136.1| truncated cathepsin L-like protein [Miichthys miiuy]
Length = 246
Score = 204 bits (519), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 107/222 (48%), Positives = 138/222 (62%), Gaps = 9/222 (4%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
+ + +R ++PVKDQG CGSCW FSTTG+LE + + GK +SLSEQ LVDC++ N
Sbjct: 27 RAVDWRDNGYVTPVKDQGQCGSCWAFSTTGALEGQHFRKTGKLVSLSEQNLVDCSRPEGN 86
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGK-DGVCKFSSENVGVQVLDSVNITLGAED 217
+GCNGGL QAF+Y+K N GLD+E+AYPY G D C + +++ G E
Sbjct: 87 EGCNGGLMDQAFQYVKDNQGLDSEDAYPYLGTGDQPCHYDPNYNSANDTGFIDVPSGKEH 146
Query: 218 ELQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----D 272
L AV V PVSVA + + F+FY+SG+Y C + +D H V+ VGYG E D
Sbjct: 147 ALMKAVAAVGPVSVAIDASHESFQFYQSGIYYEKDCSSEELD--HGVLVVGYGFEGEDVD 204
Query: 273 GVPYWLIKNSWGENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
G YW++KNSW E WGD GY M KN CGIAT ASYP+V
Sbjct: 205 GKKYWIVKNSWSEKWGDKGYIYMAKDRKNHCGIATAASYPLV 246
>gi|294883322|ref|XP_002770704.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
gi|239873993|gb|EER02713.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
Length = 333
Score = 204 bits (519), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 132/314 (42%), Positives = 157/314 (50%), Gaps = 70/314 (22%)
Query: 60 LSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN----------- 108
L+F F ++GK YES EE R A F NL I N K LSY+LG+N
Sbjct: 26 LAFMGFQHKFGKNYESKEEEVKRNAIFQANLHHIEQVNAKDLSYKLGVNEHADLTHEEFA 85
Query: 109 -------------------------------------ISPVKDQGHCGSCWTFSTTGSLE 131
++PVKDQG CGSCW FSTTG+LE
Sbjct: 86 ALKLGTLKMSTRRDDKFVIEADTTQLPTSVDWRNKNVLTPVKDQGSCGSCWAFSTTGALE 145
Query: 132 AAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKD 191
A Y A GK +SLSEQQLVDC+ + N GC GGL A+EYIK + GLD E Y Y G D
Sbjct: 146 AQYAIATGKLLSLSEQQLVDCSSGYGNNGCEGGLMDDAYEYIK-SAGLDQESTYSYNGTD 204
Query: 192 GVCKFS----------SENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVD-GFR 240
VC+ S E G +LD E L A+ PVSVA D FR
Sbjct: 205 DVCQGSLAKRSDGIPAGEVTGFHMLDKT------EQSLMKALADA-PVSVAMYAADPDFR 257
Query: 241 FYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKN 300
FYKSGVYSS C N +D H VVAVGYG E+G Y++I+NSWG +WG GYF ++ G +
Sbjct: 258 FYKSGVYSSATC-NGKLD--HGVVAVGYGTENGSDYFIIRNSWGSSWGQAGYFYLKRGVS 314
Query: 301 MCGIATCASYPVVA 314
G Y VA
Sbjct: 315 GYGECNILEYMCVA 328
>gi|3087790|emb|CAA75029.1| cathepsin L2 [Homo sapiens]
Length = 334
Score = 204 bits (519), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 108/230 (46%), Positives = 139/230 (60%), Gaps = 13/230 (5%)
Query: 90 LDLIRSTNCKGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQL 149
LDL +S + + Y ++PVK+Q CGSCW FS TG+LE + GK +SLSEQ L
Sbjct: 112 LDLPKSVDWRKKGY-----VTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNL 166
Query: 150 VDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSV 209
VDC++ NQGCNGG ++AF+Y+K NGGLD+EE+YPY D +CK+ EN
Sbjct: 167 VDCSRPQGNQGCNGGFMARAFQYVKENGGLDSEESYPYVAVDEICKYRPENSVANDTGFT 226
Query: 210 NITLGAEDELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGY 268
+ G E L AV V P+SVA + F+FYKSG+Y C + +D H V+ VGY
Sbjct: 227 VVAPGKEKALMKAVATVGPISVAMDAGHSSFQFYKSGIYFEPDCSSKNLD--HGVLVVGY 284
Query: 269 GVE----DGVPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
G E + YWL+KNSWG WG +GY K+ K N CGIAT ASYP V
Sbjct: 285 GFEGANSNNSKYWLVKNSWGPEWGSNGYVKIAKDKNNHCGIATAASYPNV 334
>gi|197258082|gb|ACH56225.1| cathepsin L-like cysteine proteinase [Bursaphelenchus xylophilus]
Length = 282
Score = 204 bits (519), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 111/252 (44%), Positives = 148/252 (58%), Gaps = 13/252 (5%)
Query: 65 FARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLNISPVKDQGHCGSCWTF 124
F R +G S K R N ++ + + + Y ++PVK+QG CGSCW F
Sbjct: 41 FRRTFGDNIASRNATKWRAPL---NFEVPDAVDWRDEGY-----VTPVKNQGMCGSCWAF 92
Query: 125 STTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEA 184
S TGSLE + +A GK +SLSEQ LVDC+ F N GCNGGL AFEY+K N G+DTEE+
Sbjct: 93 SATGSLEGQHKRATGKLVSLSEQNLVDCSADFGNNGCNGGLMDFAFEYVKQNHGIDTEES 152
Query: 185 YPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEV-VDGFRFYK 243
YPY K C F NVG V++ E++L+ AV PVSVA + FR YK
Sbjct: 153 YPYKAKQKKCHFQKANVGADDTGFVDLPEADEEQLKAAVASQGPVSVAIDAGHRSFRLYK 212
Query: 244 SGVYSSTKCGNTPMDVNHAVVAVGYGVE-DGVPYWLIKNSWGENWGDHGYFKMEMGK-NM 301
+GVY C +P ++H V+ VGYG + + YW++KNSWGE WG+ GY ++ + N
Sbjct: 213 TGVYYEKHC--SPEQLDHGVLVVGYGTDPEHGDYWIVKNSWGEEWGEKGYVRIARNRNNH 270
Query: 302 CGIATCASYPVV 313
CGIA+ ASYP+
Sbjct: 271 CGIASKASYPLA 282
>gi|443698586|gb|ELT98517.1| hypothetical protein CAPTEDRAFT_128252 [Capitella teleta]
Length = 324
Score = 204 bits (519), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 126/328 (38%), Positives = 171/328 (52%), Gaps = 31/328 (9%)
Query: 13 LLLCCAAAASASASSFDD-------------SNPIRLVSSDGLRDFETSVLQVIGQ---- 55
+L CC AA AS FD+ S + D R L +I Q
Sbjct: 1 MLACCIAATLASPLVFDEALDEMWTLFKTTHSKTYATEAEDMRRFIWERHLNMINQHNIE 60
Query: 56 ---ARHALSFARFARRYGKI--YESVEEMKLRFATFSKNLDLIRSTNC---KGLSYRLGL 107
+H S YG + +E + A S + N K + +R
Sbjct: 61 ADLGKHTFSLG--MNEYGDLTQHEYAAMSGYKMAKSSVGSSFLEPENLQVPKTVDWREKG 118
Query: 108 NISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPS 167
++PVK+QG CGSCW FS+TGSLE + G+ S+SEQ LVDC++ N GC+GGL
Sbjct: 119 YVTPVKNQGQCGSCWAFSSTGSLEGQVFRKTGRLPSISEQNLVDCSRDEGNMGCSGGLMD 178
Query: 168 QAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVR 227
AF YIK N G+D+E++YPY DG C++ + V+I G E L+ AV V
Sbjct: 179 NAFTYIKKNMGIDSEKSYPYEAVDGECRYKKSDSVTTDSGFVDIPHGDETALRTAVASVG 238
Query: 228 PVSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGEN 286
PVSVA + F+FYK+GVY+ C +T +D H V+ VGYGVE+G YWL+KNSWG +
Sbjct: 239 PVSVAIDASHTSFQFYKTGVYTEANCSSTQLD--HGVLVVGYGVENGQDYWLVKNSWGAS 296
Query: 287 WGDHGYFKMEMGK-NMCGIATCASYPVV 313
WG+ GY K+ N CGIA+ ASYP++
Sbjct: 297 WGEAGYIKLARNHGNQCGIASQASYPLL 324
>gi|426362423|ref|XP_004048364.1| PREDICTED: cathepsin L2 isoform 1 [Gorilla gorilla gorilla]
gi|426362425|ref|XP_004048365.1| PREDICTED: cathepsin L2 isoform 2 [Gorilla gorilla gorilla]
Length = 334
Score = 204 bits (518), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 108/230 (46%), Positives = 139/230 (60%), Gaps = 13/230 (5%)
Query: 90 LDLIRSTNCKGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQL 149
LDL +S + + Y ++PVK+Q CGSCW FS TG+LE + GK +SLSEQ L
Sbjct: 112 LDLPKSVDWRKKGY-----VTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNL 166
Query: 150 VDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSV 209
VDC++ NQGCNGG ++AF+Y+K NGGLD+EE+YPY D +CK+ EN
Sbjct: 167 VDCSRPQGNQGCNGGFMARAFQYVKENGGLDSEESYPYVAMDEICKYRPENSVANDTGFT 226
Query: 210 NITLGAEDELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGY 268
+ G E L AV V P+SVA + F+FYKSG+Y C + +D H V+ VGY
Sbjct: 227 VVAPGKEKALMKAVATVGPISVAVDAGHSSFQFYKSGIYFEPDCSSKNLD--HGVLVVGY 284
Query: 269 GVE----DGVPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
G E + YWL+KNSWG WG +GY K+ K N CGIAT ASYP V
Sbjct: 285 GFEGANSNNSKYWLVKNSWGPEWGSNGYVKIAKDKNNHCGIATAASYPNV 334
>gi|23110960|ref|NP_001324.2| cathepsin L2 preproprotein [Homo sapiens]
gi|320118898|ref|NP_001188504.1| cathepsin L2 preproprotein [Homo sapiens]
gi|12644075|sp|O60911.2|CATL2_HUMAN RecName: Full=Cathepsin L2; AltName: Full=Cathepsin U; AltName:
Full=Cathepsin V; Flags: Precursor
gi|3107915|dbj|BAA25909.1| cathepsin V [Homo sapiens]
gi|3228672|gb|AAC23598.1| cathepsin U [Homo sapiens]
gi|3869129|dbj|BAA34365.1| cathepsin L2 [Homo sapiens]
gi|23958123|gb|AAH23504.1| CTSL2 protein [Homo sapiens]
gi|37182404|gb|AAQ89004.1| cathepsin L2 [Homo sapiens]
gi|83405150|gb|AAI10513.1| Cathepsin L2 [Homo sapiens]
gi|119579235|gb|EAW58831.1| cathepsin L2, isoform CRA_a [Homo sapiens]
gi|119579236|gb|EAW58832.1| cathepsin L2, isoform CRA_a [Homo sapiens]
Length = 334
Score = 204 bits (518), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 108/230 (46%), Positives = 139/230 (60%), Gaps = 13/230 (5%)
Query: 90 LDLIRSTNCKGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQL 149
LDL +S + + Y ++PVK+Q CGSCW FS TG+LE + GK +SLSEQ L
Sbjct: 112 LDLPKSVDWRKKGY-----VTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNL 166
Query: 150 VDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSV 209
VDC++ NQGCNGG ++AF+Y+K NGGLD+EE+YPY D +CK+ EN
Sbjct: 167 VDCSRPQGNQGCNGGFMARAFQYVKENGGLDSEESYPYVAVDEICKYRPENSVANDTGFT 226
Query: 210 NITLGAEDELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGY 268
+ G E L AV V P+SVA + F+FYKSG+Y C + +D H V+ VGY
Sbjct: 227 VVAPGKEKALMKAVATVGPISVAMDAGHSSFQFYKSGIYFEPDCSSKNLD--HGVLVVGY 284
Query: 269 GVE----DGVPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
G E + YWL+KNSWG WG +GY K+ K N CGIAT ASYP V
Sbjct: 285 GFEGANSNNSKYWLVKNSWGPEWGSNGYVKIAKDKNNHCGIATAASYPNV 334
>gi|330803820|ref|XP_003289900.1| hypothetical protein DICPUDRAFT_80649 [Dictyostelium purpureum]
gi|325080011|gb|EGC33585.1| hypothetical protein DICPUDRAFT_80649 [Dictyostelium purpureum]
Length = 328
Score = 204 bits (518), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 117/314 (37%), Positives = 165/314 (52%), Gaps = 53/314 (16%)
Query: 48 SVLQVIGQARHALSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGL 107
S +V Q ++ +F + ++ K Y + +E R+ F N+D + N KG LGL
Sbjct: 18 SAARVFSQKQYQTAFQNWMVKHQKSYTN-DEFGSRYTIFQDNMDFVTKWNQKGSDTILGL 76
Query: 108 N-----------------------------------------------ISPVKDQGHCGS 120
N ++ VK+QG CG
Sbjct: 77 NSMADLTNQEYQRIYLGTKTTVKKPNLIIGVTDVSKAPASVDWRANGAVTAVKNQGQCGG 136
Query: 121 CWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLD 180
C++FSTTGS+E + + +SLSEQQ++DC+ + N GC+GGL + +FEYI GGLD
Sbjct: 137 CYSFSTTGSVEGIHEITSKQLVSLSEQQILDCSGSEGNNGCDGGLMTNSFEYIIAVGGLD 196
Query: 181 TEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVV-DGF 239
TE +YPY G G CKF+ N+G + N+ G+E +LQ AV +PVSVA + + F
Sbjct: 197 TEASYPYEGVVGKCKFNKANIGATITGYKNVKSGSESDLQTAVA-AQPVSVAIDASQNSF 255
Query: 240 RFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGK 299
+ Y SGVY C +T +D H V+AVGYG + G YW++KNSWG +WG+ G+ M K
Sbjct: 256 QLYSSGVYYEPACSSTQLD--HGVLAVGYGSQSGQDYWIVKNSWGADWGEKGFILMARNK 313
Query: 300 -NMCGIATCASYPV 312
N CGIAT ASYP
Sbjct: 314 HNNCGIATMASYPT 327
>gi|157644745|gb|ABV59078.1| cathepsin L [Lates calcarifer]
Length = 337
Score = 204 bits (518), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 108/222 (48%), Positives = 137/222 (61%), Gaps = 9/222 (4%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
+ L +R ++PVKDQG CGSCW FSTTG+LE + GK +SLSEQ LVDC++ N
Sbjct: 118 RALDWRDKGYVTPVKDQGQCGSCWAFSTTGALEGQQFRKTGKLVSLSEQNLVDCSRPEGN 177
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDG-VCKFSSENVGVQVLDSVNITLGAED 217
+GCNGGL QAF+Y+K N GLD+E++YPY G D C + V++ G E
Sbjct: 178 EGCNGGLMDQAFQYVKDNQGLDSEDSYPYLGTDDQPCHYDPNYNSANDTGFVDVPSGKER 237
Query: 218 ELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----D 272
L AV V PVSVA + + F+FY+SG+Y C + +D H V+ VGYG E D
Sbjct: 238 ALMKAVAAVGPVSVAIDAGHESFQFYQSGIYYEKDCSSEELD--HGVLVVGYGYEGEDVD 295
Query: 273 GVPYWLIKNSWGENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
G YW++KNSW E WGD GY M KN CGIAT ASYP+V
Sbjct: 296 GKKYWIVKNSWSEKWGDKGYIYMAKDRKNHCGIATAASYPLV 337
>gi|440793751|gb|ELR14926.1| Cysteine proteinase 5, putative [Acanthamoeba castellanii str.
Neff]
Length = 326
Score = 204 bits (518), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 103/213 (48%), Positives = 136/213 (63%), Gaps = 5/213 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+R ++ VK+QG CGSCW+FSTTGS E A G+ SLSEQ LVDC+ ++ N G
Sbjct: 115 FDWRQKGAVTHVKNQGQCGSCWSFSTTGSTEGANFLKHGRLTSLSEQNLVDCSTSYGNHG 174
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
CNGGL AFEYI N G+DTEE+YPY G C+++ ++ G +++ N+ G E L
Sbjct: 175 CNGGLMDYAFEYIIRNKGIDTEESYPYHASQGTCRYNKQHSGGELVSYTNVPSGNEGALL 234
Query: 221 HAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
+AV +P SVA + F+FYK GVY C ++ +D H V+AVG+GV DG YWL+
Sbjct: 235 NAVA-TQPTSVAIDASHSSFQFYKGGVYDEPACSSSRLD--HGVLAVGWGVRDGKDYWLV 291
Query: 280 KNSWGENWGDHGYFKMEMGK-NMCGIATCASYP 311
KNSWG +WG GY +M K N CGIAT AS+P
Sbjct: 292 KNSWGADWGLSGYIEMSRNKHNQCGIATAASHP 324
>gi|390368662|ref|XP_780781.2| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
Length = 333
Score = 204 bits (518), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 110/230 (47%), Positives = 140/230 (60%), Gaps = 11/230 (4%)
Query: 87 SKNLDLIRSTNCKGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSE 146
S N+D + T + +R ++PVKDQG CGSCW FS TGSLE + GK +SLSE
Sbjct: 112 SNNVDKLPKT----VDWRTKGYVTPVKDQGQCGSCWAFSATGSLEGQQFKKTGKLVSLSE 167
Query: 147 QQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVL 206
Q LVDC+ + N GC+GG +AF+YI GG+DTE Y Y DG C F NVG V
Sbjct: 168 QNLVDCS--YRNYGCHGGFMDRAFQYIIDAGGIDTEATYSYRAVDGNCHFKKANVGATVT 225
Query: 207 DSVNITLGAEDELQHAVGLVRPVSVAFEVVDG-FRFYKSGVYSSTKCGNTPMDVNHAVVA 265
++T G+E LQ AV + P+SVA + F+FYKSGVY+ C T + HAV+
Sbjct: 226 GYTDVTSGSEKALQKAVAHIGPISVAIDASHKFFKFYKSGVYNEPGCSTTRL--GHAVLV 283
Query: 266 VGYG-VEDGVPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
VGYG DG YW++KNSW + WG +GY M K N CGIA+ ASYP+V
Sbjct: 284 VGYGTTSDGTDYWIVKNSWAKTWGMNGYLWMSRNKDNQCGIASEASYPMV 333
>gi|318037269|ref|NP_001187182.1| cathepsin L precursor [Ictalurus punctatus]
gi|196475596|gb|ACG76367.1| cathepsin L [Ictalurus punctatus]
Length = 336
Score = 204 bits (518), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 109/220 (49%), Positives = 137/220 (62%), Gaps = 9/220 (4%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
L +R ++PVKDQG CGSCW FSTTG++E + GK +SLSEQ LVDC++ N+G
Sbjct: 119 LDWREKGYVTPVKDQGQCGSCWAFSTTGAMEGQQFRKTGKLVSLSEQNLVDCSRPEGNEG 178
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDG-VCKFSSENVGVQVLDSVNITLGAEDEL 219
CNGGL QAF+YIK NGGLDTE+ YPY G D C + V+I G E L
Sbjct: 179 CNGGLMDQAFQYIKDNGGLDTEKFYPYLGTDDQPCHYDPSYSAANDTGFVDIPSGKEHAL 238
Query: 220 QHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----DGV 274
AV V PVSVA + + F+FY+SG+Y C + D++H V+ VGYG E DG
Sbjct: 239 MKAVTAVGPVSVAIDAGHESFQFYQSGIYYEADCSSE--DLDHGVLVVGYGYEGENVDGK 296
Query: 275 PYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
YW++KNSW E WG+ GY M + N CGIAT ASYP+V
Sbjct: 297 KYWIVKNSWSEQWGNKGYIYMAKDRHNHCGIATAASYPLV 336
>gi|31558997|gb|AAP49831.1| cathepsin L [Fasciola hepatica]
Length = 326
Score = 204 bits (518), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 109/268 (40%), Positives = 161/268 (60%), Gaps = 15/268 (5%)
Query: 57 RHALSFARFARRYGKIYE-SVEEMKLRFAT-FSKNLDLIR-----STNCKG----LSYRL 105
RH L + + + + EE K ++ T S+ D++ TN + + +R
Sbjct: 57 RHDLGLVTYTLGLNQFTDMTFEEFKAKYLTEMSRASDILSHGVPYETNNRAVPDKIDWRE 116
Query: 106 GLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGL 165
++ VKDQG+CGSCW FSTTG++E Y + IS SEQQLVDC+ + N GC+GGL
Sbjct: 117 SGYVTEVKDQGNCGSCWAFSTTGTMEGQYMKNERTSISFSEQQLVDCSGPWGNNGCSGGL 176
Query: 166 PSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGL 225
A++Y+K GL+TE +YPYT +G C+++ + +V + G+E EL++ VG
Sbjct: 177 MENAYQYLK-QFGLETESSYPYTAVEGQCRYNKQLGVAKVTGYYTVPSGSEVELKNLVGA 235
Query: 226 VRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGE 285
P +VA +V F Y+SG+Y S C +P+ VNHAV+AVGYG + G YW++KNSWG
Sbjct: 236 EGPAAVAVDVESDFMMYRSGIYQSQTC--SPLRVNHAVLAVGYGTQGGTDYWIVKNSWGL 293
Query: 286 NWGDHGYFKMEMGK-NMCGIATCASYPV 312
+WG+ GY +M + NMCGIA+ AS P+
Sbjct: 294 SWGERGYIRMARNRGNMCGIASLASLPI 321
>gi|21483190|gb|AAL14223.1| cathepsin L [Dictyocaulus viviparus]
Length = 347
Score = 204 bits (518), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 101/208 (48%), Positives = 135/208 (64%), Gaps = 5/208 (2%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++PVK+QG CGSCW FS TG+LE + +A GK +SLSEQ LVDC+ + N GCNGGL
Sbjct: 142 VTPVKNQGMCGSCWAFSATGALEGQHFRATGKLVSLSEQNLVDCSTKYGNHGCNGGLMDL 201
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
AFEYIK N G+DTEE YPY GK+ C F ++G + V++ G ED L+ AV P
Sbjct: 202 AFEYIKDNHGIDTEEGYPYVGKEMRCHFKKRDIGAEDRGFVDLPEGDEDALKVAVATQGP 261
Query: 229 VSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE-DGVPYWLIKNSWGEN 286
+S+A + F+ YK GVY +C + +D H V+ VGYG + + YW+IKNSWG
Sbjct: 262 ISIAIDAGHRSFQLYKKGVYFDEECSSEELD--HGVLLVGYGTDPEAGDYWIIKNSWGTK 319
Query: 287 WGDHGYFKMEMGK-NMCGIATCASYPVV 313
WG+ GY ++ + N CG+AT ASYP+V
Sbjct: 320 WGEKGYVRIARNRNNHCGVATKASYPLV 347
>gi|358255476|dbj|GAA57175.1| cathepsin L [Clonorchis sinensis]
Length = 385
Score = 204 bits (518), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 114/302 (37%), Positives = 179/302 (59%), Gaps = 25/302 (8%)
Query: 34 IRLVSSDGLRDFETSVLQVIGQARHALSFARFARR------YGKIYESVEEMKLRFATFS 87
++ +++ +R + +V + GQ + + F+ + + +++ EE+K R F
Sbjct: 87 FKIFANNFVRISKHNVRFIQGQVSYTMGINEFSDKVIGLIIHTICFQTDEELK-RLRCFR 145
Query: 88 KNLD---------LIRSTNCKGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAF 138
+L+ I + + +R ++PVK+QG+CGSCW FS TG++E A
Sbjct: 146 GSLNASRDGSKYITIAAPPPSEIDWRNKGAVTPVKNQGNCGSCWAFSATGAIEGQNFLAT 205
Query: 139 GKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPY-TGKDG----V 193
G +SLSEQQLVDC+ + N CNGGL AF+Y+K + G+DTE +YPY +G+ G
Sbjct: 206 GNLVSLSEQQLVDCSSEYGNNACNGGLMDNAFKYVKDSNGIDTEASYPYVSGETGDANPT 265
Query: 194 CKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKC 252
C+F+ + V+V +++ G EL+ AVG P+SVA + F YKSGVYS +C
Sbjct: 266 CRFNLKEAVVRVTGYIDLPRGQVSELKQAVGHYGPISVAINAGLPSFMSYKSGVYSDDQC 325
Query: 253 GNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFK-MEMGKNMCGIATCASYP 311
+ D++H V+ VGYG E+G+PYWLIKNSWG +WG++GY K + N+CG+A+ ASYP
Sbjct: 326 SSD--DLDHGVLLVGYGEENGIPYWLIKNSWGPHWGENGYVKILRDHNNLCGVASMASYP 383
Query: 312 VV 313
++
Sbjct: 384 LM 385
>gi|21483188|gb|AAK77918.1| cathepsin L 1 [Dictyocaulus viviparus]
Length = 347
Score = 204 bits (518), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 101/208 (48%), Positives = 135/208 (64%), Gaps = 5/208 (2%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++PVK+QG CGSCW FS TG+LE + +A GK +SLSEQ LVDC+ + N GCNGGL
Sbjct: 142 VTPVKNQGMCGSCWAFSATGALEGQHFRATGKLVSLSEQNLVDCSTKYGNHGCNGGLMDL 201
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
AFEYIK N G+DTEE YPY GK+ C F ++G + V++ G ED L+ AV P
Sbjct: 202 AFEYIKDNHGIDTEEGYPYVGKEMRCHFKKRDIGAEDRGFVDLPEGDEDALKVAVATQGP 261
Query: 229 VSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE-DGVPYWLIKNSWGEN 286
+S+A + F+ YK GVY +C + +D H V+ VGYG + + YW+IKNSWG
Sbjct: 262 ISIAIDAGHRSFQLYKKGVYFDEECSSEELD--HGVLLVGYGTDPEAGDYWIIKNSWGTK 319
Query: 287 WGDHGYFKMEMGK-NMCGIATCASYPVV 313
WG+ GY ++ + N CG+AT ASYP+V
Sbjct: 320 WGEKGYVRIARNRNNHCGVATKASYPLV 347
>gi|163658591|gb|ABY28387.1| cathepsin L [Gnathostoma spinigerum]
Length = 398
Score = 204 bits (518), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 100/217 (46%), Positives = 141/217 (64%), Gaps = 6/217 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++ VKDQG CGSCW FS TG+LE + + + +SLSEQ LVDC++ + N G
Sbjct: 184 VDWRNSSYVTVVKDQGQCGSCWAFSATGALEGQHMRKTHQLVSLSEQNLVDCSRKYGNNG 243
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDG-VCKFSSENVGVQVLDSVNITLGAEDEL 219
CNGGL AFEYIK N G+DTEE+YPY G +G C F + VG + ++ G E+ L
Sbjct: 244 CNGGLMDNAFEYIKDNHGIDTEESYPYKGVEGKKCHFRRKFVGAEDYGYTDLPEGDEEAL 303
Query: 220 QHAVGLVRPVSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGV-PYW 277
+ AV + P+SVA + F+ Y+ G+Y+ +C +P D++H V+ VGYG ++ YW
Sbjct: 304 KVAVATIGPISVAIDAGHISFQNYRKGIYTENEC--SPEDLDHGVLVVGYGTDENAGDYW 361
Query: 278 LIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
++KNSWG WG+HGY +M K N CGIA+ ASYP+V
Sbjct: 362 IVKNSWGTRWGEHGYIRMARNKRNQCGIASKASYPIV 398
>gi|20136379|gb|AAM11647.1|AF490984_1 cathepsin L, partial [Fasciola hepatica]
Length = 311
Score = 204 bits (518), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 100/215 (46%), Positives = 141/215 (65%), Gaps = 4/215 (1%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++ VKDQG+CGSCW FSTTG++E Y + IS SEQQLVDC+ + N G
Sbjct: 97 IDWRESGYVTEVKDQGNCGSCWAFSTTGTMEGQYMKNERTSISFSEQQLVDCSGPWGNNG 156
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C+GGL A++Y+K GL+TE +YPYT +G C+++ + +V + G+E EL+
Sbjct: 157 CSGGLMENAYQYLK-QFGLETESSYPYTAVEGQCRYNKQLGVAKVTGYYTVHSGSEVELK 215
Query: 221 HAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIK 280
+ VG P +VA +V F Y+SG+Y S C +P+ VNHAV+AVGYG +DG YW++K
Sbjct: 216 NLVGAEGPAAVAVDVESDFMMYRSGIYQSQTC--SPLRVNHAVLAVGYGTQDGTDYWIVK 273
Query: 281 NSWGENWGDHGYFKMEMGK-NMCGIATCASYPVVA 314
NSWG WG+ GY +M + NMCGIA+ AS +VA
Sbjct: 274 NSWGSYWGERGYIRMARNRGNMCGIASLASVAMVA 308
>gi|330805273|ref|XP_003290609.1| hypothetical protein DICPUDRAFT_92519 [Dictyostelium purpureum]
gi|325079248|gb|EGC32857.1| hypothetical protein DICPUDRAFT_92519 [Dictyostelium purpureum]
Length = 333
Score = 204 bits (518), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 114/262 (43%), Positives = 147/262 (56%), Gaps = 10/262 (3%)
Query: 60 LSFARFA----RRYGKIYESVE-EMKLRFATFSKNLDLIRSTNCKGLSYRLGLNISPVKD 114
L +FA Y K Y ++ +K K L + T + +R +S VKD
Sbjct: 75 LGLTKFADLTNEEYKKHYLGIKVNVKKNLNAAQKGLKFFKFTGPDSIDWREKGAVSQVKD 134
Query: 115 QGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIK 174
QG CGSCW+FSTTG++E A+ G +SLSEQ LVDC+ + NQGC GGL AFEYI
Sbjct: 135 QGQCGSCWSFSTTGAVEGAHQIKSGNMVSLSEQNLVDCSGQYGNQGCEGGLMVNAFEYII 194
Query: 175 YNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFE 234
NGG+ TE +YPYT G CKF+ G ++ I G ED L A+ +PVSVA +
Sbjct: 195 DNGGIATESSYPYTAAQGRCKFTKSMNGANIIGYKEIPQGEEDSLTAALA-KQPVSVAID 253
Query: 235 VVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGY- 292
F+ Y SGVY C + +D H V+AVGYG +G Y++IKNSWG WG GY
Sbjct: 254 ASHMSFQLYSSGVYDEPACSSEALD--HGVLAVGYGTLEGKDYYIIKNSWGPTWGQDGYI 311
Query: 293 FKMEMGKNMCGIATCASYPVVA 314
F +N CG+AT ASYP+ A
Sbjct: 312 FMSRNAQNQCGVATMASYPISA 333
>gi|390457768|ref|XP_002742793.2| PREDICTED: cathepsin L2 [Callithrix jacchus]
Length = 588
Score = 203 bits (517), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 110/230 (47%), Positives = 139/230 (60%), Gaps = 14/230 (6%)
Query: 90 LDLIRSTNCKGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQL 149
L+L +S + + Y ++PVK+Q CGSCW FS TG+LE + GK +SLSEQ L
Sbjct: 112 LNLPKSVDWRKKGY-----VTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNL 166
Query: 150 VDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSV 209
VDC+ NQGCNGG + AF+Y+K NGGLD+E +YPY KDG CK+ EN V
Sbjct: 167 VDCSHPQGNQGCNGGFMNNAFQYVKENGGLDSEASYPYVAKDGSCKYKPENSVANDTGFV 226
Query: 210 NITLGAEDELQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGY 268
I E EL AV V P+SVA + F+FYKSG+Y C + +D H V+ VGY
Sbjct: 227 VIP-AHEKELMKAVATVGPISVAVDASHSSFQFYKSGIYFEQDCSSKNLD--HGVLVVGY 283
Query: 269 GVE----DGVPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
G E + YWLIKNSWG WG +GY K+ + N CGIAT ASYP+V
Sbjct: 284 GFEGTNSNNNNYWLIKNSWGPEWGSNGYIKIAKDRNNHCGIATAASYPIV 333
>gi|348514005|ref|XP_003444531.1| PREDICTED: cathepsin L1-like [Oreochromis niloticus]
Length = 338
Score = 203 bits (517), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 121/296 (40%), Positives = 165/296 (55%), Gaps = 24/296 (8%)
Query: 35 RLVSSDGLRDFETSVL-QVIGQARHALSFARFA-------RRYGKIYESVEEMKLRFATF 86
R+V L+ E L +G+ + L F R+ Y+ E K++ + F
Sbjct: 50 RMVWEKNLKKIELHNLDHSMGKHTYRLGMNHFGDMTNEEFRQLMNGYKHKAERKVKGSLF 109
Query: 87 SKN--LDLIRSTNCKGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISL 144
+ L+ RS + + Y ++PVKDQG CGSCW FS TG+LE + GK + L
Sbjct: 110 LEPNFLEAPRSLDWRDKGY-----VTPVKDQGQCGSCWAFSATGALEGQQFRKTGKMVQL 164
Query: 145 SEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDG-VCKFSSENVGV 203
SEQ LV+C++ N+GCNGGL QAF+Y+K N GLD+EE+YPY G D C + V
Sbjct: 165 SEQNLVECSRPEGNEGCNGGLMDQAFQYVKDNQGLDSEESYPYLGTDDQKCHYDPRYNAV 224
Query: 204 QVLDSVNITLGAEDELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHA 262
V+I G+E L AV V P+SVA + + F+FY+SG+Y +C + +D H
Sbjct: 225 NDTGFVDIKSGSEHALMKAVTAVGPISVAIDAGHESFQFYQSGIYYEPECSSEELD--HG 282
Query: 263 VVAVGYGVE----DGVPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
V+ VGYG E DG YW++KNSW E WGD GY M + N CGIAT ASYP+V
Sbjct: 283 VLLVGYGFEGEDVDGKKYWIVKNSWSEKWGDKGYVYMAKDRQNHCGIATAASYPLV 338
>gi|390994425|gb|AFM37362.1| cathepsin L2 [Dictyocaulus viviparus]
Length = 352
Score = 203 bits (517), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 101/208 (48%), Positives = 135/208 (64%), Gaps = 5/208 (2%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++PVK+QG CGSCW FS TG+LE + +A GK +SLSEQ LVDC+ + N GCNGGL
Sbjct: 147 VTPVKNQGMCGSCWAFSATGALEGQHFRATGKLVSLSEQNLVDCSTKYGNHGCNGGLMDL 206
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
AFEYIK N G+DTEE YPY GK+ C F ++G + V++ G ED L+ AV P
Sbjct: 207 AFEYIKDNHGIDTEEGYPYVGKEMRCHFKKRDIGAEDRGFVDLPEGDEDALKVAVATQGP 266
Query: 229 VSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE-DGVPYWLIKNSWGEN 286
+S+A + F+ YK GVY +C + +D H V+ VGYG + + YW+IKNSWG
Sbjct: 267 ISIAIDAGHRSFQLYKKGVYFDEECSSEELD--HGVLLVGYGTDPEAGDYWIIKNSWGTK 324
Query: 287 WGDHGYFKMEMGK-NMCGIATCASYPVV 313
WG+ GY ++ + N CG+AT ASYP+V
Sbjct: 325 WGEKGYVRIARNRNNHCGVATKASYPLV 352
>gi|390347681|ref|XP_801784.2| PREDICTED: cathepsin L1-like isoform 2 [Strongylocentrotus
purpuratus]
Length = 336
Score = 203 bits (517), Expect = 7e-50, Method: Compositional matrix adjust.
Identities = 108/245 (44%), Positives = 146/245 (59%), Gaps = 11/245 (4%)
Query: 73 YESVEEMKLRFATF--SKNLDLIRSTNCKGLSYRLGLNISPVKDQGHCGSCWTFSTTGSL 130
Y+S K++ +TF N+ + + + + Y ++PVK+QG CGSCW FSTTGSL
Sbjct: 99 YKSSNVTKVQGSTFLTPSNIQVPDTVDWRTKGY-----VTPVKNQGQCGSCWAFSTTGSL 153
Query: 131 EAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGK 190
E + K +SLSEQ LVDC++ N GC GGL Q F+Y+ N G+D+E+ YPY +
Sbjct: 154 EGQTFKKTSKLVSLSEQNLVDCSRTEGNMGCEGGLMDQGFQYVIDNHGIDSEDCYPYDAE 213
Query: 191 DGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSS 249
D C + + +V ++T G E L AV V PVSVA + F+ Y+SGVY
Sbjct: 214 DETCHYKASCDSAEVTGFTDVTSGDEQALMEAVASVGPVSVAIDASHQSFQLYESGVYDE 273
Query: 250 TKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCA 308
+C ++ +D H V+ VGYG + G YWL+KNSWGE WG GY KM K N CGIAT A
Sbjct: 274 PECSSSELD--HGVLVVGYGTDGGKDYWLVKNSWGETWGLSGYIKMSRNKSNQCGIATSA 331
Query: 309 SYPVV 313
SYP+V
Sbjct: 332 SYPLV 336
>gi|30388235|gb|AAH51665.1| CDNA sequence BC051665 [Mus musculus]
Length = 330
Score = 203 bits (517), Expect = 7e-50, Method: Compositional matrix adjust.
Identities = 108/218 (49%), Positives = 136/218 (62%), Gaps = 6/218 (2%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++PVKDQGHCGSCW FS GSLE + GK + LSEQ L+DC+ ++ N
Sbjct: 116 KSVDWRDHGYVTPVKDQGHCGSCWAFSAVGSLEGQIFRKTGKLVPLSEQNLMDCSWSYGN 175
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
GCNGGL AF+Y+K N GLDT E+Y Y DG C++ + V + V + L +ED
Sbjct: 176 VGCNGGLMELAFQYVKENRGLDTRESYAYEAWDGPCRYDPKYSAVNITGFVKVPL-SEDA 234
Query: 219 LQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE-DGVPY 276
L +AV V PVSV + FRFY+ G Y C +T +D HAV+ VGYG E DG Y
Sbjct: 235 LMNAVASVGPVSVGIDTHHHSFRFYRGGTYYEPDCSSTNLD--HAVLVVGYGEESDGRKY 292
Query: 277 WLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
WL+KNSWGE+WG GY KM + N CGIAT A YP V
Sbjct: 293 WLVKNSWGEDWGMDGYIKMAKDRDNNCGIATYAIYPTV 330
>gi|269954686|ref|NP_954599.2| uncharacterized protein LOC218275 precursor [Mus musculus]
Length = 330
Score = 203 bits (517), Expect = 7e-50, Method: Compositional matrix adjust.
Identities = 108/218 (49%), Positives = 136/218 (62%), Gaps = 6/218 (2%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++PVKDQGHCGSCW FS GSLE + GK + LSEQ L+DC+ ++ N
Sbjct: 116 KSVDWRDHGYVTPVKDQGHCGSCWAFSAVGSLEGQIFRKTGKLVPLSEQNLMDCSWSYGN 175
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
GCNGGL AF+Y+K N GLDT E+Y Y DG C++ + V + V + L +ED
Sbjct: 176 VGCNGGLMELAFQYVKENRGLDTRESYAYEAWDGPCRYDPKYSAVNITGFVKVPL-SEDA 234
Query: 219 LQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE-DGVPY 276
L +AV V PVSV + FRFY+ G Y C +T +D HAV+ VGYG E DG Y
Sbjct: 235 LMNAVASVGPVSVGIDTHHHSFRFYRGGTYYEPDCSSTNLD--HAVLVVGYGEESDGRKY 292
Query: 277 WLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
WL+KNSWGE+WG GY KM + N CGIAT A YP V
Sbjct: 293 WLVKNSWGEDWGMDGYIKMAKDRDNNCGIATYAIYPTV 330
>gi|74211558|dbj|BAE26509.1| unnamed protein product [Mus musculus]
Length = 338
Score = 203 bits (517), Expect = 8e-50, Method: Compositional matrix adjust.
Identities = 108/218 (49%), Positives = 136/218 (62%), Gaps = 6/218 (2%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++PVKDQGHCGSCW FS GSLE + GK + LSEQ L+DC+ ++ N
Sbjct: 124 KSVDWRDHGYVTPVKDQGHCGSCWAFSAVGSLEGQIFRKTGKLVPLSEQNLMDCSWSYGN 183
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
GCNGGL AF+Y+K N GLDT E+Y Y DG C++ + V + V + L +ED
Sbjct: 184 VGCNGGLMELAFQYVKENRGLDTRESYAYEAWDGPCRYDPKYSAVNITGFVKVPL-SEDA 242
Query: 219 LQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE-DGVPY 276
L +AV V PVSV + FRFY+ G Y C +T +D HAV+ VGYG E DG Y
Sbjct: 243 LMNAVASVGPVSVGIDTHHHSFRFYRGGTYYEPDCSSTNLD--HAVLVVGYGEESDGRKY 300
Query: 277 WLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
WL+KNSWGE+WG GY KM + N CGIAT A YP V
Sbjct: 301 WLVKNSWGEDWGMDGYIKMAKDRDNNCGIATYAIYPTV 338
>gi|28932708|gb|AAO60048.1| midgut cysteine proteinase 5 [Rhipicephalus appendiculatus]
Length = 329
Score = 203 bits (516), Expect = 9e-50, Method: Compositional matrix adjust.
Identities = 102/202 (50%), Positives = 125/202 (61%), Gaps = 4/202 (1%)
Query: 113 KDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEY 172
+DQG CGSCW FS TGSLE + G+ +SLSEQ LVDC+Q+F N GC GGL AF Y
Sbjct: 131 QDQGQCGSCWAFSATGSLEGQHLLKDGELVSLSEQNLVDCSQSFGNNGCEGGLMDNAFNY 190
Query: 173 IKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVA 232
IK N G+DTEE YPY DG C+F E+VG V+I G ED+L+ A P
Sbjct: 191 IKANDGIDTEEGYPYEAVDGECRFKKEDVGATDTGFVDIPGGIEDDLKKA-SFCWPPPWL 249
Query: 233 FEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGY 292
+ F+ Y GVY + C + +D H V+ VGYGV+ G YWL+KNSW E+WGD GY
Sbjct: 250 WRSPSSFQLYSEGVYDESDCSSEQLD--HGVLVVGYGVKGGKKYWLVKNSWAESWGDQGY 307
Query: 293 FKMEMGK-NMCGIATCASYPVV 313
M K N CGIA+ ASYP+V
Sbjct: 308 ILMSRDKNNQCGIASAASYPLV 329
>gi|391343119|ref|XP_003745860.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
Length = 385
Score = 203 bits (516), Expect = 9e-50, Method: Compositional matrix adjust.
Identities = 124/329 (37%), Positives = 168/329 (51%), Gaps = 66/329 (20%)
Query: 48 SVLQVIGQARHALS--------FARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCK 99
+VL VIG A ALS + F + K YES E +R F +N I N K
Sbjct: 60 AVLAVIGLAS-ALSPNPNLNQHWENFKAEHNKKYESFPEELMRRLIFEENHQFIEDHNSK 118
Query: 100 G-LSYRLGLN-------------------------------------------------- 108
+ LG+N
Sbjct: 119 KEFDFYLGMNHFGDLTNKEYRERYLGYRRPENTPSKASYIFSRAEKIEDVPDQIDWRDQG 178
Query: 109 -ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPS 167
++PVK+QG CGSCW FS GSLE + ++ GK +SLSEQ LVDC+ N GCNGG
Sbjct: 179 FVTPVKNQGQCGSCWAFSAVGSLEGQHFKSTGKLVSLSEQNLVDCSTPEGNSGCNGGWMD 238
Query: 168 QAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVR 227
QAFEY+K N G+DTE++YPY G DG C F ++++G + +++ G E+ L+ AVG+
Sbjct: 239 QAFEYVKDNHGIDTEDSYPYVGTDGSCHFKNKSIGATLKGFMDVKEGDEEALRQAVGVAG 298
Query: 228 PVSVAFEVVDG-FRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE-DGVPYWLIKNSWGE 285
PVSVA + F+FY+ GVY+ C + +D H V+ VGYG + G +W++KNSWG
Sbjct: 299 PVSVAIDASSMLFQFYRGGVYNVPWCSTSELD--HGVLVVGYGKQFQGKDFWMVKNSWGV 356
Query: 286 NWGDHGYFKMEMGK-NMCGIATCASYPVV 313
WG +GY +M K N CGIA+ AS P V
Sbjct: 357 GWGIYGYIEMSRNKGNQCGIASKASIPTV 385
>gi|50657029|emb|CAH04632.1| cathepsin L [Suberites domuncula]
Length = 324
Score = 203 bits (516), Expect = 9e-50, Method: Compositional matrix adjust.
Identities = 99/207 (47%), Positives = 130/207 (62%), Gaps = 4/207 (1%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
+S VK+QG CGSCW+FS TGSLE + G+ +SLSEQ L+DC+ F N GC GG+
Sbjct: 120 VSEVKNQGQCGSCWSFSATGSLEGQHALKMGRLVSLSEQNLMDCSSRFGNHGCKGGIMDD 179
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
AF Y+ N G+DTE +YPYT KDG C+F+ NVG +I G+E L A + P
Sbjct: 180 AFRYVISNHGVDTESSYPYTAKDGYCRFNQNNVGATETSYRDIARGSESSLTQASAQIGP 239
Query: 229 VSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENW 287
+SVA + F+FYK+GVY C ++ +D H V+ VGYG E G Y+++KNSWG W
Sbjct: 240 ISVAIDASHRSFQFYKNGVYYEPSCSSSRLD--HGVLVVGYGTEGGQDYFIVKNSWGTRW 297
Query: 288 GDHGYFKMEMG-KNMCGIATCASYPVV 313
G GY M +N CGIA+ ASYP+V
Sbjct: 298 GMDGYIMMSRNRRNNCGIASQASYPIV 324
>gi|391333246|ref|XP_003741030.1| PREDICTED: digestive cysteine proteinase 2-like [Metaseiulus
occidentalis]
Length = 327
Score = 203 bits (516), Expect = 9e-50, Method: Compositional matrix adjust.
Identities = 107/273 (39%), Positives = 162/273 (59%), Gaps = 11/273 (4%)
Query: 48 SVLQVIGQARHALSFARFARRYGKIYESVEEMKLRFATFS-----KNLDLIRSTN-CKGL 101
++L +GQ + + +RF + S+ + + +T + + D I T + +
Sbjct: 59 NLLHDLGQVSYRMGLSRFTDATPEEIRSLTCLNISDSTSTGKSNGNSFDTIDITELSEAV 118
Query: 102 SYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGC 161
+R ++PVKDQG CGSCW F+ TG++E Y + G+ +SLSEQ LVDC ++ + GC
Sbjct: 119 DWRQNGYVTPVKDQGKCGSCWAFAATGAVEGQYFKKTGQLVSLSEQNLVDCDRS--SDGC 176
Query: 162 NGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQH 221
GG ++FEYI+ NGG+ TE +Y Y G C+F+++++G V ++ G E+ L
Sbjct: 177 EGGYFYESFEYIRSNGGIATESSYGYEATAGSCRFTADSIGATVSGRDSVASGDEEALLK 236
Query: 222 AVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKN 281
AV + P+SV +V+D FR Y SGVY +C ++ NHAV+ VGYG E G YWL+KN
Sbjct: 237 AVASIGPISVTIDVIDTFRHYSSGVYYDAECSSSSR--NHAVLVVGYGTEAGGDYWLVKN 294
Query: 282 SWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
SWG ++G+ GY KM K N CGIA+ A YP+
Sbjct: 295 SWGTSFGEQGYIKMARNKGNNCGIASEAGYPIA 327
>gi|148709355|gb|EDL41301.1| cDNA sequence BC051665 [Mus musculus]
Length = 349
Score = 203 bits (516), Expect = 9e-50, Method: Compositional matrix adjust.
Identities = 108/218 (49%), Positives = 136/218 (62%), Gaps = 6/218 (2%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++PVKDQGHCGSCW FS GSLE + GK + LSEQ L+DC+ ++ N
Sbjct: 135 KSVDWRDHGYVTPVKDQGHCGSCWAFSAVGSLEGQIFRKTGKLVPLSEQNLMDCSWSYGN 194
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
GCNGGL AF+Y+K N GLDT E+Y Y DG C++ + V + V + L +ED
Sbjct: 195 VGCNGGLMELAFQYVKENRGLDTRESYAYEAWDGPCRYDPKYSAVNITGFVKVPL-SEDA 253
Query: 219 LQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE-DGVPY 276
L +AV V PVSV + FRFY+ G Y C +T +D HAV+ VGYG E DG Y
Sbjct: 254 LMNAVASVGPVSVGIDTHHHSFRFYRGGTYYEPDCSSTNLD--HAVLVVGYGEESDGRKY 311
Query: 277 WLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
WL+KNSWGE+WG GY KM + N CGIAT A YP V
Sbjct: 312 WLVKNSWGEDWGMDGYIKMAKDRDNNCGIATYAIYPTV 349
>gi|11055|emb|CAA45129.1| cysteine proteinase preproenzyme [Homarus americanus]
Length = 320
Score = 203 bits (516), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 103/215 (47%), Positives = 135/215 (62%), Gaps = 5/215 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++PVKDQ CGSCW FS TG+LE + + +SLSEQQLVDC+ + N G
Sbjct: 109 VDWRTKALVTPVKDQEQCGSCWAFSATGALEGQHFLKNDELVSLSEQQLVDCSTDYGNDG 168
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C GG + AF+YIK NGG+DTE +YPY +D C+F + ++G SV + E+ LQ
Sbjct: 169 CGGGWMTSAFDYIKDNGGIDTESSYPYEAEDRSCRFDANSIGAICTGSVEVQH-TEEALQ 227
Query: 221 HAVGLVRPVSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
AV V P+SVA + F+FY SGVY C +P ++H V+AVGYG E YWL+
Sbjct: 228 EAVSGVGPISVAIDASHFSFQFYSSGVYYEQNC--SPTFLDHGVLAVGYGTESTKDYWLV 285
Query: 280 KNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
KNSWG +WGD GY KM + N CGIA+ SYP V
Sbjct: 286 KNSWGSSWGDAGYIKMSRNRDNNCGIASEPSYPTV 320
>gi|118125|sp|P25784.1|CYSP3_HOMAM RecName: Full=Digestive cysteine proteinase 3; Flags: Precursor
Length = 321
Score = 202 bits (515), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 103/215 (47%), Positives = 135/215 (62%), Gaps = 5/215 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++PVKDQ CGSCW FS TG+LE + + +SLSEQQLVDC+ + N G
Sbjct: 110 VDWRTKALVTPVKDQEQCGSCWAFSATGALEGQHFLKNDELVSLSEQQLVDCSTDYGNDG 169
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C GG + AF+YIK NGG+DTE +YPY +D C+F + ++G SV + E+ LQ
Sbjct: 170 CGGGWMTSAFDYIKDNGGIDTESSYPYEAEDRSCRFDANSIGAICTGSVEVQH-TEEALQ 228
Query: 221 HAVGLVRPVSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
AV V P+SVA + F+FY SGVY C +P ++H V+AVGYG E YWL+
Sbjct: 229 EAVSGVGPISVAIDASHFSFQFYSSGVYYEQNC--SPTFLDHGVLAVGYGTESTKDYWLV 286
Query: 280 KNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
KNSWG +WGD GY KM + N CGIA+ SYP V
Sbjct: 287 KNSWGSSWGDAGYIKMSRNRDNNCGIASEPSYPTV 321
>gi|281204396|gb|EFA78592.1| cysteine proteinase 3 [Polysphondylium pallidum PN500]
Length = 330
Score = 202 bits (515), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 124/321 (38%), Positives = 165/321 (51%), Gaps = 60/321 (18%)
Query: 48 SVLQVIGQARHALSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGL 107
S ++ + + F + R + Y+ V E + R+ F NLDLI N +G S LG+
Sbjct: 15 SANRLFSEQHYQNQFTNWMVRLDRAYD-VFEFQDRYNAFKNNLDLIHKWNSQGHSTVLGV 73
Query: 108 N---------------------------------------------------ISPVKDQG 116
N + VKDQG
Sbjct: 74 NHLADLSNEEYRNLYLGVKVDASRLPQQAASIKLNKVFAPVAASLDWRSSGAVGRVKDQG 133
Query: 117 HCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYN 176
CGSCW+FSTTGS+E A A G SLSEQQL+DC++ + N+GCNGGL A +Y+
Sbjct: 134 QCGSCWSFSTTGSIEGANQIATGNFASLSEQQLMDCSRDYGNEGCNGGLMDAAMKYVIAQ 193
Query: 177 GGLDTEEAYPYTGKDG-VCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVR-PVSVAFE 234
GGLDTEE+YPYT D CKF+ N+G ++ +++ G+E +L A L + PVSVA +
Sbjct: 194 GGLDTEESYPYTMSDSYTCKFNPANIGAKISSYIDVQRGSETDL--AAKLNKGPVSVAID 251
Query: 235 VV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYF 293
F+ YKSGVY C + +D H V+AVGYG E YW++KNSWG NWG GY
Sbjct: 252 ASHSSFQLYKSGVYYEPACSSYNLD--HGVLAVGYGTEGSSNYWIVKNSWGPNWGLSGYI 309
Query: 294 KMEMGK-NMCGIATCASYPVV 313
M K N CGI++ AS PVV
Sbjct: 310 WMAKDKSNHCGISSMASIPVV 330
>gi|15593252|gb|AAL02222.1|AF410882_1 cysteine protease CP14 precursor [Frankliniella occidentalis]
Length = 333
Score = 202 bits (515), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 105/218 (48%), Positives = 137/218 (62%), Gaps = 5/218 (2%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++P+KDQG CGSCW+FS TGSLE +SLSEQ LVDC+ F N
Sbjct: 118 KKVDWRSKGAVTPIKDQGQCGSCWSFSATGSLEGQLFLKNKNLVSLSEQNLVDCSWDFGN 177
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
+GCNGGL AFEY+K NGG+DTEE+YPYT +DG C + + N ++ +E
Sbjct: 178 EGCNGGLMDSAFEYVKSNGGIDTEESYPYTAEDGTCLYKAANNAGVNTGYKDVQAKSESA 237
Query: 219 LQHAVGLVRPVSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE-DGVPY 276
L+ AV V PVSVA + + F+ Y SG+Y C + +D H V+AVGYG E +
Sbjct: 238 LRDAVEKVGPVSVAIDASNWSFQMYTSGIYYEPACSSDSLD--HGVLAVGYGSEWPNKEF 295
Query: 277 WLIKNSWGENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
W++KNSWG +WG+ GY KM KN CGIAT ASYP+V
Sbjct: 296 WIVKNSWGTSWGEEGYIKMARNKKNNCGIATEASYPLV 333
>gi|293342579|ref|XP_001065885.2| PREDICTED: cathepsin L1 [Rattus norvegicus]
gi|293354415|ref|XP_225137.5| PREDICTED: cathepsin L1 [Rattus norvegicus]
gi|149039747|gb|EDL93863.1| rCG24278 [Rattus norvegicus]
Length = 330
Score = 202 bits (515), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 110/218 (50%), Positives = 138/218 (63%), Gaps = 6/218 (2%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K L +R ++PVK+QG CGSCW FS GSLE + GK +SLSEQ LVDC+ ++ N
Sbjct: 116 KSLDWREHGYVTPVKNQGQCGSCWAFSAVGSLEGQIFKKTGKLVSLSEQNLVDCSWSYGN 175
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
GCNGGL AF+Y+K N GLDT E+Y Y +DG+C+++ + V V + L +ED+
Sbjct: 176 LGCNGGLMEFAFQYVKENRGLDTGESYAYEAQDGLCRYNPKYSAANVTGFVKVPL-SEDD 234
Query: 219 LQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE-DGVPY 276
L AV V PVSV + FRFY G+Y C +T MD HAV+ VGYG E DG Y
Sbjct: 235 LMSAVASVGPVSVGIDSHHQSFRFYSGGMYYEPDCSSTEMD--HAVLVVGYGEESDGGKY 292
Query: 277 WLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
WL+KNSWGE+WG GY KM + N CGIAT A YP V
Sbjct: 293 WLVKNSWGEDWGMDGYIKMAKDQNNNCGIATYAIYPTV 330
>gi|260516654|gb|ACX43954.1| cysteine protease 1 [Brachiaria hybrid cultivar]
gi|260516656|gb|ACX43955.1| cysteine protease 1 [Brachiaria hybrid cultivar]
gi|260516658|gb|ACX43956.1| cysteine protease 1 [Brachiaria hybrid cultivar]
gi|260516660|gb|ACX43957.1| cysteine protease 1 [Brachiaria hybrid cultivar]
gi|260516662|gb|ACX43958.1| cysteine protease 2 [Brachiaria hybrid cultivar]
gi|260516664|gb|ACX43959.1| cysteine protease 2 [Brachiaria hybrid cultivar]
gi|260516666|gb|ACX43960.1| cysteine protease 2 [Brachiaria hybrid cultivar]
gi|260516668|gb|ACX43961.1| cysteine protease 2 [Brachiaria hybrid cultivar]
gi|260516670|gb|ACX43962.1| cysteine protease 2 [Brachiaria hybrid cultivar]
Length = 338
Score = 202 bits (514), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 119/256 (46%), Positives = 150/256 (58%), Gaps = 12/256 (4%)
Query: 60 LSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLNISPVKDQGHCG 119
LSF F +Y Y+ VE R S NL + +R ++P+KDQG CG
Sbjct: 93 LSFEEFKGKYFG-YKHVEREFAR----SNNLHQEVEAAPTSIDWRTSNAVTPIKDQGQCG 147
Query: 120 SCWTFSTTGSLEAAY-HQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGG 178
SCW FS TGS+E A+ Q SLSEQQLVDC+ ++ N GCNGGL AFEYI N G
Sbjct: 148 SCWAFSATGSIEGAWVLQGKHTLTSLSEQQLVDCSTSYGNAGCNGGLMDYAFEYIIANKG 207
Query: 179 LDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVD- 237
+ E AYPY G G+C+ S V V + ++ G E L +AVG V PVSVA E
Sbjct: 208 ICAESAYPYKGVGGLCQKSCTKV-VTISGYKDVASGDEASLLNAVGTVGPVSVAIEADQA 266
Query: 238 GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEM 297
GF+FY SGV+S T CG+ +++H V+AVGYG YW++KNSWG +WG+ GY +M
Sbjct: 267 GFQFYSSGVFSGT-CGH---NLDHGVLAVGYGTTGSQDYWIVKNSWGTSWGESGYIRMIR 322
Query: 298 GKNMCGIATCASYPVV 313
KN CGIA SYP V
Sbjct: 323 NKNQCGIAIQPSYPTV 338
>gi|261289787|ref|XP_002611755.1| hypothetical protein BRAFLDRAFT_284339 [Branchiostoma floridae]
gi|229297127|gb|EEN67765.1| hypothetical protein BRAFLDRAFT_284339 [Branchiostoma floridae]
Length = 327
Score = 202 bits (514), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 105/216 (48%), Positives = 132/216 (61%), Gaps = 5/216 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++ VK+Q CGSCW FSTTGSLE + G +SLSEQ LVDC++ N+G
Sbjct: 114 VDWRQKGAVTKVKNQEQCGSCWAFSTTGSLEGQHFLKSGTLVSLSEQNLVDCSRKEGNKG 173
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGK-DGVCKFSSENVGVQVLDSVNITLGAEDEL 219
C GGL QAF+YIK NGG+DTEE YPY GK + C++ S G + V+I G ED L
Sbjct: 174 CQGGLMDQAFKYIKTNGGIDTEECYPYKGKNERKCEYKSSCSGATLSSYVDIKTGDEDAL 233
Query: 220 QHAVGLVRPVSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWL 278
A + P+SV + F+ Y GVY +C + +D H V+ VGYG + YWL
Sbjct: 234 MQASATIGPISVGIDASHPSFQLYDHGVYHEKRCSSKKLD--HGVLVVGYGTDGEKDYWL 291
Query: 279 IKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
+KNSWGE WG GY KM K N CGIAT ASYPVV
Sbjct: 292 VKNSWGEEWGMEGYIKMSRNKDNQCGIATQASYPVV 327
>gi|66812702|ref|XP_640530.1| counting factor associated protein [Dictyostelium discoideum AX4]
gi|74897159|sp|Q54TR1.1|CFAD_DICDI RecName: Full=Counting factor associated protein D; Flags:
Precursor
gi|60468561|gb|EAL66564.1| counting factor associated protein [Dictyostelium discoideum AX4]
Length = 531
Score = 202 bits (514), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 115/303 (37%), Positives = 158/303 (52%), Gaps = 52/303 (17%)
Query: 62 FARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN------------- 108
F + +Y K Y S +E RF F +I + N K SY+LG+N
Sbjct: 225 FKEYKAQYNKEYSSQDEHDERFINFKAARKIIATHNAKESSYKLGMNHYADLSNKEFNTL 284
Query: 109 ------------------------------------ISPVKDQGHCGSCWTFSTTGSLEA 132
++PVKDQG CGSCWTF +TGSLE
Sbjct: 285 VKPKVARPSVTGADSVHDDESLRSIPSTVDWRNQNCVTPVKDQGICGSCWTFGSTGSLEG 344
Query: 133 AYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDG 192
G+ +SLSEQQLVDCA +QGC GG S AF+Y+ G L TE YPY ++G
Sbjct: 345 TNCVTNGELVSLSEQQLVDCAILTGSQGCGGGFASSAFQYVMEIGSLATESNYPYLMQNG 404
Query: 193 VCKFSS-ENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSST 250
+C+ + GV + VN+T G+E LQ+A+ PV++A + VD FR+Y SGVY++
Sbjct: 405 LCRDRTVTPSGVSITGYVNVTSGSESALQNAIATTGPVAIAIDASVDDFRYYMSGVYNNP 464
Query: 251 KCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMG-KNMCGIATCAS 309
C N D++H V+A+GYG G Y+L+KNSW NWG GY M N+CG+++ A+
Sbjct: 465 ACKNGLDDLDHEVLAIGYGTYQGQDYFLVKNSWSTNWGMDGYVYMARNDNNLCGVSSQAT 524
Query: 310 YPV 312
YP+
Sbjct: 525 YPI 527
>gi|37786769|gb|AAO64471.1| cathepsin L precursor [Fundulus heteroclitus]
Length = 337
Score = 202 bits (514), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 123/296 (41%), Positives = 167/296 (56%), Gaps = 24/296 (8%)
Query: 35 RLVSSDGLRDFETSVLQ-VIGQARHALSFARFA----RRYGKI---YESVEEMKLRFATF 86
RLV L+ E L+ +G+ + L F + +I Y+ E K + + F
Sbjct: 49 RLVWEKNLKKIELHNLEHSMGKHSYRLGMNHFGDMTHEEFKQIMNGYKHKAERKFKGSLF 108
Query: 87 SKN--LDLIRSTNCKGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISL 144
+ L+ RS + + Y ++PVKDQG CGSCW FSTTG+LE GK +SL
Sbjct: 109 LEPNFLEAPRSVDWREKGY-----VTPVKDQGECGSCWAFSTTGALEGQEFTRTGKLVSL 163
Query: 145 SEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDG-VCKFSSENVGV 203
S Q LV+C++ N+GCNGGL QAF+Y+K N GLD+E++YPY G D C + +
Sbjct: 164 SGQNLVECSRPEGNEGCNGGLMDQAFQYVKDNQGLDSEDSYPYLGTDDQPCHYDPKFSAA 223
Query: 204 QVLDSVNITLGAEDELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHA 262
V+I G E L AV V PVSVA + + F+FY+SG+Y +C + +D H
Sbjct: 224 NDTGFVDIPSGNERALMKAVASVGPVSVAIDAGHESFQFYQSGIYYEKECSSEELD--HG 281
Query: 263 VVAVGYGVE----DGVPYWLIKNSWGENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
V+AVGYG + DG +W++KNSW ENWGD GY M KN CGIAT ASYP+V
Sbjct: 282 VLAVGYGFQGEDVDGKKFWIVKNSWSENWGDKGYIYMAKDRKNHCGIATAASYPLV 337
>gi|322799749|gb|EFZ20954.1| hypothetical protein SINV_06041 [Solenopsis invicta]
Length = 337
Score = 202 bits (514), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 100/209 (47%), Positives = 135/209 (64%), Gaps = 6/209 (2%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++ VKDQGHCGSCW FS+TG+LE + ++ G +SLSEQ L+DC+ + N GCNGGL
Sbjct: 131 VTAVKDQGHCGSCWAFSSTGALEGQHFRSTGYLVSLSEQNLIDCSGKYGNNGCNGGLMDY 190
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
AF+YIK N GLDTE+ YPY ++ C+++ N G V+I G E++L+ AV + P
Sbjct: 191 AFQYIKDNKGLDTEKTYPYEAENDRCRYNPRNSGATDKGYVDIPQGDEEKLKAAVATIGP 250
Query: 229 VSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVED--GVPYWLIKNSWGE 285
+SVA + + F+ Y GVY C +D H V+ VGYG ++ G YWL+KNSWG+
Sbjct: 251 ISVAIDASHESFQLYSEGVYYDPDCSAENLD--HGVLIVGYGTDETSGHDYWLVKNSWGK 308
Query: 286 NWGDHGYFKMEMGK-NMCGIATCASYPVV 313
WG GY KM K N CGIA+ ASYP+V
Sbjct: 309 TWGQKGYIKMARNKNNHCGIASSASYPLV 337
>gi|391333248|ref|XP_003741031.1| PREDICTED: uncharacterized protein LOC100898636 [Metaseiulus
occidentalis]
Length = 642
Score = 202 bits (513), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 113/304 (37%), Positives = 163/304 (53%), Gaps = 59/304 (19%)
Query: 65 FARRYGKIYESVEEMKLRFATFSKNLDLIRSTNC----KGLSYRLGLN------------ 108
+ R +GK Y+ VEE +R F KN+ +I + N K +SYR+GL+
Sbjct: 22 YKRIHGKSYD-VEEESMRRRIFEKNVAMINAHNLLHDLKQVSYRMGLSRLTDATPAEVQA 80
Query: 109 -------------------------------------ISPVKDQGHCGSCWTFSTTGSLE 131
++PVKDQG CG+CWTF+ TG++E
Sbjct: 81 LKCLNFTLPNKTSRKSTLGTLQRQDLPEAVDWTQQGYVTPVKDQGKCGACWTFAATGAIE 140
Query: 132 AAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKD 191
+ +A G +SLSEQ ++DC + + GC+GGL +AF+Y+K +GG+D EE+YPY
Sbjct: 141 GQHFKATGNLVSLSEQNILDCVKTATSNGCSGGLFVEAFDYLKNSGGIDAEESYPYEASG 200
Query: 192 GVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSST 250
G C+F ++V V I+ G E ELQ AV + P+SV + GF+ Y G+Y
Sbjct: 201 GTCRFRQDSVAATVSGYQAISAGNEAELQEAVATIGPISVGIDSGHPGFQHYTGGIYYEP 260
Query: 251 KCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCAS 309
+C ++HAV+ VGYG E+G YWL+KNSWG ++G GY KM + N CGIAT A+
Sbjct: 261 ECTE---HLSHAVLVVGYGTENGEDYWLVKNSWGASYGLQGYIKMARNRNNNCGIATGAA 317
Query: 310 YPVV 313
YP+
Sbjct: 318 YPIT 321
Score = 202 bits (513), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 102/226 (45%), Positives = 149/226 (65%), Gaps = 7/226 (3%)
Query: 90 LDLIRSTN-CKGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQ 148
D I S++ + + +R ++PVK+QG+CGSCW FS TG++E + +A G+ SLSEQ
Sbjct: 420 FDAIESSDLSEAIDWRQQGYVTPVKNQGNCGSCWAFSATGAVEGQHFKATGRLESLSEQN 479
Query: 149 LVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDS 208
LVDC + ++GC+GG QAF+YIK NGG++TE++YPY DG C+F +++G V
Sbjct: 480 LVDCVK--ESKGCDGGFFEQAFQYIKDNGGINTEDSYPYEAFDGSCRFREDSIGATVSGY 537
Query: 209 VNITLGAEDELQHAVGLVRPVSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVG 267
I G+E +LQ AV + P+SVA +V + F+ Y+ GVY C ++ +D HAV+ VG
Sbjct: 538 QTIPKGSEADLQKAVSTIGPISVAIDVSNPSFQNYREGVYYEPSCSSSNLD--HAVLVVG 595
Query: 268 YGVEDGVPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPV 312
YG + G YWL+KNSWG ++G+ GY +M K N CGIA+ A+YP
Sbjct: 596 YGSDGGEDYWLVKNSWGTSFGEQGYVRMARNKGNNCGIASAAAYPT 641
>gi|146147376|gb|ABQ01982.1| cathepsin [Fasciola gigantica]
Length = 326
Score = 202 bits (513), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 111/271 (40%), Positives = 160/271 (59%), Gaps = 17/271 (6%)
Query: 57 RHALSFARFARRYGKIYE-SVEEMKLRFAT-FSKNLDLI----------RSTNCKGLSYR 104
RH L F + + + + EE K ++ T + D++ R+ K + +R
Sbjct: 57 RHYLGFVTYTLGLNQFTDMTFEEFKAKYLTEMPRASDILSHGIPYEANNRAVPDK-IDWR 115
Query: 105 LGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGG 164
++ VKDQG+CGSCW FSTTG++E Y + IS SEQQLVDC+ + N GC GG
Sbjct: 116 ESGYVTEVKDQGNCGSCWAFSTTGTMEGQYMKNERTSISFSEQQLVDCSGPWGNMGCMGG 175
Query: 165 LPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVG 224
L A+EY+K GL+TE +YPYT +G C+++ + +V D + G+E EL++ VG
Sbjct: 176 LMENAYEYLK-QFGLETESSYPYTAVEGQCRYNRQLGVAKVTDYYTVHSGSEVELKNLVG 234
Query: 225 LVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWG 284
P +VA +V F Y G+Y S C + + VNHAV+AVGYG + G YW++KNSWG
Sbjct: 235 AEGPAAVAVDVESDFMMYSGGIYQSRTC--SSLRVNHAVLAVGYGTQSGTDYWIVKNSWG 292
Query: 285 ENWGDHGYFKMEMGK-NMCGIATCASYPVVA 314
+WG+ GY +M + NMCGIA+ AS P+VA
Sbjct: 293 SSWGERGYIRMVRNRGNMCGIASLASLPMVA 323
>gi|10798511|emb|CAC12806.1| cathepsin L1 [Fasciola hepatica]
Length = 311
Score = 202 bits (513), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 102/216 (47%), Positives = 142/216 (65%), Gaps = 6/216 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++ VKDQG+CGSCW FSTTG++E Y + IS SEQQLVDC+ + N G
Sbjct: 97 IDWRESGYVTGVKDQGNCGSCWAFSTTGTMEGQYMKNEKTSISFSEQQLVDCSGPWGNNG 156
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGV-QVLDSVNITLGAEDEL 219
C+GGL A+EY+K GL+TE +YPY +G C++ +E +GV +V + G+E EL
Sbjct: 157 CSGGLMENAYEYLK-RFGLETESSYPYRAVEGQCRY-NEQLGVAKVTGYYTVHSGSEVEL 214
Query: 220 QHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
++ VG P ++A E F Y+SG+Y S C P +NHAV+AVGYG +DG YW++
Sbjct: 215 KNLVGSEGPAAIAVEAESDFMMYRSGIYQSQTC--LPFALNHAVLAVGYGTQDGTDYWIV 272
Query: 280 KNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVVA 314
KNSWG +WG+ GY +M + NMCGIA+ AS P+VA
Sbjct: 273 KNSWGLSWGERGYIRMARNRGNMCGIASLASLPMVA 308
>gi|1498185|dbj|BAA06738.1| cysteine proteinase-1 precursor [Drosophila melanogaster]
Length = 254
Score = 201 bits (512), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 104/218 (47%), Positives = 135/218 (61%), Gaps = 5/218 (2%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++ VKDQGHCGSCW FS+TG+LE + + G +SLSEQ LVDC+ + N
Sbjct: 39 KSVDWRTKGAVTAVKDQGHCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGN 98
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
GCNGGL AF YIK NGG+DTE++YPY D C F+ VG +I G E +
Sbjct: 99 NGCNGGLMDNAFPYIKDNGGIDTEKSYPYEAIDDSCHFNRAQVGATDRGFTDIPQGDEKK 158
Query: 219 LQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGV-EDGVPY 276
+ V V PVSVA + + F+FY GVY+ +C +D H V+ VG+G E G Y
Sbjct: 159 MPEPVPTVGPVSVAIDASHESFQFYSEGVYNEPQCDAQNLD--HGVLVVGFGTDESGEDY 216
Query: 277 WLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
WL+KNSWG WGD G+ KM K N CGIA+ +SYP+V
Sbjct: 217 WLVKNSWGTTWGDKGFIKMLRNKENQCGIASPSSYPLV 254
>gi|124487918|gb|ABN12042.1| putative cathepsin L precursor [Maconellicoccus hirsutus]
Length = 211
Score = 201 bits (512), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 97/207 (46%), Positives = 131/207 (63%), Gaps = 2/207 (0%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++ VKDQG CGSC+ FSTTGS+E + G SLSEQQ++DC+ + N GC GG+
Sbjct: 5 VTEVKDQGDCGSCYAFSTTGSIEGQQFRKSGTLKSLSEQQIIDCSVKYGNGGCEGGVMEN 64
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
AF Y+ NGG+D+E +YPY ++ C + EN + D + +G E+ L+ AV V P
Sbjct: 65 AFNYVIDNGGIDSEGSYPYIDRETQCAYKPENSAANIKDFATLPVGDEEMLKLAVAKVGP 124
Query: 229 VSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENW 287
+S+A F+ YKSGVY C + P D+ HAV+ VGYG EDG YWL+KNSW +W
Sbjct: 125 ISIAINTSPRSFKLYKSGVYYDKDCKSDPDDLTHAVLVVGYGTEDGKDYWLVKNSWNTDW 184
Query: 288 GDHGYFKMEMGK-NMCGIATCASYPVV 313
G++GY KM K N CGIA+ A+YP V
Sbjct: 185 GENGYIKMARNKNNHCGIASYATYPTV 211
>gi|261824891|pdb|3H6S|A Chain A, Strucure Of Clitocypin - Cathepsin V Complex
gi|261824892|pdb|3H6S|B Chain B, Strucure Of Clitocypin - Cathepsin V Complex
gi|261824893|pdb|3H6S|C Chain C, Strucure Of Clitocypin - Cathepsin V Complex
gi|261824894|pdb|3H6S|D Chain D, Strucure Of Clitocypin - Cathepsin V Complex
gi|310942696|pdb|3KFQ|A Chain A, Unreduced Cathepsin V In Complex With Stefin A
gi|310942697|pdb|3KFQ|B Chain B, Unreduced Cathepsin V In Complex With Stefin A
Length = 221
Score = 201 bits (512), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 106/221 (47%), Positives = 135/221 (61%), Gaps = 8/221 (3%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++PVK+Q CGS W FS TG+LE + GK +SLSEQ LVDC++ N
Sbjct: 3 KSVDWRKKGYVTPVKNQKQCGSXWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGN 62
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
QGCNGG ++AF+Y+K NGGLD+EE+YPY D +CK+ EN Q + G E
Sbjct: 63 QGCNGGFMARAFQYVKENGGLDSEESYPYVAVDEICKYRPENSVAQDTGFTVVAPGKEKA 122
Query: 219 LQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----DG 273
L AV V P+SVA + F+FYKSG+Y C + +D H V+ VGYG E D
Sbjct: 123 LMKAVATVGPISVAMDAGHSSFQFYKSGIYFEPDCSSKNLD--HGVLVVGYGFEGANSDN 180
Query: 274 VPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
YWL+KNSWG WG +GY K+ K N CGIAT ASYP V
Sbjct: 181 SKYWLVKNSWGPEWGSNGYVKIAKDKNNHCGIATAASYPNV 221
>gi|340381055|ref|XP_003389037.1| PREDICTED: cathepsin L1-like [Amphimedon queenslandica]
Length = 329
Score = 201 bits (512), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 105/220 (47%), Positives = 131/220 (59%), Gaps = 9/220 (4%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++PVK+Q CGSCW FSTTGSLE G SLSEQQLVDC+ + N G
Sbjct: 112 VDWRTKGAVTPVKNQKQCGSCWAFSTTGSLEGQTFLKKGTLPSLSEQQLVDCSDKYGNHG 171
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C GGL AF+YI+ NGG+D+E +YPY K+G C+F V +I D LQ
Sbjct: 172 CQGGLMDNAFKYIEANGGIDSEASYPYEAKNGKCRFQQSAVAATCTGYKDIPHDDIDGLQ 231
Query: 221 HAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE------DG 273
AV V P+SVA + F+ Y +GVY C +T +D H V+AVGYG E +
Sbjct: 232 DAVANVGPISVAMDASHSSFQLYAAGVYDPLLCSSTRLD--HGVLAVGYGTEPSGLFHEE 289
Query: 274 VPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASYPVV 313
PYWL+KNSWG +WG GYFK+ N CGIAT ASYP V
Sbjct: 290 KPYWLVKNSWGPDWGQQGYFKIVRKDNKCGIATDASYPTV 329
>gi|2146900|pir||S67481 cathepsin L-like cysteine proteinase (EC 3.4.22.-) CP1 [similarity]
- fruit fly (Drosophila melanogaster) (fragment)
Length = 218
Score = 201 bits (512), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 104/218 (47%), Positives = 135/218 (61%), Gaps = 5/218 (2%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++ VKDQGHCGSCW FS+TG+LE + + G +SLSEQ LVDC+ + N
Sbjct: 3 KSVDWRTKGAVTAVKDQGHCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGN 62
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
GCNGGL AF YIK NGG+DTE++YPY D C F+ VG +I G E +
Sbjct: 63 NGCNGGLMDNAFPYIKDNGGIDTEKSYPYEAIDDSCHFNRAQVGATDRGFTDIPQGDEKK 122
Query: 219 LQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGV-EDGVPY 276
+ V V PVSVA + + F+FY GVY+ +C +D H V+ VG+G E G Y
Sbjct: 123 MPEPVPTVGPVSVAIDASHESFQFYSEGVYNEPQCDAQNLD--HGVLVVGFGTDESGEDY 180
Query: 277 WLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
WL+KNSWG WGD G+ KM K N CGIA+ +SYP+V
Sbjct: 181 WLVKNSWGTTWGDKGFIKMLRNKENQCGIASPSSYPLV 218
>gi|21263041|gb|AAM44832.1|AF510856_1 cathepsin L2 [Fasciola gigantica]
Length = 326
Score = 201 bits (512), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 100/215 (46%), Positives = 139/215 (64%), Gaps = 4/215 (1%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++ VKDQG+CGSCW FSTTG++E Y + IS SEQQLVDC+ + N G
Sbjct: 112 IDWRESGYVTEVKDQGNCGSCWAFSTTGTMEGQYMKNERTSISFSEQQLVDCSGPWGNMG 171
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C GGL A+EY+K GL+TE +YPYT +G C+++ + +V D + G+E EL+
Sbjct: 172 CMGGLMENAYEYLK-QFGLETESSYPYTAVEGQCRYNRQLGVAKVTDYYTVHSGSEVELK 230
Query: 221 HAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIK 280
+ VG P +VA +V F Y G+Y S C + + VNHAV+AVGYG + G YW++K
Sbjct: 231 NLVGAEGPAAVAVDVESDFMMYSGGIYQSRTC--SSLHVNHAVLAVGYGTQGGTDYWIVK 288
Query: 281 NSWGENWGDHGYFKMEMGK-NMCGIATCASYPVVA 314
NSWG +WG+ GY +M + NMCGIA+ AS P+VA
Sbjct: 289 NSWGSSWGERGYIRMVRNRGNMCGIASLASLPMVA 323
>gi|311265493|ref|XP_003130681.1| PREDICTED: cathepsin L1-like [Sus scrofa]
Length = 332
Score = 201 bits (512), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 109/229 (47%), Positives = 139/229 (60%), Gaps = 8/229 (3%)
Query: 90 LDLIRSTNCKGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQL 149
LD + + +R ++ VK+QGHCGSCW FS TG+LE + K ISLSEQ L
Sbjct: 107 LDAGSALTPHSVDWREKGYVTAVKNQGHCGSCWAFSATGALEGQMFRKTSKLISLSEQNL 166
Query: 150 VDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSV 209
VDC+ N+GCNGGL AF+YIK NGGLD+EE+YPY GKDG CK+ ++ V
Sbjct: 167 VDCSWPEGNEGCNGGLMDNAFQYIKDNGGLDSEESYPYFGKDGSCKYKPQSSAANDTGYV 226
Query: 210 NITLGAEDELQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGY 268
+I E L AV V P+SV + + F+FY +G+Y +C + D++H V+ VGY
Sbjct: 227 DIPK-QEKALMKAVATVGPISVGIDASHESFQFYSTGIYFEPQC--SSEDLDHGVLVVGY 283
Query: 269 GVE---DGVPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
GVE YWL+KNSWG WG GY KM + N CGIAT ASYPVV
Sbjct: 284 GVEGAHSNNKYWLVKNSWGNTWGMDGYIKMTKDQNNHCGIATMASYPVV 332
>gi|261289779|ref|XP_002611751.1| hypothetical protein BRAFLDRAFT_284345 [Branchiostoma floridae]
gi|229297123|gb|EEN67761.1| hypothetical protein BRAFLDRAFT_284345 [Branchiostoma floridae]
Length = 330
Score = 201 bits (511), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 99/207 (47%), Positives = 131/207 (63%), Gaps = 4/207 (1%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++ +KDQG CGSCW FSTTGSLE + +A G +SLSEQ LVDC++ N+GC GG Q
Sbjct: 126 VTDIKDQGQCGSCWAFSTTGSLEGQHAKATGTLVSLSEQNLVDCSRQEGNKGCEGGDMDQ 185
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
F+YI N G+DTE+ YPY K+ CKF + +G + ++T G ED L+ A + P
Sbjct: 186 GFQYIIQNKGIDTEQCYPYKAKNHRCKFDNSCIGATMSSFTDVTSGDEDALKQACANIGP 245
Query: 229 VSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENW 287
+SV + F+FY SGVY+ +C +T +D H V+ VGYG YWL+KNSWG W
Sbjct: 246 ISVGIDASHQSFQFYSSGVYNEFECSSTKLD--HGVLVVGYGTYGSKDYWLVKNSWGTVW 303
Query: 288 GDHGYFKMEMGK-NMCGIATCASYPVV 313
G+ GY M K N CG+AT AS+PVV
Sbjct: 304 GNEGYIMMSRNKDNQCGVATDASFPVV 330
>gi|4574304|gb|AAD23996.1|AF112566_1 cathepsin [Fasciola gigantica]
Length = 326
Score = 201 bits (511), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 99/215 (46%), Positives = 140/215 (65%), Gaps = 4/215 (1%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++ +KDQG+CGSCW FSTTG++E Y + IS SEQQLVDC+ + N G
Sbjct: 112 IDWRESGYVTELKDQGNCGSCWAFSTTGTMEGQYMKNERTSISFSEQQLVDCSGPWGNMG 171
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C+GGL A+EY+K GL+TE +YPYT +G C+++ + +V D + G+E EL+
Sbjct: 172 CSGGLMENAYEYLK-QFGLETESSYPYTAVEGQCRYNRQLGVAKVTDYYTVHSGSEVELK 230
Query: 221 HAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIK 280
+ VG P +VA +V F Y G+Y S C + + VNHAV+AVGYG + G YW++K
Sbjct: 231 NLVGAEGPAAVAVDVESDFMMYSGGIYQSRTC--SSLRVNHAVLAVGYGTQGGTDYWIVK 288
Query: 281 NSWGENWGDHGYFKMEMGK-NMCGIATCASYPVVA 314
NSWG +WG+ GY +M + NMCGIA+ AS P+VA
Sbjct: 289 NSWGSSWGERGYIRMVRNRGNMCGIASLASLPMVA 323
>gi|21483184|gb|AAF86584.1| cathepsin L cysteine protease [Haemonchus contortus]
Length = 355
Score = 201 bits (511), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 99/208 (47%), Positives = 135/208 (64%), Gaps = 5/208 (2%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++PVK+QG CGSCW FS+TG+LE + +A GK +SLSEQ LVDC+ + N GCNGGL
Sbjct: 150 VTPVKNQGMCGSCWAFSSTGALEGQHARATGKLVSLSEQNLVDCSTKYGNHGCNGGLMDL 209
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
AFEYIK N G+DTE++YPY G++ C F VG V++ G E+ L+ AV P
Sbjct: 210 AFEYIKENHGVDTEDSYPYVGRETKCHFKRNTVGADDKGFVDLPEGDEEALKKAVATQGP 269
Query: 229 VSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE-DGVPYWLIKNSWGEN 286
+S+A + F+ YK GVY +C + +D H V+ VGYG + + YWL+KNSWG
Sbjct: 270 ISIAIDAGHRSFQLYKKGVYFDEECSSEELD--HGVLLVGYGTDPEAGDYWLVKNSWGPT 327
Query: 287 WGDHGYFKMEMGK-NMCGIATCASYPVV 313
WG+ GY ++ + N CG+AT ASYP+V
Sbjct: 328 WGEKGYIRIARNRNNHCGVATKASYPLV 355
>gi|198432221|ref|XP_002130541.1| PREDICTED: similar to cathepsin L [Ciona intestinalis]
Length = 330
Score = 201 bits (511), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 101/220 (45%), Positives = 137/220 (62%), Gaps = 3/220 (1%)
Query: 97 NCKGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAF 156
N + +R ++PVK+Q CGSCW FS TGSLE + K +SLSEQQL+DC+
Sbjct: 111 NPTTVDWRTQGYVTPVKNQLQCGSCWAFSATGSLEGQHFAKTKKLVSLSEQQLIDCSTKQ 170
Query: 157 NNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAE 216
+ GC GG P AF YI GG+++E YPY K+ VC+F+ V + V+IT +E
Sbjct: 171 GDLGCGGGYPDWAFAYINQVGGIESETNYPYEAKNDVCRFNVSEVAATLTGCVDITPDSE 230
Query: 217 DELQHAVGLVRPVSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVP 275
+L+ AVG + PVSV + F+ Y SG+Y +C ++P ++H V+AVGYG ++G
Sbjct: 231 TQLEKAVGSIGPVSVLIDASHISFQLYGSGIYYEQQCSSSPASLDHGVLAVGYGADNGQE 290
Query: 276 YWLIKNSWGENWGD-HGYFKMEMGK-NMCGIATCASYPVV 313
YW++KNSWGE WG GY KM K N CGIAT ASYP+V
Sbjct: 291 YWMVKNSWGEGWGKLGGYIKMAKNKNNNCGIATQASYPIV 330
>gi|354502595|ref|XP_003513369.1| PREDICTED: cathepsin L1-like [Cricetulus griseus]
Length = 330
Score = 201 bits (511), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 108/226 (47%), Positives = 140/226 (61%), Gaps = 11/226 (4%)
Query: 91 DLIRSTNCKGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLV 150
D+ +S + + LSY ++PVKDQG C SCW FS GSLE + G+ ISLSEQ LV
Sbjct: 113 DVPKSVDWRNLSY-----VTPVKDQGQCSSCWAFSAVGSLEGQIFRKTGQLISLSEQNLV 167
Query: 151 DCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVN 210
DC+ ++ N GC GGL AF Y+K N GLDT +YPY ++G C++ +N V D V
Sbjct: 168 DCSWSYGNIGCFGGLMEYAFRYVKENRGLDTRVSYPYEARNGPCRYDPKNSAANVTDFVK 227
Query: 211 ITLGAEDELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYG 269
I + +ED L AV V P+SV + FRFYK G+Y C ++ +D HAV+ VGYG
Sbjct: 228 IPI-SEDALMKAVATVGPISVGVDSHHHSFRFYKGGMYYEPHCSSSNLD--HAVLVVGYG 284
Query: 270 VE-DGVPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
E DG YW++KNSWG+ WG +GY KM + N CGIAT A YP V
Sbjct: 285 EESDGNKYWMVKNSWGQGWGMNGYIKMARDRNNNCGIATYAIYPTV 330
>gi|387015022|gb|AFJ49630.1| Cathepsin L1-like [Crotalus adamanteus]
Length = 338
Score = 201 bits (511), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 107/223 (47%), Positives = 137/223 (61%), Gaps = 10/223 (4%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++PVKDQG CGSCW FS TG+LE + + GK +SLSEQ L+DC+ N
Sbjct: 118 KSVDWREKGYVTPVKDQGQCGSCWAFSATGALEGQHFRKTGKLVSLSEQNLIDCSGPEGN 177
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGV-CKFSSENVGVQVLDSVNITLGAED 217
QGCNGGL QAF+YIK N G+D+EE+YPY GKD C + E V+I G E
Sbjct: 178 QGCNGGLMDQAFQYIKDNNGIDSEESYPYIGKDDEDCLYKPEYNSANDTGFVDIPEGRER 237
Query: 218 ELQHAVGLVRPVSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----- 271
L AV V P+SVA + F+FY+SGVY +C + +D H V+ VGYG E
Sbjct: 238 ALMKAVAAVGPISVAIDASHTSFQFYESGVYYEPQCNSEELD--HGVLVVGYGYEGTDDD 295
Query: 272 DGVPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
+ YW++KNSW E WGD GY M + N CGIA+ ASYP+V
Sbjct: 296 NKKRYWIVKNSWSEKWGDQGYIHMAKDRSNNCGIASAASYPMV 338
>gi|344257452|gb|EGW13556.1| Cathepsin L1 [Cricetulus griseus]
Length = 290
Score = 201 bits (511), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 108/226 (47%), Positives = 140/226 (61%), Gaps = 11/226 (4%)
Query: 91 DLIRSTNCKGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLV 150
D+ +S + + LSY ++PVKDQG C SCW FS GSLE + G+ ISLSEQ LV
Sbjct: 73 DVPKSVDWRNLSY-----VTPVKDQGQCSSCWAFSAVGSLEGQIFRKTGQLISLSEQNLV 127
Query: 151 DCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVN 210
DC+ ++ N GC GGL AF Y+K N GLDT +YPY ++G C++ +N V D V
Sbjct: 128 DCSWSYGNIGCFGGLMEYAFRYVKENRGLDTRVSYPYEARNGPCRYDPKNSAANVTDFVK 187
Query: 211 ITLGAEDELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYG 269
I + +ED L AV V P+SV + FRFYK G+Y C ++ +D HAV+ VGYG
Sbjct: 188 IPI-SEDALMKAVATVGPISVGVDSHHHSFRFYKGGMYYEPHCSSSNLD--HAVLVVGYG 244
Query: 270 VE-DGVPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
E DG YW++KNSWG+ WG +GY KM + N CGIAT A YP V
Sbjct: 245 EESDGNKYWMVKNSWGQGWGMNGYIKMARDRNNNCGIATYAIYPTV 290
>gi|30141019|dbj|BAC75923.1| cysteine protease-1 [Helianthus annuus]
Length = 461
Score = 201 bits (510), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 122/304 (40%), Positives = 158/304 (51%), Gaps = 65/304 (21%)
Query: 68 RYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN------------------- 108
++GK Y ++ E RF F NL I N +Y+LGLN
Sbjct: 58 KHGKTYNALGEKDRRFQIFKDNLRFIDEHNSGDHTYKLGLNKFADLTNEEYRMTYTGIKT 117
Query: 109 ----------------------------------ISPVKDQGHCGSCWTFSTTGSLEAAY 134
++ VKDQG CGSCW FSTTGS+E
Sbjct: 118 IDDKKKLSKMKSDRYAYRSGDSLPEYVDWREQGAVTDVKDQGSCGSCWAFSTTGSVEGVN 177
Query: 135 HQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVC 194
G IS+SEQ+LV+C ++N QGCNGGL AFE+I NGG+DTEE YPYTGKDG C
Sbjct: 178 KIVTGDLISVSEQELVNCDTSYN-QGCNGGLMDYAFEFIIKNGGIDTEEDYPYTGKDGKC 236
Query: 195 KFSSENVGVQVLDSV-NITLGAEDELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKC 252
+ +N V +DS ++ + E L+ AV +PV+VA E F+FY SG+++ + C
Sbjct: 237 DKNKKNAKVVTIDSYEDVPVNDESSLKKAVS-NQPVAVAIEAGGRDFQFYTSGIFTGS-C 294
Query: 253 GNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMG----KNMCGIATCA 308
G ++H V+A GYG EDG YWL+KNSWG WG+ GY KME CGIA A
Sbjct: 295 GTA---LDHGVLAAGYGTEDGKDYWLVKNSWGAEWGEGGYLKMERNIADKSGKCGIAMEA 351
Query: 309 SYPV 312
SYP+
Sbjct: 352 SYPI 355
>gi|291224868|ref|XP_002732424.1| PREDICTED: cathepsin L-like [Saccoglossus kowalevskii]
Length = 823
Score = 201 bits (510), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 103/201 (51%), Positives = 123/201 (61%), Gaps = 4/201 (1%)
Query: 115 QGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIK 174
+G CGSCW FSTTGSLE + GK LSEQQLVDC+ F N GCNGGL AFEYIK
Sbjct: 625 KGQCGSCWAFSTTGSLEGQTFKKTGKLPDLSEQQLVDCSTQFGNHGCNGGLMDLAFEYIK 684
Query: 175 YNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFE 234
G++ E YPY KDG C F V V+I E+ L+ AV + P+SVA +
Sbjct: 685 AAPGIEGEMDYPYLAKDGRCMFDQSKVVATDTGYVDIPSMDENALKEAVATIGPISVAID 744
Query: 235 V-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYF 293
F+ YKSGVY+ C + +D H V+AVGYG EDG YWL+KNSWG++WG GY
Sbjct: 745 AGHPSFQMYKSGVYNEPGCSSERLD--HGVLAVGYGTEDGQDYWLVKNSWGDSWGQAGYI 802
Query: 294 KMEMG-KNMCGIATCASYPVV 313
M N CGIAT ASYP+V
Sbjct: 803 MMSRNMNNQCGIATQASYPLV 823
>gi|308474437|ref|XP_003099440.1| CRE-CPL-1 protein [Caenorhabditis remanei]
gi|308266846|gb|EFP10799.1| CRE-CPL-1 protein [Caenorhabditis remanei]
Length = 337
Score = 201 bits (510), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 98/208 (47%), Positives = 136/208 (65%), Gaps = 5/208 (2%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++ VK+QG CGSCW FS TG+LE + + GK +SLSEQ LVDC+ + N GCNGGL Q
Sbjct: 132 VTDVKNQGMCGSCWAFSATGALEGQHARKLGKLVSLSEQNLVDCSTKYGNHGCNGGLMDQ 191
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
AFEYI+ N G+DTE++YPY G+D C FS ++VG ++ G E++L+ AV P
Sbjct: 192 AFEYIRDNHGVDTEDSYPYKGRDMKCHFSKKDVGADDKGYTDLPEGDEEQLKIAVATQGP 251
Query: 229 VSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE-DGVPYWLIKNSWGEN 286
+S+A + F+ YK GVY +C + +D H V+ VGYG + + YWL+KNSWG
Sbjct: 252 ISIAIDAGHRSFQLYKKGVYYDEECSSEELD--HGVLLVGYGTDPEHGDYWLVKNSWGTG 309
Query: 287 WGDHGYFKMEMGK-NMCGIATCASYPVV 313
WG+ GY ++ + N CG+AT ASYP+V
Sbjct: 310 WGEKGYIRIARNRNNHCGVATKASYPLV 337
>gi|323451241|gb|EGB07119.1| hypothetical protein AURANDRAFT_54023 [Aureococcus anophagefferens]
Length = 377
Score = 201 bits (510), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 111/314 (35%), Positives = 155/314 (49%), Gaps = 60/314 (19%)
Query: 61 SFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN------------ 108
+F F ++ K YE+VEE R F++N ++ + K + LGL+
Sbjct: 64 AFMTFMTKFEKTYETVEEWAHRLTVFAQNAKIVLEHDAKAEGFALGLDNQFADWTAEEFA 123
Query: 109 ------------------------------------ISPVKDQGHCGSCWTFSTTGSLEA 132
++ +K+QG CGSCWTFST S+E
Sbjct: 124 SYQKLHSRPKPSQAGATHEVSDKAAPTAVDWRTEGVVADIKNQGSCGSCWTFSTVVSIEG 183
Query: 133 AYHQAFGKGISLSEQQLVDCAQAFNNQG-------CNGGLPSQAFEYIKYN--GGLDTEE 183
A + GK ++LSEQ LVDC + G C+GGL AF+YI N GG+DTE
Sbjct: 184 AAARKTGKLVTLSEQNLVDCVKKDQIDGGDECCMGCSGGLMDNAFDYIIKNQDGGIDTEA 243
Query: 184 AYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYK 243
+Y YTGKDG C F NVG + + ++ +G E L A+ PVS+A + ++ Y
Sbjct: 244 SYGYTGKDGTCAFDKANVGATISNWTDVAVGDEVALADALANAGPVSIALDASKQWQLYS 303
Query: 244 SGVY---SSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKN 300
G+ S C + P +H V VGYG +DGV YW I+NSWG WG+ GY ++E G N
Sbjct: 304 GGILKPRSILGCSSDPTHADHGVAIVGYGTDDGVDYWWIRNSWGTTWGESGYMRLERGVN 363
Query: 301 MCGIATCASYPVVA 314
CG+A ASYP+ A
Sbjct: 364 ACGVANFASYPIAA 377
>gi|260516678|gb|ACX43965.1| cysteine protease 1 [Brachiaria hybrid cultivar]
Length = 338
Score = 201 bits (510), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 118/256 (46%), Positives = 150/256 (58%), Gaps = 12/256 (4%)
Query: 60 LSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLNISPVKDQGHCG 119
LSF F +Y Y+ VE R S NL + +R ++P+KDQG CG
Sbjct: 93 LSFEEFKGKYFG-YKHVEREFAR----SNNLHQEVEAAPTSIDWRTSNAVTPIKDQGQCG 147
Query: 120 SCWTFSTTGSLEAAY-HQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGG 178
SCW FS TGS+E A+ Q SLSEQQLVDC+ ++ + GCNGGL AFEYI N G
Sbjct: 148 SCWAFSATGSIEGAWVLQGKHTLTSLSEQQLVDCSTSYGDAGCNGGLMDYAFEYIIANKG 207
Query: 179 LDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVD- 237
+ E AYPY G G+C+ S V V + ++ G E L +AVG V PVSVA E
Sbjct: 208 ICAESAYPYKGVGGLCQKSCTKV-VTISGYKDVASGDEASLLNAVGTVGPVSVAIEADQA 266
Query: 238 GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEM 297
GF+FY SGV+S T CG+ +++H V+AVGYG YW++KNSWG +WG+ GY +M
Sbjct: 267 GFQFYSSGVFSGT-CGH---NLDHGVLAVGYGTTGSQDYWIVKNSWGTSWGESGYIRMIR 322
Query: 298 GKNMCGIATCASYPVV 313
KN CGIA SYP V
Sbjct: 323 NKNQCGIAIQPSYPTV 338
>gi|45550334|gb|AAS67923.1| cathepsin L [Artemia franciscana]
Length = 226
Score = 201 bits (510), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 103/215 (47%), Positives = 134/215 (62%), Gaps = 2/215 (0%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++PVK QG C SC FS TG+LE+ + GK ISLSEQ L+DC+ + N G
Sbjct: 12 VDWREKGAVTPVKYQGQCASCLAFSPTGALESQTFRKTGKLISLSEQNLIDCSGEYGNLG 71
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C GG SQAFEYIK N G+DTE Y Y K+ C+ + N G L VNI G ED+L+
Sbjct: 72 CKGGWISQAFEYIKDNKGIDTENKYHYEAKENFCRDNPRNRGAVALGFVNIPSGEEDKLK 131
Query: 221 HAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
AV V PVS +V +GF+FY GVY C + +NHAV+ +G G ++G YWL+
Sbjct: 132 AAVATVGPVSAVIDVSHEGFQFYSKGVYYEPSCKTSFEHLNHAVLVIGCGSDNGEDYWLV 191
Query: 280 KNSWGENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
KNSW ++WGD GY K+ KN CG+AT A YP+V
Sbjct: 192 KNSWSKHWGDEGYLKIARNRKNHCGVATAALYPIV 226
>gi|189053498|dbj|BAG35664.1| unnamed protein product [Homo sapiens]
Length = 334
Score = 201 bits (510), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 107/230 (46%), Positives = 138/230 (60%), Gaps = 13/230 (5%)
Query: 90 LDLIRSTNCKGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQL 149
LDL +S + + Y ++PVK+Q C SCW FS TG+LE + GK +SLSEQ L
Sbjct: 112 LDLPKSVDWRKKGY-----VTPVKNQKQCVSCWAFSATGALEGQMFRKTGKLVSLSEQNL 166
Query: 150 VDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSV 209
VDC++ NQGCNGG ++AF+Y+K NGGLD+EE+YPY D +CK+ EN
Sbjct: 167 VDCSRPQGNQGCNGGFMARAFQYVKENGGLDSEESYPYVAVDEICKYRPENSVANDTGFT 226
Query: 210 NITLGAEDELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGY 268
+ G E L AV V P+SVA + F+FYKSG+Y C + +D H V+ VGY
Sbjct: 227 VVAPGKEKALMKAVATVGPISVAMDAGHSSFQFYKSGIYFEPDCSSKNLD--HGVLVVGY 284
Query: 269 GVE----DGVPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
G E + YWL+KNSWG WG +GY K+ K N CGIAT ASYP V
Sbjct: 285 GFEGANSNNSKYWLVKNSWGPEWGSNGYVKIAKDKNNHCGIATAASYPNV 334
>gi|21489677|gb|AAM55195.1|AF412313_1 cathepsin L cysteine protease [Haemonchus contortus]
gi|21483192|gb|AAL14224.1| cathepsin L [Haemonchus contortus]
Length = 354
Score = 200 bits (509), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 99/208 (47%), Positives = 135/208 (64%), Gaps = 5/208 (2%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++PVK+QG CGSCW FS+TG+LE + +A GK +SLSEQ LVDC+ + N GCNGGL
Sbjct: 149 VTPVKNQGMCGSCWAFSSTGALEGQHARATGKLVSLSEQNLVDCSTKYGNHGCNGGLMDL 208
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
AFEYIK N G+DTE++YPY G++ C F VG V++ G E+ L+ AV P
Sbjct: 209 AFEYIKENHGVDTEDSYPYVGRETKCHFKRNAVGADDKGFVDLPEGDEEALKKAVATQGP 268
Query: 229 VSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE-DGVPYWLIKNSWGEN 286
+S+A + F+ YK GVY +C + +D H V+ VGYG + + YWL+KNSWG
Sbjct: 269 ISIAIDAGHRSFQLYKKGVYFDEECSSEELD--HGVLLVGYGTDPEAGDYWLVKNSWGPT 326
Query: 287 WGDHGYFKMEMGK-NMCGIATCASYPVV 313
WG+ GY ++ + N CG+AT ASYP+V
Sbjct: 327 WGEKGYIRIARNRNNHCGVATKASYPLV 354
>gi|7271889|gb|AAF44675.1|AF239264_1 cathepsin L [Fasciola gigantica]
Length = 326
Score = 200 bits (509), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 100/215 (46%), Positives = 139/215 (64%), Gaps = 4/215 (1%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++ VKDQG+CGSCW FSTTG++E Y + IS SEQQLVDC+ + N G
Sbjct: 112 IDWRESGYVTEVKDQGNCGSCWAFSTTGTMEGQYMKNERTSISFSEQQLVDCSGPWGNYG 171
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C GGL A+EY+K GL+TE +YPYT +G C+++ + +V D + G+E EL+
Sbjct: 172 CMGGLMENAYEYLK-QFGLETESSYPYTAVEGQCRYNRQLGVAKVTDYYTVHSGSEVELK 230
Query: 221 HAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIK 280
+ VG P +VA +V F Y G+Y S C + + VNHAV+AVGYG + G YW++K
Sbjct: 231 NLVGAEGPAAVAVDVESDFTMYSGGIYQSRTC--SSLRVNHAVLAVGYGTQGGTDYWIVK 288
Query: 281 NSWGENWGDHGYFKMEMGK-NMCGIATCASYPVVA 314
NSWG +WG+ GY +M + NMCGIA+ AS P+VA
Sbjct: 289 NSWGSSWGERGYIRMVRNRGNMCGIASLASLPMVA 323
>gi|1841466|emb|CAA71892.1| putative pre-pro-cysteine proteinase [Nicotiana tabacum]
Length = 106
Score = 200 bits (509), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 92/106 (86%), Positives = 101/106 (95%)
Query: 209 VNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGY 268
VNITLGAEDEL++AV LVRPVS+AFEV+ GF+ YKSGVYSST+CGNTPMDVNHAV+AVGY
Sbjct: 1 VNITLGAEDELKYAVALVRPVSIAFEVIKGFKQYKSGVYSSTECGNTPMDVNHAVLAVGY 60
Query: 269 GVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASYPVVA 314
GVE+GVPYWLIKNSWG +WGD GYFKMEMGKNMCGIATCASYPVVA
Sbjct: 61 GVENGVPYWLIKNSWGADWGDDGYFKMEMGKNMCGIATCASYPVVA 106
>gi|221117518|ref|XP_002157675.1| PREDICTED: cathepsin L-like [Hydra magnipapillata]
Length = 340
Score = 200 bits (509), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 110/249 (44%), Positives = 143/249 (57%), Gaps = 6/249 (2%)
Query: 71 KIYESVEEMKLRFATFSKNLDLIRSTNCK---GLSYRLGLNISPVKDQGHCGSCWTFSTT 127
KIY ++ F +K + +N + +R ++PVK+QG CGSCW FSTT
Sbjct: 92 KIYGGCFKLPKSFINITKGSTFLPPSNVNIPDEVDWRTKGYVNPVKNQGQCGSCWAFSTT 151
Query: 128 GSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPY 187
G+LE + G LSEQ LVDC Q++ N+ CNGG AF+YI N G+D+E YPY
Sbjct: 152 GALEGQTFRKTGVLPDLSEQNLVDCTQSYGNEACNGGWMDNAFKYISDNKGIDSEAGYPY 211
Query: 188 TGKD-GVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVV-DGFRFYKSG 245
K G C ++ + V+I G ED L+ AV V P+SVA + D F Y+SG
Sbjct: 212 YAKALGYCYYNQQFNVASDTGFVDIASGDEDALKVAVATVGPISVAIDATKDSFMRYQSG 271
Query: 246 VYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMG-KNMCGI 304
VY CGN +++HAV+ VGYG EDG +WL+KNSW WGD GY KM N CGI
Sbjct: 272 VYYEPTCGNGLENLDHAVLVVGYGTEDGRDFWLVKNSWDITWGDQGYIKMSRNMSNQCGI 331
Query: 305 ATCASYPVV 313
AT ASYP+V
Sbjct: 332 ATKASYPLV 340
>gi|115715524|ref|XP_780580.2| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
Length = 334
Score = 200 bits (509), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 122/307 (39%), Positives = 156/307 (50%), Gaps = 59/307 (19%)
Query: 64 RFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCK----GLSYRLGLN----------- 108
++ +GK Y S EE R + KNLD++ N K +Y LG+N
Sbjct: 30 QWKNEHGKRYLSDEEEASRRLIWQKNLDIVIKHNLKYDLGHFTYDLGMNQFADLKNEEFV 89
Query: 109 ---------------------------------------ISPVKDQGHCGSCWTFSTTGS 129
++PVK+Q CGSCW FS TGS
Sbjct: 90 SLMNGFRGNSSKATRGSTFLPPSNVFDMPTMVDWRTKGYVTPVKNQLQCGSCWAFSATGS 149
Query: 130 LEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTG 189
LE + + GK +SLSEQ LVDC+ N GC GGL QAF+YI GG+DTE +YPYT
Sbjct: 150 LEGQHFKKTGKLVSLSEQNLVDCSGKEGNMGCEGGLMDQAFQYILDVGGIDTEMSYPYTA 209
Query: 190 KDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVV-DGFRFYKSGVYS 248
DG C F+ N+G ++T G+E LQ AV V P+SVA + F+ YKSGVY+
Sbjct: 210 MDGQCHFNKANIGATDTGYTDVTTGSESALQMAVASVGPISVAIDASHQSFQLYKSGVYN 269
Query: 249 STKCGNTPMDVNHAVVAVGYGV-EDGVPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIAT 306
C +T +D H V+AVGYG DG Y+ +SWG WG +GY M K N CGIAT
Sbjct: 270 EPACSSTLLD--HGVLAVGYGTSSDGTDYFFFFHSWGAAWGMNGYLWMSRNKDNQCGIAT 327
Query: 307 CASYPVV 313
ASYP+V
Sbjct: 328 KASYPLV 334
>gi|163310848|pdb|2O6X|A Chain A, Crystal Structure Of Procathepsin L1 From Fasciola
Hepatica
Length = 310
Score = 200 bits (509), Expect = 7e-49, Method: Compositional matrix adjust.
Identities = 99/215 (46%), Positives = 141/215 (65%), Gaps = 4/215 (1%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++ VKDQG+CGS W FSTTG++E Y + IS SEQQLVDC++ + N G
Sbjct: 96 IDWRESGYVTEVKDQGNCGSGWAFSTTGTMEGQYMKNERTSISFSEQQLVDCSRPWGNNG 155
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C GGL A++Y+K GL+TE +YPYT +G C+++ + +V + G+E EL+
Sbjct: 156 CGGGLMENAYQYLK-QFGLETESSYPYTAVEGQCRYNKQLGVAKVTGFYTVHSGSEVELK 214
Query: 221 HAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIK 280
+ VG P +VA +V F Y+SG+Y S C +P+ VNHAV+AVGYG + G YW++K
Sbjct: 215 NLVGAEGPAAVAVDVESDFMMYRSGIYQSQTC--SPLRVNHAVLAVGYGTQGGTDYWIVK 272
Query: 281 NSWGENWGDHGYFKMEMGK-NMCGIATCASYPVVA 314
NSWG +WG+ GY +M + NMCGIA+ AS P+VA
Sbjct: 273 NSWGLSWGERGYIRMVRNRGNMCGIASLASLPMVA 307
>gi|167427529|gb|ABZ80401.1| cathepsin L4, partial [Fasciola hepatica]
Length = 303
Score = 200 bits (508), Expect = 7e-49, Method: Compositional matrix adjust.
Identities = 99/206 (48%), Positives = 131/206 (63%), Gaps = 4/206 (1%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++ VKDQG CGSCW FSTTG++E Y + IS SEQQLVDC+ + N GCNGG
Sbjct: 97 VTEVKDQGDCGSCWAFSTTGAVEGQYMKNPKANISFSEQQLVDCSGDYGNHGCNGGFMEN 156
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
A+EY++ GL+TE +YPY ++G CK+ S V+V G E +L H VG P
Sbjct: 157 AYEYLERR-GLETESSYPYKAEEGPCKYDSRLGVVEVFGYFIEHSGIESKLAHLVGDKGP 215
Query: 229 VSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWG 288
+VA +V F Y+ G+Y+S C + + NHA++ VGYG +DG YW++KNSWG WG
Sbjct: 216 AAVAVDVESDFLMYRGGIYASRNCSSEKL--NHAMLVVGYGTQDGTDYWIVKNSWGSLWG 273
Query: 289 DHGYFKMEMGK-NMCGIATCASYPVV 313
DHGY +M + NMCGIA+ AS PVV
Sbjct: 274 DHGYIRMARNRDNMCGIASAASVPVV 299
>gi|158268255|gb|ABW25047.1| cathepsin L-like protease [Strongylus vulgaris]
Length = 354
Score = 200 bits (508), Expect = 8e-49, Method: Compositional matrix adjust.
Identities = 99/208 (47%), Positives = 136/208 (65%), Gaps = 5/208 (2%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++ VK+QG CGSCW FS TG+LE + +A GK +SLSEQ LVDC+ + N GCNGGL
Sbjct: 149 VTDVKNQGMCGSCWAFSATGALEGQHARASGKMVSLSEQNLVDCSTKYGNHGCNGGLMDL 208
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
AFEYIK N G+DTEE+YPY G++ C F +++G + V++ G E+ L+ AV P
Sbjct: 209 AFEYIKDNHGIDTEESYPYVGRETKCHFKKKDIGAEDKGFVDLPEGDEEALKVAVATQGP 268
Query: 229 VSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE-DGVPYWLIKNSWGEN 286
+S+A + F+ YK GVY +C + +D H V+ VGYG + + YWLIKNSWG
Sbjct: 269 ISIAIDAGHRTFQLYKKGVYYDEECSSEELD--HGVLLVGYGTDPEAGDYWLIKNSWGPG 326
Query: 287 WGDHGYFKMEMGK-NMCGIATCASYPVV 313
WG+ GY ++ + N CG+AT ASYP+V
Sbjct: 327 WGEKGYIRIARNRSNHCGVATKASYPLV 354
>gi|441593109|ref|XP_003260582.2| PREDICTED: cathepsin L2 isoform 1 [Nomascus leucogenys]
Length = 334
Score = 200 bits (508), Expect = 8e-49, Method: Compositional matrix adjust.
Identities = 106/230 (46%), Positives = 136/230 (59%), Gaps = 13/230 (5%)
Query: 90 LDLIRSTNCKGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQL 149
LDL +S + + Y ++PVK+Q CGSCW FS TG+LE + GK +SLSEQ L
Sbjct: 112 LDLPKSVDWRKKGY-----VTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNL 166
Query: 150 VDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSV 209
VDC++ NQGCNGG +AF+Y+K NGGLD+EE+YPY D +CK+ EN
Sbjct: 167 VDCSRPQGNQGCNGGFMGKAFQYVKENGGLDSEESYPYVAMDEICKYRPENSVANDTGFT 226
Query: 210 NITLGAEDELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGY 268
+ G E L AV V P+SVA + F+FY G+Y C + +D H V+ VGY
Sbjct: 227 VVPPGKEKALMKAVATVGPISVAMDAGHSSFQFYNQGIYFEPDCSSENLD--HGVLVVGY 284
Query: 269 GVE----DGVPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
G E + YWL+KNSWG WG +GY K+ K N CGIAT ASYP V
Sbjct: 285 GFEGANSNNSKYWLVKNSWGPEWGSNGYVKIAKDKNNHCGIATAASYPNV 334
>gi|379991182|emb|CCA61803.1| cathepsin protein CatL1-MM3p, partial [Fasciola hepatica]
Length = 326
Score = 200 bits (508), Expect = 8e-49, Method: Compositional matrix adjust.
Identities = 109/271 (40%), Positives = 160/271 (59%), Gaps = 17/271 (6%)
Query: 57 RHALSFARFARRYGKIYE-SVEEMKLRFAT-FSKNLDLI----------RSTNCKGLSYR 104
RH L + + + + EE K ++ T S+ D++ R+ K + +R
Sbjct: 57 RHDLGLVTYTLGLNQFTDMTFEEFKAKYLTEMSRASDILSHGVPYEANNRAVPDK-IDWR 115
Query: 105 LGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGG 164
++ VKDQG+CGSCW FSTTG++E Y + IS SEQQLVDC+ + N GC+GG
Sbjct: 116 ESGYVTEVKDQGNCGSCWAFSTTGTMEGQYMKNERTSISFSEQQLVDCSGPWGNNGCSGG 175
Query: 165 LPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVG 224
L A++Y+K GL+TE +YPYT +G C+++ + +V + G+E EL++ VG
Sbjct: 176 LMENAYQYLK-QFGLETESSYPYTAVEGQCRYNKQLGVAKVTGYYTVHSGSEVELKNLVG 234
Query: 225 LVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWG 284
P +VA +V F Y G+Y S C +P+ +NHAV+AVGYG + G YW++KNSWG
Sbjct: 235 AEGPAAVAVDVESDFMMYSGGIYQSQTC--SPLGLNHAVLAVGYGTQGGTDYWIVKNSWG 292
Query: 285 ENWGDHGYFKMEMGK-NMCGIATCASYPVVA 314
WG+ GY +M + NMCGIA+ AS P+VA
Sbjct: 293 SYWGERGYIRMARNRGNMCGIASLASLPMVA 323
>gi|224460525|gb|ACN43674.1| cathepsin L [Paralichthys olivaceus]
Length = 334
Score = 200 bits (508), Expect = 8e-49, Method: Compositional matrix adjust.
Identities = 101/207 (48%), Positives = 132/207 (63%), Gaps = 4/207 (1%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++PVK+QG CGSCW+FS+TG+LE + G+ +SLSEQ+LVDC+ + N GCNGG
Sbjct: 130 VTPVKNQGSCGSCWSFSSTGALEGQNFRKTGRLVSLSEQELVDCSGNYGNYGCNGGWMDN 189
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
AF YI GG+ TE++YPY G+ G C+ + +G +I G E L+ AV P
Sbjct: 190 AFRYIVNKGGIHTEDSYPYEGQVGQCRANYGEIGATCTGYYDIPSGNEHALKEAVATFGP 249
Query: 229 VSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENW 287
VSVA D F+ Y SGVY++ C T +D HAV+ VGYG E G YWL+KNSWG W
Sbjct: 250 VSVAIHASDQSFQLYHSGVYNNPYCSGTALD--HAVLIVGYGTEYGQDYWLVKNSWGPAW 307
Query: 288 GDHGYFKMEMGK-NMCGIATCASYPVV 313
GD GY KM + N CGIA+ AS+P+V
Sbjct: 308 GDQGYIKMSRNRYNQCGIASAASFPLV 334
>gi|158268253|gb|ABW25046.1| cathepsin L-like protease [Strongylus vulgaris]
Length = 354
Score = 200 bits (508), Expect = 9e-49, Method: Compositional matrix adjust.
Identities = 99/208 (47%), Positives = 136/208 (65%), Gaps = 5/208 (2%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++ VK+QG CGSCW FS TG+LE + +A GK +SLSEQ LVDC+ + N GCNGGL
Sbjct: 149 VTDVKNQGMCGSCWAFSATGALEGQHARASGKMVSLSEQNLVDCSTKYGNHGCNGGLMDL 208
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
AFEYIK N G+DTEE+YPY G++ C F +++G + V++ G E+ L+ AV P
Sbjct: 209 AFEYIKDNHGIDTEESYPYVGRETKCHFKKKDIGAEDKGFVDLPEGDEEALKVAVATQGP 268
Query: 229 VSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE-DGVPYWLIKNSWGEN 286
+S+A + F+ YK GVY +C + +D H V+ VGYG + + YWLIKNSWG
Sbjct: 269 ISIAIDAGHRTFQLYKKGVYYDEECSSEELD--HGVLLVGYGTDPEAGDYWLIKNSWGPG 326
Query: 287 WGDHGYFKMEMGK-NMCGIATCASYPVV 313
WG+ GY ++ + N CG+AT ASYP+V
Sbjct: 327 WGEKGYIRIARNRSNHCGVATKASYPLV 354
>gi|313235127|emb|CBY24999.1| unnamed protein product [Oikopleura dioica]
Length = 326
Score = 200 bits (508), Expect = 9e-49, Method: Compositional matrix adjust.
Identities = 103/205 (50%), Positives = 129/205 (62%), Gaps = 4/205 (1%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++PVKDQG CGSCW FST SLE+ + A SLSEQQLVDC+ + N GC+GGL +Q
Sbjct: 122 VTPVKDQGQCGSCWAFSTIASLESQWALAGNALTSLSEQQLVDCSMNWGNYGCSGGLMTQ 181
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
F YI N G+DTE +YPYT +DG C F+ NVG + NI G E L +AV +V P
Sbjct: 182 GFTYIHDNNGVDTEASYPYTAQDGKCVFNPANVGTSLTSCYNIASGDEAALANAVQMVGP 241
Query: 229 VSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENW 287
+SVA + F+ Y SGVY C + +D H V AVGYG +G ++++KNSW W
Sbjct: 242 MSVAIDASHMSFQLYTSGVYYEPNCSSQFLD--HGVTAVGYGSSNGNDFFIVKNSWAATW 299
Query: 288 GDHGYFKMEMGK-NMCGIATCASYP 311
GD+GY M K N CGIAT ASYP
Sbjct: 300 GDNGYIMMSRNKSNNCGIATSASYP 324
>gi|111036374|dbj|BAF02516.1| cathepsin L-like proteinase [Echinococcus multilocularis]
Length = 338
Score = 199 bits (507), Expect = 9e-49, Method: Compositional matrix adjust.
Identities = 103/208 (49%), Positives = 129/208 (62%), Gaps = 6/208 (2%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++P+KDQG CGSCW FS TG+LE + GK ISLSEQQLVDC+ N+GCNGG +
Sbjct: 134 VTPIKDQGDCGSCWAFSATGALEGQLKRKTGKLISLSEQQLVDCSTYTGNEGCNGGDMND 193
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
AF Y NG ++E YPYT DG CKF+S V +V V + ED+L+ +V V P
Sbjct: 194 AFRYWMRNGA-ESESDYPYTAMDGKCKFNSSKVVTKVSKFVKVPKKREDQLKLSVAQVGP 252
Query: 229 VSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDG-VPYWLIKNSWGEN 286
VSVA + GF YK G+Y C +D HAV+ VGY + YW++KNSWGE+
Sbjct: 253 VSVAIDATSSGFMLYKKGIYQDNTCSQQYLD--HAVLVVGYDADKTRQKYWIVKNSWGED 310
Query: 287 WGDHGYFKMEMGK-NMCGIATCASYPVV 313
WG GY M K NMCGIAT ASYP++
Sbjct: 311 WGQRGYIWMARDKGNMCGIATMASYPLI 338
>gi|149617838|ref|XP_001521715.1| PREDICTED: cathepsin L1-like [Ornithorhynchus anatinus]
Length = 338
Score = 199 bits (507), Expect = 9e-49, Method: Compositional matrix adjust.
Identities = 105/212 (49%), Positives = 135/212 (63%), Gaps = 9/212 (4%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++PVK+QGHCGSCW FSTTG+LE + G+ ISLSEQ LVDC+ NQGC+GG+
Sbjct: 129 VTPVKNQGHCGSCWAFSTTGALEGQLFRKSGRLISLSEQNLVDCSWQQGNQGCHGGIVDL 188
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGV-CKFSSENVGVQVLDSVNITLGAEDELQHAVGLVR 227
AF+YI N G+D+E+ YPYT KD C F E V V+I +E+ L AV V
Sbjct: 189 AFQYILQNQGIDSEDCYPYTAKDTAQCTFKPECATAPVTGFVDIPPHSEEALMKAVATVG 248
Query: 228 PVSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVED----GVPYWLIKNS 282
PVSV + FRFY+SG++ KC + +D HAV+ VGYG E G YW++KNS
Sbjct: 249 PVSVGIDASSTSFRFYQSGIFYDPKCSSESLD--HAVLVVGYGYEREDEAGKKYWIVKNS 306
Query: 283 WGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
WG++WGD GY M + N CGIAT ASYP++
Sbjct: 307 WGKHWGDRGYVYMSKDRGNHCGIATVASYPLL 338
>gi|268560858|ref|XP_002638172.1| C. briggsae CBR-CPL-1 protein [Caenorhabditis briggsae]
Length = 336
Score = 199 bits (507), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 98/208 (47%), Positives = 135/208 (64%), Gaps = 5/208 (2%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++ VK+QG CGSCW FS TG+LE + + G+ +SLSEQ LVDC+ + N GCNGGL Q
Sbjct: 131 VTDVKNQGMCGSCWAFSATGALEGQHARKLGQLVSLSEQNLVDCSTKYGNHGCNGGLMDQ 190
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
AFEYI+ N G+DTEE+YPY G+D C F+ + VG V+ G E++L+ AV P
Sbjct: 191 AFEYIRDNHGVDTEESYPYKGRDMKCHFNKKTVGADDKGYVDTPEGDEEQLKIAVATQGP 250
Query: 229 VSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE-DGVPYWLIKNSWGEN 286
+S+A + F+ YK GVY +C + +D H V+ VGYG + + YWL+KNSWG
Sbjct: 251 ISIAIDAGHRSFQLYKKGVYYDEECSSEELD--HGVLLVGYGTDPEHGDYWLVKNSWGTG 308
Query: 287 WGDHGYFKMEMGK-NMCGIATCASYPVV 313
WG+ GY ++ + N CG+AT ASYP+V
Sbjct: 309 WGEKGYIRIARNRNNHCGVATKASYPLV 336
>gi|66810271|ref|XP_638859.1| cysteine proteinase 3 [Dictyostelium discoideum AX4]
gi|166201983|sp|Q23894.2|CYSP3_DICDI RecName: Full=Cysteine proteinase 3; AltName: Full=Cysteine
proteinase II; Flags: Precursor
gi|60467526|gb|EAL65548.1| cysteine proteinase 3 [Dictyostelium discoideum AX4]
Length = 337
Score = 199 bits (507), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 119/318 (37%), Positives = 166/318 (52%), Gaps = 60/318 (18%)
Query: 51 QVIGQARHALSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN-- 108
V ++ SF + R K Y E M R+ F KN+D + + N KG LGLN
Sbjct: 23 NVFSHKQYQDSFIDWMRSNNKAYTHKEFMP-RYEEFKKNMDYVHNWNSKGSKTVLGLNQH 81
Query: 109 ---------------------------------------------------ISPVKDQGH 117
++PVKDQG
Sbjct: 82 ADLSNEEYRLNYLGTRAHIKLNGYHKRNLGLRLNRPQFKQPLNVDWREKDAVTPVKDQGQ 141
Query: 118 CGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNG 177
CGSC++FSTTGS+E GK +SLSEQ ++DC+ +F N+GCNGGL + AFEYI N
Sbjct: 142 CGSCYSFSTTGSVEGVTAIKTGKLVSLSEQNILDCSSSFGNEGCNGGLMTNAFEYIIKNN 201
Query: 178 GLDTEEAYPYTGK-DGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVV 236
GL++EE YPY K + CKF +V ++ I G E++LQ+A+ L+ PVSVA +
Sbjct: 202 GLNSEEQYPYEMKVNDECKFQEGSVAAKITSYKEIEAGDENDLQNAL-LLNPVSVAIDAS 260
Query: 237 -DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKM 295
+ F+ Y +GVY C + D++H V+AVG G ++G Y+++KNSWG +WG +GY M
Sbjct: 261 HNSFQLYTAGVYYEPAC--SSEDLDHGVLAVGMGTDNGEDYYIVKNSWGPSWGLNGYIHM 318
Query: 296 EMGK-NMCGIATCASYPV 312
K N CGI+T ASYP+
Sbjct: 319 ARNKDNNCGISTMASYPI 336
>gi|253796148|gb|ACT35690.1| cathepsin L-like cysteine proteinase [Ditylenchus destructor]
Length = 376
Score = 199 bits (507), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 110/260 (42%), Positives = 154/260 (59%), Gaps = 14/260 (5%)
Query: 60 LSFARFARR--YGKIY-ESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLNISPVKDQG 116
L F+ + + Y +IY + + RF N+++ S + + Y ++ VK+QG
Sbjct: 125 LPFSEYQKLNGYRRIYGDPLRRNSSRFLA-PHNVEVPESMDWRDHGY-----VTEVKNQG 178
Query: 117 HCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYN 176
CGSCW FS TGSLE + ++ G +SLSEQ LVDC+ A+ N GCNGGL AF+YIK N
Sbjct: 179 MCGSCWAFSATGSLEGQHKRSKGTLVSLSEQNLVDCSAAYGNNGCNGGLMDFAFQYIKEN 238
Query: 177 GGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEV- 235
G+DTE +YPY + C F +VG +++ G ED+L+ AV P+SVA +
Sbjct: 239 HGIDTETSYPYKARQKKCHFQRSSVGADDTGFMDLPEGDEDQLKIAVATQGPISVAIDAG 298
Query: 236 VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE-DGVPYWLIKNSWGENWGDHGYFK 294
F+ YK+GVY +C + +D H V+ VGYG + D YW++KNSWG WG+ GY +
Sbjct: 299 HRSFQLYKTGVYYEKECSSEQLD--HGVLVVGYGTDPDHGDYWIVKNSWGTTWGEQGYVR 356
Query: 295 MEMGK-NMCGIATCASYPVV 313
M K N CGIAT ASYP+V
Sbjct: 357 MARNKNNHCGIATKASYPLV 376
>gi|392873948|gb|AFM85806.1| cathepsin L [Callorhinchus milii]
Length = 338
Score = 199 bits (507), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 105/221 (47%), Positives = 142/221 (64%), Gaps = 9/221 (4%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++PVKDQG CGSCW FSTTG+LE + + G+ +SLSEQ LV+C++ N
Sbjct: 119 KHVDWRDEGYVTPVKDQGQCGSCWAFSTTGALEGQHFRRTGQLVSLSEQNLVECSKPEGN 178
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGV-CKFSSENVGVQVLDSVNITLGAED 217
+GCNGGL QAF+Y+K NGG+D+E++YPY G D C ++ + V+I G E
Sbjct: 179 EGCNGGLMDQAFQYVKDNGGIDSEDSYPYVGTDDTPCHYNPQYNAANDTGFVDIPSGKER 238
Query: 218 ELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----D 272
L A+ V PVSVA + F+FY+SG+Y +C +T D++H V+ VGYGVE D
Sbjct: 239 ALMKAIAAVGPVSVAIDAGHTSFQFYQSGIYFEAECSST--DLDHGVLVVGYGVEKRDTD 296
Query: 273 GVPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPV 312
G YW++KNSW E G +GY M K N CGIAT ASYP+
Sbjct: 297 GKKYWIVKNSWSEKLGQNGYILMAKDKDNHCGIATAASYPL 337
>gi|444514070|gb|ELV10520.1| Cathepsin L1 [Tupaia chinensis]
Length = 450
Score = 199 bits (507), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 108/220 (49%), Positives = 136/220 (61%), Gaps = 14/220 (6%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++PVK+QG CGSCW FS TG+LE + GK ISLSEQ LVDC++ N
Sbjct: 240 KSVDWREKGFVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLISLSEQNLVDCSRRQGN 299
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
GC GGL AF+YIK NGGLD+EE+YPY G DG C++ +E +V G E
Sbjct: 300 LGCQGGLMDNAFQYIKDNGGLDSEESYPYKGMDGTCQYKAEW-------AVANDTGFEKA 352
Query: 219 LQHAVGLVRPVSVAFEVVDG-FRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVED---GV 274
L AV V P+SVA + F+FYK G+Y C + +D H V+ VGYGVE
Sbjct: 353 LMKAVASVGPISVAIDAGHASFQFYKDGIYYEPDCSSENLD--HGVLVVGYGVEKRNSND 410
Query: 275 PYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
YWLIKNSWGE WG +GY K+ + N CG+A+ ASYPVV
Sbjct: 411 KYWLIKNSWGEQWGANGYVKIAKDRNNHCGVASAASYPVV 450
>gi|91092022|ref|XP_970951.1| PREDICTED: similar to cathepsin l [Tribolium castaneum]
gi|270001246|gb|EEZ97693.1| cathepsin L precursor [Tribolium castaneum]
Length = 343
Score = 199 bits (507), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 113/295 (38%), Positives = 160/295 (54%), Gaps = 38/295 (12%)
Query: 51 QVIGQARHALSFARFARRYGKIYESVEEMKLRFA-----TFSKNLDLIRSTNCKGLSYRL 105
++ + RH + ARF + YG+ S + FA F + L+ T LS R+
Sbjct: 55 EIFIENRHKI--ARFNQEYGRGQWSFVQQLNNFADMLHHEFHRTLNGFNRT----LSARV 108
Query: 106 GLN------------------------ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKG 141
G+ ++PVK+QG C CW FS G+LE + G+
Sbjct: 109 GIPQSSTFIPSANVIFPDYVDWREVGAVTPVKNQGSCAGCWAFSAAGALEGHNFRKTGRL 168
Query: 142 ISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENV 201
+ LS Q L+DC+ + N GC+GGL + A+EY++ N G+DTE++YPY ++G C+F E V
Sbjct: 169 VELSPQNLIDCSTNYGNDGCSGGLMNPAYEYVRTNPGIDTEDSYPYEARNGPCRFRPETV 228
Query: 202 GVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVN 260
G V+I G E L+ A+ + PVS A + F+FY G+Y +CGN P DVN
Sbjct: 229 GAYCTGYVDIAEGDEQGLEAAIATLGPVSAAMDAGRQSFQFYSDGIYYDPQCGNRPDDVN 288
Query: 261 HAVVAVGYGVE-DGVPYWLIKNSWGENWGDHGYFKM-EMGKNMCGIATCASYPVV 313
HAV+ VGYG E +G YWL+KNS+G WG GY K+ + N CGIA ASYP+V
Sbjct: 289 HAVLVVGYGTEPNGQKYWLVKNSYGPQWGIGGYVKLAKDANNHCGIAIQASYPLV 343
>gi|15593246|gb|AAL02220.1|AF410880_1 cysteine protease CP7 precursor [Frankliniella occidentalis]
Length = 333
Score = 199 bits (507), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 104/218 (47%), Positives = 136/218 (62%), Gaps = 5/218 (2%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++P+KDQG CGSCW+FS TGSLE +SLSEQ LVDC+ F N
Sbjct: 118 KKVDWRSKGAVTPIKDQGQCGSCWSFSATGSLEGQLFLKNKNLVSLSEQNLVDCSWDFGN 177
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
+GCNGGL AFEY+K GG+DTEE+YPYT +DG C + + N ++ +E
Sbjct: 178 EGCNGGLMDSAFEYVKSYGGIDTEESYPYTAEDGTCLYKAANNAGVNTGYKDVQAKSESA 237
Query: 219 LQHAVGLVRPVSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE-DGVPY 276
L+ AV V PVSVA + + F+ Y SG+Y C + +D H V+AVGYG E +
Sbjct: 238 LRDAVEKVGPVSVAIDASNWSFQMYTSGIYYEPACSSDSLD--HGVLAVGYGSEWPNKEF 295
Query: 277 WLIKNSWGENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
W++KNSWG +WG+ GY KM KN CGIAT ASYP+V
Sbjct: 296 WIVKNSWGTSWGEEGYIKMARNKKNNCGIATEASYPLV 333
>gi|41152540|gb|AAR99519.1| cathepsin L protein [Fasciola hepatica]
Length = 239
Score = 199 bits (507), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 98/215 (45%), Positives = 140/215 (65%), Gaps = 4/215 (1%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++ VKDQG+CGSCW FSTTG++E Y + IS SEQQLVDC+ + N G
Sbjct: 25 IDWRESGYVTGVKDQGNCGSCWAFSTTGTMEGQYMKNERTSISFSEQQLVDCSGPWGNNG 84
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C+GGL A++Y+K GL+TE +YPYT +G C+++ + +V + G+E EL+
Sbjct: 85 CSGGLMENAYQYLK-QFGLETESSYPYTAVEGQCRYNRQLGVAKVTGYYTVHSGSEVELK 143
Query: 221 HAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIK 280
+ VG P ++A +V F Y+SG+Y S C P +NHAV+AVGYG + G YW++K
Sbjct: 144 NLVGSEGPAAIAVDVESDFMMYRSGIYQSQTC--LPFALNHAVLAVGYGTQGGTDYWIVK 201
Query: 281 NSWGENWGDHGYFKMEMGK-NMCGIATCASYPVVA 314
NSWG +WG+ GY +M + NMCGIA+ AS P+VA
Sbjct: 202 NSWGLSWGERGYIRMARNRGNMCGIASLASLPMVA 236
>gi|167427527|gb|ABZ80400.1| cathepsin L4, partial [Fasciola hepatica]
Length = 303
Score = 199 bits (506), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 98/206 (47%), Positives = 130/206 (63%), Gaps = 4/206 (1%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++ VKDQG CGSCW FSTTG++E Y + IS SEQQLVDC+ + N GCNGG
Sbjct: 97 VTEVKDQGDCGSCWAFSTTGAVEGQYTKNQKANISFSEQQLVDCSGDYGNHGCNGGFMEN 156
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
A+EY++ GL+TE +YPY ++G CK+ S V+V G E +L H VG P
Sbjct: 157 AYEYLERR-GLETESSYPYKAEEGPCKYDSRLGVVEVFGYFIEHSGIESKLAHLVGDKGP 215
Query: 229 VSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWG 288
+VA +V F Y+ G+Y+S C + + NH ++ VGYG +DG YW++KNSWG WG
Sbjct: 216 AAVAVDVESDFLMYRGGIYASRNCSSESL--NHGILVVGYGTQDGTDYWIVKNSWGSLWG 273
Query: 289 DHGYFKMEMGK-NMCGIATCASYPVV 313
DHGY +M + NMCGIA+ AS PVV
Sbjct: 274 DHGYIRMARNRDNMCGIASAASVPVV 299
>gi|157862759|gb|ABV90502.1| cathepsin L, partial [Fasciola gigantica]
Length = 280
Score = 199 bits (506), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 99/215 (46%), Positives = 140/215 (65%), Gaps = 4/215 (1%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++ VKDQG+CGSCW FSTTG++E Y + IS SEQQLVDC+ + N G
Sbjct: 66 IDWRESGYVTGVKDQGNCGSCWAFSTTGTMEGQYMKNQRTSISFSEQQLVDCSGPWGNMG 125
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C+GGL A+EY+K GL+TE +YPY +G C+++ + V+V + G+E L+
Sbjct: 126 CSGGLMENAYEYLK-QFGLETESSYPYRAVEGQCRYNRQLGVVKVTGYYTVHSGSEVGLK 184
Query: 221 HAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIK 280
+ VG P +VA +V F Y+SG+Y S C +P +NHAV+AVGYG + G YW++K
Sbjct: 185 NLVGAEGPAAVAVDVESDFMMYRSGIYQSQTC--SPFGLNHAVLAVGYGTQGGTDYWIVK 242
Query: 281 NSWGENWGDHGYFKMEMGK-NMCGIATCASYPVVA 314
NSWG +WG+ GY +M + NMCGIA+ AS P+VA
Sbjct: 243 NSWGSSWGERGYIRMVRNRGNMCGIASMASLPMVA 277
>gi|341878328|gb|EGT34263.1| CBN-CPL-1 protein [Caenorhabditis brenneri]
Length = 336
Score = 199 bits (506), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 97/208 (46%), Positives = 135/208 (64%), Gaps = 5/208 (2%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++ VK+QG CGSCW FS TG+LE + + G+ +SLSEQ LVDC+ + N GCNGGL Q
Sbjct: 131 VTDVKNQGMCGSCWAFSATGALEGQHARKLGQLVSLSEQNLVDCSTKYGNHGCNGGLMDQ 190
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
AFEYI+ N G+DTEE+YPY G+D C F+ + +G V+ G E++L+ AV P
Sbjct: 191 AFEYIRDNHGVDTEESYPYKGRDMKCHFNKKTIGADDKGYVDTPEGDEEQLKIAVATQGP 250
Query: 229 VSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE-DGVPYWLIKNSWGEN 286
+S+A + F+ YK GVY +C + +D H V+ VGYG + + YWL+KNSWG
Sbjct: 251 ISIAIDAGHRSFQLYKKGVYYDEECSSEELD--HGVLLVGYGTDPEHGDYWLVKNSWGTG 308
Query: 287 WGDHGYFKMEMGK-NMCGIATCASYPVV 313
WG+ GY ++ + N CG+AT ASYP+V
Sbjct: 309 WGEKGYIRIARNRNNHCGVATKASYPLV 336
>gi|348531521|ref|XP_003453257.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
Length = 333
Score = 199 bits (506), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 109/282 (38%), Positives = 160/282 (56%), Gaps = 14/282 (4%)
Query: 36 LVSSDGLRDFETSVLQVIGQARHALSFARFARR--YGKIYESVEEMKLRFATFSKNLDLI 93
+++ GL+ + + Q + R R G S+ F + DL
Sbjct: 62 ILADQGLKSYRLGMTQFADMENE--EYKRLVSRGCLGSFNTSLHHRGSTFLRLPEGTDLP 119
Query: 94 RSTNCKGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCA 153
+ + + Y ++ V++Q CGSCW FS G+LE + GK +SLS+QQLVDC+
Sbjct: 120 DTVDWRDKGY-----VTDVQNQMQCGSCWAFSAIGALEGQNFRKTGKLVSLSKQQLVDCS 174
Query: 154 QAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITL 213
Q+F N GCNGG AF+YI+ GG+DTE +YPY ++G C ++ E VG V+++
Sbjct: 175 QSFGNHGCNGGWMDWAFKYIQATGGIDTEASYPYEAEEGNCHYNPETVGATCTGYVDVSP 234
Query: 214 GAEDELQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVED 272
ED L+ AV + P+S+A + + F+FY+SGVY C + +HA++AVGYG E+
Sbjct: 235 N-EDALKEAVATIGPISIAMDASHESFQFYQSGVYDEPSCITSRF--SHAMLAVGYGTEN 291
Query: 273 GVPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
G YWL+KNS+G WG+ GY KM K N CGIA+ ASYP+V
Sbjct: 292 GHDYWLVKNSFGLGWGEKGYIKMSRNKSNQCGIASKASYPLV 333
>gi|33520126|gb|AAQ21040.1| cathepsin L precursor [Branchiostoma belcheri tsingtauense]
Length = 327
Score = 199 bits (506), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 101/216 (46%), Positives = 132/216 (61%), Gaps = 5/216 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++ VK+Q CGSCW FSTTGSLE + G +SLSEQ LVDC++ N+G
Sbjct: 114 VDWRQKGAVTKVKNQEQCGSCWAFSTTGSLEGQHFLKSGTLVSLSEQNLVDCSRKEGNKG 173
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDG-VCKFSSENVGVQVLDSVNITLGAEDEL 219
C GGL QAF+YIK NGG+DTEE YPY G+D C++ + G + V++ G ED L
Sbjct: 174 CKGGLMDQAFKYIKTNGGIDTEECYPYKGRDERKCEYKASCSGATLSSFVDVKTGDEDAL 233
Query: 220 QHAVGLVRPVSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWL 278
+ A + P+SV + F+ Y GVY +C + +D H V+ VGYG + YWL
Sbjct: 234 KQASATIGPISVGIDASHPSFQLYDHGVYHEKRCSSKKLD--HGVLVVGYGTQSTKDYWL 291
Query: 279 IKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
+KNSWG +WG GY M K N CGIAT ASYPVV
Sbjct: 292 VKNSWGADWGMEGYIMMSRNKDNQCGIATQASYPVV 327
>gi|156124996|gb|ABU50816.1| Ale o 1 allergen [Aleuroglyphus ovatus]
Length = 337
Score = 199 bits (506), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 98/207 (47%), Positives = 128/207 (61%), Gaps = 4/207 (1%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++P+K+Q CGSCW FS S+E + GK +SLSEQ LVDC+ A + GC+GG
Sbjct: 133 VTPIKNQQQCGSCWAFSAVASMEGQHALKTGKLVSLSEQNLVDCSAAEGDMGCSGGWMDY 192
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
AF+Y+ N G+DTE +YPY D C+F ++G + V++ G E LQ+AV + P
Sbjct: 193 AFKYVIQNRGIDTEASYPYKAIDESCEFKRNSIGATIHSFVDVKTGDESALQNAVASIGP 252
Query: 229 VSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENW 287
+SVA + F+FY SGVY+ C +D H V AVGYG +GVPYW +KNSWG +W
Sbjct: 253 ISVAIDASQPSFQFYSSGVYNEPDCSTEILD--HGVTAVGYGTLNGVPYWKVKNSWGTSW 310
Query: 288 GDHGYFKMEMGK-NMCGIATCASYPVV 313
G GY M K N CGIAT ASYPVV
Sbjct: 311 GQKGYIFMSRNKQNQCGIATKASYPVV 337
>gi|313213752|emb|CBY40632.1| unnamed protein product [Oikopleura dioica]
Length = 440
Score = 199 bits (506), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 103/205 (50%), Positives = 128/205 (62%), Gaps = 4/205 (1%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++PVKDQG CGSCW FST SLE+ + A SLSEQQLVDC+ + N GC+GGL +Q
Sbjct: 236 VTPVKDQGQCGSCWAFSTIASLESQWALAGNALTSLSEQQLVDCSMNWGNYGCSGGLMTQ 295
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
F YI N G+DTE +YPYT +DG C F+ NVG + NI G E L +AV +V P
Sbjct: 296 GFTYIHDNNGVDTEASYPYTAQDGKCVFNPANVGTSLTSCYNIASGDEAALANAVQMVGP 355
Query: 229 VSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENW 287
+SVA + F+ Y SGVY C + +D H V AVGYG G ++++KNSW W
Sbjct: 356 MSVAIDASHMSFQLYTSGVYYEPNCSSQFLD--HGVTAVGYGSSSGNDFFIVKNSWAATW 413
Query: 288 GDHGYFKMEMGK-NMCGIATCASYP 311
GD+GY M K N CGIAT ASYP
Sbjct: 414 GDNGYIMMSRNKNNNCGIATSASYP 438
>gi|426219875|ref|XP_004004143.1| PREDICTED: cathepsin L1 [Ovis aries]
Length = 333
Score = 199 bits (506), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 116/277 (41%), Positives = 161/277 (58%), Gaps = 21/277 (7%)
Query: 51 QVIGQARHALSFARFARRYGKIYESVEEMKLRFATFSKN--------LDLIRSTNCKGLS 102
Q Q +H+ S A A +G + + EE + F + + I ++ +
Sbjct: 64 QEYSQGKHSFSMAMNA--FGDL--TSEEFRQMMNGFQRQENKKGKVFHETIFASIPPSVD 119
Query: 103 YRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCN 162
+R ++PVK+QG CGSCW FSTTG+LE + GK +SLSEQ LVDC+Q N+GC+
Sbjct: 120 WREKGYVTPVKNQGKCGSCWAFSTTGALEGQMFRKTGKLVSLSEQNLVDCSQPEGNRGCH 179
Query: 163 GGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHA 222
GGL AF+Y+ GGLD+EE+YPYTG G C ++ +N V++ E+ L A
Sbjct: 180 GGLMDNAFQYVLDVGGLDSEESYPYTGLVGTCNYNPKNSAANETGFVDLP-KQENALMKA 238
Query: 223 VGLVRPVSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----DGVPYW 277
V + P+SVA + + F+FYKSG+Y KC + +D H V+ VGYG E D YW
Sbjct: 239 VATLGPISVAVDASNPSFQFYKSGIYYEPKCKSESVD--HGVLVVGYGFEGADSDDNKYW 296
Query: 278 LIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
L+KNSWG++WG +GY KM + N CGIAT ASYP V
Sbjct: 297 LVKNSWGKHWGINGYIKMAKDQNNHCGIATMASYPTV 333
>gi|313246319|emb|CBY35240.1| unnamed protein product [Oikopleura dioica]
Length = 326
Score = 199 bits (506), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 103/205 (50%), Positives = 128/205 (62%), Gaps = 4/205 (1%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++PVKDQG CGSCW FST SLE+ + A SLSEQQLVDC+ + N GC+GGL +Q
Sbjct: 122 VTPVKDQGQCGSCWAFSTIASLESQWALAGNALTSLSEQQLVDCSMNWGNYGCSGGLMTQ 181
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
F YI N G+DTE +YPYT +DG C F+ NVG + NI G E L +AV +V P
Sbjct: 182 GFTYIHDNNGVDTEASYPYTAQDGKCVFNPANVGTSLTSCYNIASGDEAALANAVQMVGP 241
Query: 229 VSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENW 287
+SVA + F+ Y SGVY C + +D H V AVGYG G ++++KNSW W
Sbjct: 242 MSVAIDASHMSFQLYTSGVYYEPNCSSQFLD--HGVTAVGYGSSSGNDFFIVKNSWAATW 299
Query: 288 GDHGYFKMEMGK-NMCGIATCASYP 311
GD+GY M K N CGIAT ASYP
Sbjct: 300 GDNGYIMMSRNKNNNCGIATSASYP 324
>gi|327285051|ref|XP_003227248.1| PREDICTED: counting factor associated protein D-like [Anolis
carolinensis]
Length = 547
Score = 199 bits (505), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 111/304 (36%), Positives = 162/304 (53%), Gaps = 51/304 (16%)
Query: 62 FARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN------------- 108
F + +R+GK Y+ +EM+ R TF+ N+ + S N L ++L LN
Sbjct: 244 FHHYRKRFGKSYDDEKEMEHRKHTFTHNMRFVHSKNRANLPFKLALNHLADLTQDEMAAM 303
Query: 109 ------------------------------------ISPVKDQGHCGSCWTFSTTGSLEA 132
++PVKDQ CGSCW+FS+TG+LE
Sbjct: 304 RGKLKSTKPNNGLPFPHEQFVGLILPESLDWRLYGAVTPVKDQAVCGSCWSFSSTGALEG 363
Query: 133 AYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAY-PYTGKD 191
+ G+ I LS+Q L+DC+ F N C+GG QAFE++ +GG+ + E+Y PY G++
Sbjct: 364 SLFLKTGQLIPLSQQILIDCSWGFGNYACDGGEEWQAFEWVLKHGGIASTESYGPYKGQN 423
Query: 192 GVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVD-GFRFYKSGVYSST 250
G C + ++ ++ VN+T G L+ A+ PVSV+ + F FY +GVY
Sbjct: 424 GYCHSNKTHLVGKLSGYVNVTSGNITALKAAIYKHGPVSVSIDASHRTFSFYSNGVYYEP 483
Query: 251 KCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASY 310
KCGN +++HAV+AVGYGV G YWL+KNSW WG+ GY M M N CG+AT A+Y
Sbjct: 484 KCGNKKGELDHAVLAVGYGVLQGELYWLVKNSWSTYWGNDGYILMSMKDNNCGVATDATY 543
Query: 311 PVVA 314
P++A
Sbjct: 544 PLMA 547
>gi|156938919|gb|ABU97481.1| cathepsin L-like cysteine protease [Tyrophagus putrescentiae]
Length = 333
Score = 199 bits (505), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 117/324 (36%), Positives = 155/324 (47%), Gaps = 57/324 (17%)
Query: 45 FETSVLQVIGQARHALSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRS--------- 95
F + ++I + F F R+G+ Y + EE R F+ NL+ I +
Sbjct: 12 FAPTASELISEGELEAHFNLFKTRFGRSYANFEEEIFRKRVFASNLEFIFNHNREFFAGN 71
Query: 96 ----------TNCKGLSYRLGLN---------------------------------ISPV 112
T+ +R N ++P+
Sbjct: 72 KNFNVAVNNFTDMSNTEFRARFNGLRHSGVQSAPAIHSASAEGLPATVDWTKVKNVVTPI 131
Query: 113 KDQGHCGSCWTF-STTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFE 171
K+Q CGSCW F S S+E + GK +SLSEQ LVDC+ A N GC GGL QAF+
Sbjct: 132 KNQEQCGSCWAFFSAVASMEGQHGLKTGKLVSLSEQNLVDCSAAEGNMGCEGGLMDQAFQ 191
Query: 172 YIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSV 231
Y+ N G+DTE +YPY D +F +VG + V++ G+E LQ AV V P+SV
Sbjct: 192 YVIANKGIDTEMSYPYKAIDESWEFKKNSVGATIKSYVDVKTGSESSLQSAVATVGPISV 251
Query: 232 AFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDH 290
+ F+FY SGVY C T +D H V AVGYG +G PYW +KNSWG +WG
Sbjct: 252 GIDASQLSFQFYSSGVYEEPACSTTILD--HGVTAVGYGALNGTPYWKVKNSWGTSWGMS 309
Query: 291 GYFKMEMGK-NMCGIATCASYPVV 313
GY M K N CGIAT AS+PVV
Sbjct: 310 GYIFMSRNKQNQCGIATAASWPVV 333
>gi|108735840|gb|ABG00259.1| cathepsin L2 [Fasciola hepatica]
Length = 219
Score = 199 bits (505), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 97/207 (46%), Positives = 131/207 (63%), Gaps = 4/207 (1%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++ VKDQG CGSCW FSTTG++E + + S SEQQLVDC + F N GC GG
Sbjct: 13 VTEVKDQGQCGSCWAFSTTGAVEGQFRKNERASASFSEQQLVDCTRDFGNYGCGGGYMEN 72
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
A+EY+K+N GL+TE YPY +G C++ +V + G E EL++ VG P
Sbjct: 73 AYEYLKHN-GLETESYYPYQAVEGPCQYDGRLAYAKVTGYYTVHSGDEIELKNLVGTEGP 131
Query: 229 VSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWG 288
++A +V F Y+SG+Y S C P +NHAV+AVGYG +DG YW++KNSWG +WG
Sbjct: 132 AAIAVDVESDFMMYRSGIYQSQTC--LPFALNHAVLAVGYGTQDGTDYWIVKNSWGLSWG 189
Query: 289 DHGYFKMEMGK-NMCGIATCASYPVVA 314
+ GY +M + NMCGIA+ AS P+VA
Sbjct: 190 ERGYIRMARNRGNMCGIASLASLPMVA 216
>gi|431897851|gb|ELK06685.1| Cathepsin L1 [Pteropus alecto]
Length = 331
Score = 199 bits (505), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 104/219 (47%), Positives = 136/219 (62%), Gaps = 9/219 (4%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++PVKDQG CGSCW FS TG+LE + GK ISLSEQ LVDC+Q+ N+G
Sbjct: 116 VDWRQKGYVTPVKDQGQCGSCWAFSATGALEGQMFRKTGKLISLSEQNLVDCSQSQGNEG 175
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C+GGL AF+Y+K NGGLD+EE+YPY +D CK+ E V+I E L
Sbjct: 176 CDGGLMDNAFQYVKDNGGLDSEESYPYLARDESCKYKPEFSAANDSGFVDIH-KQERSLM 234
Query: 221 HAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----DGVP 275
AV V P+SV + F+FY+ G+Y +C + D+NH V+ VGYG E +
Sbjct: 235 KAVASVGPISVGIDASYSSFQFYEKGIYYEPECSSE--DLNHGVLVVGYGFERAESNKNK 292
Query: 276 YWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
YW++KNSWG NWG +GY M + N CGIAT ASYP+V
Sbjct: 293 YWIVKNSWGTNWGMNGYINMAKDQNNHCGIATAASYPIV 331
>gi|294938848|ref|XP_002782226.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
gi|239893730|gb|EER14021.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
Length = 334
Score = 199 bits (505), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 123/315 (39%), Positives = 158/315 (50%), Gaps = 71/315 (22%)
Query: 60 LSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN----------- 108
L+F F ++GK YES EE R A F +L I N K LSY+LG+N
Sbjct: 26 LAFMGFQHKFGKNYESKEEEIKRNAIFRAHLHYIEQVNAKNLSYKLGVNEHADLTHEEFA 85
Query: 109 --------------------------------------ISPVKDQGHCGSCWTFSTTGSL 130
++P+KDQG CGSCW FS TG+L
Sbjct: 86 ALKLGTSSKMSMKRDDKLVVKADTTQLLTSVDWRSKGVLTPIKDQGPCGSCWAFSATGAL 145
Query: 131 EAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGK 190
EA Y A GK +SLSEQQL+DC+ ++ N+GC+GGL A+ YIK + GLD E YPY K
Sbjct: 146 EAQYAIATGKLLSLSEQQLIDCSSSYGNEGCSGGLMENAYTYIK-SAGLDQESTYPYIAK 204
Query: 191 DGVCKFSSEN----------VGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVD-GF 239
+ C+ S E G +LD E L A+ PVS+A D F
Sbjct: 205 NNACQVSLEKRSDGIPAGEVTGFHMLDQT------EQGLMKALADA-PVSIAMYASDPDF 257
Query: 240 RFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGK 299
RFY+SGVYSS C T ++H VVAVGYG E+G Y++I+NSWG +WG GYF ++ G
Sbjct: 258 RFYQSGVYSSKTCHGT---IDHGVVAVGYGTENGEDYFVIRNSWGSSWGQDGYFYLKRGV 314
Query: 300 NMCGIATCASYPVVA 314
+ G Y VA
Sbjct: 315 SGYGECNILEYMCVA 329
>gi|110349473|gb|ABG73217.1| cathepsin L 1 precursor [Diaprepes abbreviatus]
Length = 322
Score = 199 bits (505), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 121/307 (39%), Positives = 164/307 (53%), Gaps = 60/307 (19%)
Query: 60 LSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNC---KGL-SYRLGLN------- 108
+ F F ++GK Y++ E RF F NL I N +GL SY+ G+N
Sbjct: 23 VKFQAFKLKHGKTYKNQVEETARFNIFKDNLRAIEQHNVLYEQGLVSYKKGINRFTDMTQ 82
Query: 109 ----------------------------------------ISPVKDQGHCGSCWTFSTTG 128
++ VKDQG+CGSCW FS TG
Sbjct: 83 EEFRAFLTLSSSKKPHFNTTEHVLTGLAVPDSIDWRTKGQVTGVKDQGNCGSCWAFSVTG 142
Query: 129 SLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYT 188
S EAAY++ GK +SLSEQQLVDC+ N GCNGG + F Y+K + GL+ E YPY
Sbjct: 143 STEAAYYRKAGKLVSLSEQQLVDCSTDIN-AGCNGGYLDETFTYVK-SKGLEAESTYPYK 200
Query: 189 GKDGVCKFSSENVGVQVLDSVNITLGAEDE--LQHAVGLVRPVSVAFEVVDGFRFYKSGV 246
G DG CK+S+ V +V S + +L +EDE L AVG V PVSVA + Y+SG+
Sbjct: 201 GTDGSCKYSASKVVTKV--SGHKSLKSEDENALLDAVGNVGPVSVAIDATY-LSSYESGI 257
Query: 247 YSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIAT 306
Y C +P ++NH V+ VGYG +G YW++KNSWG ++G+ GYF++ GKN CG+A
Sbjct: 258 YEDDWC--SPSELNHGVLVVGYGTSNGKKYWIVKNSWGGSFGESGYFRLLRGKNECGVAE 315
Query: 307 CASYPVV 313
YP++
Sbjct: 316 DTVYPII 322
>gi|291383486|ref|XP_002708337.1| PREDICTED: cathepsin L1 [Oryctolagus cuniculus]
Length = 333
Score = 198 bits (504), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 101/211 (47%), Positives = 130/211 (61%), Gaps = 9/211 (4%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++PVK+QG CGSCW FS TG+LE + G+ +SLSEQ L+DC+ N GC GGLP
Sbjct: 126 VTPVKNQGRCGSCWAFSATGALEGQMFRKTGRLVSLSEQNLIDCSWPAGNYGCRGGLPDH 185
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
AF+Y+K NGGLD+E++YPY +DG+C++S + V I E+ L AV V P
Sbjct: 186 AFQYVKDNGGLDSEDSYPYEARDGLCRYSPQESVANDTGFVQIPE-QEEALMEAVATVGP 244
Query: 229 VSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----DGVPYWLIKNSW 283
++VA + F FYK G+Y C +D HAV+ VGYG E D YWL+KNSW
Sbjct: 245 IAVAIDASHSSFLFYKEGIYYEPNCSRENLD--HAVLVVGYGFEGAESDNQKYWLVKNSW 302
Query: 284 GENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
G+ WG GY KM + N CGIAT ASYP V
Sbjct: 303 GKGWGMDGYMKMAKDRNNHCGIATAASYPTV 333
>gi|198432215|ref|XP_002130162.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
Length = 331
Score = 198 bits (504), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 105/218 (48%), Positives = 133/218 (61%), Gaps = 5/218 (2%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++PVKDQ CGSCW FSTTGSLE + + GK +SLSEQ LVDC+ N
Sbjct: 116 KNVDWRKEGYVTPVKDQKQCGSCWAFSTTGSLEGQHFKKTGKLVSLSEQNLVDCSMKEGN 175
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGK-DGVCKFSSENVGVQVLDSVNITLGAED 217
GC GGL FEYI NGG+DTE +YPY K + C + N G + V+I G+E
Sbjct: 176 HGCQGGLMDLGFEYIFDNGGIDTESSYPYMAKNEPQCMYKRSNSGATLTGCVDIKRGSES 235
Query: 218 ELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPY 276
L AV V P+SVA + F+ YKSGVY C + +D H V+AVG+G ++G +
Sbjct: 236 ALMKAVADVGPISVAIDAGHKSFQMYKSGVYYEPSCSSVKLD--HGVLAVGFGADNGEDF 293
Query: 277 WLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
WL+KNSWG WG GY M + N CGIAT ASYP+V
Sbjct: 294 WLVKNSWGPIWGMEGYIMMSRNRDNNCGIATQASYPLV 331
>gi|7271891|gb|AAF44676.1|AF239265_1 cathepsin L [Fasciola gigantica]
Length = 326
Score = 198 bits (504), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 101/215 (46%), Positives = 137/215 (63%), Gaps = 4/215 (1%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++ VKDQG CGSCW FSTTG++E Y + IS SEQQLVDC+ F N G
Sbjct: 112 IDWRESGYVTEVKDQGQCGSCWAFSTTGAMEGQYMKNQRTSISFSEQQLVDCSDDFGNFG 171
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
CNGGL A EY+K GL+TE +YPY +G C+++ + +V + G E ELQ
Sbjct: 172 CNGGLMENACEYLK-RFGLETESSYPYRAVEGPCRYNKQLGVAKVTGYYMVHSGDEVELQ 230
Query: 221 HAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIK 280
+ VG+ P +VA +V F Y+SG+Y S C +P +NH V+AVGYG + G YW++K
Sbjct: 231 NLVGIEGPAAVALDVDSDFMMYRSGIYQSQTC--SPEFLNHGVLAVGYGTQSGTDYWIVK 288
Query: 281 NSWGENWGDHGYFKMEMGK-NMCGIATCASYPVVA 314
NSWG WG++GY +M + NMCGIA+ AS P+VA
Sbjct: 289 NSWGPWWGENGYIRMVRNRGNMCGIASLASVPMVA 323
>gi|209738038|gb|ACI69888.1| Digestive cysteine proteinase 2 precursor [Salmo salar]
Length = 367
Score = 198 bits (504), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 104/212 (49%), Positives = 136/212 (64%), Gaps = 12/212 (5%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
+SPVKDQG+CGSCW+FSTTG++E+ Y +GK SEQQLVDC + +QGCNGG P
Sbjct: 130 VSPVKDQGNCGSCWSFSTTGAMESQYRLKYGKMKLFSEQQLVDCDRQNIDQGCNGGFPVA 189
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNIT-----LGAEDELQHAV 223
AFEYI+ GL TEE YPY+ C+F + G L+S +T E+ L A+
Sbjct: 190 AFEYIR-EFGLLTEEEYPYSAHSNQCRFKPDENG--HLNSTKVTGYTVIEMNENALTEAI 246
Query: 224 GLVRPVSVAFEVVDG-FRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVED--GVPYWLIK 280
P+SVA + F+FY SGVY + CG+ +++HAV+AVG+GV+ PY+++K
Sbjct: 247 YKRGPISVAIDASSSDFQFYHSGVYQNPSCGSAVSELDHAVLAVGFGVDKVHKTPYYIVK 306
Query: 281 NSWGENWGDHGYFKM-EMGKNMCGIATCASYP 311
NSW WGDHGY KM GKN CGIAT A+YP
Sbjct: 307 NSWSSGWGDHGYIKMIRNGKNNCGIATFATYP 338
>gi|392922426|ref|NP_001256718.1| Protein CPL-1, isoform a [Caenorhabditis elegans]
gi|3879367|emb|CAB07275.1| Protein CPL-1, isoform a [Caenorhabditis elegans]
Length = 337
Score = 198 bits (504), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 97/208 (46%), Positives = 135/208 (64%), Gaps = 5/208 (2%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++ VK+QG CGSCW FS TG+LE + + G+ +SLSEQ LVDC+ + N GCNGGL Q
Sbjct: 132 VTDVKNQGMCGSCWAFSATGALEGQHARKLGQLVSLSEQNLVDCSTKYGNHGCNGGLMDQ 191
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
AFEYI+ N G+DTEE+YPY G+D C F+ + VG V+ G E++L+ AV P
Sbjct: 192 AFEYIRDNHGVDTEESYPYKGRDMKCHFNKKTVGADDKGYVDTPEGDEEQLKIAVATQGP 251
Query: 229 VSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE-DGVPYWLIKNSWGEN 286
+S+A + F+ YK GVY +C + +D H V+ VGYG + + YW++KNSWG
Sbjct: 252 ISIAIDAGHRSFQLYKKGVYYDEECSSEELD--HGVLLVGYGTDPEHGDYWIVKNSWGAG 309
Query: 287 WGDHGYFKMEMGK-NMCGIATCASYPVV 313
WG+ GY ++ + N CG+AT ASYP+V
Sbjct: 310 WGEKGYIRIARNRNNHCGVATKASYPLV 337
>gi|157862755|gb|ABV90500.1| cathepsin L, partial [Fasciola gigantica]
Length = 251
Score = 198 bits (504), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 97/214 (45%), Positives = 138/214 (64%), Gaps = 4/214 (1%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++ VKDQG CGSCW FSTTG++E Y ++ IS SEQQLVDC+ F N G
Sbjct: 37 IDWRESGYVTEVKDQGGCGSCWAFSTTGAMEGQYMKSQRINISFSEQQLVDCSGDFGNHG 96
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C+GGL +A+EY+++ GL+TE +YPY +G C++ + Q+ D + E L+
Sbjct: 97 CSGGLMEKAYEYLRHF-GLETESSYPYRADEGPCQYDKQLGVAQLSDYYIVHSQDEVALK 155
Query: 221 HAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIK 280
+ +G+ P +VA +V F YKSG+Y C + + NHA++AVGYG EDG YW++K
Sbjct: 156 NLIGVEGPAAVALDVNIDFMMYKSGIYQDEICSSRYL--NHALLAVGYGTEDGTEYWIVK 213
Query: 281 NSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
NSWG WG+HGY ++ + NMCGIAT AS P+V
Sbjct: 214 NSWGSRWGEHGYIRLARNRDNMCGIATLASLPIV 247
>gi|116563690|gb|ABJ99858.1| cathepsin L [Hippoglossus hippoglossus]
Length = 336
Score = 198 bits (504), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 105/212 (49%), Positives = 134/212 (63%), Gaps = 9/212 (4%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++PVKDQG CGSCW FSTTG+LE + GK +SLSEQ LVDC++ N+GC GGL Q
Sbjct: 127 VTPVKDQGQCGSCWAFSTTGALEGQQFRKTGKLVSLSEQNLVDCSRPEGNEGCGGGLMDQ 186
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDG-VCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVR 227
AF+Y+ N GLD+E++YPYTG D C + V++ G E L AV V
Sbjct: 187 AFQYVTDNQGLDSEDSYPYTGTDDQPCHYDPLYNSANDTGFVDVPSGKEHALMKAVASVG 246
Query: 228 PVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVED----GVPYWLIKNS 282
PVSVA + + F+FY+SG+Y +C + +D H V+AVGYG E G +W++KNS
Sbjct: 247 PVSVAIDAGHESFQFYQSGIYYEKECSSEELD--HGVLAVGYGFEGEDKMGKKFWIVKNS 304
Query: 283 WGENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
WGE WGD GY M KN CGIAT ASYP+V
Sbjct: 305 WGEKWGDKGYIYMAKDRKNHCGIATAASYPLV 336
>gi|395819351|ref|XP_003783057.1| PREDICTED: cathepsin L1-like [Otolemur garnettii]
Length = 333
Score = 198 bits (504), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 109/221 (49%), Positives = 136/221 (61%), Gaps = 9/221 (4%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++PVK+QG CGSCW FS TG+LE + GK ISLSEQ LVDC++ N
Sbjct: 116 KSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLISLSEQNLVDCSRPQGN 175
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
+GC+GGL AF+YIK NGGLD+EE+YPY D CK+ E V+I E
Sbjct: 176 EGCDGGLMDYAFQYIKENGGLDSEESYPYDAMDESCKYRPEYSVANDTGFVDIP-KEEKA 234
Query: 219 LQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----DG 273
L AV V P+SVA + + F+FYK GVY +C + +V+H V+ VGYG E D
Sbjct: 235 LMKAVATVGPISVAIDAGHESFQFYKEGVYFEPECSSD--NVDHGVLVVGYGYEETESDN 292
Query: 274 VPYWLIKNSWGENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
+WL+KNSWGE WG GY KM KN CGIAT ASYP V
Sbjct: 293 NKFWLVKNSWGEEWGLGGYIKMTKDQKNHCGIATAASYPTV 333
>gi|440893559|gb|ELR46281.1| Cathepsin L1 [Bos grunniens mutus]
Length = 330
Score = 198 bits (503), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 117/274 (42%), Positives = 156/274 (56%), Gaps = 18/274 (6%)
Query: 51 QVIGQARHALSFARFARRYGKIYESVEEMKLRFATF-----SKNLDLIRSTNCKGLSYRL 105
Q Q +H+ S A A +G + + EE + F K + I ++ + +R
Sbjct: 64 QEYSQGKHSFSMAMNA--FGDM--TNEEFRHTMNGFQRQKNKKGKETIFASIPPSMDWRE 119
Query: 106 GLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGL 165
++PVK+QG CGSCW FS TG+LE Q GK +SLSEQ LVDC+Q N+GC+GG
Sbjct: 120 KGYVTPVKNQGKCGSCWAFSATGALEGQMFQKTGKLVSLSEQNLVDCSQPEGNRGCHGGF 179
Query: 166 PSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGL 225
AF+Y+ GGLD+EE+YPYTG G C ++ N V++ E L AV
Sbjct: 180 IDNAFQYVLDVGGLDSEESYPYTGLVGTCLYNPNNSAANETGFVDLP-KQEKALMKAVAT 238
Query: 226 VRPVSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----DGVPYWLIK 280
+ P+SVA + + F+FYKSG+Y C + +D HAV+ VGYG E D YWL+K
Sbjct: 239 LGPISVAVDAHNPSFQFYKSGIYYEPNCSSESVD--HAVLVVGYGFEGADSDDNKYWLVK 296
Query: 281 NSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
NSWGE+WG GY KM + N CGIAT ASYP V
Sbjct: 297 NSWGEHWGMDGYIKMAKDRNNHCGIATMASYPTV 330
>gi|119433808|gb|ABL74967.1| cysteine protease [Acanthamoeba castellanii]
Length = 330
Score = 198 bits (503), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 116/305 (38%), Positives = 157/305 (51%), Gaps = 57/305 (18%)
Query: 62 FARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN------------- 108
FA + R + K Y S EE R+ + +N + I+ N K SY L +N
Sbjct: 30 FADWMRTHTKSY-SNEEFVFRWNVWRENYNFIQEENRKNNSYYLTMNKFGDLTNAEFNKV 88
Query: 109 --------------------------------------ISPVKDQGHCGSCWTFSTTGSL 130
++ VK+QG CGSCW+FSTTGS
Sbjct: 89 YKGLAFDYSAHILKAKAATPAAPAPGLPANFDWRQKGAVTHVKNQGQCGSCWSFSTTGST 148
Query: 131 EAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGK 190
E A G +SLSEQ L+DC+ ++ N GCNGGL AFEYI N G+DTE +YPY
Sbjct: 149 EGANFLKRGTLVSLSEQNLIDCSGSYGNNGCNGGLMDYAFEYIINNKGIDTEASYPYETA 208
Query: 191 DGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSS 249
C+++ N G + +++ G E+ L +AV + P SVA + + F+FY GVY
Sbjct: 209 QYNCRYNPANSGGSLTSYTDVSSGDENALLNAVA-IEPTSVAIDASHNSFQFYSGGVYYE 267
Query: 250 TKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCA 308
+ C +T +D H V+AVG+G E+G YWL+KNSWG +WG GY KM + N CGIAT A
Sbjct: 268 SSCSSTQLD--HGVLAVGWGTENGQDYWLVKNSWGADWGLQGYIKMARNRHNNCGIATAA 325
Query: 309 SYPVV 313
SYP
Sbjct: 326 SYPTA 330
>gi|156124998|gb|ABU50817.1| Ale o 1 allergen [Aleuroglyphus ovatus]
Length = 337
Score = 198 bits (503), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 98/207 (47%), Positives = 127/207 (61%), Gaps = 4/207 (1%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++P+K+Q CGSCW FS S+E + GK +SLSEQ LVDC+ A + GC+GG
Sbjct: 133 VTPIKNQQQCGSCWAFSAVASMEGQHALKTGKLVSLSEQNLVDCSAAEGDMGCSGGWMDY 192
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
AF+Y+ N G+DTE +YPY D C+F +VG + V++ G E LQ+AV + P
Sbjct: 193 AFKYVIQNRGIDTEASYPYKAIDESCEFKRNSVGATIHSFVDVKTGDESALQNAVASIGP 252
Query: 229 VSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENW 287
+SVA + F+FY SGVY+ C +D H V AVGYG +G PYW +KNSWG +W
Sbjct: 253 ISVAIDAAQPSFQFYSSGVYNEPDCSTEILD--HGVTAVGYGTLNGAPYWKVKNSWGTSW 310
Query: 288 GDHGYFKMEMGK-NMCGIATCASYPVV 313
G GY M K N CGIAT ASYPVV
Sbjct: 311 GRKGYIFMSRNKQNQCGIATKASYPVV 337
>gi|293342577|ref|XP_001065834.2| PREDICTED: cathepsin L1 [Rattus norvegicus]
gi|293354413|ref|XP_573976.3| PREDICTED: cathepsin L1 [Rattus norvegicus]
gi|149039745|gb|EDL93861.1| rCG24317, isoform CRA_a [Rattus norvegicus]
Length = 330
Score = 198 bits (503), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 103/218 (47%), Positives = 141/218 (64%), Gaps = 6/218 (2%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++PVK+QG CGSCW FS GSLE + GK + LSEQ LVDC+ + N
Sbjct: 116 KTVDWRKHGYVTPVKNQGPCGSCWAFSAVGSLEGQVFRKTGKLVPLSEQNLVDCSWSHGN 175
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
+GC+GGLP AF+Y+K NGGLDT +YPY +G C+++ + +V+ ++I +E+
Sbjct: 176 KGCDGGLPDFAFQYVKDNGGLDTSVSYPYEALNGTCRYNPKYSAAKVVGFMSIP-PSENA 234
Query: 219 LQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE-DGVPY 276
L AV V P+SV ++ F+FYK G+Y C +T ++NHAV+ VGYG E DG Y
Sbjct: 235 LMKAVATVGPISVGIDIKHKSFQFYKGGMYYEPDCSST--NLNHAVLVVGYGEESDGRKY 292
Query: 277 WLIKNSWGENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
WL+KNSWG +WG GY KM N CGIA+ ASYP+V
Sbjct: 293 WLVKNSWGRDWGMDGYIKMAKDWNNNCGIASDASYPIV 330
>gi|388890776|gb|AFK80364.1| cysteine proteinase 3, partial [Acanthamoeba castellanii]
Length = 329
Score = 197 bits (502), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 101/215 (46%), Positives = 133/215 (61%), Gaps = 5/215 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+R ++ VK+QG CGSCW+FSTTGS E A G+ SLSEQ L+DC+ ++ N G
Sbjct: 118 FDWRQKGAVTHVKNQGQCGSCWSFSTTGSTEGANFLKTGRLTSLSEQNLIDCSGSYGNNG 177
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
CNGGL AFEYI N G+DTE +YPY C+++ N G + +++ G E+ L
Sbjct: 178 CNGGLMDYAFEYIINNKGIDTEASYPYQTAQYTCQYNPANSGGSLTSYTDVSSGDENALL 237
Query: 221 HAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
+AV P SVA + + F+FY GVY + C +T +D H V+AVG+G EDG YWL+
Sbjct: 238 NAVA-TEPTSVAIDASHNSFQFYSGGVYYESACSSTQLD--HGVLAVGWGTEDGQDYWLV 294
Query: 280 KNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
KNSWG +WG GY KM + N CGIAT ASYP
Sbjct: 295 KNSWGADWGLAGYIKMARNRSNNCGIATSASYPTA 329
>gi|348531513|ref|XP_003453253.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
Length = 333
Score = 197 bits (502), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 97/207 (46%), Positives = 137/207 (66%), Gaps = 5/207 (2%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++ VK+Q CGSCW FS TG+LE + + G+ + LSEQQLVDC++ F N+GC+GG +
Sbjct: 130 VTKVKNQQQCGSCWAFSATGALEGQHFKKTGRLVYLSEQQLVDCSRNFGNRGCDGGWMNN 189
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
AF+YIK NGG+ TE +YPY DG+C ++ +VG V+++ E+ L+ AV + P
Sbjct: 190 AFKYIKDNGGIQTEASYPYQAMDGLCHYNPNSVGAICNGYVDVS-PDEEALKEAVATIGP 248
Query: 229 VSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENW 287
+S+A + + F+ Y+SGVY +C + + +H ++ VGYG E G+ YWLIKNSWG W
Sbjct: 249 ISIAMDASHESFQLYQSGVYDEHRCNDYYL--SHGMLVVGYGTEGGLDYWLIKNSWGLGW 306
Query: 288 GDHGYFKMEMGK-NMCGIATCASYPVV 313
G GY KM K N CGIAT ASYP+V
Sbjct: 307 GKMGYIKMVRNKRNQCGIATAASYPLV 333
>gi|356984263|gb|AET43955.1| cathepsin L2, partial [Reishia clavigera]
Length = 278
Score = 197 bits (502), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 113/281 (40%), Positives = 147/281 (52%), Gaps = 57/281 (20%)
Query: 65 FARRYGKIYESVEEMKLRFATFSKNLDLIRSTNC---KGL-SYRLGLN------------ 108
F + Y K+Y S E+ +R + +NL I N +GL +YRLG+N
Sbjct: 1 FKKTYNKLY-SAEDESIRRMIWERNLKKIEEHNLEADRGLHTYRLGMNPLGDLTAKDFSW 59
Query: 109 -------------------------------------ISPVKDQGHCGSCWTFSTTGSLE 131
++PVK+Q CGSCW FS TGSLE
Sbjct: 60 MLNGYKMSANRTAGATYLPPSNVGDLPSEVDWRTKGYVTPVKNQKQCGSCWAFSATGSLE 119
Query: 132 AAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKD 191
+ + G +SLSEQ LVDC++ N+GC GGL QAFEYIK N G+DTE++YPY D
Sbjct: 120 GQHFKKTGTLVSLSEQNLVDCSKKEGNEGCEGGLMDQAFEYIKRNKGIDTEQSYPYRAVD 179
Query: 192 GVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSST 250
C+FS +VG +I G+E +LQ AV V P+SVA + D F+ YKSGVY
Sbjct: 180 EKCRFSRADVGATDTGYTDIHKGSEKDLQSAVATVGPISVAIDASRDSFQLYKSGVYYEP 239
Query: 251 KCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHG 291
KC +T +D H V+AVGYG D YW++KNSWG WG G
Sbjct: 240 KCSSTMLD--HGVLAVGYGTTDSKDYWIVKNSWGTQWGMKG 278
>gi|7271895|gb|AAF44678.1|AF239267_1 cathepsin L, partial [Fasciola gigantica]
Length = 219
Score = 197 bits (502), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 99/215 (46%), Positives = 138/215 (64%), Gaps = 4/215 (1%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++ VKDQG+CGSCW FSTTG++E Y + IS SEQQLVDC+ + N G
Sbjct: 5 IDWRESGYVTEVKDQGNCGSCWAFSTTGTMEGQYMKNERTSISFSEQQLVDCSGPWGNYG 64
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C GGL A+EY+K GL+TE +YPYT + C+++ + +V D + G+E EL+
Sbjct: 65 CMGGLMENAYEYLK-QFGLETESSYPYTAVEDQCRYNRQLGVAKVTDYYTVHSGSEVELK 123
Query: 221 HAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIK 280
+ VG P +VA +V F Y G+Y S C + + VNHAV+AVGYG + G YW++K
Sbjct: 124 NLVGAEGPAAVAVDVESDFMMYSGGIYQSRTC--SSLRVNHAVLAVGYGTQGGTDYWIVK 181
Query: 281 NSWGENWGDHGYFKMEMGK-NMCGIATCASYPVVA 314
NSWG +WG+ GY +M + NMCGIA+ AS P+VA
Sbjct: 182 NSWGSSWGERGYIRMVRNRGNMCGIASLASLPMVA 216
>gi|211953221|gb|ACJ13772.1| aleurain-like protease [Helianthus petiolaris]
gi|211953223|gb|ACJ13773.1| aleurain-like protease [Helianthus petiolaris]
Length = 114
Score = 197 bits (501), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 91/112 (81%), Positives = 101/112 (90%)
Query: 203 VQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHA 262
VQVLDSVNIT GAEDEL+HAVG+VRPVSVAFEV+ FR Y GV++S CG+ PMDVNHA
Sbjct: 3 VQVLDSVNITSGAEDELKHAVGVVRPVSVAFEVIANFRLYTGGVFTSDDCGSGPMDVNHA 62
Query: 263 VVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASYPVVA 314
VVAVGYGVEDGVPYWLIKNSWG +WG +GYFKMEMGKNMCG+ATCASYP+VA
Sbjct: 63 VVAVGYGVEDGVPYWLIKNSWGADWGLNGYFKMEMGKNMCGVATCASYPIVA 114
>gi|195729975|gb|ACG50798.1| cathepsin L1 [Fascioloides magna]
Length = 327
Score = 197 bits (501), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 100/206 (48%), Positives = 130/206 (63%), Gaps = 4/206 (1%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++ VKDQG CGSCW FS+TG++E Y + F +S SEQQLVDC + + N GCNGG +
Sbjct: 121 VTEVKDQGQCGSCWAFSSTGAMEGQYIKKFRTTVSFSEQQLVDCTRNYGNSGCNGGWMER 180
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
AFEY++ N GL+TE +YPY D C++ S+ +V G E L + VG P
Sbjct: 181 AFEYLRRN-GLETESSYPYRAVDDHCRYESQLGVAKVTGYYTEHSGNEVSLMNMVGGEGP 239
Query: 229 VSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWG 288
V+VA +V F YKSG+Y S C + VNHAV+AVGYG E G YW++KNSWG WG
Sbjct: 240 VAVAVDVQSDFSMYKSGIYQSETC--STYYVNHAVLAVGYGTESGTDYWILKNSWGSWWG 297
Query: 289 DHGYFKMEMGK-NMCGIATCASYPVV 313
D GY + + NMCGIA+ AS P+V
Sbjct: 298 DQGYIRFARNRNNMCGIASYASVPMV 323
>gi|19909509|dbj|BAB86959.1| cathepsin L [Fasciola gigantica]
Length = 324
Score = 197 bits (501), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 96/207 (46%), Positives = 136/207 (65%), Gaps = 6/207 (2%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++ VKDQG+CGSCW FSTTG++E Y + IS SEQQLVDC+ + N GC+GGL
Sbjct: 120 VTTVKDQGNCGSCWAFSTTGTMEGQYMKNERTSISFSEQQLVDCSGPWGNYGCSGGLMEN 179
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
A+EY+K GL+TE +YPYT +G C+++ + +V D + G+E EL++ VG P
Sbjct: 180 AYEYLK-QFGLETESSYPYTAVEGQCRYNRQLGVAKVTDYYTVHSGSEVELKNLVGAEGP 238
Query: 229 VSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWG 288
++A +V F Y G+Y S C + +NHAV+AVGYG + G YW++KNSWG +WG
Sbjct: 239 AAIAVDVESDFMMYSGGIYQSQTC----LRLNHAVLAVGYGTQGGTDYWIVKNSWGLSWG 294
Query: 289 DHGYFKMEMGK-NMCGIATCASYPVVA 314
+ GY +M + NMCGI++ AS P+VA
Sbjct: 295 ERGYIRMARNRGNMCGISSLASLPMVA 321
>gi|301612003|ref|XP_002935514.1| PREDICTED: cathepsin K-like [Xenopus (Silurana) tropicalis]
Length = 331
Score = 197 bits (501), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 99/215 (46%), Positives = 133/215 (61%), Gaps = 6/215 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ YR ++P++DQG CGSCW FST G+LE + GK + +S Q LVDC + +N G
Sbjct: 121 IDYRKKGYVTPIRDQGECGSCWAFSTVGALEGQLMKKTGKLVGISPQNLVDCVK--DNFG 178
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C GG + AF+Y+K N G+D+EEAYPY G D CK++ ++ + G+E L+
Sbjct: 179 CGGGYMTTAFKYVKKNKGIDSEEAYPYVGMDQKCKYNVSGRAAEIKGFKEVKKGSETALK 238
Query: 221 HAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
AVGLV P+SV + +D F YK G+Y C +NHAV+AVGYG + YW+I
Sbjct: 239 KAVGLVGPISVGIDAGLDTFFLYKKGIYYDKSCDGDS--INHAVLAVGYGKQKKGKYWII 296
Query: 280 KNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
KNSWGE+WG+ GY M K N CGIA ASYPV+
Sbjct: 297 KNSWGEDWGNKGYILMAREKGNACGIANLASYPVM 331
>gi|281346354|gb|EFB21938.1| hypothetical protein PANDA_009085 [Ailuropoda melanoleuca]
Length = 333
Score = 197 bits (501), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 101/211 (47%), Positives = 131/211 (62%), Gaps = 9/211 (4%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++PVKDQG CGSCW FS TG+LE + GK +SLSEQ LVDC++A N GCNGGL
Sbjct: 126 VTPVKDQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAEGNAGCNGGLMDN 185
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
AF Y+K NGGLD+EE+YPY +DG CK+ E +I E+ L +V V P
Sbjct: 186 AFRYVKDNGGLDSEESYPYLAQDGRCKYKPEQSAANDTGFADIHQD-EESLMLSVATVGP 244
Query: 229 VSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----DGVPYWLIKNSW 283
+SVA + +D FRFY G+Y C + D++H V+ VGYG + + YW++KNSW
Sbjct: 245 ISVAIDASLDTFRFYYKGIYYDPNCSSE--DLDHGVLVVGYGSDEREAENKNYWIVKNSW 302
Query: 284 GENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
G WG GY M + N CGIAT AS+P+V
Sbjct: 303 GTQWGMQGYILMAKDRGNHCGIATSASFPIV 333
>gi|2804266|dbj|BAA24444.1| cysteine proteinase [Sitophilus zeamais]
Length = 331
Score = 197 bits (501), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 99/201 (49%), Positives = 133/201 (66%), Gaps = 5/201 (2%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++ VKDQGHCGSCW+FS +GSLE + + GK +SLSEQ LVDC+ + N GCNGGL
Sbjct: 133 VTKVKDQGHCGSCWSFSGSGSLEGQHFRKTGKLVSLSEQNLVDCSGRYGNTGCNGGLMDN 192
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
AF YIK NGG+DTE++YPY +D C + ++N G V+I G ED+L+ AV V P
Sbjct: 193 AFRYIKDNGGIDTEQSYPYLAEDEKCHYKTQNSGATDKGFVDIEEGNEDDLKAAVATVGP 252
Query: 229 VSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGV-EDGVPYWLIKNSWGEN 286
VS+A + + F+ Y GVYS +C + +D H V+ VGYG +DG YWL+KNSW +
Sbjct: 253 VSIAIDASYETFQLYSDGVYSDPECSSQELD--HGVLVVGYGTSDDGQDYWLVKNSWRPS 310
Query: 287 WGDHGYFKMEMGK-NMCGIAT 306
G +GY KM + NMCG+A+
Sbjct: 311 CGLNGYIKMARNQDNMCGVAS 331
>gi|197258086|gb|ACH56227.1| cathepsin S-like cysteine proteinase [Radopholus similis]
Length = 314
Score = 197 bits (501), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 103/209 (49%), Positives = 134/209 (64%), Gaps = 6/209 (2%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++ VKDQG CGSCW FSTTGSL A+ +A GK +SLSEQ LVDC+ N GL
Sbjct: 108 VTEVKDQGQCGSCWAFSTTGSLGGAHAKATGKLVSLSEQNLVDCSS--ENSVHEHGLMDV 165
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGV-CKFSSENVGVQVLDSVNITLGAEDELQHAVGLVR 227
AF+YI+ NGG+DTE +YPY G + CK+S NVG + V++ G E EL+ AV
Sbjct: 166 AFDYIEENGGIDTERSYPYRGYEQYRCKYSKRNVGATMASYVDLPSGDEQELKIAVATQG 225
Query: 228 PVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGV-PYWLIKNSWGE 285
P+SVA + D F+ Y+SGVY +CGN +++H V+ VGYG + YW++KNSW
Sbjct: 226 PISVAIDASSDSFQLYESGVYKDKQCGNRRSNLDHGVLLVGYGTDPKHGDYWIVKNSWSA 285
Query: 286 NWGDHGYFKM-EMGKNMCGIATCASYPVV 313
WG+ GY +M +NMCGIAT ASYP V
Sbjct: 286 AWGEKGYIRMARNNRNMCGIATMASYPQV 314
>gi|224081608|ref|XP_002191568.1| PREDICTED: counting factor associated protein D-like [Taeniopygia
guttata]
Length = 546
Score = 197 bits (501), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 110/304 (36%), Positives = 159/304 (52%), Gaps = 51/304 (16%)
Query: 62 FARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN------------- 108
F + R+ G+ Y SV E++ R + F N+ + S N LSY L LN
Sbjct: 243 FHDYRRQMGRHYGSVRELEHRQSIFVHNMRFVHSRNRAALSYTLSLNQLADRTPQELAAL 302
Query: 109 ------------------------------------ISPVKDQGHCGSCWTFSTTGSLEA 132
++PVKDQ CGSCW+F+TTG++E
Sbjct: 303 RGRRRSGTPNHGLPFPTDLYAGIILPESLDWRMYGAVTPVKDQAVCGSCWSFATTGAMEG 362
Query: 133 AYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAY-PYTGKD 191
A G LS+Q L+DC+ F N C+GG +A+E+IK +GG+ + E+Y Y G++
Sbjct: 363 ALFLKTGVLTPLSQQVLIDCSWGFGNYACDGGEEWRAYEWIKKHGGIASTESYGTYKGQN 422
Query: 192 GVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSST 250
G+C ++ + ++ VN+T G ++ A+ PV+V+ + F FY +GVY
Sbjct: 423 GLCHYNQSEMLAKITGYVNVTSGNITAVKTAIYKHGPVAVSIDASHKSFSFYSNGVYYEP 482
Query: 251 KCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASY 310
KC NTP ++HAV+AVGYGV G YWLIKNSW WG+ GY M M N CG+AT A+Y
Sbjct: 483 KCDNTPGSLDHAVLAVGYGVLQGETYWLIKNSWSTYWGNDGYILMAMKDNNCGVATEATY 542
Query: 311 PVVA 314
P++A
Sbjct: 543 PILA 546
>gi|291224870|ref|XP_002732425.1| PREDICTED: cathepsin L2-like [Saccoglossus kowalevskii]
Length = 326
Score = 197 bits (501), Expect = 6e-48, Method: Compositional matrix adjust.
Identities = 104/216 (48%), Positives = 134/216 (62%), Gaps = 6/216 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++ VKDQG CGSCW FS+TGSLE + GK + LSEQQLVDC+ + N G
Sbjct: 114 IDWRTQGYVTDVKDQGACGSCWAFSSTGSLEGQTFKKTGKLVPLSEQQLVDCSGDYGNMG 173
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C GG QAF YIK + G ++E+ YPYTG D C + + V +I E+ LQ
Sbjct: 174 CGGGWMDQAFSYIK-DKGEESEDGYPYTGTDDTCVYDASKVVATDTGYTDIPEMDENALQ 232
Query: 221 HAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGV-EDGVPYWL 278
AV V P+SVA + F+FY+SGVY +C T +D HAV+AVGYG E+G+ YW+
Sbjct: 233 QAVATVGPISVAIDATHSSFQFYESGVYDEPECSQTNLD--HAVLAVGYGTSEEGLDYWI 290
Query: 279 IKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
+KNSW WG GY +M K N CGIA+ ASYPVV
Sbjct: 291 VKNSWSTGWGMQGYIEMSRNKDNQCGIASKASYPVV 326
>gi|344271892|ref|XP_003407771.1| PREDICTED: cathepsin L1-like [Loxodonta africana]
Length = 334
Score = 197 bits (501), Expect = 6e-48, Method: Compositional matrix adjust.
Identities = 107/213 (50%), Positives = 135/213 (63%), Gaps = 12/213 (5%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++PVKDQG CGSCW FS TG+LE + GK +SLSEQ LVDC++ N+GCNGGL
Sbjct: 126 VTPVKDQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRREGNEGCNGGLMDN 185
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGV-CKFSSENVGVQVLDSVNITLGAEDELQHAVGLVR 227
AF+YIK NGGLD+EE+YPYT D C+++ + V+I E L AV V
Sbjct: 186 AFQYIKDNGGLDSEESYPYTAMDKQDCRYNPKYSAANDTGFVDIPP-QEKALMKAVATVG 244
Query: 228 PVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVP-----YWLIKN 281
P+SVA + + F+FYKSG+Y + C + D+NH V+ VGYG E G+ YWL+KN
Sbjct: 245 PISVAVDAGHESFQFYKSGIYYDSNC--SSKDLNHGVLVVGYGFE-GIDSANNRYWLVKN 301
Query: 282 SWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
SWG WG GY KM + N CGIAT ASYP V
Sbjct: 302 SWGTGWGTDGYIKMAKDRNNHCGIATAASYPTV 334
>gi|38045864|gb|AAR08900.1| cathepsin L [Fasciola gigantica]
Length = 326
Score = 197 bits (501), Expect = 6e-48, Method: Compositional matrix adjust.
Identities = 98/207 (47%), Positives = 133/207 (64%), Gaps = 6/207 (2%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++ VKDQG CGSCW FS TG++E Y + IS SEQQLVDC+ + N+GC+GG
Sbjct: 120 VTEVKDQGDCGSCWAFSATGAMEGQYMKNQKANISFSEQQLVDCSGDYGNRGCSGGFMEH 179
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNIT-LGAEDELQHAVGLVR 227
A+EY+ Y GL+TE +YPY ++G CK+ S +GV ++ G E +L H VG
Sbjct: 180 AYEYL-YEVGLETESSYPYKAEEGPCKYDSR-LGVAKVNGFYFDHFGVESKLAHLVGDKG 237
Query: 228 PVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENW 287
P +VA +V F Y+ G+Y+S C + + NHA++ VGYG +DG YW++KNSWG W
Sbjct: 238 PAAVAVDVESDFLMYRGGIYASRNCSSEKL--NHAMLVVGYGTQDGTDYWIVKNSWGSLW 295
Query: 288 GDHGYFKMEMGK-NMCGIATCASYPVV 313
GDHGY +M + NMCGIA+ AS PVV
Sbjct: 296 GDHGYIRMARNRDNMCGIASFASLPVV 322
>gi|72005575|ref|XP_783218.1| PREDICTED: cathepsin L2-like isoform 2 [Strongylocentrotus
purpuratus]
gi|390337647|ref|XP_003724610.1| PREDICTED: cathepsin L2-like isoform 1 [Strongylocentrotus
purpuratus]
Length = 334
Score = 197 bits (500), Expect = 6e-48, Method: Compositional matrix adjust.
Identities = 101/216 (46%), Positives = 130/216 (60%), Gaps = 5/216 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++PVKDQG CGSCW FST G+LE + G +SLSEQ LVDC+QA N G
Sbjct: 121 VDWRTEGYVTPVKDQGQCGSCWAFSTVGALEGQHFVKTGTLVSLSEQNLVDCSQAEGNDG 180
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
CNGG P+ A EYIK NGG+DTE YPY G D C + + +VG + + +E L+
Sbjct: 181 CNGGWPAWADEYIKSNGGIDTEVGYPYEGVDDSCHYRTSDVGATITGFAEVEADSEKALE 240
Query: 221 HAVGLVRPVSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGY-GVEDGVPYWL 278
A+ V P+SV + F+ Y+SGVY C +T +D H V AVGY DG Y++
Sbjct: 241 KALAQVGPISVCIDATQPSFQLYESGVYDEPDCSSTALD--HCVTAVGYDSTADGDKYYI 298
Query: 279 IKNSWGENWGDHGYFKMEMGKN-MCGIATCASYPVV 313
+KNSWG WG GY M K CGIAT A+YP+V
Sbjct: 299 VKNSWGTTWGQEGYIWMSRDKQKQCGIATNATYPLV 334
>gi|339765072|gb|AEK01110.1| cathepsin L [Cristaria plicata]
gi|397880684|gb|AFO67888.1| cathepsin L [Cristaria plicata]
Length = 333
Score = 197 bits (500), Expect = 6e-48, Method: Compositional matrix adjust.
Identities = 100/220 (45%), Positives = 133/220 (60%), Gaps = 10/220 (4%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
+ + +R ++ VKDQG CGSC+ FS TG+LE + + GK +SLSEQ +VDC+ N
Sbjct: 119 RQVDWRRKGYVTRVKDQGGCGSCYAFSATGALEGQHFRKTGKLVSLSEQNIVDCSFKEGN 178
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
+GC GGL ++F YIK N G+D EEAYPY +DG C+F VG V++ E
Sbjct: 179 KGCKGGLMDKSFTYIKNNNGIDKEEAYPYEARDGPCRFRRSEVGATDRGYVDLPENDETA 238
Query: 219 LQHAVGLVRPVSVAFEVVDG----FRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGV 274
L+HAV + P+SVA +DG FRFY GV+ + C T +NH V+ VGYG +G+
Sbjct: 239 LRHAVATIGPISVA---IDGHHFNFRFYDHGVFDNPNCSKTK--INHGVLVVGYGTRNGL 293
Query: 275 PYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
YW++KNSWG WG GY M N C IA ASYP+V
Sbjct: 294 DYWMVKNSWGRGWGAKGYILMSRNNDNQCCIACAASYPIV 333
>gi|167427531|gb|ABZ80402.1| cathepsin L6, partial [Fasciola hepatica]
Length = 306
Score = 197 bits (500), Expect = 7e-48, Method: Compositional matrix adjust.
Identities = 97/206 (47%), Positives = 132/206 (64%), Gaps = 4/206 (1%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++ VKDQG CGSCW FSTTG++E Y + F +S SEQQLVDC+ N GC GG +
Sbjct: 100 VTEVKDQGGCGSCWAFSTTGAIEGQYVKKFQTRVSFSEQQLVDCSTIPGNHGCRGGGMRR 159
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
A+EY+K N GL+ E +YPY +G C++ S+ +V +S + G E +L++ +G P
Sbjct: 160 AYEYLKKN-GLEPESSYPYKAVEGQCQYKSDLALAKVTNSQLVRSGNETQLKNLIGAEGP 218
Query: 229 VSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWG 288
SVA +V F Y+SG+Y S C + M NHAV+AVGYG E G+ YW++KNSWG WG
Sbjct: 219 ASVAVDVKPDFSMYRSGIYQSQTCSSRRM--NHAVLAVGYGTEGGMDYWIVKNSWGPRWG 276
Query: 289 DHGYFKMEMGK-NMCGIATCASYPVV 313
+ GY +M + NMCGIA+ S P V
Sbjct: 277 EAGYIRMARNRNNMCGIASAGSLPTV 302
>gi|326672297|ref|XP_003199631.1| PREDICTED: cathepsin L1-like [Danio rerio]
Length = 336
Score = 197 bits (500), Expect = 7e-48, Method: Compositional matrix adjust.
Identities = 101/212 (47%), Positives = 137/212 (64%), Gaps = 9/212 (4%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++PVKDQ CGSCW+FS+TG+LE + GK IS+SEQ LVDC++ NQGCNGGL Q
Sbjct: 127 VTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPHGNQGCNGGLMDQ 186
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGV-CKFSSENVGVQVLDSVNITLGAEDELQHAVGLVR 227
AF+Y+K N GLD+E++YPY +D + C++ ++ V+I G E L +AV V
Sbjct: 187 AFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPKGNELALMNAVAAVG 246
Query: 228 PVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVED----GVPYWLIKNS 282
PVSVA + +FY+SG+Y C ++ +D HAV+ VGYG + G YW++KNS
Sbjct: 247 PVSVAIDASHQSLQFYQSGIYYERACSSSRLD--HAVLVVGYGYQGADVAGNRYWIVKNS 304
Query: 283 WGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
W + WGD GY M K N CGIAT ASYP++
Sbjct: 305 WSDKWGDKGYIYMAKDKNNHCGIATMASYPLM 336
>gi|211953177|gb|ACJ13750.1| aleurain-like protease [Helianthus annuus]
gi|211953179|gb|ACJ13751.1| aleurain-like protease [Helianthus annuus]
gi|211953181|gb|ACJ13752.1| aleurain-like protease [Helianthus annuus]
gi|211953183|gb|ACJ13753.1| aleurain-like protease [Helianthus annuus]
gi|211953187|gb|ACJ13755.1| aleurain-like protease [Helianthus annuus]
gi|211953189|gb|ACJ13756.1| aleurain-like protease [Helianthus annuus]
gi|211953191|gb|ACJ13757.1| aleurain-like protease [Helianthus annuus]
gi|211953193|gb|ACJ13758.1| aleurain-like protease [Helianthus annuus]
gi|211953195|gb|ACJ13759.1| aleurain-like protease [Helianthus annuus]
gi|211953203|gb|ACJ13763.1| aleurain-like protease [Helianthus annuus]
gi|211953205|gb|ACJ13764.1| aleurain-like protease [Helianthus annuus]
gi|211953207|gb|ACJ13765.1| aleurain-like protease [Helianthus annuus]
gi|211953209|gb|ACJ13766.1| aleurain-like protease [Helianthus annuus]
gi|211953213|gb|ACJ13768.1| aleurain-like protease [Helianthus annuus]
gi|211953215|gb|ACJ13769.1| aleurain-like protease [Helianthus annuus]
gi|211953217|gb|ACJ13770.1| aleurain-like protease [Helianthus annuus]
gi|211953219|gb|ACJ13771.1| aleurain-like protease [Helianthus annuus]
Length = 114
Score = 196 bits (499), Expect = 9e-48, Method: Compositional matrix adjust.
Identities = 90/112 (80%), Positives = 101/112 (90%)
Query: 203 VQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHA 262
VQV+DSVNIT GAEDEL+HAVG+VRPVSVAFEV+ FR Y GV++S CG+ PMDVNHA
Sbjct: 3 VQVIDSVNITSGAEDELKHAVGVVRPVSVAFEVIANFRLYTGGVFTSDDCGSGPMDVNHA 62
Query: 263 VVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASYPVVA 314
VVAVGYGVEDGVPYWLIKNSWG +WG +GYFKMEMGKNMCG+ATCASYP+VA
Sbjct: 63 VVAVGYGVEDGVPYWLIKNSWGADWGLNGYFKMEMGKNMCGVATCASYPIVA 114
>gi|351694995|gb|EHA97913.1| Cathepsin L1 [Heterocephalus glaber]
Length = 278
Score = 196 bits (499), Expect = 9e-48, Method: Compositional matrix adjust.
Identities = 105/219 (47%), Positives = 135/219 (61%), Gaps = 9/219 (4%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++PVK+QG CGSCW FS TGSLE + G+ +SLSEQ LVDC+Q N
Sbjct: 61 KSVDWRKKGYVTPVKNQGQCGSCWAFSATGSLEGQMFRKTGQLVSLSEQNLVDCSQPQGN 120
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
QGCNGGL AFEY+K N GL++E++YPY GKDG C++ E V+I E
Sbjct: 121 QGCNGGLMDFAFEYVKENKGLESEKSYPYEGKDGSCRYKPELSAANDTGFVDIPQ-REKA 179
Query: 219 LQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----DG 273
L AV P+SVA + + F+FYK G+Y +C + D+NH V+ VGYG E +
Sbjct: 180 LMKAVAEKGPISVAVDAGLMSFQFYKDGIYFDPEC--SSKDLNHGVLVVGYGYEEVDTEK 237
Query: 274 VPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYP 311
YWL+KNSWG WG GY K+ + N CGIAT ASYP
Sbjct: 238 NEYWLVKNSWGPEWGAEGYIKIARNRNNHCGIATAASYP 276
>gi|301769893|ref|XP_002920368.1| PREDICTED: cathepsin L1-like [Ailuropoda melanoleuca]
Length = 503
Score = 196 bits (499), Expect = 9e-48, Method: Compositional matrix adjust.
Identities = 102/219 (46%), Positives = 134/219 (61%), Gaps = 9/219 (4%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++PVKDQG CGSCW FS TG+LE + GK +SLSEQ LVDC++A N G
Sbjct: 118 VDWREKGYVTPVKDQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAEGNAG 177
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
CNGGL AF Y+K NGGLD+EE+YPY +DG CK+ E +I E+ L
Sbjct: 178 CNGGLMDNAFRYVKDNGGLDSEESYPYLAQDGRCKYKPEQSAANDTGFADIHQD-EESLM 236
Query: 221 HAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----DGVP 275
+V V P+SVA + +D FRFY G+Y C + D++H V+ VGYG + +
Sbjct: 237 LSVATVGPISVAIDASLDTFRFYYKGIYYDPNCSSE--DLDHGVLVVGYGSDEREAENKN 294
Query: 276 YWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
YW++KNSWG WG GY M + N CGIAT AS+P+V
Sbjct: 295 YWIVKNSWGTQWGMQGYILMAKDRGNHCGIATSASFPIV 333
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 44/127 (34%), Positives = 62/127 (48%), Gaps = 11/127 (8%)
Query: 193 VCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDG-FRFYKSGVYSSTK 251
+ + E V VN+ E+ + AV PVS A G F+F K G+Y
Sbjct: 382 ILRTRPECSAADVTGPVNVPQ-QEEAVMLAVAAGGPVSAAIRASLGSFQFCKEGIYYDPN 440
Query: 252 CGNTPMDVNHAVVAVGYGVED----GVPYWLIKNSWGENWGDHGYFKM-EMGKNMCGIAT 306
C + D++H V+ VGYG ++ YW++KNSWG +WG GY + N C I T
Sbjct: 441 CSSE--DLDHGVLVVGYGSDEREAENKNYWIVKNSWGTDWGLQGYMLLVRDWDNHCEITT 498
Query: 307 CASYPVV 313
S+PVV
Sbjct: 499 --SFPVV 503
>gi|1272388|gb|AAB17051.1| cysteine protease, partial [Spirometra mansonoides]
Length = 216
Score = 196 bits (499), Expect = 9e-48, Method: Compositional matrix adjust.
Identities = 99/207 (47%), Positives = 130/207 (62%), Gaps = 5/207 (2%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++ VK+QG CGSCW+FS G++E A G +LSEQQLVDC+ + NQGCNGG S
Sbjct: 13 VTSVKNQGQCGSCWSFSANGAIEGAIQIKMGILPTLSEQQLVDCSWEYGNQGCNGGFMSL 72
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
AF+Y + G++ E Y YT KDG C++ + V V + G E LQ AV ++ P
Sbjct: 73 AFQYAQ-RYGVEAEVDYRYTAKDGFCRYQQDMVVANVTGYAELPQGDEASLQRAVAVIGP 131
Query: 229 VSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENW 287
+SV + D GF Y GV+ S C +P D+NH V+ +GYG E+ PYWL+KNSWG +W
Sbjct: 132 ISVGIDANDPGFMSYSHGVFVSKTC--SPDDINHGVLVIGYGTENDEPYWLVKNSWGRSW 189
Query: 288 GDHGYFKMEMGK-NMCGIATCASYPVV 313
G+ GY KM K NMCGIA+ ASYP V
Sbjct: 190 GEQGYVKMARNKNNMCGIASVASYPTV 216
>gi|74219261|dbj|BAE26764.1| unnamed protein product [Mus musculus]
Length = 333
Score = 196 bits (499), Expect = 9e-48, Method: Compositional matrix adjust.
Identities = 104/221 (47%), Positives = 136/221 (61%), Gaps = 9/221 (4%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R+ ++PVK+QG+C S W FS TGSLE + G+ + LSEQ L+DC +
Sbjct: 116 KYVDWRMLGYVTPVKNQGYCASSWAFSATGSLEGQMFKKTGRLVPLSEQNLLDCMGSNVT 175
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
C+GG AF+Y+K NGGL TEE+YPY G D C++ +EN V D V I G E+
Sbjct: 176 HDCSGGFMQNAFQYVKDNGGLATEESYPYIGPDRKCRYHAENSAANVRDFVQIP-GREEA 234
Query: 219 LQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----DG 273
L AV V P+SVA + D F+FY SG+Y +C + +NHAV+ VGYG E DG
Sbjct: 235 LMKAVAKVGPISVAVDASHDSFQFYDSGIYYEPQCKR--VHLNHAVLVVGYGFEGEESDG 292
Query: 274 VPYWLIKNSWGENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
YWL+KNSWGE WG GY K+ N CGIAT A+YP+V
Sbjct: 293 NSYWLVKNSWGEEWGMKGYIKIAKDWNNHCGIATLATYPIV 333
>gi|74151179|dbj|BAE27712.1| unnamed protein product [Mus musculus]
Length = 334
Score = 196 bits (499), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 108/221 (48%), Positives = 134/221 (60%), Gaps = 9/221 (4%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++PVK+QG CGSCW FS +G LE GK ISLSEQ LVDC+ A N
Sbjct: 116 KSVDWREKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGN 175
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
QGCNGGL AF+YIK NGGLD+EE+YPY KDG CK+ +E V+I E+
Sbjct: 176 QGCNGGLMDFAFQYIKENGGLDSEESYPYEAKDGSCKYRAEFAVANDTGFVDIPQ-QEEA 234
Query: 219 LQHAVGLVRPVSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----DG 273
L AV V P+SVA + +FY SG+Y C + +D H V+ VGYG E +
Sbjct: 235 LMKAVATVGPISVAMDASHPSLQFYSSGIYYEPNCSSKNLD--HGVLLVGYGYEGTDSNK 292
Query: 274 VPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
YWL+KNSWG WG GY K+ + N CG+AT ASYPVV
Sbjct: 293 NKYWLVKNSWGSEWGMEGYIKIAKDRDNHCGLATAASYPVV 333
>gi|296189340|ref|XP_002742739.1| PREDICTED: cathepsin L1 [Callithrix jacchus]
Length = 333
Score = 196 bits (499), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 105/221 (47%), Positives = 135/221 (61%), Gaps = 9/221 (4%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
+ + +R ++PVK+QG CGSCW FS TG+LE + GK +SLSEQ LVDC+ N
Sbjct: 116 RSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSGPQGN 175
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
QGC+GGL AF+Y++ NGGLD+EE+YPY + CK++ E V+I E
Sbjct: 176 QGCDGGLMDYAFQYVQENGGLDSEESYPYEATEESCKYNPEYSVANDTGFVDIP-KLEKA 234
Query: 219 LQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----DG 273
L AV V P+SVA + + F+FYK G+Y +C + MD H V+ VGYG E D
Sbjct: 235 LMKAVATVGPISVAIDAGHESFQFYKEGIYFEPECSSEDMD--HGVLVVGYGFERTGSDN 292
Query: 274 VPYWLIKNSWGENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
YWL+KNSWGE WG GY KM KN CGIA+ ASYP V
Sbjct: 293 SKYWLVKNSWGEKWGMDGYIKMAKDRKNHCGIASAASYPTV 333
>gi|332376957|gb|AEE63618.1| unknown [Dendroctonus ponderosae]
Length = 318
Score = 196 bits (499), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 103/213 (48%), Positives = 135/213 (63%), Gaps = 7/213 (3%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
L +R ++ VKDQG CGSCW FS GS E AY+++ GK +SLSEQQL+DC N+ G
Sbjct: 113 LDWRSQGYVTGVKDQGDCGSCWAFSVVGSTEGAYYKSTGKLVSLSEQQLIDCTTNVND-G 171
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C+GG + F Y++ GL +E +YPYTG+DG C+ S +V +V S + LG E +L
Sbjct: 172 CDGGYLEETFPYVQ-QTGLVSESSYPYTGRDGNCRISESDVVTKV--SKYVLLGGEADLL 228
Query: 221 HAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIK 280
AVG V PVSVA + + Y SGVY S+ C + +NH V+ VGYG +DG YWLIK
Sbjct: 229 EAVGSVGPVSVAMDATYIYS-YASGVYESSLC--SLYSLNHGVLVVGYGTQDGKDYWLIK 285
Query: 281 NSWGENWGDHGYFKMEMGKNMCGIATCASYPVV 313
NSWG WG+ GY K+ G N CGIA YP++
Sbjct: 286 NSWGNTWGEQGYLKLLRGTNECGIAEDDVYPII 318
>gi|38146075|gb|AAR11477.1| cathepsin L [Litopenaeus vannamei]
Length = 297
Score = 196 bits (498), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 108/285 (37%), Positives = 146/285 (51%), Gaps = 55/285 (19%)
Query: 62 FARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCK----GLSYRLGLN--------- 108
+ F +G+ Y SV+E + R + F +N I N + +++ L +N
Sbjct: 15 WQNFKAEHGRHYASVQEERYRLSVFEQNQQFIDDHNARFENGEVTFTLQMNQFGDMTSEE 74
Query: 109 --------------------------------------ISPVKDQGHCGSCWTFSTTGSL 130
++PVKDQ CGSCW FSTTGSL
Sbjct: 75 IVATMNGFLGAPTRRPAAVLKADDETLPEKVDWRTKGAVTPVKDQKQCGSCWAFSTTGSL 134
Query: 131 EAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGK 190
E + GK +SLSEQ LVDC+ F N GC GGL QAF YIK N G+DTE++YPY +
Sbjct: 135 EGQHFLKDGKLVSLSEQNLVDCSDKFGNMGCMGGLMDQAFRYIKANKGIDTEDSYPYEAQ 194
Query: 191 DGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDG-FRFYKSGVYSS 249
DG C+F + NVG V++ G+E L+ AV + P+SV + F FY +GVY
Sbjct: 195 DGKCRFDASNVGATDTGYVDVEHGSESALKKAVATIGPISVGIDASQSTFHFYHTGVYHD 254
Query: 250 TKCGNTPMDVNHAVVAVGYGV-EDGVPYWLIKNSWGENWGDHGYF 293
C +T +D H V+AVGYG E+G +WL+KNSW +WGD GY
Sbjct: 255 DHCSSTMLD--HGVLAVGYGSDENGGDFWLVKNSWNTSWGDKGYI 297
>gi|403300975|ref|XP_003941187.1| PREDICTED: cathepsin L1-like isoform 1 [Saimiri boliviensis
boliviensis]
gi|403300977|ref|XP_003941188.1| PREDICTED: cathepsin L1-like isoform 2 [Saimiri boliviensis
boliviensis]
gi|403300979|ref|XP_003941189.1| PREDICTED: cathepsin L1-like isoform 3 [Saimiri boliviensis
boliviensis]
Length = 333
Score = 196 bits (498), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 105/221 (47%), Positives = 135/221 (61%), Gaps = 9/221 (4%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
+ + +R ++PVK+QG CGSCW FS TG+LE + GK +SLSEQ LVDC+ N
Sbjct: 116 RSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSGPQGN 175
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
QGCNGGL AF+Y++ NGGLD+EE+YPY + CK++ + V+I E
Sbjct: 176 QGCNGGLMDYAFQYVQENGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIP-KLEKA 234
Query: 219 LQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----DG 273
L AV V P+SVA + + F+FYK G+Y +C + MD H V+ VGYG E D
Sbjct: 235 LMKAVATVGPISVAIDAGHESFQFYKEGIYFEPECSSEDMD--HGVLVVGYGFERTGSDN 292
Query: 274 VPYWLIKNSWGENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
YWL+KNSWGE WG GY KM KN CGIA+ ASYP V
Sbjct: 293 SKYWLVKNSWGEEWGMDGYIKMAKDRKNHCGIASAASYPTV 333
>gi|294885989|ref|XP_002771502.1| thiolproteinase SmTP1, putative [Perkinsus marinus ATCC 50983]
gi|239875206|gb|EER03318.1| thiolproteinase SmTP1, putative [Perkinsus marinus ATCC 50983]
Length = 337
Score = 196 bits (498), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 115/294 (39%), Positives = 152/294 (51%), Gaps = 57/294 (19%)
Query: 60 LSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN----------- 108
L+F F +++GK Y++ EE R A F NL+ I N + LSY+LG+N
Sbjct: 25 LAFIGFQKKHGKSYDNKEEEMKRAAIFHDNLNYIEEVNAQNLSYKLGVNEYTDLTLEEFA 84
Query: 109 ---------------------------------------ISPVKDQGHCGSCWTFSTTGS 129
++PVKDQG+CGSCW FS G+
Sbjct: 85 ALKLSSTDMSEGMGDGFVAGAGPTTTTLPTSVDWRKKGVLNPVKDQGYCGSCWAFSAIGA 144
Query: 130 LEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTG 189
LE Y A GK +SLSEQQLVDCA A+ N+GCNGGL +AFEYIK G+D E YPY G
Sbjct: 145 LEPRYAIATGKLLSLSEQQLVDCAGAYGNEGCNGGLMDKAFEYIKAT-GVDKESTYPYVG 203
Query: 190 KDGVCKFSSEN----VGVQVLDSVNITLGAEDELQHAVGLVRPVSVA-FEVVDGFRFYKS 244
D C+ + EN + V + + E L V PVS+A + + F+ YKS
Sbjct: 204 SDETCQATVENKTDGLPVGEVTGNQMLHQTEKALMEGVA-AAPVSIAMYANLQSFQHYKS 262
Query: 245 GVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMG 298
GVYS C ++H VVAVGYG E+G Y++I+NSWG +WG GY ++ G
Sbjct: 263 GVYSDPNCNAKGGSIDHGVVAVGYGTENGQDYFIIRNSWGRSWGQDGYVYLKRG 316
>gi|45550332|gb|AAS67922.1| cathepsin L [Artemia franciscana]
Length = 226
Score = 196 bits (498), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 101/215 (46%), Positives = 132/215 (61%), Gaps = 2/215 (0%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++PVK G C SC FS TG+LE+ + GK ISLSEQ L+DC+ + N G
Sbjct: 12 VDWREKGAVTPVKYPGQCASCLAFSPTGALESQTFRKTGKLISLSEQNLIDCSGEYGNLG 71
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C GG SQAFEYIK N G+DTE Y Y K+ C+ + N G L VNI G ED+L+
Sbjct: 72 CKGGWISQAFEYIKDNKGIDTENKYHYEAKENFCRDNPRNRGAVALGFVNIPSGEEDKLK 131
Query: 221 HAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
AV V PVS +V +GF+FY GVY C + +NH V+ +G G ++G YWL+
Sbjct: 132 AAVATVGPVSAVIDVSHEGFQFYSKGVYYEPSCKTSFEHLNHEVLVIGCGSDNGEDYWLV 191
Query: 280 KNSWGENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
KNSW ++WGD GY K+ KN CG+AT A YP+V
Sbjct: 192 KNSWSKHWGDEGYLKIARNRKNHCGVATAALYPIV 226
>gi|310751866|gb|ADP09371.1| cathepsin L-like proteinase [Fasciola hepatica]
Length = 326
Score = 196 bits (498), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 97/215 (45%), Positives = 138/215 (64%), Gaps = 4/215 (1%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++ VKDQG+CGSCW FSTTG++E Y + IS SEQQLVDC++ + N G
Sbjct: 112 IDWRESGYVTEVKDQGNCGSCWAFSTTGTMEGQYMKNERTSISFSEQQLVDCSRPWGNNG 171
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C GGL A+EY+K GL+TE +YPY +G C+++ + +V + G+E EL+
Sbjct: 172 CGGGLMENAYEYLK-QFGLETESSYPYRAVEGQCRYNKQLGVAKVTGYYTVHSGSEVELK 230
Query: 221 HAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIK 280
+ VG P +VA +V F Y G+Y S C +P+ +NHAV+AVGYG + G YW++K
Sbjct: 231 NLVGAEGPAAVAVDVESDFMMYSGGIYQSQTC--SPLGLNHAVLAVGYGTQGGTDYWIVK 288
Query: 281 NSWGENWGDHGYFKMEMGK-NMCGIATCASYPVVA 314
NSWG +WG+ GY +M + NMCGIA+ AS +VA
Sbjct: 289 NSWGLSWGERGYIRMARNRGNMCGIASLASLLMVA 323
>gi|157787177|ref|NP_001099150.1| cathepsin L1-like precursor [Danio rerio]
gi|157422879|gb|AAI53505.1| MGC174152 protein [Danio rerio]
Length = 336
Score = 196 bits (498), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 101/212 (47%), Positives = 137/212 (64%), Gaps = 9/212 (4%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++PVKDQ CGSCW+FS+TG+LE + GK IS+SEQ LVDC++ NQGCNGGL Q
Sbjct: 127 VTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQGNQGCNGGLMDQ 186
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGV-CKFSSENVGVQVLDSVNITLGAEDELQHAVGLVR 227
AF+Y+K N GLD+E++YPY +D + C++ ++ V+I G E L +AV V
Sbjct: 187 AFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPRGNELALMNAVAAVG 246
Query: 228 PVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVED----GVPYWLIKNS 282
PVSVA + +FY+SG+Y C ++ +D HAV+ VGYG + G YW++KNS
Sbjct: 247 PVSVAIDASHQSLQFYQSGIYYERACSSSRLD--HAVLVVGYGYQGADVAGNRYWIVKNS 304
Query: 283 WGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
W + WGD GY M K N CGIAT ASYP++
Sbjct: 305 WSDKWGDKGYIYMAKDKNNHCGIATMASYPLM 336
>gi|12847813|dbj|BAB27719.1| unnamed protein product [Mus musculus]
Length = 334
Score = 196 bits (498), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 108/221 (48%), Positives = 133/221 (60%), Gaps = 9/221 (4%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++PVK+QG CGSCW FS +G LE GK ISLSEQ LVDC+ A N
Sbjct: 116 KSVDWREKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGN 175
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
QGCNGGL AF+YIK NGGLD+EE+YPY KDG CK+ +E V+I E
Sbjct: 176 QGCNGGLMDYAFQYIKENGGLDSEESYPYEAKDGSCKYRAEFAVANDTGFVDIPQ-QEKA 234
Query: 219 LQHAVGLVRPVSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----DG 273
L AV V P+SVA + +FY SG+Y C + +D H V+ VGYG E +
Sbjct: 235 LMKAVATVGPISVAMDASHPSLQFYSSGIYYEPNCSSKNLD--HGVLLVGYGYEGTDSNK 292
Query: 274 VPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
YWL+KNSWG WG GY K+ + N CG+AT ASYPVV
Sbjct: 293 NKYWLVKNSWGSEWGMEGYIKIAKDRDNHCGLATAASYPVV 333
>gi|211953201|gb|ACJ13762.1| aleurain-like protease [Helianthus annuus]
gi|211953211|gb|ACJ13767.1| aleurain-like protease [Helianthus annuus]
Length = 114
Score = 196 bits (498), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 90/112 (80%), Positives = 101/112 (90%)
Query: 203 VQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHA 262
VQV+DSVNIT GAEDEL+HAVG+VRPVSVAFEV+ FR Y GV++S CG+ PMDVNHA
Sbjct: 3 VQVVDSVNITSGAEDELKHAVGVVRPVSVAFEVIANFRLYTGGVFTSDDCGSGPMDVNHA 62
Query: 263 VVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASYPVVA 314
VVAVGYGVEDGVPYWLIKNSWG +WG +GYFKMEMGKNMCG+ATCASYP+VA
Sbjct: 63 VVAVGYGVEDGVPYWLIKNSWGADWGLNGYFKMEMGKNMCGVATCASYPIVA 114
>gi|171702843|dbj|BAG16377.1| cysteine protease [Brassica rapa var. perviridis]
Length = 431
Score = 196 bits (498), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 119/306 (38%), Positives = 157/306 (51%), Gaps = 61/306 (19%)
Query: 62 FARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN------------- 108
+ + ++GK S+ E RF F NL I N K LSYRLGL
Sbjct: 42 YEEWVVKHGKAQNSLTEKDRRFEIFKDNLRFIDEHNGKNLSYRLGLTKFADLTNDEYRSM 101
Query: 109 ------------------------------------ISPVKDQGHCGSCWTFSTTGSLEA 132
++ VKDQG CGSCW FST G++E
Sbjct: 102 YLGSRLKRKATKTSLRYEARVGDAIPESVDWRKEGAVAEVKDQGSCGSCWAFSTIGAVEG 161
Query: 133 AYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDG 192
G ISLSEQ+LVDC ++N +GCNGGL AFE+I NGG+DTEE YPY G DG
Sbjct: 162 INKIVTGDLISLSEQELVDCDTSYN-EGCNGGLMDYAFEFIIKNGGIDTEEDYPYKGVDG 220
Query: 193 VCKFSSENVGVQVLDSV-NITLGAEDELQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSST 250
C + +N V +DS ++ +E+ L+ A+ +P+SVA E F+ Y SG++
Sbjct: 221 RCDQTRKNAKVVTIDSYEDVPANSEESLKKALSH-QPISVAIEGGGRAFQLYDSGIFDGI 279
Query: 251 KCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMG----KNMCGIAT 306
CG D++H VVAVGYG E+G YW++KNSWG +WG+ GY +ME CGIA
Sbjct: 280 -CGT---DLDHGVVAVGYGTENGKDYWIVKNSWGTSWGESGYIRMERNIASSAGKCGIAV 335
Query: 307 CASYPV 312
SYP+
Sbjct: 336 EPSYPI 341
>gi|139947602|ref|NP_001077155.1| cathepsin L1 precursor [Bos taurus]
gi|134025180|gb|AAI34742.1| CTSL1 protein [Bos taurus]
gi|296484500|tpg|DAA26615.1| TPA: cathepsin L1 [Bos taurus]
Length = 333
Score = 196 bits (498), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 115/265 (43%), Positives = 154/265 (58%), Gaps = 28/265 (10%)
Query: 57 RHALS-FARFARRYGK-IYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLNISPVKD 114
RH ++ F R + GK +E++ FA+ ++D +R ++PVK+
Sbjct: 89 RHTMNGFQRQKNKKGKEFHETI------FASIPPSVD-----------WREKGYVTPVKN 131
Query: 115 QGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIK 174
QG CGSCW FS TG+LE Q GK +SLSEQ LVDC+Q N+GC+GG AF+Y+
Sbjct: 132 QGKCGSCWAFSATGALEGQMFQKTGKLVSLSEQNLVDCSQPEGNRGCHGGFIDNAFQYVL 191
Query: 175 YNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFE 234
GGLD+EE+YPYTG G C ++ N V++ E L AV + P+SVA +
Sbjct: 192 DVGGLDSEESYPYTGLVGTCLYNPNNSAANETGFVDLP-KQEKALMKAVANLGPISVAVD 250
Query: 235 VVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----DGVPYWLIKNSWGENWGD 289
+ F+FYKSG+Y C + +D HAV+ VGYG E D YWL+KNSWGE+WG
Sbjct: 251 AHNPSFQFYKSGIYYEPNCSSESVD--HAVLVVGYGFEGADSDDNKYWLVKNSWGEHWGM 308
Query: 290 HGYFKMEMGK-NMCGIATCASYPVV 313
+GY KM + N CGIAT ASYP V
Sbjct: 309 NGYIKMAKDRNNHCGIATMASYPTV 333
>gi|294885991|ref|XP_002771503.1| thiolproteinase SmTP1, putative [Perkinsus marinus ATCC 50983]
gi|239875207|gb|EER03319.1| thiolproteinase SmTP1, putative [Perkinsus marinus ATCC 50983]
Length = 337
Score = 196 bits (498), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 115/305 (37%), Positives = 156/305 (51%), Gaps = 57/305 (18%)
Query: 49 VLQVIGQARHALSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN 108
V + + +L+F F +++GK Y++ +E R A F NL+ I N + LSY+LG+N
Sbjct: 14 VYKAVDLETSSLAFIGFQKKHGKSYDNKDEEMKRAAIFHDNLNYIEEVNAQNLSYKLGVN 73
Query: 109 --------------------------------------------------ISPVKDQGHC 118
++PVKDQG+C
Sbjct: 74 EYTDLTLEEFAALKLSSTDMSEGMGDGFVAGAGPTTTTLPTSVDWRKKGVLNPVKDQGYC 133
Query: 119 GSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGG 178
GSCW FS G+LE Y A GK +SLSEQQLVDCA A+ N+GCNGGL +AFEYIK G
Sbjct: 134 GSCWAFSAIGALEPRYAIATGKLLSLSEQQLVDCAGAYGNEGCNGGLMDKAFEYIKAT-G 192
Query: 179 LDTEEAYPYTGKDGVCKFSSEN----VGVQVLDSVNITLGAEDELQHAVGLVRPVSVA-F 233
+D E YPY G D C+ + EN + V + + E L V PVS+A +
Sbjct: 193 VDKESTYPYVGSDETCQATVENKTDGLPVGEVTGNQMLHQTEKALMEGVA-AAPVSIAMY 251
Query: 234 EVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYF 293
+ F+ YKSGVYS C ++H VVAVGYG E+G Y++I+NSWG +WG GY
Sbjct: 252 ANLQSFQHYKSGVYSDPNCNAKGGSIDHGVVAVGYGTENGQDYFIIRNSWGRSWGQDGYV 311
Query: 294 KMEMG 298
++ G
Sbjct: 312 YLKRG 316
>gi|2804264|dbj|BAA24443.1| cysteine proteinase [Sitophilus zeamais]
Length = 331
Score = 196 bits (498), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 98/201 (48%), Positives = 133/201 (66%), Gaps = 5/201 (2%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++ VKDQGHCGSCW+FS +GSLE + + GK +SLSEQ LVDC+ + N GCNGGL
Sbjct: 133 VTKVKDQGHCGSCWSFSGSGSLEGQHFRKTGKLVSLSEQNLVDCSGRYGNNGCNGGLMDN 192
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
AF YIK NGG+DTE++YPY +D C + ++N G V+I G ED+L+ AV V P
Sbjct: 193 AFRYIKDNGGIDTEQSYPYLAEDEKCHYKTQNSGATDKGFVDIEEGNEDDLKAAVATVGP 252
Query: 229 VSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGV-EDGVPYWLIKNSWGEN 286
+S+A + + F+ Y GVYS +C + +D H V+ VGYG +DG YWL+KNSW +
Sbjct: 253 ISIAIDASYETFQLYSDGVYSDPECISQELD--HGVLVVGYGTSDDGQDYWLVKNSWRPS 310
Query: 287 WGDHGYFKMEMGK-NMCGIAT 306
G +GY KM + NMCG+A+
Sbjct: 311 CGLNGYIKMARNQDNMCGVAS 331
>gi|211909242|gb|ACJ12894.1| cathepsin L1D [Fasciola hepatica]
Length = 326
Score = 196 bits (497), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 98/215 (45%), Positives = 137/215 (63%), Gaps = 4/215 (1%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++ VKDQG+CGSCW FSTTG++E Y + IS SEQQLVDC+ + N G
Sbjct: 112 IDWRESGYVTGVKDQGNCGSCWAFSTTGTMEGQYMKNERTSISFSEQQLVDCSGPWGNNG 171
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C GGL A+EY+K GL+TE +YPY +G C+++ + +V + G E L+
Sbjct: 172 CGGGLMENAYEYLK-QFGLETESSYPYRAVEGQCRYNRQLGVAKVTGYYTLHSGNEAGLK 230
Query: 221 HAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIK 280
VG P +VA +V F Y+SG+Y S C +P+ +NHAV+AVGYG + G YW++K
Sbjct: 231 SLVGSEGPAAVAVDVESDFMMYRSGIYQSQTC--SPLGLNHAVLAVGYGTQGGTDYWIVK 288
Query: 281 NSWGENWGDHGYFKMEMGK-NMCGIATCASYPVVA 314
NSWG +WG+ GY +M + NMCGIA+ AS P+VA
Sbjct: 289 NSWGLSWGERGYIRMARNRGNMCGIASLASLPMVA 323
>gi|211909240|gb|ACJ12893.1| cathepsin L1D [Fasciola hepatica]
Length = 326
Score = 196 bits (497), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 98/215 (45%), Positives = 137/215 (63%), Gaps = 4/215 (1%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++ VKDQG+CGSCW FSTTG++E Y + IS SEQQLVDC+ + N G
Sbjct: 112 IDWRESGYVTGVKDQGNCGSCWAFSTTGTMEGQYMKNERTSISFSEQQLVDCSGPWGNNG 171
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C GGL A+EY+K GL+TE +YPY +G C+++ + +V + G E L+
Sbjct: 172 CGGGLMENAYEYLK-QFGLETESSYPYRAVEGQCRYNRQLGVAKVTGYYTLHSGNEAGLK 230
Query: 221 HAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIK 280
VG P +VA +V F Y+SG+Y S C +P+ +NHAV+AVGYG + G YW++K
Sbjct: 231 SLVGSEGPAAVAVDVESDFMMYRSGIYQSQTC--SPLGLNHAVLAVGYGTQGGTDYWIVK 288
Query: 281 NSWGENWGDHGYFKMEMGK-NMCGIATCASYPVVA 314
NSWG +WG+ GY +M + NMCGIA+ AS P+VA
Sbjct: 289 NSWGLSWGERGYIRMARNRGNMCGIASLASLPMVA 323
>gi|157311713|ref|NP_001098585.1| uncharacterized protein LOC564979 precursor [Danio rerio]
gi|156230121|gb|AAI52284.1| Wu:fa26c03 protein [Danio rerio]
Length = 336
Score = 196 bits (497), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 100/212 (47%), Positives = 137/212 (64%), Gaps = 9/212 (4%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++PVKDQ CGSCW+FS+TG+LE + GK IS+SEQ LVDC++ NQGCNGGL Q
Sbjct: 127 VTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQGNQGCNGGLMDQ 186
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGV-CKFSSENVGVQVLDSVNITLGAEDELQHAVGLVR 227
AF+Y+K N GLD+E++YPY +D + C++ ++ V+I G E L +AV V
Sbjct: 187 AFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPRGNELALMNAVAAVG 246
Query: 228 PVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVED----GVPYWLIKNS 282
PVSVA + +FY+SG+Y C ++ +D HAV+ VGYG + G YW++KNS
Sbjct: 247 PVSVAIDASHQSLQFYQSGIYYERACSSSRLD--HAVLVVGYGYQGADVAGNRYWIVKNS 304
Query: 283 WGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
W + WGD GY M K N CG+AT ASYP++
Sbjct: 305 WSDKWGDKGYIYMAKDKNNHCGVATSASYPLM 336
>gi|7271893|gb|AAF44677.1|AF239266_1 cathepsin L [Fasciola gigantica]
Length = 326
Score = 196 bits (497), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 103/256 (40%), Positives = 146/256 (57%), Gaps = 8/256 (3%)
Query: 60 LSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLNISPVKDQGHCG 119
L+F F +Y E+ R + N + + + +R ++ VKDQG CG
Sbjct: 75 LTFEEFKTKYLIEIPRSSELLSRGIPYKANKPAVPES----IDWRDYYYVTEVKDQGQCG 130
Query: 120 SCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGL 179
SCW FSTTG++E + + S SEQQLVDC + F N GC GG A+EY+K++ GL
Sbjct: 131 SCWAFSTTGAMEGQFRKNERASASFSEQQLVDCTRNFGNHGCGGGYMENAYEYLKHS-GL 189
Query: 180 DTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGF 239
+T+ YPY +G C++ +V D + G E EL++ VG P +VA +V F
Sbjct: 190 ETDSYYPYQAVEGPCQYDGRLAYAKVTDYYTVHSGDEVELKNLVGTEGPAAVALDVDYDF 249
Query: 240 RFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGK 299
Y+SG+Y S C P + HAV+AVGYG +DG YW++KNSWG +WG+ GY + +
Sbjct: 250 MMYESGIYHSETC--LPDRLTHAVLAVGYGAQDGTDYWIVKNSWGSSWGEKGYIRFARNR 307
Query: 300 -NMCGIATCASYPVVA 314
NMCGIA+ AS P+VA
Sbjct: 308 GNMCGIASLASVPMVA 323
>gi|6753558|ref|NP_034114.1| cathepsin L1 preproprotein [Mus musculus]
gi|115742|sp|P06797.2|CATL1_MOUSE RecName: Full=Cathepsin L1; AltName: Full=Major excreted protein;
Short=MEP; AltName: Full=p39 cysteine proteinase;
Contains: RecName: Full=Cathepsin L1 heavy chain;
Contains: RecName: Full=Cathepsin L1 light chain; Flags:
Precursor
gi|53047|emb|CAA29470.1| unnamed protein product [Mus musculus]
gi|309186|gb|AAA37445.1| preprocysteine proteinase [Mus musculus]
gi|12832050|dbj|BAB21945.1| unnamed protein product [Mus musculus]
gi|26340196|dbj|BAC33761.1| unnamed protein product [Mus musculus]
gi|45768760|gb|AAH68163.1| Cathepsin L [Mus musculus]
gi|74139700|dbj|BAE31701.1| unnamed protein product [Mus musculus]
gi|74146632|dbj|BAE41323.1| unnamed protein product [Mus musculus]
gi|74151584|dbj|BAE41141.1| unnamed protein product [Mus musculus]
gi|74185397|dbj|BAE30172.1| unnamed protein product [Mus musculus]
gi|74197196|dbj|BAE35143.1| unnamed protein product [Mus musculus]
gi|74203006|dbj|BAE26206.1| unnamed protein product [Mus musculus]
gi|74219606|dbj|BAE29572.1| unnamed protein product [Mus musculus]
gi|148684295|gb|EDL16242.1| cathepsin L [Mus musculus]
Length = 334
Score = 195 bits (496), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 108/221 (48%), Positives = 133/221 (60%), Gaps = 9/221 (4%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++PVK+QG CGSCW FS +G LE GK ISLSEQ LVDC+ A N
Sbjct: 116 KSVDWREKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGN 175
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
QGCNGGL AF+YIK NGGLD+EE+YPY KDG CK+ +E V+I E
Sbjct: 176 QGCNGGLMDFAFQYIKENGGLDSEESYPYEAKDGSCKYRAEFAVANDTGFVDIPQ-QEKA 234
Query: 219 LQHAVGLVRPVSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----DG 273
L AV V P+SVA + +FY SG+Y C + +D H V+ VGYG E +
Sbjct: 235 LMKAVATVGPISVAMDASHPSLQFYSSGIYYEPNCSSKNLD--HGVLLVGYGYEGTDSNK 292
Query: 274 VPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
YWL+KNSWG WG GY K+ + N CG+AT ASYPVV
Sbjct: 293 NKYWLVKNSWGSEWGMEGYIKIAKDRDNHCGLATAASYPVV 333
>gi|4886998|gb|AAD32136.1|AF121837_1 cathepsin L [Mus musculus]
gi|4887000|gb|AAD32137.1|AF121838_1 cathepsin L [Mus musculus]
gi|4887002|gb|AAD32138.1|AF121839_1 cathepsin L [Mus musculus]
gi|200501|gb|AAA39984.1| preprocathepsin L precursor [Mus musculus]
Length = 334
Score = 195 bits (496), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 108/221 (48%), Positives = 133/221 (60%), Gaps = 9/221 (4%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++PVK+QG CGSCW FS +G LE GK ISLSEQ LVDC+ A N
Sbjct: 116 KSVDWREKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGN 175
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
QGCNGGL AF+YIK NGGLD+EE+YPY KDG CK+ +E V+I E
Sbjct: 176 QGCNGGLMDFAFQYIKENGGLDSEESYPYEAKDGSCKYRAEFAVANDTGFVDIPQ-QEKA 234
Query: 219 LQHAVGLVRPVSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----DG 273
L AV V P+SVA + +FY SG+Y C + +D H V+ VGYG E +
Sbjct: 235 LMKAVATVGPISVAMDASHPSLQFYSSGIYYEPNCSSKNLD--HGVLLVGYGYEGTDSNK 292
Query: 274 VPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
YWL+KNSWG WG GY K+ + N CG+AT ASYPVV
Sbjct: 293 NKYWLVKNSWGSEWGMEGYIKIAKDRDNHCGLATAASYPVV 333
>gi|403333364|gb|EJY65772.1| Cathepsin L [Oxytricha trifallax]
Length = 338
Score = 195 bits (496), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 123/315 (39%), Positives = 164/315 (52%), Gaps = 56/315 (17%)
Query: 47 TSVLQVIGQARHALSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKG-LSYRL 105
T + I HA F F +YGK Y + EE R F +NL + N + ++YRL
Sbjct: 30 TQLYTPITAEDHA--FTNFVAKYGKSYGTKEEYDFRSKLFKQNLAKVSMNNARNDVTYRL 87
Query: 106 GLN---------------------------------------------ISPVKDQGHCGS 120
GLN ++PVKDQG CGS
Sbjct: 88 GLNKFADYTEAEYKRLLGFGGQKNKNPRNIKVLGAPKNDGVNWVEQGAVTPVKDQGQCGS 147
Query: 121 CWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLD 180
CW+FS TG++E FG SLSEQQLVDC+QA N+GC GG QAF+Y++ L+
Sbjct: 148 CWSFSATGAMEGHAKIQFGTLYSLSEQQLVDCSQAEGNEGCGGGWMDQAFQYVEQT-ALE 206
Query: 181 TEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDG-F 239
TE+ YPY D C+ SS V V+V V++T +EL+ A+ PVSVA E F
Sbjct: 207 TEDQYPYEAVDDTCRASSAGV-VKVDSFVDVTPNNVNELKAALDK-GPVSVAIEADQMVF 264
Query: 240 RFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMG- 298
+FY GV + CG T ++H V+AVGYG E G Y+L+KNSWG +WG+ GY K+
Sbjct: 265 QFYSGGVINDASCGTT---LDHGVLAVGYGNESGQDYFLVKNSWGASWGEEGYVKIAASP 321
Query: 299 KNMCGIATCASYPVV 313
N+CGI + ASYP++
Sbjct: 322 DNICGILSQASYPIM 336
>gi|301789679|ref|XP_002930256.1| PREDICTED: cathepsin L1-like [Ailuropoda melanoleuca]
gi|281343339|gb|EFB18923.1| hypothetical protein PANDA_020645 [Ailuropoda melanoleuca]
Length = 334
Score = 195 bits (496), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 105/222 (47%), Positives = 137/222 (61%), Gaps = 10/222 (4%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + + L ++PVK+QG CGSCW FS TG+LE + GK +SLSEQ LVDC+++ N
Sbjct: 116 KSVDWTLKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRSQGN 175
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKD-GVCKFSSENVGVQVLDSVNITLGAED 217
+GCNGGL AF+Y+K NGGLD+EE+YPY G D CK+ E V+I E
Sbjct: 176 EGCNGGLMDNAFQYVKENGGLDSEESYPYLGTDTDSCKYKPECSAANDTGFVDIPQ-REK 234
Query: 218 ELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----D 272
L AV V P+SVA + F+FYKSG+Y C + D++H V+ VGYG E +
Sbjct: 235 ALMKAVATVGPISVAIDAGHQSFQFYKSGIYYDPDC--SSKDLDHGVLVVGYGFEGTDSN 292
Query: 273 GVPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
+W++KNSWG WG +GY KM + N CGIAT ASYP V
Sbjct: 293 NNKFWIVKNSWGPEWGTNGYVKMAKDQNNHCGIATAASYPTV 334
>gi|74149661|dbj|BAE36450.1| unnamed protein product [Mus musculus]
Length = 334
Score = 195 bits (496), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 108/221 (48%), Positives = 133/221 (60%), Gaps = 9/221 (4%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++PVK+QG CGSCW FS +G LE GK ISLSEQ LVDC+ A N
Sbjct: 116 KSVDWREKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGN 175
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
QGCNGGL AF+YIK NGGLD+EE+YPY KDG CK+ +E V+I E
Sbjct: 176 QGCNGGLMDFAFQYIKENGGLDSEESYPYEAKDGSCKYRAEFAVANGTGFVDIPQ-QEKA 234
Query: 219 LQHAVGLVRPVSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----DG 273
L AV V P+SVA + +FY SG+Y C + +D H V+ VGYG E +
Sbjct: 235 LMKAVATVGPISVAMDASHPSLQFYSSGIYYEPNCSSKNLD--HGVLLVGYGYEGTDSNK 292
Query: 274 VPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
YWL+KNSWG WG GY K+ + N CG+AT ASYPVV
Sbjct: 293 NKYWLVKNSWGSEWGMEGYIKIAKDRDNHCGLATAASYPVV 333
>gi|74200292|dbj|BAE22939.1| unnamed protein product [Mus musculus]
Length = 308
Score = 195 bits (495), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 108/221 (48%), Positives = 133/221 (60%), Gaps = 9/221 (4%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++PVK+QG CGSCW FS +G LE GK ISLSEQ LVDC+ A N
Sbjct: 90 KSVDWREKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGN 149
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
QGCNGGL AF+YIK NGGLD+EE+YPY KDG CK+ +E V+I E
Sbjct: 150 QGCNGGLMDFAFQYIKENGGLDSEESYPYEAKDGSCKYRAEFAVANDTGFVDIPQ-QEKA 208
Query: 219 LQHAVGLVRPVSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----DG 273
L AV V P+SVA + +FY SG+Y C + +D H V+ VGYG E +
Sbjct: 209 LMKAVATVGPISVAMDASHPSLQFYSSGIYYEPNCSSKNLD--HGVLLVGYGYEGTDSNK 266
Query: 274 VPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
YWL+KNSWG WG GY K+ + N CG+AT ASYPVV
Sbjct: 267 NKYWLVKNSWGSEWGMEGYIKIAKDRDNHCGLATAASYPVV 307
>gi|167521499|ref|XP_001745088.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163776702|gb|EDQ90321.1| predicted protein [Monosiga brevicollis MX1]
Length = 294
Score = 195 bits (495), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 123/303 (40%), Positives = 154/303 (50%), Gaps = 63/303 (20%)
Query: 65 FARRYGKIYESVEEMKLRFATFSKNLDLIRSTNC---KGL-SYRLGLN------------ 108
F Y K YES R A F NL+ I N +GL SY +G+N
Sbjct: 1 FKSDYSKSYESEAVEAKRLAAFEANLEFINKHNAEHAQGLHSYTVGVNEFADLTIDEFMA 60
Query: 109 ---------------------------------ISPVKDQGHCGSCWTFSTTGSLEAAYH 135
++P+K+QG CGSCW+FSTTGS E A+
Sbjct: 61 LYVPSKFNRTMPYNTVYLPATSEDSVDWRTKGAVTPIKNQGQCGSCWSFSTTGSTEGAHA 120
Query: 136 QAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCK 195
A G +SLSEQQLVDC+ +F NQGCNGGL AF+YI N GLDTEE YPYT +DG C
Sbjct: 121 IATGNLVSLSEQQLVDCSGSFGNQGCNGGLMDDAFKYIISNKGLDTEEDYPYTAQDGTCN 180
Query: 196 FSSENVGVQVLDSV-NITLGAEDELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCG 253
E + S ++ ED+L AV PVSVA E GF+ YKSGV+ G
Sbjct: 181 KEKEAKHAATISSYSDVPKNNEDQLAAAVAK-GPVSVAIEADQSGFQLYKSGVFD----G 235
Query: 254 NTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMG---KNMCGIATCASY 310
N +++H V+ VGY + YW++KNSWG WG GY M+ G +CGIA SY
Sbjct: 236 NCGTNLDHGVLVVGYTDD----YWIVKNSWGTTWGVEGYINMKRGVSASGICGIAMQPSY 291
Query: 311 PVV 313
P+V
Sbjct: 292 PIV 294
>gi|211953185|gb|ACJ13754.1| aleurain-like protease [Helianthus annuus]
Length = 114
Score = 195 bits (495), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 89/112 (79%), Positives = 101/112 (90%)
Query: 203 VQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHA 262
VQV+DSVNIT GAED+L+HAVG+VRPVSVAFEV+ FR Y GV++S CG+ PMDVNHA
Sbjct: 3 VQVIDSVNITSGAEDKLKHAVGVVRPVSVAFEVIANFRLYTGGVFTSDDCGSGPMDVNHA 62
Query: 263 VVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASYPVVA 314
VVAVGYGVEDGVPYWLIKNSWG +WG +GYFKMEMGKNMCG+ATCASYP+VA
Sbjct: 63 VVAVGYGVEDGVPYWLIKNSWGADWGLNGYFKMEMGKNMCGVATCASYPIVA 114
>gi|403368476|gb|EJY84073.1| Cathepsin L [Oxytricha trifallax]
Length = 338
Score = 195 bits (495), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 123/315 (39%), Positives = 164/315 (52%), Gaps = 56/315 (17%)
Query: 47 TSVLQVIGQARHALSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKG-LSYRL 105
T + I HA F F +YGK Y + EE R F +NL + N + ++YRL
Sbjct: 30 TQLYTPITPEDHA--FTNFVAKYGKSYGTKEEYDFRSKLFKQNLAKVSMNNVRNDVTYRL 87
Query: 106 GLN---------------------------------------------ISPVKDQGHCGS 120
GLN ++PVKDQG CGS
Sbjct: 88 GLNKFADYTEAEYKRLLGFGGQKNKNPRNIKVLGAPKNDGVNWVEQGAVTPVKDQGQCGS 147
Query: 121 CWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLD 180
CW+FS TG++E FG SLSEQQLVDC+QA N+GC GG QAF+Y++ L+
Sbjct: 148 CWSFSATGAMEGHAKIQFGTLYSLSEQQLVDCSQAEGNEGCGGGWMDQAFQYVE-QTALE 206
Query: 181 TEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDG-F 239
TE+ YPY D C+ SS V V+V V++T +EL+ A+ PVSVA E F
Sbjct: 207 TEDQYPYEAVDDTCRASSAGV-VKVDSFVDVTPNNVNELKAALDK-GPVSVAIEADQMVF 264
Query: 240 RFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMG- 298
+FY GV + CG T ++H V+AVGYG E G Y+L+KNSWG +WG+ GY K+
Sbjct: 265 QFYSGGVINDASCGTT---LDHGVLAVGYGNESGQDYFLVKNSWGASWGEEGYVKIAASP 321
Query: 299 KNMCGIATCASYPVV 313
N+CGI + ASYP++
Sbjct: 322 DNICGILSQASYPIM 336
>gi|149755226|ref|XP_001494409.1| PREDICTED: cathepsin L1-like [Equus caballus]
Length = 334
Score = 195 bits (495), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 106/222 (47%), Positives = 138/222 (62%), Gaps = 10/222 (4%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++PVK+QG CGSCW FS TG+LE + GK +SLSEQ LVDC++A N
Sbjct: 116 KTVDWREKGYVTPVKNQGPCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAEGN 175
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDG-VCKFSSENVGVQVLDSVNITLGAED 217
QGCNGGL AF+Y+K NGGLD+EE+YPY K+G C + E V+I E
Sbjct: 176 QGCNGGLMDNAFQYVKDNGGLDSEESYPYLAKEGNNCNYKPEYSAANDTGYVDIPQ-KEK 234
Query: 218 ELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----D 272
L AV V P+SVA + + F+FYKSG+Y C + D++H V+ VGYG E +
Sbjct: 235 ALMKAVATVGPISVAIDAGHESFQFYKSGIYYDPDC--SSKDLDHGVLVVGYGFEGRDSN 292
Query: 273 GVPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
+W++KNSWG WG +GY KM + N CGIAT ASYP V
Sbjct: 293 NNKFWIVKNSWGPEWGWNGYVKMAKDQNNHCGIATAASYPTV 334
>gi|17224950|gb|AAL37181.1|AF320084_1 cathepsin L-like protease [Ancylostoma caninum]
Length = 214
Score = 194 bits (494), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 98/208 (47%), Positives = 135/208 (64%), Gaps = 5/208 (2%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++ VK+QG CGSCW FS TG+LE + +A G+ +SLSEQ LVDC+ + N GCNGGL
Sbjct: 9 VTEVKNQGMCGSCWAFSATGALEGQHARASGQMVSLSEQNLVDCSTKYGNHGCNGGLMDL 68
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
AFEYIK N G+DTEE+YPY G+D C F +++G V++ G E+ L+ AV P
Sbjct: 69 AFEYIKDNHGIDTEESYPYVGRDMKCHFKKKDIGAVDNGYVDLPEGDEEALKIAVATQGP 128
Query: 229 VSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE-DGVPYWLIKNSWGEN 286
+S+A + F+ YK GVY +C + +D H V+ VGYG + + YWL+KNSWG
Sbjct: 129 ISIAIDAGHRTFQLYKKGVYYDEECSSEELD--HGVLLVGYGTDPEAGDYWLVKNSWGTG 186
Query: 287 WGDHGYFKMEMGK-NMCGIATCASYPVV 313
WG+ GY ++ + N CG+AT ASYP+V
Sbjct: 187 WGEKGYIRIARNRNNHCGVATKASYPLV 214
>gi|62526575|gb|AAX84673.1| cysteine protease CP1 [Manihot esculenta]
Length = 467
Score = 194 bits (494), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 116/303 (38%), Positives = 159/303 (52%), Gaps = 64/303 (21%)
Query: 68 RYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN------------------- 108
+ GK+Y ++ E + RF F NL I N + +Y+LGLN
Sbjct: 58 KQGKVYNALGEREKRFQVFKDNLRFIDEHNSENRTYKLGLNGFADLTNEEYRSTYLGARG 117
Query: 109 ---------------------------------ISPVKDQGHCGSCWTFSTTGSLEAAYH 135
++ VKDQG CGSCW FST ++E
Sbjct: 118 GMKRNRLRKTSDRYAPRVGESLPDSVDWRKEGAVAEVKDQGSCGSCWAFSTIAAVEGINK 177
Query: 136 QAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCK 195
G ISLSEQ+LVDC ++N +GCNGGL AFE+I NGG+DTEE YPY +DG C
Sbjct: 178 IVTGDLISLSEQELVDCDTSYN-EGCNGGLMDYAFEFIINNGGIDTEEDYPYLARDGRCD 236
Query: 196 FSSENVGVQVLDSV-NITLGAEDELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCG 253
+N V +D ++ + +E LQ AV +PVSVA E F+FY SG++S +CG
Sbjct: 237 TYRKNAKVVTIDDYEDVPVNSETALQKAVA-NQPVSVAIEAGGRDFQFYASGIFSG-RCG 294
Query: 254 NTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKN----MCGIATCAS 309
++H V AVGYG E+G YW+++NSWG++WG++GY +M N +CGIA AS
Sbjct: 295 TQ---LDHGVAAVGYGTENGKDYWIVRNSWGKSWGENGYLRMARSINSPTGICGIAMEAS 351
Query: 310 YPV 312
YP+
Sbjct: 352 YPI 354
>gi|189525868|ref|XP_001341714.2| PREDICTED: cathepsin L1-like isoform 1 [Danio rerio]
Length = 336
Score = 194 bits (494), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 100/212 (47%), Positives = 137/212 (64%), Gaps = 9/212 (4%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++PVKDQ CGSCW+FS+TG+LE + GK IS+SEQ LVDC++ NQGCNGGL Q
Sbjct: 127 VTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQGNQGCNGGLMDQ 186
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGV-CKFSSENVGVQVLDSVNITLGAEDELQHAVGLVR 227
AF+Y+K N GLD+E++YPY +D + C++ ++ V+I G E L +AV V
Sbjct: 187 AFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPSGNELALMNAVAAVG 246
Query: 228 PVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVED----GVPYWLIKNS 282
PVSVA + +FY+SG+Y C ++ +D HAV+ VGYG + G YW++KNS
Sbjct: 247 PVSVAIDASHQSLQFYQSGIYYERACSSSRLD--HAVLVVGYGYQGADVAGNRYWIVKNS 304
Query: 283 WGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
W + WGD GY M K N CG+AT ASYP++
Sbjct: 305 WSDKWGDKGYIYMAKDKNNHCGVATKASYPLM 336
>gi|7542602|gb|AAF63517.1|AF242733_1 putative cystein proteinase [Capsicum annuum]
Length = 128
Score = 194 bits (494), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 89/123 (72%), Positives = 105/123 (85%)
Query: 179 LDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDG 238
LDT+EAYPYT K+G+CKFS + V+DSVNITLG EDEL++AV LVRPVSVAFEV+ G
Sbjct: 6 LDTKEAYPYTAKNGICKFSQAKLVSNVIDSVNITLGPEDELKYAVALVRPVSVAFEVIKG 65
Query: 239 FRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMG 298
F+ YKSGVY+S +CGNTPMDVNHAV+AVGYGVE+G+PYWLIKNSWG N GD GYFK G
Sbjct: 66 FKQYKSGVYTSAECGNTPMDVNHAVLAVGYGVENGIPYWLIKNSWGANGGDSGYFKWRWG 125
Query: 299 KNM 301
+N+
Sbjct: 126 RNV 128
>gi|15593255|gb|AAL02223.1|AF410883_1 cysteine protease CP19 precursor [Frankliniella occidentalis]
Length = 334
Score = 194 bits (494), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 107/220 (48%), Positives = 139/220 (63%), Gaps = 8/220 (3%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R +P+KDQG CGSCW+FS TGSLE +SLSEQ LVDC+ F N
Sbjct: 118 KKVDWRSKGAATPIKDQGQCGSCWSFSATGSLEGQLFLKNKNLVSLSEQNLVDCSWDFGN 177
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDG-VCKF-SSENVGVQVLDSVNITLGAE 216
+GCNGGL AFEY+K NGG+DTEE+YPYT DG C + ++ N GV ++ +E
Sbjct: 178 EGCNGGLMDSAFEYVKSNGGIDTEESYPYTAVDGDSCLYRAANNAGVNT-GYKDVQAKSE 236
Query: 217 DELQHAVGLVRPVSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE-DGV 274
L+ AV V PVSVA + + F+ Y SG+Y + C + +D H V+AVGYG E
Sbjct: 237 SALRDAVEKVGPVSVAIDASNWSFQMYSSGIYYESACSSDYLD--HGVLAVGYGSEWPNK 294
Query: 275 PYWLIKNSWGENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
+W++KNSWG +WG+ GY KM KN CGIAT ASYP+V
Sbjct: 295 EFWIVKNSWGTSWGEEGYIKMARNKKNNCGIATEASYPLV 334
>gi|218478060|dbj|BAH03396.1| cathepsin L-like cysteine peptidase [Taenia saginata]
Length = 338
Score = 194 bits (494), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 101/208 (48%), Positives = 136/208 (65%), Gaps = 7/208 (3%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++ VK+QG+CGSCW FS+TG+LE A+ + GK ISLSEQQLVDC+ N GCNGG S
Sbjct: 135 VTEVKNQGNCGSCWAFSSTGALEGAFAKKTGKLISLSEQQLVDCSLKNGNDGCNGGYMSY 194
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGV-QVLDSVNITLGAEDELQHAVGLVR 227
AF+Y++ + ++ E AYPY DG C++ +E++GV V D +I G E L AV V
Sbjct: 195 AFKYLEEH-SIEPESAYPYRATDGPCRY-NESLGVGTVTDIGDIPEGNETALMEAVATVG 252
Query: 228 PVSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGEN 286
P+S+A + GF FY+ G+Y S C + + NH V+A+GYG +DG PYWL+KNSWG
Sbjct: 253 PISIAIDASSLGFMFYRHGIYKSHWCSSKFL--NHGVLAIGYGKQDGKPYWLVKNSWGTR 310
Query: 287 WGDHGYFKMEMG-KNMCGIATCASYPVV 313
WG GY M NMCG+A+ A +P V
Sbjct: 311 WGMKGYIMMAKDYHNMCGVASLADFPYV 338
>gi|211953197|gb|ACJ13760.1| aleurain-like protease [Helianthus annuus]
Length = 114
Score = 194 bits (494), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 89/112 (79%), Positives = 101/112 (90%)
Query: 203 VQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHA 262
VQV+DSVNIT GAEDEL+HAVG+VRPVSVAFEV+ FR Y GV++S CG+ PMDVNHA
Sbjct: 3 VQVIDSVNITSGAEDELKHAVGVVRPVSVAFEVIANFRLYTGGVFTSGDCGSGPMDVNHA 62
Query: 263 VVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASYPVVA 314
VVAVGYGVEDGVPYWLIK+SWG +WG +GYFKMEMGKNMCG+ATCASYP+VA
Sbjct: 63 VVAVGYGVEDGVPYWLIKDSWGADWGLNGYFKMEMGKNMCGVATCASYPIVA 114
>gi|281200606|gb|EFA74824.1| cysteine proteinase 5 precursor [Polysphondylium pallidum PN500]
Length = 307
Score = 194 bits (494), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 117/301 (38%), Positives = 151/301 (50%), Gaps = 68/301 (22%)
Query: 75 SVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN-------------------------- 108
+ +E RF F KN+D + N KG S LGLN
Sbjct: 9 TAQEFGTRFNIFKKNMDFVHKWNAKGSSTVLGLNSMADISNEEYQRVYLGTHIDASQFRQ 68
Query: 109 -------------------------ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGIS 143
++P+K+QG CGSCW+FSTTGS E A+ G +S
Sbjct: 69 QAASHKLGRTFKVQAANVDWRAKGAVTPIKNQGQCGSCWSFSTTGSTEGAHFIKTGNLVS 128
Query: 144 LSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDG-VCKFSSENVG 202
LSEQ L+DC++ NQGCNGGL + AFEYI N G+DTE +YPY +DG C ++ N
Sbjct: 129 LSEQNLMDCSKPEGNQGCNGGLMTAAFEYIIKNNGIDTESSYPYKAEDGKKCLYNPANSA 188
Query: 203 VQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNH 261
+ VN+T G+E +L GL PVSVA + + F+ Y SGVY KC T +D H
Sbjct: 189 ATLSSYVNVTTGSESDLAVKSGL-GPVSVAIDASHNSFQLYSSGVYYEPKCSQTQLD--H 245
Query: 262 AVVAVGYGVEDGVP----------YWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASY 310
V+ VGYG D +P +W++KNSWG WG GY M + N CGIAT AS
Sbjct: 246 GVLVVGYG-SDALPSAGVSAGSGDWWIVKNSWGTTWGVEGYIYMSRNRNNNCGIATMASL 304
Query: 311 P 311
P
Sbjct: 305 P 305
>gi|42564163|gb|AAS20593.1| digestive cysteine proteinase intestain [Leptinotarsa decemlineata]
Length = 324
Score = 194 bits (494), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 122/315 (38%), Positives = 158/315 (50%), Gaps = 60/315 (19%)
Query: 56 ARHALS----FARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTN---CKGLS-YRLGL 107
A ALS + F + K Y++V E K RF F NL I N +GLS Y +G+
Sbjct: 13 ATEALSDKEKWQNFKINFSKSYQNVVEEKRRFNIFLSNLLRIEEHNQNFSRGLSTYEMGV 72
Query: 108 N------------------------------------------------ISPVKDQGHCG 119
N ++ VK QG CG
Sbjct: 73 NKFADLTPEEFMERFRPLRKTKPKFLSEQAKFNFDGDLPAEVDWTKQGAVTEVKSQGSCG 132
Query: 120 SCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGL 179
SCW FSTTGS+E+ GK ISLSEQQLVDC + NN GC GG A EYI+ +G +
Sbjct: 133 SCWAFSTTGSVESHNFIKTGKLISLSEQQLVDCVK--NNSGCAGGWMDIALEYIEADGIM 190
Query: 180 DTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGF 239
+E+ YPY ++ C+F++ VQ+ I E +LQ AV L PVSVA EV F
Sbjct: 191 -SEDDYPYEERNTTCRFNNSKAAVQIKSYKAIKKNDEIDLQKAVALEGPVSVAIEVTIAF 249
Query: 240 RFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEM-G 298
+ Y G+ + +C NT D+ HAV+ GYG +DG YW++KNSWG +G GY +M
Sbjct: 250 QLYARGILNDPQCKNTEGDLTHAVLVTGYGSQDGKDYWIVKNSWGAEYGMDGYLRMSRNA 309
Query: 299 KNMCGIATCASYPVV 313
N CGIAT ASYPV+
Sbjct: 310 DNQCGIATRASYPVL 324
>gi|432114312|gb|ELK36240.1| Aryl hydrocarbon receptor nuclear translocator [Myotis davidii]
Length = 897
Score = 194 bits (494), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 96/213 (45%), Positives = 131/213 (61%), Gaps = 6/213 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ YR ++PVK+QG CGSCW FS+ G+LE + GK ++LS Q LVDC N G
Sbjct: 687 IDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLMKKTGKLLNLSPQNLVDCVS--ENDG 744
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C GG + AF+Y++ N G+D+E+AYPY G+D C ++ + I G E L+
Sbjct: 745 CGGGYMTNAFQYVQRNRGIDSEDAYPYVGQDESCMYNPTGKAAKCRGYKEIPEGNEKALK 804
Query: 221 HAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
AV V P+SVA + + F+FY GVY C + ++NHAV+AVGYG++ G +W+I
Sbjct: 805 KAVARVGPISVAIDASLSSFQFYSKGVYYDENCNSD--NLNHAVLAVGYGIQKGKKHWII 862
Query: 280 KNSWGENWGDHGYFKMEMGK-NMCGIATCASYP 311
KNSWGENWG+ GY M K N CGIA AS+P
Sbjct: 863 KNSWGENWGNKGYILMARNKNNACGIANLASFP 895
>gi|18858809|ref|NP_571273.1| cathepsin L, 1 b precursor [Danio rerio]
gi|1752664|emb|CAA69623.1| cathepsin L [Danio rerio]
Length = 336
Score = 194 bits (493), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 100/212 (47%), Positives = 137/212 (64%), Gaps = 9/212 (4%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++PVKDQ CGSCW+FS+TG+LE + GK IS+SEQ LVDC++ NQGCNGGL Q
Sbjct: 127 VTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQGNQGCNGGLMDQ 186
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGV-CKFSSENVGVQVLDSVNITLGAEDELQHAVGLVR 227
AF+Y+K N GLD+E++YPY +D + C++ ++ V+I G E L +AV V
Sbjct: 187 AFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPSGNELALMNAVAAVG 246
Query: 228 PVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVED----GVPYWLIKNS 282
PVSVA + +FY+SG+Y C ++ +D HAV+ VGYG + G YW++KNS
Sbjct: 247 PVSVAIDASHQSLQFYQSGIYYERACSSSRLD--HAVLVVGYGYQGADVAGNRYWIVKNS 304
Query: 283 WGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
W + WGD GY M K N CG+AT ASYP++
Sbjct: 305 WSDKWGDKGYIYMAKDKNNHCGVATKASYPLM 336
>gi|28971813|dbj|BAC65418.1| cathepsin L [Pandalus borealis]
Length = 318
Score = 194 bits (493), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 102/207 (49%), Positives = 127/207 (61%), Gaps = 6/207 (2%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++PVKDQG CGSCW FSTTG+LE A+ G +SLSEQ LVDC+ N GCNGG+
Sbjct: 116 VTPVKDQGQCGSCWAFSTTGALEGAHFLKHGDLVSLSEQNLVDCST--ENSGCNGGVVQW 173
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
A++YIK N G+DTE +YPY +D C+F + +VG V +I E AV P
Sbjct: 174 AYDYIKSNNGIDTESSYPYEAQDLTCRFDAAHVGATVTGYADIPYADEVTQASAVHDDGP 233
Query: 229 VSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENW 287
VSV + + F+ Y SGVY C P +NHAV+ VGYG E+G YWLIKNSWG W
Sbjct: 234 VSVCIDAGHNSFQLYSSGVYYEPNC--NPSSINHAVLPVGYGTEEGSDYWLIKNSWGTGW 291
Query: 288 GDHGYFKMEMGK-NMCGIATCASYPVV 313
G GY K+ K N CG+AT + YP V
Sbjct: 292 GLSGYMKLTRNKSNHCGVATQSCYPNV 318
>gi|545734|gb|AAB30089.1| cysteine protease [Fasciola sp.]
gi|2662308|dbj|BAA23743.1| cathepsin L [Fasciola hepatica]
Length = 325
Score = 194 bits (493), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 99/215 (46%), Positives = 138/215 (64%), Gaps = 5/215 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++ VKDQG+CGSCW FSTTG++E Y + IS SEQQLVDC+ + N G
Sbjct: 112 IDWRESGYVTEVKDQGNCGSCWAFSTTGTMEGQYMKNERTSISFSEQQLVDCSGPWGNYG 171
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C GGL A+EY+K GL+TE +YPYT +G C+++ + +V D + G+E EL+
Sbjct: 172 CMGGLMENAYEYLK-QFGLETESSYPYTAVEGQCRYNRQLGVAKVTDYYTVHSGSEVELK 230
Query: 221 HAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIK 280
+ VG P +VA +V F Y G+Y S C + + VNHAV+AVGYG + G YW++K
Sbjct: 231 NLVGAEGPAAVAVDVESDFMMYSGGIYQSRTC--SSLRVNHAVLAVGYGTQGGTDYWIVK 288
Query: 281 NSWGENWGDHGYFKMEMGK-NMCGIATCASYPVVA 314
NSWG +WG+ Y +M + NMCGIA+ AS P+VA
Sbjct: 289 NSWGSSWGER-YIRMVRNRGNMCGIASLASLPMVA 322
>gi|146216000|gb|ABQ10202.1| cysteine protease Cp4 [Actinidia deliciosa]
Length = 463
Score = 194 bits (493), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 128/369 (34%), Positives = 180/369 (48%), Gaps = 84/369 (22%)
Query: 5 VQLVSSVILLLCCAAAASA---SASSFDDSNPIRLVSSDGLRDFETSVLQVIGQARHALS 61
V+ S LL C A +SA S S+D ++P + ++ + +E
Sbjct: 4 VRASSVACLLFLCFAFSSALDMSIISYDQTHPPQRTDAEAMAIYE--------------- 48
Query: 62 FARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN------------- 108
++ +GK Y ++ E + RF F NL + N SYR+GLN
Sbjct: 49 --KWLTTHGKAYNAIGEKERRFEIFKDNLRFVDEHNAVAGSYRVGLNRFADLTNEEYRSM 106
Query: 109 ---------------------------------------ISPVKDQGHCGSCWTFSTTGS 129
+SPVKDQG CGSCW FST +
Sbjct: 107 FLGGNMEMKERSASTKSDRYAFRAGDKLPGSVDWREKGAVSPVKDQGQCGSCWAFSTISA 166
Query: 130 LEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTG 189
+E G+ ISLSEQ+LVDC +++N GCNGGL F++I NGG+DTEE YPY
Sbjct: 167 VEGINQIVTGELISLSEQELVDCDKSYN-MGCNGGLMDYGFQFIINNGGIDTEEDYPYRA 225
Query: 190 KDGVCKFSSENVGVQVLDSV-NITLGAEDELQHAVGLVRPVSVAFEVV-DGFRFYKSGVY 247
DG C +N V ++ ++ E+ L+ AV +PVSVA E F+ Y+SGV+
Sbjct: 226 VDGTCDQFRKNARVVSINGYEDVPEDDENSLKKAVA-NQPVSVAIEAGGRAFQLYESGVF 284
Query: 248 SSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNM----CG 303
+ G+ +++H VVAVGYG E+GV YW ++NSWG WG++GY K+E N CG
Sbjct: 285 T----GHCGTNLDHGVVAVGYGTENGVDYWTVRNSWGPKWGENGYIKLERNINATSGKCG 340
Query: 304 IATCASYPV 312
IA+ ASYP
Sbjct: 341 IASMASYPT 349
>gi|410978262|ref|XP_003995514.1| PREDICTED: cathepsin L1-like [Felis catus]
Length = 333
Score = 194 bits (493), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 103/219 (47%), Positives = 134/219 (61%), Gaps = 9/219 (4%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++PVKDQG CGSCW FS TG+LE + GK +SLSEQ LVDC+QA N+G
Sbjct: 118 VDWREKGYVTPVKDQGPCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSQAEGNEG 177
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
CNGGL + AF+Y+K NGGLD+EE+YPY +D CK+ ++ +I E L
Sbjct: 178 CNGGLMNNAFQYVKDNGGLDSEESYPYHAQDESCKYKPQDSAANDTGFFDIPQ-QEKALM 236
Query: 221 HAVGLVRPVSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDG----VP 275
AV P+SV + F+FY G+Y C + D++H V+ +GYG E G
Sbjct: 237 VAVATKGPISVGIDASHFTFQFYHEGIYYDPDCSSE--DLDHGVLVIGYGTEIGQSINKT 294
Query: 276 YWLIKNSWGENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
YW++KNSWG NWG GY KM KN CGIAT AS+PVV
Sbjct: 295 YWIVKNSWGANWGIDGYIKMAKDRKNHCGIATMASFPVV 333
>gi|255555337|ref|XP_002518705.1| cysteine protease, putative [Ricinus communis]
gi|223542086|gb|EEF43630.1| cysteine protease, putative [Ricinus communis]
Length = 471
Score = 194 bits (493), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 118/303 (38%), Positives = 156/303 (51%), Gaps = 65/303 (21%)
Query: 69 YGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN-------------------- 108
+GK Y ++ E + RF F NL I N SY++GLN
Sbjct: 58 HGKAYNALGEKEKRFEIFKDNLRFIDEHNSVDRSYKVGLNRFADLTNEEYKAMFLGTKME 117
Query: 109 --------------------------------ISPVKDQGHCGSCWTFSTTGSLEAAYHQ 136
+ PVKDQG CGSCW FST G++E
Sbjct: 118 RKNRFLGTRSQRYLFKDGDDLPENVDWREKGAVVPVKDQGQCGSCWAFSTVGAVEGINQI 177
Query: 137 AFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKF 196
G+ ISLSEQ+LVDC +++ NQGCNGGL AFE+I NGG+DTEE YPY D +C
Sbjct: 178 VTGELISLSEQELVDCDKSY-NQGCNGGLMDYAFEFIINNGGIDTEEDYPYKASDNICDP 236
Query: 197 SSENVGVQVLDSV-NITLGAEDELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGN 254
+ +N V +D ++ E+ L+ AV +PVSVA E F+ YKSGV++ +CG
Sbjct: 237 NRKNAKVVTIDGYEDVPENDENSLKKAVAH-QPVSVAIEAGGRAFQLYKSGVFTG-RCG- 293
Query: 255 TPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMG-----KNMCGIATCAS 309
+++H VVAVGYG E+GV YW+++NSWG WG+ GY +ME CGIA S
Sbjct: 294 --TELDHGVVAVGYGTENGVNYWIVRNSWGSAWGESGYIRMERNVANTKTGKCGIAIQPS 351
Query: 310 YPV 312
YP
Sbjct: 352 YPT 354
>gi|296090463|emb|CBI40282.3| unnamed protein product [Vitis vinifera]
Length = 386
Score = 194 bits (493), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 114/272 (41%), Positives = 153/272 (56%), Gaps = 33/272 (12%)
Query: 68 RYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLNIS----------------- 110
++GK Y ++ E + RF F NL I N +Y++G S
Sbjct: 10 KHGKSYNALGERERRFEIFKDNLRFIEEHNAVNRTYKVGDRYSFRAGEDLPESVDWREKG 69
Query: 111 ---PVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPS 167
PVKDQG+CGSCW FST ++E A G ISLSEQ+LVDC +++N QGCNGGL
Sbjct: 70 AVVPVKDQGNCGSCWAFSTIAAVEGINQIATGDLISLSEQELVDCDKSYN-QGCNGGLMD 128
Query: 168 QAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSV-NITLGAEDELQHAVGLV 226
AFE+I NGG+D+EE YPY D C + +N V +D ++ E L+ AV
Sbjct: 129 YAFEFIINNGGIDSEEDYPYRAADTTCDPNRKNARVVSIDGYEDVPQNDERSLKKAVAN- 187
Query: 227 RPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGE 285
+PVSVA E F+ Y+SGV++ +CG ++H VVAVGYG E+ V YW+++NSWG
Sbjct: 188 QPVSVAIEAGGRAFQLYQSGVFTG-QCG---TQLDHGVVAVGYGTENSVDYWIVRNSWGP 243
Query: 286 NWGDHGYFKMEMG-----KNMCGIATCASYPV 312
NWG+ GY K+E CGIA SYP+
Sbjct: 244 NWGESGYIKLERNLAGTETGKCGIAIEPSYPI 275
>gi|115743|sp|P07154.2|CATL1_RAT RecName: Full=Cathepsin L1; AltName: Full=Cyclic protein 2;
Short=CP-2; AltName: Full=Major excreted protein;
Short=MEP; Contains: RecName: Full=Procathepsin L;
Contains: RecName: Full=Cathepsin L1 heavy chain;
Contains: RecName: Full=Cathepsin L1 light chain; Flags:
Precursor
gi|38648869|gb|AAH63175.1| Cathepsin L1 [Rattus norvegicus]
gi|149029152|gb|EDL84437.1| cathepsin L, isoform CRA_a [Rattus norvegicus]
gi|386267881|dbj|BAM14518.1| cathepsin L [Rattus norvegicus]
Length = 334
Score = 194 bits (493), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 106/221 (47%), Positives = 134/221 (60%), Gaps = 9/221 (4%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++PVK+QG CGSCW FS +G LE GK ISLSEQ LVDC+ N
Sbjct: 116 KTVDWREKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHDQGN 175
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
QGCNGGL AF+YIK NGGLD+EE+YPY KDG CK+ +E V+I E
Sbjct: 176 QGCNGGLMDFAFQYIKENGGLDSEESYPYEAKDGSCKYRAEYAVANDTGFVDIPQ-QEKA 234
Query: 219 LQHAVGLVRPVSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----DG 273
L AV V P+SVA + +FY SG+Y C + D++H V+ VGYG E +
Sbjct: 235 LMKAVATVGPISVAMDASHPSLQFYSSGIYYEPNC--SSKDLDHGVLVVGYGYEGTDSNK 292
Query: 274 VPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
YWL+KNSWG+ WG GY K+ + N CG+AT ASYP+V
Sbjct: 293 DKYWLVKNSWGKEWGMDGYIKIAKDRNNHCGLATAASYPIV 333
>gi|324512246|gb|ADY45078.1| Cathepsin L [Ascaris suum]
Length = 388
Score = 194 bits (493), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 98/209 (46%), Positives = 136/209 (65%), Gaps = 7/209 (3%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++ VK+QG CGSCW FS TG+LE + + G +SLSEQ LVDC++ + N GCNGGL
Sbjct: 183 VTEVKNQGMCGSCWAFSATGALEGQHKRKIGSLVSLSEQNLVDCSRKYGNNGCNGGLMDY 242
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
AFEYIK N G+DTE +YPY GK+ C F+ + VG + V++ G E++L+ AV P
Sbjct: 243 AFEYIKDNHGVDTEASYPYKGKEMKCHFNKKTVGAEDEGYVDLPEGDEEKLKIAVATQGP 302
Query: 229 VSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE--DGVPYWLIKNSWGE 285
+SVA + F+ Y+ GVY +C + +D H V+ VGYG + DG YW++KNSWG
Sbjct: 303 ISVAIDAGHPSFQMYRKGVYYEPQCSSESLD--HGVLVVGYGTDEIDG-DYWIVKNSWGP 359
Query: 286 NWGDHGYFKMEMGK-NMCGIATCASYPVV 313
WG+ GY ++ + N CGIA+ ASYP+V
Sbjct: 360 GWGEKGYVRIARNRDNHCGIASKASYPIV 388
>gi|73946536|ref|XP_541257.2| PREDICTED: cathepsin L1 [Canis lupus familiaris]
Length = 333
Score = 194 bits (492), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 104/211 (49%), Positives = 128/211 (60%), Gaps = 9/211 (4%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++PVKDQG C CW FS TG+LE + GK +SLSEQ LVDC+ + N+GCNGGL
Sbjct: 126 VTPVKDQGQCLGCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSWSQGNRGCNGGLMEY 185
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
AF+Y+K NGGLD+EE+YPY ++ CK+ E V I L ED L V V P
Sbjct: 186 AFQYVKDNGGLDSEESYPYLARNEPCKYRPEKSAANVTAFWPI-LNEEDGLMTTVATVGP 244
Query: 229 VSVAFE-VVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----DGVPYWLIKNSW 283
VS A + F+FYK G+Y KC N + NH V+ VGYG E D YW++KNSW
Sbjct: 245 VSAAVDSSPQSFQFYKKGIYYDPKCSNKLL--NHGVLVVGYGFEGAESDNKKYWIVKNSW 302
Query: 284 GENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
G NWG GY + + N CGIAT ASYPVV
Sbjct: 303 GTNWGMQGYMLLAKDRDNHCGIATRASYPVV 333
>gi|74213650|dbj|BAE35627.1| unnamed protein product [Mus musculus]
Length = 334
Score = 194 bits (492), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 107/221 (48%), Positives = 133/221 (60%), Gaps = 9/221 (4%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++PVK+QG CGSCW FS +G LE GK ISLSEQ LVDC+ A N
Sbjct: 116 KSVDWREKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGN 175
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
QGCNGGL AF+YIK NGGLD+EE+YPY KDG CK+ +E V+I E
Sbjct: 176 QGCNGGLMDFAFQYIKENGGLDSEESYPYEAKDGSCKYRAEFAVANDTGFVDIPQ-QEKA 234
Query: 219 LQHAVGLVRPVSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----DG 273
L AV V P+SVA + +FY SG+Y C + +D H V+ VGYG E +
Sbjct: 235 LMKAVATVGPISVAMDASHPSLQFYSSGIYYEPNCSSKNLD--HGVLLVGYGYEGTDSNK 292
Query: 274 VPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
YWL+KNSWG WG GY ++ + N CG+AT ASYPVV
Sbjct: 293 NKYWLVKNSWGSEWGMEGYIEIAKDRDNHCGLATAASYPVV 333
>gi|261289789|ref|XP_002611756.1| hypothetical protein BRAFLDRAFT_236363 [Branchiostoma floridae]
gi|229297128|gb|EEN67766.1| hypothetical protein BRAFLDRAFT_236363 [Branchiostoma floridae]
Length = 308
Score = 194 bits (492), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 103/216 (47%), Positives = 129/216 (59%), Gaps = 5/216 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++ VK+Q CGSCW FS TGSLE + +SLSEQ LVDC++ N+G
Sbjct: 95 VDWRQKGAVTKVKNQEQCGSCWAFSATGSLEGQHFLKTNNLVSLSEQNLVDCSRREGNKG 154
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKD-GVCKFSSENVGVQVLDSVNITLGAEDEL 219
C GG QAF+YIK NGG+DTEE Y Y G+D +C++ S G + +I G E L
Sbjct: 155 CKGGSMDQAFKYIKMNGGIDTEECYSYRGRDESMCRYKSSCSGATLSSYTDIKTGDEMAL 214
Query: 220 QHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWL 278
AV V P+SVA + F+ Y GVY KC +T +D H V+AVGYG +G YWL
Sbjct: 215 MQAVSTVGPISVAIDAGHKSFQLYHHGVYDEPKCSSTHLD--HGVLAVGYGSSNGSDYWL 272
Query: 279 IKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
+KNSWG WG GY M K N CGIAT A YPVV
Sbjct: 273 VKNSWGTEWGMEGYIMMSRNKHNQCGIATRAIYPVV 308
>gi|1834307|dbj|BAA09820.1| cysteine proteinase [Spirometra erinaceieuropaei]
gi|1834309|dbj|BAA09821.1| cysteine proteinase [Spirometra erinaceieuropaei]
Length = 336
Score = 194 bits (492), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 98/207 (47%), Positives = 130/207 (62%), Gaps = 5/207 (2%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++ VK+QG CGSCW+FS G++E A G SLSEQQL+DC+ + NQGCNGGL Q
Sbjct: 133 VTSVKNQGQCGSCWSFSANGAIEGAIQIKTGALRSLSEQQLMDCSWDYGNQGCNGGLMPQ 192
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
AF+Y + G++ E Y YT +DGVC++ + V V + G E LQ AV + P
Sbjct: 193 AFQYAQ-RYGVEAEVDYRYTERDGVCRYRQDLVVANVTGYAELPEGDEGGLQRAVATIGP 251
Query: 229 VSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENW 287
+SV + D GF Y GV+ S C +P ++H V+ VGYG E+G YWL+KNSWG +W
Sbjct: 252 ISVGIDAADPGFMSYSHGVFVSKTC--SPYAIDHGVLVVGYGAENGDAYWLVKNSWGSSW 309
Query: 288 GDHGYFKMEMGK-NMCGIATCASYPVV 313
G+ GY KM + NMCGIA+ ASYP V
Sbjct: 310 GEDGYLKMARNRNNMCGIASMASYPTV 336
>gi|246148|gb|AAB21516.1| Cyclic Protein-2 [Rattus sp.]
Length = 247
Score = 194 bits (492), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 106/221 (47%), Positives = 134/221 (60%), Gaps = 9/221 (4%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++PVK+QG CGSCW FS +G LE GK ISLSEQ LVDC+ N
Sbjct: 29 KTVDWREKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHDQGN 88
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
QGCNGGL AF+YIK NGGLD+EE+YPY KDG CK+ +E V+I E
Sbjct: 89 QGCNGGLMDFAFQYIKENGGLDSEESYPYEAKDGSCKYRAEYAVANDTGFVDIPQ-QEKA 147
Query: 219 LQHAVGLVRPVSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----DG 273
L AV V P+SVA + +FY SG+Y C + D++H V+ VGYG E +
Sbjct: 148 LMKAVATVGPISVAMDASHPSLQFYSSGIYYEPNC--SSKDLDHGVLVVGYGYEGTDSNK 205
Query: 274 VPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
YWL+KNSWG+ WG GY K+ + N CG+AT ASYP+V
Sbjct: 206 DKYWLVKNSWGKEWGMDGYIKIAKDRNNHCGLATAASYPIV 246
>gi|74142447|dbj|BAE31977.1| unnamed protein product [Mus musculus]
Length = 334
Score = 194 bits (492), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 107/221 (48%), Positives = 133/221 (60%), Gaps = 9/221 (4%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++PVK++G CGSCW FS +G LE GK ISLSEQ LVDC+ A N
Sbjct: 116 KSVDWREKGCVTPVKNKGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGN 175
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
QGCNGGL AF+YIK NGGLD+EE+YPY KDG CK+ +E V+I E
Sbjct: 176 QGCNGGLMDFAFQYIKENGGLDSEESYPYEAKDGSCKYRAEFAVANDTGFVDIPQ-QEKA 234
Query: 219 LQHAVGLVRPVSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----DG 273
L AV V P+SVA + +FY SG+Y C + +D H V+ VGYG E +
Sbjct: 235 LMKAVATVGPISVAMDASHPSLQFYSSGIYYEPNCSSKNLD--HGVLLVGYGYEGTDSNK 292
Query: 274 VPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
YWL+KNSWG WG GY K+ + N CG+AT ASYPVV
Sbjct: 293 NKYWLVKNSWGSEWGMEGYIKIAKDRDNHCGLATAASYPVV 333
>gi|189525870|ref|XP_001923796.1| PREDICTED: cathepsin L1 [Danio rerio]
Length = 335
Score = 194 bits (492), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 100/212 (47%), Positives = 136/212 (64%), Gaps = 10/212 (4%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++PVKDQ CGSCW+FS+TG+LE + GK IS+SEQ LVDC++ NQGCNGGL Q
Sbjct: 127 VTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPHGNQGCNGGLMDQ 186
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGV-CKFSSENVGVQVLDSVNITLGAEDELQHAVGLVR 227
AF+Y+K N GLD+E++YPY +D + C++ ++ V+I G E L +AV V
Sbjct: 187 AFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPKGNELALMNAVAAVG 246
Query: 228 PVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVED----GVPYWLIKNS 282
PVSVA + +FY+SG+Y C + ++HAV+ VGYG + G YW++KNS
Sbjct: 247 PVSVAIDASHQSLQFYQSGIYYERACTSQ---LDHAVLVVGYGYQGADVAGNRYWIVKNS 303
Query: 283 WGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
W + WGD GY M K N CGIAT ASYP++
Sbjct: 304 WSDKWGDKGYIYMAKDKNNHCGIATMASYPLM 335
>gi|313241067|emb|CBY33367.1| unnamed protein product [Oikopleura dioica]
Length = 326
Score = 194 bits (492), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 101/219 (46%), Positives = 133/219 (60%), Gaps = 5/219 (2%)
Query: 97 NCKGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAF 156
N + +R ++P+KDQG CGSCW FSTTGS E A+ + GK ++LSEQQLVDC+
Sbjct: 111 NPDSVDWRNEGYVTPIKDQGQCGSCWAFSTTGSTEGAHFKKTGKLVTLSEQQLVDCSTKE 170
Query: 157 NNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAE 216
+ GCNGGL F YI N G+ TE AYPY +DG CK S + + ++ G+E
Sbjct: 171 GDHGCNGGLMDFGFTYIIENDGITTESAYPYKAQDGSCK-SGMTAAATLSECYDVAQGSE 229
Query: 217 DELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVP 275
+L+ AV V P+SVA + + FR YK G+Y C +T +D H V+AVGY +
Sbjct: 230 ADLETAVATVGPISVAIDAHLLSFRLYKQGIYHDRLCSSTRLD--HGVLAVGYKNDPSGN 287
Query: 276 YWLIKNSWGENWGDHGYFKM-EMGKNMCGIATCASYPVV 313
YW++KNSW WG+ GY M + KN CGIAT ASYPV
Sbjct: 288 YWIVKNSWNTTWGNEGYIWMAKDKKNTCGIATAASYPVA 326
>gi|405977173|gb|EKC41636.1| Cathepsin K [Crassostrea gigas]
Length = 942
Score = 194 bits (492), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 116/302 (38%), Positives = 152/302 (50%), Gaps = 58/302 (19%)
Query: 65 FARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKG----LSYRLGLN------------ 108
F R Y K Y +E K+R + + +N+D+I N + SYRLG+N
Sbjct: 646 FKRIYSKTYTEQDE-KIRKSIWIQNIDIINRHNKEADMGHHSYRLGMNEFGDMTTKEVTG 704
Query: 109 -----------------------------------ISPVKDQGHCGSCWTFSTTGSLEAA 133
++PVK+QG+CGSCW F+TTG LE
Sbjct: 705 MLNVPKGYATDNVSTFLPPNNLQLPETVNWTKEGYVTPVKNQGYCGSCWAFATTGGLEGQ 764
Query: 134 YHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGV 193
+ + K +SLSEQ LVDC + N GC GGLP A++YI NGG+DTEE+YPY GK+G
Sbjct: 765 HFRKTKKLVSLSEQNLVDCCK--ENLGCTGGLPVTAYKYIARNGGIDTEESYPYLGKNGN 822
Query: 194 CKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKC 252
C F +G V + G E LQ AV V PV+V+ + + F YK GVY KC
Sbjct: 823 CTFRPPKIGATCQGFVRVPAGDEVGLQKAVASVGPVTVSIDASLKSFYLYKEGVYDDKKC 882
Query: 253 GNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYP 311
NH V+ VGYG G YWL+KNSWG ++G GY M + N CGI+ YP
Sbjct: 883 SKKMF--NHFVLIVGYGKHLGKEYWLVKNSWGMSFGMDGYIMMARNQDNQCGISNQPVYP 940
Query: 312 VV 313
+V
Sbjct: 941 IV 942
>gi|211953199|gb|ACJ13761.1| aleurain-like protease [Helianthus annuus]
Length = 114
Score = 194 bits (492), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 89/112 (79%), Positives = 100/112 (89%)
Query: 203 VQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHA 262
VQV+DSVNIT GAEDEL+HAVG+VRPVSVAFEV+ FR Y GV++S CG+ PMDVN A
Sbjct: 3 VQVIDSVNITSGAEDELKHAVGVVRPVSVAFEVIANFRLYTGGVFTSDDCGSGPMDVNRA 62
Query: 263 VVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASYPVVA 314
VVAVGYGVEDGVPYWLIKNSWG +WG +GYFKMEMGKNMCG+ATCASYP+VA
Sbjct: 63 VVAVGYGVEDGVPYWLIKNSWGADWGLNGYFKMEMGKNMCGVATCASYPIVA 114
>gi|194741252|ref|XP_001953103.1| GF17600 [Drosophila ananassae]
gi|190626162|gb|EDV41686.1| GF17600 [Drosophila ananassae]
Length = 333
Score = 194 bits (492), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 117/321 (36%), Positives = 157/321 (48%), Gaps = 69/321 (21%)
Query: 58 HALSFAR---------FARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCK----GLSYR 104
HA+ +A+ F Y K+Y+ E +LRF F+ N LI N K +S+
Sbjct: 17 HAVPYAQDILEEEWMAFKLEYNKVYQDETEEQLRFKIFNYNKLLIARHNLKWAAGKVSFN 76
Query: 105 LGLN-------------------------------------------------ISPVKDQ 115
L +N ++PVKDQ
Sbjct: 77 LAVNKFADLLDHEFQDLMLGKMSPSGSNFGSSTFLPPVNLTLPDAVDWRKYGFVTPVKDQ 136
Query: 116 GHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKY 175
G CGSCW FSTTGSLE + + G+ ISLSEQ L+DC+ N GC G AF YI+
Sbjct: 137 GSCGSCWAFSTTGSLEGQHFRKTGQLISLSEQNLIDCSPG--NNGCKNGAVEYAFRYIQS 194
Query: 176 NGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFE- 234
N G+DTE +YPY C+F + +G V + G E EL AV V P+SV
Sbjct: 195 NKGIDTEISYPYEAAQNQCRFRRDTIGATSTGFVKLNPGDEMELAQAVATVGPISVLINS 254
Query: 235 VVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVED-GVPYWLIKNSWGENWGDHGYF 293
+D F+FY GVY+ C P + HAV+ VGYG +D G +WL+KNSW +WG+ GY
Sbjct: 255 SLDSFKFYHDGVYNDPSC--NPNKLTHAVLVVGYGTDDRGGDFWLVKNSWSTHWGEQGYV 312
Query: 294 KMEM-GKNMCGIATCASYPVV 313
K++ N+CGIA+ A YP+V
Sbjct: 313 KIKRNANNLCGIASNALYPLV 333
>gi|46948154|gb|AAT07059.1| cathepsin F-like cysteine proteinase, partial [Brugia malayi]
Length = 461
Score = 194 bits (492), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 90/205 (43%), Positives = 129/205 (62%), Gaps = 4/205 (1%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++PVKDQG CGSCW FS TG++E+ + GK ISLSEQ+L+DC ++GCNGGLP
Sbjct: 260 VTPVKDQGSCGSCWAFSVTGNIESLWAIKTGKLISLSEQELIDCDVI--DKGCNGGLPIN 317
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
AF IK GGL+ E+ YPY K+G C + V + D+V I E ++ + P
Sbjct: 318 AFREIKRMGGLEPEDQYPYEAKNGTCHLVRAQIAVSIDDAVEIPRN-ETVMKAWIAQRGP 376
Query: 229 VSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWG 288
+SV + + +YKSG+ +K P +NH V+ GYG+E+ +PYW IKNSWGE WG
Sbjct: 377 LSVGIDA-ELLSYYKSGILHPSKSRCPPSKINHGVLITGYGIENNLPYWTIKNSWGEQWG 435
Query: 289 DHGYFKMEMGKNMCGIATCASYPVV 313
++GYF++ GKN+CG++ S ++
Sbjct: 436 ENGYFQLMRGKNICGVSDLVSSAII 460
>gi|348542778|ref|XP_003458861.1| PREDICTED: digestive cysteine proteinase 3-like [Oreochromis
niloticus]
Length = 218
Score = 194 bits (492), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 99/199 (49%), Positives = 130/199 (65%), Gaps = 5/199 (2%)
Query: 117 HCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYN 176
CGSCW FS TG+LE + + G +SLSEQQLVDC++ F N GC+GG AF+YIK N
Sbjct: 23 QCGSCWAFSATGALEGQHFKKTGNLVSLSEQQLVDCSRNFFNHGCDGGWMIPAFKYIKDN 82
Query: 177 GGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVV 236
GG+ TEE+Y Y +DG C +++ VG Q E+ L+ AV + P+S+A +
Sbjct: 83 GGIQTEESYTYEARDGRCHYNANFVGAQC-SGYGTVKQDEEALKQAVAAIGPISIAVDAS 141
Query: 237 -DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKM 295
+ F+ Y+SGVY C N +++NHAV+AVGYG E+G YWL+KNSWG WG+ GY KM
Sbjct: 142 HESFQLYQSGVYDEPWCSN--INLNHAVLAVGYGTENGHDYWLVKNSWGSEWGNKGYIKM 199
Query: 296 EMGK-NMCGIATCASYPVV 313
K N CGIAT ASYP+V
Sbjct: 200 TRNKDNQCGIATEASYPLV 218
>gi|344271939|ref|XP_003407794.1| PREDICTED: cathepsin L1-like [Loxodonta africana]
Length = 335
Score = 193 bits (491), Expect = 7e-47, Method: Compositional matrix adjust.
Identities = 105/212 (49%), Positives = 133/212 (62%), Gaps = 11/212 (5%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++PVKDQG C SCW FS TG+LE + GK +SLSEQ LVDC++ +N GC+GGL +
Sbjct: 128 VTPVKDQGSCHSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPESNNGCSGGLMDK 187
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGV-CKFSSENVGVQVLDSVNITLGAEDELQHAVGLVR 227
AF+Y+K NGGLD+EE+YPYT K+ C + E VNI E L +AV V
Sbjct: 188 AFQYVKNNGGLDSEESYPYTAKESRNCLYKPEFSAANNTGFVNIP-PQEKALMNAVASVG 246
Query: 228 PVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVP----YWLIKNS 282
P+SVA + + FRFYKSG+Y C + VNH V+ VGYG E P YWL+KNS
Sbjct: 247 PISVAVDASLKSFRFYKSGIYFDPACR---LAVNHGVLVVGYGFEGTDPDKNKYWLVKNS 303
Query: 283 WGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
WG++WG GY K+ + N CGIA ASYP V
Sbjct: 304 WGKSWGADGYIKIAKDRNNHCGIARAASYPTV 335
>gi|333827692|gb|AEG19548.1| cathepsin L-like cysteine protease [Taenia pisiformis]
Length = 338
Score = 193 bits (491), Expect = 7e-47, Method: Compositional matrix adjust.
Identities = 112/253 (44%), Positives = 151/253 (59%), Gaps = 22/253 (8%)
Query: 64 RFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLNISPVKDQGHCGSCWT 123
R A + G+++++++ FA +D R N ++ VK+QG+CGSCW
Sbjct: 105 RVAGKCGRVWKALKS----FADLPDTVDW-RDKNL----------VTEVKNQGNCGSCWA 149
Query: 124 FSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEE 183
FS+TG+LEAA + GK ISLSEQQLVDC+ N GCNGG S AF+Y++ + ++ E
Sbjct: 150 FSSTGALEAALAKKTGKLISLSEQQLVDCSLKNGNDGCNGGYMSNAFKYLE-DHSIEPES 208
Query: 184 AYPYTGKDGVCKFSSENVGV-QVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVD-GFRF 241
AYPY DG C++ +E++GV V D I G E L AV V P+S+A + GF F
Sbjct: 209 AYPYRATDGPCRY-NESLGVGTVTDIGEIPEGNETALMEAVATVGPISIAIDASSLGFMF 267
Query: 242 YKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMG-KN 300
Y+ G+Y S C + + NH V+AVGYG DG PYWL+KNSWG WG GY M N
Sbjct: 268 YRHGIYKSHWCSSKFL--NHGVLAVGYGKLDGKPYWLVKNSWGSGWGMKGYIMMAKDYHN 325
Query: 301 MCGIATCASYPVV 313
MCGIA+ A +P V
Sbjct: 326 MCGIASLADFPYV 338
>gi|41323856|gb|AAS00027.1| cathepsin L-like cysteine proteinase [Taenia solium]
Length = 339
Score = 193 bits (491), Expect = 7e-47, Method: Compositional matrix adjust.
Identities = 101/208 (48%), Positives = 136/208 (65%), Gaps = 7/208 (3%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++ VK+QG+CGSCW FS+TG+LE A+ + GK ISLSEQQLVDC+ N GCNGG S
Sbjct: 136 VTEVKNQGNCGSCWAFSSTGALEGAFAKKTGKLISLSEQQLVDCSLKNGNDGCNGGYMSY 195
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGV-QVLDSVNITLGAEDELQHAVGLVR 227
AF+Y++ + ++ E AYPY DG C++ +E++GV V D +I G E L AV V
Sbjct: 196 AFKYLEEH-FIEPESAYPYRATDGPCRY-NESLGVGTVTDIGDIPEGNETALMEAVATVG 253
Query: 228 PVSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGEN 286
P+S+A + GF FY+ G+Y S C + + NH V+A+GYG +DG PYWL+KNSWG
Sbjct: 254 PISIAIDASSLGFMFYRHGIYKSHWCSSKFL--NHGVLAIGYGKQDGKPYWLVKNSWGTR 311
Query: 287 WGDHGYFKMEMG-KNMCGIATCASYPVV 313
WG GY M NMCG+A+ A +P V
Sbjct: 312 WGMKGYIMMAKDYHNMCGVASLADFPYV 339
>gi|391340505|ref|XP_003744580.1| PREDICTED: digestive cysteine proteinase 1-like [Metaseiulus
occidentalis]
Length = 469
Score = 193 bits (491), Expect = 7e-47, Method: Compositional matrix adjust.
Identities = 120/330 (36%), Positives = 162/330 (49%), Gaps = 63/330 (19%)
Query: 42 LRDFETSVLQ-VIGQARHALSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKG 100
+R + SVL V+ +F F +GK YE +E LR F +NL I N +
Sbjct: 145 MRLYIASVLALVVAVGADLTNFEHFKEHFGKTYEG-DEHALRQGIFQRNLAHIEKFNAEK 203
Query: 101 LSYR---------------------LGLN------------------------------- 108
+ R LGL
Sbjct: 204 AASRGYTLGITQFADMSTAEFRQTYLGLRMNASTIAKLRKLQREVVADDRDLPEAVDWRD 263
Query: 109 ---ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGL 165
+SPVKDQG CGSCW FST+G++E + G+ +SLSEQQ+VDC ++ + GCNGG
Sbjct: 264 KGAVSPVKDQGQCGSCWAFSTSGAIEGQHFLKNGELLSLSEQQMVDC--SWLDFGCNGGQ 321
Query: 166 PSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGL 225
P A EY+++NGGL+ E AYPY G G C ++ ++ +E LQ AV
Sbjct: 322 PMLAMEYVRFNGGLELETAYPYKGVGGSCHSDKKSAAAKITGFWMAGFYSESALQKAVAK 381
Query: 226 VRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWG 284
V P+SV + + F+ YKSG+Y+ C + +D HAV+AVGYG D YWL+KNSW
Sbjct: 382 VGPISVGMDASGEDFQHYKSGIYNPESCSSIGLD--HAVLAVGYGTSDDGDYWLVKNSWN 439
Query: 285 ENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
+WG+ GYFK+ K N CGIAT YP V
Sbjct: 440 TSWGEKGYFKLPRNKGNKCGIATTPIYPTV 469
>gi|443694581|gb|ELT95681.1| hypothetical protein CAPTEDRAFT_173171 [Capitella teleta]
Length = 342
Score = 193 bits (491), Expect = 8e-47, Method: Compositional matrix adjust.
Identities = 100/208 (48%), Positives = 126/208 (60%), Gaps = 6/208 (2%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++PVK+QG CGSCW FS+TGSLE + + G+ +SLSEQ LVDC + + N GCNGG
Sbjct: 136 VTPVKNQGACGSCWAFSSTGSLEGQHFRLTGQLVSLSEQNLVDCTKKYGNAGCNGGWMDN 195
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKF--SSENVGVQVLDSVNITLGAEDELQHAVGLV 226
AF Y+K N G+DTE YPY G D C + S + G V++ G E L+ AV V
Sbjct: 196 AFNYVKANNGIDTEAFYPYEGHDDWCGYDGSPGHKGANCTGHVDVQQGDELALKQAVATV 255
Query: 227 RPVSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGE 285
PVSV + F+ YKSG+Y C N+ D HAV+ VGYG + G YWL+KNSWG
Sbjct: 256 GPVSVGIDATHRSFQLYKSGIYDEVACSNSSTD--HAVLVVGYGSQGGHDYWLVKNSWGT 313
Query: 286 NWGDHGYFKMEMGK-NMCGIATCASYPV 312
+WG GY M K N C IA+ ASYP
Sbjct: 314 SWGMDGYIMMSRNKGNQCAIASYASYPT 341
>gi|355567871|gb|EHH24212.1| Cathepsin L1 [Macaca mulatta]
Length = 333
Score = 193 bits (491), Expect = 8e-47, Method: Compositional matrix adjust.
Identities = 105/221 (47%), Positives = 132/221 (59%), Gaps = 9/221 (4%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
+ + +R ++PVK+QG CGSCW FS TG+LE + GK +SLSEQ LVDC+ N
Sbjct: 116 RSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSGPQGN 175
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
+GCNGGL AF+Y+ NGGLD+EEAYPY + CK++ E V+I E
Sbjct: 176 EGCNGGLMDYAFQYVADNGGLDSEEAYPYEATEESCKYNPEYSVANDTGFVDIP-KQEKA 234
Query: 219 LQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----DG 273
L AV V P+SVA + + F FYK G+Y C + MD H V+ VGYG E D
Sbjct: 235 LMKAVATVGPISVAIDAGHESFMFYKEGIYFEPDCSSEDMD--HGVLVVGYGFESTESDN 292
Query: 274 VPYWLIKNSWGENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
YWL+KNSWGE WG GY KM +N CGIA+ ASYP V
Sbjct: 293 SKYWLVKNSWGEEWGMGGYIKMAKDRRNHCGIASAASYPTV 333
>gi|363807062|ref|NP_001242584.1| uncharacterized protein LOC100804015 precursor [Glycine max]
gi|255640677|gb|ACU20623.1| unknown [Glycine max]
Length = 366
Score = 193 bits (491), Expect = 8e-47, Method: Compositional matrix adjust.
Identities = 103/218 (47%), Positives = 141/218 (64%), Gaps = 12/218 (5%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R+ ++P+KDQG CGSCW FST ++EA GK +SLSEQ+LVDC +A+ N+G
Sbjct: 132 VDWRMKGAVAPIKDQGSCGSCWAFSTVATVEAINKIVTGKFVSLSEQELVDCDRAY-NEG 190
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSV-NITLGAEDEL 219
CNGGL AFE+I NGG+DT++ YPY G DG+C + +N V +D ++ E+ L
Sbjct: 191 CNGGLMDYAFEFIIQNGGIDTDKDYPYRGFDGICDPTKKNAKVVNIDGYEDVPPYDENAL 250
Query: 220 QHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWL 278
+ AV +PVSVA E + Y+SGV++ KCG + ++H VV VGYG E+GV YWL
Sbjct: 251 KKAVAH-QPVSVAIEASGRALQLYQSGVFTG-KCGTS---LDHGVVVVGYGSENGVDYWL 305
Query: 279 IKNSWGENWGDHGYFKME----MGKNMCGIATCASYPV 312
++NSWG WG+ GYFKM+ CGI ASYPV
Sbjct: 306 VRNSWGTGWGEDGYFKMQRNVRTSTGKCGITMEASYPV 343
>gi|313235898|emb|CBY11285.1| unnamed protein product [Oikopleura dioica]
Length = 326
Score = 193 bits (491), Expect = 8e-47, Method: Compositional matrix adjust.
Identities = 101/219 (46%), Positives = 132/219 (60%), Gaps = 5/219 (2%)
Query: 97 NCKGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAF 156
N + +R ++P+KDQG CGSCW FSTTGS E A+ + GK + LSEQQLVDC+
Sbjct: 111 NPDAVDWRPQGYVTPIKDQGQCGSCWAFSTTGSTEGAHFKKTGKLVMLSEQQLVDCSTKE 170
Query: 157 NNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAE 216
+ GCNGGL F YI N G+ TE AYPY +DG CK S + + ++ G+E
Sbjct: 171 GDHGCNGGLMDFGFTYIIENDGITTESAYPYKAQDGSCK-SGMTAAATLSECYDVAQGSE 229
Query: 217 DELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVP 275
+L+ AV V P+SVA + + FR YK G+Y C +T +D H V+AVGY +
Sbjct: 230 ADLETAVATVGPISVAIDAHLLSFRLYKQGIYHDRLCSSTRLD--HGVLAVGYKNDPSGN 287
Query: 276 YWLIKNSWGENWGDHGYFKM-EMGKNMCGIATCASYPVV 313
YW++KNSW WG+ GY M + KN CGIAT ASYPV
Sbjct: 288 YWIVKNSWNTTWGNEGYIWMAKDKKNTCGIATAASYPVA 326
>gi|30017423|ref|NP_835199.1| testin-2 precursor [Mus musculus]
gi|81895036|sp|Q80UB0.1|TEST2_MOUSE RecName: Full=Testin-2; Contains: RecName: Full=Testin-1; Flags:
Precursor
gi|29289939|gb|AAN63093.1| testin precursor [Mus musculus]
gi|38173997|gb|AAH61218.1| RIKEN cDNA 4930486L24 gene [Mus musculus]
Length = 333
Score = 193 bits (491), Expect = 8e-47, Method: Compositional matrix adjust.
Identities = 103/221 (46%), Positives = 135/221 (61%), Gaps = 9/221 (4%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R+ ++PVK+QG+C S W FS TGSLE + G+ + LSEQ L+DC +
Sbjct: 116 KYVDWRMLGYVTPVKNQGYCASSWAFSATGSLEGQMFKKTGRLVPLSEQNLLDCMGSNVT 175
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
C+GG AF+Y+K NGGL TEE+YPY G C++ +EN V D V I G E+
Sbjct: 176 HDCSGGFMQNAFQYVKDNGGLATEESYPYIGPGRKCRYHAENSAANVRDFVQIP-GREEA 234
Query: 219 LQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----DG 273
L AV V P+SVA + D F+FY SG+Y +C + +NHAV+ VGYG E DG
Sbjct: 235 LMKAVAKVGPISVAVDASHDSFQFYDSGIYYEPQCKR--VHLNHAVLVVGYGFEGEESDG 292
Query: 274 VPYWLIKNSWGENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
YWL+KNSWGE WG GY K+ N CGIAT A+YP+V
Sbjct: 293 NSYWLVKNSWGEEWGMKGYIKIAKDWNNHCGIATLATYPIV 333
>gi|332384364|gb|AEE69034.1| cysteine protease [Taenia pisiformis]
Length = 338
Score = 193 bits (491), Expect = 8e-47, Method: Compositional matrix adjust.
Identities = 112/253 (44%), Positives = 151/253 (59%), Gaps = 22/253 (8%)
Query: 64 RFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLNISPVKDQGHCGSCWT 123
R A + G+++++++ FA +D R N ++ VK+QG+CGSCW
Sbjct: 105 RVAGKCGRVWKALKS----FADLPDTVDW-RDKNL----------VTEVKNQGNCGSCWA 149
Query: 124 FSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEE 183
FS+TG+LEAA + GK ISLSEQQLVDC+ N GCNGG S AF+Y++ + ++ E
Sbjct: 150 FSSTGALEAALAKKTGKLISLSEQQLVDCSLKNGNDGCNGGYMSNAFKYLE-DHSIEPES 208
Query: 184 AYPYTGKDGVCKFSSENVGV-QVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVD-GFRF 241
AYPY DG C++ +E++GV V D I G E L AV V P+S+A + GF F
Sbjct: 209 AYPYRATDGPCRY-NESLGVGTVTDIGEIPEGNETALMEAVATVGPISIAIDASSLGFMF 267
Query: 242 YKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMG-KN 300
Y+ G+Y S C + + NH V+AVGYG DG PYWL+KNSWG WG GY M N
Sbjct: 268 YRHGIYKSHWCSSKFL--NHGVLAVGYGKLDGKPYWLVKNSWGSGWGMKGYIMMAKDYHN 325
Query: 301 MCGIATCASYPVV 313
MCGIA+ A +P V
Sbjct: 326 MCGIASLADFPYV 338
>gi|84660246|emb|CAI43320.1| cathepsin L [Lubomirskia baicalensis]
gi|85677150|emb|CAI46307.1| cathepsin L [Lubomirskia baicalensis]
Length = 327
Score = 193 bits (491), Expect = 8e-47, Method: Compositional matrix adjust.
Identities = 116/259 (44%), Positives = 146/259 (56%), Gaps = 11/259 (4%)
Query: 60 LSFARFARRYGKIYESVEEMKLRFATFSKNL-DLIRSTNCKGLSYRLGLNISPVKDQGHC 118
L + F R Y K + FS + DL S + +R ++ +K+QG C
Sbjct: 75 LESSEFGRLYNGYNNKPSMKKAQSKVFSTKVGDLPTS-----VDWRTKGFVTAIKNQGQC 129
Query: 119 GSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGG 178
GSCW FS LE + A G +SLSEQ LVDC+ A NQGCNGGL AF+Y+ NGG
Sbjct: 130 GSCWAFSAVAGLEGQHFNATGTLVSLSEQNLVDCSTAEGNQGCNGGLMDNAFQYVIKNGG 189
Query: 179 LDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNI-TLGAEDELQHAVGLVRPVSVAFEVVD 237
+DTE +YPY D CKF++ NVG +I +E LQ AV +V P+SVA +
Sbjct: 190 IDTEASYPYKAVDQKCKFNAANVGSTCSGFSDILPHKSEAALQVAVAVVGPISVAIDASH 249
Query: 238 -GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKME 296
F+ YKSGVYS + C T +D H V AVGY GV YW++KNSWG WG GY M
Sbjct: 250 TSFQLYKSGVYSESACSQTSLD--HGVTAVGYDSSSGVAYWIVKNSWGTTWGQAGYIWMS 307
Query: 297 MGK-NMCGIATCASYPVVA 314
K N CGIAT ASYP+V+
Sbjct: 308 RNKNNQCGIATAASYPIVS 326
>gi|148709357|gb|EDL41303.1| RIKEN cDNA 4930486L24 [Mus musculus]
Length = 334
Score = 193 bits (490), Expect = 9e-47, Method: Compositional matrix adjust.
Identities = 103/221 (46%), Positives = 135/221 (61%), Gaps = 9/221 (4%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R+ ++PVK+QG+C S W FS TGSLE + G+ + LSEQ L+DC +
Sbjct: 117 KYVDWRMLGYVTPVKNQGYCASSWAFSATGSLEGQMFKKTGRLVPLSEQNLLDCMGSNVT 176
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
C+GG AF+Y+K NGGL TEE+YPY G C++ +EN V D V I G E+
Sbjct: 177 HDCSGGFMQNAFQYVKDNGGLATEESYPYIGPGRKCRYHAENSAANVRDFVQIP-GREEA 235
Query: 219 LQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----DG 273
L AV V P+SVA + D F+FY SG+Y +C + +NHAV+ VGYG E DG
Sbjct: 236 LMKAVAKVGPISVAVDASHDSFQFYDSGIYYEPQCKR--VHLNHAVLVVGYGFEGEESDG 293
Query: 274 VPYWLIKNSWGENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
YWL+KNSWGE WG GY K+ N CGIAT A+YP+V
Sbjct: 294 NSYWLVKNSWGEEWGMKGYIKIAKDWNNHCGIATLATYPIV 334
>gi|149755237|ref|XP_001495795.1| PREDICTED: cathepsin L1-like [Equus caballus]
Length = 339
Score = 193 bits (490), Expect = 9e-47, Method: Compositional matrix adjust.
Identities = 102/220 (46%), Positives = 136/220 (61%), Gaps = 8/220 (3%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++PVK+QG CGSCW FS TG+LE + GK +SLSEQ LVDC+ A N
Sbjct: 116 KSVDWRKKGYVTPVKNQGLCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSWAQGN 175
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
+GC+GGL AF+Y+K NGGLD+E++YPY +DG CK+ E ++I E
Sbjct: 176 EGCSGGLMDYAFQYVKDNGGLDSEKSYPYLAEDGFCKYKPEYSAANDTGFLDIQQ-QEKF 234
Query: 219 LQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE---DGV 274
L AV V P+S + ++ F+FYK G+Y C + +D H V+ VGYG E
Sbjct: 235 LMEAVATVGPISAGIDASLESFQFYKEGIYYDPDCSSKYLD--HGVLVVGYGFEGKDSRN 292
Query: 275 PYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
YWL+KNSWGE+WG +GY KM + N CGIAT ASYP +
Sbjct: 293 KYWLVKNSWGEDWGMNGYIKMAKDRENHCGIATMASYPSL 332
>gi|15593249|gb|AAL02221.1|AF410881_1 cysteine protease CP10 precursor [Frankliniella occidentalis]
Length = 334
Score = 193 bits (490), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 103/219 (47%), Positives = 136/219 (62%), Gaps = 6/219 (2%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++P+KDQG CGSCW+FS TGSLE +SLSEQ LVDC+ F N
Sbjct: 118 KKVDWRSKGAVTPIKDQGQCGSCWSFSATGSLEGQLFLKNKNLVSLSEQNLVDCSWDFGN 177
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDG-VCKFSSENVGVQVLDSVNITLGAED 217
+GCNGGL AFEY++ NGG+DTEE+YPYT DG C + + N ++ +E
Sbjct: 178 EGCNGGLMDSAFEYVESNGGIDTEESYPYTAVDGDSCLYKAANNAGVNTGYKDVQAKSES 237
Query: 218 ELQHAVGLVRPVSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE-DGVP 275
L+ AV PVSVA + + F+ Y SG+Y + C + +D H V+AVGYG E
Sbjct: 238 ALRDAVEKAGPVSVAIDASNWSFQMYSSGIYYESACSSDYLD--HGVLAVGYGSEWPNKE 295
Query: 276 YWLIKNSWGENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
+W++KNSWG +WG+ GY KM KN CGIAT ASYP+V
Sbjct: 296 FWIVKNSWGTSWGEEGYIKMARNKKNNCGIATEASYPLV 334
>gi|307141900|gb|ADN34745.1| putative cysteine peptidase [Echinococcus granulosus]
Length = 218
Score = 193 bits (490), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 102/211 (48%), Positives = 128/211 (60%), Gaps = 6/211 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++P+KDQG CGSCW FS TG+LE + GK ISLSEQQLVDC+ N+G
Sbjct: 11 IDWRKKGLVTPIKDQGDCGSCWAFSATGALEGQLKRKKGKLISLSEQQLVDCSTDMGNEG 70
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
CNGG + AF Y NG ++E YPYT DG CKF+S V +V V + ED+L+
Sbjct: 71 CNGGYMNDAFRYWMQNGA-ESESDYPYTAMDGKCKFNSSKVVTKVSKFVKVPKKREDQLK 129
Query: 221 HAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE-DGVPYWL 278
+V V PVSVA + GF YK G+Y C +D HAV+ VGY + G YW+
Sbjct: 130 LSVAQVGPVSVAIDAASSGFMLYKKGIYQDNTCSQQYLD--HAVLVVGYDADMAGQKYWI 187
Query: 279 IKNSWGENWGDHGYFKMEMGK-NMCGIATCA 308
+KNSWGE+WG GY M K NMCGIAT A
Sbjct: 188 VKNSWGEDWGQRGYIWMARDKGNMCGIATMA 218
>gi|74222595|dbj|BAE38161.1| unnamed protein product [Mus musculus]
Length = 334
Score = 193 bits (490), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 107/221 (48%), Positives = 132/221 (59%), Gaps = 9/221 (4%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++PVK+QG CGSCW FS +G LE GK ISLSEQ LVDC+ A N
Sbjct: 116 KSVDWREKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGN 175
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
QGCNGGL AF+YIK NGGLD+EE+YPY KDG CK+ +E V+I E
Sbjct: 176 QGCNGGLMDFAFQYIKENGGLDSEESYPYEAKDGSCKYRAEFAVANDTGFVDIPQ-QEKA 234
Query: 219 LQHAVGLVRPVSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----DG 273
L AV V P+SVA + +FY G+Y C + +D H V+ VGYG E +
Sbjct: 235 LMKAVATVGPISVAMDASHPSLQFYSLGIYYEPNCSSKNLD--HGVLLVGYGYEGTDSNK 292
Query: 274 VPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
YWL+KNSWG WG GY K+ + N CG+AT ASYPVV
Sbjct: 293 NKYWLVKNSWGSEWGMEGYIKIAKDRDNHCGLATAASYPVV 333
>gi|47076309|emb|CAD89795.1| putative cathepsin L protease [Meloidogyne incognita]
Length = 383
Score = 193 bits (490), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 100/211 (47%), Positives = 134/211 (63%), Gaps = 8/211 (3%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCA--QAFNNQGCNGGLP 166
++ VK+QG CGSCW FS TG+LE + + G +SLSEQ L+DC + + N GCNGGL
Sbjct: 175 VTSVKNQGMCGSCWAFSATGALEGQHSRKLGTLVSLSEQNLIDCTKGEPYGNMGCNGGLM 234
Query: 167 SQAFEYIKYNGGLDTEEAYPYTGKDG-VCKFSSENVGVQVLDSVNITLGAEDELQHAVGL 225
AF+YI+ N G+DTE +YPY K+G C F NVG V++ G ED+L+ AV
Sbjct: 235 DNAFQYIEDNKGVDTENSYPYKAKNGKKCLFKRSNVGATDTGYVDLPSGDEDKLKIAVAT 294
Query: 226 VRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVED-GVPYWLIKNSW 283
P+SVA + F+ Y GVY C +P ++ H V+ VGYG +D YWL+KNSW
Sbjct: 295 QGPISVAIDAGHRSFQLYAHGVYDEEAC--SPDNLGHGVLVVGYGTDDIHGDYWLVKNSW 352
Query: 284 GENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
GE+WG++GY +M K N CGIA+ ASYP+V
Sbjct: 353 GEHWGENGYIRMSRNKDNQCGIASKASYPLV 383
>gi|15128493|dbj|BAB62718.1| plerocercoid growth factor/cysteine protease [Spirometra
erinaceieuropaei]
gi|15130639|dbj|BAB62799.1| plerocercoid growth factor-2/cysteine protease [Spirometra
erinaceieuropaei]
Length = 336
Score = 193 bits (490), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 98/207 (47%), Positives = 130/207 (62%), Gaps = 5/207 (2%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++ VK+QG CGSCW+FS G++E A G SLSEQQL+DC+ + NQGCNGGL Q
Sbjct: 133 VTSVKNQGQCGSCWSFSANGAIEGAIQIKTGALRSLSEQQLMDCSWDYGNQGCNGGLMPQ 192
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
AF+Y + G++ E Y YT +DGVC++ + V V + G E LQ AV + P
Sbjct: 193 AFQYAQ-RYGVEAEVDYRYTERDGVCRYRQDLVVANVTGYAELPEGDEGGLQRAVATIGP 251
Query: 229 VSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENW 287
+SV + D GF Y GV+ S C +P ++H V+ VGYG E+G YWL+KNSWG +W
Sbjct: 252 ISVGIDAADPGFMSYSHGVFVSKTC--SPYAIDHGVLVVGYGAENGEAYWLVKNSWGSSW 309
Query: 288 GDHGYFKMEMGK-NMCGIATCASYPVV 313
G+ GY KM + NMCGIA+ ASYP V
Sbjct: 310 GEGGYVKMARNRNNMCGIASMASYPTV 336
>gi|218478062|dbj|BAH03397.1| cathepsin L-like cysteine peptidase [Taenia asiatica]
Length = 338
Score = 193 bits (490), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 100/208 (48%), Positives = 136/208 (65%), Gaps = 7/208 (3%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++ VK+QG+CGSCW FS+TG+LE A+ + GK ISLSEQQLVDC+ N GCNGG S
Sbjct: 135 VTEVKNQGNCGSCWAFSSTGALEGAFAKKTGKLISLSEQQLVDCSLKNGNDGCNGGYMSY 194
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGV-QVLDSVNITLGAEDELQHAVGLVR 227
AF+Y++ + ++ E AYPY DG C++ +E++GV V D +I G E L AV V
Sbjct: 195 AFKYLEEH-SIEPESAYPYRATDGPCRY-NESLGVGTVTDIGDIPEGNETALMEAVATVG 252
Query: 228 PVSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGEN 286
P+S+A + GF FY+ G+Y S C + + NH V+A+GYG ++G PYWL+KNSWG
Sbjct: 253 PISIAIDASSLGFMFYRHGIYKSHWCSSKFL--NHGVLAIGYGKQEGKPYWLVKNSWGTR 310
Query: 287 WGDHGYFKMEMG-KNMCGIATCASYPVV 313
WG GY M NMCG+A+ A +P V
Sbjct: 311 WGMKGYIMMAKDYHNMCGVASLADFPYV 338
>gi|344275468|ref|XP_003409534.1| PREDICTED: cathepsin K-like [Loxodonta africana]
Length = 329
Score = 193 bits (490), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 101/232 (43%), Positives = 140/232 (60%), Gaps = 11/232 (4%)
Query: 87 SKNLDLIRSTNCKG-----LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKG 141
S+N D + + +G + YR ++PVK+QG CGSCW FS+ G+LE + GK
Sbjct: 100 SRNNDTLYIPDWEGRAPDSIDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKL 159
Query: 142 ISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENV 201
++LS Q LVDC N GC GG + AF+Y++ N G+D+E+AYPY G+D C ++
Sbjct: 160 LNLSPQNLVDCVSE--NDGCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQDESCMYNPTGK 217
Query: 202 GVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVN 260
+ I +G E L+ AV V PVSVA + + F+FY GVY C + ++N
Sbjct: 218 AAKCRGYREIPVGNEKALKRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNSD--NLN 275
Query: 261 HAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYP 311
HAV+AVGYG++ G +W+IKNSWGENWG+ GY M K N CGIA AS+P
Sbjct: 276 HAVLAVGYGIQKGNKHWIIKNSWGENWGNKGYILMARNKNNACGIANLASFP 327
>gi|186688053|gb|ACC86112.1| cathepsin K [Paralichthys olivaceus]
Length = 330
Score = 192 bits (489), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 103/235 (43%), Positives = 139/235 (59%), Gaps = 7/235 (2%)
Query: 82 RFATFSKNLDLIRSTNCKGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKG 141
R +F+ LD S K + YR ++PVK+QG CGSCW FS+ G+LE + G+
Sbjct: 100 RQRSFTMALDERVSKLPKFVDYRKEGMVTPVKNQGSCGSCWAFSSAGALEGQLAKKTGQL 159
Query: 142 ISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENV 201
+ LS Q VDC N GC GG + AF+Y++ NGG+D+EEAYPY G+D C+++S +
Sbjct: 160 MDLSPQNPVDCVT--ENNGCGGGYMTNAFQYVQENGGIDSEEAYPYVGEDQSCRYNSSGM 217
Query: 202 GVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVN 260
Q + +G E L A+ V PVSV + F+FY+ GVY C D+N
Sbjct: 218 AAQCKGYKEVPVGDEHALAVALFKVGPVSVGIDASQSSFQFYQRGVYYDRNCNKD--DIN 275
Query: 261 HAVVAVGYGV-EDGVPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
HAV+AVGYG+ G YW+IKNSW ENWG GY M + N+CGIA ASYP++
Sbjct: 276 HAVLAVGYGISSKGKKYWIIKNSWSENWGKKGYILMARNRDNLCGIANLASYPIM 330
>gi|431917800|gb|ELK17041.1| Cathepsin L1 [Pteropus alecto]
Length = 334
Score = 192 bits (489), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 103/220 (46%), Positives = 134/220 (60%), Gaps = 10/220 (4%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++PVK+QG CGSCW FS TGSLE + GK +SLSEQ LVDC+++ N+G
Sbjct: 118 VDWRQKGYVTPVKNQGQCGSCWAFSATGSLEGQMFRKTGKLVSLSEQNLVDCSRSQGNEG 177
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDG-VCKFSSENVGVQVLDSVNITLGAEDEL 219
CNGGL AF+YIK NGGLD+EE+YPY K+ C + E V+I E L
Sbjct: 178 CNGGLMDNAFQYIKDNGGLDSEESYPYLAKESDTCNYKPEYSAANDTGFVDIPQ-REKSL 236
Query: 220 QHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVP--- 275
AV V P+SVA + F+FY G+Y C + D++H V+ +GYG E G P
Sbjct: 237 MKAVATVGPISVAIDAGHSSFQFYNKGIYYEPDC--SSKDLDHGVLVIGYGSEGGDPKSN 294
Query: 276 -YWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
+W++KNSWG WG +GY KM + N CGIAT ASYP V
Sbjct: 295 KFWIVKNSWGPEWGMNGYVKMAKDQNNHCGIATAASYPTV 334
>gi|355753449|gb|EHH57495.1| Cathepsin L1 [Macaca fascicularis]
Length = 333
Score = 192 bits (489), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 104/221 (47%), Positives = 132/221 (59%), Gaps = 9/221 (4%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
+ + +R ++PVK+QG CGSCW FS TG+LE + GK +SLSEQ LVDC+ N
Sbjct: 116 RSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSWPQGN 175
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
+GCNGGL AF+Y+ NGGLD+EE+YPY + CK++ E V+I E
Sbjct: 176 EGCNGGLMDYAFQYVADNGGLDSEESYPYEATEESCKYNPEYSVANDTGFVDIP-KQEKA 234
Query: 219 LQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----DG 273
L AV V P+SVA + + F FYK G+Y C + MD H V+ VGYG E D
Sbjct: 235 LMKAVATVGPISVAIDAGHESFMFYKEGIYFEPDCSSEDMD--HGVLVVGYGFESTESDN 292
Query: 274 VPYWLIKNSWGENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
YWL+KNSWGE WG GY KM +N CGIA+ ASYP V
Sbjct: 293 SKYWLVKNSWGEEWGMGGYIKMAKDRRNHCGIASAASYPTV 333
>gi|19698255|dbj|BAB86770.1| cathepsin L-like [Engraulis japonicus]
Length = 324
Score = 192 bits (489), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 99/215 (46%), Positives = 136/215 (63%), Gaps = 6/215 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R +SP+K+QG CGSCW+FS TG+LE+ G SLSEQQLVDC+ ++ N G
Sbjct: 114 VDWRTSGCVSPIKNQGQCGSCWSFSATGALESQTCLRRGYLPSLSEQQLVDCSGSYGNYG 173
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNIT-LGAEDEL 219
CNGG P QAF+YI+ NGG+D+E YPY + G C ++S ++T +G+E L
Sbjct: 174 CNGGWPDQAFQYIQANGGIDSESYYPYQARVGTCHYNSAYSAATCSGYQDVTPVGSESAL 233
Query: 220 QHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
Q+ V V P+S+A + G++ Y+SGV++ C T +HAV+ VGYG +G YWL+
Sbjct: 234 QYYVANVGPLSIAID-ASGWQSYQSGVFNDPSCSQT---ADHAVLLVGYGTYNGQDYWLV 289
Query: 280 KNSWGENWGDHGYFKM-EMGKNMCGIATCASYPVV 313
KNSWG WG+ GY M N CGIA ASYP+V
Sbjct: 290 KNSWGTWWGEQGYIMMTRNANNQCGIANHASYPLV 324
>gi|440906717|gb|ELR56946.1| Cathepsin K [Bos grunniens mutus]
Length = 338
Score = 192 bits (489), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 96/213 (45%), Positives = 132/213 (61%), Gaps = 6/213 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ YR ++PVK+QG CGSCW FS+ G+LE + GK ++LS Q LVDC N G
Sbjct: 128 IDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE--NDG 185
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C GG + AF+Y++ N G+D+E+AYPY G+D C ++ + I G E L+
Sbjct: 186 CGGGYMTNAFQYVQKNRGIDSEDAYPYVGQDENCMYNPTGKAAKCRGYREIPEGNEKALK 245
Query: 221 HAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
AV V P+SVA + + F+FY+ GVY C + ++NHAV+AVGYG++ G +W+I
Sbjct: 246 RAVARVGPISVAIDASLTSFQFYRKGVYYDENCNSD--NLNHAVLAVGYGIQKGNKHWII 303
Query: 280 KNSWGENWGDHGYFKMEMGK-NMCGIATCASYP 311
KNSWGENWG+ GY M K N CGIA AS+P
Sbjct: 304 KNSWGENWGNKGYILMARNKNNACGIANLASFP 336
>gi|410990008|ref|XP_004001242.1| PREDICTED: cathepsin L1 isoform 1 [Felis catus]
Length = 333
Score = 192 bits (489), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 101/219 (46%), Positives = 135/219 (61%), Gaps = 9/219 (4%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++PVKDQG+C CW FS TG+LE + GK +SLSEQ LVDC+Q N+G
Sbjct: 118 VDWREKGYVTPVKDQGYCLCCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSQTEGNEG 177
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
+GGL AF+Y+K NGGLD+EE+YPY + CK+ EN V D +I E+EL
Sbjct: 178 YSGGLIDDAFQYVKDNGGLDSEESYPYHAQGDSCKYRPENSVANVTDYWDIP-SKENELM 236
Query: 221 HAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----DGVP 275
+ V P+S A + +D FRFYK G+Y C + DV+H V+ VGYG + +
Sbjct: 237 ITLAAVGPISAAIDASLDTFRFYKEGIYYDPSCSSE--DVDHGVLVVGYGADGTETENKK 294
Query: 276 YWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
YW+IKNSWG +WG GY KM + N CGIA+ AS+P V
Sbjct: 295 YWIIKNSWGTDWGMDGYIKMAKDRDNHCGIASLASFPTV 333
>gi|1705639|sp|Q10991.1|CATL1_SHEEP RecName: Full=Cathepsin L1; Contains: RecName: Full=Cathepsin L1
heavy chain; Contains: RecName: Full=Cathepsin L1 light
chain; Flags: Precursor
Length = 217
Score = 192 bits (489), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 102/208 (49%), Positives = 129/208 (62%), Gaps = 6/208 (2%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++PVK+QG CGSCW FS TG+LE + GK +SLSEQ LVD ++ NQGCNGGL
Sbjct: 13 VTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDSSRPQGNQGCNGGLMDN 72
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
AF+YIK NGGLD+EE+YPY D C + E + V+I E L AV V P
Sbjct: 73 AFQYIKENGGLDSEESYPYEATDTSCNYKPEYSAAKDTGFVDIPQ-REKALMKAVATVGP 131
Query: 229 VSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE-DGVPYWLIKNSWGEN 286
+SVA + F+FYKSG+Y C + D++H V+ VGYG E +W++KNSWG
Sbjct: 132 ISVAIDAGHSSFQFYKSGIYYDPDC--SSKDLDHGVLVVGYGFEGTNNKFWIVKNSWGPE 189
Query: 287 WGDHGYFKMEMGK-NMCGIATCASYPVV 313
WG+ GY KM + N CGIAT ASYP V
Sbjct: 190 WGNKGYVKMAKDQNNHCGIATAASYPTV 217
>gi|156739289|ref|NP_001096592.1| uncharacterized protein LOC569326 precursor [Danio rerio]
gi|156230119|gb|AAI52283.1| Im:6910535 protein [Danio rerio]
Length = 335
Score = 192 bits (489), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 99/212 (46%), Positives = 136/212 (64%), Gaps = 10/212 (4%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++PVKDQ CGSCW+FS+TG+LE + GK IS+SEQ LVDC++ NQGCNGG+ Q
Sbjct: 127 VTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQGNQGCNGGIMDQ 186
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGV-CKFSSENVGVQVLDSVNITLGAEDELQHAVGLVR 227
AF+Y+K N GLD+E++YPY +D + C++ ++ V+I G E L +AV V
Sbjct: 187 AFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPRGNELALMNAVAAVG 246
Query: 228 PVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVED----GVPYWLIKNS 282
PVSVA + +FY+SG+Y C + ++HAV+ VGYG + G YW++KNS
Sbjct: 247 PVSVAIDASHQSLQFYQSGIYYERACTSR---LDHAVLVVGYGYQGADVAGNRYWIVKNS 303
Query: 283 WGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
W + WGD GY M K N CGIAT ASYP++
Sbjct: 304 WSDKWGDKGYIYMAKDKNNHCGIATMASYPLM 335
>gi|340370384|ref|XP_003383726.1| PREDICTED: silicatein-like [Amphimedon queenslandica]
Length = 337
Score = 192 bits (489), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 97/215 (45%), Positives = 133/215 (61%), Gaps = 4/215 (1%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++ VKDQG CGSC+ FS G+LE A A K + LSEQ +VDC+ + N+G
Sbjct: 125 VDWRTKNAVTGVKDQGQCGSCYAFSAVGALEGAQALAHDKLVHLSEQNIVDCSIPYGNKG 184
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
CNGG ++F YI N G+D E+ Y YTG+ G CKF + +G + + ++I G+E ELQ
Sbjct: 185 CNGGNMYESFRYIIDNDGIDREDGYKYTGRQGQCKFDRKAIGGRQVGIIHIPTGSEAELQ 244
Query: 221 HAVGLVRPVSVAFE-VVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
A+ PVSVA + + FRFY+ GV+ C T + HA + +GYG + G PYWL+
Sbjct: 245 SALATAGPVSVAIDGSSNAFRFYEKGVFDEPNCSTTKL--THAGLIIGYGKKKGKPYWLV 302
Query: 280 KNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
KNSWG +WG GY M K N CGIAT AS+P +
Sbjct: 303 KNSWGPHWGMKGYIMMARNKANQCGIATAASFPTL 337
>gi|334332714|ref|XP_001367224.2| PREDICTED: cathepsin L1-like [Monodelphis domestica]
Length = 335
Score = 192 bits (489), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 114/271 (42%), Positives = 151/271 (55%), Gaps = 25/271 (9%)
Query: 61 SFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGL------------- 107
SF ++G + S EE K + N R+ KG YR L
Sbjct: 72 SFQLRMNKFGDM--STEEFKQVMNGYKSNGSQKRT---KGSLYRESLLAQLPESVDWREK 126
Query: 108 -NISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLP 166
++PVK+Q C SCW FS G++E + + GK +SLS Q LVDC+ N GC+GGL
Sbjct: 127 GYVTPVKEQRGCYSCWAFSAAGAIEGQWFRKTGKLVSLSVQNLVDCSIPEGNNGCDGGLM 186
Query: 167 SQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLV 226
AF+Y++ NGG+DTEE YPY +D CK+ E G V V I E L AV V
Sbjct: 187 GNAFQYVQDNGGIDTEECYPYVAQDNECKYQPECSGANVTGFVKIPSTDERALMKAVANV 246
Query: 227 RPVSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE--DGVPYWLIKNSW 283
P+SVA + + F+FY+SGVY +C ++ + NH V+ VGYG E +G YW++KNSW
Sbjct: 247 GPISVAIDAGNPSFKFYQSGVYYDPQCSSSQL--NHGVLVVGYGSEGKNGRKYWIVKNSW 304
Query: 284 GENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
GENWGD+GY M + N CGI T ASYP+V
Sbjct: 305 GENWGDNGYVLMAKDEDNHCGIITDASYPIV 335
>gi|118136313|gb|ABK62794.1| cathepsin L-like cysteine protease [Neobenedenia melleni]
gi|118136315|gb|ABK62795.1| cathepsin L-like cysteine protease [Neobenedenia melleni]
Length = 335
Score = 192 bits (489), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 96/209 (45%), Positives = 129/209 (61%), Gaps = 5/209 (2%)
Query: 108 NISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPS 167
+++ VK+Q CGSCW FS+TGS+E A +A GK IS SEQQLVDC+ AF N GCNGG+
Sbjct: 129 HVTAVKNQAQCGSCWAFSSTGSIEGAVKRATGKLISFSEQQLVDCSTAFGNHGCNGGIMD 188
Query: 168 QAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVR 227
+F Y+ +N GL++E +YPY + C++ + +++ E +L+ AVGLV
Sbjct: 189 NSFNYLIHNKGLESEASYPYEAQKKECRYKKALSKGTISSFTDVSQFDEKDLKRAVGLVG 248
Query: 228 PVSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYG-VEDGVPYWLIKNSWGE 285
PVS+A + F Y SGVY C T + NH V+AVGYG +G+ YW +KNSW
Sbjct: 249 PVSIAIDASQFSFHLYDSGVYDEEDCSQTML--NHGVLAVGYGTTPEGLDYWKVKNSWTN 306
Query: 286 NWGDHGYFKMEMGK-NMCGIATCASYPVV 313
WG GY M K N CG+AT ASYP+V
Sbjct: 307 TWGMEGYILMSRNKDNQCGVATVASYPIV 335
>gi|326932936|ref|XP_003212567.1| PREDICTED: counting factor associated protein D-like [Meleagris
gallopavo]
Length = 573
Score = 192 bits (488), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 107/304 (35%), Positives = 156/304 (51%), Gaps = 51/304 (16%)
Query: 62 FARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN------------- 108
F + RR+G+ Y S E++ R F N+ + S N LSY L LN
Sbjct: 270 FHHYRRRFGRHYGSARELEHRQRIFVHNMRFVHSKNRAALSYSLALNHLADRTPQEMAAM 329
Query: 109 ------------------------------------ISPVKDQGHCGSCWTFSTTGSLEA 132
++PVKDQ CGSCW+F+TTG++E
Sbjct: 330 RGRRRSGDPNHGLPFPAEHYAGIILPESLDWRLYGAVTPVKDQAVCGSCWSFATTGAMEG 389
Query: 133 AYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAY-PYTGKD 191
A G LS+Q L+DC+ F N C+GG +A+E+IK +GG+ + E+Y Y G++
Sbjct: 390 ALFLKTGVLTPLSQQVLIDCSWGFGNYACDGGEEWRAYEWIKKHGGIASTESYGTYKGQN 449
Query: 192 GVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDG-FRFYKSGVYSST 250
G+C ++ + ++ VN+T G ++ A+ PV+V+ + F FY +G+Y
Sbjct: 450 GLCHYNQSEMLAKITGYVNVTSGNITAVKTAIYKHGPVAVSIDASHKTFSFYSNGIYYEP 509
Query: 251 KCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASY 310
KC N ++HAV+AVGYGV G YWLIKNSW WG+ GY M M N CG+AT A+Y
Sbjct: 510 KCANKSGQLDHAVLAVGYGVLQGETYWLIKNSWSTYWGNDGYILMAMKDNNCGVATEATY 569
Query: 311 PVVA 314
P++A
Sbjct: 570 PILA 573
>gi|75067394|sp|Q9GKL8.1|CATL1_CERAE RecName: Full=Cathepsin L1; AltName: Full=Major excreted protein;
Short=MEP; Contains: RecName: Full=Cathepsin L1 heavy
chain; Contains: RecName: Full=Cathepsin L1 light chain;
Flags: Precursor
gi|11493685|gb|AAG35605.1|AF201700_1 cysteine protease [Chlorocebus aethiops]
Length = 333
Score = 192 bits (488), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 104/221 (47%), Positives = 132/221 (59%), Gaps = 9/221 (4%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
+ + +R ++PVK+QG CGSCW FS TG+LE + GK +SLSEQ LVDC+ N
Sbjct: 116 RSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSGPQGN 175
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
+GCNGGL AF+Y+ NGGLD+EE+YPY + CK++ E V+I E
Sbjct: 176 EGCNGGLMDYAFQYVADNGGLDSEESYPYEATEESCKYNPEYSVANDTGFVDIP-KQEKA 234
Query: 219 LQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----DG 273
L AV V P+SVA + + F FYK G+Y C + MD H V+ VGYG E D
Sbjct: 235 LMKAVATVGPISVAIDAGHESFMFYKEGIYFEPDCSSEDMD--HGVLVVGYGFESTESDN 292
Query: 274 VPYWLIKNSWGENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
YWL+KNSWGE WG GY KM +N CGIA+ ASYP V
Sbjct: 293 SKYWLVKNSWGEEWGMGGYIKMAKDRRNHCGIASAASYPTV 333
>gi|354502591|ref|XP_003513367.1| PREDICTED: cathepsin L1-like isoform 1 [Cricetulus griseus]
Length = 330
Score = 192 bits (488), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 102/218 (46%), Positives = 133/218 (61%), Gaps = 6/218 (2%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++PVKDQG C SCW FS GSLE + GK + LSEQ LVDC+++ +N
Sbjct: 116 KSVDWRKHGYVTPVKDQGSCVSCWAFSAVGSLEGQMFRKTGKLVPLSEQNLVDCSRSQHN 175
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
GC+GGL + AF+YIK NGGLDT E+YPY +DG C++ ++ + V + E+
Sbjct: 176 NGCHGGLFTSAFQYIKDNGGLDTSESYPYEAQDGPCRYDPKHSAANITGFV-VVPSNEEA 234
Query: 219 LQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE-DGVPY 276
L AV V P+S+ V + FYKSG Y C N NH+V+ VGYG E DG Y
Sbjct: 235 LMKAVATVGPISIGISVRLRSLLFYKSGFYYDPDCYN--HYPNHSVLLVGYGEESDGQKY 292
Query: 277 WLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
WL+KNSWGE WG GY K+ + N C IAT A+YP V
Sbjct: 293 WLVKNSWGEEWGMDGYIKIAKDRNNHCSIATIAAYPTV 330
>gi|334332718|ref|XP_001367502.2| PREDICTED: cathepsin L1-like [Monodelphis domestica]
Length = 333
Score = 192 bits (488), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 115/308 (37%), Positives = 156/308 (50%), Gaps = 59/308 (19%)
Query: 62 FARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTN------------------------ 97
+ ++ ++GK YE+ E+ LR AT+ KNL +I N
Sbjct: 29 WHQWKAQHGKSYEANED-SLRRATWEKNLKMIERHNQEYSAGKHSFQLRMNKFGDMSTEE 87
Query: 98 ----------------CKGLSYRLGL--------------NISPVKDQGHCGSCWTFSTT 127
KG YR L ++PVK+QG CG+CW+FS
Sbjct: 88 FKQVMNGYKSNGSQRRTKGSLYRESLLAQLPESVDWREKGYVTPVKEQGDCGACWSFSAV 147
Query: 128 GSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPY 187
G++E + + GK +SLS Q L+DC N GC+GG AF+Y++ NGG+DTEE YPY
Sbjct: 148 GAIEGQWFRKTGKLVSLSIQNLIDCTIPEGNNGCDGGFMDNAFQYVQDNGGIDTEECYPY 207
Query: 188 TGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVD-GFRFYKSGV 246
+D CK+ E G + V+I E L AV V P+SV + + F+FY+SGV
Sbjct: 208 VAQDTECKYKPECSGANITGFVDIPSMDERALMEAVATVGPISVGIDSANPSFKFYQSGV 267
Query: 247 YSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIA 305
Y C ++ +D H V+ VGYG YW++KNSWGE WGD+GY M K N CGIA
Sbjct: 268 YYEPDCSSSQLD--HGVLVVGYGSIGKDEYWIVKNSWGEAWGDNGYILMAKDKDNHCGIA 325
Query: 306 TCASYPVV 313
T ASYP V
Sbjct: 326 TEASYPKV 333
>gi|156739281|ref|NP_001096588.1| cathepsin L1-like precursor [Danio rerio]
gi|166158351|ref|NP_001107526.1| uncharacterized protein LOC100135391 precursor [Xenopus (Silurana)
tropicalis]
gi|326672305|ref|XP_003199634.1| PREDICTED: cathepsin L1-like [Danio rerio]
gi|156230096|gb|AAI52237.1| MGC174155 protein [Danio rerio]
gi|163916362|gb|AAI57707.1| LOC100135391 protein [Xenopus (Silurana) tropicalis]
Length = 335
Score = 192 bits (488), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 99/212 (46%), Positives = 136/212 (64%), Gaps = 10/212 (4%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++PVKDQ CGSCW+FS+TG+LE + GK IS+SEQ LVDC++ NQGCNGG+ Q
Sbjct: 127 VTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQGNQGCNGGIMDQ 186
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGV-CKFSSENVGVQVLDSVNITLGAEDELQHAVGLVR 227
AF+Y+K N GLD+E++YPY +D + C++ ++ V+I G E L +AV V
Sbjct: 187 AFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPRGNELALMNAVAAVG 246
Query: 228 PVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVED----GVPYWLIKNS 282
PVSVA + +FY+SG+Y C + ++HAV+ VGYG + G YW++KNS
Sbjct: 247 PVSVAIDASHQSLQFYQSGIYYERACTSR---LDHAVLVVGYGYQGADVAGNRYWIVKNS 303
Query: 283 WGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
W + WGD GY M K N CGIAT ASYP++
Sbjct: 304 WSDKWGDKGYIYMAKDKNNHCGIATMASYPLM 335
>gi|326672302|ref|XP_003199633.1| PREDICTED: cathepsin L1-like [Danio rerio]
gi|157423549|gb|AAI53506.1| Im:6910535 [Danio rerio]
Length = 335
Score = 192 bits (488), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 99/212 (46%), Positives = 136/212 (64%), Gaps = 10/212 (4%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++PVKDQ CGSCW+FS+TG+LE + GK IS+SEQ LVDC++ NQGCNGG+ Q
Sbjct: 127 VTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQGNQGCNGGIMDQ 186
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGV-CKFSSENVGVQVLDSVNITLGAEDELQHAVGLVR 227
AF+Y+K N GLD+E++YPY +D + C++ ++ V+I G E L +AV V
Sbjct: 187 AFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPKGNELALMNAVAAVG 246
Query: 228 PVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVED----GVPYWLIKNS 282
PVSVA + +FY+SG+Y C + ++HAV+ VGYG + G YW++KNS
Sbjct: 247 PVSVAIDASHQSLQFYQSGIYYERACTSR---LDHAVLVVGYGYQGADVAGNRYWIVKNS 303
Query: 283 WGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
W + WGD GY M K N CGIAT ASYP++
Sbjct: 304 WSDKWGDKGYIYMAKDKNNHCGIATMASYPLM 335
>gi|357167196|ref|XP_003581047.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like
[Brachypodium distachyon]
Length = 338
Score = 192 bits (488), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 119/316 (37%), Positives = 151/316 (47%), Gaps = 68/316 (21%)
Query: 56 ARHALSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN------- 108
ARH ++ RYG++Y V E R F N+ I S N + L N
Sbjct: 31 ARHE----QWMARYGRVYSDVAEKARRLEVFKANVGFIESVNAGNHKFWLEANQFADITK 86
Query: 109 ---------------------------------------------ISPVKDQGHCGSCWT 123
++PVKDQG CG CW
Sbjct: 87 DEFRAMHKGYKMQVIGSKARATGFRYANVSIDDLPASVDWRANGAVTPVKDQGQCGCCWA 146
Query: 124 FSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEE 183
FST S+E + GK ISLSEQ+LVDC N+GC GGL AFE+I NGGLDTE
Sbjct: 147 FSTVASMEGIVKVSTGKLISLSEQELVDCDVGMQNKGCGGGLMDNAFEFIVNNGGLDTEA 206
Query: 184 AYPYTGKDGVCKFSSE-NVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDG-FRF 241
YPYTG DG C + E N+ + ++ E LQ AV +PVS+A + D FRF
Sbjct: 207 DYPYTGADGTCNSNKESNIAASIKGYEDVPANDEASLQKAVA-AQPVSIAVDGGDDLFRF 265
Query: 242 YKSGVYSSTKCGNTPMDVNHAVVAVGYGVE-DGVPYWLIKNSWGENWGDHGYFKMEMG-- 298
YK GV + CG +++H V AVGYGV DG YWL+KNSWG +WG+ G+ ++E
Sbjct: 266 YKGGVLTGA-CGT---ELDHGVAAVGYGVAGDGTKYWLVKNSWGTSWGEDGFIRLERDVA 321
Query: 299 --KNMCGIATCASYPV 312
MCG+A SYP
Sbjct: 322 DEAGMCGLAMKPSYPT 337
>gi|171702831|dbj|BAG16371.1| cysteine protease [Brassica oleracea var. italica]
Length = 441
Score = 192 bits (488), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 117/306 (38%), Positives = 156/306 (50%), Gaps = 61/306 (19%)
Query: 62 FARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN------------- 108
+ + ++GK S+ E RF F NL I N K LSYRLGL
Sbjct: 42 YEEWLVKHGKAQNSLTEKDRRFEIFKDNLRFIDEHNGKNLSYRLGLTKFADLTNDEYRSM 101
Query: 109 ------------------------------------ISPVKDQGHCGSCWTFSTTGSLEA 132
++ VKDQG CGSCW FST G++E
Sbjct: 102 YLGSRLKRKATKSSLRYEVRVGDAIPESVDWRKEGAVAEVKDQGSCGSCWAFSTIGAVEG 161
Query: 133 AYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDG 192
G I+LSEQ+LVDC ++N +GCNGGL AFE+I NGG+DTEE YPY G DG
Sbjct: 162 INKIVTGDLITLSEQELVDCDTSYN-EGCNGGLMDYAFEFIINNGGIDTEEDYPYKGVDG 220
Query: 193 VCKFSSENVGVQVLDSV-NITLGAEDELQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSST 250
C + +N V +D ++ +E+ L+ A+ +P+SVA E F+ Y SG++
Sbjct: 221 RCDQTRKNAKVVTIDLYEDVPANSEESLKKALSH-QPISVAIEGGGRAFQLYDSGIFDGI 279
Query: 251 KCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMG----KNMCGIAT 306
CG D++H VVAVGYG E+G YW++KNSWG +WG+ GY +ME CGIA
Sbjct: 280 -CGT---DLDHGVVAVGYGTENGKDYWIVKNSWGTSWGESGYIRMERNIASSAGKCGIAV 335
Query: 307 CASYPV 312
SYP+
Sbjct: 336 EPSYPI 341
>gi|156739275|ref|NP_001096585.1| cathepsin L1-like precursor [Danio rerio]
gi|156230123|gb|AAI52285.1| MGC174857 protein [Danio rerio]
Length = 335
Score = 192 bits (488), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 99/212 (46%), Positives = 136/212 (64%), Gaps = 10/212 (4%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++PVKDQ CGSCW+FS+TG+LE + GK IS+SEQ LVDC++ NQGCNGG+ Q
Sbjct: 127 VTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQGNQGCNGGIMDQ 186
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGV-CKFSSENVGVQVLDSVNITLGAEDELQHAVGLVR 227
AF+Y+K N GLD+E++YPY +D + C++ ++ V+I G E L +AV V
Sbjct: 187 AFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPRGNELALMNAVAAVG 246
Query: 228 PVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVED----GVPYWLIKNS 282
PVSVA + +FY+SG+Y C + ++HAV+ VGYG + G YW++KNS
Sbjct: 247 PVSVAIDASHQSLQFYQSGIYYERACTSR---LDHAVLVVGYGYQGADVAGNRYWIVKNS 303
Query: 283 WGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
W + WGD GY M K N CGIAT ASYP++
Sbjct: 304 WSDKWGDKGYIYMAKDKNNHCGIATMASYPLM 335
>gi|328866326|gb|EGG14711.1| hypothetical protein DFA_10969 [Dictyostelium fasciculatum]
Length = 369
Score = 192 bits (488), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 120/340 (35%), Positives = 172/340 (50%), Gaps = 81/340 (23%)
Query: 52 VIGQARHALSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSY-------- 103
+ ++ F + ++ K YES E + RF F KN+D I++ N K + +
Sbjct: 33 LFSHEQYTTEFKGWVGQFEKNYESHEFLN-RFDIFKKNMDYIKTWNDKSVDHKLELNTLA 91
Query: 104 --------------------RLGLN------------------------------ISPVK 113
R+GLN +S VK
Sbjct: 92 DLTDKEYQRLYLGTKVNGALRVGLNHADERDFGHIKSVFSNVKDNPNVDWRKQGAVSHVK 151
Query: 114 DQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYI 173
+QG CGSCW+FS+TG++E A+ G+ ISLSEQQLVDC++ + N GCNGGL + AF+Y+
Sbjct: 152 NQGQCGSCWSFSSTGAIEGAHAIKTGEMISLSEQQLVDCSKRYGNNGCNGGLMTLAFDYV 211
Query: 174 KYNGGLDTEEAYPYTGKD-GVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVA 232
GGL++EEAYPYT D C F+S N + D NI G E L+ + V PVSVA
Sbjct: 212 IDAGGLESEEAYPYTTTDTSACMFNSTNAVTSISDHQNIRAGNEKHLETVLRNVGPVSVA 271
Query: 233 FEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYG--------------VEDGVP-- 275
+ FRFYKSG++ + +C ++ +D H V+AVG+G + D
Sbjct: 272 IDASPRSFRFYKSGIFYAPECSSSQLD--HGVLAVGFGKGNPESNFENKVSFIHDDTKNN 329
Query: 276 -YWLIKNSWGENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
Y+++KNSWG +WG +G+ M KN CGIAT A+YP +
Sbjct: 330 EYYIVKNSWGSDWGSNGFIYMSKNRKNNCGIATMATYPTI 369
>gi|380790141|gb|AFE66946.1| cathepsin L1 preproprotein [Macaca mulatta]
gi|384939708|gb|AFI33459.1| cathepsin L1 preproprotein [Macaca mulatta]
Length = 333
Score = 192 bits (488), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 104/221 (47%), Positives = 132/221 (59%), Gaps = 9/221 (4%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
+ + +R ++PVK+QG CGSCW FS TG+LE + GK +SLSEQ LVDC+ N
Sbjct: 116 RSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSGPQGN 175
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
+GCNGGL AF+Y+ NGGLD+EE+YPY + CK++ E V+I E
Sbjct: 176 EGCNGGLMDYAFQYVADNGGLDSEESYPYEATEESCKYNPEYSVANDTGFVDIP-KQEKA 234
Query: 219 LQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----DG 273
L AV V P+SVA + + F FYK G+Y C + MD H V+ VGYG E D
Sbjct: 235 LMKAVATVGPISVAIDAGHESFMFYKEGIYFEPDCSSEDMD--HGVLVVGYGFESTESDN 292
Query: 274 VPYWLIKNSWGENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
YWL+KNSWGE WG GY KM +N CGIA+ ASYP V
Sbjct: 293 SKYWLVKNSWGEEWGMGGYIKMAKDRRNHCGIASAASYPTV 333
>gi|109112057|ref|XP_001086247.1| PREDICTED: cathepsin L1-like isoform 5 [Macaca mulatta]
gi|402897797|ref|XP_003911929.1| PREDICTED: cathepsin L1 [Papio anubis]
Length = 333
Score = 192 bits (488), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 104/221 (47%), Positives = 132/221 (59%), Gaps = 9/221 (4%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
+ + +R ++PVK+QG CGSCW FS TG+LE + GK +SLSEQ LVDC+ N
Sbjct: 116 RSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSGPQGN 175
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
+GCNGGL AF+Y+ NGGLD+EE+YPY + CK++ E V+I E
Sbjct: 176 EGCNGGLMDYAFQYVADNGGLDSEESYPYEATEESCKYNPEYSVANDTGFVDIP-KQEKA 234
Query: 219 LQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----DG 273
L AV V P+SVA + + F FYK G+Y C + MD H V+ VGYG E D
Sbjct: 235 LMKAVATVGPISVAIDAGHESFMFYKEGIYFEPDCSSEDMD--HGVLVVGYGFESTESDN 292
Query: 274 VPYWLIKNSWGENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
YWL+KNSWGE WG GY KM +N CGIA+ ASYP V
Sbjct: 293 SKYWLVKNSWGEEWGMGGYIKMAKDRRNHCGIASAASYPTV 333
>gi|291224872|ref|XP_002732426.1| PREDICTED: cathepsin L2-like [Saccoglossus kowalevskii]
Length = 691
Score = 192 bits (488), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 102/216 (47%), Positives = 129/216 (59%), Gaps = 6/216 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++ VKDQG CGSCW FSTTGS+E + GK +S SEQQLVDC+ ++ N G
Sbjct: 479 VDWRTKGYVTEVKDQGACGSCWAFSTTGSMEGQSFKNTGKLVSFSEQQLVDCSGSYGNMG 538
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C GGL QAF YI+ + G++ E YPYT KD C + + +I E LQ
Sbjct: 539 CGGGLMDQAFAYIE-DYGIEPEADYPYTAKDDPCSYDTSKAVATNTGYTDIATMDEKALQ 597
Query: 221 HAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYG-VEDGVPYWL 278
AV V P+SVA + FR YKSGVY C T +D H V+AVGYG +DG YW+
Sbjct: 598 QAVATVGPISVAIDASHSSFRLYKSGVYDEPACSQTMLD--HGVLAVGYGTTDDGNDYWI 655
Query: 279 IKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
+KNSWG WG+ GY M N CGIAT ASYP++
Sbjct: 656 VKNSWGSTWGNQGYIHMSRNNDNQCGIATNASYPLM 691
>gi|6978723|ref|NP_037288.1| cathepsin L1 preproprotein [Rattus norvegicus]
gi|55888|emb|CAA68691.1| prepro-cathepsin L [Rattus norvegicus]
Length = 334
Score = 192 bits (488), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 105/221 (47%), Positives = 133/221 (60%), Gaps = 9/221 (4%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++PVK+QG CGSCW FS +G LE GK ISLSEQ LVDC+ N
Sbjct: 116 KTVDWREKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHDQGN 175
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
QGCNGGL AF+YIK NGGLD+EE+YPY KDG CK+ +E V+I E
Sbjct: 176 QGCNGGLMDFAFQYIKENGGLDSEESYPYEAKDGSCKYRAEYAVANDTGFVDIPQ-QEKA 234
Query: 219 LQHAVGLVRPVSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----DG 273
L V V P+SVA + +FY SG+Y C + D++H V+ VGYG E +
Sbjct: 235 LMKPVATVGPISVAMDASHPSLQFYSSGIYYEPNC--SSKDLDHGVLVVGYGYEGTDSNK 292
Query: 274 VPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
YWL+KNSWG+ WG GY K+ + N CG+AT ASYP+V
Sbjct: 293 DKYWLVKNSWGKEWGMDGYIKIAKDRNNHCGLATAASYPIV 333
>gi|5231178|gb|AAD41105.1|AF157961_1 cysteine proteinase [Hypera postica]
Length = 324
Score = 192 bits (488), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 112/303 (36%), Positives = 152/303 (50%), Gaps = 56/303 (18%)
Query: 62 FARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNC----KGLSYRLGLN--------- 108
F F +GK Y + E RF F+ N+ I + N +SY+ G+N
Sbjct: 26 FQAFKLEHGKTYLNQAEESKRFNIFTDNVRAIEAHNALYEQGKVSYKKGINKFTDMSQEE 85
Query: 109 --------------------------------------ISPVKDQGHCGSCWTFSTTGSL 130
++ VKDQG CGSCW FS TGS
Sbjct: 86 FKTMLTLSASRKPTLETTSYVKTGVEIPSSVDWRKEGRVTGVKDQGDCGSCWAFSITGST 145
Query: 131 EAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGK 190
E AY + GK +SLSEQQL+DC + GC+GG F+Y+ +G L +EE+Y Y G+
Sbjct: 146 EGAYARKSGKLVSLSEQQLIDCCTD-TSAGCDGGSLDDNFKYVMKDG-LQSEESYTYKGE 203
Query: 191 DGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSST 250
DG CK++ +V +V +I ED L AV V PVSV + Y SG+Y
Sbjct: 204 DGACKYNVASVVTKVSKYTSIPAEDEDALLEAVATVGPVSVGMDA-SYLSSYDSGIYEDQ 262
Query: 251 KCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASY 310
C +P +NHA++AVGYG E+G YW+IKNSWG +WG+ GYF++ GKN CGI+ Y
Sbjct: 263 DC--SPAGLNHAILAVGYGTENGKDYWIIKNSWGASWGEQGYFRLARGKNQCGISEDTVY 320
Query: 311 PVV 313
P +
Sbjct: 321 PTI 323
>gi|167526493|ref|XP_001747580.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163774026|gb|EDQ87660.1| predicted protein [Monosiga brevicollis MX1]
Length = 330
Score = 192 bits (487), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 106/232 (45%), Positives = 143/232 (61%), Gaps = 14/232 (6%)
Query: 85 TFSKNLDLIRSTNCKGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISL 144
T +N I + + +R ++ +K+QG CGSCW+FSTTGS+E A+ A GK +SL
Sbjct: 95 TRPRNEVWITEAPLQEVDWRQKNAVTEIKNQGQCGSCWSFSTTGSVEGAHAIATGKLVSL 154
Query: 145 SEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENV-GV 203
SEQQL+DC+ + N GCNGGL AFEY+ NGGLDTEE YPYT +DG C E
Sbjct: 155 SEQQLMDCSTRYGNHGCNGGLMDYAFEYVIANGGLDTEEDYPYTAEDGKCNTEKEKKHAA 214
Query: 204 QVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHA 262
++ N+ ED+L AV + PVSVA E GF+ Y SGV+ KCG + ++H
Sbjct: 215 EIHGFRNVPKEHEDQLAAAVS-IGPVSVAIEADQAGFQHYTSGVFDG-KCGTS---LDHG 269
Query: 263 VVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMG---KNMCGIATCASYP 311
V+ VGY + YW++KNSWG++WG+ GY +++ G K MCGI ASYP
Sbjct: 270 VLVVGYSDD----YWIVKNSWGKSWGEEGYIRLKRGVDKKGMCGITMQASYP 317
>gi|356559055|ref|XP_003547817.1| PREDICTED: cysteine proteinase RD21a [Glycine max]
Length = 366
Score = 192 bits (487), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 102/218 (46%), Positives = 141/218 (64%), Gaps = 12/218 (5%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R+ ++P+KDQG CGSCW FST ++EA GK +SLSEQ+LVDC +A+ N+G
Sbjct: 134 VDWRVKGAVAPIKDQGSCGSCWAFSTVATVEAINKIVTGKFVSLSEQELVDCDRAY-NEG 192
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSV-NITLGAEDEL 219
CNGGL AFE+I NGG+DT++ YPY G DG+C + +N V +D ++ E+ L
Sbjct: 193 CNGGLMDYAFEFIIQNGGIDTDKDYPYRGFDGICDPTKKNAKVVNIDGFEDVPPYDENAL 252
Query: 220 QHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWL 278
+ AV +PVS+A E + Y+SGV++ KCG + ++H VV VGYG E+GV YWL
Sbjct: 253 KKAVAH-QPVSIAIEASGRDLQLYQSGVFTG-KCGTS---LDHGVVVVGYGSENGVDYWL 307
Query: 279 IKNSWGENWGDHGYFKMEMG----KNMCGIATCASYPV 312
++NSWG WG+ GYFKM+ CGI ASYPV
Sbjct: 308 VRNSWGTGWGEDGYFKMQRNVRTPTGKCGITMEASYPV 345
>gi|18141285|gb|AAL60580.1|AF454958_1 senescence-associated cysteine protease [Brassica oleracea]
Length = 485
Score = 192 bits (487), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 117/306 (38%), Positives = 156/306 (50%), Gaps = 61/306 (19%)
Query: 62 FARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN------------- 108
+ + ++GK S+ E RF F NL I N K LSYRLGL
Sbjct: 48 YEEWLVKHGKAQNSLTEKDRRFEIFKDNLRFIDEHNGKNLSYRLGLTKFADLTNDEYRSM 107
Query: 109 ------------------------------------ISPVKDQGHCGSCWTFSTTGSLEA 132
++ VKDQG CGSCW FST G++E
Sbjct: 108 YLGSRLKRKATKSSLRYEVRVGDAIPESVDWRKEGAVAEVKDQGSCGSCWAFSTIGAVEG 167
Query: 133 AYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDG 192
G I+LSEQ+LVDC ++N +GCNGGL AFE+I NGG+DTEE YPY G DG
Sbjct: 168 INKIVTGDLITLSEQELVDCDTSYN-EGCNGGLMDYAFEFIINNGGIDTEEDYPYKGVDG 226
Query: 193 VCKFSSENVGVQVLDSV-NITLGAEDELQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSST 250
C + +N V +D ++ +E+ L+ A+ +P+SVA E F+ Y SG++
Sbjct: 227 RCDQTRKNAKVVTIDLYEDVPANSEESLKKALSH-QPISVAIEGGGRAFQLYDSGIFDGI 285
Query: 251 KCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMG----KNMCGIAT 306
CG D++H VVAVGYG E+G YW++KNSWG +WG+ GY +ME CGIA
Sbjct: 286 -CGT---DLDHGVVAVGYGTENGKDYWIVKNSWGTSWGESGYIRMERNIASSAGKCGIAV 341
Query: 307 CASYPV 312
SYP+
Sbjct: 342 EPSYPI 347
>gi|431896622|gb|ELK06034.1| Cathepsin K [Pteropus alecto]
Length = 330
Score = 192 bits (487), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 96/213 (45%), Positives = 132/213 (61%), Gaps = 6/213 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ YR ++PVK+QG CGSCW FS+ G+LE + GK ++LS Q LVDC N G
Sbjct: 120 VDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVS--ENDG 177
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C GG + AF+Y++ N G+D+E+AYPY G+D C ++ + I G E L+
Sbjct: 178 CGGGYMTNAFQYVQKNRGIDSEDAYPYVGQDESCMYNPTGKAAKCRGYKEIPEGNEKALK 237
Query: 221 HAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
AV V P+SVA + + F+FY+ GVY C + ++NHAV+AVGYG++ G +W+I
Sbjct: 238 RAVARVGPISVAIDASLTSFQFYRKGVYYDENCNSD--NLNHAVLAVGYGIQKGRKHWII 295
Query: 280 KNSWGENWGDHGYFKMEMGK-NMCGIATCASYP 311
KNSWGENWG+ GY M K N CGIA AS+P
Sbjct: 296 KNSWGENWGNKGYVLMARNKNNACGIANLASFP 328
>gi|77735825|ref|NP_001029607.1| cathepsin K precursor [Bos taurus]
gi|59858469|gb|AAX09069.1| cathepsin K preproprotein [Bos taurus]
gi|83638771|gb|AAI09854.1| Cathepsin K [Bos taurus]
gi|296489554|tpg|DAA31667.1| TPA: cathepsin K [Bos taurus]
Length = 334
Score = 192 bits (487), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 96/213 (45%), Positives = 132/213 (61%), Gaps = 6/213 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ YR ++PVK+QG CGSCW FS+ G+LE + GK ++LS Q LVDC N G
Sbjct: 124 VDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE--NDG 181
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C GG + AF+Y++ N G+D+E+AYPY G+D C ++ + I G E L+
Sbjct: 182 CGGGYMTNAFQYVQKNRGIDSEDAYPYVGQDENCMYNPTGKAAKCRGYREIPEGNEKALK 241
Query: 221 HAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
AV V P+SVA + + F+FY+ GVY C + ++NHAV+AVGYG++ G +W+I
Sbjct: 242 RAVARVGPISVAIDASLTSFQFYRKGVYYDENCNSD--NLNHAVLAVGYGIQKGNKHWII 299
Query: 280 KNSWGENWGDHGYFKMEMGK-NMCGIATCASYP 311
KNSWGENWG+ GY M K N CGIA AS+P
Sbjct: 300 KNSWGENWGNKGYILMARNKNNACGIANLASFP 332
>gi|449676370|ref|XP_002156627.2| PREDICTED: counting factor associated protein D-like [Hydra
magnipapillata]
Length = 551
Score = 192 bits (487), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 94/212 (44%), Positives = 137/212 (64%), Gaps = 2/212 (0%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+++RL ++PVKDQ CGSCW+F TTG++E A G+ + LSEQ L+DC+ F N G
Sbjct: 336 INWRLFGAVTPVKDQAVCGSCWSFGTTGAIEGALFLKTGRLVRLSEQNLMDCSWGFGNNG 395
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAY-PYTGKDGVCKFSSENVGVQVLDSVNITLGAEDEL 219
C+GG +A+EYI +GG+ T+++Y Y G DG C S +G ++ VN+T G D L
Sbjct: 396 CDGGEEFRAYEYIMKHGGIATDDSYGNYLGIDGYCHQKSSVIGAKIASYVNVTSGDMDAL 455
Query: 220 QHAVGLVRPVSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWL 278
+ A+ P++V + F FY GVY + +CGN P +++HAV+AVGYGV++G PY L
Sbjct: 456 KMAIVQHGPIAVGIDAAHLAFVFYSHGVYYNPECGNKPENLDHAVLAVGYGVQNGEPYTL 515
Query: 279 IKNSWGENWGDHGYFKMEMGKNMCGIATCASY 310
+KNSW +WG+ GY M N CG+AT A++
Sbjct: 516 VKNSWSTHWGNDGYVLMSQRDNNCGVATDATF 547
>gi|535473|emb|CAA53377.1| cysteine protease [Vicia sativa]
Length = 368
Score = 192 bits (487), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 105/219 (47%), Positives = 141/219 (64%), Gaps = 13/219 (5%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +RL I+ +KDQG CGSCW FST ++EA GK +SLSEQ+LVDC +AFN +G
Sbjct: 132 VDWRLKGAITHIKDQGSCGSCWAFSTIATVEAINKIVTGKLVSLSEQELVDCDRAFN-EG 190
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSV-NITLGAEDEL 219
CNGGL AFE+I NGG+DT++ YPY G +G C + + + +D ++ E+ L
Sbjct: 191 CNGGLMDYAFEFIIGNGGIDTDQHYPYKGFEGRCDPTRKKAKIVSIDGYEDVPSNNENAL 250
Query: 220 QHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWL 278
+ AV +PVSVA E + Y+SGV++ KCG + ++HAVV VGYG E+G+ YWL
Sbjct: 251 KKAVAH-QPVSVAIEASGRALQLYQSGVFTG-KCGTS---LDHAVVIVGYGSENGLDYWL 305
Query: 279 IKNSWGENWGDHGYFKMEMG-----KNMCGIATCASYPV 312
++NSWG NWG+ GYFKME CGIA ASYPV
Sbjct: 306 VRNSWGTNWGEDGYFKMERNVKGTHTGKCGIAVEASYPV 344
>gi|109940312|sp|Q5E968.2|CATK_BOVIN RecName: Full=Cathepsin K; Flags: Precursor
Length = 329
Score = 192 bits (487), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 96/213 (45%), Positives = 132/213 (61%), Gaps = 6/213 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ YR ++PVK+QG CGSCW FS+ G+LE + GK ++LS Q LVDC N G
Sbjct: 119 VDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE--NDG 176
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C GG + AF+Y++ N G+D+E+AYPY G+D C ++ + I G E L+
Sbjct: 177 CGGGYMTNAFQYVQKNRGIDSEDAYPYVGQDENCMYNPTGKAAKCRGYREIPEGNEKALK 236
Query: 221 HAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
AV V P+SVA + + F+FY+ GVY C + ++NHAV+AVGYG++ G +W+I
Sbjct: 237 RAVARVGPISVAIDASLTSFQFYRKGVYYDENCNSD--NLNHAVLAVGYGIQKGNKHWII 294
Query: 280 KNSWGENWGDHGYFKMEMGK-NMCGIATCASYP 311
KNSWGENWG+ GY M K N CGIA AS+P
Sbjct: 295 KNSWGENWGNKGYILMARNKNNACGIANLASFP 327
>gi|149751227|ref|XP_001490649.1| PREDICTED: cathepsin K-like [Equus caballus]
Length = 329
Score = 192 bits (487), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 97/213 (45%), Positives = 131/213 (61%), Gaps = 6/213 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ YR ++PVK+QG CGSCW FS+ G+LE + GK ++LS Q LVDC N G
Sbjct: 119 IDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE--NDG 176
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C GG + AF+Y++ N G+D+E+AYPY G+D C ++ + I G E L+
Sbjct: 177 CGGGYMTNAFQYVQKNRGIDSEDAYPYVGQDESCMYNPTGKAAKCRGYREIPQGNEKALK 236
Query: 221 HAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
AV V PVSVA + + F+FY GVY C + ++NHAV+AVGYG++ G +W+I
Sbjct: 237 RAVARVGPVSVAIDASLTSFQFYSRGVYYDENCNSD--NLNHAVLAVGYGIQKGNKHWII 294
Query: 280 KNSWGENWGDHGYFKMEMGK-NMCGIATCASYP 311
KNSWGENWG+ GY M K N CGIA AS+P
Sbjct: 295 KNSWGENWGNKGYILMARNKNNACGIANMASFP 327
>gi|27465595|ref|NP_775155.1| testin-2 precursor [Rattus norvegicus]
gi|1174639|sp|P15242.2|TEST2_RAT RecName: Full=Testin-2; AltName: Full=CMB-23; Contains: RecName:
Full=Testin-1; AltName: Full=CMB-22; Flags: Precursor
gi|577430|gb|AAC52162.1| testin [Rattus norvegicus]
gi|149039744|gb|EDL93860.1| testin gene [Rattus norvegicus]
Length = 333
Score = 192 bits (487), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 103/221 (46%), Positives = 136/221 (61%), Gaps = 9/221 (4%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++PVK+QGHC S W FS TGSLE + + I LSEQ L+DC +
Sbjct: 116 KRVDWRQLGYVTPVKNQGHCASSWAFSATGSLEGQMFRKTERLIPLSEQNLLDCMGSNVT 175
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
GC+GG AF+Y+K NGGL TEE+YPY G+ C++ +EN V D V I G+E+
Sbjct: 176 HGCSGGFMQYAFQYVKDNGGLATEESYPYRGQGRECRYHAENSAANVRDFVQIP-GSEEA 234
Query: 219 LQHAVGLVRPVSVAFEVVDG-FRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----DG 273
L AV V P+SVA + G F+FY SG+Y +C + +NHAV+ VGYG E DG
Sbjct: 235 LMKAVAKVGPISVAVDASHGSFQFYGSGIYYEPQCKR--VHLNHAVLVVGYGFEGEESDG 292
Query: 274 VPYWLIKNSWGENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
+WL+KNSWGE WG GY K+ N CGIAT ++YP+V
Sbjct: 293 NSFWLVKNSWGEEWGMKGYMKLAKDWSNHCGIATYSTYPIV 333
>gi|426216528|ref|XP_004002514.1| PREDICTED: cathepsin K [Ovis aries]
Length = 330
Score = 191 bits (486), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 96/213 (45%), Positives = 132/213 (61%), Gaps = 6/213 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ YR ++PVK+QG CGSCW FS+ G+LE + GK ++LS Q LVDC N G
Sbjct: 120 VDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE--NDG 177
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C GG + AF+Y++ N G+D+E+AYPY G+D C ++ + I G E L+
Sbjct: 178 CGGGYMTNAFQYVQKNRGIDSEDAYPYVGQDENCMYNPTGKAAKCRGYREIPEGNEKALK 237
Query: 221 HAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
AV V P+SVA + + F+FY+ GVY C + ++NHAV+AVGYG++ G +W+I
Sbjct: 238 RAVARVGPISVAIDASLTSFQFYRKGVYYDENCNSD--NLNHAVLAVGYGIQKGNKHWII 295
Query: 280 KNSWGENWGDHGYFKMEMGK-NMCGIATCASYP 311
KNSWGENWG+ GY M K N CGIA AS+P
Sbjct: 296 KNSWGENWGNKGYILMARNKNNACGIANLASFP 328
>gi|148927394|gb|ABR19828.1| cysteine proteinase [Elaeis guineensis]
Length = 469
Score = 191 bits (486), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 116/274 (42%), Positives = 155/274 (56%), Gaps = 21/274 (7%)
Query: 54 GQARHALSFARFARRYGKIYESVE-------EMKLRFATFSKNLDLIRSTN--CKGLSYR 104
G+ L RFA + Y S + R +T N RS++ + +R
Sbjct: 88 GKYSFRLGLTRFADLTNEEYRSTYLGVRTAGSRRRRNSTVGSNRYRFRSSDDLPDSIDWR 147
Query: 105 LGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGG 164
+ VKDQG CGSCW FST ++E H G ISLSEQ+LVDC + NQGCNGG
Sbjct: 148 DKGAVVDVKDQGSCGSCWAFSTIAAVEGINHIVTGDLISLSEQELVDC-DTYYNQGCNGG 206
Query: 165 LPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSV-NITLGAEDELQHAV 223
L AFE+I NGG+DT+E YPYTG+DG C +N V +DS ++ + E LQ AV
Sbjct: 207 LMDYAFEFIISNGGIDTDEDYPYTGRDGSCDQYRKNAHVVTIDSYEDVPINDEKSLQKAV 266
Query: 224 GLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNS 282
+PVSVA E F+ Y+SG+++ CG +++H V A+GYG E+G YW++KNS
Sbjct: 267 A-NQPVSVAIEAGGRAFQLYESGIFTGY-CG---TELDHGVTAIGYGSENGKYYWIVKNS 321
Query: 283 WGENWGDHGYFKMEMGKN----MCGIATCASYPV 312
WG +WG+ GY +ME N CGIA ASYP+
Sbjct: 322 WGSDWGESGYIRMERNINSATGKCGIAMEASYPI 355
>gi|350425511|ref|XP_003494144.1| PREDICTED: counting factor associated protein D-like [Bombus
impatiens]
Length = 549
Score = 191 bits (486), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 114/360 (31%), Positives = 171/360 (47%), Gaps = 65/360 (18%)
Query: 3 RPVQLVSSVILLLCCAAAASASASSFDDSNPIRLVSSDGLRDFETSVLQVIGQARHALSF 62
+P V V + C NP+R + + +++T V + +F
Sbjct: 200 KPSSEVFEVTTNMTCVGFPGPGDKHVYTFNPMR----EFVHNYDTHVNE---------AF 246
Query: 63 ARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN-------------- 108
F + + K Y + + +R F +NL I STN Y+L +N
Sbjct: 247 EDFKKTHNKEYVNHVDQLMRKEVFRQNLRFIHSTNRANKGYQLSVNHLVDRTELELKALR 306
Query: 109 ------------------------------------ISPVKDQGHCGSCWTFSTTGSLEA 132
++PVKDQ CGSCW+F TTG++E
Sbjct: 307 GKQYTAHYNGGQPFPHNAEKEVTEVPDSLDWRLYGAVTPVKDQSVCGSCWSFGTTGAVEG 366
Query: 133 AYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAY-PYTGKD 191
AY+ +GK + LS+Q L+DC+ + N GC+GG +++++I +GGL TE+ Y Y G+D
Sbjct: 367 AYYMKYGKLVRLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKHGGLPTEDDYGGYLGQD 426
Query: 192 GVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSST 250
G C ++ V ++ VN+T G + L+ A+ P+SVA + F FY GVY
Sbjct: 427 GYCHINNATVTAKITGYVNVTSGDANALKVAIAKHGPISVAIDASHKTFSFYSHGVYYDE 486
Query: 251 KCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASY 310
CGNT ++HAV+AVGYG +G YWL+KNSW WG+ GY M KN CG+ T +Y
Sbjct: 487 SCGNTEESLDHAVLAVGYGSLNGKDYWLVKNSWSNYWGNDGYILMSQEKNNCGVLTAPTY 546
>gi|47523662|ref|NP_999467.1| cathepsin K precursor [Sus scrofa]
gi|15213940|sp|Q9GLE3.1|CATK_PIG RecName: Full=Cathepsin K; Flags: Precursor
gi|10048286|gb|AAG12340.1|AF292030_1 cathepsin K precursor [Sus scrofa]
Length = 330
Score = 191 bits (486), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 97/213 (45%), Positives = 131/213 (61%), Gaps = 6/213 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ YR ++PVK+QG CGSCW FS+ G+LE + GK ++LS Q LVDC N G
Sbjct: 120 IDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE--NDG 177
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C GG + AF+Y++ N G+D+E+AYPY G+D C ++ + I G E L+
Sbjct: 178 CGGGYMTNAFQYVQKNRGIDSEDAYPYVGQDENCMYNPTGKAAKCRGYREIPEGNEKALK 237
Query: 221 HAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
AV V PVSVA + + F+FY GVY C + ++NHAV+AVGYG++ G +W+I
Sbjct: 238 RAVARVGPVSVAIDASLTSFQFYSKGVYYDENCNSD--NLNHAVLAVGYGIQKGKKHWII 295
Query: 280 KNSWGENWGDHGYFKMEMGK-NMCGIATCASYP 311
KNSWGENWG+ GY M K N CGIA AS+P
Sbjct: 296 KNSWGENWGNKGYILMARNKNNACGIANLASFP 328
>gi|108735858|gb|ABG00260.1| cathepsin L1 [Fasciola hepatica]
Length = 219
Score = 191 bits (486), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 96/207 (46%), Positives = 134/207 (64%), Gaps = 4/207 (1%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++ VKDQG+CGSCW FSTTG+++ Y + IS SEQQLVDC++ + N GC GGL
Sbjct: 13 VTEVKDQGNCGSCWAFSTTGTMKGQYMKNERTSISFSEQQLVDCSRPWGNNGCGGGLMEN 72
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
A+EY+K GL+TE +YPY+ +G C++ + +V + G E ELQ+ VG P
Sbjct: 73 AYEYLK-QFGLETESSYPYSAVEGPCRYDRKLGVAKVTGYYTVHSGDEVELQNLVGGEGP 131
Query: 229 VSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWG 288
+VA + F Y+SG+Y S C +P ++H V+AVGYG +DG YW++KNSWG WG
Sbjct: 132 PAVALDAELDFMMYRSGIYXSQTC--SPDRLSHGVLAVGYGTQDGTDYWIVKNSWGTWWG 189
Query: 289 DHGYFKMEMGK-NMCGIATCASYPVVA 314
+ GY +M + NMCGIA+ AS P+VA
Sbjct: 190 EDGYIRMVRNRGNMCGIASLASVPMVA 216
>gi|356582227|ref|NP_001239115.1| cathepsin L1 precursor [Canis lupus familiaris]
gi|62899810|sp|Q9GL24.1|CATL1_CANFA RecName: Full=Cathepsin L1; Contains: RecName: Full=Cathepsin L1
heavy chain; Contains: RecName: Full=Cathepsin L1 light
chain; Flags: Precursor
gi|10185020|emb|CAC08809.1| cathepsin L [Canis lupus familiaris]
Length = 333
Score = 191 bits (486), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 103/221 (46%), Positives = 135/221 (61%), Gaps = 9/221 (4%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++PVK+QG CGSCW FS TG+LE + GK +SLSEQ LVDC++A N
Sbjct: 116 KSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQGN 175
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDG-VCKFSSENVGVQVLDSVNITLGAED 217
+GCNGGL AF Y+K NGGLD+EE+YPY G+D C + E V++ E
Sbjct: 176 EGCNGGLMDNAFRYVKDNGGLDSEESYPYLGRDTETCNYKPECSAANDTGFVDLPQ-REK 234
Query: 218 ELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE---DG 273
L AV + P+SVA + F+FYKSG+Y C + D++H V+ VGYG E
Sbjct: 235 ALMKAVATLGPISVAIDAGHQSFQFYKSGIYFDPDC--SSKDLDHGVLVVGYGFEGTDSN 292
Query: 274 VPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
+W++KNSWG WG +GY KM + N CGIAT ASYP V
Sbjct: 293 NKFWIVKNSWGPEWGWNGYVKMAKDQNNHCGIATAASYPTV 333
>gi|81294188|gb|AAI08032.1| Cathepsin L, 1 b [Danio rerio]
Length = 336
Score = 191 bits (486), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 99/212 (46%), Positives = 136/212 (64%), Gaps = 9/212 (4%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++PVKDQ CGSCW+FS+TG+LE + GK IS+SEQ LVDC++ NQGCNGGL
Sbjct: 127 VTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQGNQGCNGGLMDL 186
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGV-CKFSSENVGVQVLDSVNITLGAEDELQHAVGLVR 227
AF+Y+K N GLD+E++YPY +D + C++ ++ V+I G E L +AV V
Sbjct: 187 AFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPSGNELALMNAVAAVG 246
Query: 228 PVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVED----GVPYWLIKNS 282
PVSVA + +FY+SG+Y C ++ +D HAV+ VGYG + G YW++KNS
Sbjct: 247 PVSVAIDASHQSLQFYQSGIYYERACSSSRLD--HAVLVVGYGYQGADVAGNRYWIVKNS 304
Query: 283 WGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
W + WGD GY M K N CG+AT ASYP++
Sbjct: 305 WSDKWGDKGYIYMAKDKNNHCGVATKASYPLM 336
>gi|344271925|ref|XP_003407787.1| PREDICTED: cathepsin L1-like [Loxodonta africana]
Length = 333
Score = 191 bits (486), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 104/212 (49%), Positives = 131/212 (61%), Gaps = 11/212 (5%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++PVKDQG CGSCW FS TG+LE + GK +SLSEQ LVDC+ N+GCNGGL
Sbjct: 126 VTPVKDQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSWREGNEGCNGGLMDN 185
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGV-CKFSSENVGVQVLDSVNITLGAEDELQHAVGLVR 227
AF+Y+K NGGLD+EE+YPYT D C+++ + V+I E L AV V
Sbjct: 186 AFQYVKDNGGLDSEESYPYTATDTQDCRYNPKYSAANDTGFVDIPP-QEKALMKAVATVG 244
Query: 228 PVSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVP----YWLIKNS 282
P+SVA + F+FY SG+Y C + VNH V+AVGYG E P YWL+KNS
Sbjct: 245 PISVAIDAGQVSFQFYSSGIYFDPAC---RLTVNHGVLAVGYGFEGTDPDKNKYWLVKNS 301
Query: 283 WGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
WG++WG GY K+ + N CGIA ASYP V
Sbjct: 302 WGKSWGADGYIKIAKDRNNHCGIARAASYPTV 333
>gi|148575301|gb|ABQ95351.1| secreted cathepsin L2 [Fasciola hepatica]
Length = 326
Score = 191 bits (486), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 105/257 (40%), Positives = 144/257 (56%), Gaps = 10/257 (3%)
Query: 60 LSFARFARRYGKIYESVEEMKLRFATFSKN-LDLIRSTNCKGLSYRLGLNISPVKDQGHC 118
L+F F +Y E+ R + N L + S + + Y ++ VKDQG C
Sbjct: 75 LTFEEFKAKYLIEIPRSSELLSRGIPYKANKLAVPESIDWRDYYY-----VTEVKDQGQC 129
Query: 119 GSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGG 178
GSCW FSTTG++E + + S SEQQLVDC + F N GC GG A+EY+K+N G
Sbjct: 130 GSCWAFSTTGAVEGQFRKNERASASFSEQQLVDCTRDFGNYGCGGGYMENAYEYLKHN-G 188
Query: 179 LDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDG 238
L+TE YPY +G C++ +V + G E EL++ VG P +VA +
Sbjct: 189 LETESYYPYQAVEGPCQYDGRLAYAKVTGYYTVHSGDEIELKNLVGTEGPAAVALDADSD 248
Query: 239 FRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMG 298
F Y+SG+Y S C P + HAV+AVGYG +DG YW++KNSWG WG+ GY +
Sbjct: 249 FMMYQSGIYQSQTC--LPDRLTHAVLAVGYGSQDGTDYWIVKNSWGTWWGEDGYIRFARN 306
Query: 299 K-NMCGIATCASYPVVA 314
+ NMCGIA+ AS P+VA
Sbjct: 307 RGNMCGIASLASVPMVA 323
>gi|355681653|gb|AER96814.1| cathepsin K [Mustela putorius furo]
Length = 329
Score = 191 bits (486), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 97/213 (45%), Positives = 131/213 (61%), Gaps = 6/213 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ YR ++PVK+QG CGSCW FS+ G+LE + GK ++LS Q LVDC N G
Sbjct: 120 IDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE--NDG 177
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C GG + AF+Y++ N G+D+E+AYPY G+D C ++ + I G E L+
Sbjct: 178 CGGGYMTNAFQYVQKNRGIDSEDAYPYVGQDESCMYNPTGKAAKCKGYREIPEGNEKALK 237
Query: 221 HAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
AV V P+SVA + + F+FY GVY C + ++NHAV+AVGYGV+ G +W+I
Sbjct: 238 RAVARVGPISVAIDASLTSFQFYSKGVYYDENCNSD--NLNHAVLAVGYGVQKGNKHWII 295
Query: 280 KNSWGENWGDHGYFKMEMGK-NMCGIATCASYP 311
KNSWGENWG+ GY M K N CGIA AS+P
Sbjct: 296 KNSWGENWGNKGYILMARNKNNACGIANLASFP 328
>gi|410968296|ref|XP_003990643.1| PREDICTED: cathepsin K [Felis catus]
Length = 330
Score = 191 bits (485), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 96/213 (45%), Positives = 131/213 (61%), Gaps = 6/213 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ YR ++PVK+QG CGSCW FS+ G+LE + GK ++LS Q LVDC N G
Sbjct: 120 IDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE--NDG 177
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C GG + AF+Y++ N G+D+E+AYPY G+D C ++ + I G E L+
Sbjct: 178 CGGGYMTNAFQYVQKNRGIDSEDAYPYVGQDESCMYNPTGKAAKCRGYREIPEGNEKALK 237
Query: 221 HAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
AV V P+SVA + + F+FY GVY C + ++NHAV+AVGYG++ G +W+I
Sbjct: 238 RAVARVGPISVAIDASLTSFQFYSKGVYYDENCNSD--NLNHAVLAVGYGIQKGNKHWII 295
Query: 280 KNSWGENWGDHGYFKMEMGK-NMCGIATCASYP 311
KNSWGENWG+ GY M K N CGIA AS+P
Sbjct: 296 KNSWGENWGNKGYILMARNKNNACGIANLASFP 328
>gi|74927078|sp|Q86GF7.1|CRUST_PANBO RecName: Full=Crustapain; AltName: Full=NsCys; Flags: Precursor
gi|28971811|dbj|BAC65417.1| crustapain [Pandalus borealis]
Length = 323
Score = 191 bits (485), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 101/208 (48%), Positives = 126/208 (60%), Gaps = 5/208 (2%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++PVKDQG CGSCW FS +LE A+ G +SLSEQ LVDC+ ++ NQGCNGG P Q
Sbjct: 118 VTPVKDQGQCGSCWAFSAVAALEGAHFLKTGDLVSLSEQNLVDCSSSYGNQGCNGGWPYQ 177
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
A++YI N G+DTE +YPY D C++ + N+G V V G E LQHAV P
Sbjct: 178 AYQYIIANRGIDTESSYPYKAIDDNCRYDAGNIGATVSSYVEPASGDESALQHAVQNEGP 237
Query: 229 VSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE-DGVPYWLIKNSWGEN 286
VSV + F Y GVY C + NHAV AVGYG + +G YW++KNSWG
Sbjct: 238 VSVCIDAGQSSFGSYGGGVYYEPNCDS--WYANHAVTAVGYGTDANGGDYWIVKNSWGAW 295
Query: 287 WGDHGYFKMEMGK-NMCGIATCASYPVV 313
WG+ GY KM + N C IAT + YPVV
Sbjct: 296 WGESGYIKMARNRDNNCAIATYSVYPVV 323
>gi|301767944|ref|XP_002919404.1| PREDICTED: cathepsin K-like [Ailuropoda melanoleuca]
gi|281352889|gb|EFB28473.1| hypothetical protein PANDA_008011 [Ailuropoda melanoleuca]
Length = 330
Score = 191 bits (485), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 96/213 (45%), Positives = 131/213 (61%), Gaps = 6/213 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ YR ++PVK+QG CGSCW FS+ G+LE + GK ++LS Q LVDC N G
Sbjct: 120 IDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE--NDG 177
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C GG + AF+Y++ N G+D+E+AYPY G+D C ++ + I G E L+
Sbjct: 178 CGGGYMTNAFQYVQKNRGIDSEDAYPYVGQDESCMYNPTGKAAKCRGYREIPEGNEKALK 237
Query: 221 HAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
AV V P+SVA + + F+FY GVY C + ++NHAV+AVGYG++ G +W+I
Sbjct: 238 RAVARVGPISVAIDASLTSFQFYSKGVYYDENCNSD--NLNHAVLAVGYGIQKGNKHWII 295
Query: 280 KNSWGENWGDHGYFKMEMGK-NMCGIATCASYP 311
KNSWGENWG+ GY M K N CGIA AS+P
Sbjct: 296 KNSWGENWGNKGYILMARNKNNACGIANLASFP 328
>gi|395740610|ref|XP_002819972.2| PREDICTED: cathepsin L1 [Pongo abelii]
Length = 333
Score = 191 bits (485), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 104/221 (47%), Positives = 133/221 (60%), Gaps = 9/221 (4%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
+ + +R ++PVK+QG CGSCW FS TG+LE + GK ISLSEQ LVDC+ N
Sbjct: 116 RSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLISLSEQNLVDCSGPQGN 175
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
+GCNGGL AF+Y++ NGGLD+EE+YPY + CK++ + V+I E
Sbjct: 176 EGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIP-KQEKA 234
Query: 219 LQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----DG 273
L AV V P+SVA + + F FYK G+Y C + MD H V+ VGYG E D
Sbjct: 235 LMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMD--HGVLVVGYGFESTESDN 292
Query: 274 VPYWLIKNSWGENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
YWL+KNSWGE WG GY KM +N CGIA+ ASYP V
Sbjct: 293 NKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 333
>gi|255564910|ref|XP_002523448.1| cysteine protease, putative [Ricinus communis]
gi|223537276|gb|EEF38907.1| cysteine protease, putative [Ricinus communis]
Length = 341
Score = 191 bits (485), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 110/303 (36%), Positives = 151/303 (49%), Gaps = 62/303 (20%)
Query: 68 RYGKIYESVEEMKLRFATFSKNLDLIRSTNCKG-LSYRLGLN------------------ 108
+YG++Y+ E + RF F N++ I S N G Y+L +N
Sbjct: 44 KYGRVYKDNSEKERRFEIFRNNVEFIESFNKPGNRPYKLDINEFADLTNEEFKASRNGYK 103
Query: 109 --------------------------------ISPVKDQGHCGSCWTFSTTGSLEAAYHQ 136
++P+KDQG CG CW FS ++E
Sbjct: 104 RSSNVGLSEKSSFRYGNVTAVPTSMDWRQKGAVTPIKDQGQCGCCWAFSAVAAMEGITKL 163
Query: 137 AFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKF 196
+ GK ISLSEQ+LVDC + +QGC GGL AFE+IK NGGL TE YPY G DG C
Sbjct: 164 STGKLISLSEQELVDCDTSGEDQGCEGGLMDDAFEFIKQNGGLTTEANYPYQGTDGTCNT 223
Query: 197 SSE-NVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGN 254
+ N ++ ++ +ED L AV +PVSVA + F+FY GV++ G+
Sbjct: 224 NKAGNDAAKITGYEDVPANSEDALLKAVA-SQPVSVAIDASGSAFQFYSGGVFT----GD 278
Query: 255 TPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMG----KNMCGIATCASY 310
+++H V AVGYG DG YWL+KNSWG +WG+ GY +ME + +CGIA +SY
Sbjct: 279 CGTELDHGVTAVGYGTSDGTKYWLVKNSWGTSWGEDGYIRMERDIEAKEGLCGIAMQSSY 338
Query: 311 PVV 313
P
Sbjct: 339 PTA 341
>gi|224103643|ref|XP_002313136.1| predicted protein [Populus trichocarpa]
gi|222849544|gb|EEE87091.1| predicted protein [Populus trichocarpa]
Length = 477
Score = 191 bits (485), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 119/306 (38%), Positives = 156/306 (50%), Gaps = 67/306 (21%)
Query: 68 RYGKIYESVEEMKLRFATFSKNLDLIRSTNCKG-LSYRLGLN------------------ 108
+YGK Y ++ E + RF F NL + N G SY+LGLN
Sbjct: 55 KYGKAYNALGEKERRFEIFKDNLKFVDQHNSVGNPSYKLGLNKFADLSNEEYRAAYLGTR 114
Query: 109 -----------------------------------ISPVKDQGHCGSCWTFSTTGSLEAA 133
++PVKDQG CGSCW FST G++E
Sbjct: 115 MDGKRRLLGGPKSARYLFKDGDDLPESVDWREKGAVAPVKDQGQCGSCWAFSTVGAVEGI 174
Query: 134 YHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGV 193
G SLSEQ+LVDC + +N QGCNGGL AFE+I NGG+DTEE YPY D +
Sbjct: 175 NQIVTGNLTSLSEQELVDCDKVYN-QGCNGGLMDYAFEFIMKNGGIDTEEDYPYKAVDSM 233
Query: 194 CKFSSENVGVQVLDSV-NITLGAEDELQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTK 251
C + +N V +D ++ E L+ AV +PVSVA E F+ Y+SGV++ +
Sbjct: 234 CDPNRKNARVVTIDGYEDVPQNDEKSLRKAVA-NQPVSVAIEAGGRAFQLYQSGVFTGS- 291
Query: 252 CGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMG-----KNMCGIAT 306
CG ++H VVAVGYG E+GV YW+++NSWG WG++GY +ME CGIA
Sbjct: 292 CGTQ---LDHGVVAVGYGTENGVDYWVVRNSWGPAWGENGYIRMERNVASTETGKCGIAM 348
Query: 307 CASYPV 312
ASYP
Sbjct: 349 EASYPT 354
>gi|403302736|ref|XP_003942009.1| PREDICTED: cathepsin K isoform 2 [Saimiri boliviensis boliviensis]
Length = 383
Score = 191 bits (485), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 99/235 (42%), Positives = 141/235 (60%), Gaps = 11/235 (4%)
Query: 84 ATFSKNLDLIRSTNCKG-----LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAF 138
+FS++ D + + +G + YR ++PVK+QG CGSCW FS+ G+LE +
Sbjct: 151 TSFSRSNDTLYIPDWEGRAPDSVDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKT 210
Query: 139 GKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSS 198
GK ++LS Q LVDC N GC GG + AF+Y++ N G+D+E+AYPY G++ C ++
Sbjct: 211 GKLLNLSPQNLVDCVSE--NDGCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQEESCMYNP 268
Query: 199 ENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPM 257
+ I G E L+ AV V P+SVA + + F+FY GVY C +
Sbjct: 269 TGKAAKCRGYREIPEGNEKALKRAVARVGPISVAIDASLTSFQFYSKGVYYDESCNSD-- 326
Query: 258 DVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYP 311
++NHAV+AVGYG++ G +W+IKNSWGENWG+ GY M K N CGIA AS+P
Sbjct: 327 NLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKGYILMARNKNNACGIANLASFP 381
>gi|375340657|emb|CBJ56264.1| cathepsin S protein [Dicentrarchus labrax]
Length = 337
Score = 191 bits (485), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 101/249 (40%), Positives = 141/249 (56%), Gaps = 15/249 (6%)
Query: 77 EEMKLRFATFSKNLDLIRSTNC----------KGLSYRLGLNISPVKDQGHCGSCWTFST 126
EE+ FAT S D+ R+ + + +R ++ VK QG CGSCW FS
Sbjct: 92 EEIMQSFATLSPPTDIQRAASPFAGTTGADVPDTMDWREKGCVTSVKMQGSCGSCWAFSA 151
Query: 127 TGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYP 186
G+LE + GK + LS Q LVDC+ + N GCNGG QAF+Y+ N G+D++ +YP
Sbjct: 152 AGALEGQLAKTTGKLVDLSPQNLVDCSTKYGNHGCNGGFMHQAFQYVIDNQGIDSDASYP 211
Query: 187 YTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVV-DGFRFYKSG 245
YTG++G C+++S+ + G E L+ A+ + P+SVA + F FY+SG
Sbjct: 212 YTGRNGECRYNSKFRAANCSQYSFLPEGNEGALKEALANIGPISVAIDATRPTFTFYRSG 271
Query: 246 VYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKN-MCGI 304
VY+ C VNH V+AVGYG DG YWL+KNSWG+ +GD GY +M KN CGI
Sbjct: 272 VYNDPNCSQ---KVNHGVLAVGYGTLDGQDYWLVKNSWGKTFGDQGYIRMSRNKNDQCGI 328
Query: 305 ATCASYPVV 313
A YP++
Sbjct: 329 ALYGCYPIM 337
>gi|77404197|ref|NP_001029168.1| cathepsin K precursor [Canis lupus familiaris]
gi|122056102|sp|Q3ZKN1.1|CATK_CANFA RecName: Full=Cathepsin K; Flags: Precursor
gi|58047562|gb|AAW65150.1| cathepsin K [Canis lupus familiaris]
Length = 330
Score = 191 bits (485), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 96/213 (45%), Positives = 131/213 (61%), Gaps = 6/213 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ YR ++PVK+QG CGSCW FS+ G+LE + GK ++LS Q LVDC N G
Sbjct: 120 VDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE--NDG 177
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C GG + AF+Y++ N G+D+E+AYPY G+D C ++ + I G E L+
Sbjct: 178 CGGGYMTNAFQYVQKNRGIDSEDAYPYVGQDESCMYNPTGKAAKCRGYREIPEGNEKALK 237
Query: 221 HAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
AV V P+SVA + + F+FY GVY C + ++NHAV+AVGYG++ G +W+I
Sbjct: 238 RAVARVGPISVAIDASLTSFQFYSKGVYYDENCNSD--NLNHAVLAVGYGIQKGNKHWII 295
Query: 280 KNSWGENWGDHGYFKMEMGK-NMCGIATCASYP 311
KNSWGENWG+ GY M K N CGIA AS+P
Sbjct: 296 KNSWGENWGNKGYILMARNKNNACGIANLASFP 328
>gi|395856027|ref|XP_003800444.1| PREDICTED: cathepsin K [Otolemur garnettii]
Length = 329
Score = 191 bits (485), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 96/213 (45%), Positives = 131/213 (61%), Gaps = 6/213 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ YR ++PVK+QG CGSCW FS+ G+LE + GK ++LS Q LVDC +N G
Sbjct: 119 IDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVS--DNDG 176
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C GG + AF+Y++ N G+D+E+AYPY G+D C ++ + I G E L+
Sbjct: 177 CGGGYMTNAFQYVQKNRGIDSEDAYPYVGQDESCMYNPTGKAAKCRGYREIPEGNEKALK 236
Query: 221 HAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
AV V P+SV + + F+FY GVY C + +VNHAV+AVGYG++ G +W+I
Sbjct: 237 RAVARVGPISVGIDASLTSFQFYSKGVYYDESCNSD--NVNHAVLAVGYGIQKGNKHWII 294
Query: 280 KNSWGENWGDHGYFKMEMGK-NMCGIATCASYP 311
KNSWGENWG+ GY M K N CGIA AS+P
Sbjct: 295 KNSWGENWGNKGYILMARNKNNACGIANLASFP 327
>gi|332375406|gb|AEE62844.1| unknown [Dendroctonus ponderosae]
Length = 320
Score = 191 bits (484), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 100/213 (46%), Positives = 134/213 (62%), Gaps = 5/213 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++ VK+QG CGSCW FS TGS+E A ++ GK +SLSEQQLVDC N G
Sbjct: 113 VDWRTEGYVTGVKNQGDCGSCWAFSLTGSVEGALFKSTGKLVSLSEQQLVDCTYGTVNFG 172
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C+GG + F YI+ GL+ E +YPY +DG CKF + V ++ D V G E+ L
Sbjct: 173 CDGGYLEETFPYIQ-ETGLEAEASYPYKARDGTCKFDASKVVTKINDYV-YWYGDEEALL 230
Query: 221 HAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIK 280
A + P+SVA + + Y SGV+SS C + D+NH V+ VGYG E+GV YWL+K
Sbjct: 231 EATATIGPISVAMDA-NYIDSYASGVFSSRLCSSD--DLNHGVLVVGYGSENGVNYWLVK 287
Query: 281 NSWGENWGDHGYFKMEMGKNMCGIATCASYPVV 313
NSW E+WG+ GY K+ G+N CGIA SYP+V
Sbjct: 288 NSWAEDWGESGYLKLLRGQNECGIAEDDSYPIV 320
>gi|354502593|ref|XP_003513368.1| PREDICTED: cathepsin L1-like isoform 2 [Cricetulus griseus]
Length = 330
Score = 191 bits (484), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 102/218 (46%), Positives = 133/218 (61%), Gaps = 6/218 (2%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++PVKDQG C SCW FS GSLE + GK + LSEQ LVDC+++ +N
Sbjct: 116 KSVDWRKHGYVTPVKDQGSCVSCWAFSAVGSLEGQMFRKTGKLVPLSEQNLVDCSRSQHN 175
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
GC+GGL + AF+YIK NGGLDT E+YPY +DG C++ ++ + V + E+
Sbjct: 176 NGCHGGLFTSAFQYIKDNGGLDTSESYPYEAQDGPCRYDPKHSAANITGFV-VVPSNEEA 234
Query: 219 LQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE-DGVPY 276
L AV V P+S+ V + FYKSG Y C N NH+V+ VGYG E DG Y
Sbjct: 235 LMKAVATVGPISIGISVRLRSLLFYKSGFYYDPDCYN--HYPNHSVLLVGYGEESDGQKY 292
Query: 277 WLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
WL+KNSWGE WG GY K+ + N C IAT A+YP V
Sbjct: 293 WLVKNSWGEEWGMDGYIKIAKDRNNHCSIATIAAYPTV 330
>gi|157862757|gb|ABV90501.1| cathepsin L, partial [Fasciola gigantica]
Length = 244
Score = 191 bits (484), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 96/215 (44%), Positives = 132/215 (61%), Gaps = 4/215 (1%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++ VKDQ CGSCW FSTTG++E + + G +S SEQQLVDC+ F N G
Sbjct: 30 IDWRDSGYVTKVKDQEDCGSCWAFSTTGTMEGQFMKNIGFNVSFSEQQLVDCSSDFGNNG 89
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C GGL A+EY++ GL+ E YPY +G C++ +V + G E ELQ
Sbjct: 90 CRGGLMEIAYEYLR-RFGLEIESTYPYRAVEGPCRYDRRLGVAKVTGYYIVHSGDEVELQ 148
Query: 221 HAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIK 280
+ VG+ P +VA +V F Y+SG+Y S C +P +NH V+AVGYG + G YW++K
Sbjct: 149 NLVGIEGPAAVALDVESDFVMYRSGIYQSQTC--SPDRLNHGVLAVGYGTQSGTDYWIVK 206
Query: 281 NSWGENWGDHGYFKMEMGK-NMCGIATCASYPVVA 314
NSWG WG+ GY +M + NMCGIA+ AS P+VA
Sbjct: 207 NSWGTWWGEGGYIRMVRNRGNMCGIASMASLPMVA 241
>gi|332260024|ref|XP_003279085.1| PREDICTED: cathepsin L1 isoform 3 [Nomascus leucogenys]
gi|441593306|ref|XP_004087072.1| PREDICTED: cathepsin L1 [Nomascus leucogenys]
gi|441593309|ref|XP_004087073.1| PREDICTED: cathepsin L1 [Nomascus leucogenys]
Length = 333
Score = 191 bits (484), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 103/221 (46%), Positives = 133/221 (60%), Gaps = 9/221 (4%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
+ + +R ++PVK+QG CGSCW FS TG+LE + GK +SLSEQ LVDC+ N
Sbjct: 116 RSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSGPQGN 175
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
+GCNGGL AF+Y++ NGGLD+EE+YPY + CK++ + V+I E
Sbjct: 176 EGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIP-KQEKA 234
Query: 219 LQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----DG 273
L AV V P+SVA + F+FYK G+Y C + MD H V+ VGYG E D
Sbjct: 235 LMKAVATVGPISVAVDAGHQSFQFYKEGIYFEPDCSSEDMD--HGVLVVGYGFESTESDN 292
Query: 274 VPYWLIKNSWGENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
YWL+KNSWGE WG GY KM +N CGIA+ ASYP V
Sbjct: 293 NKYWLVKNSWGEEWGMGGYIKMAKDRRNHCGIASAASYPTV 333
>gi|213512938|ref|NP_001133871.1| Cathepsin K precursor [Salmo salar]
gi|209155648|gb|ACI34056.1| Cathepsin K precursor [Salmo salar]
gi|223647252|gb|ACN10384.1| Cathepsin K precursor [Salmo salar]
gi|223673129|gb|ACN12746.1| Cathepsin K precursor [Salmo salar]
Length = 331
Score = 191 bits (484), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 100/218 (45%), Positives = 132/218 (60%), Gaps = 7/218 (3%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
+ + YR ++PVK+Q CGSCW FS+ G+LE + GK I LS Q LVDC N
Sbjct: 118 RSIDYRKKGMVTPVKNQLSCGSCWAFSSAGALEGQLAKTTGKLIDLSPQNLVDCVT--EN 175
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
GC GG + AFEY++ NGG+DTEEAYPY G+DG C +++ +G Q I G E
Sbjct: 176 NGCGGGYMTNAFEYVEENGGIDTEEAYPYLGQDGQCAYNASGMGAQCRGFKEIPEGDEWA 235
Query: 219 LQHAVGLVRPVSVAFE-VVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYG-VEDGVPY 276
L AV V PV+V + + F+FY+ GVY C D+NHAV+AVGYG G+ +
Sbjct: 236 LTKAVVKVGPVAVGIDATLSTFQFYQRGVYYDPNCNKD--DINHAVLAVGYGQTAKGMKF 293
Query: 277 WLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
W++KNSW E+WG GY M + N CGIA ASYP++
Sbjct: 294 WIVKNSWSESWGKQGYIMMARNRGNACGIANLASYPIM 331
>gi|388519351|gb|AFK47737.1| unknown [Medicago truncatula]
Length = 359
Score = 191 bits (484), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 103/211 (48%), Positives = 138/211 (65%), Gaps = 13/211 (6%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++ +KDQG CGSCW FST ++EA GK +SLSEQ+LVDC +AFN +GCNGGL
Sbjct: 137 VAHIKDQGSCGSCWAFSTIATVEAINKIVTGKLVSLSEQELVDCDRAFN-EGCNGGLMDY 195
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSV-NITLGAEDELQHAVGLVR 227
AFE+I NGG+DTE+ YPY G +G C + +N V +D ++ E+ L+ AV +
Sbjct: 196 AFEFIVENGGIDTEQDYPYKGFEGRCDPTRKNAKVVSIDGYEDVPAYNENALKKAV-FHQ 254
Query: 228 PVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGEN 286
PVSVA E + Y+SGV++ +CG +++H VV VGYG E+GV YWL++NSWG N
Sbjct: 255 PVSVAIEAGGRALQLYQSGVFTG-RCG---TNLDHGVVVVGYGFENGVDYWLVRNSWGTN 310
Query: 287 WGDHGYFKME-----MGKNMCGIATCASYPV 312
WG+ GYFK+E + CGIA ASYPV
Sbjct: 311 WGEDGYFKLERNVKKINTGKCGIAMQASYPV 341
>gi|124484387|dbj|BAF46304.1| cysteine proteinase precursor [Ipomoea nil]
Length = 474
Score = 191 bits (484), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 137/374 (36%), Positives = 185/374 (49%), Gaps = 82/374 (21%)
Query: 1 MARPVQLVSSVILLLCCAAAASASAS--SFDDSNPIRLVSSDGLRDFETSVLQVIGQARH 58
MARP L + + ++ AAAA+ S ++D +P + GL E V ++
Sbjct: 1 MARPSILFTFLFAVVSAAAAAAEDMSIITYDQQHPAK-----GLVRSEDEVKEM------ 49
Query: 59 ALSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNC-KGLSYRLGLN--------- 108
F + ++GK Y +V+E RF F NL I N + SY+LGLN
Sbjct: 50 ---FESWLVKHGKSYNAVDEKDKRFKIFRDNLKYIDEKNSLENRSYKLGLNRFADITNEE 106
Query: 109 -------------------------------------------ISPVKDQGHCGSCWTFS 125
++ VKDQG CGSCW FS
Sbjct: 107 YRTGYLGAKRDASRNMVKSKSDRYAPVAGDSLPDSIDWREKGAVTGVKDQGSCGSCWAFS 166
Query: 126 TTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAY 185
T ++E A G ISLSEQ+LVDC + N QGCNGG AF++I NGG+D+EE Y
Sbjct: 167 TIAAVEGVNQLATGNLISLSEQELVDCDRKIN-QGCNGGDMGYAFQFIIKNGGIDSEEDY 225
Query: 186 PYTGKDGVCK-FSSENVGVQVLDSVN-ITLGAEDELQHAVGLVRPVSVAFEVVD-GFRFY 242
PYTGKDG C + N V +D + + E LQ AV +PVSVA E F+ Y
Sbjct: 226 PYTGKDGKCDSYRQNNAKVASIDGYEEVPVNNEKSLQKAVA-NQPVSVAIEAGGYDFQLY 284
Query: 243 KSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMG---- 298
SG+++ + CG D++H V AVGYG E+GV YW++KNSWG+ WG+ GY +M+
Sbjct: 285 SSGIFTGS-CG---TDLDHGVAAVGYGTENGVDYWIVKNSWGDYWGEKGYVRMQRNVKAK 340
Query: 299 KNMCGIATCASYPV 312
+CGIA ASYP
Sbjct: 341 TGLCGIAMEASYPT 354
>gi|444522624|gb|ELV13407.1| Cathepsin L1 [Tupaia chinensis]
Length = 307
Score = 191 bits (484), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 103/231 (44%), Positives = 140/231 (60%), Gaps = 14/231 (6%)
Query: 89 NLDLIRSTNCKGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQ 148
+LD+ S + + Y ++PVK+QG CGSCW FS+TG+LE + GK +SLSEQ
Sbjct: 85 HLDVPESVDWREKGY-----VTPVKNQGDCGSCWAFSSTGALEGQMFRKTGKLVSLSEQN 139
Query: 149 LVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDS 208
LVDC+ + N GCNGG+ AF Y+K NGGLD+EE+YPY D CK++ +N
Sbjct: 140 LVDCSISEGNFGCNGGIMDNAFLYVKDNGGLDSEESYPYEAVDDSCKYNPKNSAANDTGF 199
Query: 209 VNITLGAEDELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVG 267
V++ + E L+ AV V P+SV + D F+FYK G+Y C + +D HAV+ VG
Sbjct: 200 VHLPV-EEKALEKAVATVGPISVGIDASADSFQFYKEGIYFEPNCSSVELD--HAVLVVG 256
Query: 268 YGVEDGV----PYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
YGV + +WL+KNSWG+NWG GY M + N CGIA+ A YP V
Sbjct: 257 YGVMEEASTNNKFWLVKNSWGKNWGMDGYIMMAKDRNNNCGIASYAMYPTV 307
>gi|109940313|sp|P25975.3|CATL1_BOVIN RecName: Full=Cathepsin L1; Contains: RecName: Full=Cathepsin L1
heavy chain; Contains: RecName: Full=Cathepsin L1 light
chain; Flags: Precursor
gi|74354943|gb|AAI02313.1| CTSL2 protein [Bos taurus]
gi|154425700|gb|AAI51426.1| Cathepsin L2 [Bos taurus]
gi|296484466|tpg|DAA26581.1| TPA: cathepsin L2 precursor [Bos taurus]
gi|440898893|gb|ELR50299.1| Cathepsin L1 [Bos grunniens mutus]
Length = 334
Score = 191 bits (484), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 104/212 (49%), Positives = 131/212 (61%), Gaps = 10/212 (4%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++PVK+QG CGSCW FS TG+LE + GK +SLSEQ LVDC++A NQGCNGGL
Sbjct: 126 VTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQGNQGCNGGLMDN 185
Query: 169 AFEYIKYNGGLDTEEAYPYTGKD-GVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVR 227
AF+YIK NGGLD+EE+YPY D C + E V+I E L AV V
Sbjct: 186 AFQYIKDNGGLDSEESYPYLATDTNSCNYKPECSAANDTGFVDIPQ-REKALMKAVATVG 244
Query: 228 PVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----DGVPYWLIKNS 282
P+SVA + F+FYKSG+Y C + D++H V+ VGYG E + +W++KNS
Sbjct: 245 PISVAIDAGHTSFQFYKSGIYYDPDC--SSKDLDHGVLVVGYGFEGTDSNNNKFWIVKNS 302
Query: 283 WGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
WG WG +GY KM + N CGIAT ASYP V
Sbjct: 303 WGPEWGWNGYVKMAKDQNNHCGIATAASYPTV 334
>gi|45822207|emb|CAE47500.1| cathepsin L-like proteinase [Diabrotica virgifera virgifera]
Length = 326
Score = 191 bits (484), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 95/206 (46%), Positives = 128/206 (62%), Gaps = 2/206 (0%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++ VKDQG CGSCW+FSTTG++E AY GK +SLSEQ LVDCA+ + GC+GG +
Sbjct: 122 VTEVKDQGSCGSCWSFSTTGTVEGAYFLKTGKLVSLSEQNLVDCAKE-DCYGCSGGYMDK 180
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
A EYI+ GG+ +E YPY G D C+F S V ++ + I ED+L++AV P
Sbjct: 181 ALEYIETAGGIMSENDYPYEGIDDKCRFDSSKVAAKISNFTYIKKNDEDDLKNAVIAKGP 240
Query: 229 VSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWG 288
+SVA + F+ Y SG+ + C + +NH V+ VGYG E YW++KNSWG +WG
Sbjct: 241 ISVAIDASFNFQLYDSGILDDSSCYSDFNSLNHGVLVVGYGTEKEQDYWIVKNSWGADWG 300
Query: 289 DHGYFKMEMGK-NMCGIATCASYPVV 313
GY M K N CGIAT A+YP +
Sbjct: 301 MDGYIWMSRNKNNQCGIATDATYPTI 326
>gi|380236892|emb|CBK52289.1| cathepsin S protein [Dicentrarchus labrax]
Length = 337
Score = 191 bits (484), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 101/249 (40%), Positives = 141/249 (56%), Gaps = 15/249 (6%)
Query: 77 EEMKLRFATFSKNLDLIRSTNC----------KGLSYRLGLNISPVKDQGHCGSCWTFST 126
EE+ FAT S D+ R+ + + +R ++ VK QG CGSCW FS
Sbjct: 92 EEIMQSFATLSPPTDIQRAASPFAGTTGADVPDTMDWREKGCVTSVKMQGSCGSCWAFSA 151
Query: 127 TGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYP 186
G+LE + GK + LS Q LVDC+ + N GCNGGL AF+Y+ N G+D++ +YP
Sbjct: 152 AGALEGQLAKTTGKLVDLSPQNLVDCSTKYGNHGCNGGLMHHAFQYVIDNQGIDSDASYP 211
Query: 187 YTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVV-DGFRFYKSG 245
YTG++G C+++S+ + G E L+ A+ + P+SVA + F FY+SG
Sbjct: 212 YTGRNGECRYNSKFRAANCSQYSFLPEGNEGALKEALANIGPISVAIDATRPTFTFYRSG 271
Query: 246 VYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKN-MCGI 304
VY+ C VNH V+AVGYG DG YWL+KNSWG+ +GD GY +M KN CGI
Sbjct: 272 VYNDPNCSQ---KVNHGVLAVGYGTLDGQDYWLVKNSWGKTFGDQGYIRMSRNKNDQCGI 328
Query: 305 ATCASYPVV 313
A YP++
Sbjct: 329 ALYGCYPIM 337
>gi|1222694|gb|AAA92018.1| CP5 [Dictyostelium discoideum]
Length = 344
Score = 191 bits (484), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 119/332 (35%), Positives = 165/332 (49%), Gaps = 75/332 (22%)
Query: 51 QVIGQARHALSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN-- 108
Q + ++ +F + + K Y S EE R+ F+ N+D ++ N KG LGLN
Sbjct: 19 QQFSELQYRNAFTDWMITHQKSYTS-EEFGARYNIFTANMDYVQQWNSKGSETVLGLNNF 77
Query: 109 ----------------------------------------------ISPVKDQGHCGSCW 122
++PVK+QG CG CW
Sbjct: 78 ADITNEEYRNTYLGTKFDASSLIGTQEEKVHTNSSAASKDWRSEGAVTPVKNQGQCGGCW 137
Query: 123 TFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTE 182
+FSTTGS E A+ Q+ G+ +SLSEQ L+DC+ N GC+GGL + AFEYI N G+DTE
Sbjct: 138 SFSTTGSTEGAHFQSKGELVSLSEQNLIDCSTE--NSGCDGGLMTYAFEYIINNNGIDTE 195
Query: 183 EAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVV-DGFRF 241
+YPY ++G C++ SEN G + +T G+E L+ AV V PVSVA + F+
Sbjct: 196 SSYPYKAENGKCEYKSENSGATLSSYKTVTAGSESSLESAVN-VNPVSVAIDASHQSFQL 254
Query: 242 YKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGV-------------------PYWLIKNS 282
Y SG+Y +C + +D H V+AVGYG G YW++KNS
Sbjct: 255 YTSGIYYEPECSSENLD--HGVLAVGYGSGSGSSSGQSSGQSSGNLSASSSNEYWIVKNS 312
Query: 283 WGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
WG +WG GY M + N CGIA+ AS+PVV
Sbjct: 313 WGTSWGIEGYILMSRNRDNNCGIASSASFPVV 344
>gi|344257451|gb|EGW13555.1| Cathepsin L1 [Cricetulus griseus]
Length = 474
Score = 191 bits (484), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 103/219 (47%), Positives = 134/219 (61%), Gaps = 8/219 (3%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++PVKDQG C SCW FS GSLE + GK + LSEQ LVDC+++ +N
Sbjct: 260 KSVDWRKHGYVTPVKDQGSCVSCWAFSAVGSLEGQMFRKTGKLVPLSEQNLVDCSRSQHN 319
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
GC+GGL + AF+YIK NGGLDT E+YPY +DG C++ ++ + V + E+
Sbjct: 320 NGCHGGLFTSAFQYIKDNGGLDTSESYPYEAQDGPCRYDPKHSAANITGFV-VVPSNEEA 378
Query: 219 LQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNT-PMDVNHAVVAVGYGVE-DGVP 275
L AV V P+S+ V + FYKSG Y C N P NH+V+ VGYG E DG
Sbjct: 379 LMKAVATVGPISIGISVRLRSLLFYKSGFYYDPDCYNHYP---NHSVLLVGYGEESDGQK 435
Query: 276 YWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
YWL+KNSWGE WG GY K+ + N C IAT A+YP V
Sbjct: 436 YWLVKNSWGEEWGMDGYIKIAKDRNNHCSIATIAAYPTV 474
Score = 90.9 bits (224), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 50/122 (40%), Positives = 66/122 (54%), Gaps = 7/122 (5%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++PVKDQG CG+CW FS GSL GK + LSEQ LVDC+ + N
Sbjct: 81 KSVDWRKHGYVTPVKDQGSCGACWAFSAVGSLVGQMFWKTGKLVPLSEQNLVDCSWSHGN 140
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKD-------GVCKFSSENVGVQVLDSVNI 211
GC+GGL AF+Y+ NGGLDT + G D + F +E V + L N+
Sbjct: 141 IGCHGGLMQNAFQYVMDNGGLDTTQTLRELGLDLKEKVAHSIYNFQNEEVERRALWEENM 200
Query: 212 TL 213
L
Sbjct: 201 KL 202
>gi|349604730|gb|AEQ00199.1| Cathepsin K-like protein, partial [Equus caballus]
Length = 219
Score = 191 bits (484), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 97/213 (45%), Positives = 131/213 (61%), Gaps = 6/213 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ YR ++PVK+QG CGSCW FS+ G+LE + GK ++LS Q LVDC N G
Sbjct: 9 IDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE--NDG 66
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C GG + AF+Y++ N G+D+E+AYPY G+D C ++ + I G E L+
Sbjct: 67 CGGGYMTNAFQYVQKNRGIDSEDAYPYVGQDESCMYNPTGKAAKCRGYREIPQGNEKALK 126
Query: 221 HAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
AV V PVSVA + + F+FY GVY C + ++NHAV+AVGYG++ G +W+I
Sbjct: 127 RAVARVGPVSVAIDASLTSFQFYSRGVYYDENCNSD--NLNHAVLAVGYGIQKGNKHWII 184
Query: 280 KNSWGENWGDHGYFKMEMGK-NMCGIATCASYP 311
KNSWGENWG+ GY M K N CGIA AS+P
Sbjct: 185 KNSWGENWGNKGYILMARNKNNACGIANMASFP 217
>gi|403302734|ref|XP_003942008.1| PREDICTED: cathepsin K isoform 1 [Saimiri boliviensis boliviensis]
Length = 329
Score = 191 bits (484), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 104/247 (42%), Positives = 148/247 (59%), Gaps = 13/247 (5%)
Query: 74 ESVEEMK-LRFAT-FSKNLDLIRSTNCKG-----LSYRLGLNISPVKDQGHCGSCWTFST 126
E V++M L+ T FS++ D + + +G + YR ++PVK+QG CGSCW FS+
Sbjct: 85 EVVQKMTGLKVPTSFSRSNDTLYIPDWEGRAPDSVDYRKKGYVTPVKNQGQCGSCWAFSS 144
Query: 127 TGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYP 186
G+LE + GK ++LS Q LVDC N GC GG + AF+Y++ N G+D+E+AYP
Sbjct: 145 VGALEGQLKKKTGKLLNLSPQNLVDCVS--ENDGCGGGYMTNAFQYVQKNRGIDSEDAYP 202
Query: 187 YTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEV-VDGFRFYKSG 245
Y G++ C ++ + I G E L+ AV V P+SVA + + F+FY G
Sbjct: 203 YVGQEESCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPISVAIDASLTSFQFYSKG 262
Query: 246 VYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGK-NMCGI 304
VY C + ++NHAV+AVGYG++ G +W+IKNSWGENWG+ GY M K N CGI
Sbjct: 263 VYYDESCNSD--NLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKGYILMARNKNNACGI 320
Query: 305 ATCASYP 311
A AS+P
Sbjct: 321 ANLASFP 327
>gi|340368360|ref|XP_003382720.1| PREDICTED: digestive cysteine proteinase 2-like [Amphimedon
queenslandica]
Length = 326
Score = 191 bits (484), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 104/228 (45%), Positives = 133/228 (58%), Gaps = 8/228 (3%)
Query: 91 DLIRSTNCK---GLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQ 147
D + T K + +R ++ +K+QG CGSCW+FSTTGSLE + G +SLSEQ
Sbjct: 98 DFYKKTGVKVAATVDWREKGAVTAIKNQGKCGSCWSFSTTGSLEGQHFLKTGTLLSLSEQ 157
Query: 148 QLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLD 207
Q VDC+ F N GC GG AF Y++ G +TE YPYT +DG CKF S V+
Sbjct: 158 QFVDCSTKFGNHGCKGGTMDNAFRYLETVSGDETEMMYPYTAEDGFCKFRSTEGKVKCEG 217
Query: 208 SVNITLGAEDELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAV 266
+I ED L+ AV V P+SVA + F+ YK GVY + C +T +D H V+AV
Sbjct: 218 YKDIPRDDEDALREAVATVGPISVAIDAGHSSFQLYKEGVYYNPTCSSTKLD--HGVLAV 275
Query: 267 GYGVEDGV-PYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPV 312
GYG +G YWL+KNSWG +WG GY M + N CGIAT ASYP
Sbjct: 276 GYGTYEGSEEYWLVKNSWGPSWGMEGYIMMSRNRENNCGIATMASYPT 323
>gi|413919736|gb|AFW59668.1| cysteine protease 1 [Zea mays]
Length = 469
Score = 191 bits (484), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 118/311 (37%), Positives = 157/311 (50%), Gaps = 66/311 (21%)
Query: 62 FARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKG----LSYRLGLN--------- 108
+A + +G+ Y +V E + RF F NL + + N S+RLGLN
Sbjct: 46 YAEWMAAHGRTYNAVGEEERRFEVFRDNLRYVDAHNAAADAGVHSFRLGLNRFADLTNDE 105
Query: 109 -----------------------------------------ISPVKDQGHCGSCWTFSTT 127
++ VKDQG CGSCW FST
Sbjct: 106 YRATYLGVRSRPQRERRLGDRYLAGDNEDLPESVDWRAKGAVAEVKDQGSCGSCWAFSTI 165
Query: 128 GSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPY 187
++E G ISLSEQ+LVDC ++N QGCNGGL AFE+I NGG+DTEE YPY
Sbjct: 166 AAVEGINQIVTGDMISLSEQELVDCDTSYN-QGCNGGLMDYAFEFIINNGGIDTEEDYPY 224
Query: 188 TGKDGVCKFSSENVGVQVLDSV-NITLGAEDELQHAVGLVRPVSVAFEV-VDGFRFYKSG 245
G DG C + +N V +DS ++ +E LQ AV +P+SVA E F+ Y SG
Sbjct: 225 KGTDGRCDVNRKNAKVVTIDSYEDVPANSEKSLQKAVA-NQPISVAIEAGGRAFQLYNSG 283
Query: 246 VYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMG----KNM 301
+++ T CG ++H V AVGYG E+G YW++KNSWG +WG+ GY +ME
Sbjct: 284 IFTGT-CGTA---LDHGVTAVGYGTENGKDYWIVKNSWGSSWGESGYVRMERNIKASSGK 339
Query: 302 CGIATCASYPV 312
CGIA SYP+
Sbjct: 340 CGIAVEPSYPL 350
>gi|356563584|ref|XP_003550041.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
Length = 366
Score = 190 bits (483), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 101/218 (46%), Positives = 140/218 (64%), Gaps = 12/218 (5%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R+ ++P+KDQG CGSCW FST ++EA GK +SLSEQ+LVDC +A+ NQG
Sbjct: 134 VDWRVKGAVAPIKDQGSCGSCWAFSTVATVEAINKIVTGKFVSLSEQELVDCDRAY-NQG 192
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSV-NITLGAEDEL 219
CNGGL AFE+I NGG+DT++ YPY G DG+C + +N +D ++ E+ L
Sbjct: 193 CNGGLMDYAFEFIIQNGGIDTDKDYPYRGFDGICDPTKKNAKAVNIDGYEDVPPYDENAL 252
Query: 220 QHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWL 278
+ AV +PVS+A E + Y+SGV++ +CG + ++H VV VGYG E+GV YWL
Sbjct: 253 KKAVAR-QPVSIAIEASGRALQLYQSGVFTG-ECGTS---LDHGVVVVGYGSENGVDYWL 307
Query: 279 IKNSWGENWGDHGYFKMEMG----KNMCGIATCASYPV 312
++NSWG WG+ GYFKM+ CGI ASYPV
Sbjct: 308 VRNSWGTGWGEDGYFKMQRNVRTPTGKCGITMEASYPV 345
>gi|66378018|gb|AAY45870.1| cathepsin L-like cysteine proteinase [Rotylenchulus reniformis]
Length = 369
Score = 190 bits (483), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 96/216 (44%), Positives = 131/216 (60%), Gaps = 5/216 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++ VK+QG CGSCW FS TG+LE + + G+ +SLSEQ LVDC + + N G
Sbjct: 156 VDWRDKQWVTEVKNQGQCGSCWAFSATGALEGQHARKTGQLVSLSEQNLVDCTKKYGNMG 215
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
CNGGL AF+YIK N G+D E YPY K G C F +VG ++ G ED+L+
Sbjct: 216 CNGGLMDNAFQYIKDNEGIDKEMTYPYKAKAGRCHFKRNDVGATDTGFFDVAEGDEDKLK 275
Query: 221 HAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE-DGVPYWL 278
AV PVSVA + F+ YK GVY +C P +++H V+ VGYG + + YW+
Sbjct: 276 LAVATQGPVSVAIDAGHRSFQLYKHGVYFEEEC--NPEELDHGVLVVGYGTDPEHGDYWI 333
Query: 279 IKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
+KNSW +WG+ GY +M + N CGI + ASYP V
Sbjct: 334 VKNSWSTHWGEQGYIRMAPNRNNNCGIPSHASYPTV 369
>gi|225719058|gb|ACO15375.1| Cathepsin L1 precursor [Caligus clemensi]
Length = 326
Score = 190 bits (483), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 118/312 (37%), Positives = 158/312 (50%), Gaps = 56/312 (17%)
Query: 56 ARHALSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTN---CKGL-SYRLGLN--- 108
A + + ++ +GK+Y S +E LRF F +N +I N +G +Y LG+N
Sbjct: 17 AEFSSEWLKWKATHGKVYNSADEESLRFKIFQENSLMITQHNEEYRQGFHTYILGMNHFG 76
Query: 109 -------------------------------------------ISPVKDQGHCGSCWTFS 125
++PVKDQG CGSCW FS
Sbjct: 77 DLLHSEFLERSNGFQGGVSGGDVFTFDTNAPVPSYANWTAKGAVTPVKDQGKCGSCWAFS 136
Query: 126 TTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAY 185
TGS+E K +SLSEQQLVDC+ N GC GGL AF+Y N G+ E++Y
Sbjct: 137 ATGSVEGQIFLKKKKLMSLSEQQLVDCSGDEGNLGCGGGLMDNAFKYFIANKGIANEKSY 196
Query: 186 PYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDG-FRFYKS 244
PYT KD CK+ + ++ ED+L+ AV V PVSVA + F+FY+S
Sbjct: 197 PYTAKDNDCKYKKSMSVATISSFKDVKHKDEDQLKMAVANVGPVSVAIDASSSKFQFYES 256
Query: 245 GVYSSTKCGNTPMDVNHAVVAVGYGVED--GVPYWLIKNSWGENWGDHGYFKMEMGK-NM 301
GVY C + +D H V+AVGYG + G+ +WL+KNSW +WG +GY KM K N
Sbjct: 257 GVYYDENCSSEVLD--HGVLAVGYGTDKKSGMDFWLVKNSWAASWGLNGYIKMARNKDNN 314
Query: 302 CGIATCASYPVV 313
CGIAT ASYP+V
Sbjct: 315 CGIATMASYPIV 326
>gi|68399197|ref|XP_695425.1| PREDICTED: cathepsin L [Danio rerio]
Length = 349
Score = 190 bits (483), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 96/216 (44%), Positives = 138/216 (63%), Gaps = 6/216 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ YR ++ VKDQG+CGSCW+FSTTG++E ++ G+ +SLSEQQLVDC++++ G
Sbjct: 137 IDYRAKGYVTEVKDQGYCGSCWSFSTTGAIEGQMYKHTGRLVSLSEQQLVDCSRSYGTYG 196
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQ-VLDSVNITLGAEDEL 219
C+G + A++Y+ N L++ + YPYT D F +N+ + + D + G E L
Sbjct: 197 CSGAWMANAYDYV-INNALESSDTYPYTSVDTQPCFYEKNLAMAGISDYRFVPAGNEQAL 255
Query: 220 QHAVGLVRPVSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWL 278
AV V PVSVA + + F FY SG+Y + C P ++NHAV+ VGYG E+G YW+
Sbjct: 256 ADAVATVGPVSVAIDADNPSFLFYSSGIYKESNC--NPNNLNHAVLVVGYGSEEGTDYWI 313
Query: 279 IKNSWGENWGDHGYFKM-EMGKNMCGIATCASYPVV 313
IKNSWG WG+ GY +M GKN CGIA+ A YP++
Sbjct: 314 IKNSWGTGWGEGGYMRMIRNGKNTCGIASYALYPII 349
>gi|449448298|ref|XP_004141903.1| PREDICTED: germination-specific cysteine protease 1-like [Cucumis
sativus]
gi|449531757|ref|XP_004172852.1| PREDICTED: germination-specific cysteine protease 1-like [Cucumis
sativus]
Length = 365
Score = 190 bits (483), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 116/306 (37%), Positives = 161/306 (52%), Gaps = 67/306 (21%)
Query: 68 RYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSY------------------------ 103
++GK Y ++E + RF F +NL I N + +Y
Sbjct: 41 KHGKAYNGIDEREKRFQIFKENLKFIDDHNSENRTYKVGLNMFADLTNEEYRALYLGTRS 100
Query: 104 --------------RLGLN----------------ISPVKDQGHCGSCWTFSTTGSLEAA 133
R +N ++PVK+QG CGSCW FST ++E
Sbjct: 101 PPARRVMKAKTASRRYAVNNLDRLPESMDWRTRGAVAPVKNQGSCGSCWAFSTIAAVEGI 160
Query: 134 YHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGV 193
G+ ISLSEQ+LV C + +N+ GCNGGL AF++I NGGLDTEE YPY DG
Sbjct: 161 NQIVTGELISLSEQELVSCDKKYNS-GCNGGLMDYAFQFIIDNGGLDTEEDYPYEAFDGQ 219
Query: 194 CKFSSENVGVQVLDSV-NITLGAEDELQHAVGLVRPVSVAFEVVD-GFRFYKSGVYSSTK 251
C + +N V +D+ ++ E+ L+ AV +PVSVA E + Y+SGV++ K
Sbjct: 220 CDPTRKNAKVVSIDAYEDVPANDEESLKKAVAH-QPVSVAIEASGLALQLYQSGVFTG-K 277
Query: 252 CGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKME-----MGKNMCGIAT 306
CG+ ++H VVAVGYG E+GV YWL++NSWG +WG+ GYFK+E + + CGIA
Sbjct: 278 CGSA---LDHGVVAVGYGKENGVDYWLVRNSWGTSWGEDGYFKLERNVKHITEGKCGIAM 334
Query: 307 CASYPV 312
ASYPV
Sbjct: 335 QASYPV 340
>gi|351712164|gb|EHB15083.1| Cathepsin L1 [Heterocephalus glaber]
Length = 278
Score = 190 bits (483), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 105/228 (46%), Positives = 138/228 (60%), Gaps = 14/228 (6%)
Query: 90 LDLIRSTNCKGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQL 149
L L++S + + Y ++PVK+QG CG+CW FS TGSLE Q G+ +SLSEQ L
Sbjct: 57 LQLLKSVDWREKGY-----VTPVKNQGQCGTCWAFSATGSLEGQMFQKTGQLVSLSEQNL 111
Query: 150 VDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSV 209
VDC++ NQGCNGGL AFEY+K N GL++E+ YPY GKDG CK+ E V
Sbjct: 112 VDCSRPQGNQGCNGGLMDFAFEYVKENKGLESEKFYPYEGKDGSCKYKPELSAANDTGFV 171
Query: 210 NITLGAEDELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGY 268
+I+ E L AV P+SVA + + F+FYK G+Y +C + D+NH V+ +GY
Sbjct: 172 DISQ-REKALMKAVAEEGPISVAVDAGLTSFQFYKDGIYFDPEC--SSKDLNHGVLVLGY 228
Query: 269 GVE----DGVPYWLIKNSWGENWGDHGYFKMEMGKNM-CGIATCASYP 311
G E + YWL+KNS G WG GY K+ +N CGIAT ASYP
Sbjct: 229 GYEEVNSEKNEYWLVKNSSGPEWGAKGYMKIAGNRNKHCGIATAASYP 276
>gi|410303012|gb|JAA30106.1| cathepsin L1 [Pan troglodytes]
Length = 333
Score = 190 bits (483), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 103/221 (46%), Positives = 133/221 (60%), Gaps = 9/221 (4%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
+ + +R ++PVK+QG CGSCW FS TG+LE + G+ ISLSEQ LVDC+ N
Sbjct: 116 RSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGN 175
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
+GCNGGL AF+Y++ NGGLD+EE+YPY + CK++ + V+I E
Sbjct: 176 EGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIP-KQEKA 234
Query: 219 LQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----DG 273
L AV V P+SVA + + F FYK G+Y C + MD H V+ VGYG E D
Sbjct: 235 LMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMD--HGVLVVGYGFESTESDN 292
Query: 274 VPYWLIKNSWGENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
YWL+KNSWGE WG GY KM +N CGIA+ ASYP V
Sbjct: 293 NKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 333
>gi|193786743|dbj|BAG52066.1| unnamed protein product [Homo sapiens]
Length = 333
Score = 190 bits (483), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 103/221 (46%), Positives = 133/221 (60%), Gaps = 9/221 (4%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
+ + +R ++PVK+QG CGSCW FS TG+LE + G+ ISLSEQ LVDC+ N
Sbjct: 116 RSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGN 175
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
+GCNGGL AF+Y++ NGGLD+EE+YPY + CK++ + V+I E
Sbjct: 176 EGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIP-KQEKA 234
Query: 219 LQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----DG 273
L AV V P+SVA + + F FYK G+Y C + MD H V+ VGYG E D
Sbjct: 235 LMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMD--HGVLVVGYGFESTESDN 292
Query: 274 VPYWLIKNSWGENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
YWL+KNSWGE WG GY KM +N CGIA+ ASYP V
Sbjct: 293 NKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 333
>gi|15214962|gb|AAH12612.1| Cathepsin L1 [Homo sapiens]
gi|61363426|gb|AAX42388.1| cathepsin L [synthetic construct]
gi|123988681|gb|ABM83856.1| cathepsin L [synthetic construct]
gi|123999196|gb|ABM87178.1| cathepsin L [synthetic construct]
Length = 333
Score = 190 bits (483), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 103/221 (46%), Positives = 133/221 (60%), Gaps = 9/221 (4%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
+ + +R ++PVK+QG CGSCW FS TG+LE + G+ ISLSEQ LVDC+ N
Sbjct: 116 RSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGN 175
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
+GCNGGL AF+Y++ NGGLD+EE+YPY + CK++ + V+I E
Sbjct: 176 EGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIP-KQEKA 234
Query: 219 LQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----DG 273
L AV V P+SVA + + F FYK G+Y C + MD H V+ VGYG E D
Sbjct: 235 LMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMD--HGVLVVGYGFESTESDN 292
Query: 274 VPYWLIKNSWGENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
YWL+KNSWGE WG GY KM +N CGIA+ ASYP V
Sbjct: 293 NKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 333
>gi|94733563|emb|CAK11015.1| novel protein similar to vertebrate cathepsin L (CTSL) [Danio
rerio]
Length = 334
Score = 190 bits (483), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 96/216 (44%), Positives = 138/216 (63%), Gaps = 6/216 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ YR ++ VKDQG+CGSCW+FSTTG++E ++ G+ +SLSEQQLVDC++++ G
Sbjct: 122 IDYRAKGYVTEVKDQGYCGSCWSFSTTGAIEGQMYKHTGRLVSLSEQQLVDCSRSYGTYG 181
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQ-VLDSVNITLGAEDEL 219
C+G + A++Y+ N L++ + YPYT D F +N+ + + D + G E L
Sbjct: 182 CSGAWMANAYDYV-INNALESSDTYPYTSVDTQPCFYEKNLAMAGISDYRFVPAGNEQAL 240
Query: 220 QHAVGLVRPVSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWL 278
AV V PVSVA + + F FY SG+Y + C P ++NHAV+ VGYG E+G YW+
Sbjct: 241 ADAVATVGPVSVAIDADNPSFLFYSSGIYKESNC--NPNNLNHAVLVVGYGSEEGTDYWI 298
Query: 279 IKNSWGENWGDHGYFKM-EMGKNMCGIATCASYPVV 313
IKNSWG WG+ GY +M GKN CGIA+ A YP++
Sbjct: 299 IKNSWGTGWGEGGYMRMIRNGKNTCGIASYALYPII 334
>gi|60827856|gb|AAX36816.1| cathepsin L [synthetic construct]
Length = 334
Score = 190 bits (483), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 103/221 (46%), Positives = 133/221 (60%), Gaps = 9/221 (4%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
+ + +R ++PVK+QG CGSCW FS TG+LE + G+ ISLSEQ LVDC+ N
Sbjct: 116 RSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGN 175
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
+GCNGGL AF+Y++ NGGLD+EE+YPY + CK++ + V+I E
Sbjct: 176 EGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIP-KQEKA 234
Query: 219 LQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----DG 273
L AV V P+SVA + + F FYK G+Y C + MD H V+ VGYG E D
Sbjct: 235 LMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMD--HGVLVVGYGFESTESDN 292
Query: 274 VPYWLIKNSWGENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
YWL+KNSWGE WG GY KM +N CGIA+ ASYP V
Sbjct: 293 NKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 333
>gi|226496089|ref|NP_001149658.1| cysteine protease 1 precursor [Zea mays]
gi|195629242|gb|ACG36262.1| cysteine protease 1 precursor [Zea mays]
Length = 469
Score = 190 bits (483), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 117/311 (37%), Positives = 157/311 (50%), Gaps = 66/311 (21%)
Query: 62 FARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKG----LSYRLGLN--------- 108
+A + +G+ Y +V E + RF F NL + + N S+RLGLN
Sbjct: 46 YAEWMAAHGRTYNAVGEEERRFEVFRDNLRYVDAHNAAADAGVHSFRLGLNRFADLTNDE 105
Query: 109 -----------------------------------------ISPVKDQGHCGSCWTFSTT 127
++ +KDQG CGSCW FST
Sbjct: 106 YRATYLGVRSRPQRERRLGDRYLAGDNEDLPESVDWRAKGAVAEIKDQGSCGSCWAFSTI 165
Query: 128 GSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPY 187
++E G ISLSEQ+LVDC ++N QGCNGGL AFE+I NGG+DTEE YPY
Sbjct: 166 AAVEGINQIVTGDMISLSEQELVDCDTSYN-QGCNGGLMDYAFEFIINNGGIDTEEDYPY 224
Query: 188 TGKDGVCKFSSENVGVQVLDSV-NITLGAEDELQHAVGLVRPVSVAFEV-VDGFRFYKSG 245
G DG C + +N V +DS ++ +E LQ AV +P+SVA E F+ Y SG
Sbjct: 225 KGTDGRCDVNRKNAKVVTIDSYEDVPANSEKSLQKAVA-NQPISVAIEAGGRAFQLYNSG 283
Query: 246 VYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMG----KNM 301
+++ T CG ++H V AVGYG E+G YW++KNSWG +WG+ GY +ME
Sbjct: 284 IFTGT-CGTA---LDHGVTAVGYGTENGKDYWIVKNSWGSSWGESGYVRMERNIKASSGK 339
Query: 302 CGIATCASYPV 312
CGIA SYP+
Sbjct: 340 CGIAVEPSYPL 350
>gi|162138968|ref|NP_001104662.1| uncharacterized protein LOC567623 precursor [Danio rerio]
gi|158254065|gb|AAI54241.1| Zgc:174153 protein [Danio rerio]
Length = 336
Score = 190 bits (483), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 99/212 (46%), Positives = 135/212 (63%), Gaps = 9/212 (4%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++PVKDQ CGSCW+FS+TG+LE + GK IS+SEQ LVDC++ NQGCNGGL
Sbjct: 127 VTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQGNQGCNGGLMDL 186
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGV-CKFSSENVGVQVLDSVNITLGAEDELQHAVGLVR 227
AF+Y+K N GLD+E++YPY +D + C++ + V+I G E L +AV V
Sbjct: 187 AFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKSTGFVDIPSGNEPALMNAVAAVG 246
Query: 228 PVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVED----GVPYWLIKNS 282
PVSVA + +FY+SG+Y C ++ +D HAV+ VGYG + G YW++KNS
Sbjct: 247 PVSVAIDASHQSLQFYQSGIYYERACSSSRLD--HAVLVVGYGYQGADVAGNRYWIVKNS 304
Query: 283 WGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
W + WGD GY M K N CG+AT ASYP++
Sbjct: 305 WSDKWGDKGYIYMAKDKNNHCGVATKASYPLM 336
>gi|426219849|ref|XP_004004130.1| PREDICTED: cathepsin L1 isoform 1 [Ovis aries]
gi|426219851|ref|XP_004004131.1| PREDICTED: cathepsin L1 isoform 2 [Ovis aries]
Length = 334
Score = 190 bits (483), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 113/279 (40%), Positives = 151/279 (54%), Gaps = 24/279 (8%)
Query: 51 QVIGQARHALSFARFA------RRYGKIYESVEEMKLRFATFSKN---LDLIRSTNCKGL 101
Q Q +H S A A + ++ + K + + +D+ +S +
Sbjct: 64 QEYSQGKHGFSMAMNAFGDMTNEEFRQVMNGFQNQKRKKGKLFREPLLIDVPKSVDWTKK 123
Query: 102 SYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGC 161
Y ++PVK+QG CGSCW FS TG+LE + GK +SLSEQ LVDC++ NQGC
Sbjct: 124 GY-----VTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQGC 178
Query: 162 NGGLPSQAFEYIKYNGGLDTEEAYPYTGKD-GVCKFSSENVGVQVLDSVNITLGAEDELQ 220
NGGL AF+YIK NGGLD+EE+YPY D C + E V+I E L
Sbjct: 179 NGGLMDNAFQYIKENGGLDSEESYPYLATDTSSCNYKPECSAANDTGFVDIPQ-REKALM 237
Query: 221 HAVGLVRPVSVAFEVVDG-FRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----DGVP 275
AV V P+SVA + F+FYKSG+Y C + D++H V+ VGYG E +
Sbjct: 238 KAVATVGPISVAIDAGHASFQFYKSGIYYDPDC--SSKDLDHGVLVVGYGFEGTDSNNNK 295
Query: 276 YWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
+W++KNSWG WG +GY KM + N CGIAT ASYP V
Sbjct: 296 FWIVKNSWGPEWGWNGYVKMAKDQNNHCGIATAASYPTV 334
>gi|327289213|ref|XP_003229319.1| PREDICTED: cathepsin S-like [Anolis carolinensis]
Length = 333
Score = 190 bits (483), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 111/304 (36%), Positives = 154/304 (50%), Gaps = 57/304 (18%)
Query: 65 FARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCK---GL-SYRLGLN------------ 108
+ ++Y K Y++ EE +R + KNL + N + GL SY LG+N
Sbjct: 32 WKKKYNKEYQNKEEEGVRRVIWEKNLRFVMLHNLEQSLGLHSYELGMNHLGDMTSEEVTA 91
Query: 109 -------------------------------------ISPVKDQGHCGSCWTFSTTGSLE 131
++ VK+QG CGSCW FS G+LE
Sbjct: 92 LMTGLKIPVSQSRNSTLYWARQGASAPDTVDWREKGCVTNVKNQGSCGSCWAFSAVGALE 151
Query: 132 AAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKD 191
G +SLS Q LVDC+ AF N GCNGG S AF+Y+ YN G+D+E +YPYTG+
Sbjct: 152 CQLKLKTGNLVSLSPQNLVDCSSAFGNHGCNGGYISAAFQYVIYNNGIDSEASYPYTGQS 211
Query: 192 GVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSST 250
G C+++ + V++ G E L+ AV PVSVA + F ++ GVY
Sbjct: 212 GTCRYNLQGRAATCSRYVDLPSGNEAALKDAVANFGPVSVAIDASRPSFFLFRKGVYDDP 271
Query: 251 KCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCAS 309
C T +NH V+ VGYG EDG+ YWL+KNSWG ++GD GY K+ N CGIA+ +
Sbjct: 272 SC--TSAHINHGVLVVGYGTEDGIDYWLVKNSWGVSFGDQGYIKIARNHDNRCGIASQCT 329
Query: 310 YPVV 313
YP++
Sbjct: 330 YPLM 333
>gi|4503155|ref|NP_001903.1| cathepsin L1 isoform 1 preproprotein [Homo sapiens]
gi|22202619|ref|NP_666023.1| cathepsin L1 isoform 1 preproprotein [Homo sapiens]
gi|384081592|ref|NP_001244900.1| cathepsin L1 isoform 1 preproprotein [Homo sapiens]
gi|384081594|ref|NP_001244901.1| cathepsin L1 isoform 1 preproprotein [Homo sapiens]
gi|332832229|ref|XP_003312197.1| PREDICTED: cathepsin L1 isoform 2 [Pan troglodytes]
gi|332832233|ref|XP_001137800.2| PREDICTED: cathepsin L1 isoform 1 [Pan troglodytes]
gi|397470218|ref|XP_003806728.1| PREDICTED: cathepsin L1 isoform 1 [Pan paniscus]
gi|397470220|ref|XP_003806729.1| PREDICTED: cathepsin L1 isoform 2 [Pan paniscus]
gi|397470222|ref|XP_003806730.1| PREDICTED: cathepsin L1 isoform 3 [Pan paniscus]
gi|410042824|ref|XP_003951515.1| PREDICTED: cathepsin L1 [Pan troglodytes]
gi|115741|sp|P07711.2|CATL1_HUMAN RecName: Full=Cathepsin L1; AltName: Full=Major excreted protein;
Short=MEP; Contains: RecName: Full=Cathepsin L1 heavy
chain; Contains: RecName: Full=Cathepsin L1 light chain;
Flags: Precursor
gi|29715|emb|CAA30981.1| pro-(cathepsin L) [Homo sapiens]
gi|190418|gb|AAA66974.1| preprocathepsin L precursor [Homo sapiens]
gi|31873292|emb|CAD97637.1| hypothetical protein [Homo sapiens]
gi|48146223|emb|CAG33334.1| CTSL [Homo sapiens]
gi|119583135|gb|EAW62731.1| cathepsin L, isoform CRA_a [Homo sapiens]
gi|119583136|gb|EAW62732.1| cathepsin L, isoform CRA_a [Homo sapiens]
gi|119583137|gb|EAW62733.1| cathepsin L, isoform CRA_a [Homo sapiens]
gi|119583138|gb|EAW62734.1| cathepsin L, isoform CRA_a [Homo sapiens]
gi|119583140|gb|EAW62736.1| cathepsin L, isoform CRA_a [Homo sapiens]
gi|208965934|dbj|BAG72981.1| cathepsin L1 [synthetic construct]
gi|410303006|gb|JAA30103.1| cathepsin L1 [Pan troglodytes]
gi|410303008|gb|JAA30104.1| cathepsin L1 [Pan troglodytes]
gi|410303010|gb|JAA30105.1| cathepsin L1 [Pan troglodytes]
Length = 333
Score = 190 bits (483), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 103/221 (46%), Positives = 133/221 (60%), Gaps = 9/221 (4%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
+ + +R ++PVK+QG CGSCW FS TG+LE + G+ ISLSEQ LVDC+ N
Sbjct: 116 RSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGN 175
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
+GCNGGL AF+Y++ NGGLD+EE+YPY + CK++ + V+I E
Sbjct: 176 EGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIP-KQEKA 234
Query: 219 LQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----DG 273
L AV V P+SVA + + F FYK G+Y C + MD H V+ VGYG E D
Sbjct: 235 LMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMD--HGVLVVGYGFESTESDN 292
Query: 274 VPYWLIKNSWGENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
YWL+KNSWGE WG GY KM +N CGIA+ ASYP V
Sbjct: 293 NKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 333
>gi|334324659|ref|XP_001371004.2| PREDICTED: cathepsin K-like [Monodelphis domestica]
Length = 332
Score = 190 bits (483), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 95/213 (44%), Positives = 131/213 (61%), Gaps = 6/213 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ YR ++PVK+QG CGSCW FS+ G+LE + GK ++LS Q LVDC N G
Sbjct: 122 IDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE--NDG 179
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C GG + AF+Y++ N G+D+E+AYPY G+D C ++ + I G+E L+
Sbjct: 180 CGGGYMTNAFQYVQKNRGIDSEDAYPYIGEDESCMYNPTGKAAKCRGYREIPEGSEKALK 239
Query: 221 HAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
AV V PV+VA + + F+FY GVY C + ++NHAV+AVGYG++ G +W+I
Sbjct: 240 RAVARVGPVAVAIDASLSSFQFYSKGVYYDENCNSD--NLNHAVLAVGYGIQRGTKHWII 297
Query: 280 KNSWGENWGDHGYFKMEMGK-NMCGIATCASYP 311
KNSWGE WG+ GY M K N CGIA AS+P
Sbjct: 298 KNSWGEQWGNKGYILMARNKNNACGIANLASFP 330
>gi|114559412|ref|XP_001171151.1| PREDICTED: cathepsin K isoform 4 [Pan troglodytes]
gi|410221358|gb|JAA07898.1| cathepsin K [Pan troglodytes]
gi|410248298|gb|JAA12116.1| cathepsin K [Pan troglodytes]
gi|410301088|gb|JAA29144.1| cathepsin K [Pan troglodytes]
gi|410351445|gb|JAA42326.1| cathepsin K [Pan troglodytes]
Length = 329
Score = 190 bits (483), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 97/213 (45%), Positives = 131/213 (61%), Gaps = 6/213 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ YR ++PVK+QG CGSCW FS+ G+LE + GK ++LS Q LVDC N G
Sbjct: 119 VDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE--NDG 176
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C GG + AFEY++ N G+D+E+AYPY G++ C ++ + I G E L+
Sbjct: 177 CGGGYMTNAFEYVQKNRGIDSEDAYPYVGQEESCMYNPTGKAAKCRGYREIPEGNEKALK 236
Query: 221 HAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
AV V PVSVA + + F+FY GVY C + ++NHAV+AVGYG++ G +W+I
Sbjct: 237 RAVARVGPVSVAIDASLTSFQFYSRGVYFDESCNSD--NLNHAVLAVGYGIQKGNKHWII 294
Query: 280 KNSWGENWGDHGYFKMEMGK-NMCGIATCASYP 311
KNSWGENWG+ GY M K N CGIA AS+P
Sbjct: 295 KNSWGENWGNKGYILMARNKNNACGIANLASFP 327
>gi|157093355|gb|ABV22332.1| cysteine protease 1 [Noctiluca scintillans]
Length = 338
Score = 190 bits (483), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 113/315 (35%), Positives = 162/315 (51%), Gaps = 66/315 (20%)
Query: 58 HALSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN--------- 108
+ + F F +YGK+Y + E +RF F N+D+I +TN + L++ LG+N
Sbjct: 23 YMMMFNNFKTKYGKVYNGINEDAVRFGIFKANVDIIYATNARNLTFALGVNEFTDLTQEE 82
Query: 109 -----------------------------------------ISPVKDQGHCGSCWTFSTT 127
++PVK+QG CGSCW+FSTT
Sbjct: 83 LAASYTGLKPASLWSGLPRLSTHEYNGAPLASSVDWTTQGVVTPVKNQGQCGSCWSFSTT 142
Query: 128 GSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPY 187
G+LE A+ + G +SLSEQQ VDC + GCNGG AF + K N + TE +YPY
Sbjct: 143 GALEGAWALSTGNLVSLSEQQFVDCDT--TDSGCNGGWMDNAFSFAKKN-SICTEGSYPY 199
Query: 188 TGKDGVCKFSSENVGVQ---VLDSVNITLGAEDELQHAVGLVRPVSVAFEVVD-GFRFYK 243
T DG C S VG+ V+ +++ +E + AV +PVS+A E F+ Y
Sbjct: 200 TATDGTCNLSGCQVGIPQGGVVGYTDVSTDSEQAMMSAVAQ-QPVSIAIEADQYSFQLYS 258
Query: 244 SGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCG 303
SGV +++ CG ++H V+AVGYG E G YW +KNSWG +WG+ GY +++ GK G
Sbjct: 259 SGVLTAS-CGTR---LDHGVLAVGYGSEAGTDYWKVKNSWGSSWGEQGYVRLQRGKGGAG 314
Query: 304 ----IATCASYPVVA 314
+A SYPVV+
Sbjct: 315 ECGLLAGPPSYPVVS 329
>gi|33242884|gb|AAQ01146.1| cathepsin [Petromyzon marinus]
Length = 333
Score = 190 bits (482), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 95/216 (43%), Positives = 134/216 (62%), Gaps = 6/216 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +RL ++PVK+QG CGS W FS TGSLE + A G SLSEQQLVDC +++ N G
Sbjct: 121 VDWRLKGYVTPVKEQGLCGSSWAFSATGSLEGQHFAATGNLTSLSEQQLVDCTKSYYNNG 180
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE-L 219
CNGG +A +YI N G+D+E +YPY DG C+F NV + + + +E L
Sbjct: 181 CNGGRSERALQYIIDNNGIDSELSYPYEHADGKCRFKPANVATKCSSYQFVEPSSNEEVL 240
Query: 220 QHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWL 278
+ AV V P+++A +D F+ YKSG+++ C +P NHA++ VGYG G +W+
Sbjct: 241 RQAVASVGPIAIAMNADLDTFKHYKSGLFNEPSCDKSP---NHAMLVVGYGSLSGNDFWI 297
Query: 279 IKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
+KNSWGE+WG+ GY M K N CGIA+ YP++
Sbjct: 298 VKNSWGEDWGEKGYIYMIRNKDNQCGIASIGIYPII 333
>gi|7271897|gb|AAF44679.1|AF239268_1 cathepsin L, partial [Fasciola gigantica]
Length = 219
Score = 190 bits (482), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 96/215 (44%), Positives = 132/215 (61%), Gaps = 4/215 (1%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++ VKDQ CGSCW FSTTG++E + + G +S SEQQLVDC+ F N G
Sbjct: 5 IDWRDSGYVTKVKDQEDCGSCWAFSTTGTMEGQFMKNIGFNVSFSEQQLVDCSSDFGNNG 64
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C GGL A+EY++ GL+ E YPY +G C++ +V + G E ELQ
Sbjct: 65 CRGGLMEIAYEYLR-RFGLEIESTYPYRAVEGPCRYDRRLGVAKVTGYYIVHSGDEVELQ 123
Query: 221 HAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIK 280
+ VG+ P +VA +V F Y+SG+Y S C +P +NH V+AVGYG + G YW++K
Sbjct: 124 NLVGIEGPAAVALDVESDFVMYRSGIYQSQTC--SPDRLNHGVLAVGYGTQSGTDYWIVK 181
Query: 281 NSWGENWGDHGYFKMEMGK-NMCGIATCASYPVVA 314
NSWG WG+ GY +M + NMCGIA+ AS P+VA
Sbjct: 182 NSWGTWWGEGGYIRMVRNRGNMCGIASMASLPMVA 216
>gi|144905112|dbj|BAF56429.1| cysteine proteinase [Lotus japonicus]
Length = 341
Score = 190 bits (482), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 116/334 (34%), Positives = 167/334 (50%), Gaps = 70/334 (20%)
Query: 34 IRLVSSDGLRDFETSVLQVIGQARHALSFARFARRYGKIYESVEEMKLRFATFSKNLDLI 93
I V S L + ETS+++ RH ++ +Y K+Y+ E + RF F N++ I
Sbjct: 22 ISRVISRELHETETSLIE-----RHE----QWMAKYDKVYKDAAEKEKRFLIFKDNVEFI 72
Query: 94 RSTNCKG-LSYRLGLN-------------------------------------------- 108
S N G Y+LG+N
Sbjct: 73 ESFNAAGNKPYKLGVNHLADLTIEEFKASRNGLKRSYDYEVGTTSFKYENVTAIPASVDW 132
Query: 109 -----ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNG 163
++P+KDQG CGSCW FST + E + + GK +SLSEQ+LVDC + +QGC G
Sbjct: 133 RKKGAVTPIKDQGQCGSCWAFSTVAATEGIHKISTGKLVSLSEQELVDCDRKGTDQGCEG 192
Query: 164 GLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAV 223
G FE+I NGG+ TE YPY DG CK ++ Q+ + + +E L AV
Sbjct: 193 GYMEDGFEFIIKNGGITTEANYPYKAVDGSCK-NATAPAAQIKGYEKVPVNSEKALLKAV 251
Query: 224 GLVRPVSVAFEVVDG-FRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNS 282
+PVSV+ + DG F FY SG+++ +CG +++H V AVGYG +G YW++KNS
Sbjct: 252 A-NQPVSVSIDAADGSFMFYSSGIFTG-ECGT---ELDHGVTAVGYGRANGTDYWIVKNS 306
Query: 283 WGENWGDHGYFKMEMG----KNMCGIATCASYPV 312
WG WG+ GY +M+ G + +CGIA +SYP
Sbjct: 307 WGTVWGEQGYIRMQRGIAAKEGLCGIAMDSSYPT 340
>gi|340370270|ref|XP_003383669.1| PREDICTED: cathepsin L1-like [Amphimedon queenslandica]
Length = 326
Score = 190 bits (482), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 97/214 (45%), Positives = 130/214 (60%), Gaps = 4/214 (1%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++ VK+QG CGSCW+FS+TGSLE + G SLSEQQL+DC+ +F N G
Sbjct: 112 VDWREKGAVTEVKNQGKCGSCWSFSSTGSLEGQHFLKTGTLSSLSEQQLMDCSTSFGNHG 171
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C GGL +F Y++ G +EE YPYT +DG C++ S + +I G ED L+
Sbjct: 172 CKGGLMDNSFRYLETVAGDMSEEMYPYTAEDGFCRYRSSEAIAKDTGYKDIPRGDEDALK 231
Query: 221 HAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
AV V P+SVA + F+ Y G+Y C +T +D H V+AVGYG +G YWL+
Sbjct: 232 EAVATVGPISVAIDAGHRSFQLYHEGIYYEPACSSTKLD--HGVLAVGYGTGEGEEYWLV 289
Query: 280 KNSWGENWGDHGYFKMEMGK-NMCGIATCASYPV 312
KNSWG +WG+ GY M + N CGIAT ASYP
Sbjct: 290 KNSWGPSWGNEGYVMMSRNRENNCGIATQASYPT 323
>gi|321478980|gb|EFX89936.1| hypothetical protein DAPPUDRAFT_309603 [Daphnia pulex]
Length = 584
Score = 190 bits (482), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 106/304 (34%), Positives = 154/304 (50%), Gaps = 51/304 (16%)
Query: 61 SFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN------------ 108
+F F R + K Y++ E + R F +N+ I S N GL+Y+L N
Sbjct: 281 TFDSFVRHHKKGYKNTTEHENRKDIFRQNMRFIHSKNRAGLTYKLAPNHMTDRSSDEIRY 340
Query: 109 -------------------------------------ISPVKDQGHCGSCWTFSTTGSLE 131
++PVKDQ CGSCW+F T G+LE
Sbjct: 341 MRGKLRSNGFNGGSTFHYTKSDVENLPEQMDWRLYGAVTPVKDQSVCGSCWSFGTVGTLE 400
Query: 132 AAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAY-PYTGK 190
A GK LS+Q LVDC+ F N GC+GG + ++++ +GG+ +EE+Y PY G
Sbjct: 401 GALFLKTGKLTPLSQQALVDCSWGFGNNGCDGGEDFRVYQWMMKHGGIPSEESYGPYLGA 460
Query: 191 DGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVD-GFRFYKSGVYSS 249
DG C + + + VN+T G D L+ A+ P+SVA + F FY +GVY +
Sbjct: 461 DGYCHVDNATLVASIKGYVNVTSGDVDALRVAIFKYGPISVAIDAAHRAFSFYANGVYYN 520
Query: 250 TKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCAS 309
+CG+ ++HAV+AVGYG+ G PYWL+KNSW WG+ GY M +N CG+AT +
Sbjct: 521 PECGSGEDSLDHAVLAVGYGILKGEPYWLVKNSWSTYWGNSGYVLMSQKENNCGVATSPT 580
Query: 310 YPVV 313
Y ++
Sbjct: 581 YVIM 584
>gi|301769891|ref|XP_002920367.1| PREDICTED: cathepsin L1-like [Ailuropoda melanoleuca]
gi|281346353|gb|EFB21937.1| hypothetical protein PANDA_009084 [Ailuropoda melanoleuca]
Length = 333
Score = 190 bits (482), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 100/211 (47%), Positives = 128/211 (60%), Gaps = 9/211 (4%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++PVK QGHC SCW FS TG+LE + GK +SLSEQ LVDC+ NN GC GGL
Sbjct: 126 VTPVKYQGHCQSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSWPQNNDGCRGGLMDN 185
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
AF Y+K NGGLD+ E+YPY G++ CK+ E + +++ ED L V V P
Sbjct: 186 AFRYVKDNGGLDSAESYPYLGRNESCKYRPEKSAANLTTFWSVS-NKEDGLMTTVATVGP 244
Query: 229 VSVAFE-VVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----DGVPYWLIKNSW 283
VS A + + F+FYK G+Y C + + NHAV+ VGYG E + YW+IKNSW
Sbjct: 245 VSAAVDSSLHSFQFYKKGIYYDPNCRSNRL--NHAVLVVGYGFEGEESENKKYWIIKNSW 302
Query: 284 GENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
G NWG GY + + N CGIAT AS+PVV
Sbjct: 303 GTNWGMKGYMLLAKDRDNHCGIATMASFPVV 333
>gi|410042826|ref|XP_003951516.1| PREDICTED: cathepsin L1 [Pan troglodytes]
Length = 278
Score = 190 bits (482), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 103/221 (46%), Positives = 133/221 (60%), Gaps = 9/221 (4%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
+ + +R ++PVK+QG CGSCW FS TG+LE + G+ ISLSEQ LVDC+ N
Sbjct: 61 RSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGN 120
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
+GCNGGL AF+Y++ NGGLD+EE+YPY + CK++ + V+I E
Sbjct: 121 EGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPK-QEKA 179
Query: 219 LQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----DG 273
L AV V P+SVA + + F FYK G+Y C + MD H V+ VGYG E D
Sbjct: 180 LMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMD--HGVLVVGYGFESTESDN 237
Query: 274 VPYWLIKNSWGENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
YWL+KNSWGE WG GY KM +N CGIA+ ASYP V
Sbjct: 238 NKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 278
>gi|66823245|ref|XP_644977.1| cysteine proteinase 5 precursor [Dictyostelium discoideum AX4]
gi|166201986|sp|P54640.2|CYSP5_DICDI RecName: Full=Cysteine proteinase 5; Flags: Precursor
gi|60473097|gb|EAL71045.1| cysteine proteinase 5 precursor [Dictyostelium discoideum AX4]
Length = 344
Score = 190 bits (482), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 119/332 (35%), Positives = 164/332 (49%), Gaps = 75/332 (22%)
Query: 51 QVIGQARHALSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN-- 108
Q + ++ +F + + K Y S EE R+ F N+D ++ N KG LGLN
Sbjct: 19 QQFSELQYRNAFTDWMITHQKSYTS-EEFGARYNIFKANMDYVQQWNSKGSETVLGLNNF 77
Query: 109 ----------------------------------------------ISPVKDQGHCGSCW 122
++PVK+QG CG CW
Sbjct: 78 ADITNEEYRNTYLGTKFDASSLIGTQEEKVFTTSSAASKDWRSEGAVTPVKNQGQCGGCW 137
Query: 123 TFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTE 182
+FSTTGS E A+ Q+ G+ +SLSEQ L+DC+ N GC+GGL + AFEYI N G+DTE
Sbjct: 138 SFSTTGSTEGAHFQSKGELVSLSEQNLIDCSTE--NSGCDGGLMTYAFEYIINNNGIDTE 195
Query: 183 EAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVV-DGFRF 241
+YPY ++G C++ SEN G + +T G+E L+ AV V PVSVA + F+
Sbjct: 196 SSYPYKAENGKCEYKSENSGATLSSYKTVTAGSESSLESAVN-VNPVSVAIDASHQSFQL 254
Query: 242 YKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGV-------------------PYWLIKNS 282
Y SG+Y +C + +D H V+AVGYG G YW++KNS
Sbjct: 255 YTSGIYYEPECSSENLD--HGVLAVGYGSGSGSSSGQSSGQSSGNLSASSSNEYWIVKNS 312
Query: 283 WGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
WG +WG GY M + N CGIA+ AS+PVV
Sbjct: 313 WGTSWGIEGYILMSRNRDNNCGIASSASFPVV 344
>gi|27806673|ref|NP_776457.1| cathepsin L2 precursor [Bos taurus]
gi|1542853|emb|CAA62870.1| cathepsin L [Bos taurus]
Length = 334
Score = 190 bits (482), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 104/212 (49%), Positives = 131/212 (61%), Gaps = 10/212 (4%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++PVK+QG CGSCW FS TG+LE + GK +SLSEQ LVDC++A NQGCNGGL
Sbjct: 126 VTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQGNQGCNGGLMDN 185
Query: 169 AFEYIKYNGGLDTEEAYPYTGKD-GVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVR 227
AF+YIK NGGLD+EE+YPY D C + E V+I E L AV V
Sbjct: 186 AFQYIKDNGGLDSEESYPYLATDTNSCNYKPECSAANDTGFVDIPQ-REKALMKAVATVG 244
Query: 228 PVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----DGVPYWLIKNS 282
P+SVA + F+FYKSG+Y C + D++H V+ VGYG E + +W++KNS
Sbjct: 245 PISVAIDAGHTSFQFYKSGIYYDPDC--SCKDLDHGVLVVGYGFEGTDSNNNKFWIVKNS 302
Query: 283 WGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
WG WG +GY KM + N CGIAT ASYP V
Sbjct: 303 WGPEWGWNGYVKMAKDQNNHCGIATAASYPTV 334
>gi|50355623|dbj|BAD29960.1| cysteine protease [Daucus carota]
Length = 460
Score = 190 bits (482), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 111/310 (35%), Positives = 159/310 (51%), Gaps = 63/310 (20%)
Query: 61 SFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNC-KGLSYRLGLN----------- 108
++ + ++GK Y ++ E + RF F N I N K S++LGLN
Sbjct: 43 AYESWLVKHGKSYNALGEKEQRFQIFKDNFLYIDEQNAAKDRSFKLGLNRFADLTNEEYR 102
Query: 109 -----------------------------------------ISPVKDQGHCGSCWTFSTT 127
++ VKDQG CGSCW FST
Sbjct: 103 SKYTGIRTKDSRKKVSGKSQRYASLAGESLPESVDWREHGAVASVKDQGQCGSCWAFSTI 162
Query: 128 GSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPY 187
++E A GK I+LSEQ+LVDC +++N +GCNGGL AF++I NGG+D++ YPY
Sbjct: 163 SAVEGINQIATGKLITLSEQELVDCDRSYN-EGCNGGLMDDAFQFIINNGGIDSDADYPY 221
Query: 188 TGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVV-DGFRFYKSGV 246
TG+DG C +N V +DS +++ +P+SVA E F+FY SG+
Sbjct: 222 TGRDGQCDQYRKNAKVVTIDSYEDVPEYDEKALQKAAANQPISVAIEASGRDFQFYDSGI 281
Query: 247 YSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMG----KNMC 302
++ KCG D++H VV VGYG E+G YW+++NSWG +WG+ GY +ME G +C
Sbjct: 282 FTG-KCGT---DLDHGVVVVGYGTENGKDYWIVRNSWGADWGEKGYLRMERGISSKAGIC 337
Query: 303 GIATCASYPV 312
GI + SYPV
Sbjct: 338 GITSEPSYPV 347
>gi|426216526|ref|XP_004002513.1| PREDICTED: cathepsin S isoform 2 [Ovis aries]
Length = 281
Score = 190 bits (482), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 108/256 (42%), Positives = 144/256 (56%), Gaps = 12/256 (4%)
Query: 65 FARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCK---GL-SYRLGLNISPVKDQGHCGS 120
+ + YGK YE E R + KNL + N + G+ SY LG+N + D G CGS
Sbjct: 31 WKKTYGKQYEEKNEEVARRLIWEKNLKTVMLHNLEHSMGMHSYELGMN--HLGDMGACGS 88
Query: 121 CWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQA-FNNQGCNGGLPSQAFEYIKYNGGL 179
CW FS G+LEA GK +SLS Q LVDC+ + N+GCNGG ++AF+YI N G+
Sbjct: 89 CWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCSTVKYGNKGCNGGFMTEAFQYIIDNNGI 148
Query: 180 DTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVD-G 238
D+E +YPY DG C++ +N + + G+E+ L+ AV PVSV +
Sbjct: 149 DSEASYPYKAMDGRCQYDVKNRAATCSRYIELPFGSEEALKEAVANKGPVSVGIDAKQTS 208
Query: 239 FRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMG 298
F YK+GVY C +VNH V+ VGYG +G YWL+KNSWG N+GD GY +M
Sbjct: 209 FFLYKTGVYYDPSC---TQNVNHGVLVVGYGSLNGKDYWLVKNSWGLNFGDQGYIRMARN 265
Query: 299 K-NMCGIATCASYPVV 313
N CGIA SYP +
Sbjct: 266 SGNHCGIANFPSYPEI 281
>gi|355681660|gb|AER96816.1| cathepsin L2 [Mustela putorius furo]
Length = 334
Score = 190 bits (482), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 113/259 (43%), Positives = 149/259 (57%), Gaps = 26/259 (10%)
Query: 62 FARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLNISPVKDQGHCGSC 121
F R GK+++ + FA K++D + KG ++PVK+QG CGSC
Sbjct: 95 FRNQKHRKGKVFQ-----EPLFAEIPKSVDWTQ----KGY-------VTPVKNQGQCGSC 138
Query: 122 WTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDT 181
W FS TG+LE + GK +SLSEQ LVDC+++ NQGCNGGL AF+YIK NGGLD+
Sbjct: 139 WAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRSQGNQGCNGGLMDFAFQYIKDNGGLDS 198
Query: 182 EEAYPYTGKD-GVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEV-VDGF 239
EE+YPY +D C + E V+I E L AV V P+SVA + F
Sbjct: 199 EESYPYLARDTDSCNYKPEYSVANDTGFVDIPQ-RERALMKAVATVGPISVAIDAGHQSF 257
Query: 240 RFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----DGVPYWLIKNSWGENWGDHGYFKM 295
+FYKSG+Y C + D++H V+ VGYG E + +W++KNSWG WG +GY KM
Sbjct: 258 QFYKSGIYFDPDC--SSKDLDHGVLVVGYGFEGTDSNNNKFWIVKNSWGPEWGCNGYVKM 315
Query: 296 EMGK-NMCGIATCASYPVV 313
+ N CGIAT ASYP V
Sbjct: 316 AKDQNNHCGIATAASYPTV 334
>gi|348525618|ref|XP_003450319.1| PREDICTED: cathepsin S-like [Oreochromis niloticus]
Length = 330
Score = 189 bits (481), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 96/231 (41%), Positives = 138/231 (59%), Gaps = 5/231 (2%)
Query: 85 TFSKNLDLIRSTNCKGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISL 144
+F+ +D + S K + YR ++ VK+QG CGSCW FS G+LE ++ GK + L
Sbjct: 103 SFTLAMDDMPSKIPKSVDYRKKGMVTSVKNQGACGSCWAFSAAGALEGQLAKSTGKLVDL 162
Query: 145 SEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQ 204
S Q LVDC+ + N GCNGG ++AF+Y+ N G+D++ +YPYTG+D C+++
Sbjct: 163 SPQNLVDCSGKYGNHGCNGGFMTRAFQYVIDNHGIDSDASYPYTGRDEQCRYNPATRAAN 222
Query: 205 VLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDG-FRFYKSGVYSSTKCGNTPMDVNHAV 263
+ G E+ L+ A+ + P+SVA + F FY+SGVY+ C +VNH V
Sbjct: 223 CSSYQFLPEGDENALKQALATIGPISVAIDARRPRFSFYRSGVYNDPSC---TQEVNHGV 279
Query: 264 VAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
+AVGYG +G YWL+KNSWG +GD GY +M N CGIA A YPV+
Sbjct: 280 LAVGYGSLNGQDYWLVKNSWGSTFGDQGYIRMARNTGNQCGIALYACYPVM 330
>gi|130502110|ref|NP_001076110.1| cathepsin K precursor [Oryctolagus cuniculus]
gi|1168794|sp|P43236.1|CATK_RABIT RecName: Full=Cathepsin K; AltName: Full=Protein OC-2; Flags:
Precursor
gi|454187|dbj|BAA03125.1| OC-2 protein [Oryctolagus cuniculus]
Length = 329
Score = 189 bits (481), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 97/213 (45%), Positives = 131/213 (61%), Gaps = 6/213 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ YR ++PVK+QG CGSCW FS+ G+LE + GK ++LS Q LVDC N G
Sbjct: 119 IDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE--NYG 176
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C GG + AF+Y++ N G+D+E+AYPY G+D C ++ + I G E L+
Sbjct: 177 CGGGYMTNAFQYVQRNRGIDSEDAYPYVGQDESCMYNPTGKAAKCRGYREIPEGNEKALK 236
Query: 221 HAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
AV V PVSVA + + F+FY GVY C + +VNHAV+AVGYG++ G +W+I
Sbjct: 237 RAVARVGPVSVAIDASLTSFQFYSKGVYYDENCSSD--NVNHAVLAVGYGIQKGNKHWII 294
Query: 280 KNSWGENWGDHGYFKMEMGK-NMCGIATCASYP 311
KNSWGE+WG+ GY M K N CGIA AS+P
Sbjct: 295 KNSWGESWGNKGYILMARNKNNACGIANLASFP 327
>gi|198432217|ref|XP_002130230.1| PREDICTED: similar to cathepsin L [Ciona intestinalis]
Length = 327
Score = 189 bits (481), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 98/220 (44%), Positives = 137/220 (62%), Gaps = 5/220 (2%)
Query: 97 NCKGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAF 156
N + +R ++PVK+Q CGSCW FSTTGSLE + +SLSEQQL+DC+
Sbjct: 110 NPTSIDWRTRGYVTPVKNQLQCGSCWAFSTTGSLEGQHFAKTKNLVSLSEQQLMDCSFKE 169
Query: 157 NNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAE 216
++GC GG+ AF+YI GG+++E YPY ++ C+F + ++ + V++T G+E
Sbjct: 170 GDEGCGGGIMDYAFDYIFLAGGVESEADYPYEARNDHCRFDNSSIAATLTGCVDVTSGSE 229
Query: 217 DELQHAVGLVRPVSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVP 275
+L+ AVG + PVSVA + F+ Y SGV C T +D H V+AVGYG ++G
Sbjct: 230 TQLEKAVGSIGPVSVAIDASHISFQLYGSGVNYEPMCSTTTLD--HGVLAVGYGADNGNE 287
Query: 276 YWLIKNSWGENWGD-HGYFKMEMGK-NMCGIATCASYPVV 313
YW++KNSWGE WG +GY KM + N CGIAT ASYP V
Sbjct: 288 YWIVKNSWGEGWGHLNGYIKMSKNRNNNCGIATQASYPTV 327
>gi|281204231|gb|EFA78427.1| cysteine proteinase 3 [Polysphondylium pallidum PN500]
Length = 329
Score = 189 bits (481), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 101/247 (40%), Positives = 143/247 (57%), Gaps = 6/247 (2%)
Query: 69 YGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLNISPVKDQGHCGSCWTFSTTG 128
Y +Y + FA LD + L +R + VKDQG CGSCW FSTTG
Sbjct: 84 YRAVYLGMNVDASNFAAQPATLDQVYQPVRSTLDWRNNGAVGRVKDQGQCGSCWAFSTTG 143
Query: 129 SLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYT 188
++E A+ A G +SLSEQQL+DC++++ N GC GGL A YI GG++TEE+YPY
Sbjct: 144 AVEGAHQIATGNFVSLSEQQLMDCSRSYGNHGCQGGLMDSAMSYIVKQGGINTEESYPYE 203
Query: 189 GKDG-VCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVV-DGFRFYKSGV 246
+D CK++ N G ++ NI G+E +L + + PV++A + F+ YKSGV
Sbjct: 204 MRDSYTCKYNPANNGAKLSGYSNIKRGSEADLAAKLN-IGPVAIALDASHSSFQLYKSGV 262
Query: 247 YSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIA 305
+ C +T + +H V+AVGYG E YW++KNSWG WGD GY + + N CG+A
Sbjct: 263 FYDPACSSTSL--SHGVLAVGYGTEGSSAYWIVKNSWGTRWGDAGYIWIAKDRNNHCGVA 320
Query: 306 TCASYPV 312
T +S P+
Sbjct: 321 TMSSIPI 327
>gi|45822209|emb|CAE47501.1| cathepsin L-like proteinase [Diabrotica virgifera virgifera]
Length = 325
Score = 189 bits (481), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 97/206 (47%), Positives = 127/206 (61%), Gaps = 3/206 (1%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++ VKDQG CGSCWTFSTTGS+EAA+ G +SLSEQ LVDCA+ GC GG +
Sbjct: 122 VTEVKDQGMCGSCWTFSTTGSVEAAHFLKTGNLVSLSEQNLVDCAKD-TCYGCGGGWMDK 180
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
A EYI+ GG+ +E+ YPY G D C+F V ++ + I E++L++AV P
Sbjct: 181 ALEYIE-KGGIMSEKDYPYEGVDDNCRFDISKVAAKISNFTYIKKNDEEDLKNAVAAKGP 239
Query: 229 VSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWG 288
+SVA + F+ Y SG+ T+C N +NH V+ VGYG E+G YW+IKNSWG NWG
Sbjct: 240 ISVAIDASATFQLYVSGILDDTECSNEFDSLNHGVLVVGYGTENGKDYWIIKNSWGVNWG 299
Query: 289 DHGYFKMEMGK-NMCGIATCASYPVV 313
GY +M K N CGI T YP +
Sbjct: 300 MDGYIRMSRNKNNQCGITTDGVYPNI 325
>gi|402856109|ref|XP_003892642.1| PREDICTED: cathepsin K [Papio anubis]
Length = 348
Score = 189 bits (481), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 96/213 (45%), Positives = 131/213 (61%), Gaps = 6/213 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ YR ++PVK+QG CGSCW FS+ G+LE + GK ++LS Q LVDC N G
Sbjct: 138 VDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE--NDG 195
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C GG + AF+Y++ N G+D+E+AYPY G++ C ++ + I G E L+
Sbjct: 196 CGGGYMTNAFQYVQKNRGIDSEDAYPYVGQEESCMYNPTGKAAKCRGYREIPEGNEKALK 255
Query: 221 HAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
AV V PVSVA + + F+FY GVY C + ++NHAV+AVGYG++ G +W+I
Sbjct: 256 RAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNSD--NLNHAVLAVGYGIQKGNKHWII 313
Query: 280 KNSWGENWGDHGYFKMEMGK-NMCGIATCASYP 311
KNSWGENWG+ GY M K N CGIA AS+P
Sbjct: 314 KNSWGENWGNKGYILMARNKNNACGIANLASFP 346
>gi|258588539|pdb|3HWN|A Chain A, Cathepsin L With Az13010160
gi|258588540|pdb|3HWN|B Chain B, Cathepsin L With Az13010160
gi|258588541|pdb|3HWN|C Chain C, Cathepsin L With Az13010160
gi|258588542|pdb|3HWN|D Chain D, Cathepsin L With Az13010160
Length = 258
Score = 189 bits (481), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 103/221 (46%), Positives = 133/221 (60%), Gaps = 9/221 (4%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
+ + +R ++PVK+QG CGSCW FS TG+LE + G+ ISLSEQ LVDC+ N
Sbjct: 41 RSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGN 100
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
+GCNGGL AF+Y++ NGGLD+EE+YPY + CK++ + V+I E
Sbjct: 101 EGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDAGFVDIPK-QEKA 159
Query: 219 LQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----DG 273
L AV V P+SVA + + F FYK G+Y C + MD H V+ VGYG E D
Sbjct: 160 LMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMD--HGVLVVGYGFESTESDN 217
Query: 274 VPYWLIKNSWGENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
YWL+KNSWGE WG GY KM +N CGIA+ ASYP V
Sbjct: 218 NKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 258
>gi|208972992|dbj|BAG74345.1| silicatein-M4 [Ephydatia fluviatilis]
gi|296168739|emb|CAQ54047.1| silicatein alpha 3 [Ephydatia muelleri]
Length = 327
Score = 189 bits (481), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 96/220 (43%), Positives = 139/220 (63%), Gaps = 4/220 (1%)
Query: 96 TNCKGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQA 155
T + +R ++ V+ QG CGS + F+ G+LE A A K ++LSEQ ++DC+ A
Sbjct: 110 TYADSMDWRTRGAVTSVQSQGSCGSSYAFAAAGALEGANALAADKLVALSEQNIIDCSVA 169
Query: 156 FNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGA 215
+ N GC+GG AF+Y+ NGG+DT+ +YPY GK C+++S+N+G V IT G+
Sbjct: 170 YGNHGCSGGDVYTAFKYVVDNGGIDTDSSYPYKGKQYSCQYNSKNLGAVATGVVKITSGS 229
Query: 216 EDELQHAVGLVRPVSVAFE-VVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGV 274
E +L AV V P++VA + V+ F FY+SGV+ S+ C T + NHA++ GYG +G
Sbjct: 230 ETDLLSAVASVGPIAVAVDATVNSFMFYQSGVFDSSSCSTTKL--NHAMLVTGYGSTNGK 287
Query: 275 PYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
YWL+KNSWG WG+ GY KM K N CGIA+ A YP++
Sbjct: 288 DYWLVKNSWGTGWGESGYIKMVRNKYNQCGIASDALYPML 327
>gi|37780047|gb|AAP32196.1| cysteine protease 8 [Trifolium repens]
Length = 343
Score = 189 bits (481), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 113/307 (36%), Positives = 160/307 (52%), Gaps = 63/307 (20%)
Query: 64 RFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGL-SYRLGLN-------------- 108
++ +YGKIY+ +E + RF F++N++ + ++N SY+LG+N
Sbjct: 41 QWMSQYGKIYKDHQERETRFKIFTENVNYVEASNADDTKSYKLGINQFADLTNEEFVASR 100
Query: 109 ------------------------------------ISPVKDQGHCGSCWTFSTTGSLEA 132
++PVK+QG CG CW FS + E
Sbjct: 101 NKFKGHMCSSITRTTTFKYENVSAIPSTVDWRKKGAVTPVKNQGQCGCCWAFSAVAATEG 160
Query: 133 AYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDG 192
+ + GK ISLSEQ+LVDC +QGC GGL AF++I N GL TE YPY G DG
Sbjct: 161 IHKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFIIQNHGLSTEAQYPYEGVDG 220
Query: 193 VCKFSSENV-GVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSST 250
C + +V V + ++ +E LQ AV +P+SVA + F+FYKSGV++ +
Sbjct: 221 TCNANKASVQAVTITGYEDVPANSEQALQKAVA-NQPISVAIDASGSDFQFYKSGVFTGS 279
Query: 251 KCGNTPMDVNHAVVAVGYGV-EDGVPYWLIKNSWGENWGDHGYFKMEMG----KNMCGIA 305
CG +++H V AVGYGV DG YWL+KNSWG +WG+ GY M+ G + +CGIA
Sbjct: 280 -CGT---ELDHGVTAVGYGVSNDGTKYWLVKNSWGTDWGEEGYIMMQRGVEAAEGLCGIA 335
Query: 306 TCASYPV 312
ASYP
Sbjct: 336 MQASYPT 342
>gi|395729888|ref|XP_002810309.2| PREDICTED: cathepsin K [Pongo abelii]
Length = 343
Score = 189 bits (481), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 96/213 (45%), Positives = 131/213 (61%), Gaps = 6/213 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ YR ++PVK+QG CGSCW FS+ G+LE + GK ++LS Q LVDC N G
Sbjct: 133 VDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE--NDG 190
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C GG + AF+Y++ N G+D+E+AYPY G++ C ++ + I G E L+
Sbjct: 191 CGGGYMTNAFQYVQKNRGIDSEDAYPYVGQEESCMYNPTGKAAKCRGYREIPEGNEKALK 250
Query: 221 HAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
AV V PVSVA + + F+FY GVY C + ++NHAV+AVGYG++ G +W+I
Sbjct: 251 RAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNSD--NLNHAVLAVGYGIQKGNKHWII 308
Query: 280 KNSWGENWGDHGYFKMEMGK-NMCGIATCASYP 311
KNSWGENWG+ GY M K N CGIA AS+P
Sbjct: 309 KNSWGENWGNKGYILMARNKNNACGIANLASFP 341
>gi|558563|emb|CAA57538.1| cysteine proteinase [Cicer arietinum]
Length = 325
Score = 189 bits (481), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 116/311 (37%), Positives = 160/311 (51%), Gaps = 67/311 (21%)
Query: 62 FARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN------------- 108
+ ++ ++ K+Y + E RF F NL I N + SY++GLN
Sbjct: 4 YEKWLVKHQKMYNGLGEKDTRFQIFKDNLRFIDEHNAQNYSYKVGLNKFADINNEEYRDM 63
Query: 109 ----------------------------------------ISPVKDQGHCGSCWTFSTTG 128
++ +KDQG CGSCW FST
Sbjct: 64 YLGTKSDAKRRVMKTKITGHRITYNSVIVTVKVDWRLKGAVTHIKDQGSCGSCWAFSTIA 123
Query: 129 SLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYT 188
++EA GK +SLSEQ+LVDC +AFN +GCNGGL AFE+I NGG+DT++ YPY
Sbjct: 124 TVEAINKIVTGKFVSLSEQELVDCDRAFN-EGCNGGLMDYAFEFIIRNGGIDTDQDYPYN 182
Query: 189 GKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVV-DGFRFYKSGVY 247
G + C + +N V +D + L+ AV +PVSVA + + Y+SGV+
Sbjct: 183 GFERKCDPTKKNAKVVSIDGYEDVPSYMNALKKAVAH-QPVSVAIAGLGRALQLYQSGVF 241
Query: 248 SSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNM------ 301
+ KCG D++H VV VGYG E+GV YWL++NSWG NWG+ GYFK+ +N+
Sbjct: 242 TG-KCGT---DLDHGVVVVGYGSENGVDYWLVRNSWGTNWGEDGYFKI-ASRNVKSLYRK 296
Query: 302 CGIATCASYPV 312
CGIA ASYPV
Sbjct: 297 CGIAMEASYPV 307
>gi|383410403|gb|AFH28415.1| cathepsin L1 preproprotein [Macaca mulatta]
Length = 333
Score = 189 bits (481), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 104/221 (47%), Positives = 131/221 (59%), Gaps = 9/221 (4%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
+ + +R ++PVK+QG CGSCW FS TG+LE + GK +SLSEQ LVDC+ N
Sbjct: 116 RSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSGPQGN 175
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
+GCNGGL AF+Y+ NGGLD+EE+YPY + CK++ E V+I E
Sbjct: 176 EGCNGGLMDYAFQYVADNGGLDSEESYPYEATEESCKYNPEYSVANDTGFVDIP-KQEKA 234
Query: 219 LQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----DG 273
L AV V P+SVA + + F FYK G+Y C + MD H V+ VGYG E D
Sbjct: 235 LMKAVATVGPISVAIDAGHESFMFYKEGIYFEPDCSSEDMD--HGVLVVGYGFESTESDN 292
Query: 274 VPYWLIKNSWGENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
YWL KNSWGE WG GY KM +N CGIA+ ASYP V
Sbjct: 293 SKYWLGKNSWGEEWGMGGYIKMAKDRRNHCGIASAASYPTV 333
>gi|300121328|emb|CBK21708.2| unnamed protein product [Blastocystis hominis]
Length = 318
Score = 189 bits (481), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 116/304 (38%), Positives = 154/304 (50%), Gaps = 61/304 (20%)
Query: 62 FARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN------------- 108
F + +YGK Y + EE + R F+ NL I+ N K L + LG+N
Sbjct: 22 FTSYMSKYGKTYAAPEEARYRLRVFNDNLLKIKEHNAKNLPWTLGVNKFADVSAEEFAYK 81
Query: 109 ---------------------------------ISPVKDQGHCGSCWTFSTTGSLEAAYH 135
++PVK+QG CGSCW FSTTG+ E AY
Sbjct: 82 FCGCAKDPKTRGTRQTTLVGDVPARVDWREQGAVTPVKNQGMCGSCWAFSTTGTTEGAYF 141
Query: 136 QAFGKGISLSEQQLVDCAQ--AFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGV 193
G +SLSEQQLVDCA+ + N GC+GG P A +Y+ + GL TEE YPY G D
Sbjct: 142 LKTGNLVSLSEQQLVDCARDPEYENFGCSGGWPWSAVDYVTKH-GLCTEEDYPYKGVDAE 200
Query: 194 CKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCG 253
CK SS V VQ +D V + +G ED L AV PVS+ + + Y G+ T+C
Sbjct: 201 CKESSCKVAVQSVDKVQLPVGDEDSLAVAVSKT-PVSIVLDAT-AMQLYDKGII--TRCS 256
Query: 254 NTPMDVNHAVVAVGY--GVEDGVPYWLIKNSWGENWGDHGYFKMEM---GKNMCGIATCA 308
+ +NHAV+AVGY E G+ YW+IKNSWG +WG+ GY ++E G C + +
Sbjct: 257 ES---INHAVLAVGYDKDAETGLKYWIIKNSWGADWGEEGYCRIEKDVGGMGRCALTYSS 313
Query: 309 SYPV 312
YPV
Sbjct: 314 VYPV 317
>gi|10336513|dbj|BAB13759.1| cysteine proteinase [Astragalus sinicus]
Length = 343
Score = 189 bits (481), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 113/303 (37%), Positives = 158/303 (52%), Gaps = 63/303 (20%)
Query: 68 RYGKIYESVEEMKLRFATFSKNLDLIRSTNCKG-LSYRLGLN------------------ 108
+Y KIY +E + RF F +N++ I ++N +G Y+LG+N
Sbjct: 45 QYAKIYNDHQEWEKRFQIFKENVNYIETSNKEGGRFYKLGVNQFVDLTNEEFIAPRNRFK 104
Query: 109 --------------------------------ISPVKDQGHCGSCWTFSTTGSLEAAYHQ 136
++PVKDQG CG CW FS + E +
Sbjct: 105 GHMCSSIIRTNTYKYENVTTVPSNVDWRQKGAVTPVKDQGQCGCCWAFSAVAATEGIHQL 164
Query: 137 AFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKF 196
+ GK ISLSEQ+LVDC +QGC GGL AF++I N GLDTE YPY G DG C
Sbjct: 165 STGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFIIQNHGLDTEAKYPYQGVDGTCNA 224
Query: 197 SSENVGVQVLDSV-NITLGAEDELQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGN 254
+ ++ + S ++ E LQ AV +P+SVA + F+FY SGV++ + CG
Sbjct: 225 NEASINAATITSYEDVPTNNEQALQKAVA-NQPISVAIDASGSDFQFYTSGVFTGS-CGT 282
Query: 255 TPMDVNHAVVAVGYGV-EDGVPYWLIKNSWGENWGDHGYFKMEMG----KNMCGIATCAS 309
+++H V AVGYGV +DG YWL+KNSWG +WG+ GY +M+ G + +CGIA AS
Sbjct: 283 ---ELDHGVTAVGYGVSDDGTKYWLVKNSWGTSWGEEGYIRMQRGVDAVEGLCGIAMQAS 339
Query: 310 YPV 312
YP+
Sbjct: 340 YPI 342
>gi|340728972|ref|XP_003402785.1| PREDICTED: counting factor associated protein D-like [Bombus
terrestris]
Length = 549
Score = 189 bits (481), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 113/360 (31%), Positives = 171/360 (47%), Gaps = 65/360 (18%)
Query: 3 RPVQLVSSVILLLCCAAAASASASSFDDSNPIRLVSSDGLRDFETSVLQVIGQARHALSF 62
+P V V + C NP+R + + +++T V + +F
Sbjct: 200 KPSSEVFEVTTNMTCVGFPGPGDKHVYTFNPMR----EFVHNYDTHVNE---------AF 246
Query: 63 ARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN-------------- 108
F + + K Y + + +R F +NL I STN Y+L +N
Sbjct: 247 EDFKKAHNKEYVNHVDQLMRKEVFRQNLRFIHSTNRANKGYQLSVNHLVDRTELELKALR 306
Query: 109 ------------------------------------ISPVKDQGHCGSCWTFSTTGSLEA 132
++PVKDQ CGSCW+F TTG++E
Sbjct: 307 GKQYTAHYNGGQPFPYNAEKEVTEVPDSLDWRLYGAVTPVKDQSVCGSCWSFGTTGAVEG 366
Query: 133 AYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAY-PYTGKD 191
AY+ +GK + LS+Q L+DC+ + N GC+GG +++++I +GGL TE+ Y Y G+D
Sbjct: 367 AYYMKYGKLVRLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKHGGLPTEDEYGGYLGQD 426
Query: 192 GVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSST 250
G C ++ + ++ VN+T G + L+ A+ P+SVA + F FY GVY
Sbjct: 427 GYCHVNNVTLTAKITGYVNVTSGDANALKVAIAKHGPISVAIDASHKTFSFYSHGVYYDE 486
Query: 251 KCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASY 310
CGNT ++HAV+AVGYG +G YWL+KNSW WG+ GY M KN CG+ T +Y
Sbjct: 487 SCGNTEESLDHAVLAVGYGSLNGKDYWLVKNSWSNYWGNDGYILMSQEKNNCGVLTAPTY 546
>gi|397492864|ref|XP_003817340.1| PREDICTED: cathepsin K [Pan paniscus]
Length = 343
Score = 189 bits (480), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 96/213 (45%), Positives = 131/213 (61%), Gaps = 6/213 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ YR ++PVK+QG CGSCW FS+ G+LE + GK ++LS Q LVDC N G
Sbjct: 133 VDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE--NDG 190
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C GG + AF+Y++ N G+D+E+AYPY G++ C ++ + I G E L+
Sbjct: 191 CGGGYMTNAFQYVQKNRGIDSEDAYPYVGQEESCMYNPTGKAAKCRGYREIPEGNEKALK 250
Query: 221 HAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
AV V PVSVA + + F+FY GVY C + ++NHAV+AVGYG++ G +W+I
Sbjct: 251 RAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNSD--NLNHAVLAVGYGIQKGNKHWII 308
Query: 280 KNSWGENWGDHGYFKMEMGK-NMCGIATCASYP 311
KNSWGENWG+ GY M K N CGIA AS+P
Sbjct: 309 KNSWGENWGNKGYILMARNKNNACGIANLASFP 341
>gi|26245875|gb|AAN77413.1| digestive cysteine protease intestain [Leptinotarsa decemlineata]
Length = 287
Score = 189 bits (480), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 98/206 (47%), Positives = 127/206 (61%), Gaps = 4/206 (1%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++ VK QG CGSCW FSTTGS+E+ GK ISLSEQQLVDC + NN GC GG
Sbjct: 85 VTEVKSQGSCGSCWAFSTTGSVESHNFIKTGKLISLSEQQLVDCVK--NNSGCAGGWMDI 142
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
A EYI+ +G + +E+ YPY ++ C+F++ VQ+ I E +LQ AV L P
Sbjct: 143 ALEYIEADGIM-SEDDYPYEERNTTCRFNNSKAAVQIKSYKAIKKNDEIDLQKAVALEGP 201
Query: 229 VSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWG 288
V VA EV F+ Y G+ + +C NT D+ HAV+ GYG +DG YW++KNSWG +G
Sbjct: 202 VPVAIEVTIAFQLYARGILNDPQCKNTEGDLTHAVLVTGYGSQDGKDYWIVKNSWGAEYG 261
Query: 289 DHGYFKMEM-GKNMCGIATCASYPVV 313
GY +M N CGIAT ASYPV+
Sbjct: 262 MDGYLRMSRNADNQCGIATRASYPVL 287
>gi|6435586|pdb|7PCK|A Chain A, Crystal Structure Of Wild Type Human Procathepsin K
gi|6435587|pdb|7PCK|B Chain B, Crystal Structure Of Wild Type Human Procathepsin K
gi|6435588|pdb|7PCK|C Chain C, Crystal Structure Of Wild Type Human Procathepsin K
gi|6435589|pdb|7PCK|D Chain D, Crystal Structure Of Wild Type Human Procathepsin K
gi|6435592|pdb|1BY8|A Chain A, The Crystal Structure Of Human Procathepsin K
Length = 314
Score = 189 bits (480), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 96/213 (45%), Positives = 131/213 (61%), Gaps = 6/213 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ YR ++PVK+QG CGSCW FS+ G+LE + GK ++LS Q LVDC N G
Sbjct: 104 VDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE--NDG 161
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C GG + AF+Y++ N G+D+E+AYPY G++ C ++ + I G E L+
Sbjct: 162 CGGGYMTNAFQYVQKNRGIDSEDAYPYVGQEESCMYNPTGKAAKCRGYREIPEGNEKALK 221
Query: 221 HAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
AV V PVSVA + + F+FY GVY C + ++NHAV+AVGYG++ G +W+I
Sbjct: 222 RAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNSD--NLNHAVLAVGYGIQKGNKHWII 279
Query: 280 KNSWGENWGDHGYFKMEMGK-NMCGIATCASYP 311
KNSWGENWG+ GY M K N CGIA AS+P
Sbjct: 280 KNSWGENWGNKGYILMARNKNNACGIANLASFP 312
>gi|62955235|ref|NP_001017633.1| uncharacterized protein LOC550326 precursor [Danio rerio]
gi|62202194|gb|AAH92817.1| Zgc:110239 [Danio rerio]
Length = 546
Score = 189 bits (480), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 105/272 (38%), Positives = 153/272 (56%), Gaps = 11/272 (4%)
Query: 45 FETSVLQVIGQARHALSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYR 104
F SV + +++ LS R +R K++ + F + I + N + +R
Sbjct: 284 FSLSVNHLADRSQKELSMMRGCQRTHKVHRKAQ-------PFPSEIRSIATPN--SVDWR 334
Query: 105 LGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGG 164
L ++PVKDQ CGSCW+F+TTG+LE A G+ SLS+Q LVDC F N GC+GG
Sbjct: 335 LYGAVTPVKDQAVCGSCWSFATTGTLEGALFLKTGQLTSLSQQMLVDCTWGFGNNGCDGG 394
Query: 165 LPSQAFEYIKYNGGLDTEEAY-PYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAV 223
+AFE+I +GG+ T E+Y Y G +G+C + ++ Q+ N+T G L+ A+
Sbjct: 395 EEWRAFEWIMKHGGISTAESYGAYMGMNGLCHYDKSSMVAQLTGYTNVTSGDILALKAAI 454
Query: 224 GLVRPVSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNS 282
PV+V+ + F FY +GVY +C N D++HAV+AVGYG+ + YWL+KNS
Sbjct: 455 FKFGPVAVSIDAAHRSFAFYSNGVYYEPECKNGINDLDHAVLAVGYGIMNNESYWLVKNS 514
Query: 283 WGENWGDHGYFKMEMGKNMCGIATCASYPVVA 314
W WG+ GY M M N CG+AT A Y +A
Sbjct: 515 WSSYWGNDGYILMSMKDNNCGVATDAIYATLA 546
>gi|49456399|emb|CAG46520.1| CTSK [Homo sapiens]
Length = 329
Score = 189 bits (480), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 96/213 (45%), Positives = 131/213 (61%), Gaps = 6/213 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ YR ++PVK+QG CGSCW FS+ G+LE + GK ++LS Q LVDC N G
Sbjct: 119 VDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE--NDG 176
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C GG + AF+Y++ N G+D+E+AYPY G++ C ++ + I G E L+
Sbjct: 177 CGGGYMTNAFQYVQKNRGIDSEDAYPYVGQEESCMYNPTGKAAKCRGYREIPEGNEKALK 236
Query: 221 HAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
AV V PVSVA + + F+FY GVY C + ++NHAV+AVGYG++ G +W+I
Sbjct: 237 RAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNSD--NLNHAVLAVGYGIQKGNKHWII 294
Query: 280 KNSWGENWGDHGYFKMEMGK-NMCGIATCASYP 311
KNSWGENWG+ GY M K N CGIA AS+P
Sbjct: 295 KNSWGENWGNKGYILMARNKNNACGIANLASFP 327
>gi|60654335|gb|AAX29858.1| cathepsin K [synthetic construct]
gi|60654337|gb|AAX29859.1| cathepsin K [synthetic construct]
Length = 330
Score = 189 bits (480), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 96/213 (45%), Positives = 131/213 (61%), Gaps = 6/213 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ YR ++PVK+QG CGSCW FS+ G+LE + GK ++LS Q LVDC N G
Sbjct: 119 VDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE--NDG 176
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C GG + AF+Y++ N G+D+E+AYPY G++ C ++ + I G E L+
Sbjct: 177 CGGGYMTNAFQYVQKNRGIDSEDAYPYVGQEESCMYNPTGKAAKCRGYREIPEGNEKALK 236
Query: 221 HAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
AV V PVSVA + + F+FY GVY C + ++NHAV+AVGYG++ G +W+I
Sbjct: 237 RAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNSD--NLNHAVLAVGYGIQKGNKHWII 294
Query: 280 KNSWGENWGDHGYFKMEMGK-NMCGIATCASYP 311
KNSWGENWG+ GY M K N CGIA AS+P
Sbjct: 295 KNSWGENWGNKGYILMARNKNNACGIANLASFP 327
>gi|209155876|gb|ACI34170.1| Digestive cysteine proteinase 2 precursor [Salmo salar]
Length = 551
Score = 189 bits (480), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 108/304 (35%), Positives = 152/304 (50%), Gaps = 51/304 (16%)
Query: 62 FARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN------------- 108
F F ++G+ Y E + R F NL + S N GLS+ L +N
Sbjct: 248 FGHFKEQFGRHYGDEREHEKREHAFVHNLRYVHSMNRAGLSFSLAVNSLSDLSMSELSAM 307
Query: 109 ------------------------------------ISPVKDQGHCGSCWTFSTTGSLEA 132
++PVKDQ CGSCW+F+TTG++E
Sbjct: 308 RGRNRGKRPNNGLPFPMHLYTGVQVPDQLDWRLYGAVTPVKDQAICGSCWSFATTGAVEG 367
Query: 133 AYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAY-PYTGKD 191
A G LS+Q LVDC+ F N GC+GG +A+E+I +GG+ T E Y Y G +
Sbjct: 368 ALFLTSGSLQVLSQQMLVDCSWGFGNNGCDGGEEWRAYEWIMKHGGIATTETYGSYMGMN 427
Query: 192 GVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVD-GFRFYKSGVYSST 250
G+C F++ + ++ N+T G + L+ A+ PV+V+ + F FY GVY
Sbjct: 428 GLCHFNTSQLTARIQSYTNVTSGDAEALKVALFKHGPVAVSIDAGHRSFVFYSHGVYYEP 487
Query: 251 KCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASY 310
KCGNT ++HAV+AVGYGV + PYWL+KNSW WG+ GY M M N CG+ T A+Y
Sbjct: 488 KCGNTTDSLDHAVLAVGYGVMEAEPYWLVKNSWSTYWGNDGYILMSMKDNNCGVTTDATY 547
Query: 311 PVVA 314
+A
Sbjct: 548 VTLA 551
>gi|198285481|gb|ACH85279.1| cathepsin l-like [Salmo salar]
Length = 444
Score = 189 bits (480), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 108/304 (35%), Positives = 152/304 (50%), Gaps = 51/304 (16%)
Query: 62 FARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN------------- 108
F F ++G+ Y E + R F NL + S N GLS+ L +N
Sbjct: 141 FGHFKEQFGRHYGDEREHEKREHAFVHNLRYVHSMNRAGLSFSLAVNSLSDLSMSELSAM 200
Query: 109 ------------------------------------ISPVKDQGHCGSCWTFSTTGSLEA 132
++PVKDQ CGSCW+F+TTG++E
Sbjct: 201 RGRNRGKRPNNGLPFPMHLYTGVQVPDQLDWRLYGAVTPVKDQAICGSCWSFATTGAVEG 260
Query: 133 AYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAY-PYTGKD 191
A G LS+Q LVDC+ F N GC+GG +A+E+I +GG+ T E Y Y G +
Sbjct: 261 ALFLTSGSLQVLSQQMLVDCSWGFGNNGCDGGEEWRAYEWIMKHGGIATTETYGSYMGMN 320
Query: 192 GVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVD-GFRFYKSGVYSST 250
G+C F++ + ++ N+T G + L+ A+ PV+V+ + F FY GVY
Sbjct: 321 GLCHFNTSQLTARIQSYTNVTSGDAEALKVALFKHGPVAVSIDAGHRSFVFYSHGVYYEP 380
Query: 251 KCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASY 310
KCGNT ++HAV+AVGYGV + PYWL+KNSW WG+ GY M M N CG+ T A+Y
Sbjct: 381 KCGNTTDSLDHAVLAVGYGVMEAEPYWLVKNSWSTYWGNDGYILMSMKDNNCGVTTDATY 440
Query: 311 PVVA 314
+A
Sbjct: 441 VTLA 444
>gi|332220191|ref|XP_003259241.1| PREDICTED: cathepsin K [Nomascus leucogenys]
Length = 329
Score = 189 bits (480), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 96/213 (45%), Positives = 131/213 (61%), Gaps = 6/213 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ YR ++PVK+QG CGSCW FS+ G+LE + GK ++LS Q LVDC N G
Sbjct: 119 VDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE--NDG 176
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C GG + AF+Y++ N G+D+E+AYPY G++ C ++ + I G E L+
Sbjct: 177 CGGGYMTNAFQYVQKNRGIDSEDAYPYVGQEESCMYNPTGKAAKCRGYREIPEGNEKALK 236
Query: 221 HAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
AV V PVSVA + + F+FY GVY C + ++NHAV+AVGYG++ G +W+I
Sbjct: 237 RAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNSD--NLNHAVLAVGYGIQKGNKHWII 294
Query: 280 KNSWGENWGDHGYFKMEMGK-NMCGIATCASYP 311
KNSWGENWG+ GY M K N CGIA AS+P
Sbjct: 295 KNSWGENWGNKGYILMARNKNNACGIANLASFP 327
>gi|19698257|dbj|BAB86771.1| cathepsin L-like [Engraulis japonicus]
Length = 324
Score = 189 bits (480), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 97/215 (45%), Positives = 134/215 (62%), Gaps = 6/215 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R +SP+K+QG CGSCW+FS TG+LE+ G SLSEQQLVDC+ + N G
Sbjct: 114 VDWRTSGCVSPIKNQGQCGSCWSFSATGALESQTCLRRGYLPSLSEQQLVDCSGPYGNYG 173
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNIT-LGAEDEL 219
CNGG P AF+Y++ NGG+D+E YPY + G C ++S ++T +G+E L
Sbjct: 174 CNGGWPDHAFQYVQANGGIDSESYYPYQARVGTCHYNSAYSAATCSGYQDVTPVGSESAL 233
Query: 220 QHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
Q+ V V P+S+A + G++ Y+SGV++ C T +HAV+ VGYG +G YWL+
Sbjct: 234 QYYVANVGPLSIAID-ASGWQSYQSGVFNDPSCSQT---ADHAVLLVGYGTYNGQDYWLV 289
Query: 280 KNSWGENWGDHGYFKM-EMGKNMCGIATCASYPVV 313
KNSWG WG+ GY M N CGIA ASYP+V
Sbjct: 290 KNSWGTWWGEQGYIMMARNANNQCGIANHASYPLV 324
>gi|74136185|ref|NP_001027984.1| cathepsin K precursor [Macaca mulatta]
gi|47117667|sp|P61276.1|CATK_MACFA RecName: Full=Cathepsin K; Flags: Precursor
gi|47117668|sp|P61277.1|CATK_MACMU RecName: Full=Cathepsin K; Flags: Precursor
gi|3236470|gb|AAC23694.1| cathepsin K [Macaca fascicularis]
gi|4927694|gb|AAD33249.1| cathepsin K [Macaca mulatta]
gi|355558400|gb|EHH15180.1| hypothetical protein EGK_01237 [Macaca mulatta]
gi|355763132|gb|EHH62118.1| hypothetical protein EGM_20317 [Macaca fascicularis]
gi|380809978|gb|AFE76864.1| cathepsin K preproprotein [Macaca mulatta]
gi|383416065|gb|AFH31246.1| cathepsin K preproprotein [Macaca mulatta]
gi|384945478|gb|AFI36344.1| cathepsin K preproprotein [Macaca mulatta]
Length = 329
Score = 189 bits (480), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 96/213 (45%), Positives = 131/213 (61%), Gaps = 6/213 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ YR ++PVK+QG CGSCW FS+ G+LE + GK ++LS Q LVDC N G
Sbjct: 119 VDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE--NDG 176
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C GG + AF+Y++ N G+D+E+AYPY G++ C ++ + I G E L+
Sbjct: 177 CGGGYMTNAFQYVQKNRGIDSEDAYPYVGQEESCMYNPTGKAAKCRGYREIPEGNEKALK 236
Query: 221 HAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
AV V PVSVA + + F+FY GVY C + ++NHAV+AVGYG++ G +W+I
Sbjct: 237 RAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNSD--NLNHAVLAVGYGIQKGNKHWII 294
Query: 280 KNSWGENWGDHGYFKMEMGK-NMCGIATCASYP 311
KNSWGENWG+ GY M K N CGIA AS+P
Sbjct: 295 KNSWGENWGNKGYILMARNKNNACGIANLASFP 327
>gi|348565223|ref|XP_003468403.1| PREDICTED: cathepsin L1-like [Cavia porcellus]
Length = 333
Score = 189 bits (480), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 102/221 (46%), Positives = 133/221 (60%), Gaps = 9/221 (4%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++ VK+QG CGSCW FS TGSLE G +SLSEQ LVDC++ N
Sbjct: 116 KSVDWREKGYVTEVKNQGQCGSCWAFSATGSLEGQMFHKTGNLVSLSEQNLVDCSRPQGN 175
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
QGCNGGL AF+Y+K N GL+ E++YPY GKDG CK+ E V++ E
Sbjct: 176 QGCNGGLMDFAFQYVKDNKGLEAEKSYPYVGKDGECKYKPELSAANDTGFVDVPQ-REKV 234
Query: 219 LQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVP-- 275
+Q A+ V P+SVA + + F+FYK G+Y C + D+NH V+ VGYG +
Sbjct: 235 VQKALATVGPLSVAIDAGLQSFQFYKEGIYYDPGC--SSRDLNHGVLLVGYGTDASETGK 292
Query: 276 --YWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
YWLIKNSWG WG GY K+ + N CG+AT ASYP+V
Sbjct: 293 GDYWLIKNSWGTTWGADGYVKIARNRNNHCGVATAASYPLV 333
>gi|148745204|gb|AAI42984.1| Cathepsin L1 [Homo sapiens]
Length = 333
Score = 189 bits (480), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 103/221 (46%), Positives = 133/221 (60%), Gaps = 9/221 (4%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
+ + +R ++PVK+QG CGSCW FS TG+LE + G+ ISLSEQ LVDC+ N
Sbjct: 116 RSVDWREKGYVTPVKNQGPCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGN 175
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
+GCNGGL AF+Y++ NGGLD+EE+YPY + CK++ + V+I E
Sbjct: 176 EGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIP-KQEKA 234
Query: 219 LQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----DG 273
L AV V P+SVA + + F FYK G+Y C + MD H V+ VGYG E D
Sbjct: 235 LMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMD--HGVLVVGYGFESTESDN 292
Query: 274 VPYWLIKNSWGENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
YWL+KNSWGE WG GY KM +N CGIA+ ASYP V
Sbjct: 293 NKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 333
>gi|313754424|pdb|3OF8|A Chain A, Structural Basis For Reversible And Irreversible
Inhibition Of Human Cathepsin L By Their Respective
Dipeptidyl Glyoxal And Diazomethylketone Inhibitors
gi|313754425|pdb|3OF9|A Chain A, Structural Basis For Irreversible Inhibition Of Human
Cathepsin L By A Diazomethylketone Inhibitor
Length = 221
Score = 189 bits (480), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 103/221 (46%), Positives = 133/221 (60%), Gaps = 9/221 (4%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
+ + +R ++PVK+QG CGSCW FS TG+LE + G+ ISLSEQ LVDC+ N
Sbjct: 4 RSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGN 63
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
+GCNGGL AF+Y++ NGGLD+EE+YPY + CK++ + V+I E
Sbjct: 64 EGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPK-QEKA 122
Query: 219 LQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----DG 273
L AV V P+SVA + + F FYK G+Y C + MD H V+ VGYG E D
Sbjct: 123 LMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMD--HGVLVVGYGFESTESDN 180
Query: 274 VPYWLIKNSWGENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
YWL+KNSWGE WG GY KM +N CGIA+ ASYP V
Sbjct: 181 NKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 221
>gi|426331364|ref|XP_004026652.1| PREDICTED: cathepsin K [Gorilla gorilla gorilla]
Length = 329
Score = 189 bits (480), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 96/213 (45%), Positives = 131/213 (61%), Gaps = 6/213 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ YR ++PVK+QG CGSCW FS+ G+LE + GK ++LS Q LVDC N G
Sbjct: 119 VDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE--NDG 176
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C GG + AF+Y++ N G+D+E+AYPY G++ C ++ + I G E L+
Sbjct: 177 CGGGYMTNAFQYVQKNRGIDSEDAYPYVGQEESCMYNPTGKAAKCRGYREIPEGNEKALK 236
Query: 221 HAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
AV V PVSVA + + F+FY GVY C + ++NHAV+AVGYG++ G +W+I
Sbjct: 237 RAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNSD--NLNHAVLAVGYGIQKGNKHWII 294
Query: 280 KNSWGENWGDHGYFKMEMGK-NMCGIATCASYP 311
KNSWGENWG+ GY M K N CGIA AS+P
Sbjct: 295 KNSWGENWGNKGYILMARNKNNACGIANLASFP 327
>gi|4503151|ref|NP_000387.1| cathepsin K preproprotein [Homo sapiens]
gi|1168793|sp|P43235.1|CATK_HUMAN RecName: Full=Cathepsin K; AltName: Full=Cathepsin O; AltName:
Full=Cathepsin O2; AltName: Full=Cathepsin X; Flags:
Precursor
gi|562757|emb|CAA57649.1| Cathepsin O [Homo sapiens]
gi|606923|gb|AAA65233.1| cathepsin O [Homo sapiens]
gi|1195556|gb|AAB35521.1| cathepsin O2 [Homo sapiens]
gi|16359188|gb|AAH16058.1| Cathepsin K [Homo sapiens]
gi|49456311|emb|CAG46476.1| CTSK [Homo sapiens]
gi|60823594|gb|AAX36649.1| cathepsin K [synthetic construct]
gi|119573901|gb|EAW53516.1| cathepsin K (pycnodysostosis), isoform CRA_b [Homo sapiens]
gi|307685681|dbj|BAJ20771.1| cathepsin K [synthetic construct]
gi|312150424|gb|ADQ31724.1| cathepsin K [synthetic construct]
Length = 329
Score = 189 bits (480), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 96/213 (45%), Positives = 131/213 (61%), Gaps = 6/213 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ YR ++PVK+QG CGSCW FS+ G+LE + GK ++LS Q LVDC N G
Sbjct: 119 VDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE--NDG 176
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C GG + AF+Y++ N G+D+E+AYPY G++ C ++ + I G E L+
Sbjct: 177 CGGGYMTNAFQYVQKNRGIDSEDAYPYVGQEESCMYNPTGKAAKCRGYREIPEGNEKALK 236
Query: 221 HAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
AV V PVSVA + + F+FY GVY C + ++NHAV+AVGYG++ G +W+I
Sbjct: 237 RAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNSD--NLNHAVLAVGYGIQKGNKHWII 294
Query: 280 KNSWGENWGDHGYFKMEMGK-NMCGIATCASYP 311
KNSWGENWG+ GY M K N CGIA AS+P
Sbjct: 295 KNSWGENWGNKGYILMARNKNNACGIANLASFP 327
>gi|836934|gb|AAA95998.1| cathepsin X [Homo sapiens]
Length = 329
Score = 189 bits (480), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 96/213 (45%), Positives = 131/213 (61%), Gaps = 6/213 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ YR ++PVK+QG CGSCW FS+ G+LE + GK ++LS Q LVDC N G
Sbjct: 119 VDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE--NDG 176
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C GG + AF+Y++ N G+D+E+AYPY G++ C ++ + I G E L+
Sbjct: 177 CGGGYMTNAFQYVQKNRGIDSEDAYPYVGQEESCMYNPTGKAAKCRGYREIPEGNEKALK 236
Query: 221 HAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
AV V PVSVA + + F+FY GVY C + ++NHAV+AVGYG++ G +W+I
Sbjct: 237 RAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNSD--NLNHAVLAVGYGIQKGNKHWII 294
Query: 280 KNSWGENWGDHGYFKMEMGK-NMCGIATCASYP 311
KNSWGENWG+ GY M K N CGIA AS+P
Sbjct: 295 KNSWGENWGNKGYILMARNKNNACGIANLASFP 327
>gi|344258279|gb|EGW14383.1| Cathepsin L1 [Cricetulus griseus]
Length = 295
Score = 189 bits (480), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 101/218 (46%), Positives = 132/218 (60%), Gaps = 6/218 (2%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++PVKDQG CG+CW FS GSL GK + LSEQ LVDC+ + N
Sbjct: 81 KSVDWRKHGYVTPVKDQGSCGACWAFSAVGSLVGQMFWKTGKLVPLSEQNLVDCSWSHGN 140
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
GC+GGL AF+Y+ NGGLDT E+YPY ++ C+++ EN V V I E
Sbjct: 141 IGCHGGLMQNAFQYVMDNGGLDTSESYPYESRNTTCRYNPENSAANVTGFVKIP-ANEYS 199
Query: 219 LQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE-DGVPY 276
L AV +V P+S A + F+FY+ G+Y +C ++ +D HAV+ VGYG E DG Y
Sbjct: 200 LMKAVAIVGPISAAIDTKHHSFQFYRGGMYYEPECSSSNLD--HAVLVVGYGEESDGRKY 257
Query: 277 WLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
WL+KNSWG WG +GY KM + N CGIAT A YP V
Sbjct: 258 WLVKNSWGTYWGMNGYIKMARDRNNNCGIATYAMYPTV 295
>gi|62955291|ref|NP_001017661.1| cathepsin S, b.2 precursor [Danio rerio]
gi|62204682|gb|AAH93339.1| Cathepsin S, b.2 [Danio rerio]
gi|182891354|gb|AAI64362.1| Ctssb.2 protein [Danio rerio]
Length = 330
Score = 189 bits (480), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 97/215 (45%), Positives = 128/215 (59%), Gaps = 5/215 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
L +R ++ VK+QG CGSCW FS+ G+LE + GK + LS Q LVDC+ + N G
Sbjct: 119 LDWRDKGYVTSVKNQGACGSCWAFSSVGALEGQLMKTTGKLVDLSPQNLVDCSSKYGNLG 178
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
CNGG SQAF+Y+ NGG+D+E +YPY G G C++ ++ G E L+
Sbjct: 179 CNGGYMSQAFQYVIDNGGIDSESSYPYQGTQGSCRYDPSQRAANCTSYKFVSQGDEQALK 238
Query: 221 HAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
A+ + PVSVA + F FY+SGVY C VNH V+AVGYG G YWL+
Sbjct: 239 EALANIGPVSVAIDATRPQFIFYRSGVYDDPSC---TQKVNHGVLAVGYGTLSGQDYWLV 295
Query: 280 KNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
KNSWG +GD GY ++ K NMCGIA+ A YP+V
Sbjct: 296 KNSWGAGFGDGGYIRIARNKNNMCGIASEACYPIV 330
>gi|297852302|ref|XP_002894032.1| F2G19.31/F2G19.31 [Arabidopsis lyrata subsp. lyrata]
gi|297339874|gb|EFH70291.1| F2G19.31/F2G19.31 [Arabidopsis lyrata subsp. lyrata]
Length = 455
Score = 189 bits (480), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 131/360 (36%), Positives = 175/360 (48%), Gaps = 74/360 (20%)
Query: 11 VILLLCCAAAASASASSFDDSNPIRLVSSDGLRDFETSVLQVIGQARHALSFARFARRYG 70
VIL L A ASA S + VS+ G R + V+ + + + ++G
Sbjct: 2 VILFLAMVAVASAVDMSIISYDEKHGVSTTGGRS-DAEVMSI---------YEAWLVKHG 51
Query: 71 KIYE--SVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN-------------------- 108
K S+ E RF F NL I N K LSYRLGL
Sbjct: 52 KAQNQNSLVEKDRRFEIFKDNLRFIDDHNKKNLSYRLGLTRFADLTNDEYRSKYLGAKME 111
Query: 109 ------------------------------ISPVKDQGHCGSCWTFSTTGSLEAAYHQAF 138
++ VKDQG CGSCW FST G++E
Sbjct: 112 KKGERRTSQRYEARVGDELPESIDWRKKGAVAEVKDQGSCGSCWAFSTIGAVEGINQIVT 171
Query: 139 GKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSS 198
G I+LSEQ+LVDC ++N +GCNGGL AFE+I NGG+DT++ YPY G DG C
Sbjct: 172 GDLITLSEQELVDCDTSYN-EGCNGGLMDYAFEFIIKNGGIDTDKDYPYKGVDGTCDQIR 230
Query: 199 ENVGVQVLDSV-NITLGAEDELQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTP 256
+N V +DS ++ +E+ L+ AV +PVSVA E F+ Y SG++ T CG
Sbjct: 231 KNAKVVTIDSYEDVPTYSEESLKKAVAH-QPVSVAIEAGGRAFQLYDSGIFDGT-CGTQ- 287
Query: 257 MDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMG----KNMCGIATCASYPV 312
++H VVAVGYG E+G YW+++NSWG++WG+ GY KM CGIA SYP+
Sbjct: 288 --LDHGVVAVGYGTENGKDYWIVRNSWGKSWGESGYLKMARNIASSSGKCGIAIEPSYPI 345
>gi|395535911|ref|XP_003769964.1| PREDICTED: cathepsin K [Sarcophilus harrisii]
Length = 332
Score = 189 bits (479), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 95/213 (44%), Positives = 130/213 (61%), Gaps = 6/213 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ YR ++PVK+QG CGSCW FS+ G+LE + GK ++LS Q LVDC N G
Sbjct: 122 VDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSK--NDG 179
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C GG + AF+Y++ N G+D+E+AYPY G+D C ++ + I G+E L+
Sbjct: 180 CGGGYMTNAFQYVQENRGIDSEDAYPYIGQDESCMYNPTGKAAKCRGYREIPEGSEKALK 239
Query: 221 HAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
AV V PV+VA + + F+FY GVY C ++NHAV+AVGYG++ G +W+I
Sbjct: 240 RAVARVGPVAVAIDASLSSFQFYSKGVYYDENCNGD--NLNHAVLAVGYGIQRGTKHWII 297
Query: 280 KNSWGENWGDHGYFKMEMG-KNMCGIATCASYP 311
KNSWGE WG+ GY M KN CGIA AS+P
Sbjct: 298 KNSWGEEWGNKGYILMARNKKNACGIANLASFP 330
>gi|111036376|dbj|BAF02517.1| cathepsin L-like proteinase [Echinococcus multilocularis]
Length = 338
Score = 189 bits (479), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 101/208 (48%), Positives = 132/208 (63%), Gaps = 7/208 (3%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++ VK+QG+CGSCW FS+TG+LE A + GK ISLSEQQLVDC N GCNGG S
Sbjct: 135 VTEVKNQGNCGSCWAFSSTGALEGALAKKTGKLISLSEQQLVDCTLENGNDGCNGGYMSN 194
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGV-QVLDSVNITLGAEDELQHAVGLVR 227
AF+Y++ + ++ E AYPY DG C++ +E++GV V D +I G E L AV V
Sbjct: 195 AFKYLEGH-SIEPESAYPYRATDGPCRY-NESLGVGSVTDIGDIPEGNETALMEAVATVG 252
Query: 228 PVSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGEN 286
P+S+A + GF FY G+Y S C + + NH V+A+GYG DG PYWL+KNSWG
Sbjct: 253 PISIAIDASTLGFMFYHHGIYKSHWCSSKFL--NHGVLAIGYGKLDGKPYWLVKNSWGSR 310
Query: 287 WGDHGYFKMEMG-KNMCGIATCASYPVV 313
WG GY M NMCG+A+ A +P V
Sbjct: 311 WGMKGYIMMAKDYHNMCGVASLADFPYV 338
>gi|67678376|gb|AAH96862.1| Cathepsin S, b.2 [Danio rerio]
Length = 330
Score = 189 bits (479), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 97/215 (45%), Positives = 128/215 (59%), Gaps = 5/215 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
L +R ++ VK+QG CGSCW FS+ G+LE + GK + LS Q LVDC+ + N G
Sbjct: 119 LDWRDKGYVTSVKNQGACGSCWAFSSVGALEGQLMKTTGKLVDLSPQNLVDCSSKYGNLG 178
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
CNGG SQAF+Y+ NGG+D+E +YPY G G C++ ++ G E L+
Sbjct: 179 CNGGYMSQAFQYVIDNGGIDSESSYPYQGTQGSCRYDPSQRAANCTSYKFVSQGDEQALK 238
Query: 221 HAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
A+ + PVSVA + F FY+SGVY C VNH V+AVGYG G YWL+
Sbjct: 239 EALANIGPVSVAIDATRPQFIFYRSGVYDDPSC---TQKVNHGVLAVGYGTLSGQDYWLV 295
Query: 280 KNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
KNSWG +GD GY ++ K NMCGIA+ A YP+V
Sbjct: 296 KNSWGAGFGDGGYIRIARNKNNMCGIASEACYPIV 330
>gi|119573900|gb|EAW53515.1| cathepsin K (pycnodysostosis), isoform CRA_a [Homo sapiens]
Length = 288
Score = 189 bits (479), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 96/213 (45%), Positives = 131/213 (61%), Gaps = 6/213 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ YR ++PVK+QG CGSCW FS+ G+LE + GK ++LS Q LVDC N G
Sbjct: 78 VDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE--NDG 135
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C GG + AF+Y++ N G+D+E+AYPY G++ C ++ + I G E L+
Sbjct: 136 CGGGYMTNAFQYVQKNRGIDSEDAYPYVGQEESCMYNPTGKAAKCRGYREIPEGNEKALK 195
Query: 221 HAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
AV V PVSVA + + F+FY GVY C + ++NHAV+AVGYG++ G +W+I
Sbjct: 196 RAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNSD--NLNHAVLAVGYGIQKGNKHWII 253
Query: 280 KNSWGENWGDHGYFKMEMGK-NMCGIATCASYP 311
KNSWGENWG+ GY M K N CGIA AS+P
Sbjct: 254 KNSWGENWGNKGYILMARNKNNACGIANLASFP 286
>gi|253722774|pdb|1CJL|A Chain A, Crystal Structure Of A Cysteine Protease Proform
Length = 312
Score = 189 bits (479), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 103/221 (46%), Positives = 133/221 (60%), Gaps = 9/221 (4%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
+ + +R ++PVK+QG CGS W FS TG+LE + G+ ISLSEQ LVDC+ N
Sbjct: 95 RSVDWREKGYVTPVKNQGQCGSSWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPEGN 154
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
+GCNGGL AF+Y++ NGGLD+EE+YPY + CK++ + V+I E
Sbjct: 155 EGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDAGFVDIPK-QEKA 213
Query: 219 LQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----DG 273
L AV V P+SVA + + F FYK G+Y C + MD H V+ VGYG E DG
Sbjct: 214 LMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMD--HGVLVVGYGFESTESDG 271
Query: 274 VPYWLIKNSWGENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
YWL+KNSWGE WG GY KM +N CGIA+ ASYP V
Sbjct: 272 NKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 312
>gi|261824899|pdb|3H89|A Chain A, A Combined Crystallographic And Molecular Dynamics Study
Of Cathepsin-L Retro-Binding Inhibitors(Compound 4)
gi|261824900|pdb|3H89|B Chain B, A Combined Crystallographic And Molecular Dynamics Study
Of Cathepsin-L Retro-Binding Inhibitors(Compound 4)
gi|261824901|pdb|3H89|C Chain C, A Combined Crystallographic And Molecular Dynamics Study
Of Cathepsin-L Retro-Binding Inhibitors(Compound 4)
gi|261824902|pdb|3H89|D Chain D, A Combined Crystallographic And Molecular Dynamics Study
Of Cathepsin-L Retro-Binding Inhibitors(Compound 4)
gi|261824903|pdb|3H89|E Chain E, A Combined Crystallographic And Molecular Dynamics Study
Of Cathepsin-L Retro-Binding Inhibitors(Compound 4)
gi|261824904|pdb|3H89|F Chain F, A Combined Crystallographic And Molecular Dynamics Study
Of Cathepsin-L Retro-Binding Inhibitors(Compound 4)
gi|261824905|pdb|3H8B|A Chain A, A Combined Crystallographic And Molecular Dynamics Study
Of Cathepsin-L Retro-Binding Inhibitors(Compound 9)
gi|261824906|pdb|3H8B|B Chain B, A Combined Crystallographic And Molecular Dynamics Study
Of Cathepsin-L Retro-Binding Inhibitors(Compound 9)
gi|261824907|pdb|3H8B|C Chain C, A Combined Crystallographic And Molecular Dynamics Study
Of Cathepsin-L Retro-Binding Inhibitors(Compound 9)
gi|261824908|pdb|3H8B|D Chain D, A Combined Crystallographic And Molecular Dynamics Study
Of Cathepsin-L Retro-Binding Inhibitors(Compound 9)
gi|261824909|pdb|3H8B|E Chain E, A Combined Crystallographic And Molecular Dynamics Study
Of Cathepsin-L Retro-Binding Inhibitors(Compound 9)
gi|261824910|pdb|3H8B|F Chain F, A Combined Crystallographic And Molecular Dynamics Study
Of Cathepsin-L Retro-Binding Inhibitors(Compound 9)
gi|317455049|pdb|2XU3|A Chain A, Cathepsin L With A Nitrile Inhibitor
gi|317455050|pdb|2XU4|A Chain A, Cathepsin L With A Nitrile Inhibitor
gi|317455051|pdb|2XU5|A Chain A, Cathepsin L With A Nitrile Inhibitor
gi|358009432|pdb|2YJ2|A Chain A, Cathepsin L With A Nitrile Inhibitor
gi|358009433|pdb|2YJ8|A Chain A, Cathepsin L With A Nitrile Inhibitor
gi|358009434|pdb|2YJ9|A Chain A, Cathepsin L With A Nitrile Inhibitor
gi|358009435|pdb|2YJB|A Chain A, Cathepsin L With A Nitrile Inhibitor
gi|358009436|pdb|2YJC|A Chain A, Cathepsin L With A Nitrile Inhibitor
Length = 220
Score = 189 bits (479), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 103/221 (46%), Positives = 133/221 (60%), Gaps = 9/221 (4%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
+ + +R ++PVK+QG CGSCW FS TG+LE + G+ ISLSEQ LVDC+ N
Sbjct: 3 RSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGN 62
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
+GCNGGL AF+Y++ NGGLD+EE+YPY + CK++ + V+I E
Sbjct: 63 EGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPK-QEKA 121
Query: 219 LQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----DG 273
L AV V P+SVA + + F FYK G+Y C + MD H V+ VGYG E D
Sbjct: 122 LMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMD--HGVLVVGYGFESTESDN 179
Query: 274 VPYWLIKNSWGENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
YWL+KNSWGE WG GY KM +N CGIA+ ASYP V
Sbjct: 180 NKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 220
>gi|50355619|dbj|BAD29958.1| cysteine protease [Daucus carota]
Length = 496
Score = 189 bits (479), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 125/354 (35%), Positives = 176/354 (49%), Gaps = 81/354 (22%)
Query: 18 AAAASASASSFDDSNPIRLVSSDGLRDFETSVLQVIGQARHALSFARFARRYGKIYESVE 77
AAA S ++D+++ + + D E + L F + +GK Y ++
Sbjct: 17 AAATDMSIITYDETHAVGFKTDD-----EATTL-----------FESWLVTHGKSYNALG 60
Query: 78 EMKLRFATFSKNLDLIRSTN-CKGLSYRLGLN---------------------------- 108
E + RF F NL I N + ++LGLN
Sbjct: 61 EEEKRFQIFKNNLRYIDEQNLVEDRGFKLGLNKFADLTNEEYRSKYTGIKSKDLRKKVSA 120
Query: 109 ------------------------ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISL 144
++ VKDQG CGSCW FST ++E A GK I+L
Sbjct: 121 KSGRYATLSGESLPESVDWRESGAVATVKDQGSCGSCWAFSTISAVEGINQIATGKLITL 180
Query: 145 SEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQ 204
SEQ+LVDC +++N +GCNGGL AFE+I NGG+DT+ YPYTG+DG C +N V
Sbjct: 181 SEQELVDCDRSYN-EGCNGGLMDYAFEFIINNGGIDTDVDYPYTGRDGKCDQYRKNAKVV 239
Query: 205 VLDSVNITLGAEDELQ-HAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHA 262
+DS + A DEL +P+SVA E F+FY SG+++ KCG + ++H
Sbjct: 240 TIDSYE-DVPAYDELALKKAAANQPISVAIEASGRDFQFYDSGIFTG-KCG---IALDHG 294
Query: 263 VVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMG----KNMCGIATCASYPV 312
VV VGYG E+G YW+++NSWG +WG++GY +ME G +CGIA SYPV
Sbjct: 295 VVVVGYGTENGKDYWIVRNSWGADWGENGYLRMERGISSKTGICGIAIEPSYPV 348
>gi|1256830|gb|AAB68374.1| cysteine endopeptidase 1 [Phaseolus vulgaris]
gi|2959418|emb|CAA12118.1| cysteine protease [Phaseolus vulgaris]
Length = 364
Score = 189 bits (479), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 101/219 (46%), Positives = 138/219 (63%), Gaps = 13/219 (5%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +RL + P+KDQG+CGSCW FST ++E + G+ +SLSEQ+LVDC + ++ +G
Sbjct: 129 VDWRLKGAVGPIKDQGNCGSCWAFSTVAAVEGINNIVTGEFVSLSEQELVDCDREYD-EG 187
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSV-NITLGAEDEL 219
CNGGL AF++I NGG+DTEE YPY G DG C + + V +D ++ E+ L
Sbjct: 188 CNGGLMDYAFQFIIQNGGIDTEEDYPYQGIDGTCDETKKKTKVVQIDGYEDVPSNNENAL 247
Query: 220 QHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWL 278
+ AV +PVSVA E + Y+SGV++ KCG ++H VV VGYG E+GV YWL
Sbjct: 248 KKAVSH-QPVSVAIEASGRALQLYQSGVFTG-KCGTA---LDHGVVVVGYGTENGVDYWL 302
Query: 279 IKNSWGENWGDHGYFKME-----MGKNMCGIATCASYPV 312
++NSWG WG+ GYFKME + CGIA SYPV
Sbjct: 303 VRNSWGTGWGEDGYFKMERNVRSTSEGKCGIAMDCSYPV 341
>gi|47522698|ref|NP_999057.1| cathepsin L1 precursor [Sus scrofa]
gi|2499874|sp|Q28944.1|CATL1_PIG RecName: Full=Cathepsin L1; Contains: RecName: Full=Cathepsin L1
heavy chain; Contains: RecName: Full=Cathepsin L1 light
chain; Flags: Precursor
gi|1468964|dbj|BAA07140.1| porcine cathepsin L [Sus scrofa]
gi|15027272|emb|CAC44793.1| cathepsin L [Sus scrofa]
Length = 334
Score = 189 bits (479), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 103/222 (46%), Positives = 135/222 (60%), Gaps = 10/222 (4%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++ VK+QG CGSCW FS TG+LE + GK +SLSEQ LVDC++ N
Sbjct: 116 KSVDWREKGYVTAVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGN 175
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKD-GVCKFSSENVGVQVLDSVNITLGAED 217
QGCNGGL AF+Y+K NGGLDTEE+YPY G++ C + E V+I E
Sbjct: 176 QGCNGGLMDNAFQYVKDNGGLDTEESYPYLGRETNSCTYKPECSAANDTGFVDIPQ-REK 234
Query: 218 ELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----D 272
L AV V P+SVA + F+FYKSG+Y C + D++H V+ VGYG E +
Sbjct: 235 ALMKAVATVGPISVAIDAGHSSFQFYKSGIYYDPDC--SSKDLDHGVLVVGYGFEGTDSN 292
Query: 273 GVPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
+W++KNSWG WG +GY KM + N CGI+T ASYP V
Sbjct: 293 SSKFWIVKNSWGPEWGWNGYVKMAKDQNNHCGISTAASYPTV 334
>gi|149510440|ref|XP_001518002.1| PREDICTED: cathepsin K-like [Ornithorhynchus anatinus]
Length = 618
Score = 189 bits (479), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 93/213 (43%), Positives = 130/213 (61%), Gaps = 6/213 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ YR ++PVK+QG CGSCW FS+ G+LE + G+ + LS Q LVDC + N G
Sbjct: 408 IDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGRLLDLSPQNLVDCVAS--NDG 465
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C GG + AF+Y+ N G+D+E+AYPY G+D C++S + + +G E L+
Sbjct: 466 CGGGYMTNAFQYVHDNRGIDSEDAYPYVGQDEPCRYSPTGKAAKCRGYREVPVGDEKALK 525
Query: 221 HAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
AV V PV+VA + + F+FY GVY C ++NHA++AVGYG + G +W+I
Sbjct: 526 RAVARVGPVAVAIDASLSSFQFYSKGVYFDENCNGA--NLNHALLAVGYGAQKGAKHWII 583
Query: 280 KNSWGENWGDHGYFKMEMGK-NMCGIATCASYP 311
KNSWGE WG+ GY M K N CGIA+ AS+P
Sbjct: 584 KNSWGEEWGNKGYVLMARNKNNACGIASLASFP 616
>gi|2511689|emb|CAB17074.1| cysteine proteinase precursor [Phaseolus vulgaris]
Length = 364
Score = 189 bits (479), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 101/219 (46%), Positives = 138/219 (63%), Gaps = 13/219 (5%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +RL + P+KDQG+CGSCW FST ++E + G+ +SLSEQ+LVDC + ++ +G
Sbjct: 129 VDWRLKGAVGPIKDQGNCGSCWAFSTVAAVEGINNIVTGEFVSLSEQELVDCDREYD-EG 187
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSV-NITLGAEDEL 219
CNGGL AF++I NGG+DTEE YPY G DG C + + V +D ++ E+ L
Sbjct: 188 CNGGLMDYAFQFIIQNGGIDTEEDYPYQGIDGTCDQTKKKTKVVQIDGYEDVPSNNENAL 247
Query: 220 QHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWL 278
+ AV +PVSVA E + Y+SGV++ KCG ++H VV VGYG E+GV YWL
Sbjct: 248 KKAVSH-QPVSVAIEASGRALQLYQSGVFTG-KCGTA---LDHGVVVVGYGTENGVDYWL 302
Query: 279 IKNSWGENWGDHGYFKME-----MGKNMCGIATCASYPV 312
++NSWG WG+ GYFKME + CGIA SYPV
Sbjct: 303 VRNSWGTGWGEDGYFKMERNVRSTSEGKCGIAMDCSYPV 341
>gi|75765285|pdb|1U9V|A Chain A, Crystal Structure Of The Cysteine Protease Human Cathepsin
K In Complex With The Covalent Inhibitor Nvp-Abe854
gi|75765286|pdb|1U9W|A Chain A, Crystal Structure Of The Cysteine Protease Human Cathepsin
K In Complex With The Covalent Inhibitor Nvp-Abi491
gi|75765287|pdb|1U9X|A Chain A, Crystal Structure Of The Cysteine Protease Human Cathepsin
K In Complex With The Covalent Inhibitor Nvp-Abj688
gi|160286063|pdb|2R6N|A Chain A, Crystal Structure Of A Pyrrolopyrimidine Inhibitor In
Complex With Human Cathepsin K
Length = 217
Score = 188 bits (478), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 96/213 (45%), Positives = 131/213 (61%), Gaps = 6/213 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ YR ++PVK+QG CGSCW FS+ G+LE + GK ++LS Q LVDC N G
Sbjct: 7 VDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE--NDG 64
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C GG + AF+Y++ N G+D+E+AYPY G++ C ++ + I G E L+
Sbjct: 65 CGGGYMTNAFQYVQKNRGIDSEDAYPYVGQEESCMYNPTGKAAKCRGYREIPEGNEKALK 124
Query: 221 HAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
AV V PVSVA + + F+FY GVY C + ++NHAV+AVGYG++ G +W+I
Sbjct: 125 RAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNSD--NLNHAVLAVGYGIQKGNKHWII 182
Query: 280 KNSWGENWGDHGYFKMEMGK-NMCGIATCASYP 311
KNSWGENWG+ GY M K N CGIA AS+P
Sbjct: 183 KNSWGENWGNKGYILMARNKNNACGIANLASFP 215
>gi|241913450|pdb|3HHA|A Chain A, Crystal Structure Of Cathepsin L In Complex With
Az12878478
gi|241913451|pdb|3HHA|B Chain B, Crystal Structure Of Cathepsin L In Complex With
Az12878478
gi|241913452|pdb|3HHA|C Chain C, Crystal Structure Of Cathepsin L In Complex With
Az12878478
gi|241913453|pdb|3HHA|D Chain D, Crystal Structure Of Cathepsin L In Complex With
Az12878478
gi|317455045|pdb|2XU1|A Chain A, Cathepsin L With A Nitrile Inhibitor
gi|317455046|pdb|2XU1|B Chain B, Cathepsin L With A Nitrile Inhibitor
gi|317455047|pdb|2XU1|C Chain C, Cathepsin L With A Nitrile Inhibitor
gi|317455048|pdb|2XU1|D Chain D, Cathepsin L With A Nitrile Inhibitor
Length = 220
Score = 188 bits (478), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 103/221 (46%), Positives = 133/221 (60%), Gaps = 9/221 (4%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
+ + +R ++PVK+QG CGSCW FS TG+LE + G+ ISLSEQ LVDC+ N
Sbjct: 3 RSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGN 62
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
+GCNGGL AF+Y++ NGGLD+EE+YPY + CK++ + V+I E
Sbjct: 63 EGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDAGFVDIPK-QEKA 121
Query: 219 LQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----DG 273
L AV V P+SVA + + F FYK G+Y C + MD H V+ VGYG E D
Sbjct: 122 LMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMD--HGVLVVGYGFESTESDN 179
Query: 274 VPYWLIKNSWGENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
YWL+KNSWGE WG GY KM +N CGIA+ ASYP V
Sbjct: 180 NKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 220
>gi|93279455|pdb|2F7D|A Chain A, A Mutant Rabbit Cathepsin K With A Nitrile Inhibitor
Length = 215
Score = 188 bits (478), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 96/213 (45%), Positives = 131/213 (61%), Gaps = 6/213 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ YR ++PVK+QG CGSCW FS+ G+LE + GK ++LS Q LVDC N G
Sbjct: 5 IDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE--NDG 62
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C GG + AF+Y++ N G+D+E+AYPY G+D C ++ + I G E L+
Sbjct: 63 CGGGYMTNAFQYVQRNRGIDSEDAYPYVGQDESCMYNPTGKAAKCRGYREIPEGNEKALK 122
Query: 221 HAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
AV V PVSVA + + F+FY GVY C + ++NHAV+AVGYG++ G +W+I
Sbjct: 123 RAVARVGPVSVAIDASLTSFQFYSKGVYYDENCSSD--NLNHAVLAVGYGIQKGNKHWII 180
Query: 280 KNSWGENWGDHGYFKMEMGK-NMCGIATCASYP 311
KNSWGE+WG+ GY M K N CGIA AS+P
Sbjct: 181 KNSWGESWGNKGYILMARNKNNACGIANLASFP 213
>gi|449681105|ref|XP_002158608.2| PREDICTED: cathepsin L-like [Hydra magnipapillata]
Length = 339
Score = 188 bits (478), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 105/260 (40%), Positives = 145/260 (55%), Gaps = 12/260 (4%)
Query: 60 LSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNC---KGLSYRLGLNISPVKDQG 116
+S F + YG + KL +K + +N + +R ++ VK+QG
Sbjct: 86 MSHEEFRKMYGGCF------KLSKKNVTKGSIFLSPSNVVIPDSVDWRTEGYVTRVKNQG 139
Query: 117 HCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYN 176
CGSCW FS+TG+LE + G +SEQ LVDC Q++ N+ CNGG AF YIK N
Sbjct: 140 QCGSCWAFSSTGALEGQTFRKTGVLQEISEQNLVDCTQSYGNEACNGGWMDNAFTYIKDN 199
Query: 177 GGLDTEEAYPYTGKD-GVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEV 235
G+D+E YPY + G C ++ + V+I G E+ L+ AV V P+SVA +
Sbjct: 200 KGIDSEVGYPYYARALGYCYYNQQYNVASDTGFVDIPSGDENALKVAVATVGPISVAIDA 259
Query: 236 VDG-FRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFK 294
F Y+SGVY+ CGN +++HAV+ VGYG E+G +W++KNSW WGD GY K
Sbjct: 260 TKASFMSYQSGVYNEPTCGNGIENLDHAVLVVGYGTEEGRDFWIVKNSWDTTWGDQGYIK 319
Query: 295 MEMG-KNMCGIATCASYPVV 313
M N CGIAT ASYP+V
Sbjct: 320 MSRNMSNQCGIATKASYPIV 339
>gi|432108215|gb|ELK33129.1| Cathepsin L1 [Myotis davidii]
Length = 334
Score = 188 bits (478), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 101/212 (47%), Positives = 130/212 (61%), Gaps = 10/212 (4%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++PVK+QG CGSCW FS TGSLE + GK +SLSEQ LVDC++A N+GCNGGL
Sbjct: 126 VTPVKNQGQCGSCWAFSATGSLEGQMFRKTGKLVSLSEQNLVDCSRAQGNEGCNGGLMDN 185
Query: 169 AFEYIKYNGGLDTEEAYPYTGKD-GVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVR 227
AF+Y+K N GLDTEE+YPY ++ C + E V+I E L AV V
Sbjct: 186 AFQYVKDNKGLDTEESYPYLARESNTCNYRPEYSAANDTGFVDIPQ-REKALLKAVATVG 244
Query: 228 PVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGV----PYWLIKNS 282
P+SVA + F+FY +G+Y C + D++H V+ VGYG E G +W++KNS
Sbjct: 245 PISVAIDAGHSSFQFYNAGIYYEPNC--SSKDLDHGVLVVGYGSEGGESKNNKFWIVKNS 302
Query: 283 WGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
WG WG +GY KM + N CGIAT ASYP V
Sbjct: 303 WGSGWGMNGYVKMARDQSNHCGIATAASYPTV 334
>gi|28932704|gb|AAO60046.1| midgut cysteine proteinase 3 [Rhipicephalus appendiculatus]
Length = 334
Score = 188 bits (478), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 107/217 (49%), Positives = 131/217 (60%), Gaps = 5/217 (2%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++PVK+QG CGSCW FSTTGSLE + + K +SLSEQ LVDC Q N
Sbjct: 121 KTVDWRKKGAVTPVKNQGQCGSCWAFSTTGSLEGQHFRKMRKLVSLSEQNLVDCMQKLGN 180
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
GC GGL AF+YIK N G+DTE +YPY DGVC F VG +I E+
Sbjct: 181 NGCGGGLMDNAFKYIKANKGIDTELSYPYNATDGVCHFKKSGVGATATGFEDIPARDENS 240
Query: 219 LQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYW 277
AV V PVSVA + + F+FY GV +C + +D H V+ VGYG +DG YW
Sbjct: 241 WD-AVAPVGPVSVAIDASHESFQFYSEGVLDEPECSSDQLD--HGVLVVGYGTKDGQDYW 297
Query: 278 LIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
L+KNSWG WGD GY M K N CGIA+ ASYP+V
Sbjct: 298 LVKNSWGTTWGDEGYIYMTRNKDNQCGIASSASYPLV 334
>gi|2914594|pdb|1MEM|A Chain A, Crystal Structure Of Cathepsin K Complexed With A Potent
Vinyl Sulfone Inhibitor
gi|28374044|pdb|1NL6|A Chain A, Crystal Structure Of The Cysteine Protease Human Cathepsin
K In Complex With A Covalent Azepanone Inhibitor
gi|28374045|pdb|1NL6|B Chain B, Crystal Structure Of The Cysteine Protease Human Cathepsin
K In Complex With A Covalent Azepanone Inhibitor
gi|28374047|pdb|1NLJ|A Chain A, Crystal Structure Of The Cysteine Protease Human Cathepsin
K In Complex With A Covalent Azepanone Inhibitor
gi|28374048|pdb|1NLJ|B Chain B, Crystal Structure Of The Cysteine Protease Human Cathepsin
K In Complex With A Covalent Azepanone Inhibitor
gi|47168617|pdb|1Q6K|A Chain A, Cathepsin K Complexed With T-butyl(1s)-1-cyclohexyl-2-
Oxoethylcarbamate
gi|55670045|pdb|1TU6|A Chain A, Cathepsin K Complexed With A Ketoamide Inhibitor
gi|55670046|pdb|1TU6|B Chain B, Cathepsin K Complexed With A Ketoamide Inhibitor
gi|62738654|pdb|1YK7|A Chain A, Cathepsin K Complexed With A Cyanopyrrolidine Inhibitor
gi|73535690|pdb|1YK8|A Chain A, Cathepsin K Complexed With A Cyanamide-Based Inhibitor
gi|73535721|pdb|1YT7|A Chain A, Cathepsin K Complexed With A Constrained Ketoamide
Inhibitor
gi|93278849|pdb|2BDL|A Chain A, Cathepsin K Complexed With A Pyrrolidine Ketoamide-Based
Inhibitor
gi|114793438|pdb|2ATO|A Chain A, Crystal Structure Of Human Cathepsin K In Complex With
Myocrisin
gi|114793448|pdb|2AUX|A Chain A, Cathepsin K Complexed With A Semicarbazone Inhibitor
gi|114793451|pdb|2AUZ|A Chain A, Cathepsin K Complexed With A Semicarbazone Inhibitor
gi|126030469|pdb|2FTD|A Chain A, Crystal Structure Of Cathepsin K Complexed With 7-Methyl-
Substituted Azepan-3-One Compound
gi|126030470|pdb|2FTD|B Chain B, Crystal Structure Of Cathepsin K Complexed With 7-Methyl-
Substituted Azepan-3-One Compound
gi|157830076|pdb|1ATK|A Chain A, Crystal Structure Of The Cysteine Protease Human Cathepsin
K In Complex With The Covalent Inhibitor E-64
gi|157830085|pdb|1AU0|A Chain A, Crystal Structure Of The Cysteine Protease Human Cathepsin
K In Complex With A Covalent Symmetric Diacylaminomethyl
Ketone Inhibitor
gi|157830086|pdb|1AU2|A Chain A, Crystal Structure Of The Cysteine Protease Human Cathepsin
K In Complex With A Covalent Propanone Inhibitor
gi|157830087|pdb|1AU3|A Chain A, Crystal Structure Of The Cysteine Protease Human Cathepsin
K In Complex With A Covalent Pyrrolidinone Inhibitor
gi|157830088|pdb|1AU4|A Chain A, Crystal Structure Of The Cysteine Protease Human Cathepsin
K In Complex With A Covalent Pyrrolidinone Inhibitor
gi|157830146|pdb|1AYU|A Chain A, Crystal Structure Of Cysteine Protease Human Cathepsin K
In Complex With A Covalent Symmetric Biscarbohydrazide
Inhibitor
gi|157830147|pdb|1AYV|A Chain A, Crystal Structure Of Cysteine Protease Human Cathepsin K
In Complex With A Covalent Thiazolhydrazide Inhibitor
gi|157830148|pdb|1AYW|A Chain A, Crystal Structure Of Cysteine Protease Human Cathepsin K
In Complex With A Covalent
Benzyloxybenzoylcarbohydrazide Inhibitor
gi|157830300|pdb|1BGO|A Chain A, Crystal Structure Of Cysteine Protease Human Cathepsin K
In Complex With A Covalent Peptidomimetic Inhibitor
gi|197305045|pdb|3C9E|A Chain A, Crystal Structure Of The Cathepsin K : Chondroitin Sulfate
Complex.
gi|290560385|pdb|3KW9|A Chain A, X-Ray Structure Of Cathepsin K Covalently Bound To A
Triazine Ligand
gi|290560386|pdb|3KWZ|A Chain A, Cathepsin K In Complex With A Non-Selective 2-Cyano-
Pyrimidine Inhibitor
gi|290560387|pdb|3KX1|A Chain A, Cathepsin K In Complex With A Selective 2-Cyano-Pyrimidine
Inhibitor
gi|293651910|pdb|3KWB|X Chain X, Structure Of Catk Covalently Bound To A Dioxo-Triazine
Inhibitor
gi|293651911|pdb|3KWB|Y Chain Y, Structure Of Catk Covalently Bound To A Dioxo-Triazine
Inhibitor
gi|308198615|pdb|3O1G|A Chain A, Cathepsin K Covalently Bound To A 2-Cyano Pyrimidine
Inhibitor With A Benzyl P3 Group.
gi|327200584|pdb|3O0U|A Chain A, Cathepsin K Covalently Bound To A Cyano-Pyrimidine
Inhibitor With Improved Selectivity Over Herg
gi|394986262|pdb|4DMX|A Chain A, Cathepsin K Inhibitor
gi|394986263|pdb|4DMY|A Chain A, Cathepsin K Inhibitor
gi|394986264|pdb|4DMY|B Chain B, Cathepsin K Inhibitor
Length = 215
Score = 188 bits (478), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 96/213 (45%), Positives = 131/213 (61%), Gaps = 6/213 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ YR ++PVK+QG CGSCW FS+ G+LE + GK ++LS Q LVDC N G
Sbjct: 5 VDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE--NDG 62
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C GG + AF+Y++ N G+D+E+AYPY G++ C ++ + I G E L+
Sbjct: 63 CGGGYMTNAFQYVQKNRGIDSEDAYPYVGQEESCMYNPTGKAAKCRGYREIPEGNEKALK 122
Query: 221 HAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
AV V PVSVA + + F+FY GVY C + ++NHAV+AVGYG++ G +W+I
Sbjct: 123 RAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNSD--NLNHAVLAVGYGIQKGNKHWII 180
Query: 280 KNSWGENWGDHGYFKMEMGK-NMCGIATCASYP 311
KNSWGENWG+ GY M K N CGIA AS+P
Sbjct: 181 KNSWGENWGNKGYILMARNKNNACGIANLASFP 213
>gi|315364648|pdb|3OVZ|A Chain A, Cathepsin K In Complex With A Covalent Inhibitor With A
Ketoamide Warhead
Length = 213
Score = 188 bits (478), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 96/213 (45%), Positives = 131/213 (61%), Gaps = 6/213 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ YR ++PVK+QG CGSCW FS+ G+LE + GK ++LS Q LVDC N G
Sbjct: 3 VDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE--NDG 60
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C GG + AF+Y++ N G+D+E+AYPY G++ C ++ + I G E L+
Sbjct: 61 CGGGYMTNAFQYVQKNRGIDSEDAYPYVGQEESCMYNPTGKAAKCRGYREIPEGNEKALK 120
Query: 221 HAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
AV V PVSVA + + F+FY GVY C + ++NHAV+AVGYG++ G +W+I
Sbjct: 121 RAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNSD--NLNHAVLAVGYGIQKGNKHWII 178
Query: 280 KNSWGENWGDHGYFKMEMGK-NMCGIATCASYP 311
KNSWGENWG+ GY M K N CGIA AS+P
Sbjct: 179 KNSWGENWGNKGYILMARNKNNACGIANLASFP 211
>gi|50513589|pdb|1SNK|A Chain A, Cathepsin K Complexed With Carbamate Derivatized
Norleucine Aldehyde
Length = 214
Score = 188 bits (478), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 96/213 (45%), Positives = 131/213 (61%), Gaps = 6/213 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ YR ++PVK+QG CGSCW FS+ G+LE + GK ++LS Q LVDC N G
Sbjct: 4 VDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE--NDG 61
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C GG + AF+Y++ N G+D+E+AYPY G++ C ++ + I G E L+
Sbjct: 62 CGGGYMTNAFQYVQKNRGIDSEDAYPYVGQEESCMYNPTGKAAKCRGYREIPEGNEKALK 121
Query: 221 HAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
AV V PVSVA + + F+FY GVY C + ++NHAV+AVGYG++ G +W+I
Sbjct: 122 RAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNSD--NLNHAVLAVGYGIQKGNKHWII 179
Query: 280 KNSWGENWGDHGYFKMEMGK-NMCGIATCASYP 311
KNSWGENWG+ GY M K N CGIA AS+P
Sbjct: 180 KNSWGENWGNKGYILMARNKNNACGIANLASFP 212
>gi|13928758|ref|NP_113748.1| cathepsin K precursor [Rattus norvegicus]
gi|12585195|sp|O35186.1|CATK_RAT RecName: Full=Cathepsin K; Flags: Precursor
gi|2305208|gb|AAB65743.1| cathepsin K [Rattus norvegicus]
gi|50927597|gb|AAH78793.1| Cathepsin K [Rattus norvegicus]
gi|149030667|gb|EDL85704.1| cathepsin K, isoform CRA_a [Rattus norvegicus]
Length = 329
Score = 188 bits (478), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 95/213 (44%), Positives = 130/213 (61%), Gaps = 6/213 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ YR ++PVK+QG CGSCW FS+ G+LE + GK ++LS Q LVDC N G
Sbjct: 119 IDYRKKGYVTPVKNQGQCGSCWAFSSAGALEGQLKKKTGKLLALSPQNLVDCVS--ENYG 176
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C GG + AF+Y++ NGG+D+E+AYPY G+D C +++ + I +G E L+
Sbjct: 177 CGGGYMTTAFQYVQQNGGIDSEDAYPYVGQDESCMYNATAKAAKCRGYREIPVGNEKALK 236
Query: 221 HAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
AV V PVSV+ + + F+FY GVY C +VNHAV+ VGYG + G YW+I
Sbjct: 237 RAVARVGPVSVSIDASLTSFQFYSRGVYYDENCDRD--NVNHAVLVVGYGTQKGNKYWII 294
Query: 280 KNSWGENWGDHGYFKMEMGK-NMCGIATCASYP 311
KNSWGE+WG+ GY + K N CGI AS+P
Sbjct: 295 KNSWGESWGNKGYVLLARNKNNACGITNLASFP 327
>gi|168058022|ref|XP_001781010.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162667567|gb|EDQ54194.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 457
Score = 188 bits (477), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 119/315 (37%), Positives = 160/315 (50%), Gaps = 67/315 (21%)
Query: 59 ALSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN---------- 108
A FA +A ++GK+Y + EE RF + NL+ I+ + K LSY LGL
Sbjct: 42 AGQFAAWAHKHGKVYSAAEERAHRFLVWKDNLEYIQRHSEKNLSYWLGLTKFADLTNEEF 101
Query: 109 -------------------------------------------ISPVKDQGHCGSCWTFS 125
++ VKDQG CGSCW FS
Sbjct: 102 RRQYTGTRIDRSRRLKKGRNATGSFRYANSEAPKSIDWREKGAVTSVKDQGSCGSCWAFS 161
Query: 126 TTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAY 185
GS+E G ISLS Q+LVDC + +N QGCNGGL AF+++ NGG+DTE+ Y
Sbjct: 162 AVGSVEGINAIRTGDAISLSVQELVDCDKKYN-QGCNGGLMDYAFDFVIQNGGIDTEKDY 220
Query: 186 PYTGKDGVCKFSSENVGVQVLDSV-NITLGAEDELQHAVGLVRPVSVAFEV-VDGFRFYK 243
PY G DG C + N V +DS ++ E+ L+ AV +PVSVA E F+ Y
Sbjct: 221 PYQGYDGRCDVNKMNARVVTIDSYEDVPENDEEALKKAVA-GQPVSVAIEAGGRDFQLYS 279
Query: 244 SGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEM------ 297
GV++ +CG D++H V+AVGYG E G+ YW++KNSWGE WG+ GY +M+
Sbjct: 280 GGVFTG-RCG---TDLDHGVLAVGYGSEKGLDYWIVKNSWGEYWGESGYLRMQRNLKDDN 335
Query: 298 GKNMCGIATCASYPV 312
G +CGI SY V
Sbjct: 336 GYGLCGINIEPSYAV 350
>gi|346574377|gb|AEO36960.1| silicatein-alpha 3 [Baikalospongia fungiformis]
Length = 324
Score = 188 bits (477), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 103/246 (41%), Positives = 148/246 (60%), Gaps = 12/246 (4%)
Query: 78 EMKLRFATF--SKNLDLIRSTNCKGLSYRLGLN------ISPVKDQGHCGSCWTFSTTGS 129
E RF T S+ L + KG++Y L+ ++ V+ QG CGS + F+ G+
Sbjct: 81 EFTERFLTHKHSQRSGLQTFESPKGVTYADSLDWRTRGVVTSVQSQGQCGSSYAFAAAGA 140
Query: 130 LEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTG 189
LE A A K ++LSEQ ++DC+ + N GC+GG AF+Y+ NGG+DTE +YPY G
Sbjct: 141 LEGATALAADKLVALSEQNIIDCSVPYGNHGCSGGDVYTAFKYVVDNGGIDTESSYPYKG 200
Query: 190 KDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYS 248
K C+++S+NVG V I G+E +L AV V P++VA + V+ F FY+SGV+
Sbjct: 201 KQSSCQYNSKNVGAISTGVVKIASGSETDLLSAVASVGPIAVAVDASVNAFMFYQSGVFD 260
Query: 249 STKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATC 307
S+ C + + NHA++ GYG +G YWL+KNSWG WG+ GY KM K N CGIA+
Sbjct: 261 SSTCSTSKL--NHAMLVTGYGSTNGKDYWLVKNSWGTGWGESGYIKMVRNKYNQCGIASD 318
Query: 308 ASYPVV 313
A YP++
Sbjct: 319 ALYPML 324
>gi|330805275|ref|XP_003290610.1| hypothetical protein DICPUDRAFT_98747 [Dictyostelium purpureum]
gi|325079249|gb|EGC32858.1| hypothetical protein DICPUDRAFT_98747 [Dictyostelium purpureum]
Length = 334
Score = 188 bits (477), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 107/262 (40%), Positives = 147/262 (56%), Gaps = 11/262 (4%)
Query: 60 LSFARFA----RRYGKIYESVE-EMKLRFATFSKN-LDLIRSTNCKGLSYRLGLNISPVK 113
L RFA Y K Y + + LR N L+ R T + +R ++ VK
Sbjct: 76 LGLNRFADLTNEEYKKTYLGMSINVNLRANQVPMNGLNFERFTGPSSIDWRQNGAVAYVK 135
Query: 114 DQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYI 173
DQGHCGSCW F+TTG++E A+ G ++ SEQ LVDC+ + N GC+GGL + AF+YI
Sbjct: 136 DQGHCGSCWAFATTGAVEGAHQIKTGNMVTFSEQHLVDCSGRYGNNGCDGGLMTSAFKYI 195
Query: 174 KYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAF 233
N G+ TEEAYPYT C +++ +G + ++ G+E L A+ +PV+VA
Sbjct: 196 IDNDGIATEEAYPYTATQNRCVYNTTMLGTAISGYKDVPRGSESALTAAIS-KQPVAVAI 254
Query: 234 EVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGY 292
+ F+ YKSGVY C + +NH V+AVGYG +G Y+++KNSW E WG+ GY
Sbjct: 255 DASPITFQLYKSGVYQEATC--SSYRLNHGVLAVGYGTLEGKDYYIVKNSWAETWGNQGY 312
Query: 293 FKM-EMGKNMCGIATCASYPVV 313
M N CGIAT ASY V
Sbjct: 313 ILMARNANNHCGIATMASYASV 334
>gi|195123219|ref|XP_002006105.1| GI20850 [Drosophila mojavensis]
gi|193911173|gb|EDW10040.1| GI20850 [Drosophila mojavensis]
Length = 329
Score = 188 bits (477), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 93/208 (44%), Positives = 132/208 (63%), Gaps = 7/208 (3%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQ-AFNNQGCNGGLPS 167
++PVK+QG CG+CW+F+ TG+LE + GK +SLSEQ LVDC+ + N+GCNGG+P
Sbjct: 126 VTPVKNQGKCGACWSFAATGTLEGMHFLKTGKLVSLSEQNLVDCSTIRYFNRGCNGGMPF 185
Query: 168 QAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVR 227
+A +Y++ NGG+DTE +Y Y K C++ ++G QV D V + G E L AV
Sbjct: 186 RALKYVRDNGGIDTEYSYTYEAKQLSCRYDPLHIGAQVTDVVRVAAG-EPHLAVAVASKG 244
Query: 228 PVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE-DGVPYWLIKNSWGEN 286
P+SV + FR Y+ GV + +C NHAV+ VG+G + G +WL+KNSWG +
Sbjct: 245 PISVGIHASNNFRNYRDGVLNDRQCNKA---ANHAVLVVGFGRDPQGGDFWLVKNSWGAS 301
Query: 287 WGDHGYFKMEMGK-NMCGIATCASYPVV 313
WGD GY +M + N CGIA+ A YP+V
Sbjct: 302 WGDGGYIRMSRNRSNQCGIASNAVYPLV 329
>gi|312386083|gb|ADQ74586.1| silicatein alpha 3 [Lubomirskia baicalensis]
Length = 330
Score = 188 bits (477), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 96/220 (43%), Positives = 137/220 (62%), Gaps = 4/220 (1%)
Query: 96 TNCKGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQA 155
T L +R ++ V+ QG CGS + F+ G+LE A A K ++LSEQ ++DC+
Sbjct: 113 TYADSLDWRTRGVVTSVQSQGQCGSSYAFAAAGALEGATALAADKLVALSEQNIIDCSVP 172
Query: 156 FNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGA 215
+ N GC+GG AF+Y+ NGG+DTE +YPY GK C+++S+NVG V I G+
Sbjct: 173 YGNHGCSGGDVYTAFKYVVDNGGIDTESSYPYKGKKSSCQYNSKNVGAISTGVVKIASGS 232
Query: 216 EDELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGV 274
E +L AV V P++VA + V+ F FY+SGV+ S+ C + + NHA++ GYG +G
Sbjct: 233 ETDLLSAVASVGPIAVAVDASVNAFMFYQSGVFDSSTCSTSKL--NHAMLVTGYGSTNGK 290
Query: 275 PYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
YWL+KNSWG WG+ GY KM K N CGIA+ A YP++
Sbjct: 291 DYWLVKNSWGTGWGESGYIKMVRNKYNQCGIASDALYPML 330
>gi|302142276|emb|CBI19479.3| unnamed protein product [Vitis vinifera]
Length = 388
Score = 188 bits (477), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 110/272 (40%), Positives = 150/272 (55%), Gaps = 33/272 (12%)
Query: 68 RYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN------------------- 108
++GK Y ++ E + RF F NL I N + +Y++
Sbjct: 10 KHGKSYNALGEKERRFQIFKDNLRFIDEHNAENRTYKISDRYAFRVGDSLPESVDWRKKG 69
Query: 109 -ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPS 167
+ VKDQG CGSCW FST ++E G ISLSEQ+LVDC ++N +GCNGGL
Sbjct: 70 AVVEVKDQGSCGSCWAFSTIAAVEGINKIVTGGLISLSEQELVDCDTSYN-EGCNGGLMD 128
Query: 168 QAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSV-NITLGAEDELQHAVGLV 226
AFE+I NGG+D+EE YPY DG C +N V +D ++ E L+ AV
Sbjct: 129 YAFEFIINNGGIDSEEDYPYKASDGRCDQYRKNAKVVTIDGYEDVPENDEKSLEKAVA-N 187
Query: 227 RPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGE 285
+PVSVA E F+ Y+SG+++ +CG ++H V AVGYG E+GV YW++KNSWG
Sbjct: 188 QPVSVAIEAGGREFQLYQSGIFTG-RCGTA---LDHGVTAVGYGTENGVDYWIVKNSWGA 243
Query: 286 NWGDHGYFKMEM-----GKNMCGIATCASYPV 312
+WG+ GY +ME CGIA ASYP+
Sbjct: 244 SWGEEGYIRMERDLATSATGKCGIAMEASYPI 275
>gi|1809288|gb|AAC47721.1| secreted cathepsin L 2 [Fasciola hepatica]
Length = 326
Score = 188 bits (477), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 104/257 (40%), Positives = 143/257 (55%), Gaps = 10/257 (3%)
Query: 60 LSFARFARRYGKIYESVEEMKLRFATFSKN-LDLIRSTNCKGLSYRLGLNISPVKDQGHC 118
L+F F +Y E+ R F N L + S + + Y ++ VK+QG C
Sbjct: 75 LTFEEFKAKYLIEIPRSSELLSRGIPFKANKLAVPESIDWRDYYY-----VTEVKNQGQC 129
Query: 119 GSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGG 178
GSCW FSTTG++E + + S SEQQLVDC + N GC GG A+EY+K+N G
Sbjct: 130 GSCWAFSTTGAVEGQFRKNERASASFSEQQLVDCPRDLGNYGCGGGYMENAYEYLKHN-G 188
Query: 179 LDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDG 238
L+TE YPY +G C++ +V + G E EL++ VG P +VA +
Sbjct: 189 LETESYYPYQAVEGPCQYDGRLAYAKVTGYYTVHSGDEIELKNLVGTEGPAAVALDADSD 248
Query: 239 FRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMG 298
F Y+SG+Y S C P + HAV+AVGYG +DG YW++KNSWG WG+ GY +
Sbjct: 249 FMMYQSGIYQSQTC--LPDRLTHAVLAVGYGSQDGTDYWIVKNSWGTWWGEDGYIRFARN 306
Query: 299 K-NMCGIATCASYPVVA 314
+ NMCGIA+ AS P+VA
Sbjct: 307 RGNMCGIASLASVPMVA 323
>gi|308321226|gb|ADO27765.1| cathepsin S [Ictalurus furcatus]
Length = 329
Score = 188 bits (477), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 93/207 (44%), Positives = 127/207 (61%), Gaps = 5/207 (2%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++ VK+QG CGSCW FS G+LE + G+ SLS Q LVDC+ + N+GCNGG +Q
Sbjct: 126 VTEVKNQGSCGSCWAFSAAGALEGQLKRTTGQVKSLSPQNLVDCSSKYGNKGCNGGFMTQ 185
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
AF+Y+ +GG+D++EAYPYT DG C++ ++ G E+ L+ AV + P
Sbjct: 186 AFQYVIDDGGIDSDEAYPYTAMDGQCRYDQSQRAANCSSYNYVSEGDEEALKQAVATIGP 245
Query: 229 VSVAFEVVDG-FRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENW 287
+SVA + F Y SGVYS C +VNH V+ VGYG +G YWL+KNSWG +
Sbjct: 246 ISVAIDATRPMFILYHSGVYSDPTC---TQNVNHGVLVVGYGSLNGEDYWLVKNSWGTRF 302
Query: 288 GDHGYFKMEMGK-NMCGIATCASYPVV 313
GD GY ++ K NMCGIA A YP++
Sbjct: 303 GDGGYIRIARNKGNMCGIANYACYPLM 329
>gi|18308182|gb|AAL67857.1|AF462309_1 cysteine proteinase [Acanthamoeba healyi]
Length = 330
Score = 188 bits (477), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 99/216 (45%), Positives = 136/216 (62%), Gaps = 6/216 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+R ++ VK+QG CGSCW+FSTTGS E A G+ +SLSEQ L+DC+ ++ N G
Sbjct: 118 FDWRQKGAVTHVKNQGQCGSCWSFSTTGSTEGANFLKTGRLVSLSEQNLIDCSVSYGNNG 177
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPY-TGKDGVCKFSSENVGVQVLDSVNITLGAEDEL 219
CNGGL AFEYI N G+DTE +YPY T C++++ N G + ++T G E+ L
Sbjct: 178 CNGGLMDYAFEYIINNRGIDTEASYPYQTAGPLTCQYNAANKGGSLTGYTDVTSGDENAL 237
Query: 220 QHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWL 278
+A + PVSVA + + F+FY GVY + C +T +D H V+ VG+G E+G +W
Sbjct: 238 LNAA-VKEPVSVAIDASHNSFQFYSGGVYYESACSSTQLD--HGVLVVGWGSENGQDFWW 294
Query: 279 IKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
+KNSWG +WG +GY KM + N CGIAT ASYP
Sbjct: 295 VKNSWGASWGLNGYIKMSRNQNNNCGIATAASYPTA 330
>gi|94448668|emb|CAI91572.1| silicatein a3 [Lubomirskia baicalensis]
Length = 344
Score = 188 bits (477), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 96/220 (43%), Positives = 137/220 (62%), Gaps = 4/220 (1%)
Query: 96 TNCKGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQA 155
T L +R ++ V+ QG CGS + F+ G+LE A A K ++LSEQ ++DC+
Sbjct: 127 TYADSLDWRTRGVVTSVQSQGQCGSSYAFAAAGALEGATALAADKLVALSEQNIIDCSVP 186
Query: 156 FNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGA 215
+ N GC+GG AF+Y+ NGG+DTE +YPY GK C+++S+NVG V I G+
Sbjct: 187 YGNHGCSGGDVYTAFKYVVDNGGIDTESSYPYKGKKSSCQYNSKNVGAISTGVVKIASGS 246
Query: 216 EDELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGV 274
E +L AV V P++VA + V+ F FY+SGV+ S+ C + + NHA++ GYG +G
Sbjct: 247 ETDLLSAVASVGPIAVAVDASVNAFMFYQSGVFDSSTCSTSKL--NHAMLVTGYGSTNGK 304
Query: 275 PYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
YWL+KNSWG WG+ GY KM K N CGIA+ A YP++
Sbjct: 305 DYWLVKNSWGTGWGESGYIKMVRNKYNQCGIASDALYPML 344
>gi|225428879|ref|XP_002285299.1| PREDICTED: cysteine proteinase RD21a-like [Vitis vinifera]
Length = 469
Score = 187 bits (476), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 115/305 (37%), Positives = 155/305 (50%), Gaps = 66/305 (21%)
Query: 68 RYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN------------------- 108
++GK Y ++ E + RF F NL I N +Y++GLN
Sbjct: 60 KHGKSYNALGERERRFEIFKDNLRFIEEHNAVNRTYKVGLNRFADLTNEEYRSRYLGRRD 119
Query: 109 ----------------------------------ISPVKDQGHCGSCWTFSTTGSLEAAY 134
+ PVKDQG+CGSCW FST ++E
Sbjct: 120 ETRRGLRASRVSDRYSFRAGEDLPESVDWREKGAVVPVKDQGNCGSCWAFSTIAAVEGIN 179
Query: 135 HQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVC 194
A G ISLSEQ+LVDC +++N QGCNGGL AFE+I NGG+D+EE YPY D C
Sbjct: 180 QIATGDLISLSEQELVDCDKSYN-QGCNGGLMDYAFEFIINNGGIDSEEDYPYRAADTTC 238
Query: 195 KFSSENVGVQVLDSV-NITLGAEDELQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKC 252
+ +N V +D ++ E L+ AV +PVSVA E F+ Y+SGV++ +C
Sbjct: 239 DPNRKNARVVSIDGYEDVPQNDERSLKKAVA-NQPVSVAIEAGGRAFQLYQSGVFTG-QC 296
Query: 253 GNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMG-----KNMCGIATC 307
G ++H VVAVGYG E+ V YW+++NSWG NWG+ GY K+E CGIA
Sbjct: 297 GTQ---LDHGVVAVGYGTENSVDYWIVRNSWGPNWGESGYIKLERNLAGTETGKCGIAIE 353
Query: 308 ASYPV 312
SYP+
Sbjct: 354 PSYPI 358
>gi|297802228|ref|XP_002868998.1| cysteine proteinase [Arabidopsis lyrata subsp. lyrata]
gi|297314834|gb|EFH45257.1| cysteine proteinase [Arabidopsis lyrata subsp. lyrata]
Length = 375
Score = 187 bits (476), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 111/274 (40%), Positives = 154/274 (56%), Gaps = 23/274 (8%)
Query: 56 ARHALSFARFA----RRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKG------LSYRL 105
A + L +F Y +Y +R +KN++ S G + +RL
Sbjct: 94 ATYKLGLTKFTDLTNEEYRSLYLGARTEPVRRIAKAKNVNQKYSAAVDGKEVPETVDWRL 153
Query: 106 GLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGL 165
++P+KDQG CGSCW FST ++E G+ ISLSEQ+LVDC ++N QGCNGGL
Sbjct: 154 KGAVNPIKDQGTCGSCWAFSTAAAVEGINKIVTGELISLSEQELVDCDNSYN-QGCNGGL 212
Query: 166 PSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSV-NITLGAEDELQHAVG 224
AF++I NGGL TE+ YPY G G C +N V +D ++ E L+ A+
Sbjct: 213 MDYAFQFIMKNGGLKTEKDYPYRGFGGKCNSFLKNAKVVSIDGYEDVPTKDETALKRAIS 272
Query: 225 LVRPVSVAFEVVDG-FRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSW 283
L +PVSVA E F+ Y++G+++ GN +++HAVVAVGYG E+GV YW+++NSW
Sbjct: 273 L-QPVSVAIEAGGRIFQHYQTGIFT----GNCGTNLDHAVVAVGYGSENGVDYWIVRNSW 327
Query: 284 GENWGDHGYFKMEMG-----KNMCGIATCASYPV 312
G WG+ GY +ME CGIA ASYPV
Sbjct: 328 GPRWGEEGYIRMERNLASSKSGKCGIAVEASYPV 361
>gi|157093357|gb|ABV22333.1| cysteine protease 1 [Noctiluca scintillans]
Length = 338
Score = 187 bits (476), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 112/315 (35%), Positives = 161/315 (51%), Gaps = 66/315 (20%)
Query: 58 HALSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN--------- 108
+ + F F +YGK+Y + E +RF F N+D+I +TN + L++ LG+N
Sbjct: 23 YMMMFNNFKTKYGKVYNGINEDAVRFGIFKANVDIIYATNARNLTFALGVNEFTDLTQEE 82
Query: 109 -----------------------------------------ISPVKDQGHCGSCWTFSTT 127
++PVK+QG CGSCW+FSTT
Sbjct: 83 FAASYTGLKPASLWSGLPRLSTHEYNGAPLASSVDWTTQGVVTPVKNQGQCGSCWSFSTT 142
Query: 128 GSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPY 187
G+LE A+ + G +SLSEQQ DC + GCNGG AF + K N + TE +YPY
Sbjct: 143 GALEGAWALSTGNLVSLSEQQFEDCDT--TDSGCNGGWMDNAFSFAKKN-SICTEGSYPY 199
Query: 188 TGKDGVCKFSSENVGVQ---VLDSVNITLGAEDELQHAVGLVRPVSVAFEVVD-GFRFYK 243
T DG C S VG+ V+ +++ +E + AV +PVS+A E F+ Y
Sbjct: 200 TATDGTCNLSGCQVGIPQGGVVGYTDVSTDSEQAMMSAVAQ-QPVSIAIEADQYSFQLYS 258
Query: 244 SGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCG 303
SGV +++ CG ++H V+AVGYG E G YW +KNSWG +WG+ GY +++ GK G
Sbjct: 259 SGVLTAS-CGT---RLDHGVLAVGYGSEAGTDYWKVKNSWGSSWGEQGYVRLQRGKGGAG 314
Query: 304 ----IATCASYPVVA 314
+A SYPVV+
Sbjct: 315 ECGLLAGPPSYPVVS 329
>gi|414585111|tpg|DAA35682.1| TPA: cysteine proteinase Mir3 [Zea mays]
Length = 468
Score = 187 bits (476), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 117/311 (37%), Positives = 156/311 (50%), Gaps = 66/311 (21%)
Query: 62 FARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKG----LSYRLGLN--------- 108
+A + +G+ Y +V E + R+ F NL I + N S+RLGLN
Sbjct: 46 YAEWMAAHGRTYNAVGEEERRYQVFRDNLRYIDAHNAAADAGVHSFRLGLNRFADLTNDE 105
Query: 109 -----------------------------------------ISPVKDQGHCGSCWTFSTT 127
++ VKDQG CGSCW FST
Sbjct: 106 YRATYLGARTRPQRERKLGARYHAADNEDLPESVDWRAKGAVAEVKDQGSCGSCWAFSTI 165
Query: 128 GSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPY 187
++E G ISLSEQ+LVDC ++N QGCNGGL AFE+I NGG+DTE+ YPY
Sbjct: 166 AAVEGINQIVTGDLISLSEQELVDCDTSYN-QGCNGGLMDYAFEFIINNGGIDTEKDYPY 224
Query: 188 TGKDGVCKFSSENVGVQVLDSV-NITLGAEDELQHAVGLVRPVSVAFEVV-DGFRFYKSG 245
G DG C + +N V +DS ++ E LQ AV +PVSVA E F+ Y SG
Sbjct: 225 KGTDGRCDVNRKNAKVVTIDSYEDVPANDEKSLQKAVA-NQPVSVAIEAAGTAFQLYSSG 283
Query: 246 VYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMG----KNM 301
+++ + CG ++H V AVGYG E+G YW++KNSWG +WG+ GY +ME
Sbjct: 284 IFTGS-CGTA---LDHGVTAVGYGTENGKDYWIVKNSWGSSWGESGYVRMERNIKASSGK 339
Query: 302 CGIATCASYPV 312
CGIA SYP+
Sbjct: 340 CGIAVEPSYPL 350
>gi|255564908|ref|XP_002523447.1| cysteine protease, putative [Ricinus communis]
gi|223537275|gb|EEF38906.1| cysteine protease, putative [Ricinus communis]
Length = 342
Score = 187 bits (476), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 113/315 (35%), Positives = 155/315 (49%), Gaps = 67/315 (21%)
Query: 57 RHALSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKG-LSYRLGLN------- 108
RH + A+ YG++Y+ E + RF F N++ I S N G Y+L +N
Sbjct: 37 RHEMWMAK----YGRVYKDNSEKERRFEIFRNNVEFIESFNKLGNRPYKLDINEFADLTN 92
Query: 109 -------------------------------------------ISPVKDQGHCGSCWTFS 125
++P+KDQG CG CW FS
Sbjct: 93 EEFKVSKNGYKRSSGVGLTEKSSFRYANVTAVPTSMDWRQNGAVTPIKDQGQCGCCWAFS 152
Query: 126 TTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAY 185
++E + GK ISLSEQ+LVDC + +QGC GGL AFE+IK NGGL TE Y
Sbjct: 153 AVAAMEGITKLSTGKLISLSEQELVDCDTSGEDQGCEGGLMDDAFEFIKQNGGLTTEANY 212
Query: 186 PYTGKDGVCKFSSE-NVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVV-DGFRFYK 243
PY G DG C + N ++ ++ +ED L AV +PVSVA + F+FY
Sbjct: 213 PYQGTDGTCNTNKAGNDAAKITGYEDVPANSEDALLKAVA-SQPVSVAIDASGSAFQFYS 271
Query: 244 SGVYSSTKCGNTPMDVNHAVVAVGYGV-EDGVPYWLIKNSWGENWGDHGYFKMEMG---- 298
GV++ G+ +++H V AVGYG +DG YWL+KNSWG +WG+ GY +ME
Sbjct: 272 GGVFT----GDCGTELDHGVTAVGYGTSDDGTKYWLVKNSWGTSWGEDGYIRMERDIEAK 327
Query: 299 KNMCGIATCASYPVV 313
+ +CGIA SYP
Sbjct: 328 EGLCGIAMQPSYPTA 342
>gi|242068363|ref|XP_002449458.1| hypothetical protein SORBIDRAFT_05g013840 [Sorghum bicolor]
gi|241935301|gb|EES08446.1| hypothetical protein SORBIDRAFT_05g013840 [Sorghum bicolor]
Length = 350
Score = 187 bits (476), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 118/324 (36%), Positives = 159/324 (49%), Gaps = 60/324 (18%)
Query: 43 RDFETSVLQVIGQARHALSFARFARRYGKIYESVEEMKLRFATFSKNLDLI-RSTNCKGL 101
RD TS G+ + ++ +G+ Y+ E RF F N D + RS G
Sbjct: 31 RDLSTST-GGYGEEAMKVRHQQWMAEHGRTYKDEAEKARRFQVFKANADFVDRSNAAGGK 89
Query: 102 SYRLGLN----------------------------------------------------I 109
SY L +N +
Sbjct: 90 SYELAINEFADMTNDEFVAMYTGLKPVPAGPKKMAGFKYENLTLSDVDQQAVDWRQKGAV 149
Query: 110 SPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQA 169
+ +K+QG CG CW F+ ++E+ + G +SLSEQQ++DC NN GCNGG A
Sbjct: 150 TGIKNQGQCGCCWAFAAVAAVESIHQITTGNLVSLSEQQVLDCDTDGNN-GCNGGYIDNA 208
Query: 170 FEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPV 229
F+YI NGGL TE+AYPY G C+ SS V + ++ G E L AV +PV
Sbjct: 209 FQYIISNGGLATEDAYPYAAAQGTCQ-SSVQPAVTISSYQDVPSGDEAALAAAV-ANQPV 266
Query: 230 SVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGV-EDGVPYWLIKNSWGENWG 288
+VA + + F+FY SGV ++ CG TP +NHAV AVGY EDG PYWL+KN WG+NWG
Sbjct: 267 AVAIDAHNNFQFYSSGVLTADTCG-TP-SLNHAVTAVGYSTAEDGTPYWLLKNQWGQNWG 324
Query: 289 DHGYFKMEMGKNMCGIATCASYPV 312
+ GY ++E G N CG+A ASYPV
Sbjct: 325 EGGYLRVERGTNACGVAQQASYPV 348
>gi|226495425|ref|NP_001148706.1| cysteine protease 1 precursor [Zea mays]
gi|195621544|gb|ACG32602.1| cysteine protease 1 precursor [Zea mays]
Length = 463
Score = 187 bits (476), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 117/311 (37%), Positives = 156/311 (50%), Gaps = 66/311 (21%)
Query: 62 FARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKG----LSYRLGLN--------- 108
+A + +G+ Y +V E + R+ F NL I + N S+RLGLN
Sbjct: 41 YAEWMAAHGRTYNAVGEEERRYQVFRDNLRYIDAHNAAADAGVHSFRLGLNRFADLTNDE 100
Query: 109 -----------------------------------------ISPVKDQGHCGSCWTFSTT 127
++ VKDQG CGSCW FST
Sbjct: 101 YRATYLGARTRPQRERKLGARYHAADNEDLPESVDWRAKGAVAEVKDQGSCGSCWAFSTI 160
Query: 128 GSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPY 187
++E G ISLSEQ+LVDC ++N QGCNGGL AFE+I NGG+DTE+ YPY
Sbjct: 161 AAVEGINQIVTGDLISLSEQELVDCDTSYN-QGCNGGLMDYAFEFIINNGGIDTEKDYPY 219
Query: 188 TGKDGVCKFSSENVGVQVLDSV-NITLGAEDELQHAVGLVRPVSVAFEVV-DGFRFYKSG 245
G DG C + +N V +DS ++ E LQ AV +PVSVA E F+ Y SG
Sbjct: 220 KGTDGRCDVNRKNAKVVTIDSYEDVPANDEKSLQKAVA-NQPVSVAIEAAGTAFQLYSSG 278
Query: 246 VYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMG----KNM 301
+++ + CG ++H V AVGYG E+G YW++KNSWG +WG+ GY +ME
Sbjct: 279 IFTGS-CGTA---LDHGVTAVGYGTENGKDYWIVKNSWGSSWGESGYVRMERNIKASSGK 334
Query: 302 CGIATCASYPV 312
CGIA SYP+
Sbjct: 335 CGIAVEPSYPL 345
>gi|344271616|ref|XP_003407633.1| PREDICTED: cathepsin L1-like [Loxodonta africana]
Length = 334
Score = 187 bits (475), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 100/212 (47%), Positives = 132/212 (62%), Gaps = 10/212 (4%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++PVK+QG CGSCW FS TG+LE + GK +SLSEQ LVDC++ N+GCNGGL
Sbjct: 126 VTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRREGNEGCNGGLMDN 185
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDG-VCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVR 227
AF+Y++ NGGLD+EE+YPY D C + E V+I E L AV V
Sbjct: 186 AFQYVQDNGGLDSEESYPYLATDTHTCNYKPECSAANDTGFVDIPQ-REKALMKAVATVG 244
Query: 228 PVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----DGVPYWLIKNS 282
P+SVA + + F+FYKSG+Y C + D++H V+ VGYG E + +W++KNS
Sbjct: 245 PISVAIDAGHESFQFYKSGIYYEPGC--SSKDLDHGVLLVGYGFEGKDSENNKFWIVKNS 302
Query: 283 WGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
WG +WG +GY KM + N CGIAT ASYP V
Sbjct: 303 WGTSWGTNGYVKMAKDQNNHCGIATAASYPTV 334
>gi|208972988|dbj|BAG74343.1| silicatein-M2 [Ephydatia fluviatilis]
Length = 326
Score = 187 bits (475), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 92/218 (42%), Positives = 138/218 (63%), Gaps = 4/218 (1%)
Query: 98 CKGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFN 157
+ + +R ++ VK QG CG+ + F+ TG+LE A A K ++LSEQ ++DC+ +
Sbjct: 111 AESIDWRTKGAVTSVKYQGQCGASYAFAATGALEGASALANDKQVTLSEQNIIDCSVPYG 170
Query: 158 NQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAED 217
N GC+GG AF+Y+ NGG+DTE +Y + GK C+++++ G V+I G+E+
Sbjct: 171 NHGCSGGDTYTAFKYVIDNGGIDTESSYSFKGKQSSCQYNNKTSGASATGVVSIAYGSEN 230
Query: 218 ELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPY 276
+L AV V PV+VA + + FRFY+SGV+ S+ C +T + NHA++ GYG +G Y
Sbjct: 231 DLLAAVATVGPVAVAIDANTNAFRFYQSGVFDSSSCSSTKL--NHAMLVTGYGSYNGKDY 288
Query: 277 WLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
WL+KNSW +NWGD GY M K N CGIA+ A YP++
Sbjct: 289 WLVKNSWSKNWGDSGYILMVRNKYNQCGIASDALYPML 326
>gi|75060921|sp|Q5E998.1|CATL2_BOVIN RecName: Full=Cathepsin L2; Flags: Precursor
gi|59858409|gb|AAX09039.1| cathepsin L2 preproprotein [Bos taurus]
Length = 334
Score = 187 bits (475), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 103/212 (48%), Positives = 130/212 (61%), Gaps = 10/212 (4%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++PVK+QG CGSCW FS TG+LE + GK +SLSEQ LVDC++A NQGCNGGL
Sbjct: 126 VTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQGNQGCNGGLMDN 185
Query: 169 AFEYIKYNGGLDTEEAYPYTGKD-GVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVR 227
AF+YIK NG LD+EE+YPY D C + E V+I E L AV V
Sbjct: 186 AFQYIKDNGCLDSEESYPYLATDTNSCNYKPECSAANDTGFVDIPQ-REKALMKAVATVG 244
Query: 228 PVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----DGVPYWLIKNS 282
P+SVA + F+FYKSG+Y C + D++H V+ VGYG E + +W++KNS
Sbjct: 245 PISVAIDAGHTSFQFYKSGIYYDPDC--SSKDLDHGVLVVGYGFEGTDSNNNKFWIVKNS 302
Query: 283 WGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
WG WG +GY KM + N CGIAT ASYP V
Sbjct: 303 WGPEWGWNGYVKMAKDQNNHCGIATAASYPTV 334
>gi|390476660|ref|XP_003735160.1| PREDICTED: LOW QUALITY PROTEIN: cathepsin K [Callithrix jacchus]
Length = 329
Score = 187 bits (475), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 103/247 (41%), Positives = 147/247 (59%), Gaps = 13/247 (5%)
Query: 74 ESVEEMK-LRFAT-FSKNLDLIRSTNCKG-----LSYRLGLNISPVKDQGHCGSCWTFST 126
E V++M L+ T +S++ D + + +G + YR ++PVK+QG CGSCW FS+
Sbjct: 85 EVVQKMTGLKVPTSYSRSNDTLYIPDWEGRAPDSVDYRKKGYVTPVKNQGQCGSCWAFSS 144
Query: 127 TGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYP 186
G+LE + GK ++LS Q LVDC N GC GG + AF+Y++ N G+D+E+AYP
Sbjct: 145 VGALEGQLKKKTGKLLNLSPQNLVDCVSE--NDGCGGGYMTNAFQYVQKNRGIDSEDAYP 202
Query: 187 YTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEV-VDGFRFYKSG 245
Y G++ C ++ + I G E L+ AV V P+SVA + + F+FY G
Sbjct: 203 YVGQEESCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPISVAIDASLTSFQFYSKG 262
Query: 246 VYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGK-NMCGI 304
VY C + ++NHAV+AVGYG+ G +W+IKNSWGENWG+ GY M K N CGI
Sbjct: 263 VYYDESCNSD--NLNHAVLAVGYGILKGNKHWIIKNSWGENWGNKGYILMARNKNNACGI 320
Query: 305 ATCASYP 311
A AS+P
Sbjct: 321 ANLASFP 327
>gi|444519959|gb|ELV12909.1| Cathepsin L1 [Tupaia chinensis]
Length = 333
Score = 187 bits (475), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 115/309 (37%), Positives = 157/309 (50%), Gaps = 61/309 (19%)
Query: 62 FARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCK----GLSYRLGLN--------- 108
+ ++ +GK+Y + EE LR A + KNL +I N + ++ +G+N
Sbjct: 29 WNQWTAEHGKVYSTGEE-SLRRAVWEKNLKMIEQHNLEYSQGKHTFTMGMNAFGDMTNED 87
Query: 109 --------------------------------------ISPVKDQGHCGSCWTFSTTGSL 130
++PVK+Q CGSCW FS TG+L
Sbjct: 88 FRQMMTGFQNQKYNKGEVFQPPQPLEVPESVDWREKGYVTPVKNQHRCGSCWAFSATGAL 147
Query: 131 EAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGK 190
E + GK +SLSEQ LVDC+Q +N GC GGL +AF+Y+K NGGLD+EE+YPY
Sbjct: 148 EGQMFRKTGKLVSLSEQNLVDCSQPQHNSGCKGGLVIKAFQYVKDNGGLDSEESYPYEEM 207
Query: 191 DGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSS 249
+ C++S N V +I E L+ AV V P+SVA + F+FY G+
Sbjct: 208 ESTCRYSPGNSAATVTGFKHIP-AEEKALEKAVASVGPISVAIDAHHHSFQFYTGGILHE 266
Query: 250 TKCGNTPMDVNHAVVAVGYGV----EDGVPYWLIKNSWGENWGDHGYFKMEMGK-NMCGI 304
C +P +NHAV+ VGYGV + YWL+KNSWGE WG GY M K N CGI
Sbjct: 267 PNC--SPKWLNHAVLVVGYGVMQEGSNNNTYWLVKNSWGERWGVGGYIMMAKDKNNHCGI 324
Query: 305 ATCASYPVV 313
A+ A YP+V
Sbjct: 325 ASDALYPIV 333
>gi|383860620|ref|XP_003705787.1| PREDICTED: counting factor associated protein D-like [Megachile
rotundata]
Length = 549
Score = 187 bits (475), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 96/229 (41%), Positives = 135/229 (58%), Gaps = 2/229 (0%)
Query: 84 ATFSKNLDLIRSTNCKGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGIS 143
A F N D L +RL ++PVKDQ CGSCW+F TTG++E AY+ +GK +
Sbjct: 318 APFPYNADEEVKKVPDSLDWRLYGAVTPVKDQSVCGSCWSFGTTGAVEGAYYMKYGKLVR 377
Query: 144 LSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAY-PYTGKDGVCKFSSENVG 202
LS+Q L+DC+ F N GC+GG +++++I +GGL E+ Y Y G+DG C ++
Sbjct: 378 LSQQALIDCSWGFGNNGCDGGEDFRSYQWIMKHGGLPAEDEYGGYLGQDGYCHANNVTKV 437
Query: 203 VQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNH 261
++ VN+T G + L+ A+ P+SVA + F FY GVY CGNT ++H
Sbjct: 438 AKITGFVNVTPGDPNALKVAIAKHGPISVAIDAAHKTFSFYSHGVYYDESCGNTEESLDH 497
Query: 262 AVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASY 310
AV+AVGYG +G YWL+KNSW WG+ GY M KN CG+ T +Y
Sbjct: 498 AVLAVGYGKLNGKDYWLVKNSWSNYWGNDGYILMSQEKNNCGVLTAPTY 546
>gi|328869030|gb|EGG17408.1| cysteine protease [Dictyostelium fasciculatum]
Length = 379
Score = 187 bits (475), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 104/245 (42%), Positives = 138/245 (56%), Gaps = 34/245 (13%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R +SP+K+QG CGSCW+FSTTGS+E A++ + GK + LSEQ LVDC+ + N G
Sbjct: 137 VDWRAKGAVSPIKNQGQCGSCWSFSTTGSVEGAHYISTGKMVPLSEQNLVDCSGSEGNMG 196
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDG-VCKFSSENVGVQVLDSVNITLGAEDEL 219
C GGL + AF+YI N G+DTE++YPY+ + G C F+ NVG + NIT G E L
Sbjct: 197 CQGGLMNLAFDYIIKNEGIDTEDSYPYSAETGKKCLFNKTNVGATISSYKNITSGDESNL 256
Query: 220 QHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVED------ 272
AV PVSVA + + F+ Y G+Y C + +D H V+ VGYG D
Sbjct: 257 ADAVKNAGPVSVAIDASHNSFQLYSHGIYYEKDCSSVNLD--HGVLVVGYGSGDPSSLAN 314
Query: 273 ------------------GVP-----YWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCA 308
P YW++KNSWG WG HG+ M M + N CGIAT A
Sbjct: 315 NVGGRSGPKMVVFNNRMVKTPSSNGDYWIVKNSWGSTWGSHGFIFMSMNRDNNCGIATSA 374
Query: 309 SYPVV 313
SYP+V
Sbjct: 375 SYPIV 379
>gi|344257450|gb|EGW13554.1| Testin-2 [Cricetulus griseus]
Length = 401
Score = 187 bits (475), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 101/221 (45%), Positives = 133/221 (60%), Gaps = 9/221 (4%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K +++R ++PVK QGHC S W FS TG+LE + K +LSEQ L+DC +
Sbjct: 184 KQVNWREQGYVTPVKSQGHCASSWAFSATGALEGQMFKKTRKLNALSEQNLLDCMEFNVT 243
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
+ C+GG AF+Y++ NGGL TEE+YPY G C++ ++N V D V I G E+
Sbjct: 244 RSCSGGFMQSAFQYVRDNGGLATEESYPYQGHAMECRYQAKNSAANVKDFVQIP-GHEEA 302
Query: 219 LQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----DG 273
L AV V P+SVA + F+FY+SG+Y KC + NHAV+ VGYG E DG
Sbjct: 303 LMKAVANVGPISVAIDARHSSFQFYESGIYYEPKCKR--VHQNHAVLVVGYGFEGEESDG 360
Query: 274 VPYWLIKNSWGENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
YWL+KNSWGE WG GY K+ N CGIAT A+YP+V
Sbjct: 361 NSYWLVKNSWGEEWGIKGYMKIAKDWNNHCGIATHATYPIV 401
>gi|168063167|ref|XP_001783545.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664932|gb|EDQ51634.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 461
Score = 187 bits (475), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 120/309 (38%), Positives = 157/309 (50%), Gaps = 65/309 (21%)
Query: 62 FARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN------------- 108
FA +A ++GK Y E+ RFA + NL IR + +Y LGL
Sbjct: 54 FAAWAHKHGKAYHDAEQCLHRFAVWKDNLAYIRHSE-TNRTYSLGLTKFADLTNEEFRRM 112
Query: 109 ------------------------------------ISPVKDQGHCGSCWTFSTTGSLEA 132
++ VKDQG CGSCW FS GS+E
Sbjct: 113 YTGTRIDRSRRAKRRTGFRYADSEAPESVDWRKNGAVTSVKDQGSCGSCWAFSAVGSVEG 172
Query: 133 AYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDG 192
G+ +SLSEQ+LVDC +N QGCNGGL AF++I NGG+DTE+ YPY G DG
Sbjct: 173 INAIRNGEAVSLSEQELVDCDLEYN-QGCNGGLMDYAFDFIIQNGGIDTEKDYPYKGFDG 231
Query: 193 VCKFSSENVGVQVLDSV-NITLGAEDELQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSST 250
C S +N V +D ++ E+ L+ AV +PVSVA E F+ Y GV+S
Sbjct: 232 RCDNSKKNAHVVTIDGYEDVPENDEEALKKAVA-GQPVSVAIEAGGRDFQLYAQGVFSG- 289
Query: 251 KCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEM-------GKNMCG 303
+CG D++H V+AVGYG EDGV YW++KNSWGE WG+ GY +M+ G +CG
Sbjct: 290 ECGT---DLDHGVLAVGYGTEDGVDYWIVKNSWGEYWGESGYLRMKRNMKDSNDGPGLCG 346
Query: 304 IATCASYPV 312
I SY V
Sbjct: 347 INIEPSYAV 355
>gi|340368358|ref|XP_003382719.1| PREDICTED: digestive cysteine proteinase 2-like [Amphimedon
queenslandica]
Length = 329
Score = 187 bits (475), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 97/214 (45%), Positives = 126/214 (58%), Gaps = 4/214 (1%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++ VK+QG CG+CW FS TG+LE + G ISLSEQQL+DC+ +F N G
Sbjct: 115 VDWRKSGAVTGVKNQGKCGACWAFSATGALEGQHFINTGTLISLSEQQLMDCSSSFGNNG 174
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C GGL AF Y++ G TEEAYPY + G C+++S V+ +I G ED LQ
Sbjct: 175 CKGGLMDNAFRYLETVAGDMTEEAYPYLAEVGTCRYNSSEAKVKNTVYKDIPEGDEDALQ 234
Query: 221 HAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
AV + P+SV+ F+ Y GVY C ++ +D H V+ +GYG D YWL+
Sbjct: 235 EAVATIGPISVSINSEHSSFQLYDQGVYYEPTCSSSKLD--HGVLVIGYGTSDNNDYWLV 292
Query: 280 KNSWGENWGDHGYFKMEMGK-NMCGIATCASYPV 312
KNSWG NWG GY M K N CGIAT ASYP
Sbjct: 293 KNSWGTNWGMDGYIMMSRNKENNCGIATRASYPT 326
>gi|392922428|ref|NP_001256719.1| Protein CPL-1, isoform b [Caenorhabditis elegans]
gi|379657173|emb|CCG28194.1| Protein CPL-1, isoform b [Caenorhabditis elegans]
Length = 198
Score = 187 bits (475), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 93/199 (46%), Positives = 128/199 (64%), Gaps = 5/199 (2%)
Query: 118 CGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNG 177
CGSCW FS TG+LE + + G+ +SLSEQ LVDC+ + N GCNGGL QAFEYI+ N
Sbjct: 2 CGSCWAFSATGALEGQHARKLGQLVSLSEQNLVDCSTKYGNHGCNGGLMDQAFEYIRDNH 61
Query: 178 GLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVD 237
G+DTEE+YPY G+D C F+ + VG V+ G E++L+ AV P+S+A +
Sbjct: 62 GVDTEESYPYKGRDMKCHFNKKTVGADDKGYVDTPEGDEEQLKIAVATQGPISIAIDAGH 121
Query: 238 -GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE-DGVPYWLIKNSWGENWGDHGYFKM 295
F+ YK GVY +C + +D H V+ VGYG + + YW++KNSWG WG+ GY ++
Sbjct: 122 RSFQLYKKGVYYDEECSSEELD--HGVLLVGYGTDPEHGDYWIVKNSWGAGWGEKGYIRI 179
Query: 296 EMGK-NMCGIATCASYPVV 313
+ N CG+AT ASYP+V
Sbjct: 180 ARNRNNHCGVATKASYPLV 198
>gi|50355615|dbj|BAD29956.1| cysteine protease [Daucus carota]
Length = 423
Score = 187 bits (475), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 102/221 (46%), Positives = 136/221 (61%), Gaps = 13/221 (5%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
+ + +R ++PVKDQG CGSCW FST ++E A G ISLSEQ+LVDC + FN
Sbjct: 95 QSVDWREKGAVAPVKDQGQCGSCWAFSTVAAVEGINQIATGDLISLSEQELVDCDKGFN- 153
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSV-NITLGAED 217
QGCNGG AFE+I NGG+DTE+ YPY G DG C + +N V ++ ++ E
Sbjct: 154 QGCNGGFMDYAFEFIVKNGGIDTEDDYPYKGVDGQCDQNRKNAKVVTINGFEDVPQNDEK 213
Query: 218 ELQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPY 276
L+ AV +PVSVA E F+ Y+SG+++ CG D++H VVAVGYG EDG Y
Sbjct: 214 SLKKAVAH-QPVSVAIEAGGRAFQLYESGIFNGL-CG---TDLDHGVVAVGYGTEDGKDY 268
Query: 277 WLIKNSWGENWGDHGYFKME-----MGKNMCGIATCASYPV 312
W+++NSWG NWG++GY ++E CGIA SYP
Sbjct: 269 WIVRNSWGPNWGENGYIRLERNVASTNTGKCGIAMQPSYPT 309
>gi|351694420|gb|EHA97338.1| Cathepsin K [Heterocephalus glaber]
Length = 329
Score = 187 bits (475), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 93/213 (43%), Positives = 131/213 (61%), Gaps = 6/213 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ YR ++PVK+QG CGSCW FS+ G+LE + GK ++LS Q LVDC N G
Sbjct: 119 VDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE--NDG 176
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C GG + AF+Y++ N G+D+E+AYPY G+D C ++ + + +G E L+
Sbjct: 177 CGGGYMTNAFQYVQQNRGIDSEDAYPYVGQDESCMYNPTGKAAKCRGYREVPVGNEKALK 236
Query: 221 HAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
AV V P+SVA + + F+FY GVY C ++NHAV+AVGYG++ G +W++
Sbjct: 237 RAVARVGPISVAIDASLTSFQFYSKGVYYDESCDGD--NLNHAVLAVGYGIQRGHKHWIL 294
Query: 280 KNSWGENWGDHGYFKMEMGK-NMCGIATCASYP 311
KNSWGENWG+ GY + K N CGIA AS+P
Sbjct: 295 KNSWGENWGNKGYVLLARNKNNTCGIANLASFP 327
>gi|410256886|gb|JAA16410.1| cathepsin L1 [Pan troglodytes]
Length = 333
Score = 187 bits (474), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 102/221 (46%), Positives = 132/221 (59%), Gaps = 9/221 (4%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
+ + +R ++PVK+QG CGSCW FS TG+LE + G+ ISLSEQ LVDC+ N
Sbjct: 116 RSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGN 175
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
+GCNGGL AF+Y++ NGGLD+EE+YPY + CK++ + V+I E
Sbjct: 176 EGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIP-KQEKA 234
Query: 219 LQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----DG 273
L AV V P+SVA + + F FYK G+Y C + MD H V+ VGYG E D
Sbjct: 235 LMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMD--HGVLVVGYGFESTESDN 292
Query: 274 VPYWLIKNSWGENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
YWL+KNSWG WG GY KM +N CGIA+ ASYP V
Sbjct: 293 NKYWLVKNSWGGEWGMGGYVKMAKDRRNHCGIASAASYPTV 333
>gi|410256882|gb|JAA16408.1| cathepsin L1 [Pan troglodytes]
gi|410256884|gb|JAA16409.1| cathepsin L1 [Pan troglodytes]
Length = 333
Score = 187 bits (474), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 102/221 (46%), Positives = 132/221 (59%), Gaps = 9/221 (4%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
+ + +R ++PVK+QG CGSCW FS TG+LE + G+ ISLSEQ LVDC+ N
Sbjct: 116 RSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGN 175
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
+GCNGGL AF+Y++ NGGLD+EE+YPY + CK++ + V+I E
Sbjct: 176 EGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIP-KQEKA 234
Query: 219 LQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----DG 273
L AV V P+SVA + + F FYK G+Y C + MD H V+ VGYG E D
Sbjct: 235 LMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMD--HGVLVVGYGFESTESDN 292
Query: 274 VPYWLIKNSWGENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
YWL+KNSWG WG GY KM +N CGIA+ ASYP V
Sbjct: 293 NKYWLVKNSWGGEWGMGGYVKMAKDRRNHCGIASAASYPTV 333
>gi|330793420|ref|XP_003284782.1| hypothetical protein DICPUDRAFT_28222 [Dictyostelium purpureum]
gi|325085276|gb|EGC38686.1| hypothetical protein DICPUDRAFT_28222 [Dictyostelium purpureum]
Length = 347
Score = 187 bits (474), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 118/334 (35%), Positives = 160/334 (47%), Gaps = 76/334 (22%)
Query: 51 QVIGQARHALSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN-- 108
Q + ++ +F + + + Y S EE R+ F N+D ++ N KG LGLN
Sbjct: 19 QQFSELQYRNAFTNWMIQNQRHYAS-EEFAARYNIFKANMDYVQEWNSKGSETVLGLNTF 77
Query: 109 ---------------------------------------------ISPVKDQGHCGSCWT 123
++P+K+Q CG CW+
Sbjct: 78 ADITNQEFRSIYLGTPFDGSSIINTETEKIFAAPAASIDWRTKGAVTPIKNQQQCGGCWS 137
Query: 124 FSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEE 183
FSTTGS E A A G SLSEQ L+DC+ ++ N GCNGGL + AFEYI N G+DTE
Sbjct: 138 FSTTGSTEGATAIAKGNLPSLSEQNLIDCSGSYGNNGCNGGLMTLAFEYIINNKGIDTES 197
Query: 184 AYPYTGKDG-VCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVV-DGFRF 241
+YPYT KDG CK++ N+G + N+T G+E L+ A + PVSVA + + F+
Sbjct: 198 SYPYTAKDGKTCKYNPANIGATLSSYSNVTSGSEPSLESAAN-IGPVSVAIDASHNSFQL 256
Query: 242 YKSGVYSSTKCGNTPMDVNHAVVAVGY---------------------GVEDGVPYWLIK 280
Y SG+Y C T +D H V+ VGY G G YW++K
Sbjct: 257 YSSGIYYEPACSTTSLD--HGVLVVGYASGSGSGSGSGSGSGSGLAVEGASSG-NYWIVK 313
Query: 281 NSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
NSWG +WG GY M + N CGIAT AS+P V
Sbjct: 314 NSWGTSWGIEGYILMSKDRNNNCGIATMASFPKV 347
>gi|410990010|ref|XP_004001243.1| PREDICTED: cathepsin L1 isoform 2 [Felis catus]
Length = 337
Score = 187 bits (474), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 101/223 (45%), Positives = 135/223 (60%), Gaps = 13/223 (5%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++PVKDQG+C CW FS TG+LE + GK +SLSEQ LVDC+Q N+G
Sbjct: 118 VDWREKGYVTPVKDQGYCLCCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSQTEGNEG 177
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGK----DGVCKFSSENVGVQVLDSVNITLGAE 216
+GGL AF+Y+K NGGLD+EE+YPY + CK+ EN V D +I E
Sbjct: 178 YSGGLIDDAFQYVKDNGGLDSEESYPYHAQVKRASYSCKYRPENSVANVTDYWDIP-SKE 236
Query: 217 DELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE---- 271
+EL + V P+S A + +D FRFYK G+Y C + DV+H V+ VGYG +
Sbjct: 237 NELMITLAAVGPISAAIDASLDTFRFYKEGIYYDPSCSSE--DVDHGVLVVGYGADGTET 294
Query: 272 DGVPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
+ YW+IKNSWG +WG GY KM + N CGIA+ AS+P V
Sbjct: 295 ENKKYWIIKNSWGTDWGMDGYIKMAKDRDNHCGIASLASFPTV 337
>gi|380013206|ref|XP_003690657.1| PREDICTED: counting factor associated protein D-like [Apis florea]
Length = 549
Score = 187 bits (474), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 94/227 (41%), Positives = 139/227 (61%), Gaps = 2/227 (0%)
Query: 86 FSKNLDLIRSTNCKGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLS 145
F N++ ++ L +RL ++PVKDQ CGSCW+F TTG++E AY +GK + LS
Sbjct: 320 FPYNIEQEITSIPDNLDWRLYGAVTPVKDQSVCGSCWSFGTTGAVEGAYFMKYGKLVRLS 379
Query: 146 EQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAY-PYTGKDGVCKFSSENVGVQ 204
+Q L+DC+ F N GC+GG +++++I +GGL TE+ Y Y G+DG C ++ ++ +
Sbjct: 380 QQALIDCSWGFGNNGCDGGEDFRSYQWIMKHGGLPTEDEYGGYLGQDGYCHVNNISMIAK 439
Query: 205 VLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDG-FRFYKSGVYSSTKCGNTPMDVNHAV 263
+ VN+T G + L+ A+ P+SVA + F FY G+Y + CGN ++HAV
Sbjct: 440 ITGYVNVTSGDANALKIAIAKHGPISVAIDASHKTFSFYSHGIYYESTCGNIEESLDHAV 499
Query: 264 VAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASY 310
+AVGYG +G YWLIKNSW WG+ GY M KN CG+ T +Y
Sbjct: 500 LAVGYGKINGKDYWLIKNSWSNYWGNDGYILMSQEKNNCGVLTTPTY 546
>gi|118404242|ref|NP_001072435.1| cathepsin K precursor [Xenopus (Silurana) tropicalis]
gi|113197688|gb|AAI21683.1| hypothetical protein MGC147539 [Xenopus (Silurana) tropicalis]
Length = 331
Score = 187 bits (474), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 95/215 (44%), Positives = 132/215 (61%), Gaps = 6/215 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ YR ++P+++QG CGSCW FS+ G+LE + GK + LS Q LVDC + N G
Sbjct: 121 IDYRKKGYVTPIRNQGSCGSCWAFSSVGALEGQLKKKKGKLVDLSPQNLVDCVK--KNDG 178
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C GG + AFEY++ N G+D+E AYPY G+D C +++ + G+E L+
Sbjct: 179 CGGGYMTNAFEYVRDNKGIDSENAYPYVGEDQECMYNATGKAASCKGFKEVQEGSEKALK 238
Query: 221 HAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
AVGLV PVSV + + F+FY GVY C ++NHAV+AVGYG + YW++
Sbjct: 239 KAVGLVGPVSVGIDAGLSSFQFYSKGVYYDKDC--NAENINHAVLAVGYGTQKKTKYWIV 296
Query: 280 KNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
KNSWGE+WG+ GY M K N CGI++ ASYPV+
Sbjct: 297 KNSWGEDWGNKGYILMAREKDNACGISSLASYPVM 331
>gi|403302732|ref|XP_003942007.1| PREDICTED: cathepsin S isoform 2 [Saimiri boliviensis boliviensis]
Length = 289
Score = 187 bits (474), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 104/255 (40%), Positives = 143/255 (56%), Gaps = 11/255 (4%)
Query: 65 FARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCK---GL-SYRLGLNISPVKDQGHCGS 120
+ + YGK Y+ E +R + KNL + N + G+ SY LG+N + D G CG+
Sbjct: 40 WKKTYGKQYKEKNEEAVRRLIWEKNLKFVMLHNLEHSMGMHSYDLGMN--HLGDMGSCGA 97
Query: 121 CWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLD 180
CW FS G+LEA GK +SLS Q LVDC++ + N+GCNGG ++AF+YI N G+D
Sbjct: 98 CWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSEKYGNKGCNGGFMTEAFQYIIDNKGID 157
Query: 181 TEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVD-GF 239
+E +YPY D C++ S+ + G ED L+ AV PV V + F
Sbjct: 158 SEASYPYKATDQKCQYDSKYRAATCSKYTELPYGREDVLKEAVANKGPVCVGVDASHPSF 217
Query: 240 RFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGK 299
Y+SGVY C VNH V+ +GYG +G YWL+KNSWG N+G+ GY +M K
Sbjct: 218 FLYRSGVYYDPAC---TQKVNHGVLVIGYGDLNGKEYWLVKNSWGSNFGEQGYIRMARNK 274
Query: 300 -NMCGIATCASYPVV 313
N CGIA+ SYP +
Sbjct: 275 GNHCGIASYPSYPEI 289
>gi|66394764|gb|AAY46196.1| cathepsin L-like cysteine proteinase [Globodera pallida]
Length = 379
Score = 187 bits (474), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 96/209 (45%), Positives = 132/209 (63%), Gaps = 6/209 (2%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++ VK+QG CGSCW FS+TG+LEA + + G+ ISLSEQ L+DC++ + N GCNGG+
Sbjct: 173 VTEVKNQGMCGSCWAFSSTGALEAQHARQTGQLISLSEQNLIDCSKKYGNMGCNGGIMDN 232
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDG-VCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVR 227
AF+YIK N G+D E YPY K G C F +VG +I G E++L+ AV
Sbjct: 233 AFQYIKDNNGVDKELDYPYKAKTGKKCLFKRNDVGATDTGFFDIAEGDEEKLKIAVATQG 292
Query: 228 PVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE-DGVPYWLIKNSWGE 285
P SVA + F+ Y GVY +C +P +++H V+ VGYG + YW++KNSWG
Sbjct: 293 PASVAIDAGHRSFQLYTHGVYFEKEC--SPENLDHGVLVVGYGTDAQQGDYWIVKNSWGA 350
Query: 286 NWGDHGYFKMEMG-KNMCGIATCASYPVV 313
+WG+ GY +M KN CGIA+ ASYP+V
Sbjct: 351 HWGEQGYIRMARNRKNNCGIASHASYPLV 379
>gi|45384464|ref|NP_990302.1| cathepsin K precursor [Gallus gallus]
gi|25089842|sp|Q90686.1|CATK_CHICK RecName: Full=Cathepsin K; AltName: Full=JTAP-1; Flags: Precursor
gi|1017831|gb|AAC59739.1| JTAP-1 [Gallus gallus]
Length = 334
Score = 187 bits (474), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 95/213 (44%), Positives = 126/213 (59%), Gaps = 6/213 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++PVKDQG CGSCW FS+ G+LE + GK +SLS Q LV C NN G
Sbjct: 124 VDWRRKGYVTPVKDQGQCGSCWAFSSVGALEGQLKRRTGKLLSLSPQNLVYCVS--NNNG 181
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C GG + AFEY++ N G+D+E+AYPY G+D C +S + I E L+
Sbjct: 182 CGGGYMTNAFEYVRLNRGIDSEDAYPYIGQDESCMYSPTGKAAKCRGYREIPEDNEKALK 241
Query: 221 HAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
AV + PVSV + + F+FY GVY T C P ++NHAV+AVGYG + G +W+I
Sbjct: 242 RAVARIGPVSVGIDASLPSFQFYSRGVYYDTGC--NPENINHAVLAVGYGAQKGTKHWII 299
Query: 280 KNSWGENWGDHGYFKMEMG-KNMCGIATCASYP 311
KNSWG WG+ GY + K CGIA AS+P
Sbjct: 300 KNSWGTEWGNKGYVLLARNMKQTCGIANLASFP 332
>gi|402585860|gb|EJW79799.1| cysteine protease 6 [Wuchereria bancrofti]
Length = 242
Score = 187 bits (474), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 88/205 (42%), Positives = 128/205 (62%), Gaps = 4/205 (1%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++PVK+QG CGSCW FS TG++E+ + G ISLSEQ+L+DC N GCNGGLP
Sbjct: 41 VTPVKNQGSCGSCWAFSVTGNIESLWAIKTGNLISLSEQELIDCDVIDN--GCNGGLPIN 98
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
AF IK GGL+ E+ YPY K+G C + V + D++ I E ++ + P
Sbjct: 99 AFREIKRMGGLEPEDQYPYKAKNGTCHLVRAQIAVTIDDAIEIPRN-ETVMKAWIAQRGP 157
Query: 229 VSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWG 288
+SV + + +YKSG+ +K P +NH V+ GYG+E+G+PYW IKNSWGE WG
Sbjct: 158 LSVGIDA-ELLAYYKSGILHPSKSRCPPSKINHGVLITGYGIENGLPYWTIKNSWGEEWG 216
Query: 289 DHGYFKMEMGKNMCGIATCASYPVV 313
++GYF++ GK++CG++ S ++
Sbjct: 217 ENGYFRLMRGKDICGVSDLVSSAII 241
>gi|452258|emb|CAA80446.1| cathepsin L-like protease [Fasciola hepatica]
Length = 326
Score = 187 bits (474), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 104/257 (40%), Positives = 143/257 (55%), Gaps = 10/257 (3%)
Query: 60 LSFARFARRYGKIYESVEEMKLRFATFSKN-LDLIRSTNCKGLSYRLGLNISPVKDQGHC 118
L+F F +Y E+ R + N L + S + + Y ++ VKDQG C
Sbjct: 75 LTFEEFKAKYLIEIPRSSELLSRGIPYKANKLAVPESIDWRDYYY-----VTEVKDQGQC 129
Query: 119 GSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGG 178
GSCW FSTTG++E + + S SEQQLVDC + F N GC GG A+EY+K+N G
Sbjct: 130 GSCWAFSTTGAVEGQFRKNERASASFSEQQLVDCTRDFGNYGCGGGYMENAYEYLKHN-G 188
Query: 179 LDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDG 238
L+TE YPY +G C++ +V + G E EL++ VG +VA +
Sbjct: 189 LETESYYPYQAVEGPCQYDGRLAYAKVTGYYTVHSGDEIELKNLVGTEDLPAVALDADSD 248
Query: 239 FRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMG 298
F Y+SG+Y S C P + HAV+AVGYG +DG YW++KNSWG WG+ GY +
Sbjct: 249 FMMYQSGIYQSQTC--LPDRLTHAVLAVGYGSQDGTDYWIVKNSWGTWWGEDGYIRFARN 306
Query: 299 K-NMCGIATCASYPVVA 314
+ NMCGIA+ AS P+VA
Sbjct: 307 RGNMCGIASLASVPMVA 323
>gi|297287735|ref|XP_002803218.1| PREDICTED: putative cathepsin L-like protein 6-like [Macaca
mulatta]
Length = 270
Score = 187 bits (474), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 100/211 (47%), Positives = 129/211 (61%), Gaps = 9/211 (4%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++PVK+QG CGSCW FS TG+LE GK ISLSEQ LVDC+ N+G NGG
Sbjct: 63 VTPVKNQGMCGSCWAFSATGALEGQMFWKTGKLISLSEQNLVDCSWPQGNEGYNGGFMDN 122
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
+F Y++ NGGLD+E +YPY GK C+++ + V+I E +L AV V P
Sbjct: 123 SFRYVQENGGLDSEASYPYEGKVKTCRYNPKYSVANDTGFVDIP-SREKDLAKAVATVGP 181
Query: 229 VSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----DGVPYWLIKNSW 283
+SVA + F+FYK G+Y +C P ++HA++ VGYG E D YWL+KNSW
Sbjct: 182 ISVAVDASHFSFQFYKKGIYFEPRC--DPEGLDHAMLTVGYGYEGADSDNNKYWLVKNSW 239
Query: 284 GENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
G+NWG GY KM +N CGIAT ASYP V
Sbjct: 240 GKNWGMDGYIKMAKDRRNNCGIATAASYPTV 270
>gi|405976506|gb|EKC41011.1| Counting factor associated protein D [Crassostrea gigas]
Length = 349
Score = 187 bits (474), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 111/286 (38%), Positives = 156/286 (54%), Gaps = 16/286 (5%)
Query: 43 RDFETSVLQVIGQARHALSFARFARRYG-KIYESVEEMK-----------LRFATFSKNL 90
+F +V + + R AL F+ K E + M L F T +L
Sbjct: 65 HNFRQNVRFIHSKNRAALGFSLAVNHLADKTQEEIRLMNGYRYSPGPHGGLAFDTSKYSL 124
Query: 91 -DLIRSTNCKGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQL 149
DL S + + Y ++PVKDQ CGSCW+F TTG++E AY G + LS+QQL
Sbjct: 125 RDLPDSMDWRLHGYLSQRAVTPVKDQAVCGSCWSFGTTGTIEGAYFLKTGDLVRLSQQQL 184
Query: 150 VDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAY-PYTGKDGVCKFSSENVGVQVLDS 208
+DC+ N C+GG +A++++ NGGL +EE Y PY +DG C + + VQ+ +
Sbjct: 185 MDCSWGEGNNACDGGEDFRAYQWMMKNGGLTSEELYGPYKAQDGKCNKTITPI-VQLKNY 243
Query: 209 VNITLGAEDELQHAVGLVRPVSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVG 267
VN+T G L+ A+ PVSVA + FY +GVY +CGN P D++HAV+AVG
Sbjct: 244 VNVTSGDLQALKFAIAHQGPVSVAIDASHLSLSFYANGVYYEPQCGNKPDDLDHAVLAVG 303
Query: 268 YGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASYPVV 313
YGV +G YWLIKNSW WG+ GY M N CG+AT ++ +V
Sbjct: 304 YGVMNGQAYWLIKNSWSTYWGNDGYVLMSQKDNNCGVATDPTFVIV 349
>gi|348542138|ref|XP_003458543.1| PREDICTED: counting factor associated protein D-like [Oreochromis
niloticus]
Length = 551
Score = 187 bits (474), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 109/304 (35%), Positives = 149/304 (49%), Gaps = 51/304 (16%)
Query: 62 FARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN------------- 108
F+ F ++ + Y E + R F NL I S N G+S+ L LN
Sbjct: 248 FSHFKDKFQRQYNDEREHEKREHAFVLNLRYIHSKNRAGMSFSLALNSLSDRTMSELATM 307
Query: 109 ------------------------------------ISPVKDQGHCGSCWTFSTTGSLEA 132
++PVKDQ CGSCW+F+TTG++E
Sbjct: 308 RGRKRGKTPNRGLPFPFKAYERVNLPESLDWRLYGAVTPVKDQAICGSCWSFATTGAVEG 367
Query: 133 AYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAY-PYTGKD 191
A G LS+Q L+DC+ F N GC+GG +A+E+I +GG+ T E Y Y G +
Sbjct: 368 ALFVKTGSLQVLSQQMLIDCSWGFGNNGCDGGEEWRAYEWIMKHGGIATTETYGAYMGMN 427
Query: 192 GVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVD-GFRFYKSGVYSST 250
G C S + ++ N+T G + L+ A+ PV+V+ + F FY GVY
Sbjct: 428 GFCHVDSSELTARIQSYTNVTSGDQLALKMALFKNGPVAVSIDASHRSFVFYSHGVYYEP 487
Query: 251 KCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASY 310
CGNT D++HAV+AVGYG G PYWLIKNSW WG+ GY M M N CG+AT A+Y
Sbjct: 488 ACGNTVDDLDHAVLAVGYGTLSGEPYWLIKNSWSTYWGNDGYILMSMKDNNCGVATDATY 547
Query: 311 PVVA 314
+A
Sbjct: 548 VTLA 551
>gi|358334193|dbj|GAA43174.2| cysteine proteinase 3, partial [Clonorchis sinensis]
Length = 374
Score = 187 bits (474), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 89/221 (40%), Positives = 130/221 (58%), Gaps = 6/221 (2%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
+ + +R ++PV+ QG CG+CW F+ TG++E Y + + S QQLVDC Q
Sbjct: 154 QSIDWRRNGAVTPVRRQGDCGACWAFAATGAIEGRYFIFEKRLETFSPQQLVDCIQGDTT 213
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTG-----KDGVCKFSSENVGVQVLDSVNITL 213
GCNGG PS+AFEY++ GGL+ E YPY + C + V++ V +
Sbjct: 214 NGCNGGYPSEAFEYVENVGGLELERDYPYVSVATGLPNPFCGYDQTKQQVKLTSHVILPS 273
Query: 214 GAEDELQHAVGLVRPVSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVED 272
G E+ L AV + P+++ F+ F+ Y+S +YS CG T DV HA++ VGYG E
Sbjct: 274 GDEEALLQAVSIYGPIAILFDASHPSFKDYESDIYSEENCGTTLDDVTHAMLVVGYGEEL 333
Query: 273 GVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASYPVV 313
G PYWL+KNSWG+ WG+ GY ++ G NMC +A +SYP++
Sbjct: 334 GEPYWLVKNSWGDKWGEKGYMRVRRGVNMCAVAGFSSYPLM 374
>gi|296168737|emb|CAQ54046.1| silicatein alpha 2 [Ephydatia muelleri]
Length = 340
Score = 187 bits (474), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 91/215 (42%), Positives = 139/215 (64%), Gaps = 4/215 (1%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++ VK+QG CG+ + F+ TG++E A + K ++LSEQ ++DC+ A+ N G
Sbjct: 128 IDWRTKGAVTSVKNQGDCGASYAFAATGTMEGANALSNDKQVALSEQNIIDCSVAYGNHG 187
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C+GG A +Y+ NGG+DTE +Y + GK C+++S+N G +V+I+ G+E +L
Sbjct: 188 CSGGDTYTAIKYVVDNGGIDTESSYSFRGKQSSCQYNSKNSGASATGAVSISYGSESDLM 247
Query: 221 HAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
AV V PV+VA + + FRFY+SGV+ S+ C +T + NHA++ GYG +G YWL+
Sbjct: 248 SAVATVGPVAVAVDANTNAFRFYQSGVFDSSTCSSTKL--NHAMLVTGYGSYNGKDYWLV 305
Query: 280 KNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
KNSWG+ WGD GY M K N CGIA+ A Y ++
Sbjct: 306 KNSWGKYWGDSGYIMMVRNKYNQCGIASDALYSML 340
>gi|66377984|gb|AAY45869.1| cathepsin L-like cysteine proteinase [Globodera pallida]
Length = 379
Score = 187 bits (474), Expect = 8e-45, Method: Compositional matrix adjust.
Identities = 96/209 (45%), Positives = 132/209 (63%), Gaps = 6/209 (2%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++ VK+QG CGSCW FS+TG+LEA + + G+ ISLSEQ L+DC++ + N GCNGG+
Sbjct: 173 VTEVKNQGMCGSCWAFSSTGALEAQHARQTGQLISLSEQNLIDCSKKYGNMGCNGGIMDN 232
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDG-VCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVR 227
AF+YIK N G+D E YPY K G C F +VG +I G E++L+ AV
Sbjct: 233 AFQYIKDNNGVDKELDYPYKAKTGKKCLFKRNDVGATDTGFFDIAEGDEEKLKIAVATQG 292
Query: 228 PVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE-DGVPYWLIKNSWGE 285
P SVA + F+ Y GVY +C +P +++H V+ VGYG + YW++KNSWG
Sbjct: 293 PASVAIDAGHRSFQLYTHGVYFEKEC--SPENLDHGVLVVGYGTDAQQGDYWIVKNSWGA 350
Query: 286 NWGDHGYFKMEMG-KNMCGIATCASYPVV 313
+WG+ GY +M KN CGIA+ ASYP+V
Sbjct: 351 HWGEQGYIRMARNRKNNCGIASHASYPLV 379
>gi|94448670|emb|CAI91573.1| silicatein a4 [Lubomirskia baicalensis]
gi|312386085|gb|ADQ74587.1| silicatein alpha 4 [Lubomirskia baicalensis]
Length = 326
Score = 187 bits (474), Expect = 8e-45, Method: Compositional matrix adjust.
Identities = 94/215 (43%), Positives = 135/215 (62%), Gaps = 4/215 (1%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++ VK QG CG+ + F+ TG+LE A + K + LSEQ ++DC+ + N G
Sbjct: 114 IDWRTKGAVTSVKYQGQCGASYAFAATGALEGASALSNDKQVILSEQNIIDCSVPYGNHG 173
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C+GG A +Y+ NGG+DTE +Y + GK C++SS+N G ++I G+E +L
Sbjct: 174 CSGGDTYTAMKYVIDNGGIDTESSYSFQGKQSSCQYSSKNSGASATGVISIASGSETDLF 233
Query: 221 HAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
AV V PV+VA + + FRFY+SGV+ S+ C NT + NHA++ GYG +G YWL+
Sbjct: 234 AAVATVGPVAVAVDANTNAFRFYQSGVFDSSSCSNTKL--NHAMLVTGYGSYNGKDYWLV 291
Query: 280 KNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
KNSW +NWGD+GY M K N CGIAT A YP +
Sbjct: 292 KNSWSKNWGDNGYIMMVRNKYNQCGIATDALYPTL 326
>gi|217323618|gb|ACK38176.1| midgut cysteine peptidase, partial [Sphenophorus levis]
Length = 324
Score = 187 bits (474), Expect = 8e-45, Method: Compositional matrix adjust.
Identities = 112/307 (36%), Positives = 152/307 (49%), Gaps = 59/307 (19%)
Query: 60 LSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNC---KGL-SYRLGLN------- 108
+ F F ++GK Y++ E RFA F +NL I + N +G+ SY G+N
Sbjct: 24 VHFQSFKLKHGKTYKNQAEETKRFAIFRENLRKIEAHNAEYKQGIHSYTQGINKFADMTR 83
Query: 109 ------------------------------------------ISPVKDQGHCGSCWTFST 126
++P+KDQ CGSCW+F+
Sbjct: 84 AEFKAMLATQVKTKPSIVATKTFQLADGVSVPESIDWRSRNVVTPIKDQAQCGSCWSFAV 143
Query: 127 TGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYP 186
GS E AY + GK SEQQLVDC N GC+GG F YI+ NG L+ E YP
Sbjct: 144 VGSTEGAYALSTGKLTRFSEQQLVDCTTDLN-YGCDGGYLDDTFPYIQTNG-LELESDYP 201
Query: 187 YTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGV 246
YTG DG C + S V +V V++ E L AVG PV++A D +FY SG+
Sbjct: 202 YTGYDGSCSYDSSKVVTKVSSYVSVP-ANEQALLEAVGTAGPVAIAINA-DDLQFYFSGI 259
Query: 247 YSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIAT 306
C P ++H V+AVGY E+G+ YWLIKNSWG +WG+ GYF+ G+N+CG+
Sbjct: 260 IDDKYCD--PEWLDHGVLAVGYNSENGLDYWLIKNSWGADWGESGYFRFLRGQNICGVKE 317
Query: 307 CASYPVV 313
A YP++
Sbjct: 318 DAVYPLI 324
>gi|348586441|ref|XP_003478977.1| PREDICTED: cathepsin K-like [Cavia porcellus]
Length = 329
Score = 187 bits (474), Expect = 8e-45, Method: Compositional matrix adjust.
Identities = 94/213 (44%), Positives = 131/213 (61%), Gaps = 6/213 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ YR ++PVK+QG CGSCW FS+ G+LE + GK ++LS Q LVDC N G
Sbjct: 119 VDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVS--ENDG 176
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C GG + AF+Y++ N G+D+E+AYPY G++ C ++ + I +G E L+
Sbjct: 177 CGGGYMTNAFQYVQENRGIDSEDAYPYVGQEESCMYNPTGKAAKCRGYREIPVGNEKALK 236
Query: 221 HAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
AV V PVSVA + + F+FY GVY C D+NHA++AVGYG++ G +W++
Sbjct: 237 RAVARVGPVSVAIDASLSSFQFYSKGVYYDESCNGE--DLNHALLAVGYGMQRGNKHWIL 294
Query: 280 KNSWGENWGDHGYFKMEMGK-NMCGIATCASYP 311
KNSWGENWG+ GY + K N CGIA AS+P
Sbjct: 295 KNSWGENWGNKGYVLLARNKNNACGIANLASFP 327
>gi|28932706|gb|AAO60047.1| midgut cysteine proteinase 4 [Rhipicephalus appendiculatus]
Length = 345
Score = 186 bits (473), Expect = 8e-45, Method: Compositional matrix adjust.
Identities = 121/358 (33%), Positives = 170/358 (47%), Gaps = 80/358 (22%)
Query: 12 ILLLCCAAAASASASSFDDSNPIRLVSSDGLRDFETSVLQVIGQARHALSFARFARRYGK 71
+L+LC +A A + S DD+ + + G A + +F + Y K
Sbjct: 8 VLILCVSALAQIAPSRQDDN------------------IDIYGHFGKA--WDKFRKIYNK 47
Query: 72 IYESVEEMKLRFATFSKNLDLIRSTNCK----GLSYRLGLN------------------- 108
Y + EE R F + + +R+ + K L Y + +N
Sbjct: 48 TYGTSEETVYREQVFRRTFNFLRTVDEKFKNGTLLYSVAVNHFADMTPDEVVANYTGYKP 107
Query: 109 ------------------------------ISPVKDQGHCGSCWTFSTTGSLEAAYHQAF 138
++PVK+QG CGSCW FS+TG+LE +
Sbjct: 108 PSAQQLAEIPLYAPLFGDTPEFIEWRENGFVTPVKNQGQCGSCWAFSSTGALEGQVFKRT 167
Query: 139 GKGISLSEQQLVDCA-QAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPY-TGKDGVCKF 196
+ ISLSEQ L+DCA Q + N GCNGG AF+Y++ GGLDTE YPY G + C+F
Sbjct: 168 RRLISLSEQNLMDCAGQRYGNNGCNGGQMPGAFQYVQDAGGLDTEARYPYRQGTNFQCQF 227
Query: 197 SS--ENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCG 253
S+ E V V + E LQ AV V P+S+A F FYK+G+Y C
Sbjct: 228 SNSFEARRVSVNGHTRVPPRNERVLQDAVANVGPISIAINASPQTFMFYKNGIYGEPNC- 286
Query: 254 NTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASYP 311
P +NHAV+ VGYG E GVPYW++KNSWG WG+ GY K+ +N+CG++ S+P
Sbjct: 287 -DPRGLNHAVLLVGYGEERGVPYWIVKNSWGPGWGEGGYIKILRNRNVCGMSQDPSFP 343
>gi|224081320|ref|XP_002306369.1| predicted protein [Populus trichocarpa]
gi|222855818|gb|EEE93365.1| predicted protein [Populus trichocarpa]
Length = 340
Score = 186 bits (473), Expect = 9e-45, Method: Compositional matrix adjust.
Identities = 109/303 (35%), Positives = 154/303 (50%), Gaps = 60/303 (19%)
Query: 64 RFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCK-GLSYRLGLN-------------- 108
++ +YG++Y+ E + R+ F +N+ I + N + G SY+LG+N
Sbjct: 41 QWMAQYGRVYKDDAEKETRYNIFKENVARIDAFNSQTGKSYKLGVNQFADLSNEEFKASR 100
Query: 109 ----------------------------------ISPVKDQGHCGSCWTFSTTGSLEAAY 134
++PVKDQG CG CW FS ++E
Sbjct: 101 NRFKGHMCSPQAGPFRYENVSAVPATMDWRKKGAVTPVKDQGQCGCCWAFSAVAAMEGIN 160
Query: 135 HQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVC 194
GK ISLSEQ++VDC +QGCNGGL AF++I+ N GL TE YPYTG DG C
Sbjct: 161 QLTTGKLISLSEQEVVDCDTKGEDQGCNGGLMDDAFKFIEQNKGLTTEANYPYTGTDGTC 220
Query: 195 KFSSENV-GVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVD-GFRFYKSGVYSSTKC 252
E ++ ++ +E L AV +PVSVA + F+FY SG+++ + C
Sbjct: 221 NTQKEATHAAKITGFEDVPANSEAALMKAVAK-QPVSVAIDAGGFEFQFYSSGIFTGS-C 278
Query: 253 GNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMG----KNMCGIATCA 308
G ++H V AVGYG+ DG YWL+KNSWG WG+ GY +M+ + +CGIA A
Sbjct: 279 GT---QLDHGVTAVGYGISDGTKYWLVKNSWGAQWGEEGYIRMQKDISAKEGLCGIAMQA 335
Query: 309 SYP 311
SYP
Sbjct: 336 SYP 338
>gi|354507493|ref|XP_003515790.1| PREDICTED: cathepsin L1-like [Cricetulus griseus]
gi|344259154|gb|EGW15258.1| Cathepsin L1 [Cricetulus griseus]
Length = 333
Score = 186 bits (473), Expect = 9e-45, Method: Compositional matrix adjust.
Identities = 101/221 (45%), Positives = 131/221 (59%), Gaps = 9/221 (4%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++PVK+QG CGSCW FS G+LE G +SLSEQ LVDC+QA N
Sbjct: 116 KSVDWREKGCVTPVKNQGQCGSCWAFSACGALEGQMCLKTGVLVSLSEQNLVDCSQAEGN 175
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
QGCNGGL AF+Y+ N GLD+EE+YPY KDG CK+ E V+I E
Sbjct: 176 QGCNGGLMDFAFQYVLNNKGLDSEESYPYEAKDGTCKYKPEFAAANDTGYVDIPQ-LEKA 234
Query: 219 LQHAVGLVRPVSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----DG 273
L AV V P+++A + F+FY SG+Y C + +D H V+ VGYG E +
Sbjct: 235 LMKAVATVGPIAIAIDASHPSFQFYSSGIYYEPNCSSKELD--HGVLVVGYGFEGTDSNK 292
Query: 274 VPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
YW++KNSWG +WG G+F + K N CG+AT ASYP V
Sbjct: 293 KKYWIVKNSWGSSWGMGGFFHIAKDKNNHCGVATAASYPTV 333
>gi|218478069|dbj|BAH03395.1| cathepsin L-like cysteine peptidase [Taenia solium]
Length = 346
Score = 186 bits (473), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 101/215 (46%), Positives = 136/215 (63%), Gaps = 14/215 (6%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++ VK+QG+CGSCW FS+TG+LE A+ + GK ISLSEQQLVDC+ N GCNGG S
Sbjct: 136 VTEVKNQGNCGSCWAFSSTGALEGAFAKKTGKLISLSEQQLVDCSLKNGNDGCNGGYMSY 195
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGV-QVLDSVNITLGAEDELQHAVGLVR 227
AF+Y++ + ++ E AYPY DG C++ +E++GV V D +I G E L AV V
Sbjct: 196 AFKYLEEH-FIEPESAYPYRATDGPCRY-NESLGVGTVTDIGDIPEGNETALMEAVATVG 253
Query: 228 PVSVAFEVVD-GFRFYKS-------GVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
P+S+A + GF FY+ G+Y S C + + NH V+A+GYG +DG PYWL+
Sbjct: 254 PISIAIDASSLGFMFYRQVATNPHHGIYKSHWCSSKFL--NHGVLAIGYGKQDGKPYWLV 311
Query: 280 KNSWGENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
KNSWG WG GY M NMCG+A+ A +P V
Sbjct: 312 KNSWGTRWGMKGYIMMAKDYHNMCGVASLADFPYV 346
>gi|357474527|ref|XP_003607548.1| Cysteine protease [Medicago truncatula]
gi|358347211|ref|XP_003637653.1| Cysteine protease [Medicago truncatula]
gi|355503588|gb|AES84791.1| Cysteine protease [Medicago truncatula]
gi|355508603|gb|AES89745.1| Cysteine protease [Medicago truncatula]
Length = 345
Score = 186 bits (473), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 111/303 (36%), Positives = 158/303 (52%), Gaps = 63/303 (20%)
Query: 68 RYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLS--YRLGLN----------------- 108
YGK+Y+ ++E + R F +N++ I ++N G + Y+LG+N
Sbjct: 47 HYGKVYKDLQERENRLKIFKENVNYIEASNNAGNNKLYKLGINQFADLTNEEFIASRNKF 106
Query: 109 --------------------------------ISPVKDQGHCGSCWTFSTTGSLEAAYHQ 136
++PVK+QG CG CW FS + E +
Sbjct: 107 KGHMCSSITKTSTFKYENASVPSTVDWRKKGAVTPVKNQGQCGCCWAFSAVAATEGIHKL 166
Query: 137 AFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKF 196
+ GK +SLSEQ+LVDC +QGC GGL AF++I N GL+TE YPY G DG C
Sbjct: 167 STGKLVSLSEQELVDCDTKGVDQGCEGGLMDDAFKFIIQNHGLNTEAQYPYQGVDGTCSA 226
Query: 197 SSENV-GVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGN 254
+ ++ V + ++ E LQ AV +P+SVA + F+FYKSGV++ + CG
Sbjct: 227 NKASIHAVTITGYEDVPANNEQALQKAVA-NQPISVAIDASGSDFQFYKSGVFTGS-CG- 283
Query: 255 TPMDVNHAVVAVGYGV-EDGVPYWLIKNSWGENWGDHGYFKMEMG----KNMCGIATCAS 309
+++H V AVGYGV DG YWL+KNSWG +WG+ GY KM+ G + +CGIA AS
Sbjct: 284 --TELDHGVTAVGYGVGNDGTKYWLVKNSWGTDWGEEGYIKMQRGVDAAEGLCGIAMEAS 341
Query: 310 YPV 312
YP
Sbjct: 342 YPT 344
>gi|33333714|gb|AAQ11975.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
maculatus]
Length = 323
Score = 186 bits (473), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 107/306 (34%), Positives = 156/306 (50%), Gaps = 58/306 (18%)
Query: 61 SFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCK--------------------- 99
+ +F +GK Y SV E K RF+ F KNL I+ N K
Sbjct: 22 EWQQFKLDHGKTYRSVVEEKRRFSVFQKNLIDIQEHNKKYERGEVSFAKKVTQFADMTHE 81
Query: 100 ------------------------------GLSYRLGLNISPVKDQGHCGSCWTFSTTGS 129
+ +R ++PVK+QGHCGSCW FS G+
Sbjct: 82 EFLDLLKLQGVPALPSDAVYFEETDIEEKDAVDWRKEGAVTPVKNQGHCGSCWAFSAVGA 141
Query: 130 LEAAYHQAFGKGISLSEQQLVDCA-QAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYT 188
+E + + G +SLS Q+LVDCA + + N+GCNGGL QAF++++ + G+ TEE+YPY
Sbjct: 142 IEGQFFKKNGTLVSLSAQELVDCATEYYGNEGCNGGLMGQAFDFVE-DEGIQTEESYPYK 200
Query: 189 GKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVYS 248
K +C+ + E V + + ++ L E E+ AV PV+VA + FY G+
Sbjct: 201 AKRSICQMNGEYV--TKVKTYHLLLN-EQEIARAVSAKGPVAVAIDASQ-LSFYDQGIVD 256
Query: 249 ST-KCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATC 307
KC D+NH V+ VGYG E+GV YW++KNSWG +WG+ GYF+++ CGI
Sbjct: 257 EKCKCSKKREDLNHGVLVVGYGSENGVDYWIVKNSWGADWGEKGYFRLKKDVKACGIGNY 316
Query: 308 ASYPVV 313
+YPV+
Sbjct: 317 NTYPVL 322
>gi|428186189|gb|EKX55040.1| hypothetical protein GUITHDRAFT_63227 [Guillardia theta CCMP2712]
Length = 344
Score = 186 bits (473), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 111/262 (42%), Positives = 146/262 (55%), Gaps = 11/262 (4%)
Query: 60 LSFARFARRY-GKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLNISPVKDQGHC 118
L+F F+ +Y G VE+ K R A K+ RS + +R ++ VK+QG C
Sbjct: 86 LTFEEFSAQYLGYGGAEVEQPKTRRA--GKHERKSRSEIPASVDWREKGAVAEVKNQGAC 143
Query: 119 GSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGG 178
GSCW FS +LE A+ G+ ISLSEQQLVDC++ F N GC GG AFEY N G
Sbjct: 144 GSCWAFSAVAALEGAHFLNSGELISLSEQQLVDCSKKFGNHGCAGGYMDNAFEYWMNNTG 203
Query: 179 L--DTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVV 236
D+E+ YPY G DG CKFS++ V + ++ G E +L AV V PVSVA
Sbjct: 204 HGDDSEKDYPYKGMDGKCKFSADGVRATISGYNDVKQGNETDLLDAVANVGPVSVAIHAG 263
Query: 237 DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVED-----GVPYWLIKNSWGENWGDHG 291
+FY GV++ G +NH V AVGYG + YW+IKNSWG WG+ G
Sbjct: 264 AALQFYLRGVFNGV-AGTCFGPLNHGVTAVGYGTASLRFGRKMDYWIIKNSWGMGWGEKG 322
Query: 292 YFKMEMGKNMCGIATCASYPVV 313
+ + GKN+CG+A ASYP+V
Sbjct: 323 FVRFARGKNLCGVANGASYPLV 344
>gi|405971604|gb|EKC36431.1| Cathepsin L [Crassostrea gigas]
Length = 384
Score = 186 bits (473), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 105/257 (40%), Positives = 140/257 (54%), Gaps = 10/257 (3%)
Query: 60 LSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLNISPVKDQGHCG 119
L +A F G ++ K + N+ + S + + Y ++ VK+QG CG
Sbjct: 134 LEYAEFVNFNGLKMTNLNNTKCSSHLSANNIVVPDSVDWRSKGY-----VTKVKNQGACG 188
Query: 120 SCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGL 179
SCW FS TGSLE Y + GK + LSE QLVDC+ +F N+GCNGG AF+Y+K GG+
Sbjct: 189 SCWAFSATGSLEGQYFRKNGKLVPLSESQLVDCSGSFGNEGCNGGFMENAFKYVKSVGGI 248
Query: 180 DTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEV-VDG 238
++E YPY + C F V V V++ G+E L+ V V PVSVA +
Sbjct: 249 ESESDYPYKARQRTCAFDKTKVIATVSGCVDVESGSESSLKEVVSEVGPVSVAIDAGHSS 308
Query: 239 FRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE-DGVPYWLIKNSWGENWGDHGYFKMEM 297
F+ Y GVY C + + NH V+ VGYG G YW++KNSWG WG GY KM
Sbjct: 309 FQLYAGGVYDEPLCSTSRL--NHGVLCVGYGTSLQGKDYWIVKNSWGVRWGVEGYIKMSR 366
Query: 298 GK-NMCGIATCASYPVV 313
K N CGIA+ ASYP+V
Sbjct: 367 NKNNQCGIASEASYPLV 383
>gi|294883332|ref|XP_002770713.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
gi|239873998|gb|EER02718.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
Length = 332
Score = 186 bits (472), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 120/306 (39%), Positives = 149/306 (48%), Gaps = 57/306 (18%)
Query: 60 LSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN----------- 108
L+F F ++GK YES EE R A F NL I N K LSY+LG+N
Sbjct: 26 LAFMGFKHKFGKNYESKEEEVKRNAIFQANLQHIEQVNAKDLSYKLGVNEHADLTHEEFA 85
Query: 109 -------------------------------------ISPVKDQGHCGSCWTFSTTGSLE 131
++PVKDQ CGSCW FS G+LE
Sbjct: 86 ALKLSTLDTSTRRDDEFVVEVNTTQLPTSVDWRNKSVLTPVKDQEFCGSCWAFSAIGALE 145
Query: 132 AAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKD 191
A Y A GK +SLSEQQLVDC+ + +GC GG A+EYIK + G+D E YPY G D
Sbjct: 146 AQYAIATGKLLSLSEQQLVDCSHKYGTKGCRGGYMGDAYEYIK-SAGIDQESTYPYKGWD 204
Query: 192 GVCK---FSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDG-FRFYKSGVY 247
C+ ++ + + I E L A+ PVSV + F Y+SGVY
Sbjct: 205 EPCRPREKKADGIPAGEVTGSYILYWTEQSLMDALAYA-PVSVTMDASGADFGLYESGVY 263
Query: 248 SSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATC 307
SST C T VNHAVVAVGYG E+G Y++ KNSWG +WG GYF ++ G G
Sbjct: 264 SSTTCNGT---VNHAVVAVGYGTENGSDYFIFKNSWGSSWGMGGYFYLKRGVGGFGECNI 320
Query: 308 ASYPVV 313
Y VV
Sbjct: 321 LEYMVV 326
>gi|83715950|dbj|BAE54434.1| silicatein [Ephydatia fluviatilis]
Length = 326
Score = 186 bits (472), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 91/215 (42%), Positives = 139/215 (64%), Gaps = 4/215 (1%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++ VK+QG CG+ + F+ TG++E A + K ++LSEQ ++DC+ A+ N G
Sbjct: 114 IDWRTKGAVTSVKNQGDCGASYAFAATGTMEGANALSNDKQVALSEQNIIDCSVAYGNHG 173
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C+GG A +Y+ NGG+DTE +Y + GK C+++S+N G +V+I+ G+E +L
Sbjct: 174 CSGGDTYTAIKYVVDNGGIDTESSYSFRGKQSSCQYNSKNSGASATGAVSISYGSESDLM 233
Query: 221 HAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
AV V PV+VA + + FRFY+SGV+ S+ C +T + NHA++ GYG +G YWL+
Sbjct: 234 SAVATVGPVAVAVDANTNAFRFYQSGVFDSSTCSSTKL--NHAMLVTGYGSYNGKDYWLV 291
Query: 280 KNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
KNSWG+ WGD GY M K N CGIA+ A Y ++
Sbjct: 292 KNSWGKYWGDSGYIMMVRNKYNQCGIASDALYSML 326
>gi|417409774|gb|JAA51378.1| Putative cathepsin k, partial [Desmodus rotundus]
Length = 331
Score = 186 bits (472), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 94/213 (44%), Positives = 129/213 (60%), Gaps = 6/213 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ YR ++PVK+QG CGSCW FS+ G+LE + GK ++LS Q LVDC N G
Sbjct: 121 IDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE--NDG 178
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C GG + AF Y++ N G+D+E+AYPY G+D C ++ + I G E L+
Sbjct: 179 CGGGYMTNAFHYVQKNQGIDSEDAYPYVGQDESCMYNPTGKAAKCRGYKEIPEGNEKALK 238
Query: 221 HAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
AV V P+SVA + + F+FY GVY C + ++NHAV+AVGYG++ +W+I
Sbjct: 239 RAVARVGPISVAIDASLTSFQFYSKGVYYDKNCNSD--NLNHAVLAVGYGIQKRKKHWII 296
Query: 280 KNSWGENWGDHGYFKMEMGK-NMCGIATCASYP 311
KNSWGE+WG+ GY M K N CGIA AS+P
Sbjct: 297 KNSWGESWGNKGYILMARNKNNACGIANLASFP 329
>gi|339252572|ref|XP_003371509.1| cathepsin L1 [Trichinella spiralis]
gi|316968239|gb|EFV52542.1| cathepsin L1 [Trichinella spiralis]
Length = 448
Score = 186 bits (472), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 102/243 (41%), Positives = 136/243 (55%), Gaps = 38/243 (15%)
Query: 109 ISPVKDQGHCGSCWTFST---------------TGSLEAAYHQAFGKGISLSEQQLVDCA 153
++PVKDQGHCGSCW FS TG+LE + GK +SLSEQ L+DC+
Sbjct: 206 VTPVKDQGHCGSCWAFSAVNSNALHVHSRAFQQTGALEGQNKRKTGKLVSLSEQNLIDCS 265
Query: 154 QAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGK----DGVCKFSSENVGVQVLDSV 209
+ + N+GC+GGL AFEY+K N G+DTEE+YPY D C+F + +G V
Sbjct: 266 RKYGNKGCSGGLMDNAFEYVKENHGIDTEESYPYEAAVRMLDKKCRFKNSTIGATDKGFV 325
Query: 210 NITLGAEDELQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNT------------- 255
+I G E L HAV + P+SVA + + F+FY SG+ NT
Sbjct: 326 DIEPGNETYLMHAVATIGPLSVAIDASHESFQFYSSGMLLMVDIFNTVEVMWTNLGVYFE 385
Query: 256 PM----DVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASY 310
PM ++H V+ VGYG G YW++KNSWG +WG+ GY M K N CGIA+ ASY
Sbjct: 386 PMCSSQFLDHGVLVVGYGSLKGKDYWIVKNSWGTSWGNDGYIFMARNKNNSCGIASFASY 445
Query: 311 PVV 313
P++
Sbjct: 446 PII 448
>gi|33242865|gb|AAQ01137.1| cathepsin [Branchiostoma lanceolatum]
Length = 328
Score = 186 bits (472), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 101/207 (48%), Positives = 126/207 (60%), Gaps = 6/207 (2%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++PVK+Q CGSCW FSTTGSLE + ++ K +SLSEQ LVDC++ G G L Q
Sbjct: 126 VTPVKNQEQCGSCWAFSTTGSLEGQHFKSTQKLVSLSEQNLVDCSRK-RGTGLPGRLMDQ 184
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
F+YIK NGG+DTEE YPY K+ C + + G + G LQ AV V P
Sbjct: 185 GFKYIKDNGGIDTEECYPYKAKNEKCNYQASCSGATLTAKRRQDEG-RGALQQAVATVGP 243
Query: 229 VSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENW 287
+SVA + F+ Y+SGVY C T MD H V+AVGYG E+G YWL+KNSWG +W
Sbjct: 244 ISVAIDAGHSSFQLYQSGVYHKFFCSETKMD--HGVLAVGYGTEEGKDYWLVKNSWGASW 301
Query: 288 GDHGYFKMEMGK-NMCGIATCASYPVV 313
G+ GY KM + N GIAT ASYP V
Sbjct: 302 GEKGYIKMSRNRHNNWGIATSASYPTV 328
>gi|357474573|ref|XP_003607571.1| Cysteine proteinase EP-B [Medicago truncatula]
gi|34329348|gb|AAQ63885.1| putative cysteine proteinase [Medicago truncatula]
gi|355508626|gb|AES89768.1| Cysteine proteinase EP-B [Medicago truncatula]
Length = 345
Score = 186 bits (472), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 111/303 (36%), Positives = 158/303 (52%), Gaps = 63/303 (20%)
Query: 68 RYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLS--YRLGLN----------------- 108
YGK+Y+ ++E + R F +N++ I ++N G + Y+LG+N
Sbjct: 47 HYGKVYKDLQERENRLKIFKENVNYIEASNNAGNNKLYKLGINQFADLTNEEFIASRNKF 106
Query: 109 --------------------------------ISPVKDQGHCGSCWTFSTTGSLEAAYHQ 136
++PVK+QG CG CW FS + E +
Sbjct: 107 KGHMCSSITKTSTFKYENASVPSTVDWRKKGAVTPVKNQGQCGCCWAFSAVAATEGIHKL 166
Query: 137 AFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKF 196
+ GK +SLSEQ+LVDC +QGC GGL AF++I N GL+TE YPY G DG C
Sbjct: 167 STGKLVSLSEQELVDCDTKGVDQGCEGGLMDDAFKFIIQNHGLNTEAQYPYQGVDGTCSA 226
Query: 197 SSENV-GVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGN 254
+ ++ V + ++ E LQ AV +P+SVA + F+FYKSGV++ + CG
Sbjct: 227 NKASIHAVTITGYEDVPANNEQALQKAVA-NQPISVAIDASGSDFQFYKSGVFTGS-CG- 283
Query: 255 TPMDVNHAVVAVGYGV-EDGVPYWLIKNSWGENWGDHGYFKMEMG----KNMCGIATCAS 309
+++H V AVGYGV DG YWL+KNSWG +WG+ GY KM+ G + +CGIA AS
Sbjct: 284 --TELDHGVTAVGYGVGNDGTKYWLVKNSWGTDWGEEGYIKMQRGVDAAEGLCGIAMEAS 341
Query: 310 YPV 312
YP
Sbjct: 342 YPT 344
>gi|312091978|ref|XP_003147174.1| fibroinase [Loa loa]
gi|307757661|gb|EFO16895.1| fibroinase [Loa loa]
Length = 286
Score = 186 bits (472), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 97/236 (41%), Positives = 144/236 (61%), Gaps = 24/236 (10%)
Query: 86 FSKNLDLIRSTNCK---GLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGI 142
F K ++ S N + + +R+ ++PVKDQG CGSCW FS+TG+LE +++ G+ I
Sbjct: 54 FGKKNVILLSANSRLPEKVDWRIKGAVTPVKDQGRCGSCWAFSSTGALEGQHYRRTGRLI 113
Query: 143 SLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVG 202
SLSEQ L+DC++ + N GC+GGL AF+YIK NGG+D+E AYPY K+G C++S+
Sbjct: 114 SLSEQNLLDCSEDYGNSGCSGGLMDYAFDYIKENGGIDSESAYPYEAKEGPCRYSNRTRV 173
Query: 203 VQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRF---YKSGVYSSTKCGNTPMDV 259
V++ G E +LQ AV + P+SVA R+ Y+ G GN +
Sbjct: 174 STDNGEVDLPEGDEMQLQRAVAKIGPISVAMNA----RYLSSYEEGY------GNEKVKR 223
Query: 260 NHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVVA 314
+ V + + YW++KNSWG++WG+ GYF++ K NMCGIA+ ASYP+V+
Sbjct: 224 ENGTV-------EDLDYWIVKNSWGKDWGEDGYFRLARNKDNMCGIASAASYPIVS 272
>gi|109390302|gb|ABG33750.1| cysteine protease [Hevea brasiliensis]
Length = 457
Score = 186 bits (472), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 115/303 (37%), Positives = 154/303 (50%), Gaps = 64/303 (21%)
Query: 68 RYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN------------------- 108
++GK Y S+ E + RF F NL I N + +YR+GLN
Sbjct: 48 KHGKAYNSLGEKERRFEVFKDNLRFIDEHNSENRTYRVGLNRFADLTNEEYRSMYLGALS 107
Query: 109 ---------------------------------ISPVKDQGHCGSCWTFSTTGSLEAAYH 135
+ VKDQG CGSCW FS ++E
Sbjct: 108 GIRRNKLRKISDRYTPRVGDSLPDSVDWRKEGAVVGVKDQGSCGSCWAFSAVAAVEGINK 167
Query: 136 QAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCK 195
G ISLSEQ+LVDC ++N +GCNGGL FE+I NGG+D+EE YPY +DG C
Sbjct: 168 IVTGDLISLSEQELVDCDNSYN-EGCNGGLMDYGFEFIINNGGIDSEEDYPYLARDGRCD 226
Query: 196 FSSENVGVQVLDSV-NITLGAEDELQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCG 253
+N V +DS ++ + E LQ AV +PVSVA E F+ Y SGV+S +CG
Sbjct: 227 TYRKNARVVSIDSYEDVPVNNEAALQKAVA-NQPVSVAIEAGGRDFQLYSSGVFSG-RCG 284
Query: 254 NTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMG----KNMCGIATCAS 309
++H VVAVGYG E+G YW+++NSWG++WG+ GY +M +CGIA AS
Sbjct: 285 TA---LDHGVVAVGYGTENGQDYWIVRNSWGKSWGESGYLRMARNIRKPTGICGIAMEAS 341
Query: 310 YPV 312
YP+
Sbjct: 342 YPI 344
>gi|186516984|ref|NP_195406.2| cysteine proteinase1 [Arabidopsis thaliana]
gi|15290508|gb|AAK92229.1| cysteine proteinase [Arabidopsis thaliana]
gi|332661313|gb|AEE86713.1| cysteine proteinase1 [Arabidopsis thaliana]
Length = 376
Score = 186 bits (472), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 112/274 (40%), Positives = 156/274 (56%), Gaps = 23/274 (8%)
Query: 56 ARHALSFARFAR----RYGKIYESVEEMKLRFATFSKNLDLIRSTNCKG------LSYRL 105
A + L +F Y K+Y R +KN++ S G + +R
Sbjct: 94 ATYKLGLTKFTDLTNDEYRKLYLGARTEPARRIAKAKNVNQKYSAAVNGKEVPETVDWRQ 153
Query: 106 GLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGL 165
++P+KDQG CGSCW FSTT ++E G+ ISLSEQ+LVDC +++N QGCNGGL
Sbjct: 154 KGAVNPIKDQGTCGSCWAFSTTAAVEGINKIVTGELISLSEQELVDCDKSYN-QGCNGGL 212
Query: 166 PSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSV-NITLGAEDELQHAVG 224
AF++I NGGL+TE+ YPY G G C +N V +D ++ E L+ A+
Sbjct: 213 MDYAFQFIMKNGGLNTEKDYPYRGFGGKCNSFLKNSRVVSIDGYEDVPTKDETALKKAIS 272
Query: 225 LVRPVSVAFEVVDG-FRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSW 283
+PVSVA E F+ Y+SG+++ + CG +++HAVVAVGYG E+GV YW+++NSW
Sbjct: 273 Y-QPVSVAIEAGGRIFQHYQSGIFTGS-CG---TNLDHAVVAVGYGSENGVDYWIVRNSW 327
Query: 284 GENWGDHGYFKMEMG-----KNMCGIATCASYPV 312
G WG+ GY +ME CGIA ASYPV
Sbjct: 328 GPRWGEEGYIRMERNLAASKSGKCGIAVEASYPV 361
>gi|5822035|pdb|1CS8|A Chain A, Crystal Structure Of Procathepsin L
Length = 316
Score = 186 bits (472), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 102/221 (46%), Positives = 132/221 (59%), Gaps = 9/221 (4%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
+ + +R ++PVK+QG CGS W FS TG+LE + G+ ISLSEQ LVDC+ N
Sbjct: 99 RSVDWREKGYVTPVKNQGQCGSXWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGN 158
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
+GCNGGL AF+Y++ NGGLD+EE+YPY + CK++ + V+I E
Sbjct: 159 EGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDAGFVDIPK-QEKA 217
Query: 219 LQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----DG 273
L AV V P+SVA + + F FYK G+Y C + MD H V+ VGYG E D
Sbjct: 218 LMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMD--HGVLVVGYGFESTESDN 275
Query: 274 VPYWLIKNSWGENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
YWL+KNSWGE WG GY KM +N CGIA+ ASYP V
Sbjct: 276 NKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 316
>gi|307175098|gb|EFN65240.1| Cathepsin L [Camponotus floridanus]
Length = 319
Score = 186 bits (472), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 103/244 (42%), Positives = 144/244 (59%), Gaps = 31/244 (12%)
Query: 74 ESVEEMKLRFATFSK--NLDLIRSTNCKGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLE 131
E+V E +L ATF + N++L +S + +R ++ +KDQG CGSCW FS+TG+LE
Sbjct: 103 ETVSEEQLIGATFIEPVNVELAKSVD-----WRTNGAVTAIKDQGQCGSCWAFSSTGALE 157
Query: 132 AAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKD 191
+ + G +SLSEQ L+DC+ + N GCNGGL AF YIK N GLDTE++YPY ++
Sbjct: 158 GQHFRQSGVLVSLSEQNLIDCSGKYGNNGCNGGLMDYAFRYIKENKGLDTEKSYPYEAEN 217
Query: 192 GVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSST 250
C+++ +N G + V+I G ED+L+ AV + P+SVA + + F+FY G
Sbjct: 218 DQCRYNPKNSGASDVGFVDIPEGDEDKLKAAVATIGPISVAIDASHESFQFYSEGT---- 273
Query: 251 KCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMG-KNMCGIATCAS 309
C +D YWL+KNSWGE WG+ GY KM KN CGIA+ AS
Sbjct: 274 -CYTCNID-----------------YWLVKNSWGETWGEKGYIKMARNKKNHCGIASSAS 315
Query: 310 YPVV 313
YP+V
Sbjct: 316 YPLV 319
>gi|46395939|sp|Q94B08.2|GCP1_ARATH RecName: Full=Germination-specific cysteine protease 1; Flags:
Precursor
gi|4006883|emb|CAB16767.1| cysteine proteinase [Arabidopsis thaliana]
gi|7270637|emb|CAB80354.1| cysteine proteinase [Arabidopsis thaliana]
Length = 376
Score = 186 bits (472), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 112/274 (40%), Positives = 156/274 (56%), Gaps = 23/274 (8%)
Query: 56 ARHALSFARFAR----RYGKIYESVEEMKLRFATFSKNLDLIRSTNCKG------LSYRL 105
A + L +F Y K+Y R +KN++ S G + +R
Sbjct: 94 ATYKLGLTKFTDLTNDEYRKLYLGARTEPARRIAKAKNVNQKYSAAVNGKEVPETVDWRQ 153
Query: 106 GLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGL 165
++P+KDQG CGSCW FSTT ++E G+ ISLSEQ+LVDC +++N QGCNGGL
Sbjct: 154 KGAVNPIKDQGTCGSCWAFSTTAAVEGINKIVTGELISLSEQELVDCDKSYN-QGCNGGL 212
Query: 166 PSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSV-NITLGAEDELQHAVG 224
AF++I NGGL+TE+ YPY G G C +N V +D ++ E L+ A+
Sbjct: 213 MDYAFQFIMKNGGLNTEKDYPYRGFGGKCNSFLKNSRVVSIDGYEDVPTKDETALKKAIS 272
Query: 225 LVRPVSVAFEVVDG-FRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSW 283
+PVSVA E F+ Y+SG+++ + CG +++HAVVAVGYG E+GV YW+++NSW
Sbjct: 273 Y-QPVSVAIEAGGRIFQHYQSGIFTGS-CG---TNLDHAVVAVGYGSENGVDYWIVRNSW 327
Query: 284 GENWGDHGYFKMEMG-----KNMCGIATCASYPV 312
G WG+ GY +ME CGIA ASYPV
Sbjct: 328 GPRWGEEGYIRMERNLAASKSGKCGIAVEASYPV 361
>gi|109939734|sp|P25776.2|ORYA_ORYSJ RecName: Full=Oryzain alpha chain; Flags: Precursor
gi|78192122|gb|ABB30151.1| oryzain alpha [Oryza sativa Japonica Group]
Length = 458
Score = 186 bits (471), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 116/311 (37%), Positives = 158/311 (50%), Gaps = 66/311 (21%)
Query: 62 FARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKG----LSYRLGLN--------- 108
+A + +GK Y +V E + R+A F NL I N S+RLGLN
Sbjct: 40 YAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHSFRLGLNRFADLTNEE 99
Query: 109 -----------------------------------------ISPVKDQGHCGSCWTFSTT 127
++ +KDQG CGSCW FS
Sbjct: 100 YRDTYLGLRNKPRRERKVSDRYLAADNEALPESVDWRTKGAVAEIKDQGGCGSCWAFSAI 159
Query: 128 GSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPY 187
++E G ISLSEQ+LVDC ++N +GCNGGL AF++I NGG+DTE+ YPY
Sbjct: 160 AAVEGINQIVTGDLISLSEQELVDCDTSYN-EGCNGGLMDYAFDFIINNGGIDTEDDYPY 218
Query: 188 TGKDGVCKFSSENVGVQVLDSV-NITLGAEDELQHAVGLVRPVSVAFEV-VDGFRFYKSG 245
GKD C + +N V +DS ++T +E LQ AV +PVSVA E F+ Y SG
Sbjct: 219 KGKDERCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVA-NQPVSVAIEAGGRAFQLYSSG 277
Query: 246 VYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMG----KNM 301
+++ KCG ++H V AVGYG E+G YW+++NSWG++WG+ GY +ME
Sbjct: 278 IFTG-KCGTA---LDHGVAAVGYGTENGKDYWIVRNSWGKSWGESGYVRMERNIKASSGK 333
Query: 302 CGIATCASYPV 312
CGIA SYP+
Sbjct: 334 CGIAVEPSYPL 344
>gi|291463491|pdb|3IV2|A Chain A, Crystal Structure Of Mature Apo-Cathepsin L C25a Mutant
gi|291463492|pdb|3IV2|B Chain B, Crystal Structure Of Mature Apo-Cathepsin L C25a Mutant
gi|291463519|pdb|3K24|A Chain A, Crystal Structure Of Mature Apo-Cathepsin L C25a Mutant In
Complex With Gln-Leu-Ala Peptide
gi|291463520|pdb|3K24|B Chain B, Crystal Structure Of Mature Apo-Cathepsin L C25a Mutant In
Complex With Gln-Leu-Ala Peptide
Length = 220
Score = 186 bits (471), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 102/221 (46%), Positives = 132/221 (59%), Gaps = 9/221 (4%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
+ + +R ++PVK+QG CGS W FS TG+LE + G+ ISLSEQ LVDC+ N
Sbjct: 3 RSVDWREKGYVTPVKNQGQCGSAWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGN 62
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
+GCNGGL AF+Y++ NGGLD+EE+YPY + CK++ + V+I E
Sbjct: 63 EGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPK-QEKA 121
Query: 219 LQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----DG 273
L AV V P+SVA + + F FYK G+Y C + MD H V+ VGYG E D
Sbjct: 122 LMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMD--HGVLVVGYGFESTESDN 179
Query: 274 VPYWLIKNSWGENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
YWL+KNSWGE WG GY KM +N CGIA+ ASYP V
Sbjct: 180 NKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 220
>gi|294883340|ref|XP_002770717.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
gi|239874002|gb|EER02722.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
Length = 333
Score = 186 bits (471), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 121/313 (38%), Positives = 158/313 (50%), Gaps = 69/313 (22%)
Query: 60 LSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN----------- 108
L+F F ++GK YES EE R A F NL I N K LSY+LG+N
Sbjct: 26 LAFMGFQHKFGKNYESKEEEVKRNAIFQANLHHIEHVNAKNLSYKLGVNEHADLTHEEFA 85
Query: 109 -------------------------------------ISPVKDQGHCGSCWTFSTTGSLE 131
+SPVK+QG CGSCW FS G+LE
Sbjct: 86 ALKLGTLEMSTRRDDKFVVEADTTQLPTSVDWRNKSVLSPVKNQGSCGSCWAFSAAGALE 145
Query: 132 AAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKD 191
A Y A GK LS Q+LVDC+ ++ N+GC GGL + A++YIK + GLD E YPY G +
Sbjct: 146 AQYAIATGKLRPLSVQELVDCSSSYGNKGCLGGLMTNAYKYIK-SAGLDQESTYPYKGWN 204
Query: 192 GVCKFSSEN----VGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVD-GFRFYKSGV 246
C SSE + + ++ E L A+ PVS+A D FRFY+SGV
Sbjct: 205 KHCFRSSEKKADGIPAGEVTGSHMLAQTEQSLMKALAAA-PVSLAMYARDRNFRFYRSGV 263
Query: 247 YSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMG-------- 298
YSST C +++H VVAVGYG + G Y+++KNSWG +WG GYF ++ G
Sbjct: 264 YSSTTCNG---EIDHGVVAVGYGADKGSDYFILKNSWGSSWGIGGYFYLKRGVGGFGECK 320
Query: 299 --KNMCGIATCAS 309
+NMC +AT S
Sbjct: 321 ILENMC-VATLKS 332
>gi|208972990|dbj|BAG74344.1| silicatein-M3 [Ephydatia fluviatilis]
Length = 326
Score = 186 bits (471), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 95/215 (44%), Positives = 134/215 (62%), Gaps = 4/215 (1%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++ VK QG CG+ + F+ TG+LE A A K + LSEQ ++DC+ + N G
Sbjct: 114 IDWRTKGAVTSVKYQGQCGASYAFAATGALEGASALANDKQVILSEQNIIDCSVPYGNHG 173
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C+GG A +Y+ NGG+DTE +Y + GK C++SS+N G ++IT G+E +L
Sbjct: 174 CSGGDTYTAMKYVIDNGGIDTESSYSFQGKQSSCQYSSKNSGASATGVISITSGSETDLL 233
Query: 221 HAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
AV V PV+VA + + FRFY+SGV+ S+ C NT NHA++ GYG +G YWL+
Sbjct: 234 AAVATVGPVAVAVDANTNAFRFYQSGVFDSSSCSNTK--PNHAMLVTGYGSYNGKDYWLV 291
Query: 280 KNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
KNSW +NWGD+GY M K N C IAT A YP +
Sbjct: 292 KNSWSKNWGDNGYILMVRNKYNQCAIATDALYPTL 326
>gi|31982433|ref|NP_031828.2| cathepsin K precursor [Mus musculus]
gi|12644320|sp|P55097.2|CATK_MOUSE RecName: Full=Cathepsin K; Flags: Precursor
gi|3550487|emb|CAA06825.1| cathepsin K [Mus musculus]
gi|12834090|dbj|BAB22783.1| unnamed protein product [Mus musculus]
gi|28277388|gb|AAH46320.1| Cathepsin K [Mus musculus]
gi|74209960|dbj|BAE21279.1| unnamed protein product [Mus musculus]
gi|148706870|gb|EDL38817.1| cathepsin K, isoform CRA_a [Mus musculus]
Length = 329
Score = 186 bits (471), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 93/213 (43%), Positives = 130/213 (61%), Gaps = 6/213 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ YR ++PVK+QG CGSCW FS+ G+LE + GK ++LS Q LVDC N G
Sbjct: 119 IDYRKKGYVTPVKNQGQCGSCWAFSSAGALEGQLKKKTGKLLALSPQNLVDCVT--ENYG 176
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C GG + AF+Y++ NGG+D+E+AYPY G+D C +++ + I +G E L+
Sbjct: 177 CGGGYMTTAFQYVQQNGGIDSEDAYPYVGQDESCMYNATAKAAKCRGYREIPVGNEKALK 236
Query: 221 HAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
AV V P+SV+ + + F+FY GVY C +VNHAV+ VGYG + G +W+I
Sbjct: 237 RAVARVGPISVSIDASLASFQFYSRGVYYDENCDRD--NVNHAVLVVGYGTQKGSKHWII 294
Query: 280 KNSWGENWGDHGYFKMEMGK-NMCGIATCASYP 311
KNSWGE+WG+ GY + K N CGI AS+P
Sbjct: 295 KNSWGESWGNKGYALLARNKNNACGITNMASFP 327
>gi|222629675|gb|EEE61807.1| hypothetical protein OsJ_16426 [Oryza sativa Japonica Group]
Length = 459
Score = 186 bits (471), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 116/311 (37%), Positives = 158/311 (50%), Gaps = 66/311 (21%)
Query: 62 FARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKG----LSYRLGLN--------- 108
+A + +GK Y +V E + R+A F NL I N S+RLGLN
Sbjct: 41 YAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHSFRLGLNRFADLTNEE 100
Query: 109 -----------------------------------------ISPVKDQGHCGSCWTFSTT 127
++ +KDQG CGSCW FS
Sbjct: 101 YRDTYLGLRNKPRRERKVSDRYLAADNEALPESVDWRTKGAVAEIKDQGGCGSCWAFSAI 160
Query: 128 GSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPY 187
++E G ISLSEQ+LVDC ++N +GCNGGL AF++I NGG+DTE+ YPY
Sbjct: 161 AAVEGINQIVTGDLISLSEQELVDCDTSYN-EGCNGGLMDYAFDFIINNGGIDTEDDYPY 219
Query: 188 TGKDGVCKFSSENVGVQVLDSV-NITLGAEDELQHAVGLVRPVSVAFEV-VDGFRFYKSG 245
GKD C + +N V +DS ++T +E LQ AV +PVSVA E F+ Y SG
Sbjct: 220 KGKDERCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVA-NQPVSVAIEAGGRAFQLYSSG 278
Query: 246 VYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMG----KNM 301
+++ KCG ++H V AVGYG E+G YW+++NSWG++WG+ GY +ME
Sbjct: 279 IFTG-KCGTA---LDHGVAAVGYGTENGKDYWIVRNSWGKSWGESGYVRMERNIKASSGK 334
Query: 302 CGIATCASYPV 312
CGIA SYP+
Sbjct: 335 CGIAVEPSYPL 345
>gi|146386731|pdb|1VSN|A Chain A, Crystal Structure Of A Potent Small Molecule Inhibitor
Bound To Cathepsin K
Length = 215
Score = 186 bits (471), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 95/213 (44%), Positives = 131/213 (61%), Gaps = 6/213 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ YR ++PVK+QG CGSCW FS+ G+LE +A G ++L+ Q LVDC N G
Sbjct: 5 IDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKATGALLNLAPQNLVDCVS--ENDG 62
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C GG + AF+Y++ N G+D+E+AYPY G+D C ++ + I G E L+
Sbjct: 63 CGGGYMTNAFQYVQRNRGIDSEDAYPYVGQDESCMYNPTGKAAKCRGYREIPEGNEAALK 122
Query: 221 HAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
AV V PVSVA + + F+FY +GVY C + + NHAV+AVGYG++ G +W+I
Sbjct: 123 RAVAAVGPVSVAIDASLTSFQFYSAGVYYDENCSSDAL--NHAVLAVGYGIQAGNKHWII 180
Query: 280 KNSWGENWGDHGYFKMEMGK-NMCGIATCASYP 311
KNSWGE+WG+ GY M K N CGIA AS+P
Sbjct: 181 KNSWGESWGNAGYILMARNKNNACGIANLASFP 213
>gi|157835400|pdb|2NQD|B Chain B, Crystal Structure Of Cysteine Protease Inhibitor,
Chagasin, In Complex With Human Cathepsin L
Length = 221
Score = 186 bits (471), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 102/221 (46%), Positives = 132/221 (59%), Gaps = 9/221 (4%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
+ + +R ++PVK+QG CGS W FS TG+LE + G+ ISLSEQ LVDC+ N
Sbjct: 4 RSVDWREKGYVTPVKNQGQCGSAWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGN 63
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
+GCNGGL AF+Y++ NGGLD+EE+YPY + CK++ + V+I E
Sbjct: 64 EGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPK-QEKA 122
Query: 219 LQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----DG 273
L AV V P+SVA + + F FYK G+Y C + MD H V+ VGYG E D
Sbjct: 123 LMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMD--HGVLVVGYGFESTESDN 180
Query: 274 VPYWLIKNSWGENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
YWL+KNSWGE WG GY KM +N CGIA+ ASYP V
Sbjct: 181 NKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 221
>gi|354502589|ref|XP_003513366.1| PREDICTED: testin-2-like [Cricetulus griseus]
Length = 333
Score = 186 bits (471), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 101/221 (45%), Positives = 132/221 (59%), Gaps = 9/221 (4%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K +++R ++PVK QGHC S W FS TG+LE + K +LSEQ L+DC +
Sbjct: 116 KQVNWREQGYVTPVKSQGHCASSWAFSATGALEGQMFKKTRKLNALSEQNLLDCMEFNVT 175
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
+ C+GG AF+Y++ NGGL TEE+YPY G C++ ++N V D V I G E+
Sbjct: 176 RSCSGGFMQSAFQYVRDNGGLATEESYPYQGHAMECRYQAKNSAANVKDFVQIP-GHEEA 234
Query: 219 LQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----DG 273
L AV V P+SVA + F+FY+SG+Y KC NHAV+ VGYG E DG
Sbjct: 235 LMKAVANVGPISVAIDARHSSFQFYESGIYYEPKCKRVHQ--NHAVLVVGYGFEGEESDG 292
Query: 274 VPYWLIKNSWGENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
YWL+KNSWGE WG GY K+ N CGIAT A+YP+V
Sbjct: 293 NSYWLVKNSWGEEWGIKGYMKIAKDWNNHCGIATHATYPIV 333
>gi|291383488|ref|XP_002708302.1| PREDICTED: cathepsin L1 [Oryctolagus cuniculus]
Length = 344
Score = 186 bits (471), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 102/218 (46%), Positives = 133/218 (61%), Gaps = 9/218 (4%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
L +R ++PVK+Q CGS W FS TG+LE + G+ +SLSEQ LVDC+ NQG
Sbjct: 118 LDWRERGYVTPVKNQELCGSSWAFSATGALEGQMFRKTGRLVSLSEQNLVDCSWPQGNQG 177
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C+GGL AF+Y+K N GLD+EE+YPY + G CK++ V V+++ E L
Sbjct: 178 CSGGLMDYAFQYVKDNRGLDSEESYPYEQRKGSCKYNPRFSAANVTGFVDVS-KDEKALM 236
Query: 221 HAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVED----GVP 275
AV V PVSV + F FY+ G+Y KC + +VNHAV+ VGYG E+
Sbjct: 237 EAVATVGPVSVGIATTPESFLFYEGGIYYDPKCSSE--NVNHAVLVVGYGFEEVGSKNNK 294
Query: 276 YWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPV 312
YWLIKNSWG++WG GY KM + N CGIAT ASYP+
Sbjct: 295 YWLIKNSWGKDWGMGGYMKMAKDQNNHCGIATAASYPL 332
>gi|218195711|gb|EEC78138.1| hypothetical protein OsI_17694 [Oryza sativa Indica Group]
Length = 458
Score = 186 bits (471), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 116/311 (37%), Positives = 158/311 (50%), Gaps = 66/311 (21%)
Query: 62 FARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKG----LSYRLGLN--------- 108
+A + +GK Y +V E + R+A F NL I N S+RLGLN
Sbjct: 40 YAEWKAEHGKNYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHSFRLGLNRFADLTNEE 99
Query: 109 -----------------------------------------ISPVKDQGHCGSCWTFSTT 127
++ +KDQG CGSCW FS
Sbjct: 100 YRDTYLGLRNKPRRERKVSDRYLAADNEALPESVDWRTKGAVAEIKDQGGCGSCWAFSAI 159
Query: 128 GSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPY 187
++E G ISLSEQ+LVDC ++N +GCNGGL AF++I NGG+DTE+ YPY
Sbjct: 160 AAVEGINQIVTGDLISLSEQELVDCDTSYN-EGCNGGLMDYAFDFIINNGGIDTEDDYPY 218
Query: 188 TGKDGVCKFSSENVGVQVLDSV-NITLGAEDELQHAVGLVRPVSVAFEV-VDGFRFYKSG 245
GKD C + +N V +DS ++T +E LQ AV +PVSVA E F+ Y SG
Sbjct: 219 KGKDERCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVA-NQPVSVAIEAGGRAFQLYSSG 277
Query: 246 VYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMG----KNM 301
+++ KCG ++H V AVGYG E+G YW+++NSWG++WG+ GY +ME
Sbjct: 278 IFTG-KCGTA---LDHGVAAVGYGTENGKDYWIVRNSWGKSWGESGYVRMERNIKASSGK 333
Query: 302 CGIATCASYPV 312
CGIA SYP+
Sbjct: 334 CGIAVEPSYPL 344
>gi|288764223|emb|CAQ03432.1| silcatein 1 [Spongilla lacustris]
gi|296168747|emb|CAQ54051.1| silicatein alpha 3 [Spongilla lacustris]
Length = 327
Score = 186 bits (471), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 96/223 (43%), Positives = 138/223 (61%), Gaps = 10/223 (4%)
Query: 99 KGLSYRLGLN------ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDC 152
KG+SY ++ ++ VK Q CGS + F+ G+LE A A K ++LSEQ ++DC
Sbjct: 107 KGVSYADSMDWRTKGVVTSVKTQSQCGSSYAFAAVGALEGASALATDKLVALSEQNIIDC 166
Query: 153 AQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNIT 212
+ + N GC+GG AF+Y+ NGG+DTE +YPY GK C+++S+N G V I
Sbjct: 167 SVPYGNHGCSGGDTYTAFKYVVDNGGIDTESSYPYKGKQSSCQYNSKNAGATATGVVKIA 226
Query: 213 LGAEDELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE 271
G+E +L AV PV+VA + V+ F FY+SGV+ S+ C NT + NHA++ GYG
Sbjct: 227 SGSESDLMSAVASGGPVAVAVDASVNSFMFYQSGVFDSSTCSNTKL--NHAMLVTGYGSV 284
Query: 272 DGVPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
+G YWL+KNSWG +WG+ GY +M K N CGIA+ A P++
Sbjct: 285 NGKDYWLVKNSWGTSWGESGYIRMVRNKYNQCGIASDALIPML 327
>gi|118627554|emb|CAL64936.1| putative cysteine protease 8 [Trifolium pratense]
Length = 344
Score = 186 bits (471), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 114/308 (37%), Positives = 157/308 (50%), Gaps = 64/308 (20%)
Query: 64 RFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKG--LSYRLGLN------------- 108
++ +YGKIY+ +E + RF F +N++ I + N SY+LG+N
Sbjct: 41 QWMSQYGKIYKDHQERETRFKIFKENVNYIETFNNADDTKSYKLGINQFADLTNEEFIAS 100
Query: 109 -------------------------------------ISPVKDQGHCGSCWTFSTTGSLE 131
++PVK+QG CG CW FS + E
Sbjct: 101 RNKFKGHMCSSIMRTTSFKYENVSGIPSTVDWRKKGAVTPVKNQGQCGCCWAFSAVAATE 160
Query: 132 AAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKD 191
+ + GK ISLSEQ+LVDC +QGC GGL AF++I N GL TE YPY G D
Sbjct: 161 GIHKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFIIQNHGLSTEAQYPYEGVD 220
Query: 192 GVCKFSSENV-GVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSS 249
G C + +V V + ++ +E LQ AV +P+SVA + F+FYKSGV++
Sbjct: 221 GTCNANKASVQAVTITGYEDVPANSEQALQKAVA-NQPISVAIDASGSDFQFYKSGVFTG 279
Query: 250 TKCGNTPMDVNHAVVAVGYGV-EDGVPYWLIKNSWGENWGDHGYFKMEMG----KNMCGI 304
CG +++H V AVGYGV DG YWL+KNSWG +WG+ GY M+ G + +CGI
Sbjct: 280 A-CGT---ELDHGVTAVGYGVSNDGTKYWLVKNSWGTDWGEEGYIMMQRGIEAAEGICGI 335
Query: 305 ATCASYPV 312
A ASYP
Sbjct: 336 AMQASYPT 343
>gi|295321664|pdb|3H7D|A Chain A, The Crystal Structure Of The Cathepsin K Variant M5 In
Compl Chondroitin-4-Sulfate
gi|295321665|pdb|3H7D|E Chain E, The Crystal Structure Of The Cathepsin K Variant M5 In
Compl Chondroitin-4-Sulfate
Length = 215
Score = 186 bits (471), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 97/213 (45%), Positives = 129/213 (60%), Gaps = 6/213 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ YR ++PVK+QG CGSCW FS+ G+LE + GK ++LS Q LVDC N G
Sbjct: 5 VDYREKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE--NDG 62
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C GG + AF+Y++ N G+D+E+AYPY G++ C ++ + I G E L+
Sbjct: 63 CGGGYMTNAFQYVQKNRGIDSEDAYPYVGQEESCMYNPTGKAAKCRGYREIPEGNEKALK 122
Query: 221 HAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
AV V PVSVA + + F+FY GVY C + ++NHAV+AVGYG G +W+I
Sbjct: 123 RAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNSD--NLNHAVLAVGYGESKGNKHWII 180
Query: 280 KNSWGENWGDHGYFKMEMGK-NMCGIATCASYP 311
KNSWGENWG GY KM K N CGIA AS+P
Sbjct: 181 KNSWGENWGMGGYIKMARNKNNACGIANLASFP 213
>gi|402856107|ref|XP_003892641.1| PREDICTED: cathepsin S isoform 2 [Papio anubis]
Length = 281
Score = 185 bits (470), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 105/256 (41%), Positives = 145/256 (56%), Gaps = 12/256 (4%)
Query: 65 FARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCK---GL-SYRLGLNISPVKDQGHCGS 120
+ + YGK Y+ E +R + KNL + N + G+ SY LG+N + D G CG+
Sbjct: 31 WKKTYGKQYKEKNEEAVRRLIWEKNLKFVMLHNLEHSMGMHSYDLGMN--HLGDMGSCGA 88
Query: 121 CWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCA-QAFNNQGCNGGLPSQAFEYIKYNGGL 179
CW FS G+LEA GK +SLS Q LVDC+ + + N+GCNGG ++AF+YI N G+
Sbjct: 89 CWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTRAFQYIIDNNGI 148
Query: 180 DTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVD-G 238
D++ +YPY D C++ S+ + G ED L+ V PVSV +
Sbjct: 149 DSDASYPYKATDQKCQYDSKYRAATCSKYTELPYGREDVLKEVVANKGPVSVGVDASHPS 208
Query: 239 FRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMG 298
F Y+SGVY C +VNH V+ VGYGV +G YWL+KNSWG N+G+ GY +M
Sbjct: 209 FFLYRSGVYYEPSC---TQNVNHGVLVVGYGVLNGKEYWLVKNSWGRNFGEEGYIRMARN 265
Query: 299 K-NMCGIATCASYPVV 313
K N CGIA+ SYP +
Sbjct: 266 KGNHCGIASFPSYPEI 281
>gi|294890024|ref|XP_002773045.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
gi|239877748|gb|EER04861.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
Length = 329
Score = 185 bits (470), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 120/303 (39%), Positives = 147/303 (48%), Gaps = 54/303 (17%)
Query: 60 LSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN----------- 108
L+F F ++GK YES EE R A F NL I N K LSY+LG+N
Sbjct: 26 LAFMGFQHKFGKNYESKEEEVKRNAIFQANLHHIEHVNAKNLSYKLGVNEHADLTHEEFA 85
Query: 109 -------------------------------------ISPVKDQGHCGSCWTFSTTGSLE 131
++PVK+QG CGS W FSTTG+L
Sbjct: 86 ALKLGTLKMSTRRDDEFVVEADTTQLPTSVDWRNKSVLTPVKNQGSCGSSWAFSTTGALG 145
Query: 132 AAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKD 191
A Y A GK +SLSEQ+LVDC+ + N GC GG A+EYI GLD E YPY G D
Sbjct: 146 AQYAIATGKLLSLSEQELVDCSLKYGNDGCIGGYMGAAYEYIN-QAGLDQESTYPYKGWD 204
Query: 192 GVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVD-GFRFYKSGVYSST 250
C SSE + + E L A+ PVSV D FRFY+SGVYSST
Sbjct: 205 EPCFRSSEKKADGIPVRFVLNTKTEQSLMKALADA-PVSVGMYASDPNFRFYRSGVYSST 263
Query: 251 KCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASY 310
C + +HAVVAVGYG + G Y+++KNSWG WG GYF ++ G G Y
Sbjct: 264 TCNG---ETDHAVVAVGYGADKGSDYFILKNSWGSKWGIGGYFFLKRGVGGHGECNILEY 320
Query: 311 PVV 313
+V
Sbjct: 321 MLV 323
>gi|218181|dbj|BAA14402.1| oryzain alpha precursor [Oryza sativa Japonica Group]
Length = 458
Score = 185 bits (470), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 116/311 (37%), Positives = 158/311 (50%), Gaps = 66/311 (21%)
Query: 62 FARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKG----LSYRLGLN--------- 108
+A + +GK Y +V E + R+A F NL I N S+RLGLN
Sbjct: 40 YAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHSFRLGLNRFADLTNEE 99
Query: 109 -----------------------------------------ISPVKDQGHCGSCWTFSTT 127
++ +KDQG CGSCW FS
Sbjct: 100 YRDTYLGLRNKPRRERKVSDRYLAADNEALPESVDWRTKGAVAEIKDQGGCGSCWAFSAI 159
Query: 128 GSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPY 187
++E G ISLSEQ+LVDC ++N +GCNGGL AF++I NGG+DTE+ YPY
Sbjct: 160 AAVEDINQIVTGDLISLSEQELVDCDTSYN-EGCNGGLMDYAFDFIINNGGIDTEDDYPY 218
Query: 188 TGKDGVCKFSSENVGVQVLDSV-NITLGAEDELQHAVGLVRPVSVAFEV-VDGFRFYKSG 245
GKD C + +N V +DS ++T +E LQ AV +PVSVA E F+ Y SG
Sbjct: 219 KGKDERCDVNRKNAKVVTIDSYEDVTPNSETSLQKAV-RNQPVSVAIEAGGRAFQLYSSG 277
Query: 246 VYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMG----KNM 301
+++ KCG ++H V AVGYG E+G YW+++NSWG++WG+ GY +ME
Sbjct: 278 IFTG-KCGTA---LDHGVAAVGYGTENGKDYWIVRNSWGKSWGESGYVRMERNIKASSGK 333
Query: 302 CGIATCASYPV 312
CGIA SYP+
Sbjct: 334 CGIAVEPSYPL 344
>gi|50355611|dbj|BAD29954.1| cysteine protease [Daucus carota]
Length = 474
Score = 185 bits (470), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 100/211 (47%), Positives = 135/211 (63%), Gaps = 13/211 (6%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++PVKDQG CGSCW FST G++E G+ ISLSEQ+LVDC + NQGCNGGL
Sbjct: 162 VAPVKDQGQCGSCWAFSTVGAVEGINKIVTGELISLSEQELVDCDNGY-NQGCNGGLMDY 220
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSV-NITLGAEDELQHAVGLVR 227
AFE+I NGG+DTE+ YPY G DG+C + +N V ++ ++ E L+ AV +
Sbjct: 221 AFEFIVKNGGIDTEDDYPYKGVDGLCDQNRKNAKVVTINGYEDVPHNDEKSLKKAVAH-Q 279
Query: 228 PVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGEN 286
PVSVA E F+ Y+SGV++ +CG +++H VVAVGYG E+G YW+++NSWG +
Sbjct: 280 PVSVAIEAGGRAFQLYESGVFTG-QCGT---ELDHGVVAVGYGSENGKDYWIVRNSWGPD 335
Query: 287 WGDHGYFKME-----MGKNMCGIATCASYPV 312
WG+ GY ++E CGIA ASYP
Sbjct: 336 WGESGYIRLERNVASTSTGKCGIAMQASYPT 366
>gi|449525012|ref|XP_004169515.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
Length = 459
Score = 185 bits (470), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 118/309 (38%), Positives = 160/309 (51%), Gaps = 64/309 (20%)
Query: 62 FARFARRYGKIYESV-EEMKLRFATFSKNLDLIRSTNCKGLSYRLGLNI----------- 109
+ ++ ++GK++ ++ E + RF F NL I N + L YRLGLN+
Sbjct: 41 YDQWRAKHGKLHNNLGAEPENRFHIFKDNLKFIDEINAQNLPYRLGLNVFADLTNEEYRS 100
Query: 110 ----------------------------------------SPVKDQGHCGSCWTFSTTGS 129
+PVKDQG CGSCW FST S
Sbjct: 101 RYLGGKFASGSRRNRTSNRYLPRLGDDLPDSIDWRAKGAVAPVKDQGSCGSCWAFSTVAS 160
Query: 130 LEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTG 189
+EA G I+LSEQ+LVDC +++ N+GCNGGL AFE+I NGGLDTEE YPY G
Sbjct: 161 VEAINQIVTGDLIALSEQELVDCDRSY-NEGCNGGLMDYAFEFIIENGGLDTEEDYPYYG 219
Query: 190 KDGVCKFSSENVGVQVLDSV-NITLGAEDELQHAVGLVRPVSVAFE-VVDGFRFYKSGVY 247
D C +N V +DS ++ + E LQ AV + VSVA E F+ Y+SG++
Sbjct: 220 FDSSCIQYKKNAKVVAIDSYEDVPVNNEKALQKAVS-KQVVSVAIEGGGRSFQLYQSGIF 278
Query: 248 SSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMG----KNMCG 303
+ +CG D++H V VGYG E GV YW+++NSWG +WG+ GY KM+ +CG
Sbjct: 279 TG-RCG---TDLDHGVNVVGYGSEGGVDYWIVRNSWGGSWGESGYVKMQRNIASPTGLCG 334
Query: 304 IATCASYPV 312
IA SYP
Sbjct: 335 IAMEPSYPT 343
>gi|170292465|pdb|3BC3|A Chain A, Exploring Inhibitor Binding At The S Subsites Of Cathepsin
L
gi|170292466|pdb|3BC3|B Chain B, Exploring Inhibitor Binding At The S Subsites Of Cathepsin
L
gi|261824911|pdb|3H8C|A Chain A, A Combined Crystallographic And Molecular Dynamics Study
Of Cathepsin-L Retro-Binding Inhibitors (Compound 14)
gi|261824912|pdb|3H8C|B Chain B, A Combined Crystallographic And Molecular Dynamics Study
Of Cathepsin-L Retro-Binding Inhibitors (Compound 14)
Length = 220
Score = 185 bits (470), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 102/221 (46%), Positives = 132/221 (59%), Gaps = 9/221 (4%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
+ + +R ++PVK+QG CGS W FS TG+LE + G+ ISLSEQ LVDC+ N
Sbjct: 3 RSVDWREKGYVTPVKNQGQCGSXWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGN 62
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
+GCNGGL AF+Y++ NGGLD+EE+YPY + CK++ + V+I E
Sbjct: 63 EGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPK-QEKA 121
Query: 219 LQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----DG 273
L AV V P+SVA + + F FYK G+Y C + MD H V+ VGYG E D
Sbjct: 122 LMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMD--HGVLVVGYGFESTESDN 179
Query: 274 VPYWLIKNSWGENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
YWL+KNSWGE WG GY KM +N CGIA+ ASYP V
Sbjct: 180 NKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 220
>gi|156546466|ref|XP_001607324.1| PREDICTED: hypothetical protein LOC100123649 [Nasonia vitripennis]
Length = 1036
Score = 185 bits (470), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 111/331 (33%), Positives = 161/331 (48%), Gaps = 65/331 (19%)
Query: 43 RDFETSVLQVIGQARHALSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRS------- 95
+++ +LQ Q + + F F +Y K+Y + EE ++RF F NL+LI
Sbjct: 712 QNYSQKMLQQSRQLKEEILFHEFMGKYKKMYHNKEEKEMRFQIFKDNLNLIEELQRNEMG 771
Query: 96 ------------TNCKGLSYRLGLN--------------------------------ISP 111
T + + LGL ++P
Sbjct: 772 TGRYGVTQFTDLTKAEFKARHLGLKPTLKSENDIPMPMATIPDIELPSDYDWRHHNVVTP 831
Query: 112 VKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFE 171
VKDQG CGSCW FS TG++E Y G+ +SLSEQ+LVDC + + GCNGGLP A+
Sbjct: 832 VKDQGSCGSCWAFSVTGNIEGQYAIKHGELLSLSEQELVDCDKL--DSGCNGGLPDTAYR 889
Query: 172 YIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVR--PV 229
I+ GGL+ E YPY +D C F+ V V ++ +NIT +E Q A LV+ P+
Sbjct: 890 AIEELGGLELESDYPYDAEDEKCHFNKNKVKVNIVSGLNIT---SNETQMAQWLVKNGPM 946
Query: 230 SVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGV------EDGVPYWLIKNSW 283
S+ + +FY GV K +P ++H V+ VGYGV + +PYW+IKNSW
Sbjct: 947 SIGIN-ANAMQFYMGGVSHPFKFLCSPDSLDHGVLIVGYGVKFYPIFKKTMPYWIIKNSW 1005
Query: 284 GENWGDHGYFKMEMGKNMCGIATCASYPVVA 314
G WG+ GY+++ G CG+ + VVA
Sbjct: 1006 GPRWGEQGYYRVYRGDGTCGVNKMVTSAVVA 1036
>gi|6448469|dbj|BAA86911.1| homologue of Sarcophaga 26,29kDa proteinase [Periplaneta americana]
Length = 552
Score = 185 bits (470), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 91/224 (40%), Positives = 135/224 (60%), Gaps = 2/224 (0%)
Query: 89 NLDLIRSTNCKGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQ 148
NLD I L +R+ ++PVKDQ CGSCW+F TTG++E AY +G + LS+Q
Sbjct: 326 NLDAIMDQIPDDLDWRIYGAVTPVKDQSVCGSCWSFGTTGTIEGAYFLKYGHLVRLSQQA 385
Query: 149 LVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAY-PYTGKDGVCKFSSENVGVQVLD 207
L+DC+ + N GC+GG +++E++ +GG+ E+ Y Y G+DG C + + ++
Sbjct: 386 LIDCSWGYGNNGCDGGEDFRSYEWMMKHGGIPLEDEYGGYLGQDGYCHVENVTLTAKITG 445
Query: 208 SVNITLGAEDELQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAV 266
VN+T G D L+ A+ P+SVA + F FY +G+Y +CGN ++HAV+ V
Sbjct: 446 YVNVTSGDIDALKVALAKHGPISVAIDASHKTFSFYSNGIYYDPECGNKLDQLDHAVLLV 505
Query: 267 GYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASY 310
GYG+ +G PYWL+KNSW WG+ GY M N CG+AT +Y
Sbjct: 506 GYGIINGNPYWLVKNSWSNYWGNDGYILMSPKDNNCGVATDPTY 549
>gi|395844675|ref|XP_003795081.1| PREDICTED: cathepsin L1-like [Otolemur garnettii]
Length = 333
Score = 185 bits (470), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 101/221 (45%), Positives = 131/221 (59%), Gaps = 9/221 (4%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++PVK+QG CGSCW FS+TG+LE + GK ISLSEQ LVDC+Q N
Sbjct: 116 KSVDWRKKGYVTPVKNQGQCGSCWAFSSTGALEGQMFRKTGKLISLSEQNLVDCSQRQGN 175
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
GC+GGL + AF Y+K NGGLD+E +YPY +D CK+ E VNI E
Sbjct: 176 HGCSGGLMNFAFNYVKENGGLDSEVSYPYVARDEKCKYKPEYSVANDTGFVNIPT-QEKA 234
Query: 219 LQHAVGLVRPVSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----DG 273
L AV ++ P+S+A + +FYKSG+Y C + +D H V+ +GYG E D
Sbjct: 235 LMKAVAIIGPISIAIDASHISIQFYKSGIYYEPNCSSKNLD--HGVLLIGYGFEGTDSDD 292
Query: 274 VPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
+W IKNSWG WG G K+ K N CGIA+ ASYP V
Sbjct: 293 NKFWFIKNSWGIEWGLDGCIKIAKDKNNHCGIASAASYPTV 333
>gi|146216004|gb|ABQ10204.1| cysteine protease Cp6 [Actinidia deliciosa]
Length = 461
Score = 185 bits (469), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 100/206 (48%), Positives = 128/206 (62%), Gaps = 11/206 (5%)
Query: 112 VKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFE 171
VKDQG CGSCW FST ++E G ISLSEQ+LVDC ++N +GCNGGL AFE
Sbjct: 149 VKDQGSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDTSYN-EGCNGGLMDYAFE 207
Query: 172 YIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSV-NITLGAEDELQHAVGLVRPVS 230
+I NGG+DTEE YPY +DG C +N V +D ++ + E LQ AV +PVS
Sbjct: 208 FIIKNGGIDTEEDYPYNARDGRCDQYRKNAKVVTIDDYEDVPVNNEQALQKAVA-NQPVS 266
Query: 231 VAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGD 289
VA E F+FY+SGV++ GN ++H V AVGYG E+ V YW++KNSWG +WG+
Sbjct: 267 VAIEASGMAFQFYESGVFT----GNCGTALDHGVTAVGYGTENSVDYWIVKNSWGSSWGE 322
Query: 290 HGYFKMEM---GKNMCGIATCASYPV 312
GY +ME CGIA SYP+
Sbjct: 323 SGYIRMERNTGATGKCGIAVEPSYPI 348
>gi|18422289|ref|NP_568620.1| Granulin repeat cysteine protease family protein [Arabidopsis
thaliana]
gi|9757832|dbj|BAB08269.1| cysteine protease component of protease-inhibitor complex
[Arabidopsis thaliana]
gi|17065064|gb|AAL32686.1| cysteine protease component of protease-inhibitor complex
[Arabidopsis thaliana]
gi|21387153|gb|AAM47980.1| cysteine protease component of protease-inhibitor complex
[Arabidopsis thaliana]
gi|332007522|gb|AED94905.1| Granulin repeat cysteine protease family protein [Arabidopsis
thaliana]
Length = 463
Score = 185 bits (469), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 115/290 (39%), Positives = 146/290 (50%), Gaps = 61/290 (21%)
Query: 78 EMKLRFATFSKNLDLIRSTNCKGLSYRLGLN----------------------------- 108
E RF F NL I N K LSY+LGL
Sbjct: 70 EKDQRFEIFKDNLRFIDEHNTKNLSYKLGLTRFADLTNEEYRSMYLGAKPTKRVLKTSDR 129
Query: 109 --------------------ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQ 148
++ VKDQG CGSCW FST G++E G ISLSEQ+
Sbjct: 130 YQARVGDALPDSVDWRKEGAVADVKDQGSCGSCWAFSTIGAVEGINKIVTGDLISLSEQE 189
Query: 149 LVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDS 208
LVDC ++N QGCNGGL AFE+I NGG+DTE YPY DG C + +N V +DS
Sbjct: 190 LVDCDTSYN-QGCNGGLMDYAFEFIIKNGGIDTEADYPYKAADGRCDQNRKNAKVVTIDS 248
Query: 209 V-NITLGAEDELQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAV 266
++ +E L+ A+ +P+SVA E F+ Y SGV+ CG +++H VVAV
Sbjct: 249 YEDVPENSEASLKKALAH-QPISVAIEAGGRAFQLYSSGVFDGL-CGT---ELDHGVVAV 303
Query: 267 GYGVEDGVPYWLIKNSWGENWGDHGYFKM----EMGKNMCGIATCASYPV 312
GYG E+G YW+++NSWG WG+ GY KM E CGIA ASYP+
Sbjct: 304 GYGTENGKDYWIVRNSWGNRWGESGYIKMARNIEAPTGKCGIAMEASYPI 353
>gi|357216861|gb|AET71138.1| cysteine peptidase isoform b [Sphenophorus levis]
Length = 324
Score = 185 bits (469), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 112/305 (36%), Positives = 149/305 (48%), Gaps = 59/305 (19%)
Query: 62 FARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNC---KGL-SYRLGLN--------- 108
F F ++GK Y++ E RFA F +NL I + N +G+ SY G+N
Sbjct: 26 FQSFKLKHGKTYKNQAEETKRFAIFRENLRKIEAHNAEYKQGIHSYTQGINKFADMTRAE 85
Query: 109 ----------------------------------------ISPVKDQGHCGSCWTFSTTG 128
++P+KDQ CGSCW F+ G
Sbjct: 86 FKAMLATQVKTKPSIVATKTFQLADGVSVPESIDWRSRNVVTPIKDQAQCGSCWAFAVVG 145
Query: 129 SLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYT 188
S E AY + GK SEQQLVDC N GC+GG F YI+ NG L+ E YPYT
Sbjct: 146 STEGAYALSTGKLTRFSEQQLVDCTTDLN-YGCDGGYLDDTFPYIQTNG-LELESDYPYT 203
Query: 189 GKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVYS 248
G DG C + S V +V V++ E L AVG PV++A D +FY SG+
Sbjct: 204 GYDGYCSYESSKVVTKVSSYVSVP-ANEQALLEAVGTAGPVAIAINA-DDLQFYFSGIID 261
Query: 249 STKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCA 308
C P ++H V+AVGY E+G YWLIKNSWG +WG+ GYF+ G+N+CG+ A
Sbjct: 262 DKYC--DPEYLDHGVLAVGYDSENGRDYWLIKNSWGADWGESGYFRFLRGQNICGVKEDA 319
Query: 309 SYPVV 313
YP++
Sbjct: 320 VYPLI 324
>gi|417399134|gb|JAA46597.1| Putative cathepsin l1 [Desmodus rotundus]
Length = 335
Score = 185 bits (469), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 100/220 (45%), Positives = 130/220 (59%), Gaps = 10/220 (4%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++PVK+QG CGSCW FS G+LE + GK +SLSEQ LVDC+ + NQG
Sbjct: 119 VDWRQKGYVTPVKNQGQCGSCWAFSANGALEGQMFRKTGKLVSLSEQNLVDCSHSQGNQG 178
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKD-GVCKFSSENVGVQVLDSVNITLGAEDEL 219
CNGGL AF+Y+K N GLD+EE+YPY G++ C + E V+I E L
Sbjct: 179 CNGGLMDNAFQYVKDNKGLDSEESYPYLGRESNTCNYRPEYSAANDTGFVDIPQ-HERGL 237
Query: 220 QHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----DGV 274
AV V P+SVA + F+FY G+Y C + D++H V+ VGYG E D
Sbjct: 238 MKAVATVGPISVAIDAGHSSFQFYSEGIYYEPNC--SSKDLDHGVLVVGYGSEGAQSDSN 295
Query: 275 PYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
+W++KNSWG WG GY KM + N CGIAT ASYP V
Sbjct: 296 KFWIVKNSWGTGWGMSGYVKMARDQSNHCGIATAASYPTV 335
>gi|344275472|ref|XP_003409536.1| PREDICTED: cathepsin S-like isoform 2 [Loxodonta africana]
Length = 281
Score = 185 bits (469), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 103/257 (40%), Positives = 145/257 (56%), Gaps = 14/257 (5%)
Query: 65 FARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLG-----LNISPVKDQGHCG 119
+ + Y K Y+ E R + KNL + N L + +G L+++ + D G CG
Sbjct: 31 WKKTYSKQYKEKNEEVARRLIWEKNLKFVMLHN---LEHSMGMHSYDLSMNHLGDMGSCG 87
Query: 120 SCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCA-QAFNNQGCNGGLPSQAFEYIKYNGG 178
+CW FS G+LEA GK +SLS Q LVDC+ + ++N+GCNGG ++AF+YI N G
Sbjct: 88 ACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSGEKYSNKGCNGGFMTRAFQYIIDNNG 147
Query: 179 LDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVV-D 237
+D+E +YPY DG C++ +N + G+ED L+ AV PVSV +
Sbjct: 148 IDSEASYPYKATDGKCQYDPKNRAATCSKYTELPYGSEDALKEAVANKGPVSVGIDASRP 207
Query: 238 GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEM 297
F YKSGVY C + +VNH V+ VGYG +G YWL+KNSWG N+G+ GY +M
Sbjct: 208 SFFLYKSGVYYDPSCTD---NVNHGVLVVGYGNLNGKDYWLVKNSWGLNFGEQGYIRMAR 264
Query: 298 GK-NMCGIATCASYPVV 313
N CGIA+ SYP +
Sbjct: 265 NSGNHCGIASFPSYPEI 281
>gi|84660244|emb|CAI43319.1| silicatein alpha [Lubomirskia baicalensis]
gi|85677148|emb|CAI46306.1| silicatein alpha [Lubomirskia baicalensis]
gi|220675708|emb|CAP69653.1| silcatein [Lubomirskia baicalensis]
Length = 326
Score = 185 bits (469), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 92/218 (42%), Positives = 137/218 (62%), Gaps = 4/218 (1%)
Query: 98 CKGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFN 157
+ + +R ++ VK QG CG+ + F+ TG+LE A A K ++LSEQ ++DC+ +
Sbjct: 111 AESIDWRTKGAVTSVKYQGQCGASYAFAATGALEGASALANDKQVTLSEQNIIDCSVPYG 170
Query: 158 NQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAED 217
N GC+GG AF+Y+ NGG+DTE +Y + GK C+++++ G V+I G+E
Sbjct: 171 NHGCSGGDTYTAFKYVIDNGGIDTESSYSFKGKQSSCQYNNKTSGASATGVVSIGYGSES 230
Query: 218 ELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPY 276
+L AV V PV+VA + + FRFY+SGV+ S+ C +T + NHA++ GYG +G Y
Sbjct: 231 DLLAAVATVGPVAVAVDANTNAFRFYQSGVFDSSSCSSTKL--NHAMLVTGYGSYNGKDY 288
Query: 277 WLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
WL+KNSW +NWGD GY M K N CGIA+ A YP++
Sbjct: 289 WLVKNSWSKNWGDSGYILMVRNKYNQCGIASDALYPML 326
>gi|410911058|ref|XP_003969007.1| PREDICTED: counting factor associated protein D-like [Takifugu
rubripes]
Length = 549
Score = 185 bits (469), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 109/304 (35%), Positives = 150/304 (49%), Gaps = 51/304 (16%)
Query: 62 FARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN------------- 108
F F ++ + YE +E +R F NL I S N GLSY L LN
Sbjct: 246 FGHFKEKFQRRYEDDKEHDIRQQAFIHNLRYIHSKNRAGLSYTLALNSLSDRTMSELGTM 305
Query: 109 ------------------------------------ISPVKDQGHCGSCWTFSTTGSLEA 132
++PVKDQ CGSCW+F+TTG++E
Sbjct: 306 RGKKQRKTPNRGLPFPLKLYENVQVPDSLDWRLYGAVTPVKDQAICGSCWSFATTGTIEG 365
Query: 133 AYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAY-PYTGKD 191
A G LS+Q L+DC+ F N C+GG +++E+I +GG+ E Y PY G +
Sbjct: 366 ALFLKTGFLQVLSQQILMDCSWGFGNNACDGGEEWRSYEWIMKHGGIALAETYGPYMGMN 425
Query: 192 GVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVD-GFRFYKSGVYSST 250
G C +S + Q+ N+T G L+ A+ PV+V+ + F FY GVY
Sbjct: 426 GFCHVNSSELVAQIQSYTNVTSGDAMALKLALFKHGPVAVSIDASHRSFVFYSHGVYYEP 485
Query: 251 KCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASY 310
CG+T D++HAV+AVGYG +G PYWLIKNSW WG+ GY M M N CG+AT A++
Sbjct: 486 ACGSTIDDLDHAVLAVGYGNLNGEPYWLIKNSWSTYWGNDGYILMSMKDNNCGVATDATF 545
Query: 311 PVVA 314
+A
Sbjct: 546 VTLA 549
>gi|390363592|ref|XP_790934.3| PREDICTED: counting factor associated protein D-like
[Strongylocentrotus purpuratus]
Length = 560
Score = 185 bits (469), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 107/311 (34%), Positives = 154/311 (49%), Gaps = 54/311 (17%)
Query: 53 IGQARHALSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN---- 108
+G H L F + ++Y K Y++ E R F+KN+ +I S N L Y L +N
Sbjct: 245 MGDRFHQL-FDEYKQKYDKTYKTDVEHVQRKGHFTKNVRMIHSINRANLGYVLDINHMAD 303
Query: 109 ---------------------------------------------ISPVKDQGHCGSCWT 123
+SPVKDQ CGSCW+
Sbjct: 304 QSHQELKRMRGRLRQTRPNNGLPYDGSDISDDAVPDHIDWNVRGAVSPVKDQAVCGSCWS 363
Query: 124 FSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEE 183
F + ++E A GK + LS+Q L+DC A N GC+GG + +E++ NGG+ EE
Sbjct: 364 FGSAETIEGAVFMQSGKRVRLSQQMLMDCTWAAGNNGCDGGEEWRVYEWLMKNGGIPLEE 423
Query: 184 AY-PYTGKDGVCKFSSENVGVQVLDS-VNITLGAEDELQHAVGLVRPVSVAFE-VVDGFR 240
Y PY G++G+C + V + N+T G + +L+ A+ P++V + V F
Sbjct: 424 TYGPYLGQNGMCHYGKSTPAVASIKKYYNVTSGNQKDLKKALATKGPIAVGIDAAVPSFS 483
Query: 241 FYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE-DGVPYWLIKNSWGENWGDHGYFKMEMGK 299
FY G Y CGNT D++HAV+AVGYG + G YWLIKNSW +WG++GY + M
Sbjct: 484 FYSYGTYYDASCGNTVDDLDHAVLAVGYGTDSSGQDYWLIKNSWSTHWGNNGYVAISMKD 543
Query: 300 NMCGIATCASY 310
N CG+AT A+Y
Sbjct: 544 NNCGVATAATY 554
>gi|356515086|ref|XP_003526232.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 185 bits (469), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 113/307 (36%), Positives = 156/307 (50%), Gaps = 63/307 (20%)
Query: 64 RFARRYGKIYESVEEMKLRFATFSKNLDLIRS-TNCKGLSYRLGLN-------------- 108
++ RYGK+Y+ +E + RF F +N++ I + N SY+LG+N
Sbjct: 41 QWMTRYGKVYKDPQEREKRFRVFKENVNYIEAFNNAANKSYKLGINQFADLTNKEFIAPR 100
Query: 109 ------------------------------------ISPVKDQGHCGSCWTFSTTGSLEA 132
++P+KDQG CG CW FS + E
Sbjct: 101 NGFKGHMCSSIIRTTTFKFENVTATPSTVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEG 160
Query: 133 AYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDG 192
+ + GK ISLSEQ+LVDC +QGC GGL AF++I N GL+TE YPY G DG
Sbjct: 161 IHALSAGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFIIQNHGLNTEANYPYKGVDG 220
Query: 193 VCKFSSENVGVQVLDSV-NITLGAEDELQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSST 250
C + + ++ E LQ AV +PVSVA + F+FYKSGV++ +
Sbjct: 221 KCNANEAAKNAATITGYEDVPANNEMALQKAVA-NQPVSVAIDASGSDFQFYKSGVFTGS 279
Query: 251 KCGNTPMDVNHAVVAVGYGV-EDGVPYWLIKNSWGENWGDHGYFKMEMG----KNMCGIA 305
CG +++H V AVGYGV +DG YWL+KNSWG WG+ GY +M+ G + +CGIA
Sbjct: 280 -CGT---ELDHGVTAVGYGVSDDGTEYWLVKNSWGTEWGEEGYIRMQRGVDSEEGLCGIA 335
Query: 306 TCASYPV 312
ASYP
Sbjct: 336 MQASYPT 342
>gi|148283737|gb|ABN50361.2| cathepsin L [Fasciola hepatica]
Length = 326
Score = 185 bits (469), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 101/255 (39%), Positives = 142/255 (55%), Gaps = 10/255 (3%)
Query: 60 LSFARFARRYGKIYESVEEMKLRFATFSKN-LDLIRSTNCKGLSYRLGLNISPVKDQGHC 118
L+F F +Y E+ R + N L + S + + Y ++ VK+QG C
Sbjct: 75 LTFEEFKAKYLIEIPRSSELLSRGIPYKANKLAVPESIDWRDYYY-----VTEVKNQGQC 129
Query: 119 GSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGG 178
GSCW FSTTG++E + + S SEQQLV+C + F N GC GG A+EY+K+N G
Sbjct: 130 GSCWAFSTTGAVEGQFRKNERASASFSEQQLVNCTRDFGNYGCGGGYVENAYEYLKHN-G 188
Query: 179 LDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDG 238
L+TE YPY +G C++ +V + G E EL++ VG P +VA +
Sbjct: 189 LETESYYPYQAVEGPCQYDGRLAYAKVTGYYTVHSGDEIELKNLVGTEGPAAVALDADSD 248
Query: 239 FRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMG 298
F Y+SG+Y S C P + HAV+AVGYG +DG YW++KNSWG WG+ GY +
Sbjct: 249 FMMYQSGIYQSQTC--LPDRLTHAVLAVGYGSQDGTDYWIVKNSWGTWWGEDGYIRFARN 306
Query: 299 K-NMCGIATCASYPV 312
+ NMCGIA+ AS P+
Sbjct: 307 RGNMCGIASLASVPI 321
>gi|157093563|gb|ABV22436.1| cysteine proteinase [Oxyrrhis marina]
Length = 329
Score = 185 bits (469), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 115/310 (37%), Positives = 157/310 (50%), Gaps = 64/310 (20%)
Query: 62 FARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN------------- 108
+ F ++G+ Y EE R F++N+ LI N KG +Y LG+N
Sbjct: 19 WEEFKAKFGESYNGEEEEAERKGVFAQNVQLINEENSKGHTYTLGVNQFADLTVEEFSKT 78
Query: 109 --------------------------------------ISPVKDQGHCGSCWTFSTTGSL 130
++PVK+QG CGSCW+FSTTGSL
Sbjct: 79 YMGFKKPAQKYGDAAYLGRHVYNGEALPTSVDWSSQGAVTPVKNQGQCGSCWSFSTTGSL 138
Query: 131 EAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGK 190
E A + GK +SLSEQQ VDCA + NQGCNGGL AF+Y + N L TE++YPY G
Sbjct: 139 EGANEISTGKLVSLSEQQFVDCAGTYGNQGCNGGLMDSAFKYAEAN-ALCTEQSYPYKGT 197
Query: 191 DGVCKFSSENVGV---QVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDG-FRFYKSGV 246
DG C+ SS + G+ V +++ +E ++ AV +PVS+A E F+ Y GV
Sbjct: 198 DGSCQASSCSTGLAKGSVSGYKDVSSDSEQDMMSAVAQ-QPVSIAIEADKSVFQLYSGGV 256
Query: 247 YSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGK---NMCG 303
+ CG + ++H V+AVGYG G YW +KNSWG WG GY ++ GK CG
Sbjct: 257 LTGA-CGAS---LDHGVLAVGYGTLSGTDYWKVKNSWGSTWGMSGYVLLQRGKGGSGECG 312
Query: 304 IATCASYPVV 313
+ + SYP V
Sbjct: 313 LLSEPSYPQV 322
>gi|226508570|ref|NP_001141984.1| uncharacterized protein LOC100274134 precursor [Zea mays]
gi|194706676|gb|ACF87422.1| unknown [Zea mays]
gi|413920745|gb|AFW60677.1| vignain [Zea mays]
Length = 363
Score = 185 bits (469), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 104/275 (37%), Positives = 154/275 (56%), Gaps = 25/275 (9%)
Query: 54 GQARHALSFARFA----RRYGKIYESVEE-------MKLRFATFSKNLDLIRSTNCKGLS 102
G+ ++ L +FA + + +Y + + K A SK + R + +
Sbjct: 97 GKKKYVLGTNQFADLTSKEFAAMYTGLRKPAAVPSGAKQIPAAGSKYQNFTRLDDDVQVD 156
Query: 103 YRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCN 162
+R ++PVK+QG CG CW FS G++E G +SLSEQQ++DC ++ NQGCN
Sbjct: 157 WRQQGAVTPVKNQGQCGCCWAFSAVGAMEGLIMITTGNLVSLSEQQILDCDESDGNQGCN 216
Query: 163 GGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHA 222
GG AF+Y+ NGG+ TE+AYPY+ G C+ + ++ G E+ L +A
Sbjct: 217 GGYMDNAFQYVINNGGVTTEDAYPYSAVQGTCQNVQPAATISGFQ--DLPSGDENALANA 274
Query: 223 VGLVRPVSVAFEVVDG----FRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVED-GVPYW 277
V +PVSV VDG F+FY+ G+Y CG D+NHAV A+GYG +D G YW
Sbjct: 275 VA-NQPVSVG---VDGGSSPFQFYQGGIYDGDGCGT---DMNHAVTAIGYGADDQGTQYW 327
Query: 278 LIKNSWGENWGDHGYFKMEMGKNMCGIATCASYPV 312
++KNSWG WG++G+ +++MG CGI+T ASYP
Sbjct: 328 ILKNSWGTGWGENGFMQLQMGVGACGISTMASYPT 362
>gi|226529105|ref|NP_001150196.1| cysteine protease 1 precursor [Zea mays]
gi|194701798|gb|ACF84983.1| unknown [Zea mays]
gi|194704800|gb|ACF86484.1| unknown [Zea mays]
gi|195637480|gb|ACG38208.1| cysteine protease 1 precursor [Zea mays]
gi|413919895|gb|AFW59827.1| cysteine protease 1 [Zea mays]
Length = 470
Score = 185 bits (469), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 118/332 (35%), Positives = 161/332 (48%), Gaps = 67/332 (20%)
Query: 41 GLRDFETSVLQVIGQARHALSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKG 100
G R E + +V +A + L A R Y + E E RF F NL + + N +
Sbjct: 42 GARGLERTEPEV--RAMYDLWLAEHGRAYNALGEGEGERDRRFLVFWDNLRFVDAHNERA 99
Query: 101 --LSYRLGLN-------------------------------------------------- 108
+RLG+N
Sbjct: 100 GARGFRLGMNQFADLTNDEFRAAYLGAMVPAARRGAVVGERYRHDGAAEELPESVDWREK 159
Query: 109 --ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLP 166
++PVK+QG CGSCW FS S+E+ G+ ++LSEQ+LV+C+ N GCNGGL
Sbjct: 160 GAVAPVKNQGQCGSCWAFSAVSSVESVNQIVTGEMVTLSEQELVECSTDGGNSGCNGGLM 219
Query: 167 SQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSV-NITLGAEDELQHAVGL 225
AF++I NGG+DTE+ YPY DG C + +N V +D ++ E LQ AV
Sbjct: 220 DAAFDFIIKNGGIDTEDDYPYRAVDGKCDMNRKNARVVSIDGFEDVPENDEKSLQKAVAH 279
Query: 226 VRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWG 284
+PVSVA E F+ YKSGV+S G+ +++H VVAVGYG E+G YW+++NSWG
Sbjct: 280 -QPVSVAIEAGGREFQLYKSGVFS----GSCTTNLDHGVVAVGYGAENGKDYWIVRNSWG 334
Query: 285 ENWGDHGYFKMEMGKNM----CGIATCASYPV 312
WG+ GY +ME N CGIA ASYP
Sbjct: 335 PKWGEAGYIRMERNVNASTGKCGIAMMASYPT 366
>gi|66816665|ref|XP_642342.1| hypothetical protein DDB_G0278401 [Dictyostelium discoideum AX4]
gi|60470393|gb|EAL68373.1| hypothetical protein DDB_G0278401 [Dictyostelium discoideum AX4]
Length = 337
Score = 185 bits (469), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 117/328 (35%), Positives = 158/328 (48%), Gaps = 74/328 (22%)
Query: 51 QVIGQARHALSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN-- 108
Q + ++++ +F + K Y S E R+ F N D I N KG LGLN
Sbjct: 19 QELSESQYRDAFTDWMISNQKSYSS-SEFITRYNIFKTNFDYIEEWNSKGSETVLGLNKM 77
Query: 109 ----------------------------------------------ISPVKDQGHCGSCW 122
++ VK+Q C CW
Sbjct: 78 ADITNEEYRSLYLGKPFDASSLIGTKEEILFSNKFSSTVDWRKKGAVTHVKNQQSCSGCW 137
Query: 123 TFSTTGSLEAAYHQAFGKG----ISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGG 178
+FS TG+ E A H+ G +SLSEQ L+DC+ F N GCNGG+ + AFEYI NGG
Sbjct: 138 SFSATGATEGA-HKLANNGTNELVSLSEQNLIDCSTPFGNTGCNGGVITYAFEYIISNGG 196
Query: 179 LDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVV-D 237
+DTE++YP+ G DG C++ SEN G + VN+T G+E L+ AV V PV+ + +
Sbjct: 197 IDTEKSYPFEGTDGTCRYKSENSGATISSYVNVTFGSESSLESAVN-VNPVACSIDASHS 255
Query: 238 GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVP-----------YWLIKNSWGEN 286
F FYKSG+Y C T +D H V+ VGYG E+ YW+ KNSWG N
Sbjct: 256 SFLFYKSGIYFEPACSRTNLD--HGVLVVGYGTENSQSQDSSSEPNHSNYWIAKNSWGIN 313
Query: 287 WGDHGYFKMEMGK-NMCGIATCASYPVV 313
GY M + NMCGI+T AS+P+V
Sbjct: 314 ----GYILMSKDRDNMCGISTLASFPIV 337
>gi|340370388|ref|XP_003383728.1| PREDICTED: digestive cysteine proteinase 2-like [Amphimedon
queenslandica]
Length = 398
Score = 185 bits (469), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 97/215 (45%), Positives = 125/215 (58%), Gaps = 6/215 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++P+KDQG CGSCW FS GSLE + G +SLSEQQLVDC + N G
Sbjct: 187 VDWREKGAVTPIKDQGQCGSCWAFSAIGSLEGQHFINTGNLVSLSEQQLVDC--SLKNDG 244
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
CNGG+ S AF+YI+ G ++E YPYT K+G C++ +V + G ED L
Sbjct: 245 CNGGMLSTAFKYIESVAGEESETDYPYTAKNGTCQYDPSKAVAKVTGYTALPSGDEDSLN 304
Query: 221 HAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
AV P+SV + F+ Y GVY C +D H V+ VGYG ED YWL+
Sbjct: 305 DAVTSKGPISVCIDASHKSFQLYSEGVYYEKSCSYFLLD--HCVLVVGYGTEDTADYWLV 362
Query: 280 KNSWGENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
KNSWG +WG GY +M KN CGIAT A+YP+V
Sbjct: 363 KNSWGTSWGMKGYIRMSRNRKNNCGIATNAAYPLV 397
Score = 43.9 bits (102), Expect = 0.092, Method: Compositional matrix adjust.
Identities = 27/100 (27%), Positives = 46/100 (46%), Gaps = 2/100 (2%)
Query: 59 ALSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNC-KGLSYRLGLNISPVKDQGH 117
++ FA + ++++ A + N L+ N + +R ++PV QG
Sbjct: 72 TVAMNEFADLDADAFSKLKKIPSHPAQANNNKVLLTGGNVPNSIDWRKKGAVTPVSSQGQ 131
Query: 118 CGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFN 157
CG W + GS+E+ Y G + LS QQ++DCA N
Sbjct: 132 CG-VWPWPIVGSVESQYFIKTGTLVPLSVQQILDCANITN 170
>gi|327239614|gb|AEA39651.1| cathepsin H [Epinephelus coioides]
Length = 261
Score = 185 bits (469), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 96/168 (57%), Positives = 112/168 (66%)
Query: 104 RLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNG 163
+ G ++ VK+QG CGSCWTFSTTG LE+ GK + LSEQQLVDCAQAFNN GCNG
Sbjct: 94 KKGNYVTDVKNQGGCGSCWTFSTTGCLESVIAINKGKLVPLSEQQLVDCAQAFNNHGCNG 153
Query: 164 GLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAV 223
GLPSQAFEYI YN GL TE+ YPYT +G C ++ E V + VNIT E + AV
Sbjct: 154 GLPSQAFEYILYNKGLMTEDDYPYTSFEGTCVYNPERAAAFVNEVVNITAYDEMGMVDAV 213
Query: 224 GLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE 271
PVS+AFEV F Y GVY+ST+C VNHAV+AVGYG E
Sbjct: 214 ATRNPVSLAFEVTSDFMHYSQGVYTSTECHQNTNKVNHAVLAVGYGQE 261
>gi|313103779|pdb|3KSE|A Chain A, Unreduced Cathepsin L In Complex With Stefin A
gi|313103780|pdb|3KSE|B Chain B, Unreduced Cathepsin L In Complex With Stefin A
gi|313103781|pdb|3KSE|C Chain C, Unreduced Cathepsin L In Complex With Stefin A
Length = 220
Score = 185 bits (469), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 102/221 (46%), Positives = 132/221 (59%), Gaps = 9/221 (4%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
+ + +R ++PVK+QG CGS W FS TG+LE + G+ ISLSEQ LVDC+ N
Sbjct: 3 RSVDWREKGYVTPVKNQGQCGSXWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGN 62
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
+GCNGGL AF+Y++ NGGLD+EE+YPY + CK++ + V+I E
Sbjct: 63 EGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDAGFVDIPK-QEKA 121
Query: 219 LQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----DG 273
L AV V P+SVA + + F FYK G+Y C + MD H V+ VGYG E D
Sbjct: 122 LMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMD--HGVLVVGYGFESTESDD 179
Query: 274 VPYWLIKNSWGENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
YWL+KNSWGE WG GY KM +N CGIA+ ASYP V
Sbjct: 180 NKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 220
>gi|116787404|gb|ABK24495.1| unknown [Picea sitchensis]
gi|224286306|gb|ACN40861.1| unknown [Picea sitchensis]
Length = 452
Score = 184 bits (468), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 121/353 (34%), Positives = 169/353 (47%), Gaps = 79/353 (22%)
Query: 18 AAAASASASSFDDSNPIRLVSSDGLRDFETSVLQVIGQARHALSFARFARRYGKIYESVE 77
A A SAS + F ++SS LR+ + +++++ + + + + Y ++
Sbjct: 14 AMAGSASRADF------SIISSKDLRE-DDAIMEL---------YELWLAEHKRAYNGLD 57
Query: 78 EMKLRFATFSKNLDLIRSTNCKGLSYRLGLN----------------------------- 108
E + RF+ F N I N SY+LGLN
Sbjct: 58 EKQKRFSVFKDNFLYIHEHNQGNRSYKLGLNQFADLSHEEFKATYLGAKLDTKKRLSRPP 117
Query: 109 -----------------------ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLS 145
++ VKDQG CGSCW FST ++E G ISLS
Sbjct: 118 SRRYQYSDGEDLPESIDWREKGAVTSVKDQGSCGSCWAFSTVAAVEGINQIVTGDLISLS 177
Query: 146 EQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQV 205
EQ+LVDC ++N QGCNGGL AFE+I NGGLD+EE YPYT DG C +N V
Sbjct: 178 EQELVDCDTSYN-QGCNGGLMDYAFEFIINNGGLDSEEDYPYTAYDGSCDSYRKNAHVVT 236
Query: 206 LDSVNITLGAEDELQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVV 264
+D +++ +P+SVA E F+FY SGV++ST CG ++H V
Sbjct: 237 IDDYEDVPENDEKSLKKAAANQPISVAIEASGREFQFYDSGVFTST-CGTQ---LDHGVT 292
Query: 265 AVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMG-----KNMCGIATCASYPV 312
VGYG E G YW +KNSWG++WG+ G+ +++ MCGIA ASYPV
Sbjct: 293 LVGYGSESGTDYWTVKNSWGKSWGEEGFIRLQRNIEVASTGMCGIAMEASYPV 345
>gi|225707912|gb|ACO09802.1| Cathepsin K precursor [Osmerus mordax]
Length = 331
Score = 184 bits (468), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 97/218 (44%), Positives = 128/218 (58%), Gaps = 7/218 (3%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + YR ++PVK+QG CGSCW FS+ G+LE + GK + LS Q LVDC N
Sbjct: 118 KSIDYRRKGMVTPVKNQGSCGSCWAFSSVGALEGQLMKTTGKLVDLSPQNLVDCVTE--N 175
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
GC GG + AF Y++ N G+D+E AYPY G+D C ++ + I G E
Sbjct: 176 NGCGGGYMTNAFNYVRDNQGIDSEAAYPYIGQDETCAYNVSGMTASCRGYKEIPEGNERA 235
Query: 219 LQHAVGLVRPVSVAFE-VVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGV-EDGVPY 276
L AV V PVSV + + F+FY+ GVY C D+NHAV+AVGYGV G Y
Sbjct: 236 LTVAVAKVGPVSVGIDATLSTFQFYQKGVYYDRNCNKD--DINHAVLAVGYGVTPKGKKY 293
Query: 277 WLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
W++KNSW E+WG+ GY M + N+CGIA ASYP++
Sbjct: 294 WIVKNSWSESWGNKGYILMARNRGNLCGIANLASYPIM 331
>gi|356533293|ref|XP_003535200.1| PREDICTED: LOW QUALITY PROTEIN: cysteine proteinase RD21a-like
[Glycine max]
Length = 466
Score = 184 bits (468), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 101/219 (46%), Positives = 134/219 (61%), Gaps = 13/219 (5%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++P+KDQG CGSCW FST G++E G SLSEQ+LVDC + +N G
Sbjct: 143 VDWREKGAVTPIKDQGQCGSCWAFSTVGAVEGINQIVTGNLTSLSEQELVDCDRGYN-MG 201
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSV-NITLGAEDEL 219
CNGGL AFE+I NGG+DTEE YPY KD C + +N V +D ++ E L
Sbjct: 202 CNGGLMDYAFEFIVQNGGIDTEEDYPYHAKDNTCDPNRKNARVVTIDGYEDVPTNDEKSL 261
Query: 220 QHAVGLVRPVSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWL 278
AV +PVSVA E F+ Y+SGV++ +CG +++H VVAVGYG E+G YWL
Sbjct: 262 MKAVA-NQPVSVAIEAGGMEFQLYQSGVFTG-RCG---TNLDHGVVAVGYGTENGTDYWL 316
Query: 279 IKNSWGENWGDHGYFKMEMG-----KNMCGIATCASYPV 312
++NSWG WG++GY K+E CGIA ASYP+
Sbjct: 317 VRNSWGSAWGENGYIKLERNVQNTETGKCGIAIEASYPI 355
>gi|356577811|ref|XP_003557016.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 184 bits (468), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 112/307 (36%), Positives = 154/307 (50%), Gaps = 63/307 (20%)
Query: 64 RFARRYGKIYESVEEMKLRFATFSKNLDLIRS-TNCKGLSYRLGLN-------------- 108
++ RYGK+Y+ +E + RF F +N++ I + N Y+L +N
Sbjct: 41 QWMTRYGKVYKDPQEREKRFRIFKENVNYIEAFNNAANKRYKLAINQFADLTNEEFIAPR 100
Query: 109 ------------------------------------ISPVKDQGHCGSCWTFSTTGSLEA 132
++P+KDQG CG CW FS + E
Sbjct: 101 NRFKGHMCSSIIRTTTFKYENVTAVPSTVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEG 160
Query: 133 AYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDG 192
+ GK ISLSEQ+LVDC +QGC GGL AF+++ N GL+TE YPY G DG
Sbjct: 161 IHALTSGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFVIQNHGLNTEANYPYKGVDG 220
Query: 193 VCKFS-SENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSST 250
C + + N + ++ E LQ AV +PVSVA + F+FYKSGV++ +
Sbjct: 221 KCNVNEAANDAATITGYEDVPANNEKALQKAVA-NQPVSVAIDASGSDFQFYKSGVFTGS 279
Query: 251 KCGNTPMDVNHAVVAVGYGV-EDGVPYWLIKNSWGENWGDHGYFKMEMGKN----MCGIA 305
CG +++H V AVGYGV DG YWL+KNSWG WG+ GY +M+ G N +CGIA
Sbjct: 280 -CGT---ELDHGVTAVGYGVSNDGTEYWLVKNSWGTEWGEEGYIRMQRGVNSEEGLCGIA 335
Query: 306 TCASYPV 312
ASYP
Sbjct: 336 MQASYPT 342
>gi|226821419|gb|ACO82385.1| cathepsin K [Lutjanus argentimaculatus]
Length = 330
Score = 184 bits (468), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 100/232 (43%), Positives = 137/232 (59%), Gaps = 7/232 (3%)
Query: 85 TFSKNLDLIRSTNCKGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISL 144
+F+ LD + K + YR ++ VK+QG CGSCW FS+ G+LE + G+ + L
Sbjct: 103 SFTMALDDDVNRLPKYIDYRKKGMVTSVKNQGSCGSCWAFSSAGALEGQLAKKTGQLVDL 162
Query: 145 SEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQ 204
S Q LVDC N GC GG ++AF+Y+ NGG+D+EEAYPY G+D C++++ + Q
Sbjct: 163 SPQNLVDCVT--ENDGCGGGYMTKAFQYVADNGGIDSEEAYPYIGEDQPCRYNATGMAAQ 220
Query: 205 VLDSVNITLGAEDELQHAVGLVRPVSVAFE-VVDGFRFYKSGVYSSTKCGNTPMDVNHAV 263
I G E L A+ PVSV + + F+FY GVY C D+NHAV
Sbjct: 221 CKGYKEIPEGNEHALAVALFKAGPVSVGIDATLSSFQFYSKGVYYDPSCNKE--DINHAV 278
Query: 264 VAVGYGVE-DGVPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
+AVGYGV G YW++KNSWGE+WG GY M + N+CGIA ASYP++
Sbjct: 279 LAVGYGVTGKGKKYWIVKNSWGESWGKGGYILMARNRGNLCGIANLASYPIM 330
>gi|357483847|ref|XP_003612210.1| Cysteine proteinase [Medicago truncatula]
gi|355513545|gb|AES95168.1| Cysteine proteinase [Medicago truncatula]
Length = 344
Score = 184 bits (468), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 111/308 (36%), Positives = 156/308 (50%), Gaps = 64/308 (20%)
Query: 64 RFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNC--KGLSYRLGLN------------- 108
R+ YGK+Y+ +E + RF F++N+ I + N SY+LG+N
Sbjct: 41 RWMNHYGKVYKDHQEREKRFKIFTENMKYIEAFNNGDNNESYKLGINQFADLTNEEFVAS 100
Query: 109 -------------------------------------ISPVKDQGHCGSCWTFSTTGSLE 131
++PVK+QG CG CW FS + E
Sbjct: 101 RNKFKGHMCSSIIRTTTFKYENVSAIPSTVDWRKKGAVTPVKNQGQCGCCWAFSAVAATE 160
Query: 132 AAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKD 191
+ + GK +SLSEQ+LVDC +QGC GGL AF++I N GL+TE YPY G D
Sbjct: 161 GIHKLSTGKLVSLSEQELVDCDTKGVDQGCEGGLMDDAFKFIIQNHGLNTEAQYPYQGVD 220
Query: 192 GVCKFSSENVGVQVLDSV-NITLGAEDELQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSS 249
G C + ++ + ++ E LQ AV +P+SVA + F+FYKSGV++
Sbjct: 221 GTCNANKASIQATTITGYEDVPANNEQALQKAVA-NQPISVAIDASGSDFQFYKSGVFTG 279
Query: 250 TKCGNTPMDVNHAVVAVGYGV-EDGVPYWLIKNSWGENWGDHGYFKMEMG----KNMCGI 304
+ CG +++H V AVGYGV DG YWL+KNSWG +WG+ GY M+ G + +CGI
Sbjct: 280 S-CG---TELDHGVTAVGYGVSNDGTKYWLVKNSWGTDWGEEGYIMMQRGVEAAEGLCGI 335
Query: 305 ATCASYPV 312
A ASYP
Sbjct: 336 AMQASYPT 343
>gi|432117576|gb|ELK37815.1| Cathepsin L1 [Myotis davidii]
Length = 299
Score = 184 bits (468), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 104/232 (44%), Positives = 131/232 (56%), Gaps = 30/232 (12%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++PVKDQG CGSCW FS TG+LE + GK +SLSEQ LVDC++A N+GC+GGL
Sbjct: 71 VTPVKDQGGCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQGNEGCSGGLMDN 130
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
AF+Y+K N GLDTEE+YPY G D CK+ E V+I E L AV V P
Sbjct: 131 AFQYVKDNEGLDTEESYPYYGTDDTCKYKPEFSAANDTGFVDIH-KDERSLMKAVASVGP 189
Query: 229 VSVAFEV-VDGFRFYKS---------------------GVYSSTKCGNTPMDVNHAVVAV 266
+SVA + ++ F+FY+ G+Y C + D+NH V+ V
Sbjct: 190 ISVALDASLESFQFYEKGKVTVSSYLEIFTPAMTSVFLGIYYDPDCSSE--DLNHGVLVV 247
Query: 267 GYGVE----DGVPYWLIKNSWGENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
GYG E D YW++KNSWG WG GY KM N CGIA+ ASYP V
Sbjct: 248 GYGFEGVEMDNNKYWIVKNSWGTKWGMDGYIKMAKDLDNHCGIASMASYPTV 299
>gi|147790682|emb|CAN61026.1| hypothetical protein VITISV_001146 [Vitis vinifera]
Length = 469
Score = 184 bits (468), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 113/304 (37%), Positives = 153/304 (50%), Gaps = 65/304 (21%)
Query: 68 RYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN------------------- 108
++GK Y ++ E + RF F NL I N + +Y++GLN
Sbjct: 59 KHGKSYNALGEKERRFQIFKDNLRFIDEHNAENRTYKVGLNRFADLTNEEYRSMYLGTRT 118
Query: 109 ---------------------------------ISPVKDQGHCGSCWTFSTTGSLEAAYH 135
+ VKDQG CGSCW FST ++E
Sbjct: 119 AAKRRSSNKISDRYAFRVGDSLPESVDWRKKGAVVEVKDQGSCGSCWAFSTIAAVEGINK 178
Query: 136 QAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCK 195
G ISLSEQ+LVDC ++N +GCNGGL AFE+I NGG+D+EE YPY DG C
Sbjct: 179 IVTGGLISLSEQELVDCDTSYN-EGCNGGLMDYAFEFIINNGGIDSEEDYPYKASDGRCD 237
Query: 196 FSSENVGVQVLDSV-NITLGAEDELQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCG 253
+N V +D ++ E L+ AV +PVSVA E F+ Y+SG+++ +CG
Sbjct: 238 QYRKNAXVVTIDGYEDVPENDEKSLEKAVA-NQPVSVAIEAGGREFQLYQSGIFTG-RCG 295
Query: 254 NTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEM-----GKNMCGIATCA 308
++H V AVGYG E+GV YW++KNSWG +WG+ GY +ME CGIA A
Sbjct: 296 TA---LDHGVTAVGYGTENGVDYWIVKNSWGASWGEEGYIRMERDLATSATGKCGIAMEA 352
Query: 309 SYPV 312
SYP+
Sbjct: 353 SYPI 356
>gi|374713649|gb|AEZ65082.1| cysteine protease [Carica papaya]
Length = 471
Score = 184 bits (468), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 101/219 (46%), Positives = 136/219 (62%), Gaps = 13/219 (5%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++ VKDQG CGSCW FST GS+E G ISLSEQ+LVDC +A+N QG
Sbjct: 146 VDWREKGAVTEVKDQGQCGSCWAFSTVGSVEGINQIVTGDLISLSEQELVDCDKAYN-QG 204
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSV-NITLGAEDEL 219
CNGGL AFE+I NGG+D+E YPY D +C + +N V +D ++ E+ L
Sbjct: 205 CNGGLMDYAFEFIIKNGGIDSEADYPYRASDNMCDSNRKNAHVVTIDGYEDVPENDEESL 264
Query: 220 QHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWL 278
+ AV +PVSVA E F+ Y+SGV++ +CG +++H VVAVGYG E+G+ YW+
Sbjct: 265 KKAVA-NQPVSVAIEAGGREFQLYQSGVFTG-RCGT---NLDHGVVAVGYGTENGIDYWI 319
Query: 279 IKNSWGENWGDHGYFKMEMG-----KNMCGIATCASYPV 312
++NSWG WG+ GY +ME CGIA ASYP
Sbjct: 320 VRNSWGPKWGESGYIRMERNVASTDTGKCGIAMEASYPT 358
>gi|144905104|dbj|BAF56427.1| cysteine proteinase [Lotus japonicus]
Length = 342
Score = 184 bits (468), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 117/334 (35%), Positives = 167/334 (50%), Gaps = 63/334 (18%)
Query: 36 LVSSDGLRDFETSVLQVIGQARHALSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRS 95
LV GL F+ S + + H ++ RYGK+Y+ ++E + RF F +N+ I +
Sbjct: 14 LVLCLGLWAFQVSSRTLQDASMHE-RHEQWMARYGKVYKDLQEKEKRFNIFQENVKYIEA 72
Query: 96 TNCKG-LSYRLGLN---------------------------------------------- 108
+N G Y+LG+N
Sbjct: 73 SNNAGNKPYKLGVNQFTDLTNKEFIATRNKFKGHMSSSITRTTTFKYENVTAPSTVDWRQ 132
Query: 109 ---ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGL 165
++PVK+QG CG CW FS + E + + G +SLSEQ+LVDC + +QGC GGL
Sbjct: 133 EGAVTPVKNQGTCGCCWAFSAVAATEGIHKLSTGNLVSLSEQELVDCDTSGADQGCQGGL 192
Query: 166 PSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSV-NITLGAEDELQHAVG 224
AF++I NGGL+TE YPY G DG C + E V + ++ E LQ AV
Sbjct: 193 MDDAFKFIIQNGGLNTEAQYPYQGVDGTCNTNEEVTHVATITGYEDVPSNNEQALQQAVA 252
Query: 225 LVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGV-EDGVPYWLIKNS 282
+P+SVA + F+ Y+SGV++ + CG ++H V VGYGV +DG YWL+KNS
Sbjct: 253 -NQPISVAIDASGSDFQNYQSGVFTGS-CG---TQLDHGVAVVGYGVSDDGTKYWLVKNS 307
Query: 283 WGENWGDHGYFKM----EMGKNMCGIATCASYPV 312
WGE+WG+ GY +M E + +CGIA SYP
Sbjct: 308 WGEDWGEEGYIRMQRDVEAPEGLCGIAMQPSYPT 341
>gi|443722452|gb|ELU11310.1| hypothetical protein CAPTEDRAFT_132308 [Capitella teleta]
Length = 235
Score = 184 bits (468), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 103/217 (47%), Positives = 139/217 (64%), Gaps = 3/217 (1%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++PVK+QG CGSCW FS+TGSLE + G+ S+SEQ LVDC++ N
Sbjct: 20 KTVDWREKGYVTPVKNQGQCGSCWAFSSTGSLEGQVFRKTGRLPSISEQNLVDCSRDEGN 79
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
GC+GGL AF YIK N G+D+E++YPY DG C++ + V+I G E
Sbjct: 80 MGCSGGLMDNAFTYIKKNMGIDSEKSYPYEAVDGECRYKKSDSVTTDSGFVDIPHGDETA 139
Query: 219 LQHAVGLVRPVSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYW 277
L+ AV V PVSVA + F+FYK+GVY+ C +T +D + +V VGYGVE+G YW
Sbjct: 140 LRTAVASVGPVSVAIDASHTSFQFYKTGVYTEANCSSTQLD-HGVLVVVGYGVENGQDYW 198
Query: 278 LIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
L+KNSWG +WG+ GY KM N CGIA+ ASYP++
Sbjct: 199 LVKNSWGASWGEAGYIKMARNHGNQCGIASQASYPLL 235
>gi|94480716|emb|CAI91577.1| cathepsin L [Aphrocallistes vastus]
Length = 329
Score = 184 bits (468), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 99/206 (48%), Positives = 131/206 (63%), Gaps = 8/206 (3%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++PVK+QG CGSCW+FS TGSLE Y GK +S SEQ+LVDC+ + N GC GGL
Sbjct: 127 VTPVKNQGQCGSCWSFSATGSLEGQYAIKSGKLVSFSEQELVDCSTSLGNHGCQGGLMDY 186
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAE--DELQHAVGLV 226
AF+Y + N + E Y YT K+G CK++++ +GV DS + +E D L+ AV
Sbjct: 187 AFKYWETNLA-EKESDYTYTAKNGKCKYNAQ-LGV-TKDSSFTDIPSENCDALKEAVANK 243
Query: 227 RPVSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGE 285
P++VA + F+ Y SG+Y+ C T +D H V+ VGYG ++GV YWLIKNSWG
Sbjct: 244 GPIAVAMDASHTSFQMYHSGIYTPFLCSKTKLD--HGVLVVGYGTDNGVDYWLIKNSWGM 301
Query: 286 NWGDHGYFKMEMGKNMCGIATCASYP 311
WG GYFK+EM + CGI T ASYP
Sbjct: 302 AWGMDGYFKIEMKSDKCGICTQASYP 327
>gi|213623956|gb|AAI70449.1| LOC100127265 protein [Xenopus laevis]
Length = 331
Score = 184 bits (468), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 97/215 (45%), Positives = 129/215 (60%), Gaps = 6/215 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ YR ++P+++QG CGSCW FS+ G+LE + GK + LS Q LVDC + N G
Sbjct: 121 IDYRKKGYVTPIRNQGSCGSCWAFSSVGALEGQLKKKKGKLVVLSPQNLVDCVK--KNDG 178
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C GG + AFEY++ N G+D+E+AYPY G+D C ++ + G E L+
Sbjct: 179 CGGGYMTNAFEYVRDNKGIDSEKAYPYVGEDQECMYNVSGRAAACKGYKEVQEGNEKALK 238
Query: 221 HAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
AV LV PVSV + + F+FY GVY C + D+NHAV+AVGYG + YW++
Sbjct: 239 KAVALVGPVSVGIDAGLSSFQFYSKGVYYDKDC--SAEDINHAVLAVGYGTQKKAKYWIV 296
Query: 280 KNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
KNSWGE WGD GY M K N CGIA ASYPV+
Sbjct: 297 KNSWGEEWGDKGYILMAKDKGNACGIANLASYPVM 331
>gi|256535829|gb|ACU82389.1| cathepsin L 1 [Pheronema raphanus]
Length = 328
Score = 184 bits (468), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 109/299 (36%), Positives = 151/299 (50%), Gaps = 60/299 (20%)
Query: 69 YGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN-------------------- 108
+ K+Y ++ E R + NL N +GLSY LG N
Sbjct: 36 HKKVYYTLIEENFRRLIWEDNLSTFNEMNSRGLSYTLGTNEFADMTSKEFVEIMNGYKPE 95
Query: 109 -------------------------------ISPVKDQGHCGSCWTFSTTGSLEAAYHQA 137
++PVK+QG CGSCW FS+TGSLE Y
Sbjct: 96 LRIDKLEDVNEVKNYSSIKLSDSVDWRSKGAVTPVKNQGQCGSCWAFSSTGSLEGQYFIN 155
Query: 138 FGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIK-YNGGLDTEEAYPYTGKDGVCKF 196
K +S SE +LVDC++ + N GC GGL AF Y + Y L+++ YPY KDG C++
Sbjct: 156 NDKLLSFSESELVDCSRRYGNNGCKGGLMDNAFRYWEVYKEELESD--YPYVAKDGPCRY 213
Query: 197 SSENVGVQVLDSV-NITLGAEDELQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGN 254
S++ GV + S N+ ++ LQ AV + P+SVA + F+ Y SGVYS ++C
Sbjct: 214 -SQDKGVTTISSYKNVPHFSQISLQDAVRTIGPISVAMDASHKSFQLYHSGVYSESECSQ 272
Query: 255 TPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASYPVV 313
T +D H V+ VGYG P+WL+KNSWG WG GYF++ M NMCG+ T SYP++
Sbjct: 273 TKLD--HGVLVVGYGTS-SEPFWLVKNSWGAGWGMDGYFEIAMRNNMCGLETEPSYPIL 328
>gi|225458701|ref|XP_002284973.1| PREDICTED: cysteine proteinase RD21a-like [Vitis vinifera]
Length = 467
Score = 184 bits (468), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 113/304 (37%), Positives = 153/304 (50%), Gaps = 65/304 (21%)
Query: 68 RYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN------------------- 108
++GK Y ++ E + RF F NL I N + +Y++GLN
Sbjct: 57 KHGKSYNALGEKERRFQIFKDNLRFIDEHNAENRTYKVGLNRFADLTNEEYRSMYLGTRT 116
Query: 109 ---------------------------------ISPVKDQGHCGSCWTFSTTGSLEAAYH 135
+ VKDQG CGSCW FST ++E
Sbjct: 117 AAKRRSSNKISDRYAFRVGDSLPESVDWRKKGAVVEVKDQGSCGSCWAFSTIAAVEGINK 176
Query: 136 QAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCK 195
G ISLSEQ+LVDC ++N +GCNGGL AFE+I NGG+D+EE YPY DG C
Sbjct: 177 IVTGGLISLSEQELVDCDTSYN-EGCNGGLMDYAFEFIINNGGIDSEEDYPYKASDGRCD 235
Query: 196 FSSENVGVQVLDSV-NITLGAEDELQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCG 253
+N V +D ++ E L+ AV +PVSVA E F+ Y+SG+++ +CG
Sbjct: 236 QYRKNAKVVTIDGYEDVPENDEKSLEKAVA-NQPVSVAIEAGGREFQLYQSGIFTG-RCG 293
Query: 254 NTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEM-----GKNMCGIATCA 308
++H V AVGYG E+GV YW++KNSWG +WG+ GY +ME CGIA A
Sbjct: 294 TA---LDHGVTAVGYGTENGVDYWIVKNSWGASWGEEGYIRMERDLATSATGKCGIAMEA 350
Query: 309 SYPV 312
SYP+
Sbjct: 351 SYPI 354
>gi|334332716|ref|XP_001367365.2| PREDICTED: cathepsin L1-like [Monodelphis domestica]
Length = 335
Score = 184 bits (468), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 116/313 (37%), Positives = 164/313 (52%), Gaps = 67/313 (21%)
Query: 62 FARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCK----GLSYRLGLN--------- 108
+ ++ ++GK YE+ E+ LR A + KNL +I N + S++LG+N
Sbjct: 29 WHQWKAQHGKSYEANED-SLRRAIWEKNLKMIERHNQEYRAGKQSFQLGMNKFGDMTTEE 87
Query: 109 -----------------------------------------ISPVKDQGHCGSCWTFSTT 127
++PVK+QG C SCW FS
Sbjct: 88 FQEAINFYNSSASQRRTKRYLHREPLLAQLPESVDWREEGYVTPVKNQGQCLSCWAFSAV 147
Query: 128 GSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPY 187
G++E + + G+ +SLS Q LVDC + + C+GG +AF+Y++ NGG+DTEE YPY
Sbjct: 148 GAIEGQWFRKTGELVSLSIQNLVDCTTSDSISSCHGGFMDRAFQYVQDNGGIDTEECYPY 207
Query: 188 TGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDG----FRFYK 243
G+ CK+ E G V+ V+I E L AV V P+SVA +DG F+FY+
Sbjct: 208 VGEVNECKYQPECSGANVVGFVDIPSMDERALMEAVATVGPISVA---IDGGNPSFKFYE 264
Query: 244 SGVYSSTKCGNTPMDVNHAVVAVGYGVE--DGVPYWLIKNSWGENWGDHGYFKMEMGK-N 300
SGVY +C ++ + NHA + VGYG E DG YW++KNSWGE WG++GY M + N
Sbjct: 265 SGVYYDPQCSSSQL--NHAGLVVGYGSEGIDGRKYWIVKNSWGELWGNNGYILMAKDEDN 322
Query: 301 MCGIATCASYPVV 313
CGIAT ASYP V
Sbjct: 323 HCGIATEASYPEV 335
>gi|162463464|ref|NP_001104879.1| cysteine proteinase Mir3 precursor [Zea mays]
gi|2425066|gb|AAB88263.1| cysteine proteinase Mir3 [Zea mays]
Length = 480
Score = 184 bits (468), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 115/311 (36%), Positives = 155/311 (49%), Gaps = 66/311 (21%)
Query: 62 FARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKG----LSYRLGLN--------- 108
+A + +G+ Y +V + R+ F NL I + N S+RLGLN
Sbjct: 44 YAEWMAAHGRTYNAVGAEERRYQVFRDNLRYIDAHNAAADAGVHSFRLGLNRFADLTNDE 103
Query: 109 -----------------------------------------ISPVKDQGHCGSCWTFSTT 127
++ VKDQG CG+CW FST
Sbjct: 104 YPATYLGARTRPQRDRKLGARYHAADNEDLPESVDWRAKGAVAEVKDQGSCGTCWAFSTI 163
Query: 128 GSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPY 187
++E G ISLSEQ+LVDC ++N QGCNGGL AFE+I NGG+DTE+ YPY
Sbjct: 164 AAVEGINQIVTGDLISLSEQELVDCDTSYN-QGCNGGLMDYAFEFIINNGGIDTEKDYPY 222
Query: 188 TGKDGVCKFSSENVGVQVLDSV-NITLGAEDELQHAVGLVRPVSVAFEVV-DGFRFYKSG 245
G DG C + +N V +DS ++ E LQ AV +PVSVA E F+ Y SG
Sbjct: 223 KGTDGRCDVNRKNAKVVTIDSYEDVPANDEKSLQKAVA-NQPVSVAIEAAGTAFQLYSSG 281
Query: 246 VYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMG----KNM 301
+++ + CG ++H V AVGYG E+G YW++KNSWG +WG+ GY +ME
Sbjct: 282 IFTGS-CGTR---LDHGVTAVGYGTENGKDYWIVKNSWGSSWGESGYVRMERNIKASSGK 337
Query: 302 CGIATCASYPV 312
CGIA SYP+
Sbjct: 338 CGIAVEPSYPL 348
>gi|163914459|ref|NP_001106314.1| cathepsin K precursor [Xenopus laevis]
gi|159155477|gb|AAI54985.1| LOC100127265 protein [Xenopus laevis]
Length = 331
Score = 184 bits (468), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 97/215 (45%), Positives = 129/215 (60%), Gaps = 6/215 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ YR ++P+++QG CGSCW FS+ G+LE + GK + LS Q LVDC + N G
Sbjct: 121 IDYRKKGYVTPIRNQGSCGSCWAFSSVGALEGQLKKKKGKLVVLSPQNLVDCVK--KNDG 178
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C GG + AFEY++ N G+D+E+AYPY G+D C ++ + G E L+
Sbjct: 179 CGGGYMTNAFEYVRDNKGIDSEKAYPYVGEDQECMYNVSGRAAACKGYKEVQEGNEKALK 238
Query: 221 HAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
AV LV PVSV + + F+FY GVY C + D+NHAV+AVGYG + YW++
Sbjct: 239 KAVALVGPVSVGIDAGLSSFQFYSKGVYYDKDC--SAEDINHAVLAVGYGTQKKAKYWIV 296
Query: 280 KNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
KNSWGE WGD GY M K N CGIA ASYPV+
Sbjct: 297 KNSWGEEWGDKGYILMAKDKGNACGIANLASYPVM 331
>gi|114559420|ref|XP_001171183.1| PREDICTED: cathepsin S isoform 1 [Pan troglodytes]
gi|397492868|ref|XP_003817342.1| PREDICTED: cathepsin S isoform 2 [Pan paniscus]
Length = 281
Score = 184 bits (468), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 105/256 (41%), Positives = 145/256 (56%), Gaps = 12/256 (4%)
Query: 65 FARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCK---GL-SYRLGLNISPVKDQGHCGS 120
+ + YGK Y+ E +R + KNL + N + G+ SY LG+N + D G CG+
Sbjct: 31 WKKTYGKQYKEKNEEAVRRLIWEKNLKFVMLHNLEHSMGMHSYDLGMN--HLGDMGSCGA 88
Query: 121 CWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCA-QAFNNQGCNGGLPSQAFEYIKYNGGL 179
CW FS G+LEA GK +SLS Q LVDC+ + + N+GCNGG + AF+YI N G+
Sbjct: 89 CWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNKGI 148
Query: 180 DTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVD-G 238
D++ +YPY D C++ S+ + G ED L+ AV PVSV + +
Sbjct: 149 DSDASYPYKATDQKCQYDSKYRAATCSKYTELPYGREDVLKEAVANKGPVSVGVDALHPS 208
Query: 239 FRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMG 298
F Y+SGVY C +VNH V+ VGYG +G YWL+KNSWG N+G+ GY +M
Sbjct: 209 FFLYRSGVYYEPSC---TQNVNHGVLVVGYGDLNGKEYWLVKNSWGHNFGEEGYIRMARN 265
Query: 299 K-NMCGIATCASYPVV 313
K N CGIA+ SYP +
Sbjct: 266 KGNHCGIASFPSYPEI 281
>gi|302790836|ref|XP_002977185.1| hypothetical protein SELMODRAFT_106228 [Selaginella moellendorffii]
gi|300155161|gb|EFJ21794.1| hypothetical protein SELMODRAFT_106228 [Selaginella moellendorffii]
Length = 299
Score = 184 bits (468), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 105/257 (40%), Positives = 155/257 (60%), Gaps = 14/257 (5%)
Query: 60 LSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLNISPVKDQGHCG 119
L+ A F Y ++S R A K++D+ S+ L +R ++P+KDQG CG
Sbjct: 54 LTNAEFRANYVGKFKSPRYQDRRPA---KDVDVDVSSLPTSLDWRQEGAVTPIKDQGQCG 110
Query: 120 SCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGL 179
SCW FS S+E+A+ A + +SLSEQQL+DC +QGC GG P AF+++ NGG+
Sbjct: 111 SCWAFSAIASIESAHFLATKELVSLSEQQLIDCDTV--DQGCQGGFPEDAFKFVVENGGV 168
Query: 180 DTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVD-G 238
TEEAYPYTG G C +++N V++ ++T + D L AV PV+V D
Sbjct: 169 TTEEAYPYTGFAGSCN-ANKNKVVEITGYKDVTKDSADALMKAVSKT-PVTVGICGSDQN 226
Query: 239 FRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEM- 297
F+ Y+SG+ S +C N+ +HAV+ +GYG E G+PYW+IKNSWG +WG++G+ K++
Sbjct: 227 FQNYRSGILSG-QCSNS---RDHAVLVIGYGTEGGMPYWIIKNSWGTSWGENGFMKIKKK 282
Query: 298 -GKNMCGIATCASYPVV 313
G+ MCG+ +SYP
Sbjct: 283 DGEGMCGMNGQSSYPTT 299
>gi|195628596|gb|ACG36128.1| vignain precursor [Zea mays]
Length = 362
Score = 184 bits (468), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 96/237 (40%), Positives = 141/237 (59%), Gaps = 20/237 (8%)
Query: 81 LRFATFSKNLDLIRSTNCKGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGK 140
++ F++ D ++ + +R ++PVK+QG CG CW FS G++E G
Sbjct: 140 FKYQNFTRLDDDVQ------VDWRQQGAVTPVKNQGQCGCCWAFSAVGAMEGLIMITTGN 193
Query: 141 GISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSEN 200
+SLSEQQ++DC ++ NQGCNGG AF+Y+ NGG+ TE+AYPY+ G C+
Sbjct: 194 LVSLSEQQILDCDESDGNQGCNGGYMDNAFQYVVNNGGVTTEDAYPYSAVQGTCQNVQPA 253
Query: 201 VGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDG----FRFYKSGVYSSTKCGNTP 256
+ ++ G E+ L +AV +PVSV VDG F+FY+ G+Y CG
Sbjct: 254 ATISGFQ--DLPSGDENALANAVA-NQPVSVG---VDGGSSPFQFYQGGIYDGDGCGT-- 305
Query: 257 MDVNHAVVAVGYGVED-GVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASYPV 312
D+NHAV A+GYG +D G YW++KNSWG WG++G+ +++MG CGI+T ASYP
Sbjct: 306 -DMNHAVTAIGYGADDQGTQYWILKNSWGTGWGENGFMQLQMGVGACGISTMASYPT 361
>gi|255580657|ref|XP_002531151.1| cysteine protease, putative [Ricinus communis]
gi|223529264|gb|EEF31236.1| cysteine protease, putative [Ricinus communis]
Length = 340
Score = 184 bits (468), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 108/301 (35%), Positives = 150/301 (49%), Gaps = 60/301 (19%)
Query: 68 RYGKIYESVEEMKLRFATFSKNLDLIRSTN-CKGLSYRLGLN------------------ 108
R+G++Y E ++R+ F +N+ I S N G SY+LG+N
Sbjct: 45 RFGRVYNDGNEKEIRYKIFKENVQRIESFNKASGKSYKLGINQFADLTNEEFKTSRNRFK 104
Query: 109 ------------------------------ISPVKDQGHCGSCWTFSTTGSLEAAYHQAF 138
++ +KDQG CGSCW FS ++E A
Sbjct: 105 GHMCSSQAGPFRYENLTAAPSSMDWRKKGAVTAIKDQGQCGSCWAFSAVAAVEGITQLAT 164
Query: 139 GKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSS 198
K ISLSEQ+LVDC +QGC GGL AF++I+ N GL TE YPY G DG C
Sbjct: 165 SKLISLSEQELVDCDTKGEDQGCQGGLMDDAFKFIEQNQGLTTEANYPYEGSDGTCNTKQ 224
Query: 199 E-NVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVD-GFRFYKSGVYSSTKCGNTP 256
E N ++ ++ E L AV +PVSVA + GF+FY SG+++ G+
Sbjct: 225 EANHAAKINGFEDVPANNEGALMKAVA-KQPVSVAIDAGGFGFQFYSSGIFT----GDCG 279
Query: 257 MDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMG----KNMCGIATCASYPV 312
+++H V AVGYG +G+ YWL+KNSWG WG+ GY +M+ + +CGIA ASYP
Sbjct: 280 TELDHGVAAVGYGESNGMNYWLVKNSWGTQWGEEGYIRMQKDIDAKEGLCGIAMQASYPT 339
Query: 313 V 313
Sbjct: 340 A 340
>gi|326493368|dbj|BAJ85145.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 436
Score = 184 bits (467), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 113/311 (36%), Positives = 150/311 (48%), Gaps = 66/311 (21%)
Query: 62 FARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKG----LSYRLGLN--------- 108
+A + +G Y ++ E + RF F NL I N S+RLGLN
Sbjct: 43 YAEWMAEHGSTYNAIGEEERRFEAFRDNLRYIDQHNAAADAGVHSFRLGLNRFADLTNEE 102
Query: 109 -----------------------------------------ISPVKDQGHCGSCWTFSTT 127
+ VKDQG CGSCW FS
Sbjct: 103 YRSTYLGARTKPDRERKLSARYQAADNDELPESVDWRKKGAVGAVKDQGGCGSCWAFSAI 162
Query: 128 GSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPY 187
++E G I LSEQ+LVDC ++N QGCNGGL AFE+I NGG+D+EE YPY
Sbjct: 163 AAVEGINQIVTGDMIPLSEQELVDCDTSYN-QGCNGGLMDYAFEFIINNGGIDSEEDYPY 221
Query: 188 TGKDGVCKFSSENVGVQVLDSV-NITLGAEDELQHAVGLVRPVSVAFEVV-DGFRFYKSG 245
+D C + +N V +D ++ + +E LQ AV +P+SVA E F+ YKSG
Sbjct: 222 KERDNRCDANKKNAKVVTIDGYEDVPVNSEKSLQKAVA-NQPISVAIEAGGRAFQLYKSG 280
Query: 246 VYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMG----KNM 301
+++ T CG ++H V AVGYG E+G YWL++NSWG WG+ GY +ME
Sbjct: 281 IFTGT-CGTA---LDHGVAAVGYGTENGKDYWLVRNSWGSVWGEDGYIRMERNIKASSGK 336
Query: 302 CGIATCASYPV 312
CGIA SYP
Sbjct: 337 CGIAVEPSYPT 347
>gi|1149525|emb|CAA64218.1| preprocathepsin K [Mus musculus]
Length = 329
Score = 184 bits (467), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 92/213 (43%), Positives = 130/213 (61%), Gaps = 6/213 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ YR ++PVK+QG CGSCW FS+ G+LE + GK ++LS Q LVDC N G
Sbjct: 119 IDYRKKGYVTPVKNQGQCGSCWAFSSAGALEGQLKKKTGKLLALSPQNLVDCVT--ENYG 176
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C GG + AF+Y++ NGG+D+E+A+PY G+D C +++ + I +G E L+
Sbjct: 177 CGGGYMTTAFQYVQQNGGIDSEDAFPYVGQDESCMYNATAKAAKCRGYREIPVGNEKALK 236
Query: 221 HAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
AV V P+SV+ + + F+FY GVY C +VNHAV+ VGYG + G +W+I
Sbjct: 237 RAVARVGPISVSIDASLASFQFYSRGVYYDENCDRD--NVNHAVLVVGYGTQKGSKHWII 294
Query: 280 KNSWGENWGDHGYFKMEMGK-NMCGIATCASYP 311
KNSWGE+WG+ GY + K N CGI AS+P
Sbjct: 295 KNSWGESWGNKGYALLARNKNNACGITNMASFP 327
>gi|356577813|ref|XP_003557017.1| PREDICTED: uncharacterized protein LOC100801364 [Glycine max]
Length = 890
Score = 184 bits (467), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 112/307 (36%), Positives = 155/307 (50%), Gaps = 63/307 (20%)
Query: 64 RFARRYGKIYESVEEMKLRFATFSKNLDLIRS-TNCKGLSYRLGLN-------------- 108
++ RYGK+Y+ +E + RF F +N++ I + N Y+L +N
Sbjct: 588 QWMTRYGKVYKDPQEREKRFRIFKENVNYIEAFNNAANKRYKLAINQFADLTNEEFIAPR 647
Query: 109 ------------------------------------ISPVKDQGHCGSCWTFSTTGSLEA 132
++P+KDQG CG CW FS + E
Sbjct: 648 NRFKGHMCSSIIRTTTFKYENVTAVPSTVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEG 707
Query: 133 AYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDG 192
+ GK ISLSEQ+LVDC +QGC GGL AF+++ N GL+TE YPY G DG
Sbjct: 708 IHALTSGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFVIQNHGLNTEANYPYKGVDG 767
Query: 193 VCKFS-SENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSST 250
C + + N V + ++ E LQ AV +PVSVA + F+FYKSGV++ +
Sbjct: 768 KCNANEAANDVVTITGYEDVPANNEKALQKAVA-NQPVSVAIDASGSDFQFYKSGVFTGS 826
Query: 251 KCGNTPMDVNHAVVAVGYGV-EDGVPYWLIKNSWGENWGDHGYFKMEMG----KNMCGIA 305
CG +++H V AVGYGV DG YWL+KNSWG WG+ GY +M+ G + +CGIA
Sbjct: 827 -CGT---ELDHGVTAVGYGVSNDGTEYWLVKNSWGTEWGEEGYIRMQRGVDSEEGLCGIA 882
Query: 306 TCASYPV 312
ASYP
Sbjct: 883 MQASYPT 889
>gi|213623960|gb|AAI70453.1| Hypothetical protein LOC100127265 [Xenopus laevis]
Length = 331
Score = 184 bits (467), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 97/215 (45%), Positives = 129/215 (60%), Gaps = 6/215 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ YR ++P+++QG CGSCW FS+ G+LE + GK + LS Q LVDC + N G
Sbjct: 121 IDYRKKGYVTPIRNQGSCGSCWAFSSVGALEGQLKKKKGKLVVLSPQNLVDCVK--KNDG 178
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C GG + AFEY++ N G+D+E+AYPY G+D C ++ + G E L+
Sbjct: 179 CGGGYMTNAFEYVRDNKGIDSEKAYPYVGEDQECMYNVSGRAAACKGYKEVQEGNEKALK 238
Query: 221 HAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
AV LV PVSV + + F+FY GVY C + D+NHAV+AVGYG + YW++
Sbjct: 239 KAVALVGPVSVGIDAGLSSFQFYSKGVYYDKDC--SAEDINHAVLAVGYGTQKKAKYWIV 296
Query: 280 KNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
KNSWGE WGD GY M K N CGIA ASYPV+
Sbjct: 297 KNSWGEEWGDKGYILMAKDKGNACGIANLASYPVM 331
>gi|218183|dbj|BAA14403.1| oryzain beta precursor [Oryza sativa Japonica Group]
Length = 471
Score = 184 bits (467), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 98/218 (44%), Positives = 135/218 (61%), Gaps = 11/218 (5%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++PVK+QG CGSCW FS ++E+ G+ I+LSEQ+LV+C+ N G
Sbjct: 144 VDWREKGAVAPVKNQGQCGSCWAFSAVSTVESINQLVTGEMITLSEQELVECSTNGQNSG 203
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSV-NITLGAEDEL 219
CNGGL + AF++I NGG+DTE+ YPY DG C + EN V +D ++ E L
Sbjct: 204 CNGGLMADAFDFIIKNGGIDTEDDYPYKAVDGKCDINRENAKVVSIDGFEDVPQNDEKSL 263
Query: 220 QHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWL 278
Q AV +PVSVA E F+ Y SGV+S +CG + ++H VVAVGYG ++G YW+
Sbjct: 264 QKAVAH-QPVSVAIEAGGREFQLYHSGVFSG-RCGTS---LDHGVVAVGYGTDNGKDYWI 318
Query: 279 IKNSWGENWGDHGYFKMEMGKNM----CGIATCASYPV 312
++NSWG WG+ GY +ME N+ CGIA ASYP
Sbjct: 319 VRNSWGPKWGESGYVRMERNINVTTGKCGIAMMASYPT 356
>gi|358334194|dbj|GAA34712.2| cathepsin L [Clonorchis sinensis]
Length = 401
Score = 184 bits (467), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 108/288 (37%), Positives = 159/288 (55%), Gaps = 25/288 (8%)
Query: 25 ASSFDDSNPIRLVSSDGLRDFETSVLQVIGQARHALSFARFARR-----YGKIYESVEEM 79
A + + R+ + + +R + +V + G + + RF+ R +I+++ EE
Sbjct: 80 AGPVEQAKRFRIFTENFIRINQHNVRYIQGDTFYTMGINRFSDRVSWTILSQIFQTKEEF 139
Query: 80 KLRFATFSKNLDLIRSTNCK----------GLSYRLGLNISPVKDQGHCGSCWTFSTTGS 129
R F + L N K + +R ++PVKDQG CGSCW FS TG+
Sbjct: 140 G-RLLGF-RGLRNTSRANSKYITIAAEPPASIDWRSTGAVTPVKDQGQCGSCWAFSATGA 197
Query: 130 LEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPY-T 188
+E + A + +SLSEQQLVDC+ F N GC+GG AF+Y+K+ G+ TE YPY +
Sbjct: 198 IEGQHFMATKQLVSLSEQQLVDCSSHFGNFGCSGGWMDNAFKYVKHTHGITTETKYPYIS 257
Query: 189 GKDGV----CKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEV-VDGFRFYK 243
G+ G C+F + + V V++ E L+ AVGL P+SVA ++ F YK
Sbjct: 258 GETGTPNPRCEFHGQAIAATVTGIVDLPRSNEFALKQAVGLHGPISVAIHASLESFMGYK 317
Query: 244 SGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHG 291
SGVYS +C + +D HAV+ VGYG E+G+PYWLIKNSWG +WG+ G
Sbjct: 318 SGVYSDEECSSDQLD--HAVLVVGYGEENGIPYWLIKNSWGFDWGEMG 363
>gi|37732137|gb|AAR02406.1| cysteine proteinase [Anthonomus grandis]
Length = 322
Score = 184 bits (467), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 93/213 (43%), Positives = 135/213 (63%), Gaps = 6/213 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++ VK+QG CGSCW+F+ TGS E AY++ + +SLSEQQLVDC+ + N G
Sbjct: 115 VDWRSAGQVTGVKNQGSCGSCWSFALTGSTEGAYYRKHKQLVSLSEQQLVDCSTSIN-YG 173
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
CNGG F YI+ GL TE +YPYTG DG CK+ S V ++ + V++ G+E ++
Sbjct: 174 CNGGFLDATFPYIE-QYGLQTESSYPYTGVDGSCKYDSSKVVTKISNYVSLH-GSESKVL 231
Query: 221 HAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIK 280
VG + PV++ + Y SG+Y++ KC T ++NHAV+ VGYG ++G YW++K
Sbjct: 232 EPVGSIGPVAITMDA-SYLSSYSSGIYAANKC--TTTNLNHAVLVVGYGSQNGQNYWIVK 288
Query: 281 NSWGENWGDHGYFKMEMGKNMCGIATCASYPVV 313
NSWG WG+ GYF++ G N CG A YP +
Sbjct: 289 NSWGSGWGEQGYFRLLRGSNECGCAQDPVYPNI 321
>gi|110737959|dbj|BAF00916.1| cysteine proteinase [Arabidopsis thaliana]
Length = 376
Score = 184 bits (467), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 111/274 (40%), Positives = 155/274 (56%), Gaps = 23/274 (8%)
Query: 56 ARHALSFARFAR----RYGKIYESVEEMKLRFATFSKNLDLIRSTNCKG------LSYRL 105
A + L +F Y K+Y R +KN++ S G + +R
Sbjct: 94 ATYKLGLTKFTDLTNDEYRKLYLGARTEPARRIAKAKNVNQKYSAAVNGKEVPETVDWRQ 153
Query: 106 GLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGL 165
++P+KDQG CGSCW FSTT ++E G+ ISLSEQ+LVDC +++N QGCNGGL
Sbjct: 154 KGAVNPIKDQGTCGSCWAFSTTAAVEGINKIVTGELISLSEQELVDCDKSYN-QGCNGGL 212
Query: 166 PSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSV-NITLGAEDELQHAVG 224
AF++I NGGL+TE+ YPY G G C +N V +D ++ E L+ A+
Sbjct: 213 MDYAFQFIMKNGGLNTEKDYPYRGFGGKCNSFLKNSRVVSIDGYEDVPTKDETALKKAIS 272
Query: 225 LVRPVSVAFEVVDG-FRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSW 283
+PV VA E F+ Y+SG+++ + CG +++HAVVAVGYG E+GV YW+++NSW
Sbjct: 273 Y-QPVRVAIEAGGRIFQHYQSGIFTGS-CG---TNLDHAVVAVGYGSENGVDYWIVRNSW 327
Query: 284 GENWGDHGYFKMEMG-----KNMCGIATCASYPV 312
G WG+ GY +ME CGIA ASYPV
Sbjct: 328 GPRWGEEGYIRMERNLAASKSGKCGIAVEASYPV 361
>gi|294883334|ref|XP_002770714.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
gi|239873999|gb|EER02719.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
Length = 330
Score = 184 bits (467), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 120/317 (37%), Positives = 156/317 (49%), Gaps = 59/317 (18%)
Query: 49 VLQVIGQARHALSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN 108
+++ + + L+F F ++GK YES EE R A F NL LI N K LSY+LG+N
Sbjct: 15 LVKCLDEGTVELAFMGFQHKFGKNYESKEEEVKRNAIFQANLHLIEQVNAKNLSYKLGVN 74
Query: 109 --------------------------------------------------ISPVKDQGHC 118
+SPVKDQG C
Sbjct: 75 EYADLTHEEFAALKLGTLKMRPAEHASLSLFVSADTTQLPTSVDWRNKSVLSPVKDQGSC 134
Query: 119 GSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGG 178
GSCW FS G+LEA Y A GK LSEQQLVDC+ + GC GG + A++YIK + G
Sbjct: 135 GSCWAFSAAGALEAQYAIATGKLRPLSEQQLVDCSHKYGTNGCFGGFMADAYKYIK-SAG 193
Query: 179 LDTEEAYPYTGKDGVCKFSSENV-GVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVD 237
LD E YPY G + C+ + G+ V ++ E L A+ PVSVA D
Sbjct: 194 LDQESTYPYKGVNEPCRPREKKADGIPVRFVLDTK--TEQSLMKALADA-PVSVAMYASD 250
Query: 238 G-FRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKME 296
F Y SGVYSST C +++HAVVAVGYG ++G Y+++KNSWG +WG GYF ++
Sbjct: 251 FLFHLYLSGVYSSTTCNG---EIDHAVVAVGYGADEGSDYFILKNSWGSSWGMGGYFFLK 307
Query: 297 MGKNMCGIATCASYPVV 313
G G Y VV
Sbjct: 308 RGVGGHGECNILEYMVV 324
>gi|255032|gb|AAB23155.1| COT44=cysteine proteinase homolog [Brassica napus, seedling, rapid
cycling base population CrGC5, Peptide, 328 aa]
Length = 328
Score = 184 bits (467), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 113/274 (41%), Positives = 156/274 (56%), Gaps = 22/274 (8%)
Query: 55 QARHALSFARFAR----RYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLS------YR 104
A + L FA Y +Y +R T +KN+++ S + +R
Sbjct: 48 NATYKLGLTIFANLTNDEYRSLYLGARTEPVRRITKAKNVNMKYSAAVNDVEVPVTVDWR 107
Query: 105 LGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGG 164
++ +KDQG CGSCW FST ++E G+ +SLSEQ+LVDC +++N QGCNGG
Sbjct: 108 QKGAVNAIKDQGTCGSCWAFSTAAAVEGINKIVTGELVSLSEQELVDCDKSYN-QGCNGG 166
Query: 165 LPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSV-NITLGAEDELQHAV 223
L AF++I NGGL+TE+ YPY G +G C +N V +D ++ E L+ AV
Sbjct: 167 LMDYAFQFIMKNGGLNTEKDYPYHGTNGKCNSLLKNSRVVTIDGYEDVPSKDETALKRAV 226
Query: 224 GLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNS 282
+PVSVA + F+ Y+SG+++ KCG T MD HAVVAVGYG E+GV YW+++NS
Sbjct: 227 SY-QPVSVAIDAGGRAFQHYQSGIFTG-KCG-TNMD--HAVVAVGYGSENGVDYWIVRNS 281
Query: 283 WGENWGDHGYFKMEMG----KNMCGIATCASYPV 312
WG WG+ GY +ME CGIA ASYPV
Sbjct: 282 WGTRWGEDGYIRMERNVASKSGKCGIAIEASYPV 315
>gi|359359213|gb|AEV41117.1| putative oryzain beta chain precursor [Oryza officinalis]
Length = 465
Score = 184 bits (467), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 98/220 (44%), Positives = 135/220 (61%), Gaps = 11/220 (5%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
+ + +R ++PVK+QG CGSCW FS ++E+ G+ I+LSEQ+LV+C+ N
Sbjct: 142 ESVDWREKGAVAPVKNQGQCGSCWAFSAVSTVESINQLVTGEMITLSEQELVECSTNGQN 201
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSV-NITLGAED 217
GCNGGL AF++I NGG+DTE+ YPY DG C + EN V +D ++ E
Sbjct: 202 SGCNGGLMDDAFDFIIKNGGIDTEDDYPYKAVDGKCDINRENAKVVSIDGFEDVPQNDEK 261
Query: 218 ELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPY 276
LQ AV +PVSVA E F+ Y SGV+S +CG + ++H VVAVGYG ++G Y
Sbjct: 262 SLQKAVAH-QPVSVAIEAGGREFQLYHSGVFSG-RCGTS---LDHGVVAVGYGTDNGKDY 316
Query: 277 WLIKNSWGENWGDHGYFKMEMGKNM----CGIATCASYPV 312
W+++NSWG WG+ GY +ME N+ CGIA ASYP
Sbjct: 317 WIVRNSWGPKWGESGYVRMERNINVTTGKCGIAMMASYPT 356
>gi|7770062|ref|NP_036137.1| cathepsin J precursor [Mus musculus]
gi|6467374|gb|AAF13142.1|AF136272_1 cathepsin J precursor [Mus musculus]
gi|15418834|gb|AAK58455.1| cathepsin J [Mus musculus]
gi|148709364|gb|EDL41310.1| cathepsin J, isoform CRA_b [Mus musculus]
Length = 333
Score = 184 bits (467), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 97/211 (45%), Positives = 125/211 (59%), Gaps = 9/211 (4%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++PV++QG CGSCW F+ G++E G LS Q L+DC++ N+GC G Q
Sbjct: 125 VTPVRNQGKCGSCWAFAAAGAIEGQMFWKTGNLTPLSVQNLLDCSKTVGNKGCQSGTAHQ 184
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
AFEY+ N GL+ E YPY GKDG C++ SEN + D VN+ E L AV + P
Sbjct: 185 AFEYVLKNKGLEAEATYPYEGKDGPCRYRSENASANITDYVNLPPN-ELYLWVAVASIGP 243
Query: 229 VSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYG----VEDGVPYWLIKNSW 283
VS A + D FRFY G+Y C + VNHAV+ VGYG V+DG YWLIKNSW
Sbjct: 244 VSAAIDASHDSFRFYNGGIYYEPNC--SSYFVNHAVLVVGYGSEGDVKDGNNYWLIKNSW 301
Query: 284 GENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
GE WG +GY ++ N CGIA+ ASYP +
Sbjct: 302 GEEWGMNGYMQIAKDHNNHCGIASLASYPNI 332
>gi|84028184|sp|Q9R014.2|CATJ_MOUSE RecName: Full=Cathepsin J; AltName: Full=Cathepsin L-related
protein; AltName: Full=Cathepsin P; AltName:
Full=Catlrp-p; Flags: Precursor
gi|5306071|gb|AAD41898.1|AF158182_1 preprocathepsin P [Mus musculus]
gi|12838143|dbj|BAB24099.1| unnamed protein product [Mus musculus]
gi|74199838|dbj|BAE20748.1| unnamed protein product [Mus musculus]
gi|74355544|gb|AAI03770.1| Cathepsin J [Mus musculus]
gi|148709363|gb|EDL41309.1| cathepsin J, isoform CRA_a [Mus musculus]
Length = 334
Score = 184 bits (467), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 97/211 (45%), Positives = 125/211 (59%), Gaps = 9/211 (4%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++PV++QG CGSCW F+ G++E G LS Q L+DC++ N+GC G Q
Sbjct: 126 VTPVRNQGKCGSCWAFAAAGAIEGQMFWKTGNLTPLSVQNLLDCSKTVGNKGCQSGTAHQ 185
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
AFEY+ N GL+ E YPY GKDG C++ SEN + D VN+ E L AV + P
Sbjct: 186 AFEYVLKNKGLEAEATYPYEGKDGPCRYRSENASANITDYVNLPPN-ELYLWVAVASIGP 244
Query: 229 VSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYG----VEDGVPYWLIKNSW 283
VS A + D FRFY G+Y C + VNHAV+ VGYG V+DG YWLIKNSW
Sbjct: 245 VSAAIDASHDSFRFYNGGIYYEPNC--SSYFVNHAVLVVGYGSEGDVKDGNNYWLIKNSW 302
Query: 284 GENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
GE WG +GY ++ N CGIA+ ASYP +
Sbjct: 303 GEEWGMNGYMQIAKDHNNHCGIASLASYPNI 333
>gi|297603535|ref|NP_001054211.2| Os04g0670200 [Oryza sativa Japonica Group]
gi|109939735|sp|P25777.2|ORYB_ORYSJ RecName: Full=Oryzain beta chain; Flags: Precursor
gi|32488398|emb|CAE02823.1| OSJNBa0043A12.28 [Oryza sativa Japonica Group]
gi|90399163|emb|CAJ86092.1| H0818H01.14 [Oryza sativa Indica Group]
gi|125550169|gb|EAY95991.1| hypothetical protein OsI_17862 [Oryza sativa Indica Group]
gi|215766596|dbj|BAG98700.1| unnamed protein product [Oryza sativa Japonica Group]
gi|255675868|dbj|BAF16125.2| Os04g0670200 [Oryza sativa Japonica Group]
Length = 466
Score = 184 bits (467), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 98/218 (44%), Positives = 134/218 (61%), Gaps = 11/218 (5%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++PVK+QG CGSCW FS ++E+ G+ I+LSEQ+LV+C+ N G
Sbjct: 145 VDWREKGAVAPVKNQGQCGSCWAFSAVSTVESINQLVTGEMITLSEQELVECSTNGQNSG 204
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSV-NITLGAEDEL 219
CNGGL AF++I NGG+DTE+ YPY DG C + EN V +D ++ E L
Sbjct: 205 CNGGLMDDAFDFIIKNGGIDTEDDYPYKAVDGKCDINRENAKVVSIDGFEDVPQNDEKSL 264
Query: 220 QHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWL 278
Q AV +PVSVA E F+ Y SGV+S +CG + ++H VVAVGYG ++G YW+
Sbjct: 265 QKAVAH-QPVSVAIEAGGREFQLYHSGVFSG-RCGTS---LDHGVVAVGYGTDNGKDYWI 319
Query: 279 IKNSWGENWGDHGYFKMEMGKNM----CGIATCASYPV 312
++NSWG WG+ GY +ME N+ CGIA ASYP
Sbjct: 320 VRNSWGPKWGESGYVRMERNINVTTGKCGIAMMASYPT 357
>gi|224056176|ref|XP_002298740.1| predicted protein [Populus trichocarpa]
gi|222845998|gb|EEE83545.1| predicted protein [Populus trichocarpa]
Length = 455
Score = 184 bits (467), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 116/306 (37%), Positives = 154/306 (50%), Gaps = 67/306 (21%)
Query: 68 RYGKIYESVEEMKLRFATFSKNLDLIRSTNCKG-LSYRLGLN------------------ 108
++G+ Y ++ E + RF F NL I N G SY+LGLN
Sbjct: 31 KHGRAYNALGEKERRFEIFKDNLKFIDEHNSVGNPSYKLGLNKFADLSNDEYRSVYLGTR 90
Query: 109 -----------------------------------ISPVKDQGHCGSCWTFSTTGSLEAA 133
++PVKDQG CGSCW FST G++E
Sbjct: 91 MDGKGRLLGGPKSERYLFKEGDDLPETVDWREKGAVAPVKDQGQCGSCWAFSTVGAVEGI 150
Query: 134 YHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGV 193
G SLSEQ+LVDC + + N GCNGGL AF++I NGG+DTEE YPY D +
Sbjct: 151 NQIVTGNLTSLSEQELVDCDKTY-NLGCNGGLMDYAFDFIIENGGIDTEEDYPYKAIDSM 209
Query: 194 CKFSSENVGVQVLDSV-NITLGAEDELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTK 251
C + +N V +D ++ E L+ AV +PVSVA E GF+ Y+SGV++ +
Sbjct: 210 CDPNRKNARVVTIDGYEDVPQNDEKSLKKAVA-NQPVSVAIEAGGRGFQLYQSGVFTGS- 267
Query: 252 CGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMG-----KNMCGIAT 306
CG ++H VV VGYG E GV YW+++NSWG WG++GY +ME CGIA
Sbjct: 268 CGTQ---LDHGVVTVGYGTEHGVDYWIVRNSWGPAWGENGYIRMERDVASTETGKCGIAM 324
Query: 307 CASYPV 312
ASYP
Sbjct: 325 EASYPT 330
>gi|242077600|ref|XP_002448736.1| hypothetical protein SORBIDRAFT_06g032320 [Sorghum bicolor]
gi|241939919|gb|EES13064.1| hypothetical protein SORBIDRAFT_06g032320 [Sorghum bicolor]
Length = 467
Score = 184 bits (466), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 98/220 (44%), Positives = 133/220 (60%), Gaps = 11/220 (5%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
+ + +R ++PVK+QG CGSCW FS S+E+ G+ ++LSEQ+LV+C+ N
Sbjct: 147 ESVDWREKGAVAPVKNQGQCGSCWAFSAVSSVESINQIVTGEMVTLSEQELVECSTDGGN 206
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSV-NITLGAED 217
GCNGGL AF +I NGG+DTE+ YPY DG C + N V +D+ ++ E
Sbjct: 207 SGCNGGLMDAAFNFIIKNGGIDTEDDYPYKAVDGKCDINRRNAKVVSIDAFEDVPENDEK 266
Query: 218 ELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPY 276
LQ AV +PVSVA E F+ YKSGV+S G+ +++H VVAVGYG E+G Y
Sbjct: 267 SLQKAVAH-QPVSVAIEAGGRQFQLYKSGVFS----GSCTTNLDHGVVAVGYGTENGKDY 321
Query: 277 WLIKNSWGENWGDHGYFKMEMGKNM----CGIATCASYPV 312
W+++NSWG WG+ GY +ME N CGIA ASYP
Sbjct: 322 WIVRNSWGPKWGEAGYIRMERNINATTGKCGIAMMASYPT 361
>gi|354472953|ref|XP_003498701.1| PREDICTED: cathepsin K [Cricetulus griseus]
Length = 329
Score = 184 bits (466), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 92/213 (43%), Positives = 129/213 (60%), Gaps = 6/213 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ YR ++PVK+QG CGSCW FS+ G+LE + GK ++LS Q LVDC N G
Sbjct: 119 IDYRKKGYVTPVKNQGECGSCWAFSSAGALEGQLKKKTGKLLNLSPQNLVDCVS--ENYG 176
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C GG + AF Y++ NGG+D+E+AYPY G+D C ++ + I +G+E L+
Sbjct: 177 CGGGYMTTAFRYVQTNGGIDSEDAYPYVGQDQSCMYNPTAKAAKCRGYREIPVGSEKALK 236
Query: 221 HAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
AV V P+SV+ + + F+FY GVY C +VNHAV+ VGYG + G +W+I
Sbjct: 237 RAVARVGPISVSIDASLTSFQFYSRGVYYDENCDGD--NVNHAVLVVGYGAQKGNKHWII 294
Query: 280 KNSWGENWGDHGYFKMEMGK-NMCGIATCASYP 311
KNSWGE+WG+ GY + + N CGI AS+P
Sbjct: 295 KNSWGESWGNKGYVLLARNRNNACGITNLASFP 327
>gi|118127|sp|P25251.1|CYSP4_BRANA RecName: Full=Cysteine proteinase COT44; Flags: Precursor
Length = 328
Score = 184 bits (466), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 113/274 (41%), Positives = 156/274 (56%), Gaps = 22/274 (8%)
Query: 55 QARHALSFARFAR----RYGKIYESVEEMKLRFATFSKNLDLIRSTNCK------GLSYR 104
A + L FA Y +Y +R T +KN+++ S + +R
Sbjct: 48 NATYKLGLTIFANLTNDEYRSLYLGARTEPVRRITKAKNVNMKYSAAVNVDEVPVTVDWR 107
Query: 105 LGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGG 164
++ +KDQG CGSCW FST ++E G+ +SLSEQ+LVDC +++N QGCNGG
Sbjct: 108 QKGAVNAIKDQGTCGSCWAFSTAAAVEGINKIVTGELVSLSEQELVDCDKSYN-QGCNGG 166
Query: 165 LPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSV-NITLGAEDELQHAV 223
L AF++I NGGL+TE+ YPY G +G C +N V +D ++ E L+ AV
Sbjct: 167 LMDYAFQFIMKNGGLNTEKDYPYHGTNGKCNSLLKNSRVVTIDGYEDVPSKDETALKRAV 226
Query: 224 GLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNS 282
+PVSVA + F+ Y+SG+++ KCG T MD HAVVAVGYG E+GV YW+++NS
Sbjct: 227 SY-QPVSVAIDAGGRAFQHYQSGIFTG-KCG-TNMD--HAVVAVGYGSENGVDYWIVRNS 281
Query: 283 WGENWGDHGYFKMEMG----KNMCGIATCASYPV 312
WG WG+ GY +ME CGIA ASYPV
Sbjct: 282 WGTRWGEDGYIRMERNVASKSGKCGIAIEASYPV 315
>gi|374713651|gb|AEZ65083.1| cysteine protease [Carica papaya]
Length = 467
Score = 184 bits (466), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 116/306 (37%), Positives = 153/306 (50%), Gaps = 67/306 (21%)
Query: 68 RYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN----------------ISP 111
++GK Y ++ E + RF F NL I N + L+YRLGLN + P
Sbjct: 55 KHGKAYNALGEKEKRFGIFKDNLRFIDEHNSQNLTYRLGLNRFADLTNEEYRSMYLGVKP 114
Query: 112 --------------------------------------VKDQGHCGSCWTFSTTGSLEAA 133
VKDQG CGSCW FST ++E
Sbjct: 115 GATRVTRKVSRKSDRFAARVGDALPDFIDWRKEGAVVGVKDQGSCGSCWAFSTIAAVEGI 174
Query: 134 YHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGV 193
G ISLSEQ+LVDC ++N +GCNGGL AFE+I NGG+D+EE YPY D
Sbjct: 175 NQIVTGDLISLSEQELVDCDTSYN-EGCNGGLMDYAFEFIINNGGIDSEEDYPYRAADQK 233
Query: 194 CKFSSENVGVQVLDSV-NITLGAEDELQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTK 251
C +N V +D ++ E L+ AV +PVSVA E F+ Y+SGV++ K
Sbjct: 234 CDQYRKNANVVSIDGYEDVPENDEAALKKAVAK-QPVSVAIEAGGRAFQLYQSGVFTG-K 291
Query: 252 CGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMG-----KNMCGIAT 306
CG + ++H V AVGYG E+G YW++ NSWG+NWG+ GY +ME CGIA
Sbjct: 292 CGTS---LDHGVAAVGYGTENGQDYWIVGNSWGKNWGEDGYIRMERNLAGSSSGKCGIAI 348
Query: 307 CASYPV 312
SYP+
Sbjct: 349 GPSYPI 354
>gi|224093956|ref|XP_002310053.1| predicted protein [Populus trichocarpa]
gi|224147016|ref|XP_002336386.1| predicted protein [Populus trichocarpa]
gi|222834869|gb|EEE73318.1| predicted protein [Populus trichocarpa]
gi|222852956|gb|EEE90503.1| predicted protein [Populus trichocarpa]
Length = 340
Score = 184 bits (466), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 107/305 (35%), Positives = 153/305 (50%), Gaps = 60/305 (19%)
Query: 64 RFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCK-GLSYRLGLN-------------- 108
++ +YG++Y+ E R++ F +N+ I + N + G SY+LG+N
Sbjct: 41 QWMTQYGRVYKDDNERATRYSIFKENVARIDAFNSQTGKSYKLGVNQFADLTNEEFKASR 100
Query: 109 ----------------------------------ISPVKDQGHCGSCWTFSTTGSLEAAY 134
++PVKDQG CG CW FS ++E
Sbjct: 101 NRFKGHMCSPQAGPFRYENVSAVPSTVDWRKEGAVTPVKDQGQCGCCWAFSAVAAMEGIN 160
Query: 135 HQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVC 194
GK ISLSEQ++VDC +QGCNGGL AF++I+ N GL TE YPY G DG C
Sbjct: 161 KLTTGKLISLSEQEVVDCDTKGEDQGCNGGLMDDAFKFIEQNKGLTTEANYPYKGTDGTC 220
Query: 195 KFSSENV-GVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKC 252
+ + ++ ++ +E L AV +PVSVA + F+FY SG+++
Sbjct: 221 NTNKAAIHAAKITGFEDVPANSEAALMKAVAK-QPVSVAIDAGGSDFQFYSSGIFT---- 275
Query: 253 GNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMG----KNMCGIATCA 308
G+ ++H V AVGYGV DG YWL+KNSWG WG+ GY +M+ + +CGIA A
Sbjct: 276 GSCDTQLDHGVTAVGYGVSDGSKYWLVKNSWGAQWGEEGYIRMQKDISAKEGLCGIAMQA 335
Query: 309 SYPVV 313
SYP
Sbjct: 336 SYPTA 340
>gi|28194647|gb|AAO33585.1|AF479267_1 cathepsin L [Mesocricetus auratus]
Length = 333
Score = 184 bits (466), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 100/221 (45%), Positives = 130/221 (58%), Gaps = 9/221 (4%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++PVK+QG CGSCW FS G+LE G +SLSEQ LVDC++ N
Sbjct: 116 KSVDWREKGCVTPVKNQGQCGSCWAFSACGALEGQMCLKTGVLVSLSEQNLVDCSRGEGN 175
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
QGCNGGL AF+Y+ N GLD+EE+YPY KDG CK+ E V+I E
Sbjct: 176 QGCNGGLMDFAFQYVLNNKGLDSEESYPYEAKDGTCKYKPEFAAANDTGYVDIPQ-LEKA 234
Query: 219 LQHAVGLVRPVSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----DG 273
L AV V P++VA + F+FY SG+Y C + D++H V+ +GYG E +
Sbjct: 235 LMKAVATVGPIAVAIDASHPSFQFYSSGIYFEPNC--SSKDLDHGVLVIGYGFEGTDSNK 292
Query: 274 VPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
YW++KNSWG WG G+F + K N CGIAT ASYP V
Sbjct: 293 KKYWIVKNSWGTGWGMGGFFHIAKDKNNHCGIATAASYPTV 333
>gi|312381834|gb|EFR27484.1| hypothetical protein AND_05795 [Anopheles darlingi]
Length = 508
Score = 184 bits (466), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 93/206 (45%), Positives = 131/206 (63%), Gaps = 11/206 (5%)
Query: 114 DQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYI 173
+QG CGSCW FS+TG++E + + K +SLSEQ LVDC + N+GC GG ++F+YI
Sbjct: 308 EQGKCGSCWAFSSTGAVEGQHFRKTNKLVSLSEQNLVDCTSNYRNKGCKGGAIYRSFQYI 367
Query: 174 KYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAF 233
+ N G+DTE++YPY K+G C ++ + +G +V V+I G ED L AV V P+S+
Sbjct: 368 EQNHGIDTEKSYPYQAKEGPCAYNPKAIGAKVKGYVHIPTGDEDALMKAVATVGPISI-- 425
Query: 234 EVVDG----FRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGV-EDGVPYWLIKNSWGENWG 288
VVD F+ Y GVY ++C T ++ HA++ VGYG + G +WL+KNSWG +WG
Sbjct: 426 -VVDSRHHTFKHYADGVYYDSQCSAT--NLTHAMLVVGYGTSKKGEDFWLVKNSWGTSWG 482
Query: 289 DHGYFKMEMGK-NMCGIATCASYPVV 313
GY KM + N CGIA A YP+V
Sbjct: 483 IKGYIKMARNRNNSCGIANKAYYPLV 508
>gi|283046734|ref|NP_001164314.1| cathepsin L precursor [Tribolium castaneum]
gi|270001247|gb|EEZ97694.1| cathepsin L precursor [Tribolium castaneum]
Length = 328
Score = 184 bits (466), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 93/207 (44%), Positives = 130/207 (62%), Gaps = 6/207 (2%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGI-SLSEQQLVDCAQAFNNQGCNGGLPS 167
+S VK+QG CGSCW+FSTTG++E + G+G+ SLSEQ LVDC+ A+ N GCNGG
Sbjct: 126 VSEVKNQGQCGSCWSFSTTGAVEGQLAIS-GRGLTSLSEQNLVDCSSAYGNAGCNGGWMD 184
Query: 168 QAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVR 227
AF+YI ++ G+ +E AYPYT +G C+F+ + ++ G E+ L+ AV
Sbjct: 185 SAFDYI-HDNGIMSESAYPYTASEGSCRFNPSESVTSLQGYYDLPSGDENALKSAVANNG 243
Query: 228 PVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENW 287
P++VA + D +FY GV T C + +NH V+ VGYG E G YW++KNSWG W
Sbjct: 244 PIAVALDATDELQFYSGGVLYDTTC--SAQALNHGVLVVGYGSEGGQDYWIVKNSWGSGW 301
Query: 288 GDHGYFKMEMGK-NMCGIATCASYPVV 313
G+ GY++ + N CGIAT ASYP +
Sbjct: 302 GEQGYWRQARNRNNNCGIATAASYPAL 328
>gi|156384930|ref|XP_001633385.1| predicted protein [Nematostella vectensis]
gi|156220454|gb|EDO41322.1| predicted protein [Nematostella vectensis]
Length = 548
Score = 184 bits (466), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 103/300 (34%), Positives = 144/300 (48%), Gaps = 51/300 (17%)
Query: 62 FARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN------------- 108
F ++ +++ K Y+ +E R F NL I S N + Y L +N
Sbjct: 245 FDKYVKKHKKNYKDNKEHHTRREHFKHNLRFIHSKNRRHAGYYLAMNHLGDRSDKELRVL 304
Query: 109 ------------------------------------ISPVKDQGHCGSCWTFSTTGSLEA 132
++PVKDQ CGSCW+F TTG++E
Sbjct: 305 RGRRYTKGYNGGLPYKPDMASINDVPDEMNWVIRGAVTPVKDQAVCGSCWSFGTTGTIEG 364
Query: 133 AYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAY-PYTGKD 191
LS+Q L+DC+ N C+GG ++++YI +GG+ TEE+Y PY G D
Sbjct: 365 TLFLKTKYLTRLSQQNLMDCSWGEGNNACDGGEDFRSYQYIMKSGGIATEESYGPYLGAD 424
Query: 192 GVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSST 250
G C +G + VNIT G L+ A+ P+SV+ + FY GVY
Sbjct: 425 GYCHKKDAEIGATITGYVNITEGDLSALKTAIAQKGPISVSIDASHKSLSFYSYGVYYEP 484
Query: 251 KCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASY 310
KCGN D++H+V+AVGYG DG PYW+IKNSW +WG +GY M N CG+AT A+Y
Sbjct: 485 KCGNKNEDLDHSVLAVGYGTMDGKPYWMIKNSWSTHWGMNGYVLMSQKDNNCGVATAATY 544
>gi|224162986|ref|XP_002338508.1| predicted protein [Populus trichocarpa]
gi|222872535|gb|EEF09666.1| predicted protein [Populus trichocarpa]
Length = 306
Score = 184 bits (466), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 107/305 (35%), Positives = 152/305 (49%), Gaps = 60/305 (19%)
Query: 64 RFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCK-GLSYRLGLN-------------- 108
++ +YG++Y+ E R++ F +N+ I + N + G SY+LG+N
Sbjct: 7 QWMTQYGRVYKDDNERATRYSIFKENVARIDAFNSQTGKSYKLGVNQFADLTNEEFKASR 66
Query: 109 ----------------------------------ISPVKDQGHCGSCWTFSTTGSLEAAY 134
++PVKDQG CG CW FS ++E
Sbjct: 67 NRFKGHMCSPQAGPFRYENVSAVPSTVDWRKEGAVTPVKDQGQCGCCWAFSAVAAMEGIN 126
Query: 135 HQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVC 194
GK ISLSEQ++VDC +QGCNGGL AF++I+ N GL TE YPY G DG C
Sbjct: 127 KLTTGKLISLSEQEVVDCDTKGEDQGCNGGLMDDAFKFIEQNKGLTTEANYPYKGTDGTC 186
Query: 195 KFSSENV-GVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKC 252
+ ++ ++ +E L AV +PVSVA + F+FY SG+++
Sbjct: 187 NTKKSAIHAAKITGFEDVPANSEAALMKAVAK-QPVSVAIDAGGSDFQFYSSGIFT---- 241
Query: 253 GNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMG----KNMCGIATCA 308
G+ ++H V AVGYGV DG YWL+KNSWG WG+ GY +M+ + +CGIA A
Sbjct: 242 GSCDTQLDHGVTAVGYGVSDGSKYWLVKNSWGAQWGEEGYIRMQKDISAKEGLCGIAMQA 301
Query: 309 SYPVV 313
SYP
Sbjct: 302 SYPTA 306
>gi|414584879|tpg|DAA35450.1| TPA: cysteine protease 1 [Zea mays]
Length = 522
Score = 184 bits (466), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 97/218 (44%), Positives = 131/218 (60%), Gaps = 11/218 (5%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++PVK+QG CGSCW FS S+E+ G+ ++LSEQ+LV+C+ N G
Sbjct: 203 VDWREKGAVAPVKNQGQCGSCWAFSAVSSVESVNQIVTGEMVTLSEQELVECSTDGGNSG 262
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSV-NITLGAEDEL 219
CNGGL AF++I NGG+DTE YPY DG C + EN V +D ++ E L
Sbjct: 263 CNGGLMDAAFDFIIKNGGIDTEGDYPYKAVDGKCDINRENAKVVSIDGFEDVPENDEKSL 322
Query: 220 QHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWL 278
Q AV +PVSVA E F+ YK+GV++ G +++H VVAVGYG E+G YW+
Sbjct: 323 QKAVAH-QPVSVAIEAGGREFQLYKAGVFT----GTCTTNLDHGVVAVGYGTENGKDYWI 377
Query: 279 IKNSWGENWGDHGYFKMEMGKN----MCGIATCASYPV 312
++NSWG WG+ GY +ME N CGIA ASYP
Sbjct: 378 VRNSWGAKWGEDGYIRMERNVNATTGKCGIAMMASYPT 415
>gi|226501480|ref|NP_001150266.1| cysteine protease 1 precursor [Zea mays]
gi|195637948|gb|ACG38442.1| cysteine protease 1 precursor [Zea mays]
Length = 462
Score = 184 bits (466), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 98/218 (44%), Positives = 131/218 (60%), Gaps = 11/218 (5%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++PVK+QG CGSCW FS S+E+ G+ ++LSEQ+LV+C+ N G
Sbjct: 143 VDWREKGAVAPVKNQGQCGSCWAFSAVSSVESVNQIVTGEMVTLSEQELVECSTDGGNSG 202
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSV-NITLGAEDEL 219
CNGGL AF++I NGG+DTE YPY DG C + EN V +D ++ E L
Sbjct: 203 CNGGLMDAAFDFIIKNGGIDTEGDYPYKAVDGKCDINRENAKVVSIDGFEDVPENDEKSL 262
Query: 220 QHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWL 278
Q AV +PVSVA E F+ YK+GV+S G +++H VVAVGYG E+G YW+
Sbjct: 263 QKAVAH-QPVSVAIEAGGREFQLYKAGVFS----GTCTTNLDHGVVAVGYGTENGKDYWI 317
Query: 279 IKNSWGENWGDHGYFKMEMGKNM----CGIATCASYPV 312
++NSWG WG+ GY +ME N CGIA ASYP
Sbjct: 318 VRNSWGAKWGEDGYIRMERNVNATTGKCGIAMMASYPT 355
>gi|94448666|emb|CAI91571.1| silicatein a2 [Lubomirskia baicalensis]
Length = 326
Score = 184 bits (466), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 91/215 (42%), Positives = 137/215 (63%), Gaps = 4/215 (1%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++ VK+QG CG+ + F+ TG++E A + K +SLSEQ ++DC+ + N G
Sbjct: 114 IDWRTKGAVTSVKNQGDCGASYAFAATGTMEGANALSNDKQVSLSEQNIIDCSVPYGNHG 173
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C+GG A +Y+ NGG+DTE +Y + GK C+++S+N G +V I G+E +L
Sbjct: 174 CSGGDTYTAIKYVVDNGGIDTESSYSFRGKQSSCQYNSKNSGASATGAVGIPYGSESDLM 233
Query: 221 HAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
AV V PV+VA + + FRFY+SGV+ S+ C +T + NHA++ GYG +G YWL+
Sbjct: 234 AAVATVGPVAVAVDANTNAFRFYQSGVFDSSTCSSTKL--NHAMLVTGYGSYNGKDYWLV 291
Query: 280 KNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
KNSWG+ WGD+GY M K N CGIA+ A Y ++
Sbjct: 292 KNSWGKYWGDNGYIMMVRNKYNQCGIASDALYSML 326
>gi|32396020|gb|AAP41847.1| senescence-associated cysteine protease [Anthurium andraeanum]
Length = 460
Score = 184 bits (466), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 107/271 (39%), Positives = 150/271 (55%), Gaps = 21/271 (7%)
Query: 58 HALSFARFA----RRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKG------LSYRLGL 107
+ L RFA Y Y V+ ++R ++ R + G + +R
Sbjct: 81 YTLGLTRFADLTNEEYRSTYLGVKPGQVRPRRANRAPGRGRDLSANGDDLPQKVDWREKG 140
Query: 108 NISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPS 167
++P+KDQG CGSCW FST ++E G I LSEQ+LVDC A+N +GCNGGL
Sbjct: 141 AVAPIKDQGGCGSCWAFSTVAAVEGINQIVTGDLIVLSEQELVDCDTAYN-EGCNGGLMD 199
Query: 168 QAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVR 227
AF++I NGG+DTEE YPY +DG+C + +N V +DS L ++ +
Sbjct: 200 YAFQFIISNGGIDTEEDYPYKERDGLCDPNRKNAKVVSIDSYEDVLENDEHALKTAVAHQ 259
Query: 228 PVSVAFE-VVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGEN 286
PVSVA E F+ YKSG++ +CG +D++H VVAVGYG E G YW+++NSWG++
Sbjct: 260 PVSVAIEGGGRSFQLYKSGIFDG-RCG---IDLDHGVVAVGYGTESGKDYWIVRNSWGKS 315
Query: 287 WGDHGYFKMEMG-----KNMCGIATCASYPV 312
WG+ GY +ME CGIA SYP+
Sbjct: 316 WGEAGYIRMERNLPSSSSGKCGIAIEPSYPI 346
>gi|449438381|ref|XP_004136967.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
Length = 479
Score = 184 bits (466), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 109/303 (35%), Positives = 155/303 (51%), Gaps = 65/303 (21%)
Query: 69 YGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN-------------------- 108
+GK Y ++ E + RF F NL I N + +Y++GL
Sbjct: 69 HGKAYNAIGEKERRFEIFKDNLRFIDEHNRESRTYKVGLTRFADLTNEEYRARFLGGRFS 128
Query: 109 --------------------------------ISPVKDQGHCGSCWTFSTTGSLEAAYHQ 136
++ VKDQG CGSCW FS+ ++E
Sbjct: 129 RKPRLSAAKSGRYAAALGDDLPDDVDWRKKGAVATVKDQGQCGSCWAFSSVAAVEGINQI 188
Query: 137 AFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKF 196
G+ I LSEQ+LVDC ++FN GCNGGL AF++I NGG+DTEE YPY G+D C
Sbjct: 189 VTGELIPLSEQELVDCDKSFN-MGCNGGLMDYAFQFIIGNGGIDTEEDYPYKGRDAACDP 247
Query: 197 SSENVGVQVLDSV-NITLGAEDELQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGN 254
+ +N V +D ++ E L+ AV +PVSVA E F+ Y+SGV++ +CG
Sbjct: 248 NRKNAKVVTIDGYEDVPENDESSLKKAVA-NQPVSVAIEAGGRAFQLYQSGVFTG-RCG- 304
Query: 255 TPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKME-----MGKNMCGIATCAS 309
D++H VVAVGYG ++G YW+++NSWG++WG+ GY ++E + CGIA S
Sbjct: 305 --TDLDHGVVAVGYGTDNGTDYWIVRNSWGKDWGESGYIRLERNVANITTGKCGIAVQPS 362
Query: 310 YPV 312
YP
Sbjct: 363 YPT 365
>gi|327289219|ref|XP_003229322.1| PREDICTED: cathepsin K-like, partial [Anolis carolinensis]
Length = 289
Score = 184 bits (466), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 94/213 (44%), Positives = 125/213 (58%), Gaps = 6/213 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ YR ++PVK+QG CGSCW FS+ G+LEA GK ++LS Q LVDC NN G
Sbjct: 79 VDYRKKGYVTPVKNQGQCGSCWAFSSVGALEAQLKMKTGKLLNLSPQNLVDCVS--NNDG 136
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C GG + AFEY+ N G+D+++ YPY G+D C ++ + I G E L+
Sbjct: 137 CGGGYMTNAFEYVHVNRGIDSDDTYPYIGQDENCMYNPTGKAAKCRGYKEIPEGDEKALK 196
Query: 221 HAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
AV PVSV + + F+FY GVY C ++NHAV+AVGYG + G +W++
Sbjct: 197 RAVARKGPVSVGIDASLASFQFYSRGVYYDENCNAD--NINHAVLAVGYGSQKGTKHWIV 254
Query: 280 KNSWGENWGDHGYFKMEMG-KNMCGIATCASYP 311
KNSWGE+WGD GY M N CGIA AS+P
Sbjct: 255 KNSWGEDWGDKGYILMARNMNNACGIANLASFP 287
>gi|312386081|gb|ADQ74585.1| silicatein alpha 2 [Lubomirskia baicalensis]
Length = 326
Score = 184 bits (466), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 91/215 (42%), Positives = 137/215 (63%), Gaps = 4/215 (1%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++ VK+QG CG+ + F+ TG++E A + K +SLSEQ ++DC+ + N G
Sbjct: 114 IDWRTKGAVTSVKNQGDCGASYAFAATGTMEGANALSNDKQVSLSEQNIIDCSVPYGNHG 173
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C+GG A +Y+ NGG+DTE +Y + GK C+++S+N G +V I G+E +L
Sbjct: 174 CSGGDTYTAIKYVVDNGGIDTESSYSFRGKQSSCQYNSKNSGASATGAVGIPYGSESDLM 233
Query: 221 HAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
AV V PV+VA + + FRFY+SGV+ S+ C +T + NHA++ GYG +G YWL+
Sbjct: 234 AAVATVGPVAVAVDANTNAFRFYQSGVFDSSTCSSTKL--NHAMLVTGYGSYNGKDYWLV 291
Query: 280 KNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
KNSWG+ WGD+GY M K N CGIA+ A Y ++
Sbjct: 292 KNSWGKYWGDNGYIMMVRNKYNQCGIASDALYSML 326
>gi|281427380|ref|NP_001163996.1| cathepsin L-like proteinase precursor [Tribolium castaneum]
gi|281427798|ref|NP_001164001.1| cathepsin L-like proteinase precursor [Tribolium castaneum]
gi|270001241|gb|EEZ97688.1| cathepsin L precursor [Tribolium castaneum]
gi|270016928|gb|EFA13374.1| hypothetical protein TcasGA2_TC001950 [Tribolium castaneum]
Length = 328
Score = 184 bits (466), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 95/207 (45%), Positives = 127/207 (61%), Gaps = 6/207 (2%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGI-SLSEQQLVDCAQAFNNQGCNGGLPS 167
++ VKDQG CGSCW+FSTTG++E + GKG+ SLSEQ LVDC+ + N GCNGG
Sbjct: 126 VTEVKDQGQCGSCWSFSTTGAVEGQLAIS-GKGLTSLSEQNLVDCSSQYGNAGCNGGWMD 184
Query: 168 QAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVR 227
AF+YI ++ G+ +E AYPYT DG C+F + + +I G E LQ AV
Sbjct: 185 SAFDYI-HDNGIMSESAYPYTAMDGNCRFDASQSVTSLQGYYDIPSGDESALQDAVANNG 243
Query: 228 PVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENW 287
PV+VA + + + Y GV T C + +NH V+ VGYG E G YW++KNSWG W
Sbjct: 244 PVAVALDATEELQLYSGGVLYDTTC--SAQALNHGVLVVGYGSEGGQDYWIVKNSWGSGW 301
Query: 288 GDHGYFKMEMGK-NMCGIATCASYPVV 313
G+ GY++ + N CGIAT ASYP +
Sbjct: 302 GEQGYWRQARNRNNNCGIATAASYPAL 328
>gi|301767946|ref|XP_002919405.1| PREDICTED: cathepsin S-like [Ailuropoda melanoleuca]
Length = 340
Score = 183 bits (465), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 95/208 (45%), Positives = 127/208 (61%), Gaps = 6/208 (2%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCA-QAFNNQGCNGGLPS 167
++ VK QG CG+CW FS G+LEA GK +SLS Q LVDC+ + + N+GCNGG +
Sbjct: 136 VTKVKYQGACGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMT 195
Query: 168 QAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVR 227
+AF+YI N G+D+E +YPY DG C++ S+N + G+ED+L+ AV
Sbjct: 196 EAFQYIIDNNGIDSEASYPYKATDGKCRYDSKNRAATCSKYTELPSGSEDDLKEAVANKG 255
Query: 228 PVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGEN 286
PVSVA + F Y+SGVY C +VNH V+ VGYG +G YWL+KNSWG N
Sbjct: 256 PVSVAIDARHSSFFLYRSGVYYDPSC---TQNVNHGVLVVGYGNLNGKDYWLVKNSWGLN 312
Query: 287 WGDHGYFKMEMGK-NMCGIATCASYPVV 313
+GD GY +M N CGIA+ SYP +
Sbjct: 313 FGDQGYIRMARNSGNHCGIASYPSYPEI 340
>gi|148927382|gb|ABR19827.1| cysteine proteinase [Elaeis guineensis]
Length = 470
Score = 183 bits (465), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 122/345 (35%), Positives = 167/345 (48%), Gaps = 74/345 (21%)
Query: 30 DSNPIRLVSSDGLRDFETSVLQVIGQARHALSFARFARRYGKIYESVEEMKLRFATFSKN 89
D + I + G+R E S + + + + ++G+ Y ++ E + RF F N
Sbjct: 24 DMSIISYDEAHGVRGLERS------EEEMRILYEGWLAKHGRAYNALGEKERRFEIFKDN 77
Query: 90 LDLIRSTNCKG----LSYRLGLN------------------------------------- 108
+ I + N S+RLGLN
Sbjct: 78 VLFIDAHNAAADAGHRSFRLGLNRFADMTNEEYRAVYLGTRPAGHRRRARVGSDRYRYNA 137
Query: 109 ---------------ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCA 153
++ VKDQG CGSCW FST ++E G ISLSEQ+LVDC
Sbjct: 138 GEDLPESVDWRAKGAVAAVKDQGSCGSCWAFSTVAAVEGINKIVTGDLISLSEQELVDCD 197
Query: 154 QAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSV-NIT 212
+N QGCNGGL FE+I NGG+DTEE YPYT +DG C +N V +D ++
Sbjct: 198 NGYN-QGCNGGLMDYGFEFIINNGGIDTEEDYPYTARDGKCDQYRKNAKVVSIDGYEDVP 256
Query: 213 LGAEDELQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE 271
+ E LQ AV +PVSVA E F+ Y SG+++ +CG D++H VVAVGYG E
Sbjct: 257 VNDEKALQKAVA-NQPVSVAIEAGGREFQLYHSGIFTG-RCG---TDLDHGVVAVGYGTE 311
Query: 272 DGVPYWLIKNSWGENWGDHGYFKMEMGKNM----CGIATCASYPV 312
+G YW+++NSWG +WG+ GY +ME N CGIA SYP
Sbjct: 312 NGKDYWIVRNSWGGDWGESGYIRMERNVNTSTGKCGIAIEPSYPT 356
>gi|359359066|gb|AEV40973.1| putative oryzain beta chain precursor [Oryza punctata]
Length = 461
Score = 183 bits (465), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 98/220 (44%), Positives = 134/220 (60%), Gaps = 11/220 (5%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
+ + +R ++PVK+QG CGSCW FS ++E+ G+ I+LSEQ+LV+C+ N
Sbjct: 138 ESVDWREKGAVAPVKNQGQCGSCWAFSAVSTVESINQLVTGEMITLSEQELVECSTNGQN 197
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSV-NITLGAED 217
GCNGGL AF++I NGG+DTE+ YPY DG C + EN V +D ++ E
Sbjct: 198 SGCNGGLMDDAFDFIIKNGGIDTEDDYPYKAVDGKCDINRENAKVVSIDGFEDVPQNDEK 257
Query: 218 ELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPY 276
LQ AV +PVSVA E F+ Y SGV+S +CG + ++H VVAVGYG ++G Y
Sbjct: 258 SLQKAVAH-QPVSVAIEAGGREFQLYHSGVFSG-RCGTS---LDHGVVAVGYGTDNGKDY 312
Query: 277 WLIKNSWGENWGDHGYFKMEMGKNM----CGIATCASYPV 312
W+++NSWG WG+ GY +ME N CGIA ASYP
Sbjct: 313 WIVRNSWGPKWGESGYVRMERNINATTGKCGIAMMASYPT 352
>gi|164519063|ref|NP_001002813.2| cathepsin Q-like 2 precursor [Rattus norvegicus]
gi|67678196|gb|AAH97257.1| Ctsql2 protein [Rattus norvegicus]
gi|149039735|gb|EDL93851.1| rCG24202 [Rattus norvegicus]
Length = 343
Score = 183 bits (465), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 98/226 (43%), Positives = 130/226 (57%), Gaps = 10/226 (4%)
Query: 94 RSTNCKGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCA 153
R K + +R ++ V++QG C SCW F G++E + GK LS Q LVDC+
Sbjct: 122 RDALPKSIDWRKEGYVTRVREQGKCKSCWAFPVAGAIEGQMFKKTGKLTPLSVQNLVDCS 181
Query: 154 QAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITL 213
+ N+GC GG AF+Y+ NGGL++E YPY GK+G+CK++ +N ++ V +
Sbjct: 182 KPQGNKGCRGGTTYNAFQYVLQNGGLESEATYPYKGKEGLCKYNPKNAYAKITRFVALP- 240
Query: 214 GAEDELQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE- 271
ED L A+ PV+ VV RFYK G+Y KC N VNHAV+ VGYG E
Sbjct: 241 EDEDVLMDALATKGPVAAGIHVVYSSLRFYKKGIYHEPKCNNR---VNHAVLVVGYGFEG 297
Query: 272 ---DGVPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
DG YWLIKNSWG+ WG GY K+ + N CGIAT A YP+V
Sbjct: 298 NETDGNNYWLIKNSWGKQWGLKGYMKIAKDRNNHCGIATFAQYPIV 343
>gi|281352890|gb|EFB28474.1| hypothetical protein PANDA_008012 [Ailuropoda melanoleuca]
Length = 328
Score = 183 bits (465), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 95/208 (45%), Positives = 127/208 (61%), Gaps = 6/208 (2%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCA-QAFNNQGCNGGLPS 167
++ VK QG CG+CW FS G+LEA GK +SLS Q LVDC+ + + N+GCNGG +
Sbjct: 124 VTKVKYQGACGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMT 183
Query: 168 QAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVR 227
+AF+YI N G+D+E +YPY DG C++ S+N + G+ED+L+ AV
Sbjct: 184 EAFQYIIDNNGIDSEASYPYKATDGKCRYDSKNRAATCSKYTELPSGSEDDLKEAVANKG 243
Query: 228 PVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGEN 286
PVSVA + F Y+SGVY C +VNH V+ VGYG +G YWL+KNSWG N
Sbjct: 244 PVSVAIDARHSSFFLYRSGVYYDPSC---TQNVNHGVLVVGYGNLNGKDYWLVKNSWGLN 300
Query: 287 WGDHGYFKMEMGK-NMCGIATCASYPVV 313
+GD GY +M N CGIA+ SYP +
Sbjct: 301 FGDQGYIRMARNSGNHCGIASYPSYPEI 328
>gi|54020916|ref|NP_001005702.1| cathepsin K (pycnodysostosis) precursor [Xenopus (Silurana)
tropicalis]
gi|49671274|gb|AAH75275.1| cathepsin K (pycnodysostosis) [Xenopus (Silurana) tropicalis]
Length = 329
Score = 183 bits (465), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 96/215 (44%), Positives = 132/215 (61%), Gaps = 6/215 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ YR ++PV +QG CGSCW FS+ G+LE + GK +SLS Q LVDC +N G
Sbjct: 119 IDYRKKGYVTPVHNQGICGSCWAFSSVGALEGQLMKKTGKLVSLSPQNLVDCDT--DNYG 176
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C GG + AF Y++ NGG+D++ YPY G+D C ++ + I +G+E L+
Sbjct: 177 CEGGYMTNAFGYVRDNGGIDSDAEYPYVGQDEGCHYNPADKAATCKGYKEIPVGSEKALK 236
Query: 221 HAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
AV V PVSV+ + + F+FYK GVY + C P VNHAV+ VGYG E G+ +W+I
Sbjct: 237 RAVANVGPVSVSIDASLPSFQFYKKGVYYDSSC--NPDAVNHAVLVVGYGNEKGIKHWII 294
Query: 280 KNSWGENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
KNSWG+ WG GY + KN CGIA+ AS+PV+
Sbjct: 295 KNSWGDWWGKKGYVLLARDKKNACGIASLASFPVM 329
>gi|328872971|gb|EGG21338.1| cysteine proteinase 5 precursor [Dictyostelium fasciculatum]
Length = 358
Score = 183 bits (465), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 112/334 (33%), Positives = 154/334 (46%), Gaps = 85/334 (25%)
Query: 61 SFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN------------ 108
SF + +++ + Y S E R++ + KN+D + N KG LGLN
Sbjct: 29 SFTNWMQKHSRSYAS-HEFNTRYSVYKKNMDYVNEWNSKGSETVLGLNSLADMTNQEYQA 87
Query: 109 ----------------------------------------ISPVKDQGHCGSCWTFSTTG 128
++ VK+QG CGSCW+FS TG
Sbjct: 88 IYLGTKTDATARLAAASASASFGKVQGALPASIDWVAQGAVTQVKNQGQCGSCWSFSATG 147
Query: 129 SLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYT 188
S E A+ + ++LSEQ L+DC+ ++ N GCNGGL AF+YI NGG+DTE +YPY
Sbjct: 148 STEGAHQISTSNLVALSEQNLIDCSSSYGNDGCNGGLMDNAFKYIIANGGIDTEASYPYV 207
Query: 189 GKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVV-DGFRFYKSGVY 247
K CK++ N G + V++T G+E LQ + PVSVA + F+ Y SGVY
Sbjct: 208 AKVQKCKYNPANSGATLSSYVDVTSGSESALQSQT-VKGPVSVAIDASHQSFQLYDSGVY 266
Query: 248 SSTKCGNTPMDVNHAVVAVGYGV---------------------------EDGVPYWLIK 280
C +T +D H V+ VGYG G +W +K
Sbjct: 267 YEPACSSTNLD--HGVLVVGYGTASANGSSDSDSSAASQSSSSESSDDQATQGAQFWKVK 324
Query: 281 NSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
NSWG WG GY +M + N CGIAT AS P+V
Sbjct: 325 NSWGPEWGLSGYIQMARNRDNNCGIATTASQPIV 358
>gi|2239107|emb|CAA70693.1| cathepsin L-like cysteine proteinase [Heterodera glycines]
Length = 374
Score = 183 bits (465), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 94/209 (44%), Positives = 130/209 (62%), Gaps = 6/209 (2%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++ VK+QG CGSCW FS TG+LE + + G +SLSEQ L+DC++ + N GCNGG+
Sbjct: 168 VTEVKNQGMCGSCWAFSATGALEGQHVRDKGHLVSLSEQNLIDCSKKYGNMGCNGGIMDN 227
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDG-VCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVR 227
AF+YIK N G+D E AYPY K G C F +VG +I G E++L+ AV
Sbjct: 228 AFQYIKDNKGIDKETAYPYKAKTGKKCLFKRNDVGATDSGYNDIAEGDEEDLKMAVATQG 287
Query: 228 PVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGV-PYWLIKNSWGE 285
PVSVA + F+ Y +GVY +C P +++H V+ VGYG + YW++KNSWG
Sbjct: 288 PVSVAIDAGHRSFQLYTNGVYFEKEC--DPENLDHGVLVVGYGTDPTQGDYWIVKNSWGT 345
Query: 286 NWGDHGYFKMEMGK-NMCGIATCASYPVV 313
WG+ GY +M + N CGIA+ AS+P+V
Sbjct: 346 RWGEQGYIRMARNRNNNCGIASHASFPLV 374
>gi|291383484|ref|XP_002708316.1| PREDICTED: cathepsin L1 [Oryctolagus cuniculus]
Length = 333
Score = 183 bits (465), Expect = 8e-44, Method: Compositional matrix adjust.
Identities = 94/211 (44%), Positives = 127/211 (60%), Gaps = 9/211 (4%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++PVK QG CGSCW FS TG+LE + G+ +SLSEQ L+DC+ N GC GGL
Sbjct: 126 VTPVKKQGRCGSCWAFSATGALEGQMFRKTGRLVSLSEQNLIDCSWPAGNHGCRGGLTDH 185
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
AF+Y+K NGGLD+E++YPY ++ C++ + V I E+ L AV V P
Sbjct: 186 AFQYVKDNGGLDSEDSYPYEARNLPCRYDPQKSVANGTGFVRIPR-QENALMEAVATVGP 244
Query: 229 VSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----DGVPYWLIKNSW 283
++VA + F+FYK G+Y C + NHAV+ VGYG E D YWL+KNSW
Sbjct: 245 IAVAIDAGHPSFQFYKEGIYYEPNCSSK--HHNHAVLVVGYGYEGAESDSNKYWLVKNSW 302
Query: 284 GENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
G+ WG+ GY ++ + N CGIA+ ASYP V
Sbjct: 303 GKRWGEAGYIRIAKDRNNHCGIASHASYPTV 333
>gi|356515036|ref|XP_003526207.1| PREDICTED: thiol protease SEN102-like [Glycine max]
Length = 336
Score = 183 bits (465), Expect = 8e-44, Method: Compositional matrix adjust.
Identities = 112/321 (34%), Positives = 156/321 (48%), Gaps = 63/321 (19%)
Query: 49 VLQVIGQARHALSF----ARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKG-LSY 103
+ QV+ + H S ++ YGK+Y+ E RF F N++ I S N G Y
Sbjct: 21 ISQVMCRKLHETSMRERHEQWMTEYGKVYKDAAEKDKRFQIFKDNVEFIESFNADGNKPY 80
Query: 104 RLGLN-----------------------------------------------ISPVKDQG 116
+LG+N ++P+KDQG
Sbjct: 81 KLGVNHLADLTVEEFKASRNGFKRPHEFSTTTFKYENVTAIPAAIDWRTKGAVTPIKDQG 140
Query: 117 HCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYN 176
CGSCW FST + E + GK +SLSEQ+LVDC +QGC GG FE+I N
Sbjct: 141 QCGSCWAFSTIAATEGIHQITTGKLVSLSEQELVDCDTKGVDQGCEGGYMEDGFEFIIKN 200
Query: 177 GGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVV 236
GG+ +E YPY DG C ++ V Q+ + +E LQ AV +PVSV+ +
Sbjct: 201 GGITSETNYPYKAVDGKCNKATSPV-AQIKGYEKVPPNSETALQKAVA-NQPVSVSIDAD 258
Query: 237 D-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKM 295
GF FY SG+Y+ +CG +++H V AVGYG +G YW++KNSWG WG+ GY +M
Sbjct: 259 GAGFMFYSSGIYNG-ECGT---ELDHGVTAVGYGTANGTDYWIVKNSWGTQWGEKGYVRM 314
Query: 296 EMG----KNMCGIATCASYPV 312
+ G +CGIA +SYP
Sbjct: 315 QRGIAAKHGLCGIALDSSYPT 335
>gi|225456820|ref|XP_002278323.1| PREDICTED: vignain [Vitis vinifera]
Length = 360
Score = 183 bits (465), Expect = 8e-44, Method: Compositional matrix adjust.
Identities = 99/219 (45%), Positives = 135/219 (61%), Gaps = 13/219 (5%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++P+KDQG CGSCW FST ++E H K +SLSEQ+LVDC + NQG
Sbjct: 130 VDWRKKGAVTPIKDQGQCGSCWAFSTVVAVEGINHIKTNKLVSLSEQELVDCDTS-ENQG 188
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLD-SVNITLGAEDEL 219
CNGGL AFE+IK GG+ TE++YPYT +DG C S N V +D + ED L
Sbjct: 189 CNGGLMGYAFEFIKEKGGITTEQSYPYTAEDGTCDVSKVNSPVVSIDGHETVPPNNEDAL 248
Query: 220 QHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE-DGVPYW 277
A +P+SVA + F+FY GV++ +CG D++H V VGYG DG YW
Sbjct: 249 LKAAA-NQPISVAIDAGGSAFQFYSEGVFAG-RCG---TDLDHGVAIVGYGTTLDGTKYW 303
Query: 278 LIKNSWGENWGDHGYFKMEMG----KNMCGIATCASYPV 312
++KNSWG +WG++GY +M+ G + +CGIA ASYP+
Sbjct: 304 IVKNSWGTDWGENGYIRMKRGISAKEGLCGIAVEASYPI 342
>gi|27960477|gb|AAO27843.1|AF456459_1 cathepsin R [Rattus norvegicus]
Length = 334
Score = 183 bits (465), Expect = 8e-44, Method: Compositional matrix adjust.
Identities = 97/221 (43%), Positives = 134/221 (60%), Gaps = 9/221 (4%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++PV+ QG+C +CW FS TG++EA GK I LS Q LVDC+++ N
Sbjct: 117 KFVDWRKKGYVTPVRRQGNCNACWAFSVTGAIEAQTIWQTGKLIPLSVQNLVDCSKSQGN 176
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
+GC G P A+EY+ NGGL+ E YPY GK+GVC+++ ++ ++ V++ +ED
Sbjct: 177 EGCQWGDPHIAYEYVLNNGGLEAEATYPYKGKEGVCRYNPKHSKAEITGFVSLP-ESEDI 235
Query: 219 LQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----DG 273
L AV + P+SVA + + F FYK G+Y C N VNH+V+ VGYG E DG
Sbjct: 236 LMEAVATIGPISVAVDASFNSFGFYKKGLYDEPNCSNN--TVNHSVLVVGYGFEGNETDG 293
Query: 274 VPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
YWLIKNSWG WG GY K+ + N C IA+ A YP V
Sbjct: 294 NSYWLIKNSWGRKWGLRGYMKIPKDQNNFCAIASYAHYPTV 334
>gi|37780045|gb|AAP32195.1| cysteine protease 5 [Trifolium repens]
Length = 343
Score = 183 bits (465), Expect = 8e-44, Method: Compositional matrix adjust.
Identities = 108/307 (35%), Positives = 158/307 (51%), Gaps = 63/307 (20%)
Query: 64 RFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLS--YRLGLN------------- 108
++ YGK+Y++ +E + R F++NL I ++N G + Y+LG+N
Sbjct: 41 QWMTHYGKVYKNPQEREKRLRIFTENLKYIEASNNAGNNKPYKLGINQFADLTNEEFIAS 100
Query: 109 ------------------------------------ISPVKDQGHCGSCWTFSTTGSLEA 132
++PVK+QG CG CW FS + E
Sbjct: 101 RNKFKGHMCSSIIRTTTFKYENTSVPSTVDWRKKGAVTPVKNQGQCGCCWAFSAIAATEG 160
Query: 133 AYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDG 192
+ + GK +SLSEQ+LVDC +QGC GGL AF++I N G+ TE YPY G DG
Sbjct: 161 IHKISTGKLVSLSEQELVDCDTNGVDQGCEGGLMDDAFKFIIQNNGISTEAGYPYQGVDG 220
Query: 193 VCKFSSENVGVQVLDSV-NITLGAEDELQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSST 250
CK + + + ++ E+ LQ AV +P+SVA + F+FYKSGV++ +
Sbjct: 221 TCKANEASTSAATITGYEDVPANNENALQKAVA-NQPISVAIDASGSDFQFYKSGVFTGS 279
Query: 251 KCGNTPMDVNHAVVAVGYGV-EDGVPYWLIKNSWGENWGDHGYFKME----MGKNMCGIA 305
CG +++H V AVGYG+ DG YWL+KNSWG +WG+ GY +M+ + +CGIA
Sbjct: 280 -CG---TELDHGVTAVGYGISNDGTKYWLVKNSWGTDWGEEGYIRMQRSIDAAEGLCGIA 335
Query: 306 TCASYPV 312
ASYP
Sbjct: 336 MQASYPT 342
>gi|71897043|ref|NP_001026516.1| cathepsin S precursor [Gallus gallus]
gi|53126701|emb|CAG30977.1| hypothetical protein RCJMB04_1f23 [Gallus gallus]
Length = 328
Score = 183 bits (465), Expect = 9e-44, Method: Compositional matrix adjust.
Identities = 113/320 (35%), Positives = 160/320 (50%), Gaps = 57/320 (17%)
Query: 48 SVLQVIGQARHALS--FARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCK---GL- 101
+++ V+G L + + + +GK Y E R AT+ KNL L+ N + GL
Sbjct: 12 TLVAVMGHPDPTLDQHWQLWKKAHGKEYRHQAEEGQRRATWEKNLRLVMLHNLEHSLGLH 71
Query: 102 SYRLGLN----------------------------------------------ISPVKDQ 115
SY+LG+N ++ VK+Q
Sbjct: 72 SYQLGMNHMGDMTSEDVAALLTGLRVPYGHNQTSTYRRRGGAPDAMDWREKGCVTEVKNQ 131
Query: 116 GHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKY 175
G CG+CW FS G+LEA GK +SLS Q LVDC+ + N+GC GG ++AF+YI
Sbjct: 132 GACGACWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCSMMYGNKGCGGGFMTRAFQYIID 191
Query: 176 NGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEV 235
N G+D+EE+YPY ++G C+++ V + E L+ AV V PVSVA +
Sbjct: 192 NNGIDSEESYPYMAQNGTCQYNVSTRAATCSKYVELPYADEAALKDAVANVGPVSVAIDA 251
Query: 236 VD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFK 294
F Y+SGVY +C +VNH V+ VGYG + +WL+KNSWGE +GD GY +
Sbjct: 252 TQPTFFLYRSGVYDDPRC---TQEVNHGVLVVGYGTLNEKDFWLVKNSWGERFGDGGYIR 308
Query: 295 MEMGK-NMCGIATCASYPVV 313
M N CGIA+ ASYP +
Sbjct: 309 MSRNHANHCGIASYASYPQI 328
>gi|156554010|ref|XP_001605879.1| PREDICTED: counting factor associated protein D-like [Nasonia
vitripennis]
Length = 553
Score = 183 bits (465), Expect = 9e-44, Method: Compositional matrix adjust.
Identities = 113/346 (32%), Positives = 164/346 (47%), Gaps = 58/346 (16%)
Query: 22 SASASSFDDSNPIRLVSSDGLRDFETSVLQVIGQARHALSFARFARRYGKIYESVEEMKL 81
+AS SF R+ + + +++F + QA ++F RF + + K Y E K
Sbjct: 213 NASCVSFPGPGEHRIYTFNPMKEFIHN-----HQAHVDMAFDRFKKTHNKNYAHDLEHKQ 267
Query: 82 RFATFSKNLDLIRSTNCKGLSYRLGLN--------------------------------- 108
R F NL I S N L + L +N
Sbjct: 268 RKEHFRHNLRFIHSINRANLGFTLDVNHLADRNEAELKVLRGKQYTQHGYNGGMPFPHDV 327
Query: 109 ------------------ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLV 150
++PVKDQ CGSCW+F TTG++E AY + K + LS+Q L+
Sbjct: 328 EKEKADVPDSFDWRLYGAVTPVKDQSVCGSCWSFGTTGAVEGAYFMKYKKLVRLSQQALI 387
Query: 151 DCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAY-PYTGKDGVCKFSSENVGVQVLDSV 209
DC+ F N GC+GG +++++I +GGL TEE Y Y G+DG C + ++ V
Sbjct: 388 DCSWGFGNNGCDGGEDFRSYQWIIKHGGLPTEEEYGGYLGQDGYCHIKNVTQIAKLKGFV 447
Query: 210 NITLGAEDELQHAVGLVRPVSVAFEVVDG-FRFYKSGVYSSTKCGNTPMDVNHAVVAVGY 268
N+ D ++ A+ P+SVA + F FY +GVY CGNT ++HAV+AVGY
Sbjct: 448 NVDTNNVDAMKLALFKHGPISVAIDASHKTFSFYSNGVYYEPACGNTENSLDHAVLAVGY 507
Query: 269 GVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASYPVVA 314
G +G +WLIKNSW WG+ GY M N CG+ T +Y + A
Sbjct: 508 GTINGKGFWLIKNSWSNYWGNDGYILMAQKNNNCGVMTAPTYAIAA 553
>gi|62320725|dbj|BAD95392.1| cysteine proteinase RD21A [Arabidopsis thaliana]
Length = 433
Score = 183 bits (465), Expect = 9e-44, Method: Compositional matrix adjust.
Identities = 125/359 (34%), Positives = 174/359 (48%), Gaps = 74/359 (20%)
Query: 12 ILLLCCAAAASASASSFDDSNPIRLVSSDGLRDFETSVLQVIGQARHALSFARFARRYGK 71
IL L A +SA S + VS+ G R E V+ + + + ++GK
Sbjct: 10 ILFLAMVAVSSAVDMSIISYDEKHGVSTTGGRS-EAEVMSI---------YEAWLVKHGK 59
Query: 72 IY--ESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN--------------------- 108
S+ E RF F NL + N K LSYRLGL
Sbjct: 60 AQSQNSLVEKDRRFEIFKDNLRFVDEHNEKNLSYRLGLTRFADLTNDEYRSKYLGAKMEK 119
Query: 109 -----------------------------ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFG 139
++ VKDQG CGSCW FST G++E G
Sbjct: 120 KGERRTSLRYEARVGDELPESIDWRKKGAVAEVKDQGGCGSCWAFSTIGAVEGINQIVTG 179
Query: 140 KGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSE 199
I+LSEQ+LVDC ++N +GCNGGL AFE+I NGG+DT++ YPY G DG C +
Sbjct: 180 DLITLSEQELVDCDTSYN-EGCNGGLMDYAFEFIIKNGGIDTDKDYPYKGVDGTCDQIRK 238
Query: 200 NVGVQVLDSV-NITLGAEDELQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPM 257
N V +DS ++ +E+ L+ AV +P+S+A E F+ Y SG++ + CG
Sbjct: 239 NAKVVTIDSYEDVPTYSEESLKKAVAH-QPISIAIEAGGRAFQLYDSGIFDGS-CGTQ-- 294
Query: 258 DVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMG----KNMCGIATCASYPV 312
++H VVAVGYG E+G YW+++NSWG++WG+ GY +M CGIA SYP+
Sbjct: 295 -LDHGVVAVGYGTENGKDYWIVRNSWGKSWGESGYLRMARNIASSSGKCGIAIEPSYPI 352
>gi|297733654|emb|CBI14901.3| unnamed protein product [Vitis vinifera]
Length = 273
Score = 183 bits (464), Expect = 9e-44, Method: Compositional matrix adjust.
Identities = 99/219 (45%), Positives = 135/219 (61%), Gaps = 13/219 (5%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++P+KDQG CGSCW FST ++E H K +SLSEQ+LVDC + NQG
Sbjct: 43 VDWRKKGAVTPIKDQGQCGSCWAFSTVVAVEGINHIKTNKLVSLSEQELVDCDTS-ENQG 101
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDS-VNITLGAEDEL 219
CNGGL AFE+IK GG+ TE++YPYT +DG C S N V +D + ED L
Sbjct: 102 CNGGLMGYAFEFIKEKGGITTEQSYPYTAEDGTCDVSKVNSPVVSIDGHETVPPNNEDAL 161
Query: 220 QHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE-DGVPYW 277
A +P+SVA + F+FY GV++ +CG D++H V VGYG DG YW
Sbjct: 162 LKAAAN-QPISVAIDAGGSAFQFYSEGVFAG-RCG---TDLDHGVAIVGYGTTLDGTKYW 216
Query: 278 LIKNSWGENWGDHGYFKMEMG----KNMCGIATCASYPV 312
++KNSWG +WG++GY +M+ G + +CGIA ASYP+
Sbjct: 217 IVKNSWGTDWGENGYIRMKRGISAKEGLCGIAVEASYPI 255
>gi|157128512|ref|XP_001661463.1| cathepsin l [Aedes aegypti]
gi|91992510|gb|ABE72971.1| cathepsin L [Aedes aegypti]
gi|108872552|gb|EAT36777.1| AAEL011167-PA [Aedes aegypti]
Length = 327
Score = 183 bits (464), Expect = 9e-44, Method: Compositional matrix adjust.
Identities = 101/223 (45%), Positives = 131/223 (58%), Gaps = 8/223 (3%)
Query: 95 STNCKGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQ 154
+T + +R ++PVKDQG CGSC+ FS G+LE A GK ++LSEQ +VDC
Sbjct: 109 TTTVTSIDWRTKGAVTPVKDQGRCGSCYAFSALGALEGATFTKTGKLVNLSEQNIVDCTS 168
Query: 155 AFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGK-DGVCKFSSENVGVQVLDSVNITL 213
+ N GCNGG + F+YIK N G+DT YPY C F+ VG D+ + L
Sbjct: 169 TYGNYGCNGGSMTSVFKYIKTNNGVDTGAFYPYKAAVAATCGFNPAYVG--ATDTGYVLL 226
Query: 214 GA-EDELQHAVGLVRPVSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE 271
A E LQ AV + PVSVA + + F+ YKSG+Y C ++ + NH V+ VGYG E
Sbjct: 227 PANETALQTAVANIGPVSVAIDASNPSFQQYKSGIYYEPLCSSSKL--NHGVLVVGYGTE 284
Query: 272 DGVPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
+G YW +KNSWG WG+ GY KM K N CGIA+ ASYP V
Sbjct: 285 NGTDYWQVKNSWGTTWGEKGYIKMARNKNNHCGIASFASYPTV 327
>gi|393906608|gb|EFO21301.2| ctsf protein [Loa loa]
Length = 472
Score = 183 bits (464), Expect = 9e-44, Method: Compositional matrix adjust.
Identities = 88/226 (38%), Positives = 136/226 (60%), Gaps = 4/226 (1%)
Query: 88 KNLDLIRSTNCKGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQ 147
K +L + + +R ++PVK+QG CGSCW FS TG++E + GK ISLSEQ
Sbjct: 250 KKFNLTFNNLPEQFDWRTKGVVTPVKNQGSCGSCWAFSVTGNIEGLWAIKTGKLISLSEQ 309
Query: 148 QLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLD 207
+L+DC + ++GCNGGLP AF I+ GGL+ E+ YPY ++G C + V + D
Sbjct: 310 ELIDCDRI--DKGCNGGLPINAFREIQRMGGLEPEDQYPYKARNGTCHLIRSAIAVTIDD 367
Query: 208 SVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVG 267
+V I E ++ + P+SV + +YKSG+ ++ P ++H V+ G
Sbjct: 368 AVEIPRN-ETVMKAWIVQRGPLSVGIDA-KLLAYYKSGILHPSRSRCPPSGIDHGVLITG 425
Query: 268 YGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASYPVV 313
YGVE+G+PYW IKNSWG+ WG+ GYF++ +GK++CG++ S ++
Sbjct: 426 YGVENGLPYWTIKNSWGDQWGEDGYFRLMLGKDVCGVSDLVSSAII 471
>gi|37780051|gb|AAP32198.1| cysteine protease 12 [Trifolium repens]
Length = 343
Score = 183 bits (464), Expect = 9e-44, Method: Compositional matrix adjust.
Identities = 108/307 (35%), Positives = 157/307 (51%), Gaps = 63/307 (20%)
Query: 64 RFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKG--LSYRLGLN------------- 108
++ YGK+Y++ +E + R F++NL I ++N G Y+LG+N
Sbjct: 41 QWMTHYGKVYKNPQEREKRLRIFTENLKYIEASNNAGNKKPYKLGINQFADLTNEEFIAS 100
Query: 109 ------------------------------------ISPVKDQGHCGSCWTFSTTGSLEA 132
++PVK+QG CG CW FS + E
Sbjct: 101 RNKFKGHMCSSIIRTTTFKYENTSVPSTVDWRKKGAVTPVKNQGQCGCCWAFSAIAATEG 160
Query: 133 AYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDG 192
+ + GK +SLSEQ+LVDC +QGC GGL AF++I N G+ TE YPY G DG
Sbjct: 161 IHKISTGKLVSLSEQELVDCDTNGVDQGCEGGLMDDAFKFIIQNNGISTEAGYPYQGVDG 220
Query: 193 VCKFSSENVGVQVLDSV-NITLGAEDELQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSST 250
CK + + + ++ E+ LQ AV +P+SVA + F+FYKSGV++ +
Sbjct: 221 TCKANEASTSAATITGYEDVPANNENALQKAVA-NQPISVAIDASGSDFQFYKSGVFTGS 279
Query: 251 KCGNTPMDVNHAVVAVGYGV-EDGVPYWLIKNSWGENWGDHGYFKMEM----GKNMCGIA 305
CG +++H V AVGYG+ DG YWL+KNSWG +WG+ GY +M+ + +CGIA
Sbjct: 280 -CG---TELDHGVTAVGYGISNDGTKYWLVKNSWGTDWGEEGYIRMQRSIDAAEGLCGIA 335
Query: 306 TCASYPV 312
ASYP
Sbjct: 336 MQASYPT 342
>gi|50355613|dbj|BAD29955.1| cysteine protease [Daucus carota]
Length = 365
Score = 183 bits (464), Expect = 9e-44, Method: Compositional matrix adjust.
Identities = 109/317 (34%), Positives = 157/317 (49%), Gaps = 61/317 (19%)
Query: 53 IGQARHALSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTN-CKGLSYRLGLN--- 108
+ +A + ++ RYG++Y++ E R F +NL I++ N Y+LG+N
Sbjct: 30 LNEASMTETHDQWMARYGRVYKTANEKNRRSTIFQENLKYIQTFNKANNKPYKLGVNEFA 89
Query: 109 ---------------------------------------------ISPVKDQGHCGSCWT 123
++P+K+QG CG CW
Sbjct: 90 DLTNEEFTTSRNKFKSHVCATVTNVFRYENVTAVPATMDWRKKGAVTPIKNQGQCGCCWA 149
Query: 124 FSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEE 183
FS ++E GK ISLSEQ+LVDC +QGC GGL AF++I+ N GL TE
Sbjct: 150 FSAVAAMEGITQLKTGKLISLSEQELVDCDTNGEDQGCEGGLMDYAFDFIQQNHGLSTET 209
Query: 184 AYPYTGKDGVCKFSSE-NVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVV-DGFRF 241
YPY+G DG C + E N + ++ +E L AV +P+SVA + F+F
Sbjct: 210 NYPYSGTDGTCNANKEANHAATITGHEDVPANSESALLKAVA-NQPISVAIDASGSDFQF 268
Query: 242 YKSGVYSSTKCGNTPMDVNHAVVAVGYGV-EDGVPYWLIKNSWGENWGDHGYFKMEMG-- 298
Y SGV++ +CG +++H V AVGYG DG YWL+KNSWG +WG+ GY +M+ G
Sbjct: 269 YSSGVFTG-ECGT---ELDHGVTAVGYGTAADGTKYWLVKNSWGTSWGEEGYIQMQRGVA 324
Query: 299 --KNMCGIATCASYPVV 313
+ +CGIA ASYP
Sbjct: 325 AAEGLCGIAMQASYPTA 341
>gi|149030666|gb|EDL85703.1| cathepsin S [Rattus norvegicus]
Length = 291
Score = 183 bits (464), Expect = 9e-44, Method: Compositional matrix adjust.
Identities = 104/257 (40%), Positives = 143/257 (55%), Gaps = 13/257 (5%)
Query: 65 FARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCK---GL-SYRLGLNISPVKDQGHCGS 120
+ + + K Y+ E +R + KNL I N + G+ SY +G+N + D G CGS
Sbjct: 40 WKKTHEKEYKDQNEEDVRRLIWEKNLKFIMLHNLEHSMGMHSYSVGMN--HMGDMGSCGS 97
Query: 121 CWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCA--QAFNNQGCNGGLPSQAFEYIKYNGG 178
CW FS G+LE GK +SLS Q LVDC+ + + N+GC GG ++AF+YI NGG
Sbjct: 98 CWAFSAVGALEGQLKLKTGKLVSLSAQNLVDCSTEEKYGNKGCGGGFMTEAFQYIIDNGG 157
Query: 179 LDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVV-D 237
+D+E +YPY D C + +N + + G E+ L+ AV PVSV +
Sbjct: 158 IDSEASYPYKAMDEKCHYDPKNRAATCSRYIELPFGDEEALKEAVATKGPVSVGIDASHS 217
Query: 238 GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKM-E 296
F Y+SGVY C +VNH V+ VGYG DG YWL+KNSWG ++GD GY +M
Sbjct: 218 SFFLYQSGVYDDPSCTE---NVNHGVLVVGYGTLDGKDYWLVKNSWGLHFGDQGYIRMAR 274
Query: 297 MGKNMCGIATCASYPVV 313
KN CGIA+ SYP +
Sbjct: 275 NNKNHCGIASYCSYPEI 291
>gi|326430129|gb|EGD75699.1| hypothetical protein PTSG_07816 [Salpingoeca sp. ATCC 50818]
Length = 545
Score = 183 bits (464), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 102/301 (33%), Positives = 152/301 (50%), Gaps = 48/301 (15%)
Query: 61 SFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN------------ 108
+F F ++G++YE+ +E R F N + + N + L+Y L LN
Sbjct: 245 AFDSFKAQHGRMYETEQEHAKRLNNFRHNKKFVDAMNRRNLTYTLALNHLADLHDEERAQ 304
Query: 109 ---------------------------------ISPVKDQGHCGSCWTFSTTGSLEAAYH 135
++ VKDQG CGSCW+F ++E Y
Sbjct: 305 MRGTFSSRTDYAYVAETPSPVRSAARDWRTTGAVTGVKDQGICGSCWSFGAAQAIEGQYF 364
Query: 136 QAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAY-PYTGKDGVC 194
A + + +S+Q L+DC+ F N C+GG +A+E++ NG + TE +Y PY DG C
Sbjct: 365 LATNRTVPMSQQALMDCSWGFGNNACDGGEAFRAYEWVLQNGYIPTEASYGPYLMADGYC 424
Query: 195 KFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCG 253
+ G + VNIT G +++ + P++VA + + F FY SGVY + CG
Sbjct: 425 HPEKADKGPGIKGYVNITSGDMNKVLDMLDNDGPLAVAIDASLKSFSFYSSGVYYDSDCG 484
Query: 254 NTPMDVNHAVVAVGYGVE-DGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASYPV 312
NTP D++HAV+AVG+G DG YW+IKNSW N+GD GY +M N CG+AT A P+
Sbjct: 485 NTPDDLDHAVLAVGFGTSVDGEDYWIIKNSWSTNYGDRGYVRMSRRNNNCGVATDAHIPL 544
Query: 313 V 313
+
Sbjct: 545 L 545
>gi|168017893|ref|XP_001761481.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162687165|gb|EDQ73549.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 471
Score = 183 bits (464), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 121/310 (39%), Positives = 158/310 (50%), Gaps = 65/310 (20%)
Query: 62 FARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN------------- 108
F ++ R+ ++Y S+ E + RF F NL I + N + SY LGLN
Sbjct: 52 FHQWLERHSRVYHSLSEKQRRFQIFKDNLHYIHNHNKQEKSYWLGLNKFSDLTHDEFRAL 111
Query: 109 -------------------------------------ISPVKDQGHCGSCWTFSTTGSLE 131
+S VKDQG CGSCW FS GS+E
Sbjct: 112 YLGIRPAGRAHGLRNGDRFIYEDVVAEEMVDWRKKGAVSDVKDQGSCGSCWAFSAIGSVE 171
Query: 132 AAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKD 191
G+ ISLSEQ+LVDC + NQGCNGGL AF++I NGG+DTEE YPY D
Sbjct: 172 GVNAIVTGELISLSEQELVDCDRG-QNQGCNGGLMDYAFDFIIKNGGIDTEEDYPYKATD 230
Query: 192 GVC-KFSSENVGVQVLDSV-NITLGAEDELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYS 248
G C + E V V+D ++ +E L AV PVSVA E F+ Y+ GV++
Sbjct: 231 GQCDEARKETSKVVVIDDYQDVPTKSESSLLKAVSK-NPVSVAIEAGGRDFQHYQGGVFT 289
Query: 249 STKCGNTPMDVNHAVVAVGYGV-EDGVPYWLIKNSWGENWGDHGYFKME-MGKN----MC 302
CG D++H V+AVGYG +DGV YW++KNSWG +WG+ GY +ME MG N C
Sbjct: 290 GP-CGT---DLDHGVLAVGYGTDDDGVNYWIVKNSWGPSWGEKGYIRMERMGSNSTSGKC 345
Query: 303 GIATCASYPV 312
GI S+P+
Sbjct: 346 GINIEPSFPI 355
>gi|312080834|ref|XP_003142769.1| ctsf protein [Loa loa]
Length = 437
Score = 183 bits (464), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 88/226 (38%), Positives = 136/226 (60%), Gaps = 4/226 (1%)
Query: 88 KNLDLIRSTNCKGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQ 147
K +L + + +R ++PVK+QG CGSCW FS TG++E + GK ISLSEQ
Sbjct: 215 KKFNLTFNNLPEQFDWRTKGVVTPVKNQGSCGSCWAFSVTGNIEGLWAIKTGKLISLSEQ 274
Query: 148 QLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLD 207
+L+DC + ++GCNGGLP AF I+ GGL+ E+ YPY ++G C + V + D
Sbjct: 275 ELIDCDRI--DKGCNGGLPINAFREIQRMGGLEPEDQYPYKARNGTCHLIRSAIAVTIDD 332
Query: 208 SVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVG 267
+V I E ++ + P+SV + +YKSG+ ++ P ++H V+ G
Sbjct: 333 AVEIPRN-ETVMKAWIVQRGPLSVGIDA-KLLAYYKSGILHPSRSRCPPSGIDHGVLITG 390
Query: 268 YGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASYPVV 313
YGVE+G+PYW IKNSWG+ WG+ GYF++ +GK++CG++ S ++
Sbjct: 391 YGVENGLPYWTIKNSWGDQWGEDGYFRLMLGKDVCGVSDLVSSAII 436
>gi|222425026|dbj|BAH20463.1| cysteine protease [Spinacia oleracea]
Length = 473
Score = 183 bits (464), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 100/221 (45%), Positives = 138/221 (62%), Gaps = 13/221 (5%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
+ + +R ++ VKDQG CGSCW FST ++E G+ +SLSEQ+LVDC ++N+
Sbjct: 149 EAVDWRKNGAVAKVKDQGQCGSCWAFSTIAAVEGINQIVTGELLSLSEQELVDCDTSYNS 208
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSV-NITLGAED 217
GC+GGL A+E+I NGG+DT+ YPYT KDG C +N V +D ++ E
Sbjct: 209 -GCDGGLMDYAYEFIINNGGIDTDADYPYTAKDGKCDQYRKNAKVVTIDDFEDVPENDEK 267
Query: 218 ELQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPY 276
LQ AV +PVSVA E F+FY+SGV++ KCG D++H VVAVGYG +DG Y
Sbjct: 268 ALQKAVAH-QPVSVAIEAGGSTFQFYQSGVFTG-KCG---ADLDHGVVAVGYGSDDGKDY 322
Query: 277 WLIKNSWGENWGDHGYFKME-----MGKNMCGIATCASYPV 312
W+++NSWG +WG+ GY +ME + CGIA SYP+
Sbjct: 323 WIVRNSWGADWGESGYIRMERNLETVKTGKCGIAIEPSYPI 363
>gi|18141281|gb|AAL60578.1|AF454956_1 senescence-associated cysteine protease [Brassica oleracea]
Length = 445
Score = 183 bits (464), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 99/218 (45%), Positives = 132/218 (60%), Gaps = 12/218 (5%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R + PVKDQG CGSCW FS G++E G+ +SLSEQ+LVDC ++NN G
Sbjct: 127 VDWRAKGAVVPVKDQGSCGSCWAFSAIGAVEGINQIKTGELVSLSEQELVDCDTSYNN-G 185
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTG-KDGVCKFSSENVGVQVLDSVNITLGAEDEL 219
C GGL AF++I NGG+DTEE YPYT D +C +N V +D E+ L
Sbjct: 186 CGGGLMDYAFQFIISNGGIDTEEDYPYTATDDNICNTDKKNTRVVTIDGYEDVPENENSL 245
Query: 220 QHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWL 278
+ A+ +P+SVA E GF+ YKSGV++ T CG ++H VVAVGYG +G YW+
Sbjct: 246 KKALA-NQPISVAIEAGGRGFQLYKSGVFTGT-CGTA---LDHGVVAVGYGTSEGQDYWI 300
Query: 279 IKNSWGENWGDHGYFKMEMG----KNMCGIATCASYPV 312
I+NSWG NWG+ GY K++ CG+A ASYP
Sbjct: 301 IRNSWGSNWGESGYIKLQRNIKDSSGKCGVAMMASYPT 338
>gi|355681656|gb|AER96815.1| Cathepsin L precursor [Mustela putorius furo]
Length = 331
Score = 183 bits (464), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 94/217 (43%), Positives = 132/217 (60%), Gaps = 9/217 (4%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++PVK+QG CGSCW FS TG+LE + + +SLSEQ LVDC+QA N+G
Sbjct: 118 VDWRKKGYVTPVKNQGPCGSCWAFSATGALEGQMFRKTKRLVSLSEQNLVDCSQAEGNEG 177
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C+GGL AF+Y+K NGGLD+EE+YPY +D CK+ E ++I E+ L+
Sbjct: 178 CSGGLMDYAFQYVKDNGGLDSEESYPYRAQDESCKYKPEQSAANDTGFMDIHP-EEESLK 236
Query: 221 HAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----DGVP 275
AV V P+S A + + F+FY G+Y C + +D H ++ VGYG + +
Sbjct: 237 LAVATVGPISAAIDASLSTFQFYHKGIYYDPDCSSENLD--HGILVVGYGSQGEDSEKQK 294
Query: 276 YWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYP 311
YW++KNSWG +WG GY M + N CGIAT AS+P
Sbjct: 295 YWIVKNSWGTDWGTQGYILMAKDRDNHCGIATAASFP 331
>gi|238006338|gb|ACR34204.1| unknown [Zea mays]
Length = 465
Score = 183 bits (464), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 97/218 (44%), Positives = 131/218 (60%), Gaps = 11/218 (5%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++PVK+QG CGSCW FS S+E+ G+ ++LSEQ+LV+C+ N G
Sbjct: 146 VDWREKGAVAPVKNQGQCGSCWAFSAVSSVESVNQIVTGEMVTLSEQELVECSTDGGNSG 205
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSV-NITLGAEDEL 219
CNGGL AF++I NGG+DTE YPY DG C + EN V +D ++ E L
Sbjct: 206 CNGGLMDAAFDFIIKNGGIDTEGDYPYKAVDGKCDINRENAKVVSIDGFEDVPENDEKSL 265
Query: 220 QHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWL 278
Q AV +PVSVA E F+ YK+GV++ G +++H VVAVGYG E+G YW+
Sbjct: 266 QKAVAH-QPVSVAIEAGGREFQLYKAGVFT----GTCTTNLDHGVVAVGYGTENGKDYWI 320
Query: 279 IKNSWGENWGDHGYFKMEMGKNM----CGIATCASYPV 312
++NSWG WG+ GY +ME N CGIA ASYP
Sbjct: 321 VRNSWGAKWGEDGYIRMERNVNATTGKCGIAMMASYPT 358
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.319 0.134 0.409
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,938,835,519
Number of Sequences: 23463169
Number of extensions: 210110348
Number of successful extensions: 473620
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 6645
Number of HSP's successfully gapped in prelim test: 1088
Number of HSP's that attempted gapping in prelim test: 447257
Number of HSP's gapped (non-prelim): 11084
length of query: 314
length of database: 8,064,228,071
effective HSP length: 142
effective length of query: 172
effective length of database: 9,027,425,369
effective search space: 1552717163468
effective search space used: 1552717163468
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 76 (33.9 bits)