BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 036910
(314 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|Q8RWQ9|ALEUL_ARATH Thiol protease aleurain-like OS=Arabidopsis thaliana GN=At3g45310
PE=2 SV=1
Length = 358
Score = 492 bits (1266), Expect = e-138, Method: Compositional matrix adjust.
Identities = 242/361 (67%), Positives = 277/361 (76%), Gaps = 50/361 (13%)
Query: 1 MARPVQLVSSVILLLCCAAAASASASSFDDSNPIRLVSSDGLRDFETSVLQVIGQARHAL 60
M+ + L SS++L+L AAA++ FD+SNPI++VS D L + E +V+Q++GQ+RH L
Sbjct: 1 MSVKLNLSSSILLILF--AAAASKEIGFDESNPIKMVS-DNLHELEDTVVQILGQSRHVL 57
Query: 61 SFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN------------ 108
SF+RF RYGK Y+SVEEMKLRF+ F +NLDLIRSTN KGLSY+L LN
Sbjct: 58 SFSRFTHRYGKKYQSVEEMKLRFSVFKENLDLIRSTNKKGLSYKLSLNQFADLTWQEFQR 117
Query: 109 -----------------------------------ISPVKDQGHCGSCWTFSTTGSLEAA 133
+SPVK+QGHCGSCWTFSTTG+LEAA
Sbjct: 118 YKLGAAQNCSATLKGSHKITEATVPDTKDWREDGIVSPVKEQGHCGSCWTFSTTGALEAA 177
Query: 134 YHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGV 193
YHQAFGKGISLSEQQLVDCA FNN GC+GGLPSQAFEYIKYNGGLDTEEAYPYTGKDG
Sbjct: 178 YHQAFGKGISLSEQQLVDCAGTFNNFGCHGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGG 237
Query: 194 CKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCG 253
CKFS++N+GVQV DSVNITLGAEDEL+HAVGLVRPVSVAFEVV FRFYK GV++S CG
Sbjct: 238 CKFSAKNIGVQVRDSVNITLGAEDELKHAVGLVRPVSVAFEVVHEFRFYKKGVFTSNTCG 297
Query: 254 NTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASYPVV 313
NTPMDVNHAV+AVGYGVED VPYWLIKNSWG WGD+GYFKMEMGKNMCG+ATC+SYPVV
Sbjct: 298 NTPMDVNHAVLAVGYGVEDDVPYWLIKNSWGGEWGDNGYFKMEMGKNMCGVATCSSYPVV 357
Query: 314 A 314
A
Sbjct: 358 A 358
>sp|Q8H166|ALEU_ARATH Thiol protease aleurain OS=Arabidopsis thaliana GN=ALEU PE=1 SV=2
Length = 358
Score = 488 bits (1255), Expect = e-137, Method: Compositional matrix adjust.
Identities = 238/334 (71%), Positives = 263/334 (78%), Gaps = 48/334 (14%)
Query: 28 FDDSNPIRLVSSDGLRDFETSVLQVIGQARHALSFARFARRYGKIYESVEEMKLRFATFS 87
FD+SNPIR+VS DGLR+ E SV Q++GQ+RH LSFARF RYGK Y++VEEMKLRF+ F
Sbjct: 26 FDESNPIRMVS-DGLREVEESVSQILGQSRHVLSFARFTHRYGKKYQNVEEMKLRFSIFK 84
Query: 88 KNLDLIRSTNCKGLSYRLGLN--------------------------------------- 108
+NLDLIRSTN KGLSY+LG+N
Sbjct: 85 ENLDLIRSTNKKGLSYKLGVNQFADLTWQEFQRTKLGAAQNCSATLKGSHKVTEAALPET 144
Query: 109 --------ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+SPVKDQG CGSCWTFSTTG+LEAAYHQAFGKGISLSEQQLVDCA AFNN G
Sbjct: 145 KDWREDGIVSPVKDQGGCGSCWTFSTTGALEAAYHQAFGKGISLSEQQLVDCAGAFNNYG 204
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
CNGGLPSQAFEYIK NGGLDTE+AYPYTGKD CKFS+ENVGVQVL+SVNITLGAEDEL+
Sbjct: 205 CNGGLPSQAFEYIKSNGGLDTEKAYPYTGKDETCKFSAENVGVQVLNSVNITLGAEDELK 264
Query: 221 HAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIK 280
HAVGLVRPVS+AFEV+ FR YKSGVY+ + CG+TPMDVNHAV+AVGYGVEDGVPYWLIK
Sbjct: 265 HAVGLVRPVSIAFEVIHSFRLYKSGVYTDSHCGSTPMDVNHAVLAVGYGVEDGVPYWLIK 324
Query: 281 NSWGENWGDHGYFKMEMGKNMCGIATCASYPVVA 314
NSWG +WGD GYFKMEMGKNMCGIATCASYPVVA
Sbjct: 325 NSWGADWGDKGYFKMEMGKNMCGIATCASYPVVA 358
>sp|Q10717|CYSP2_MAIZE Cysteine proteinase 2 OS=Zea mays GN=CCP2 PE=2 SV=1
Length = 360
Score = 451 bits (1159), Expect = e-126, Method: Compositional matrix adjust.
Identities = 224/353 (63%), Positives = 253/353 (71%), Gaps = 49/353 (13%)
Query: 11 VILLLCCAAAASASASSFDDSNPIRLVSSDGLRDFETSVLQVIGQARHALSFARFARRYG 70
V+ ++ A A+ S F DSNPIR V+ E++V +G+ R AL FARFA RYG
Sbjct: 8 VLAVVVLADTAAVVNSGFADSNPIRPVTDRAASALESTVFAALGRTRDALRFARFAVRYG 67
Query: 71 KIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN---------------------- 108
K YES E+ RF FS++L L+RSTN KGLSYRLG+N
Sbjct: 68 KSYESAAEVHKRFRIFSESLQLVRSTNRKGLSYRLGINRFADMSWEEFRATRLGAAQNCS 127
Query: 109 ---------------------------ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKG 141
+SPVK+QGHCGSCWTFSTTG+LEAAY QA GK
Sbjct: 128 ATLTGNHRMRAAAVALPETKDWREDGIVSPVKNQGHCGSCWTFSTTGALEAAYTQATGKP 187
Query: 142 ISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENV 201
ISLSEQQLVDC AFNN GCNGGLPSQAFEYIKYNGGLDTEE+YPY G +G+CKF +ENV
Sbjct: 188 ISLSEQQLVDCGFAFNNFGCNGGLPSQAFEYIKYNGGLDTEESYPYQGVNGICKFKNENV 247
Query: 202 GVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNH 261
GV+VLDSVNITLGAEDEL+ AVGLVRPVSVAFEV+ GFR YKSGVY+S CG TPMDVNH
Sbjct: 248 GVKVLDSVNITLGAEDELKDAVGLVRPVSVAFEVITGFRLYKSGVYTSDHCGTTPMDVNH 307
Query: 262 AVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASYPVVA 314
AV+AVGYGVEDGVPYWLIKNSWG +WGD GYFKMEMGKNMCG+ATCASYP+VA
Sbjct: 308 AVLAVGYGVEDGVPYWLIKNSWGADWGDEGYFKMEMGKNMCGVATCASYPIVA 360
>sp|Q40143|CYSP3_SOLLC Cysteine proteinase 3 OS=Solanum lycopersicum GN=CYP-3 PE=2 SV=1
Length = 356
Score = 443 bits (1140), Expect = e-124, Method: Compositional matrix adjust.
Identities = 225/355 (63%), Positives = 260/355 (73%), Gaps = 53/355 (14%)
Query: 10 SVILLLCCA--AAASASASSFDDSNPIR-LVSSDGLRDFETSVLQVIGQARHALSFARFA 66
S++L+L A A A ++F D NPIR +V D + E +LQV+GQ R ALSFARFA
Sbjct: 5 SLVLILVAGLFATALAGPATFADKNPIRQVVFPD---ELENGILQVVGQTRSALSFARFA 61
Query: 67 RRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN------------------ 108
R+ K Y+SVEE+K RF F NL +IRS N KGLSY+LG+N
Sbjct: 62 IRHRKRYDSVEEIKQRFEIFLDNLKMIRSHNRKGLSYKLGINEFTDLTWDEFRKHKLGAS 121
Query: 109 -----------------------------ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFG 139
+SPVK QG CGSCWTFSTTG+LEAAY QAFG
Sbjct: 122 QNCSATTKGNLKLTNVVLPETKDWRKDGIVSPVKAQGKCGSCWTFSTTGALEAAYAQAFG 181
Query: 140 KGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSE 199
KGISLSEQQLVDCA AFNN GCNGGLPSQAFEYIK+NGGLDTEEAYPYTGK+G+CKFS
Sbjct: 182 KGISLSEQQLVDCAGAFNNFGCNGGLPSQAFEYIKFNGGLDTEEAYPYTGKNGICKFSQA 241
Query: 200 NVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDV 259
N+GV+V+ SVNITLGAE EL++AV LVRPVSVAFEVV GF+ YKSGVY+ST+CG+TPMDV
Sbjct: 242 NIGVKVISSVNITLGAEYELKYAVALVRPVSVAFEVVKGFKQYKSGVYASTECGDTPMDV 301
Query: 260 NHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASYPVVA 314
NHAV+AVGYGVE+G PYWLIKNSWG +WG+ GYFKMEMGKNMCG+ATCASYP+VA
Sbjct: 302 NHAVLAVGYGVENGTPYWLIKNSWGADWGEDGYFKMEMGKNMCGVATCASYPIVA 356
>sp|P25778|ORYC_ORYSJ Oryzain gamma chain OS=Oryza sativa subsp. japonica GN=Os09g0442300
PE=2 SV=2
Length = 362
Score = 441 bits (1134), Expect = e-123, Method: Compositional matrix adjust.
Identities = 214/335 (63%), Positives = 249/335 (74%), Gaps = 48/335 (14%)
Query: 28 FDDSNPIRLVSSDGLRDFETSVLQVIGQARHALSFARFARRYGKIYESVEEMKLRFATFS 87
FDDSNPIR V+ E++V+ +G+ R AL FARFA R+GK Y E++ RF FS
Sbjct: 28 FDDSNPIRSVTDHAASALESTVIAALGRTRDALRFARFAVRHGKRYGDAAEVQRRFRIFS 87
Query: 88 KNLDLIRSTNCKGLSYRLGLN--------------------------------------- 108
++L+L+RSTN +GL YRLG+N
Sbjct: 88 ESLELVRSTNRRGLPYRLGINRFADMSWEEFQASRLGAAQNCSATLAGNHRMRDAAALPE 147
Query: 109 ---------ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQ 159
+SPVKDQGHCGSCWTFSTTGSLEAAY QA GK +SLSEQQLVDCA A+NN
Sbjct: 148 TKDWREDGIVSPVKDQGHCGSCWTFSTTGSLEAAYTQATGKPVSLSEQQLVDCATAYNNF 207
Query: 160 GCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDEL 219
GC+GGLPSQAFEYIKYNGGLDTEEAYPYTG +G+C + ENVGV+VLDSVNITLGAEDEL
Sbjct: 208 GCSGGLPSQAFEYIKYNGGLDTEEAYPYTGVNGICHYKPENVGVKVLDSVNITLGAEDEL 267
Query: 220 QHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
++AVGLVRPVSVAF+V++GFR YKSGVY+S CG +PMDVNHAV+AVGYGVE+GVPYWLI
Sbjct: 268 KNAVGLVRPVSVAFQVINGFRMYKSGVYTSDHCGTSPMDVNHAVLAVGYGVENGVPYWLI 327
Query: 280 KNSWGENWGDHGYFKMEMGKNMCGIATCASYPVVA 314
KNSWG +WGD+GYFKMEMGKNMCGIATCASYP+VA
Sbjct: 328 KNSWGADWGDNGYFKMEMGKNMCGIATCASYPIVA 362
>sp|P05167|ALEU_HORVU Thiol protease aleurain OS=Hordeum vulgare PE=2 SV=1
Length = 362
Score = 429 bits (1102), Expect = e-119, Method: Compositional matrix adjust.
Identities = 211/331 (63%), Positives = 240/331 (72%), Gaps = 48/331 (14%)
Query: 32 NPIRLVSSDGLRDFETSVLQVIGQARHALSFARFARRYGKIYESVEEMKLRFATFSKNLD 91
NPIR V+ E++VL +G+ RHAL FARFA RYGK YES E++ RF FS++L+
Sbjct: 31 NPIRPVTDRAASTLESAVLGALGRTRHALRFARFAVRYGKSYESAAEVRRRFRIFSESLE 90
Query: 92 LIRSTNCKGLSYRLGLN------------------------------------------- 108
+RSTN KGL YRLG+N
Sbjct: 91 EVRSTNRKGLPYRLGINRFSDMSWEEFQATRLGAAQTCSATLAGNHLMRDAAALPETKDW 150
Query: 109 -----ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNG 163
+SPVK+Q HCGSCWTFSTTG+LEAAY QA GK ISLSEQQLVDCA FNN GCNG
Sbjct: 151 REDGIVSPVKNQAHCGSCWTFSTTGALEAAYTQATGKNISLSEQQLVDCAGGFNNFGCNG 210
Query: 164 GLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAV 223
GLPSQAFEYIKYNGG+DTEE+YPY G +GVC + +EN VQVLDSVNITL AEDEL++AV
Sbjct: 211 GLPSQAFEYIKYNGGIDTEESYPYKGVNGVCHYKAENAAVQVLDSVNITLNAEDELKNAV 270
Query: 224 GLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSW 283
GLVRPVSVAF+V+DGFR YKSGVY+S CG TP DVNHAV+AVGYGVE+GVPYWLIKNSW
Sbjct: 271 GLVRPVSVAFQVIDGFRQYKSGVYTSDHCGTTPDDVNHAVLAVGYGVENGVPYWLIKNSW 330
Query: 284 GENWGDHGYFKMEMGKNMCGIATCASYPVVA 314
G +WGD+GYFKMEMGKNMC IATCASYPVVA
Sbjct: 331 GADWGDNGYFKMEMGKNMCAIATCASYPVVA 361
>sp|P00786|CATH_RAT Pro-cathepsin H OS=Rattus norvegicus GN=Ctsh PE=1 SV=1
Length = 333
Score = 277 bits (709), Expect = 6e-74, Method: Compositional matrix adjust.
Identities = 143/305 (46%), Positives = 181/305 (59%), Gaps = 49/305 (16%)
Query: 56 ARHALSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN------- 108
A F + +++ K Y S E R F+ N I++ N + ++++GLN
Sbjct: 27 AIEKFHFTSWMKQHQKTYSS-REYSHRLQVFANNWRKIQAHNQRNHTFKMGLNQFSDMSF 85
Query: 109 -----------------------------------------ISPVKDQGHCGSCWTFSTT 127
+SPVK+QG CGSCWTFSTT
Sbjct: 86 AEIKHKYLWSEPQNCSATKSNYLRGTGPYPSSMDWRKKGNVVSPVKNQGACGSCWTFSTT 145
Query: 128 GSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPY 187
G+LE+A A GK ++L+EQQLVDCAQ FNN GC GGLPSQAFEYI YN G+ E++YPY
Sbjct: 146 GALESAVAIASGKMMTLAEQQLVDCAQNFNNHGCQGGLPSQAFEYILYNKGIMGEDSYPY 205
Query: 188 TGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVY 247
GK+G CKF+ E V + VNITL E + AV L PVS AFEV + F YKSGVY
Sbjct: 206 IGKNGQCKFNPEKAVAFVKNVVNITLNDEAAMVEAVALYNPVSFAFEVTEDFMMYKSGVY 265
Query: 248 SSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATC 307
SS C TP VNHAV+AVGYG ++G+ YW++KNSWG NWG++GYF +E GKNMCG+A C
Sbjct: 266 SSNSCHKTPDKVNHAVLAVGYGEQNGLLYWIVKNSWGSNWGNNGYFLIERGKNMCGLAAC 325
Query: 308 ASYPV 312
ASYP+
Sbjct: 326 ASYPI 330
>sp|P49935|CATH_MOUSE Pro-cathepsin H OS=Mus musculus GN=Ctsh PE=2 SV=2
Length = 333
Score = 276 bits (706), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 142/305 (46%), Positives = 180/305 (59%), Gaps = 49/305 (16%)
Query: 56 ARHALSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN------- 108
A F + +++ K Y SVE R F+ N I++ N + ++++ LN
Sbjct: 27 AIEKFHFKSWMKQHQKTYSSVE-YNHRLQMFANNWRKIQAHNQRNHTFKMALNQFSDMSF 85
Query: 109 -----------------------------------------ISPVKDQGHCGSCWTFSTT 127
+SPVK+QG CGSCWTFSTT
Sbjct: 86 AEIKHKFLWSEPQNCSATKSNYLRGTGPYPSSMDWRKKGNVVSPVKNQGACGSCWTFSTT 145
Query: 128 GSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPY 187
G+LE+A A GK +SL+EQQLVDCAQAFNN GC GGLPSQAFEYI YN G+ E++YPY
Sbjct: 146 GALESAVAIASGKMLSLAEQQLVDCAQAFNNHGCKGGLPSQAFEYILYNKGIMEEDSYPY 205
Query: 188 TGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVY 247
GKD C+F+ + V + VNITL E + AV L PVS AFEV + F YKSGVY
Sbjct: 206 IGKDSSCRFNPQKAVAFVKNVVNITLNDEAAMVEAVALYNPVSFAFEVTEDFLMYKSGVY 265
Query: 248 SSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATC 307
SS C TP VNHAV+AVGYG ++G+ YW++KNSWG WG++GYF +E GKNMCG+A C
Sbjct: 266 SSKSCHKTPDKVNHAVLAVGYGEQNGLLYWIVKNSWGSQWGENGYFLIERGKNMCGLAAC 325
Query: 308 ASYPV 312
ASYP+
Sbjct: 326 ASYPI 330
>sp|O46427|CATH_PIG Pro-cathepsin H OS=Sus scrofa GN=CTSH PE=1 SV=1
Length = 335
Score = 275 bits (703), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 142/301 (47%), Positives = 176/301 (58%), Gaps = 49/301 (16%)
Query: 60 LSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN----------- 108
L F + ++ K Y S+EE R F N I + N +++LGLN
Sbjct: 33 LHFKSWMVQHQKKY-SLEEYHHRLQVFVSNWRKINAHNAGNHTFKLGLNQFSDMSFDEIR 91
Query: 109 -------------------------------------ISPVKDQGHCGSCWTFSTTGSLE 131
+SPVK+QG CGSCWTFSTTG+LE
Sbjct: 92 HKYLWSEPQNCSATKGNYLRGTGPYPPSMDWRKKGNFVSPVKNQGSCGSCWTFSTTGALE 151
Query: 132 AAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKD 191
+A A GK +SL+EQQLVDCAQ FNN GC GGLPSQAFEYI+YN G+ E+ YPY G+D
Sbjct: 152 SAVAIATGKMLSLAEQQLVDCAQNFNNHGCQGGLPSQAFEYIRYNKGIMGEDTYPYKGQD 211
Query: 192 GVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTK 251
CKF + V D NIT+ E+ + AV L PVS AFEV + F Y+ G+YSST
Sbjct: 212 DHCKFQPDKAIAFVKDVANITMNDEEAMVEAVALYNPVSFAFEVTNDFLMYRKGIYSSTS 271
Query: 252 CGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASYP 311
C TP VNHAV+AVGYG E+G+PYW++KNSWG WG +GYF +E GKNMCG+A CASYP
Sbjct: 272 CHKTPDKVNHAVLAVGYGEENGIPYWIVKNSWGPQWGMNGYFLIERGKNMCGLAACASYP 331
Query: 312 V 312
+
Sbjct: 332 I 332
>sp|P09668|CATH_HUMAN Pro-cathepsin H OS=Homo sapiens GN=CTSH PE=1 SV=4
Length = 335
Score = 271 bits (693), Expect = 4e-72, Method: Compositional matrix adjust.
Identities = 141/301 (46%), Positives = 174/301 (57%), Gaps = 49/301 (16%)
Query: 60 LSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN----------- 108
F + ++ K Y S EE R TF+ N I + N ++++ LN
Sbjct: 33 FHFKSWMSKHRKTY-STEEYHHRLQTFASNWRKINAHNNGNHTFKMALNQFSDMSFAEIK 91
Query: 109 -------------------------------------ISPVKDQGHCGSCWTFSTTGSLE 131
+SPVK+QG CGSCWTFSTTG+LE
Sbjct: 92 HKYLWSEPQNCSATKSNYLRGTGPYPPSVDWRKKGNFVSPVKNQGACGSCWTFSTTGALE 151
Query: 132 AAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKD 191
+A A GK +SL+EQQLVDCAQ FNN GC GGLPSQAFEYI YN G+ E+ YPY GKD
Sbjct: 152 SAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNKGIMGEDTYPYQGKD 211
Query: 192 GVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFYKSGVYSSTK 251
G CKF V D NIT+ E+ + AV L PVS AFEV F Y++G+YSST
Sbjct: 212 GYCKFQPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFAFEVTQDFMMYRTGIYSSTS 271
Query: 252 CGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMCGIATCASYP 311
C TP VNHAV+AVGYG ++G+PYW++KNSWG WG +GYF +E GKNMCG+A CASYP
Sbjct: 272 CHKTPDKVNHAVLAVGYGEKNGIPYWIVKNSWGPQWGMNGYFLIERGKNMCGLAACASYP 331
Query: 312 V 312
+
Sbjct: 332 I 332
>sp|Q3T0I2|CATH_BOVIN Pro-cathepsin H OS=Bos taurus GN=CTSH PE=2 SV=1
Length = 335
Score = 271 bits (692), Expect = 7e-72, Method: Compositional matrix adjust.
Identities = 140/310 (45%), Positives = 176/310 (56%), Gaps = 49/310 (15%)
Query: 51 QVIGQARHALSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN-- 108
++ + F + ++ K Y S EE R F+ NL I + N + ++++GLN
Sbjct: 24 ELAANSLEKFHFQSWMVQHQKKYSS-EEYYHRLQAFASNLREINAHNARNHTFKMGLNQF 82
Query: 109 ----------------------------------------------ISPVKDQGHCGSCW 122
++PVK+QG CGSCW
Sbjct: 83 SDMSFDELKRKYLWSEPQNCSATKSNYLRGTGPYPPSMDWRKKGNFVTPVKNQGSCGSCW 142
Query: 123 TFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTE 182
TFSTTG+LE+A A GK L+EQQLVDCAQ FNN GC GGLPSQAFEYI+YN G+ E
Sbjct: 143 TFSTTGALESAVAIATGKLPFLAEQQLVDCAQNFNNHGCQGGLPSQAFEYIRYNKGIMGE 202
Query: 183 EAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVDGFRFY 242
+ YPY G+DG CK+ V D NITL E+ + AV L PVS AFEV F Y
Sbjct: 203 DTYPYRGQDGDCKYQPSKAIAFVKDVANITLNDEEAMVEAVALHNPVSFAFEVTADFMMY 262
Query: 243 KSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGKNMC 302
+ G+YSST C TP VNHAV+AVGYG E G+PYW++KNSWG NWG GYF +E GKNMC
Sbjct: 263 RKGIYSSTSCHKTPDKVNHAVLAVGYGEEKGIPYWIVKNSWGPNWGMKGYFLIERGKNMC 322
Query: 303 GIATCASYPV 312
G+A CAS+P+
Sbjct: 323 GLAACASFPI 332
>sp|P25782|CYSP2_HOMAM Digestive cysteine proteinase 2 OS=Homarus americanus GN=LCP2 PE=2
SV=1
Length = 323
Score = 223 bits (567), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 111/215 (51%), Positives = 140/215 (65%), Gaps = 4/215 (1%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++PVKDQG CGSCW FSTTGSLE + G ISL+EQQLVDC++ + QG
Sbjct: 111 VDWRTKGAVTPVKDQGQCGSCWAFSTTGSLEGQHFLKTGSLISLAEQQLVDCSRPYGPQG 170
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
CNGG + AF+YIK N G+DTE AYPY +DG C+F S +V NI G+E LQ
Sbjct: 171 CNGGWMNDAFDYIKANNGIDTEAAYPYEARDGSCRFDSNSVAATCSGHTNIASGSETGLQ 230
Query: 221 HAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
AV + P+SV + F+FY SGVY C +P ++HAV+AVGYG E G +WL+
Sbjct: 231 QAVRDIGPISVTIDAAHSSFQFYSSGVYYEPSC--SPSYLDHAVLAVGYGSEGGQDFWLV 288
Query: 280 KNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
KNSW +WGD GY KM + N CGIAT ASYP+V
Sbjct: 289 KNSWATSWGDAGYIKMSRNRNNNCGIATVASYPLV 323
>sp|P09648|CATL1_CHICK Cathepsin L1 (Fragments) OS=Gallus gallus GN=CTSL1 PE=1 SV=1
Length = 218
Score = 217 bits (552), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 111/218 (50%), Positives = 143/218 (65%), Gaps = 5/218 (2%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
+ + +R ++PVKDQG CGSCW FSTTG+LE + + GK +SLSEQ LVDC++ N
Sbjct: 3 RSVDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQHFRTKGKLVSLSEQNLVDCSRPEGN 62
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGV-CKFSSENVGVQVLDSVNITLGAED 217
QGCNGGL QAF+Y++ NGG+D+EE+YPYT KD C++ +E V+I G E
Sbjct: 63 QGCNGGLMDQAFQYVQDNGGIDSEESYPYTAKDDEDCRYKAEYNAANDTGFVDIPQGHER 122
Query: 218 ELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPY 276
L AV V PVSVA + F+FY+SG+Y C + D++H V+ VGYG E G Y
Sbjct: 123 ALMKAVASVGPVSVAIDAGHSSFQFYQSGIYYEPDC--SSEDLDHGVLVVGYGFEGGKKY 180
Query: 277 WLIKNSWGENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
W++KNSWGE WGD GY M KN CGIAT ASYP+V
Sbjct: 181 WIVKNSWGEKWGDKGYIYMAKDRKNHCGIATAASYPLV 218
>sp|Q26636|CATL_SARPE Cathepsin L OS=Sarcophaga peregrina PE=1 SV=1
Length = 339
Score = 211 bits (537), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 105/218 (48%), Positives = 139/218 (63%), Gaps = 5/218 (2%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++ VKDQGHCGSCW FS+TG+LE + + G +SLSEQ LVDC+ + N
Sbjct: 124 KSVDWREHGAVTGVKDQGHCGSCWAFSSTGALEGQHFRKAGVLVSLSEQNLVDCSTKYGN 183
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
GCNGGL AF YIK NGG+DTE++YPY G D C F+ +G V+I G E++
Sbjct: 184 NGCNGGLMDNAFRYIKDNGGIDTEKSYPYEGIDDSCHFNKATIGATDTGFVDIPEGDEEK 243
Query: 219 LQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGV-EDGVPY 276
++ AV + PVSVA + + F+ Y GVY+ +C +D H V+ VGYG E G+ Y
Sbjct: 244 MKKAVATMGPVSVAIDASHESFQLYSEGVYNEPECDEQNLD--HGVLVVGYGTDESGMDY 301
Query: 277 WLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
WL+KNSWG WG+ GY KM + N CGIAT +SYP V
Sbjct: 302 WLVKNSWGTTWGEQGYIKMARNQNNQCGIATASSYPTV 339
>sp|Q95029|CATL_DROME Cathepsin L OS=Drosophila melanogaster GN=Cp1 PE=2 SV=2
Length = 371
Score = 208 bits (529), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 105/218 (48%), Positives = 136/218 (62%), Gaps = 5/218 (2%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++ VKDQGHCGSCW FS+TG+LE + + G +SLSEQ LVDC+ + N
Sbjct: 156 KSVDWRTKGAVTAVKDQGHCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGN 215
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
GCNGGL AF YIK NGG+DTE++YPY D C F+ VG +I G E +
Sbjct: 216 NGCNGGLMDNAFRYIKDNGGIDTEKSYPYEAIDDSCHFNKGTVGATDRGFTDIPQGDEKK 275
Query: 219 LQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGV-EDGVPY 276
+ AV V PVSVA + + F+FY GVY+ +C +D H V+ VG+G E G Y
Sbjct: 276 MAEAVATVGPVSVAIDASHESFQFYSEGVYNEPQCDAQNLD--HGVLVVGFGTDESGEDY 333
Query: 277 WLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
WL+KNSWG WGD G+ KM K N CGIA+ +SYP+V
Sbjct: 334 WLVKNSWGTTWGDKGFIKMLRNKENQCGIASASSYPLV 371
>sp|Q24940|CATLL_FASHE Cathepsin L-like proteinase OS=Fasciola hepatica GN=Cat-1 PE=1 SV=1
Length = 326
Score = 208 bits (529), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 101/215 (46%), Positives = 142/215 (66%), Gaps = 4/215 (1%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++ VKDQG+CGSCW FSTTG++E Y + IS SEQQLVDC+ + N G
Sbjct: 112 IDWRESGYVTEVKDQGNCGSCWAFSTTGTMEGQYMKNERTSISFSEQQLVDCSGPWGNNG 171
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C+GGL A++Y+K GL+TE +YPYT +G C+++ + +V + G+E EL+
Sbjct: 172 CSGGLMENAYQYLK-QFGLETESSYPYTAVEGQCRYNKQLGVAKVTGYYTVHSGSEVELK 230
Query: 221 HAVGLVRPVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIK 280
+ VG RP +VA +V F Y+SG+Y S C +P+ VNHAV+AVGYG + G YW++K
Sbjct: 231 NLVGARRPAAVAVDVESDFMMYRSGIYQSQTC--SPLRVNHAVLAVGYGTQGGTDYWIVK 288
Query: 281 NSWGENWGDHGYFKMEMGK-NMCGIATCASYPVVA 314
NSWG WG+ GY +M + NMCGIA+ AS P+VA
Sbjct: 289 NSWGTYWGERGYIRMARNRGNMCGIASLASLPMVA 323
>sp|P13277|CYSP1_HOMAM Digestive cysteine proteinase 1 OS=Homarus americanus GN=LCP1 PE=1
SV=2
Length = 322
Score = 205 bits (521), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 117/306 (38%), Positives = 159/306 (51%), Gaps = 55/306 (17%)
Query: 61 SFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCK----GLSYRLGLN-------- 108
S+ F ++G+ Y +EE + R F NL I N K ++Y L +N
Sbjct: 19 SWEEFKGKFGRKYVDLEEERYRLNVFLDNLQYIEEFNKKYERGEVTYNLAINQFSDMTNE 78
Query: 109 --------------------------------------ISPVKDQGHCGSCWTFSTTGSL 130
++PVKDQG CGSCW FSTTG +
Sbjct: 79 KFNAVMKGYKKGPRPAAVFTSTDAAPESTEVDWRTKGAVTPVKDQGQCGSCWAFSTTGGI 138
Query: 131 EAAYHQAFGKGISLSEQQLVDCAQ-AFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTG 189
E + G+ +SLSEQQLVDCA ++ NQGCNGG +A Y++ NGG+DTE +YPY
Sbjct: 139 EGQHFLKTGRLVSLSEQQLVDCAGGSYYNQGCNGGWVERAIMYVRDNGGVDTESSYPYEA 198
Query: 190 KDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVVD-GFRFYKSGVYS 248
+D C+F+S +G V I G+E L+ A + P+SVA + F+ Y +GVY
Sbjct: 199 RDNTCRFNSNTIGATCTGYVGIAQGSESALKTATRDIGPISVAIDASHRSFQSYYTGVYY 258
Query: 249 STKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATC 307
C ++ +D HAV+AVGYG E G +WL+KNSW +WG+ GY KM + N CGIAT
Sbjct: 259 EPSCSSSQLD--HAVLAVGYGSEGGQDFWLVKNSWATSWGESGYIKMARNRNNNCGIATD 316
Query: 308 ASYPVV 313
A YP V
Sbjct: 317 ACYPTV 322
>sp|O60911|CATL2_HUMAN Cathepsin L2 OS=Homo sapiens GN=CTSL2 PE=1 SV=2
Length = 334
Score = 204 bits (518), Expect = 8e-52, Method: Compositional matrix adjust.
Identities = 108/230 (46%), Positives = 139/230 (60%), Gaps = 13/230 (5%)
Query: 90 LDLIRSTNCKGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQL 149
LDL +S + + Y ++PVK+Q CGSCW FS TG+LE + GK +SLSEQ L
Sbjct: 112 LDLPKSVDWRKKGY-----VTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNL 166
Query: 150 VDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSV 209
VDC++ NQGCNGG ++AF+Y+K NGGLD+EE+YPY D +CK+ EN
Sbjct: 167 VDCSRPQGNQGCNGGFMARAFQYVKENGGLDSEESYPYVAVDEICKYRPENSVANDTGFT 226
Query: 210 NITLGAEDELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGY 268
+ G E L AV V P+SVA + F+FYKSG+Y C + +D H V+ VGY
Sbjct: 227 VVAPGKEKALMKAVATVGPISVAMDAGHSSFQFYKSGIYFEPDCSSKNLD--HGVLVVGY 284
Query: 269 GVE----DGVPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
G E + YWL+KNSWG WG +GY K+ K N CGIAT ASYP V
Sbjct: 285 GFEGANSNNSKYWLVKNSWGPEWGSNGYVKIAKDKNNHCGIATAASYPNV 334
>sp|P25784|CYSP3_HOMAM Digestive cysteine proteinase 3 OS=Homarus americanus GN=LCP3 PE=2
SV=1
Length = 321
Score = 202 bits (515), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 103/215 (47%), Positives = 135/215 (62%), Gaps = 5/215 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++PVKDQ CGSCW FS TG+LE + + +SLSEQQLVDC+ + N G
Sbjct: 110 VDWRTKALVTPVKDQEQCGSCWAFSATGALEGQHFLKNDELVSLSEQQLVDCSTDYGNDG 169
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C GG + AF+YIK NGG+DTE +YPY +D C+F + ++G SV + E+ LQ
Sbjct: 170 CGGGWMTSAFDYIKDNGGIDTESSYPYEAEDRSCRFDANSIGAICTGSVEVQH-TEEALQ 228
Query: 221 HAVGLVRPVSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
AV V P+SVA + F+FY SGVY C +P ++H V+AVGYG E YWL+
Sbjct: 229 EAVSGVGPISVAIDASHFSFQFYSSGVYYEQNC--SPTFLDHGVLAVGYGTESTKDYWLV 286
Query: 280 KNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
KNSWG +WGD GY KM + N CGIA+ SYP V
Sbjct: 287 KNSWGSSWGDAGYIKMSRNRDNNCGIASEPSYPTV 321
>sp|Q54TR1|CFAD_DICDI Counting factor associated protein D OS=Dictyostelium discoideum
GN=cfaD PE=1 SV=1
Length = 531
Score = 202 bits (514), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 115/303 (37%), Positives = 158/303 (52%), Gaps = 52/303 (17%)
Query: 62 FARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN------------- 108
F + +Y K Y S +E RF F +I + N K SY+LG+N
Sbjct: 225 FKEYKAQYNKEYSSQDEHDERFINFKAARKIIATHNAKESSYKLGMNHYADLSNKEFNTL 284
Query: 109 ------------------------------------ISPVKDQGHCGSCWTFSTTGSLEA 132
++PVKDQG CGSCWTF +TGSLE
Sbjct: 285 VKPKVARPSVTGADSVHDDESLRSIPSTVDWRNQNCVTPVKDQGICGSCWTFGSTGSLEG 344
Query: 133 AYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDG 192
G+ +SLSEQQLVDCA +QGC GG S AF+Y+ G L TE YPY ++G
Sbjct: 345 TNCVTNGELVSLSEQQLVDCAILTGSQGCGGGFASSAFQYVMEIGSLATESNYPYLMQNG 404
Query: 193 VCKFSS-ENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSST 250
+C+ + GV + VN+T G+E LQ+A+ PV++A + VD FR+Y SGVY++
Sbjct: 405 LCRDRTVTPSGVSITGYVNVTSGSESALQNAIATTGPVAIAIDASVDDFRYYMSGVYNNP 464
Query: 251 KCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMG-KNMCGIATCAS 309
C N D++H V+A+GYG G Y+L+KNSW NWG GY M N+CG+++ A+
Sbjct: 465 ACKNGLDDLDHEVLAIGYGTYQGQDYFLVKNSWSTNWGMDGYVYMARNDNNLCGVSSQAT 524
Query: 310 YPV 312
YP+
Sbjct: 525 YPI 527
>sp|Q23894|CYSP3_DICDI Cysteine proteinase 3 OS=Dictyostelium discoideum GN=cprC PE=3 SV=2
Length = 337
Score = 199 bits (507), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 119/318 (37%), Positives = 166/318 (52%), Gaps = 60/318 (18%)
Query: 51 QVIGQARHALSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN-- 108
V ++ SF + R K Y E M R+ F KN+D + + N KG LGLN
Sbjct: 23 NVFSHKQYQDSFIDWMRSNNKAYTHKEFMP-RYEEFKKNMDYVHNWNSKGSKTVLGLNQH 81
Query: 109 ---------------------------------------------------ISPVKDQGH 117
++PVKDQG
Sbjct: 82 ADLSNEEYRLNYLGTRAHIKLNGYHKRNLGLRLNRPQFKQPLNVDWREKDAVTPVKDQGQ 141
Query: 118 CGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNG 177
CGSC++FSTTGS+E GK +SLSEQ ++DC+ +F N+GCNGGL + AFEYI N
Sbjct: 142 CGSCYSFSTTGSVEGVTAIKTGKLVSLSEQNILDCSSSFGNEGCNGGLMTNAFEYIIKNN 201
Query: 178 GLDTEEAYPYTGK-DGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVV 236
GL++EE YPY K + CKF +V ++ I G E++LQ+A+ L+ PVSVA +
Sbjct: 202 GLNSEEQYPYEMKVNDECKFQEGSVAAKITSYKEIEAGDENDLQNAL-LLNPVSVAIDAS 260
Query: 237 -DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKM 295
+ F+ Y +GVY C + D++H V+AVG G ++G Y+++KNSWG +WG +GY M
Sbjct: 261 HNSFQLYTAGVYYEPAC--SSEDLDHGVLAVGMGTDNGEDYYIVKNSWGPSWGLNGYIHM 318
Query: 296 EMGK-NMCGIATCASYPV 312
K N CGI+T ASYP+
Sbjct: 319 ARNKDNNCGISTMASYPI 336
>sp|P06797|CATL1_MOUSE Cathepsin L1 OS=Mus musculus GN=Ctsl1 PE=1 SV=2
Length = 334
Score = 195 bits (496), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 108/221 (48%), Positives = 133/221 (60%), Gaps = 9/221 (4%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++PVK+QG CGSCW FS +G LE GK ISLSEQ LVDC+ A N
Sbjct: 116 KSVDWREKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGN 175
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
QGCNGGL AF+YIK NGGLD+EE+YPY KDG CK+ +E V+I E
Sbjct: 176 QGCNGGLMDFAFQYIKENGGLDSEESYPYEAKDGSCKYRAEFAVANDTGFVDIPQ-QEKA 234
Query: 219 LQHAVGLVRPVSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----DG 273
L AV V P+SVA + +FY SG+Y C + +D H V+ VGYG E +
Sbjct: 235 LMKAVATVGPISVAMDASHPSLQFYSSGIYYEPNCSSKNLD--HGVLLVGYGYEGTDSNK 292
Query: 274 VPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
YWL+KNSWG WG GY K+ + N CG+AT ASYPVV
Sbjct: 293 NKYWLVKNSWGSEWGMEGYIKIAKDRDNHCGLATAASYPVV 333
>sp|P07154|CATL1_RAT Cathepsin L1 OS=Rattus norvegicus GN=Ctsl1 PE=1 SV=2
Length = 334
Score = 194 bits (493), Expect = 8e-49, Method: Compositional matrix adjust.
Identities = 106/221 (47%), Positives = 134/221 (60%), Gaps = 9/221 (4%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++PVK+QG CGSCW FS +G LE GK ISLSEQ LVDC+ N
Sbjct: 116 KTVDWREKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHDQGN 175
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
QGCNGGL AF+YIK NGGLD+EE+YPY KDG CK+ +E V+I E
Sbjct: 176 QGCNGGLMDFAFQYIKENGGLDSEESYPYEAKDGSCKYRAEYAVANDTGFVDIPQ-QEKA 234
Query: 219 LQHAVGLVRPVSVAFEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----DG 273
L AV V P+SVA + +FY SG+Y C + D++H V+ VGYG E +
Sbjct: 235 LMKAVATVGPISVAMDASHPSLQFYSSGIYYEPNC--SSKDLDHGVLVVGYGYEGTDSNK 292
Query: 274 VPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
YWL+KNSWG+ WG GY K+ + N CG+AT ASYP+V
Sbjct: 293 DKYWLVKNSWGKEWGMDGYIKIAKDRNNHCGLATAASYPIV 333
>sp|Q80UB0|TEST2_MOUSE Testin-2 OS=Mus musculus PE=2 SV=1
Length = 333
Score = 193 bits (491), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 103/221 (46%), Positives = 135/221 (61%), Gaps = 9/221 (4%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R+ ++PVK+QG+C S W FS TGSLE + G+ + LSEQ L+DC +
Sbjct: 116 KYVDWRMLGYVTPVKNQGYCASSWAFSATGSLEGQMFKKTGRLVPLSEQNLLDCMGSNVT 175
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
C+GG AF+Y+K NGGL TEE+YPY G C++ +EN V D V I G E+
Sbjct: 176 HDCSGGFMQNAFQYVKDNGGLATEESYPYIGPGRKCRYHAENSAANVRDFVQIP-GREEA 234
Query: 219 LQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----DG 273
L AV V P+SVA + D F+FY SG+Y +C + +NHAV+ VGYG E DG
Sbjct: 235 LMKAVAKVGPISVAVDASHDSFQFYDSGIYYEPQCKR--VHLNHAVLVVGYGFEGEESDG 292
Query: 274 VPYWLIKNSWGENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
YWL+KNSWGE WG GY K+ N CGIAT A+YP+V
Sbjct: 293 NSYWLVKNSWGEEWGMKGYIKIAKDWNNHCGIATLATYPIV 333
>sp|Q10991|CATL1_SHEEP Cathepsin L1 OS=Ovis aries GN=CTSL PE=1 SV=1
Length = 217
Score = 192 bits (489), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 102/208 (49%), Positives = 129/208 (62%), Gaps = 6/208 (2%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++PVK+QG CGSCW FS TG+LE + GK +SLSEQ LVD ++ NQGCNGGL
Sbjct: 13 VTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDSSRPQGNQGCNGGLMDN 72
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
AF+YIK NGGLD+EE+YPY D C + E + V+I E L AV V P
Sbjct: 73 AFQYIKENGGLDSEESYPYEATDTSCNYKPEYSAAKDTGFVDIPQ-REKALMKAVATVGP 131
Query: 229 VSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE-DGVPYWLIKNSWGEN 286
+SVA + F+FYKSG+Y C + D++H V+ VGYG E +W++KNSWG
Sbjct: 132 ISVAIDAGHSSFQFYKSGIYYDPDC--SSKDLDHGVLVVGYGFEGTNNKFWIVKNSWGPE 189
Query: 287 WGDHGYFKMEMGK-NMCGIATCASYPVV 313
WG+ GY KM + N CGIAT ASYP V
Sbjct: 190 WGNKGYVKMAKDQNNHCGIATAASYPTV 217
>sp|Q9GKL8|CATL1_CHLAE Cathepsin L1 OS=Chlorocebus aethiops GN=CTSL1 PE=1 SV=1
Length = 333
Score = 192 bits (488), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 104/221 (47%), Positives = 132/221 (59%), Gaps = 9/221 (4%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
+ + +R ++PVK+QG CGSCW FS TG+LE + GK +SLSEQ LVDC+ N
Sbjct: 116 RSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSGPQGN 175
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
+GCNGGL AF+Y+ NGGLD+EE+YPY + CK++ E V+I E
Sbjct: 176 EGCNGGLMDYAFQYVADNGGLDSEESYPYEATEESCKYNPEYSVANDTGFVDIP-KQEKA 234
Query: 219 LQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----DG 273
L AV V P+SVA + + F FYK G+Y C + MD H V+ VGYG E D
Sbjct: 235 LMKAVATVGPISVAIDAGHESFMFYKEGIYFEPDCSSEDMD--HGVLVVGYGFESTESDN 292
Query: 274 VPYWLIKNSWGENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
YWL+KNSWGE WG GY KM +N CGIA+ ASYP V
Sbjct: 293 SKYWLVKNSWGEEWGMGGYIKMAKDRRNHCGIASAASYPTV 333
>sp|Q5E968|CATK_BOVIN Cathepsin K OS=Bos taurus GN=CTSK PE=2 SV=2
Length = 329
Score = 192 bits (487), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 96/213 (45%), Positives = 132/213 (61%), Gaps = 6/213 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ YR ++PVK+QG CGSCW FS+ G+LE + GK ++LS Q LVDC N G
Sbjct: 119 VDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE--NDG 176
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C GG + AF+Y++ N G+D+E+AYPY G+D C ++ + I G E L+
Sbjct: 177 CGGGYMTNAFQYVQKNRGIDSEDAYPYVGQDENCMYNPTGKAAKCRGYREIPEGNEKALK 236
Query: 221 HAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
AV V P+SVA + + F+FY+ GVY C + ++NHAV+AVGYG++ G +W+I
Sbjct: 237 RAVARVGPISVAIDASLTSFQFYRKGVYYDENCNSD--NLNHAVLAVGYGIQKGNKHWII 294
Query: 280 KNSWGENWGDHGYFKMEMGK-NMCGIATCASYP 311
KNSWGENWG+ GY M K N CGIA AS+P
Sbjct: 295 KNSWGENWGNKGYILMARNKNNACGIANLASFP 327
>sp|P15242|TEST2_RAT Testin-2 OS=Rattus norvegicus GN=Testin PE=1 SV=2
Length = 333
Score = 192 bits (487), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 103/221 (46%), Positives = 136/221 (61%), Gaps = 9/221 (4%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++PVK+QGHC S W FS TGSLE + + I LSEQ L+DC +
Sbjct: 116 KRVDWRQLGYVTPVKNQGHCASSWAFSATGSLEGQMFRKTERLIPLSEQNLLDCMGSNVT 175
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
GC+GG AF+Y+K NGGL TEE+YPY G+ C++ +EN V D V I G+E+
Sbjct: 176 HGCSGGFMQYAFQYVKDNGGLATEESYPYRGQGRECRYHAENSAANVRDFVQIP-GSEEA 234
Query: 219 LQHAVGLVRPVSVAFEVVDG-FRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----DG 273
L AV V P+SVA + G F+FY SG+Y +C + +NHAV+ VGYG E DG
Sbjct: 235 LMKAVAKVGPISVAVDASHGSFQFYGSGIYYEPQCKR--VHLNHAVLVVGYGFEGEESDG 292
Query: 274 VPYWLIKNSWGENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
+WL+KNSWGE WG GY K+ N CGIAT ++YP+V
Sbjct: 293 NSFWLVKNSWGEEWGMKGYMKLAKDWSNHCGIATYSTYPIV 333
>sp|Q9GLE3|CATK_PIG Cathepsin K OS=Sus scrofa GN=CTSK PE=2 SV=1
Length = 330
Score = 191 bits (486), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 97/213 (45%), Positives = 131/213 (61%), Gaps = 6/213 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ YR ++PVK+QG CGSCW FS+ G+LE + GK ++LS Q LVDC N G
Sbjct: 120 IDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE--NDG 177
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C GG + AF+Y++ N G+D+E+AYPY G+D C ++ + I G E L+
Sbjct: 178 CGGGYMTNAFQYVQKNRGIDSEDAYPYVGQDENCMYNPTGKAAKCRGYREIPEGNEKALK 237
Query: 221 HAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
AV V PVSVA + + F+FY GVY C + ++NHAV+AVGYG++ G +W+I
Sbjct: 238 RAVARVGPVSVAIDASLTSFQFYSKGVYYDENCNSD--NLNHAVLAVGYGIQKGKKHWII 295
Query: 280 KNSWGENWGDHGYFKMEMGK-NMCGIATCASYP 311
KNSWGENWG+ GY M K N CGIA AS+P
Sbjct: 296 KNSWGENWGNKGYILMARNKNNACGIANLASFP 328
>sp|Q9GL24|CATL1_CANFA Cathepsin L1 OS=Canis familiaris GN=CTSL1 PE=2 SV=1
Length = 333
Score = 191 bits (486), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 103/221 (46%), Positives = 135/221 (61%), Gaps = 9/221 (4%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++PVK+QG CGSCW FS TG+LE + GK +SLSEQ LVDC++A N
Sbjct: 116 KSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQGN 175
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDG-VCKFSSENVGVQVLDSVNITLGAED 217
+GCNGGL AF Y+K NGGLD+EE+YPY G+D C + E V++ E
Sbjct: 176 EGCNGGLMDNAFRYVKDNGGLDSEESYPYLGRDTETCNYKPECSAANDTGFVDLPQ-REK 234
Query: 218 ELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE---DG 273
L AV + P+SVA + F+FYKSG+Y C + D++H V+ VGYG E
Sbjct: 235 ALMKAVATLGPISVAIDAGHQSFQFYKSGIYFDPDC--SSKDLDHGVLVVGYGFEGTDSN 292
Query: 274 VPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
+W++KNSWG WG +GY KM + N CGIAT ASYP V
Sbjct: 293 NKFWIVKNSWGPEWGWNGYVKMAKDQNNHCGIATAASYPTV 333
>sp|Q86GF7|CRUST_PANBO Crustapain OS=Pandalus borealis GN=Cys PE=1 SV=1
Length = 323
Score = 191 bits (485), Expect = 6e-48, Method: Compositional matrix adjust.
Identities = 101/208 (48%), Positives = 126/208 (60%), Gaps = 5/208 (2%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++PVKDQG CGSCW FS +LE A+ G +SLSEQ LVDC+ ++ NQGCNGG P Q
Sbjct: 118 VTPVKDQGQCGSCWAFSAVAALEGAHFLKTGDLVSLSEQNLVDCSSSYGNQGCNGGWPYQ 177
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
A++YI N G+DTE +YPY D C++ + N+G V V G E LQHAV P
Sbjct: 178 AYQYIIANRGIDTESSYPYKAIDDNCRYDAGNIGATVSSYVEPASGDESALQHAVQNEGP 237
Query: 229 VSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE-DGVPYWLIKNSWGEN 286
VSV + F Y GVY C + NHAV AVGYG + +G YW++KNSWG
Sbjct: 238 VSVCIDAGQSSFGSYGGGVYYEPNCDS--WYANHAVTAVGYGTDANGGDYWIVKNSWGAW 295
Query: 287 WGDHGYFKMEMGK-NMCGIATCASYPVV 313
WG+ GY KM + N C IAT + YPVV
Sbjct: 296 WGESGYIKMARNRDNNCAIATYSVYPVV 323
>sp|Q3ZKN1|CATK_CANFA Cathepsin K OS=Canis familiaris GN=CTSK PE=2 SV=1
Length = 330
Score = 191 bits (485), Expect = 7e-48, Method: Compositional matrix adjust.
Identities = 96/213 (45%), Positives = 131/213 (61%), Gaps = 6/213 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ YR ++PVK+QG CGSCW FS+ G+LE + GK ++LS Q LVDC N G
Sbjct: 120 VDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE--NDG 177
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C GG + AF+Y++ N G+D+E+AYPY G+D C ++ + I G E L+
Sbjct: 178 CGGGYMTNAFQYVQKNRGIDSEDAYPYVGQDESCMYNPTGKAAKCRGYREIPEGNEKALK 237
Query: 221 HAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
AV V P+SVA + + F+FY GVY C + ++NHAV+AVGYG++ G +W+I
Sbjct: 238 RAVARVGPISVAIDASLTSFQFYSKGVYYDENCNSD--NLNHAVLAVGYGIQKGNKHWII 295
Query: 280 KNSWGENWGDHGYFKMEMGK-NMCGIATCASYP 311
KNSWGENWG+ GY M K N CGIA AS+P
Sbjct: 296 KNSWGENWGNKGYILMARNKNNACGIANLASFP 328
>sp|P25975|CATL1_BOVIN Cathepsin L1 OS=Bos taurus GN=CTSL1 PE=1 SV=3
Length = 334
Score = 191 bits (484), Expect = 8e-48, Method: Compositional matrix adjust.
Identities = 104/212 (49%), Positives = 131/212 (61%), Gaps = 10/212 (4%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++PVK+QG CGSCW FS TG+LE + GK +SLSEQ LVDC++A NQGCNGGL
Sbjct: 126 VTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQGNQGCNGGLMDN 185
Query: 169 AFEYIKYNGGLDTEEAYPYTGKD-GVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVR 227
AF+YIK NGGLD+EE+YPY D C + E V+I E L AV V
Sbjct: 186 AFQYIKDNGGLDSEESYPYLATDTNSCNYKPECSAANDTGFVDIPQ-REKALMKAVATVG 244
Query: 228 PVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----DGVPYWLIKNS 282
P+SVA + F+FYKSG+Y C + D++H V+ VGYG E + +W++KNS
Sbjct: 245 PISVAIDAGHTSFQFYKSGIYYDPDC--SSKDLDHGVLVVGYGFEGTDSNNNKFWIVKNS 302
Query: 283 WGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
WG WG +GY KM + N CGIAT ASYP V
Sbjct: 303 WGPEWGWNGYVKMAKDQNNHCGIATAASYPTV 334
>sp|P07711|CATL1_HUMAN Cathepsin L1 OS=Homo sapiens GN=CTSL1 PE=1 SV=2
Length = 333
Score = 190 bits (483), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 103/221 (46%), Positives = 133/221 (60%), Gaps = 9/221 (4%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
+ + +R ++PVK+QG CGSCW FS TG+LE + G+ ISLSEQ LVDC+ N
Sbjct: 116 RSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGN 175
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDE 218
+GCNGGL AF+Y++ NGGLD+EE+YPY + CK++ + V+I E
Sbjct: 176 EGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIP-KQEKA 234
Query: 219 LQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----DG 273
L AV V P+SVA + + F FYK G+Y C + MD H V+ VGYG E D
Sbjct: 235 LMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMD--HGVLVVGYGFESTESDN 292
Query: 274 VPYWLIKNSWGENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
YWL+KNSWGE WG GY KM +N CGIA+ ASYP V
Sbjct: 293 NKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 333
>sp|P54640|CYSP5_DICDI Cysteine proteinase 5 OS=Dictyostelium discoideum GN=cprE PE=2 SV=2
Length = 344
Score = 190 bits (482), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 119/332 (35%), Positives = 164/332 (49%), Gaps = 75/332 (22%)
Query: 51 QVIGQARHALSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLN-- 108
Q + ++ +F + + K Y S EE R+ F N+D ++ N KG LGLN
Sbjct: 19 QQFSELQYRNAFTDWMITHQKSYTS-EEFGARYNIFKANMDYVQQWNSKGSETVLGLNNF 77
Query: 109 ----------------------------------------------ISPVKDQGHCGSCW 122
++PVK+QG CG CW
Sbjct: 78 ADITNEEYRNTYLGTKFDASSLIGTQEEKVFTTSSAASKDWRSEGAVTPVKNQGQCGGCW 137
Query: 123 TFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTE 182
+FSTTGS E A+ Q+ G+ +SLSEQ L+DC+ N GC+GGL + AFEYI N G+DTE
Sbjct: 138 SFSTTGSTEGAHFQSKGELVSLSEQNLIDCSTE--NSGCDGGLMTYAFEYIINNNGIDTE 195
Query: 183 EAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVAFEVV-DGFRF 241
+YPY ++G C++ SEN G + +T G+E L+ AV V PVSVA + F+
Sbjct: 196 SSYPYKAENGKCEYKSENSGATLSSYKTVTAGSESSLESAVN-VNPVSVAIDASHQSFQL 254
Query: 242 YKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGV-------------------PYWLIKNS 282
Y SG+Y +C + +D H V+AVGYG G YW++KNS
Sbjct: 255 YTSGIYYEPECSSENLD--HGVLAVGYGSGSGSSSGQSSGQSSGNLSASSSNEYWIVKNS 312
Query: 283 WGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
WG +WG GY M + N CGIA+ AS+PVV
Sbjct: 313 WGTSWGIEGYILMSRNRDNNCGIASSASFPVV 344
>sp|P43236|CATK_RABIT Cathepsin K OS=Oryctolagus cuniculus GN=CTSK PE=1 SV=1
Length = 329
Score = 189 bits (481), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 97/213 (45%), Positives = 131/213 (61%), Gaps = 6/213 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ YR ++PVK+QG CGSCW FS+ G+LE + GK ++LS Q LVDC N G
Sbjct: 119 IDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE--NYG 176
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C GG + AF+Y++ N G+D+E+AYPY G+D C ++ + I G E L+
Sbjct: 177 CGGGYMTNAFQYVQRNRGIDSEDAYPYVGQDESCMYNPTGKAAKCRGYREIPEGNEKALK 236
Query: 221 HAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
AV V PVSVA + + F+FY GVY C + +VNHAV+AVGYG++ G +W+I
Sbjct: 237 RAVARVGPVSVAIDASLTSFQFYSKGVYYDENCSSD--NVNHAVLAVGYGIQKGNKHWII 294
Query: 280 KNSWGENWGDHGYFKMEMGK-NMCGIATCASYP 311
KNSWGE+WG+ GY M K N CGIA AS+P
Sbjct: 295 KNSWGESWGNKGYILMARNKNNACGIANLASFP 327
>sp|P61277|CATK_MACMU Cathepsin K OS=Macaca mulatta GN=CTSK PE=1 SV=1
Length = 329
Score = 189 bits (480), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 96/213 (45%), Positives = 131/213 (61%), Gaps = 6/213 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ YR ++PVK+QG CGSCW FS+ G+LE + GK ++LS Q LVDC N G
Sbjct: 119 VDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE--NDG 176
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C GG + AF+Y++ N G+D+E+AYPY G++ C ++ + I G E L+
Sbjct: 177 CGGGYMTNAFQYVQKNRGIDSEDAYPYVGQEESCMYNPTGKAAKCRGYREIPEGNEKALK 236
Query: 221 HAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
AV V PVSVA + + F+FY GVY C + ++NHAV+AVGYG++ G +W+I
Sbjct: 237 RAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNSD--NLNHAVLAVGYGIQKGNKHWII 294
Query: 280 KNSWGENWGDHGYFKMEMGK-NMCGIATCASYP 311
KNSWGENWG+ GY M K N CGIA AS+P
Sbjct: 295 KNSWGENWGNKGYILMARNKNNACGIANLASFP 327
>sp|P61276|CATK_MACFA Cathepsin K OS=Macaca fascicularis GN=CTSK PE=2 SV=1
Length = 329
Score = 189 bits (480), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 96/213 (45%), Positives = 131/213 (61%), Gaps = 6/213 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ YR ++PVK+QG CGSCW FS+ G+LE + GK ++LS Q LVDC N G
Sbjct: 119 VDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE--NDG 176
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C GG + AF+Y++ N G+D+E+AYPY G++ C ++ + I G E L+
Sbjct: 177 CGGGYMTNAFQYVQKNRGIDSEDAYPYVGQEESCMYNPTGKAAKCRGYREIPEGNEKALK 236
Query: 221 HAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
AV V PVSVA + + F+FY GVY C + ++NHAV+AVGYG++ G +W+I
Sbjct: 237 RAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNSD--NLNHAVLAVGYGIQKGNKHWII 294
Query: 280 KNSWGENWGDHGYFKMEMGK-NMCGIATCASYP 311
KNSWGENWG+ GY M K N CGIA AS+P
Sbjct: 295 KNSWGENWGNKGYILMARNKNNACGIANLASFP 327
>sp|P43235|CATK_HUMAN Cathepsin K OS=Homo sapiens GN=CTSK PE=1 SV=1
Length = 329
Score = 189 bits (480), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 96/213 (45%), Positives = 131/213 (61%), Gaps = 6/213 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ YR ++PVK+QG CGSCW FS+ G+LE + GK ++LS Q LVDC N G
Sbjct: 119 VDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE--NDG 176
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C GG + AF+Y++ N G+D+E+AYPY G++ C ++ + I G E L+
Sbjct: 177 CGGGYMTNAFQYVQKNRGIDSEDAYPYVGQEESCMYNPTGKAAKCRGYREIPEGNEKALK 236
Query: 221 HAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
AV V PVSVA + + F+FY GVY C + ++NHAV+AVGYG++ G +W+I
Sbjct: 237 RAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNSD--NLNHAVLAVGYGIQKGNKHWII 294
Query: 280 KNSWGENWGDHGYFKMEMGK-NMCGIATCASYP 311
KNSWGENWG+ GY M K N CGIA AS+P
Sbjct: 295 KNSWGENWGNKGYILMARNKNNACGIANLASFP 327
>sp|Q28944|CATL1_PIG Cathepsin L1 OS=Sus scrofa GN=CTSL1 PE=2 SV=1
Length = 334
Score = 189 bits (479), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 103/222 (46%), Positives = 135/222 (60%), Gaps = 10/222 (4%)
Query: 99 KGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNN 158
K + +R ++ VK+QG CGSCW FS TG+LE + GK +SLSEQ LVDC++ N
Sbjct: 116 KSVDWREKGYVTAVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGN 175
Query: 159 QGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKD-GVCKFSSENVGVQVLDSVNITLGAED 217
QGCNGGL AF+Y+K NGGLDTEE+YPY G++ C + E V+I E
Sbjct: 176 QGCNGGLMDNAFQYVKDNGGLDTEESYPYLGRETNSCTYKPECSAANDTGFVDIPQ-REK 234
Query: 218 ELQHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----D 272
L AV V P+SVA + F+FYKSG+Y C + D++H V+ VGYG E +
Sbjct: 235 ALMKAVATVGPISVAIDAGHSSFQFYKSGIYYDPDC--SSKDLDHGVLVVGYGFEGTDSN 292
Query: 273 GVPYWLIKNSWGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
+W++KNSWG WG +GY KM + N CGI+T ASYP V
Sbjct: 293 SSKFWIVKNSWGPEWGWNGYVKMAKDQNNHCGISTAASYPTV 334
>sp|O35186|CATK_RAT Cathepsin K OS=Rattus norvegicus GN=Ctsk PE=2 SV=1
Length = 329
Score = 188 bits (478), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 95/213 (44%), Positives = 130/213 (61%), Gaps = 6/213 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ YR ++PVK+QG CGSCW FS+ G+LE + GK ++LS Q LVDC N G
Sbjct: 119 IDYRKKGYVTPVKNQGQCGSCWAFSSAGALEGQLKKKTGKLLALSPQNLVDCVS--ENYG 176
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C GG + AF+Y++ NGG+D+E+AYPY G+D C +++ + I +G E L+
Sbjct: 177 CGGGYMTTAFQYVQQNGGIDSEDAYPYVGQDESCMYNATAKAAKCRGYREIPVGNEKALK 236
Query: 221 HAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
AV V PVSV+ + + F+FY GVY C +VNHAV+ VGYG + G YW+I
Sbjct: 237 RAVARVGPVSVSIDASLTSFQFYSRGVYYDENCDRD--NVNHAVLVVGYGTQKGNKYWII 294
Query: 280 KNSWGENWGDHGYFKMEMGK-NMCGIATCASYP 311
KNSWGE+WG+ GY + K N CGI AS+P
Sbjct: 295 KNSWGESWGNKGYVLLARNKNNACGITNLASFP 327
>sp|Q5E998|CATL2_BOVIN Cathepsin L2 OS=Bos taurus GN=CTSL2 PE=2 SV=1
Length = 334
Score = 187 bits (475), Expect = 8e-47, Method: Compositional matrix adjust.
Identities = 103/212 (48%), Positives = 130/212 (61%), Gaps = 10/212 (4%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++PVK+QG CGSCW FS TG+LE + GK +SLSEQ LVDC++A NQGCNGGL
Sbjct: 126 VTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQGNQGCNGGLMDN 185
Query: 169 AFEYIKYNGGLDTEEAYPYTGKD-GVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVR 227
AF+YIK NG LD+EE+YPY D C + E V+I E L AV V
Sbjct: 186 AFQYIKDNGCLDSEESYPYLATDTNSCNYKPECSAANDTGFVDIPQ-REKALMKAVATVG 244
Query: 228 PVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE----DGVPYWLIKNS 282
P+SVA + F+FYKSG+Y C + D++H V+ VGYG E + +W++KNS
Sbjct: 245 PISVAIDAGHTSFQFYKSGIYYDPDC--SSKDLDHGVLVVGYGFEGTDSNNNKFWIVKNS 302
Query: 283 WGENWGDHGYFKMEMGK-NMCGIATCASYPVV 313
WG WG +GY KM + N CGIAT ASYP V
Sbjct: 303 WGPEWGWNGYVKMAKDQNNHCGIATAASYPTV 334
>sp|Q90686|CATK_CHICK Cathepsin K OS=Gallus gallus GN=CTSK PE=2 SV=1
Length = 334
Score = 187 bits (474), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 95/213 (44%), Positives = 126/213 (59%), Gaps = 6/213 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++PVKDQG CGSCW FS+ G+LE + GK +SLS Q LV C NN G
Sbjct: 124 VDWRRKGYVTPVKDQGQCGSCWAFSSVGALEGQLKRRTGKLLSLSPQNLVYCVS--NNNG 181
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C GG + AFEY++ N G+D+E+AYPY G+D C +S + I E L+
Sbjct: 182 CGGGYMTNAFEYVRLNRGIDSEDAYPYIGQDESCMYSPTGKAAKCRGYREIPEDNEKALK 241
Query: 221 HAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
AV + PVSV + + F+FY GVY T C P ++NHAV+AVGYG + G +W+I
Sbjct: 242 RAVARIGPVSVGIDASLPSFQFYSRGVYYDTGC--NPENINHAVLAVGYGAQKGTKHWII 299
Query: 280 KNSWGENWGDHGYFKMEMG-KNMCGIATCASYP 311
KNSWG WG+ GY + K CGIA AS+P
Sbjct: 300 KNSWGTEWGNKGYVLLARNMKQTCGIANLASFP 332
>sp|Q94B08|GCP1_ARATH Germination-specific cysteine protease 1 OS=Arabidopsis thaliana
GN=GCP1 PE=2 SV=2
Length = 376
Score = 186 bits (472), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 112/274 (40%), Positives = 156/274 (56%), Gaps = 23/274 (8%)
Query: 56 ARHALSFARFAR----RYGKIYESVEEMKLRFATFSKNLDLIRSTNCKG------LSYRL 105
A + L +F Y K+Y R +KN++ S G + +R
Sbjct: 94 ATYKLGLTKFTDLTNDEYRKLYLGARTEPARRIAKAKNVNQKYSAAVNGKEVPETVDWRQ 153
Query: 106 GLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGL 165
++P+KDQG CGSCW FSTT ++E G+ ISLSEQ+LVDC +++N QGCNGGL
Sbjct: 154 KGAVNPIKDQGTCGSCWAFSTTAAVEGINKIVTGELISLSEQELVDCDKSYN-QGCNGGL 212
Query: 166 PSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSV-NITLGAEDELQHAVG 224
AF++I NGGL+TE+ YPY G G C +N V +D ++ E L+ A+
Sbjct: 213 MDYAFQFIMKNGGLNTEKDYPYRGFGGKCNSFLKNSRVVSIDGYEDVPTKDETALKKAIS 272
Query: 225 LVRPVSVAFEVVDG-FRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSW 283
+PVSVA E F+ Y+SG+++ + CG +++HAVVAVGYG E+GV YW+++NSW
Sbjct: 273 Y-QPVSVAIEAGGRIFQHYQSGIFTGS-CG---TNLDHAVVAVGYGSENGVDYWIVRNSW 327
Query: 284 GENWGDHGYFKMEMG-----KNMCGIATCASYPV 312
G WG+ GY +ME CGIA ASYPV
Sbjct: 328 GPRWGEEGYIRMERNLAASKSGKCGIAVEASYPV 361
>sp|P25776|ORYA_ORYSJ Oryzain alpha chain OS=Oryza sativa subsp. japonica GN=Os04g0650000
PE=1 SV=2
Length = 458
Score = 186 bits (471), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 116/311 (37%), Positives = 158/311 (50%), Gaps = 66/311 (21%)
Query: 62 FARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKG----LSYRLGLN--------- 108
+A + +GK Y +V E + R+A F NL I N S+RLGLN
Sbjct: 40 YAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHSFRLGLNRFADLTNEE 99
Query: 109 -----------------------------------------ISPVKDQGHCGSCWTFSTT 127
++ +KDQG CGSCW FS
Sbjct: 100 YRDTYLGLRNKPRRERKVSDRYLAADNEALPESVDWRTKGAVAEIKDQGGCGSCWAFSAI 159
Query: 128 GSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPY 187
++E G ISLSEQ+LVDC ++N +GCNGGL AF++I NGG+DTE+ YPY
Sbjct: 160 AAVEGINQIVTGDLISLSEQELVDCDTSYN-EGCNGGLMDYAFDFIINNGGIDTEDDYPY 218
Query: 188 TGKDGVCKFSSENVGVQVLDSV-NITLGAEDELQHAVGLVRPVSVAFEV-VDGFRFYKSG 245
GKD C + +N V +DS ++T +E LQ AV +PVSVA E F+ Y SG
Sbjct: 219 KGKDERCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVA-NQPVSVAIEAGGRAFQLYSSG 277
Query: 246 VYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHGYFKMEMG----KNM 301
+++ KCG ++H V AVGYG E+G YW+++NSWG++WG+ GY +ME
Sbjct: 278 IFTG-KCGTA---LDHGVAAVGYGTENGKDYWIVRNSWGKSWGESGYVRMERNIKASSGK 333
Query: 302 CGIATCASYPV 312
CGIA SYP+
Sbjct: 334 CGIAVEPSYPL 344
>sp|P55097|CATK_MOUSE Cathepsin K OS=Mus musculus GN=Ctsk PE=2 SV=2
Length = 329
Score = 186 bits (471), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 93/213 (43%), Positives = 130/213 (61%), Gaps = 6/213 (2%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ YR ++PVK+QG CGSCW FS+ G+LE + GK ++LS Q LVDC N G
Sbjct: 119 IDYRKKGYVTPVKNQGQCGSCWAFSSAGALEGQLKKKTGKLLALSPQNLVDCVT--ENYG 176
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQ 220
C GG + AF+Y++ NGG+D+E+AYPY G+D C +++ + I +G E L+
Sbjct: 177 CGGGYMTTAFQYVQQNGGIDSEDAYPYVGQDESCMYNATAKAAKCRGYREIPVGNEKALK 236
Query: 221 HAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLI 279
AV V P+SV+ + + F+FY GVY C +VNHAV+ VGYG + G +W+I
Sbjct: 237 RAVARVGPISVSIDASLASFQFYSRGVYYDENCDRD--NVNHAVLVVGYGTQKGSKHWII 294
Query: 280 KNSWGENWGDHGYFKMEMGK-NMCGIATCASYP 311
KNSWGE+WG+ GY + K N CGI AS+P
Sbjct: 295 KNSWGESWGNKGYALLARNKNNACGITNMASFP 327
>sp|Q9R014|CATJ_MOUSE Cathepsin J OS=Mus musculus GN=Ctsj PE=2 SV=2
Length = 334
Score = 184 bits (467), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 97/211 (45%), Positives = 125/211 (59%), Gaps = 9/211 (4%)
Query: 109 ISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQ 168
++PV++QG CGSCW F+ G++E G LS Q L+DC++ N+GC G Q
Sbjct: 126 VTPVRNQGKCGSCWAFAAAGAIEGQMFWKTGNLTPLSVQNLLDCSKTVGNKGCQSGTAHQ 185
Query: 169 AFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRP 228
AFEY+ N GL+ E YPY GKDG C++ SEN + D VN+ E L AV + P
Sbjct: 186 AFEYVLKNKGLEAEATYPYEGKDGPCRYRSENASANITDYVNLPPN-ELYLWVAVASIGP 244
Query: 229 VSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYG----VEDGVPYWLIKNSW 283
VS A + D FRFY G+Y C + VNHAV+ VGYG V+DG YWLIKNSW
Sbjct: 245 VSAAIDASHDSFRFYNGGIYYEPNC--SSYFVNHAVLVVGYGSEGDVKDGNNYWLIKNSW 302
Query: 284 GENWGDHGYFKMEMG-KNMCGIATCASYPVV 313
GE WG +GY ++ N CGIA+ ASYP +
Sbjct: 303 GEEWGMNGYMQIAKDHNNHCGIASLASYPNI 333
>sp|P25777|ORYB_ORYSJ Oryzain beta chain OS=Oryza sativa subsp. japonica GN=Os04g0670200
PE=1 SV=2
Length = 466
Score = 184 bits (467), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 98/218 (44%), Positives = 134/218 (61%), Gaps = 11/218 (5%)
Query: 101 LSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQG 160
+ +R ++PVK+QG CGSCW FS ++E+ G+ I+LSEQ+LV+C+ N G
Sbjct: 145 VDWREKGAVAPVKNQGQCGSCWAFSAVSTVESINQLVTGEMITLSEQELVECSTNGQNSG 204
Query: 161 CNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSV-NITLGAEDEL 219
CNGGL AF++I NGG+DTE+ YPY DG C + EN V +D ++ E L
Sbjct: 205 CNGGLMDDAFDFIIKNGGIDTEDDYPYKAVDGKCDINRENAKVVSIDGFEDVPQNDEKSL 264
Query: 220 QHAVGLVRPVSVAFEV-VDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWL 278
Q AV +PVSVA E F+ Y SGV+S +CG + ++H VVAVGYG ++G YW+
Sbjct: 265 QKAVAH-QPVSVAIEAGGREFQLYHSGVFSG-RCGTS---LDHGVVAVGYGTDNGKDYWI 319
Query: 279 IKNSWGENWGDHGYFKMEMGKNM----CGIATCASYPV 312
++NSWG WG+ GY +ME N+ CGIA ASYP
Sbjct: 320 VRNSWGPKWGESGYVRMERNINVTTGKCGIAMMASYPT 357
>sp|P25251|CYSP4_BRANA Cysteine proteinase COT44 (Fragment) OS=Brassica napus PE=2 SV=1
Length = 328
Score = 184 bits (466), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 113/274 (41%), Positives = 156/274 (56%), Gaps = 22/274 (8%)
Query: 55 QARHALSFARFAR----RYGKIYESVEEMKLRFATFSKNLDLIRSTNCK------GLSYR 104
A + L FA Y +Y +R T +KN+++ S + +R
Sbjct: 48 NATYKLGLTIFANLTNDEYRSLYLGARTEPVRRITKAKNVNMKYSAAVNVDEVPVTVDWR 107
Query: 105 LGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGG 164
++ +KDQG CGSCW FST ++E G+ +SLSEQ+LVDC +++N QGCNGG
Sbjct: 108 QKGAVNAIKDQGTCGSCWAFSTAAAVEGINKIVTGELVSLSEQELVDCDKSYN-QGCNGG 166
Query: 165 LPSQAFEYIKYNGGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSV-NITLGAEDELQHAV 223
L AF++I NGGL+TE+ YPY G +G C +N V +D ++ E L+ AV
Sbjct: 167 LMDYAFQFIMKNGGLNTEKDYPYHGTNGKCNSLLKNSRVVTIDGYEDVPSKDETALKRAV 226
Query: 224 GLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNS 282
+PVSVA + F+ Y+SG+++ KCG T MD HAVVAVGYG E+GV YW+++NS
Sbjct: 227 SY-QPVSVAIDAGGRAFQHYQSGIFTG-KCG-TNMD--HAVVAVGYGSENGVDYWIVRNS 281
Query: 283 WGENWGDHGYFKMEMG----KNMCGIATCASYPV 312
WG WG+ GY +ME CGIA ASYPV
Sbjct: 282 WGTRWGEDGYIRMERNVASKSGKCGIAIEASYPV 315
>sp|P04989|CYSP2_DICDI Cysteine proteinase 2 OS=Dictyostelium discoideum GN=cprB PE=2 SV=1
Length = 376
Score = 183 bits (465), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 105/257 (40%), Positives = 143/257 (55%), Gaps = 43/257 (16%)
Query: 96 TNCKGLSYRLGLNISPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQA 155
TN K + +R ++P+KDQG CGSCW+FSTTGS E A+ K +SLSEQ LVDC+
Sbjct: 122 TNPKSIDWRTKNAVTPIKDQGQCGSCWSFSTTGSTEGAHALKTKKLVSLSEQNLVDCSGP 181
Query: 156 FNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDG-VCKFSSENVGVQVLDSVNITLG 214
N GC+GGL + AF+YI N G+DTE +YPYT + G C F+ ++G + VNIT G
Sbjct: 182 EENFGCDGGLMNNAFDYIIKNKGIDTESSYPYTAETGSTCLFNKSDIGATIKGYVNITAG 241
Query: 215 AEDELQHAVGLVRPVSVAFEVV-DGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVE-- 271
+E L++ PVSVA + + F+ Y SG+Y KC +P +++H V+ VGYGV+
Sbjct: 242 SEISLENGAQH-GPVSVAIDASHNSFQLYTSGIYYEPKC--SPTELDHGVLVVGYGVQGK 298
Query: 272 -DGVP----------------------------------YWLIKNSWGENWGDHGYFKME 296
D P YW++KNSWG +WG GY M
Sbjct: 299 DDEGPVLNRKQTIVIHKNEDNKVESSDDSSDSVRPKANNYWIVKNSWGTSWGIKGYILMS 358
Query: 297 MG-KNMCGIATCASYPV 312
KN CGIA+ +SYP+
Sbjct: 359 KDRKNNCGIASVSSYPL 375
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.319 0.134 0.409
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 117,320,289
Number of Sequences: 539616
Number of extensions: 5024492
Number of successful extensions: 11934
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 218
Number of HSP's successfully gapped in prelim test: 13
Number of HSP's that attempted gapping in prelim test: 11055
Number of HSP's gapped (non-prelim): 339
length of query: 314
length of database: 191,569,459
effective HSP length: 117
effective length of query: 197
effective length of database: 128,434,387
effective search space: 25301574239
effective search space used: 25301574239
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 61 (28.1 bits)