BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 009271
(538 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|Q9LX20|ASPL1_ARATH Aspartic proteinase-like protein 1 OS=Arabidopsis thaliana
GN=At5g10080 PE=1 SV=1
Length = 528
Score = 504 bits (1297), Expect = e-142, Method: Compositional matrix adjust.
Identities = 261/502 (51%), Positives = 347/502 (69%), Gaps = 26/502 (5%)
Query: 4 LVAICMLFGCILLDGSDAVSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLE 63
+ C+LF + + + A FSS+L+HRFSDE + + S +DS P K S+EY
Sbjct: 7 FLLFCVLF--LATEETLASLFSSRLIHRFSDEGRASIKTPSS----SDSLPNKQSLEYYR 60
Query: 64 LLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSF 123
LL +D++RQ+ N ++ Q L PSEGS+T GN F WLHYTWIDIGTP+VSF
Sbjct: 61 LLAESDFRRQRM-------NLGAKVQSLVPSEGSKTISSGNDFGWLHYTWIDIGTPSVSF 113
Query: 124 LVALDAGSNLLWVPCQCIQCAPLSASYYTSL-DRNLSEYDPSSSSSSKNVSCSHPLCKSR 182
LVALD GSNLLW+PC C+QCAPL+++YY+SL ++L+EY+PSSSS+SK CSH LC S
Sbjct: 114 LVALDTGSNLLWIPCNCVQCAPLTSTYYSSLATKDLNEYNPSSSSTSKVFLCSHKLCDSA 173
Query: 183 SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHA---PQSSVQSSVIIGCGRK 239
S C+S K+ CPY +Y + +TSSSG LV+DILHL + + SSV++ V+IGCG+K
Sbjct: 174 SDCESPKEQCPYTVNYLSGNTSSSGLLVEDILHLTYNTNNRLMNGSSSVKARVVIGCGKK 233
Query: 240 QTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQ 299
Q+G YLDG APDG+MGLG ++SVPS L+KAGL++NSFS+CFDE DSG ++FGD GP+ Q
Sbjct: 234 QSGDYLDGVAPDGLMGLGPAEISVPSFLSKAGLMRNSFSLCFDEEDSGRIYFGDMGPSIQ 293
Query: 300 QSTSFLPI-GEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKF 358
QST FL + KY Y VGVE+ CIGNSCL Q+ F +DSG SFT+LP EIY +V ++
Sbjct: 294 QSTPFLQLDNNKYSGYIVGVEACCIGNSCLKQTSFTTFIDSGQSFTYLPEEIYRKVALEI 353
Query: 359 DKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTV 418
D+ +++ + +G SW+YCY +S+E KVP ++L FS N +FV+ +F F +++G
Sbjct: 354 DRHINATSKNFEGVSWEYCYESSAEP--KVPAIKLKFSHNNTFVIHKPLFVFQQSQGLVQ 411
Query: 419 FCLTVMSTDGDYGI--IGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAG 476
FCL + S G GI IGQN+M G+R+VFDREN+KL WS SKC+E DK P +
Sbjct: 412 FCLPI-SPSGQEGIGSIGQNYMRGYRMVFDRENMKLGWSPSKCQE--DKIEPPQASPGST 468
Query: 477 QSPNPLPTTEQQSTSNGQAAAP 498
SPNPLPT EQQS G A +P
Sbjct: 469 SSPNPLPTDEQQS-RGGHAVSP 489
>sp|Q9S9K4|ASPL2_ARATH Aspartic proteinase-like protein 2 OS=Arabidopsis thaliana
GN=At1g65240 PE=1 SV=2
Length = 475
Score = 114 bits (285), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 97/369 (26%), Positives = 166/369 (44%), Gaps = 25/369 (6%)
Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSS 167
L++T I +G+P + V +D GS++LW+ C+ C +C T+L+ LS +D ++SS
Sbjct: 73 LYFTKIKLGSPPKEYHVQVDTGSDILWINCKPCPKCPT-----KTNLNFRLSLFDMNASS 127
Query: 168 SSKNVSCSHPLCKSRSSCKSLKDP--CPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
+SK V C C S S + C Y Y+ E TS G + D+L L +
Sbjct: 128 TSKKVGCDDDFCSFISQSDSCQPALGCSYHIVYADESTSD-GKFIRDMLTLEQVTGDLKT 186
Query: 226 SSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEN 284
+ V+ GCG Q+G +G +A DGVMG G + SV S LA G + FS C D
Sbjct: 187 GPLGQEVVFGCGSDQSGQLGNGDSAVDGVMGFGQSNTSVLSQLAATGDAKRVFSHCLDNV 246
Query: 285 DSGSVF-FGDQGPATQQSTSFLPIGEKYDAYFVGVE----SYCIGNSCLTQSGFQALVDS 339
G +F G ++T +P Y+ +G++ S + S + G +VDS
Sbjct: 247 KGGGIFAVGVVDSPKVKTTPMVPNQMHYNVMLMGMDVDGTSLDLPRSIVRNGG--TIVDS 304
Query: 340 GASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY-CYNASSEEMLKVPDMRLIFSKN 398
G + + P +Y ++ + +++ + + L + C++ S+ P + F +
Sbjct: 305 GTTLAYFPKVLYDSLI---ETILARQPVKLHIVEETFQCFSFSTNVDEAFPPVSFEFEDS 361
Query: 399 QSFVVRNHIFSFPENEGFTVFCLTV--MSTD--GDYGIIGQNFMMGHRIVFDRENLKLAW 454
V H + F E F ++TD + ++G + +V+D +N + W
Sbjct: 362 VKLTVYPHDYLFTLEEELYCFGWQAGGLTTDERSEVILLGDLVLSNKLVVYDLDNEVIGW 421
Query: 455 SHSKCEEVI 463
+ C I
Sbjct: 422 ADHNCSSSI 430
>sp|Q766C2|NEP2_NEPGR Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis GN=nep2
PE=1 SV=1
Length = 438
Score = 87.4 bits (215), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 100/425 (23%), Positives = 176/425 (41%), Gaps = 63/425 (14%)
Query: 56 KNSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFF-GNQFYWLHYTWI 114
KN +Y L+ KR + R++ S N +L S G +T + G+ Y ++ +
Sbjct: 53 KNLTKYE--LIKRAIKRGERRMR-------SINAMLQSSSGIETPVYAGDGEYLMN---V 100
Query: 115 DIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
IGTP+ SF +D GS+L+W C+ C QC + ++P SSS +
Sbjct: 101 AIGTPDSSFSAIMDTGSDLIWTQCEPCTQC----------FSQPTPIFNPQDSSSFSTLP 150
Query: 174 CSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 233
C C+ S + C Y Y + +++ GY+ + ++S ++
Sbjct: 151 CESQYCQDLPSETCNNNECQYTYGYG-DGSTTQGYMATETFTF--------ETSSVPNIA 201
Query: 234 IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGS---VF 290
GCG G A G++G+G G +S+PS L FS C S S +
Sbjct: 202 FGCGEDNQGFGQGNGA--GLIGMGWGPLSLPSQLGVG-----QFSYCMTSYGSSSPSTLA 254
Query: 291 FGDQG---PATQQSTSFLPIGEKYDAYFVGVESYCIG--NSCLTQSGFQ--------ALV 337
G P ST+ + Y++ ++ +G N + S FQ ++
Sbjct: 255 LGSAASGVPEGSPSTTLIHSSLNPTYYYITLQGITVGGDNLGIPSSTFQLQDDGTGGMII 314
Query: 338 DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSE-EMLKVPDMRLIFS 396
DSG + T+LP + Y V F ++ + + C+ S+ ++VP++ + F
Sbjct: 315 DSGTTLTYLPQDAYNAVAQAFTDQINLPTVDESSSGLSTCFQQPSDGSTVQVPEISMQFD 374
Query: 397 KNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYG--IIGQNFMMGHRIVFDRENLKLAW 454
+ +I P EG V CL M + G I G ++++D +NL +++
Sbjct: 375 GGVLNLGEQNILISPA-EG--VICL-AMGSSSQLGISIFGNIQQQETQVLYDLQNLAVSF 430
Query: 455 SHSKC 459
++C
Sbjct: 431 VPTQC 435
>sp|Q9LS40|ASPG1_ARATH Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana
GN=ASPG1 PE=1 SV=1
Length = 500
Score = 84.0 bits (206), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 98/367 (26%), Positives = 161/367 (43%), Gaps = 45/367 (12%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
+++ I +GTP + LD GS++ W IQC P + Y ++ ++P+SSS+
Sbjct: 162 YFSRIGVGTPAKEMYLVLDTGSDVNW-----IQCEPCADCY----QQSDPVFNPTSSSTY 212
Query: 170 KNVSCSHPLCK--SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
K+++CS P C S+C+S K C Y Y + + + G L D + + K
Sbjct: 213 KSLTCSAPQCSLLETSACRSNK--CLYQVSYG-DGSFTVGELATDTVTFGNSGKI----- 264
Query: 228 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSG 287
++V +GCG G + A G+ V S+ + + SFS C + DSG
Sbjct: 265 --NNVALGCGHDNEGLFTGAAGLLGLG------GGVLSITNQ--MKATSFSYCLVDRDSG 314
Query: 288 ---SVFFGDQGPATQQSTSFLPIGEKYDA-YFVGVESYCIGNS--CLTQSGFQ------- 334
S+ F +T+ L +K D Y+VG+ + +G L + F
Sbjct: 315 KSSSLDFNSVQLGGGDATAPLLRNKKIDTFYYVGLSGFSVGGEKVVLPDAIFDVDASGSG 374
Query: 335 -ALVDSGASFTFLPTEIYAEVVVKFDKL-VSSKRISLQGNSWKYCYNASSEEMLKVPDMR 392
++D G + T L T+ Y + F KL V+ K+ S + + CY+ SS +KVP +
Sbjct: 375 GVILDCGTAVTRLQTQAYNSLRDAFLKLTVNLKKGSSSISLFDTCYDFSSLSTVKVPTVA 434
Query: 393 LIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKL 452
F+ +S + + P ++ T FC T IIG G RI +D +
Sbjct: 435 FHFTGGKSLDLPAKNYLIPVDDSGT-FCFAFAPTSSSLSIIGNVQQQGTRITYDLSKNVI 493
Query: 453 AWSHSKC 459
S +KC
Sbjct: 494 GLSGNKC 500
>sp|Q6XBF8|CDR1_ARATH Aspartic proteinase CDR1 OS=Arabidopsis thaliana GN=CDR1 PE=1 SV=1
Length = 437
Score = 80.9 bits (198), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 106/445 (23%), Positives = 185/445 (41%), Gaps = 72/445 (16%)
Query: 22 VSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRV-KLQ 80
+ F++ L+HR S ++ P N +E L N R RV
Sbjct: 29 LGFTADLIHRDSPKS-----------------PFYNPMETSSQRLRNAIHRSVNRVFHFT 71
Query: 81 SNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQC 140
+N+ + Q+ S + Y ++ + IGTP + D GS+LLW
Sbjct: 72 EKDNTPQPQIDLTSNSGE--------YLMN---VSIGTPPFPIMAIADTGSDLLWT---- 116
Query: 141 IQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLC---KSRSSCKSLKDPCPYIAD 197
QCAP YT +D +DP +SS+ K+VSCS C ++++SC + + C Y
Sbjct: 117 -QCAPCD-DCYTQVD---PLFDPKTSSTYKDVSCSSSQCTALENQASCSTNDNTCSYSLS 171
Query: 198 YSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLG 257
Y +++ + G + D L L S Q ++IIGCG G++ +
Sbjct: 172 YG-DNSYTKGNIAVDTLTLGSSDTRPMQ---LKNIIIGCGHNNAGTF------NKKGSGI 221
Query: 258 LGDVSVP-SLLAKAG-LIQNSFSICF-----DENDSGSVFFGDQGPATQQ---STSFLPI 307
+G P SL+ + G I FS C ++ + + FG + ST +
Sbjct: 222 VGLGGGPVSLIKQLGDSIDGKFSYCLVPLTSKKDQTSKINFGTNAIVSGSGVVSTPLIAK 281
Query: 308 GEKYDAYFVGVESYCIGNSCL-------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDK 360
+ Y++ ++S +G+ + S ++DSG + T LPTE Y+E+
Sbjct: 282 ASQETFYYLTLKSISVGSKQIQYSGSDSESSEGNIIIDSGTTLTLLPTEFYSELEDAVAS 341
Query: 361 LVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFC 420
+ +++ + CY+A+ + LKVP + + F + ++ F +E F
Sbjct: 342 SIDAEKKQDPQSGLSLCYSATGD--LKVPVITMHFDGADVKLDSSNAF-VQVSEDLVCFA 398
Query: 421 LTVMSTDGDYGIIGQ-NFMMGHRIV 444
+ YG + Q NF++G+ V
Sbjct: 399 FRGSPSFSIYGNVAQMNFLVGYDTV 423
>sp|Q0IU52|ASP1_ORYSJ Aspartic proteinase Asp1 OS=Oryza sativa subsp. japonica GN=ASP1
PE=2 SV=1
Length = 410
Score = 78.2 bits (191), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 86/404 (21%), Positives = 162/404 (40%), Gaps = 58/404 (14%)
Query: 93 PSEGSQTHFFGNQFYWLHY-TWIDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSAS 149
PS GN + H+ ++IG P S+ + +D GS L W+ C C C +
Sbjct: 20 PSSAVVLELHGNVYPIGHFFITMNIGDPAKSYFLDIDTGSTLTWLQCDAPCTNCNIVPHV 79
Query: 150 YYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSS-------CKSLKDPCPYIADYSTED 202
Y + L V+C+ LC + C S K C Y+ Y D
Sbjct: 80 LYKPTPKKL-------------VTCADSLCTDLYTDLGKPKRCGSQKQ-CDYVIQYV--D 123
Query: 203 TSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDV 261
+SS G LV D FS A + +++ GCG Q + P D ++GL G V
Sbjct: 124 SSSMGVLVID-----RFSLSASNGTNPTTIAFGCGYDQGKKNRNVPIPVDSILGLSRGKV 178
Query: 262 SVPSLLAKAGLI-QNSFSICFDENDSGSVFFGD-QGPATQQSTSFLPIGEKYDAYFVGVE 319
++ S L G+I ++ C G +FFGD Q P + + + + KY + G
Sbjct: 179 TLLSQLKSQGVITKHVLGHCISSKGGGFLFFGDAQVPTSGVTWTPMNREHKYYSPGHGTL 238
Query: 320 SYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSK-----RISLQGNSW 374
+ + ++ + + DSGA++T+ + Y + ++S+ ++ + +
Sbjct: 239 HFDSNSKAISAAPMAVIFDSGATYTYFAAQPYQATLSVVKSTLNSECKFLTEVTEKDRAL 298
Query: 375 KYCYNASSEEMLKVPDMRLIF----------SKNQSFVVRNHIFSFPENEGFTVFCLTVM 424
C+ ++++ + +++ F K + + + EG CL ++
Sbjct: 299 TVCWKG-KDKIVTIDEVKKCFRSLSLEFADGDKKATLEIPPEHYLIISQEGHV--CLGIL 355
Query: 425 STDGDY------GIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 462
++ +IG M+ +++D E L W + +C+ +
Sbjct: 356 DGSKEHLSLAGTNLIGGITMLDQMVIYDSERSLLGWVNYQCDRI 399
>sp|Q766C3|NEP1_NEPGR Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis GN=nep1
PE=1 SV=1
Length = 437
Score = 73.2 bits (178), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 85/386 (22%), Positives = 157/386 (40%), Gaps = 51/386 (13%)
Query: 93 PSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYY 151
PS + + G+ Y ++ + IGTP F +D GS+L+W CQ C QC
Sbjct: 81 PSGVETSVYAGDGEYLMN---LSIGTPAQPFSAIMDTGSDLIWTQCQPCTQC-------- 129
Query: 152 TSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVD 211
+++ ++P SSS + CS LC++ SS + C Y Y + + + G +
Sbjct: 130 --FNQSTPIFNPQGSSSFSTLPCSSQLCQALSSPTCSNNFCQYTYGYG-DGSETQGSMGT 186
Query: 212 DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAG 271
+ L S S ++ GCG G A G++G+G G +S+PS L
Sbjct: 187 ETLTFGSVSI--------PNITFGCGENNQGFGQGNGA--GLVGMGRGPLSLPSQLDVT- 235
Query: 272 LIQNSFSICFDENDSGS---VFFG---DQGPATQQSTSFLPIGEKYDAYFVGVESYCIGN 325
FS C S + + G + A +T+ + + Y++ + +G+
Sbjct: 236 ----KFSYCMTPIGSSTPSNLLLGSLANSVTAGSPNTTLIQSSQIPTFYYITLNGLSVGS 291
Query: 326 SCLT--QSGFQ---------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSW 374
+ L S F ++DSG + T+ Y V +F ++ ++ + +
Sbjct: 292 TRLPIDPSAFALNSNNGTGGIIIDSGTTLTYFVNNAYQSVRQEFISQINLPVVNGSSSGF 351
Query: 375 KYCYNASSE-EMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGII 433
C+ S+ L++P + F + + F P N + CL + S+ I
Sbjct: 352 DLCFQTPSDPSNLQIPTFVMHFDGGDLELPSENYFISPSNG---LICLAMGSSSQGMSIF 408
Query: 434 GQNFMMGHRIVFDRENLKLAWSHSKC 459
G +V+D N ++++ ++C
Sbjct: 409 GNIQQQNMLVVYDTGNSVVSFASAQC 434
>sp|A2ZC67|ASP1_ORYSI Aspartic proteinase Asp1 OS=Oryza sativa subsp. indica GN=ASP1 PE=2
SV=2
Length = 410
Score = 70.9 bits (172), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 82/398 (20%), Positives = 158/398 (39%), Gaps = 46/398 (11%)
Query: 93 PSEGSQTHFFGNQFYWLHY-TWIDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSAS 149
PS GN + H+ ++IG P + + +D GS L W+ C CI C +
Sbjct: 20 PSSAVVLELHGNVYPIGHFFVTMNIGDPAKPYFLDIDTGSTLTWLQCDYPCINCNKVPHG 79
Query: 150 YYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYL 209
Y E + + + + + + C K+ C Y Y SS G L
Sbjct: 80 LYK------PELKYAVKCTEQRCADLYADLRKPMKCGP-KNQCHYGIQYV--GGSSIGVL 130
Query: 210 VDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLA 268
+ D SFS A + +S+ GCG Q + + P +G++GLG G V++ S L
Sbjct: 131 IVD-----SFSLPASNGTNPTSIAFGCGYNQGKNNHNVPTPVNGILGLGRGKVTLLSQLK 185
Query: 269 KAGLI-QNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAY--FVGVESYCIGN 325
G+I ++ C G +FFGD T T + P+ ++ Y G + +
Sbjct: 186 SQGVITKHVLGHCISSKGKGFLFFGDAKVPTSGVT-WSPMNREHKHYSPRQGTLQFNSNS 244
Query: 326 SCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSK-----RISLQGNSWKYCYNA 380
++ + + + DSGA++T+ + Y + +S + + + + C+
Sbjct: 245 KPISAAPMEVIFDSGATYTYFALQPYHATLSVVKSTLSKECKFLTEVKEKDRALTVCWKG 304
Query: 381 SSEEMLKVPDMRLIFS----------KNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDY 430
+++ + +++ F K + + + EG CL ++ ++
Sbjct: 305 -KDKIRTIDEVKKCFRSLSLKFADGDKKATLEIPPEHYLIISQEGHV--CLGILDGSKEH 361
Query: 431 ------GIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 462
+IG M+ +++D E L W + +C+ +
Sbjct: 362 PSLAGTNLIGGITMLDQMVIYDSERSLLGWVNYQCDRI 399
>sp|P00793|PEPA_CHICK Pepsin A OS=Gallus gallus GN=PGA PE=1 SV=1
Length = 367
Score = 63.5 bits (153), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 94/372 (25%), Positives = 158/372 (42%), Gaps = 92/372 (24%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVP---CQCIQCAPLSASYYTSLDRNLSEYDPSSS 166
+Y I IGTP F V D GS+ LWVP C+ C+ N +DPS S
Sbjct: 59 YYGTISIGTPQQDFSVIFDTGSSNLWVPSIYCKSSACS------------NHKRFDPSKS 106
Query: 167 SSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQS 226
S+ VS + + YIA Y T S SG L D + ++S
Sbjct: 107 STY--VSTNETV---------------YIA-YGT--GSMSGILGYDTVAVSSI------- 139
Query: 227 SVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVS-------VPSLLAKAGLIQNSFSI 279
VQ+ I G + GS+ DG++GL +S +++++ + Q+ FS+
Sbjct: 140 DVQNQ-IFGLSETEPGSFFYYCNFDGILGLAFPSISSSGATPVFDNMMSQHLVAQDLFSV 198
Query: 280 CFDEN-DSGS-VFFGDQGP-ATQQSTSFLPI-GEKYDAYFVGVESYCIGN---SCLTQSG 332
++ ++GS V FG P T + ++P+ E Y + + ++ +GN +C
Sbjct: 199 YLSKDGETGSFVLFGGIDPNYTTKGIYWVPLSAETY--WQITMDRVTVGNKYVACFFTC- 255
Query: 333 FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMR 392
QA+VD+G S +P Y ++++ +S G S +++ K+PD+
Sbjct: 256 -QAIVDTGTSLLVMPQGAY-------NRIIKDLGVSSDG-------EISCDDISKLPDV- 299
Query: 393 LIFSKNQSFVVRNHIFSFP------ENEGFTVFCLTVMSTDGDYG---IIGQNFMMGHRI 443
+F + H F+ P +G + M T + G I+G F+ + +
Sbjct: 300 -------TFHINGHAFTLPASAYVLNEDGSCMLGFENMGTPTELGEQWILGDVFIREYYV 352
Query: 444 VFDRENLKLAWS 455
+FDR N K+ S
Sbjct: 353 IFDRANNKVGLS 364
>sp|P18242|CATD_MOUSE Cathepsin D OS=Mus musculus GN=Ctsd PE=1 SV=1
Length = 410
Score = 61.2 bits (147), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 92/371 (24%), Positives = 150/371 (40%), Gaps = 65/371 (17%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
+Y I IGTP F V D GS+ LWVP I C L
Sbjct: 79 YYGDIGIGTPPQCFTVVFDTGSSNLWVP--SIHCKIL----------------------- 113
Query: 170 KNVSC-SHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSV 228
+++C H S S +K+ + Y + S SGYL D + + S + +
Sbjct: 114 -DIACWVHHKYNSDKSSTYVKNGTSFDIHYGS--GSLSGYLSQDTVSVPCKSDQSKARGI 170
Query: 229 Q-SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSV-------PSLLAKAGLIQNSFSIC 280
+ I G KQ G A DG++G+G +SV +L+ + + +N FS
Sbjct: 171 KVEKQIFGEATKQPGIVFVAAKFDGILGMGYPHISVNNVLPVFDNLMQQKLVDKNIFSFY 230
Query: 281 FDENDSGS-----VFFGDQGPATQQSTSFLPIGEKYDAYF-VGVESYCIGNS-CLTQSGF 333
+ + G + G S+L + K AY+ V ++ +GN L + G
Sbjct: 231 LNRDPEGQPGGELMLGGTDSKYYHGELSYLNVTRK--AYWQVHMDQLEVGNELTLCKGGC 288
Query: 334 QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRL 393
+A+VD+G S P E E+ K + + + +QG C SS +P + L
Sbjct: 289 EAIVDTGTSLLVGPVEEVKEL----QKAIGAVPL-IQGEYMIPCEKVSS-----LPTVYL 338
Query: 394 -IFSKNQSFVVRNHIFSFPENEGFTVFCLT-VMSTD-----GDYGIIGQNFMMGHRIVFD 446
+ KN +I ++G CL+ M D G I+G F+ + VFD
Sbjct: 339 KLGGKNYELHPDKYILKV--SQGGKTICLSGFMGMDIPPPSGPLWILGDVFIGSYYTVFD 396
Query: 447 RENLKLAWSHS 457
R+N ++ ++++
Sbjct: 397 RDNNRVGFANA 407
>sp|P10977|CARPV_CANAX Vacuolar aspartic protease OS=Candida albicans GN=APR1 PE=3 SV=3
Length = 419
Score = 60.5 bits (145), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 105/467 (22%), Positives = 180/467 (38%), Gaps = 87/467 (18%)
Query: 21 AVSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQ 80
A++ +S LV + K +S + ++ NS+ L L N + LQ
Sbjct: 12 ALALTSSLVDAKAHSIKLSKLSNEETLDASNFQEYTNSLANKYLNLFNTAHGNPSNFGLQ 71
Query: 81 S--NNNSSRNQLLFPSEGSQ-----THFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNL 133
N + + P +G + T++ Q++ T I IGTP F V LD GS+
Sbjct: 72 HVLTNQEAEVPFVTPKKGGKYDAPLTNYLNAQYF----TEIQIGTPGQPFKVILDTGSSN 127
Query: 134 LWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCP 193
LWVP Q C L+ + D + +SS+ V+ S + S
Sbjct: 128 LWVPSQ--DCTSLACFLHAKYDHD--------ASSTYKVNGSEFSIQYGSG--------- 168
Query: 194 YIADYSTEDTSSSGYLVDDILHLASF---SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAP 250
S GY+ D+L + + +++ + + G+
Sbjct: 169 ----------SMEGYISQDVLTIGDLVIPGQDFAEATSEPGLAFAFGKF----------- 207
Query: 251 DGVMGLGLGDVSVPSLL------AKAGLIQN-SFSICF-----DENDSG-SVFFGDQGPA 297
DG++GL +SV ++ GL++ F DEND G + F G
Sbjct: 208 DGILGLAYDTISVNHIVPPIYNAINQGLLEKPQFGFYLGSTDKDENDGGLATFGGYDASL 267
Query: 298 TQQSTSFLPIGEKYDAYF-VGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVV 356
Q ++LPI K AY+ V E +G+ A +D+G S LP+ + AE++
Sbjct: 268 FQGKITWLPIRRK--AYWEVSFEGIGLGDEYAELHKTGAAIDTGTSLITLPSSL-AEIIN 324
Query: 357 KFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSK-NQSFVVRNHIFSFPENEG 415
K+ ++K SW Y + +PD+ L F+ N + ++I E G
Sbjct: 325 A--KIGATK-------SWSGQYQVDCAKRDSLPDLTLTFAGYNFTLTPYDYIL---EVSG 372
Query: 416 FTVFCLTVMSTD---GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 459
+ T M GD I+G F+ + ++D + + + +K
Sbjct: 373 SCISVFTPMDFPQPIGDLAIVGDAFLRKYYSIYDLDKNAVGLAPTKV 419
>sp|P22929|CARP_SACFI Acid protease OS=Saccharomycopsis fibuligera GN=PEP1 PE=3 SV=1
Length = 390
Score = 60.1 bits (144), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 97/420 (23%), Positives = 173/420 (41%), Gaps = 71/420 (16%)
Query: 59 VEYLELLLSNDWKRQKTRVKLQSNNNSS----RNQLLFPSEGSQTHFFGNQFYWLHYTWI 114
VE E L+ D+ ++ K ++ +S R L S+ T N+ Y + T I
Sbjct: 21 VEKREKTLTLDFDVKRISSKAKNVTVASSPGFRRNLRAASDAGVTISLENE-YSFYLTTI 79
Query: 115 DIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSC 174
+IGTP V +D GS+ LWVP Q ++S Y + YD + S+S K
Sbjct: 80 EIGTPGQKLQVDVDTGSSDLWVPGQG------TSSLYGT-------YDHTKSTSYK---- 122
Query: 175 SHPLCKSRSSCK-SLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 233
K RS S D D++ E S G + + F Q Q +
Sbjct: 123 -----KDRSGFSISYGDGSSARGDWAQETVSIGGASITGL----EFGDATSQDVGQGLLG 173
Query: 234 IGC-GRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI-QNSFSICFDEND--SGSV 289
IG G + + + D ++P L GLI + ++S+ + D SGS+
Sbjct: 174 IGLKGNEASAQSSNSFTYD----------NLPLKLKDQGLIDKAAYSLYLNSEDATSGSI 223
Query: 290 FFGDQGPATQQST----SFLPIGEKYD------AYFVGVESYCIGNSCLTQSGFQALVDS 339
FG + + + I ++ D A+FV +E G+S +T++ + AL+DS
Sbjct: 224 LFGGSDSSKYSGSLATLDLVNIDDEGDSTSGAVAFFVELEGIEAGSSSITKTTYPALLDS 283
Query: 340 GASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKV-PDMRLIFSKN 398
G + + P+ I + + ++ ++ Y Y PD + F+
Sbjct: 284 GTTLIYAPSSIASSIGREY-------------GTYSYSYGGYVTSCDATGPDFKFSFNGK 330
Query: 399 QSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSK 458
V +++ F +EG + + V+S+ +Y I+G F+ + +D +N ++ + +K
Sbjct: 331 TITVPFSNLL-FQNSEGDSECLVGVLSSGSNYYILGDAFLRSAYVYYDIDNSQVGIAQAK 389
>sp|Q9LHE3|ASPG2_ARATH Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana
GN=ASPG2 PE=2 SV=1
Length = 470
Score = 58.2 bits (139), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 100/447 (22%), Positives = 179/447 (40%), Gaps = 46/447 (10%)
Query: 32 FSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNN--NSSRNQ 89
FSDE+ ++ + + S +N L + D R ++ S SS ++
Sbjct: 51 FSDESSSKYTLRLLHRDRFPSVTYRNHHHRLHARMRRDTDRVSAILRRISGKVIPSSDSR 110
Query: 90 LLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSA 148
GS +Q ++ I +G+P + +D+GS+++WV CQ C C
Sbjct: 111 YEVNDFGSDIVSGMDQGSGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPCKLC----- 165
Query: 149 SYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGY 208
Y D +DP+ S S VSC +C + C Y Y + + + G
Sbjct: 166 --YKQSD---PVFDPAKSGSYTGVSCGSSVCDRIENSGCHSGGCRYEVMYG-DGSYTKGT 219
Query: 209 LVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLA 268
L L +F+K +V +V +GCG + G ++ A G+ G + V
Sbjct: 220 LA---LETLTFAK-----TVVRNVAMGCGHRNRGMFIGAAGLLGIGGGSMSFVG-----Q 266
Query: 269 KAGLIQNSFSICF---DENDSGSVFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYC 322
+G +F C + +GS+ FG + A S++P+ A Y+VG++
Sbjct: 267 LSGQTGGAFGYCLVSRGTDSTGSLVFGRE--ALPVGASWVPLVRNPRAPSFYYVGLKGLG 324
Query: 323 I---------GNSCLTQSGFQALV-DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN 372
+ G LT++G +V D+G + T LPT Y F ++ + +
Sbjct: 325 VGGVRIPLPDGVFDLTETGDGGVVMDTGTAVTRLPTAAYVAFRDGFKSQTANLPRASGVS 384
Query: 373 SWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGI 432
+ CY+ S ++VP + F++ + F P ++ T +C ++ I
Sbjct: 385 IFDTCYDLSGFVSVRVPTVSFYFTEGPVLTLPARNFLMPVDDSGT-YCFAFAASPTGLSI 443
Query: 433 IGQNFMMGHRIVFDRENLKLAWSHSKC 459
IG G ++ FD N + + + C
Sbjct: 444 IGNIQQEGIQVSFDGANGFVGFGPNVC 470
>sp|O93428|CATD_CHIHA Cathepsin D OS=Chionodraco hamatus GN=ctsd PE=1 SV=2
Length = 396
Score = 57.8 bits (138), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 97/414 (23%), Positives = 162/414 (39%), Gaps = 72/414 (17%)
Query: 66 LSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQT-HFFGNQFYWLHYTWIDIGTPNVSFL 124
L++ KR + +L ++++S + L FP+ + T N +Y I +GTP F
Sbjct: 34 LTDSGKRAE---ELLADHHSLKYNLSFPASNAPTPETLKNYLDAQYYGEIGLGTPPQPFT 90
Query: 125 VALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSC-SHPLCKSRS 183
V D GS+ LWVP I C+ L +++C H S
Sbjct: 91 VVFDTGSSNLWVP--SIHCSLL------------------------DIACLLHHKYNSGK 124
Query: 184 SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGS 243
S +K+ + Y + S SGYL D + + S + G KQ G
Sbjct: 125 SSTYVKNGTAFAIQYGS--GSLSGYLSQDTCTIGDLAI--------DSQLFGEAIKQPGV 174
Query: 244 YLDGAAPDGVMGLGLGDVSV-------PSLLAKAGLIQNSFSICFDEN----DSGSVFFG 292
A DG++G+ +SV +++++ + QN FS + N G + G
Sbjct: 175 AFIAAKFDGILGMAYPRISVDGVAPVFDNIMSQKKVEQNVFSFYLNRNPDTEPGGELLLG 234
Query: 293 DQGPATQQSTSFLPIGEKYDAYF-VGVESYCIGNSC-LTQSGFQALVDSGASFTFLPTEI 350
P + F + AY+ + V+S +G+ L G +A+VDSG S P+
Sbjct: 235 GTDP-KYYTGDFNYVNVTRQAYWQIRVDSMAVGDQLSLCTGGCEAIVDSGTSLITGPS-- 291
Query: 351 YAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSF 410
V VK + +QG +Y N + L V + Q + + +
Sbjct: 292 ---VEVKALQKAIGAFPLIQG---EYMVNCDTVPSLPVISFTV---GGQVYTLTGEQYIL 342
Query: 411 PENEGFTVFCLT-VMSTD-----GDYGIIGQNFMMGHRIVFDRENLKLAWSHSK 458
+ CL+ M D G I+G FM + VFDR+ ++ ++ +K
Sbjct: 343 KVTQAGKTMCLSGFMGLDIPAPAGPLWILGDVFMGQYYTVFDRDANRVGFAKAK 396
>sp|Q9GMY8|PEPA_SORUN Pepsin A OS=Sorex unguiculatus GN=PGA PE=2 SV=1
Length = 387
Score = 57.0 bits (136), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 90/357 (25%), Positives = 143/357 (40%), Gaps = 62/357 (17%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
++ I IGTP F V D GS+ LWVP I C+ + S N + +DP SS+
Sbjct: 75 YFGTISIGTPPQEFTVIFDTGSSNLWVP--SIYCSSPACS-------NHNRFDPQKSSTF 125
Query: 170 KNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
K S + + S +G L D + +A +
Sbjct: 126 KPTSQTVSIAYGTGSM--------------------TGVLGYDTVQVAGIAD-------- 157
Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGL------GDVSVPSLLAKAGLI-QNSFSICFD 282
++ I G + + GS+L + DG++GL G V + GL+ Q+ FS+
Sbjct: 158 TNQIFGLSQSEPGSFLYYSPFDGILGLAYPSISSSGATPVFDNMWNQGLVSQDLFSVYLS 217
Query: 283 END-SGSV--FFGDQGPATQQSTSFLPI-GEKYDAYFVGVESYCI-GNSCLTQSGFQALV 337
ND SGSV F G S +++P+ E Y + + V+S + G S G QA+V
Sbjct: 218 SNDQSGSVVMFGGIDSSYYTGSLNWVPLSSEGY--WQITVDSITMNGQSIACNGGCQAIV 275
Query: 338 DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSK 397
D+G S PT A + K +S QG C + +PD+ +
Sbjct: 276 DTGTSLLSGPTNAIANIQSKIGASQNS-----QGQMAVSC-----SSIKNLPDIVFTING 325
Query: 398 NQ-SFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLA 453
Q +I E + + ++ G+ I+G F+ + VFDR N ++
Sbjct: 326 IQYPLPASAYILQSQEGCSSGFQGMDIPTSSGELWILGDVFIRQYFTVFDRANNQVG 382
>sp|C4YMJ3|CARP2_CANAW Candidapepsin-2 OS=Candida albicans (strain WO-1) GN=SAP2 PE=1 SV=1
Length = 398
Score = 56.6 bits (135), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 86/368 (23%), Positives = 149/368 (40%), Gaps = 77/368 (20%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
I +G+ N V +D GS+ LWVP + C + + YDPS SS+S++++
Sbjct: 74 ITVGSNNQKLNVIVDTGSSDLWVPDVNVDCQVTYSDQTADFCKQKGTYDPSGSSASQDLN 133
Query: 174 CSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS-KHAPQSSVQSSV 232
P+ Y + +SS G L D + S K+ + V S+
Sbjct: 134 ------------------TPFKIGYG-DGSSSQGTLYKDTVGFGGVSIKNQVLADVDSTS 174
Query: 233 ----IIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI-QNSFSICFDEND-- 285
I+G G K + G + D +VP L K G+I +N++S+ + D
Sbjct: 175 IDQGILGVGYKTNEA---GGSYD----------NVPVTLKKQGVIAKNAYSLYLNSPDAA 221
Query: 286 SGSVFFGDQGPATQQSTSFLPIGEKYDAYF-VGVESYCIGNSCLTQSGFQALVDSGASFT 344
+G + FG A + S S + + D + + S + + L+DSG + T
Sbjct: 222 TGQIIFGGVDNA-KYSGSLIALPVTSDRELRISLGSVEVSGKTINTDNVDVLLDSGTTIT 280
Query: 345 FLPTEIYAEVVVKFDKLVSSKRISLQGNS-WKYCYNASSEEMLKVPDMRLIFSKNQSFVV 403
+L ++ +++ F+ ++ GNS ++ N S D+ FSKN
Sbjct: 281 YLQQDLADQIIKAFNGKLTQDS---NGNSFYEVDCNLSG-------DVVFNFSKNAK--- 327
Query: 404 RNHIFSFPENEGFTVFCLTVMSTDG-------------DYGIIGQNFMMGHRIVFDRENL 450
S P +E F ++ DG D I+G NF+ IV+D +N
Sbjct: 328 ----ISVPASE----FAASLQGDDGQPYDKCQLLFDVNDANILGDNFLRSAYIVYDLDNN 379
Query: 451 KLAWSHSK 458
+++ + K
Sbjct: 380 EISLAQVK 387
>sp|Q9LZL3|PCS1L_ARATH Aspartic proteinase PCS1 OS=Arabidopsis thaliana GN=PCS1 PE=2 SV=1
Length = 453
Score = 56.6 bits (135), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 90/385 (23%), Positives = 158/385 (41%), Gaps = 78/385 (20%)
Query: 125 VALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRS- 183
+ +D GS L W+ C +S ++ +DP+ SSS + CS P C++R+
Sbjct: 88 MVIDTGSELSWLRCN-----------RSSNPNPVNNFDPTRSSSYSPIPCSSPTCRTRTR 136
Query: 184 ------SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCG 237
SC S K C Y+ + +SS G L +I H + S+ S++I GC
Sbjct: 137 DFLIPASCDSDKL-CHATLSYA-DASSSEGNLAAEIFHFGN-------STNDSNLIFGCM 187
Query: 238 RKQTGS-YLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQG- 295
+GS + G++G+ G + S +++ G + S+ I ++ G + GD
Sbjct: 188 GSVSGSDPEEDTKTTGLLGMNRGSL---SFISQMGFPKFSYCISGTDDFPGFLLLGDSNF 244
Query: 296 ---------PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL-----------TQSGFQA 335
P + ST LP ++ AY V + + L T +G Q
Sbjct: 245 TWLTPLNYTPLIRISTP-LPYFDRV-AYTVQLTGIKVNGKLLPIPKSVLVPDHTGAG-QT 301
Query: 336 LVDSGASFTFLPTEIYAEVVVKFDK-------LVSSKRISLQGNSWKYCYNAS-----SE 383
+VDSG FTFL +Y + F + QG + CY S S
Sbjct: 302 MVDSGTQFTFLLGPVYTALRSHFLNRTNGILTVYEDPDFVFQG-TMDLCYRISPVRIRSG 360
Query: 384 EMLKVPDMRLIFSKNQSFVV-RNHIFSFPE----NEGFTVFCLTVMSTD---GDYGIIGQ 435
+ ++P + L+F + V + ++ P N+ +V+C T ++D + +IG
Sbjct: 361 ILHRLPTVSLVFEGAEIAVSGQPLLYRVPHLTVGND--SVYCFTFGNSDLMGMEAYVIGH 418
Query: 436 NFMMGHRIVFDRENLKLAWSHSKCE 460
+ I FD + ++ + +C+
Sbjct: 419 HHQQNMWIEFDLQRSRIGLAPVECD 443
>sp|P0DJ06|CARP2_CANAL Candidapepsin-2 OS=Candida albicans (strain SC5314 / ATCC MYA-2876)
GN=SAP2 PE=1 SV=1
Length = 398
Score = 56.2 bits (134), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 86/368 (23%), Positives = 149/368 (40%), Gaps = 77/368 (20%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
I +G+ N V +D GS+ LWVP + C + + YDPS SS+S++++
Sbjct: 74 ITVGSNNQKLNVIVDTGSSDLWVPDVNVDCQVTYSDQTADFCKQKGTYDPSGSSASQDLN 133
Query: 174 CSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS-KHAPQSSVQSSV 232
P+ Y + +SS G L D + S K+ + V S+
Sbjct: 134 ------------------TPFKIGYG-DGSSSQGTLYKDTVGFGGVSIKNQVLADVDSTS 174
Query: 233 ----IIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI-QNSFSICFDEND-- 285
I+G G K + G + D +VP L K G+I +N++S+ + D
Sbjct: 175 IDQGILGVGYKTNEA---GGSYD----------NVPVTLKKQGVIAKNAYSLYLNSPDAA 221
Query: 286 SGSVFFGDQGPATQQSTSFLPIGEKYDAYF-VGVESYCIGNSCLTQSGFQALVDSGASFT 344
+G + FG A + S S + + D + + S + + LVDSG + T
Sbjct: 222 TGQIIFGGVDNA-KYSGSLIALPVTSDRELRISLGSVEVSGKTINTDNVDVLVDSGTTIT 280
Query: 345 FLPTEIYAEVVVKFDKLVSSKRISLQGNS-WKYCYNASSEEMLKVPDMRLIFSKNQSFVV 403
+L ++ +++ F+ ++ GNS ++ N S D+ FSKN
Sbjct: 281 YLQQDLADQIIKAFNGKLTQDS---NGNSFYEVDCNLSG-------DVVFNFSKNAK--- 327
Query: 404 RNHIFSFPENEGFTVFCLTVMSTDG-------------DYGIIGQNFMMGHRIVFDRENL 450
S P +E F ++ DG D I+G NF+ IV+D ++
Sbjct: 328 ----ISVPASE----FAASLQGDDGQPYDKCQLLFDVNDANILGDNFLRSAYIVYDLDDN 379
Query: 451 KLAWSHSK 458
+++ + K
Sbjct: 380 EISLAQVK 387
>sp|P0CS83|CARP2_CANAX Candidapepsin-2 OS=Candida albicans GN=SAP2 PE=1 SV=1
Length = 398
Score = 54.7 bits (130), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 85/368 (23%), Positives = 149/368 (40%), Gaps = 77/368 (20%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
I +G+ N V +D GS+ LWVP + C + + YDPS SS+S++++
Sbjct: 74 ITVGSNNQKLNVIVDTGSSDLWVPDVNVDCQVTYSDQTADFCKQKGTYDPSGSSASQDLN 133
Query: 174 CSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS-KHAPQSSVQSSV 232
P+ Y + +SS G L D + S K+ + V S+
Sbjct: 134 ------------------TPFKIGYG-DGSSSQGTLYKDTVGFGGVSIKNQVLADVDSTS 174
Query: 233 ----IIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI-QNSFSICFDEND-- 285
I+G G K + G + D +VP L K G+I +N++S+ + D
Sbjct: 175 IDQGILGVGYKTNEA---GGSYD----------NVPVTLKKQGVIAKNAYSLYLNSPDAA 221
Query: 286 SGSVFFGDQGPATQQSTSFLPIGEKYDAYF-VGVESYCIGNSCLTQSGFQALVDSGASFT 344
+G + FG A + S S + + D + + S + + L+DSG + T
Sbjct: 222 TGQIIFGGVDNA-KYSGSLIALPVTSDRELRISLGSVEVSGKTINTDNVDVLLDSGTTIT 280
Query: 345 FLPTEIYAEVVVKFDKLVSSKRISLQGNS-WKYCYNASSEEMLKVPDMRLIFSKNQSFVV 403
+L ++ +++ F+ ++ GNS ++ N S D+ FSKN
Sbjct: 281 YLQQDLADQIIKAFNGKLTQDS---NGNSFYEVDCNLSG-------DVVFNFSKNAK--- 327
Query: 404 RNHIFSFPENEGFTVFCLTVMSTDG-------------DYGIIGQNFMMGHRIVFDRENL 450
S P +E F ++ DG D I+G NF+ IV+D ++
Sbjct: 328 ----ISVPASE----FAASLQGDDGQPYDKCQLLFDVNDANILGDNFLRSAYIVYDLDDN 379
Query: 451 KLAWSHSK 458
+++ + K
Sbjct: 380 EISLAQVK 387
>sp|Q9DEX3|CATD_CLUHA Cathepsin D OS=Clupea harengus GN=ctsd PE=1 SV=1
Length = 396
Score = 53.9 bits (128), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 94/405 (23%), Positives = 162/405 (40%), Gaps = 77/405 (19%)
Query: 78 KLQSNNNSSRNQLLFPSEGSQT-HFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWV 136
+L + NS ++ FPS + T N +Y I +GTP F V D GS+ LW+
Sbjct: 43 QLLAGTNSLQHNQGFPSSNAPTPETLKNYMDAQYYGEIGLGTPVQMFTVVFDTGSSNLWL 102
Query: 137 PCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSC-SHPLCKSRSSCKSLKDPCPYI 195
P I C S +++C H S +K+ +
Sbjct: 103 P--SIHC------------------------SFTDIACLLHHKYNGAKSSTYVKNGTEFA 136
Query: 196 ADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMG 255
Y + S SGYL D + V + G KQ G A DG++G
Sbjct: 137 IQYGS--GSLSGYLSQDSCTIGDI--------VVEKQLFGEAIKQPGVAFIAAKFDGILG 186
Query: 256 LGLGDVSVPS-------LLAKAGLIQNSFSICFDEN----DSGSVFFGDQGPATQQST-S 303
+ +SV ++++ + QN FS + N G + G P +
Sbjct: 187 MAYPRISVDGVPPVFDMMMSQKKVEQNVFSFYLNRNPDTEPGGELLLGGTDPKYYTGDFN 246
Query: 304 FLPIGEKYDAYF-VGVESYCIGNS-CLTQSGFQALVDSGASF-TFLPTEIYAEVVVKFDK 360
++P+ + AY+ + ++ IG+ L + G +A+VD+G S T P E+ A K
Sbjct: 247 YVPVTRQ--AYWQIHMDGMSIGSQLTLCKDGCEAIVDTGTSLITGPPAEVRA-----LQK 299
Query: 361 LVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLI-FS-KNQSFVVRNHIFSFPENEGFTV 418
+ + + +QG C KVP + I F+ +++ + + E++G
Sbjct: 300 AIGAIPL-IQGEYMIDCK--------KVPTLPTISFNVGGKTYSLTGEQYVLKESQGGKT 350
Query: 419 FCLT-VMSTD-----GDYGIIGQNFMMGHRIVFDRENLKLAWSHS 457
CL+ +M + G I+G F+ + VFDRE+ ++ ++ S
Sbjct: 351 ICLSGLMGLEIPPPAGPLWILGDVFIGQYYTVFDRESNRVGFAKS 395
>sp|P16476|PEPE_CHICK Embryonic pepsinogen OS=Gallus gallus PE=2 SV=1
Length = 383
Score = 52.4 bits (124), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 80/359 (22%), Positives = 136/359 (37%), Gaps = 63/359 (17%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
+Y I IGTP F V D GS+ LWVP
Sbjct: 76 YYGTISIGTPPQDFTVVFDTGSSNLWVP-------------------------------- 103
Query: 170 KNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
+VSC+ P C+S + +P ST S Y D+ S +
Sbjct: 104 -SVSCTSPACQSHQ----MFNPSQSSTYKSTGQNLSIHYGTGDMEGTVGCDTVTVASLMD 158
Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGL----GDVSVP---SLLAKAGLIQNSFSICFD 282
++ + G + G + DG++GLG D P +++ ++ L QN FS+
Sbjct: 159 TNQLFGLSTSEPGQFFVYVKFDGILGLGYPSLAADGITPVFDNMVNESLLEQNLFSVYLS 218
Query: 283 ENDSGS--VFFGDQGPATQQSTSFLPIGEKYDAYF-VGVESYCIGNSCLT-QSGFQALVD 338
GS VF G S +++P+ Y Y+ + ++S + + SG QA++D
Sbjct: 219 REPMGSMVVFGGIDESYFTGSINWIPV--SYQGYWQISMDSIIVNKQEIACSSGCQAIID 276
Query: 339 SGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKN 398
+G S P ++ S + Q +Y N S +L +PD+ +
Sbjct: 277 TGTSLVAGPASDINDI--------QSAVGANQNTYGEYSVNCS--HILAMPDVVFVIGGI 326
Query: 399 QSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHS 457
Q V ++ E G + ++ D I+G F+ + +FDR N ++ + +
Sbjct: 327 QYPVPA---LAYTEQNGQGTCMSSFQNSSADLWILGDVFIRVYYSIFDRANNRVGLAKA 382
>sp|P0DJD9|PEPA5_HUMAN Pepsin A-5 OS=Homo sapiens GN=PGA5 PE=1 SV=1
Length = 388
Score = 51.2 bits (121), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 87/357 (24%), Positives = 144/357 (40%), Gaps = 62/357 (17%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
++ I IGTP F V D GS+ LWVP + C+ L+ + N + ++P SS+
Sbjct: 76 YFGTIGIGTPAQDFTVVFDTGSSNLWVP--SVYCSSLACT-------NHNRFNPEDSSTY 126
Query: 170 KNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
++ S + + S +G L D + + S
Sbjct: 127 QSTSETVSITYGTGSM--------------------TGILGYDTVQVGGISD-------- 158
Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGL------GDVSVPSLLAKAGLI-QNSFSICFD 282
++ I G + GS+L A DG++GL G V + GL+ Q+ FS+
Sbjct: 159 TNQIFGLSETEPGSFLYYAPFDGILGLAYPSISSSGATPVFDNIWNQGLVSQDLFSVYLS 218
Query: 283 END-SGSV--FFGDQGPATQQSTSFLPIGEKYDAYF-VGVESYCI-GNSCLTQSGFQALV 337
+D SGSV F G S +++P+ + Y+ + V+S + G + G QA+V
Sbjct: 219 ADDKSGSVVIFGGIDSSYYTGSLNWVPV--TVEGYWQITVDSITMNGETIACAEGCQAIV 276
Query: 338 DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSK 397
D+G S PT A + +S G+ C SS +PD+ +
Sbjct: 277 DTGTSLLTGPTSPIANIQSDIGASENSD-----GDMVVSCSAISS-----LPDIVFTING 326
Query: 398 NQSFVVRNHIFSFPENEGFTVF-CLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLA 453
Q V + E + F + V + G+ I+G F+ + VFDR N ++
Sbjct: 327 VQYPVPPSAYILQSEGSCISGFQGMNVPTESGELWILGDVFIRQYFTVFDRANNQVG 383
>sp|P00791|PEPA_PIG Pepsin A OS=Sus scrofa GN=PGA PE=1 SV=3
Length = 385
Score = 50.8 bits (120), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 88/362 (24%), Positives = 148/362 (40%), Gaps = 72/362 (19%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
++ I IGTP F V D GS+ LWVP + C+ L+ S D N +++P SS+
Sbjct: 73 YFGTIGIGTPAQDFTVIFDTGSSNLWVPS--VYCSSLACS-----DHN--QFNPDDSSTF 123
Query: 170 KNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
+ S + S +G L D + + S
Sbjct: 124 EATSQELSITYGTGSM--------------------TGILGYDTVQVGGIS--------D 155
Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSL------LAKAGLI-QNSFSICFD 282
++ I G + GS+L A DG++GL +S L GL+ Q+ FS+
Sbjct: 156 TNQIFGLSETEPGSFLYYAPFDGILGLAYPSISASGATPVFDNLWDQGLVSQDLFSVYLS 215
Query: 283 END-SGSVFF--GDQGPATQQSTSFLPIG-EKYDAYFVGVESYCI-GNSCLTQSGFQALV 337
ND SGSV G S +++P+ E Y + + ++S + G + G QA+V
Sbjct: 216 SNDDSGSVVLLGGIDSSYYTGSLNWVPVSVEGY--WQITLDSITMDGETIACSGGCQAIV 273
Query: 338 DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSK 397
D+G S PT A + S + + + + + SS + L PD+ +
Sbjct: 274 DTGTSLLTGPTSAIANI--------QSDIGASENSDGEMVISCSSIDSL--PDIVFTING 323
Query: 398 NQ------SFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLK 451
Q ++++++ EG + V ++ G+ I+G F+ + VFDR N K
Sbjct: 324 VQYPLSPSAYILQDDDSCTSGFEG-----MDVPTSSGELWILGDVFIRQYYTVFDRANNK 378
Query: 452 LA 453
+
Sbjct: 379 VG 380
>sp|Q42456|ASPR1_ORYSJ Aspartic proteinase oryzasin-1 OS=Oryza sativa subsp. japonica
GN=Os05g0567100 PE=2 SV=2
Length = 509
Score = 50.8 bits (120), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 72/298 (24%), Positives = 119/298 (39%), Gaps = 60/298 (20%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
++ I +GTP F V D GS+ LWVP SA Y S
Sbjct: 85 YFGEIGVGTPPQKFTVIFDTGSSNLWVP---------SAKCYFS---------------- 119
Query: 170 KNVSC-SHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSV 228
++C H KS S K+ P Y T S +G+ +D + + Q +
Sbjct: 120 --IACFFHSRYKSGQSSTYQKNGKPAAIQYGT--GSIAGFFSEDSVTVGDLVVK-DQEFI 174
Query: 229 QSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSL------LAKAGLI-QNSFSICF 281
+++ K+ G A DG++GLG ++SV + + GL+ + FS F
Sbjct: 175 EAT-------KEPGLTFMVAKFDGILGLGFQEISVGDAVPVWYKMVEQGLVSEPVFSFWF 227
Query: 282 ----DENDSGSVFFGDQGPATQQST-SFLPIGEK-YDAYFVGVESYCIGNSCLTQSGFQA 335
DE + G + FG P+ + +++P+ +K Y + +G + SG A
Sbjct: 228 NRHSDEGEGGEIVFGGMDPSHYKGNHTYVPVSQKGYWQFEMGDVLIGGKTTGFCASGCSA 287
Query: 336 LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRL 393
+ DSG S PT I E+ +++I G + C S+ ++ D+ L
Sbjct: 288 IADSGTSLLAGPTAIITEI---------NEKIGATGVVSQECKTVVSQYGQQILDLLL 336
>sp|Q4LAL9|CATD_CANFA Cathepsin D OS=Canis familiaris GN=CTSD PE=2 SV=1
Length = 410
Score = 50.4 bits (119), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 88/375 (23%), Positives = 154/375 (41%), Gaps = 73/375 (19%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
+Y I IGTP F V D GS+ LWVP I C L
Sbjct: 79 YYGEIGIGTPPQCFTVVFDTGSSNLWVP--SIHCKLL----------------------- 113
Query: 170 KNVSC-SHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSV 228
+++C H S S +K+ + Y + S SGYL D + + S + + +
Sbjct: 114 -DIACWIHHKYNSGKSSTYVKNGTSFDIHYGS--GSLSGYLSQDTVSVPCKSALSGLAGI 170
Query: 229 Q-SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSV-------PSLLAKAGLIQNSFSIC 280
+ G KQ G A DG++G+ +SV +L+ + + +N FS
Sbjct: 171 KVERQTFGEATKQPGITFIAAKFDGILGMAYPRISVNNVLPVFDNLMQQKLVEKNIFSFY 230
Query: 281 FDENDS----GSVFFGD------QGPATQQSTSFLPIGEKYDAYF-VGVESYCIGNS-CL 328
+ + + G + G +GP S+L + K AY+ V +E +G+S L
Sbjct: 231 LNRDPNAQPGGELMLGGTDSKYYKGP-----LSYLNVTRK--AYWQVHMEQVDVGSSLTL 283
Query: 329 TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKV 388
+ G +A+VD+G S P + V + K + + + +QG Y E++ +
Sbjct: 284 CKGGCEAIVDTGTSLIVGP----VDEVRELQKAIGAVPL-IQGE-----YMIPCEKVSTL 333
Query: 389 PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLT-VMSTD-----GDYGIIGQNFMMGHR 442
PD+ L + + + + ++ ++G CL+ M D G I+G F+ +
Sbjct: 334 PDVTLKLG-GKLYKLSSEDYTLKVSQGGKTICLSGFMGMDIPPPGGPLWILGDVFIGCYY 392
Query: 443 IVFDRENLKLAWSHS 457
VFDR+ ++ + +
Sbjct: 393 TVFDRDQNRVGLAQA 407
>sp|Q9D7R7|PEPC_MOUSE Gastricsin OS=Mus musculus GN=Pgc PE=2 SV=1
Length = 392
Score = 50.4 bits (119), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 84/376 (22%), Positives = 146/376 (38%), Gaps = 88/376 (23%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVP---CQCIQCAPLSASYYTSLDRNLSEYDPSSS 166
+Y I IGTP +FLV D GS+ LWV CQ C + Y+PS S
Sbjct: 76 YYGEISIGTPPQNFLVLFDTGSSNLWVSSVYCQSEACT------------THTRYNPSKS 123
Query: 167 SSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQS 226
S+ + L Y T S +G+ D L + S P
Sbjct: 124 STYYTQGQTFSL------------------QYGT--GSLTGFFGYDTLRVQSI--QVPNQ 161
Query: 227 SVQSSVIIGCGRKQTGSYLDGAAPDGVMGL-------GLGDVSVPSLLAKAGLIQNSFSI 279
G + G+ A DG+MGL G ++ +L + L Q F +
Sbjct: 162 E------FGLSENEPGTNFVYAQFDGIMGLAYPGLSSGGATTALQGMLGEGALSQPLFGV 215
Query: 280 CFDE---NDSGSVFFG--DQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSC---LTQS 331
++ G + FG D+ T + T ++P+ ++ + + ++ + IGN + S
Sbjct: 216 YLGSQQGSNGGQIVFGGVDENLYTGELT-WIPVTQEL-YWQITIDDFLIGNQASGWCSSS 273
Query: 332 GFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDM 391
G Q +VD+G S +P + E++ + I Q + Y S + + +P +
Sbjct: 274 GCQGIVDTGTSLLVMPAQYLNELL---------QTIGAQEGEYGQ-YFVSCDSVSSLPTL 323
Query: 392 RLIFSKNQ------SFVVRNHIFSFPENEGFTVFCLTVMSTDGDYG----IIGQNFMMGH 441
+ + Q S+++ + EG + L +S + + G I+G F+ +
Sbjct: 324 TFVLNGVQFPLSPSSYII--------QEEGSCMVGLESLSLNAESGQPLWILGDVFLRSY 375
Query: 442 RIVFDRENLKLAWSHS 457
VFD N ++ + S
Sbjct: 376 YAVFDMGNNRVGLAPS 391
>sp|P24268|CATD_RAT Cathepsin D OS=Rattus norvegicus GN=Ctsd PE=1 SV=1
Length = 407
Score = 50.1 bits (118), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 89/369 (24%), Positives = 146/369 (39%), Gaps = 64/369 (17%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
+Y I IGTP F V D GS+ LWVP I C L
Sbjct: 79 YYGEIGIGTPPQCFTVVFDTGSSNLWVP--SIHCKLL----------------------- 113
Query: 170 KNVSC-SHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSV 228
+++C H S S +K+ + Y + S SGYL D + + S
Sbjct: 114 -DIACWVHHKYNSDKSSTYVKNGTSFDIHYGS--GSLSGYLSQDTVSVPCKSDLGGIKVE 170
Query: 229 QSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSL------LAKAGLIQ-NSFSICF 281
+ I G KQ G A DG++G+G +SV + L K L++ N FS
Sbjct: 171 KQ--IFGEATKQPGVVFIAAKFDGILGMGYPFISVNKVLPVFDNLMKQKLVEKNIFSFYL 228
Query: 282 DENDSGS-----VFFGDQGPATQQSTSFLPIGEKYDAYF-VGVESYCIGNS-CLTQSGFQ 334
+ + +G + G S+L + K AY+ V ++ +G+ L + G +
Sbjct: 229 NRDPTGQPGGELMLGGTDSRYYHGELSYLNVTRK--AYWQVHMDQLEVGSELTLCKGGCE 286
Query: 335 ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLI 394
A+VD+G S P + E+ K + + + +QG C SS L + +L
Sbjct: 287 AIVDTGTSLLVGPVDEVKEL----QKAIGAVPL-IQGEYMIPCEKVSS---LPIITFKL- 337
Query: 395 FSKNQSFVVRNHIFSFPENEGFTVFCLT-VMSTD-----GDYGIIGQNFMMGHRIVFDRE 448
Q++ + + ++ CL+ M D G I+G F+ + VFDRE
Sbjct: 338 --GGQNYELHPEKYILKVSQAGKTICLSGFMGMDIPPPSGPLWILGDVFIGCYYTVFDRE 395
Query: 449 NLKLAWSHS 457
++ ++ +
Sbjct: 396 YNRVGFAKA 404
>sp|P0DJD7|PEPA4_HUMAN Pepsin A-4 OS=Homo sapiens GN=PGA4 PE=1 SV=1
Length = 388
Score = 50.1 bits (118), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 86/357 (24%), Positives = 144/357 (40%), Gaps = 62/357 (17%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
++ I IGTP F V D GS+ LWVP + C+ L+ + N + ++P SS+
Sbjct: 76 YFGTIGIGTPAQDFTVVFDTGSSNLWVP--SVYCSSLACT-------NHNRFNPEDSSTY 126
Query: 170 KNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
++ S + + S +G L D + + S
Sbjct: 127 QSTSETVSITYGTGSM--------------------TGILGYDTVQVGGISD-------- 158
Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGL------GDVSVPSLLAKAGLI-QNSFSICFD 282
++ I G + GS+L A DG++GL G V + GL+ Q+ FS+
Sbjct: 159 TNQIFGLSETEPGSFLYYAPFDGILGLAYPSISSSGATPVFDNIWNQGLVSQDLFSVYLS 218
Query: 283 END-SGSV--FFGDQGPATQQSTSFLPIGEKYDAYF-VGVESYCI-GNSCLTQSGFQALV 337
+D SGSV F G S +++P+ + Y+ + V+S + G + G QA+V
Sbjct: 219 ADDQSGSVVIFGGIDSSYYTGSLNWVPV--TVEGYWQITVDSITMNGEAIACAEGCQAIV 276
Query: 338 DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSK 397
D+G S PT A + +S G+ C SS +PD+ +
Sbjct: 277 DTGTSLLTGPTSPIANIQSDIGASENSD-----GDMVVSCSAISS-----LPDIVFTING 326
Query: 398 NQSFVVRNHIFSFPENEGFTVF-CLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLA 453
Q V + E + F + + + G+ I+G F+ + VFDR N ++
Sbjct: 327 VQYPVPPSAYILQSEGSCISGFQGMNLPTESGELWILGDVFIRQYFTVFDRANNQVG 383
>sp|P0DJD8|PEPA3_HUMAN Pepsin A-3 OS=Homo sapiens GN=PGA3 PE=1 SV=1
Length = 388
Score = 50.1 bits (118), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 86/357 (24%), Positives = 144/357 (40%), Gaps = 62/357 (17%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
++ I IGTP F V D GS+ LWVP + C+ L+ + N + ++P SS+
Sbjct: 76 YFGTIGIGTPAQDFTVVFDTGSSNLWVP--SVYCSSLACT-------NHNRFNPEDSSTY 126
Query: 170 KNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
++ S + + S +G L D + + S
Sbjct: 127 QSTSETVSITYGTGSM--------------------TGILGYDTVQVGGISD-------- 158
Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGL------GDVSVPSLLAKAGLI-QNSFSICFD 282
++ I G + GS+L A DG++GL G V + GL+ Q+ FS+
Sbjct: 159 TNQIFGLSETEPGSFLYYAPFDGILGLAYPSISSSGATPVFDNIWNQGLVSQDLFSVYLS 218
Query: 283 END-SGSV--FFGDQGPATQQSTSFLPIGEKYDAYF-VGVESYCI-GNSCLTQSGFQALV 337
+D SGSV F G S +++P+ + Y+ + V+S + G + G QA+V
Sbjct: 219 ADDQSGSVVIFGGIDSSYYTGSLNWVPV--TVEGYWQITVDSITMNGEAIACAEGCQAIV 276
Query: 338 DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSK 397
D+G S PT A + +S G+ C SS +PD+ +
Sbjct: 277 DTGTSLLTGPTSPIANIQSDIGASENSD-----GDMVVSCSAISS-----LPDIVFTING 326
Query: 398 NQSFVVRNHIFSFPENEGFTVF-CLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLA 453
Q V + E + F + + + G+ I+G F+ + VFDR N ++
Sbjct: 327 VQYPVPPSAYILQSEGSCISGFQGMNLPTESGELWILGDVFIRQYFTVFDRANNQVG 383
>sp|Q9XFX3|CARDA_CYNCA Procardosin-A OS=Cynara cardunculus GN=cardA PE=1 SV=1
Length = 504
Score = 50.1 bits (118), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 64/256 (25%), Positives = 101/256 (39%), Gaps = 49/256 (19%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
++ I IGTP F V D GS++LWVP +CI A S Y+ S SS+
Sbjct: 85 YFGEIGIGTPPQKFTVIFDTGSSVLWVPSSKCINSKACRAH---------SMYESSDSST 135
Query: 169 SKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSV 228
K + S I + ++D+ + G LV V
Sbjct: 136 YKENGTFGAIIYGTGS----------ITGFFSQDSVTIGDLV-----------------V 168
Query: 229 QSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVP---SLLAKAGLIQNSFSICF---- 281
+ I + +L DG++GL +SVP ++L + + + FS
Sbjct: 169 KEQDFIEATDEADNVFLH-RLFDGILGLSFQTISVPVWYNMLNQGLVKERRFSFWLNRNV 227
Query: 282 DENDSGSVFFGDQGPAT-QQSTSFLPIGEKYDAYFVGVESYCIGN--SCLTQSGFQALVD 338
DE + G + FG P + +++P+ +Y F G+ IG+ + G QA D
Sbjct: 228 DEEEGGELVFGGLDPNHFRGDHTYVPVTYQYYWQF-GIGDVLIGDKSTGFCAPGCQAFAD 286
Query: 339 SGASFTFLPTEIYAEV 354
SG S PT I ++
Sbjct: 287 SGTSLLSGPTAIVTQI 302
>sp|P32329|YPS1_YEAST Aspartic proteinase 3 OS=Saccharomyces cerevisiae (strain ATCC
204508 / S288c) GN=YPS1 PE=1 SV=2
Length = 569
Score = 49.7 bits (117), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 54/243 (22%), Positives = 107/243 (44%), Gaps = 49/243 (20%)
Query: 252 GVMGLGLGDVSV-------------------PSLLAKAGLIQ-NSFSICFDENDS--GSV 289
GV+G+GL ++ V P +L +G I+ N++S+ +++D+ G++
Sbjct: 249 GVLGIGLPELEVTYSGSTASHSGKAYKYDNFPIVLKNSGAIKSNTYSLYLNDSDAMHGTI 308
Query: 290 FFGDQGPATQQSTSF-LPIGE-----------KYDAYF--VGVESYCIGNSCLTQSGFQA 335
FG + T + +PI ++D +G+ N LT + A
Sbjct: 309 LFGAVDHSKYTGTLYTIPIVNTLSASGFSSPIQFDVTINGIGISDSGSSNKTLTTTKIPA 368
Query: 336 LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIF 395
L+DSG + T+LP + + + + SS RI Y + S++ M ++F
Sbjct: 369 LLDSGTTLTYLPQTVVSMIATELGAQYSS-RIGY------YVLDCPSDD-----SMEIVF 416
Query: 396 SKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWS 455
F + + SF + G T + ++D I+G +F+ +V+D ENL+++ +
Sbjct: 417 DFG-GFHINAPLSSFILSTGTTCLLGIIPTSDDTGTILGDSFLTNAYVVYDLENLEISMA 475
Query: 456 HSK 458
++
Sbjct: 476 QAR 478
>sp|P27677|PEPA2_MACFU Pepsin A-2/A-3 OS=Macaca fuscata fuscata PE=1 SV=1
Length = 388
Score = 49.3 bits (116), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 90/370 (24%), Positives = 142/370 (38%), Gaps = 88/370 (23%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
++ I IGTP F V D GS+ LWVP + C+ L+ + N + ++P SS+
Sbjct: 76 YFGTIGIGTPAQDFTVIFDTGSSNLWVP--SVYCSSLACT-------NHNRFNPQDSSTY 126
Query: 170 KNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
++ S + + S +G L D + + S
Sbjct: 127 QSTSGTVSITYGTGSM--------------------TGILGYDTVQVGGISD-------- 158
Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGL------GDVSVPSLLAKAGLI-QNSFSICFD 282
++ I G + GS+L A DG++GL G V + GL+ Q+ FS+
Sbjct: 159 TNQIFGLSETEPGSFLYYAPFDGILGLAYPSISSSGATPVFDNIWNQGLVSQDLFSVYLS 218
Query: 283 END-SGSV--FFGDQGPATQQSTSFLPIG-EKYDAYFVGVESYCI-GNSCLTQSGFQALV 337
+D SGSV F G S +++P+ E Y + + V+S + G + G QA+V
Sbjct: 219 ADDQSGSVVIFGGIDSSYYTGSLNWVPVSVEGY--WQISVDSITMNGEAIACAEGCQAIV 276
Query: 338 DSGASFTFLPTEIYA--------------EVVVKFDKLVSSKRISLQGNSWKYCYNASSE 383
D+G S PT A E+VV + S I N +Y
Sbjct: 277 DTGTSLLTGPTSPIANIQSDIGASENSDGEMVVSCSAISSLPDIVFTINGIQY------- 329
Query: 384 EMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRI 443
VP I S + GF + V + G+ I+G F+ +
Sbjct: 330 ---PVPPSAYILQSQGSCI-----------SGFQ--GMDVPTESGELWILGDVFIRQYFT 373
Query: 444 VFDRENLKLA 453
VFDR N ++
Sbjct: 374 VFDRANNQVG 383
>sp|Q3EBM5|ASPR1_ARATH Probable aspartic protease At2g35615 OS=Arabidopsis thaliana
GN=At2g35615 PE=3 SV=1
Length = 447
Score = 49.3 bits (116), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 87/397 (21%), Positives = 151/397 (38%), Gaps = 87/397 (21%)
Query: 111 YTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
+ I IGTP + D GS+L WV C+ C QC N +D SS+
Sbjct: 86 FMSITIGTPPIKVFAIADTGSDLTWVQCKPCQQC----------YKENGPIFDKKKSSTY 135
Query: 170 KNVSCSHPLCKSRSS----CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
K+ C C++ SS C + C Y Y + + S G + + + + S S +P
Sbjct: 136 KSEPCDSRNCQALSSTERGCDESNNICKYRYSYG-DQSFSKGDVATETVSIDSASG-SPV 193
Query: 226 SSVQSSVIIGCGRKQTGSYLD----------------------------------GAAPD 251
S + GCG G++ + A +
Sbjct: 194 SF--PGTVFGCGYNNGGTFDETGSGIIGLGGGHLSLISQLGSSISKKFSYCLSHKSATTN 251
Query: 252 GVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSG--SVFFGDQGPATQQSTSF--LPI 307
G + LG S+PS L+K DSG S D+ P T + + +
Sbjct: 252 GTSVINLGTNSIPSSLSK---------------DSGVVSTPLVDKEPLTYYYLTLEAISV 296
Query: 308 GEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVS-SKR 366
G+K Y G + L+++ ++DSG + T L + + ++ V+ +KR
Sbjct: 297 GKKKIPY-TGSSYNPNDDGILSETSGNIIIDSGTTLTLLEAGFFDKFSSAVEESVTGAKR 355
Query: 367 ISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVR----NHIFSFPENEGFTVFCLT 422
+S +C+ + S E + +P++ + F+ VR N E+ + CL+
Sbjct: 356 VSDPQGLLSHCFKSGSAE-IGLPEITVHFTGAD---VRLSPINAFVKLSED----MVCLS 407
Query: 423 VMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 459
++ T + I G M + +D E +++ H C
Sbjct: 408 MVPTT-EVAIYGNFAQMDFLVGYDLETRTVSFQHMDC 443
>sp|P07339|CATD_HUMAN Cathepsin D OS=Homo sapiens GN=CTSD PE=1 SV=1
Length = 412
Score = 49.3 bits (116), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 89/388 (22%), Positives = 152/388 (39%), Gaps = 65/388 (16%)
Query: 94 SEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTS 153
+EG N +Y I IGTP F V D GS+ LWVP I C L
Sbjct: 63 TEGPIPEVLKNYMDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVP--SIHCKLL------- 113
Query: 154 LDRNLSEYDPSSSSSSKNVSC-SHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDD 212
+++C H S S +K+ + Y + S SGYL D
Sbjct: 114 -----------------DIACWIHHKYNSDKSSTYVKNGTSFDIHYGS--GSLSGYLSQD 154
Query: 213 ILHLASFSKHAPQSSVQSSV---IIGCGRKQTGSYLDGAAPDGVMGLGLGDVSV------ 263
+ + S + + V + G KQ G A DG++G+ +SV
Sbjct: 155 TVSVPCQSASSASALGGVKVERQVFGEATKQPGITFIAAKFDGILGMAYPRISVNNVLPV 214
Query: 264 -PSLLAKAGLIQNSFSICF----DENDSGSVFFGD-QGPATQQSTSFLPIGEKYDAYF-V 316
+L+ + + QN FS D G + G + S S+L + K AY+ V
Sbjct: 215 FDNLMQQKLVDQNIFSFYLSRDPDAQPGGELMLGGTDSKYYKGSLSYLNVTRK--AYWQV 272
Query: 317 GVESYCIGNS-CLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWK 375
++ + + L + G +A+VD+G S P + V + K + + + +QG
Sbjct: 273 HLDQVEVASGLTLCKEGCEAIVDTGTSLMVGPV----DEVRELQKAIGAVPL-IQGE--- 324
Query: 376 YCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLT-VMSTD-----GD 429
Y E++ +P + L + + + ++ ++ CL+ M D G
Sbjct: 325 --YMIPCEKVSTLPAITLKLG-GKGYKLSPEDYTLKVSQAGKTLCLSGFMGMDIPPPSGP 381
Query: 430 YGIIGQNFMMGHRIVFDRENLKLAWSHS 457
I+G F+ + VFDR+N ++ ++ +
Sbjct: 382 LWILGDVFIGRYYTVFDRDNNRVGFAEA 409
>sp|O96009|NAPSA_HUMAN Napsin-A OS=Homo sapiens GN=NAPSA PE=1 SV=1
Length = 420
Score = 49.3 bits (116), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 96/410 (23%), Positives = 151/410 (36%), Gaps = 81/410 (19%)
Query: 64 LLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSF 123
L L W+ KL + + + + S +FG I +GTP +F
Sbjct: 41 LNLLRGWREPAELPKLGAPSPGDKPIFVPLSNYRDVQYFGE---------IGLGTPPQNF 91
Query: 124 LVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRS 183
VA D GS+ LWVP + +C S + +DP +SSS +
Sbjct: 92 TVAFDTGSSNLWVPSR--RCHFFSVPCWLH-----HRFDPKASSSFQANGTK-------- 136
Query: 184 SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGS 243
+ Y T G L +D L + +SVI G +
Sbjct: 137 ----------FAIQYGTGRV--DGILSEDKLTIGGIKG--------ASVIFGEALWEPSL 176
Query: 244 YLDGAAPDGVMGLGLGDVSVPS------LLAKAGLIQN---SFSICFD--ENDSGSVFFG 292
A DG++GLG +SV +L + GL+ SF + D E D G + G
Sbjct: 177 VFAFAHFDGILGLGFPILSVEGVRPPMDVLVEQGLLDKPVFSFYLNRDPEEPDGGELVLG 236
Query: 293 DQGPATQ-QSTSFLPIGEKYDAYF-VGVESYCIGNS-CLTQSGFQALVDSGASFTFLPT- 348
PA +F+P+ AY+ + +E +G L G A++D+G S PT
Sbjct: 237 GSDPAHYIPPLTFVPV--TVPAYWQIHMERVKVGPGLTLCAKGCAAILDTGTSLITGPTE 294
Query: 349 EIYA-EVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHI 407
EI A + L++ + I L E+ K+P + + F + H
Sbjct: 295 EIRALHAAIGGIPLLAGEYIIL------------CSEIPKLPAVSFLLG-GVWFNLTAHD 341
Query: 408 FSFPENEGFTVFCLT------VMSTDGDYGIIGQNFMMGHRIVFDRENLK 451
+ CL+ V G + I+G F+ + VFDR ++K
Sbjct: 342 YVIQTTRNGVRLCLSGFQALDVPPPAGPFWILGDVFLGTYVAVFDRGDMK 391
>sp|Q05744|CATD_CHICK Cathepsin D OS=Gallus gallus GN=CTSD PE=1 SV=1
Length = 398
Score = 48.9 bits (115), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 91/373 (24%), Positives = 150/373 (40%), Gaps = 76/373 (20%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVP---CQCIQCAPLSASYYTSLDRNLSEYDPSSS 166
+Y I IGTP F V D GS+ LWVP C + A L +YD S S
Sbjct: 78 YYGEIGIGTPPQKFTVVFDTGSSNLWVPSVHCHLLDIACLLH----------HKYDASKS 127
Query: 167 SSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQS 226
S+ +++ + Y T S SG+L D + L +
Sbjct: 128 ST------------------YVENGTEFAIHYGT--GSLSGFLSQDTVTLGNLKI----- 162
Query: 227 SVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSL------LAKAGLIQ-NSFSI 279
+ I G KQ G A DG++G+ +SV + + + LI+ N FS
Sbjct: 163 ---KNQIFGEAVKQPGITFIAAKFDGILGMAFPRISVDKVTPFFDNVMQQKLIEKNIFSF 219
Query: 280 CFDENDS----GSVFFGDQGPATQQSTSFLPIGEKYDAYF-VGVESYCIGNS-CLTQSGF 333
+ + + G + G P S F + AY+ V ++S + N L + G
Sbjct: 220 YLNRDPTAQPGGELLLGGTDPK-YYSGDFSWVNVTRKAYWQVHMDSVDVANGLTLCKGGC 278
Query: 334 QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRL 393
+A+VD+G S PT+ E+ + +K + ++G Y S +++ +P + L
Sbjct: 279 EAIVDTGTSLITGPTKEVKEL----QTAIGAKPL-IKGQ-----YVISCDKISSLPVVTL 328
Query: 394 IF-SKNQSFVVRNHIFSFPENEGFTVFCLT------VMSTDGDYGIIGQNFMMGHRIVFD 446
+ K ++F +G T+ CL+ V G I+G F+ + VFD
Sbjct: 329 MLGGKPYQLTGEQYVFKV-SAQGETI-CLSGFSGLDVPPPGGPLWILGDVFIGPYYTVFD 386
Query: 447 RENLKLAWSHSKC 459
R+N + + +KC
Sbjct: 387 RDNDSVGF--AKC 397
>sp|P00792|PEPA_BOVIN Pepsin A OS=Bos taurus GN=PGA PE=1 SV=2
Length = 372
Score = 48.5 bits (114), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 88/362 (24%), Positives = 151/362 (41%), Gaps = 72/362 (19%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
++ I IGTP F V D GS+ LWVP I C+ + + N + ++P SS+
Sbjct: 60 YFGTIGIGTPAQDFTVIFDTGSSNLWVP--SIYCSSEACT-------NHNRFNPQDSSTY 110
Query: 170 KNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
+ S + + S +G L D + + S
Sbjct: 111 EATSETLSITYGTGSM--------------------TGILGYDTVQVGGISD-------- 142
Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGL------GDVSVPSLLAKAGLI-QNSFSICFD 282
++ I G + GS+L A DG++GL G V + GL+ Q+ FS+
Sbjct: 143 TNQIFGLSETEPGSFLYYAPFDGILGLAYPSISSSGATPVFDNIWDQGLVSQDLFSVYLS 202
Query: 283 EN-DSGS-VFFGD-QGPATQQSTSFLPIG-EKYDAYFVGVESYCI-GNSCLTQSGFQALV 337
N +SGS V FGD S +++P+ E Y + + V+S + G S G QA+V
Sbjct: 203 SNEESGSVVIFGDIDSSYYSGSLNWVPVSVEGY--WQITVDSITMNGESIACSDGCQAIV 260
Query: 338 DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSK 397
D+G S PT + + S + + +S + + SS + L PD+ +
Sbjct: 261 DTGTSLLAGPTTAISN--------IQSYIGASEDSSGEVVISCSSIDSL--PDIVFTING 310
Query: 398 NQ------SFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLK 451
Q +++++++ EG + + ++ GD I+G F+ + VFDR N +
Sbjct: 311 VQYPVPPSAYILQSNGICSSGFEG-----MDISTSSGDLWILGDVFIRQYFTVFDRGNNQ 365
Query: 452 LA 453
+
Sbjct: 366 IG 367
>sp|P07267|CARP_YEAST Saccharopepsin OS=Saccharomyces cerevisiae (strain ATCC 204508 /
S288c) GN=PEP4 PE=1 SV=1
Length = 405
Score = 48.1 bits (113), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 89/382 (23%), Positives = 149/382 (39%), Gaps = 71/382 (18%)
Query: 86 SRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAP 145
SR F +EG N +YT I +GTP +F V LD GS+ LWVP +C
Sbjct: 68 SREHPFF-TEGGHDVPLTNYLNAQYYTDITLGTPPQNFKVILDTGSSNLWVPSN--ECGS 124
Query: 146 LSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSS 205
L+ + S+YD +SSS K + S + Y ++DT S
Sbjct: 125 LACFLH-------SKYDHEASSSYKANGTEFAIQYGTGSLEG----------YISQDTLS 167
Query: 206 SGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPS 265
G L A +++ + + G+ DG++GLG +SV
Sbjct: 168 IGDLTIPKQDFA-------EATSEPGLTFAFGKF-----------DGILGLGYDTISVDK 209
Query: 266 L-------LAKAGLIQNSFSICFD------ENDSGSVFFGDQGPATQQSTSFLPIGEKYD 312
+ + + L + F+ EN + F G + ++LP+ K
Sbjct: 210 VVPPFYNAIQQDLLDEKRFAFYLGDTSKDTENGGEATFGGIDESKFKGDITWLPVRRK-- 267
Query: 313 AYF-VGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQG 371
AY+ V E +G+ A +D+G S LP+ + AE++ + + +K+
Sbjct: 268 AYWEVKFEGIGLGDEYAELESHGAAIDTGTSLITLPSGL-AEMI---NAEIGAKK----- 318
Query: 372 NSWKYCYNASSEEMLKVPDMRLIFSKN-QSFVVRNHIFSFPENEGFTVFCLTVMSTD--- 427
W Y +PD LIF+ N +F + + ++ E G + +T M
Sbjct: 319 -GWTGQYTLDCNTRDNLPD--LIFNFNGYNFTIGPYDYTL-EVSGSCISAITPMDFPEPV 374
Query: 428 GDYGIIGQNFMMGHRIVFDREN 449
G I+G F+ + ++D N
Sbjct: 375 GPLAIVGDAFLRKYYSIYDLGN 396
>sp|P80209|CATD_BOVIN Cathepsin D OS=Bos taurus GN=CTSD PE=1 SV=2
Length = 390
Score = 47.8 bits (112), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 85/373 (22%), Positives = 143/373 (38%), Gaps = 69/373 (18%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
+Y I IGTP F V D GS LWVP I C L + +T N
Sbjct: 59 YYGEIGIGTPPQCFTVVFDTGSANLWVP--SIHCKLLDIACWTHRKYN------------ 104
Query: 170 KNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLA-SFSKHAPQSSV 228
S S +K+ + Y + S SGYL D + + + S +P
Sbjct: 105 -----------SDKSSTYVKNGTTFDIHYGS--GSLSGYLSQDTVSVPCNPSSSSPGGVT 151
Query: 229 QSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK-AGLIQNSFSICFDENDSG 287
G KQ G A DG++G+ +SV ++L L+Q D+N
Sbjct: 152 VQRQTFGEAIKQPGVVFIAAKFDGILGMAYPRISVNNVLPVFDNLMQQKL---VDKNVFS 208
Query: 288 SVFFGDQGPATQQSTSFLPIGEKYDAYFVG----------------VESYCIGNS-CLTQ 330
FF ++ P Q + +G Y+ G ++ +G+S + +
Sbjct: 209 --FFLNRDPKAQPGGELM-LGGTDSKYYRGSLMFHNVTRQAYWQIHMDQLDVGSSLTVCK 265
Query: 331 SGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPD 390
G +A+VD+G S P E V + K + + + +QG C SS +P+
Sbjct: 266 GGCEAIVDTGTSLIVGPV----EEVRELQKAIGAVPL-IQGEYMIPCEKVSS-----LPE 315
Query: 391 MRLIFSKNQSFVVRNHIFSFPENEGFTVFCLT-VMSTD-----GDYGIIGQNFMMGHRIV 444
+ + + + + ++ ++ T CL+ M D G I+G F+ + V
Sbjct: 316 VTVKLG-GKDYALSPEDYALKVSQAETTVCLSGFMGMDIPPPGGPLWILGDVFIGRYYTV 374
Query: 445 FDRENLKLAWSHS 457
FDR+ ++ + +
Sbjct: 375 FDRDQNRVGLAEA 387
>sp|P11489|PEPA_MACMU Pepsin A OS=Macaca mulatta GN=PGA PE=2 SV=1
Length = 388
Score = 47.4 bits (111), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 68/257 (26%), Positives = 109/257 (42%), Gaps = 51/257 (19%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
++ I IGTP F V D GS+ LWVP + C+ L+ + + NL ++P SS+
Sbjct: 76 YFGTIGIGTPAQDFTVIFDTGSSNLWVP--SVYCSSLACT-----NHNL--FNPQDSSTY 126
Query: 170 KNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
++ S + + S +G L D + + S
Sbjct: 127 QSTSGTLSITYGTGSM--------------------TGILGYDTVQVGGISD-------- 158
Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGL------GDVSVPSLLAKAGLI-QNSFSICFD 282
++ I G + GS+L A DG++GL G V + GL+ Q+ FS+
Sbjct: 159 TNQIFGLSETEPGSFLYYAPFDGILGLAYPSISSSGATPVFDNIWDQGLVSQDLFSVYLS 218
Query: 283 END-SGSV--FFGDQGPATQQSTSFLPIG-EKYDAYFVGVESYCI-GNSCLTQSGFQALV 337
+D SGSV F G S +++P+ E Y + + V+S + G + G QA+V
Sbjct: 219 ADDQSGSVVIFGGIDSSYYTGSLNWVPVSVEGY--WQISVDSITMNGEAIACAEGCQAIV 276
Query: 338 DSGASFTFLPTEIYAEV 354
D+G S PT A +
Sbjct: 277 DTGTSLLTGPTSPIANI 293
>sp|P03954|PEPA1_MACFU Pepsin A-1 OS=Macaca fuscata fuscata GN=PGA PE=1 SV=2
Length = 388
Score = 47.4 bits (111), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 68/257 (26%), Positives = 109/257 (42%), Gaps = 51/257 (19%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
++ I IGTP F V D GS+ LWVP + C+ L+ + + NL ++P SS+
Sbjct: 76 YFGTIGIGTPAQDFTVIFDTGSSNLWVP--SVYCSSLACT-----NHNL--FNPQDSSTY 126
Query: 170 KNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
++ S + + S +G L D + + S
Sbjct: 127 QSTSGTLSITYGTGSM--------------------TGILGYDTVQVGGISD-------- 158
Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGL------GDVSVPSLLAKAGLI-QNSFSICFD 282
++ I G + GS+L A DG++GL G V + GL+ Q+ FS+
Sbjct: 159 TNQIFGLSETEPGSFLYYAPFDGILGLAYPSISSSGATPVFDNIWDQGLVSQDLFSVYLS 218
Query: 283 END-SGSV--FFGDQGPATQQSTSFLPIG-EKYDAYFVGVESYCI-GNSCLTQSGFQALV 337
+D SGSV F G S +++P+ E Y + + V+S + G + G QA+V
Sbjct: 219 ADDQSGSVVIFGGIDSSYYTGSLNWVPVSVEGY--WQISVDSITMNGEAIACAEGCQAIV 276
Query: 338 DSGASFTFLPTEIYAEV 354
D+G S PT A +
Sbjct: 277 DTGTSLLTGPTSPIANI 293
>sp|Q9GMY6|PEPA_CANFA Pepsin A OS=Canis familiaris GN=PGA PE=2 SV=1
Length = 386
Score = 47.0 bits (110), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 98/401 (24%), Positives = 152/401 (37%), Gaps = 93/401 (23%)
Query: 77 VKLQSNNNSSRNQLLFPSEGS--QTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLL 134
+K QS N +S+ FP E + T N ++ I IGTP F V D GS+ L
Sbjct: 42 LKNQSPNPASK---YFPQEPTVLATQSLKNYMDMEYFGTIGIGTPPQEFTVIFDTGSSNL 98
Query: 135 WVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPY 194
WVP + C+ + S N + ++P SS+ + + P
Sbjct: 99 WVP--SVYCSSPACS-------NHNRFNPQESSTYQGTN------------------RPV 131
Query: 195 IADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVM 254
Y T S +G L D + + + ++ I G + GS+L A DG++
Sbjct: 132 SIAYGT--GSMTGILGYDTVQVGGIAD--------TNQIFGLSETEPGSFLYYAPFDGIL 181
Query: 255 GLGLGDVSVPSL------LAKAGLI-QNSFSICFDEND-SGSV--FFGDQGPATQQSTSF 304
GL +S + GL+ Q+ FS+ +D SGSV F G + ++
Sbjct: 182 GLAYPQISASGATPVFDNMWNEGLVSQDLFSVYLSSDDQSGSVVMFGGIDSSYYSGNLNW 241
Query: 305 LPIG-EKYDAYFVGVESYCI-GNSCLTQSGFQALVDSGASFTFLPTEI------------ 350
+P+ E Y + + V+S + G + G QA+VD+G S PT
Sbjct: 242 VPVSVEGY--WQITVDSVTMNGQAIACSDGCQAIVDTGTSLLAGPTNAIANIQSYIGASQ 299
Query: 351 --YAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIF 408
Y ++V+ + S I N +Y +P I Q V
Sbjct: 300 NSYGQMVISCSAINSLPDIVFTINGIQY----------PLPPSAYILQSQQGCV------ 343
Query: 409 SFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDREN 449
GF L S G+ I+G F+ + VFDR N
Sbjct: 344 -----SGFQGMNLPTAS--GELWILGDVFIRQYFAVFDRAN 377
>sp|O65390|APA1_ARATH Aspartic proteinase A1 OS=Arabidopsis thaliana GN=APA1 PE=1 SV=1
Length = 506
Score = 47.0 bits (110), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 67/254 (26%), Positives = 101/254 (39%), Gaps = 49/254 (19%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
+Y I IGTP F V D GS+ LWVP S+ Y SL L
Sbjct: 82 YYGEIAIGTPPQKFTVVFDTGSSNLWVP---------SSKCYFSLACLL----------- 121
Query: 170 KNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
HP KS S K+ Y T + +G+ +D + + Q ++
Sbjct: 122 ------HPKYKSSRSSTYEKNGKAAAIHYGT--GAIAGFFSNDAVTVGDLVVK-DQEFIE 172
Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSL------LAKAGLIQNS-FSICF- 281
++ K+ G A DG++GLG ++SV + K GLI+ FS
Sbjct: 173 AT-------KEPGITFVVAKFDGILGLGFQEISVGKAAPVWYNMLKQGLIKEPVFSFWLN 225
Query: 282 ---DENDSGSVFFGDQGPAT-QQSTSFLPIGEK-YDAYFVGVESYCIGNSCLTQSGFQAL 336
DE + G + FG P + +++P+ +K Y + +G + +SG A+
Sbjct: 226 RNADEEEGGELVFGGVDPNHFKGKHTYVPVTQKGYWQFDMGDVLIGGAPTGFCESGCSAI 285
Query: 337 VDSGASFTFLPTEI 350
DSG S PT I
Sbjct: 286 ADSGTSLLAGPTTI 299
>sp|P03955|PEPC_MACFU Gastricsin (Fragment) OS=Macaca fuscata fuscata GN=PGC PE=1 SV=2
Length = 377
Score = 46.2 bits (108), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 66/257 (25%), Positives = 100/257 (38%), Gaps = 59/257 (22%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVP---CQCIQCAPLSASYYTSLDRNLSEYDPSSS 166
++ I IGTP +FLV D GS+ LWVP CQ C + S ++PS S
Sbjct: 62 YFGEISIGTPPQNFLVLFDTGSSNLWVPSVYCQSQACT------------SHSRFNPSES 109
Query: 167 SSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQS 226
S+ + L S +G+ D L + S P
Sbjct: 110 STYSTNGQTFSLQYGSGSL--------------------TGFFGYDTLTVQSI--QVPNQ 147
Query: 227 SVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVP-SLLAKAGLIQNS------FSI 279
G + G+ A DG+MGL +SV + A G++Q FS+
Sbjct: 148 E------FGLSENEPGTNFVYAQFDGIMGLAYPTLSVDGATTAMQGMVQEGALTSPIFSV 201
Query: 280 CFDENDS---GSVFFG--DQGPATQQSTSFLPIGEKYDAYFVGVESYCIGN--SCLTQSG 332
+ G+V FG D T Q + P+ ++ + +G+E + IG S G
Sbjct: 202 YLSDQQGSSGGAVVFGGVDSSLYTGQ-IYWAPVTQEL-YWQIGIEEFLIGGQASGWCSEG 259
Query: 333 FQALVDSGASFTFLPTE 349
QA+VD+G S +P +
Sbjct: 260 CQAIVDTGTSLLTVPQQ 276
>sp|P27822|PEPA3_RABIT Pepsin-3 OS=Oryctolagus cuniculus PE=2 SV=1
Length = 387
Score = 46.2 bits (108), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 82/360 (22%), Positives = 145/360 (40%), Gaps = 68/360 (18%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
++ I IGTP F V D GS+ LWVP + C+ + S + ++++P SS+
Sbjct: 75 YFGTIGIGTPAQDFTVIFDTGSSNLWVP--SVYCSSAACSVH-------NQFNPEDSSTF 125
Query: 170 KNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
+ S S + S +G+L D + + +
Sbjct: 126 QATSESLSITYGTGSM--------------------TGFLGYDTVKVGNIE--------D 157
Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVS------VPSLLAKAGLI-QNSFSICFD 282
++ I G + GS+L A DG++GL +S V + GL+ ++ FS+
Sbjct: 158 TNQIFGLSESEPGSFLYYAPFDGILGLAYPSISSSDATPVFDNMWNEGLVSEDLFSVYLS 217
Query: 283 END-SGSV--FFGDQGPATQQSTSFLPIGEKYDAYF-VGVESYCI-GNSCLTQSGFQALV 337
+D SGSV F G S +++P+ Y+ Y+ + ++S + G + QA+V
Sbjct: 218 SDDESGSVVMFGGIDSSYYTGSLNWVPV--SYEGYWQITLDSITMDGETIACADSCQAIV 275
Query: 338 DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSK 397
D+G S PT + + +S + S M +P++ +
Sbjct: 276 DTGTSLLAGPTSAISNIQSYIGASENSDGEMI----------VSCSSMYSLPNIVFTING 325
Query: 398 NQSFVVRNHIFSFPENE----GFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLA 453
Q + V + E++ GF L + G+ I+G F+ + VFDR N +L
Sbjct: 326 VQ-YPVPASAYILEEDDACISGFEGMNLDTYT--GELWILGDVFIRQYFTVFDRANNQLG 382
>sp|P0CY27|CARP1_CANAL Candidapepsin-1 OS=Candida albicans (strain SC5314 / ATCC MYA-2876)
GN=SAP1 PE=1 SV=1
Length = 391
Score = 46.2 bits (108), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 76/361 (21%), Positives = 139/361 (38%), Gaps = 64/361 (17%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
I IG+ F V +D GS+ LWVP + C + Y P SS++S+N+
Sbjct: 68 ITIGSNKQKFNVIVDTGSSDLWVPDASVTCDKPRPGQSADFCKGKGIYTPKSSTTSQNLG 127
Query: 174 CSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL--ASFSKHAPQSSVQSS 231
P+ Y + +SS G L D + AS +K ++S
Sbjct: 128 T------------------PFYIGYG-DGSSSQGTLYKDTVGFGGASITKQVFADITKTS 168
Query: 232 VIIGCGRKQTGSYLDGAAPDGVMGLGL------GDV-SVPSLLAKAGLI-QNSFSICFDE 283
+ P G++G+G GD +VP L G+I +N++S+ +
Sbjct: 169 I-----------------PQGILGIGYKTNEAAGDYDNVPVTLKNQGVIAKNAYSLYLNS 211
Query: 284 ND--SGSVFFGDQGPATQQSTSF-LPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSG 340
+ +G + FG A + +P+ + +G + L+DSG
Sbjct: 212 PNAATGQIIFGGVDKAKYSGSLIAVPVTSDRELRITLNSLKAVGKNI--NGNIDVLLDSG 269
Query: 341 ASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQS 400
+ T+L ++ +++ F + S QG+++ Y + + + F N
Sbjct: 270 TTITYLQQDVAQDIIDAFQAELKSDG---QGHTF-YVTDCQTSGTVD-----FNFDNNAK 320
Query: 401 FVVRNHIFSFP---ENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHS 457
V F+ P N C ++ D I+G NF+ +V+D ++ K++ +
Sbjct: 321 ISVPASEFTAPLSYANGQPYPKCQLLLGIS-DANILGDNFLRSAYLVYDLDDDKISLAQV 379
Query: 458 K 458
K
Sbjct: 380 K 380
>sp|C4YSF6|CARP1_CANAW Candidapepsin-1 OS=Candida albicans (strain WO-1) GN=SAP1 PE=1 SV=1
Length = 391
Score = 45.8 bits (107), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 89/426 (20%), Positives = 160/426 (37%), Gaps = 76/426 (17%)
Query: 49 VADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYW 108
+ D+ P K S ++ L D+ KT V + Q L P + H
Sbjct: 15 LVDASPAKRSPGFVTL----DFDVIKTPVNATGQEGKVKRQAL-PVTLNNEHVS------ 63
Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
+ I IG+ F V +D GS+ LWVP + C + Y P SS++
Sbjct: 64 -YAADITIGSNKQKFNVIVDTGSSDLWVPDASVTCDKPRPGQSADFCKGKGIYTPKSSTT 122
Query: 169 SKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL--ASFSKHAPQS 226
S+N+ P+ Y + +SS G L D + AS +K
Sbjct: 123 SQNLGT------------------PFYIGYG-DGSSSQGTLYKDTVGFGGASITKQVFAD 163
Query: 227 SVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGL------GDV-SVPSLLAKAGLI-QNSFS 278
++S+ P G++G+G GD +VP L G+I +N++S
Sbjct: 164 ITKTSI-----------------PQGILGIGYKTNEAAGDYDNVPVTLKNQGVIAKNAYS 206
Query: 279 ICFDEND--SGSVFFGDQGPATQQSTSF-LPIGEKYDAYFVGVESYCIGNSCLTQSGFQA 335
+ + + +G + FG A + +P+ + +G +
Sbjct: 207 LYLNSPNAATGQIIFGGVDKAKYSGSLIAVPVTSDRELRITLNSLKAVGKNI--NGNIDV 264
Query: 336 LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIF 395
L+DSG + T+L ++ +++ F + S QG+++ Y + + + F
Sbjct: 265 LLDSGTTITYLQQDVAQDIIDAFQAELKSDG---QGHTF-YVTDCQTSGTVD-----FNF 315
Query: 396 SKNQSFVVRNHIFSFP---ENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKL 452
N V F+ P N C ++ D I+G NF+ +V+D ++ K+
Sbjct: 316 DNNVKISVPASEFTAPLSYANGQPYPKCQLLLGIS-DANILGDNFLRSAYLVYDLDDDKI 374
Query: 453 AWSHSK 458
+ + K
Sbjct: 375 SLAQVK 380
>sp|P20142|PEPC_HUMAN Gastricsin OS=Homo sapiens GN=PGC PE=1 SV=1
Length = 388
Score = 45.8 bits (107), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 66/257 (25%), Positives = 99/257 (38%), Gaps = 59/257 (22%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVP---CQCIQCAPLSASYYTSLDRNLSEYDPSSS 166
++ I IGTP +FLV D GS+ LWVP CQ C + S ++PS S
Sbjct: 73 YFGEISIGTPPQNFLVLFDTGSSNLWVPSVYCQSQACT------------SHSRFNPSES 120
Query: 167 SSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQS 226
S+ + L S +G+ D L + S P
Sbjct: 121 STYSTNGQTFSLQYGSGSL--------------------TGFFGYDTLTVQSI--QVPNQ 158
Query: 227 SVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSV-PSLLAKAGLIQNS------FSI 279
G + G+ A DG+MGL +SV + A G++Q FS+
Sbjct: 159 E------FGLSENEPGTNFVYAQFDGIMGLAYPALSVDEATTAMQGMVQEGALTSPVFSV 212
Query: 280 CFDEND---SGSVFFG--DQGPATQQSTSFLPIGEKYDAYFVGVESYCIGN--SCLTQSG 332
G+V FG D T Q + P+ ++ + +G+E + IG S G
Sbjct: 213 YLSNQQGSSGGAVVFGGVDSSLYTGQ-IYWAPVTQEL-YWQIGIEEFLIGGQASGWCSEG 270
Query: 333 FQALVDSGASFTFLPTE 349
QA+VD+G S +P +
Sbjct: 271 CQAIVDTGTSLLTVPQQ 287
>sp|P55956|ASP3_CAEEL Aspartic protease 3 OS=Caenorhabditis elegans GN=asp-3 PE=1 SV=2
Length = 398
Score = 45.8 bits (107), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 76/373 (20%), Positives = 137/373 (36%), Gaps = 70/373 (18%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
+Y + IGTP +F V D GS+ LWVPC ++ + D
Sbjct: 69 YYGPVTIGTPPQNFQVLFDTGSSNLWVPCANCPFGDIACRMHNRFD-------------- 114
Query: 170 KNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
CK SSC + + Y T S G + +D++ H
Sbjct: 115 ---------CKKSSSCTATG--ASFEIQYGT--GSMKGTVDNDVVCFG----HDTTYCTD 157
Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSV-------PSLLAKAGLIQN---SFSI 279
+ + C + G A DG+ G+G +SV + A + + +N +F +
Sbjct: 158 KNQGLACATSEPGITFVAAKFDGIFGMGWDTISVNKISQPMDQIFANSAICKNQLFAFWL 217
Query: 280 CFDEND---SGSVFFGDQGPATQ-QSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQA 335
D ND G + + P + ++ P+ + D + + + S I + T +
Sbjct: 218 SRDANDITNGGEITLCETDPNHYVGNIAWEPLVSE-DYWRIKLASVVIDGTTYTSGPIDS 276
Query: 336 LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSE-EMLKVPDM-RL 393
+VD+G S PT++ ++ K + +N E E K+P + +
Sbjct: 277 IVDTGTSLLTGPTDVIKKIQHKIGGIP--------------LFNGEYEVECSKIPSLPNI 322
Query: 394 IFS---KNQSFVVRNHIFSFPENEGFTVFCLTVMSTD-----GDYGIIGQNFMMGHRIVF 445
F+ +N +++I G + M D G I+G F+ VF
Sbjct: 323 TFNLGGQNFDLQGKDYILQMSNGNGGSTCLSGFMGMDIPAPAGPLWILGDVFIGRFYSVF 382
Query: 446 DRENLKLAWSHSK 458
D N ++ ++ S+
Sbjct: 383 DHGNKRVGFATSR 395
>sp|P42211|ASPRX_ORYSJ Aspartic proteinase OS=Oryza sativa subsp. japonica GN=RAP PE=2
SV=2
Length = 496
Score = 45.4 bits (106), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 68/262 (25%), Positives = 109/262 (41%), Gaps = 57/262 (21%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNL-SEYDPSSSSS 168
+Y I +G+P +F V D GS+ LWVP SA Y S+ L S Y+ SSS
Sbjct: 77 YYGVIGLGSPPQNFTVIFDTGSSNLWVP---------SAKCYFSIACYLHSRYNSKKSSS 127
Query: 169 SKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSV 228
K +CK + I+ + ++D G LV V
Sbjct: 128 YK---------ADGETCK-ITYGSGAISGFFSKDNVLVGDLV-----------------V 160
Query: 229 QSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSV-------PSLLAKAGLIQNSFSICF 281
++ I R+ + +++ G DG++GLG ++SV S+ + L + FS
Sbjct: 161 KNQKFIEATRETSVTFIIGKF-DGILGLGYPEISVGKAPPIWQSMQEQELLADDVFSFWL 219
Query: 282 ----DENDSGSVFFGDQGPATQQST-SFLPIGEK-YDAYFVG---VESYCIGNSCLTQSG 332
D + G + FG P + +++P+ K Y + +G ++ + G G
Sbjct: 220 NRDPDASSGGELVFGGMDPKHYKGDHTYVPVSRKGYWQFNMGDLLIDGHSTG---FCAKG 276
Query: 333 FQALVDSGASFTFLPTEIYAEV 354
A+VDSG S PT I A+V
Sbjct: 277 CAAIVDSGTSLLAGPTAIVAQV 298
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.317 0.131 0.395
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 199,400,780
Number of Sequences: 539616
Number of extensions: 8448450
Number of successful extensions: 30679
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 63
Number of HSP's successfully gapped in prelim test: 131
Number of HSP's that attempted gapping in prelim test: 30261
Number of HSP's gapped (non-prelim): 380
length of query: 538
length of database: 191,569,459
effective HSP length: 122
effective length of query: 416
effective length of database: 125,736,307
effective search space: 52306303712
effective search space used: 52306303712
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 64 (29.3 bits)