BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 009271
         (538 letters)

Database: swissprot 
           539,616 sequences; 191,569,459 total letters

Searching..................................................done



>sp|Q9LX20|ASPL1_ARATH Aspartic proteinase-like protein 1 OS=Arabidopsis thaliana
           GN=At5g10080 PE=1 SV=1
          Length = 528

 Score =  504 bits (1297), Expect = e-142,   Method: Compositional matrix adjust.
 Identities = 261/502 (51%), Positives = 347/502 (69%), Gaps = 26/502 (5%)

Query: 4   LVAICMLFGCILLDGSDAVSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLE 63
            +  C+LF  +  + + A  FSS+L+HRFSDE +    + S     +DS P K S+EY  
Sbjct: 7   FLLFCVLF--LATEETLASLFSSRLIHRFSDEGRASIKTPSS----SDSLPNKQSLEYYR 60

Query: 64  LLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSF 123
           LL  +D++RQ+        N  ++ Q L PSEGS+T   GN F WLHYTWIDIGTP+VSF
Sbjct: 61  LLAESDFRRQRM-------NLGAKVQSLVPSEGSKTISSGNDFGWLHYTWIDIGTPSVSF 113

Query: 124 LVALDAGSNLLWVPCQCIQCAPLSASYYTSL-DRNLSEYDPSSSSSSKNVSCSHPLCKSR 182
           LVALD GSNLLW+PC C+QCAPL+++YY+SL  ++L+EY+PSSSS+SK   CSH LC S 
Sbjct: 114 LVALDTGSNLLWIPCNCVQCAPLTSTYYSSLATKDLNEYNPSSSSTSKVFLCSHKLCDSA 173

Query: 183 SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHA---PQSSVQSSVIIGCGRK 239
           S C+S K+ CPY  +Y + +TSSSG LV+DILHL   + +      SSV++ V+IGCG+K
Sbjct: 174 SDCESPKEQCPYTVNYLSGNTSSSGLLVEDILHLTYNTNNRLMNGSSSVKARVVIGCGKK 233

Query: 240 QTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQ 299
           Q+G YLDG APDG+MGLG  ++SVPS L+KAGL++NSFS+CFDE DSG ++FGD GP+ Q
Sbjct: 234 QSGDYLDGVAPDGLMGLGPAEISVPSFLSKAGLMRNSFSLCFDEEDSGRIYFGDMGPSIQ 293

Query: 300 QSTSFLPI-GEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKF 358
           QST FL +   KY  Y VGVE+ CIGNSCL Q+ F   +DSG SFT+LP EIY +V ++ 
Sbjct: 294 QSTPFLQLDNNKYSGYIVGVEACCIGNSCLKQTSFTTFIDSGQSFTYLPEEIYRKVALEI 353

Query: 359 DKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTV 418
           D+ +++   + +G SW+YCY +S+E   KVP ++L FS N +FV+   +F F +++G   
Sbjct: 354 DRHINATSKNFEGVSWEYCYESSAEP--KVPAIKLKFSHNNTFVIHKPLFVFQQSQGLVQ 411

Query: 419 FCLTVMSTDGDYGI--IGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAG 476
           FCL + S  G  GI  IGQN+M G+R+VFDREN+KL WS SKC+E  DK       P + 
Sbjct: 412 FCLPI-SPSGQEGIGSIGQNYMRGYRMVFDRENMKLGWSPSKCQE--DKIEPPQASPGST 468

Query: 477 QSPNPLPTTEQQSTSNGQAAAP 498
            SPNPLPT EQQS   G A +P
Sbjct: 469 SSPNPLPTDEQQS-RGGHAVSP 489


>sp|Q9S9K4|ASPL2_ARATH Aspartic proteinase-like protein 2 OS=Arabidopsis thaliana
           GN=At1g65240 PE=1 SV=2
          Length = 475

 Score =  114 bits (285), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 97/369 (26%), Positives = 166/369 (44%), Gaps = 25/369 (6%)

Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSS 167
           L++T I +G+P   + V +D GS++LW+ C+ C +C        T+L+  LS +D ++SS
Sbjct: 73  LYFTKIKLGSPPKEYHVQVDTGSDILWINCKPCPKCPT-----KTNLNFRLSLFDMNASS 127

Query: 168 SSKNVSCSHPLCKSRSSCKSLKDP--CPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
           +SK V C    C   S   S +    C Y   Y+ E TS  G  + D+L L   +     
Sbjct: 128 TSKKVGCDDDFCSFISQSDSCQPALGCSYHIVYADESTSD-GKFIRDMLTLEQVTGDLKT 186

Query: 226 SSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEN 284
             +   V+ GCG  Q+G   +G +A DGVMG G  + SV S LA  G  +  FS C D  
Sbjct: 187 GPLGQEVVFGCGSDQSGQLGNGDSAVDGVMGFGQSNTSVLSQLAATGDAKRVFSHCLDNV 246

Query: 285 DSGSVF-FGDQGPATQQSTSFLPIGEKYDAYFVGVE----SYCIGNSCLTQSGFQALVDS 339
             G +F  G       ++T  +P    Y+   +G++    S  +  S +   G   +VDS
Sbjct: 247 KGGGIFAVGVVDSPKVKTTPMVPNQMHYNVMLMGMDVDGTSLDLPRSIVRNGG--TIVDS 304

Query: 340 GASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY-CYNASSEEMLKVPDMRLIFSKN 398
           G +  + P  +Y  ++   + +++ + + L      + C++ S+      P +   F  +
Sbjct: 305 GTTLAYFPKVLYDSLI---ETILARQPVKLHIVEETFQCFSFSTNVDEAFPPVSFEFEDS 361

Query: 399 QSFVVRNHIFSFPENEGFTVFCLTV--MSTD--GDYGIIGQNFMMGHRIVFDRENLKLAW 454
               V  H + F   E    F      ++TD   +  ++G   +    +V+D +N  + W
Sbjct: 362 VKLTVYPHDYLFTLEEELYCFGWQAGGLTTDERSEVILLGDLVLSNKLVVYDLDNEVIGW 421

Query: 455 SHSKCEEVI 463
           +   C   I
Sbjct: 422 ADHNCSSSI 430


>sp|Q766C2|NEP2_NEPGR Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis GN=nep2
           PE=1 SV=1
          Length = 438

 Score = 87.4 bits (215), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 100/425 (23%), Positives = 176/425 (41%), Gaps = 63/425 (14%)

Query: 56  KNSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFF-GNQFYWLHYTWI 114
           KN  +Y   L+    KR + R++       S N +L  S G +T  + G+  Y ++   +
Sbjct: 53  KNLTKYE--LIKRAIKRGERRMR-------SINAMLQSSSGIETPVYAGDGEYLMN---V 100

Query: 115 DIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
            IGTP+ SF   +D GS+L+W  C+ C QC            +    ++P  SSS   + 
Sbjct: 101 AIGTPDSSFSAIMDTGSDLIWTQCEPCTQC----------FSQPTPIFNPQDSSSFSTLP 150

Query: 174 CSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 233
           C    C+   S     + C Y   Y  + +++ GY+  +            ++S   ++ 
Sbjct: 151 CESQYCQDLPSETCNNNECQYTYGYG-DGSTTQGYMATETFTF--------ETSSVPNIA 201

Query: 234 IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGS---VF 290
            GCG    G      A  G++G+G G +S+PS L         FS C     S S   + 
Sbjct: 202 FGCGEDNQGFGQGNGA--GLIGMGWGPLSLPSQLGVG-----QFSYCMTSYGSSSPSTLA 254

Query: 291 FGDQG---PATQQSTSFLPIGEKYDAYFVGVESYCIG--NSCLTQSGFQ--------ALV 337
            G      P    ST+ +        Y++ ++   +G  N  +  S FQ         ++
Sbjct: 255 LGSAASGVPEGSPSTTLIHSSLNPTYYYITLQGITVGGDNLGIPSSTFQLQDDGTGGMII 314

Query: 338 DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSE-EMLKVPDMRLIFS 396
           DSG + T+LP + Y  V   F   ++   +    +    C+   S+   ++VP++ + F 
Sbjct: 315 DSGTTLTYLPQDAYNAVAQAFTDQINLPTVDESSSGLSTCFQQPSDGSTVQVPEISMQFD 374

Query: 397 KNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYG--IIGQNFMMGHRIVFDRENLKLAW 454
                +   +I   P  EG  V CL  M +    G  I G       ++++D +NL +++
Sbjct: 375 GGVLNLGEQNILISPA-EG--VICL-AMGSSSQLGISIFGNIQQQETQVLYDLQNLAVSF 430

Query: 455 SHSKC 459
             ++C
Sbjct: 431 VPTQC 435


>sp|Q9LS40|ASPG1_ARATH Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana
           GN=ASPG1 PE=1 SV=1
          Length = 500

 Score = 84.0 bits (206), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 98/367 (26%), Positives = 161/367 (43%), Gaps = 45/367 (12%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           +++ I +GTP     + LD GS++ W     IQC P +  Y     ++   ++P+SSS+ 
Sbjct: 162 YFSRIGVGTPAKEMYLVLDTGSDVNW-----IQCEPCADCY----QQSDPVFNPTSSSTY 212

Query: 170 KNVSCSHPLCK--SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
           K+++CS P C     S+C+S K  C Y   Y  + + + G L  D +   +  K      
Sbjct: 213 KSLTCSAPQCSLLETSACRSNK--CLYQVSYG-DGSFTVGELATDTVTFGNSGKI----- 264

Query: 228 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSG 287
             ++V +GCG    G +   A   G+         V S+  +  +   SFS C  + DSG
Sbjct: 265 --NNVALGCGHDNEGLFTGAAGLLGLG------GGVLSITNQ--MKATSFSYCLVDRDSG 314

Query: 288 ---SVFFGDQGPATQQSTSFLPIGEKYDA-YFVGVESYCIGNS--CLTQSGFQ------- 334
              S+ F         +T+ L   +K D  Y+VG+  + +G     L  + F        
Sbjct: 315 KSSSLDFNSVQLGGGDATAPLLRNKKIDTFYYVGLSGFSVGGEKVVLPDAIFDVDASGSG 374

Query: 335 -ALVDSGASFTFLPTEIYAEVVVKFDKL-VSSKRISLQGNSWKYCYNASSEEMLKVPDMR 392
             ++D G + T L T+ Y  +   F KL V+ K+ S   + +  CY+ SS   +KVP + 
Sbjct: 375 GVILDCGTAVTRLQTQAYNSLRDAFLKLTVNLKKGSSSISLFDTCYDFSSLSTVKVPTVA 434

Query: 393 LIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKL 452
             F+  +S  +    +  P ++  T FC     T     IIG     G RI +D     +
Sbjct: 435 FHFTGGKSLDLPAKNYLIPVDDSGT-FCFAFAPTSSSLSIIGNVQQQGTRITYDLSKNVI 493

Query: 453 AWSHSKC 459
             S +KC
Sbjct: 494 GLSGNKC 500


>sp|Q6XBF8|CDR1_ARATH Aspartic proteinase CDR1 OS=Arabidopsis thaliana GN=CDR1 PE=1 SV=1
          Length = 437

 Score = 80.9 bits (198), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 106/445 (23%), Positives = 185/445 (41%), Gaps = 72/445 (16%)

Query: 22  VSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRV-KLQ 80
           + F++ L+HR S ++                 P  N +E     L N   R   RV    
Sbjct: 29  LGFTADLIHRDSPKS-----------------PFYNPMETSSQRLRNAIHRSVNRVFHFT 71

Query: 81  SNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQC 140
             +N+ + Q+   S   +        Y ++   + IGTP    +   D GS+LLW     
Sbjct: 72  EKDNTPQPQIDLTSNSGE--------YLMN---VSIGTPPFPIMAIADTGSDLLWT---- 116

Query: 141 IQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLC---KSRSSCKSLKDPCPYIAD 197
            QCAP     YT +D     +DP +SS+ K+VSCS   C   ++++SC +  + C Y   
Sbjct: 117 -QCAPCD-DCYTQVD---PLFDPKTSSTYKDVSCSSSQCTALENQASCSTNDNTCSYSLS 171

Query: 198 YSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLG 257
           Y  +++ + G +  D L L S      Q     ++IIGCG    G++      +      
Sbjct: 172 YG-DNSYTKGNIAVDTLTLGSSDTRPMQ---LKNIIIGCGHNNAGTF------NKKGSGI 221

Query: 258 LGDVSVP-SLLAKAG-LIQNSFSICF-----DENDSGSVFFGDQGPATQQ---STSFLPI 307
           +G    P SL+ + G  I   FS C       ++ +  + FG     +     ST  +  
Sbjct: 222 VGLGGGPVSLIKQLGDSIDGKFSYCLVPLTSKKDQTSKINFGTNAIVSGSGVVSTPLIAK 281

Query: 308 GEKYDAYFVGVESYCIGNSCL-------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDK 360
             +   Y++ ++S  +G+  +         S    ++DSG + T LPTE Y+E+      
Sbjct: 282 ASQETFYYLTLKSISVGSKQIQYSGSDSESSEGNIIIDSGTTLTLLPTEFYSELEDAVAS 341

Query: 361 LVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFC 420
            + +++     +    CY+A+ +  LKVP + + F      +  ++ F    +E    F 
Sbjct: 342 SIDAEKKQDPQSGLSLCYSATGD--LKVPVITMHFDGADVKLDSSNAF-VQVSEDLVCFA 398

Query: 421 LTVMSTDGDYGIIGQ-NFMMGHRIV 444
                +   YG + Q NF++G+  V
Sbjct: 399 FRGSPSFSIYGNVAQMNFLVGYDTV 423


>sp|Q0IU52|ASP1_ORYSJ Aspartic proteinase Asp1 OS=Oryza sativa subsp. japonica GN=ASP1
           PE=2 SV=1
          Length = 410

 Score = 78.2 bits (191), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 86/404 (21%), Positives = 162/404 (40%), Gaps = 58/404 (14%)

Query: 93  PSEGSQTHFFGNQFYWLHY-TWIDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSAS 149
           PS        GN +   H+   ++IG P  S+ + +D GS L W+ C   C  C  +   
Sbjct: 20  PSSAVVLELHGNVYPIGHFFITMNIGDPAKSYFLDIDTGSTLTWLQCDAPCTNCNIVPHV 79

Query: 150 YYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSS-------CKSLKDPCPYIADYSTED 202
            Y    + L             V+C+  LC    +       C S K  C Y+  Y   D
Sbjct: 80  LYKPTPKKL-------------VTCADSLCTDLYTDLGKPKRCGSQKQ-CDYVIQYV--D 123

Query: 203 TSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDV 261
           +SS G LV D      FS  A   +  +++  GCG  Q     +   P D ++GL  G V
Sbjct: 124 SSSMGVLVID-----RFSLSASNGTNPTTIAFGCGYDQGKKNRNVPIPVDSILGLSRGKV 178

Query: 262 SVPSLLAKAGLI-QNSFSICFDENDSGSVFFGD-QGPATQQSTSFLPIGEKYDAYFVGVE 319
           ++ S L   G+I ++    C      G +FFGD Q P +  + + +    KY +   G  
Sbjct: 179 TLLSQLKSQGVITKHVLGHCISSKGGGFLFFGDAQVPTSGVTWTPMNREHKYYSPGHGTL 238

Query: 320 SYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSK-----RISLQGNSW 374
            +   +  ++ +    + DSGA++T+   + Y   +      ++S+      ++ +  + 
Sbjct: 239 HFDSNSKAISAAPMAVIFDSGATYTYFAAQPYQATLSVVKSTLNSECKFLTEVTEKDRAL 298

Query: 375 KYCYNASSEEMLKVPDMRLIF----------SKNQSFVVRNHIFSFPENEGFTVFCLTVM 424
             C+    ++++ + +++  F           K  +  +    +     EG    CL ++
Sbjct: 299 TVCWKG-KDKIVTIDEVKKCFRSLSLEFADGDKKATLEIPPEHYLIISQEGHV--CLGIL 355

Query: 425 STDGDY------GIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 462
               ++       +IG   M+   +++D E   L W + +C+ +
Sbjct: 356 DGSKEHLSLAGTNLIGGITMLDQMVIYDSERSLLGWVNYQCDRI 399


>sp|Q766C3|NEP1_NEPGR Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis GN=nep1
           PE=1 SV=1
          Length = 437

 Score = 73.2 bits (178), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 85/386 (22%), Positives = 157/386 (40%), Gaps = 51/386 (13%)

Query: 93  PSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYY 151
           PS    + + G+  Y ++   + IGTP   F   +D GS+L+W  CQ C QC        
Sbjct: 81  PSGVETSVYAGDGEYLMN---LSIGTPAQPFSAIMDTGSDLIWTQCQPCTQC-------- 129

Query: 152 TSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVD 211
              +++   ++P  SSS   + CS  LC++ SS     + C Y   Y  + + + G +  
Sbjct: 130 --FNQSTPIFNPQGSSSFSTLPCSSQLCQALSSPTCSNNFCQYTYGYG-DGSETQGSMGT 186

Query: 212 DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAG 271
           + L   S S          ++  GCG    G      A  G++G+G G +S+PS L    
Sbjct: 187 ETLTFGSVSI--------PNITFGCGENNQGFGQGNGA--GLVGMGRGPLSLPSQLDVT- 235

Query: 272 LIQNSFSICFDENDSGS---VFFG---DQGPATQQSTSFLPIGEKYDAYFVGVESYCIGN 325
                FS C     S +   +  G   +   A   +T+ +   +    Y++ +    +G+
Sbjct: 236 ----KFSYCMTPIGSSTPSNLLLGSLANSVTAGSPNTTLIQSSQIPTFYYITLNGLSVGS 291

Query: 326 SCLT--QSGFQ---------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSW 374
           + L    S F           ++DSG + T+     Y  V  +F   ++   ++   + +
Sbjct: 292 TRLPIDPSAFALNSNNGTGGIIIDSGTTLTYFVNNAYQSVRQEFISQINLPVVNGSSSGF 351

Query: 375 KYCYNASSE-EMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGII 433
             C+   S+   L++P   + F      +   + F  P N    + CL + S+     I 
Sbjct: 352 DLCFQTPSDPSNLQIPTFVMHFDGGDLELPSENYFISPSNG---LICLAMGSSSQGMSIF 408

Query: 434 GQNFMMGHRIVFDRENLKLAWSHSKC 459
           G        +V+D  N  ++++ ++C
Sbjct: 409 GNIQQQNMLVVYDTGNSVVSFASAQC 434


>sp|A2ZC67|ASP1_ORYSI Aspartic proteinase Asp1 OS=Oryza sativa subsp. indica GN=ASP1 PE=2
           SV=2
          Length = 410

 Score = 70.9 bits (172), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 82/398 (20%), Positives = 158/398 (39%), Gaps = 46/398 (11%)

Query: 93  PSEGSQTHFFGNQFYWLHY-TWIDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSAS 149
           PS        GN +   H+   ++IG P   + + +D GS L W+ C   CI C  +   
Sbjct: 20  PSSAVVLELHGNVYPIGHFFVTMNIGDPAKPYFLDIDTGSTLTWLQCDYPCINCNKVPHG 79

Query: 150 YYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYL 209
            Y        E   +   + +  +  +   +    C   K+ C Y   Y     SS G L
Sbjct: 80  LYK------PELKYAVKCTEQRCADLYADLRKPMKCGP-KNQCHYGIQYV--GGSSIGVL 130

Query: 210 VDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLA 268
           + D     SFS  A   +  +S+  GCG  Q  +  +   P +G++GLG G V++ S L 
Sbjct: 131 IVD-----SFSLPASNGTNPTSIAFGCGYNQGKNNHNVPTPVNGILGLGRGKVTLLSQLK 185

Query: 269 KAGLI-QNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAY--FVGVESYCIGN 325
             G+I ++    C      G +FFGD    T   T + P+  ++  Y    G   +   +
Sbjct: 186 SQGVITKHVLGHCISSKGKGFLFFGDAKVPTSGVT-WSPMNREHKHYSPRQGTLQFNSNS 244

Query: 326 SCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSK-----RISLQGNSWKYCYNA 380
             ++ +  + + DSGA++T+   + Y   +      +S +      +  +  +   C+  
Sbjct: 245 KPISAAPMEVIFDSGATYTYFALQPYHATLSVVKSTLSKECKFLTEVKEKDRALTVCWKG 304

Query: 381 SSEEMLKVPDMRLIFS----------KNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDY 430
             +++  + +++  F           K  +  +    +     EG    CL ++    ++
Sbjct: 305 -KDKIRTIDEVKKCFRSLSLKFADGDKKATLEIPPEHYLIISQEGHV--CLGILDGSKEH 361

Query: 431 ------GIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 462
                  +IG   M+   +++D E   L W + +C+ +
Sbjct: 362 PSLAGTNLIGGITMLDQMVIYDSERSLLGWVNYQCDRI 399


>sp|P00793|PEPA_CHICK Pepsin A OS=Gallus gallus GN=PGA PE=1 SV=1
          Length = 367

 Score = 63.5 bits (153), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 94/372 (25%), Positives = 158/372 (42%), Gaps = 92/372 (24%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVP---CQCIQCAPLSASYYTSLDRNLSEYDPSSS 166
           +Y  I IGTP   F V  D GS+ LWVP   C+   C+            N   +DPS S
Sbjct: 59  YYGTISIGTPQQDFSVIFDTGSSNLWVPSIYCKSSACS------------NHKRFDPSKS 106

Query: 167 SSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQS 226
           S+   VS +  +               YIA Y T   S SG L  D + ++S        
Sbjct: 107 STY--VSTNETV---------------YIA-YGT--GSMSGILGYDTVAVSSI------- 139

Query: 227 SVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVS-------VPSLLAKAGLIQNSFSI 279
            VQ+  I G    + GS+      DG++GL    +S         +++++  + Q+ FS+
Sbjct: 140 DVQNQ-IFGLSETEPGSFFYYCNFDGILGLAFPSISSSGATPVFDNMMSQHLVAQDLFSV 198

Query: 280 CFDEN-DSGS-VFFGDQGP-ATQQSTSFLPI-GEKYDAYFVGVESYCIGN---SCLTQSG 332
              ++ ++GS V FG   P  T +   ++P+  E Y  + + ++   +GN   +C     
Sbjct: 199 YLSKDGETGSFVLFGGIDPNYTTKGIYWVPLSAETY--WQITMDRVTVGNKYVACFFTC- 255

Query: 333 FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMR 392
            QA+VD+G S   +P   Y       ++++    +S  G         S +++ K+PD+ 
Sbjct: 256 -QAIVDTGTSLLVMPQGAY-------NRIIKDLGVSSDG-------EISCDDISKLPDV- 299

Query: 393 LIFSKNQSFVVRNHIFSFP------ENEGFTVFCLTVMSTDGDYG---IIGQNFMMGHRI 443
                  +F +  H F+ P        +G  +     M T  + G   I+G  F+  + +
Sbjct: 300 -------TFHINGHAFTLPASAYVLNEDGSCMLGFENMGTPTELGEQWILGDVFIREYYV 352

Query: 444 VFDRENLKLAWS 455
           +FDR N K+  S
Sbjct: 353 IFDRANNKVGLS 364


>sp|P18242|CATD_MOUSE Cathepsin D OS=Mus musculus GN=Ctsd PE=1 SV=1
          Length = 410

 Score = 61.2 bits (147), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 92/371 (24%), Positives = 150/371 (40%), Gaps = 65/371 (17%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           +Y  I IGTP   F V  D GS+ LWVP   I C  L                       
Sbjct: 79  YYGDIGIGTPPQCFTVVFDTGSSNLWVP--SIHCKIL----------------------- 113

Query: 170 KNVSC-SHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSV 228
            +++C  H    S  S   +K+   +   Y +   S SGYL  D + +   S  +    +
Sbjct: 114 -DIACWVHHKYNSDKSSTYVKNGTSFDIHYGS--GSLSGYLSQDTVSVPCKSDQSKARGI 170

Query: 229 Q-SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSV-------PSLLAKAGLIQNSFSIC 280
           +    I G   KQ G     A  DG++G+G   +SV        +L+ +  + +N FS  
Sbjct: 171 KVEKQIFGEATKQPGIVFVAAKFDGILGMGYPHISVNNVLPVFDNLMQQKLVDKNIFSFY 230

Query: 281 FDENDSGS-----VFFGDQGPATQQSTSFLPIGEKYDAYF-VGVESYCIGNS-CLTQSGF 333
            + +  G      +  G          S+L +  K  AY+ V ++   +GN   L + G 
Sbjct: 231 LNRDPEGQPGGELMLGGTDSKYYHGELSYLNVTRK--AYWQVHMDQLEVGNELTLCKGGC 288

Query: 334 QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRL 393
           +A+VD+G S    P E   E+     K + +  + +QG     C   SS     +P + L
Sbjct: 289 EAIVDTGTSLLVGPVEEVKEL----QKAIGAVPL-IQGEYMIPCEKVSS-----LPTVYL 338

Query: 394 -IFSKNQSFVVRNHIFSFPENEGFTVFCLT-VMSTD-----GDYGIIGQNFMMGHRIVFD 446
            +  KN       +I     ++G    CL+  M  D     G   I+G  F+  +  VFD
Sbjct: 339 KLGGKNYELHPDKYILKV--SQGGKTICLSGFMGMDIPPPSGPLWILGDVFIGSYYTVFD 396

Query: 447 RENLKLAWSHS 457
           R+N ++ ++++
Sbjct: 397 RDNNRVGFANA 407


>sp|P10977|CARPV_CANAX Vacuolar aspartic protease OS=Candida albicans GN=APR1 PE=3 SV=3
          Length = 419

 Score = 60.5 bits (145), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 105/467 (22%), Positives = 180/467 (38%), Gaps = 87/467 (18%)

Query: 21  AVSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQ 80
           A++ +S LV   +   K   +S    +  ++     NS+    L L N      +   LQ
Sbjct: 12  ALALTSSLVDAKAHSIKLSKLSNEETLDASNFQEYTNSLANKYLNLFNTAHGNPSNFGLQ 71

Query: 81  S--NNNSSRNQLLFPSEGSQ-----THFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNL 133
               N  +    + P +G +     T++   Q++    T I IGTP   F V LD GS+ 
Sbjct: 72  HVLTNQEAEVPFVTPKKGGKYDAPLTNYLNAQYF----TEIQIGTPGQPFKVILDTGSSN 127

Query: 134 LWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCP 193
           LWVP Q   C  L+   +   D +        +SS+  V+ S    +  S          
Sbjct: 128 LWVPSQ--DCTSLACFLHAKYDHD--------ASSTYKVNGSEFSIQYGSG--------- 168

Query: 194 YIADYSTEDTSSSGYLVDDILHLASF---SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAP 250
                     S  GY+  D+L +       +   +++ +  +    G+            
Sbjct: 169 ----------SMEGYISQDVLTIGDLVIPGQDFAEATSEPGLAFAFGKF----------- 207

Query: 251 DGVMGLGLGDVSVPSLL------AKAGLIQN-SFSICF-----DENDSG-SVFFGDQGPA 297
           DG++GL    +SV  ++         GL++   F         DEND G + F G     
Sbjct: 208 DGILGLAYDTISVNHIVPPIYNAINQGLLEKPQFGFYLGSTDKDENDGGLATFGGYDASL 267

Query: 298 TQQSTSFLPIGEKYDAYF-VGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVV 356
            Q   ++LPI  K  AY+ V  E   +G+         A +D+G S   LP+ + AE++ 
Sbjct: 268 FQGKITWLPIRRK--AYWEVSFEGIGLGDEYAELHKTGAAIDTGTSLITLPSSL-AEIIN 324

Query: 357 KFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSK-NQSFVVRNHIFSFPENEG 415
              K+ ++K       SW   Y     +   +PD+ L F+  N +    ++I    E  G
Sbjct: 325 A--KIGATK-------SWSGQYQVDCAKRDSLPDLTLTFAGYNFTLTPYDYIL---EVSG 372

Query: 416 FTVFCLTVMSTD---GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 459
             +   T M      GD  I+G  F+  +  ++D +   +  + +K 
Sbjct: 373 SCISVFTPMDFPQPIGDLAIVGDAFLRKYYSIYDLDKNAVGLAPTKV 419


>sp|P22929|CARP_SACFI Acid protease OS=Saccharomycopsis fibuligera GN=PEP1 PE=3 SV=1
          Length = 390

 Score = 60.1 bits (144), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 97/420 (23%), Positives = 173/420 (41%), Gaps = 71/420 (16%)

Query: 59  VEYLELLLSNDWKRQKTRVKLQSNNNSS----RNQLLFPSEGSQTHFFGNQFYWLHYTWI 114
           VE  E  L+ D+  ++   K ++   +S    R  L   S+   T    N+ Y  + T I
Sbjct: 21  VEKREKTLTLDFDVKRISSKAKNVTVASSPGFRRNLRAASDAGVTISLENE-YSFYLTTI 79

Query: 115 DIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSC 174
           +IGTP     V +D GS+ LWVP Q       ++S Y +       YD + S+S K    
Sbjct: 80  EIGTPGQKLQVDVDTGSSDLWVPGQG------TSSLYGT-------YDHTKSTSYK---- 122

Query: 175 SHPLCKSRSSCK-SLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 233
                K RS    S  D      D++ E  S  G  +  +     F     Q   Q  + 
Sbjct: 123 -----KDRSGFSISYGDGSSARGDWAQETVSIGGASITGL----EFGDATSQDVGQGLLG 173

Query: 234 IGC-GRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI-QNSFSICFDEND--SGSV 289
           IG  G + +    +    D          ++P  L   GLI + ++S+  +  D  SGS+
Sbjct: 174 IGLKGNEASAQSSNSFTYD----------NLPLKLKDQGLIDKAAYSLYLNSEDATSGSI 223

Query: 290 FFGDQGPATQQST----SFLPIGEKYD------AYFVGVESYCIGNSCLTQSGFQALVDS 339
            FG    +    +      + I ++ D      A+FV +E    G+S +T++ + AL+DS
Sbjct: 224 LFGGSDSSKYSGSLATLDLVNIDDEGDSTSGAVAFFVELEGIEAGSSSITKTTYPALLDS 283

Query: 340 GASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKV-PDMRLIFSKN 398
           G +  + P+ I + +  ++              ++ Y Y           PD +  F+  
Sbjct: 284 GTTLIYAPSSIASSIGREY-------------GTYSYSYGGYVTSCDATGPDFKFSFNGK 330

Query: 399 QSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSK 458
              V  +++  F  +EG +   + V+S+  +Y I+G  F+    + +D +N ++  + +K
Sbjct: 331 TITVPFSNLL-FQNSEGDSECLVGVLSSGSNYYILGDAFLRSAYVYYDIDNSQVGIAQAK 389


>sp|Q9LHE3|ASPG2_ARATH Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana
           GN=ASPG2 PE=2 SV=1
          Length = 470

 Score = 58.2 bits (139), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 100/447 (22%), Positives = 179/447 (40%), Gaps = 46/447 (10%)

Query: 32  FSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNN--NSSRNQ 89
           FSDE+  ++  +  +     S   +N    L   +  D  R    ++  S     SS ++
Sbjct: 51  FSDESSSKYTLRLLHRDRFPSVTYRNHHHRLHARMRRDTDRVSAILRRISGKVIPSSDSR 110

Query: 90  LLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSA 148
                 GS      +Q    ++  I +G+P     + +D+GS+++WV CQ C  C     
Sbjct: 111 YEVNDFGSDIVSGMDQGSGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPCKLC----- 165

Query: 149 SYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGY 208
             Y   D     +DP+ S S   VSC   +C    +       C Y   Y  + + + G 
Sbjct: 166 --YKQSD---PVFDPAKSGSYTGVSCGSSVCDRIENSGCHSGGCRYEVMYG-DGSYTKGT 219

Query: 209 LVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLA 268
           L    L   +F+K     +V  +V +GCG +  G ++  A   G+ G  +  V       
Sbjct: 220 LA---LETLTFAK-----TVVRNVAMGCGHRNRGMFIGAAGLLGIGGGSMSFVG-----Q 266

Query: 269 KAGLIQNSFSICF---DENDSGSVFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYC 322
            +G    +F  C      + +GS+ FG +  A     S++P+     A   Y+VG++   
Sbjct: 267 LSGQTGGAFGYCLVSRGTDSTGSLVFGRE--ALPVGASWVPLVRNPRAPSFYYVGLKGLG 324

Query: 323 I---------GNSCLTQSGFQALV-DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN 372
           +         G   LT++G   +V D+G + T LPT  Y      F    ++   +   +
Sbjct: 325 VGGVRIPLPDGVFDLTETGDGGVVMDTGTAVTRLPTAAYVAFRDGFKSQTANLPRASGVS 384

Query: 373 SWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGI 432
            +  CY+ S    ++VP +   F++     +    F  P ++  T +C    ++     I
Sbjct: 385 IFDTCYDLSGFVSVRVPTVSFYFTEGPVLTLPARNFLMPVDDSGT-YCFAFAASPTGLSI 443

Query: 433 IGQNFMMGHRIVFDRENLKLAWSHSKC 459
           IG     G ++ FD  N  + +  + C
Sbjct: 444 IGNIQQEGIQVSFDGANGFVGFGPNVC 470


>sp|O93428|CATD_CHIHA Cathepsin D OS=Chionodraco hamatus GN=ctsd PE=1 SV=2
          Length = 396

 Score = 57.8 bits (138), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 97/414 (23%), Positives = 162/414 (39%), Gaps = 72/414 (17%)

Query: 66  LSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQT-HFFGNQFYWLHYTWIDIGTPNVSFL 124
           L++  KR +   +L ++++S +  L FP+  + T     N     +Y  I +GTP   F 
Sbjct: 34  LTDSGKRAE---ELLADHHSLKYNLSFPASNAPTPETLKNYLDAQYYGEIGLGTPPQPFT 90

Query: 125 VALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSC-SHPLCKSRS 183
           V  D GS+ LWVP   I C+ L                        +++C  H    S  
Sbjct: 91  VVFDTGSSNLWVP--SIHCSLL------------------------DIACLLHHKYNSGK 124

Query: 184 SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGS 243
           S   +K+   +   Y +   S SGYL  D   +   +          S + G   KQ G 
Sbjct: 125 SSTYVKNGTAFAIQYGS--GSLSGYLSQDTCTIGDLAI--------DSQLFGEAIKQPGV 174

Query: 244 YLDGAAPDGVMGLGLGDVSV-------PSLLAKAGLIQNSFSICFDEN----DSGSVFFG 292
               A  DG++G+    +SV        +++++  + QN FS   + N      G +  G
Sbjct: 175 AFIAAKFDGILGMAYPRISVDGVAPVFDNIMSQKKVEQNVFSFYLNRNPDTEPGGELLLG 234

Query: 293 DQGPATQQSTSFLPIGEKYDAYF-VGVESYCIGNSC-LTQSGFQALVDSGASFTFLPTEI 350
              P    +  F  +     AY+ + V+S  +G+   L   G +A+VDSG S    P+  
Sbjct: 235 GTDP-KYYTGDFNYVNVTRQAYWQIRVDSMAVGDQLSLCTGGCEAIVDSGTSLITGPS-- 291

Query: 351 YAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSF 410
              V VK  +        +QG   +Y  N  +   L V    +     Q + +    +  
Sbjct: 292 ---VEVKALQKAIGAFPLIQG---EYMVNCDTVPSLPVISFTV---GGQVYTLTGEQYIL 342

Query: 411 PENEGFTVFCLT-VMSTD-----GDYGIIGQNFMMGHRIVFDRENLKLAWSHSK 458
              +     CL+  M  D     G   I+G  FM  +  VFDR+  ++ ++ +K
Sbjct: 343 KVTQAGKTMCLSGFMGLDIPAPAGPLWILGDVFMGQYYTVFDRDANRVGFAKAK 396


>sp|Q9GMY8|PEPA_SORUN Pepsin A OS=Sorex unguiculatus GN=PGA PE=2 SV=1
          Length = 387

 Score = 57.0 bits (136), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 90/357 (25%), Positives = 143/357 (40%), Gaps = 62/357 (17%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           ++  I IGTP   F V  D GS+ LWVP   I C+  + S       N + +DP  SS+ 
Sbjct: 75  YFGTISIGTPPQEFTVIFDTGSSNLWVP--SIYCSSPACS-------NHNRFDPQKSSTF 125

Query: 170 KNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
           K  S +  +     S                     +G L  D + +A  +         
Sbjct: 126 KPTSQTVSIAYGTGSM--------------------TGVLGYDTVQVAGIAD-------- 157

Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGL------GDVSVPSLLAKAGLI-QNSFSICFD 282
           ++ I G  + + GS+L  +  DG++GL        G   V   +   GL+ Q+ FS+   
Sbjct: 158 TNQIFGLSQSEPGSFLYYSPFDGILGLAYPSISSSGATPVFDNMWNQGLVSQDLFSVYLS 217

Query: 283 END-SGSV--FFGDQGPATQQSTSFLPI-GEKYDAYFVGVESYCI-GNSCLTQSGFQALV 337
            ND SGSV  F G        S +++P+  E Y  + + V+S  + G S     G QA+V
Sbjct: 218 SNDQSGSVVMFGGIDSSYYTGSLNWVPLSSEGY--WQITVDSITMNGQSIACNGGCQAIV 275

Query: 338 DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSK 397
           D+G S    PT   A +  K     +S     QG     C       +  +PD+    + 
Sbjct: 276 DTGTSLLSGPTNAIANIQSKIGASQNS-----QGQMAVSC-----SSIKNLPDIVFTING 325

Query: 398 NQ-SFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLA 453
            Q       +I    E        + + ++ G+  I+G  F+  +  VFDR N ++ 
Sbjct: 326 IQYPLPASAYILQSQEGCSSGFQGMDIPTSSGELWILGDVFIRQYFTVFDRANNQVG 382


>sp|C4YMJ3|CARP2_CANAW Candidapepsin-2 OS=Candida albicans (strain WO-1) GN=SAP2 PE=1 SV=1
          Length = 398

 Score = 56.6 bits (135), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 86/368 (23%), Positives = 149/368 (40%), Gaps = 77/368 (20%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           I +G+ N    V +D GS+ LWVP   + C    +       +    YDPS SS+S++++
Sbjct: 74  ITVGSNNQKLNVIVDTGSSDLWVPDVNVDCQVTYSDQTADFCKQKGTYDPSGSSASQDLN 133

Query: 174 CSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS-KHAPQSSVQSSV 232
                              P+   Y  + +SS G L  D +     S K+   + V S+ 
Sbjct: 134 ------------------TPFKIGYG-DGSSSQGTLYKDTVGFGGVSIKNQVLADVDSTS 174

Query: 233 ----IIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI-QNSFSICFDEND-- 285
               I+G G K   +   G + D          +VP  L K G+I +N++S+  +  D  
Sbjct: 175 IDQGILGVGYKTNEA---GGSYD----------NVPVTLKKQGVIAKNAYSLYLNSPDAA 221

Query: 286 SGSVFFGDQGPATQQSTSFLPIGEKYDAYF-VGVESYCIGNSCLTQSGFQALVDSGASFT 344
           +G + FG    A + S S + +    D    + + S  +    +       L+DSG + T
Sbjct: 222 TGQIIFGGVDNA-KYSGSLIALPVTSDRELRISLGSVEVSGKTINTDNVDVLLDSGTTIT 280

Query: 345 FLPTEIYAEVVVKFDKLVSSKRISLQGNS-WKYCYNASSEEMLKVPDMRLIFSKNQSFVV 403
           +L  ++  +++  F+  ++       GNS ++   N S        D+   FSKN     
Sbjct: 281 YLQQDLADQIIKAFNGKLTQDS---NGNSFYEVDCNLSG-------DVVFNFSKNAK--- 327

Query: 404 RNHIFSFPENEGFTVFCLTVMSTDG-------------DYGIIGQNFMMGHRIVFDRENL 450
                S P +E    F  ++   DG             D  I+G NF+    IV+D +N 
Sbjct: 328 ----ISVPASE----FAASLQGDDGQPYDKCQLLFDVNDANILGDNFLRSAYIVYDLDNN 379

Query: 451 KLAWSHSK 458
           +++ +  K
Sbjct: 380 EISLAQVK 387


>sp|Q9LZL3|PCS1L_ARATH Aspartic proteinase PCS1 OS=Arabidopsis thaliana GN=PCS1 PE=2 SV=1
          Length = 453

 Score = 56.6 bits (135), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 90/385 (23%), Positives = 158/385 (41%), Gaps = 78/385 (20%)

Query: 125 VALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRS- 183
           + +D GS L W+ C             +S    ++ +DP+ SSS   + CS P C++R+ 
Sbjct: 88  MVIDTGSELSWLRCN-----------RSSNPNPVNNFDPTRSSSYSPIPCSSPTCRTRTR 136

Query: 184 ------SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCG 237
                 SC S K  C     Y+ + +SS G L  +I H  +       S+  S++I GC 
Sbjct: 137 DFLIPASCDSDKL-CHATLSYA-DASSSEGNLAAEIFHFGN-------STNDSNLIFGCM 187

Query: 238 RKQTGS-YLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQG- 295
              +GS   +     G++G+  G +   S +++ G  + S+ I   ++  G +  GD   
Sbjct: 188 GSVSGSDPEEDTKTTGLLGMNRGSL---SFISQMGFPKFSYCISGTDDFPGFLLLGDSNF 244

Query: 296 ---------PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL-----------TQSGFQA 335
                    P  + ST  LP  ++  AY V +    +    L           T +G Q 
Sbjct: 245 TWLTPLNYTPLIRISTP-LPYFDRV-AYTVQLTGIKVNGKLLPIPKSVLVPDHTGAG-QT 301

Query: 336 LVDSGASFTFLPTEIYAEVVVKFDK-------LVSSKRISLQGNSWKYCYNAS-----SE 383
           +VDSG  FTFL   +Y  +   F         +        QG +   CY  S     S 
Sbjct: 302 MVDSGTQFTFLLGPVYTALRSHFLNRTNGILTVYEDPDFVFQG-TMDLCYRISPVRIRSG 360

Query: 384 EMLKVPDMRLIFSKNQSFVV-RNHIFSFPE----NEGFTVFCLTVMSTD---GDYGIIGQ 435
            + ++P + L+F   +  V  +  ++  P     N+  +V+C T  ++D    +  +IG 
Sbjct: 361 ILHRLPTVSLVFEGAEIAVSGQPLLYRVPHLTVGND--SVYCFTFGNSDLMGMEAYVIGH 418

Query: 436 NFMMGHRIVFDRENLKLAWSHSKCE 460
           +      I FD +  ++  +  +C+
Sbjct: 419 HHQQNMWIEFDLQRSRIGLAPVECD 443


>sp|P0DJ06|CARP2_CANAL Candidapepsin-2 OS=Candida albicans (strain SC5314 / ATCC MYA-2876)
           GN=SAP2 PE=1 SV=1
          Length = 398

 Score = 56.2 bits (134), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 86/368 (23%), Positives = 149/368 (40%), Gaps = 77/368 (20%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           I +G+ N    V +D GS+ LWVP   + C    +       +    YDPS SS+S++++
Sbjct: 74  ITVGSNNQKLNVIVDTGSSDLWVPDVNVDCQVTYSDQTADFCKQKGTYDPSGSSASQDLN 133

Query: 174 CSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS-KHAPQSSVQSSV 232
                              P+   Y  + +SS G L  D +     S K+   + V S+ 
Sbjct: 134 ------------------TPFKIGYG-DGSSSQGTLYKDTVGFGGVSIKNQVLADVDSTS 174

Query: 233 ----IIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI-QNSFSICFDEND-- 285
               I+G G K   +   G + D          +VP  L K G+I +N++S+  +  D  
Sbjct: 175 IDQGILGVGYKTNEA---GGSYD----------NVPVTLKKQGVIAKNAYSLYLNSPDAA 221

Query: 286 SGSVFFGDQGPATQQSTSFLPIGEKYDAYF-VGVESYCIGNSCLTQSGFQALVDSGASFT 344
           +G + FG    A + S S + +    D    + + S  +    +       LVDSG + T
Sbjct: 222 TGQIIFGGVDNA-KYSGSLIALPVTSDRELRISLGSVEVSGKTINTDNVDVLVDSGTTIT 280

Query: 345 FLPTEIYAEVVVKFDKLVSSKRISLQGNS-WKYCYNASSEEMLKVPDMRLIFSKNQSFVV 403
           +L  ++  +++  F+  ++       GNS ++   N S        D+   FSKN     
Sbjct: 281 YLQQDLADQIIKAFNGKLTQDS---NGNSFYEVDCNLSG-------DVVFNFSKNAK--- 327

Query: 404 RNHIFSFPENEGFTVFCLTVMSTDG-------------DYGIIGQNFMMGHRIVFDRENL 450
                S P +E    F  ++   DG             D  I+G NF+    IV+D ++ 
Sbjct: 328 ----ISVPASE----FAASLQGDDGQPYDKCQLLFDVNDANILGDNFLRSAYIVYDLDDN 379

Query: 451 KLAWSHSK 458
           +++ +  K
Sbjct: 380 EISLAQVK 387


>sp|P0CS83|CARP2_CANAX Candidapepsin-2 OS=Candida albicans GN=SAP2 PE=1 SV=1
          Length = 398

 Score = 54.7 bits (130), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 85/368 (23%), Positives = 149/368 (40%), Gaps = 77/368 (20%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           I +G+ N    V +D GS+ LWVP   + C    +       +    YDPS SS+S++++
Sbjct: 74  ITVGSNNQKLNVIVDTGSSDLWVPDVNVDCQVTYSDQTADFCKQKGTYDPSGSSASQDLN 133

Query: 174 CSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS-KHAPQSSVQSSV 232
                              P+   Y  + +SS G L  D +     S K+   + V S+ 
Sbjct: 134 ------------------TPFKIGYG-DGSSSQGTLYKDTVGFGGVSIKNQVLADVDSTS 174

Query: 233 ----IIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI-QNSFSICFDEND-- 285
               I+G G K   +   G + D          +VP  L K G+I +N++S+  +  D  
Sbjct: 175 IDQGILGVGYKTNEA---GGSYD----------NVPVTLKKQGVIAKNAYSLYLNSPDAA 221

Query: 286 SGSVFFGDQGPATQQSTSFLPIGEKYDAYF-VGVESYCIGNSCLTQSGFQALVDSGASFT 344
           +G + FG    A + S S + +    D    + + S  +    +       L+DSG + T
Sbjct: 222 TGQIIFGGVDNA-KYSGSLIALPVTSDRELRISLGSVEVSGKTINTDNVDVLLDSGTTIT 280

Query: 345 FLPTEIYAEVVVKFDKLVSSKRISLQGNS-WKYCYNASSEEMLKVPDMRLIFSKNQSFVV 403
           +L  ++  +++  F+  ++       GNS ++   N S        D+   FSKN     
Sbjct: 281 YLQQDLADQIIKAFNGKLTQDS---NGNSFYEVDCNLSG-------DVVFNFSKNAK--- 327

Query: 404 RNHIFSFPENEGFTVFCLTVMSTDG-------------DYGIIGQNFMMGHRIVFDRENL 450
                S P +E    F  ++   DG             D  I+G NF+    IV+D ++ 
Sbjct: 328 ----ISVPASE----FAASLQGDDGQPYDKCQLLFDVNDANILGDNFLRSAYIVYDLDDN 379

Query: 451 KLAWSHSK 458
           +++ +  K
Sbjct: 380 EISLAQVK 387


>sp|Q9DEX3|CATD_CLUHA Cathepsin D OS=Clupea harengus GN=ctsd PE=1 SV=1
          Length = 396

 Score = 53.9 bits (128), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 94/405 (23%), Positives = 162/405 (40%), Gaps = 77/405 (19%)

Query: 78  KLQSNNNSSRNQLLFPSEGSQT-HFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWV 136
           +L +  NS ++   FPS  + T     N     +Y  I +GTP   F V  D GS+ LW+
Sbjct: 43  QLLAGTNSLQHNQGFPSSNAPTPETLKNYMDAQYYGEIGLGTPVQMFTVVFDTGSSNLWL 102

Query: 137 PCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSC-SHPLCKSRSSCKSLKDPCPYI 195
           P   I C                        S  +++C  H       S   +K+   + 
Sbjct: 103 P--SIHC------------------------SFTDIACLLHHKYNGAKSSTYVKNGTEFA 136

Query: 196 ADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMG 255
             Y +   S SGYL  D   +           V    + G   KQ G     A  DG++G
Sbjct: 137 IQYGS--GSLSGYLSQDSCTIGDI--------VVEKQLFGEAIKQPGVAFIAAKFDGILG 186

Query: 256 LGLGDVSVPS-------LLAKAGLIQNSFSICFDEN----DSGSVFFGDQGPATQQST-S 303
           +    +SV         ++++  + QN FS   + N      G +  G   P       +
Sbjct: 187 MAYPRISVDGVPPVFDMMMSQKKVEQNVFSFYLNRNPDTEPGGELLLGGTDPKYYTGDFN 246

Query: 304 FLPIGEKYDAYF-VGVESYCIGNS-CLTQSGFQALVDSGASF-TFLPTEIYAEVVVKFDK 360
           ++P+  +  AY+ + ++   IG+   L + G +A+VD+G S  T  P E+ A       K
Sbjct: 247 YVPVTRQ--AYWQIHMDGMSIGSQLTLCKDGCEAIVDTGTSLITGPPAEVRA-----LQK 299

Query: 361 LVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLI-FS-KNQSFVVRNHIFSFPENEGFTV 418
            + +  + +QG     C         KVP +  I F+   +++ +    +   E++G   
Sbjct: 300 AIGAIPL-IQGEYMIDCK--------KVPTLPTISFNVGGKTYSLTGEQYVLKESQGGKT 350

Query: 419 FCLT-VMSTD-----GDYGIIGQNFMMGHRIVFDRENLKLAWSHS 457
            CL+ +M  +     G   I+G  F+  +  VFDRE+ ++ ++ S
Sbjct: 351 ICLSGLMGLEIPPPAGPLWILGDVFIGQYYTVFDRESNRVGFAKS 395


>sp|P16476|PEPE_CHICK Embryonic pepsinogen OS=Gallus gallus PE=2 SV=1
          Length = 383

 Score = 52.4 bits (124), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 80/359 (22%), Positives = 136/359 (37%), Gaps = 63/359 (17%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           +Y  I IGTP   F V  D GS+ LWVP                                
Sbjct: 76  YYGTISIGTPPQDFTVVFDTGSSNLWVP-------------------------------- 103

Query: 170 KNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
            +VSC+ P C+S      + +P       ST    S  Y   D+            S + 
Sbjct: 104 -SVSCTSPACQSHQ----MFNPSQSSTYKSTGQNLSIHYGTGDMEGTVGCDTVTVASLMD 158

Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGL----GDVSVP---SLLAKAGLIQNSFSICFD 282
           ++ + G    + G +      DG++GLG      D   P   +++ ++ L QN FS+   
Sbjct: 159 TNQLFGLSTSEPGQFFVYVKFDGILGLGYPSLAADGITPVFDNMVNESLLEQNLFSVYLS 218

Query: 283 ENDSGS--VFFGDQGPATQQSTSFLPIGEKYDAYF-VGVESYCIGNSCLT-QSGFQALVD 338
               GS  VF G        S +++P+   Y  Y+ + ++S  +    +   SG QA++D
Sbjct: 219 REPMGSMVVFGGIDESYFTGSINWIPV--SYQGYWQISMDSIIVNKQEIACSSGCQAIID 276

Query: 339 SGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKN 398
           +G S    P     ++         S   + Q    +Y  N S   +L +PD+  +    
Sbjct: 277 TGTSLVAGPASDINDI--------QSAVGANQNTYGEYSVNCS--HILAMPDVVFVIGGI 326

Query: 399 QSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHS 457
           Q  V      ++ E  G      +  ++  D  I+G  F+  +  +FDR N ++  + +
Sbjct: 327 QYPVPA---LAYTEQNGQGTCMSSFQNSSADLWILGDVFIRVYYSIFDRANNRVGLAKA 382


>sp|P0DJD9|PEPA5_HUMAN Pepsin A-5 OS=Homo sapiens GN=PGA5 PE=1 SV=1
          Length = 388

 Score = 51.2 bits (121), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 87/357 (24%), Positives = 144/357 (40%), Gaps = 62/357 (17%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           ++  I IGTP   F V  D GS+ LWVP   + C+ L+ +       N + ++P  SS+ 
Sbjct: 76  YFGTIGIGTPAQDFTVVFDTGSSNLWVP--SVYCSSLACT-------NHNRFNPEDSSTY 126

Query: 170 KNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
           ++ S +  +     S                     +G L  D + +   S         
Sbjct: 127 QSTSETVSITYGTGSM--------------------TGILGYDTVQVGGISD-------- 158

Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGL------GDVSVPSLLAKAGLI-QNSFSICFD 282
           ++ I G    + GS+L  A  DG++GL        G   V   +   GL+ Q+ FS+   
Sbjct: 159 TNQIFGLSETEPGSFLYYAPFDGILGLAYPSISSSGATPVFDNIWNQGLVSQDLFSVYLS 218

Query: 283 END-SGSV--FFGDQGPATQQSTSFLPIGEKYDAYF-VGVESYCI-GNSCLTQSGFQALV 337
            +D SGSV  F G        S +++P+    + Y+ + V+S  + G +     G QA+V
Sbjct: 219 ADDKSGSVVIFGGIDSSYYTGSLNWVPV--TVEGYWQITVDSITMNGETIACAEGCQAIV 276

Query: 338 DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSK 397
           D+G S    PT   A +        +S      G+    C   SS     +PD+    + 
Sbjct: 277 DTGTSLLTGPTSPIANIQSDIGASENSD-----GDMVVSCSAISS-----LPDIVFTING 326

Query: 398 NQSFVVRNHIFSFPENEGFTVF-CLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLA 453
            Q  V  +      E    + F  + V +  G+  I+G  F+  +  VFDR N ++ 
Sbjct: 327 VQYPVPPSAYILQSEGSCISGFQGMNVPTESGELWILGDVFIRQYFTVFDRANNQVG 383


>sp|P00791|PEPA_PIG Pepsin A OS=Sus scrofa GN=PGA PE=1 SV=3
          Length = 385

 Score = 50.8 bits (120), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 88/362 (24%), Positives = 148/362 (40%), Gaps = 72/362 (19%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           ++  I IGTP   F V  D GS+ LWVP   + C+ L+ S     D N  +++P  SS+ 
Sbjct: 73  YFGTIGIGTPAQDFTVIFDTGSSNLWVPS--VYCSSLACS-----DHN--QFNPDDSSTF 123

Query: 170 KNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
           +  S    +     S                     +G L  D + +   S         
Sbjct: 124 EATSQELSITYGTGSM--------------------TGILGYDTVQVGGIS--------D 155

Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSL------LAKAGLI-QNSFSICFD 282
           ++ I G    + GS+L  A  DG++GL    +S          L   GL+ Q+ FS+   
Sbjct: 156 TNQIFGLSETEPGSFLYYAPFDGILGLAYPSISASGATPVFDNLWDQGLVSQDLFSVYLS 215

Query: 283 END-SGSVFF--GDQGPATQQSTSFLPIG-EKYDAYFVGVESYCI-GNSCLTQSGFQALV 337
            ND SGSV    G        S +++P+  E Y  + + ++S  + G +     G QA+V
Sbjct: 216 SNDDSGSVVLLGGIDSSYYTGSLNWVPVSVEGY--WQITLDSITMDGETIACSGGCQAIV 273

Query: 338 DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSK 397
           D+G S    PT   A +         S   + + +  +   + SS + L  PD+    + 
Sbjct: 274 DTGTSLLTGPTSAIANI--------QSDIGASENSDGEMVISCSSIDSL--PDIVFTING 323

Query: 398 NQ------SFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLK 451
            Q      ++++++        EG     + V ++ G+  I+G  F+  +  VFDR N K
Sbjct: 324 VQYPLSPSAYILQDDDSCTSGFEG-----MDVPTSSGELWILGDVFIRQYYTVFDRANNK 378

Query: 452 LA 453
           + 
Sbjct: 379 VG 380


>sp|Q42456|ASPR1_ORYSJ Aspartic proteinase oryzasin-1 OS=Oryza sativa subsp. japonica
           GN=Os05g0567100 PE=2 SV=2
          Length = 509

 Score = 50.8 bits (120), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 72/298 (24%), Positives = 119/298 (39%), Gaps = 60/298 (20%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           ++  I +GTP   F V  D GS+ LWVP         SA  Y S                
Sbjct: 85  YFGEIGVGTPPQKFTVIFDTGSSNLWVP---------SAKCYFS---------------- 119

Query: 170 KNVSC-SHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSV 228
             ++C  H   KS  S    K+  P    Y T   S +G+  +D + +        Q  +
Sbjct: 120 --IACFFHSRYKSGQSSTYQKNGKPAAIQYGT--GSIAGFFSEDSVTVGDLVVK-DQEFI 174

Query: 229 QSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSL------LAKAGLI-QNSFSICF 281
           +++       K+ G     A  DG++GLG  ++SV         + + GL+ +  FS  F
Sbjct: 175 EAT-------KEPGLTFMVAKFDGILGLGFQEISVGDAVPVWYKMVEQGLVSEPVFSFWF 227

Query: 282 ----DENDSGSVFFGDQGPATQQST-SFLPIGEK-YDAYFVGVESYCIGNSCLTQSGFQA 335
               DE + G + FG   P+  +   +++P+ +K Y  + +G        +    SG  A
Sbjct: 228 NRHSDEGEGGEIVFGGMDPSHYKGNHTYVPVSQKGYWQFEMGDVLIGGKTTGFCASGCSA 287

Query: 336 LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRL 393
           + DSG S    PT I  E+         +++I   G   + C    S+   ++ D+ L
Sbjct: 288 IADSGTSLLAGPTAIITEI---------NEKIGATGVVSQECKTVVSQYGQQILDLLL 336


>sp|Q4LAL9|CATD_CANFA Cathepsin D OS=Canis familiaris GN=CTSD PE=2 SV=1
          Length = 410

 Score = 50.4 bits (119), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 88/375 (23%), Positives = 154/375 (41%), Gaps = 73/375 (19%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           +Y  I IGTP   F V  D GS+ LWVP   I C  L                       
Sbjct: 79  YYGEIGIGTPPQCFTVVFDTGSSNLWVP--SIHCKLL----------------------- 113

Query: 170 KNVSC-SHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSV 228
            +++C  H    S  S   +K+   +   Y +   S SGYL  D + +   S  +  + +
Sbjct: 114 -DIACWIHHKYNSGKSSTYVKNGTSFDIHYGS--GSLSGYLSQDTVSVPCKSALSGLAGI 170

Query: 229 Q-SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSV-------PSLLAKAGLIQNSFSIC 280
           +      G   KQ G     A  DG++G+    +SV        +L+ +  + +N FS  
Sbjct: 171 KVERQTFGEATKQPGITFIAAKFDGILGMAYPRISVNNVLPVFDNLMQQKLVEKNIFSFY 230

Query: 281 FDENDS----GSVFFGD------QGPATQQSTSFLPIGEKYDAYF-VGVESYCIGNS-CL 328
            + + +    G +  G       +GP      S+L +  K  AY+ V +E   +G+S  L
Sbjct: 231 LNRDPNAQPGGELMLGGTDSKYYKGP-----LSYLNVTRK--AYWQVHMEQVDVGSSLTL 283

Query: 329 TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKV 388
            + G +A+VD+G S    P     + V +  K + +  + +QG      Y    E++  +
Sbjct: 284 CKGGCEAIVDTGTSLIVGP----VDEVRELQKAIGAVPL-IQGE-----YMIPCEKVSTL 333

Query: 389 PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLT-VMSTD-----GDYGIIGQNFMMGHR 442
           PD+ L     + + + +  ++   ++G    CL+  M  D     G   I+G  F+  + 
Sbjct: 334 PDVTLKLG-GKLYKLSSEDYTLKVSQGGKTICLSGFMGMDIPPPGGPLWILGDVFIGCYY 392

Query: 443 IVFDRENLKLAWSHS 457
            VFDR+  ++  + +
Sbjct: 393 TVFDRDQNRVGLAQA 407


>sp|Q9D7R7|PEPC_MOUSE Gastricsin OS=Mus musculus GN=Pgc PE=2 SV=1
          Length = 392

 Score = 50.4 bits (119), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 84/376 (22%), Positives = 146/376 (38%), Gaps = 88/376 (23%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVP---CQCIQCAPLSASYYTSLDRNLSEYDPSSS 166
           +Y  I IGTP  +FLV  D GS+ LWV    CQ   C               + Y+PS S
Sbjct: 76  YYGEISIGTPPQNFLVLFDTGSSNLWVSSVYCQSEACT------------THTRYNPSKS 123

Query: 167 SSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQS 226
           S+      +  L                   Y T   S +G+   D L + S     P  
Sbjct: 124 STYYTQGQTFSL------------------QYGT--GSLTGFFGYDTLRVQSI--QVPNQ 161

Query: 227 SVQSSVIIGCGRKQTGSYLDGAAPDGVMGL-------GLGDVSVPSLLAKAGLIQNSFSI 279
                   G    + G+    A  DG+MGL       G    ++  +L +  L Q  F +
Sbjct: 162 E------FGLSENEPGTNFVYAQFDGIMGLAYPGLSSGGATTALQGMLGEGALSQPLFGV 215

Query: 280 CFDE---NDSGSVFFG--DQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSC---LTQS 331
                  ++ G + FG  D+   T + T ++P+ ++   + + ++ + IGN      + S
Sbjct: 216 YLGSQQGSNGGQIVFGGVDENLYTGELT-WIPVTQEL-YWQITIDDFLIGNQASGWCSSS 273

Query: 332 GFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDM 391
           G Q +VD+G S   +P +   E++         + I  Q   +   Y  S + +  +P +
Sbjct: 274 GCQGIVDTGTSLLVMPAQYLNELL---------QTIGAQEGEYGQ-YFVSCDSVSSLPTL 323

Query: 392 RLIFSKNQ------SFVVRNHIFSFPENEGFTVFCLTVMSTDGDYG----IIGQNFMMGH 441
             + +  Q      S+++        + EG  +  L  +S + + G    I+G  F+  +
Sbjct: 324 TFVLNGVQFPLSPSSYII--------QEEGSCMVGLESLSLNAESGQPLWILGDVFLRSY 375

Query: 442 RIVFDRENLKLAWSHS 457
             VFD  N ++  + S
Sbjct: 376 YAVFDMGNNRVGLAPS 391


>sp|P24268|CATD_RAT Cathepsin D OS=Rattus norvegicus GN=Ctsd PE=1 SV=1
          Length = 407

 Score = 50.1 bits (118), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 89/369 (24%), Positives = 146/369 (39%), Gaps = 64/369 (17%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           +Y  I IGTP   F V  D GS+ LWVP   I C  L                       
Sbjct: 79  YYGEIGIGTPPQCFTVVFDTGSSNLWVP--SIHCKLL----------------------- 113

Query: 170 KNVSC-SHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSV 228
            +++C  H    S  S   +K+   +   Y +   S SGYL  D + +   S        
Sbjct: 114 -DIACWVHHKYNSDKSSTYVKNGTSFDIHYGS--GSLSGYLSQDTVSVPCKSDLGGIKVE 170

Query: 229 QSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSL------LAKAGLIQ-NSFSICF 281
           +   I G   KQ G     A  DG++G+G   +SV  +      L K  L++ N FS   
Sbjct: 171 KQ--IFGEATKQPGVVFIAAKFDGILGMGYPFISVNKVLPVFDNLMKQKLVEKNIFSFYL 228

Query: 282 DENDSGS-----VFFGDQGPATQQSTSFLPIGEKYDAYF-VGVESYCIGNS-CLTQSGFQ 334
           + + +G      +  G          S+L +  K  AY+ V ++   +G+   L + G +
Sbjct: 229 NRDPTGQPGGELMLGGTDSRYYHGELSYLNVTRK--AYWQVHMDQLEVGSELTLCKGGCE 286

Query: 335 ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLI 394
           A+VD+G S    P +   E+     K + +  + +QG     C   SS   L +   +L 
Sbjct: 287 AIVDTGTSLLVGPVDEVKEL----QKAIGAVPL-IQGEYMIPCEKVSS---LPIITFKL- 337

Query: 395 FSKNQSFVVRNHIFSFPENEGFTVFCLT-VMSTD-----GDYGIIGQNFMMGHRIVFDRE 448
               Q++ +    +    ++     CL+  M  D     G   I+G  F+  +  VFDRE
Sbjct: 338 --GGQNYELHPEKYILKVSQAGKTICLSGFMGMDIPPPSGPLWILGDVFIGCYYTVFDRE 395

Query: 449 NLKLAWSHS 457
             ++ ++ +
Sbjct: 396 YNRVGFAKA 404


>sp|P0DJD7|PEPA4_HUMAN Pepsin A-4 OS=Homo sapiens GN=PGA4 PE=1 SV=1
          Length = 388

 Score = 50.1 bits (118), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 86/357 (24%), Positives = 144/357 (40%), Gaps = 62/357 (17%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           ++  I IGTP   F V  D GS+ LWVP   + C+ L+ +       N + ++P  SS+ 
Sbjct: 76  YFGTIGIGTPAQDFTVVFDTGSSNLWVP--SVYCSSLACT-------NHNRFNPEDSSTY 126

Query: 170 KNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
           ++ S +  +     S                     +G L  D + +   S         
Sbjct: 127 QSTSETVSITYGTGSM--------------------TGILGYDTVQVGGISD-------- 158

Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGL------GDVSVPSLLAKAGLI-QNSFSICFD 282
           ++ I G    + GS+L  A  DG++GL        G   V   +   GL+ Q+ FS+   
Sbjct: 159 TNQIFGLSETEPGSFLYYAPFDGILGLAYPSISSSGATPVFDNIWNQGLVSQDLFSVYLS 218

Query: 283 END-SGSV--FFGDQGPATQQSTSFLPIGEKYDAYF-VGVESYCI-GNSCLTQSGFQALV 337
            +D SGSV  F G        S +++P+    + Y+ + V+S  + G +     G QA+V
Sbjct: 219 ADDQSGSVVIFGGIDSSYYTGSLNWVPV--TVEGYWQITVDSITMNGEAIACAEGCQAIV 276

Query: 338 DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSK 397
           D+G S    PT   A +        +S      G+    C   SS     +PD+    + 
Sbjct: 277 DTGTSLLTGPTSPIANIQSDIGASENSD-----GDMVVSCSAISS-----LPDIVFTING 326

Query: 398 NQSFVVRNHIFSFPENEGFTVF-CLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLA 453
            Q  V  +      E    + F  + + +  G+  I+G  F+  +  VFDR N ++ 
Sbjct: 327 VQYPVPPSAYILQSEGSCISGFQGMNLPTESGELWILGDVFIRQYFTVFDRANNQVG 383


>sp|P0DJD8|PEPA3_HUMAN Pepsin A-3 OS=Homo sapiens GN=PGA3 PE=1 SV=1
          Length = 388

 Score = 50.1 bits (118), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 86/357 (24%), Positives = 144/357 (40%), Gaps = 62/357 (17%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           ++  I IGTP   F V  D GS+ LWVP   + C+ L+ +       N + ++P  SS+ 
Sbjct: 76  YFGTIGIGTPAQDFTVVFDTGSSNLWVP--SVYCSSLACT-------NHNRFNPEDSSTY 126

Query: 170 KNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
           ++ S +  +     S                     +G L  D + +   S         
Sbjct: 127 QSTSETVSITYGTGSM--------------------TGILGYDTVQVGGISD-------- 158

Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGL------GDVSVPSLLAKAGLI-QNSFSICFD 282
           ++ I G    + GS+L  A  DG++GL        G   V   +   GL+ Q+ FS+   
Sbjct: 159 TNQIFGLSETEPGSFLYYAPFDGILGLAYPSISSSGATPVFDNIWNQGLVSQDLFSVYLS 218

Query: 283 END-SGSV--FFGDQGPATQQSTSFLPIGEKYDAYF-VGVESYCI-GNSCLTQSGFQALV 337
            +D SGSV  F G        S +++P+    + Y+ + V+S  + G +     G QA+V
Sbjct: 219 ADDQSGSVVIFGGIDSSYYTGSLNWVPV--TVEGYWQITVDSITMNGEAIACAEGCQAIV 276

Query: 338 DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSK 397
           D+G S    PT   A +        +S      G+    C   SS     +PD+    + 
Sbjct: 277 DTGTSLLTGPTSPIANIQSDIGASENSD-----GDMVVSCSAISS-----LPDIVFTING 326

Query: 398 NQSFVVRNHIFSFPENEGFTVF-CLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLA 453
            Q  V  +      E    + F  + + +  G+  I+G  F+  +  VFDR N ++ 
Sbjct: 327 VQYPVPPSAYILQSEGSCISGFQGMNLPTESGELWILGDVFIRQYFTVFDRANNQVG 383


>sp|Q9XFX3|CARDA_CYNCA Procardosin-A OS=Cynara cardunculus GN=cardA PE=1 SV=1
          Length = 504

 Score = 50.1 bits (118), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 64/256 (25%), Positives = 101/256 (39%), Gaps = 49/256 (19%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
           ++  I IGTP   F V  D GS++LWVP  +CI      A          S Y+ S SS+
Sbjct: 85  YFGEIGIGTPPQKFTVIFDTGSSVLWVPSSKCINSKACRAH---------SMYESSDSST 135

Query: 169 SKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSV 228
            K       +     S          I  + ++D+ + G LV                 V
Sbjct: 136 YKENGTFGAIIYGTGS----------ITGFFSQDSVTIGDLV-----------------V 168

Query: 229 QSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVP---SLLAKAGLIQNSFSICF---- 281
           +    I    +    +L     DG++GL    +SVP   ++L +  + +  FS       
Sbjct: 169 KEQDFIEATDEADNVFLH-RLFDGILGLSFQTISVPVWYNMLNQGLVKERRFSFWLNRNV 227

Query: 282 DENDSGSVFFGDQGPAT-QQSTSFLPIGEKYDAYFVGVESYCIGN--SCLTQSGFQALVD 338
           DE + G + FG   P   +   +++P+  +Y   F G+    IG+  +     G QA  D
Sbjct: 228 DEEEGGELVFGGLDPNHFRGDHTYVPVTYQYYWQF-GIGDVLIGDKSTGFCAPGCQAFAD 286

Query: 339 SGASFTFLPTEIYAEV 354
           SG S    PT I  ++
Sbjct: 287 SGTSLLSGPTAIVTQI 302


>sp|P32329|YPS1_YEAST Aspartic proteinase 3 OS=Saccharomyces cerevisiae (strain ATCC
           204508 / S288c) GN=YPS1 PE=1 SV=2
          Length = 569

 Score = 49.7 bits (117), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 54/243 (22%), Positives = 107/243 (44%), Gaps = 49/243 (20%)

Query: 252 GVMGLGLGDVSV-------------------PSLLAKAGLIQ-NSFSICFDENDS--GSV 289
           GV+G+GL ++ V                   P +L  +G I+ N++S+  +++D+  G++
Sbjct: 249 GVLGIGLPELEVTYSGSTASHSGKAYKYDNFPIVLKNSGAIKSNTYSLYLNDSDAMHGTI 308

Query: 290 FFGDQGPATQQSTSF-LPIGE-----------KYDAYF--VGVESYCIGNSCLTQSGFQA 335
            FG    +    T + +PI             ++D     +G+      N  LT +   A
Sbjct: 309 LFGAVDHSKYTGTLYTIPIVNTLSASGFSSPIQFDVTINGIGISDSGSSNKTLTTTKIPA 368

Query: 336 LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIF 395
           L+DSG + T+LP  + + +  +     SS RI        Y  +  S++      M ++F
Sbjct: 369 LLDSGTTLTYLPQTVVSMIATELGAQYSS-RIGY------YVLDCPSDD-----SMEIVF 416

Query: 396 SKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWS 455
                F +   + SF  + G T     + ++D    I+G +F+    +V+D ENL+++ +
Sbjct: 417 DFG-GFHINAPLSSFILSTGTTCLLGIIPTSDDTGTILGDSFLTNAYVVYDLENLEISMA 475

Query: 456 HSK 458
            ++
Sbjct: 476 QAR 478


>sp|P27677|PEPA2_MACFU Pepsin A-2/A-3 OS=Macaca fuscata fuscata PE=1 SV=1
          Length = 388

 Score = 49.3 bits (116), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 90/370 (24%), Positives = 142/370 (38%), Gaps = 88/370 (23%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           ++  I IGTP   F V  D GS+ LWVP   + C+ L+ +       N + ++P  SS+ 
Sbjct: 76  YFGTIGIGTPAQDFTVIFDTGSSNLWVP--SVYCSSLACT-------NHNRFNPQDSSTY 126

Query: 170 KNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
           ++ S +  +     S                     +G L  D + +   S         
Sbjct: 127 QSTSGTVSITYGTGSM--------------------TGILGYDTVQVGGISD-------- 158

Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGL------GDVSVPSLLAKAGLI-QNSFSICFD 282
           ++ I G    + GS+L  A  DG++GL        G   V   +   GL+ Q+ FS+   
Sbjct: 159 TNQIFGLSETEPGSFLYYAPFDGILGLAYPSISSSGATPVFDNIWNQGLVSQDLFSVYLS 218

Query: 283 END-SGSV--FFGDQGPATQQSTSFLPIG-EKYDAYFVGVESYCI-GNSCLTQSGFQALV 337
            +D SGSV  F G        S +++P+  E Y  + + V+S  + G +     G QA+V
Sbjct: 219 ADDQSGSVVIFGGIDSSYYTGSLNWVPVSVEGY--WQISVDSITMNGEAIACAEGCQAIV 276

Query: 338 DSGASFTFLPTEIYA--------------EVVVKFDKLVSSKRISLQGNSWKYCYNASSE 383
           D+G S    PT   A              E+VV    + S   I    N  +Y       
Sbjct: 277 DTGTSLLTGPTSPIANIQSDIGASENSDGEMVVSCSAISSLPDIVFTINGIQY------- 329

Query: 384 EMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRI 443
               VP    I     S +            GF    + V +  G+  I+G  F+  +  
Sbjct: 330 ---PVPPSAYILQSQGSCI-----------SGFQ--GMDVPTESGELWILGDVFIRQYFT 373

Query: 444 VFDRENLKLA 453
           VFDR N ++ 
Sbjct: 374 VFDRANNQVG 383


>sp|Q3EBM5|ASPR1_ARATH Probable aspartic protease At2g35615 OS=Arabidopsis thaliana
           GN=At2g35615 PE=3 SV=1
          Length = 447

 Score = 49.3 bits (116), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 87/397 (21%), Positives = 151/397 (38%), Gaps = 87/397 (21%)

Query: 111 YTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           +  I IGTP +      D GS+L WV C+ C QC             N   +D   SS+ 
Sbjct: 86  FMSITIGTPPIKVFAIADTGSDLTWVQCKPCQQC----------YKENGPIFDKKKSSTY 135

Query: 170 KNVSCSHPLCKSRSS----CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
           K+  C    C++ SS    C    + C Y   Y  + + S G +  + + + S S  +P 
Sbjct: 136 KSEPCDSRNCQALSSTERGCDESNNICKYRYSYG-DQSFSKGDVATETVSIDSASG-SPV 193

Query: 226 SSVQSSVIIGCGRKQTGSYLD----------------------------------GAAPD 251
           S      + GCG    G++ +                                   A  +
Sbjct: 194 SF--PGTVFGCGYNNGGTFDETGSGIIGLGGGHLSLISQLGSSISKKFSYCLSHKSATTN 251

Query: 252 GVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSG--SVFFGDQGPATQQSTSF--LPI 307
           G   + LG  S+PS L+K               DSG  S    D+ P T    +   + +
Sbjct: 252 GTSVINLGTNSIPSSLSK---------------DSGVVSTPLVDKEPLTYYYLTLEAISV 296

Query: 308 GEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVS-SKR 366
           G+K   Y  G       +  L+++    ++DSG + T L    + +     ++ V+ +KR
Sbjct: 297 GKKKIPY-TGSSYNPNDDGILSETSGNIIIDSGTTLTLLEAGFFDKFSSAVEESVTGAKR 355

Query: 367 ISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVR----NHIFSFPENEGFTVFCLT 422
           +S       +C+ + S E + +P++ + F+      VR    N      E+    + CL+
Sbjct: 356 VSDPQGLLSHCFKSGSAE-IGLPEITVHFTGAD---VRLSPINAFVKLSED----MVCLS 407

Query: 423 VMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 459
           ++ T  +  I G    M   + +D E   +++ H  C
Sbjct: 408 MVPTT-EVAIYGNFAQMDFLVGYDLETRTVSFQHMDC 443


>sp|P07339|CATD_HUMAN Cathepsin D OS=Homo sapiens GN=CTSD PE=1 SV=1
          Length = 412

 Score = 49.3 bits (116), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 89/388 (22%), Positives = 152/388 (39%), Gaps = 65/388 (16%)

Query: 94  SEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTS 153
           +EG       N     +Y  I IGTP   F V  D GS+ LWVP   I C  L       
Sbjct: 63  TEGPIPEVLKNYMDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVP--SIHCKLL------- 113

Query: 154 LDRNLSEYDPSSSSSSKNVSC-SHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDD 212
                            +++C  H    S  S   +K+   +   Y +   S SGYL  D
Sbjct: 114 -----------------DIACWIHHKYNSDKSSTYVKNGTSFDIHYGS--GSLSGYLSQD 154

Query: 213 ILHLASFSKHAPQSSVQSSV---IIGCGRKQTGSYLDGAAPDGVMGLGLGDVSV------ 263
            + +   S  +  +     V   + G   KQ G     A  DG++G+    +SV      
Sbjct: 155 TVSVPCQSASSASALGGVKVERQVFGEATKQPGITFIAAKFDGILGMAYPRISVNNVLPV 214

Query: 264 -PSLLAKAGLIQNSFSICF----DENDSGSVFFGD-QGPATQQSTSFLPIGEKYDAYF-V 316
             +L+ +  + QN FS       D    G +  G       + S S+L +  K  AY+ V
Sbjct: 215 FDNLMQQKLVDQNIFSFYLSRDPDAQPGGELMLGGTDSKYYKGSLSYLNVTRK--AYWQV 272

Query: 317 GVESYCIGNS-CLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWK 375
            ++   + +   L + G +A+VD+G S    P     + V +  K + +  + +QG    
Sbjct: 273 HLDQVEVASGLTLCKEGCEAIVDTGTSLMVGPV----DEVRELQKAIGAVPL-IQGE--- 324

Query: 376 YCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLT-VMSTD-----GD 429
             Y    E++  +P + L     + + +    ++   ++     CL+  M  D     G 
Sbjct: 325 --YMIPCEKVSTLPAITLKLG-GKGYKLSPEDYTLKVSQAGKTLCLSGFMGMDIPPPSGP 381

Query: 430 YGIIGQNFMMGHRIVFDRENLKLAWSHS 457
             I+G  F+  +  VFDR+N ++ ++ +
Sbjct: 382 LWILGDVFIGRYYTVFDRDNNRVGFAEA 409


>sp|O96009|NAPSA_HUMAN Napsin-A OS=Homo sapiens GN=NAPSA PE=1 SV=1
          Length = 420

 Score = 49.3 bits (116), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 96/410 (23%), Positives = 151/410 (36%), Gaps = 81/410 (19%)

Query: 64  LLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSF 123
           L L   W+      KL + +   +   +  S      +FG          I +GTP  +F
Sbjct: 41  LNLLRGWREPAELPKLGAPSPGDKPIFVPLSNYRDVQYFGE---------IGLGTPPQNF 91

Query: 124 LVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRS 183
            VA D GS+ LWVP +  +C   S   +         +DP +SSS +             
Sbjct: 92  TVAFDTGSSNLWVPSR--RCHFFSVPCWLH-----HRFDPKASSSFQANGTK-------- 136

Query: 184 SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGS 243
                     +   Y T      G L +D L +             +SVI G    +   
Sbjct: 137 ----------FAIQYGTGRV--DGILSEDKLTIGGIKG--------ASVIFGEALWEPSL 176

Query: 244 YLDGAAPDGVMGLGLGDVSVPS------LLAKAGLIQN---SFSICFD--ENDSGSVFFG 292
               A  DG++GLG   +SV        +L + GL+     SF +  D  E D G +  G
Sbjct: 177 VFAFAHFDGILGLGFPILSVEGVRPPMDVLVEQGLLDKPVFSFYLNRDPEEPDGGELVLG 236

Query: 293 DQGPATQ-QSTSFLPIGEKYDAYF-VGVESYCIGNS-CLTQSGFQALVDSGASFTFLPT- 348
              PA      +F+P+     AY+ + +E   +G    L   G  A++D+G S    PT 
Sbjct: 237 GSDPAHYIPPLTFVPV--TVPAYWQIHMERVKVGPGLTLCAKGCAAILDTGTSLITGPTE 294

Query: 349 EIYA-EVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHI 407
           EI A    +    L++ + I L              E+ K+P +  +      F +  H 
Sbjct: 295 EIRALHAAIGGIPLLAGEYIIL------------CSEIPKLPAVSFLLG-GVWFNLTAHD 341

Query: 408 FSFPENEGFTVFCLT------VMSTDGDYGIIGQNFMMGHRIVFDRENLK 451
           +           CL+      V    G + I+G  F+  +  VFDR ++K
Sbjct: 342 YVIQTTRNGVRLCLSGFQALDVPPPAGPFWILGDVFLGTYVAVFDRGDMK 391


>sp|Q05744|CATD_CHICK Cathepsin D OS=Gallus gallus GN=CTSD PE=1 SV=1
          Length = 398

 Score = 48.9 bits (115), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 91/373 (24%), Positives = 150/373 (40%), Gaps = 76/373 (20%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVP---CQCIQCAPLSASYYTSLDRNLSEYDPSSS 166
           +Y  I IGTP   F V  D GS+ LWVP   C  +  A L             +YD S S
Sbjct: 78  YYGEIGIGTPPQKFTVVFDTGSSNLWVPSVHCHLLDIACLLH----------HKYDASKS 127

Query: 167 SSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQS 226
           S+                   +++   +   Y T   S SG+L  D + L +        
Sbjct: 128 ST------------------YVENGTEFAIHYGT--GSLSGFLSQDTVTLGNLKI----- 162

Query: 227 SVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSL------LAKAGLIQ-NSFSI 279
               + I G   KQ G     A  DG++G+    +SV  +      + +  LI+ N FS 
Sbjct: 163 ---KNQIFGEAVKQPGITFIAAKFDGILGMAFPRISVDKVTPFFDNVMQQKLIEKNIFSF 219

Query: 280 CFDENDS----GSVFFGDQGPATQQSTSFLPIGEKYDAYF-VGVESYCIGNS-CLTQSGF 333
             + + +    G +  G   P    S  F  +     AY+ V ++S  + N   L + G 
Sbjct: 220 YLNRDPTAQPGGELLLGGTDPK-YYSGDFSWVNVTRKAYWQVHMDSVDVANGLTLCKGGC 278

Query: 334 QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRL 393
           +A+VD+G S    PT+   E+       + +K + ++G      Y  S +++  +P + L
Sbjct: 279 EAIVDTGTSLITGPTKEVKEL----QTAIGAKPL-IKGQ-----YVISCDKISSLPVVTL 328

Query: 394 IF-SKNQSFVVRNHIFSFPENEGFTVFCLT------VMSTDGDYGIIGQNFMMGHRIVFD 446
           +   K        ++F     +G T+ CL+      V    G   I+G  F+  +  VFD
Sbjct: 329 MLGGKPYQLTGEQYVFKV-SAQGETI-CLSGFSGLDVPPPGGPLWILGDVFIGPYYTVFD 386

Query: 447 RENLKLAWSHSKC 459
           R+N  + +  +KC
Sbjct: 387 RDNDSVGF--AKC 397


>sp|P00792|PEPA_BOVIN Pepsin A OS=Bos taurus GN=PGA PE=1 SV=2
          Length = 372

 Score = 48.5 bits (114), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 88/362 (24%), Positives = 151/362 (41%), Gaps = 72/362 (19%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           ++  I IGTP   F V  D GS+ LWVP   I C+  + +       N + ++P  SS+ 
Sbjct: 60  YFGTIGIGTPAQDFTVIFDTGSSNLWVP--SIYCSSEACT-------NHNRFNPQDSSTY 110

Query: 170 KNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
           +  S +  +     S                     +G L  D + +   S         
Sbjct: 111 EATSETLSITYGTGSM--------------------TGILGYDTVQVGGISD-------- 142

Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGL------GDVSVPSLLAKAGLI-QNSFSICFD 282
           ++ I G    + GS+L  A  DG++GL        G   V   +   GL+ Q+ FS+   
Sbjct: 143 TNQIFGLSETEPGSFLYYAPFDGILGLAYPSISSSGATPVFDNIWDQGLVSQDLFSVYLS 202

Query: 283 EN-DSGS-VFFGD-QGPATQQSTSFLPIG-EKYDAYFVGVESYCI-GNSCLTQSGFQALV 337
            N +SGS V FGD        S +++P+  E Y  + + V+S  + G S     G QA+V
Sbjct: 203 SNEESGSVVIFGDIDSSYYSGSLNWVPVSVEGY--WQITVDSITMNGESIACSDGCQAIV 260

Query: 338 DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSK 397
           D+G S    PT   +         + S   + + +S +   + SS + L  PD+    + 
Sbjct: 261 DTGTSLLAGPTTAISN--------IQSYIGASEDSSGEVVISCSSIDSL--PDIVFTING 310

Query: 398 NQ------SFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLK 451
            Q      +++++++       EG     + + ++ GD  I+G  F+  +  VFDR N +
Sbjct: 311 VQYPVPPSAYILQSNGICSSGFEG-----MDISTSSGDLWILGDVFIRQYFTVFDRGNNQ 365

Query: 452 LA 453
           + 
Sbjct: 366 IG 367


>sp|P07267|CARP_YEAST Saccharopepsin OS=Saccharomyces cerevisiae (strain ATCC 204508 /
           S288c) GN=PEP4 PE=1 SV=1
          Length = 405

 Score = 48.1 bits (113), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 89/382 (23%), Positives = 149/382 (39%), Gaps = 71/382 (18%)

Query: 86  SRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAP 145
           SR    F +EG       N     +YT I +GTP  +F V LD GS+ LWVP    +C  
Sbjct: 68  SREHPFF-TEGGHDVPLTNYLNAQYYTDITLGTPPQNFKVILDTGSSNLWVPSN--ECGS 124

Query: 146 LSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSS 205
           L+   +       S+YD  +SSS K       +     S +           Y ++DT S
Sbjct: 125 LACFLH-------SKYDHEASSSYKANGTEFAIQYGTGSLEG----------YISQDTLS 167

Query: 206 SGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPS 265
            G L       A       +++ +  +    G+            DG++GLG   +SV  
Sbjct: 168 IGDLTIPKQDFA-------EATSEPGLTFAFGKF-----------DGILGLGYDTISVDK 209

Query: 266 L-------LAKAGLIQNSFSICFD------ENDSGSVFFGDQGPATQQSTSFLPIGEKYD 312
           +       + +  L +  F+          EN   + F G      +   ++LP+  K  
Sbjct: 210 VVPPFYNAIQQDLLDEKRFAFYLGDTSKDTENGGEATFGGIDESKFKGDITWLPVRRK-- 267

Query: 313 AYF-VGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQG 371
           AY+ V  E   +G+         A +D+G S   LP+ + AE++   +  + +K+     
Sbjct: 268 AYWEVKFEGIGLGDEYAELESHGAAIDTGTSLITLPSGL-AEMI---NAEIGAKK----- 318

Query: 372 NSWKYCYNASSEEMLKVPDMRLIFSKN-QSFVVRNHIFSFPENEGFTVFCLTVMSTD--- 427
             W   Y         +PD  LIF+ N  +F +  + ++  E  G  +  +T M      
Sbjct: 319 -GWTGQYTLDCNTRDNLPD--LIFNFNGYNFTIGPYDYTL-EVSGSCISAITPMDFPEPV 374

Query: 428 GDYGIIGQNFMMGHRIVFDREN 449
           G   I+G  F+  +  ++D  N
Sbjct: 375 GPLAIVGDAFLRKYYSIYDLGN 396


>sp|P80209|CATD_BOVIN Cathepsin D OS=Bos taurus GN=CTSD PE=1 SV=2
          Length = 390

 Score = 47.8 bits (112), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 85/373 (22%), Positives = 143/373 (38%), Gaps = 69/373 (18%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           +Y  I IGTP   F V  D GS  LWVP   I C  L  + +T    N            
Sbjct: 59  YYGEIGIGTPPQCFTVVFDTGSANLWVP--SIHCKLLDIACWTHRKYN------------ 104

Query: 170 KNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLA-SFSKHAPQSSV 228
                      S  S   +K+   +   Y +   S SGYL  D + +  + S  +P    
Sbjct: 105 -----------SDKSSTYVKNGTTFDIHYGS--GSLSGYLSQDTVSVPCNPSSSSPGGVT 151

Query: 229 QSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK-AGLIQNSFSICFDENDSG 287
                 G   KQ G     A  DG++G+    +SV ++L     L+Q       D+N   
Sbjct: 152 VQRQTFGEAIKQPGVVFIAAKFDGILGMAYPRISVNNVLPVFDNLMQQKL---VDKNVFS 208

Query: 288 SVFFGDQGPATQQSTSFLPIGEKYDAYFVG----------------VESYCIGNS-CLTQ 330
             FF ++ P  Q     + +G     Y+ G                ++   +G+S  + +
Sbjct: 209 --FFLNRDPKAQPGGELM-LGGTDSKYYRGSLMFHNVTRQAYWQIHMDQLDVGSSLTVCK 265

Query: 331 SGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPD 390
            G +A+VD+G S    P     E V +  K + +  + +QG     C   SS     +P+
Sbjct: 266 GGCEAIVDTGTSLIVGPV----EEVRELQKAIGAVPL-IQGEYMIPCEKVSS-----LPE 315

Query: 391 MRLIFSKNQSFVVRNHIFSFPENEGFTVFCLT-VMSTD-----GDYGIIGQNFMMGHRIV 444
           + +     + + +    ++   ++  T  CL+  M  D     G   I+G  F+  +  V
Sbjct: 316 VTVKLG-GKDYALSPEDYALKVSQAETTVCLSGFMGMDIPPPGGPLWILGDVFIGRYYTV 374

Query: 445 FDRENLKLAWSHS 457
           FDR+  ++  + +
Sbjct: 375 FDRDQNRVGLAEA 387


>sp|P11489|PEPA_MACMU Pepsin A OS=Macaca mulatta GN=PGA PE=2 SV=1
          Length = 388

 Score = 47.4 bits (111), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 68/257 (26%), Positives = 109/257 (42%), Gaps = 51/257 (19%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           ++  I IGTP   F V  D GS+ LWVP   + C+ L+ +     + NL  ++P  SS+ 
Sbjct: 76  YFGTIGIGTPAQDFTVIFDTGSSNLWVP--SVYCSSLACT-----NHNL--FNPQDSSTY 126

Query: 170 KNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
           ++ S +  +     S                     +G L  D + +   S         
Sbjct: 127 QSTSGTLSITYGTGSM--------------------TGILGYDTVQVGGISD-------- 158

Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGL------GDVSVPSLLAKAGLI-QNSFSICFD 282
           ++ I G    + GS+L  A  DG++GL        G   V   +   GL+ Q+ FS+   
Sbjct: 159 TNQIFGLSETEPGSFLYYAPFDGILGLAYPSISSSGATPVFDNIWDQGLVSQDLFSVYLS 218

Query: 283 END-SGSV--FFGDQGPATQQSTSFLPIG-EKYDAYFVGVESYCI-GNSCLTQSGFQALV 337
            +D SGSV  F G        S +++P+  E Y  + + V+S  + G +     G QA+V
Sbjct: 219 ADDQSGSVVIFGGIDSSYYTGSLNWVPVSVEGY--WQISVDSITMNGEAIACAEGCQAIV 276

Query: 338 DSGASFTFLPTEIYAEV 354
           D+G S    PT   A +
Sbjct: 277 DTGTSLLTGPTSPIANI 293


>sp|P03954|PEPA1_MACFU Pepsin A-1 OS=Macaca fuscata fuscata GN=PGA PE=1 SV=2
          Length = 388

 Score = 47.4 bits (111), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 68/257 (26%), Positives = 109/257 (42%), Gaps = 51/257 (19%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           ++  I IGTP   F V  D GS+ LWVP   + C+ L+ +     + NL  ++P  SS+ 
Sbjct: 76  YFGTIGIGTPAQDFTVIFDTGSSNLWVP--SVYCSSLACT-----NHNL--FNPQDSSTY 126

Query: 170 KNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
           ++ S +  +     S                     +G L  D + +   S         
Sbjct: 127 QSTSGTLSITYGTGSM--------------------TGILGYDTVQVGGISD-------- 158

Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGL------GDVSVPSLLAKAGLI-QNSFSICFD 282
           ++ I G    + GS+L  A  DG++GL        G   V   +   GL+ Q+ FS+   
Sbjct: 159 TNQIFGLSETEPGSFLYYAPFDGILGLAYPSISSSGATPVFDNIWDQGLVSQDLFSVYLS 218

Query: 283 END-SGSV--FFGDQGPATQQSTSFLPIG-EKYDAYFVGVESYCI-GNSCLTQSGFQALV 337
            +D SGSV  F G        S +++P+  E Y  + + V+S  + G +     G QA+V
Sbjct: 219 ADDQSGSVVIFGGIDSSYYTGSLNWVPVSVEGY--WQISVDSITMNGEAIACAEGCQAIV 276

Query: 338 DSGASFTFLPTEIYAEV 354
           D+G S    PT   A +
Sbjct: 277 DTGTSLLTGPTSPIANI 293


>sp|Q9GMY6|PEPA_CANFA Pepsin A OS=Canis familiaris GN=PGA PE=2 SV=1
          Length = 386

 Score = 47.0 bits (110), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 98/401 (24%), Positives = 152/401 (37%), Gaps = 93/401 (23%)

Query: 77  VKLQSNNNSSRNQLLFPSEGS--QTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLL 134
           +K QS N +S+    FP E +   T    N     ++  I IGTP   F V  D GS+ L
Sbjct: 42  LKNQSPNPASK---YFPQEPTVLATQSLKNYMDMEYFGTIGIGTPPQEFTVIFDTGSSNL 98

Query: 135 WVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPY 194
           WVP   + C+  + S       N + ++P  SS+ +  +                   P 
Sbjct: 99  WVP--SVYCSSPACS-------NHNRFNPQESSTYQGTN------------------RPV 131

Query: 195 IADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVM 254
              Y T   S +G L  D + +   +         ++ I G    + GS+L  A  DG++
Sbjct: 132 SIAYGT--GSMTGILGYDTVQVGGIAD--------TNQIFGLSETEPGSFLYYAPFDGIL 181

Query: 255 GLGLGDVSVPSL------LAKAGLI-QNSFSICFDEND-SGSV--FFGDQGPATQQSTSF 304
           GL    +S          +   GL+ Q+ FS+    +D SGSV  F G        + ++
Sbjct: 182 GLAYPQISASGATPVFDNMWNEGLVSQDLFSVYLSSDDQSGSVVMFGGIDSSYYSGNLNW 241

Query: 305 LPIG-EKYDAYFVGVESYCI-GNSCLTQSGFQALVDSGASFTFLPTEI------------ 350
           +P+  E Y  + + V+S  + G +     G QA+VD+G S    PT              
Sbjct: 242 VPVSVEGY--WQITVDSVTMNGQAIACSDGCQAIVDTGTSLLAGPTNAIANIQSYIGASQ 299

Query: 351 --YAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIF 408
             Y ++V+    + S   I    N  +Y           +P    I    Q  V      
Sbjct: 300 NSYGQMVISCSAINSLPDIVFTINGIQY----------PLPPSAYILQSQQGCV------ 343

Query: 409 SFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDREN 449
                 GF    L   S  G+  I+G  F+  +  VFDR N
Sbjct: 344 -----SGFQGMNLPTAS--GELWILGDVFIRQYFAVFDRAN 377


>sp|O65390|APA1_ARATH Aspartic proteinase A1 OS=Arabidopsis thaliana GN=APA1 PE=1 SV=1
          Length = 506

 Score = 47.0 bits (110), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 67/254 (26%), Positives = 101/254 (39%), Gaps = 49/254 (19%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           +Y  I IGTP   F V  D GS+ LWVP         S+  Y SL   L           
Sbjct: 82  YYGEIAIGTPPQKFTVVFDTGSSNLWVP---------SSKCYFSLACLL----------- 121

Query: 170 KNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
                 HP  KS  S    K+       Y T   + +G+  +D + +        Q  ++
Sbjct: 122 ------HPKYKSSRSSTYEKNGKAAAIHYGT--GAIAGFFSNDAVTVGDLVVK-DQEFIE 172

Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSL------LAKAGLIQNS-FSICF- 281
           ++       K+ G     A  DG++GLG  ++SV         + K GLI+   FS    
Sbjct: 173 AT-------KEPGITFVVAKFDGILGLGFQEISVGKAAPVWYNMLKQGLIKEPVFSFWLN 225

Query: 282 ---DENDSGSVFFGDQGPAT-QQSTSFLPIGEK-YDAYFVGVESYCIGNSCLTQSGFQAL 336
              DE + G + FG   P   +   +++P+ +K Y  + +G        +   +SG  A+
Sbjct: 226 RNADEEEGGELVFGGVDPNHFKGKHTYVPVTQKGYWQFDMGDVLIGGAPTGFCESGCSAI 285

Query: 337 VDSGASFTFLPTEI 350
            DSG S    PT I
Sbjct: 286 ADSGTSLLAGPTTI 299


>sp|P03955|PEPC_MACFU Gastricsin (Fragment) OS=Macaca fuscata fuscata GN=PGC PE=1 SV=2
          Length = 377

 Score = 46.2 bits (108), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 66/257 (25%), Positives = 100/257 (38%), Gaps = 59/257 (22%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVP---CQCIQCAPLSASYYTSLDRNLSEYDPSSS 166
           ++  I IGTP  +FLV  D GS+ LWVP   CQ   C             + S ++PS S
Sbjct: 62  YFGEISIGTPPQNFLVLFDTGSSNLWVPSVYCQSQACT------------SHSRFNPSES 109

Query: 167 SSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQS 226
           S+      +  L     S                     +G+   D L + S     P  
Sbjct: 110 STYSTNGQTFSLQYGSGSL--------------------TGFFGYDTLTVQSI--QVPNQ 147

Query: 227 SVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVP-SLLAKAGLIQNS------FSI 279
                   G    + G+    A  DG+MGL    +SV  +  A  G++Q        FS+
Sbjct: 148 E------FGLSENEPGTNFVYAQFDGIMGLAYPTLSVDGATTAMQGMVQEGALTSPIFSV 201

Query: 280 CFDENDS---GSVFFG--DQGPATQQSTSFLPIGEKYDAYFVGVESYCIGN--SCLTQSG 332
              +      G+V FG  D    T Q   + P+ ++   + +G+E + IG   S     G
Sbjct: 202 YLSDQQGSSGGAVVFGGVDSSLYTGQ-IYWAPVTQEL-YWQIGIEEFLIGGQASGWCSEG 259

Query: 333 FQALVDSGASFTFLPTE 349
            QA+VD+G S   +P +
Sbjct: 260 CQAIVDTGTSLLTVPQQ 276


>sp|P27822|PEPA3_RABIT Pepsin-3 OS=Oryctolagus cuniculus PE=2 SV=1
          Length = 387

 Score = 46.2 bits (108), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 82/360 (22%), Positives = 145/360 (40%), Gaps = 68/360 (18%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           ++  I IGTP   F V  D GS+ LWVP   + C+  + S +       ++++P  SS+ 
Sbjct: 75  YFGTIGIGTPAQDFTVIFDTGSSNLWVP--SVYCSSAACSVH-------NQFNPEDSSTF 125

Query: 170 KNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
           +  S S  +     S                     +G+L  D + + +           
Sbjct: 126 QATSESLSITYGTGSM--------------------TGFLGYDTVKVGNIE--------D 157

Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVS------VPSLLAKAGLI-QNSFSICFD 282
           ++ I G    + GS+L  A  DG++GL    +S      V   +   GL+ ++ FS+   
Sbjct: 158 TNQIFGLSESEPGSFLYYAPFDGILGLAYPSISSSDATPVFDNMWNEGLVSEDLFSVYLS 217

Query: 283 END-SGSV--FFGDQGPATQQSTSFLPIGEKYDAYF-VGVESYCI-GNSCLTQSGFQALV 337
            +D SGSV  F G        S +++P+   Y+ Y+ + ++S  + G +       QA+V
Sbjct: 218 SDDESGSVVMFGGIDSSYYTGSLNWVPV--SYEGYWQITLDSITMDGETIACADSCQAIV 275

Query: 338 DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSK 397
           D+G S    PT   + +        +S    +           S   M  +P++    + 
Sbjct: 276 DTGTSLLAGPTSAISNIQSYIGASENSDGEMI----------VSCSSMYSLPNIVFTING 325

Query: 398 NQSFVVRNHIFSFPENE----GFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLA 453
            Q + V    +   E++    GF    L   +  G+  I+G  F+  +  VFDR N +L 
Sbjct: 326 VQ-YPVPASAYILEEDDACISGFEGMNLDTYT--GELWILGDVFIRQYFTVFDRANNQLG 382


>sp|P0CY27|CARP1_CANAL Candidapepsin-1 OS=Candida albicans (strain SC5314 / ATCC MYA-2876)
           GN=SAP1 PE=1 SV=1
          Length = 391

 Score = 46.2 bits (108), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 76/361 (21%), Positives = 139/361 (38%), Gaps = 64/361 (17%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           I IG+    F V +D GS+ LWVP   + C            +    Y P SS++S+N+ 
Sbjct: 68  ITIGSNKQKFNVIVDTGSSDLWVPDASVTCDKPRPGQSADFCKGKGIYTPKSSTTSQNLG 127

Query: 174 CSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL--ASFSKHAPQSSVQSS 231
                              P+   Y  + +SS G L  D +    AS +K       ++S
Sbjct: 128 T------------------PFYIGYG-DGSSSQGTLYKDTVGFGGASITKQVFADITKTS 168

Query: 232 VIIGCGRKQTGSYLDGAAPDGVMGLGL------GDV-SVPSLLAKAGLI-QNSFSICFDE 283
           +                 P G++G+G       GD  +VP  L   G+I +N++S+  + 
Sbjct: 169 I-----------------PQGILGIGYKTNEAAGDYDNVPVTLKNQGVIAKNAYSLYLNS 211

Query: 284 ND--SGSVFFGDQGPATQQSTSF-LPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSG 340
            +  +G + FG    A    +   +P+    +          +G +         L+DSG
Sbjct: 212 PNAATGQIIFGGVDKAKYSGSLIAVPVTSDRELRITLNSLKAVGKNI--NGNIDVLLDSG 269

Query: 341 ASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQS 400
            + T+L  ++  +++  F   + S     QG+++ Y  +  +   +        F  N  
Sbjct: 270 TTITYLQQDVAQDIIDAFQAELKSDG---QGHTF-YVTDCQTSGTVD-----FNFDNNAK 320

Query: 401 FVVRNHIFSFP---ENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHS 457
             V    F+ P    N      C  ++    D  I+G NF+    +V+D ++ K++ +  
Sbjct: 321 ISVPASEFTAPLSYANGQPYPKCQLLLGIS-DANILGDNFLRSAYLVYDLDDDKISLAQV 379

Query: 458 K 458
           K
Sbjct: 380 K 380


>sp|C4YSF6|CARP1_CANAW Candidapepsin-1 OS=Candida albicans (strain WO-1) GN=SAP1 PE=1 SV=1
          Length = 391

 Score = 45.8 bits (107), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 89/426 (20%), Positives = 160/426 (37%), Gaps = 76/426 (17%)

Query: 49  VADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYW 108
           + D+ P K S  ++ L    D+   KT V         + Q L P   +  H        
Sbjct: 15  LVDASPAKRSPGFVTL----DFDVIKTPVNATGQEGKVKRQAL-PVTLNNEHVS------ 63

Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
            +   I IG+    F V +D GS+ LWVP   + C            +    Y P SS++
Sbjct: 64  -YAADITIGSNKQKFNVIVDTGSSDLWVPDASVTCDKPRPGQSADFCKGKGIYTPKSSTT 122

Query: 169 SKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL--ASFSKHAPQS 226
           S+N+                    P+   Y  + +SS G L  D +    AS +K     
Sbjct: 123 SQNLGT------------------PFYIGYG-DGSSSQGTLYKDTVGFGGASITKQVFAD 163

Query: 227 SVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGL------GDV-SVPSLLAKAGLI-QNSFS 278
             ++S+                 P G++G+G       GD  +VP  L   G+I +N++S
Sbjct: 164 ITKTSI-----------------PQGILGIGYKTNEAAGDYDNVPVTLKNQGVIAKNAYS 206

Query: 279 ICFDEND--SGSVFFGDQGPATQQSTSF-LPIGEKYDAYFVGVESYCIGNSCLTQSGFQA 335
           +  +  +  +G + FG    A    +   +P+    +          +G +         
Sbjct: 207 LYLNSPNAATGQIIFGGVDKAKYSGSLIAVPVTSDRELRITLNSLKAVGKNI--NGNIDV 264

Query: 336 LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIF 395
           L+DSG + T+L  ++  +++  F   + S     QG+++ Y  +  +   +        F
Sbjct: 265 LLDSGTTITYLQQDVAQDIIDAFQAELKSDG---QGHTF-YVTDCQTSGTVD-----FNF 315

Query: 396 SKNQSFVVRNHIFSFP---ENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKL 452
             N    V    F+ P    N      C  ++    D  I+G NF+    +V+D ++ K+
Sbjct: 316 DNNVKISVPASEFTAPLSYANGQPYPKCQLLLGIS-DANILGDNFLRSAYLVYDLDDDKI 374

Query: 453 AWSHSK 458
           + +  K
Sbjct: 375 SLAQVK 380


>sp|P20142|PEPC_HUMAN Gastricsin OS=Homo sapiens GN=PGC PE=1 SV=1
          Length = 388

 Score = 45.8 bits (107), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 66/257 (25%), Positives = 99/257 (38%), Gaps = 59/257 (22%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVP---CQCIQCAPLSASYYTSLDRNLSEYDPSSS 166
           ++  I IGTP  +FLV  D GS+ LWVP   CQ   C             + S ++PS S
Sbjct: 73  YFGEISIGTPPQNFLVLFDTGSSNLWVPSVYCQSQACT------------SHSRFNPSES 120

Query: 167 SSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQS 226
           S+      +  L     S                     +G+   D L + S     P  
Sbjct: 121 STYSTNGQTFSLQYGSGSL--------------------TGFFGYDTLTVQSI--QVPNQ 158

Query: 227 SVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSV-PSLLAKAGLIQNS------FSI 279
                   G    + G+    A  DG+MGL    +SV  +  A  G++Q        FS+
Sbjct: 159 E------FGLSENEPGTNFVYAQFDGIMGLAYPALSVDEATTAMQGMVQEGALTSPVFSV 212

Query: 280 CFDEND---SGSVFFG--DQGPATQQSTSFLPIGEKYDAYFVGVESYCIGN--SCLTQSG 332
                     G+V FG  D    T Q   + P+ ++   + +G+E + IG   S     G
Sbjct: 213 YLSNQQGSSGGAVVFGGVDSSLYTGQ-IYWAPVTQEL-YWQIGIEEFLIGGQASGWCSEG 270

Query: 333 FQALVDSGASFTFLPTE 349
            QA+VD+G S   +P +
Sbjct: 271 CQAIVDTGTSLLTVPQQ 287


>sp|P55956|ASP3_CAEEL Aspartic protease 3 OS=Caenorhabditis elegans GN=asp-3 PE=1 SV=2
          Length = 398

 Score = 45.8 bits (107), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 76/373 (20%), Positives = 137/373 (36%), Gaps = 70/373 (18%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           +Y  + IGTP  +F V  D GS+ LWVPC       ++   +   D              
Sbjct: 69  YYGPVTIGTPPQNFQVLFDTGSSNLWVPCANCPFGDIACRMHNRFD-------------- 114

Query: 170 KNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
                    CK  SSC +      +   Y T   S  G + +D++       H       
Sbjct: 115 ---------CKKSSSCTATG--ASFEIQYGT--GSMKGTVDNDVVCFG----HDTTYCTD 157

Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSV-------PSLLAKAGLIQN---SFSI 279
            +  + C   + G     A  DG+ G+G   +SV         + A + + +N   +F +
Sbjct: 158 KNQGLACATSEPGITFVAAKFDGIFGMGWDTISVNKISQPMDQIFANSAICKNQLFAFWL 217

Query: 280 CFDEND---SGSVFFGDQGPATQ-QSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQA 335
             D ND    G +   +  P     + ++ P+  + D + + + S  I  +  T     +
Sbjct: 218 SRDANDITNGGEITLCETDPNHYVGNIAWEPLVSE-DYWRIKLASVVIDGTTYTSGPIDS 276

Query: 336 LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSE-EMLKVPDM-RL 393
           +VD+G S    PT++  ++  K   +                +N   E E  K+P +  +
Sbjct: 277 IVDTGTSLLTGPTDVIKKIQHKIGGIP--------------LFNGEYEVECSKIPSLPNI 322

Query: 394 IFS---KNQSFVVRNHIFSFPENEGFTVFCLTVMSTD-----GDYGIIGQNFMMGHRIVF 445
            F+   +N     +++I       G +      M  D     G   I+G  F+     VF
Sbjct: 323 TFNLGGQNFDLQGKDYILQMSNGNGGSTCLSGFMGMDIPAPAGPLWILGDVFIGRFYSVF 382

Query: 446 DRENLKLAWSHSK 458
           D  N ++ ++ S+
Sbjct: 383 DHGNKRVGFATSR 395


>sp|P42211|ASPRX_ORYSJ Aspartic proteinase OS=Oryza sativa subsp. japonica GN=RAP PE=2
           SV=2
          Length = 496

 Score = 45.4 bits (106), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 68/262 (25%), Positives = 109/262 (41%), Gaps = 57/262 (21%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNL-SEYDPSSSSS 168
           +Y  I +G+P  +F V  D GS+ LWVP         SA  Y S+   L S Y+   SSS
Sbjct: 77  YYGVIGLGSPPQNFTVIFDTGSSNLWVP---------SAKCYFSIACYLHSRYNSKKSSS 127

Query: 169 SKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSV 228
            K             +CK +      I+ + ++D    G LV                 V
Sbjct: 128 YK---------ADGETCK-ITYGSGAISGFFSKDNVLVGDLV-----------------V 160

Query: 229 QSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSV-------PSLLAKAGLIQNSFSICF 281
           ++   I   R+ + +++ G   DG++GLG  ++SV        S+  +  L  + FS   
Sbjct: 161 KNQKFIEATRETSVTFIIGKF-DGILGLGYPEISVGKAPPIWQSMQEQELLADDVFSFWL 219

Query: 282 ----DENDSGSVFFGDQGPATQQST-SFLPIGEK-YDAYFVG---VESYCIGNSCLTQSG 332
               D +  G + FG   P   +   +++P+  K Y  + +G   ++ +  G       G
Sbjct: 220 NRDPDASSGGELVFGGMDPKHYKGDHTYVPVSRKGYWQFNMGDLLIDGHSTG---FCAKG 276

Query: 333 FQALVDSGASFTFLPTEIYAEV 354
             A+VDSG S    PT I A+V
Sbjct: 277 CAAIVDSGTSLLAGPTAIVAQV 298


  Database: swissprot
    Posted date:  Mar 23, 2013  2:32 AM
  Number of letters in database: 191,569,459
  Number of sequences in database:  539,616
  
Lambda     K      H
   0.317    0.131    0.395 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 199,400,780
Number of Sequences: 539616
Number of extensions: 8448450
Number of successful extensions: 30679
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 63
Number of HSP's successfully gapped in prelim test: 131
Number of HSP's that attempted gapping in prelim test: 30261
Number of HSP's gapped (non-prelim): 380
length of query: 538
length of database: 191,569,459
effective HSP length: 122
effective length of query: 416
effective length of database: 125,736,307
effective search space: 52306303712
effective search space used: 52306303712
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 64 (29.3 bits)