BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 039412
(433 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|Q766C3|NEP1_NEPGR Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis GN=nep1
PE=1 SV=1
Length = 437
Score = 168 bits (425), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 116/353 (32%), Positives = 172/353 (48%), Gaps = 30/353 (8%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQAAQ 153
Y++ IGTPAQ MDT +D W PCT C S+ +FN S++F L C +
Sbjct: 95 YLMNLSIGTPAQPFSAIMDTGSDLIWTQCQPCTQCFNQSTPIFNPQGSSSFSTLPCSSQL 154
Query: 154 CKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISLATDIVPGYTFGCIQKATG-NSVP 211
C+ + +PTC C + YG S ++ +T++ + +P TFGC + G
Sbjct: 155 CQALSSPTCSNNFCQYTYGYGDGSETQGSMGTETLTFGSVSIPNITFGCGENNQGFGQGN 214
Query: 212 PQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPI------GQPKRIK 265
GL+G+GRG LSL +Q L + FSYC+ + S +L LG + G P
Sbjct: 215 GAGLVGMGRGPLSLPSQ---LDVTKFSYCMTPIGS-STPSNLLLGSLANSVTAGSPN--- 267
Query: 266 YTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTG-AGTIIDSGTVFTRLVAP 324
T L+++ + + YY+ L + VG + I P A N G G IIDSGT T V
Sbjct: 268 -TTLIQSSQIPTFYYITLNGLSVGSTRLPIDPSAFALNSNNGTGGIIIDSGTTLTYFVNN 326
Query: 325 AYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP-----IVAPTITLMFSGMNVTLPQDNLL 379
AY +VR F ++ + S GFD C+ P + PT + F G ++ LP +N
Sbjct: 327 AYQSVRQEFISQINLPVVNGSSSGFDLCFQTPSDPSNLQIPTFVMHFDGGDLELPSENYF 386
Query: 380 IHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
I + G I CLAM ++ +++ N+QQQN ++YD NS + A C
Sbjct: 387 ISPSNGLI-CLAMGSSSQG----MSIFGNIQQQNMLVVYDTGNSVVSFASAQC 434
>sp|Q766C2|NEP2_NEPGR Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis GN=nep2
PE=1 SV=1
Length = 438
Score = 164 bits (414), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 118/389 (30%), Positives = 183/389 (47%), Gaps = 32/389 (8%)
Query: 50 KPLSWEESVLEMLAKDQARLQFLSSLAVARKSV-VPIASGRQITQSPTYIVRAKIGTPAQ 108
K L+ E + + + + R++ ++++ + + P+ +G Y++ IGTP
Sbjct: 53 KNLTKYELIKRAIKRGERRMRSINAMLQSSSGIETPVYAGDG-----EYLMNVAIGTPDS 107
Query: 109 TLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGG 165
+ MDT +D W PCT C + +FN S++F L C++ C+ +P+ TC
Sbjct: 108 SFSAIMDTGSDLIWTQCEPCTQCFSQPTPIFNPQDSSSFSTLPCESQYCQDLPSETCNNN 167
Query: 166 ACAFNLTYGS-STIAANLSQDTISLATDIVPGYTFGCIQK----ATGNSVPPQGLLGLGR 220
C + YG ST ++ +T + T VP FGC + GN GL+G+G
Sbjct: 168 ECQYTYGYGDGSTTQGYMATETFTFETSSVPNIAFGCGEDNQGFGQGNGA---GLIGMGW 224
Query: 221 GSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPI--GQPKRIKYTPLLKNPRRSSL 278
G LSL +Q L FSYC+ S+ + S S +L LG G P+ T L+ + +
Sbjct: 225 GPLSLPSQ---LGVGQFSYCMTSYGSSSPS-TLALGSAASGVPEGSPSTTLIHSSLNPTY 280
Query: 279 YYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVG 338
YY+ L I VG + IP Q G IIDSGT T L AY AV F ++
Sbjct: 281 YYITLQGITVGGDNLGIPSSTFQLQDDGTGGMIIDSGTTLTYLPQDAYNAVAQAFTDQIN 340
Query: 339 SNLTVTSLGGFDTCYSVP-----IVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMA 393
S G TC+ P + P I++ F G + L + N+LI G I CLAM
Sbjct: 341 LPTVDESSSGLSTCFQQPSDGSTVQVPEISMQFDGGVLNLGEQNILISPAEGVI-CLAMG 399
Query: 394 AAPDNVNSVLNVIANMQQQNHRILYDVPN 422
++ +++ N+QQQ ++LYD+ N
Sbjct: 400 SSS---QLGISIFGNIQQQETQVLYDLQN 425
>sp|Q9LHE3|ASPG2_ARATH Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana
GN=ASPG2 PE=2 SV=1
Length = 470
Score = 161 bits (407), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 111/348 (31%), Positives = 171/348 (49%), Gaps = 14/348 (4%)
Query: 94 SPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQ 150
S Y VR +G+P + M +D+ +D WV PC C S VF+ A+S ++ + C
Sbjct: 128 SGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPCKLCYKQSDPVFDPAKSGSYTGVSCG 187
Query: 151 AAQCKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISLATDIVPGYTFGCIQKATGNS 209
++ C ++ N C G C + + YG S L+ +T++ A +V GC + G
Sbjct: 188 SSVCDRIENSGCHSGGCRYEVMYGDGSYTKGTLALETLTFAKTVVRNVAMGCGHRNRGMF 247
Query: 210 VPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPL 269
+ GLLG+G GS+S + Q F YCL S + +GSL G P + PL
Sbjct: 248 IGAAGLLGIGGGSMSFVGQLSGQTGGAFGYCLVS-RGTDSTGSLVFGREALPVGASWVPL 306
Query: 270 LKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAV 329
++NPR S YYV L + VG + +P G T G ++D+GT TRL AY A
Sbjct: 307 VRNPRAPSFYYVGLKGLGVGGVRIPLPDGVFDLTETGDGGVVMDTGTAVTRLPTAAYVAF 366
Query: 330 RDVFRRRVGSNLTVTSLGGFDTCYS----VPIVAPTITLMFS-GMNVTLPQDNLLIHSTA 384
RD F+ + + + + FDTCY V + PT++ F+ G +TLP N L+
Sbjct: 367 RDGFKSQTANLPRASGVSIFDTCYDLSGFVSVRVPTVSFYFTEGPVLTLPARNFLMPVDD 426
Query: 385 GSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
C A AA+P + L++I N+QQ+ ++ +D N +G +C
Sbjct: 427 SGTYCFAFAASP----TGLSIIGNIQQEGIQVSFDGANGFVGFGPNVC 470
>sp|Q9LS40|ASPG1_ARATH Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana
GN=ASPG1 PE=1 SV=1
Length = 500
Score = 153 bits (386), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 115/366 (31%), Positives = 165/366 (45%), Gaps = 32/366 (8%)
Query: 84 PIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQ 140
P+ SG S Y R +GTPA+ + + +DT +D W+ C C C S VFN
Sbjct: 150 PVVSGAS-QGSGEYFSRIGVGTPAKEMYLVLDTGSDVNWIQCEPCADCYQQSDPVFNPTS 208
Query: 141 STTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFG 200
S+T+K+L C A QC + C C + ++YG + LATD V TFG
Sbjct: 209 SSTYKSLTCSAPQCSLLETSACRSNKCLYQVSYGDGSFTVG------ELATDTV---TFG 259
Query: 201 CIQK----ATGNSVPPQGLL----GLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGS 252
K A G +GL GL +L+ T + ++FSYCL + S S
Sbjct: 260 NSGKINNVALGCGHDNEGLFTGAAGLLGLGGGVLSITNQMKATSFSYCLVD-RDSGKSSS 318
Query: 253 LRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTII 312
L + PLL+N + + YYV L VG V +P + + G I+
Sbjct: 319 LDFNSVQLGGGDATAPLLRNKKIDTFYYVGLSGFSVGGEKVVLPDAIFDVDASGSGGVIL 378
Query: 313 DSGTVFTRLVAPAYTAVRDVF-RRRVGSNLTVTSLGGFDTCYSVP----IVAPTITLMFS 367
D GT TRL AY ++RD F + V +S+ FDTCY + PT+ F+
Sbjct: 379 DCGTAVTRLQTQAYNSLRDAFLKLTVNLKKGSSSISLFDTCYDFSSLSTVKVPTVAFHFT 438
Query: 368 -GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLG 426
G ++ LP N LI C A A +S L++I N+QQQ RI YD+ + +G
Sbjct: 439 GGKSLDLPAKNYLIPVDDSGTFCFAFAP----TSSSLSIIGNVQQQGTRITYDLSKNVIG 494
Query: 427 VARELC 432
++ C
Sbjct: 495 LSGNKC 500
>sp|Q9LZL3|PCS1L_ARATH Aspartic proteinase PCS1 OS=Arabidopsis thaliana GN=PCS1 PE=2 SV=1
Length = 453
Score = 108 bits (271), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 106/368 (28%), Positives = 161/368 (43%), Gaps = 48/368 (13%)
Query: 106 PAQTLLMAMDTSNDAAWVPCTGCVGCSS-TVFNSAQSTTFKNLGCQAAQCKQ------VP 158
P Q + M +DT ++ +W+ C + F+ +S+++ + C + C+ +P
Sbjct: 82 PPQNISMVIDTGSELSWLRCNRSSNPNPVNNFDPTRSSSYSPIPCSSPTCRTRTRDFLIP 141
Query: 159 NPTCGGGACAFNLTYG-SSTIAANLSQDTISLATDI-VPGYTFGCIQKATGNSVPPQ--- 213
C L+Y +S+ NL+ + FGC+ +G S P +
Sbjct: 142 ASCDSDKLCHATLSYADASSSEGNLAAEIFHFGNSTNDSNLIFGCMGSVSG-SDPEEDTK 200
Query: 214 --GLLGLGRGSLSLLAQTQNLYQSTFSYCL------PSFKALSFSGSLRLGPIGQPKRIK 265
GLLG+ RGSLS ++Q + FSYC+ P F L S L P+ I+
Sbjct: 201 TTGLLGMNRGSLSFISQ---MGFPKFSYCISGTDDFPGFLLLGDSNFTWLTPLNYTPLIR 257
Query: 266 Y-TPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAP 324
TPL R + Y V L I+V +++ IP L + T T++DSGT FT L+ P
Sbjct: 258 ISTPLPYFDRVA--YTVQLTGIKVNGKLLPIPKSVLVPDHTGAGQTMVDSGTQFTFLLGP 315
Query: 325 AYTAVRDVFRRRVGSNLTVTS------LGGFDTCYSVPIVA---------PTITLMFSGM 369
YTA+R F R LTV G D CY + V PT++L+F G
Sbjct: 316 VYTALRSHFLNRTNGILTVYEDPDFVFQGTMDLCYRISPVRIRSGILHRLPTVSLVFEGA 375
Query: 370 NVTLPQDNLLI---HSTAG--SITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSR 424
+ + LL H T G S+ C + D + VI + QQN I +D+ SR
Sbjct: 376 EIAVSGQPLLYRVPHLTVGNDSVYCFTFGNS-DLMGMEAYVIGHHHQQNMWIEFDLQRSR 434
Query: 425 LGVARELC 432
+G+A C
Sbjct: 435 IGLAPVEC 442
>sp|Q6XBF8|CDR1_ARATH Aspartic proteinase CDR1 OS=Arabidopsis thaliana GN=CDR1 PE=1 SV=1
Length = 437
Score = 102 bits (254), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 103/405 (25%), Positives = 167/405 (41%), Gaps = 33/405 (8%)
Query: 33 TLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQIT 92
T + H SP SPF P+ L F K P +
Sbjct: 32 TADLIHRDSPKSPFY--NPMETSSQRLRNAIHRSVNRVF----HFTEKDNTPQPQIDLTS 85
Query: 93 QSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV---FNSAQSTTFKNLGC 149
S Y++ IGTP ++ DT +D W C C C + V F+ S+T+K++ C
Sbjct: 86 NSGEYLMNVSIGTPPFPIMAIADTGSDLLWTQCAPCDDCYTQVDPLFDPKTSSTYKDVSC 145
Query: 150 QAAQCKQVPNP---TCGGGACAFNLTYG-SSTIAANLSQDTISL-ATDIVP----GYTFG 200
++QC + N + C+++L+YG +S N++ DT++L ++D P G
Sbjct: 146 SSSQCTALENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDTLTLGSSDTRPMQLKNIIIG 205
Query: 201 CIQKATGN-SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCL-PSFKALSFSGSLRLG-- 256
C G + G++GLG G +SL+ Q + FSYCL P + + G
Sbjct: 206 CGHNNAGTFNKKGSGIVGLGGGPVSLIKQLGDSIDGKFSYCLVPLTSKKDQTSKINFGTN 265
Query: 257 PIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGT 316
I + TPL+ + + YY+ L +I VG + + + + ++ IIDSGT
Sbjct: 266 AIVSGSGVVSTPLIAKASQETFYYLTLKSISVGSKQIQY---SGSDSESSEGNIIIDSGT 322
Query: 317 VFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV--PIVAPTITLMFSGMNVTLP 374
T L Y+ + D + + G CYS + P IT+ F G +V L
Sbjct: 323 TLTLLPTEFYSELEDAVASSIDAEKKQDPQSGLSLCYSATGDLKVPVITMHFDGADVKLD 382
Query: 375 QDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYD 419
N + + + C A +P ++ N+ Q N + YD
Sbjct: 383 SSNAFVQ-VSEDLVCFAFRGSPS-----FSIYGNVAQMNFLVGYD 421
>sp|Q3EBM5|ASPR1_ARATH Probable aspartic protease At2g35615 OS=Arabidopsis thaliana
GN=At2g35615 PE=3 SV=1
Length = 447
Score = 92.8 bits (229), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 119/473 (25%), Positives = 197/473 (41%), Gaps = 69/473 (14%)
Query: 1 MKPQLVFFLAFLFLFSLSEGLNPICDTQDHSSTLQVFHVFSPCSP-FKPSKPLSWEESVL 59
M Q++ F +LS +P + ++++ H SP SP + P
Sbjct: 1 MATQILLCFFLFFSVTLSSSGHP------KNFSVELIHRDSPLSPIYNP----------- 43
Query: 60 EMLAKDQARLQFLSSLAVARK-----SVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAM 114
++ D+ FL S++ +R+ S + SG I + + IGTP +
Sbjct: 44 QITVTDRLNAAFLRSVSRSRRFNHQLSQTDLQSGL-IGADGEFFMSITIGTPPIKVFAIA 102
Query: 115 DTSNDAAWVPCTGCVGCSST---VFNSAQSTTFKNLGCQAAQCKQVPNPTCG----GGAC 167
DT +D WV C C C +F+ +S+T+K+ C + C+ + + G C
Sbjct: 103 DTGSDLTWVQCKPCQQCYKENGPIFDKKKSSTYKSEPCDSRNCQALSSTERGCDESNNIC 162
Query: 168 AFNLTYGSSTIA-ANLSQDTISLATD-----IVPGYTFGCIQKATGN-SVPPQGLLGLGR 220
+ +YG + + +++ +T+S+ + PG FGC G G++GLG
Sbjct: 163 KYRYSYGDQSFSKGDVATETVSIDSASGSPVSFPGTVFGCGYNNGGTFDETGSGIIGLGG 222
Query: 221 GSLSLLAQTQNLYQSTFSYCLPSFKALSFSGS--LRLGPIGQPKRIKY------TPLL-K 271
G LSL++Q + FSYCL S K+ + +G+ + LG P + TPL+ K
Sbjct: 223 GHLSLISQLGSSISKKFSYCL-SHKSATTNGTSVINLGTNSIPSSLSKDSGVVSTPLVDK 281
Query: 272 NPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPT-------TGAGTIIDSGTVFTRLVAP 324
P + YY+ L AI VG++ IP +NP T IIDSGT T L A
Sbjct: 282 EPL--TYYYLTLEAISVGKK--KIPYTGSSYNPNDDGILSETSGNIIIDSGTTLTLLEAG 337
Query: 325 AYTAVRDVFRRRV-GSNLTVTSLGGFDTCY---SVPIVAPTITLMFSGMNVTLPQDNLLI 380
+ V G+ G C+ S I P IT+ F+G +V L N +
Sbjct: 338 FFDKFSSAVEESVTGAKRVSDPQGLLSHCFKSGSAEIGLPEITVHFTGADVRLSPINAFV 397
Query: 381 HSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
+ + CL+M + + + N Q + + YD+ + C+
Sbjct: 398 K-LSEDMVCLSMVPTTE-----VAIYGNFAQMDFLVGYDLETRTVSFQHMDCS 444
>sp|Q9LX20|ASPL1_ARATH Aspartic proteinase-like protein 1 OS=Arabidopsis thaliana
GN=At5g10080 PE=1 SV=1
Length = 528
Score = 86.3 bits (212), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 97/413 (23%), Positives = 167/413 (40%), Gaps = 61/413 (14%)
Query: 61 MLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPTY----IVRAKIGTPAQTLLMAMDT 116
+LA+ R Q ++ L +S+VP + I+ + IGTP+ + L+A+DT
Sbjct: 61 LLAESDFRRQRMN-LGAKVQSLVPSEGSKTISSGNDFGWLHYTWIDIGTPSVSFLVALDT 119
Query: 117 SNDAAWVPCTGCVGCSS--------------TVFNSAQSTTFKNLGCQAAQCKQVPNPTC 162
++ W+PC CV C+ +N + S+T K C C +
Sbjct: 120 GSNLLWIPCN-CVQCAPLTSTYYSSLATKDLNEYNPSSSSTSKVFLCSHKLCDSASDCES 178
Query: 163 GGGACAFNLTY--GSSTIAANLSQDTISL-----------ATDIVPGYTFGCIQKATG-- 207
C + + Y G+++ + L +D + L ++ + GC +K +G
Sbjct: 179 PKEQCPYTVNYLSGNTSSSGLLVEDILHLTYNTNNRLMNGSSSVKARVVIGCGKKQSGDY 238
Query: 208 -NSVPPQGLLGLGRGSLSL--LAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRI 264
+ V P GL+GLG +S+ L +++FS C SG + G +G P
Sbjct: 239 LDGVAPDGLMGLGPAEISVPSFLSKAGLMRNSFSLCFDEED----SGRIYFGDMG-PSIQ 293
Query: 265 KYTPLLK-NPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVA 323
+ TP L+ + + S Y V + A +G + T T IDSG FT L
Sbjct: 294 QSTPFLQLDNNKYSGYIVGVEACCIGNSCLK----------QTSFTTFIDSGQSFTYLPE 343
Query: 324 PAYTAVRDVFRRRVGSNLTVTSLGG--FDTCY--SVPIVAPTITLMFSGMNVTLPQDNLL 379
Y V R + N T + G ++ CY S P I L FS N + L
Sbjct: 344 EIYRKVALEIDRHI--NATSKNFEGVSWEYCYESSAEPKVPAIKLKFSHNNTFVIHKPLF 401
Query: 380 IHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
+ + + + +P + ++ N + +R+++D N +LG + C
Sbjct: 402 VFQQSQGLVQFCLPISPSGQEGIGSIGQNY-MRGYRMVFDRENMKLGWSPSKC 453
>sp|Q9S9K4|ASPL2_ARATH Aspartic proteinase-like protein 2 OS=Arabidopsis thaliana
GN=At1g65240 PE=1 SV=2
Length = 475
Score = 84.3 bits (207), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 94/425 (22%), Positives = 174/425 (40%), Gaps = 61/425 (14%)
Query: 46 FKPSKPLSWEESVLEMLAKDQARLQ--FLSSLAVARKSVVPIASGRQITQSPTYIVRAKI 103
FK + ++ LE R L+S+ + P+ ++ Y + K+
Sbjct: 27 FKAQHKFAGKKKNLEHFKSHDTRRHSRMLASIDL------PLGGDSRVDSVGLYFTKIKL 80
Query: 104 GTPAQTLLMAMDTSNDAAWVPCTGCVGCSS--------TVFNSAQSTTFKNLGCQAAQCK 155
G+P + + +DT +D W+ C C C + ++F+ S+T K +GC C
Sbjct: 81 GSPPKEYHVQVDTGSDILWINCKPCPKCPTKTNLNFRLSLFDMNASSTSKKVGCDDDFCS 140
Query: 156 QVP-----NPTCGGGACAFNLTYG-SSTIAANLSQDTISLAT---DIVPG-----YTFGC 201
+ P G C++++ Y ST +D ++L D+ G FGC
Sbjct: 141 FISQSDSCQPALG---CSYHIVYADESTSDGKFIRDMLTLEQVTGDLKTGPLGQEVVFGC 197
Query: 202 IQKATGN----SVPPQGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALSFSGSLRL 255
+G G++G G+ + S+L+Q + FS+CL + K G +
Sbjct: 198 GSDQSGQLGNGDSAVDGVMGFGQSNTSVLSQLAATGDAKRVFSHCLDNVKG---GGIFAV 254
Query: 256 GPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSG 315
G + PK +K TP++ N Y V L+ + V +D+P ++ GTI+DSG
Sbjct: 255 GVVDSPK-VKTTPMVPNQMH---YNVMLMGMDVDGTSLDLPRSIVR-----NGGTIVDSG 305
Query: 316 TVFTRLVAPAYTAVRDVF--RRRVGSNLTVTSLGGFDTCYSVPIVAPTITLMFSG-MNVT 372
T Y ++ + R+ V ++ + F +V P ++ F + +T
Sbjct: 306 TTLAYFPKVLYDSLIETILARQPVKLHIVEETFQCFSFSTNVDEAFPPVSFEFEDSVKLT 365
Query: 373 L-PQDNLLIHSTAGSITCLAMAAA---PDNVNSVLNVIANMQQQNHRILYDVPNSRLGVA 428
+ P D L + + C A D + V+ ++ ++ N ++YD+ N +G A
Sbjct: 366 VYPHDYLF--TLEEELYCFGWQAGGLTTDERSEVI-LLGDLVLSNKLVVYDLDNEVIGWA 422
Query: 429 RELCT 433
C+
Sbjct: 423 DHNCS 427
>sp|P22929|CARP_SACFI Acid protease OS=Saccharomycopsis fibuligera GN=PEP1 PE=3 SV=1
Length = 390
Score = 67.4 bits (163), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 86/357 (24%), Positives = 143/357 (40%), Gaps = 68/357 (19%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQ 156
Y+ +IGTP Q L + +DT + WVP G T ++ +ST++K
Sbjct: 75 YLTTIEIGTPGQKLQVDVDTGSSDLWVPGQGTSSLYGT-YDHTKSTSYKK---------- 123
Query: 157 VPNPTCGGGACAFNLTYGSSTIA-ANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGL 215
F+++YG + A + +Q+T+S+ + G FG AT V QGL
Sbjct: 124 --------DRSGFSISYGDGSSARGDWAQETVSIGGASITGLEFG---DATSQDV-GQGL 171
Query: 216 LGLGRGSLSLLAQTQNLY----------------QSTFSYCLPSFKALS----FSGSLRL 255
LG+G AQ+ N + ++ +S L S A S F GS
Sbjct: 172 LGIGLKGNEASAQSSNSFTYDNLPLKLKDQGLIDKAAYSLYLNSEDATSGSILFGGSDSS 231
Query: 256 GPIGQPKRIKYTPLLKNPRRSS---LYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTII 312
G + + +S ++V L I G + T ++
Sbjct: 232 KYSGSLATLDLVNIDDEGDSTSGAVAFFVELEGIEAGSSSI----------TKTTYPALL 281
Query: 313 DSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVAPTITLMFSGMNVT 372
DSGT T + AP +++ R G+ S GG+ T S P F+G +T
Sbjct: 282 DSGT--TLIYAP--SSIASSIGREYGT--YSYSYGGYVT--SCDATGPDFKFSFNGKTIT 333
Query: 373 LPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVAR 429
+P NLL ++ G CL + S ++ + ++ + YD+ NS++G+A+
Sbjct: 334 VPFSNLLFQNSEGDSECLVGVLSS---GSNYYILGDAFLRSAYVYYDIDNSQVGIAQ 387
>sp|P10977|CARPV_CANAX Vacuolar aspartic protease OS=Candida albicans GN=APR1 PE=3 SV=3
Length = 419
Score = 58.9 bits (141), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 72/351 (20%), Positives = 138/351 (39%), Gaps = 58/351 (16%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCS---STVFNSAQSTTFKNLGCQAAQ 153
Y +IGTP Q + +DT + WVP C + ++ S+T+K G +
Sbjct: 104 YFTEIQIGTPGQPFKVILDTGSSNLWVPSQDCTSLACFLHAKYDHDASSTYKVNGSE--- 160
Query: 154 CKQVPNPTCGGGACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFGCIQKATGNSV--- 210
F++ YGS ++ +SQD +++ ++PG F G +
Sbjct: 161 ---------------FSIQYGSGSMEGYISQDVLTIGDLVIPGQDFAEATSEPGLAFAFG 205
Query: 211 PPQGLLGLGRGSLSL--------LAQTQNLYQS-TFSYCLPSFKALSFSGSL-RLGPIGQ 260
G+LGL ++S+ A Q L + F + L S G L G
Sbjct: 206 KFDGILGLAYDTISVNHIVPPIYNAINQGLLEKPQFGFYLGSTDKDENDGGLATFGGYDA 265
Query: 261 P---KRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTV 317
+I + P+ RR + + V+ I +G ++ G ID+GT
Sbjct: 266 SLFQGKITWLPI----RRKAYWEVSFEGIGLGDEYAELHK----------TGAAIDTGTS 311
Query: 318 FTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVAPTITLMFSGMNVTLPQDN 377
L +++ ++ ++G+ T + G + + P +TL F+G N TL +
Sbjct: 312 LITLP----SSLAEIINAKIGA--TKSWSGQYQVDCAKRDSLPDLTLTFAGYNFTLTPYD 365
Query: 378 LLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVA 428
++ + I+ P + L ++ + + + +YD+ + +G+A
Sbjct: 366 YILEVSGSCISVFTPMDFPQPIGD-LAIVGDAFLRKYYSIYDLDKNAVGLA 415
>sp|Q01294|CARP_NEUCR Vacuolar protease A OS=Neurospora crassa (strain ATCC 24698 /
74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) GN=pep-4
PE=3 SV=2
Length = 396
Score = 58.2 bits (139), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 83/358 (23%), Positives = 142/358 (39%), Gaps = 76/358 (21%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGC--VGCS-STVFNSAQSTTFKNLGCQAAQ 153
Y IGTP QT + +DT + WVP + C + C + S++S+T+K G
Sbjct: 85 YFSEITIGTPPQTFKVVLDTGSSNLWVPSSQCGSIACYLHNKYESSESSTYKKNG----- 139
Query: 154 CKQVPNPTCGGGACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPP- 212
+F + YGS +++ +SQD +++ + F A S P
Sbjct: 140 -------------TSFKIEYGSGSLSGFVSQDRMTIGDITINDQLF-----AEATSEPGL 181
Query: 213 -------QGLLGLGRGSLSLLAQTQNLY---------QSTFSYCLPSFKALSFSGSLRLG 256
G+LGLG +++ T Y + FS+ L S + G
Sbjct: 182 AFAFGRFDGILGLGYDRIAVNGITPPFYKMVEQKLVDEPVFSFYLADQDGES---EVVFG 238
Query: 257 PIGQPK---RIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIID 313
+ + + +I PL RR + + V+ AI G+ F G G I+D
Sbjct: 239 GVNKDRYTGKITTIPL----RRKAYWEVDFDAIGYGK----------DFAELEGHGVILD 284
Query: 314 SGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVAPTITLMFSGMNVTL 373
+GT L P+ A ++ ++G+ + D + T TL +G N TL
Sbjct: 285 TGTSLIAL--PSQLA--EMLNAQIGAKKSWNGQFTIDCGKKSSLEDVTFTL--AGYNFTL 338
Query: 374 -PQDNLLIHSTAGSITCLAMAAAPDNVNSV--LNVIANMQQQNHRILYDVPNSRLGVA 428
P+D +L S +CL+ D V L ++ + + + +YD+ +G+A
Sbjct: 339 GPEDYIL----EASGSCLSTFMGMDMPAPVGPLAILGDAFLRKYYSIYDLGADTVGIA 392
>sp|P81498|PEPC_SUNMU Gastricsin OS=Suncus murinus GN=PGC PE=1 SV=2
Length = 389
Score = 58.2 bits (139), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 88/401 (21%), Positives = 154/401 (38%), Gaps = 76/401 (18%)
Query: 55 EESVLEMLAK----DQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTL 110
E+ +LE K D A+ +VA + P+A +Y IGTP Q
Sbjct: 35 EQGLLEDFLKTNHYDPAQKYHFGDFSVAYE---PMA-----YMDASYFGEISIGTPPQNF 86
Query: 111 LMAMDTSNDAAWVPCTGCVGCSST---VFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGAC 167
L+ DT + WVP C + T FN QS+T+ G
Sbjct: 87 LVLFDTGSSNLWVPSVYCQSQACTGHARFNPNQSSTYSTNG------------------Q 128
Query: 168 AFNLTYGSSTIAANLSQDTISLATDIVPGYTFGCIQKATG-NSVPPQ--GLLGLGRGSLS 224
F+L YGS ++ DT+++ VP FG Q G N + Q G++G+ SL+
Sbjct: 129 TFSLQYGSGSLTGFFGYDTMTVQNIKVPHQEFGLSQNEPGTNFIYAQFDGIMGMAYPSLA 188
Query: 225 L---------LAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRR 275
+ + Q L FS+ L + + G++ G G + + P
Sbjct: 189 MGGATTALQGMLQEGALTSPVFSFYLSNQQGSQNGGAVIFG--GVDNSLYTGQIFWAPVT 246
Query: 276 SSLYY-VNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTV-------FTRLVAPAYT 327
LY+ + + +G + G Q G I+D+GT F + A
Sbjct: 247 QELYWQIGVEEFLIGGQAT----GWCQ----QGCQAIVDTGTSLLTVPQQFMSALQQATG 298
Query: 328 AVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVAPTITLMFSGMNVTLPQDNLLIHSTAGSI 387
A +D + + + ++ SL PT+T + +G+ LP ++++
Sbjct: 299 AQQDQYGQLAVNCNSIQSL-------------PTLTFIINGVQFPLPPSAYVLNTNGYCF 345
Query: 388 TCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVA 428
+ P L ++ ++ +++ +YD+ N+R+G A
Sbjct: 346 LGVEPTYLPSQNGQPLWILGDVFLRSYYSVYDMGNNRVGFA 386
>sp|O42630|CARP_ASPFU Vacuolar protease A OS=Neosartorya fumigata (strain ATCC MYA-4609 /
Af293 / CBS 101355 / FGSC A1100) GN=pep2 PE=2 SV=1
Length = 398
Score = 57.8 bits (138), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 79/354 (22%), Positives = 138/354 (38%), Gaps = 63/354 (17%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGC--VGCS-STVFNSAQSTTFKNLGCQAAQ 153
Y +GTP Q + +DT + WVP + C + C ++S+ S+T+K G +
Sbjct: 85 YFSEISLGTPPQKFKVVLDTGSSNLWVPGSDCSSIACFLHNKYDSSASSTYKANGTE--- 141
Query: 154 CKQVPNPTCGGGACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFGCIQKATGNSVP-- 211
F + YGS ++ +SQDT+ + V F G +
Sbjct: 142 ---------------FAIKYGSGELSGFVSQDTLQIGDLKVVKQDFAEATNEPGLAFAFG 186
Query: 212 -PQGLLGLGRGSLSL---------LAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQP 261
G+LGLG ++S+ + L + F++ L G G
Sbjct: 187 RFDGILGLGYDTISVNKIVPPFYNMLDQGLLDEPVFAFYLGDTNK---EGDNSEASFGGV 243
Query: 262 KRIKYT-PLLKNP-RRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFT 319
+ YT L K P RR + + V+ AI +G V ++ G I+D+GT
Sbjct: 244 DKNHYTGELTKIPLRRKAYWEVDFDAIALGDNVAELE----------NTGIILDTGTSLI 293
Query: 320 RLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVA----PTITLMFSGMNVTLPQ 375
L P+ A D+ + +G+ GF YS+ P +T +G N T+
Sbjct: 294 AL--PSTLA--DLLNKEIGAK------KGFTGQYSIECDKRDSLPDLTFTLAGHNFTIGP 343
Query: 376 DNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVAR 429
+ + I+ P+ V L ++ + + +YD+ N+ +G+A+
Sbjct: 344 YDYTLEVQGSCISSFMGMDFPEPVGP-LAILGDAFLRKWYSVYDLGNNAVGLAK 396
>sp|Q689Z7|PEPC_MONDO Gastricsin OS=Monodelphis domestica GN=PGC PE=2 SV=1
Length = 391
Score = 57.4 bits (137), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 76/351 (21%), Positives = 136/351 (38%), Gaps = 56/351 (15%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGC--VGCSS-TVFNSAQSTTFKNLGCQAAQ 153
Y IGTP Q L+ DT + WVP T C CS+ F+ +QS+TF N
Sbjct: 75 YFGEISIGTPPQNFLVLFDTGSSNLWVPSTYCQSQACSNHNRFSPSQSSTFTN------- 127
Query: 154 CKQVPNPTCGGGACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFGCIQKATGNSV--- 210
G + L+YGS ++ L DT+++ +V FG + +
Sbjct: 128 -----------GGQTYTLSYGSGSLTVVLGYDTVTVQNIVVSNQEFGLSESEPTSPFYYS 176
Query: 211 PPQGLLGLGRGSLSL---------LAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQP 261
G+LG+ ++++ + Q L + FS+ + G L LG + P
Sbjct: 177 DFDGILGMAYPAMAVGNSPTVMQGMLQQGQLSEPIFSFYFSRQPTHQYGGELILGGV-DP 235
Query: 262 K----RIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTV 317
+ +I +TP+ + + + + +G + + G I+D+GT
Sbjct: 236 QLYSGQITWTPV----TQEVYWQIGIEEFAIGNQATGW--------CSQGCQAIVDTGT- 282
Query: 318 FTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVAPTITLMFSGMNVTLPQDN 377
F V Y + F + G+ G F + PTIT + +G LP
Sbjct: 283 FLLAVPQQYMS---AFLQATGAQQAQN--GDFMVNCNYIQDMPTITFVINGSQFPLPPSA 337
Query: 378 LLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVA 428
+ ++ + P L ++ ++ + + +YD+ N+R+G A
Sbjct: 338 YVFNNNGYCRLGIEATYLPSPNGQPLWILGDVFLKEYYSVYDMANNRVGFA 388
>sp|Q29079|PAG2_PIG Pregnancy-associated glycoprotein 2 OS=Sus scrofa GN=PAG2 PE=2 SV=1
Length = 420
Score = 57.4 bits (137), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 75/332 (22%), Positives = 123/332 (37%), Gaps = 73/332 (21%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVG---CSSTVFNSAQSTTFKNLGCQAAQ 153
Y+ IGTP Q + DT + WVP C + FN + S+TF + G
Sbjct: 76 YVGNISIGTPPQQFSVVFDTGSSDLWVPSIYCKSKACVTHRSFNPSHSSTFHDRG----- 130
Query: 154 CKQVPNPTCGGGACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPP- 212
+ L YGS ++ L QDT+ + G FG ++ TG +
Sbjct: 131 -------------KSIKLEYGSGKMSGFLGQDTVRIGQLTSTGQAFGLSKEETGKAFEHA 177
Query: 213 --QGLLGLGRGSLSL---------LAQTQNLYQSTFSYCLPSFKA----LSFSGSLRLGP 257
G+LGL S+++ L + + + F++ L S K + F G +
Sbjct: 178 IFDGILGLAYPSIAIKGTTTVIDNLKKQDQISEPVFAFYLSSDKEEGSVVMFGGVDKKYY 237
Query: 258 IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTV 317
G +K+ PL + +S + + L I RV+ P G I+D+GT
Sbjct: 238 KGD---LKWVPLTQ----TSYWQIALDRITCRGRVIGCP---------RGCQAIVDTGTS 281
Query: 318 FTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVA----PTITLMFSGMNVTL 373
+ A + + + F+ Y VP A P I + ++ +
Sbjct: 282 MLHGPSKAVAKIHSLIKH-------------FEKEYVVPCNARKALPDIVFTINNVDYPV 328
Query: 374 PQDNLLIHSTAGSITCLAMAAAPDNVNSVLNV 405
P + + C A A PD V ++ NV
Sbjct: 329 PAQAYIRKYV---VPCNARKALPDIVFTINNV 357
>sp|P07267|CARP_YEAST Saccharopepsin OS=Saccharomyces cerevisiae (strain ATCC 204508 /
S288c) GN=PEP4 PE=1 SV=1
Length = 405
Score = 56.6 bits (135), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 71/354 (20%), Positives = 138/354 (38%), Gaps = 58/354 (16%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCS---STVFNSAQSTTFKNLGCQAAQ 153
Y +GTP Q + +DT + WVP C + + ++ S+++K G +
Sbjct: 91 YYTDITLGTPPQNFKVILDTGSSNLWVPSNECGSLACFLHSKYDHEASSSYKANGTE--- 147
Query: 154 CKQVPNPTCGGGACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFGCIQKATGNSV--- 210
F + YG+ ++ +SQDT+S+ +P F G +
Sbjct: 148 ---------------FAIQYGTGSLEGYISQDTLSIGDLTIPKQDFAEATSEPGLTFAFG 192
Query: 211 PPQGLLGLGRGSLSL---------LAQTQNLYQSTFSYCL-PSFKALSFSGSLRLGPIGQ 260
G+LGLG ++S+ Q L + F++ L + K G G I +
Sbjct: 193 KFDGILGLGYDTISVDKVVPPFYNAIQQDLLDEKRFAFYLGDTSKDTENGGEATFGGIDE 252
Query: 261 PK---RIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTV 317
K I + P+ RR + + V I +G ++ G ID+GT
Sbjct: 253 SKFKGDITWLPV----RRKAYWEVKFEGIGLGDEYAELES----------HGAAIDTGTS 298
Query: 318 FTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVAPTITLMFSGMNVTLPQDN 377
L + + ++ +G+ T D C + + P + F+G N T+ +
Sbjct: 299 LITLP----SGLAEMINAEIGAKKGWTGQYTLD-CNTRDNL-PDLIFNFNGYNFTIGPYD 352
Query: 378 LLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVAREL 431
+ + I+ + P+ V L ++ + + + +YD+ N+ +G+A+ +
Sbjct: 353 YTLEVSGSCISAITPMDFPEPVGP-LAIVGDAFLRKYYSIYDLGNNAVGLAKAI 405
>sp|Q9GMY3|PEPC_RHIFE Gastricsin OS=Rhinolophus ferrumequinum GN=PGC PE=2 SV=1
Length = 389
Score = 55.5 bits (132), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 88/414 (21%), Positives = 157/414 (37%), Gaps = 76/414 (18%)
Query: 42 PCSPFKPSKPLSWEESVLEMLAK----DQARLQFLSSLAVARKSVVPIASGRQITQSPTY 97
P K + E+ +LE K D A+ + +VA + P+A Y
Sbjct: 22 PLKKLKSLRETMKEKGLLEEFLKNHKYDPAQKYRYTDFSVAYE---PMA-----YMDAAY 73
Query: 98 IVRAKIGTPAQTLLMAMDTSNDAAWVPCTGC--VGCSS-TVFNSAQSTTFKNLGCQAAQC 154
IGTP Q L+ DT + WVP C C+ T FN +QS+T+ G
Sbjct: 74 FGEISIGTPPQNFLVLFDTGSSNLWVPSVYCQTQACTGHTRFNPSQSSTYSTNG------ 127
Query: 155 KQVPNPTCGGGACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFGCIQKATG-NSVPPQ 213
F+L YGS ++ DT+++ + VP FG + G N V Q
Sbjct: 128 ------------QTFSLQYGSGSLTGFFGYDTLTVQSIQVPNQEFGLSENEPGTNFVYAQ 175
Query: 214 --GLLGLGRGSLSL---------LAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPK 262
G++G+ SL++ + Q L FS+ L + + G++ G G
Sbjct: 176 FDGIMGMAYPSLAMGGATTALQGMLQEGALTSPVFSFYLSNQQGSQNGGAVIFG--GVDN 233
Query: 263 RIKYTPLLKNPRRSSLYY-VNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTV---- 317
+ + P LY+ + + +G + + G I+D+GT
Sbjct: 234 SLYQGQIYWAPVTQELYWQIGIEEFLIGGQASGW--------CSQGCQAIVDTGTSLLTV 285
Query: 318 ---FTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVAPTITLMFSGMNVTLP 374
+ + A A D + + F C + + PT T + +G+ LP
Sbjct: 286 PQQYMSALLQATGAQEDQYGQF------------FVNCNYIQNL-PTFTFIINGVQFPLP 332
Query: 375 QDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVA 428
+ ++++ + P L ++ ++ +++ +YD+ N+R+G A
Sbjct: 333 PSSYILNNNGYCTVGVEPTYLPSQNGQPLWILGDVFLRSYYSVYDMGNNRVGFA 386
>sp|P24268|CATD_RAT Cathepsin D OS=Rattus norvegicus GN=Ctsd PE=1 SV=1
Length = 407
Score = 54.3 bits (129), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 88/364 (24%), Positives = 149/364 (40%), Gaps = 70/364 (19%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGC----VGCS-STVFNSAQSTTFKNLGCQA 151
Y IGTP Q + DT + WVP C + C +NS +S+T+
Sbjct: 79 YYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWVHHKYNSDKSSTY------- 131
Query: 152 AQCKQVPNPTCGGGACAFNLTYGSSTIAANLSQDTISL--ATDI----VPGYTFGCIQKA 205
V N T +F++ YGS +++ LSQDT+S+ +D+ V FG K
Sbjct: 132 -----VKNGT------SFDIHYGSGSLSGYLSQDTVSVPCKSDLGGIKVEKQIFGEATKQ 180
Query: 206 TGN---SVPPQGLLGLGRGSLSL---------LAQTQNLYQSTFSYCLPSFKALSFSGSL 253
G + G+LG+G +S+ L + + + ++ FS+ L G L
Sbjct: 181 PGVVFIAAKFDGILGMGYPFISVNKVLPVFDNLMKQKLVEKNIFSFYLNRDPTGQPGGEL 240
Query: 254 RLGPIGQPKRIKYTPL-LKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTII 312
LG G R + L N R + + V++ + VG + G I+
Sbjct: 241 MLG--GTDSRYYHGELSYLNVTRKAYWQVHMDQLEVGSELTLC---------KGGCEAIV 289
Query: 313 DSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVA----PTITLMFSG 368
D+GT + LV P V+++ ++ +G+ + Y +P P IT G
Sbjct: 290 DTGT--SLLVGPV-DEVKEL-QKAIGAVPLIQGE------YMIPCEKVSSLPIITFKLGG 339
Query: 369 MNVTL-PQDNLLIHSTAGSITCLA--MAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
N L P+ +L S AG CL+ M + L ++ ++ + ++D +R+
Sbjct: 340 QNYELHPEKYILKVSQAGKTICLSGFMGMDIPPPSGPLWILGDVFIGCYYTVFDREYNRV 399
Query: 426 GVAR 429
G A+
Sbjct: 400 GFAK 403
>sp|P56272|PEP2B_GADMO Pepsin-2B OS=Gadus morhua PE=1 SV=1
Length = 324
Score = 53.9 bits (128), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 84/341 (24%), Positives = 145/341 (42%), Gaps = 54/341 (15%)
Query: 103 IGTPAQTLLMAMDTSNDAAWVPCTGCV--GCSS-TVFNSAQSTTFKNLGCQAAQCKQVPN 159
IGTP ++ + DT + WV + C CS+ F QS+T+ G K V
Sbjct: 20 IGTPPESFKVIFDTGSSNLWVSSSHCSAQACSNHNKFKPRQSSTYVETG------KTV-- 71
Query: 160 PTCGGGACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFGCIQKATG---NSVPPQGLL 216
+LTYG+ + L QDT+S+ P G Q G + P G+L
Sbjct: 72 ----------DLTYGTGGMRGILGQDTVSVGGGSDPNQELGESQTEPGPFQAAAPFDGIL 121
Query: 217 GLGRGSLSLLAQ--------TQNLYQST-FSYCLPSFKALSFSGSLRLGPIGQPKRIKYT 267
GL S++ +Q+L + FS+ L A +GS + +G YT
Sbjct: 122 GLAYPSIAAAGAVPVFDNMGSQSLVEKDLFSFYLSGGGA---NGSEVM--LGGVDNSHYT 176
Query: 268 PLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYT 327
S++++ + A + + +D Q G I+D+GT +++VAP +
Sbjct: 177 --------GSIHWIPVTAEKYWQVALDGITVNGQTAACEGCQAIVDTGT--SKIVAPV-S 225
Query: 328 AVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVAPTITLMFSGMNVTLPQDNLLIHSTAGSI 387
A+ ++ + +G++ + G C SV + P IT +G+ LP + A
Sbjct: 226 ALANIM-KDIGASENQGEMMG--NCASVQSL-PDITFTINGVKQPLPPSAYIEGDQAFCT 281
Query: 388 TCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVA 428
+ L + P N S L + ++ +N+ +YD N+++G A
Sbjct: 282 SGLGSSGVPSNT-SELWIFGDVFLRNYYTIYDRTNNKVGFA 321
>sp|A2ZC67|ASP1_ORYSI Aspartic proteinase Asp1 OS=Oryza sativa subsp. indica GN=ASP1 PE=2
SV=2
Length = 410
Score = 53.9 bits (128), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 88/382 (23%), Positives = 145/382 (37%), Gaps = 69/382 (18%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTG-CVGCSSTVFNSAQSTTFKNLGCQAAQCK 155
+ V IG PA+ + +DT + W+ C C+ C+ + + C +C
Sbjct: 38 FFVTMNIGDPAKPYFLDIDTGSTLTWLQCDYPCINCNKVPHGLYKPELKYAVKCTEQRCA 97
Query: 156 QVPNP-----TCG-GGACAFNLTY-GSSTIAANLSQDTISLATDIVPGYT---FGCIQKA 205
+ CG C + + Y G S+I L D+ SL T FGC
Sbjct: 98 DLYADLRKPMKCGPKNQCHYGIQYVGGSSIGV-LIVDSFSLPASNGTNPTSIAFGCGYNQ 156
Query: 206 TGNS----VPPQGLLGLGRGSLSLLAQTQN---LYQSTFSYCLPSF-KALSFSGSLRLGP 257
N+ P G+LGLGRG ++LL+Q ++ + + +C+ S K F G ++
Sbjct: 157 GKNNHNVPTPVNGILGLGRGKVTLLSQLKSQGVITKHVLGHCISSKGKGFLFFGDAKVPT 216
Query: 258 IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFN----PTTGA--GTI 311
G + ++P+ + + S R+ G LQFN P + A I
Sbjct: 217 SG----VTWSPMNREHKHYS-----------PRQ------GTLQFNSNSKPISAAPMEVI 255
Query: 312 IDSGTVFTRLVAPAYTAVRDVFR-------------RRVGSNLTVTSLG--GFDTCYSVP 356
DSG +T Y A V + + LTV G T V
Sbjct: 256 FDSGATYTYFALQPYHATLSVVKSTLSKECKFLTEVKEKDRALTVCWKGKDKIRTIDEVK 315
Query: 357 IVAPTITLMFSGMN----VTLPQDNLLIHSTAGSITCLAMAAAPDNVNSV--LNVIANMQ 410
+++L F+ + + +P ++ LI S G + CL + S+ N+I +
Sbjct: 316 KCFRSLSLKFADGDKKATLEIPPEHYLIISQEGHV-CLGILDGSKEHPSLAGTNLIGGIT 374
Query: 411 QQNHRILYDVPNSRLGVARELC 432
+ ++YD S LG C
Sbjct: 375 MLDQMVIYDSERSLLGWVNYQC 396
>sp|P18242|CATD_MOUSE Cathepsin D OS=Mus musculus GN=Ctsd PE=1 SV=1
Length = 410
Score = 53.5 bits (127), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 91/389 (23%), Positives = 157/389 (40%), Gaps = 73/389 (18%)
Query: 74 SLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGC----V 129
S+ + K+ P++ + Y IGTP Q + DT + WVP C +
Sbjct: 56 SMQSSPKTTEPVSELLKNYLDAQYYGDIGIGTPPQCFTVVFDTGSSNLWVPSIHCKILDI 115
Query: 130 GCS-STVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIAANLSQDTIS 188
C +NS +S+T+ V N T +F++ YGS +++ LSQDT+S
Sbjct: 116 ACWVHHKYNSDKSSTY------------VKNGT------SFDIHYGSGSLSGYLSQDTVS 157
Query: 189 L--------ATDI-VPGYTFGCIQKATG---NSVPPQGLLGLGRGSLSL---------LA 227
+ A I V FG K G + G+LG+G +S+ L
Sbjct: 158 VPCKSDQSKARGIKVEKQIFGEATKQPGIVFVAAKFDGILGMGYPHISVNNVLPVFDNLM 217
Query: 228 QTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPL-LKNPRRSSLYYVNLLAI 286
Q + + ++ FS+ L G L LG G + + L N R + + V++ +
Sbjct: 218 QQKLVDKNIFSFYLNRDPEGQPGGELMLG--GTDSKYYHGELSYLNVTRKAYWQVHMDQL 275
Query: 287 RVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSL 346
VG + G I+D+GT + LV P V+++ ++ +G+ +
Sbjct: 276 EVGNELTLC---------KGGCEAIVDTGT--SLLVGPV-EEVKEL-QKAIGAVPLIQGE 322
Query: 347 GGFDTCYSVPIVA----PTITLMFSGMNVTLPQDNLLIH-STAGSITCLA--MAAAPDNV 399
Y +P PT+ L G N L D ++ S G CL+ M
Sbjct: 323 ------YMIPCEKVSSLPTVYLKLGGKNYELHPDKYILKVSQGGKTICLSGFMGMDIPPP 376
Query: 400 NSVLNVIANMQQQNHRILYDVPNSRLGVA 428
+ L ++ ++ ++ ++D N+R+G A
Sbjct: 377 SGPLWILGDVFIGSYYTVFDRDNNRVGFA 405
>sp|Q9N2D3|PEPC_CALJA Gastricsin OS=Callithrix jacchus GN=PGC PE=1 SV=1
Length = 388
Score = 53.5 bits (127), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 95/398 (23%), Positives = 162/398 (40%), Gaps = 65/398 (16%)
Query: 49 SKPLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQ 108
K L WE L+ D AR +S L+V+ + + + + Y IGTP Q
Sbjct: 35 EKGLLWE--FLKTHKHDPARKYRVSDLSVSYEPMDYMDA--------AYFGEISIGTPPQ 84
Query: 109 TLLMAMDTSNDAAWVPCTGC--VGCSS-TVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGG 165
L+ DT + WVP C C+S + FN + S+T+ + G
Sbjct: 85 NFLVLFDTGSSNLWVPSVYCQSQACTSHSRFNPSASSTYSSNG----------------- 127
Query: 166 ACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFGCIQKATG-NSVPPQ--GLLGLGRGS 222
F+L YGS ++ DT+++ + VP FG + G N V Q G++GL +
Sbjct: 128 -QTFSLQYGSGSLTGFFGYDTLTVQSIQVPNQEFGLSENEPGTNFVYAQFDGIMGLAYPA 186
Query: 223 LSL---------LAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNP 273
LS+ + Q L FS+ L + + S G++ G G + + P
Sbjct: 187 LSMGGATTAMQGMLQEGALTSPVFSFYLSNQQGSS-GGAVIFG--GVDSSLYTGQIYWAP 243
Query: 274 RRSSLYY-VNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDV 332
LY+ + + +G G + G I+D+GT + L P
Sbjct: 244 VTQELYWQIGIEEFLIG--------GQASGWCSEGCQAIVDTGT--SLLTVPQ--QYMSA 291
Query: 333 FRRRVGSNLTVTSLGGF-DTCYSVPIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLA 391
F G+ G F C S+ + PT+T + +G+ LP + ++ S G T
Sbjct: 292 FLEATGAQ--EDEYGQFLVNCDSIQNL-PTLTFIINGVEFPLPPSSYIL-SNNGYCTVGV 347
Query: 392 MAAAPDNVNSV-LNVIANMQQQNHRILYDVPNSRLGVA 428
+ NS L ++ ++ +++ ++D+ N+R+G A
Sbjct: 348 EPTYLSSQNSQPLWILGDVFLRSYYSVFDLGNNRVGFA 385
>sp|Q0IU52|ASP1_ORYSJ Aspartic proteinase Asp1 OS=Oryza sativa subsp. japonica GN=ASP1
PE=2 SV=1
Length = 410
Score = 53.5 bits (127), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 84/382 (21%), Positives = 141/382 (36%), Gaps = 69/382 (18%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTG-CVGCSSTVFNSAQSTTFKNLGCQAAQCK 155
+ + IG PA++ + +DT + W+ C C C+ + T K + C + C
Sbjct: 38 FFITMNIGDPAKSYFLDIDTGSTLTWLQCDAPCTNCNIVPHVLYKPTPKKLVTCADSLCT 97
Query: 156 QV------PNPTCGGGACAFNLTYGSSTIAANLSQDTISLA-------TDIVPGYTFGCI 202
+ P C + + Y S+ L D SL+ T I G +
Sbjct: 98 DLYTDLGKPKRCGSQKQCDYVIQYVDSSSMGVLVIDRFSLSASNGTNPTTIAFGCGYDQG 157
Query: 203 QKATGNSVPPQGLLGLGRGSLSLLAQTQN---LYQSTFSYCLPSFKALSFSGSLRLGPIG 259
+K +P +LGL RG ++LL+Q ++ + + +C+ S G L G
Sbjct: 158 KKNRNVPIPVDSILGLSRGKVTLLSQLKSQGVITKHVLGHCISS----KGGGFLFFGDAQ 213
Query: 260 QPKR-IKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGA------GTII 312
P + +TP+ R YY G L F+ + A I
Sbjct: 214 VPTSGVTWTPM----NREHKYY-------------SPGHGTLHFDSNSKAISAAPMAVIF 256
Query: 313 DSGTVFTRLVAPAYTAVRDVFRRRVGSN-------------LTVTSLGGFDTCYSVPIVA 359
DSG +T A Y A V + + S LTV G D ++ V
Sbjct: 257 DSGATYTYFAAQPYQATLSVVKSTLNSECKFLTEVTEKDRALTV-CWKGKDKIVTIDEVK 315
Query: 360 P---TITLMFSGMN----VTLPQDNLLIHSTAGSITCLAMAAAPDNVNSV--LNVIANMQ 410
+++L F+ + + +P ++ LI S G + CL + S+ N+I +
Sbjct: 316 KCFRSLSLEFADGDKKATLEIPPEHYLIISQEGHV-CLGILDGSKEHLSLAGTNLIGGIT 374
Query: 411 QQNHRILYDVPNSRLGVARELC 432
+ ++YD S LG C
Sbjct: 375 MLDQMVIYDSERSLLGWVNYQC 396
>sp|Q8SQ41|PEPB_CANFA Pepsin B OS=Canis familiaris GN=PGB PE=1 SV=1
Length = 390
Score = 53.1 bits (126), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 75/349 (21%), Positives = 137/349 (39%), Gaps = 52/349 (14%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGC--VGCSS-TVFNSAQSTTFKNLGCQAAQ 153
Y IGTP Q L+ DT + WVP T C CS+ FN ++S+T+++
Sbjct: 74 YFGEISIGTPPQNFLILFDTGSSNLWVPSTYCQSQACSNHNRFNPSRSSTYQS------- 126
Query: 154 CKQVPNPTCGGGACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPP- 212
+ L YG ++ L DT+++ ++ FG +
Sbjct: 127 -----------SEQTYTLAYGFGSLTVLLGYDTVTVQNIVIHNQLFGMSENEPNYPFYYS 175
Query: 213 --QGLLGLGRGSLSL---------LAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQP 261
G+LG+ +L++ + Q L Q FS+ + G L LG G
Sbjct: 176 YFDGILGMAYSNLAVDNGPTVLQNMMQQGQLTQPIFSFYFSPQPTYEYGGELILG--GVD 233
Query: 262 KRIKYTPLLKNPRRSSLYY-VNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTR 320
+ ++ P +Y+ V + +G + + + G I+D+GT F
Sbjct: 234 TQFYSGEIVWAPVTREMYWQVAIDEFLIGNQATGL--------CSQGCQGIVDTGT-FPL 284
Query: 321 LVAPAYTAVRDVFRRRVGSNLTVTSLGGF-DTCYSVPIVAPTITLMFSGMNVTLPQDNLL 379
V Y D F + G+ + G F C S+ + PTIT + SG + LP +
Sbjct: 285 TVPQQYL---DSFVKATGAQQDQS--GNFVVNCNSIQSM-PTITFVISGSPLPLPPSTYV 338
Query: 380 IHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVA 428
+++ + + P L ++ ++ + + ++D+ +R+G A
Sbjct: 339 LNNNGYCTLGIEVTYLPSPNGQPLWILGDVFLREYYTVFDMAANRVGFA 387
>sp|P07339|CATD_HUMAN Cathepsin D OS=Homo sapiens GN=CTSD PE=1 SV=1
Length = 412
Score = 52.8 bits (125), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 90/368 (24%), Positives = 143/368 (38%), Gaps = 73/368 (19%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGC----VGCS-STVFNSAQSTTFKNLGCQA 151
Y IGTP Q + DT + WVP C + C +NS +S+T+
Sbjct: 79 YYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWIHHKYNSDKSSTY------- 131
Query: 152 AQCKQVPNPTCGGGACAFNLTYGSSTIAANLSQDTIS-----------LATDIVPGYTFG 200
V N T +F++ YGS +++ LSQDT+S L V FG
Sbjct: 132 -----VKNGT------SFDIHYGSGSLSGYLSQDTVSVPCQSASSASALGGVKVERQVFG 180
Query: 201 CIQKATGNSVPP---QGLLGLGRGSLSL---------LAQTQNLYQSTFSYCLPSFKALS 248
K G + G+LG+ +S+ L Q + + Q+ FS+ L
Sbjct: 181 EATKQPGITFIAAKFDGILGMAYPRISVNNVLPVFDNLMQQKLVDQNIFSFYLSRDPDAQ 240
Query: 249 FSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGA 308
G L LG K K + N R + + V+L + V + G
Sbjct: 241 PGGELMLGGT-DSKYYKGSLSYLNVTRKAYWQVHLDQVEVASGLTLC---------KEGC 290
Query: 309 GTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPI----VAPTITL 364
I+D+GT + +V P VR++ ++ +G+ + Y +P P ITL
Sbjct: 291 EAIVDTGT--SLMVGPV-DEVREL-QKAIGAVPLIQGE------YMIPCEKVSTLPAITL 340
Query: 365 MFSGMNVTL-PQDNLLIHSTAGSITCLA--MAAAPDNVNSVLNVIANMQQQNHRILYDVP 421
G L P+D L S AG CL+ M + L ++ ++ + ++D
Sbjct: 341 KLGGKGYKLSPEDYTLKVSQAGKTLCLSGFMGMDIPPPSGPLWILGDVFIGRYYTVFDRD 400
Query: 422 NSRLGVAR 429
N+R+G A
Sbjct: 401 NNRVGFAE 408
>sp|D4DEN7|CARP_TRIVH Probable vacuolar protease A OS=Trichophyton verrucosum (strain HKI
0517) GN=PEP2 PE=3 SV=1
Length = 400
Score = 52.8 bits (125), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 81/360 (22%), Positives = 135/360 (37%), Gaps = 75/360 (20%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGC--VGC--SSTVFNSAQSTTFKNLGCQAA 152
Y IGTP QT + +DT + WVP C + C ST +SA ST KN
Sbjct: 87 YFSEISIGTPPQTFKVVLDTGSSNLWVPGKDCSSIACFLHSTYDSSASSTYSKN------ 140
Query: 153 QCKQVPNPTCGGGACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPP 212
F + YGS ++ +SQD++ + + F A S P
Sbjct: 141 -------------GTKFAIRYGSGSLEGFVSQDSVKIGDMTIKNQLF-----AEATSEPG 182
Query: 213 --------QGLLGLGRGSLSLLAQTQNLY---------QSTFSYCLPSFK------ALSF 249
G++G+G S+S+ T Y + FS+ L ++F
Sbjct: 183 LAFAFGRFDGIMGMGFSSISVNGITPPFYNMIDQGLIDEPVFSFYLGDTNKEGDQSVVTF 242
Query: 250 SGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAG 309
GS G I PL RR + + V+ AI +G AL+ G
Sbjct: 243 GGSDTKHFTGDMTTI---PL----RRKAYWEVDFDAISLGEDTA-----ALE-----NTG 285
Query: 310 TIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVAPTITLMFSGM 369
I+D+GT L T + ++ ++G+ + D + P +T SG
Sbjct: 286 IILDTGTSLIALP----TTLAEMINTQIGATKSWNGQYTLDCAKRDSL--PDVTFTVSGH 339
Query: 370 NVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVAR 429
N T+ + + + I+ P+ V L ++ + + + +YD+ +G+A+
Sbjct: 340 NFTIGPHDYTLEVSGTCISSFMGMDFPEPVGP-LAILGDSFLRRYYSVYDLGKGTVGLAK 398
>sp|Q28057|PAG2_BOVIN Pregnancy-associated glycoprotein 2 OS=Bos taurus GN=PAG2 PE=2 SV=1
Length = 376
Score = 52.0 bits (123), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 46/175 (26%), Positives = 69/175 (39%), Gaps = 26/175 (14%)
Query: 51 PLSWEESVLEMLAKDQARLQFLSSLAV---ARKSVVPIASGRQITQSPTYIVRAKIGTPA 107
PL +++ E L + FL A S + I R + Y+ IGTP
Sbjct: 20 PLKKMKTLRETLREKNLLNNFLEEQAYRLSKNDSKITIHPLRNYLDT-AYVGNITIGTPP 78
Query: 108 QTLLMAMDTSNDAAWVPCTGCV--GC-SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGG 164
Q + DT + WVPC C C + FN S++F+ +G
Sbjct: 79 QEFRVVFDTGSANLWVPCITCTSPACYTHKTFNPQNSSSFREVG---------------- 122
Query: 165 GACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFGCIQKATG-NSVPPQGLLGL 218
+ YGS I L DT+ + + P +FG + G +S+P G+LGL
Sbjct: 123 --SPITIFYGSGIIQGFLGSDTVRIGNLVSPEQSFGLSLEEYGFDSLPFDGILGL 175
>sp|P81214|CARP_SYNRA Syncephapepsin OS=Syncephalastrum racemosum GN=SPSR PE=1 SV=1
Length = 395
Score = 51.6 bits (122), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 46/179 (25%), Positives = 72/179 (40%), Gaps = 32/179 (17%)
Query: 68 RLQFLSSLAVARKSVVPIASGRQITQSPT----------------YIVRAKIGTPAQTLL 111
R F + AR + +P G+ I +S Y +GTPAQ++
Sbjct: 45 RAIFRAEKKYARHTAIP-EQGKTIVKSAASGTGSVPMTDVDYDVEYYATVSVGTPAQSIK 103
Query: 112 MAMDTSNDAAWVPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNL 171
+ DT + W T C C S F+ +S+T+K +G ++ Q + G N+
Sbjct: 104 LDFDTGSSDLWFSSTLCTSCGSKSFDPTKSSTYKKVG-KSWQISYGDGSSASGITATDNV 162
Query: 172 TYGSSTIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQ 230
G I TI LAT ++ G I G+LGLG ++S +A T+
Sbjct: 163 ELGGLKITGQ----TIELATRESSSFSSGAI----------DGILGLGFDTISTVAGTK 207
>sp|P16476|PEPE_CHICK Embryonic pepsinogen OS=Gallus gallus PE=2 SV=1
Length = 383
Score = 51.6 bits (122), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 79/353 (22%), Positives = 144/353 (40%), Gaps = 63/353 (17%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCV--GC-SSTVFNSAQSTTFKNLGCQAAQ 153
Y IGTP Q + DT + WVP C C S +FN +QS+T+K+ G
Sbjct: 76 YYGTISIGTPPQDFTVVFDTGSSNLWVPSVSCTSPACQSHQMFNPSQSSTYKSTGQN--- 132
Query: 154 CKQVPNPTCGGGACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFGCIQKATGN---SV 210
++ YG+ + + DT+++A+ + FG G V
Sbjct: 133 ---------------LSIHYGTGDMEGTVGCDTVTVASLMDTNQLFGLSTSEPGQFFVYV 177
Query: 211 PPQGLLGLGRGSLSL---------LAQTQNLYQSTFSYCL---PSFKALSFSGSLRLGPI 258
G+LGLG SL+ + L Q+ FS L P + F G
Sbjct: 178 KFDGILGLGYPSLAADGITPVFDNMVNESLLEQNLFSVYLSREPMGSMVVFGGIDESYFT 237
Query: 259 GQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVF 318
G I + P+ + +++ +I V ++ + ++G IID+G
Sbjct: 238 G---SINWIPV----SYQGYWQISMDSIIVNKQ---------EIACSSGCQAIIDTG--- 278
Query: 319 TRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVAPTITLMFSGMNVTLPQDNL 378
T LVA + + D+ + VG+N + G + S + P + + G+ +P L
Sbjct: 279 TSLVAGPASDINDI-QSAVGANQ--NTYGEYSVNCSHILAMPDVVFVIGGIQYPVPA--L 333
Query: 379 LIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVAREL 431
G TC+ ++ N ++ L ++ ++ + + ++D N+R+G+A+ +
Sbjct: 334 AYTEQNGQGTCM---SSFQNSSADLWILGDVFIRVYYSIFDRANNRVGLAKAI 383
>sp|D4B385|CARP_ARTBC Probable vacuolar protease A OS=Arthroderma benhamiae (strain ATCC
MYA-4681 / CBS 112371) GN=PEP2 PE=3 SV=1
Length = 400
Score = 50.8 bits (120), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 80/360 (22%), Positives = 136/360 (37%), Gaps = 75/360 (20%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGC--VGC--SSTVFNSAQSTTFKNLGCQAA 152
Y IGTP QT + +DT + WVP C + C ST +SA ST KN
Sbjct: 87 YFSEISIGTPPQTFKVVLDTGSSNLWVPGKDCSSIACFLHSTYDSSASSTYSKN------ 140
Query: 153 QCKQVPNPTCGGGACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPP 212
F + YGS ++ +S+D++ + + F A S P
Sbjct: 141 -------------GTKFAIRYGSGSLEGFVSRDSVKIGDMTIKKQLF-----AEATSEPG 182
Query: 213 --------QGLLGLGRGSLSLLAQTQNLY---------QSTFSYCLPSFK------ALSF 249
G++G+G S+S+ T Y + FS+ L ++F
Sbjct: 183 LAFAFGRFDGIMGMGFSSISVNGITPPFYNMIDQGLIDEPVFSFYLGDTNKDGDQSVVTF 242
Query: 250 SGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAG 309
GS G I PL RR + + V+ AI +G+ AL+ G
Sbjct: 243 GGSDTNHFTGDMTTI---PL----RRKAYWEVDFDAISLGKDTA-----ALE-----NTG 285
Query: 310 TIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVAPTITLMFSGM 369
I+D+GT L T + ++ ++G+ + D + P +T SG
Sbjct: 286 IILDTGTSLIALP----TTLAEMINTQIGATKSWNGQYTLDCAKRDSL--PDVTFTLSGH 339
Query: 370 NVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVAR 429
N T+ + + + I+ P+ V L ++ + + + +YD+ +G+A+
Sbjct: 340 NFTIGPHDYTLEVSGTCISSFMGMDFPEPVGP-LAILGDSFLRRYYSVYDLGKGTVGLAK 398
>sp|P80209|CATD_BOVIN Cathepsin D OS=Bos taurus GN=CTSD PE=1 SV=2
Length = 390
Score = 50.4 bits (119), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 83/366 (22%), Positives = 147/366 (40%), Gaps = 71/366 (19%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGC----VGC-SSTVFNSAQSTTFKNLGCQA 151
Y IGTP Q + DT + WVP C + C + +NS +S+T+
Sbjct: 59 YYGEIGIGTPPQCFTVVFDTGSANLWVPSIHCKLLDIACWTHRKYNSDKSSTY------- 111
Query: 152 AQCKQVPNPTCGGGACAFNLTYGSSTIAANLSQDTISLATD---------IVPGYTFGCI 202
V N T F++ YGS +++ LSQDT+S+ + V TFG
Sbjct: 112 -----VKNGT------TFDIHYGSGSLSGYLSQDTVSVPCNPSSSSPGGVTVQRQTFGEA 160
Query: 203 QKATGN---SVPPQGLLGLGRGSLSL---------LAQTQNLYQSTFSYCLPSFKALSFS 250
K G + G+LG+ +S+ L Q + + ++ FS+ L
Sbjct: 161 IKQPGVVFIAAKFDGILGMAYPRISVNNVLPVFDNLMQQKLVDKNVFSFFLNRDPKAQPG 220
Query: 251 GSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGT 310
G L LG K + + + N R + + +++ + VG + G
Sbjct: 221 GELMLGGT-DSKYYRGSLMFHNVTRQAYWQIHMDQLDVGSSLTVC---------KGGCEA 270
Query: 311 IIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVA----PTITLMF 366
I+D+GT + +V P VR++ ++ +G+ + Y +P P +T+
Sbjct: 271 IVDTGT--SLIVGPV-EEVREL-QKAIGAVPLIQGE------YMIPCEKVSSLPEVTVKL 320
Query: 367 SGMNVTL-PQDNLLIHSTAGSITCLA--MAAAPDNVNSVLNVIANMQQQNHRILYDVPNS 423
G + L P+D L S A + CL+ M L ++ ++ + ++D +
Sbjct: 321 GGKDYALSPEDYALKVSQAETTVCLSGFMGMDIPPPGGPLWILGDVFIGRYYTVFDRDQN 380
Query: 424 RLGVAR 429
R+G+A
Sbjct: 381 RVGLAE 386
>sp|Q9GMY4|PEPC_SORUN Gastricsin OS=Sorex unguiculatus GN=PGC PE=2 SV=1
Length = 389
Score = 50.4 bits (119), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 72/348 (20%), Positives = 139/348 (39%), Gaps = 50/348 (14%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VFNSAQSTTFKNLGCQAAQ 153
Y IGTP Q L+ DT + WVP C + T FN ++S+T+ G
Sbjct: 73 YFGEISIGTPPQNFLVLFDTGSSNLWVPSVYCQSQACTGHARFNPSKSSTYSTNG----- 127
Query: 154 CKQVPNPTCGGGACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFGCIQKATG-NSVPP 212
F+L YGS ++ DT++L VP FG Q G N V
Sbjct: 128 -------------QTFSLQYGSGSLTGFFGYDTMTLQNIKVPHQEFGLSQNEPGENFVYA 174
Query: 213 Q--GLLGLGRGSLSLLAQT---QNLYQS------TFSYCLPSFKALSFSGSLRLGPIGQP 261
Q G++G+ +L++ T Q + Q+ FS+ L + ++ G++ G G
Sbjct: 175 QFDGIMGMAYPTLAMGGATTALQGMLQAGALDSPVFSFYLSNQQSSKDGGAVVFG--GVD 232
Query: 262 KRIKYTPLLKNPRRSSLYY-VNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTR 320
+ + P LY+ + + +G + + G I+D+GT
Sbjct: 233 NSLYTGQIFWTPVTQELYWQIGVEQFLIGGQATGW--------CSQGCQAIVDTGTSLLT 284
Query: 321 LVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVAPTITLMFSGMNVTLPQDNLLI 380
+ +A++ ++ + + C ++ + PT+T + +G+ L ++
Sbjct: 285 VPQQYLSALQQATGAQLDQDGQMVV-----NCNNIQNL-PTLTFVINGVQFPLLPSAYVL 338
Query: 381 HSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVA 428
++ + P L ++ ++ +++ +YD+ N+R+G A
Sbjct: 339 NNNGYCTLGVEPTYLPSPTGQPLWILGDVFLRSYYSVYDMGNNRVGFA 386
>sp|Q9GMY2|PEPC_RABIT Gastricsin OS=Oryctolagus cuniculus GN=PGC PE=2 SV=1
Length = 388
Score = 50.4 bits (119), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 75/347 (21%), Positives = 130/347 (37%), Gaps = 49/347 (14%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV---FNSAQSTTFKNLGCQAAQ 153
Y IGTP+Q L+ DT + WVP C + T FN ++S+TF
Sbjct: 73 YFGEISIGTPSQNFLVLFDTGSSNLWVPSVYCQSEACTTHNRFNPSKSSTFYTYD----- 127
Query: 154 CKQVPNPTCGGGACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFGCIQKATGNS---V 210
F+L YGS ++ DT ++ VP FG + G +
Sbjct: 128 -------------QTFSLEYGSGSLTGFFGYDTFTIQNIEVPNQEFGLSETEPGTNFLYA 174
Query: 211 PPQGLLGLGRGSLSL---------LAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQP 261
G++GL SLS+ + Q + S FS+ L S + G+L LG G
Sbjct: 175 EFDGIMGLAYPSLSVGDATPALQGMVQDGTISSSVFSFYLSSQQGTD-GGALVLG--GVD 231
Query: 262 KRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRL 321
+ + P LY+ ++G I A + + G I+D+GT + L
Sbjct: 232 SSLYTGDIYWAPVTRELYW------QIGIDEFLISSEASGW-CSQGCQAIVDTGT--SLL 282
Query: 322 VAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVAPTITLMFSGMNVTLPQDNLLIH 381
P + D+ + G F PT T + +G+ L +++
Sbjct: 283 TVPQ-EYMSDLLE---ATGAQENEYGEFLVDCDSTESLPTFTFVINGVEFPLSPSAYILN 338
Query: 382 STAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVA 428
+ + + L ++ ++ + + ++D+ N+R+G A
Sbjct: 339 TDGQCMVGVEATYLSSQDGEPLWILGDVFLRAYYSVFDMANNRVGFA 385
>sp|Q9DEX3|CATD_CLUHA Cathepsin D OS=Clupea harengus GN=ctsd PE=1 SV=1
Length = 396
Score = 50.1 bits (118), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 75/356 (21%), Positives = 138/356 (38%), Gaps = 60/356 (16%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGC----VGC-SSTVFNSAQSTTFKNLGCQA 151
Y +GTP Q + DT + W+P C + C +N A+S+T+ G +
Sbjct: 76 YYGEIGLGTPVQMFTVVFDTGSSNLWLPSIHCSFTDIACLLHHKYNGAKSSTYVKNGTE- 134
Query: 152 AQCKQVPNPTCGGGACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFGCIQKATGNSVP 211
F + YGS +++ LSQD+ ++ +V FG K G +
Sbjct: 135 -----------------FAIQYGSGSLSGYLSQDSCTIGDIVVEKQLFGEAIKQPGVAFI 177
Query: 212 P---QGLLGLGRGSLS---------LLAQTQNLYQSTFSYCLPSFKALSFSGSLRLG--- 256
G+LG+ +S ++ + + Q+ FS+ L G L LG
Sbjct: 178 AAKFDGILGMAYPRISVDGVPPVFDMMMSQKKVEQNVFSFYLNRNPDTEPGGELLLGGTD 237
Query: 257 PIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGT 316
P Y P+ R + + +++ + +G ++ G I+D+G
Sbjct: 238 PKYYTGDFNYVPV----TRQAYWQIHMDGMSIGSQLTLC---------KDGCEAIVDTG- 283
Query: 317 VFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVAPTITLMFSGMNVTLPQD 376
T L+ VR ++ +G+ + D C VP + PTI+ G +L +
Sbjct: 284 --TSLITGPPAEVR-ALQKAIGAIPLIQGEYMID-CKKVPTL-PTISFNVGGKTYSLTGE 338
Query: 377 N-LLIHSTAGSITCLA--MAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVAR 429
+L S G CL+ M L ++ ++ + ++D ++R+G A+
Sbjct: 339 QYVLKESQGGKTICLSGLMGLEIPPPAGPLWILGDVFIGQYYTVFDRESNRVGFAK 394
>sp|P39898|PLM1_PLAFA Plasmepsin-1 OS=Plasmodium falciparum PE=1 SV=2
Length = 452
Score = 50.1 bits (118), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 59/247 (23%), Positives = 92/247 (37%), Gaps = 58/247 (23%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGC--VGC-SSTVFNSAQSTTFKNLGCQAAQ 153
Y A+IG Q DT + WVP C +GC + +++S +S T++ G +
Sbjct: 139 YYGEAQIGDNKQKFAFIFDTGSANLWVPSAQCNTIGCKTKNLYDSNKSKTYEKDGTKVE- 197
Query: 154 CKQVPNPTCGGGACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPP- 212
+ Y S T++ S+D +++A P Y F I+ N P
Sbjct: 198 -----------------MNYVSGTVSGFFSKDIVTIANLSFP-YKF--IEVTDTNGFEPA 237
Query: 213 ------QGLLGLGRGSLSL---------LAQTQNLYQSTFSYCLPSFKALSFSGSLRLGP 257
G++GLG LS+ L + Q+ F++ LP G L +G
Sbjct: 238 YTLGQFDGIVGLGWKDLSIGSVDPVVVELKNQNKIEQAVFTFYLPF--DDKHKGYLTIG- 294
Query: 258 IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTV 317
G R L LY+ L + G V+ A I+DSGT
Sbjct: 295 -GIEDRFYEGQLTYEKLNHDLYWQVDLDLHFGNLTVE------------KATAIVDSGT- 340
Query: 318 FTRLVAP 324
+ + AP
Sbjct: 341 -SSITAP 346
>sp|Q7KQM4|PLM1_PLAF7 Plasmepsin-1 OS=Plasmodium falciparum (isolate 3D7) GN=PF14_0076
PE=2 SV=1
Length = 452
Score = 50.1 bits (118), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 59/247 (23%), Positives = 92/247 (37%), Gaps = 58/247 (23%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGC--VGC-SSTVFNSAQSTTFKNLGCQAAQ 153
Y A+IG Q DT + WVP C +GC + +++S +S T++ G +
Sbjct: 139 YYGEAQIGDNKQKFAFIFDTGSANLWVPSAQCNTIGCKTKNLYDSNKSKTYEKDGTKVE- 197
Query: 154 CKQVPNPTCGGGACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPP- 212
+ Y S T++ S+D +++A P Y F I+ N P
Sbjct: 198 -----------------MNYVSGTVSGFFSKDIVTIANLSFP-YKF--IEVTDTNGFEPA 237
Query: 213 ------QGLLGLGRGSLSL---------LAQTQNLYQSTFSYCLPSFKALSFSGSLRLGP 257
G++GLG LS+ L + Q+ F++ LP G L +G
Sbjct: 238 YTLGQFDGIVGLGWKDLSIGSVDPVVVELKNQNKIEQAVFTFYLPF--DDKHKGYLTIG- 294
Query: 258 IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTV 317
G R L LY+ L + G V+ A I+DSGT
Sbjct: 295 -GIEDRFYEGQLTYEKLNHDLYWQVDLDLHFGNLTVE------------KATAIVDSGT- 340
Query: 318 FTRLVAP 324
+ + AP
Sbjct: 341 -SSITAP 346
>sp|Q64411|PEPC_CAVPO Gastricsin OS=Cavia porcellus GN=PGC PE=2 SV=1
Length = 394
Score = 49.3 bits (116), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 81/350 (23%), Positives = 143/350 (40%), Gaps = 55/350 (15%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGC--VGCSS-TVFNSAQSTTFKNLGCQAAQ 153
Y + +GTP Q+ + DT + WVP C + C++ T FN S+T+
Sbjct: 79 YFGQISLGTPPQSFQVLFDTGSSNLWVPSVYCSSLACTTHTRFNPRDSSTY-------VA 131
Query: 154 CKQVPNPTCGGGACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFGCIQKATGNS---V 210
Q +F+L YG+ ++ DT+++ VP FG + G+
Sbjct: 132 TDQ-----------SFSLEYGTGSLTGVFGYDTMTIQDIQVPKQEFGLSETEPGSDFVYA 180
Query: 211 PPQGLLGLGRGSLSL---------LAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQP 261
G+LGLG LS L + L QS FS L S + S G L LG + +
Sbjct: 181 EFDGILGLGYPGLSEGGATTAMQGLLREGALSQSLFSVYLGSQQG-SDEGQLILGGVDES 239
Query: 262 ---KRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVF 318
I +TP+ + LY+ I + ++D G+ + G I+D+GT
Sbjct: 240 LYTGDIYWTPVTQE-----LYW----QIGIEGFLID---GSASGWCSRGCQGIVDTGT-- 285
Query: 319 TRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVAPTITLMFSGMNVTLPQDNL 378
+ L P + + +G+ F +C S+ + PT+T + SG+ L
Sbjct: 286 SLLTVP--SDYLSTLVQAIGAEENEYGE-YFVSCSSIQDL-PTLTFVISGVEFPLSPSAY 341
Query: 379 LIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVA 428
++ + L + ++ ++ +++ +YD+ N+R+G A
Sbjct: 342 ILSGENYCMVGLESTYVSPGGGEPVWILGDVFLRSYYSVYDLANNRVGFA 391
>sp|P46925|PLM2_PLAFA Plasmepsin-2 OS=Plasmodium falciparum PE=1 SV=1
Length = 453
Score = 49.3 bits (116), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 56/243 (23%), Positives = 94/243 (38%), Gaps = 56/243 (23%)
Query: 93 QSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCV--GC-SSTVFNSAQSTTFKNLGC 149
Q+ + A++G Q +DT + WVP C GC + +++S++S T++ G
Sbjct: 136 QNIMFYGDAEVGDNQQPFTFILDTGSANLWVPSVKCTTAGCLTKHLYDSSKSRTYEKDGT 195
Query: 150 QAAQCKQVPNPTCGGGACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFGCIQKATGNS 209
+ + Y S T++ S+D +++ +P Y F I+ N
Sbjct: 196 KVE------------------MNYVSGTVSGFFSKDLVTVGNLSLP-YKF--IEVIDTNG 234
Query: 210 VPP-------QGLLGLGRGSLSL---------LAQTQNLYQSTFSYCLPSFKALSFSGSL 253
P G+LGLG LS+ L + + F++ LP +G L
Sbjct: 235 FEPTYTASTFDGILGLGWKDLSIGSVDPIVVELKNQNKIENALFTFYLPVHD--KHTGFL 292
Query: 254 RLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIID 313
+G G +R PL LY+ L VG +++ A I+D
Sbjct: 293 TIG--GIEERFYEGPLTYEKLNHDLYWQITLDAHVGNIMLE------------KANCIVD 338
Query: 314 SGT 316
SGT
Sbjct: 339 SGT 341
>sp|Q4LAL9|CATD_CANFA Cathepsin D OS=Canis familiaris GN=CTSD PE=2 SV=1
Length = 410
Score = 48.5 bits (114), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 88/367 (23%), Positives = 145/367 (39%), Gaps = 73/367 (19%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGC----VGCS-STVFNSAQSTTFKNLGCQA 151
Y IGTP Q + DT + WVP C + C +NS +S+T+
Sbjct: 79 YYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWIHHKYNSGKSSTY------- 131
Query: 152 AQCKQVPNPTCGGGACAFNLTYGSSTIAANLSQDTIS---------LATDIVPGYTFGCI 202
V N T +F++ YGS +++ LSQDT+S LA V TFG
Sbjct: 132 -----VKNGT------SFDIHYGSGSLSGYLSQDTVSVPCKSALSGLAGIKVERQTFGEA 180
Query: 203 QKATGNSVPP---QGLLGLGRGSLSL---------LAQTQNLYQSTFSYCLPSFKALSFS 250
K G + G+LG+ +S+ L Q + + ++ FS+ L
Sbjct: 181 TKQPGITFIAAKFDGILGMAYPRISVNNVLPVFDNLMQQKLVEKNIFSFYLNRDPNAQPG 240
Query: 251 GSLRLGPIGQPKRIKYTPL-LKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAG 309
G L LG G + PL N R + + V++ + VG + G
Sbjct: 241 GELMLG--GTDSKYYKGPLSYLNVTRKAYWQVHMEQVDVGSSLTLC---------KGGCE 289
Query: 310 TIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPI----VAPTITLM 365
I+D+GT + +V P VR++ ++ +G+ + Y +P P +TL
Sbjct: 290 AIVDTGT--SLIVGPV-DEVREL-QKAIGAVPLIQGE------YMIPCEKVSTLPDVTLK 339
Query: 366 FSGMNVTL-PQDNLLIHSTAGSITCLA--MAAAPDNVNSVLNVIANMQQQNHRILYDVPN 422
G L +D L S G CL+ M L ++ ++ + ++D
Sbjct: 340 LGGKLYKLSSEDYTLKVSQGGKTICLSGFMGMDIPPPGGPLWILGDVFIGCYYTVFDRDQ 399
Query: 423 SRLGVAR 429
+R+G+A+
Sbjct: 400 NRVGLAQ 406
>sp|P03955|PEPC_MACFU Gastricsin (Fragment) OS=Macaca fuscata fuscata GN=PGC PE=1 SV=2
Length = 377
Score = 48.5 bits (114), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 75/355 (21%), Positives = 140/355 (39%), Gaps = 65/355 (18%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGC--VGCSS-TVFNSAQSTTFKNLGCQAAQ 153
Y IGTP Q L+ DT + WVP C C+S + FN ++S+T+ G
Sbjct: 62 YFGEISIGTPPQNFLVLFDTGSSNLWVPSVYCQSQACTSHSRFNPSESSTYSTNG----- 116
Query: 154 CKQVPNPTCGGGACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFGCIQKATG-NSVPP 212
F+L YGS ++ DT+++ + VP FG + G N V
Sbjct: 117 -------------QTFSLQYGSGSLTGFFGYDTLTVQSIQVPNQEFGLSENEPGTNFVYA 163
Query: 213 Q--GLLGLGRGSLSL---------LAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQP 261
Q G++GL +LS+ + Q L FS L + S G++ G G
Sbjct: 164 QFDGIMGLAYPTLSVDGATTAMQGMVQEGALTSPIFSVYLSDQQGSS-GGAVVFG--GVD 220
Query: 262 KRIKYTPLLKNPRRSSLYY-VNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTV--- 317
+ + P LY+ + + +G + + G I+D+GT
Sbjct: 221 SSLYTGQIYWAPVTQELYWQIGIEEFLIGGQASGW--------CSEGCQAIVDTGTSLLT 272
Query: 318 ----FTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVAPTITLMFSGMNVTL 373
+ + A A D + + + + ++ +L PT+T + +G+ L
Sbjct: 273 VPQQYMSALLQATGAQEDEYGQFLVNCNSIQNL-------------PTLTFIINGVEFPL 319
Query: 374 PQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVA 428
P + ++++ + + L ++ ++ +++ +YD+ N+R+G A
Sbjct: 320 PPSSYILNNNGYCTVGVEPTYLSAQNSQPLWILGDVFLRSYYSVYDLSNNRVGFA 374
>sp|Q9D7R7|PEPC_MOUSE Gastricsin OS=Mus musculus GN=Pgc PE=2 SV=1
Length = 392
Score = 48.5 bits (114), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 80/349 (22%), Positives = 142/349 (40%), Gaps = 50/349 (14%)
Query: 96 TYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGC--VGCSS-TVFNSAQSTTFKNLGCQAA 152
+Y IGTP Q L+ DT + WV C C++ T +N ++S+T+ G
Sbjct: 75 SYYGEISIGTPPQNFLVLFDTGSSNLWVSSVYCQSEACTTHTRYNPSKSSTYYTQG---- 130
Query: 153 QCKQVPNPTCGGGACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFGCIQKATG-NSVP 211
F+L YG+ ++ DT+ + + VP FG + G N V
Sbjct: 131 --------------QTFSLQYGTGSLTGFFGYDTLRVQSIQVPNQEFGLSENEPGTNFVY 176
Query: 212 PQ--GLLGLGRGSLSLLAQTQNLY---------QSTFSYCLPSFKALSFSGSLRLGPIGQ 260
Q G++GL LS T L Q F L S + S G + G G
Sbjct: 177 AQFDGIMGLAYPGLSSGGATTALQGMLGEGALSQPLFGVYLGSQQG-SNGGQIVFG--GV 233
Query: 261 PKRIKYTPLLKNPRRSSLYY-VNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFT 319
+ + L P LY+ + + +G + A + ++G I+D+GT +
Sbjct: 234 DENLYTGELTWIPVTQELYWQITIDDFLIGNQ-------ASGWCSSSGCQGIVDTGT--S 284
Query: 320 RLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVAPTITLMFSGMNVTLPQDNLL 379
LV PA + + +G+ F +C SV + PT+T + +G+ L + +
Sbjct: 285 LLVMPA--QYLNELLQTIGAQEGEYGQ-YFVSCDSVSSL-PTLTFVLNGVQFPLSPSSYI 340
Query: 380 IHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVA 428
I + L + L ++ ++ +++ ++D+ N+R+G+A
Sbjct: 341 IQEEGSCMVGLESLSLNAESGQPLWILGDVFLRSYYAVFDMGNNRVGLA 389
>sp|C5FS55|CARP_ARTOC Vacuolar protease A OS=Arthroderma otae (strain ATCC MYA-4605 / CBS
113480) GN=PEP2 PE=3 SV=1
Length = 395
Score = 48.1 bits (113), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 74/355 (20%), Positives = 129/355 (36%), Gaps = 70/355 (19%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGC--VGCS-STVFNSAQSTTFKNLGCQAAQ 153
Y IGTP QT + +DT + WVP C + C + ++S+ S+TF G
Sbjct: 87 YFSEISIGTPPQTFKVVLDTGSSNLWVPGKDCSSIACFLHSTYDSSASSTFTRNG----- 141
Query: 154 CKQVPNPTCGGGACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPP- 212
+F + YGS ++ +SQD + + + F A S P
Sbjct: 142 -------------TSFAIRYGSGSLEGFVSQDNVQIGDMKIKNQLF-----AEATSEPGL 183
Query: 213 -------QGLLGLGRGSLSLLAQTQNLY---------QSTFSYCLPSFKALSFSGSLRLG 256
G+LG+G ++S+ T Y + FS+ L G +
Sbjct: 184 AFAFGRFDGILGMGYDTISVNKITPPFYKMVEQGLVDEPVFSFYLGDTNK---DGDQSVV 240
Query: 257 PIGQPKRIKYTPLLKNP--RRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDS 314
G + YT + RR + + V AI +G+ + G I+D+
Sbjct: 241 TFGGADKSHYTGDITTIPLRRKAYWEVEFNAITLGKDTATLD----------NTGIILDT 290
Query: 315 GTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVAPTITLMFSGMNVTLP 374
G T L+A T + + T+ C + P +T SG N T+
Sbjct: 291 G---TSLIALPTTYAEMIISKSWNGQYTI-------DCAKRDSL-PDLTFTLSGHNFTIG 339
Query: 375 QDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVAR 429
+ + + I+ P+ V L ++ + + +YD+ +G+A+
Sbjct: 340 PYDYTLEVSGTCISSFMGMDFPEPVGP-LAILGDSFLRRWYSVYDLGKGTVGLAK 393
>sp|P20142|PEPC_HUMAN Gastricsin OS=Homo sapiens GN=PGC PE=1 SV=1
Length = 388
Score = 48.1 bits (113), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 40/135 (29%), Positives = 60/135 (44%), Gaps = 24/135 (17%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGC--VGCSS-TVFNSAQSTTFKNLGCQAAQ 153
Y IGTP Q L+ DT + WVP C C+S + FN ++S+T+ G
Sbjct: 73 YFGEISIGTPPQNFLVLFDTGSSNLWVPSVYCQSQACTSHSRFNPSESSTYSTNG----- 127
Query: 154 CKQVPNPTCGGGACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFGCIQKATG-NSVPP 212
F+L YGS ++ DT+++ + VP FG + G N V
Sbjct: 128 -------------QTFSLQYGSGSLTGFFGYDTLTVQSIQVPNQEFGLSENEPGTNFVYA 174
Query: 213 Q--GLLGLGRGSLSL 225
Q G++GL +LS+
Sbjct: 175 QFDGIMGLAYPALSV 189
>sp|O60020|ASPR1_PHARH Aspartic protease OS=Phaffia rhodozyma GN=pr1 PE=1 SV=1
Length = 405
Score = 47.8 bits (112), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 90/386 (23%), Positives = 161/386 (41%), Gaps = 49/386 (12%)
Query: 59 LEMLAKDQARLQFLSSLAVARKSVVPIASGRQIT-QSPTYIVRAKIGTPAQTLLMAMDTS 117
L+ L K +A+ Q+ A AR +T Q + I Q+ + DT
Sbjct: 56 LDWLKKTKAQAQYKHKQANARLHSKRATGASVLTDQGSESLWTGPITIGGQSFTVDWDTG 115
Query: 118 NDAAWVPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSST 177
+ WVP + CSS N+ T + G + + + + G G+ A Y +
Sbjct: 116 SSDLWVPSS---ACSSAACNAHHKYTLTSTGKKQSGTFSI---SYGDGSSASGPVYKDNV 169
Query: 178 IAANLSQDTISLATDIVPGYTFGCI--QKATGNSVPPQGLLGLGRGSLSLLAQTQNLY-- 233
+A+ L AT V FG + + ++ +S P G+ GLG +L+ L+ T +
Sbjct: 170 VASGLQ------ATSQV----FGAVTSESSSFSSDPSDGISGLGWPALAQLSGTSYFWSL 219
Query: 234 --QSTFSYCLPSFKALSFSGSLRLGPIGQPK---RIKYTPLLKNPRRSSLYYVNLLAIRV 288
Q T + + SF+ + + L LG I I YTP+ + + + + L + V
Sbjct: 220 INQGTVTSPVFSFRLATTNSELYLGGINSAHYTGAITYTPVTQK----AYWTIALGGVSV 275
Query: 289 GRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGG 348
+ NP+ + IID+GT LV V ++ + GS + G
Sbjct: 276 NGAAI---------NPSV-SSAIIDTGTT---LVYGPTAGVAALYAKIPGSASMADTYGS 322
Query: 349 -FDTCYSVPIVA-PTITLMFSGMNVTLPQDNLLIHS-TAGSITCLAMAAAPDNVNSVLNV 405
+ Y+ P A PT+ L F G + ++P + + ++GS C+ + + +
Sbjct: 323 DYQGYYTFPCSAVPTVALTFGGSSFSVPTSAFNLGTVSSGSKQCVGGIVGQGDGSW---L 379
Query: 406 IANMQQQNHRILYDVPNSRLGVAREL 431
+ ++ Q +YDV N+R+G A+ +
Sbjct: 380 VGDVFLQGVYSIYDVGNARVGFAKTV 405
>sp|O93428|CATD_CHIHA Cathepsin D OS=Chionodraco hamatus GN=ctsd PE=1 SV=2
Length = 396
Score = 47.8 bits (112), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 80/353 (22%), Positives = 138/353 (39%), Gaps = 54/353 (15%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGC----VGCS-STVFNSAQSTTFKNLGCQA 151
Y +GTP Q + DT + WVP C + C +NS +S+T+
Sbjct: 76 YYGEIGLGTPPQPFTVVFDTGSSNLWVPSIHCSLLDIACLLHHKYNSGKSSTY------- 128
Query: 152 AQCKQVPNPTCGGGACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFGCIQKATGNSVP 211
V N T AF + YGS +++ LSQDT ++ + FG K G +
Sbjct: 129 -----VKNGT------AFAIQYGSGSLSGYLSQDTCTIGDLAIDSQLFGEAIKQPGVAFI 177
Query: 212 P---QGLLGLGRGSLSL---------LAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIG 259
G+LG+ +S+ + + + Q+ FS+ L G L LG
Sbjct: 178 AAKFDGILGMAYPRISVDGVAPVFDNIMSQKKVEQNVFSFYLNRNPDTEPGGELLLGGT- 236
Query: 260 QPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFT 319
PK N R + + + + ++ VG ++ T G I+DSG T
Sbjct: 237 DPKYYTGDFNYVNVTRQAYWQIRVDSMAVGDQLSLC---------TGGCEAIVDSG---T 284
Query: 320 RLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVAPTITLMFSGMNVTLPQDNLL 379
L+ V+ ++ +G+ + C +VP + P I+ G TL + +
Sbjct: 285 SLITGPSVEVK-ALQKAIGA-FPLIQGEYMVNCDTVPSL-PVISFTVGGQVYTLTGEQYI 341
Query: 380 IHST-AGSITCLAMAAAPD--NVNSVLNVIANMQQQNHRILYDVPNSRLGVAR 429
+ T AG CL+ D L ++ ++ + ++D +R+G A+
Sbjct: 342 LKVTQAGKTMCLSGFMGLDIPAPAGPLWILGDVFMGQYYTVFDRDANRVGFAK 394
>sp|Q9MZS8|CATD_SHEEP Cathepsin D (Fragment) OS=Ovis aries GN=CTSD PE=1 SV=1
Length = 365
Score = 47.4 bits (111), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 77/326 (23%), Positives = 128/326 (39%), Gaps = 69/326 (21%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGC----VGCS-STVFNSAQSTTFKNLGCQA 151
Y IGTP Q + DT + WVP C + C +NS +S+T+
Sbjct: 54 YYGEIGIGTPPQCFTVVFDTGSANLWVPSIHCKLLDIACWVHHKYNSDKSSTY------- 106
Query: 152 AQCKQVPNPTCGGGACAFNLTYGSSTIAANLSQDTISLATD---------IVPGYTFGCI 202
V N T F++ YGS +++ LSQDT+S+ + V TFG
Sbjct: 107 -----VKNGT------TFDIHYGSGSLSGYLSQDTVSVPCNPSSSSPGGVTVQRQTFGEA 155
Query: 203 QKATGN---SVPPQGLLGLGRGSLSL---------LAQTQNLYQSTFSYCLPSFKALSFS 250
K G + G+LG+ +S+ L + + + ++ FS+ L
Sbjct: 156 IKQPGVVFIAAKFDGILGMAYPRISVNNVLPVFDNLMRQKLVDKNVFSFFLNRDPKAQPG 215
Query: 251 GSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGT 310
L LG K + + N R + + +++ + VG + G
Sbjct: 216 EELMLGGT-DSKYYRGSLTYHNVTRQAYWQIHMDQLDVGSSLTVC---------KGGCEA 265
Query: 311 IIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVA----PTITLMF 366
I+D+GT + +V P VR++ + + ++ Y +P P +TL
Sbjct: 266 IVDTGT--SLMVGPV-DEVRELHK-------AIGAVPLIQGEYMIPCEKVSSLPQVTLKL 315
Query: 367 SGMNVTL-PQDNLLIHSTAGSITCLA 391
G + TL P+D L S AG+ CL+
Sbjct: 316 GGKDYTLSPEDYTLKVSQAGTTVCLS 341
>sp|Q03168|ASPP_AEDAE Lysosomal aspartic protease OS=Aedes aegypti GN=AAEL006169 PE=1
SV=2
Length = 387
Score = 47.4 bits (111), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 38/131 (29%), Positives = 57/131 (43%), Gaps = 26/131 (19%)
Query: 103 IGTPAQTLLMAMDTSNDAAWVPCTGC----VGC-SSTVFNSAQSTTFKNLGCQAAQCKQV 157
IGTP Q+ + DT + WVP C + C +N+ +S+TF+ G
Sbjct: 74 IGTPPQSFKVVFDTGSSNLWVPSKECSFTNIACLMHNKYNAKKSSTFEKNG--------- 124
Query: 158 PNPTCGGGACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFGCIQKATG---NSVPPQG 214
AF++ YGS +++ LS DT+ L V TF G + G
Sbjct: 125 ---------TAFHIQYGSGSLSGYLSTDTVGLGGVSVTKQTFAEAINEPGLVFVAAKFDG 175
Query: 215 LLGLGRGSLSL 225
+LGLG S+S+
Sbjct: 176 ILGLGYSSISV 186
>sp|Q28755|PAG1_SHEEP Pregnancy-associated glycoprotein 1 OS=Ovis aries PE=2 SV=1
Length = 382
Score = 47.0 bits (110), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 79/356 (22%), Positives = 143/356 (40%), Gaps = 69/356 (19%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVG-----CSS-TVFNSAQSTTFKNLGCQ 150
Y+ IGTP Q + DT + VP C+ CS F QS+TF+
Sbjct: 71 YVGNITIGTPPQEFQVVFDTGSSDLLVPSINCLSPTKRPCSKQDKFKHHQSSTFR----- 125
Query: 151 AAQCKQVPNPTCGGGACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFGCIQKATGNSV 210
N T F + +GS T+ ++ DT+ + + FG I + +
Sbjct: 126 ------FTNDT-------FRIYFGSGTMRGFVAHDTVRIGDLVSTDQPFGLIFLESWLDI 172
Query: 211 PPQGLLGLG------RGSLSLLAQTQN---LYQSTFSYCLPSFK----ALSFSGSLRLGP 257
P G+LGL G++ + + +N + F++ L K + F G
Sbjct: 173 PFDGILGLNYPKISFSGAIPIFDKLKNEGAFSEPVFAFYLNKDKQEGSVVMFGGVDHRYY 232
Query: 258 IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTV 317
G+ + + PL+ +P S + L I + R+V+ + G ++ +G
Sbjct: 233 KGE---LNWVPLI-HPGEWS---IPLDRISMRRKVIAC---------SGGCEALVGTG-- 274
Query: 318 FTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVAPTITLMFSGMNVTLPQDN 377
T L+ T V ++ ++ +G+ T F +C +V P+I +G+N +P
Sbjct: 275 -TSLILGPRTVVENI-QKHIGA--TQQCFEYFVSCSAV-YALPSIVFTINGINYPVPPQA 329
Query: 378 LLIHSTAGSITCLA----MAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVAR 429
L+ + G C + A P N +L ++ + + ++D N R+G+AR
Sbjct: 330 YLVKDSRGQ--CYSPFQVNRANPSAENWIL---GDVFLRRYFSVFDRGNDRIGLAR 380
>sp|P16228|CATE_RAT Cathepsin E OS=Rattus norvegicus GN=Ctse PE=1 SV=3
Length = 398
Score = 46.6 bits (109), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 36/135 (26%), Positives = 58/135 (42%), Gaps = 24/135 (17%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCV--GCSST-VFNSAQSTTFKNLGCQAAQ 153
Y IG+P+Q + DT + WVP C C + VF+ +QS+T+ +G
Sbjct: 80 YFGTVSIGSPSQNFTVIFDTGSSNLWVPSVYCTSPACKAHPVFHPSQSSTYMEVGNH--- 136
Query: 154 CKQVPNPTCGGGACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFGCIQKATGNS---V 210
F++ YG+ ++ + D +S+ V G FG K G +
Sbjct: 137 ---------------FSIQYGTGSLTGIIGADQVSVEGLTVEGQQFGESVKEPGQTFVNA 181
Query: 211 PPQGLLGLGRGSLSL 225
G+LGLG SL++
Sbjct: 182 EFDGILGLGYPSLAV 196
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.321 0.134 0.401
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 151,149,241
Number of Sequences: 539616
Number of extensions: 6012316
Number of successful extensions: 14735
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 31
Number of HSP's successfully gapped in prelim test: 123
Number of HSP's that attempted gapping in prelim test: 14553
Number of HSP's gapped (non-prelim): 173
length of query: 433
length of database: 191,569,459
effective HSP length: 120
effective length of query: 313
effective length of database: 126,815,539
effective search space: 39693263707
effective search space used: 39693263707
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 63 (28.9 bits)