BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 012359
(465 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|Q766C3|NEP1_NEPGR Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis GN=nep1
PE=1 SV=1
Length = 437
Score = 150 bits (378), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 121/385 (31%), Positives = 176/385 (45%), Gaps = 50/385 (12%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y ++LS GTP Q I+DTGS L+W C C C + P F P+ SSS L
Sbjct: 93 GEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQ---PCTQCFNQSTPIFNPQGSSSFSTLP 149
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
C + C + +S C+ Y YG G T+G +ETL +
Sbjct: 150 CSSQLCQAL----------------SSPTCSNNFCQYTYGYGDGSETQGSMGTETLTFGS 193
Query: 206 RIIPNFLVGCSV----LSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTS 261
IPN GC AG+ G GRG SLPSQL++ KFSYC+ ++ S
Sbjct: 194 VSIPNITFGCGENNQGFGQGNGAGLVGMGRGPLSLPSQLDVTKFSYCMTPIG---SSTPS 250
Query: 262 SLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYL 321
+L+L S ++ T G +P N ++ + + +YY+ L ++VG R+ +
Sbjct: 251 NLLL---GSLANSVTAG---SP---NTTLIQSSQIPTFYYITLNGLSVGSTRLPIDPSAF 301
Query: 322 TLD-RDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCF 380
L+ +G GG I+DSGTT T+ ++ + EF+SQ+ N G+ + G CF
Sbjct: 302 ALNSNNGTGGIIIDSGTTLTYFVNNAYQSVRQEFISQI----NLPVVNGSSS--GFDLCF 355
Query: 381 DVPGEKTG-SFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILG 439
P + + P +HF GG ++ LP ENYF G +CL + + + I G
Sbjct: 356 QTPSDPSNLQIPTFVMHFDGG-DLELPSENYFISPSNG-LICLAMGSSSQGMS----IFG 409
Query: 440 NFQMQNYYVEYDLRNQRLGFKQQLC 464
N Q QN V YD N + F C
Sbjct: 410 NIQQQNMLVVYDTGNSVVSFASAQC 434
>sp|Q766C2|NEP2_NEPGR Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis GN=nep2
PE=1 SV=1
Length = 438
Score = 149 bits (376), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 119/384 (30%), Positives = 171/384 (44%), Gaps = 48/384 (12%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y ++++ GTP I+DTGS L+W C C C S P F P+ SSS L
Sbjct: 94 GEYLMNVAIGTPDSSFSAIMDTGSDLIWTQCE---PCTQCFSQPTPIFNPQDSSSFSTLP 150
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
C++ C + E+ +C Y YG G T+G +ET
Sbjct: 151 CESQYCQDLPSETCNNNECQ----------------YTYGYGDGSTTQGYMATETFTFET 194
Query: 206 RIIPNFLVGCSV----LSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTS 261
+PN GC AG+ G G G SLPSQL + +FSYC+ S+ ++ S
Sbjct: 195 SSVPNIAFGCGEDNQGFGQGNGAGLIGMGWGPLSLPSQLGVGQFSYCMTSYG---SSSPS 251
Query: 262 SLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYL 321
+L L + +S + + T NP+ YYY+ L+ ITVGG + +
Sbjct: 252 TLALGSAASGVPEGSPSTTLIHSSLNPT---------YYYITLQGITVGGDNLGIPSSTF 302
Query: 322 TLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFD 381
L DG GG I+DSGTT T++ + + +A F Q+ N E+ +GL CF
Sbjct: 303 QLQDDGTGGMIIDSGTTLTYLPQDAYNAVAQAFTDQI----NLPTV--DESSSGLSTCFQ 356
Query: 382 VPGE-KTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGN 440
P + T PE+ + F GG + L +N EG +CL + + + G S I GN
Sbjct: 357 QPSDGSTVQVPEISMQFDGGV-LNLGEQNILISPAEG-VICLAMGSSSQL--GIS-IFGN 411
Query: 441 FQMQNYYVEYDLRNQRLGFKQQLC 464
Q Q V YDL+N + F C
Sbjct: 412 IQQQETQVLYDLQNLAVSFVPTQC 435
>sp|Q9LS40|ASPG1_ARATH Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana
GN=ASPG1 PE=1 SV=1
Length = 500
Score = 129 bits (325), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 117/412 (28%), Positives = 171/412 (41%), Gaps = 55/412 (13%)
Query: 62 IKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHY 121
+ N T+ T TT + +S G Y + GTP + + +LDTGS + W C
Sbjct: 135 VYNEDTRYQTEDLTTPVVSGASQGSGEYFSRIGVGTPAKEMYLVLDTGSDVNWIQCE--- 191
Query: 122 QCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICP 181
C C P F P SS+ + L C P+CS + E+ CR S C
Sbjct: 192 PCADCYQQSDPVFNPTSSSTYKSLTCSAPQCSLL--ETSACR---------SNKCL---- 236
Query: 182 SYLVLYGSG-LTEGIALSETLNLPNR-IIPNFLVGCSVLSSRQPAGI-------AGFGRG 232
Y V YG G T G ++T+ N I N +GC G+ G G G
Sbjct: 237 -YQVSYGDGSFTVGELATDTVTFGNSGKINNVALGCG----HDNEGLFTGAAGLLGLGGG 291
Query: 233 KTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAE 292
S+ +Q+ FSYCL+ D+ ++SSL +S + G P + N +
Sbjct: 292 VLSITNQMKATSFSYCLVDR---DSGKSSSLDF-----NSVQLGGGDATAPLLRNKKI-- 341
Query: 293 RNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLAD 352
+YYVGL +VGG++V + +D G+GG I+D GT T + + + L D
Sbjct: 342 ----DTFYYVGLSGFSVGGEKVVLPDAIFDVDASGSGGVILDCGTAVTRLQTQAYNSLRD 397
Query: 353 EFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFA 412
F+ V + G+ +++ C+D T P + HF GG + LP +NY
Sbjct: 398 AFLKLTVNLKK-----GSSSISLFDTCYDFSSLSTVKVPTVAFHFTGGKSLDLPAKNYLI 452
Query: 413 VVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
V + C S SII GN Q Q + YDL +G C
Sbjct: 453 PVDDSGTFCFAFA---PTSSSLSII-GNVQQQGTRITYDLSKNVIGLSGNKC 500
>sp|Q6XBF8|CDR1_ARATH Aspartic proteinase CDR1 OS=Arabidopsis thaliana GN=CDR1 PE=1 SV=1
Length = 437
Score = 109 bits (273), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 124/442 (28%), Positives = 197/442 (44%), Gaps = 70/442 (15%)
Query: 39 NPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTP 98
NP + S Q L + + S+ R H T T +++S+S G Y +++S GTP
Sbjct: 47 NPMETSSQRLRNAIHRSVNRVFHF------TEKDNTPQPQIDLTSNS-GEYLMNVSIGTP 99
Query: 99 PQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHE 158
P I I DTGS L+W C C C + P F PK SS+ + + C + +C+ + ++
Sbjct: 100 PFPIMAIADTGSDLLWTQCA---PCDDCYTQVDPLFDPKTSSTYKDVSCSSSQCTALENQ 156
Query: 159 SIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETLNLPNRI-----IPNFL 212
A+ C SY + YG + T+G +TL L + + N +
Sbjct: 157 ------------ASCSTNDNTC-SYSLSYGDNSYTKGNIAVDTLTLGSSDTRPMQLKNII 203
Query: 213 VGCSVLSS----RQPAGIAGFGRGKTSLPSQL--NLD-KFSYCL--LSHKFDDTTRTSSL 263
+GC ++ ++ +GI G G G SL QL ++D KFSYCL L+ K D T++
Sbjct: 204 IGCGHNNAGTFNKKGSGIVGLGGGPVSLIKQLGDSIDGKFSYCLVPLTSKKDQTSKI--- 260
Query: 264 ILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTL 323
N +++ +G+ TP + + + +YY+ L+ I+VG ++++
Sbjct: 261 ---NFGTNAIVSGSGVVSTPLI------AKASQETFYYLTLKSISVGSKQIQYSGSDSES 311
Query: 324 DRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVP 383
G I+DSGTT T + E + L D S + + + +GL C+
Sbjct: 312 ---SEGNIIIDSGTTLTLLPTEFYSELEDAVASSIDAEKK------QDPQSGLSLCYSAT 362
Query: 384 GEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI-ILGNFQ 442
G+ P + +HF GA+V L N F V E VC G PS I GN
Sbjct: 363 GDL--KVPVITMHFD-GADVKLDSSNAFVQVSE-DLVCFAF------RGSPSFSIYGNVA 412
Query: 443 MQNYYVEYDLRNQRLGFKQQLC 464
N+ V YD ++ + FK C
Sbjct: 413 QMNFLVGYDTVSKTVSFKPTDC 434
>sp|Q3EBM5|ASPR1_ARATH Probable aspartic protease At2g35615 OS=Arabidopsis thaliana
GN=At2g35615 PE=3 SV=1
Length = 447
Score = 107 bits (266), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 113/409 (27%), Positives = 166/409 (40%), Gaps = 79/409 (19%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G + +S++ GTPP + I DTGS L W C C+ C P F K SS+ +
Sbjct: 83 GEFFMSITIGTPPIKVFAIADTGSDLTWVQCK---PCQQCYKENGPIFDKKKSSTYKSEP 139
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETLNLPN 205
C + C + C + N+ IC Y YG ++G +ET+++ +
Sbjct: 140 CDSRNCQALSSTERGCDESNN-----------IC-KYRYSYGDQSFSKGDVATETVSIDS 187
Query: 206 R-----IIPNFLVGCSVLSSRQPAGIAGFGRGKT----------------SLPSQLN--- 241
P + GC G+ G T SL SQL
Sbjct: 188 ASGSPVSFPGTVFGC------------GYNNGGTFDETGSGIIGLGGGHLSLISQLGSSI 235
Query: 242 LDKFSYCLLSHKFDDTTRTSSLILDNGSSHSD-KKTTGLTYTPFVNNPSVAERNAFSVYY 300
KFSYC LSHK T TS + L S S K +G+ TP V+ + YY
Sbjct: 236 SKKFSYC-LSHKSATTNGTSVINLGTNSIPSSLSKDSGVVSTPLVDKEPL-------TYY 287
Query: 301 YVGLRRITVGGQRVRVWHKYLTLDRDG-----NGGTIVDSGTTFTFMAPELFEPLADEFV 355
Y+ L I+VG +++ + DG +G I+DSGTT T + F+ +
Sbjct: 288 YLTLEAISVGKKKIPYTGSSYNPNDDGILSETSGNIIIDSGTTLTLLEAGFFDKFSSAVE 347
Query: 356 SQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVG 415
+ + + G L CF + G PE+ +HF GA+V L N F +
Sbjct: 348 ESVTGAKRVSDPQGL-----LSHCFKSGSAEIG-LPEITVHFT-GADVRLSPINAFVKLS 400
Query: 416 EGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
E VCL++V E + I GNF ++ V YDL + + F+ C
Sbjct: 401 E-DMVCLSMVPTTEVA-----IYGNFAQMDFLVGYDLETRTVSFQHMDC 443
>sp|Q9LHE3|ASPG2_ARATH Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana
GN=ASPG2 PE=2 SV=1
Length = 470
Score = 104 bits (259), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 107/385 (27%), Positives = 155/385 (40%), Gaps = 50/385 (12%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y + + G+PP+ ++D+GS +VW C CK C P F P S S +
Sbjct: 129 GEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQ---PCKLCYKQSDPVFDPAKSGSYTGVS 185
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
C + C I + C Y V+YG G T+G ETL
Sbjct: 186 CGSSVCDRIENSGCHSGGCR----------------YEVMYGDGSYTKGTLALETLTFAK 229
Query: 206 RIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKFDDTTR 259
++ N +GC + AG+ G G G S QL+ F YCL+S D T
Sbjct: 230 TVVRNVAMGCGHRNRGMFIGAAGLLGIGGGSMSFVGQLSGQTGGAFGYCLVSRGTDST-- 287
Query: 260 TSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
SL+ + G ++ P V NP +YYVGL+ + VGG R+ +
Sbjct: 288 -GSLVFGR-----EALPVGASWVPLVRNPRAPS------FYYVGLKGLGVGGVRIPLPDG 335
Query: 320 YLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPC 379
L G+GG ++D+GT T + + D F SQ N RA G C
Sbjct: 336 VFDLTETGDGGVVMDTGTAVTRLPTAAYVAFRDGFKSQTA---NLPRASGVSIFD---TC 389
Query: 380 FDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILG 439
+D+ G + P + +F G +TLP N+ V + C AS I+G
Sbjct: 390 YDLSGFVSVRVPTVSFYFTEGPVLTLPARNFLMPVDDSGTYCFAFA----ASPTGLSIIG 445
Query: 440 NFQMQNYYVEYDLRNQRLGFKQQLC 464
N Q + V +D N +GF +C
Sbjct: 446 NIQQEGIQVSFDGANGFVGFGPNVC 470
>sp|Q9LZL3|PCS1L_ARATH Aspartic proteinase PCS1 OS=Arabidopsis thaliana GN=PCS1 PE=2 SV=1
Length = 453
Score = 91.7 bits (226), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 97/386 (25%), Positives = 156/386 (40%), Gaps = 42/386 (10%)
Query: 98 PPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHH 157
PPQ I ++DTGS L W C + + + + +F P SSS + C +P C
Sbjct: 82 PPQNISMVIDTGSELSWLRCN-----RSSNPNPVNNFDPTRSSSYSPIPCSSPTCRTRTR 136
Query: 158 ESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRI-IPNFLVGC- 215
+ + C+ + ++C + L + +EG +E + N N + GC
Sbjct: 137 DFLIPASCDSD---------KLCHATLSYADASSSEGNLAAEIFHFGNSTNDSNLIFGCM 187
Query: 216 SVLSSRQP------AGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGS 269
+S P G+ G RG S SQ+ KFSYC+ T +L S
Sbjct: 188 GSVSGSDPEEDTKTTGLLGMNRGSLSFISQMGFPKFSYCI-----SGTDDFPGFLLLGDS 242
Query: 270 SHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNG 329
+ + T L YTP + S V Y V L I V G+ + + L D G G
Sbjct: 243 NFT--WLTPLNYTPLIRI-STPLPYFDRVAYTVQLTGIKVNGKLLPIPKSVLVPDHTGAG 299
Query: 330 GTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGS 389
T+VDSGT FTF+ ++ L F+++ + C+ + + S
Sbjct: 300 QTMVDSGTQFTFLLGPVYTALRSHFLNRTNGILTVYEDPDFVFQGTMDLCYRISPVRIRS 359
Query: 390 -----FPELKLHFKGGAEVTLPVENYF-----AVVGEGSAVCLTVVTDREASGGPSIILG 439
P + L F+G AE+ + + VG S C T + + G + ++G
Sbjct: 360 GILHRLPTVSLVFEG-AEIAVSGQPLLYRVPHLTVGNDSVYCFTF-GNSDLMGMEAYVIG 417
Query: 440 NFQMQNYYVEYDLRNQRLGFKQQLCK 465
+ QN ++E+DL+ R+G C
Sbjct: 418 HHHQQNMWIEFDLQRSRIGLAPVECD 443
>sp|Q9S9K4|ASPL2_ARATH Aspartic proteinase-like protein 2 OS=Arabidopsis thaliana
GN=At1g65240 PE=1 SV=2
Length = 475
Score = 76.3 bits (186), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 95/404 (23%), Positives = 158/404 (39%), Gaps = 71/404 (17%)
Query: 85 SYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQC--KYCSSSKIPSFIPKLSSSS 142
S G Y + G+PP+ +DTGS ++W C +C K + ++ F SS+S
Sbjct: 70 SVGLYFTKIKLGSPPKEYHVQVDTGSDILWINCKPCPKCPTKTNLNFRLSLFDMNASSTS 129
Query: 143 RLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQI--CPSYLVLYGSGLTEGIALSET 200
+ +GC + CS+I + S +C C ++V ++G + +
Sbjct: 130 KKVGCDDDFCSFI---------------SQSDSCQPALGCSYHIVYADESTSDGKFIRDM 174
Query: 201 LNLPNR--------IIPNFLVGCSVLSSRQPA-------GIAGFGRGKTSLPSQLNLDKF 245
L L + + GC S Q G+ GFG+ TS+ SQL
Sbjct: 175 LTLEQVTGDLKTGPLGQEVVFGCGSDQSGQLGNGDSAVDGVMGFGQSNTSVLSQLAATGD 234
Query: 246 SYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLR 305
+ + SH D+ I G S K T TP V N ++Y V L
Sbjct: 235 AKRVFSHCLDNVKGGG--IFAVGVVDSPKVKT----TPMVPN---------QMHYNVMLM 279
Query: 306 RITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYT 365
+ V G + + + NGGTIVDSGTT + L++ L + +++
Sbjct: 280 GMDVDGTSLDLPRSIVR-----NGGTIVDSGTTLAYFPKVLYDSLIETILAR-------- 326
Query: 366 RALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCL--- 422
+ + + CF +FP + F+ ++T+ +Y + E C
Sbjct: 327 QPVKLHIVEETFQCFSFSTNVDEAFPPVSFEFEDSVKLTVYPHDYLFTL-EEELYCFGWQ 385
Query: 423 --TVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ TD + I+LG+ + N V YDL N+ +G+ C
Sbjct: 386 AGGLTTDERSE---VILLGDLVLSNKLVVYDLDNEVIGWADHNC 426
>sp|P18242|CATD_MOUSE Cathepsin D OS=Mus musculus GN=Ctsd PE=1 SV=1
Length = 410
Score = 62.8 bits (151), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 99/422 (23%), Positives = 163/422 (38%), Gaps = 85/422 (20%)
Query: 60 LHIKNPQTKTTTTTTTTTTTNIS----SHSYGGYSISLSFGTPPQIIPFILDTGSHLVWF 115
L +K P TK + ++ TT +S ++ Y + GTPPQ + DTGS +W
Sbjct: 46 LILKGPITKYSMQSSPKTTEPVSELLKNYLDAQYYGDIGIGTPPQCFTVVFDTGSSNLWV 105
Query: 116 PCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKN 175
P + CK + C W+HH + +D+ KN
Sbjct: 106 PSIH---CKILD-----------------IAC------WVHH-----KYNSDKSSTYVKN 134
Query: 176 CTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGCSV------LSSRQPA----- 224
T S+ + YGSG G +T+++P + + G V +++QP
Sbjct: 135 GT----SFDIHYGSGSLSGYLSQDTVSVPCKSDQSKARGIKVEKQIFGEATKQPGIVFVA 190
Query: 225 ----GIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLT 280
GI G G S+ + L + F + D + L D + G T
Sbjct: 191 AKFDGILGMGYPHISVNNVLPV--FDNLMQQKLVDKNIFSFYLNRDPEGQPGGELMLGGT 248
Query: 281 YTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFT 340
+ + + Y+ V + ++ VG + LTL + G IVD+GT+
Sbjct: 249 DSKYYHGELSYLNVTRKAYWQVHMDQLEVGNE--------LTLCK-GGCEAIVDTGTSL- 298
Query: 341 FMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGG 400
V + + + +A+GA L ++ + +P EK S P + L GG
Sbjct: 299 -------------LVGPVEEVKELQKAIGAVPL--IQGEYMIPCEKVSSLPTVYLKL-GG 342
Query: 401 AEVTLPVENYFAVVGE-GSAVCLT--VVTDREASGGPSIILGNFQMQNYYVEYDLRNQRL 457
L + Y V + G +CL+ + D GP ILG+ + +YY +D N R+
Sbjct: 343 KNYELHPDKYILKVSQGGKTICLSGFMGMDIPPPSGPLWILGDVFIGSYYTVFDRDNNRV 402
Query: 458 GF 459
GF
Sbjct: 403 GF 404
>sp|Q4LAL9|CATD_CANFA Cathepsin D OS=Canis familiaris GN=CTSD PE=2 SV=1
Length = 410
Score = 62.0 bits (149), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 92/391 (23%), Positives = 148/391 (37%), Gaps = 81/391 (20%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y + GTPPQ + DTGS +W P + CK + C
Sbjct: 79 YYGEIGIGTPPQCFTVVFDTGSSNLWVPSIH---CKLLD-----------------IAC- 117
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
WIHH+ + KN T S+ + YGSG G +T+++P +
Sbjct: 118 -----WIHHKYNSGKSST-----YVKNGT----SFDIHYGSGSLSGYLSQDTVSVPCKSA 163
Query: 209 PNFLVGCSV------LSSRQPA---------GIAGFGRGKTSLPSQLNLDKFSYCLLSHK 253
+ L G V +++QP GI G + S+ + L + F +
Sbjct: 164 LSGLAGIKVERQTFGEATKQPGITFIAAKFDGILGMAYPRISVNNVLPV--FDNLMQQKL 221
Query: 254 FDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQR 313
+ + L D + + G T + + P Y+ V + ++ VG
Sbjct: 222 VEKNIFSFYLNRDPNAQPGGELMLGGTDSKYYKGPLSYLNVTRKAYWQVHMEQVDVGSS- 280
Query: 314 VRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEAL 373
LTL + G IVD+GT+ V + + R +A+GA L
Sbjct: 281 -------LTLCK-GGCEAIVDTGTSL--------------IVGPVDEVRELQKAIGAVPL 318
Query: 374 TGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSA-VCLT--VVTDREA 430
++ + +P EK + P++ L GG L E+Y V +G +CL+ + D
Sbjct: 319 --IQGEYMIPCEKVSTLPDVTLKL-GGKLYKLSSEDYTLKVSQGGKTICLSGFMGMDIPP 375
Query: 431 SGGPSIILGNFQMQNYYVEYDLRNQRLGFKQ 461
GGP ILG+ + YY +D R+G Q
Sbjct: 376 PGGPLWILGDVFIGCYYTVFDRDQNRVGLAQ 406
>sp|Q9LX20|ASPL1_ARATH Aspartic proteinase-like protein 1 OS=Arabidopsis thaliana
GN=At5g10080 PE=1 SV=1
Length = 528
Score = 58.2 bits (139), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 93/402 (23%), Positives = 148/402 (36%), Gaps = 80/402 (19%)
Query: 93 LSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSS--------KIPSFIPKLSSSSRL 144
+ GTP LDTGS+L+W PC N QC +S+ + + P SS+S++
Sbjct: 104 IDIGTPSVSFLVALDTGSNLLWIPC-NCVQCAPLTSTYYSSLATKDLNEYNPSSSSTSKV 162
Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-------LTEGIAL 197
C + C DC ++ + CP Y V Y SG L E I L
Sbjct: 163 FLCSHKLCD-------SASDC--------ESPKEQCP-YTVNYLSGNTSSSGLLVEDI-L 205
Query: 198 SETLNLPNRII-------PNFLVGC------SVLSSRQPAGIAGFGRGKTSLPSQLNLDK 244
T N NR++ ++GC L P G+ G G + S+PS L+
Sbjct: 206 HLTYNTNNRLMNGSSSVKARVVIGCGKKQSGDYLDGVAPDGLMGLGPAEISVPSFLSKAG 265
Query: 245 FSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGL 304
S FD+ D G S TPF+ + N +S Y VG+
Sbjct: 266 LMRNSFSLCFDEEDSGRIYFGDMGPSIQQS-------TPFLQ----LDNNKYSG-YIVGV 313
Query: 305 RRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNY 364
+G + + + T +DSG +FT++ E++ +A E +
Sbjct: 314 EACCIGNSCL----------KQTSFTTFIDSGQSFTYLPEEIYRKVALEIDRHINATSKN 363
Query: 365 TRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEG-SAVCLT 423
+ E C++ E P +KL F + + +G CL
Sbjct: 364 FEGVSWEY------CYESSAEP--KVPAIKLKFSHNNTFVIHKPLFVFQQSQGLVQFCLP 415
Query: 424 VVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
+ + G +G M+ Y + +D N +LG+ C+
Sbjct: 416 ISPSGQEGIGS---IGQNYMRGYRMVFDRENMKLGWSPSKCQ 454
>sp|P07339|CATD_HUMAN Cathepsin D OS=Homo sapiens GN=CTSD PE=1 SV=1
Length = 412
Score = 56.6 bits (135), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 93/393 (23%), Positives = 148/393 (37%), Gaps = 83/393 (21%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y + GTPPQ + DTGS +W P + CK + C
Sbjct: 79 YYGEIGIGTPPQCFTVVFDTGSSNLWVPSIH---CKLLD-----------------IAC- 117
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
WIHH + +D+ KN T S+ + YGSG G +T+++P +
Sbjct: 118 -----WIHH-----KYNSDKSSTYVKNGT----SFDIHYGSGSLSGYLSQDTVSVPCQSA 163
Query: 209 --PNFLVGCSVL------SSRQPA---------GIAGFGRGKTSLPSQLNLDKFSYCLLS 251
+ L G V +++QP GI G + S+ + L + F +
Sbjct: 164 SSASALGGVKVERQVFGEATKQPGITFIAAKFDGILGMAYPRISVNNVLPV--FDNLMQQ 221
Query: 252 HKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGG 311
D + L D + + G T + + Y+ V L ++ V
Sbjct: 222 KLVDQNIFSFYLSRDPDAQPGGELMLGGTDSKYYKGSLSYLNVTRKAYWQVHLDQVEVAS 281
Query: 312 QRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAE 371
LTL ++G IVD+GT+ V + + R +A+GA
Sbjct: 282 G--------LTLCKEGCE-AIVDTGTSL--------------MVGPVDEVRELQKAIGAV 318
Query: 372 ALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENY-FAVVGEGSAVCLT--VVTDR 428
L ++ + +P EK + P + L GG L E+Y V G +CL+ + D
Sbjct: 319 PL--IQGEYMIPCEKVSTLPAITLKL-GGKGYKLSPEDYTLKVSQAGKTLCLSGFMGMDI 375
Query: 429 EASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQ 461
GP ILG+ + YY +D N R+GF +
Sbjct: 376 PPPSGPLWILGDVFIGRYYTVFDRDNNRVGFAE 408
>sp|Q8RVH5|7SBG2_SOYBN Basic 7S globulin 2 OS=Glycine max PE=1 SV=1
Length = 433
Score = 53.9 bits (128), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 85/398 (21%), Positives = 147/398 (36%), Gaps = 58/398 (14%)
Query: 85 SYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNH-----YQCKYCSSSKIPSFIPKLS 139
S G + +L TP +P ++D + +W C H YQ +C S++ +
Sbjct: 50 STGLHWANLQKRTPLMQVPVLVDLNGNHLWVNCEQHYSSKTYQAPFCHSTQC-----SRA 104
Query: 140 SSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALS 198
++ + L C H + C + P+ ++ L ++ + G T+ +
Sbjct: 105 NTHQCLSCPAASRPGCHKNT--CGLMSTNPITQQTGLGELGQDVLAIHATQGSTQQLG-- 160
Query: 199 ETLNLPNRIIPNFLVGCS---VLSSRQP---AGIAGFGRGKTSLPSQLNLDKFSYCLLSH 252
P +P FL C+ +L P G+AG G SLP+QL S+ L H
Sbjct: 161 -----PLVTVPQFLFSCAPSFLLQKGLPRNIQGVAGLGHAPISLPNQLA----SHFGLQH 211
Query: 253 KFDD-----TTRTSSLILDNGSS-----HSDKKTTGLTYTPFVNNPSVAERNAFSVYYYV 302
+F T +LI + + H+ L +TP P Y V
Sbjct: 212 QFTTCLSRYPTSKGALIFGDAPNNMQQFHNQDIFHDLAFTPLTVTPQGE--------YNV 263
Query: 303 GLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNR 362
+ I + V +K + +GGT++ + T + L++ F Q+ K
Sbjct: 264 RVSSIRINQHSVFPPNKISSTIVGSSGGTMISTSTPHMVLQQSLYQAFTQVFAQQLEKQA 323
Query: 363 NYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHF-KGGAEVTLPVENYFAVVGEGSAVC 421
+ + A GL CF+ K ++P + L K V V + C
Sbjct: 324 Q----VKSVAPFGL--CFN--SNKINAYPSVDLVMDKPNGPVWRISGEDLMVQAQPGVTC 375
Query: 422 LTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGF 459
L V+ + LG Q++ + +DL R+GF
Sbjct: 376 LGVMNGGMQPRA-EVTLGTRQLEEKLMVFDLARSRVGF 412
>sp|Q9DEX3|CATD_CLUHA Cathepsin D OS=Clupea harengus GN=ctsd PE=1 SV=1
Length = 396
Score = 53.5 bits (127), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 98/472 (20%), Positives = 173/472 (36%), Gaps = 104/472 (22%)
Query: 12 FIFFFTLLSIFPSSITSLTFSLSRFHTNPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTT 71
++F F + + +I + R DS N+ L++ T +L +
Sbjct: 5 YLFLFAVFAWTSDAIVRIPLKKFRSIRRTLSDSGLNVEQLLAG--TNSLQHNQGFPSSNA 62
Query: 72 TTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKI 131
T T + + YG + GTP Q+ + DTGS +W P +CS + I
Sbjct: 63 PTPETLKNYMDAQYYG----EIGLGTPVQMFTVVFDTGSSNLWLPSI------HCSFTDI 112
Query: 132 PSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL 191
+ HH+ + KN T+ + + YGSG
Sbjct: 113 ACLL--------------------HHKYNGAKSST-----YVKNGTE----FAIQYGSGS 143
Query: 192 TEGIALSETLNLPNRIIPNFLVGCSVLSSRQPA---------GIAGFGRGKTS------- 235
G ++ + + ++ L G ++ +QP GI G + S
Sbjct: 144 LSGYLSQDSCTIGDIVVEKQLFGEAI---KQPGVAFIAAKFDGILGMAYPRISVDGVPPV 200
Query: 236 ---LPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAE 292
+ SQ +++ + ++ DT L+L G + T Y P
Sbjct: 201 FDMMMSQKKVEQNVFSFYLNRNPDTEPGGELLL--GGTDPKYYTGDFNYVP-------VT 251
Query: 293 RNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLAD 352
R A Y+ + + +++G Q LTL +DG IVD+GT+ P
Sbjct: 252 RQA---YWQIHMDGMSIGSQ--------LTLCKDGCE-AIVDTGTSLITGPP-------- 291
Query: 353 EFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFA 412
+ R +A+GA L ++ + + +K + P + + GG +L E Y
Sbjct: 292 ------AEVRALQKAIGAIPL--IQGEYMIDCKKVPTLPTISFNV-GGKTYSLTGEQYVL 342
Query: 413 VVGEGSA-VCLTVVTDRE--ASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQ 461
+G +CL+ + E GP ILG+ + YY +D + R+GF +
Sbjct: 343 KESQGGKTICLSGLMGLEIPPPAGPLWILGDVFIGQYYTVFDRESNRVGFAK 394
>sp|P24268|CATD_RAT Cathepsin D OS=Rattus norvegicus GN=Ctsd PE=1 SV=1
Length = 407
Score = 53.1 bits (126), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 97/424 (22%), Positives = 160/424 (37%), Gaps = 88/424 (20%)
Query: 60 LHIKNPQTKTTTTTTTTTTTNIS----SHSYGGYSISLSFGTPPQIIPFILDTGSHLVWF 115
L +K P TK + ++ T +S ++ Y + GTPPQ + DTGS +W
Sbjct: 46 LILKGPITKYSMQSSPRTKEPVSELLKNYLDAQYYGEIGIGTPPQCFTVVFDTGSSNLWV 105
Query: 116 PCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKN 175
P + CK + C W+HH + +D+ KN
Sbjct: 106 PSIH---CKLLD-----------------IAC------WVHH-----KYNSDKSSTYVKN 134
Query: 176 CTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGCSVL------SSRQPA----- 224
T S+ + YGSG G +T+++P + + L G V +++QP
Sbjct: 135 GT----SFDIHYGSGSLSGYLSQDTVSVPCK---SDLGGIKVEKQIFGEATKQPGVVFIA 187
Query: 225 ----GIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLT 280
GI G G S+ L + F + + + L D + G T
Sbjct: 188 AKFDGILGMGYPFISVNKVLPV--FDNLMKQKLVEKNIFSFYLNRDPTGQPGGELMLGGT 245
Query: 281 YTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFT 340
+ + + Y+ V + ++ VG + LTL + G IVD+GT+
Sbjct: 246 DSRYYHGELSYLNVTRKAYWQVHMDQLEVGSE--------LTLCK-GGCEAIVDTGTSL- 295
Query: 341 FMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGG 400
V + + + +A+GA L ++ + +P EK S P + GG
Sbjct: 296 -------------LVGPVDEVKELQKAIGAVPL--IQGEYMIPCEKVSSLPIITFKL-GG 339
Query: 401 AEVTLPVENYFAVVGE-GSAVCLT--VVTDREASGGPSIILGNFQMQNYYVEYDLRNQRL 457
L E Y V + G +CL+ + D GP ILG+ + YY +D R+
Sbjct: 340 QNYELHPEKYILKVSQAGKTICLSGFMGMDIPPPSGPLWILGDVFIGCYYTVFDREYNRV 399
Query: 458 GFKQ 461
GF +
Sbjct: 400 GFAK 403
>sp|P80209|CATD_BOVIN Cathepsin D OS=Bos taurus GN=CTSD PE=1 SV=2
Length = 390
Score = 52.0 bits (123), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 89/392 (22%), Positives = 147/392 (37%), Gaps = 83/392 (21%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y + GTPPQ + DTGS +W P + CK + C
Sbjct: 59 YYGEIGIGTPPQCFTVVFDTGSANLWVPSIH---CKLLD-----------------IAC- 97
Query: 149 NPKCSWIHHESIQCRDCNDEPLAT-SKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRI 207
W H R N + +T KN T ++ + YGSG G +T+++P
Sbjct: 98 -----WTH------RKYNSDKSSTYVKNGT----TFDIHYGSGSLSGYLSQDTVSVPCNP 142
Query: 208 IPNFLVGCSVLSS------RQPA---------GIAGFGRGKTSLPSQLNLDKFSYCLLSH 252
+ G +V +QP GI G + S+ + L + F +
Sbjct: 143 SSSSPGGVTVQRQTFGEAIKQPGVVFIAAKFDGILGMAYPRISVNNVLPV--FDNLMQQK 200
Query: 253 KFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQ 312
D + L D + + G T + + + Y+ + + ++ VG
Sbjct: 201 LVDKNVFSFFLNRDPKAQPGGELMLGGTDSKYYRGSLMFHNVTRQAYWQIHMDQLDVGSS 260
Query: 313 RVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEA 372
LT+ + G IVD+GT+ V + + R +A+GA
Sbjct: 261 --------LTVCK-GGCEAIVDTGTSL--------------IVGPVEEVRELQKAIGAVP 297
Query: 373 LTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEG-SAVCLT--VVTDRE 429
L ++ + +P EK S PE+ + GG + L E+Y V + + VCL+ + D
Sbjct: 298 L--IQGEYMIPCEKVSSLPEVTVKL-GGKDYALSPEDYALKVSQAETTVCLSGFMGMDIP 354
Query: 430 ASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQ 461
GGP ILG+ + YY +D R+G +
Sbjct: 355 PPGGPLWILGDVFIGRYYTVFDRDQNRVGLAE 386
>sp|A2ZC67|ASP1_ORYSI Aspartic proteinase Asp1 OS=Oryza sativa subsp. indica GN=ASP1 PE=2
SV=2
Length = 410
Score = 51.6 bits (122), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 86/400 (21%), Positives = 148/400 (37%), Gaps = 61/400 (15%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G + ++++ G P + +DTGS L W C Y C C+ + P+L + +
Sbjct: 36 GHFFVTMNIGDPAKPYFLDIDTGSTLTWLQCD--YPCINCNKVPHGLYKPELKYAVK--- 90
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLP-- 204
C +C+ ++ + + C + C Y + Y G + G+ + ++ +LP
Sbjct: 91 CTEQRCADLYADLRKPMKCGPK-----NQC-----HYGIQYVGGSSIGVLIVDSFSLPAS 140
Query: 205 NRIIPNFLV-GCSVLSSRQ------PA-GIAGFGRGKTSLPSQLNLDK-FSYCLLSHKFD 255
N P + GC + P GI G GRGK +L SQL + +L H
Sbjct: 141 NGTNPTSIAFGCGYNQGKNNHNVPTPVNGILGLGRGKVTLLSQLKSQGVITKHVLGHCIS 200
Query: 256 DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVR 315
+ D + T+G+T++P ++ + + I+ V
Sbjct: 201 SKGKGFLFFGD-----AKVPTSGVTWSPMNREHKHYSPRQGTLQFNSNSKPISAAPMEV- 254
Query: 316 VWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTG 375
I DSG T+T+ A + + S + K + + E
Sbjct: 255 ----------------IFDSGATYTYFALQPYHATLSVVKSTLSKECKFLTEV-KEKDRA 297
Query: 376 LRPCFD------VPGEKTGSFPELKLHFKGG---AEVTLPVENYFAVVGEGSAVCLTVV- 425
L C+ E F L L F G A + +P E+Y + EG VCL ++
Sbjct: 298 LTVCWKGKDKIRTIDEVKKCFRSLSLKFADGDKKATLEIPPEHYLIISQEGH-VCLGILD 356
Query: 426 -TDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ S + ++G M + V YD LG+ C
Sbjct: 357 GSKEHPSLAGTNLIGGITMLDQMVIYDSERSLLGWVNYQC 396
>sp|Q9MZS8|CATD_SHEEP Cathepsin D (Fragment) OS=Ovis aries GN=CTSD PE=1 SV=1
Length = 365
Score = 49.7 bits (117), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 85/370 (22%), Positives = 140/370 (37%), Gaps = 81/370 (21%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y + GTPPQ + DTGS +W P + CK + C
Sbjct: 54 YYGEIGIGTPPQCFTVVFDTGSANLWVPSIH---CKLLD-----------------IAC- 92
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
W+HH + +D+ KN T ++ + YGSG G +T+++P
Sbjct: 93 -----WVHH-----KYNSDKSSTYVKNGT----TFDIHYGSGSLSGYLSQDTVSVPCNPS 138
Query: 209 PNFLVGCSVLSS------RQPA---------GIAGFGRGKTSLPSQLNLDKFSYCLLSHK 253
+ G +V +QP GI G + S+ + L + F +
Sbjct: 139 SSSPGGVTVQRQTFGEAIKQPGVVFIAAKFDGILGMAYPRISVNNVLPV--FDNLMRQKL 196
Query: 254 FDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQR 313
D + L D + ++ G T + + Y+ + + ++ VG
Sbjct: 197 VDKNVFSFFLNRDPKAQPGEELMLGGTDSKYYRGSLTYHNVTRQAYWQIHMDQLDVGSS- 255
Query: 314 VRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEAL 373
LT+ + G IVD+GT+ V + + R +A+GA L
Sbjct: 256 -------LTVCK-GGCEAIVDTGTSL--------------MVGPVDEVRELHKAIGAVPL 293
Query: 374 TGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENY-FAVVGEGSAVCLT--VVTDREA 430
++ + +P EK S P++ L GG + TL E+Y V G+ VCL+ + D
Sbjct: 294 --IQGEYMIPCEKVSSLPQVTLKL-GGKDYTLSPEDYTLKVSQAGTTVCLSGFMGMDIPP 350
Query: 431 SGGPSIILGN 440
GGP ILG+
Sbjct: 351 PGGPLWILGD 360
>sp|P00795|CATD_PIG Cathepsin D OS=Sus scrofa GN=CTSD PE=1 SV=2
Length = 345
Score = 48.1 bits (113), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 46/166 (27%), Positives = 73/166 (43%), Gaps = 29/166 (17%)
Query: 299 YYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQM 358
Y+ + + ++ VG LTL + G IVD+GT+ PE
Sbjct: 204 YWQIHMNQVAVGSS--------LTLCK-GGCEAIVDTGTSLIVGQPE------------- 241
Query: 359 VKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENY-FAVVGEG 417
+ R +A+GA L ++ + +P EK S P++ + GG + L ENY V G
Sbjct: 242 -EVRELGKAIGAVPL--IQGEYMIPCEKVPSLPDVTVTL-GGKKYKLSSENYTLKVSQAG 297
Query: 418 SAVCLT--VVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQ 461
+CL+ + D GGP ILG+ + YY +D R+G +
Sbjct: 298 QTICLSGFMGMDIPPPGGPLWILGDVFIGRYYTVFDRDLNRVGLAE 343
>sp|P22929|CARP_SACFI Acid protease OS=Saccharomycopsis fibuligera GN=PEP1 PE=3 SV=1
Length = 390
Score = 46.2 bits (108), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 94/386 (24%), Positives = 138/386 (35%), Gaps = 76/386 (19%)
Query: 84 HSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTN-------HYQCKYCSSSKIPS-FI 135
+ Y Y ++ GTP Q + +DTGS +W P + K S K S F
Sbjct: 70 NEYSFYLTTIEIGTPGQKLQVDVDTGSSDLWVPGQGTSSLYGTYDHTKSTSYKKDRSGFS 129
Query: 136 PKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGI 195
S G + I SI + D ATS++ Q L G GL
Sbjct: 130 ISYGDGSSARGDWAQETVSIGGASITGLEFGD---ATSQDVGQ------GLLGIGLKGNE 180
Query: 196 ALSETLNLPNRIIPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFD 255
A +++ N S P L Q +DK +Y L + +
Sbjct: 181 ASAQSSN-------------SFTYDNLP----------LKLKDQGLIDKAAYSLYLNS-E 216
Query: 256 DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVR 315
D T S L G S S K + L VN + + +V ++V L I G +
Sbjct: 217 DATSGSILF---GGSDSSKYSGSLATLDLVNIDDEGDSTSGAVAFFVELEGIEAGSSSI- 272
Query: 316 VWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTG 375
Y L +DSGTT + + + E+ Y+ + G G
Sbjct: 273 TKTTYPAL---------LDSGTTLIYAPSSIASSIGREY-------GTYSYSYG-----G 311
Query: 376 LRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPS 435
D G P+ K F G +T+P N EG + CL V +SG
Sbjct: 312 YVTSCDATG------PDFKFSFNG-KTITVPFSNLLFQNSEGDSECLVGVL---SSGSNY 361
Query: 436 IILGNFQMQNYYVEYDLRNQRLGFKQ 461
ILG+ +++ YV YD+ N ++G Q
Sbjct: 362 YILGDAFLRSAYVYYDIDNSQVGIAQ 387
>sp|P00793|PEPA_CHICK Pepsin A OS=Gallus gallus GN=PGA PE=1 SV=1
Length = 367
Score = 45.8 bits (107), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 102/426 (23%), Positives = 152/426 (35%), Gaps = 119/426 (27%)
Query: 61 HIKNPQTK----TTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFP 116
H NP +K T T + TN SY G ++S GTP Q I DTGS +W P
Sbjct: 30 HPYNPASKYHPVLTATESYEPMTNYMDASYYG---TISIGTPQQDFSVIFDTGSSNLWVP 86
Query: 117 CTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNC 176
YC SS CS N + SK+
Sbjct: 87 SI------YCKSS---------------------ACS------------NHKRFDPSKSS 107
Query: 177 TQIC--PSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGCSVLSSRQPA---------G 225
T + + + YG+G GI +T+ + + + N + G LS +P G
Sbjct: 108 TYVSTNETVYIAYGTGSMSGILGYDTVAVSSIDVQNQIFG---LSETEPGSFFYYCNFDG 164
Query: 226 IAGFG------RGKTSL-PSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTG 278
I G G T + + ++ + L S T S +L G + T G
Sbjct: 165 ILGLAFPSISSSGATPVFDNMMSQHLVAQDLFSVYLSKDGETGSFVL-FGGIDPNYTTKG 223
Query: 279 LTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTT 338
+ + P + Y+ + + R+TVG + V + T IVD+GT+
Sbjct: 224 IYWVPL----------SAETYWQITMDRVTVGNKYVAC---FFTCQ------AIVDTGTS 264
Query: 339 FTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFK 398
M Q NR + LG + G C D+ P++ H
Sbjct: 265 LLVMP-------------QGAYNR-IIKDLGVSS-DGEISCDDI-----SKLPDVTFHIN 304
Query: 399 GGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPS-----IILGNFQMQNYYVEYDLR 453
G A TLP Y V+ E + L E G P+ ILG+ ++ YYV +D
Sbjct: 305 GHA-FTLPASAY--VLNEDGSCMLGF----ENMGTPTELGEQWILGDVFIREYYVIFDRA 357
Query: 454 NQRLGF 459
N ++G
Sbjct: 358 NNKVGL 363
>sp|Q0IU52|ASP1_ORYSJ Aspartic proteinase Asp1 OS=Oryza sativa subsp. japonica GN=ASP1
PE=2 SV=1
Length = 410
Score = 45.4 bits (106), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 86/406 (21%), Positives = 150/406 (36%), Gaps = 73/406 (17%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G + I+++ G P + +DTGS L W C C++ I + + +L+
Sbjct: 36 GHFFITMNIGDPAKSYFLDIDTGSTLTWLQCDAP-----CTNCNIVPHVLYKPTPKKLVT 90
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSE--TLNLP 204
C + C+ ++ + + + C + K C Y++ Y + G+ + + +L+
Sbjct: 91 CADSLCTDLYTDLGKPKRCGSQ-----KQC-----DYVIQYVDSSSMGVLVIDRFSLSAS 140
Query: 205 NRIIPNFLV-GCSVLSSRQ------PA-GIAGFGRGKTSLPSQLNLDK-FSYCLLSHKFD 255
N P + GC ++ P I G RGK +L SQL + +L H
Sbjct: 141 NGTNPTTIAFGCGYDQGKKNRNVPIPVDSILGLSRGKVTLLSQLKSQGVITKHVLGHCI- 199
Query: 256 DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVR 315
+++ + + T+G+T+TP N YY G
Sbjct: 200 -SSKGGGFLF---FGDAQVPTSGVTWTPM---------NREHKYYSPG------------ 234
Query: 316 VWHKYLTLDRDGNG------GTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALG 369
H L D + I DSG T+T+ A + ++ S + + +
Sbjct: 235 --HGTLHFDSNSKAISAAPMAVIFDSGATYTYFAAQPYQATLSVVKSTLNSECKFLTEV- 291
Query: 370 AEALTGLRPCFD------VPGEKTGSFPELKLHFKGG---AEVTLPVENYFAVVGEGSAV 420
E L C+ E F L L F G A + +P E+Y + EG V
Sbjct: 292 TEKDRALTVCWKGKDKIVTIDEVKKCFRSLSLEFADGDKKATLEIPPEHYLIISQEGH-V 350
Query: 421 CLTVV--TDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
CL ++ + S + ++G M + V YD LG+ C
Sbjct: 351 CLGILDGSKEHLSLAGTNLIGGITMLDQMVIYDSERSLLGWVNYQC 396
>sp|P55956|ASP3_CAEEL Aspartic protease 3 OS=Caenorhabditis elegans GN=asp-3 PE=1 SV=2
Length = 398
Score = 45.1 bits (105), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 83/405 (20%), Positives = 139/405 (34%), Gaps = 100/405 (24%)
Query: 81 ISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSS 140
+S +S Y ++ GTPPQ + DTGS +W PC N C +
Sbjct: 61 LSDYSNAQYYGPVTIGTPPQNFQVLFDTGSSNLWVPCAN---CPF--------------- 102
Query: 141 SSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSET 200
I CR N S +CT S+ + YG+G +G ++
Sbjct: 103 -----------------GDIACRMHNRFDCKKSSSCTATGASFEIQYGTGSMKGTVDNDV 145
Query: 201 LNLPNRII----PNFLVGCS------VLSSRQPAGIAGFGR-----GKTSLPSQLNLDKF 245
+ + N + C+ + + GI G G K S P
Sbjct: 146 VCFGHDTTYCTDKNQGLACATSEPGITFVAAKFDGIFGMGWDTISVNKISQPMDQIFANS 205
Query: 246 SYC-------LLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSV 298
+ C LS +D T + L + + + + P V+
Sbjct: 206 AICKNQLFAFWLSRDANDITNGGEITL--CETDPNHYVGNIAWEPLVSED---------- 253
Query: 299 YYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQM 358
Y+ + L + + G T G +IVD+GT+ ++ + + +
Sbjct: 254 YWRIKLASVVIDG----------TTYTSGPIDSIVDTGTSLLTGPTDVIKKIQHKIGGIP 303
Query: 359 VKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVV--GE 416
+ N Y +V K S P + + GG L ++Y + G
Sbjct: 304 LFNGEY----------------EVECSKIPSLPNITFNL-GGQNFDLQGKDYILQMSNGN 346
Query: 417 GSAVCLT--VVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGF 459
G + CL+ + D A GP ILG+ + +Y +D N+R+GF
Sbjct: 347 GGSTCLSGFMGMDIPAPAGPLWILGDVFIGRFYSVFDHGNKRVGF 391
>sp|Q12303|YPS3_YEAST Aspartic proteinase yapsin-3 OS=Saccharomyces cerevisiae (strain
ATCC 204508 / S288c) GN=YPS3 PE=1 SV=1
Length = 508
Score = 45.1 bits (105), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 95/409 (23%), Positives = 147/409 (35%), Gaps = 112/409 (27%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
YS+ L+ GTP Q + +LDTGS +W P G
Sbjct: 63 YSVELAIGTPSQNLTVLLDTGSADLWVP-----------------------------GKG 93
Query: 149 NPKCSWIHHESIQCRDCN-----DEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNL 203
NP C + DC+ D+ +++ + P Y YG G A +
Sbjct: 94 NPYCGSV-------MDCDQYGVFDKTKSSTFKANKSSPFYAA-YGDGTYAEGAFGQDKLK 145
Query: 204 PNRIIPNFLVGCSVLSSRQPAGIAGFG--------RGKTSLPSQLNLDKFSY------CL 249
N + + L S G+ G G GK ++ +DK SY
Sbjct: 146 YNELDLSGLSFAVANESNSTFGVLGIGLSTLEVTYSGKVAI-----MDKRSYEYDNFPLF 200
Query: 250 LSHK-----------FDDTTRTSSLILDNGSSHSDKKTTGLTYT-PFVNNPSVAERNAFS 297
L H +D +++S IL HS K G YT P VN
Sbjct: 201 LKHSGAIDATAYSLFLNDESQSSGSILFGAVDHS--KYEGQLYTIPLVN----------- 247
Query: 298 VYYYVGLRR-----ITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLAD 352
+Y G + +T+ G ++ + +TL ++DSGTT T++ + LA
Sbjct: 248 LYKSQGYQHPVAFDVTLQGLGLQTDKRNITLTTT-KLPALLDSGTTLTYLPSQAVALLAK 306
Query: 353 EFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFA 412
N +Y++ LG T P D KT + F GG + P+ ++
Sbjct: 307 SL------NASYSKTLGYYEYTC--PSSD---NKT----SVAFDF-GGFRINAPLSDFTM 350
Query: 413 VVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQ 461
G+ V + +G + ILG+ ++N YV YDL N + Q
Sbjct: 351 QTSVGTCVLAII----PQAGNATAILGDSFLRNAYVVYDLDNYEISLAQ 395
>sp|Q05744|CATD_CHICK Cathepsin D OS=Gallus gallus GN=CTSD PE=1 SV=1
Length = 398
Score = 43.9 bits (102), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 86/396 (21%), Positives = 138/396 (34%), Gaps = 100/396 (25%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y + GTPPQ + DTGS +W P +C
Sbjct: 78 YYGEIGIGTPPQKFTVVFDTGSSNLWVPSV------HC---------------------- 109
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
H I C + + S + + + YG+G G +T+ L N I
Sbjct: 110 -------HLLDIACLLHHKYDASKSSTYVENGTEFAIHYGTGSLSGFLSQDTVTLGNLKI 162
Query: 209 PNFLVGCSVLSSRQPA---------GIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTR 259
N + G +V +QP GI G + P ++++DK + F D
Sbjct: 163 KNQIFGEAV---KQPGITFIAAKFDGILGM-----AFP-RISVDKVT------PFFDNVM 207
Query: 260 TSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSV-------YYYVGLRRITVGGQ 312
LI N ++ ++N A+ + YY + V
Sbjct: 208 QQKLIEKN------------IFSFYLNRDPTAQPGGELLLGGTDPKYYSGDFSWVNV--T 253
Query: 313 RVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMV----KNRNYTRAL 368
R W ++ NG T+ G E + D S + + + A+
Sbjct: 254 RKAYWQVHMDSVDVANGLTLCKGGC----------EAIVDTGTSLITGPTKEVKELQTAI 303
Query: 369 GAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENY-FAVVGEGSAVCLTVVT- 426
GA+ L ++ + + +K S P + L GG L E Y F V +G +CL+ +
Sbjct: 304 GAKPL--IKGQYVISCDKISSLPVVTLML-GGKPYQLTGEQYVFKVSAQGETICLSGFSG 360
Query: 427 -DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQ 461
D GGP ILG+ + YY +D N +GF +
Sbjct: 361 LDVPPPGGPLWILGDVFIGPYYTVFDRDNDSVGFAK 396
>sp|P13917|7SB1_SOYBN Basic 7S globulin OS=Glycine max GN=BG PE=1 SV=2
Length = 427
Score = 43.5 bits (101), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 85/403 (21%), Positives = 148/403 (36%), Gaps = 67/403 (16%)
Query: 85 SYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHY-----QCKYCSSSKIPSFIPKLS 139
S G + +L TP +P ++D + +W C Y Q +C S++ +
Sbjct: 43 STGLHWANLQKRTPLMQVPVLVDLNGNHLWVNCEQQYSSKTYQAPFCHSTQC-----SRA 97
Query: 140 SSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALS 198
++ + L C H + C + P+ ++ L ++ + G T+ +
Sbjct: 98 NTHQCLSCPAASRPGCHKNT--CGLMSTNPITQQTGLGELGEDVLAIHATQGSTQQLG-- 153
Query: 199 ETLNLPNRIIPNFLVGC--SVLSS----RQPAGIAGFGRGKTSLPSQLN-----LDKFSY 247
P +P FL C S L R G+AG G SLP+QL +F+
Sbjct: 154 -----PLVTVPQFLFSCAPSFLVQKGLPRNTQGVAGLGHAPISLPNQLASHFGLQRQFTT 208
Query: 248 CLLSHKFDDTTRTSSLILDNGSSHSDKKTT-----GLTYTPFVNNPSVAERNAFSVYYYV 302
CL + T ++I + ++ + L +TP Y V
Sbjct: 209 CLSRYP----TSKGAIIFGDAPNNMRQFQNQDIFHDLAFTPLTI--------TLQGEYNV 256
Query: 303 GLRRITVGGQRVRVWHKYL-TLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKN 361
+ I + V +K T+ +GGT++ + T + +++ F Q+ K
Sbjct: 257 RVNSIRINQHSVFPLNKISSTIVGSTSGGTMISTSTPHMVLQQSVYQAFTQVFAQQLPKQ 316
Query: 362 RNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHF-KGGAEVTLPVENYFAVVGEGSAV 420
+ + A GL CF+ K ++P + L K V V +
Sbjct: 317 AQ----VKSVAPFGL--CFN--SNKINAYPSVDLVMDKPNGPVWRISGEDLMVQAQPGVT 368
Query: 421 CLTVVTDREASGG----PSIILGNFQMQNYYVEYDLRNQRLGF 459
CL V+ +GG I LG Q++ V +DL R+GF
Sbjct: 369 CLGVM-----NGGMQPRAEITLGARQLEENLVVFDLARSRVGF 406
>sp|P43093|CARP4_CANAW Candidapepsin-4 OS=Candida albicans (strain WO-1) GN=SAP4 PE=3 SV=1
Length = 417
Score = 43.5 bits (101), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 81/390 (20%), Positives = 144/390 (36%), Gaps = 91/390 (23%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
YS ++ G+ Q + I+DTGS +W P +N C IPK
Sbjct: 89 YSADITIGSNNQKLSVIVDTGSSDLWVPDSN----AVC--------IPK----------- 125
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPNRI 207
W C++ A S + + + Y G + +G +T+ +
Sbjct: 126 -----WPGDRGDFCKNNGSYSPAASSTSKNLNTPFEIKYADGSVAQGNLYQDTVGIGGVS 180
Query: 208 IPNFLVGCSVLSSRQPAGIAGFGRGKT------------SLPSQLNLDKFSYCLLSHKFD 255
+ + L +V S+ GI G G +L Q + K +Y L F
Sbjct: 181 VRDQLF-ANVRSTSAHKGILGIGFQSNEATRTPYDNLPITLKKQGIISKNAYSL----FL 235
Query: 256 DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVR 315
++ SS + G K + L P ++ +++ VGLR + V GQ V
Sbjct: 236 NSPEASSGQIIFGGIDKAKYSGSLVDLPITSDRTLS----------VGLRSVNVMGQNVN 285
Query: 316 VWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTG 375
V N G ++DSGTT ++ P + + Q+ + + A A+ T
Sbjct: 286 V-----------NAGVLLDSGTTISYFTPNIARSIIYALGGQVHYDSSGNEAYVADCKT- 333
Query: 376 LRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENY----FAVVGEGSAVCLTVVTDREAS 431
+G+ + F ++++P + + GE C V + E +
Sbjct: 334 -----------SGT---VDFQFDRNLKISVPASEFLYQLYYTNGEPYPKCEIRVRESEDN 379
Query: 432 GGPSIILGNFQMQNYYVEYDLRNQRLGFKQ 461
ILG+ M++ Y+ YDL ++++ Q
Sbjct: 380 -----ILGDNFMRSAYIVYDLDDRKISMAQ 404
>sp|O93428|CATD_CHIHA Cathepsin D OS=Chionodraco hamatus GN=ctsd PE=1 SV=2
Length = 396
Score = 42.7 bits (99), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 95/411 (23%), Positives = 148/411 (36%), Gaps = 102/411 (24%)
Query: 73 TTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIP 132
T T + + YG + GTPPQ + DTGS +W P +CS I
Sbjct: 64 TPETLKNYLDAQYYG----EIGLGTPPQPFTVVFDTGSSNLWVPSI------HCSLLDIA 113
Query: 133 SFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLT 192
+ HH+ + KN T ++ + YGSG
Sbjct: 114 CLL--------------------HHKYNSGKSST-----YVKNGT----AFAIQYGSGSL 144
Query: 193 EGIALSETLNLPNRIIPNFLVGCSVLSSRQPA---------GIAGFGRGKTSLP------ 237
G +T + + I + L G ++ +QP GI G + S+
Sbjct: 145 SGYLSQDTCTIGDLAIDSQLFGEAI---KQPGVAFIAAKFDGILGMAYPRISVDGVAPVF 201
Query: 238 ----SQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAER 293
SQ +++ + ++ DT L+L +D K YT N +V +
Sbjct: 202 DNIMSQKKVEQNVFSFYLNRNPDTEPGGELLLGG----TDPKY----YTGDFNYVNVTRQ 253
Query: 294 NAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADE 353
Y+ + + + VG Q L+L G IVDSGT+
Sbjct: 254 ----AYWQIRVDSMAVGDQ--------LSL-CTGGCEAIVDSGTSL-------------- 286
Query: 354 FVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENY-FA 412
V+ + +A+GA L ++ + V + S P + GG TL E Y
Sbjct: 287 ITGPSVEVKALQKAIGAFPL--IQGEYMVNCDTVPSLPVISFTV-GGQVYTLTGEQYILK 343
Query: 413 VVGEGSAVCLT--VVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQ 461
V G +CL+ + D A GP ILG+ M YY +D R+GF +
Sbjct: 344 VTQAGKTMCLSGFMGLDIPAPAGPLWILGDVFMGQYYTVFDRDANRVGFAK 394
>sp|Q689Z7|PEPC_MONDO Gastricsin OS=Monodelphis domestica GN=PGC PE=2 SV=1
Length = 391
Score = 41.6 bits (96), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 83/387 (21%), Positives = 129/387 (33%), Gaps = 90/387 (23%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y +S GTPPQ + DTGS +W P T YC S
Sbjct: 75 YFGEISIGTPPQNFLVLFDTGSSNLWVPST------YCQSQA------------------ 110
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
C + N + S T +Y + YGSG + +T+ + N ++
Sbjct: 111 -------------CSNHNRFSPSQSSTFTNGGQTYTLSYGSGSLTVVLGYDTVTVQNIVV 157
Query: 209 PNFLVGCSVLSSRQP------AGIAGF-------GRGKTSLPSQLNLDKFSYCLLSHKFD 255
N G S P GI G G T + L + S + S F
Sbjct: 158 SNQEFGLSESEPTSPFYYSDFDGILGMAYPAMAVGNSPTVMQGMLQQGQLSEPIFSFYFS 217
Query: 256 DT---TRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQ 312
LIL G + +T+TP VY+ +G+ +G Q
Sbjct: 218 RQPTHQYGGELIL--GGVDPQLYSGQITWTPVTQ----------EVYWQIGIEEFAIGNQ 265
Query: 313 RVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEA 372
+ IVD+GT F P+ +++S ++ +A +
Sbjct: 266 ATGWCSQ--------GCQAIVDTGT-FLLAVPQ-------QYMSAFLQATGAQQAQNGDF 309
Query: 373 LTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASG 432
+ D+P T +F G++ LP Y + + +G
Sbjct: 310 MVNCNYIQDMP---TITF------VINGSQFPLPPSAYVFNNNGYCRLGIEATYLPSPNG 360
Query: 433 GPSIILGNFQMQNYYVEYDLRNQRLGF 459
P ILG+ ++ YY YD+ N R+GF
Sbjct: 361 QPLWILGDVFLKEYYSVYDMANNRVGF 387
>sp|P28713|PEPA4_RABIT Pepsin II-4 OS=Oryctolagus cuniculus PE=2 SV=1
Length = 387
Score = 41.6 bits (96), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 27/70 (38%), Positives = 35/70 (50%), Gaps = 8/70 (11%)
Query: 61 HIKNPQTK--TTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCT 118
H NP TK T T +T ++ ++ Y ++S GTPPQ I DTGS +W P T
Sbjct: 45 HTPNPATKYFPKETFATVSTESLENYLDAEYFGTISIGTPPQDFTVIFDTGSSNLWVPST 104
Query: 119 NHYQCKYCSS 128
YCSS
Sbjct: 105 ------YCSS 108
>sp|P27821|PEPA2_RABIT Pepsin II-2/3 OS=Oryctolagus cuniculus PE=2 SV=1
Length = 387
Score = 41.6 bits (96), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 27/70 (38%), Positives = 35/70 (50%), Gaps = 8/70 (11%)
Query: 61 HIKNPQTK--TTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCT 118
H NP TK T T +T ++ ++ Y ++S GTPPQ I DTGS +W P T
Sbjct: 45 HTPNPATKYFPKETFATVSTESMENYLDAEYFGTISIGTPPQDFTVIFDTGSSNLWVPST 104
Query: 119 NHYQCKYCSS 128
YCSS
Sbjct: 105 ------YCSS 108
>sp|P43095|CARP6_CANAX Candidapepsin-6 OS=Candida albicans GN=SAP6 PE=3 SV=1
Length = 418
Score = 41.6 bits (96), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 83/395 (21%), Positives = 141/395 (35%), Gaps = 101/395 (25%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
YS ++ G+ Q + I+DTGS +W P + C IPK
Sbjct: 90 YSADITVGSNNQKLSVIVDTGSSDLWIPDSKAI----C--------IPK----------- 126
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPNRI 207
W C++ A S + + + Y G +G +T+ +
Sbjct: 127 -----WRGDCGDFCKNNGSYSPAASSTSKNLNTRFEIKYADGSYAKGNLYQDTVGIGGAS 181
Query: 208 IPNFLVGCSVLSSRQPAGIAGFG------------RGKTSLPSQLNLDKFSYCLLSHKFD 255
+ N L +V S+ GI G G SL Q + K +Y L F
Sbjct: 182 VKNQLF-ANVWSTSAHKGILGIGFQTNEATRTPYDNLPISLKKQGIIAKNAYSL----FL 236
Query: 256 DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVR 315
++ SS + G K + L P ++ +++ VGLR + V G+ V
Sbjct: 237 NSPEASSGQIIFGGIDKAKYSGSLVELPITSDRTLS----------VGLRSVNVMGRNVN 286
Query: 316 VWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTG 375
V N G ++DSGTT ++ P + R+ ALG +
Sbjct: 287 V-----------NAGVLLDSGTTISYFTPSIA--------------RSIIYALGGQV--- 318
Query: 376 LRPCFDVPGEKT-----GSFPELKLHFKGGAEVTLPVENY----FAVVGEGSAVCLTVVT 426
FD G K + + F ++++P + + G+ C V
Sbjct: 319 ---HFDSAGNKAYVADCKTSGTVDFQFDKNLKISVPASEFLYQLYYTNGKPYPKCEIRVR 375
Query: 427 DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQ 461
+ E + ILG+ M++ Y+ YDL ++++ Q
Sbjct: 376 ESEDN-----ILGDNFMRSAYIVYDLDDKKISMAQ 405
>sp|O04057|ASPR_CUCPE Aspartic proteinase OS=Cucurbita pepo PE=2 SV=1
Length = 513
Score = 40.8 bits (94), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 34/123 (27%), Positives = 51/123 (41%), Gaps = 11/123 (8%)
Query: 1 MASYISA---LCLSFIFFFTLLSIFPSSITSLTFSLSRFHTNPSQDSYQNLNSLVSSSLT 57
MASY S LCL + F ++S S+ L L + +P + S + L
Sbjct: 1 MASYHSKAAFLCLFLLVSFNIVS-SASNDGLLRVGLKKIKLDPENRLAARVESKDAEILK 59
Query: 58 RALHIKNPQT---KTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVW 114
A NP+ +++ T + + YG ++ GTPPQ I DTGS +W
Sbjct: 60 AAFRKYNPKGNLGESSDTDIVALKNYLDAQYYG----EIAIGTPPQKFTVIFDTGSSNLW 115
Query: 115 FPC 117
C
Sbjct: 116 VLC 118
>sp|P53379|MKC7_YEAST Aspartic proteinase MKC7 OS=Saccharomyces cerevisiae (strain ATCC
204508 / S288c) GN=MKC7 PE=1 SV=2
Length = 596
Score = 40.4 bits (93), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 34/130 (26%), Positives = 58/130 (44%), Gaps = 18/130 (13%)
Query: 332 IVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFP 391
++DSGTT ++M EL + LAD+ Y+ A G + C E++
Sbjct: 358 LLDSGTTISYMPTELVKMLADQV------GATYSSAYGYY----IMDCIKEMEEESSIIF 407
Query: 392 ELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYD 451
+ GG ++ + ++ V S +C+ + + P+IILG+ + N YV YD
Sbjct: 408 DF-----GGFYLSNWLSDFQLVTDSRSNICILGIAPQS---DPTIILGDNFLANTYVVYD 459
Query: 452 LRNQRLGFKQ 461
L N + Q
Sbjct: 460 LDNMEISMAQ 469
Score = 38.9 bits (89), Expect = 0.093, Method: Compositional matrix adjust.
Identities = 17/42 (40%), Positives = 26/42 (61%), Gaps = 3/42 (7%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSK 130
YS+ L GTPPQ + ++DTGS +W +++ YCS+ K
Sbjct: 81 YSVELDIGTPPQKVTVLVDTGSSDLWVTGSDN---PYCSTKK 119
>sp|P17576|CARP_IRPLA Polyporopepsin OS=Irpex lacteus PE=1 SV=1
Length = 340
Score = 40.4 bits (93), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 81/300 (27%), Positives = 117/300 (39%), Gaps = 56/300 (18%)
Query: 185 VLYGSGLTEGIALSETLNLPNRIIPNFLVGCSVLSSRQPA-----GIAGFG--------- 230
V YGSG G ++T+ L + IP +G ++SR GI G G
Sbjct: 61 VTYGSGSFSGTEYTDTVTLGSLTIPKQSIG---VASRDSGFDGVDGILGVGPVDLTVGTL 117
Query: 231 --RGKTSLPSQLNLDKFSYC-----LLSHKFDDTTRTSSL--ILDNGSSHSDKKTTGLTY 281
TS+P+ + + FS LL+ F+ TT SS L G++ S K T +TY
Sbjct: 118 SPHTSTSIPTVTD-NLFSQGTIPTNLLAVSFEPTTSESSTNGELTFGATDSSKYTGSITY 176
Query: 282 TPFVN-NPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFT 340
TP + +P+ A Y G+ Q +R L IVD+GTT T
Sbjct: 177 TPITSTSPASA---------YWGIN------QTIRYGSSTSILSSTAG---IVDTGTTLT 218
Query: 341 FMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGG 400
+A + F + N R A+ L+ F G +T
Sbjct: 219 LIASDAFAKYKKATGAVADNNTGLLRLTTAQ-YANLQSLFFTIGGQT-------FELTAN 270
Query: 401 AEVTLPVENYFAVVGEGSAVCLTVVTDREASG-GPSIILGNFQMQNYYVEYDLRNQRLGF 459
A++ P A+ G S+V L V SG G I G ++ +Y YD N+RLG
Sbjct: 271 AQI-WPRNLNTAIGGSASSVYLIVGDLGSDSGEGLDFINGLTFLERFYSVYDTTNKRLGL 329
>sp|P42210|ASPR_HORVU Phytepsin OS=Hordeum vulgare PE=1 SV=1
Length = 508
Score = 40.0 bits (92), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 27/77 (35%), Positives = 39/77 (50%), Gaps = 4/77 (5%)
Query: 388 GSFPELKLHFKGGAEVTLPVENYFAVVGEGSAV-CLTVVT--DREASGGPSIILGNFQMQ 444
GS P+++ GG + L E Y VGEG+A C++ T D GP ILG+ M
Sbjct: 431 GSMPDIEFTI-GGKKFALKPEEYILKVGEGAAAQCISGFTAMDIPPPRGPLWILGDVFMG 489
Query: 445 NYYVEYDLRNQRLGFKQ 461
Y+ +D R+GF +
Sbjct: 490 PYHTVFDYGKLRIGFAK 506
>sp|P60016|RENI_PANTR Renin OS=Pan troglodytes GN=REN PE=3 SV=1
Length = 406
Score = 39.3 bits (90), Expect = 0.061, Method: Compositional matrix adjust.
Identities = 29/94 (30%), Positives = 43/94 (45%), Gaps = 6/94 (6%)
Query: 371 EALTGLRPCFD--VPGEKTGSFPELKLHFKGGAEVTLPVENY-FAVVGEGSAVCLTVV-- 425
EAL + FD V + + P++ H GG E TL +Y F +C +
Sbjct: 310 EALGAKKRLFDYVVKCNEGPTLPDISFHL-GGKEYTLTSADYVFQESYSSKKLCTLAIHA 368
Query: 426 TDREASGGPSIILGNFQMQNYYVEYDLRNQRLGF 459
D GP+ LG ++ +Y E+D RN R+GF
Sbjct: 369 MDIPPPTGPTWALGATFIRKFYTEFDRRNNRIGF 402
Score = 35.0 bits (79), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 21/59 (35%), Positives = 28/59 (47%), Gaps = 8/59 (13%)
Query: 71 TTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFP---CTNHY-QCKY 125
TT++ T + + YG + GTPPQ + DTGS VW P C+ Y C Y
Sbjct: 72 TTSSVILTNYMDTQYYG----EIGIGTPPQTFKVVFDTGSSNVWVPSSKCSRLYTACVY 126
>sp|P00797|RENI_HUMAN Renin OS=Homo sapiens GN=REN PE=1 SV=1
Length = 406
Score = 39.3 bits (90), Expect = 0.061, Method: Compositional matrix adjust.
Identities = 29/94 (30%), Positives = 43/94 (45%), Gaps = 6/94 (6%)
Query: 371 EALTGLRPCFD--VPGEKTGSFPELKLHFKGGAEVTLPVENY-FAVVGEGSAVCLTVV-- 425
EAL + FD V + + P++ H GG E TL +Y F +C +
Sbjct: 310 EALGAKKRLFDYVVKCNEGPTLPDISFHL-GGKEYTLTSADYVFQESYSSKKLCTLAIHA 368
Query: 426 TDREASGGPSIILGNFQMQNYYVEYDLRNQRLGF 459
D GP+ LG ++ +Y E+D RN R+GF
Sbjct: 369 MDIPPPTGPTWALGATFIRKFYTEFDRRNNRIGF 402
Score = 35.0 bits (79), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 21/59 (35%), Positives = 28/59 (47%), Gaps = 8/59 (13%)
Query: 71 TTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFP---CTNHY-QCKY 125
TT++ T + + YG + GTPPQ + DTGS VW P C+ Y C Y
Sbjct: 72 TTSSVILTNYMDTQYYG----EIGIGTPPQTFKVVFDTGSSNVWVPSSKCSRLYTACVY 126
>sp|Q6DLS0|RENI_MACFA Renin OS=Macaca fascicularis GN=REN PE=2 SV=1
Length = 406
Score = 39.3 bits (90), Expect = 0.064, Method: Compositional matrix adjust.
Identities = 29/94 (30%), Positives = 43/94 (45%), Gaps = 6/94 (6%)
Query: 371 EALTGLRPCFD--VPGEKTGSFPELKLHFKGGAEVTLPVENY-FAVVGEGSAVCLTVV-- 425
EAL + FD V + + P++ H GG E TL +Y F +C +
Sbjct: 310 EALGAKKRLFDYVVKCNEGPTLPDISFHL-GGKEYTLTSADYVFQESYSSKKLCTLAIHA 368
Query: 426 TDREASGGPSIILGNFQMQNYYVEYDLRNQRLGF 459
D GP+ LG ++ +Y E+D RN R+GF
Sbjct: 369 MDIPPPTGPTWALGATFIRKFYTEFDRRNNRIGF 402
Score = 35.0 bits (79), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 21/59 (35%), Positives = 28/59 (47%), Gaps = 8/59 (13%)
Query: 71 TTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFP---CTNHY-QCKY 125
TT++ T + + YG + GTPPQ + DTGS VW P C+ Y C Y
Sbjct: 72 TTSSVILTNYMDTQYYG----EIGIGTPPQTFKVVFDTGSSNVWVPSSKCSRLYTACVY 126
>sp|Q6DLW5|RENI_MACMU Renin OS=Macaca mulatta GN=REN PE=2 SV=2
Length = 406
Score = 39.3 bits (90), Expect = 0.066, Method: Compositional matrix adjust.
Identities = 29/94 (30%), Positives = 43/94 (45%), Gaps = 6/94 (6%)
Query: 371 EALTGLRPCFD--VPGEKTGSFPELKLHFKGGAEVTLPVENY-FAVVGEGSAVCLTVV-- 425
EAL + FD V + + P++ H GG E TL +Y F +C +
Sbjct: 310 EALGAKKRLFDYVVKCNEGPTLPDISFHL-GGKEYTLTSADYVFQESYSSKKLCTLAIHA 368
Query: 426 TDREASGGPSIILGNFQMQNYYVEYDLRNQRLGF 459
D GP+ LG ++ +Y E+D RN R+GF
Sbjct: 369 MDIPPPTGPTWALGATFIRKFYTEFDRRNNRIGF 402
Score = 35.0 bits (79), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 21/59 (35%), Positives = 28/59 (47%), Gaps = 8/59 (13%)
Query: 71 TTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFP---CTNHY-QCKY 125
TT++ T + + YG + GTPPQ + DTGS VW P C+ Y C Y
Sbjct: 72 TTSSVILTNYMDTQYYG----EIGIGTPPQTFKVVFDTGSSNVWVPSSKCSRLYTACVY 126
>sp|Q42456|ASPR1_ORYSJ Aspartic proteinase oryzasin-1 OS=Oryza sativa subsp. japonica
GN=Os05g0567100 PE=2 SV=2
Length = 509
Score = 39.3 bits (90), Expect = 0.067, Method: Compositional matrix adjust.
Identities = 27/76 (35%), Positives = 37/76 (48%), Gaps = 4/76 (5%)
Query: 389 SFPELKLHFKGGAEVTLPVENYFAVVGEGSAV-CLTVVT--DREASGGPSIILGNFQMQN 445
S PE+ GG + L E Y VGEG+A C++ T D GP ILG+ M
Sbjct: 433 SMPEISFTI-GGKKFALKPEEYILKVGEGAAAQCISGFTAMDIPPPRGPLWILGDVFMGA 491
Query: 446 YYVEYDLRNQRLGFKQ 461
Y+ +D R+GF +
Sbjct: 492 YHTVFDYGKMRVGFAK 507
>sp|Q28057|PAG2_BOVIN Pregnancy-associated glycoprotein 2 OS=Bos taurus GN=PAG2 PE=2 SV=1
Length = 376
Score = 38.5 bits (88), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 82/386 (21%), Positives = 141/386 (36%), Gaps = 94/386 (24%)
Query: 88 GYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGC 147
Y +++ GTPPQ + DTGS +W PC C + +F P+ SSS R +G
Sbjct: 67 AYVGNITIGTPPQEFRVVFDTGSANLWVPCIT---CTSPACYTHKTFNPQNSSSFREVG- 122
Query: 148 QNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRI 207
P+ + YGSG+ +G S+T+ + N +
Sbjct: 123 --------------------SPIT-------------IFYGSGIIQGFLGSDTVRIGNLV 149
Query: 208 IPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDN 267
P Q G++ G SLP + +DT I DN
Sbjct: 150 SP-----------EQSFGLSLEEYGFDSLPFD---GILGLAFPAMGIEDTIP----IFDN 191
Query: 268 GSSHSDKKTTGLTYTPFVNNP--SVAERNAFSVYYYVG-LRRITVGGQRVRVWHKYLTLD 324
SH + N P SV YY G L I V + H ++++
Sbjct: 192 LWSHGAFSEPVFAFYLNTNKPEGSVVMFGGVDHRYYKGELNWIPVS----QTSHWQISMN 247
Query: 325 RDGNGGTI----------VDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALT 374
GT+ +D+GT+ + +L + + ++ ++N Y + +A+
Sbjct: 248 NISMNGTVTACSCGCEALLDTGTSMIYGPTKLVTNI-HKLMNARLENSEY--VVSCDAVK 304
Query: 375 GLRPC-FDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGG 433
L P F++ G P+ + + ++N V +G E S
Sbjct: 305 TLPPVIFNINGIDYPLRPQAYI---------IKIQNSCRSVFQGGT---------ENSSL 346
Query: 434 PSIILGNFQMQNYYVEYDLRNQRLGF 459
+ ILG+ ++ Y+ +D +N+R+G
Sbjct: 347 NTWILGDIFLRQYFSVFDRKNRRIGL 372
>sp|D4DEN7|CARP_TRIVH Probable vacuolar protease A OS=Trichophyton verrucosum (strain HKI
0517) GN=PEP2 PE=3 SV=1
Length = 400
Score = 38.5 bits (88), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 88/390 (22%), Positives = 136/390 (34%), Gaps = 95/390 (24%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y +S GTPPQ +LDTGS +W P K CSS I F+ SS
Sbjct: 87 YFSEISIGTPPQTFKVVLDTGSSNLWVP------GKDCSS--IACFLHSTYDSSA----- 133
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
SKN T+ + + YGSG EG +++ + + I
Sbjct: 134 --------------------SSTYSKNGTK----FAIRYGSGSLEGFVSQDSVKIGDMTI 169
Query: 209 PNFLVGCSVLSSRQPA---------GIAGFGRGKTSL-----PSQLNLDK--FSYCLLSH 252
N L ++ +P GI G G S+ P +D+ + S
Sbjct: 170 KNQLF---AEATSEPGLAFAFGRFDGIMGMGFSSISVNGITPPFYNMIDQGLIDEPVFSF 226
Query: 253 KFDDTTRTSSL-ILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGG 311
DT + ++ G S + T +T P R A Y+ V I++G
Sbjct: 227 YLGDTNKEGDQSVVTFGGSDTKHFTGDMTTIPL-------RRKA---YWEVDFDAISLGE 276
Query: 312 QRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAE 371
+ N G I+D+GT+ + L E + + + N YT
Sbjct: 277 DTAALE----------NTGIILDTGTSLIALPTTLAEMINTQIGATKSWNGQYT------ 320
Query: 372 ALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREAS 431
+ K S P++ G T+ +Y V G+ + + D
Sbjct: 321 ----------LDCAKRDSLPDVTFTVS-GHNFTIGPHDYTLEV-SGTCISSFMGMDFPEP 368
Query: 432 GGPSIILGNFQMQNYYVEYDLRNQRLGFKQ 461
GP ILG+ ++ YY YDL +G +
Sbjct: 369 VGPLAILGDSFLRRYYSVYDLGKGTVGLAK 398
>sp|P32329|YPS1_YEAST Aspartic proteinase 3 OS=Saccharomyces cerevisiae (strain ATCC
204508 / S288c) GN=YPS1 PE=1 SV=2
Length = 569
Score = 38.5 bits (88), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 18/45 (40%), Positives = 28/45 (62%), Gaps = 3/45 (6%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPS 133
YS+ L GTPPQ + ++DTGS +W +++ YCSS+ + S
Sbjct: 83 YSVDLEVGTPPQNVTVLVDTGSSDLWIMGSDN---PYCSSNSMGS 124
Score = 38.1 bits (87), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 33/130 (25%), Positives = 59/130 (45%), Gaps = 22/130 (16%)
Query: 332 IVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFP 391
++DSGTT T++ + +A E +Q Y+ +G L D P + +
Sbjct: 369 LLDSGTTLTYLPQTVVSMIATELGAQ------YSSRIGYYVL-------DCPSDDS---M 412
Query: 392 ELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYD 451
E+ F GG + P+ ++ ++ G+ L ++ + +G ILG+ + N YV YD
Sbjct: 413 EIVFDF-GGFHINAPLSSF--ILSTGTTCLLGIIPTSDDTG---TILGDSFLTNAYVVYD 466
Query: 452 LRNQRLGFKQ 461
L N + Q
Sbjct: 467 LENLEISMAQ 476
>sp|Q9XFX3|CARDA_CYNCA Procardosin-A OS=Cynara cardunculus GN=cardA PE=1 SV=1
Length = 504
Score = 38.5 bits (88), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 35/129 (27%), Positives = 52/129 (40%), Gaps = 11/129 (8%)
Query: 1 MASYISALCLSFIFFFTLLS--IFPSSITSLTFSLSRFHTNPSQDSYQNLNSLVSSSLTR 58
M + I A L+ +F F LLS +F S L + D + +L+ + +
Sbjct: 1 MGTSIKANVLA-LFLFYLLSPTVFSVSDDGLIRIGLKKRKVDRIDQLRGRRALMEGNARK 59
Query: 59 ALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFP-- 116
+ T + + TN SY G + GTPPQ I DTGS ++W P
Sbjct: 60 DFGFRG--TVRDSGSAVVALTNDRDTSYFG---EIGIGTPPQKFTVIFDTGSSVLWVPSS 114
Query: 117 -CTNHYQCK 124
C N C+
Sbjct: 115 KCINSKACR 123
Score = 35.0 bits (79), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 23/65 (35%), Positives = 35/65 (53%), Gaps = 2/65 (3%)
Query: 399 GGAEVTLPVENYFAVVGEGSAV-CLTVVTDREASG-GPSIILGNFQMQNYYVEYDLRNQR 456
GG + L E Y VG+G A C++ T +A+ GP ILG+ M+ Y+ +D N
Sbjct: 438 GGKKFGLTPEQYILKVGKGEATQCISGFTAMDATLLGPLWILGDVFMRPYHTVFDYGNLL 497
Query: 457 LGFKQ 461
+GF +
Sbjct: 498 VGFAE 502
>sp|Q64411|PEPC_CAVPO Gastricsin OS=Cavia porcellus GN=PGC PE=2 SV=1
Length = 394
Score = 38.1 bits (87), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 89/386 (23%), Positives = 134/386 (34%), Gaps = 97/386 (25%)
Query: 93 LSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKC 152
+S GTPPQ + DTGS +W P YCSS L C
Sbjct: 83 ISLGTPPQSFQVLFDTGSSNLWVPSV------YCSS----------------LACTT--- 117
Query: 153 SWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFL 212
H RD + +AT + S+ + YG+G G+ +T+ + + +P
Sbjct: 118 ----HTRFNPRDSSTY-VATDQ-------SFSLEYGTGSLTGVFGYDTMTIQDIQVPKQE 165
Query: 213 VGCSVLSSRQPA---------GIAGFGR-------GKTSLPSQLNLDKFSYCLLSHKFDD 256
G LS +P GI G G T++ L S L S
Sbjct: 166 FG---LSETEPGSDFVYAEFDGILGLGYPGLSEGGATTAMQGLLREGALSQSLFSVYLGS 222
Query: 257 TTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRV 316
+ L G T + +TP +Y+ +G+ + G
Sbjct: 223 QQGSDEGQLILGGVDESLYTGDIYWTPVTQE----------LYWQIGIEGFLIDGS-ASG 271
Query: 317 WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGL 376
W R G IVD+GT+ + +++S +V+ A+GAE
Sbjct: 272 W-----CSRGCQG--IVDTGTSLL--------TVPSDYLSTLVQ------AIGAEE--NE 308
Query: 377 RPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREAS---GG 433
+ V P L G V P+ A + G C+ + S G
Sbjct: 309 YGEYFVSCSSIQDLPTLTFVISG---VEFPLSPS-AYILSGENYCMVGLESTYVSPGGGE 364
Query: 434 PSIILGNFQMQNYYVEYDLRNQRLGF 459
P ILG+ +++YY YDL N R+GF
Sbjct: 365 PVWILGDVFLRSYYSVYDLANNRVGF 390
>sp|Q9N2D2|CHYM_CALJA Chymosin OS=Callithrix jacchus GN=CYM PE=1 SV=1
Length = 381
Score = 38.1 bits (87), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 101/464 (21%), Positives = 163/464 (35%), Gaps = 101/464 (21%)
Query: 15 FFTLLSIFPSSITSLTFSLSRFHTNPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTT 74
F LL++F S S + H S L+ L H + + +
Sbjct: 4 FVVLLAVFALSQASGIVRIP-LHKGKSLRRALKERGLLEDFLKNHQHAVSRKHSNSREVA 62
Query: 75 TTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSF 134
+ TN Y G + GTPPQ + DTGS +W P YC+S
Sbjct: 63 SEFLTNYLDCQYFG---KIYIGTPPQEFTVVFDTGSSDLWVPSV------YCNS------ 107
Query: 135 IPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEG 194
+ CQN HH +P + S + S + YG+G +G
Sbjct: 108 ----------VACQN------HHRF--------DP-SKSSTFQNMDKSLSIQYGTGSMQG 142
Query: 195 IALSETLNLPNRIIPNFLVGCSVLSSRQPAGIAGF-------GRGKTSLPSQLNLDKFSY 247
+ +T+ + + + P+ VG LS+++P + + G SL S+ ++ F
Sbjct: 143 LLGYDTVTVSSIVDPHQTVG---LSTQEPGDVFTYSEFDGILGLAYPSLASEYSVPVFDN 199
Query: 248 CLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRI 307
+ H + D S + + G T +PS YY L I
Sbjct: 200 MMDRHL---------VAQDLFSVYMSRNEQGSMLTLGAIDPS---------YYTGSLHWI 241
Query: 308 TVGGQRVRVWH---KYLTLDR-----DGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMV 359
V Q W +T+D DG I+D+GT+ L P +D F
Sbjct: 242 PVTVQ--EYWQFTVDSVTVDGVVVACDGGCQAILDTGTSM------LVGPGSDIF----- 288
Query: 360 KNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSA 419
N +A+GA G FD+ S P + G + LP Y +
Sbjct: 289 ---NIQQAIGATE--GQYGEFDIDCGTLSSMPTVVFEIN-GKKYPLPPSAY---TNQDQG 339
Query: 420 VCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQL 463
C + ++S ILG+ ++ YY +D + +G + +
Sbjct: 340 FCTSGFQGDDSS--QQWILGDVFIREYYSVFDRASNLVGLAKAI 381
>sp|P40782|CYPR1_CYNCA Cyprosin (Fragment) OS=Cynara cardunculus GN=CYPRO1 PE=1 SV=2
Length = 473
Score = 38.1 bits (87), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 24/66 (36%), Positives = 34/66 (51%), Gaps = 3/66 (4%)
Query: 399 GGAEVTLPVENYFAVVGEGS-AVCLTVVTDREAS--GGPSIILGNFQMQNYYVEYDLRNQ 455
GG L E Y VGEG+ A C++ T + + GP ILG+ M Y+ +D N
Sbjct: 406 GGKTFNLSPEQYVLKVGEGATAQCISGFTAMDVAPPHGPLWILGDVFMGQYHTVFDYGNL 465
Query: 456 RLGFKQ 461
R+GF +
Sbjct: 466 RVGFAE 471
Score = 32.3 bits (72), Expect = 8.9, Method: Compositional matrix adjust.
Identities = 14/33 (42%), Positives = 17/33 (51%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHY 121
Y + GTPPQ I DTGS +W P + Y
Sbjct: 51 YFGEIGIGTPPQKFTVIFDTGSSNLWVPSSKCY 83
>sp|P56819|BACE1_RAT Beta-secretase 1 OS=Rattus norvegicus GN=Bace1 PE=2 SV=1
Length = 501
Score = 38.1 bits (87), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 38/174 (21%), Positives = 69/174 (39%), Gaps = 20/174 (11%)
Query: 299 YYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQM 358
YY V + R+ + GQ +++ K D+ +IVDSGTT + ++FE +
Sbjct: 259 YYEVIIVRVEINGQDLKMDCKEYNYDK-----SIVDSGTTNLRLPKKVFEAAVKSIKAAS 313
Query: 359 VKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGA-----EVTLPVENYFAV 413
+ E L C+ FP + L+ G +T+ + Y
Sbjct: 314 STEKFPDGFWLGEQLV----CWQAGTTPWNIFPVISLYLMGEVTNQSFRITILPQQYLRP 369
Query: 414 VGEGSAV---CLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
V + + C + ++G ++G M+ +YV +D +R+GF C
Sbjct: 370 VEDVATSQDDCYKFAVSQSSTG---TVMGAVIMEGFYVVFDRARKRIGFAVSAC 420
>sp|P56818|BACE1_MOUSE Beta-secretase 1 OS=Mus musculus GN=Bace1 PE=1 SV=2
Length = 501
Score = 38.1 bits (87), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 38/174 (21%), Positives = 69/174 (39%), Gaps = 20/174 (11%)
Query: 299 YYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQM 358
YY V + R+ + GQ +++ K D+ +IVDSGTT + ++FE +
Sbjct: 259 YYEVIIVRVEINGQDLKMDCKEYNYDK-----SIVDSGTTNLRLPKKVFEAAVKSIKAAS 313
Query: 359 VKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGA-----EVTLPVENYFAV 413
+ E L C+ FP + L+ G +T+ + Y
Sbjct: 314 STEKFPDGFWLGEQLV----CWQAGTTPWNIFPVISLYLMGEVTNQSFRITILPQQYLRP 369
Query: 414 VGEGSAV---CLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
V + + C + ++G ++G M+ +YV +D +R+GF C
Sbjct: 370 VEDVATSQDDCYKFAVSQSSTG---TVMGAVIMEGFYVVFDRARKRIGFAVSAC 420
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.319 0.134 0.407
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 174,944,591
Number of Sequences: 539616
Number of extensions: 7561950
Number of successful extensions: 32527
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 139
Number of HSP's successfully gapped in prelim test: 160
Number of HSP's that attempted gapping in prelim test: 29162
Number of HSP's gapped (non-prelim): 2343
length of query: 465
length of database: 191,569,459
effective HSP length: 121
effective length of query: 344
effective length of database: 126,275,923
effective search space: 43438917512
effective search space used: 43438917512
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 63 (28.9 bits)