BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 014294
(427 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|Q9S9K4|ASPL2_ARATH Aspartic proteinase-like protein 2 OS=Arabidopsis thaliana
GN=At1g65240 PE=1 SV=2
Length = 475
Score = 457 bits (1176), Expect = e-128, Method: Compositional matrix adjust.
Identities = 216/394 (54%), Positives = 290/394 (73%), Gaps = 10/394 (2%)
Query: 29 GNFVFEVENKFKAGGERERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVG 88
NFVF+ ++KF +++ L K HDTRRH RM+ASIDL LGG+ + GLYFTK+
Sbjct: 23 ANFVFKAQHKFAG---KKKNLEHFKSHDTRRHSRMLASIDLPLGGDSRVDSVGLYFTKIK 79
Query: 89 LGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFC 148
LG+P EY+VQVDTGSD+LW+NC C +CPTK++L +L+LFD + SSTS ++ C D+FC
Sbjct: 80 LGSPPKEYHVQVDTGSDILWINCKPCPKCPTKTNLNFRLSLFDMNASSTSKKVGCDDDFC 139
Query: 149 RTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGC 208
++ ++ SC P + C Y + Y D S++ G F+RD++ L Q +G+LKT PL V+FGC
Sbjct: 140 --SFISQSDSCQPALGCSYHIVYADESTSDGKFIRDMLTLEQVTGDLKTGPLGQEVVFGC 197
Query: 209 GNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGD 268
G+ QSG LG+ D+AVDG++GFGQ+N+S+LSQLAA G+ ++ F+HCLD VKGGGIFA+G
Sbjct: 198 GSDQSGQLGNG-DSAVDGVMGFGQSNTSVLSQLAATGDAKRVFSHCLDNVKGGGIFAVGV 256
Query: 269 VVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPP 328
V SPKVKTTPMVPN HYNV+L ++V G LDLP S++ G GTI+DSGTTLAY P
Sbjct: 257 VDSPKVKTTPMVPNQMHYNVMLMGMDVDGTSLDLPRSIVRNG---GTIVDSGTTLAYFPK 313
Query: 329 MLYDLVLSQILDRQPGLKMHTVEEQFSCFQFSKNVDDAFPTVTFKFKGSLSLTVYPHEYL 388
+LYD ++ IL RQP +K+H VEE F CF FS NVD+AFP V+F+F+ S+ LTVYPH+YL
Sbjct: 314 VLYDSLIETILARQP-VKLHIVEETFQCFSFSTNVDEAFPPVSFEFEDSVKLTVYPHDYL 372
Query: 389 FQIREDVWCIGWQNGGLQNHDGRQMILLGGTVYS 422
F + E+++C GWQ GGL + ++ILLG V S
Sbjct: 373 FTLEEELYCFGWQAGGLTTDERSEVILLGDLVLS 406
>sp|Q9LX20|ASPL1_ARATH Aspartic proteinase-like protein 1 OS=Arabidopsis thaliana
GN=At5g10080 PE=1 SV=1
Length = 528
Score = 120 bits (301), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 99/322 (30%), Positives = 153/322 (47%), Gaps = 29/322 (9%)
Query: 82 LYFTKVGLGTPTDEYYVQVDTGSDLLWV--NCAGCSRCPTK--SDLGIK-LTLFDPSKSS 136
L++T + +GTP+ + V +DTGS+LLW+ NC C+ + S L K L ++PS SS
Sbjct: 99 LHYTWIDIGTPSVSFLVALDTGSNLLWIPCNCVQCAPLTSTYYSSLATKDLNEYNPSSSS 158
Query: 137 TSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDG-SSTSGYFVRDIIQLNQASGNL 195
TS CS C + + SP +C Y V Y G +S+SG V DI+ L + N
Sbjct: 159 TSKVFLCSHKLCDSASDCE----SPKEQCPYTVNYLSGNTSSSGLLVEDILHLTYNTNNR 214
Query: 196 ---KTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFA 252
++ + + V+ GCG +QSGD A DG++G G A S+ S L+ AG +R F+
Sbjct: 215 LMNGSSSVKARVVIGCGKKQSGDYLDG--VAPDGLMGLGPAEISVPSFLSKAGLMRNSFS 272
Query: 253 HCLDVVKGGGIFAIGDVVSPKVKTTPMVP----NMPHYNVILEEVEVGGNPLDLPTSLLG 308
C D G I+ GD+ ++TP + Y V +E +G + L TS
Sbjct: 273 LCFDEEDSGRIY-FGDMGPSIQQSTPFLQLDNNKYSGYIVGVEACCIGNSCLK-QTSFT- 329
Query: 309 TGDERGTIIDSGTTLAYLPPMLYDLVLSQILDRQPGLKMHTVEEQFSCFQFSKNVDDAFP 368
T IDSG + YLP +Y V +I DR E + + + + P
Sbjct: 330 ------TFIDSGQSFTYLPEEIYRKVALEI-DRHINATSKNFEGVSWEYCYESSAEPKVP 382
Query: 369 TVTFKFKGSLSLTVYPHEYLFQ 390
+ KF + + ++ ++FQ
Sbjct: 383 AIKLKFSHNNTFVIHKPLFVFQ 404
>sp|Q766C2|NEP2_NEPGR Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis GN=nep2
PE=1 SV=1
Length = 438
Score = 120 bits (301), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 107/383 (27%), Positives = 164/383 (42%), Gaps = 66/383 (17%)
Query: 35 VENKFKAGGERERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTD 94
++ K G R R+++A+ Q + + A G+G Y V +GTP
Sbjct: 61 IKRAIKRGERRMRSINAMLQSSSGIETPVYA-------GDGE------YLMNVAIGTPDS 107
Query: 95 EYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCR----T 150
+ +DTGSDL+W C C++C ++ +F+P SS+ + C +C+
Sbjct: 108 SFSAIMDTGSDLIWTQCEPCTQCFSQ-----PTPIFNPQDSSSFSTLPCESQYCQDLPSE 162
Query: 151 TYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGN 210
T NN C+Y YGDGS+T GY + +S P ++ FGCG
Sbjct: 163 TCNNN--------ECQYTYGYGDGSTTQGYMATETFTFETSS-----VP---NIAFGCGE 206
Query: 211 RQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVV--KGGGIFAIGD 268
G G A G++G G SL SQL +F++C+ A+G
Sbjct: 207 DNQG-FGQGNGA---GLIGMGWGPLSLPSQLGVG-----QFSYCMTSYGSSSPSTLALGS 257
Query: 269 VVSPKVKTTPMVP------NMPHYNVILEEVEVGGNPLDLPTSLLGTGDE--RGTIIDSG 320
S + +P N +Y + L+ + VGG+ L +P+S D+ G IIDSG
Sbjct: 258 AASGVPEGSPSTTLIHSSLNPTYYYITLQGITVGGDNLGIPSSTFQLQDDGTGGMIIDSG 317
Query: 321 TTLAYLPPMLYDLVLSQILDRQPGLKMHTVEEQFS----CFQF-SKNVDDAFPTVTFKFK 375
TTL YLP Y+ V D+ + + TV+E S CFQ S P ++ +F
Sbjct: 318 TTLTYLPQDAYNAVAQAFTDQ---INLPTVDESSSGLSTCFQQPSDGSTVQVPEISMQFD 374
Query: 376 GSLSLTVYPHEYLFQIREDVWCI 398
G + L + L E V C+
Sbjct: 375 GGV-LNLGEQNILISPAEGVICL 396
>sp|Q0IU52|ASP1_ORYSJ Aspartic proteinase Asp1 OS=Oryza sativa subsp. japonica GN=ASP1
PE=2 SV=1
Length = 410
Score = 106 bits (265), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 101/375 (26%), Positives = 162/375 (43%), Gaps = 45/375 (12%)
Query: 65 ASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNC-AGCSRCPTKSDL 123
+++ LEL GN +P G +F + +G P Y++ +DTGS L W+ C A C+ C
Sbjct: 22 SAVVLELHGNVYP--IGHFFITMNIGDPAKSYFLDIDTGSTLTWLQCDAPCTNCNI---- 75
Query: 124 GIKLTLFDPSKSSTSGEIACSDNFCRTTYNN--RYPSCSPGVRCEYVVTYGDGSSTSGYF 181
+ L+ P+ + C+D+ C Y + + C +C+YV+ Y D SS+ G
Sbjct: 76 -VPHVLYKPTPKKL---VTCADSLCTDLYTDLGKPKRCGSQKQCDYVIQYVD-SSSMGVL 130
Query: 182 VRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQL 241
V D L+ ++G T P +++ FGCG Q G + VD ILG + +LLSQL
Sbjct: 131 VIDRFSLSASNG---TNP--TTIAFGCGYDQ-GKKNRNVPIPVDSILGLSRGKVTLLSQL 184
Query: 242 AAAGNVRKE-FAHCLDVVKGGGIFAIGDVVSPK--VKTTPMVPNMPHYNVILEEVEVGGN 298
+ G + K HC+ KGGG GD P V TPM +Y+ + N
Sbjct: 185 KSQGVITKHVLGHCIS-SKGGGFLFFGDAQVPTSGVTWTPMNREHKYYSPGHGTLHFDSN 243
Query: 299 PLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQI---LDRQPGLKMHTVEEQFS 355
+ + + I DSG T Y Y LS + L+ + E+ +
Sbjct: 244 SKAISAAPM------AVIFDSGATYTYFAAQPYQATLSVVKSTLNSECKFLTEVTEKDRA 297
Query: 356 ---CFQFSKN------VDDAFPTVTFKFK---GSLSLTVYPHEYLFQIREDVWCIGWQNG 403
C++ V F +++ +F +L + P YL +E C+G +G
Sbjct: 298 LTVCWKGKDKIVTIDEVKKCFRSLSLEFADGDKKATLEIPPEHYLIISQEGHVCLGILDG 357
Query: 404 GLQNHDGRQMILLGG 418
++ L+GG
Sbjct: 358 SKEHLSLAGTNLIGG 372
>sp|Q3EBM5|ASPR1_ARATH Probable aspartic protease At2g35615 OS=Arabidopsis thaliana
GN=At2g35615 PE=3 SV=1
Length = 447
Score = 106 bits (264), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 100/364 (27%), Positives = 159/364 (43%), Gaps = 41/364 (11%)
Query: 57 TRRHGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSR 116
+RR ++ DL+ G G A G +F + +GTP + + DTGSDL WV C C +
Sbjct: 62 SRRFNHQLSQTDLQSGLIG---ADGEFFMSITIGTPPIKVFAIADTGSDLTWVQCKPCQQ 118
Query: 117 CPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSS 176
C ++ +FD KSST C C+ + C+Y +YGD S
Sbjct: 119 CYKENG-----PIFDKKKSSTYKSEPCDSRNCQALSSTERGCDESNNICKYRYSYGDQSF 173
Query: 177 TSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSS 236
+ G + + ++ ASG+ + P +FGCG G D GI+G G + S
Sbjct: 174 SKGDVATETVSIDSASGSPVSFP---GTVFGCGYNNGGTF----DETGSGIIGLGGGHLS 226
Query: 237 LLSQLAAAGNVRKEFAHCLD----VVKGGGIFAIGDVVSPK-------VKTTPMVPNMP- 284
L+SQL ++ + K+F++CL G + +G P V +TP+V P
Sbjct: 227 LISQLGSS--ISKKFSYCLSHKSATTNGTSVINLGTNSIPSSLSKDSGVVSTPLVDKEPL 284
Query: 285 -HYNVILEEVEVGGNPLDLPTSLLGTGDE-------RGTIIDSGTTLAYLPPMLYDLVLS 336
+Y + LE + VG + S D+ IIDSGTTL L +D S
Sbjct: 285 TYYYLTLEAISVGKKKIPYTGSSYNPNDDGILSETSGNIIIDSGTTLTLLEAGFFDKFSS 344
Query: 337 QILDRQPGLKMHTVEEQF--SCFQFSKNVDDAFPTVTFKFKGSLSLTVYPHEYLFQIRED 394
+ + G K + + CF+ S + + P +T F G+ + + P ++ ED
Sbjct: 345 AVEESVTGAKRVSDPQGLLSHCFK-SGSAEIGLPEITVHFTGA-DVRLSPINAFVKLSED 402
Query: 395 VWCI 398
+ C+
Sbjct: 403 MVCL 406
>sp|A2ZC67|ASP1_ORYSI Aspartic proteinase Asp1 OS=Oryza sativa subsp. indica GN=ASP1 PE=2
SV=2
Length = 410
Score = 102 bits (253), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 103/388 (26%), Positives = 161/388 (41%), Gaps = 71/388 (18%)
Query: 65 ASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVN----CAGCSRCPTK 120
+++ LEL GN +P G +F + +G P Y++ +DTGS L W+ C C++ P
Sbjct: 22 SAVVLELHGNVYP--IGHFFVTMNIGDPAKPYFLDIDTGSTLTWLQCDYPCINCNKVPH- 78
Query: 121 SDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNN-RYP-SCSPGVRCEYVVTYGDGSSTS 178
L+ P + C++ C Y + R P C P +C Y + Y GSS
Sbjct: 79 -------GLYKPELKYA---VKCTEQRCADLYADLRKPMKCGPKNQCHYGIQYVGGSSI- 127
Query: 179 GYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLL 238
G + D L ++G T P +S+ FGCG Q G + V+GILG G+ +LL
Sbjct: 128 GVLIVDSFSLPASNG---TNP--TSIAFGCGYNQ-GKNNHNVPTPVNGILGLGRGKVTLL 181
Query: 239 SQLAAAGNVRKE-FAHCLDVVKGGGIFAIGDVVSPK--VKTTPMVPNMPHYNVILEEVEV 295
SQL + G + K HC+ KG G GD P V +PM HY+ ++
Sbjct: 182 SQLKSQGVITKHVLGHCIS-SKGKGFLFFGDAKVPTSGVTWSPMNREHKHYSPRQGTLQF 240
Query: 296 GGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLS--------------QILDR 341
N + + + I DSG T Y Y LS ++ ++
Sbjct: 241 NSNSKPISAAPM------EVIFDSGATYTYFALQPYHATLSVVKSTLSKECKFLTEVKEK 294
Query: 342 QPGL--------KMHTVEEQFSCFQFSKNVDDAFPTVTFKFK---GSLSLTVYPHEYLFQ 390
L K+ T++E CF+ +++ KF +L + P YL
Sbjct: 295 DRALTVCWKGKDKIRTIDEVKKCFR----------SLSLKFADGDKKATLEIPPEHYLII 344
Query: 391 IREDVWCIGWQNGGLQNHDGRQMILLGG 418
+E C+G +G ++ L+GG
Sbjct: 345 SQEGHVCLGILDGSKEHPSLAGTNLIGG 372
>sp|Q9LS40|ASPG1_ARATH Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana
GN=ASPG1 PE=1 SV=1
Length = 500
Score = 100 bits (250), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 105/391 (26%), Positives = 164/391 (41%), Gaps = 55/391 (14%)
Query: 25 GGVMGNFVFEVENKFKAGGERERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYF 84
G++ F VE G L + DTR + + + +G +G YF
Sbjct: 114 AGIVAKIRFAVE------GVDRSDLKPVYNEDTRYQTEDLTTPVV----SGASQGSGEYF 163
Query: 85 TKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACS 144
+++G+GTP E Y+ +DTGSD+ W+ C C+ C +SD +F+P+ SST + CS
Sbjct: 164 SRIGVGTPAKEMYLVLDTGSDVNWIQCEPCADCYQQSD-----PVFNPTSSSTYKSLTCS 218
Query: 145 DNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSV 204
C + +C +C Y V+YGDGS T G D + SG + ++V
Sbjct: 219 APQCSLLETS---ACRSN-KCLYQVSYGDGSFTVGELATDTVTFGN-SGKI------NNV 267
Query: 205 IFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIF 264
GCG+ G + G S+ +Q+ A F++CL V + G
Sbjct: 268 ALGCGHDNEGLFTGAAGLLGLGGGVL-----SITNQMKAT-----SFSYCL-VDRDSGKS 316
Query: 265 AIGDVVSPKV----KTTPMVPNMP---HYNVILEEVEVGGNPLDLPTSLL-----GTGDE 312
+ D S ++ T P++ N Y V L VGG + LP ++ G+G
Sbjct: 317 SSLDFNSVQLGGGDATAPLLRNKKIDTFYYVGLSGFSVGGEKVVLPDAIFDVDASGSG-- 374
Query: 313 RGTIIDSGTTLAYLPPMLYDLVLSQILDRQPGLKMHTVEEQF--SCFQFSKNVDDAFPTV 370
G I+D GT + L Y+ + L LK + +C+ FS PTV
Sbjct: 375 -GVILDCGTAVTRLQTQAYNSLRDAFLKLTVNLKKGSSSISLFDTCYDFSSLSTVKVPTV 433
Query: 371 TFKFKGSLSLTVYPHEYLFQIRED-VWCIGW 400
F F G SL + YL + + +C +
Sbjct: 434 AFHFTGGKSLDLPAKNYLIPVDDSGTFCFAF 464
>sp|Q6XBF8|CDR1_ARATH Aspartic proteinase CDR1 OS=Arabidopsis thaliana GN=CDR1 PE=1 SV=1
Length = 437
Score = 100 bits (250), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 109/368 (29%), Positives = 152/368 (41%), Gaps = 57/368 (15%)
Query: 55 HDTRRHGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGC 114
H T + IDL S +G Y V +GTP DTGSDLLW CA C
Sbjct: 69 HFTEKDNTPQPQIDLT-------SNSGEYLMNVSIGTPPFPIMAIADTGSDLLWTQCAPC 121
Query: 115 SRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVR-CEYVVTYGD 173
C T+ D LFDP SST +++CS + C N SCS C Y ++YGD
Sbjct: 122 DDCYTQVD-----PLFDPKTSSTYKDVSCSSSQCTALENQA--SCSTNDNTCSYSLSYGD 174
Query: 174 GSSTSGYFVRDIIQLNQASGNLKTAPLN-SSVIFGCGNRQSGDLGSSTDAAVDGILGFGQ 232
S T G D + L G+ T P+ ++I GCG+ +G V G G
Sbjct: 175 NSYTKGNIAVDTLTL----GSSDTRPMQLKNIIIGCGHNNAGTFNKKGSGIV----GLGG 226
Query: 233 ANSSLLSQLAAAGNVRKEFAHCLDVVKGGGI------FAIGDVVS-PKVKTTPMVPNMPH 285
SL+ QL ++ +F++CL + F +VS V +TP++
Sbjct: 227 GPVSLIKQL--GDSIDGKFSYCLVPLTSKKDQTSKINFGTNAIVSGSGVVSTPLIAKASQ 284
Query: 286 ---YNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLY----DLVLSQI 338
Y + L+ + VG + + E IIDSGTTL LP Y D V S I
Sbjct: 285 ETFYYLTLKSISVGSKQIQY-SGSDSESSEGNIIIDSGTTLTLLPTEFYSELEDAVASSI 343
Query: 339 -----LDRQPGLKMHTVEEQFSCFQFSKNVDDAFPTVTFKFKGSLSLTVYPHEYLFQIRE 393
D Q GL + C +S D P +T F G+ + + Q+ E
Sbjct: 344 DAEKKQDPQSGLSL--------C--YSATGDLKVPVITMHFDGA-DVKLDSSNAFVQVSE 392
Query: 394 DVWCIGWQ 401
D+ C ++
Sbjct: 393 DLVCFAFR 400
>sp|Q9LHE3|ASPG2_ARATH Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana
GN=ASPG2 PE=2 SV=1
Length = 470
Score = 93.6 bits (231), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 96/344 (27%), Positives = 145/344 (42%), Gaps = 48/344 (13%)
Query: 74 NGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPS 133
+G +G YF ++G+G+P + Y+ +D+GSD++WV C C C +SD +FDP+
Sbjct: 122 SGMDQGSGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPCKLCYKQSD-----PVFDPA 176
Query: 134 KSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASG 193
KS + ++C + C N+ C G C Y V YGDGS T G + +
Sbjct: 177 KSGSYTGVSCGSSVCDRIENS---GCHSG-GCRYEVMYGDGSYTKGTLALETLTF----- 227
Query: 194 NLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAH 253
KT N V GCG+R G + G + S + QL +G F +
Sbjct: 228 -AKTVVRN--VAMGCGHRNRGMFIGAAGLLGI-----GGGSMSFVGQL--SGQTGGAFGY 277
Query: 254 CL---------DVVKGGGIFAIGDVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPL-DLP 303
CL +V G +G P V+ P P+ + + V PL D
Sbjct: 278 CLVSRGTDSTGSLVFGREALPVGASWVPLVR-NPRAPSFYYVGLKGLGVGGVRIPLPDGV 336
Query: 304 TSLLGTGDERGTIIDSGTTLAYLPPMLY----DLVLSQI--LDRQPGLKMHTVEEQFSCF 357
L TGD G ++D+GT + LP Y D SQ L R G+ + +C+
Sbjct: 337 FDLTETGDG-GVVMDTGTAVTRLPTAAYVAFRDGFKSQTANLPRASGVSIFD-----TCY 390
Query: 358 QFSKNVDDAFPTVTFKFKGSLSLTVYPHEYLFQIRED-VWCIGW 400
S V PTV+F F LT+ +L + + +C +
Sbjct: 391 DLSGFVSVRVPTVSFYFTEGPVLTLPARNFLMPVDDSGTYCFAF 434
>sp|Q766C3|NEP1_NEPGR Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis GN=nep1
PE=1 SV=1
Length = 437
Score = 90.9 bits (224), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 112/427 (26%), Positives = 174/427 (40%), Gaps = 69/427 (16%)
Query: 6 LLALVVVTVAVV-------------HQWAVGGGGVMGNFVFEVEN--KFKAGGERERTLS 50
LLAL +V + V H+ V G +M V +N KF+ ER +
Sbjct: 9 LLALSIVYIFVAPTHSTSRTALNHRHEAKVTGFQIMLEHVDSGKNLTKFQL---LERAI- 64
Query: 51 ALKQHDTRRHGRMMASIDLELGGNGHPSA-TGLYFTKVGLGTPTDEYYVQVDTGSDLLWV 109
+ +RR R+ A ++ G A G Y + +GTP + +DTGSDL+W
Sbjct: 65 ---ERGSRRLQRLEAMLNGPSGVETSVYAGDGEYLMNLSIGTPAQPFSAIMDTGSDLIWT 121
Query: 110 NCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVV 169
C C++C +S +F+P SS+ + CS C+ + P+CS C+Y
Sbjct: 122 QCQPCTQCFNQST-----PIFNPQGSSSFSTLPCSSQLCQALSS---PTCSNNF-CQYTY 172
Query: 170 TYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILG 229
YGDGS T G + + S P ++ FGCG G G A G++G
Sbjct: 173 GYGDGSETQGSMGTETLTFGSVS-----IP---NITFGCGENNQG-FGQGNGA---GLVG 220
Query: 230 FGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGG-----IFAIGDVVSPKVKTTPMVPN-- 282
G+ SL SQL +V K F++C+ + + ++ + V+ T ++ +
Sbjct: 221 MGRGPLSLPSQL----DVTK-FSYCMTPIGSSTPSNLLLGSLANSVTAGSPNTTLIQSSQ 275
Query: 283 MP-HYNVILEEVEVGGNPLDLPTSLLGTGDERGT---IIDSGTTLAYLPPMLYDLVLSQI 338
+P Y + L + VG L + S GT IIDSGTTL Y Y V +
Sbjct: 276 IPTFYYITLNGLSVGSTRLPIDPSAFALNSNNGTGGIIIDSGTTLTYFVNNAYQSVRQEF 335
Query: 339 LDRQPGLKMHTVEEQFS----CFQFSKNVDD-AFPTVTFKFKGSLSLTVYPHEYLFQIRE 393
+ + + + V S CFQ + + PT F G L + Y
Sbjct: 336 ISQ---INLPVVNGSSSGFDLCFQTPSDPSNLQIPTFVMHFDGG-DLELPSENYFISPSN 391
Query: 394 DVWCIGW 400
+ C+
Sbjct: 392 GLICLAM 398
>sp|Q9LZL3|PCS1L_ARATH Aspartic proteinase PCS1 OS=Arabidopsis thaliana GN=PCS1 PE=2 SV=1
Length = 453
Score = 85.5 bits (210), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 88/349 (25%), Positives = 147/349 (42%), Gaps = 57/349 (16%)
Query: 92 PTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRT- 150
P + +DTGS+L W+ C +S + FDP++SS+ I CS CRT
Sbjct: 82 PPQNISMVIDTGSELSWLRC-------NRSSNPNPVNNFDPTRSSSYSPIPCSSPTCRTR 134
Query: 151 TYNNRYP-SCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCG 209
T + P SC C ++Y D SS+ G +I ++ +S++IFGC
Sbjct: 135 TRDFLIPASCDSDKLCHATLSYADASSSEGNLAAEIFHFGNSTN-------DSNLIFGCM 187
Query: 210 NRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKG-GGIFAIGD 268
SG D G+LG + + S +SQ+ +F++C+ G +GD
Sbjct: 188 GSVSGS-DPEEDTKTTGLLGMNRGSLSFISQMGFP-----KFSYCISGTDDFPGFLLLGD 241
Query: 269 V---------VSPKVKTTPMVPNMPH--YNVILEEVEVGGNPLDLPTSLLGTGDERG--- 314
+P ++ + +P Y V L ++V G L +P S+L D G
Sbjct: 242 SNFTWLTPLNYTPLIRISTPLPYFDRVAYTVQLTGIKVNGKLLPIPKSVL-VPDHTGAGQ 300
Query: 315 TIIDSGTTLAYLPPMLYDLVLSQILDRQPGLKMHTVEEQFS-------CFQFS-----KN 362
T++DSGT +L +Y + S L+R G+ + F C++ S
Sbjct: 301 TMVDSGTQFTFLLGPVYTALRSHFLNRTNGILTVYEDPDFVFQGTMDLCYRISPVRIRSG 360
Query: 363 VDDAFPTVTFKFKGSLSLTVYPHEYLFQI------REDVWCIGWQNGGL 405
+ PTV+ F+G+ + V L+++ + V+C + N L
Sbjct: 361 ILHRLPTVSLVFEGA-EIAVSGQPLLYRVPHLTVGNDSVYCFTFGNSDL 408
>sp|O42630|CARP_ASPFU Vacuolar protease A OS=Neosartorya fumigata (strain ATCC MYA-4609 /
Af293 / CBS 101355 / FGSC A1100) GN=pep2 PE=2 SV=1
Length = 398
Score = 58.9 bits (141), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 78/323 (24%), Positives = 130/323 (40%), Gaps = 69/323 (21%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
YF+++ LGTP ++ V +DTGS LWV + CS IA
Sbjct: 85 YFSEISLGTPPQKFKVVLDTGSSNLWVPGSDCS------------------------SIA 120
Query: 143 CSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNS 202
C F Y++ S E+ + YG G SG+ +D +Q+ G+LK +
Sbjct: 121 C---FLHNKYDSSASSTYKANGTEFAIKYGSG-ELSGFVSQDTLQI----GDLKVVKQD- 171
Query: 203 SVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKE-------FAHCL 255
F + G + DGILG G S+ + N+ + FA L
Sbjct: 172 ---FAEATNEPG--LAFAFGRFDGILGLGYDTISVNKIVPPFYNMLDQGLLDEPVFAFYL 226
Query: 256 -DVVKGGGIFAIGDVVSPKVKTTPMVPNMP-----HYNVILEEVEVGGNPLDLPTSLLGT 309
D K G K T + +P ++ V + + +G N +L
Sbjct: 227 GDTNKEGDNSEASFGGVDKNHYTGELTKIPLRRKAYWEVDFDAIALGDNVAEL------- 279
Query: 310 GDERGTIIDSGTTLAYLPPMLYDLVLSQILDRQPGLKMHTVEEQFSCFQFSKNVDDAFPT 369
+ G I+D+GT+L LP L DL+ +I ++ +++E C + D+ P
Sbjct: 280 -ENTGIILDTGTSLIALPSTLADLLNKEIGAKKGFTGQYSIE----C-----DKRDSLPD 329
Query: 370 VTFKFKGSLSLTVYPHEYLFQIR 392
+TF G + T+ P++Y +++
Sbjct: 330 LTFTLAGH-NFTIGPYDYTLEVQ 351
>sp|P07267|CARP_YEAST Saccharopepsin OS=Saccharomyces cerevisiae (strain ATCC 204508 /
S288c) GN=PEP4 PE=1 SV=1
Length = 405
Score = 58.9 bits (141), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 79/326 (24%), Positives = 121/326 (37%), Gaps = 76/326 (23%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y+T + LGTP + V +DTGS LWV C G +A
Sbjct: 91 YYTDITLGTPPQNFKVILDTGSSNLWVPSNEC------------------------GSLA 126
Query: 143 CSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDII---QLNQASGNLKTAP 199
C F + Y++ S E+ + YG G S GY +D + L + A
Sbjct: 127 C---FLHSKYDHEASSSYKANGTEFAIQYGTG-SLEGYISQDTLSIGDLTIPKQDFAEAT 182
Query: 200 LNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNV-------RKEFA 252
+ F G DGILG G S+ + N K FA
Sbjct: 183 SEPGLTFAFGK-------------FDGILGLGYDTISVDKVVPPFYNAIQQDLLDEKRFA 229
Query: 253 HCL-DVVK---GGGIFAIGDVVSPKVK---TTPMVPNMPHYNVILEEVEVGGNPLDLPTS 305
L D K GG G + K K T V ++ V E + +G +L
Sbjct: 230 FYLGDTSKDTENGGEATFGGIDESKFKGDITWLPVRRKAYWEVKFEGIGLGDEYAEL--- 286
Query: 306 LLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQILDRQPGLKMHTVEEQFSCFQFSKNVDD 365
+ G ID+GT+L LP L +++ ++I ++ +T++ C N D
Sbjct: 287 -----ESHGAAIDTGTSLITLPSGLAEMINAEIGAKKGWTGQYTLD----C-----NTRD 332
Query: 366 AFPTVTFKFKGSLSLTVYPHEYLFQI 391
P + F F G + T+ P++Y ++
Sbjct: 333 NLPDLIFNFNG-YNFTIGPYDYTLEV 357
>sp|P81214|CARP_SYNRA Syncephapepsin OS=Syncephalastrum racemosum GN=SPSR PE=1 SV=1
Length = 395
Score = 56.2 bits (134), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 46/149 (30%), Positives = 63/149 (42%), Gaps = 41/149 (27%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y+ V +GTP + DTGS LW + C+ C +KS FDP+KSST ++
Sbjct: 89 YYATVSVGTPAQSIKLDFDTGSSDLWFSSTLCTSCGSKS--------FDPTKSSTYKKVG 140
Query: 143 CSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNS 202
S + ++YGDGSS SG D ++L G LK +
Sbjct: 141 KS----------------------WQISYGDGSSASGITATDNVEL----GGLKI----T 170
Query: 203 SVIFGCGNRQSGDLGSSTDAAVDGILGFG 231
R+S S A+DGILG G
Sbjct: 171 GQTIELATRESSSFSS---GAIDGILGLG 196
>sp|C5FS55|CARP_ARTOC Vacuolar protease A OS=Arthroderma otae (strain ATCC MYA-4605 / CBS
113480) GN=PEP2 PE=3 SV=1
Length = 395
Score = 55.1 bits (131), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 77/317 (24%), Positives = 129/317 (40%), Gaps = 64/317 (20%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
YF+++ +GTP + V +DTGS LWV CS IA
Sbjct: 87 YFSEISIGTPPQTFKVVLDTGSSNLWVPGKDCS------------------------SIA 122
Query: 143 CSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNS 202
C F +TY++ S + + YG G S G+ +D +Q+ G++K
Sbjct: 123 C---FLHSTYDSSASSTFTRNGTSFAIRYGSG-SLEGFVSQDNVQI----GDMKI----K 170
Query: 203 SVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSL------LSQLAAAGNVRKE-FAHCL 255
+ +F + G + DGILG G S+ ++ G V + F+ L
Sbjct: 171 NQLFAEATSEPG--LAFAFGRFDGILGMGYDTISVNKITPPFYKMVEQGLVDEPVFSFYL 228
Query: 256 -DVVKGGGIFAIGDVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERG 314
D K G + + K T + +P EVE L T+ L D G
Sbjct: 229 GDTNKDGDQSVVTFGGADKSHYTGDITTIPLRRKAYWEVEFNAITLGKDTATL---DNTG 285
Query: 315 TIIDSGTTLAYLPPMLYDLVLSQILDRQPGLKMHTVEEQFSCFQFSKNVDDAFPTVTFKF 374
I+D+GT+L LP ++++S+ + Q +T++ C + D+ P +TF
Sbjct: 286 IILDTGTSLIALPTTYAEMIISKSWNGQ-----YTID----CAK-----RDSLPDLTFTL 331
Query: 375 KGSLSLTVYPHEYLFQI 391
G + T+ P++Y ++
Sbjct: 332 SGH-NFTIGPYDYTLEV 347
>sp|Q8RVH5|7SBG2_SOYBN Basic 7S globulin 2 OS=Glycine max PE=1 SV=1
Length = 433
Score = 55.1 bits (131), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 91/384 (23%), Positives = 143/384 (37%), Gaps = 63/384 (16%)
Query: 58 RRHGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRC 117
+ H I+L + + ++TGL++ + TP + V VD + LWVNC
Sbjct: 29 QHHTNPTKPINLLVLPVQNDASTGLHWANLQKRTPLMQVPVLVDLNGNHLWVNCE----- 83
Query: 118 PTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVR-------CEYVVT 170
S T C C ++ SC R C + T
Sbjct: 84 -------------QHYSSKTYQAPFCHSTQCSRANTHQCLSCPAASRPGCHKNTCGLMST 130
Query: 171 YGDGSSTS-GYFVRDIIQLN------QASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAA 223
T G +D++ ++ Q G L T P +F C S L
Sbjct: 131 NPITQQTGLGELGQDVLAIHATQGSTQQLGPLVTVP---QFLFSCA--PSFLLQKGLPRN 185
Query: 224 VDGILGFGQANSSLLSQLAAAGNVRKEFAHCLD---VVKGGGIFAIG----------DVV 270
+ G+ G G A SL +QLA+ ++ +F CL KG IF D+
Sbjct: 186 IQGVAGLGHAPISLPNQLASHFGLQHQFTTCLSRYPTSKGALIFGDAPNNMQQFHNQDIF 245
Query: 271 SPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGT--GDERGTIIDSGTTLAYLPP 328
T V YNV + + + + + P + T G GT+I + T L
Sbjct: 246 HDLAFTPLTVTPQGEYNVRVSSIRINQHSVFPPNKISSTIVGSSGGTMISTSTPHMVLQQ 305
Query: 329 MLYDL---VLSQILDRQPGLKMHTVEEQFSCFQFSKNVDDAFPTVTF---KFKGSLSLTV 382
LY V +Q L++Q +K +V C F+ N +A+P+V K G + +
Sbjct: 306 SLYQAFTQVFAQQLEKQAQVK--SVAPFGLC--FNSNKINAYPSVDLVMDKPNGPV-WRI 360
Query: 383 YPHEYLFQIREDVWCIGWQNGGLQ 406
+ + Q + V C+G NGG+Q
Sbjct: 361 SGEDLMVQAQPGVTCLGVMNGGMQ 384
>sp|D4B385|CARP_ARTBC Probable vacuolar protease A OS=Arthroderma benhamiae (strain ATCC
MYA-4681 / CBS 112371) GN=PEP2 PE=3 SV=1
Length = 400
Score = 52.8 bits (125), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 75/320 (23%), Positives = 124/320 (38%), Gaps = 65/320 (20%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
YF+++ +GTP + V +DTGS LWV CS IA
Sbjct: 87 YFSEISIGTPPQTFKVVLDTGSSNLWVPGKDCS------------------------SIA 122
Query: 143 CSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGN---LKTAP 199
C F +TY++ S ++ + YG G S G+ RD +++ + A
Sbjct: 123 C---FLHSTYDSSASSTYSKNGTKFAIRYGSG-SLEGFVSRDSVKIGDMTIKKQLFAEAT 178
Query: 200 LNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKE-------FA 252
+ F G DGI+G G ++ S+ N+ + F+
Sbjct: 179 SEPGLAFAFGR-------------FDGIMGMGFSSISVNGITPPFYNMIDQGLIDEPVFS 225
Query: 253 HCL-DVVKGGGIFAIGDVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGD 311
L D K G + S T + +P EV+ L T+ L +
Sbjct: 226 FYLGDTNKDGDQSVVTFGGSDTNHFTGDMTTIPLRRKAYWEVDFDAISLGKDTAAL---E 282
Query: 312 ERGTIIDSGTTLAYLPPMLYDLVLSQILDRQPGLKMHTVEEQFSCFQFSKNVDDAFPTVT 371
G I+D+GT+L LP L +++ +QI + +T++ C + D+ P VT
Sbjct: 283 NTGIILDTGTSLIALPTTLAEMINTQIGATKSWNGQYTLD----CAK-----RDSLPDVT 333
Query: 372 FKFKGSLSLTVYPHEYLFQI 391
F G + T+ PH+Y ++
Sbjct: 334 FTLSGH-NFTIGPHDYTLEV 352
>sp|D4DEN7|CARP_TRIVH Probable vacuolar protease A OS=Trichophyton verrucosum (strain HKI
0517) GN=PEP2 PE=3 SV=1
Length = 400
Score = 52.0 bits (123), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 77/324 (23%), Positives = 133/324 (41%), Gaps = 73/324 (22%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
YF+++ +GTP + V +DTGS LWV CS IA
Sbjct: 87 YFSEISIGTPPQTFKVVLDTGSSNLWVPGKDCS------------------------SIA 122
Query: 143 CSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQL------NQASGNLK 196
C F +TY++ S ++ + YG G S G+ +D +++ NQ
Sbjct: 123 C---FLHSTYDSSASSTYSKNGTKFAIRYGSG-SLEGFVSQDSVKIGDMTIKNQLFAEAT 178
Query: 197 TAPLNSSVIFGCGNRQSGDLGSS-TDAAVDGILG--FGQANSSLLSQLAAA---GNVRKE 250
+ P + F G R G +G + +V+GI + + L+ + + G+ KE
Sbjct: 179 SEP---GLAFAFG-RFDGIMGMGFSSISVNGITPPFYNMIDQGLIDEPVFSFYLGDTNKE 234
Query: 251 FAHCLDVVKGGG---IFAIGDVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLL 307
VV GG GD+ TT + ++ V + + +G + L
Sbjct: 235 GDQS--VVTFGGSDTKHFTGDM------TTIPLRRKAYWEVDFDAISLGEDTAAL----- 281
Query: 308 GTGDERGTIIDSGTTLAYLPPMLYDLVLSQILDRQPGLKMHTVEEQFSCFQFSKNVDDAF 367
+ G I+D+GT+L LP L +++ +QI + +T++ C + D+
Sbjct: 282 ---ENTGIILDTGTSLIALPTTLAEMINTQIGATKSWNGQYTLD----CAK-----RDSL 329
Query: 368 PTVTFKFKGSLSLTVYPHEYLFQI 391
P VTF G + T+ PH+Y ++
Sbjct: 330 PDVTFTVSGH-NFTIGPHDYTLEV 352
>sp|P13917|7SB1_SOYBN Basic 7S globulin OS=Glycine max GN=BG PE=1 SV=2
Length = 427
Score = 49.3 bits (116), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 91/378 (24%), Positives = 142/378 (37%), Gaps = 71/378 (18%)
Query: 79 ATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTS 138
+TGL++ + TP + V VD + LWVNC S T
Sbjct: 43 STGLHWANLQKRTPLMQVPVLVDLNGNHLWVNCE------------------QQYSSKTY 84
Query: 139 GEIACSDNFCRTTYNNRYPSCSPGVR-------CEYVVTYGDGSSTS-GYFVRDIIQLN- 189
C C ++ SC R C + T T G D++ ++
Sbjct: 85 QAPFCHSTQCSRANTHQCLSCPAASRPGCHKNTCGLMSTNPITQQTGLGELGEDVLAIHA 144
Query: 190 -----QASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAA 244
Q G L T P +F C S + G+ G G A SL +QLA+
Sbjct: 145 TQGSTQQLGPLVTVP---QFLFSCA--PSFLVQKGLPRNTQGVAGLGHAPISLPNQLASH 199
Query: 245 GNVRKEFAHCLD---VVKGGGIFAIGDV-----------VSPKVKTTPMVPNMP-HYNVI 289
++++F CL KG IF GD + + TP+ + YNV
Sbjct: 200 FGLQRQFTTCLSRYPTSKGAIIF--GDAPNNMRQFQNQDIFHDLAFTPLTITLQGEYNVR 257
Query: 290 LEEVEVGGN---PLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDL---VLSQILDRQP 343
+ + + + PL+ +S + GT+I + T L +Y V +Q L +Q
Sbjct: 258 VNSIRINQHSVFPLNKISSTIVGSTSGGTMISTSTPHMVLQQSVYQAFTQVFAQQLPKQA 317
Query: 344 GLKMHTVEEQFSCFQFSKNVDDAFPTVTF---KFKGSLSLTVYPHEYLFQIREDVWCIGW 400
+K +V C F+ N +A+P+V K G + + + + Q + V C+G
Sbjct: 318 QVK--SVAPFGLC--FNSNKINAYPSVDLVMDKPNGPV-WRISGEDLMVQAQPGVTCLGV 372
Query: 401 QNGGLQNHDGRQMILLGG 418
NGG+Q R I LG
Sbjct: 373 MNGGMQ---PRAEITLGA 387
>sp|Q01294|CARP_NEUCR Vacuolar protease A OS=Neurospora crassa (strain ATCC 24698 /
74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) GN=pep-4
PE=3 SV=2
Length = 396
Score = 48.9 bits (115), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 82/346 (23%), Positives = 138/346 (39%), Gaps = 65/346 (18%)
Query: 57 TRRHGR-MMASIDLELGGNGHPSA-----TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVN 110
T H + M + D ++ GN HP YF+++ +GTP + V +DTGS LWV
Sbjct: 54 TESHTQAMFKATDAQVSGN-HPVPITNFMNAQYFSEITIGTPPQTFKVVLDTGSSNLWVP 112
Query: 111 CAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVT 170
+ C G IAC + Y + S + +
Sbjct: 113 SSQC------------------------GSIAC---YLHNKYESSESSTYKKNGTSFKIE 145
Query: 171 YGDGSSTSGYFVRDIIQLNQASGN---LKTAPLNSSVIFGCGNRQSGDLGSSTD-AAVDG 226
YG G S SG+ +D + + + N A + F G R G LG D AV+G
Sbjct: 146 YGSG-SLSGFVSQDRMTIGDITINDQLFAEATSEPGLAFAFG-RFDGILGLGYDRIAVNG 203
Query: 227 ILG--FGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDVVSPKVKTTPMVPNMP 284
I + L+ + + + + +VV GG D + K+ T P+
Sbjct: 204 ITPPFYKMVEQKLVDEPVFSFYLADQDGES-EVVFGG---VNKDRYTGKITTIPL-RRKA 258
Query: 285 HYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQILDRQPG 344
++ V + + G + +L + G I+D+GT+L LP L +++ +QI ++
Sbjct: 259 YWEVDFDAIGYGKDFAEL--------EGHGVILDTGTSLIALPSQLAEMLNAQIGAKKSW 310
Query: 345 LKMHTVEEQFSCFQFSKNVDDAFPTVTFKFKGSLSLTVYPHEYLFQ 390
T++ C + S D VTF G + T+ P +Y+ +
Sbjct: 311 NGQFTID----CGKKSSLED-----VTFTLAG-YNFTLGPEDYILE 346
>sp|P04073|PEPC_RAT Gastricsin OS=Rattus norvegicus GN=Pgc PE=1 SV=1
Length = 392
Score = 47.8 bits (112), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 82/351 (23%), Positives = 144/351 (41%), Gaps = 74/351 (21%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSR--CPTKSDLGIKLTLFDPSKSSTSGE 140
YF ++ +GTP + V DTGS LWV+ C C T + F+PSKSS
Sbjct: 76 YFGEISIGTPPQNFLVLFDTGSSNLWVSSVYCQSEACTTHAR-------FNPSKSS---- 124
Query: 141 IACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPL 200
TY + S + YG GS T G+F D + + P
Sbjct: 125 ----------TYYTEGQTFS--------LQYGTGSLT-GFFGYDTLTVQSIQ-----VPN 160
Query: 201 NSSVIFGCGNRQSGDLGSSTDAAVDGILGF-------GQANSSLLSQLAAAGNVRKEFAH 253
FG + G + A DGI+G G A ++L L + F
Sbjct: 161 QE---FGLSENEPGT--NFVYAQFDGIMGLAYPGLSSGGATTALQGMLGEGALSQPLFGV 215
Query: 254 CLDVVKG--GGIFAIG----DVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLL 307
L +G GG G ++ + ++ P+ + ++ + +++ +G D +
Sbjct: 216 YLGSQQGSNGGQIVFGGVDKNLYTGEITWVPVTQEL-YWQITIDDFLIG----DQASGWC 270
Query: 308 GTGDERGTIIDSGTTLAYLPPMLYDLVLSQILDRQPGLKMHTVEEQFSCFQFSKNVDDAF 367
+ +G I+D+GT+L +P LS++L + E ++ + S + +
Sbjct: 271 SSQGCQG-IVDTGTSLLVMPAQ----YLSELLQ-----TIGAQEGEYGEYFVSCDSVSSL 320
Query: 368 PTVTFKFKGSLSLTVYPHEYLFQIREDVWC-IGWQNGGLQNHDGRQMILLG 417
PT++F G + + P Y+ I+ED +C +G ++ L + G+ + +LG
Sbjct: 321 PTLSFVLNG-VQFPLSPSSYI--IQEDNFCMVGLESISLTSESGQPLWILG 368
>sp|P32951|CARP1_CANPA Candidapepsin-1 OS=Candida parapsilosis GN=SAPP1 PE=1 SV=1
Length = 402
Score = 47.4 bits (111), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 88/340 (25%), Positives = 133/340 (39%), Gaps = 70/340 (20%)
Query: 66 SIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGI 125
SI L L G PS Y +KV +G+ + V +DTGS WV ++C D
Sbjct: 64 SISLSLINEG-PS----YASKVSVGSNKQQQTVIIDTGSSDFWV-VDSNAQCGKGVDCKS 117
Query: 126 KLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDI 185
T F PS SS+ Y N + + + YGDGS++ G + +D
Sbjct: 118 SGT-FTPSSSSS--------------YKN--------LGAAFTIRYGDGSTSQGTWGKDT 154
Query: 186 IQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVD-GILGFGQANSSLLSQLAAA 244
+ +N S I G +Q D+ T +VD GILG G ++ + +
Sbjct: 155 VTINGVS------------ITG---QQIADV---TQTSVDQGILGIGYTSNEAVYDTSGR 196
Query: 245 GNVRKEFAHCLDVVKGGGIFAIGDVV---SPKVKTTPM----VPNMPHYNVILEEVEVGG 297
+ + K G I + SP +T + V N + ++ E
Sbjct: 197 QTTPNYDNVPVTLKKQGKIRTNAYSLYLNSPSAETGTIIFGGVDNAKYSGKLVAEQVTSS 256
Query: 298 NPL-------DLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQILDRQPGLKMHTV 350
PL +L S GD G ++DSGTTL Y P +Q+ D+ +
Sbjct: 257 QPLTISLASVNLKGSSFSFGD--GALLDSGTTLTYFPSDF----AAQLADKAGARLVQVA 310
Query: 351 EEQFSCFQFSKNVDDAFPTVTFKFKGSLSLTVYPHEYLFQ 390
+Q+ F N D + TV F F +TV EY++Q
Sbjct: 311 RDQYLYF-IDCNTDTSGTTV-FNFGNGAKITVPNTEYVYQ 348
>sp|P06026|CARP_RHICH Rhizopuspepsin OS=Rhizopus chinensis PE=1 SV=2
Length = 393
Score = 47.4 bits (111), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 32/106 (30%), Positives = 48/106 (45%), Gaps = 28/106 (26%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y+ +V +GTP ++ + DTGS LW+ C+ C G + T +DP +SST
Sbjct: 85 YYGQVTIGTPGKKFNLDFDTGSSDLWIASTLCTNC------GSRQTKYDPKQSSTYQADG 138
Query: 143 CSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQL 188
RT + ++YGDGSS SG +D + L
Sbjct: 139 ------RT----------------WSISYGDGSSASGILAKDNVNL 162
>sp|P10977|CARPV_CANAX Vacuolar aspartic protease OS=Candida albicans GN=APR1 PE=3 SV=3
Length = 419
Score = 47.0 bits (110), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 75/323 (23%), Positives = 129/323 (39%), Gaps = 70/323 (21%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
YFT++ +GTP + V +DTGS LWV PS+ TS +A
Sbjct: 104 YFTEIQIGTPGQPFKVILDTGSSNLWV----------------------PSQDCTS--LA 139
Query: 143 CSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQ---ASGNLKTAP 199
C F Y++ S E+ + YG G S GY +D++ + + A
Sbjct: 140 C---FLHAKYDHDASSTYKVNGSEFSIQYGSG-SMEGYISQDVLTIGDLVIPGQDFAEAT 195
Query: 200 LNSSVIFGCGNRQSGDLGSSTDA-AVDGILG--FGQANSSLLSQLAAAGNVRKEFAHCLD 256
+ F G + G LG + D +V+ I+ + N LL + +F L
Sbjct: 196 SEPGLAFAFG-KFDGILGLAYDTISVNHIVPPIYNAINQGLLE--------KPQFGFYLG 246
Query: 257 VV----KGGGIFAIG----DVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLG 308
GG+ G + K+ P + ++ V E + +G +L
Sbjct: 247 STDKDENDGGLATFGGYDASLFQGKITWLP-IRRKAYWEVSFEGIGLGDEYAEL------ 299
Query: 309 TGDERGTIIDSGTTLAYLPPMLYDLVLSQILDRQPGLKMHTVEEQFSCFQFSKNVDDAFP 368
+ G ID+GT+L LP L +++ ++I K + + Q C + D+ P
Sbjct: 300 --HKTGAAIDTGTSLITLPSSLAEIINAKI----GATKSWSGQYQVDCAK-----RDSLP 348
Query: 369 TVTFKFKGSLSLTVYPHEYLFQI 391
+T F G + T+ P++Y+ ++
Sbjct: 349 DLTLTFAG-YNFTLTPYDYILEV 370
>sp|D4DFT3|OPSB_TRIVH Probable aspartic-type endopeptidase OPSB OS=Trichophyton
verrucosum (strain HKI 0517) GN=OPSB PE=3 SV=1
Length = 495
Score = 46.6 bits (109), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 81/339 (23%), Positives = 128/339 (37%), Gaps = 98/339 (28%)
Query: 33 FEVENKFKAGGERERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVGLGTP 92
F++E +F+A R + A +R S DL+ N LYF + LGTP
Sbjct: 37 FDIE-RFQAAKPVHRDIIA------KRASTKTISQDLDNQKN-------LYFCNLTLGTP 82
Query: 93 TDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTY 152
+DTGS LWVN A C ++ CS TY
Sbjct: 83 PQTIRAHIDTGSSDLWVNTAESRFCSSR-------------------RAPCSQG---GTY 120
Query: 153 NNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQ 212
++ S V ++ ++Y DGS +G +V D+I + G +K F G+
Sbjct: 121 DSSSSSTYQLVNNDFNISYVDGSGATGDYVTDVINV----GGIKLKDFQ----FAIGHTS 172
Query: 213 SGDLGSSTDAAVDGILGFG-------------QANSSLLSQLAAAGNVRKE----FAHCL 255
S L G+LG G Q+ +L + L AG++R + + L
Sbjct: 173 SSPL---------GVLGIGYEAGEAQVTRSGDQSYPNLPAALVKAGHIRSNAYSLWLNDL 223
Query: 256 DVVKGGGIFAIGDV--VSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDER 313
+G +F D K++T P++ H + G+ L +L G G
Sbjct: 224 SASRGQILFGGIDTGKFQGKLQTVPVL----HTSR--------GDYTSLVVALTGVGIRT 271
Query: 314 GT--------------IIDSGTTLAYLPPMLYDLVLSQI 338
G+ +DSG++L+YLP L V + +
Sbjct: 272 GSDGSIDTFPSQPVAVAMDSGSSLSYLPDALAAKVYNSV 310
>sp|Q05744|CATD_CHICK Cathepsin D OS=Gallus gallus GN=CTSD PE=1 SV=1
Length = 398
Score = 46.2 bits (108), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 47/149 (31%), Positives = 62/149 (41%), Gaps = 36/149 (24%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y+ ++G+GTP ++ V DTGS LWV C L K +D SKSST E
Sbjct: 78 YYGEIGIGTPPQKFTVVFDTGSSNLWVPSVHCHLLDIACLLHHK---YDASKSSTYVENG 134
Query: 143 CSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNS 202
E+ + YG G S SG+ +D + L GNLK
Sbjct: 135 ----------------------TEFAIHYGTG-SLSGFLSQDTVTL----GNLKI----K 163
Query: 203 SVIFGCGNRQSGDLGSSTDAAVDGILGFG 231
+ IFG +Q G + A DGILG
Sbjct: 164 NQIFGEAVKQPGI--TFIAAKFDGILGMA 190
>sp|C4YMJ3|CARP2_CANAW Candidapepsin-2 OS=Candida albicans (strain WO-1) GN=SAP2 PE=1 SV=1
Length = 398
Score = 46.2 bits (108), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 73/308 (23%), Positives = 115/308 (37%), Gaps = 60/308 (19%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y + +G+ + V VDTGS LWV T SD
Sbjct: 70 YAADITVGSNNQKLNVIVDTGSSDLWVPDVNVDCQVTYSDQ------------------- 110
Query: 143 CSDNFCRT--TYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQL------NQASGN 194
+ +FC+ TY+ S S + + + YGDGSS+ G +D + NQ +
Sbjct: 111 -TADFCKQKGTYDPSGSSASQDLNTPFKIGYGDGSSSQGTLYKDTVGFGGVSIKNQVLAD 169
Query: 195 LKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHC 254
+ + ++ I G G + + GS + V +L Q A N + +
Sbjct: 170 VDSTSIDQG-ILGVGYKTNEAGGSYDNVPV-----------TLKKQGVIAKNAYSLYLNS 217
Query: 255 LDVVKGGGIFAIGDVVSPKVKTTPM---VPNMPHYNVILEEVEVGGNPLDLPTSLLGTGD 311
D G IF G V + K + + V + + L VEV G ++ D
Sbjct: 218 PDAATGQIIF--GGVDNAKYSGSLIALPVTSDRELRISLGSVEVSGKTINT--------D 267
Query: 312 ERGTIIDSGTTLAYLPPMLYDLVLS----QILDRQPGLKMHTVEEQFS---CFQFSKNVD 364
++DSGTT+ YL L D ++ ++ G + V+ S F FSKN
Sbjct: 268 NVDVLLDSGTTITYLQQDLADQIIKAFNGKLTQDSNGNSFYEVDCNLSGDVVFNFSKNAK 327
Query: 365 DAFPTVTF 372
+ P F
Sbjct: 328 ISVPASEF 335
>sp|P0DJ06|CARP2_CANAL Candidapepsin-2 OS=Candida albicans (strain SC5314 / ATCC MYA-2876)
GN=SAP2 PE=1 SV=1
Length = 398
Score = 46.2 bits (108), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 73/308 (23%), Positives = 115/308 (37%), Gaps = 60/308 (19%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y + +G+ + V VDTGS LWV T SD
Sbjct: 70 YAADITVGSNNQKLNVIVDTGSSDLWVPDVNVDCQVTYSDQ------------------- 110
Query: 143 CSDNFCRT--TYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQL------NQASGN 194
+ +FC+ TY+ S S + + + YGDGSS+ G +D + NQ +
Sbjct: 111 -TADFCKQKGTYDPSGSSASQDLNTPFKIGYGDGSSSQGTLYKDTVGFGGVSIKNQVLAD 169
Query: 195 LKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHC 254
+ + ++ I G G + + GS + V +L Q A N + +
Sbjct: 170 VDSTSIDQG-ILGVGYKTNEAGGSYDNVPV-----------TLKKQGVIAKNAYSLYLNS 217
Query: 255 LDVVKGGGIFAIGDVVSPKVKTTPM---VPNMPHYNVILEEVEVGGNPLDLPTSLLGTGD 311
D G IF G V + K + + V + + L VEV G ++ D
Sbjct: 218 PDAATGQIIF--GGVDNAKYSGSLIALPVTSDRELRISLGSVEVSGKTINT--------D 267
Query: 312 ERGTIIDSGTTLAYLPPMLYDLVLS----QILDRQPGLKMHTVEEQFS---CFQFSKNVD 364
++DSGTT+ YL L D ++ ++ G + V+ S F FSKN
Sbjct: 268 NVDVLVDSGTTITYLQQDLADQIIKAFNGKLTQDSNGNSFYEVDCNLSGDVVFNFSKNAK 327
Query: 365 DAFPTVTF 372
+ P F
Sbjct: 328 ISVPASEF 335
>sp|P0CS83|CARP2_CANAX Candidapepsin-2 OS=Candida albicans GN=SAP2 PE=1 SV=1
Length = 398
Score = 45.8 bits (107), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 73/308 (23%), Positives = 115/308 (37%), Gaps = 60/308 (19%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y + +G+ + V VDTGS LWV T SD
Sbjct: 70 YAADITVGSNNQKLNVIVDTGSSDLWVPDVNVDCQVTYSDQ------------------- 110
Query: 143 CSDNFCRT--TYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQL------NQASGN 194
+ +FC+ TY+ S S + + + YGDGSS+ G +D + NQ +
Sbjct: 111 -TADFCKQKGTYDPSGSSASQDLNTPFKIGYGDGSSSQGTLYKDTVGFGGVSIKNQVLAD 169
Query: 195 LKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHC 254
+ + ++ I G G + + GS + V +L Q A N + +
Sbjct: 170 VDSTSIDQG-ILGVGYKTNEAGGSYDNVPV-----------TLKKQGVIAKNAYSLYLNS 217
Query: 255 LDVVKGGGIFAIGDVVSPKVKTTPM---VPNMPHYNVILEEVEVGGNPLDLPTSLLGTGD 311
D G IF G V + K + + V + + L VEV G ++ D
Sbjct: 218 PDAATGQIIF--GGVDNAKYSGSLIALPVTSDRELRISLGSVEVSGKTINT--------D 267
Query: 312 ERGTIIDSGTTLAYLPPMLYDLVLS----QILDRQPGLKMHTVEEQFS---CFQFSKNVD 364
++DSGTT+ YL L D ++ ++ G + V+ S F FSKN
Sbjct: 268 NVDVLLDSGTTITYLQQDLADQIIKAFNGKLTQDSNGNSFYEVDCNLSGDVVFNFSKNAK 327
Query: 365 DAFPTVTF 372
+ P F
Sbjct: 328 ISVPASEF 335
>sp|Q03700|CARP4_RHINI Rhizopuspepsin-4 OS=Rhizopus niveus PE=3 SV=1
Length = 398
Score = 45.4 bits (106), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 44/149 (29%), Positives = 62/149 (41%), Gaps = 39/149 (26%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y+ +V +GTP + + DTGS LW C+ C G T +DPS+SST +
Sbjct: 90 YYGEVTVGTPGIKLKLDFDTGSSDLWFASTLCTNC------GSSQTKYDPSQSSTYAKDG 143
Query: 143 CSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNS 202
RT + ++YGDGSS SG +D + L G LK
Sbjct: 144 ------RT----------------WSISYGDGSSASGILGKDTVNL----GGLKI----K 173
Query: 203 SVIFGCGNRQSGDLGSSTDAAVDGILGFG 231
+ I R++ S DG+LG G
Sbjct: 174 NQIIELAKREASSFSS---GPSDGLLGLG 199
>sp|P24268|CATD_RAT Cathepsin D OS=Rattus norvegicus GN=Ctsd PE=1 SV=1
Length = 407
Score = 45.4 bits (106), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 80/329 (24%), Positives = 131/329 (39%), Gaps = 70/329 (21%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y+ ++G+GTP + V DTGS LWV C L D IA
Sbjct: 79 YYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCK-------------LLD---------IA 116
Query: 143 CSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQAS--GNLKTAPL 200
C + YN+ S + + YG G S SGY +D + + S G +K
Sbjct: 117 C---WVHHKYNSDKSSTYVKNGTSFDIHYGSG-SLSGYLSQDTVSVPCKSDLGGIKV--- 169
Query: 201 NSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKE-------FAH 253
IFG +Q G + + A DGILG G S+ L N+ K+ F+
Sbjct: 170 -EKQIFGEATKQPGVVFIA--AKFDGILGMGYPFISVNKVLPVFDNLMKQKLVEKNIFSF 226
Query: 254 CLD---VVKGGGIFAIGDVVSPKVK---TTPMVPNMPHYNVILEEVEVGGNPLDLPTSLL 307
L+ + GG +G S + V ++ V ++++EVG +L
Sbjct: 227 YLNRDPTGQPGGELMLGGTDSRYYHGELSYLNVTRKAYWQVHMDQLEVGSE-----LTLC 281
Query: 308 GTGDERGTIIDSGTTLAYLPPMLYDLVLSQILDRQPGLKMHTV---EEQFSCFQFSKNVD 364
G E I+D+GT+L P + ++ + Q + + E C + S
Sbjct: 282 KGGCE--AIVDTGTSLLVGP-------VDEVKELQKAIGAVPLIQGEYMIPCEKVS---- 328
Query: 365 DAFPTVTFKFKGSLSLTVYPHEYLFQIRE 393
+ P +TFK G + ++P +Y+ ++ +
Sbjct: 329 -SLPIITFKLGGQ-NYELHPEKYILKVSQ 355
>sp|Q9D7R7|PEPC_MOUSE Gastricsin OS=Mus musculus GN=Pgc PE=2 SV=1
Length = 392
Score = 44.7 bits (104), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 79/348 (22%), Positives = 140/348 (40%), Gaps = 68/348 (19%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y+ ++ +GTP + V DTGS LWV+ C +S+ T ++PSKSS
Sbjct: 76 YYGEISIGTPPQNFLVLFDTGSSNLWVSSVYC-----QSEACTTHTRYNPSKSS------ 124
Query: 143 CSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNS 202
TY + + S + YG GS T G+F D +++ P
Sbjct: 125 --------TYYTQGQTFS--------LQYGTGSLT-GFFGYDTLRVQSIQ-----VPNQE 162
Query: 203 SVIFGCGNRQSGDLGSSTDAAVDGILGF-------GQANSSLLSQLAAAGNVRKEFAHCL 255
FG + G + A DGI+G G A ++L L + F L
Sbjct: 163 ---FGLSENEPGT--NFVYAQFDGIMGLAYPGLSSGGATTALQGMLGEGALSQPLFGVYL 217
Query: 256 DVVKG--GGIFAIG----DVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGT 309
+G GG G ++ + ++ P+ + ++ + +++ +G +S
Sbjct: 218 GSQQGSNGGQIVFGGVDENLYTGELTWIPVTQEL-YWQITIDDFLIGNQASGWCSSSGCQ 276
Query: 310 GDERGTIIDSGTTLAYLPPMLYDLVLSQILDRQPGLKMHTVEEQFSCFQFSKNVDDAFPT 369
G I+D+GT+L +P + +L Q + Q G ++ + S + + PT
Sbjct: 277 G-----IVDTGTSLLVMPAQYLNELL-QTIGAQEG--------EYGQYFVSCDSVSSLPT 322
Query: 370 VTFKFKGSLSLTVYPHEYLFQIREDVWCIGWQNGGLQNHDGRQMILLG 417
+TF G + + P Y+ Q E +G ++ L G+ + +LG
Sbjct: 323 LTFVLNG-VQFPLSPSSYIIQ-EEGSCMVGLESLSLNAESGQPLWILG 368
>sp|D4AIS3|OPSB_ARTBC Probable aspartic-type endopeptidase OPSB OS=Arthroderma benhamiae
(strain ATCC MYA-4681 / CBS 112371) GN=OPSB PE=3 SV=1
Length = 495
Score = 44.7 bits (104), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 80/339 (23%), Positives = 127/339 (37%), Gaps = 98/339 (28%)
Query: 33 FEVENKFKAGGERERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVGLGTP 92
F++E +F+ R + A +R S DL+ N LYF + LGTP
Sbjct: 37 FDIE-RFQVAKPIHRDIIA------KRASTKTISQDLDNQKN-------LYFCNLTLGTP 82
Query: 93 TDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTY 152
+DTGS LWVN A C ++ CS TY
Sbjct: 83 PQTIRAHIDTGSSDLWVNTAESRFCSSR-------------------RAPCSQG---GTY 120
Query: 153 NNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQ 212
++ S V ++ ++Y DGS +G +V D+I + G +K F G+
Sbjct: 121 DSSSSSTYQLVNNDFNISYVDGSGATGDYVTDVINV----GGIKLKDFQ----FAIGHTS 172
Query: 213 SGDLGSSTDAAVDGILGFG-------------QANSSLLSQLAAAGNVRKE----FAHCL 255
S L G+LG G Q+ +L + L AG++R + + L
Sbjct: 173 SSPL---------GVLGIGYEAGEAQVTRSGDQSYPNLPAALVKAGHIRSNAYSLWLNDL 223
Query: 256 DVVKGGGIFAIGDV--VSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDER 313
+G +F D K++T P++ H + G+ L +L G G
Sbjct: 224 SASRGQILFGGIDTGKFQGKLQTVPVL----HTS--------RGDYTSLVVALTGVGIRT 271
Query: 314 GT--------------IIDSGTTLAYLPPMLYDLVLSQI 338
G+ +DSG++L+YLP L V + +
Sbjct: 272 GSDGSIDTFPSQPVAVAMDSGSSLSYLPDALAAKVYNSV 310
>sp|Q9GMY3|PEPC_RHIFE Gastricsin OS=Rhinolophus ferrumequinum GN=PGC PE=2 SV=1
Length = 389
Score = 44.3 bits (103), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 84/350 (24%), Positives = 136/350 (38%), Gaps = 72/350 (20%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
YF ++ +GTP + V DTGS LWV C T++ G T F+PS+SST
Sbjct: 73 YFGEISIGTPPQNFLVLFDTGSSNLWVPSV---YCQTQACTG--HTRFNPSQSSTYSTNG 127
Query: 143 CSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNS 202
+ + + YG GS T G+F D + + P
Sbjct: 128 QT----------------------FSLQYGSGSLT-GFFGYDTLTVQSIQ-----VPNQE 159
Query: 203 SVIFGCGNRQSGDLGSSTDAAVDGILG-------FGQANSSLLSQLAAAGNVRKEFAHCL 255
FG + G + A DGI+G G A ++L L F+ L
Sbjct: 160 ---FGLSENEPGT--NFVYAQFDGIMGMAYPSLAMGGATTALQGMLQEGALTSPVFSFYL 214
Query: 256 DVVKG---GGIFAIGDV----VSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLG 308
+G GG G V ++ P+ + ++ + +EE +GG +
Sbjct: 215 SNQQGSQNGGAVIFGGVDNSLYQGQIYWAPVTQEL-YWQIGIEEFLIGGQ----ASGWCS 269
Query: 309 TGDERGTIIDSGTTLAYLPPMLYDLVLSQILDRQPGLKMHTVEEQFSCFQFSKNVDDAFP 368
G + I+D+GT+L +P Y L Q Q E+Q+ F + N P
Sbjct: 270 QGCQ--AIVDTGTSLLTVPQQ-YMSALLQATGAQ--------EDQYGQFFVNCNYIQNLP 318
Query: 369 TVTFKFKGSLSLTVYPHEYLFQIREDVWC-IGWQNGGLQNHDGRQMILLG 417
T TF G + + P Y+ + + +C +G + L + +G+ + +LG
Sbjct: 319 TFTFIING-VQFPLPPSSYI--LNNNGYCTVGVEPTYLPSQNGQPLWILG 365
>sp|P09177|CARP_RHIPU Mucorpepsin OS=Rhizomucor pusillus PE=1 SV=2
Length = 427
Score = 44.3 bits (103), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 46/159 (28%), Positives = 63/159 (39%), Gaps = 38/159 (23%)
Query: 50 SALKQHDTRRHGRMMAS----IDLELGGNGHPSATGLY-------FTKVGLGTPTDEYYV 98
S +++ +HG+ A I G+G GLY V +GTP ++Y+
Sbjct: 42 SVNRKYSQTKHGQQAAEKLGGIKAFAEGDGSVDTPGLYDFDLEEYAIPVSIGTPGQDFYL 101
Query: 99 QVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPS 158
DTGS WV GC S+ + FDPS SST F T YN
Sbjct: 102 LFDTGSSDTWVPHKGCDN----SEGCVGKRFFDPSSSST---------FKETDYNLN--- 145
Query: 159 CSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKT 197
+TYG G + YF RD I + A+ +T
Sbjct: 146 ----------ITYGTGGANGIYF-RDSITVGGATVKQQT 173
>sp|P18242|CATD_MOUSE Cathepsin D OS=Mus musculus GN=Ctsd PE=1 SV=1
Length = 410
Score = 43.9 bits (102), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 45/169 (26%), Positives = 67/169 (39%), Gaps = 29/169 (17%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y+ +G+GTP + V DTGS LWV C +IA
Sbjct: 79 YYGDIGIGTPPQCFTVVFDTGSSNLWVPSIHCKIL----------------------DIA 116
Query: 143 CSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLN- 201
C + YN+ S + + YG G S SGY +D + + S K +
Sbjct: 117 C---WVHHKYNSDKSSTYVKNGTSFDIHYGSG-SLSGYLSQDTVSVPCKSDQSKARGIKV 172
Query: 202 SSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKE 250
IFG +Q G + + A DGILG G + S+ + L N+ ++
Sbjct: 173 EKQIFGEATKQPGIVFVA--AKFDGILGMGYPHISVNNVLPVFDNLMQQ 219
>sp|P00793|PEPA_CHICK Pepsin A OS=Gallus gallus GN=PGA PE=1 SV=1
Length = 367
Score = 43.5 bits (101), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 95/383 (24%), Positives = 146/383 (38%), Gaps = 86/383 (22%)
Query: 52 LKQHD----TRRHGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLL 107
LK+H ++ H + A+ E N ++ Y+ + +GTP ++ V DTGS L
Sbjct: 27 LKKHPYNPASKYHPVLTATESYEPMTNYMDAS---YYGTISIGTPQQDFSVIFDTGSSNL 83
Query: 108 WVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEY 167
WV C KS FDPSKSST Y S + V
Sbjct: 84 WVPSIYC-----KSSACSNHKRFDPSKSST------------------YVSTNETVY--- 117
Query: 168 VVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGI 227
+ YG GS SG D + ++ + IFG + G + DGI
Sbjct: 118 -IAYGTGS-MSGILGYDTVAVSSIDVQNQ--------IFGLSETEPGSFFYYCN--FDGI 165
Query: 228 LG--FGQANSS--------LLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDVVSPKVKTT 277
LG F +SS ++SQ A ++ F+ L G F + + P T
Sbjct: 166 LGLAFPSISSSGATPVFDNMMSQHLVAQDL---FSVYLSKDGETGSFVLFGGIDPNYTTK 222
Query: 278 PM--VP--NMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDL 333
+ VP ++ + ++ V VG + I+D+GT+L +P Y+
Sbjct: 223 GIYWVPLSAETYWQITMDRVTVGNK-------YVACFFTCQAIVDTGTSLLVMPQGAYNR 275
Query: 334 VLSQILDRQPGLKMHTVEEQFSCFQFSKNVDDAFPTVTFKFKGSLSLTVYPHEYLFQIRE 393
++ + G + SC SK P VTF G + T+ Y+ + E
Sbjct: 276 IIKDLGVSSDG--------EISCDDISK-----LPDVTFHINGH-AFTLPASAYV--LNE 319
Query: 394 DVWC-IGWQNGGLQNHDGRQMIL 415
D C +G++N G G Q IL
Sbjct: 320 DGSCMLGFENMGTPTELGEQWIL 342
>sp|D4AZK1|CTSD_ARTBC Probable aspartic-type endopeptidase CTSD OS=Arthroderma benhamiae
(strain ATCC MYA-4681 / CBS 112371) GN=CTSD PE=3 SV=1
Length = 509
Score = 43.1 bits (100), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 66/283 (23%), Positives = 108/283 (38%), Gaps = 65/283 (22%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSR--CPTKSDLGIKLTLFDPSKSSTSGE 140
YF++V +G+ + ++ +DTG+ WV + C+ C + G S
Sbjct: 102 YFSEVKVGSEGQKMWMLIDTGASGTWVFGSDCTSKACGRHNTFG----------KEDSKT 151
Query: 141 IACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPL 200
I +D ++ VTYG G SG V D + A L T
Sbjct: 152 IKVTDE-------------------KWGVTYGTGK-VSGVIVNDTMSF--AGFELVTP-- 187
Query: 201 NSSVIFGCGNRQSGDLGSSTDAAVDGILGFG----QANSSLLSQLAAAGNVRKEFAHCLD 256
FG + S D + +DGILG G A + + QL + K ++
Sbjct: 188 -----FGSASTASDDF---LNYPMDGILGIGPQDPNAKTPTVVQLLMQQKLLKSNVIGIN 239
Query: 257 VVKGG-----GIFAIGDVVSPKVK----TTPMVPNMPHYNVILEEVEVGGNPLDLPTSLL 307
+ + G GD+ K + +VPN + + ++++ + G L+L
Sbjct: 240 LQRASEGATDGQITFGDIDKSKFSGELIYSNVVPNGYQWEIAMDDLIMDGKSLNL----- 294
Query: 308 GTGDERGTIIDSGTTLAYLPPMLYDLVLSQILDRQPGLKMHTV 350
R IID+GT+ LPP DL+ S I G +T+
Sbjct: 295 ---KGRTGIIDTGTSFLILPPADADLIHSMIPQADKGSGFYTL 334
>sp|Q9GMY4|PEPC_SORUN Gastricsin OS=Sorex unguiculatus GN=PGC PE=2 SV=1
Length = 389
Score = 43.1 bits (100), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 68/259 (26%), Positives = 103/259 (39%), Gaps = 59/259 (22%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
YF ++ +GTP + V DTGS LWV C +S F+PSKSS
Sbjct: 73 YFGEISIGTPPQNFLVLFDTGSSNLWVPSVYC-----QSQACTGHARFNPSKSS------ 121
Query: 143 CSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNS 202
TY+ + S + YG GS T G+F D + L N+K
Sbjct: 122 --------TYSTNGQTFS--------LQYGSGSLT-GFFGYDTMTLQ----NIKVPHQE- 159
Query: 203 SVIFGCGNRQSGDLGSSTDAAVDGILG-------FGQANSSLLSQLAAAGNVRKEFAHCL 255
FG + G+ + A DGI+G G A ++L L A F+ L
Sbjct: 160 ---FGLSQNEPGE--NFVYAQFDGIMGMAYPTLAMGGATTALQGMLQAGALDSPVFSFYL 214
Query: 256 ---DVVKGGGIFAIGDV----VSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLG 308
K GG G V + ++ TP+ + ++ + +E+ +GG T
Sbjct: 215 SNQQSSKDGGAVVFGGVDNSLYTGQIFWTPVTQEL-YWQIGVEQFLIGGQ----ATGWCS 269
Query: 309 TGDERGTIIDSGTTLAYLP 327
G + I+D+GT+L +P
Sbjct: 270 QGCQ--AIVDTGTSLLTVP 286
>sp|Q9N2D2|CHYM_CALJA Chymosin OS=Callithrix jacchus GN=CYM PE=1 SV=1
Length = 381
Score = 42.7 bits (99), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 24/55 (43%), Positives = 30/55 (54%), Gaps = 5/55 (9%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSST 137
YF K+ +GTP E+ V DTGS LWV C+ ++ FDPSKSST
Sbjct: 74 YFGKIYIGTPPQEFTVVFDTGSSDLWVPSVYCNSVACQNH-----HRFDPSKSST 123
>sp|Q29079|PAG2_PIG Pregnancy-associated glycoprotein 2 OS=Sus scrofa GN=PAG2 PE=2 SV=1
Length = 420
Score = 42.0 bits (97), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 60/260 (23%), Positives = 102/260 (39%), Gaps = 62/260 (23%)
Query: 82 LYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEI 141
+Y + +GTP ++ V DTGS LWV C KS + F+PS SS
Sbjct: 75 VYVGNISIGTPPQQFSVVFDTGSSDLWVPSIYC-----KSKACVTHRSFNPSHSS----- 124
Query: 142 ACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLN 201
T+++R S + YG G SG+ +D +++ Q + +
Sbjct: 125 ---------TFHDRGKSIK--------LEYGSG-KMSGFLGQDTVRIGQLT--------S 158
Query: 202 SSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKE-------FAHC 254
+ FG ++G + A DGILG + ++ N++K+ FA
Sbjct: 159 TGQAFGLSKEETGK--AFEHAIFDGILGLAYPSIAIKGTTTVIDNLKKQDQISEPVFAFY 216
Query: 255 LDVVKGGGIFAIGDVVSPK-----VKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGT 309
L K G + V K +K P+ ++ + L+ + G + P
Sbjct: 217 LSSDKEEGSVVMFGGVDKKYYKGDLKWVPLT-QTSYWQIALDRITCRGRVIGCP------ 269
Query: 310 GDERG--TIIDSGTTLAYLP 327
RG I+D+GT++ + P
Sbjct: 270 ---RGCQAIVDTGTSMLHGP 286
>sp|Q9N2D3|PEPC_CALJA Gastricsin OS=Callithrix jacchus GN=PGC PE=1 SV=1
Length = 388
Score = 42.0 bits (97), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 89/391 (22%), Positives = 151/391 (38%), Gaps = 78/391 (19%)
Query: 54 QHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAG 113
+HD R R+ DL + YF ++ +GTP + V DTGS LWV
Sbjct: 47 KHDPARKYRVS---DLSVSYEPMDYMDAAYFGEISIGTPPQNFLVLFDTGSSNLWVPSVY 103
Query: 114 CSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGD 173
C +S + F+PS SS TY++ + S + YG
Sbjct: 104 C-----QSQACTSHSRFNPSASS--------------TYSSNGQTFS--------LQYGS 136
Query: 174 GSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGF--- 230
GS T G+F D + + P FG + G + A DGI+G
Sbjct: 137 GSLT-GFFGYDTLTVQSIQ-----VPNQE---FGLSENEPGT--NFVYAQFDGIMGLAYP 185
Query: 231 ----GQANSSLLSQLAAAGNVRKEFAHCLDVVKG--GGIFAIGDVVSP----KVKTTPMV 280
G A +++ L F+ L +G GG G V S ++ P+
Sbjct: 186 ALSMGGATTAMQGMLQEGALTSPVFSFYLSNQQGSSGGAVIFGGVDSSLYTGQIYWAPVT 245
Query: 281 PNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQILD 340
+ ++ + +EE +GG + G + I+D+GT+L +P +S L+
Sbjct: 246 QEL-YWQIGIEEFLIGGQ----ASGWCSEGCQ--AIVDTGTSLLTVP----QQYMSAFLE 294
Query: 341 RQPGLKMHTVEEQFSCFQFSKNVDDAFPTVTFKFKGSLSLTVYPHEYLFQIREDVWC-IG 399
E+++ F + + PT+TF G + + P Y+ + + +C +G
Sbjct: 295 -----ATGAQEDEYGQFLVNCDSIQNLPTLTFIING-VEFPLPPSSYI--LSNNGYCTVG 346
Query: 400 WQNGGLQNHDGRQMILLGG----TVYSCFML 426
+ L + + + + +LG + YS F L
Sbjct: 347 VEPTYLSSQNSQPLWILGDVFLRSYYSVFDL 377
>sp|P81498|PEPC_SUNMU Gastricsin OS=Suncus murinus GN=PGC PE=1 SV=2
Length = 389
Score = 42.0 bits (97), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 82/350 (23%), Positives = 138/350 (39%), Gaps = 72/350 (20%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
YF ++ +GTP + V DTGS LWV C +S F+P++SS
Sbjct: 73 YFGEISIGTPPQNFLVLFDTGSSNLWVPSVYC-----QSQACTGHARFNPNQSS------ 121
Query: 143 CSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNS 202
TY+ + S + YG GS T G+F D + + N+K
Sbjct: 122 --------TYSTNGQTFS--------LQYGSGSLT-GFFGYDTMTVQ----NIKVPHQE- 159
Query: 203 SVIFGCGNRQSGDLGSSTDAAVDGILG-------FGQANSSLLSQLAAAGNVRKEFAHCL 255
FG + G + A DGI+G G A ++L L F+ L
Sbjct: 160 ---FGLSQNEPGT--NFIYAQFDGIMGMAYPSLAMGGATTALQGMLQEGALTSPVFSFYL 214
Query: 256 DVVKG---GGIFAIGDV----VSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLG 308
+G GG G V + ++ P+ + ++ + +EE +GG T
Sbjct: 215 SNQQGSQNGGAVIFGGVDNSLYTGQIFWAPVTQEL-YWQIGVEEFLIGGQ----ATGWCQ 269
Query: 309 TGDERGTIIDSGTTLAYLPPMLYDLVLSQILDRQPGLKMHTVEEQFSCFQFSKNVDDAFP 368
G + I+D+GT+L +P L Q Q ++Q+ + N + P
Sbjct: 270 QGCQ--AIVDTGTSLLTVPQQFMS-ALQQATGAQ--------QDQYGQLAVNCNSIQSLP 318
Query: 369 TVTFKFKGSLSLTVYPHEYLFQIREDVWC-IGWQNGGLQNHDGRQMILLG 417
T+TF G + + P Y+ + + +C +G + L + +G+ + +LG
Sbjct: 319 TLTFIING-VQFPLPPSAYV--LNTNGYCFLGVEPTYLPSQNGQPLWILG 365
>sp|P10602|CARP1_RHINI Rhizopuspepsin-1 OS=Rhizopus niveus GN=RNAP PE=1 SV=1
Length = 389
Score = 42.0 bits (97), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 43/149 (28%), Positives = 59/149 (39%), Gaps = 39/149 (26%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y+ +V +GTP + + DTGS +W CS C T +DP KSST
Sbjct: 82 YYGEVTVGTPGIKLKLDFDTGSSDMWFASTLCSSCSNSH------TKYDPKKSSTYAADG 135
Query: 143 CSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNS 202
RT + ++YGDGSS SG D + L +T L
Sbjct: 136 ------RT----------------WSISYGDGSSASGILATDNVNLGGLLIKKQTIEL-- 171
Query: 203 SVIFGCGNRQSGDLGSSTDAAVDGILGFG 231
R+S +TD +DG+LG G
Sbjct: 172 ------AKRESSAF--ATD-VIDGLLGLG 191
>sp|P00791|PEPA_PIG Pepsin A OS=Sus scrofa GN=PGA PE=1 SV=3
Length = 385
Score = 42.0 bits (97), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 43/149 (28%), Positives = 56/149 (37%), Gaps = 38/149 (25%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
YF +G+GTP ++ V DTGS LWV CS +A
Sbjct: 73 YFGTIGIGTPAQDFTVIFDTGSSNLWVPSVYCSS------------------------LA 108
Query: 143 CSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNS 202
CSD +N S E +TYG GS T G D +Q+ S ++
Sbjct: 109 CSD---HNQFNPDDSSTFEATSQELSITYGTGSMT-GILGYDTVQVGGIS--------DT 156
Query: 203 SVIFGCGNRQSGDLGSSTDAAVDGILGFG 231
+ IFG + G A DGILG
Sbjct: 157 NQIFGLSETEPGSFLYY--APFDGILGLA 183
>sp|P00794|CHYM_BOVIN Chymosin OS=Bos taurus GN=CYM PE=1 SV=3
Length = 381
Score = 41.6 bits (96), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 25/55 (45%), Positives = 28/55 (50%), Gaps = 5/55 (9%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSST 137
YF K+ LGTP E+ V DTGS WV C KS+ FDP KSST
Sbjct: 74 YFGKIYLGTPPQEFTVLFDTGSSDFWVPSIYC-----KSNACKNHQRFDPRKSST 123
>sp|P18276|CHYM_SHEEP Chymosin OS=Ovis aries GN=CYM PE=2 SV=1
Length = 381
Score = 41.6 bits (96), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 25/55 (45%), Positives = 28/55 (50%), Gaps = 5/55 (9%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSST 137
YF K+ LGTP E+ V DTGS WV C KS+ FDP KSST
Sbjct: 74 YFGKIYLGTPPQEFTVLFDTGSSDFWVPSIYC-----KSNACKNHQRFDPRKSST 123
>sp|D4D8U6|CTSD_TRIVH Probable aspartic-type endopeptidase CTSD OS=Trichophyton
verrucosum (strain HKI 0517) GN=CTSD PE=3 SV=1
Length = 509
Score = 41.2 bits (95), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 65/283 (22%), Positives = 108/283 (38%), Gaps = 65/283 (22%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSR--CPTKSDLGIKLTLFDPSKSSTSGE 140
YF++V +G+ + ++ +DTG+ WV + C+ C + G S
Sbjct: 102 YFSEVKVGSEGQKMWMLIDTGASGTWVFGSDCTSKACGRHNTFG----------KEDSKT 151
Query: 141 IACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPL 200
I +D ++ VTYG G SG V D + A L T
Sbjct: 152 IKVTDE-------------------KWGVTYGTGK-VSGVIVNDTMSF--AGFELVTP-- 187
Query: 201 NSSVIFGCGNRQSGDLGSSTDAAVDGILGFG----QANSSLLSQLAAAGNVRKEFAHCLD 256
FG + S D + +DGILG G A + + QL + K ++
Sbjct: 188 -----FGSASTASDDF---LNYPMDGILGIGPQDPNAKTPTVVQLLMQQKLLKSNVIGIN 239
Query: 257 VVKGG-----GIFAIGDVVSPKVK----TTPMVPNMPHYNVILEEVEVGGNPLDLPTSLL 307
+ + G GD+ K + +VP+ + + ++++ + G L+L
Sbjct: 240 LQRASEGATDGQITFGDIDKSKFSGELIYSNVVPDGYQWEIAMDDLIMDGKSLNL----- 294
Query: 308 GTGDERGTIIDSGTTLAYLPPMLYDLVLSQILDRQPGLKMHTV 350
R IID+GT+ LPP DL+ S I G +T+
Sbjct: 295 ---KGRTGIIDTGTSFLILPPADADLIHSMIPQANKGSGFYTL 334
>sp|P81497|PEPA_SUNMU Pepsin A OS=Suncus murinus GN=PGA PE=1 SV=2
Length = 387
Score = 41.2 bits (95), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 25/55 (45%), Positives = 30/55 (54%), Gaps = 5/55 (9%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSST 137
YF +G+GTP E+ V DTGS LWV CS P S+ F+P KSST
Sbjct: 75 YFGTIGIGTPPQEFTVIFDTGSSNLWVPSVYCSS-PACSN----HNRFNPQKSST 124
>sp|P07339|CATD_HUMAN Cathepsin D OS=Homo sapiens GN=CTSD PE=1 SV=1
Length = 412
Score = 41.2 bits (95), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 68/261 (26%), Positives = 103/261 (39%), Gaps = 51/261 (19%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y+ ++G+GTP + V DTGS LWV C L D IA
Sbjct: 79 YYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCK-------------LLD---------IA 116
Query: 143 CSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNS 202
C + YN+ S + + YG G S SGY +D + + S + +A
Sbjct: 117 C---WIHHKYNSDKSSTYVKNGTSFDIHYGSG-SLSGYLSQDTVSVPCQSASSASALGGV 172
Query: 203 SV---IFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKE-------FA 252
V +FG +Q G + A DGILG S+ + L N+ ++ F+
Sbjct: 173 KVERQVFGEATKQPGI--TFIAAKFDGILGMAYPRISVNNVLPVFDNLMQQKLVDQNIFS 230
Query: 253 HCLDV---VKGGGIFAIGDVVSPKVKTTPMVPNM---PHYNVILEEVEVGGNPLDLPTSL 306
L + GG +G S K + N+ ++ V L++VEV +L
Sbjct: 231 FYLSRDPDAQPGGELMLGGTDSKYYKGSLSYLNVTRKAYWQVHLDQVEVASG-----LTL 285
Query: 307 LGTGDERGTIIDSGTTLAYLP 327
G E I+D+GT+L P
Sbjct: 286 CKEGCE--AIVDTGTSLMVGP 304
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.319 0.138 0.421
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 169,437,955
Number of Sequences: 539616
Number of extensions: 7572505
Number of successful extensions: 16350
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 111
Number of HSP's successfully gapped in prelim test: 62
Number of HSP's that attempted gapping in prelim test: 16093
Number of HSP's gapped (non-prelim): 211
length of query: 427
length of database: 191,569,459
effective HSP length: 120
effective length of query: 307
effective length of database: 126,815,539
effective search space: 38932370473
effective search space used: 38932370473
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 63 (28.9 bits)