BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 017265
(374 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|Q6XBF8|CDR1_ARATH Aspartic proteinase CDR1 OS=Arabidopsis thaliana GN=CDR1 PE=1 SV=1
Length = 437
Score = 219 bits (558), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 120/217 (55%), Positives = 152/217 (70%), Gaps = 8/217 (3%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPN 87
GF+ +LIHRDSPKSPFYN ET QRLR+A+ RS+NR+ HF + + + Q D+ N
Sbjct: 30 GFTADLIHRDSPKSPFYNPMETSSQRLRNAIHRSVNRVFHFTEKDN---TPQPQIDLTSN 86
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ YL+ +SIGTPP +A+ADTGSDL+WTQC PC CY Q PLFDPK SSTYK +
Sbjct: 87 SGEYLMNVSIGTPPFPIMAIADTGSDLLWTQCAPC--DDCYTQVDPLFDPKTSSTYKDVS 144
Query: 148 CSSSQCASL-NQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
CSSSQC +L NQ SCS + C YS+SYGD S++ GN+A +T+TLGS+ + + L I
Sbjct: 145 CSSSQCTALENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDTLTLGSSDTRPMQLKNIII 204
Query: 205 GCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG 241
GCG NN G FN K +GIVGLGGG +SLI Q+ +I G
Sbjct: 205 GCGHNNAGTFNKKGSGIVGLGGGPVSLIKQLGDSIDG 241
Score = 136 bits (342), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 69/123 (56%), Positives = 85/123 (69%), Gaps = 1/123 (0%)
Query: 251 DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHF 310
+I+IDSGTTLT LP + S L ++S I+A+ DP L LCYS +VP +T+HF
Sbjct: 315 NIIIDSGTTLTLLPTEFYSELEDAVASSIDAEKKQDPQSGLSLCYSATGDLKVPVITMHF 374
Query: 311 RGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTD 370
GADVKL SN FV+VSED+VC F+G + S IYGN+ Q NFLVGYD +TVSFKPTD
Sbjct: 375 DGADVKLDSSNAFVQVSEDLVCFAFRG-SPSFSIYGNVAQMNFLVGYDTVSKTVSFKPTD 433
Query: 371 CTK 373
C K
Sbjct: 434 CAK 436
>sp|Q3EBM5|ASPR1_ARATH Probable aspartic protease At2g35615 OS=Arabidopsis thaliana
GN=At2g35615 PE=3 SV=1
Length = 447
Score = 199 bits (505), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 113/237 (47%), Positives = 149/237 (62%), Gaps = 9/237 (3%)
Query: 8 VFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNH 67
+ + FFL F V FSVELIHRDSP SP YN T RL A RS++R
Sbjct: 5 ILLCFFLFFSVTLSSSGHPKNFSVELIHRDSPLSPIYNPQITVTDRLNAAFLRSVSRSRR 64
Query: 68 FNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQC 127
FN S + Q+ +I + + + I+IGTPP + A+ADTGSDL W QC+PC QC
Sbjct: 65 FNHQLSQTDL---QSGLIGADGEFFMSITIGTPPIKVFAIADTGSDLTWVQCKPC--QQC 119
Query: 128 YMQDSPLFDPKMSSTYKSLPCSSSQCASLN--QKSCSGVN--CQYSVSYGDGSFSNGNLA 183
Y ++ P+FD K SSTYKS PC S C +L+ ++ C N C+Y SYGD SFS G++A
Sbjct: 120 YKENGPIFDKKKSSTYKSEPCDSRNCQALSSTERGCDESNNICKYRYSYGDQSFSKGDVA 179
Query: 184 TETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIA 240
TETV++ S +G V+ PG FGCG NNGG F+ +GI+GLGGG +SLISQ+ ++I+
Sbjct: 180 TETVSIDSASGSPVSFPGTVFGCGYNNGGTFDETGSGIIGLGGGHLSLISQLGSSIS 236
Score = 110 bits (275), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 63/130 (48%), Positives = 84/130 (64%), Gaps = 3/130 (2%)
Query: 245 LGVSTPDIVIDSGTTLTFLPQGYNSNLLS-VMSSMIEAQPVADPTGSLELCYSFNSLS-Q 302
L ++ +I+IDSGTTLT L G+ S V S+ A+ V+DP G L C+ S
Sbjct: 316 LSETSGNIIIDSGTTLTLLEAGFFDKFSSAVEESVTGAKRVSDPQGLLSHCFKSGSAEIG 375
Query: 303 VPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQ 362
+PE+T+HF GADV+LS N FVK+SED+VC + T V IYGN Q +FLVGYD+E +
Sbjct: 376 LPEITVHFTGADVRLSPINAFVKLSEDMVC-LSMVPTTEVAIYGNFAQMDFLVGYDLETR 434
Query: 363 TVSFKPTDCT 372
TVSF+ DC+
Sbjct: 435 TVSFQHMDCS 444
>sp|Q766C2|NEP2_NEPGR Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis GN=nep2
PE=1 SV=1
Length = 438
Score = 170 bits (431), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 123/406 (30%), Positives = 185/406 (45%), Gaps = 73/406 (17%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPN 87
G V+L DS K+ T Y+ ++ A+ R R+ N + + SS + +
Sbjct: 41 GLRVDLEQVDSGKN------LTKYELIKRAIKRGERRMRSIN--AMLQSSSGIETPVYAG 92
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ YL+ ++IGTP + A+ DTGSDLIWTQCEPC +QC+ Q +P+F+P+ SS++ +LP
Sbjct: 93 DGEYLMNVAIGTPDSSFSAIMDTGSDLIWTQCEPC--TQCFSQPTPIFNPQDSSSFSTLP 150
Query: 148 CSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTL-----------------G 190
C S C L ++C+ CQY+ YGDGS + G +ATET T G
Sbjct: 151 CESQYCQDLPSETCNNNECQYTYGYGDGSTTQGYMATETFTFETSSVPNIAFGCGEDNQG 210
Query: 191 STTGQAVALPGITFG---------------CGTNNGG------LFNSKTTGIVGLGGGDI 229
G L G+ +G C T+ G S +G V G
Sbjct: 211 FGQGNGAGLIGMGWGPLSLPSQLGVGQFSYCMTSYGSSSPSTLALGSAASG-VPEGSPST 269
Query: 230 SLISQ----------MRTTIAGNQRLGV----------STPDIVIDSGTTLTFLPQGYNS 269
+LI ++ G LG+ T ++IDSGTTLT+LPQ +
Sbjct: 270 TLIHSSLNPTYYYITLQGITVGGDNLGIPSSTFQLQDDGTGGMIIDSGTTLTYLPQDAYN 329
Query: 270 NLLSVMSSMIEAQPVADPTGSLELCY---SFNSLSQVPEVTIHFRGADVKLSRSNFFVKV 326
+ + I V + + L C+ S S QVPE+++ F G + L N +
Sbjct: 330 AVAQAFTDQINLPTVDESSSGLSTCFQQPSDGSTVQVPEISMQFDGGVLNLGEQNILISP 389
Query: 327 SEDIVCSVFKGITN-SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
+E ++C + + I+GNI Q V YD++ VSF PT C
Sbjct: 390 AEGVICLAMGSSSQLGISIFGNIQQQETQVLYDLQNLAVSFVPTQC 435
>sp|Q766C3|NEP1_NEPGR Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis GN=nep1
PE=1 SV=1
Length = 437
Score = 167 bits (422), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 90/216 (41%), Positives = 128/216 (59%), Gaps = 15/216 (6%)
Query: 23 EAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQA 82
EA+ GF + L H DS K+ T +Q L A+ R RL + ++ +
Sbjct: 35 EAKVTGFQIMLEHVDSGKN------LTKFQLLERAIERGSRRLQRLE--AMLNGPSGVET 86
Query: 83 DIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSST 142
+ + YL+ +SIGTP A+ DTGSDLIWTQC+PC +QC+ Q +P+F+P+ SS+
Sbjct: 87 SVYAGDGEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQPC--TQCFNQSTPIFNPQGSSS 144
Query: 143 YKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGI 202
+ +LPCSS C +L+ +CS CQY+ YGDGS + G++ TET+T GS V++P I
Sbjct: 145 FSTLPCSSQLCQALSSPTCSNNFCQYTYGYGDGSETQGSMGTETLTFGS-----VSIPNI 199
Query: 203 TFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTT 238
TFGCG NN G G+VG+G G +SL SQ+ T
Sbjct: 200 TFGCGENNQGFGQGNGAGLVGMGRGPLSLPSQLDVT 235
Score = 71.2 bits (173), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 42/126 (33%), Positives = 60/126 (47%), Gaps = 3/126 (2%)
Query: 249 TPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCY---SFNSLSQVPE 305
T I+IDSGTTLT+ ++ S I V + +LC+ S S Q+P
Sbjct: 309 TGGIIIDSGTTLTYFVNNAYQSVRQEFISQINLPVVNGSSSGFDLCFQTPSDPSNLQIPT 368
Query: 306 VTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVS 365
+HF G D++L N+F+ S ++C + + I+GNI Q N LV YD VS
Sbjct: 369 FVMHFDGGDLELPSENYFISPSNGLICLAMGSSSQGMSIFGNIQQQNMLVVYDTGNSVVS 428
Query: 366 FKPTDC 371
F C
Sbjct: 429 FASAQC 434
>sp|Q9LS40|ASPG1_ARATH Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana
GN=ASPG1 PE=1 SV=1
Length = 500
Score = 153 bits (387), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 127/407 (31%), Positives = 175/407 (42%), Gaps = 92/407 (22%)
Query: 29 FSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNN 88
F+VE + R K P YN +T YQ + LT + S ASQ +
Sbjct: 122 FAVEGVDRSDLK-PVYNE-DTRYQT--EDLTTPV-------------VSGASQG-----S 159
Query: 89 ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
Y RI +GTP E V DTGSD+ W QCEPC + CY Q P+F+P SSTYKSL C
Sbjct: 160 GEYFSRIGVGTPAKEMYLVLDTGSDVNWIQCEPC--ADCYQQSDPVFNPTSSSTYKSLTC 217
Query: 149 SSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGT 208
S+ QC+ L +C C Y VSYGDGSF+ G LAT+TVT G++ + + GCG
Sbjct: 218 SAPQCSLLETSACRSNKCLYQVSYGDGSFTVGELATDTVTFGNSG----KINNVALGCGH 273
Query: 209 NNGGLFNS------------------KTTGI-------------------VGLGGGDISL 231
+N GLF K T V LGGGD +
Sbjct: 274 DNEGLFTGAAGLLGLGGGVLSITNQMKATSFSYCLVDRDSGKSSSLDFNSVQLGGGDATA 333
Query: 232 ISQMRTTIAGNQRLGVS----------TPD------------IVIDSGTTLTFL-PQGYN 268
I +G+S PD +++D GT +T L Q YN
Sbjct: 334 PLLRNKKIDTFYYVGLSGFSVGGEKVVLPDAIFDVDASGSGGVILDCGTAVTRLQTQAYN 393
Query: 269 SNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFRGAD-VKLSRSNFFVK 325
S + + + + + + CY F+SLS +VP V HF G + L N+ +
Sbjct: 394 SLRDAFLKLTVNLKKGSSSISLFDTCYDFSSLSTVKVPTVAFHFTGGKSLDLPAKNYLIP 453
Query: 326 VSED-IVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
V + C F ++S+ I GN+ Q + YD+ + + C
Sbjct: 454 VDDSGTFCFAFAPTSSSLSIIGNVQQQGTRITYDLSKNVIGLSGNKC 500
>sp|Q9LHE3|ASPG2_ARATH Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana
GN=ASPG2 PE=2 SV=1
Length = 470
Score = 129 bits (324), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 69/196 (35%), Positives = 105/196 (53%), Gaps = 17/196 (8%)
Query: 29 FSVELIHRDS-PKSPFYNSSETPYQRLR---DALTRSLNRLNHFNQNSSISSSKASQ--A 82
+++ L+HRD P + N + R+R D ++ L R++ SS S + + +
Sbjct: 59 YTLRLLHRDRFPSVTYRNHHHRLHARMRRDTDRVSAILRRISGKVIPSSDSRYEVNDFGS 118
Query: 83 DIIPN----NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPK 138
DI+ + Y +RI +G+PP ++ V D+GSD++W QC+PC CY Q P+FDP
Sbjct: 119 DIVSGMDQGSGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPC--KLCYKQSDPVFDPA 176
Query: 139 MSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA 198
S +Y + C SS C + C C+Y V YGDGS++ G LA ET+T T + VA
Sbjct: 177 KSGSYTGVSCGSSVCDRIENSGCHSGGCRYEVMYGDGSYTKGTLALETLTFAKTVVRNVA 236
Query: 199 LPGITFGCGTNNGGLF 214
+ GCG N G+F
Sbjct: 237 M-----GCGHRNRGMF 247
Score = 43.9 bits (102), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 34/124 (27%), Positives = 48/124 (38%), Gaps = 4/124 (3%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIH 309
+V+D+GT +T LP S P A + CY + +VP V+ +
Sbjct: 347 VVMDTGTAVTRLPTAAYVAFRDGFKSQTANLPRASGVSIFDTCYDLSGFVSVRVPTVSFY 406
Query: 310 F-RGADVKLSRSNFFVKVSED-IVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
F G + L NF + V + C F + I GNI Q V +D V F
Sbjct: 407 FTEGPVLTLPARNFLMPVDDSGTYCFAFAASPTGLSIIGNIQQEGIQVSFDGANGFVGFG 466
Query: 368 PTDC 371
P C
Sbjct: 467 PNVC 470
>sp|Q9S9K4|ASPL2_ARATH Aspartic proteinase-like protein 2 OS=Arabidopsis thaliana
GN=At1g65240 PE=1 SV=2
Length = 475
Score = 97.1 bits (240), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 95/392 (24%), Positives = 163/392 (41%), Gaps = 88/392 (22%)
Query: 65 LNHFNQNSSISSSKASQADIIPNNAN--------YLIRISIGTPPTERLAVADTGSDLIW 116
L HF + + S+ + +P + Y +I +G+PP E DTGSD++W
Sbjct: 40 LEHFKSHDTRRHSRMLASIDLPLGGDSRVDSVGLYFTKIKLGSPPKEYHVQVDTGSDILW 99
Query: 117 TQCEPCP--PSQCYMQ-DSPLFDPKMSSTYKSLPCSSSQCASLNQ-KSCS-GVNCQYSVS 171
C+PCP P++ + LFD SST K + C C+ ++Q SC + C Y +
Sbjct: 100 INCKPCPKCPTKTNLNFRLSLFDMNASSTSKKVGCDDDFCSFISQSDSCQPALGCSYHIV 159
Query: 172 YGDGSFSNGNLATETVTLGSTTGQAVALP---GITFGCGTNNGGLF---NSKTTGIVGLG 225
Y D S S+G + +TL TG P + FGCG++ G +S G++G G
Sbjct: 160 YADESTSDGKFIRDMLTLEQVTGDLKTGPLGQEVVFGCGSDQSGQLGNGDSAVDGVMGFG 219
Query: 226 GGDISLISQMRTT-------------IAGNQRLGVSTPD--------------------- 251
+ S++SQ+ T + G V D
Sbjct: 220 QSNTSVLSQLAATGDAKRVFSHCLDNVKGGGIFAVGVVDSPKVKTTPMVPNQMHYNVMLM 279
Query: 252 --------------IVIDSG------TTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSL 291
IV + G TTL + P+ +L+ +++ QPV
Sbjct: 280 GMDVDGTSLDLPRSIVRNGGTIVDSGTTLAYFPKVLYDSLI---ETILARQPVKLHIVEE 336
Query: 292 EL-CYSF--NSLSQVPEVTIHFRGADVKLSR--SNFFVKVSEDIVCSVFK--GIT----N 340
C+SF N P V+ F + VKL+ ++ + E++ C ++ G+T +
Sbjct: 337 TFQCFSFSTNVDEAFPPVSFEFEDS-VKLTVYPHDYLFTLEEELYCFGWQAGGLTTDERS 395
Query: 341 SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
V + G+++ +N LV YD++ + + + +C+
Sbjct: 396 EVILLGDLVLSNKLVVYDLDNEVIGWADHNCS 427
>sp|P69477|NEP2_NEPDI Aspartic proteinase nepenthesin-2 (Fragments) OS=Nepenthes
distillatoria PE=1 SV=1
Length = 178
Score = 84.0 bits (206), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 66/216 (30%), Positives = 93/216 (43%), Gaps = 69/216 (31%)
Query: 113 DLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSY 172
DLIWTQCEPC +QC+ QD SS++ +LPC S C L ++C +CQY+ Y
Sbjct: 20 DLIWTQCEPC--TQCFSQD--------SSSFSTLPCESQYCQDLPSETC---DCQYTYGY 66
Query: 173 GDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLI 232
GDGS + G +A E ++P I FGCG N
Sbjct: 67 GDGSSTQGYMAXE---------DGSSVPNIAFGCGDN----------------------- 94
Query: 233 SQMRTTIAGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLE 292
+ IDSGTTLT+LPQ + + + I V + + L
Sbjct: 95 -------------------LQIDSGTTLTYLPQDAYNAVAQAFTDQINLPTVDESSSGLS 135
Query: 293 LCY---SFNSLSQVPEVTIHFRGA--DVKLSRSNFF 323
C+ S S QVPE+++ G D++ +FF
Sbjct: 136 TCFQEPSDGSTVQVPEISMQDGGVLNDLQNLAVSFF 171
>sp|Q9LZL3|PCS1L_ARATH Aspartic proteinase PCS1 OS=Arabidopsis thaliana GN=PCS1 PE=2 SV=1
Length = 453
Score = 74.3 bits (181), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 55/149 (36%), Positives = 74/149 (49%), Gaps = 25/149 (16%)
Query: 100 PPTERLAVADTGSDLIWTQC----EPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCAS 155
PP V DTGS+L W +C P P + FDP SS+Y +PCSS C +
Sbjct: 82 PPQNISMVIDTGSELSWLRCNRSSNPNPVNN--------FDPTRSSSYSPIPCSSPTCRT 133
Query: 156 -----LNQKSC-SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTN 209
L SC S C ++SY D S S GNLA E G++T + + FGC +
Sbjct: 134 RTRDFLIPASCDSDKLCHATLSYADASSSEGNLAAEIFHFGNSTNDS----NLIFGCMGS 189
Query: 210 NGG---LFNSKTTGIVGLGGGDISLISQM 235
G ++KTTG++G+ G +S ISQM
Sbjct: 190 VSGSDPEEDTKTTGLLGMNRGSLSFISQM 218
>sp|Q9LX20|ASPL1_ARATH Aspartic proteinase-like protein 1 OS=Arabidopsis thaliana
GN=At5g10080 PE=1 SV=1
Length = 528
Score = 70.1 bits (170), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 111/460 (24%), Positives = 165/460 (35%), Gaps = 104/460 (22%)
Query: 8 VFILFFLCFYVVSPIEAQTGGFSVELIHR------DSPKSP-----FYNSSETPYQRLRD 56
F+LF C ++ E FS LIHR S K+P N Y RL
Sbjct: 6 AFLLF--CVLFLATEETLASLFSSRLIHRFSDEGRASIKTPSSSDSLPNKQSLEYYRLLA 63
Query: 57 ALTRSLNRLNHFNQNSSISSSKASQADIIPNNANYL--IRISIGTPPTERLAVADTGSDL 114
R+N + S+ S+ S+ N+ +L I IGTP L DTGS+L
Sbjct: 64 ESDFRRQRMNLGAKVQSLVPSEGSKTISSGNDFGWLHYTWIDIGTPSVSFLVALDTGSNL 123
Query: 115 IWTQCE--PCPP------SQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNC 166
+W C C P S +D ++P SST K CS C S + C
Sbjct: 124 LWIPCNCVQCAPLTSTYYSSLATKDLNEYNPSSSSTSKVFLCSHKLCDSASDCESPKEQC 183
Query: 167 QYSVSYGDGSFSNGNLATETVTLGS-------TTGQAVALPGITFGCGTNNGG--LFNSK 217
Y+V+Y G+ S+ L E + + G + + GCG G L
Sbjct: 184 PYTVNYLSGNTSSSGLLVEDILHLTYNTNNRLMNGSSSVKARVVIGCGKKQSGDYLDGVA 243
Query: 218 TTGIVGLGGGDISLIS------QMRTTIA------------------------------- 240
G++GLG +IS+ S MR + +
Sbjct: 244 PDGLMGLGPAEISVPSFLSKAGLMRNSFSLCFDEEDSGRIYFGDMGPSIQQSTPFLQLDN 303
Query: 241 ---------------GNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVA 285
GN L ++ IDSG + T+LP+ + + I A
Sbjct: 304 NKYSGYIVGVEACCIGNSCLKQTSFTTFIDSGQSFTYLPEEIYRKVALEIDRHINATSKN 363
Query: 286 DPTGSLELCYSFNSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITN-SVPI 344
S E CY ++ +VP + + F S +N FV V +G+ +PI
Sbjct: 364 FEGVSWEYCYESSAEPKVPAIKLKF-------SHNNTFVIHKPLFVFQQSQGLVQFCLPI 416
Query: 345 -------YGNIMQTNFLVGY----DIEQQTVSFKPTDCTK 373
G+I Q N++ GY D E + + P+ C +
Sbjct: 417 SPSGQEGIGSIGQ-NYMRGYRMVFDRENMKLGWSPSKCQE 455
>sp|P69476|NEP1_NEPDI Aspartic proteinase nepenthesin-1 (Fragments) OS=Nepenthes
distillatoria PE=1 SV=1
Length = 164
Score = 66.6 bits (161), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 46/121 (38%), Positives = 62/121 (51%), Gaps = 34/121 (28%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
+ YL+ +SIGTP A+ DTGSDLIWTQ +P +Q + Q DP+ SS++ +L
Sbjct: 13 GDGEYLMXLSIGTPAQPFSAIMDTGSDLIWTQXQPX--TQXFXQS----DPQGSSSFSTL 66
Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
PC YGD S + G++ TET T GS V++P ITFG
Sbjct: 67 PC----------------------GYGD-SETQGSMGTETFTFGS-----VSIPNITFGX 98
Query: 207 G 207
G
Sbjct: 99 G 99
>sp|A2ZC67|ASP1_ORYSI Aspartic proteinase Asp1 OS=Oryza sativa subsp. indica GN=ASP1 PE=2
SV=2
Length = 410
Score = 64.3 bits (155), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 50/159 (31%), Positives = 76/159 (47%), Gaps = 18/159 (11%)
Query: 89 ANYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYMQDSPLFDPKMSSTYKSLP 147
++ + ++IG P DTGS L W QC+ PC C L+ P++ K
Sbjct: 36 GHFFVTMNIGDPAKPYFLDIDTGSTLTWLQCDYPC--INCNKVPHGLYKPELKYAVK--- 90
Query: 148 CSSSQCASL-----NQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
C+ +CA L C N C Y + Y GS S G L ++ +L ++ G
Sbjct: 91 CTEQRCADLYADLRKPMKCGPKNQCHYGIQYVGGS-SIGVLIVDSFSLPASNGTNPT--S 147
Query: 202 ITFGCGTNNGGLFNSKTT---GIVGLGGGDISLISQMRT 237
I FGCG N G ++ T GI+GLG G ++L+SQ+++
Sbjct: 148 IAFGCGYNQGKNNHNVPTPVNGILGLGRGKVTLLSQLKS 186
>sp|P81214|CARP_SYNRA Syncephapepsin OS=Syncephalastrum racemosum GN=SPSR PE=1 SV=1
Length = 395
Score = 59.7 bits (143), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 64/230 (27%), Positives = 93/230 (40%), Gaps = 47/230 (20%)
Query: 19 VSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSK 78
+P+E Q G +L+ K+P Y ++ T + + A + Q +I S
Sbjct: 19 AAPVEKQVAGKPFQLV-----KNPHYQANAT--RAIFRAEKKYARHTAIPEQGKTIVKSA 71
Query: 79 ASQADIIP-----NNANYLIRISIGTPPTERLAVADTGSDLIW---TQCEPCPPSQCYMQ 130
AS +P + Y +S+GTP DTGS +W T C C
Sbjct: 72 ASGTGSVPMTDVDYDVEYYATVSVGTPAQSIKLDFDTGSSDLWFSSTLCTSC-------- 123
Query: 131 DSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLG 190
S FDP SSTYK V + +SYGDGS ++G AT+ V LG
Sbjct: 124 GSKSFDPTKSSTYKK------------------VGKSWQISYGDGSSASGITATDNVELG 165
Query: 191 STTGQAVALPGITFGCGTNNGGLFNSKTT-GIVGLGGGDISLISQMRTTI 239
+ + G T T F+S GI+GLG IS ++ +T +
Sbjct: 166 -----GLKITGQTIELATRESSSFSSGAIDGILGLGFDTISTVAGTKTPV 210
>sp|P07267|CARP_YEAST Saccharopepsin OS=Saccharomyces cerevisiae (strain ATCC 204508 /
S288c) GN=PEP4 PE=1 SV=1
Length = 405
Score = 57.4 bits (137), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 72/341 (21%), Positives = 128/341 (37%), Gaps = 93/341 (27%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
NA Y I++GTPP + DTGS +W C C++ +D + SS+YK+
Sbjct: 88 NAQYYTDITLGTPPQNFKVILDTGSSNLWVPSNECGSLACFLHSK--YDHEASSSYKA-- 143
Query: 148 CSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTT------GQAVALPG 201
++++ YG GS G ++ +T+++G T +A + PG
Sbjct: 144 ----------------NGTEFAIQYGTGSLE-GYISQDTLSIGDLTIPKQDFAEATSEPG 186
Query: 202 ITFGCGTNNGGL----------------FN-------------------SKTT---GIVG 223
+TF G +G L +N SK T G
Sbjct: 187 LTFAFGKFDGILGLGYDTISVDKVVPPFYNAIQQDLLDEKRFAFYLGDTSKDTENGGEAT 246
Query: 224 LGG-------GDISLISQMRTTIA---------GNQRLGVSTPDIVIDSGTTLTFLPQGY 267
GG GDI+ + R G++ + + ID+GT+L LP G
Sbjct: 247 FGGIDESKFKGDITWLPVRRKAYWEVKFEGIGLGDEYAELESHGAAIDTGTSLITLPSG- 305
Query: 268 NSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGADVKLSRSNFFVKVS 327
++ MI A+ A + + N+ +P++ +F G + + ++ ++VS
Sbjct: 306 -------LAEMINAEIGAKKGWTGQYTLDCNTRDNLPDLIFNFNGYNFTIGPYDYTLEVS 358
Query: 328 EDIVCSV----FKGITNSVPIYGNIMQTNFLVGYDIEQQTV 364
+ ++ F + I G+ + YD+ V
Sbjct: 359 GSCISAITPMDFPEPVGPLAIVGDAFLRKYYSIYDLGNNAV 399
>sp|P10977|CARPV_CANAX Vacuolar aspartic protease OS=Candida albicans GN=APR1 PE=3 SV=3
Length = 419
Score = 57.4 bits (137), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 77/348 (22%), Positives = 126/348 (36%), Gaps = 97/348 (27%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
NA Y I IGTP + DTGS +W + C C++ +D SSTYK
Sbjct: 101 NAQYFTEIQIGTPGQPFKVILDTGSSNLWVPSQDCTSLACFLHAK--YDHDASSTYK--- 155
Query: 148 CSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTT------GQAVALP 200
VN ++S+ YG GS G ++ + +T+G +A + P
Sbjct: 156 ----------------VNGSEFSIQYGSGSM-EGYISQDVLTIGDLVIPGQDFAEATSEP 198
Query: 201 GITFGCGTNNG------------------------GLF--------------NSKTTGIV 222
G+ F G +G GL + G+
Sbjct: 199 GLAFAFGKFDGILGLAYDTISVNHIVPPIYNAINQGLLEKPQFGFYLGSTDKDENDGGLA 258
Query: 223 GLGGGDISLISQMRTTI----------------AGNQRLGVSTPDIVIDSGTTLTFLPQG 266
GG D SL T + G++ + ID+GT+L LP
Sbjct: 259 TFGGYDASLFQGKITWLPIRRKAYWEVSFEGIGLGDEYAELHKTGAAIDTGTSLITLP-- 316
Query: 267 YNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGADVKLSRSNFFVKV 326
S ++ +I A+ A + S + +P++T+ F G + L+ ++ ++V
Sbjct: 317 ------SSLAEIINAKIGATKSWSGQYQVDCAKRDSLPDLTLTFAGYNFTLTPYDYILEV 370
Query: 327 SEDIVCSVFKGITNSVP-----IYGNIMQTNFLVGYDIEQQTVSFKPT 369
S + SVF + P I G+ + YD+++ V PT
Sbjct: 371 SGSCI-SVFTPMDFPQPIGDLAIVGDAFLRKYYSIYDLDKNAVGLAPT 417
>sp|Q0IU52|ASP1_ORYSJ Aspartic proteinase Asp1 OS=Oryza sativa subsp. japonica GN=ASP1
PE=2 SV=1
Length = 410
Score = 56.6 bits (135), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 47/158 (29%), Positives = 72/158 (45%), Gaps = 16/158 (10%)
Query: 89 ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
++ I ++IG P DTGS L W QC+ P + C + L+ P + K + C
Sbjct: 36 GHFFITMNIGDPAKSYFLDIDTGSTLTWLQCD-APCTNCNIVPHVLYKP---TPKKLVTC 91
Query: 149 SSSQCASL-----NQKSC-SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGI 202
+ S C L K C S C Y + Y D S S G L + +L ++ G I
Sbjct: 92 ADSLCTDLYTDLGKPKRCGSQKQCDYVIQYVDSS-SMGVLVIDRFSLSASNGTNPTT--I 148
Query: 203 TFGCGTNNGGLFNS---KTTGIVGLGGGDISLISQMRT 237
FGCG + G + I+GL G ++L+SQ+++
Sbjct: 149 AFGCGYDQGKKNRNVPIPVDSILGLSRGKVTLLSQLKS 186
>sp|D4B385|CARP_ARTBC Probable vacuolar protease A OS=Arthroderma benhamiae (strain ATCC
MYA-4681 / CBS 112371) GN=PEP2 PE=3 SV=1
Length = 400
Score = 55.8 bits (133), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 75/341 (21%), Positives = 123/341 (36%), Gaps = 94/341 (27%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
NA Y ISIGTPP V DTGS +W + C C++ + +D SSTY
Sbjct: 84 NAQYFSEISIGTPPQTFKVVLDTGSSNLWVPGKDCSSIACFLHST--YDSSASSTY---- 137
Query: 148 CSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTT------GQAVALPG 201
S ++++ YG GS G ++ ++V +G T +A + PG
Sbjct: 138 --------------SKNGTKFAIRYGSGSL-EGFVSRDSVKIGDMTIKKQLFAEATSEPG 182
Query: 202 ITFGCGTNNG------------------------GLFNSKT-------------TGIVGL 224
+ F G +G GL + +V
Sbjct: 183 LAFAFGRFDGIMGMGFSSISVNGITPPFYNMIDQGLIDEPVFSFYLGDTNKDGDQSVVTF 242
Query: 225 GGGDISLISQMRTTI----------------AGNQRLGVSTPDIVIDSGTTLTFLPQGYN 268
GG D + + TTI G + I++D+GT+L LP
Sbjct: 243 GGSDTNHFTGDMTTIPLRRKAYWEVDFDAISLGKDTAALENTGIILDTGTSLIALP---- 298
Query: 269 SNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGADVKLSRSNFFVKVSE 328
+ ++ MI Q A + + + +P+VT G + + ++ ++VS
Sbjct: 299 ----TTLAEMINTQIGATKSWNGQYTLDCAKRDSLPDVTFTLSGHNFTIGPHDYTLEVSG 354
Query: 329 DIVCSVFKGITNSVP-----IYGNIMQTNFLVGYDIEQQTV 364
+ S F G+ P I G+ + YD+ + TV
Sbjct: 355 TCISS-FMGMDFPEPVGPLAILGDSFLRRYYSVYDLGKGTV 394
>sp|O42630|CARP_ASPFU Vacuolar protease A OS=Neosartorya fumigata (strain ATCC MYA-4609 /
Af293 / CBS 101355 / FGSC A1100) GN=pep2 PE=2 SV=1
Length = 398
Score = 55.1 bits (131), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 44/161 (27%), Positives = 70/161 (43%), Gaps = 39/161 (24%)
Query: 80 SQADIIPNN---ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFD 136
S+ D++ +N A Y IS+GTPP + V DTGS +W C C++ + +D
Sbjct: 71 SRHDVLVDNFLNAQYFSEISLGTPPQKFKVVLDTGSSNLWVPGSDCSSIACFLHNK--YD 128
Query: 137 PKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTT--- 193
SSTYK+ ++++ YG G S G ++ +T+ +G
Sbjct: 129 SSASSTYKA------------------NGTEFAIKYGSGELS-GFVSQDTLQIGDLKVVK 169
Query: 194 ---GQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISL 231
+A PG+ F G + GI+GLG IS+
Sbjct: 170 QDFAEATNEPGLAFAFG---------RFDGILGLGYDTISV 201
>sp|D4DEN7|CARP_TRIVH Probable vacuolar protease A OS=Trichophyton verrucosum (strain HKI
0517) GN=PEP2 PE=3 SV=1
Length = 400
Score = 55.1 bits (131), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 75/341 (21%), Positives = 122/341 (35%), Gaps = 94/341 (27%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
NA Y ISIGTPP V DTGS +W + C C++ + +D SSTY
Sbjct: 84 NAQYFSEISIGTPPQTFKVVLDTGSSNLWVPGKDCSSIACFLHST--YDSSASSTY---- 137
Query: 148 CSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTT------GQAVALPG 201
S ++++ YG GS G ++ ++V +G T +A + PG
Sbjct: 138 --------------SKNGTKFAIRYGSGSL-EGFVSQDSVKIGDMTIKNQLFAEATSEPG 182
Query: 202 ITFGCGTNNG------------------------GLFNSKT-------------TGIVGL 224
+ F G +G GL + +V
Sbjct: 183 LAFAFGRFDGIMGMGFSSISVNGITPPFYNMIDQGLIDEPVFSFYLGDTNKEGDQSVVTF 242
Query: 225 GGGDISLISQMRTTI----------------AGNQRLGVSTPDIVIDSGTTLTFLPQGYN 268
GG D + TTI G + I++D+GT+L LP
Sbjct: 243 GGSDTKHFTGDMTTIPLRRKAYWEVDFDAISLGEDTAALENTGIILDTGTSLIALP---- 298
Query: 269 SNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGADVKLSRSNFFVKVSE 328
+ ++ MI Q A + + + +P+VT G + + ++ ++VS
Sbjct: 299 ----TTLAEMINTQIGATKSWNGQYTLDCAKRDSLPDVTFTVSGHNFTIGPHDYTLEVSG 354
Query: 329 DIVCSVFKGITNSVP-----IYGNIMQTNFLVGYDIEQQTV 364
+ S F G+ P I G+ + YD+ + TV
Sbjct: 355 TCISS-FMGMDFPEPVGPLAILGDSFLRRYYSVYDLGKGTV 394
>sp|Q28057|PAG2_BOVIN Pregnancy-associated glycoprotein 2 OS=Bos taurus GN=PAG2 PE=2 SV=1
Length = 376
Score = 55.1 bits (131), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 52/179 (29%), Positives = 74/179 (41%), Gaps = 32/179 (17%)
Query: 52 QRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANYLI-----RISIGTPPTERLA 106
+ LR+ L R N LN+F + + SK I NYL I+IGTPP E
Sbjct: 25 KTLRETL-REKNLLNNFLEEQAYRLSKNDSKITIHPLRNYLDTAYVGNITIGTPPQEFRV 83
Query: 107 VADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNC 166
V DTGS +W C C CY + F+P+ SS+++ V
Sbjct: 84 VFDTGSANLWVPCITCTSPACYTHKT--FNPQNSSSFRE------------------VGS 123
Query: 167 QYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLG 225
++ YG G G L ++TV +G+ P +FG G + GI+GL
Sbjct: 124 PITIFYGSGIIQ-GFLGSDTVRIGNLVS-----PEQSFGLSLEEYGFDSLPFDGILGLA 176
>sp|Q01294|CARP_NEUCR Vacuolar protease A OS=Neurospora crassa (strain ATCC 24698 /
74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) GN=pep-4
PE=3 SV=2
Length = 396
Score = 54.3 bits (129), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 43/144 (29%), Positives = 61/144 (42%), Gaps = 36/144 (25%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
NA Y I+IGTPP V DTGS +W C CY+ + ++ SSTYK
Sbjct: 82 NAQYFSEITIGTPPQTFKVVLDTGSSNLWVPSSQCGSIACYLHNK--YESSESSTYKKNG 139
Query: 148 CSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTT------GQAVALPG 201
S + + YG GS S G ++ + +T+G T +A + PG
Sbjct: 140 TS------------------FKIEYGSGSLS-GFVSQDRMTIGDITINDQLFAEATSEPG 180
Query: 202 ITFGCGTNNGGLFNSKTTGIVGLG 225
+ F G + GI+GLG
Sbjct: 181 LAFAFG---------RFDGILGLG 195
>sp|P40782|CYPR1_CYNCA Cyprosin (Fragment) OS=Cynara cardunculus GN=CYPRO1 PE=1 SV=2
Length = 473
Score = 53.9 bits (128), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 56/194 (28%), Positives = 79/194 (40%), Gaps = 51/194 (26%)
Query: 60 RSLNRLNHFNQNSSISSSKASQADIIPNN----------------ANYLIRISIGTPPTE 103
R +N LNH +++ + + A + + N A Y I IGTPP +
Sbjct: 4 RKVNILNHPGEHAGSNDANARRKYGVRGNFRDSDGELIALKNYMDAQYFGEIGIGTPPQK 63
Query: 104 RLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSG 163
+ DTGS +W P S+CY + LF K ST S N KS
Sbjct: 64 FTVIFDTGSSNLWV-----PSSKCYFSVACLFHSKYRST-------DSTTYKKNGKSA-- 109
Query: 164 VNCQYSVSYGDGSFSNGNLATETVTLGSTTG------QAVALPGITFGCGTNNGGLFNSK 217
++ YG GS S G + ++V LG +A PGITF +K
Sbjct: 110 -----AIQYGTGSIS-GFFSQDSVKLGDLLVKEQDFIEATKEPGITF---------LAAK 154
Query: 218 TTGIVGLGGGDISL 231
GI+GLG +IS+
Sbjct: 155 FDGILGLGFQEISV 168
>sp|P70269|CATE_MOUSE Cathepsin E OS=Mus musculus GN=Ctse PE=1 SV=2
Length = 397
Score = 53.9 bits (128), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 51/190 (26%), Positives = 82/190 (43%), Gaps = 45/190 (23%)
Query: 51 YQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNAN----------YLIRISIGTP 100
+Q LR L R+ +L+ F ++ ++ ++ S++ + ++ N Y ISIGTP
Sbjct: 30 HQSLRKKL-RAQGQLSEFWRSHNLDMTRLSESCNVYSSVNEPLINYLDMEYFGTISIGTP 88
Query: 101 PTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKS 160
P + DTGS +W C C + P+F P S TY
Sbjct: 89 PQNFTVIFDTGSSNLWVPSVYCTSPAC--KAHPVFHPSQSDTYTE--------------- 131
Query: 161 CSGVNCQYSVSYGDGSFSN----GNLATETVTL-GSTTGQAVALPGITFGCGTNNGGLFN 215
V +S+ YG GS + ++ E +T+ G G++V PG TF N
Sbjct: 132 ---VGNHFSIQYGTGSLTGIIGADQVSVEGLTVDGQQFGESVKEPGQTF---------VN 179
Query: 216 SKTTGIVGLG 225
++ GI+GLG
Sbjct: 180 AEFDGILGLG 189
>sp|Q4WZS3|Y5950_ASPFU Putative aspergillopepsin A-like aspartic endopeptidase
AFUA_2G15950 OS=Neosartorya fumigata (strain ATCC
MYA-4609 / Af293 / CBS 101355 / FGSC A1100)
GN=AFUA_2G15950 PE=3 SV=2
Length = 428
Score = 53.9 bits (128), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 42/150 (28%), Positives = 72/150 (48%), Gaps = 30/150 (20%)
Query: 79 ASQADIIPNNANYLIRISIGTPPTERLAVA-DTGSDLIWTQCEPCPPSQCYMQDSPLFDP 137
A A + N+A ++ ++IG +++ + DTGS W P S +FDP
Sbjct: 98 AVSAQSVQNDAAFVSPVTIGG---QKIVMNFDTGSADFWVMNTELPASAQVGH--TVFDP 152
Query: 138 KMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLG--STTGQ 195
SST+K + ++ + + YGD SF+NG + T+TV +G + TGQ
Sbjct: 153 SKSSTFKKMEGAT-----------------FEIKYGDSSFANGGVGTDTVDIGGATVTGQ 195
Query: 196 AVALPGITFGCGTNNGGLFNSKTTGIVGLG 225
A+ +P +N + ++ + G+VGLG
Sbjct: 196 AIGIP-----TSVSNSFVEDTYSNGLVGLG 220
>sp|Q05744|CATD_CHICK Cathepsin D OS=Gallus gallus GN=CTSD PE=1 SV=1
Length = 398
Score = 53.5 bits (127), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 95/400 (23%), Positives = 145/400 (36%), Gaps = 114/400 (28%)
Query: 49 TPYQRLRDALTRSLNRLNHFNQNSSISSSKASQAD-------IIPN--NANYLIRISIGT 99
T + R LT + + N + K AD I+ N +A Y I IGT
Sbjct: 27 TKFTSTRRMLTEVGSEIPDMNAITQFLKFKLGFADLAEPTPEILKNYMDAQYYGEIGIGT 86
Query: 100 PPTERLAVADTGSDLIWTQCEPCPPSQCYMQD-----SPLFDPKMSSTYKSLPCSSSQCA 154
PP + V DTGS +W P C++ D +D SSTY
Sbjct: 87 PPQKFTVVFDTGSSNLWV-----PSVHCHLLDIACLLHHKYDASKSSTYVE--------- 132
Query: 155 SLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTT------GQAVALPGITFGCGT 208
++++ YG GS S G L+ +TVTLG+ G+AV PGITF
Sbjct: 133 ---------NGTEFAIHYGTGSLS-GFLSQDTVTLGNLKIKNQIFGEAVKQPGITFIAAK 182
Query: 209 NNGGL---------------------------------FNSKTT----GIVGLGG----- 226
+G L N T G + LGG
Sbjct: 183 FDGILGMAFPRISVDKVTPFFDNVMQQKLIEKNIFSFYLNRDPTAQPGGELLLGGTDPKY 242
Query: 227 --GDISLISQMRTT----------IAGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSV 274
GD S ++ R +A L + ++D+GT+L P + +
Sbjct: 243 YSGDFSWVNVTRKAYWQVHMDSVDVANGLTLCKGGCEAIVDTGTSLITGP----TKEVKE 298
Query: 275 MSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGADVKLSRSNFFVKVS---EDIV 331
+ + I A+P+ + S + +S +P VT+ G +L+ + KVS E I
Sbjct: 299 LQTAIGAKPLIKG----QYVISCDKISSLPVVTLMLGGKPYQLTGEQYVFKVSAQGETIC 354
Query: 332 CSVFKGITNSVP-----IYGNIMQTNFLVGYDIEQQTVSF 366
S F G+ P I G++ + +D + +V F
Sbjct: 355 LSGFSGLDVPPPGGPLWILGDVFIGPYYTVFDRDNDSVGF 394
>sp|P25796|CATE_CAVPO Cathepsin E OS=Cavia porcellus GN=CTSE PE=1 SV=1
Length = 391
Score = 53.1 bits (126), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 50/179 (27%), Positives = 77/179 (43%), Gaps = 37/179 (20%)
Query: 53 RLRDALTRSLNRLN-HFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTG 111
R + LT N + +Q S+I S+ + + + Y ISIG+PP + DTG
Sbjct: 37 RAQGQLTELWKSQNLNMDQCSTIQSANEPLINYL--DMEYFGTISIGSPPQNFTVIFDTG 94
Query: 112 SDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVS 171
S +W C C Q P+F P +SSTY+ V +S+
Sbjct: 95 SSNLWVPSVYCTSPAC--QTHPVFHPSLSSTYRE------------------VGNSFSIQ 134
Query: 172 YGDGSFSN----GNLATETVT-LGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLG 225
YG GS + ++ E +T +G G++V PG TF +++ GI+GLG
Sbjct: 135 YGTGSLTGIIGADQVSVEGLTVVGQQFGESVQEPGKTF---------VHAEFDGILGLG 184
>sp|O65390|APA1_ARATH Aspartic proteinase A1 OS=Arabidopsis thaliana GN=APA1 PE=1 SV=1
Length = 506
Score = 52.0 bits (123), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 49/150 (32%), Positives = 68/150 (45%), Gaps = 35/150 (23%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+A Y I+IGTPP + V DTGS +W P S+CY + L PK YKS
Sbjct: 79 DAQYYGEIAIGTPPQKFTVVFDTGSSNLWV-----PSSKCYFSLACLLHPK----YKS-- 127
Query: 148 CSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTG------QAVALPG 201
S S N K+ ++ YG G+ + G + + VT+G +A PG
Sbjct: 128 -SRSSTYEKNGKAA-------AIHYGTGAIA-GFFSNDAVTVGDLVVKDQEFIEATKEPG 178
Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISL 231
ITF +K GI+GLG +IS+
Sbjct: 179 ITF---------VVAKFDGILGLGFQEISV 199
>sp|P10602|CARP1_RHINI Rhizopuspepsin-1 OS=Rhizopus niveus GN=RNAP PE=1 SV=1
Length = 389
Score = 52.0 bits (123), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 43/144 (29%), Positives = 66/144 (45%), Gaps = 34/144 (23%)
Query: 55 RDALTRSLNRLNHFNQNSSISSSKASQADIIP-----NNANYLIRISIGTPPTERLAVAD 109
++AL ++L + N S +++AS + +P N+ Y +++GTP + D
Sbjct: 43 KNALNKALAKYNRRKVGSGGITTEASGS--VPMVDYENDVEYYGEVTVGTPGIKLKLDFD 100
Query: 110 TGSDLIW---TQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNC 166
TGS +W T C C S +DPK SSTY +
Sbjct: 101 TGSSDMWFASTLCSSCSNSHTK------YDPKKSSTY------------------AADGR 136
Query: 167 QYSVSYGDGSFSNGNLATETVTLG 190
+S+SYGDGS ++G LAT+ V LG
Sbjct: 137 TWSISYGDGSSASGILATDNVNLG 160
>sp|P16476|PEPE_CHICK Embryonic pepsinogen OS=Gallus gallus PE=2 SV=1
Length = 383
Score = 51.6 bits (122), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 60/190 (31%), Positives = 80/190 (42%), Gaps = 45/190 (23%)
Query: 52 QRLRDALTRSLNRLNHFNQNSSIS-SSKASQA--DIIPN---------NANYLIRISIGT 99
++LR+ L R L+HF Q+ +K A D++ + Y ISIGT
Sbjct: 26 KKLREIL-REKGLLHHFLQHHRYDIGTKFPHAFPDVLTVVTEPLLNTLDMEYYGTISIGT 84
Query: 100 PPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQK 159
PP + V DTGS +W C C Q +F+P SSTYKS
Sbjct: 85 PPQDFTVVFDTGSSNLWVPSVSCTSPAC--QSHQMFNPSQSSTYKS-------------- 128
Query: 160 SCSGVNCQYSVSYGDGSFSNGNLATETVTLGS--TTGQAVALPGITFGCGTNNGGLF--N 215
+G N S+ YG G G + +TVT+ S T Q FG T+ G F
Sbjct: 129 --TGQNL--SIHYGTGDM-EGTVGCDTVTVASLMDTNQ-------LFGLSTSEPGQFFVY 176
Query: 216 SKTTGIVGLG 225
K GI+GLG
Sbjct: 177 VKFDGILGLG 186
>sp|Q9N2D2|CHYM_CALJA Chymosin OS=Callithrix jacchus GN=CYM PE=1 SV=1
Length = 381
Score = 50.8 bits (120), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 56/183 (30%), Positives = 79/183 (43%), Gaps = 37/183 (20%)
Query: 52 QRLRDALTRSLNRLNHFNQNS------SISSSKASQADIIPN--NANYLIRISIGTPPTE 103
+ LR AL + L F +N S+S+ ++ + N + Y +I IGTPP E
Sbjct: 28 KSLRRAL-KERGLLEDFLKNHQHAVSRKHSNSREVASEFLTNYLDCQYFGKIYIGTPPQE 86
Query: 104 RLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSG 163
V DTGS +W C C Q+ FDP SST++++ S
Sbjct: 87 FTVVFDTGSSDLWVPSVYCNSVAC--QNHHRFDPSKSSTFQNMDKS-------------- 130
Query: 164 VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNN-GGLFN-SKTTGI 221
S+ YG GS G L +TVT+ S + P T G T G +F S+ GI
Sbjct: 131 ----LSIQYGTGSM-QGLLGYDTVTVSS-----IVDPHQTVGLSTQEPGDVFTYSEFDGI 180
Query: 222 VGL 224
+GL
Sbjct: 181 LGL 183
>sp|Q805F3|CATEA_XENLA Cathepsin E-A OS=Xenopus laevis GN=ctse-a PE=1 SV=1
Length = 397
Score = 50.4 bits (119), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 44/144 (30%), Positives = 64/144 (44%), Gaps = 36/144 (25%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ Y IS+GTPP + DTGS +W C C D F P++SSTY+S
Sbjct: 71 DVEYFGEISVGTPPQNFTVIFDTGSSNLWVPSVYCISQACAQHDR--FQPQLSSTYES-- 126
Query: 148 CSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTT------GQAVALPG 201
+G N +S+ YG GS S G + + VT+ G++V+ PG
Sbjct: 127 --------------NGNN--FSLQYGTGSLS-GVIGIDAVTVEGILVQNQQFGESVSEPG 169
Query: 202 ITFGCGTNNGGLFNSKTTGIVGLG 225
TF +++ GI+GLG
Sbjct: 170 STF---------VDAEFDGILGLG 184
>sp|Q03168|ASPP_AEDAE Lysosomal aspartic protease OS=Aedes aegypti GN=AAEL006169 PE=1
SV=2
Length = 387
Score = 50.4 bits (119), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 48/152 (31%), Positives = 69/152 (45%), Gaps = 38/152 (25%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQ--CYMQDSPLFDPKMSSTYKS 145
+A Y I+IGTPP V DTGS +W + C + C M + ++ K SST+
Sbjct: 65 DAQYYGAITIGTPPQSFKVVFDTGSSNLWVPSKECSFTNIACLMHNK--YNAKKSSTF-- 120
Query: 146 LPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGS------TTGQAVAL 199
+K+ + + Q YG GS S G L+T+TV LG T +A+
Sbjct: 121 ------------EKNGTAFHIQ----YGSGSLS-GYLSTDTVGLGGVSVTKQTFAEAINE 163
Query: 200 PGITFGCGTNNGGLFNSKTTGIVGLGGGDISL 231
PG+ F +K GI+GLG IS+
Sbjct: 164 PGLVF---------VAAKFDGILGLGYSSISV 186
>sp|P06026|CARP_RHICH Rhizopuspepsin OS=Rhizopus chinensis PE=1 SV=2
Length = 393
Score = 50.1 bits (118), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 56/212 (26%), Positives = 91/212 (42%), Gaps = 53/212 (25%)
Query: 40 KSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISS-SKASQADIIP-----NNANYLI 93
K+P Y S ++A+ +++ + N N+S + +P N+ Y
Sbjct: 34 KNPNYKPSA------KNAIQKAIAKYNKHKINTSTGGIVPDAGVGTVPMTDYGNDVEYYG 87
Query: 94 RISIGTPPTERLAVADTGSDLIW---TQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
+++IGTP + DTGS +W T C C Q +DPK SSTY++ +
Sbjct: 88 QVTIGTPGKKFNLDFDTGSSDLWIASTLCTNCGSRQTK------YDPKQSSTYQADGRT- 140
Query: 151 SQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGS--TTGQAVALP---GITFG 205
+S+SYGDGS ++G LA + V LG GQ + L +F
Sbjct: 141 -----------------WSISYGDGSSASGILAKDNVNLGGLLIKGQTIELAKREAASFA 183
Query: 206 CGTNNGGLFNSKTTGIVGLGGGDISLISQMRT 237
G N+ G++GLG I+ + ++T
Sbjct: 184 NGPND---------GLLGLGFDTITTVRGVKT 206
>sp|P55956|ASP3_CAEEL Aspartic protease 3 OS=Caenorhabditis elegans GN=asp-3 PE=1 SV=2
Length = 398
Score = 50.1 bits (118), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 49/170 (28%), Positives = 64/170 (37%), Gaps = 40/170 (23%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
NA Y ++IGTPP + DTGS +W C CP + FD K SS
Sbjct: 66 NAQYYGPVTIGTPPQNFQVLFDTGSSNLWVPCANCPFGDIACRMHNRFDCKKSS------ 119
Query: 148 CSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTT----------GQAV 197
SC+ + + YG GS G + + V G T A
Sbjct: 120 ------------SCTATGASFEIQYGTGSMK-GTVDNDVVCFGHDTTYCTDKNQGLACAT 166
Query: 198 ALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISL--ISQMRTTIAGNQRL 245
+ PGITF +K GI G+G IS+ ISQ I N +
Sbjct: 167 SEPGITF---------VAAKFDGIFGMGWDTISVNKISQPMDQIFANSAI 207
>sp|P28713|PEPA4_RABIT Pepsin II-4 OS=Oryctolagus cuniculus PE=2 SV=1
Length = 387
Score = 49.7 bits (117), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 41/123 (33%), Positives = 56/123 (45%), Gaps = 27/123 (21%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+A Y ISIGTPP + + DTGS +W C C + F+P+ SSTY+
Sbjct: 72 DAEYFGTISIGTPPQDFTVIFDTGSSNLWVPSTYCSSLACALHKR--FNPEDSSTYQ--- 126
Query: 148 CSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGST--TGQAVAL----PG 201
G + S++YG GS + G L +TV +GS T Q L PG
Sbjct: 127 ---------------GTSETLSITYGTGSMT-GILGYDTVKVGSIEDTNQIFGLSKTEPG 170
Query: 202 ITF 204
+TF
Sbjct: 171 LTF 173
>sp|P43231|CARP2_RHINI Rhizopuspepsin-2 OS=Rhizopus niveus PE=3 SV=2
Length = 391
Score = 49.3 bits (116), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 53/198 (26%), Positives = 86/198 (43%), Gaps = 46/198 (23%)
Query: 55 RDALTRSLNRLNHFNQNSSISSSKASQADIIP-----NNANYLIRISIGTPPTERLAVAD 109
++A+ ++L + + F SS +S+ +P N+ Y ++++GTP D
Sbjct: 43 KNAIQKALAKYHRFRTTSSSNSTSTEGTGSVPVTDYYNDIEYYGKVTVGTPGVTLKLDFD 102
Query: 110 TGSDLIW---TQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNC 166
TGS +W T C C SQ ++P SSTY
Sbjct: 103 TGSSDLWFASTLCTNCGSSQT------KYNPNQSSTYAK------------------DGR 138
Query: 167 QYSVSYGDGSFSNGNLATETVTLG--STTGQAVALP---GITFGCGTNNGGLFNSKTTGI 221
+S+SYGDGS ++G L T+TVTLG T Q + L +F G + G+
Sbjct: 139 TWSISYGDGSSASGILGTDTVTLGGLKITKQTIELAKREATSFQSG---------PSYGL 189
Query: 222 VGLGGGDISLISQMRTTI 239
+GLG I+ + ++T +
Sbjct: 190 LGLGFDTITTVRGVKTPV 207
>sp|C5FS55|CARP_ARTOC Vacuolar protease A OS=Arthroderma otae (strain ATCC MYA-4605 / CBS
113480) GN=PEP2 PE=3 SV=1
Length = 395
Score = 49.3 bits (116), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 44/150 (29%), Positives = 64/150 (42%), Gaps = 36/150 (24%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
NA Y ISIGTPP V DTGS +W + C C++ STY S
Sbjct: 84 NAQYFSEISIGTPPQTFKVVLDTGSSNLWVPGKDCSSIACFLH----------STYDS-- 131
Query: 148 CSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTT------GQAVALPG 201
S+S + N S +++ YG GS G ++ + V +G +A + PG
Sbjct: 132 -SASSTFTRNGTS-------FAIRYGSGSL-EGFVSQDNVQIGDMKIKNQLFAEATSEPG 182
Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISL 231
+ F G + GI+G+G IS+
Sbjct: 183 LAFAFG---------RFDGILGMGYDTISV 203
>sp|P81498|PEPC_SUNMU Gastricsin OS=Suncus murinus GN=PGC PE=1 SV=2
Length = 389
Score = 49.3 bits (116), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 48/178 (26%), Positives = 74/178 (41%), Gaps = 31/178 (17%)
Query: 52 QRLRD-ALTRSLNRLNHFN--QNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVA 108
+ LR+ L + NH++ Q + + +A+Y ISIGTPP L +
Sbjct: 31 ENLREQGLLEDFLKTNHYDPAQKYHFGDFSVAYEPMAYMDASYFGEISIGTPPQNFLVLF 90
Query: 109 DTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQY 168
DTGS +W C C F+P SSTY S N ++ +
Sbjct: 91 DTGSSNLWVPSVYCQSQAC--TGHARFNPNQSSTY-----------STNGQT-------F 130
Query: 169 SVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGG--LFNSKTTGIVGL 224
S+ YG GS + G +T+T+ Q + +P FG N G ++ GI+G+
Sbjct: 131 SLQYGSGSLT-GFFGYDTMTV-----QNIKVPHQEFGLSQNEPGTNFIYAQFDGIMGM 182
>sp|P28712|PEPA1_RABIT Pepsin II-1 OS=Oryctolagus cuniculus PE=2 SV=1
Length = 387
Score = 48.9 bits (115), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 40/123 (32%), Positives = 56/123 (45%), Gaps = 27/123 (21%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+A Y ISIGTPP E + DTGS +W C C++ F+P SST+++
Sbjct: 72 DAEYFGTISIGTPPQEFTVIFDTGSSNLWVPSTYCSSLACFLHKR--FNPDDSSTFQA-- 127
Query: 148 CSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGST--TGQAVAL----PG 201
+ S++YG GS + G L +TV +G+ T Q L PG
Sbjct: 128 ----------------TSETLSITYGTGSMT-GILGYDTVKVGNIEDTNQIFGLSKTEPG 170
Query: 202 ITF 204
ITF
Sbjct: 171 ITF 173
>sp|O09043|NAPSA_MOUSE Napsin-A OS=Mus musculus GN=Napsa PE=1 SV=1
Length = 419
Score = 48.5 bits (114), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 50/192 (26%), Positives = 77/192 (40%), Gaps = 45/192 (23%)
Query: 48 ETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQAD--IIP----NNANYLIRISIGTPP 101
P QR+ R LN LN + Q + +S + S + +P N Y I +GTPP
Sbjct: 25 RVPLQRIHLG-HRILNPLNGWEQLAELSRTSTSGGNPSFVPLSKFMNTQYFGTIGLGTPP 83
Query: 102 TERLAVADTGSDLIWTQCEPCP--PSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQK 159
V DTGS +W C C+ F+PK SS+++
Sbjct: 84 QNFTVVFDTGSSNLWVPSTRCHFFSLACWFHHR--FNPKASSSFRP-------------- 127
Query: 160 SCSGVNCQYSVSYGDGSFSNGNLATETVTLGS------TTGQAVALPGITFGCGTNNGGL 213
++++ YG G S G L+ + +T+G T G+A+ P + F
Sbjct: 128 ----NGTKFAIQYGTGRLS-GILSQDNLTIGGIHDAFVTFGEALWEPSLIFAL------- 175
Query: 214 FNSKTTGIVGLG 225
+ GI+GLG
Sbjct: 176 --AHFDGILGLG 185
>sp|Q9XEC4|APA3_ARATH Aspartic proteinase A3 OS=Arabidopsis thaliana GN=APA3 PE=1 SV=1
Length = 508
Score = 48.5 bits (114), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 49/174 (28%), Positives = 81/174 (46%), Gaps = 30/174 (17%)
Query: 63 NRLNHFNQNSSISSSKASQADIIPN----NANYLIRISIGTPPTERLAVADTGSDLIWTQ 118
NR +H++ + + AD++P +A Y I+IGTPP + + DTGS +W
Sbjct: 56 NRGSHWSPKHYFRLNDEN-ADMVPLKNYLDAQYYGDITIGTPPQKFTVIFDTGSSNLWI- 113
Query: 119 CEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFS 178
P ++CY+ + F K YK +SQ +S +K+ + +Y G FS
Sbjct: 114 ----PSTKCYLSVACYFHSK----YK-----ASQSSSY-RKNGKPASIRYGTGAISGYFS 159
Query: 179 NGNLATETVTLGSTTG-QAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISL 231
N ++ + + +A + PGITF +K GI+GLG +IS+
Sbjct: 160 NDDVKVGDIVVKEQEFIEATSEPGITF---------LLAKFDGILGLGFKEISV 204
>sp|Q9N2D3|PEPC_CALJA Gastricsin OS=Callithrix jacchus GN=PGC PE=1 SV=1
Length = 388
Score = 48.5 bits (114), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 44/140 (31%), Positives = 60/140 (42%), Gaps = 30/140 (21%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+A Y ISIGTPP L + DTGS +W C C F+P SSTY S
Sbjct: 70 DAAYFGEISIGTPPQNFLVLFDTGSSNLWVPSVYCQSQACTSHSR--FNPSASSTYSS-- 125
Query: 148 CSSSQCASLNQKSCSGVNCQ-YSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
N Q +S+ YG GS + G +T+T+ Q++ +P FG
Sbjct: 126 -----------------NGQTFSLQYGSGSLT-GFFGYDTLTV-----QSIQVPNQEFGL 162
Query: 207 GTNNGG--LFNSKTTGIVGL 224
N G ++ GI+GL
Sbjct: 163 SENEPGTNFVYAQFDGIMGL 182
>sp|Q9GMY7|PEPA_RHIFE Pepsin A OS=Rhinolophus ferrumequinum GN=PGA PE=2 SV=1
Length = 386
Score = 48.1 bits (113), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 44/141 (31%), Positives = 64/141 (45%), Gaps = 24/141 (17%)
Query: 54 LRDAL-TRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGS 112
L+D L T S+N + + + ++ S A+Q + Y I IGTPP E + DTGS
Sbjct: 38 LQDYLKTHSINPASKYLKEAA--SMMATQPLENYMDMEYFGTIGIGTPPQEFTVIFDTGS 95
Query: 113 DLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSY 172
+W C C + F+P+ SSTY+ G N + SV+Y
Sbjct: 96 SNLWVPSVYCSSPACSNHNR--FNPQQSSTYQ------------------GTNQKLSVAY 135
Query: 173 GDGSFSNGNLATETVTLGSTT 193
G GS + G L +TV +G T
Sbjct: 136 GTGSMT-GILGYDTVQVGGIT 155
>sp|Q8VYL3|APA2_ARATH Aspartic proteinase A2 OS=Arabidopsis thaliana GN=APA2 PE=1 SV=1
Length = 513
Score = 48.1 bits (113), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 63/234 (26%), Positives = 95/234 (40%), Gaps = 49/234 (20%)
Query: 8 VFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNH 67
VF+ F L F S + G F V L K N T + ++ RS L
Sbjct: 12 VFVSFLLFFTAYS--KRNDGTFRVGL---KKLKLDPNNRLATRFGSKQEEALRS--SLRS 64
Query: 68 FNQNSSISSSKASQADIIPN----NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCP 123
+N N + ADI+P +A Y I+IGTPP + + DTGS +W P
Sbjct: 65 YNNNLG---GDSGDADIVPLKNYLDAQYYGEIAIGTPPQKFTVIFDTGSSNLWV-----P 116
Query: 124 PSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLA 183
+C+ S F K S+ S S + A+++ YG GS S G +
Sbjct: 117 SGKCFFSLSCYFHAKYKSSRSSTYKKSGKRAAIH--------------YGSGSIS-GFFS 161
Query: 184 TETVTLGSTTG------QAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISL 231
+ VT+G + + PG+TF +K G++GLG +I++
Sbjct: 162 YDAVTVGDLVVKDQEFIETTSEPGLTF---------LVAKFDGLLGLGFQEIAV 206
>sp|P16228|CATE_RAT Cathepsin E OS=Rattus norvegicus GN=Ctse PE=1 SV=3
Length = 398
Score = 47.8 bits (112), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 49/190 (25%), Positives = 79/190 (41%), Gaps = 45/190 (23%)
Query: 51 YQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNAN----------YLIRISIGTP 100
+Q LR L R+ +L+ F ++ ++ + S++ + N Y +SIG+P
Sbjct: 31 HQSLRKKL-RAQGQLSDFWRSHNLDMIEFSESCNVDKGINEPLINYLDMEYFGTVSIGSP 89
Query: 101 PTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKS 160
+ DTGS +W C C + P+F P SSTY
Sbjct: 90 SQNFTVIFDTGSSNLWVPSVYCTSPAC--KAHPVFHPSQSSTYME--------------- 132
Query: 161 CSGVNCQYSVSYGDGSFSN----GNLATETVTL-GSTTGQAVALPGITFGCGTNNGGLFN 215
V +S+ YG GS + ++ E +T+ G G++V PG TF N
Sbjct: 133 ---VGNHFSIQYGTGSLTGIIGADQVSVEGLTVEGQQFGESVKEPGQTF---------VN 180
Query: 216 SKTTGIVGLG 225
++ GI+GLG
Sbjct: 181 AEFDGILGLG 190
>sp|Q805F2|CATEB_XENLA Cathepsin E-B OS=Xenopus laevis GN=ctse-b PE=2 SV=1
Length = 397
Score = 47.8 bits (112), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 44/144 (30%), Positives = 64/144 (44%), Gaps = 36/144 (25%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ Y ISIGTPP + DTGS +W C C + F P++SSTY+S
Sbjct: 71 DVQYFGEISIGTPPQNFTVIFDTGSSNLWVPSVYCISPACAQHNR--FQPQLSSTYES-- 126
Query: 148 CSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTT------GQAVALPG 201
+G N +S+ YG GS S G + ++VT+ G++V+ PG
Sbjct: 127 --------------NGNN--FSLQYGTGSLS-GVIGIDSVTVEGILVQNQQFGESVSEPG 169
Query: 202 ITFGCGTNNGGLFNSKTTGIVGLG 225
TF ++ GI+GLG
Sbjct: 170 STF---------VDASFDGILGLG 184
>sp|P43159|CATE_RABIT Cathepsin E OS=Oryctolagus cuniculus GN=CTSE PE=2 SV=1
Length = 396
Score = 47.8 bits (112), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 50/188 (26%), Positives = 77/188 (40%), Gaps = 47/188 (25%)
Query: 54 LRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNAN----------YLIRISIGTPPTE 103
LR L R+ +L+ F + + + ++ + +AN Y ISIG+PP
Sbjct: 32 LRKKL-RAQGQLSEFWKAHKVDMVQYTETCTMEQSANEPLINYLDMEYFGTISIGSPPQN 90
Query: 104 RLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSG 163
+ DT S +W C C M P F P S+TY S
Sbjct: 91 FTVIFDTVSSNLWVPSVYCTSPACQMH--PQFRPSQSNTY------------------SE 130
Query: 164 VNCQYSVSYGDGSFSNGNLATETVT------LGSTTGQAVALPGITFGCGTNNGGLFNSK 217
V +S++YG GS + G + + V+ +G G++V PG TF N++
Sbjct: 131 VGTPFSIAYGTGSLT-GIIGADQVSVQGLTVVGQQFGESVKEPGQTF---------VNAE 180
Query: 218 TTGIVGLG 225
GI+GLG
Sbjct: 181 FDGILGLG 188
>sp|P42210|ASPR_HORVU Phytepsin OS=Hordeum vulgare PE=1 SV=1
Length = 508
Score = 47.4 bits (111), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 46/151 (30%), Positives = 64/151 (42%), Gaps = 37/151 (24%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPS-QCYMQDSPLFDPKMSSTYKSL 146
NA Y I +GTPP + + DTGS +W C S CY+ + SSTYK
Sbjct: 81 NAQYFGEIGVGTPPQKFTVIFDTGSSNLWVPSAKCYFSIACYLHSR--YKAGASSTYKK- 137
Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTG------QAVALP 200
N K ++ YG GS + G + ++VT+G +A P
Sbjct: 138 ----------NGKPA-------AIQYGTGSIA-GYFSEDSVTVGDLVVKDQEFIEATKEP 179
Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISL 231
GITF +K GI+GLG +IS+
Sbjct: 180 GITF---------LVAKFDGILGLGFKEISV 201
>sp|Q42456|ASPR1_ORYSJ Aspartic proteinase oryzasin-1 OS=Oryza sativa subsp. japonica
GN=Os05g0567100 PE=2 SV=2
Length = 509
Score = 47.4 bits (111), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 46/150 (30%), Positives = 67/150 (44%), Gaps = 35/150 (23%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
NA Y I +GTPP + + DTGS +W P ++CY + F S YKS
Sbjct: 82 NAQYFGEIGVGTPPQKFTVIFDTGSSNLWV-----PSAKCYFSIACFF----HSRYKSGQ 132
Query: 148 CSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTG------QAVALPG 201
S+ Q N K ++ YG GS + G + ++VT+G +A PG
Sbjct: 133 SSTYQ---KNGKPA-------AIQYGTGSIA-GFFSEDSVTVGDLVVKDQEFIEATKEPG 181
Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISL 231
+TF +K GI+GLG +IS+
Sbjct: 182 LTF---------MVAKFDGILGLGFQEISV 202
>sp|P22929|CARP_SACFI Acid protease OS=Saccharomycopsis fibuligera GN=PEP1 PE=3 SV=1
Length = 390
Score = 47.0 bits (110), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 49/174 (28%), Positives = 75/174 (43%), Gaps = 43/174 (24%)
Query: 50 PYQRLRDALTRSLN--RLNHFNQNSSISSS--------KASQADIIPNNAN----YLIRI 95
P ++ LT + R++ +N +++SS AS A + + N YL I
Sbjct: 20 PVEKREKTLTLDFDVKRISSKAKNVTVASSPGFRRNLRAASDAGVTISLENEYSFYLTTI 79
Query: 96 SIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCAS 155
IGTP + DTGS +W P Q +D S++YK
Sbjct: 80 EIGTPGQKLQVDVDTGSSDLWV------PGQGTSSLYGTYDHTKSTSYK----------- 122
Query: 156 LNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTN 209
K SG +S+SYGDGS + G+ A ETV++G ++ G+ FG T+
Sbjct: 123 ---KDRSG----FSISYGDGSSARGDWAQETVSIG-----GASITGLEFGDATS 164
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.317 0.132 0.390
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 137,250,998
Number of Sequences: 539616
Number of extensions: 5822081
Number of successful extensions: 16084
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 95
Number of HSP's successfully gapped in prelim test: 126
Number of HSP's that attempted gapping in prelim test: 15758
Number of HSP's gapped (non-prelim): 340
length of query: 374
length of database: 191,569,459
effective HSP length: 119
effective length of query: 255
effective length of database: 127,355,155
effective search space: 32475564525
effective search space used: 32475564525
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 62 (28.5 bits)