BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 022584
(295 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|224124910|ref|XP_002319454.1| predicted protein [Populus trichocarpa]
gi|222857830|gb|EEE95377.1| predicted protein [Populus trichocarpa]
Length = 507
Score = 473 bits (1216), Expect = e-131, Method: Compositional matrix adjust.
Identities = 216/298 (72%), Positives = 260/298 (87%), Gaps = 3/298 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M++Q LV +EVFSFWLNR+P+A+EGGE+VFGGVDPKHFKGKHTYVPVT+KGYWQ +GD
Sbjct: 210 MIQQDLVDDEVFSFWLNRNPEAKEGGELVFGGVDPKHFKGKHTYVPVTQKGYWQINMGDF 269
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
LIG STG+CEGGCAAIVDSGTSLLAGPTP++TEINHAIG EG+VSAECK VVS YGDLI
Sbjct: 270 LIGKHSTGLCEGGCAAIVDSGTSLLAGPTPIITEINHAIGAEGLVSAECKEVVSHYGDLI 329
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKEN---VSAGDSAVCSACEMAVV 177
W+L++SG+ P KVC Q+GLC FN A+ TGI++VVEKEN SAG+ C+AC+M V+
Sbjct: 330 WELIISGVQPSKVCTQLGLCIFNEAKSARTGIESVVEKENKEKSSAGNDLPCTACQMLVI 389
Query: 178 WVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPE 237
WVQNQL++K TKE ++Y+++LC+SLP+PMG+S IDC+ I TMPN++FTIGDK F+L+PE
Sbjct: 390 WVQNQLREKATKETAINYLDKLCESLPSPMGQSSIDCNSISTMPNITFTIGDKPFSLTPE 449
Query: 238 QYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
QYILKTGEGIA+VCISGFMA D+PPPRGPLWILGDVFMG YHT+FD G L +GFAEAA
Sbjct: 450 QYILKTGEGIAQVCISGFMALDVPPPRGPLWILGDVFMGAYHTIFDYGNLEVGFAEAA 507
>gi|171854659|dbj|BAG16519.1| putative aspartic protease [Capsicum chinense]
Length = 506
Score = 468 bits (1205), Expect = e-129, Method: Compositional matrix adjust.
Identities = 213/297 (71%), Positives = 257/297 (86%), Gaps = 2/297 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV+QGLV E VFSFW NRD +EGGE+VFGGVDPKHFKG HTYVP+T+KGYWQF +GD
Sbjct: 210 MVDQGLVKEPVFSFWFNRDASTKEGGELVFGGVDPKHFKGNHTYVPLTQKGYWQFNMGDF 269
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
LIGN STG C GGCAAIVDSGTSLLAGPT +VT++NHAIG EGVVSAECK +VSQYG+++
Sbjct: 270 LIGNTSTGYCAGGCAAIVDSGTSLLAGPTTIVTQLNHAIGAEGVVSAECKTIVSQYGEVL 329
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKEN--VSAGDSAVCSACEMAVVW 178
WDLLVSG+ P++VC Q GLC FNGAE+VS+ I+TVVE+EN S G++ +C+ CEMAVVW
Sbjct: 330 WDLLVSGVRPDQVCSQAGLCFFNGAEHVSSNIRTVVERENEGSSVGEAPLCTVCEMAVVW 389
Query: 179 VQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQ 238
+QNQLKQ+ TKE+VL Y+++LC+ LP+PMGES++DC+ I ++PN++FTI DK F L+PEQ
Sbjct: 390 IQNQLKQQGTKERVLEYVDQLCEKLPSPMGESVVDCNSISSLPNITFTIKDKAFVLTPEQ 449
Query: 239 YILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
YILKTGEGIA +CISGF AFD+PPPRGPLWILGDVFMG YHTVFD G ++GFAEAA
Sbjct: 450 YILKTGEGIASICISGFAAFDVPPPRGPLWILGDVFMGPYHTVFDYGNSQVGFAEAA 506
>gi|294440430|gb|ADE74632.1| aspartic protease 1 [Nicotiana tabacum]
Length = 506
Score = 467 bits (1202), Expect = e-129, Method: Compositional matrix adjust.
Identities = 213/296 (71%), Positives = 254/296 (85%), Gaps = 2/296 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV QGLV E+VFSFW+NRD A+EGGE+VFGGVD HFKG HTYVP+T+KGYWQF +GD
Sbjct: 210 MVGQGLVKEQVFSFWINRDATAKEGGELVFGGVDSNHFKGNHTYVPLTQKGYWQFNMGDF 269
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
LIGN STGVC GGCAAIVDSGTSLLAGPT VVT+INHAIG EG+VS ECK +VSQYG++I
Sbjct: 270 LIGNASTGVCAGGCAAIVDSGTSLLAGPTTVVTQINHAIGAEGIVSMECKTIVSQYGEMI 329
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKEN--VSAGDSAVCSACEMAVVW 178
W+LLVSG+ P++VC Q GLC FNGA++VS+ I+TVVE+E S G++ +C+ACEMAVVW
Sbjct: 330 WNLLVSGVKPDQVCSQAGLCYFNGAQHVSSNIRTVVERETEGSSVGEAPLCTACEMAVVW 389
Query: 179 VQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQ 238
+QNQLKQK+TKE+VL Y+N+LC+ LP+PMGES+IDC I MPN++FTI DK + L+PEQ
Sbjct: 390 MQNQLKQKETKERVLEYVNQLCEKLPSPMGESVIDCSMISAMPNITFTIKDKAYVLTPEQ 449
Query: 239 YILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
YILKTGEGI +C+SGF A D+PPPRGPLWILGDVFMGVYHTVFD G R+GFAEA
Sbjct: 450 YILKTGEGITTICMSGFAALDVPPPRGPLWILGDVFMGVYHTVFDYGNSRLGFAEA 505
>gi|82623417|gb|ABB87123.1| aspartic protease precursor-like [Solanum tuberosum]
Length = 506
Score = 462 bits (1190), Expect = e-128, Method: Compositional matrix adjust.
Identities = 212/297 (71%), Positives = 254/297 (85%), Gaps = 2/297 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV QGLV E VFSFW NRD +A+EGGE+VFGGVDPKHFKG HTYVP+T+KGYWQF +GD
Sbjct: 210 MVGQGLVKESVFSFWFNRDANAKEGGELVFGGVDPKHFKGNHTYVPLTQKGYWQFNMGDF 269
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
LIGN STG C GGCAAIVDSGTSLLAGPT +VT+INHAIG EG+VS ECK +VSQYG++I
Sbjct: 270 LIGNTSTGYCAGGCAAIVDSGTSLLAGPTTIVTQINHAIGAEGIVSMECKTIVSQYGEMI 329
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKEN--VSAGDSAVCSACEMAVVW 178
WDLLVSG+ P++VC Q GLC +GA++VS+ I+TVVE+E S G++ +C+ACEMAVVW
Sbjct: 330 WDLLVSGVRPDQVCSQAGLCFVDGAQHVSSNIRTVVERETEGSSVGEAPLCTACEMAVVW 389
Query: 179 VQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQ 238
+QNQLKQ TKEKVL Y+N+LC+ +P+PMGES IDC+ I +MP++SFTI DK F L+PEQ
Sbjct: 390 MQNQLKQAGTKEKVLEYVNQLCEKIPSPMGESTIDCNSISSMPDISFTIKDKAFVLTPEQ 449
Query: 239 YILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
YILKTGEG+A +C+SGF A D+PPPRGPLWILGDVFMG YHTVFD GK ++GFAEAA
Sbjct: 450 YILKTGEGVATICVSGFAALDVPPPRGPLWILGDVFMGPYHTVFDYGKSQVGFAEAA 506
>gi|418731269|gb|AFX67029.1| aspartic protease, partial [Solanum tuberosum]
Length = 372
Score = 461 bits (1187), Expect = e-127, Method: Compositional matrix adjust.
Identities = 212/297 (71%), Positives = 254/297 (85%), Gaps = 2/297 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV QGLV E VFSFW NRD +A+EGGE+VFGGVDPKHFKG HTYVP+T+KGYWQF +GD
Sbjct: 76 MVGQGLVKESVFSFWFNRDANAKEGGELVFGGVDPKHFKGNHTYVPLTQKGYWQFNMGDF 135
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
LIGN STG C GGCAAIVDSGTSLLAGPT +VT+INHAIG EG+VS ECK +VSQYG++I
Sbjct: 136 LIGNTSTGYCAGGCAAIVDSGTSLLAGPTTIVTQINHAIGAEGIVSMECKTIVSQYGEMI 195
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKEN--VSAGDSAVCSACEMAVVW 178
WDLLVSG+ P++VC Q GLC +GA++VS+ I+TVVE+E S G++ +C+ACEMAVVW
Sbjct: 196 WDLLVSGVRPDQVCSQAGLCFVDGAQHVSSNIRTVVERETEGSSVGEAPLCTACEMAVVW 255
Query: 179 VQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQ 238
+QNQLKQ TKEKVL Y+N+LC+ +P+PMGES IDC+ I +MP++SFTI DK F L+PEQ
Sbjct: 256 MQNQLKQAGTKEKVLEYVNQLCEKIPSPMGESTIDCNSISSMPDISFTIKDKAFVLTPEQ 315
Query: 239 YILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
YILKTGEG+A +C+SGF A D+PPPRGPLWILGDVFMG YHTVFD GK ++GFAEAA
Sbjct: 316 YILKTGEGVATICVSGFAALDVPPPRGPLWILGDVFMGPYHTVFDYGKSQVGFAEAA 372
>gi|12231178|dbj|BAB20972.1| aspartic proteinase 4 [Nepenthes alata]
Length = 505
Score = 461 bits (1186), Expect = e-127, Method: Compositional matrix adjust.
Identities = 215/298 (72%), Positives = 254/298 (85%), Gaps = 4/298 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MVEQGLVSE+VFSFW NRDP A+ GGEIVFGG+D KHF G+H YVP+T+KGYWQFE+G+
Sbjct: 209 MVEQGLVSEKVFSFWFNRDPKAKIGGEIVFGGIDEKHFVGEHIYVPITRKGYWQFEMGNF 268
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
LIGN STG C GGC AIVDSGTSLLAGP VVTE+NHAIG EG+ S ECK VV QYGD+I
Sbjct: 269 LIGNYSTGFCRGGCDAIVDSGTSLLAGPMHVVTEVNHAIGAEGIASMECKEVVYQYGDMI 328
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKE---NVSAGDSAVCSACEMAVV 177
WDLLVSG+ P+K+C Q+ LC FN A+++S GIKTV+E+E N S D +C+ACEMAVV
Sbjct: 329 WDLLVSGVQPDKICSQLALC-FNDAQFLSIGIKTVIERENRKNSSVADDFLCTACEMAVV 387
Query: 178 WVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPE 237
W+QNQL+++ TKEKVL+YINELCDSLP+PMGES+IDCD IP MPNV+FTIG+K F L+PE
Sbjct: 388 WIQNQLRREVTKEKVLNYINELCDSLPSPMGESVIDCDSIPYMPNVTFTIGEKPFKLTPE 447
Query: 238 QYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
QY+LK GEG A VC+SGF+A D+PPP GPLWILGDVFMGVYHTVFD G L++GFAE+A
Sbjct: 448 QYVLKAGEGDAMVCLSGFIALDVPPPSGPLWILGDVFMGVYHTVFDFGNLKLGFAESA 505
>gi|350535356|ref|NP_001234702.1| aspartic protease precursor [Solanum lycopersicum]
gi|951449|gb|AAB18280.1| aspartic protease precursor [Solanum lycopersicum]
Length = 506
Score = 460 bits (1184), Expect = e-127, Method: Compositional matrix adjust.
Identities = 210/297 (70%), Positives = 255/297 (85%), Gaps = 2/297 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV QGLV E VFSFW NRD +A+EGGE+VFGGVDPKHFKG HT VP+T+KGYWQF +GD
Sbjct: 210 MVGQGLVKEPVFSFWFNRDANAKEGGELVFGGVDPKHFKGNHTCVPLTQKGYWQFNMGDF 269
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
LIGN STG C GGCAAIVDSGTSLLAGPT +VT+INHAIG EG+VS ECK +VSQYG++I
Sbjct: 270 LIGNTSTGYCAGGCAAIVDSGTSLLAGPTTIVTQINHAIGAEGIVSMECKTIVSQYGEMI 329
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKEN--VSAGDSAVCSACEMAVVW 178
WDLLVSG+ P++VC Q GLC +G+++VS+ I+TVVE+E S G++ +C+ACEMAVVW
Sbjct: 330 WDLLVSGIRPDQVCSQAGLCFLDGSQHVSSNIRTVVERETEGSSVGEAPLCTACEMAVVW 389
Query: 179 VQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQ 238
+QNQLKQ+QTKEKVL Y+N+LC+ +P+PMGES IDC+RI +MP+++FTI D F L+PEQ
Sbjct: 390 MQNQLKQEQTKEKVLEYVNQLCEKIPSPMGESAIDCNRISSMPDITFTIKDTAFVLTPEQ 449
Query: 239 YILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
YILKTGEG+A +C+SGF A D+PPPRGPLWILGDVFMG YHTVFD GK ++GFAEAA
Sbjct: 450 YILKTGEGVATICVSGFAALDVPPPRGPLWILGDVFMGPYHTVFDYGKSQVGFAEAA 506
>gi|50540937|gb|AAT77954.1| Asp [Solanum tuberosum]
Length = 497
Score = 455 bits (1171), Expect = e-126, Method: Compositional matrix adjust.
Identities = 209/295 (70%), Positives = 252/295 (85%), Gaps = 2/295 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV QGLV E VFSFW NRD +A+EGGE+VFGGVDPKHFKG HTYVP+T+KGYWQF +GD
Sbjct: 198 MVGQGLVKEPVFSFWFNRDANAKEGGELVFGGVDPKHFKGNHTYVPLTQKGYWQFNMGDF 257
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
LIGN STG C GGCAAIVDSGTSLLAGPT +V +INHAIG EG+VS ECK +VSQYG++I
Sbjct: 258 LIGNTSTGYCAGGCAAIVDSGTSLLAGPTTIVAQINHAIGAEGIVSMECKTIVSQYGEMI 317
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKEN--VSAGDSAVCSACEMAVVW 178
WDLLVSG+ P++VC Q GLC +GA++VS+ IKTVVE+E S G++ +C+ACEMAVVW
Sbjct: 318 WDLLVSGVRPDQVCSQAGLCFVDGAQHVSSNIKTVVERETEGSSVGEAPLCTACEMAVVW 377
Query: 179 VQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQ 238
+QNQLKQ+ TKEKVL Y+N+LC+ +P+PMGES IDC+ I +MP+++FTI DK F L+PEQ
Sbjct: 378 MQNQLKQEGTKEKVLEYVNQLCEKIPSPMGESAIDCNNISSMPDITFTIKDKAFVLTPEQ 437
Query: 239 YILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAE 293
YILKTGEG+A +C+SGF A D+PPPRGPLWILGDVFMG YHTVFD GK ++GFAE
Sbjct: 438 YILKTGEGVATICVSGFAALDVPPPRGPLWILGDVFMGPYHTVFDYGKSQVGFAE 492
>gi|357511707|ref|XP_003626142.1| Aspartic proteinase [Medicago truncatula]
gi|355501157|gb|AES82360.1| Aspartic proteinase [Medicago truncatula]
Length = 504
Score = 452 bits (1164), Expect = e-125, Method: Compositional matrix adjust.
Identities = 210/297 (70%), Positives = 253/297 (85%), Gaps = 2/297 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+EQ L+ E+VFSFWLN +P+A++GGE+VFGGVDPKHFKGKHTYVPVT+KGYWQ E+GD
Sbjct: 208 MLEQNLIGEKVFSFWLNGNPNAKKGGELVFGGVDPKHFKGKHTYVPVTEKGYWQIEMGDF 267
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
IG STGVCEGGCAAIVDSGTSLLAGPTPVV EINHAIG EGV+S ECK VVSQYG+LI
Sbjct: 268 FIGGLSTGVCEGGCAAIVDSGTSLLAGPTPVVAEINHAIGAEGVLSVECKEVVSQYGELI 327
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKEN--VSAGDSAVCSACEMAVVW 178
WDLLVSG+ P VC Q+GLC+ G + S GI+ V +KE +SA D+ +CS+C+M V+W
Sbjct: 328 WDLLVSGVKPGDVCSQVGLCSIRGDQSNSAGIEMVTDKEQSELSAKDTPLCSSCQMLVLW 387
Query: 179 VQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQ 238
VQNQLKQK TKE+V +Y+N+LC+SLP+P GES+I C+ I MPN+SFTIG+K F L+PEQ
Sbjct: 388 VQNQLKQKATKERVFNYVNQLCESLPSPSGESVISCNDISKMPNISFTIGNKPFVLTPEQ 447
Query: 239 YILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
YIL+TGEGI +VC+SGF+AFD+PPP+GPLWILGDVFM YHTVFD G L++GFAEAA
Sbjct: 448 YILRTGEGITQVCLSGFIAFDVPPPKGPLWILGDVFMRAYHTVFDYGNLQVGFAEAA 504
>gi|357511709|ref|XP_003626143.1| Aspartic proteinase [Medicago truncatula]
gi|355501158|gb|AES82361.1| Aspartic proteinase [Medicago truncatula]
Length = 478
Score = 452 bits (1163), Expect = e-125, Method: Compositional matrix adjust.
Identities = 210/297 (70%), Positives = 253/297 (85%), Gaps = 2/297 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+EQ L+ E+VFSFWLN +P+A++GGE+VFGGVDPKHFKGKHTYVPVT+KGYWQ E+GD
Sbjct: 182 MLEQNLIGEKVFSFWLNGNPNAKKGGELVFGGVDPKHFKGKHTYVPVTEKGYWQIEMGDF 241
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
IG STGVCEGGCAAIVDSGTSLLAGPTPVV EINHAIG EGV+S ECK VVSQYG+LI
Sbjct: 242 FIGGLSTGVCEGGCAAIVDSGTSLLAGPTPVVAEINHAIGAEGVLSVECKEVVSQYGELI 301
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKEN--VSAGDSAVCSACEMAVVW 178
WDLLVSG+ P VC Q+GLC+ G + S GI+ V +KE +SA D+ +CS+C+M V+W
Sbjct: 302 WDLLVSGVKPGDVCSQVGLCSIRGDQSNSAGIEMVTDKEQSELSAKDTPLCSSCQMLVLW 361
Query: 179 VQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQ 238
VQNQLKQK TKE+V +Y+N+LC+SLP+P GES+I C+ I MPN+SFTIG+K F L+PEQ
Sbjct: 362 VQNQLKQKATKERVFNYVNQLCESLPSPSGESVISCNDISKMPNISFTIGNKPFVLTPEQ 421
Query: 239 YILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
YIL+TGEGI +VC+SGF+AFD+PPP+GPLWILGDVFM YHTVFD G L++GFAEAA
Sbjct: 422 YILRTGEGITQVCLSGFIAFDVPPPKGPLWILGDVFMRAYHTVFDYGNLQVGFAEAA 478
>gi|114786427|gb|ABI78942.1| aspartic protease [Ipomoea batatas]
Length = 508
Score = 452 bits (1162), Expect = e-124, Method: Compositional matrix adjust.
Identities = 217/304 (71%), Positives = 249/304 (81%), Gaps = 11/304 (3%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MVEQGLV E VFSFWLNRD +AEEGGE++FGGVDP HFKGKHTYVPVT+KGYWQFE+GD
Sbjct: 207 MVEQGLVDEPVFSFWLNRDTNAEEGGELIFGGVDPNHFKGKHTYVPVTQKGYWQFEMGDF 266
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPT--------PVVTEINHAIGGEGVVSAECKLV 112
LIGN STG CEGGCAAIVDSGTSLL GPT +VTEINHAIG EGVVS ECK +
Sbjct: 267 LIGNSSTGFCEGGCAAIVDSGTSLLTGPTTIVTEINHAIVTEINHAIGAEGVVSTECKEI 326
Query: 113 VSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGDS-AVCSA 171
VSQYG++IWDLLVSG+ P++VC Q+GLC FNGA + I VVEK+N S +C+A
Sbjct: 327 VSQYGNMIWDLLVSGVKPDEVCSQVGLCFFNGA--AGSNIGMVVEKDNEGKSSSDPMCTA 384
Query: 172 CEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKI 231
CEMAVVW+QNQLKQK KEKV Y+N+LC+ +P+PMGES IDC+ I MPNV+F I DK
Sbjct: 385 CEMAVVWMQNQLKQKVVKEKVFDYVNQLCEKIPSPMGESTIDCNSISNMPNVTFKIADKD 444
Query: 232 FNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGF 291
F L+PEQYILKTGEG+A +C+SGF+A D+P PRGPLWILGDVFMGVYHTVFD G L+IGF
Sbjct: 445 FVLTPEQYILKTGEGVATICVSGFLAMDVPAPRGPLWILGDVFMGVYHTVFDYGNLQIGF 504
Query: 292 AEAA 295
AEAA
Sbjct: 505 AEAA 508
>gi|356534977|ref|XP_003536026.1| PREDICTED: aspartic proteinase-like [Glycine max]
Length = 508
Score = 451 bits (1161), Expect = e-124, Method: Compositional matrix adjust.
Identities = 212/297 (71%), Positives = 255/297 (85%), Gaps = 3/297 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV+Q LVSE+VFSFWLN DP A++GGE++FGG+DPKHFKG H YVPVTKKGYWQ E+GD
Sbjct: 213 MVKQNLVSEQVFSFWLNGDPKAKDGGELIFGGIDPKHFKGDHIYVPVTKKGYWQIEMGDF 272
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
IG STGVCEGGCAAIVDSGTSLLAGPT VVTEINHAIG EGV+S ECK VVS+YG+L+
Sbjct: 273 FIGGLSTGVCEGGCAAIVDSGTSLLAGPTTVVTEINHAIGAEGVLSVECKEVVSEYGELL 332
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEK--ENVSAGDSAVCSACEMAVVW 178
WDLLVSG+ P+ VC Q+GLC F A+ S GI+ V EK +SA D+A+C++C+M VVW
Sbjct: 333 WDLLVSGVRPDDVCSQVGLC-FKRAKSESNGIEMVTEKGQRELSAKDTALCTSCQMLVVW 391
Query: 179 VQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQ 238
+QNQLKQK+TKE V +Y+N+LC+SLP+P GES++DC+ I +PN++FT+GDK F L+PEQ
Sbjct: 392 IQNQLKQKKTKEIVFNYVNQLCESLPSPNGESVVDCNSIYGLPNITFTVGDKPFTLTPEQ 451
Query: 239 YILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
YILKTGEGIAEVC+SGF+AFD+PPPRGPLWILGDVFM VYHTVFD G LR+GFA+AA
Sbjct: 452 YILKTGEGIAEVCLSGFIAFDIPPPRGPLWILGDVFMRVYHTVFDYGNLRVGFAKAA 508
>gi|87241358|gb|ABD33216.1| Peptidase A1, pepsin [Medicago truncatula]
Length = 396
Score = 451 bits (1161), Expect = e-124, Method: Compositional matrix adjust.
Identities = 210/297 (70%), Positives = 253/297 (85%), Gaps = 2/297 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+EQ L+ E+VFSFWLN +P+A++GGE+VFGGVDPKHFKGKHTYVPVT+KGYWQ E+GD
Sbjct: 100 MLEQNLIGEKVFSFWLNGNPNAKKGGELVFGGVDPKHFKGKHTYVPVTEKGYWQIEMGDF 159
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
IG STGVCEGGCAAIVDSGTSLLAGPTPVV EINHAIG EGV+S ECK VVSQYG+LI
Sbjct: 160 FIGGLSTGVCEGGCAAIVDSGTSLLAGPTPVVAEINHAIGAEGVLSVECKEVVSQYGELI 219
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKEN--VSAGDSAVCSACEMAVVW 178
WDLLVSG+ P VC Q+GLC+ G + S GI+ V +KE +SA D+ +CS+C+M V+W
Sbjct: 220 WDLLVSGVKPGDVCSQVGLCSIRGDQSNSAGIEMVTDKEQSELSAKDTPLCSSCQMLVLW 279
Query: 179 VQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQ 238
VQNQLKQK TKE+V +Y+N+LC+SLP+P GES+I C+ I MPN+SFTIG+K F L+PEQ
Sbjct: 280 VQNQLKQKATKERVFNYVNQLCESLPSPSGESVISCNDISKMPNISFTIGNKPFVLTPEQ 339
Query: 239 YILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
YIL+TGEGI +VC+SGF+AFD+PPP+GPLWILGDVFM YHTVFD G L++GFAEAA
Sbjct: 340 YILRTGEGITQVCLSGFIAFDVPPPKGPLWILGDVFMRAYHTVFDYGNLQVGFAEAA 396
>gi|356575293|ref|XP_003555776.1| PREDICTED: aspartic proteinase [Glycine max]
Length = 507
Score = 450 bits (1158), Expect = e-124, Method: Compositional matrix adjust.
Identities = 212/297 (71%), Positives = 253/297 (85%), Gaps = 3/297 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV+Q LVSE+VFSFWLN DP + GGE+VFGGVDPKHFKG+H YVPVTKKGYWQ E+GD
Sbjct: 212 MVKQNLVSEQVFSFWLNGDPKVKNGGELVFGGVDPKHFKGEHIYVPVTKKGYWQIEMGDF 271
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
IG STGVCEGGCAAIVDSGTSLLAGPT VVTEINHAIG EGV+S ECK VVS+YG+L+
Sbjct: 272 FIGGLSTGVCEGGCAAIVDSGTSLLAGPTTVVTEINHAIGAEGVLSVECKEVVSEYGELL 331
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKEN--VSAGDSAVCSACEMAVVW 178
WDLLVSG+ P+ VC Q+GLC F + S GI+ V EKE +S D+A+C++C+M VVW
Sbjct: 332 WDLLVSGVRPDDVCSQVGLC-FKRTKSESNGIEMVTEKEQRELSTKDTALCTSCQMLVVW 390
Query: 179 VQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQ 238
+QNQLKQK+TKE V +Y+N+LC+SLP+P GES++DC+ I +PN++FT+GDK F L+PEQ
Sbjct: 391 IQNQLKQKKTKEIVFNYVNQLCESLPSPNGESVVDCNSIYGLPNITFTVGDKPFTLTPEQ 450
Query: 239 YILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
YILKTGEGIAEVC+SGF+AFD+PPPRGPLWILGDVFM VYHTVFD G LR+GFA+AA
Sbjct: 451 YILKTGEGIAEVCLSGFIAFDIPPPRGPLWILGDVFMRVYHTVFDYGNLRVGFAKAA 507
>gi|359487589|ref|XP_003633616.1| PREDICTED: aspartic proteinase-like [Vitis vinifera]
Length = 510
Score = 450 bits (1157), Expect = e-124, Method: Compositional matrix adjust.
Identities = 202/297 (68%), Positives = 249/297 (83%), Gaps = 3/297 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
++ QGLV E++FSFWLNRDP A +GGEIVFGGVD +HFKG+HTY +T+KGYWQFE+G+
Sbjct: 212 LLRQGLVQEDIFSFWLNRDPQATDGGEIVFGGVDKRHFKGQHTYASITQKGYWQFEMGEF 271
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
LIG QSTG CE GCAAIVDSGTSL+AGPT +VTEINHAIG EG+VS ECK VVSQYG++I
Sbjct: 272 LIGYQSTGFCEAGCAAIVDSGTSLIAGPTAIVTEINHAIGAEGIVSQECKEVVSQYGNMI 331
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKEN---VSAGDSAVCSACEMAVV 177
WDLL+S + P+ VC QIGLC FNG++ S IKTVVE+E+ G+ C+ACEM V+
Sbjct: 332 WDLLISRVQPDAVCSQIGLCNFNGSQIESPRIKTVVEEEDARGTKVGNEVWCTACEMTVI 391
Query: 178 WVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPE 237
W+QNQLKQ++TKE + SY+ ELC SLP+PMGES++DC R+P MP+V+FTI DK F L+P+
Sbjct: 392 WIQNQLKQRKTKEIIFSYVTELCQSLPSPMGESVVDCGRVPYMPDVTFTIADKHFTLTPK 451
Query: 238 QYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
+Y+LKTGEGI VC+SGF+A D+PPPRGPLWILGD+FMGVYHTVFD G L++GFAEA
Sbjct: 452 EYVLKTGEGITTVCLSGFIALDVPPPRGPLWILGDIFMGVYHTVFDYGNLQVGFAEA 508
>gi|296089849|emb|CBI39668.3| unnamed protein product [Vitis vinifera]
Length = 430
Score = 449 bits (1156), Expect = e-124, Method: Compositional matrix adjust.
Identities = 202/297 (68%), Positives = 249/297 (83%), Gaps = 3/297 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
++ QGLV E++FSFWLNRDP A +GGEIVFGGVD +HFKG+HTY +T+KGYWQFE+G+
Sbjct: 132 LLRQGLVQEDIFSFWLNRDPQATDGGEIVFGGVDKRHFKGQHTYASITQKGYWQFEMGEF 191
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
LIG QSTG CE GCAAIVDSGTSL+AGPT +VTEINHAIG EG+VS ECK VVSQYG++I
Sbjct: 192 LIGYQSTGFCEAGCAAIVDSGTSLIAGPTAIVTEINHAIGAEGIVSQECKEVVSQYGNMI 251
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKEN---VSAGDSAVCSACEMAVV 177
WDLL+S + P+ VC QIGLC FNG++ S IKTVVE+E+ G+ C+ACEM V+
Sbjct: 252 WDLLISRVQPDAVCSQIGLCNFNGSQIESPRIKTVVEEEDARGTKVGNEVWCTACEMTVI 311
Query: 178 WVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPE 237
W+QNQLKQ++TKE + SY+ ELC SLP+PMGES++DC R+P MP+V+FTI DK F L+P+
Sbjct: 312 WIQNQLKQRKTKEIIFSYVTELCQSLPSPMGESVVDCGRVPYMPDVTFTIADKHFTLTPK 371
Query: 238 QYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
+Y+LKTGEGI VC+SGF+A D+PPPRGPLWILGD+FMGVYHTVFD G L++GFAEA
Sbjct: 372 EYVLKTGEGITTVCLSGFIALDVPPPRGPLWILGDIFMGVYHTVFDYGNLQVGFAEA 428
>gi|356505735|ref|XP_003521645.1| PREDICTED: aspartic proteinase-like [Glycine max]
Length = 508
Score = 449 bits (1156), Expect = e-124, Method: Compositional matrix adjust.
Identities = 207/296 (69%), Positives = 252/296 (85%), Gaps = 2/296 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MVEQ L+SE+VFSFWLN DP+A++GGE+VFGGVDPKHFKG HTYVP+T+KGYWQ E+GD
Sbjct: 212 MVEQKLISEKVFSFWLNGDPNAKKGGELVFGGVDPKHFKGNHTYVPITEKGYWQIEIGDF 271
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
IG STGVCEGGCAAIVDSGTSLLAGPTPVV EINHAIG EGV+S ECK VVSQYG+LI
Sbjct: 272 FIGGVSTGVCEGGCAAIVDSGTSLLAGPTPVVAEINHAIGAEGVLSVECKEVVSQYGELI 331
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKEN--VSAGDSAVCSACEMAVVW 178
WDLLVSG+ P+ +C Q+GLC+ E S GI+ V EKE ++A D+ +CS+C+M V+W
Sbjct: 332 WDLLVSGVKPDDICSQVGLCSSKRHESKSAGIEMVTEKEQGELTARDNPLCSSCQMLVLW 391
Query: 179 VQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQ 238
+QNQLKQK TK++V +Y+N+LC+SLP+P GES+I C+ + MPN++FTIG+K F L+PEQ
Sbjct: 392 IQNQLKQKATKDRVFNYVNQLCESLPSPSGESVISCNSLSKMPNITFTIGNKPFVLTPEQ 451
Query: 239 YILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
YILKTGEGI EVC+SGF+AFD+PPP+GPLWILGDVFM YHTVFD G L++GFAEA
Sbjct: 452 YILKTGEGITEVCLSGFIAFDVPPPKGPLWILGDVFMRAYHTVFDYGNLQVGFAEA 507
>gi|13897888|gb|AAK48494.1|AF259982_1 putative aspartic protease [Ipomoea batatas]
Length = 504
Score = 447 bits (1149), Expect = e-123, Method: Compositional matrix adjust.
Identities = 215/298 (72%), Positives = 253/298 (84%), Gaps = 5/298 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MVEQGLV E VFSFWLNRDP AE GGE+VFGGVDPKHFKG+HTYVPVT+KGYWQ +LGD
Sbjct: 209 MVEQGLVDEPVFSFWLNRDPKAEVGGELVFGGVDPKHFKGEHTYVPVTQKGYWQIDLGDF 268
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
LIGN STG CEGGCA IVDSGTSLL GPT VVTEIN+AIG EGVV AECK VVS+YG++I
Sbjct: 269 LIGNSSTGYCEGGCAVIVDSGTSLLTGPTAVVTEINYAIGPEGVVCAECKEVVSEYGEMI 328
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKE---NVSAGDSAVCSACEMAVV 177
WDLLVSGL ++VC ++GLC NGA + S+ IKTVVEKE N+++ + +C+ CEMAV+
Sbjct: 329 WDLLVSGLRADQVCSELGLCFLNGAWHESSIIKTVVEKEAEGNLTS--NPLCTTCEMAVI 386
Query: 178 WVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPE 237
W+QNQLKQK KEKV Y+++LC+ LP+P GES+IDC+ I +MPNV+F IGDK F L+PE
Sbjct: 387 WLQNQLKQKGIKEKVFEYVDQLCEKLPSPDGESVIDCNSISSMPNVTFVIGDKDFVLTPE 446
Query: 238 QYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
QYILKTGEGIA VC+SGF+A D+PPP+GPLWILGDVFMG YHTVFD G L++GFAEAA
Sbjct: 447 QYILKTGEGIAAVCVSGFLALDVPPPQGPLWILGDVFMGAYHTVFDYGNLQVGFAEAA 504
>gi|351725345|ref|NP_001237345.1| aspartic proteinase 2 [Glycine max]
gi|15425751|dbj|BAB64296.1| aspartic proteinase 2 [Glycine max]
Length = 508
Score = 447 bits (1149), Expect = e-123, Method: Compositional matrix adjust.
Identities = 204/296 (68%), Positives = 252/296 (85%), Gaps = 2/296 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MVEQ L+SE+VFSFWLN DP+A++GGE+VFGGVDPKHFKG HTYVP+T+KGYWQ E+GD
Sbjct: 212 MVEQKLISEKVFSFWLNGDPNAKKGGELVFGGVDPKHFKGNHTYVPITEKGYWQIEMGDF 271
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
+G STGVCEGGCAAIVDSGTSLLAGPTPVV EINHAIG EGV+S ECK VVSQYG+LI
Sbjct: 272 FVGGVSTGVCEGGCAAIVDSGTSLLAGPTPVVAEINHAIGAEGVLSVECKEVVSQYGELI 331
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKEN--VSAGDSAVCSACEMAVVW 178
WDLLVSG+ P+ +C Q+GLC+ + S GI+ V EKE ++A D+ +CS+C+M V+W
Sbjct: 332 WDLLVSGVKPDDICSQVGLCSSKRHQSKSAGIEMVTEKEQEELAARDTPLCSSCQMLVLW 391
Query: 179 VQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQ 238
+QNQLKQK TK++V +Y+N+LC+SLP+P GES+I C+ + MPN++FTIG+K F L+PEQ
Sbjct: 392 IQNQLKQKATKDRVFNYVNQLCESLPSPSGESVISCNSLSKMPNITFTIGNKPFVLTPEQ 451
Query: 239 YILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
YIL+TGEGI EVC+SGF+AFD+PPP+GPLWILGDVFM YHTVFD G L++GFAEA
Sbjct: 452 YILRTGEGITEVCLSGFIAFDVPPPKGPLWILGDVFMRAYHTVFDYGNLQVGFAEA 507
>gi|359487701|ref|XP_002276363.2| PREDICTED: aspartic proteinase oryzasin-1-like [Vitis vinifera]
gi|296089851|emb|CBI39670.3| unnamed protein product [Vitis vinifera]
Length = 502
Score = 446 bits (1147), Expect = e-123, Method: Compositional matrix adjust.
Identities = 206/299 (68%), Positives = 250/299 (83%), Gaps = 10/299 (3%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M++QGL+ EE+FSFWLNR+P+A EGGEIVFGGVD +HF+GKHT+VPVT+ GYWQF +GD
Sbjct: 210 MLQQGLLHEELFSFWLNRNPNANEGGEIVFGGVDKRHFRGKHTFVPVTQAGYWQFRMGDF 269
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
LI NQ+TGVCEGGC+AIVDSGTSL+AGPT VVT+INHAIG EG+VS ECK VVSQYG+++
Sbjct: 270 LISNQTTGVCEGGCSAIVDSGTSLIAGPTLVVTQINHAIGAEGIVSMECKEVVSQYGNMM 329
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSA----GDSAVCSACEMAV 176
WDLLVSG+LP KVC QIGLC S GI+TVVEKE + + GD C+ACEM
Sbjct: 330 WDLLVSGVLPSKVCSQIGLCM------ASPGIRTVVEKEKMESVEEVGDVVFCNACEMIA 383
Query: 177 VWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSP 236
VW+Q+QLKQ +TK+KVL Y+ ELC SLP+PMGES+IDC + MPN++F IGDK F+L+P
Sbjct: 384 VWIQSQLKQMKTKDKVLRYVTELCGSLPSPMGESVIDCTSVANMPNITFIIGDKAFDLTP 443
Query: 237 EQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
+QYIL+TG+G A VC+SGF A D+PPP+GPLWILG++FMGVYHTVFD G LRIGFAEAA
Sbjct: 444 DQYILRTGDGSATVCLSGFTALDVPPPKGPLWILGEIFMGVYHTVFDFGDLRIGFAEAA 502
>gi|255543036|ref|XP_002512581.1| Aspartic proteinase precursor, putative [Ricinus communis]
gi|223548542|gb|EEF50033.1| Aspartic proteinase precursor, putative [Ricinus communis]
Length = 494
Score = 445 bits (1144), Expect = e-122, Method: Compositional matrix adjust.
Identities = 212/298 (71%), Positives = 243/298 (81%), Gaps = 14/298 (4%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV+QGLV + VFSFWLN DPDA+EGGE+VFGGVD KH++GKHTYVPVT+KGYWQF +GD
Sbjct: 208 MVQQGLVGDPVFSFWLNNDPDAKEGGELVFGGVDEKHYRGKHTYVPVTQKGYWQFNMGDF 267
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
+IGN ST DSGTSLLAGPTP+V EINHAIG EG+VSAECK VVSQYG+LI
Sbjct: 268 IIGNHST-----------DSGTSLLAGPTPIVAEINHAIGAEGIVSAECKEVVSQYGNLI 316
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENV---SAGDSAVCSACEMAVV 177
WDLL+SG+ P KVC Q+GLC F G Y S I++VVE+EN+ S GD +C+ACEM V+
Sbjct: 317 WDLLISGVQPGKVCSQLGLCTFRGDRYESNVIESVVEEENMEGSSVGDDVLCTACEMLVI 376
Query: 178 WVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPE 237
WVQNQLK KQTKE L Y+N+LC+SLP+PMGESIIDC MPN+ FTIGDK F L+PE
Sbjct: 377 WVQNQLKHKQTKEAALEYVNKLCESLPSPMGESIIDCASTTGMPNIIFTIGDKQFQLTPE 436
Query: 238 QYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
QYILKTGEGIA VCISGFMA D+PPPRGPLWILGDVFM VYHTVFD G L++GFAEAA
Sbjct: 437 QYILKTGEGIASVCISGFMALDVPPPRGPLWILGDVFMRVYHTVFDFGDLQVGFAEAA 494
>gi|449454758|ref|XP_004145121.1| PREDICTED: aspartic proteinase-like [Cucumis sativus]
gi|449472326|ref|XP_004153558.1| PREDICTED: aspartic proteinase-like [Cucumis sativus]
Length = 514
Score = 442 bits (1137), Expect = e-122, Method: Compositional matrix adjust.
Identities = 204/298 (68%), Positives = 245/298 (82%), Gaps = 3/298 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MVEQGLV E VFSFWLNR+ + EEGGEIVFGGVDPKH+KGKHTYVPVT+KGYWQF++GD+
Sbjct: 217 MVEQGLVKEPVFSFWLNRNAEEEEGGEIVFGGVDPKHYKGKHTYVPVTQKGYWQFDMGDV 276
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
LI + TG CEGGC+AI DSGTSLLAGPT +VT INHAIG +GV+S ECK VV QYG I
Sbjct: 277 LIDGKPTGYCEGGCSAIADSGTSLLAGPTTIVTMINHAIGAKGVMSQECKAVVQQYGQTI 336
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGD---SAVCSACEMAVV 177
DLL+S P+K+C QI LC F+G VS GI++VV++ + D +CS CEM VV
Sbjct: 337 MDLLLSEADPKKICSQIKLCTFDGTRGVSMGIESVVDENAGKSSDGLRDGMCSVCEMTVV 396
Query: 178 WVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPE 237
W+QNQL+Q QTKE++++YINELCD +P+PMG+S +DC + +MP+VSFTIGDK+F+L+PE
Sbjct: 397 WMQNQLRQNQTKERIINYINELCDRMPSPMGQSAVDCGTLSSMPSVSFTIGDKVFDLAPE 456
Query: 238 QYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
+YILK GEG A CISGF AFD+PPPRGPLWILGDVFMG YHTVFD GKLR+GFAEAA
Sbjct: 457 EYILKVGEGAAAQCISGFTAFDIPPPRGPLWILGDVFMGRYHTVFDFGKLRVGFAEAA 514
>gi|147780252|emb|CAN65745.1| hypothetical protein VITISV_037763 [Vitis vinifera]
Length = 504
Score = 439 bits (1130), Expect = e-121, Method: Compositional matrix adjust.
Identities = 197/291 (67%), Positives = 243/291 (83%), Gaps = 3/291 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
++ QGLV E++FSFWLNRDP A +GGEIVFGGVD +HFKG+HTY +T+KGYWQFE+G+
Sbjct: 183 LLRQGLVQEDIFSFWLNRDPQATDGGEIVFGGVDKRHFKGQHTYASITQKGYWQFEMGEF 242
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
LIG QSTG CE GCAAIVDSGTSL+AGPT +VTEINHAIG EG+VS ECK VVSQYG++I
Sbjct: 243 LIGYQSTGFCEAGCAAIVDSGTSLIAGPTAIVTEINHAIGAEGIVSQECKEVVSQYGNMI 302
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKEN---VSAGDSAVCSACEMAVV 177
WDLL+S + P+ VC QIGLC FNG++ S IKTVVE+E+ G+ C+ACEM V+
Sbjct: 303 WDLLISRVQPDAVCSQIGLCNFNGSQIESPRIKTVVEEEDARGTKVGNEVWCTACEMTVI 362
Query: 178 WVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPE 237
W+QNQLKQ++TKE + SY+ ELC SLP+PMGES++DC R+P MP+V+FTI DK F L+P+
Sbjct: 363 WIQNQLKQRKTKEIIFSYVTELCQSLPSPMGESVVDCGRVPYMPDVTFTIADKHFTLTPK 422
Query: 238 QYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLR 288
+Y+LKTGEGI VC+SGF+A D+PPPRGPLWILGD+FMGVYHTVFD G L+
Sbjct: 423 EYVLKTGEGITTVCLSGFIALDVPPPRGPLWILGDIFMGVYHTVFDYGNLQ 473
>gi|21616051|emb|CAC86003.1| aspartic proteinase [Theobroma cacao]
Length = 514
Score = 438 bits (1126), Expect = e-120, Method: Compositional matrix adjust.
Identities = 203/299 (67%), Positives = 243/299 (81%), Gaps = 5/299 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV QGLV E VFSFW NRDP+ + GGE+VFGG+DPKHFKG HTYVP+T+KGYWQF++GD+
Sbjct: 217 MVNQGLVKEPVFSFWFNRDPEDDIGGEVVFGGMDPKHFKGDHTYVPITRKGYWQFDMGDV 276
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
LIGNQ+TG+C GGC+AI DSGTSL+ GPT ++ ++NHAIG GVVS ECK VVSQYG+ I
Sbjct: 277 LIGNQTTGLCAGGCSAIADSGTSLITGPTAIIAQVNHAIGASGVVSQECKTVVSQYGETI 336
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENV--SAGD--SAVCSACEMAV 176
D+L+S P K+C QIGLC F+G VSTGI++VV ENV + GD A+CS CEM V
Sbjct: 337 IDMLLSKDQPLKICSQIGLCTFDGTRGVSTGIESVVH-ENVGKATGDLHDAMCSTCEMTV 395
Query: 177 VWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSP 236
+W+QNQLKQ QT+E++L YINELCD LP+PMGES +DC + TMPNVSFTIG KIF LSP
Sbjct: 396 IWMQNQLKQNQTQERILEYINELCDRLPSPMGESAVDCSSLSTMPNVSFTIGGKIFELSP 455
Query: 237 EQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
EQY+LK GEG C+SGF A D+PPPRGPLWILGDVFMG +HTVFD G L++GFAEAA
Sbjct: 456 EQYVLKVGEGDVAQCLSGFTALDVPPPRGPLWILGDVFMGQFHTVFDYGNLQVGFAEAA 514
>gi|224056377|ref|XP_002298827.1| predicted protein [Populus trichocarpa]
gi|222846085|gb|EEE83632.1| predicted protein [Populus trichocarpa]
Length = 494
Score = 437 bits (1123), Expect = e-120, Method: Compositional matrix adjust.
Identities = 199/298 (66%), Positives = 241/298 (80%), Gaps = 3/298 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MVEQGLV E VFSFW NR+ D +EGGEIVFGGVDP H+KG+HTYVPVT+KGYWQF++GD+
Sbjct: 197 MVEQGLVKEPVFSFWFNRNADEKEGGEIVFGGVDPDHYKGEHTYVPVTQKGYWQFDMGDV 256
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
LIG Q++G C GCAAI DSGTSLLAGPT ++TE+NHAIG GVVS ECK VV+QYGD I
Sbjct: 257 LIGGQTSGFCASGCAAIADSGTSLLAGPTTIITEVNHAIGATGVVSQECKAVVAQYGDTI 316
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGD---SAVCSACEMAVV 177
++L++ P+K+C QIGLC F+G VS GI++VV + A D A+CS CEMAVV
Sbjct: 317 MEMLLAKDQPQKICAQIGLCTFDGTRGVSMGIESVVNEHAQKASDGFHDAMCSTCEMAVV 376
Query: 178 WVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPE 237
W+QNQLKQ QT+E++L Y+NELC+ LP+PMGES +DCD + +MPNVSFTIG ++F LSPE
Sbjct: 377 WMQNQLKQNQTQERILDYVNELCERLPSPMGESAVDCDGLSSMPNVSFTIGGRVFELSPE 436
Query: 238 QYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
QY+LK GEG CISGF A D+PPPRGPLWILGDVFMG +HTVFD G +R+GFAEA
Sbjct: 437 QYVLKVGEGDVAQCISGFTALDVPPPRGPLWILGDVFMGSFHTVFDYGNMRVGFAEAT 494
>gi|21616053|emb|CAC86004.1| aspartic proteinase [Theobroma cacao]
Length = 514
Score = 436 bits (1121), Expect = e-120, Method: Compositional matrix adjust.
Identities = 201/298 (67%), Positives = 244/298 (81%), Gaps = 3/298 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M++QGL+ E VFSFWLNR+ D E GGEIVFGGVDP H+KGKHTYVPVT+KGYWQF++GD+
Sbjct: 217 MIKQGLIKEPVFSFWLNRNVDEEAGGEIVFGGVDPNHYKGKHTYVPVTQKGYWQFDMGDV 276
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
LI ++ TG C G CAAI DSGTSLLAGP+ V+T INHAIG GVVS ECK VV QYG I
Sbjct: 277 LIADKPTGYCAGSCAAIADSGTSLLAGPSTVITMINHAIGATGVVSQECKAVVQQYGRTI 336
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKEN-VSAG--DSAVCSACEMAVV 177
DLL++ P+K+C QIGLC FNGA VSTGI++VV++ N S+G A+C ACEMAVV
Sbjct: 337 IDLLIAEAQPQKICSQIGLCTFNGAHGVSTGIESVVDESNGKSSGVLRDAMCPACEMAVV 396
Query: 178 WVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPE 237
W+QNQ++Q QT++++LSY+NELCD +PNPMGES +DC + +MP +SFTIG K+F+L+PE
Sbjct: 397 WMQNQVRQNQTQDRILSYVNELCDRVPNPMGESAVDCGSLSSMPTISFTIGGKVFDLTPE 456
Query: 238 QYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
+YILK GEG CISGF A D+PPPRGPLWILGD+FMG YHTVFD GKLR+GFAEAA
Sbjct: 457 EYILKVGEGSEAQCISGFTALDIPPPRGPLWILGDIFMGRYHTVFDFGKLRVGFAEAA 514
>gi|297848226|ref|XP_002891994.1| hypothetical protein ARALYDRAFT_314946 [Arabidopsis lyrata subsp.
lyrata]
gi|297337836|gb|EFH68253.1| hypothetical protein ARALYDRAFT_314946 [Arabidopsis lyrata subsp.
lyrata]
Length = 504
Score = 431 bits (1107), Expect = e-118, Method: Compositional matrix adjust.
Identities = 206/295 (69%), Positives = 242/295 (82%), Gaps = 8/295 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV+QGLV ++VFSFWLNRD +AE GGEIVFGGVDP HFKGKHTYVPVT+KGYWQF +GDI
Sbjct: 218 MVDQGLVRDKVFSFWLNRDTEAEVGGEIVFGGVDPAHFKGKHTYVPVTRKGYWQFNMGDI 277
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
+G+ STG CE GC AI+DSGTSLLAGPT V+ +INHAIG EG+VSAECK VVSQYG++I
Sbjct: 278 FVGSNSTGFCEQGCDAIMDSGTSLLAGPTTVIAQINHAIGAEGIVSAECKDVVSQYGEMI 337
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGDSAVCSACEMAVVWVQ 180
W+LLV +LP +VC+++GLC F G E TGIKTVV+KE S +C CEMAVVWVQ
Sbjct: 338 WNLLVKRVLPRQVCKELGLCVF-GQE---TGIKTVVDKER----SSVLCEVCEMAVVWVQ 389
Query: 181 NQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYI 240
+LK +TKEKV Y+N+LC+SLP+P GESIIDC+ I MP+V+FTIG F+LSP+QYI
Sbjct: 390 TKLKVNETKEKVFEYVNQLCESLPSPAGESIIDCNNIKNMPSVTFTIGGNPFSLSPQQYI 449
Query: 241 LKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
LKTG G AE+CISGF AFDLPPP GPLWI+GDVFMG YHTVFDS L+IG AEA
Sbjct: 450 LKTGVGNAEMCISGFSAFDLPPPTGPLWIIGDVFMGAYHTVFDSDNLQIGIAEAT 504
>gi|357131833|ref|XP_003567538.1| PREDICTED: aspartic proteinase-like [Brachypodium distachyon]
Length = 503
Score = 431 bits (1107), Expect = e-118, Method: Compositional matrix adjust.
Identities = 195/295 (66%), Positives = 242/295 (82%), Gaps = 2/295 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M EQ L+++++FSFWLNRDPDA GGE+VFGGVD KH+KGKHTYVPVT+KGYWQF++GD+
Sbjct: 211 MQEQKLIAKDIFSFWLNRDPDAPTGGELVFGGVDQKHYKGKHTYVPVTRKGYWQFDMGDL 270
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
LIG QSTG C GGCAAIVDSGTSLLAGPT +V ++NHAIG EG++S ECK VV +YG++I
Sbjct: 271 LIGGQSTGFCAGGCAAIVDSGTSLLAGPTTIVAQVNHAIGAEGIISMECKEVVREYGEMI 330
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGDSAVCSACEMAVVWVQ 180
+LLV+ P+KVC QIGLC F+G + VS I++VVEKEN G +C+ACEMAVVW+Q
Sbjct: 331 LELLVAQTRPQKVCSQIGLCVFDGTKSVSNQIESVVEKEN--RGSDLLCTACEMAVVWIQ 388
Query: 181 NQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYI 240
NQL+Q QTKE +L Y N+LC+ LP+P GES +DC +I MPN++FTI +K F L+PEQYI
Sbjct: 389 NQLRQNQTKELILQYANQLCERLPSPNGESTVDCHQISKMPNLAFTIANKTFTLTPEQYI 448
Query: 241 LKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
+K + +CISGFMAFD+PPPRGPLWILGDVFMG YHTVFD G +IGFA++A
Sbjct: 449 VKLEQSGQTICISGFMAFDIPPPRGPLWILGDVFMGAYHTVFDFGDSKIGFAKSA 503
>gi|122890420|emb|CAM12780.1| aspartic proteinase [Fagopyrum esculentum]
Length = 506
Score = 429 bits (1103), Expect = e-118, Method: Compositional matrix adjust.
Identities = 201/298 (67%), Positives = 241/298 (80%), Gaps = 3/298 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV QGLV+E VFSFWLNR+ D E GGEIVFGG+DP H KG+HTY+PVT+KGYWQF+L D+
Sbjct: 209 MVNQGLVNEPVFSFWLNRNADEEVGGEIVFGGIDPAHHKGEHTYLPVTQKGYWQFDLDDV 268
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
L+G +STG C GGC+AI DSGTSLLAGPTPVV +INHAIG GVVS ECK VVSQYG I
Sbjct: 269 LVGGESTGFCSGGCSAIADSGTSLLAGPTPVVAQINHAIGASGVVSQECKTVVSQYGKQI 328
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKE-NVSAGD--SAVCSACEMAVV 177
DLLVS P K+C QIGLC F+G VS GI++VV+K + S+G+ A CSACEMAVV
Sbjct: 329 LDLLVSQTQPRKICSQIGLCTFDGTRGVSMGIESVVDKNVDKSSGNLKDATCSACEMAVV 388
Query: 178 WVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPE 237
W+QNQLKQ QT++++L Y N+LC+ LP+PMGES +DC + T+P VSFT+G K F L+PE
Sbjct: 389 WMQNQLKQNQTEDRILDYANQLCERLPSPMGESAVDCGSLSTLPTVSFTLGGKTFALAPE 448
Query: 238 QYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
QYIL+ GEG A CISGF+A D+PPPRGPLWILGD+FMG YHTVFD G +++GFAEAA
Sbjct: 449 QYILQVGEGPATQCISGFIALDVPPPRGPLWILGDIFMGQYHTVFDHGNMQVGFAEAA 506
>gi|224115794|ref|XP_002317126.1| predicted protein [Populus trichocarpa]
gi|222860191|gb|EEE97738.1| predicted protein [Populus trichocarpa]
Length = 512
Score = 429 bits (1103), Expect = e-118, Method: Compositional matrix adjust.
Identities = 195/299 (65%), Positives = 247/299 (82%), Gaps = 5/299 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M++QGL+ E VFSFWLNR+ D EEGGEIVFGG+DP H+KGKHTYVPVT+KGYWQF++GD+
Sbjct: 215 MIKQGLIKEPVFSFWLNRNADDEEGGEIVFGGMDPNHYKGKHTYVPVTQKGYWQFDMGDV 274
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
++G++STG C GGCAAI DSGTSLLAGPT ++T INHAIG GVVS +CK VVSQYG++I
Sbjct: 275 IVGDKSTGYCAGGCAAIADSGTSLLAGPTAIITMINHAIGASGVVSQQCKAVVSQYGEVI 334
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSA----GDSAVCSACEMAV 176
DLL+S + P+K+C QIGLC F+G +S GI++VV++ N + GD A+CSACEMAV
Sbjct: 335 MDLLLSEVQPKKICSQIGLCTFDGTRGISMGIQSVVDEGNDKSSGVLGD-AMCSACEMAV 393
Query: 177 VWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSP 236
W+++QL+Q QT+++VL Y N+LC+ +PNP G+S +DC + +MP ++FTIG K F L+P
Sbjct: 394 FWMRSQLQQNQTQDRVLDYANQLCERVPNPTGQSTVDCGSVLSMPRIAFTIGGKEFELAP 453
Query: 237 EQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
E+YILK G+G A CISGF A D+PPPRGPLWILGDVFMG YHTVFDSGKLR+GFAEAA
Sbjct: 454 EEYILKVGQGSAAQCISGFTALDIPPPRGPLWILGDVFMGRYHTVFDSGKLRVGFAEAA 512
>gi|359483345|ref|XP_003632941.1| PREDICTED: aspartic proteinase isoform 2 [Vitis vinifera]
gi|359483347|ref|XP_002262915.2| PREDICTED: aspartic proteinase isoform 1 [Vitis vinifera]
Length = 514
Score = 429 bits (1103), Expect = e-118, Method: Compositional matrix adjust.
Identities = 200/298 (67%), Positives = 241/298 (80%), Gaps = 3/298 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV+QGLV E VFSFWLNR D +EGGE+VFGGVDP HFKG+HTYVPVT+KGYWQF++G++
Sbjct: 217 MVKQGLVKEPVFSFWLNRKTDDDEGGELVFGGVDPDHFKGEHTYVPVTQKGYWQFDMGEV 276
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
LI ++TG C GGCAAI DSGTSLLAGPT VV INHAIG GVVS ECK VV+QYG+ I
Sbjct: 277 LIDGETTGYCAGGCAAIADSGTSLLAGPTAVVAMINHAIGATGVVSQECKTVVAQYGETI 336
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKEN---VSAGDSAVCSACEMAVV 177
DLL+S P+K+C QIGLC F+G V GI++VV+++N S A CSACEMAVV
Sbjct: 337 MDLLLSEASPQKICSQIGLCTFDGTRGVGMGIESVVDEKNGDKSSGVHDAGCSACEMAVV 396
Query: 178 WVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPE 237
W+Q+QL+Q QTKE++L Y+NELCD LP+PMGES +DC ++ +MPNVS TIG K+F+LS
Sbjct: 397 WMQSQLRQNQTKERILEYVNELCDRLPSPMGESAVDCLQLSSMPNVSLTIGGKVFDLSAN 456
Query: 238 QYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
+Y+LK GEG A CISGF+A D+PPPRGPLWILGDVFMG YHTVFD G +R+GFAEAA
Sbjct: 457 EYVLKVGEGAAAQCISGFIAMDVPPPRGPLWILGDVFMGRYHTVFDYGNMRVGFAEAA 514
>gi|77808107|gb|AAV84085.2| aspartic proteinase 9 [Fagopyrum esculentum]
Length = 506
Score = 429 bits (1102), Expect = e-117, Method: Compositional matrix adjust.
Identities = 201/298 (67%), Positives = 241/298 (80%), Gaps = 3/298 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV QGLV+E VFSFWLNR+ D E GGEIVFGG+DP H KG+HTY+PVT+KGYWQF+L D+
Sbjct: 209 MVNQGLVNEPVFSFWLNRNADEEIGGEIVFGGIDPAHHKGEHTYLPVTQKGYWQFDLDDV 268
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
L+G +STG C GGC+AI DSGTSLLAGPTPVV +INHAIG GVVS ECK VVSQYG I
Sbjct: 269 LVGGESTGFCSGGCSAIADSGTSLLAGPTPVVAQINHAIGASGVVSQECKTVVSQYGKQI 328
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKE-NVSAGD--SAVCSACEMAVV 177
DLLVS P K+C QIGLC F+G VS GI++VV+K + S+G+ A CSACEMAVV
Sbjct: 329 LDLLVSQTQPRKICSQIGLCTFDGTRGVSMGIESVVDKNVDKSSGNLKDATCSACEMAVV 388
Query: 178 WVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPE 237
W+QNQLKQ QT++++L Y N+LC+ LP+PMGES +DC + T+P VSFT+G K F L+PE
Sbjct: 389 WMQNQLKQNQTEDRILDYANQLCERLPSPMGESAVDCGSLSTLPTVSFTLGGKTFALAPE 448
Query: 238 QYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
QYIL+ GEG A CISGF+A D+PPPRGPLWILGD+FMG YHTVFD G +++GFAEAA
Sbjct: 449 QYILQVGEGPATQCISGFIALDVPPPRGPLWILGDIFMGQYHTVFDHGNMQVGFAEAA 506
>gi|12231174|dbj|BAB20970.1| aspartic proteinase 2 [Nepenthes alata]
Length = 514
Score = 428 bits (1100), Expect = e-117, Method: Compositional matrix adjust.
Identities = 194/298 (65%), Positives = 244/298 (81%), Gaps = 3/298 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M++QGLV+E VFSFWLNR + EEGGEIVFGGVDP H+KG+HTYVPVT+KGYWQF++ D+
Sbjct: 217 MIDQGLVNEPVFSFWLNRKSEEEEGGEIVFGGVDPNHYKGEHTYVPVTRKGYWQFDMDDV 276
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
L+G ++TG C GGC+AI DSGTSLLAGPT ++ +INHAIG G+VS ECK VVSQYG I
Sbjct: 277 LVGGETTGYCSGGCSAIADSGTSLLAGPTTIIVQINHAIGASGLVSQECKAVVSQYGKAI 336
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGD---SAVCSACEMAVV 177
D LV+ P+K+C QIGLC F+G VS GI++VVEK ++ D A+C+ACEMAVV
Sbjct: 337 LDALVAEAQPQKICSQIGLCTFDGKRGVSMGIESVVEKNPGNSSDGLQDAMCTACEMAVV 396
Query: 178 WVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPE 237
W+QNQL+Q +T+E++L+Y+NELC+ LP+PMGES +DC + +MPNVS TIG K+F+LSPE
Sbjct: 397 WMQNQLRQNRTEEQILNYVNELCNRLPSPMGESSVDCGSLSSMPNVSLTIGGKVFDLSPE 456
Query: 238 QYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
+Y+LK GEG+A CISGF+A D+ PPRGPLWILGD+FMG YHTVFD G L +GFAEAA
Sbjct: 457 KYVLKVGEGVAAQCISGFIALDIAPPRGPLWILGDIFMGQYHTVFDYGNLSVGFAEAA 514
>gi|255554815|ref|XP_002518445.1| Aspartic proteinase precursor, putative [Ricinus communis]
gi|223542290|gb|EEF43832.1| Aspartic proteinase precursor, putative [Ricinus communis]
Length = 511
Score = 428 bits (1100), Expect = e-117, Method: Compositional matrix adjust.
Identities = 196/298 (65%), Positives = 240/298 (80%), Gaps = 3/298 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M++QGL+ E VFSFWLNR+ EEGGEIVFGGVD H+KGKHTYVPVT+KGYWQFE+GD+
Sbjct: 214 MIKQGLIKEPVFSFWLNRNTQGEEGGEIVFGGVDLNHYKGKHTYVPVTQKGYWQFEMGDV 273
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
LIG++ T C GGC+AI DSGTSLLAGPT VVT IN AIG GV S ECK V++QYG+ I
Sbjct: 274 LIGHKPTEYCAGGCSAIADSGTSLLAGPTTVVTLINEAIGATGVASQECKTVIAQYGETI 333
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAG---DSAVCSACEMAVV 177
DLL++ P+K+C QIGLC F+G VS GI++VV+ N + A+CSACEM VV
Sbjct: 334 MDLLIAEAQPKKICSQIGLCTFDGTRGVSMGIQSVVDDNNDKSSGIVRDAMCSACEMTVV 393
Query: 178 WVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPE 237
W+QNQL++ QT++++L+Y+NELCD +PNP+GESI+DC I +MP VSFTIG K+F+LSP+
Sbjct: 394 WMQNQLRENQTQDRILNYVNELCDRIPNPLGESIVDCGSISSMPVVSFTIGGKVFDLSPQ 453
Query: 238 QYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
+YILK GEG CISGFMA D+PPPRGPLWILGD+FMG YHTVFD G LR+GFAEAA
Sbjct: 454 EYILKVGEGAQAQCISGFMALDVPPPRGPLWILGDIFMGRYHTVFDYGNLRVGFAEAA 511
>gi|15221141|ref|NP_172655.1| aspartic proteinase A1 [Arabidopsis thaliana]
gi|75318541|sp|O65390.1|APA1_ARATH RecName: Full=Aspartic proteinase A1; Flags: Precursor
gi|3157937|gb|AAC17620.1| Identical to aspartic proteinase cDNA gb|U51036 from A. thaliana.
ESTs gb|N96313, gb|T21893, gb|R30158, gb|T21482,
gb|T43650, gb|R64749, gb|R65157, gb|T88269, gb|T44552,
gb|T22542, gb|T76533, gb|T44350, gb|Z34591, gb|AA728734,
gb|T46003, gb|R65157, gb|N38290, gb|AA395468, gb|T20815
and gb|Z34173 come from this gene [Arabidopsis thaliana]
gi|15912219|gb|AAL08243.1| At1g11910/F12F1_24 [Arabidopsis thaliana]
gi|15912251|gb|AAL08259.1| At1g11910/F12F1_24 [Arabidopsis thaliana]
gi|17381036|gb|AAL36330.1| putative aspartic proteinase [Arabidopsis thaliana]
gi|21617929|gb|AAM66979.1| putative aspartic proteinase [Arabidopsis thaliana]
gi|25055040|gb|AAN71979.1| putative aspartic proteinase [Arabidopsis thaliana]
gi|332190692|gb|AEE28813.1| aspartic proteinase A1 [Arabidopsis thaliana]
Length = 506
Score = 428 bits (1100), Expect = e-117, Method: Compositional matrix adjust.
Identities = 196/298 (65%), Positives = 239/298 (80%), Gaps = 3/298 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M++QGL+ E VFSFWLNR+ D EEGGE+VFGGVDP HFKGKHTYVPVT+KGYWQF++GD+
Sbjct: 209 MLKQGLIKEPVFSFWLNRNADEEEGGELVFGGVDPNHFKGKHTYVPVTQKGYWQFDMGDV 268
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
LIG TG CE GC+AI DSGTSLLAGPT ++T INHAIG GVVS +CK VV QYG I
Sbjct: 269 LIGGAPTGFCESGCSAIADSGTSLLAGPTTIITMINHAIGAAGVVSQQCKTVVDQYGQTI 328
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGD---SAVCSACEMAVV 177
DLL+S P+K+C QIGLC F+G VS GI++VV+KEN + A CSACEMAVV
Sbjct: 329 LDLLLSETQPKKICSQIGLCTFDGTRGVSMGIESVVDKENAKLSNGVGDAACSACEMAVV 388
Query: 178 WVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPE 237
W+Q+QL+Q T+E++L+Y+NELC+ LP+PMGES +DC ++ TMP VS TIG K+F+L+PE
Sbjct: 389 WIQSQLRQNMTQERILNYVNELCERLPSPMGESAVDCAQLSTMPTVSLTIGGKVFDLAPE 448
Query: 238 QYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
+Y+LK GEG CISGF+A D+ PPRGPLWILGDVFMG YHTVFD G ++GFAEAA
Sbjct: 449 EYVLKVGEGPVAQCISGFIALDVAPPRGPLWILGDVFMGKYHTVFDFGNEQVGFAEAA 506
>gi|1326165|gb|AAB03108.1| aspartic protease [Brassica napus]
Length = 506
Score = 427 bits (1099), Expect = e-117, Method: Compositional matrix adjust.
Identities = 199/299 (66%), Positives = 244/299 (81%), Gaps = 5/299 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M++QGL+ E VFSFWLNR+ + EEGGE+VFGGVDP HFKG+HTYVPVT+KGYWQF++GD+
Sbjct: 209 MLKQGLIKEPVFSFWLNRNAEDEEGGELVFGGVDPNHFKGEHTYVPVTQKGYWQFDMGDV 268
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
LIG TG CE GC+AI DSGTSLLAGPT V+T INHAIG GVVS +CK+VV QYG I
Sbjct: 269 LIGGAPTGYCESGCSAIADSGTSLLAGPTTVITMINHAIGAAGVVSQQCKIVVDQYGQTI 328
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSA----GDSAVCSACEMAV 176
DLL+S P+K+C QIGLC F+G VS GI++VV+KEN + GD+A CSACEMAV
Sbjct: 329 LDLLLSETQPKKICSQIGLCTFDGKRGVSMGIESVVDKENAKSSSGVGDAA-CSACEMAV 387
Query: 177 VWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSP 236
VW+Q+QL+Q T+E++L YIN+LC+ LP+PMGES +DC ++ TMP VS TIG K+F+L+P
Sbjct: 388 VWIQSQLRQNMTQERILDYINDLCERLPSPMGESAVDCAQLSTMPTVSLTIGGKVFDLAP 447
Query: 237 EQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
E+Y+LK GEG A CISGF+A D+ PPRGPLWILGDVFMG YHTVFD GK ++GFAEAA
Sbjct: 448 EEYVLKVGEGPAAQCISGFIALDVAPPRGPLWILGDVFMGKYHTVFDFGKEQVGFAEAA 506
>gi|1354272|gb|AAC49730.1| aspartic proteinase [Arabidopsis thaliana]
Length = 486
Score = 427 bits (1099), Expect = e-117, Method: Compositional matrix adjust.
Identities = 196/298 (65%), Positives = 239/298 (80%), Gaps = 3/298 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M++QGL+ E VFSFWLNR+ D EEGGE+VFGGVDP HFKGKHTYVPVT+KGYWQF++GD+
Sbjct: 189 MLKQGLIKEPVFSFWLNRNADEEEGGELVFGGVDPNHFKGKHTYVPVTQKGYWQFDMGDV 248
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
LIG TG CE GC+AI DSGTSLLAGPT ++T INHAIG GVVS +CK VV QYG I
Sbjct: 249 LIGGAPTGFCESGCSAIADSGTSLLAGPTTIITMINHAIGAAGVVSQQCKTVVDQYGQTI 308
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGD---SAVCSACEMAVV 177
DLL+S P+K+C QIGLC F+G VS GI++VV+KEN + A CSACEMAVV
Sbjct: 309 LDLLLSETQPKKICSQIGLCTFDGTRGVSMGIESVVDKENAKLSNGVGDAACSACEMAVV 368
Query: 178 WVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPE 237
W+Q+QL+Q T+E++L+Y+NELC+ LP+PMGES +DC ++ TMP VS TIG K+F+L+PE
Sbjct: 369 WIQSQLRQNMTQERILNYVNELCERLPSPMGESAVDCAQLSTMPTVSLTIGGKVFDLAPE 428
Query: 238 QYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
+Y+LK GEG CISGF+A D+ PPRGPLWILGDVFMG YHTVFD G ++GFAEAA
Sbjct: 429 EYVLKVGEGPVAQCISGFIALDVAPPRGPLWILGDVFMGKYHTVFDFGNEQVGFAEAA 486
>gi|312282703|dbj|BAJ34217.1| unnamed protein product [Thellungiella halophila]
Length = 506
Score = 427 bits (1098), Expect = e-117, Method: Compositional matrix adjust.
Identities = 196/297 (65%), Positives = 239/297 (80%), Gaps = 3/297 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M++QGL+ E VFSFWLNR+ + +EGGE+VFGGVDP HFKGKHTYVPVT+KGYWQF++GD+
Sbjct: 209 MLKQGLIKEPVFSFWLNRNAEDDEGGELVFGGVDPNHFKGKHTYVPVTQKGYWQFDMGDV 268
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
LIGN TG CE GC+AI DSGTSLLAGPT ++T INHAIG GVVS +CK VV QYG I
Sbjct: 269 LIGNAPTGFCESGCSAIADSGTSLLAGPTTIITMINHAIGAAGVVSQQCKTVVDQYGRTI 328
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGD---SAVCSACEMAVV 177
+LL+S P+K+C QIGLC FNG VS GI++VV+KEN + A CSACEMAVV
Sbjct: 329 LELLLSETQPKKICSQIGLCTFNGKRGVSMGIESVVDKENAKLSNGVGDAACSACEMAVV 388
Query: 178 WVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPE 237
W+Q+QL+Q T+E++L Y NELC+ LP+PMGES +DC ++ TMP VS TIG K+F+L+PE
Sbjct: 389 WIQSQLRQNMTQERILDYANELCERLPSPMGESAVDCAQLSTMPTVSLTIGGKVFDLAPE 448
Query: 238 QYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
+Y+LK GEG A CISGF+A D+ PPRGPLWILGDVFMG YHTVFD GK ++GFAEA
Sbjct: 449 EYVLKVGEGPAAQCISGFIALDVAPPRGPLWILGDVFMGKYHTVFDFGKEQVGFAEA 505
>gi|302144105|emb|CBI23210.3| unnamed protein product [Vitis vinifera]
Length = 429
Score = 426 bits (1096), Expect = e-117, Method: Compositional matrix adjust.
Identities = 200/298 (67%), Positives = 241/298 (80%), Gaps = 3/298 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV+QGLV E VFSFWLNR D +EGGE+VFGGVDP HFKG+HTYVPVT+KGYWQF++G++
Sbjct: 132 MVKQGLVKEPVFSFWLNRKTDDDEGGELVFGGVDPDHFKGEHTYVPVTQKGYWQFDMGEV 191
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
LI ++TG C GGCAAI DSGTSLLAGPT VV INHAIG GVVS ECK VV+QYG+ I
Sbjct: 192 LIDGETTGYCAGGCAAIADSGTSLLAGPTAVVAMINHAIGATGVVSQECKTVVAQYGETI 251
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKEN---VSAGDSAVCSACEMAVV 177
DLL+S P+K+C QIGLC F+G V GI++VV+++N S A CSACEMAVV
Sbjct: 252 MDLLLSEASPQKICSQIGLCTFDGTRGVGMGIESVVDEKNGDKSSGVHDAGCSACEMAVV 311
Query: 178 WVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPE 237
W+Q+QL+Q QTKE++L Y+NELCD LP+PMGES +DC ++ +MPNVS TIG K+F+LS
Sbjct: 312 WMQSQLRQNQTKERILEYVNELCDRLPSPMGESAVDCLQLSSMPNVSLTIGGKVFDLSAN 371
Query: 238 QYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
+Y+LK GEG A CISGF+A D+PPPRGPLWILGDVFMG YHTVFD G +R+GFAEAA
Sbjct: 372 EYVLKVGEGAAAQCISGFIAMDVPPPRGPLWILGDVFMGRYHTVFDYGNMRVGFAEAA 429
>gi|297849560|ref|XP_002892661.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297338503|gb|EFH68920.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 506
Score = 426 bits (1096), Expect = e-117, Method: Compositional matrix adjust.
Identities = 195/298 (65%), Positives = 238/298 (79%), Gaps = 3/298 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M++QGL+ E VFSFW NR+ D EEGGE+VFGGVDP HFKGKHTYVPVT+KGYWQF++GD+
Sbjct: 209 MLKQGLIKEPVFSFWFNRNADEEEGGELVFGGVDPNHFKGKHTYVPVTQKGYWQFDMGDV 268
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
LIG TG CE GC+AI DSGTSLLAGPT ++T INHAIG GVVS +CK VV QYG I
Sbjct: 269 LIGGAPTGFCESGCSAIADSGTSLLAGPTTIITMINHAIGAAGVVSQQCKTVVDQYGQTI 328
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGD---SAVCSACEMAVV 177
DLL+S P+K+C QIGLC F+G VS GI++VV+KEN + A CSACEMAVV
Sbjct: 329 LDLLLSETQPKKICSQIGLCTFDGTRGVSMGIESVVDKENSKLSNGVGDAACSACEMAVV 388
Query: 178 WVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPE 237
W+Q+QL+Q T+E++L+Y+NELC+ LP+PMGES +DC ++ TMP VS TIG K+F+L+PE
Sbjct: 389 WIQSQLRQNMTQERILNYVNELCERLPSPMGESAVDCAQLSTMPTVSLTIGGKVFDLAPE 448
Query: 238 QYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
+Y+LK GEG CISGF+A D+ PPRGPLWILGDVFMG YHTVFD G ++GFAEAA
Sbjct: 449 EYVLKVGEGPVAQCISGFIALDVAPPRGPLWILGDVFMGKYHTVFDFGNEQVGFAEAA 506
>gi|222424506|dbj|BAH20208.1| AT1G11910 [Arabidopsis thaliana]
Length = 389
Score = 426 bits (1095), Expect = e-117, Method: Compositional matrix adjust.
Identities = 196/298 (65%), Positives = 239/298 (80%), Gaps = 3/298 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M++QGL+ E VFSFWLNR+ D EEGGE+VFGGVDP HFKGKHTYVPVT+KGYWQF++GD+
Sbjct: 92 MLKQGLIKEPVFSFWLNRNADEEEGGELVFGGVDPNHFKGKHTYVPVTQKGYWQFDMGDV 151
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
LIG TG CE GC+AI DSGTSLLAGPT ++T INHAIG GVVS +CK VV QYG I
Sbjct: 152 LIGGAPTGFCESGCSAIADSGTSLLAGPTTIITMINHAIGAAGVVSQQCKTVVDQYGQTI 211
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGD---SAVCSACEMAVV 177
DLL+S P+K+C QIGLC F+G VS GI++VV+KEN + A CSACEMAVV
Sbjct: 212 LDLLLSETQPKKICSQIGLCTFDGTRGVSMGIESVVDKENAKLSNGVGDAACSACEMAVV 271
Query: 178 WVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPE 237
W+Q+QL+Q T+E++L+Y+NELC+ LP+PMGES +DC ++ TMP VS TIG K+F+L+PE
Sbjct: 272 WIQSQLRQNMTQERILNYVNELCERLPSPMGESAVDCAQLSTMPTVSLTIGGKVFDLAPE 331
Query: 238 QYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
+Y+LK GEG CISGF+A D+ PPRGPLWILGDVFMG YHTVFD G ++GFAEAA
Sbjct: 332 EYVLKVGEGPVAQCISGFIALDVAPPRGPLWILGDVFMGKYHTVFDFGNEQVGFAEAA 389
>gi|351724625|ref|NP_001237064.1| aspartic proteinase 1 precursor [Glycine max]
gi|15186732|dbj|BAB62890.1| aspartic proteinase 1 [Glycine max]
Length = 514
Score = 426 bits (1094), Expect = e-117, Method: Compositional matrix adjust.
Identities = 194/298 (65%), Positives = 240/298 (80%), Gaps = 3/298 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV+QGL+ E VFSFW NR+P+ EEGGEIVFGGVDP H+KGKHTYVPVT+KGYWQF++GD+
Sbjct: 217 MVDQGLLKEPVFSFWFNRNPEEEEGGEIVFGGVDPAHYKGKHTYVPVTRKGYWQFDMGDV 276
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
LIG + TG C GC+AI DSGTSLLAGPT V+T INHAIG GV+S ECK +V++YG I
Sbjct: 277 LIGGKPTGYCANGCSAIADSGTSLLAGPTTVITMINHAIGASGVMSQECKTIVAEYGQTI 336
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVV---EKENVSAGDSAVCSACEMAVV 177
DLL++ P+K+C +IGLCAF+G V GIK+VV E++++ A C ACEMAVV
Sbjct: 337 LDLLLAETQPKKICSRIGLCAFDGTHGVDVGIKSVVDENERKSLGGHHGAACPACEMAVV 396
Query: 178 WVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPE 237
W+QNQL + QT++++LSYIN+LCD +P+PMGES +DC I ++P VSFTIG + F+LSPE
Sbjct: 397 WMQNQLSRNQTQDQILSYINQLCDKMPSPMGESAVDCGNISSLPVVSFTIGGRTFDLSPE 456
Query: 238 QYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
+Y+LK GEG CISGF A D+PPPRGPLWILGDVFMG YHTVFD GKLR+GFA+AA
Sbjct: 457 EYVLKVGEGPVAQCISGFTAIDIPPPRGPLWILGDVFMGRYHTVFDFGKLRVGFADAA 514
>gi|224118038|ref|XP_002331542.1| predicted protein [Populus trichocarpa]
gi|222873766|gb|EEF10897.1| predicted protein [Populus trichocarpa]
Length = 512
Score = 424 bits (1090), Expect = e-116, Method: Compositional matrix adjust.
Identities = 192/299 (64%), Positives = 243/299 (81%), Gaps = 5/299 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M++ GL+ E VFSFWLNR+ + EEGGEIVFGG+DP H+KGKHT+VPVT+KGYWQF +GD+
Sbjct: 215 MIKHGLIKEPVFSFWLNRNAEDEEGGEIVFGGMDPNHYKGKHTFVPVTRKGYWQFNMGDV 274
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
IG++ TG C GCAAI DSGTSLLAGPT ++T IN AIG GVVS +CK VVSQYG+ I
Sbjct: 275 HIGDKPTGYCASGCAAIADSGTSLLAGPTTIITMINQAIGASGVVSQQCKAVVSQYGEAI 334
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSA----GDSAVCSACEMAV 176
DLL+S P+++C QIGLC F+G +S GI++VV++ N + GD A+C ACEMAV
Sbjct: 335 MDLLLSQAQPKRICSQIGLCTFDGTRGISIGIQSVVDEGNDKSSGFLGD-AMCPACEMAV 393
Query: 177 VWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSP 236
VW+++QLKQ QT++++L Y+N+LC+ +PNPMGES +DC+ +P+MP V+FTIG K F L+P
Sbjct: 394 VWMRSQLKQNQTQDRILDYVNQLCERMPNPMGESAVDCESVPSMPTVAFTIGGKEFELAP 453
Query: 237 EQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
E+YILK G+G A CISGF A D+PPPRGPLWILGD+FMG YHTVFDSGKLR+GFAEAA
Sbjct: 454 EEYILKVGQGSAAQCISGFTALDIPPPRGPLWILGDIFMGRYHTVFDSGKLRVGFAEAA 512
>gi|218143|dbj|BAA02242.1| aspartic proteinase [Oryza sativa Japonica Group]
Length = 496
Score = 424 bits (1090), Expect = e-116, Method: Compositional matrix adjust.
Identities = 191/295 (64%), Positives = 242/295 (82%), Gaps = 2/295 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M EQ L++++VFSFWLNRDPDA GGE+VFGG+DPKH+KG HTYVPV++KGYWQF +GD+
Sbjct: 204 MQEQELLADDVFSFWLNRDPDASSGGELVFGGMDPKHYKGDHTYVPVSRKGYWQFNMGDL 263
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
LI STG C GCAAIVDSGTSLLAGPT +V ++NHAIG EG++S ECK VVS+YG++I
Sbjct: 264 LIDGHSTGFCAKGCAAIVDSGTSLLAGPTAIVAQVNHAIGAEGIISTECKEVVSEYGEMI 323
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGDSAVCSACEMAVVWVQ 180
+LL++ P+KVC Q+GLC F+G VS GI++VV+KEN+ G A+CS CEMAVVW++
Sbjct: 324 LNLLIAQTDPQKVCSQVGLCMFDGKRSVSNGIESVVDKENL--GSDAMCSVCEMAVVWIE 381
Query: 181 NQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYI 240
NQL++ +TKE +L+Y N+LC+ LP+P GES + C +I MPN++FTI +K F L+PEQYI
Sbjct: 382 NQLRENKTKELILNYANQLCERLPSPNGESTVSCHQISKMPNLAFTIANKTFILTPEQYI 441
Query: 241 LKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
+K +G VCISGFMAFD+PPPRGPLWILGDVFMG YHTVFD GK RIGFA++A
Sbjct: 442 VKLEQGGQTVCISGFMAFDIPPPRGPLWILGDVFMGAYHTVFDFGKDRIGFAKSA 496
>gi|115461973|ref|NP_001054586.1| Os05g0137400 [Oryza sativa Japonica Group]
gi|78099760|sp|P42211.2|ASPRX_ORYSJ RecName: Full=Aspartic proteinase; Flags: Precursor
gi|46485798|gb|AAS98423.1| aspartic proteinase [Oryza sativa Japonica Group]
gi|113578137|dbj|BAF16500.1| Os05g0137400 [Oryza sativa Japonica Group]
gi|215694423|dbj|BAG89416.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 496
Score = 424 bits (1089), Expect = e-116, Method: Compositional matrix adjust.
Identities = 191/295 (64%), Positives = 242/295 (82%), Gaps = 2/295 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M EQ L++++VFSFWLNRDPDA GGE+VFGG+DPKH+KG HTYVPV++KGYWQF +GD+
Sbjct: 204 MQEQELLADDVFSFWLNRDPDASSGGELVFGGMDPKHYKGDHTYVPVSRKGYWQFNMGDL 263
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
LI STG C GCAAIVDSGTSLLAGPT +V ++NHAIG EG++S ECK VVS+YG++I
Sbjct: 264 LIDGHSTGFCAKGCAAIVDSGTSLLAGPTAIVAQVNHAIGAEGIISTECKEVVSEYGEMI 323
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGDSAVCSACEMAVVWVQ 180
+LL++ P+KVC Q+GLC F+G VS GI++VV+KEN+ G A+CS CEMAVVW++
Sbjct: 324 LNLLIAQTDPQKVCSQVGLCMFDGKRSVSNGIESVVDKENL--GSDAMCSVCEMAVVWIE 381
Query: 181 NQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYI 240
NQL++ +TKE +L+Y N+LC+ LP+P GES + C +I MPN++FTI +K F L+PEQYI
Sbjct: 382 NQLRENKTKELILNYANQLCERLPSPNGESTVSCHQISKMPNLAFTIANKTFILTPEQYI 441
Query: 241 LKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
+K +G VCISGFMAFD+PPPRGPLWILGDVFMG YHTVFD GK RIGFA++A
Sbjct: 442 VKLEQGGQTVCISGFMAFDIPPPRGPLWILGDVFMGAYHTVFDFGKDRIGFAKSA 496
>gi|226497182|ref|NP_001152501.1| retrotransposon protein SINE subclass precursor [Zea mays]
gi|195624058|gb|ACG33859.1| retrotransposon protein SINE subclass [Zea mays]
gi|195656921|gb|ACG47928.1| retrotransposon protein SINE subclass [Zea mays]
gi|413946824|gb|AFW79473.1| retrotransposon protein SINE subclass isoform 1 [Zea mays]
gi|413946825|gb|AFW79474.1| retrotransposon protein SINE subclass isoform 2 [Zea mays]
gi|413946826|gb|AFW79475.1| retrotransposon protein SINE subclass isoform 3 [Zea mays]
Length = 504
Score = 424 bits (1089), Expect = e-116, Method: Compositional matrix adjust.
Identities = 193/296 (65%), Positives = 242/296 (81%), Gaps = 3/296 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGG-EIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGD 59
M +Q LV+++VFSFWLNRDPDA GG E+VFGGVDPKH+KG HTYVPVT+KGYWQF++GD
Sbjct: 211 MKQQKLVAKDVFSFWLNRDPDASSGGGELVFGGVDPKHYKGDHTYVPVTRKGYWQFDMGD 270
Query: 60 ILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDL 119
++IG STG C GGCAAIVDSGTSLLAGPT +V ++NHAIG EG++S ECK VVS+YG++
Sbjct: 271 LIIGGHSTGFCAGGCAAIVDSGTSLLAGPTTIVAQVNHAIGAEGIISTECKEVVSEYGEM 330
Query: 120 IWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGDSAVCSACEMAVVWV 179
I +LL+S P+KVC QIGLC F+GA VS I++VVEK+ G C+ACEMAVVW+
Sbjct: 331 ILELLISQTSPQKVCTQIGLCVFDGAHSVSNPIESVVEKQK--RGSDLFCTACEMAVVWI 388
Query: 180 QNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQY 239
QNQL++ +TKE +L+Y N+LC+ LP+P GES +DC +I MPN++FTI +K F L+PEQY
Sbjct: 389 QNQLRENKTKELILNYANQLCERLPSPNGESTVDCHQISKMPNLAFTIANKTFTLTPEQY 448
Query: 240 ILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
I+K + +CISGFMAFD+PPPRGPLWILGDVFMG YHTVFD G+ RIGFA++A
Sbjct: 449 IVKLEQAGQTICISGFMAFDVPPPRGPLWILGDVFMGAYHTVFDFGENRIGFAKSA 504
>gi|413946821|gb|AFW79470.1| retrotransposon protein SINE subclass isoform 1 [Zea mays]
gi|413946822|gb|AFW79471.1| retrotransposon protein SINE subclass isoform 2 [Zea mays]
Length = 545
Score = 424 bits (1089), Expect = e-116, Method: Compositional matrix adjust.
Identities = 193/296 (65%), Positives = 242/296 (81%), Gaps = 3/296 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGG-EIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGD 59
M +Q LV+++VFSFWLNRDPDA GG E+VFGGVDPKH+KG HTYVPVT+KGYWQF++GD
Sbjct: 252 MKQQKLVAKDVFSFWLNRDPDASSGGGELVFGGVDPKHYKGDHTYVPVTRKGYWQFDMGD 311
Query: 60 ILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDL 119
++IG STG C GGCAAIVDSGTSLLAGPT +V ++NHAIG EG++S ECK VVS+YG++
Sbjct: 312 LIIGGHSTGFCAGGCAAIVDSGTSLLAGPTTIVAQVNHAIGAEGIISTECKEVVSEYGEM 371
Query: 120 IWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGDSAVCSACEMAVVWV 179
I +LL+S P+KVC QIGLC F+GA VS I++VVEK+ G C+ACEMAVVW+
Sbjct: 372 ILELLISQTSPQKVCTQIGLCVFDGAHSVSNPIESVVEKQK--RGSDLFCTACEMAVVWI 429
Query: 180 QNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQY 239
QNQL++ +TKE +L+Y N+LC+ LP+P GES +DC +I MPN++FTI +K F L+PEQY
Sbjct: 430 QNQLRENKTKELILNYANQLCERLPSPNGESTVDCHQISKMPNLAFTIANKTFTLTPEQY 489
Query: 240 ILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
I+K + +CISGFMAFD+PPPRGPLWILGDVFMG YHTVFD G+ RIGFA++A
Sbjct: 490 IVKLEQAGQTICISGFMAFDVPPPRGPLWILGDVFMGAYHTVFDFGENRIGFAKSA 545
>gi|255578112|ref|XP_002529926.1| Aspartic proteinase precursor, putative [Ricinus communis]
gi|223530603|gb|EEF32480.1| Aspartic proteinase precursor, putative [Ricinus communis]
Length = 514
Score = 423 bits (1088), Expect = e-116, Method: Compositional matrix adjust.
Identities = 194/298 (65%), Positives = 242/298 (81%), Gaps = 3/298 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV QGLV E VFSFW NR+ D +EGGEIVFGG+DP H+KG+HTYVPVT+KGYWQF++GD+
Sbjct: 217 MVNQGLVKEPVFSFWFNRNADEDEGGEIVFGGMDPNHYKGEHTYVPVTQKGYWQFDMGDV 276
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
LI ++TG+C GCAAI DSGTSLLAGPT ++TE+NHAIG GVVS ECK VV+QYG+ I
Sbjct: 277 LIDGKTTGICSSGCAAIADSGTSLLAGPTTIITEVNHAIGATGVVSQECKAVVAQYGETI 336
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEK--ENVSAG-DSAVCSACEMAVV 177
+L++ P+K+C QIGLC F+G+ VS GI++VV + + V+ G A+CS CEMAVV
Sbjct: 337 IAMLLAKDQPQKICSQIGLCTFDGSRGVSMGIESVVNEKIQEVAGGLHDAMCSTCEMAVV 396
Query: 178 WVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPE 237
W+QNQLKQ QT+E +L+Y+NELC+ LP+PMGES +DC + TMPNVSFTIG ++F+L+PE
Sbjct: 397 WMQNQLKQNQTQEHILNYVNELCERLPSPMGESAVDCGSLSTMPNVSFTIGGRVFDLAPE 456
Query: 238 QYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
QY+LK G+G A CISGF A D+PPPRGPLWILGDVFMG +HTVFD G R+GFAE A
Sbjct: 457 QYVLKVGDGEAAQCISGFTALDVPPPRGPLWILGDVFMGPFHTVFDYGNKRVGFAEVA 514
>gi|2811025|sp|O04057.1|ASPR_CUCPE RecName: Full=Aspartic proteinase; Flags: Precursor
gi|1944181|dbj|BAA19607.1| aspartic endopeptidase [Cucurbita pepo]
Length = 513
Score = 423 bits (1088), Expect = e-116, Method: Compositional matrix adjust.
Identities = 202/298 (67%), Positives = 245/298 (82%), Gaps = 3/298 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MVEQGLV E VFSFWLNR+ + EEGGEIVFGGVDPKH++GKHTYVPVT+KGYWQF++GD+
Sbjct: 216 MVEQGLVKEPVFSFWLNRNVEEEEGGEIVFGGVDPKHYRGKHTYVPVTQKGYWQFDMGDV 275
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
LI + TG C+GGC+AI DSGTSLLAGPTPV+T INHAIG +GVVS +CK VV+QYG I
Sbjct: 276 LIDGEPTGFCDGGCSAIADSGTSLLAGPTPVITMINHAIGAKGVVSQQCKAVVAQYGQTI 335
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGDS---AVCSACEMAVV 177
DLL+S P+K+C QI LC F+G VS GI++VV++ + DS +CS CEM VV
Sbjct: 336 MDLLLSEADPKKICSQINLCTFDGTRGVSMGIESVVDENAGKSSDSLHDGMCSVCEMTVV 395
Query: 178 WVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPE 237
W+QNQL+Q QTKE++++YINELCD +P+PMG+S +DC ++ +MP VSFTIG KIF+L+PE
Sbjct: 396 WMQNQLRQNQTKERIINYINELCDRMPSPMGQSAVDCGQLSSMPTVSFTIGGKIFDLAPE 455
Query: 238 QYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
+YILK GEG CISGF AFD+PPPRGPLWILGDVFMG YHTVFD GKLR+G AEAA
Sbjct: 456 EYILKVGEGPVAQCISGFTAFDIPPPRGPLWILGDVFMGRYHTVFDFGKLRVGSAEAA 513
>gi|356522015|ref|XP_003529645.1| PREDICTED: aspartic proteinase-like [Glycine max]
Length = 514
Score = 423 bits (1088), Expect = e-116, Method: Compositional matrix adjust.
Identities = 199/298 (66%), Positives = 237/298 (79%), Gaps = 3/298 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV+QGL+ E VFSFW NR P+ EEGGEIVFGGVDP H+KGKHTYVPVT+KGYWQF++GD+
Sbjct: 217 MVDQGLIKEPVFSFWFNRKPEEEEGGEIVFGGVDPAHYKGKHTYVPVTRKGYWQFDMGDV 276
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
LIG + TG C GC+AI DSGTSLLAGPT V+T INHAIG GV+S ECK VV++YG I
Sbjct: 277 LIGGKPTGYCADGCSAIADSGTSLLAGPTTVITMINHAIGASGVMSQECKTVVAEYGQTI 336
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVV-EKENVSAG--DSAVCSACEMAVV 177
DLL+S P+K+C +IGLCAF+G V GIK+VV E E S+G A C ACEMAVV
Sbjct: 337 LDLLLSETQPKKICSRIGLCAFDGTRGVDVGIKSVVDENERKSSGGHHGAACPACEMAVV 396
Query: 178 WVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPE 237
W+QNQL + QT++++LSYIN+LCD +P+PMGES +DC I ++P VSFTIG + F LSPE
Sbjct: 397 WMQNQLSRNQTQDQILSYINQLCDKMPSPMGESAVDCGNISSLPVVSFTIGGRTFELSPE 456
Query: 238 QYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
+YILK GEG CISGF A D+PPPRGPLWILGDVFMG YHTVFD GK R+GFA+AA
Sbjct: 457 EYILKVGEGPVAQCISGFTAIDIPPPRGPLWILGDVFMGRYHTVFDFGKQRVGFADAA 514
>gi|218196057|gb|EEC78484.1| hypothetical protein OsI_18377 [Oryza sativa Indica Group]
Length = 389
Score = 422 bits (1086), Expect = e-116, Method: Compositional matrix adjust.
Identities = 191/295 (64%), Positives = 242/295 (82%), Gaps = 2/295 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M EQ L++++VFSFWLNRDPDA GGE+VFGG+DPKH+KG HTYVPV++KGYWQF +GD+
Sbjct: 97 MQEQELLADDVFSFWLNRDPDASSGGELVFGGMDPKHYKGDHTYVPVSRKGYWQFNMGDL 156
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
LI STG C GCAAIVDSGTSLLAGPT +V ++NHAIG EG++S ECK VVS+YG++I
Sbjct: 157 LIDGHSTGFCAKGCAAIVDSGTSLLAGPTAIVAQVNHAIGAEGIISTECKEVVSEYGEMI 216
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGDSAVCSACEMAVVWVQ 180
+LL++ P+KVC Q+GLC F+G VS GI++VV+KEN+ G A+CS CEMAVVW++
Sbjct: 217 LNLLIAQTDPQKVCSQVGLCMFDGKRSVSNGIESVVDKENL--GSDAMCSVCEMAVVWIE 274
Query: 181 NQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYI 240
NQL++ +TKE +L+Y N+LC+ LP+P GES + C +I MPN++FTI +K F L+PEQYI
Sbjct: 275 NQLRENKTKELILNYANQLCERLPSPNGESTVSCHQISKMPNLAFTIANKTFILTPEQYI 334
Query: 241 LKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
+K +G VCISGFMAFD+PPPRGPLWILGDVFMG YHTVFD GK RIGFA++A
Sbjct: 335 VKLEQGGQTVCISGFMAFDIPPPRGPLWILGDVFMGAYHTVFDFGKDRIGFAKSA 389
>gi|194706186|gb|ACF87177.1| unknown [Zea mays]
Length = 504
Score = 422 bits (1085), Expect = e-116, Method: Compositional matrix adjust.
Identities = 192/296 (64%), Positives = 241/296 (81%), Gaps = 3/296 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGG-EIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGD 59
M +Q LV+++VFSFWLNRDPDA GG E+VFGGVDPKH+KG HTYVP T+KGYWQF++GD
Sbjct: 211 MKQQKLVAKDVFSFWLNRDPDASSGGGELVFGGVDPKHYKGDHTYVPATRKGYWQFDMGD 270
Query: 60 ILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDL 119
++IG STG C GGCAAIVDSGTSLLAGPT +V ++NHAIG EG++S ECK VVS+YG++
Sbjct: 271 LIIGGHSTGFCAGGCAAIVDSGTSLLAGPTTIVAQVNHAIGAEGIISTECKEVVSEYGEM 330
Query: 120 IWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGDSAVCSACEMAVVWV 179
I +LL+S P+KVC QIGLC F+GA VS I++VVEK+ G C+ACEMAVVW+
Sbjct: 331 ILELLISQTSPQKVCTQIGLCVFDGAHSVSNPIESVVEKQK--RGSDLFCTACEMAVVWI 388
Query: 180 QNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQY 239
QNQL++ +TKE +L+Y N+LC+ LP+P GES +DC +I MPN++FTI +K F L+PEQY
Sbjct: 389 QNQLRENKTKELILNYANQLCERLPSPNGESTVDCHQISKMPNLAFTIANKTFTLTPEQY 448
Query: 240 ILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
I+K + +CISGFMAFD+PPPRGPLWILGDVFMG YHTVFD G+ RIGFA++A
Sbjct: 449 IVKLEQAGQTICISGFMAFDVPPPRGPLWILGDVFMGAYHTVFDFGENRIGFAKSA 504
>gi|261264941|gb|ACX55829.1| aspartic proteinase 1 [Castanea mollissima]
Length = 513
Score = 422 bits (1084), Expect = e-115, Method: Compositional matrix adjust.
Identities = 193/297 (64%), Positives = 236/297 (79%), Gaps = 2/297 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV+QGLV E VFSFW NR+ D EEGGEIVFGGVDP H+KGKHTYVPVT+KGYWQF++GD+
Sbjct: 217 MVKQGLVKEPVFSFWFNRNTDEEEGGEIVFGGVDPNHYKGKHTYVPVTQKGYWQFDMGDV 276
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
LI Q+TG C GC+AI DSGTSLLAGPT ++TE+NHAIG GVVS ECK VV++YG+ I
Sbjct: 277 LIDGQTTGFCARGCSAIADSGTSLLAGPTTIITEVNHAIGATGVVSQECKAVVAEYGETI 336
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAG--DSAVCSACEMAVVW 178
+L+ P K+C QIGLC F+G VS I++VV+ ++ A+CS CEM VVW
Sbjct: 337 IKMLLEKDQPMKICSQIGLCTFDGVRGVSMDIESVVDNTRKASNGLRDAMCSTCEMTVVW 396
Query: 179 VQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQ 238
+QNQLKQ QT++++L+Y+NELCD LP+PMGES +DC + ++PNVS TIG ++F+LSPEQ
Sbjct: 397 MQNQLKQNQTQDRILTYVNELCDRLPSPMGESAVDCGSLSSLPNVSLTIGGRVFDLSPEQ 456
Query: 239 YILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
Y+LK GEG A CISGF A D+PPPRGPLWILGDVFMG YHTVFD G R+GFAEAA
Sbjct: 457 YVLKVGEGEAAQCISGFTALDVPPPRGPLWILGDVFMGRYHTVFDYGNQRVGFAEAA 513
>gi|219887925|gb|ACL54337.1| unknown [Zea mays]
Length = 504
Score = 421 bits (1083), Expect = e-115, Method: Compositional matrix adjust.
Identities = 193/296 (65%), Positives = 241/296 (81%), Gaps = 3/296 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGG-EIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGD 59
M +Q LV+++VFSFWLNRDPDA GG E VFGGVDPKH+KG HTYVPVT+KGYWQF++GD
Sbjct: 211 MKQQKLVAKDVFSFWLNRDPDASSGGGEPVFGGVDPKHYKGDHTYVPVTRKGYWQFDMGD 270
Query: 60 ILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDL 119
++IG STG C GGCAAIVDSGTSLLAGPT +V ++NHAIG EG++S ECK VVS+YG++
Sbjct: 271 LIIGGHSTGFCAGGCAAIVDSGTSLLAGPTTIVAQVNHAIGAEGIISTECKEVVSEYGEM 330
Query: 120 IWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGDSAVCSACEMAVVWV 179
I +LL+S P+KVC QIGLC F+GA VS I++VVEK+ G C+ACEMAVVW+
Sbjct: 331 ILELLISQTSPQKVCTQIGLCVFDGAHSVSNPIESVVEKQK--RGSDLFCTACEMAVVWI 388
Query: 180 QNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQY 239
QNQL++ +TKE +L+Y N+LC+ LP+P GES +DC +I MPN++FTI +K F L+PEQY
Sbjct: 389 QNQLRENKTKELILNYANQLCERLPSPNGESTVDCHQISKMPNLAFTIANKTFTLTPEQY 448
Query: 240 ILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
I+K + +CISGFMAFD+PPPRGPLWILGDVFMG YHTVFD G+ RIGFA++A
Sbjct: 449 IVKLEQAGQTICISGFMAFDVPPPRGPLWILGDVFMGAYHTVFDFGENRIGFAKSA 504
>gi|12231172|dbj|BAB20969.1| aspartic proteinase 1 [Nepenthes alata]
Length = 514
Score = 421 bits (1081), Expect = e-115, Method: Compositional matrix adjust.
Identities = 194/298 (65%), Positives = 238/298 (79%), Gaps = 5/298 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV+QGLV+E VFSFWLNR+ EEGGEIVFGGVDP H+KG+HT+VPVT KGYWQF++ D+
Sbjct: 217 MVKQGLVNEPVFSFWLNRNATEEEGGEIVFGGVDPNHYKGEHTFVPVTHKGYWQFDMDDV 276
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
L+G ++TG C GGC+AI DSGTSLLAGPT +V +INHAIG GVVS ECK VV+QYG I
Sbjct: 277 LVGGETTGYCSGGCSAIADSGTSLLAGPTTIVAQINHAIGASGVVSQECKAVVAQYGTAI 336
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGDS----AVCSACEMAV 176
D+L+S P+K+C QIGLC F+G VS GIK+VV+ NV S A C+ACEM V
Sbjct: 337 LDMLISETQPKKICSQIGLCTFDGKRGVSVGIKSVVDM-NVDGSSSGLQDATCTACEMTV 395
Query: 177 VWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSP 236
VW+QNQLKQ QT+E++L+Y+NELC+ LP+PMGES +DC + +MP VSFT+G K+F+L P
Sbjct: 396 VWMQNQLKQNQTEERILNYVNELCNRLPSPMGESAVDCSSLSSMPGVSFTVGGKVFDLLP 455
Query: 237 EQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
EQYIL+ GEG+A CISGF A D+ PP GPLWILGD+FMG YHTVFD G +R+GFAEA
Sbjct: 456 EQYILQVGEGVATQCISGFTALDVAPPLGPLWILGDIFMGQYHTVFDYGNMRVGFAEA 513
>gi|356532081|ref|XP_003534602.1| PREDICTED: aspartic proteinase [Glycine max]
Length = 507
Score = 420 bits (1080), Expect = e-115, Method: Compositional matrix adjust.
Identities = 196/298 (65%), Positives = 239/298 (80%), Gaps = 3/298 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MVEQGLV + VFSFWLNR P+ E GGE+VFGG DP H+KGKHTYVPVT+KGYWQF++GD+
Sbjct: 210 MVEQGLVKDPVFSFWLNRKPEEENGGELVFGGADPAHYKGKHTYVPVTRKGYWQFDMGDV 269
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
LI + TG C C+AI DSGTSLLAGPT VVT IN AIG GVVS EC+ VV+QYG I
Sbjct: 270 LIAGKPTGYCADDCSAIADSGTSLLAGPTTVVTMINQAIGASGVVSKECRSVVNQYGQTI 329
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEK-ENVSAGD--SAVCSACEMAVV 177
+LL++ P+K+C QIGLC F+G VS GI++VV+K E S+G A CSACEMAV+
Sbjct: 330 LELLLAEAKPKKICSQIGLCTFDGTHGVSMGIESVVDKNERKSSGSIRDAGCSACEMAVI 389
Query: 178 WVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPE 237
W+QNQL+Q QT+++++ Y NELCD LPNPMG+S +DC+++ +MP VSFTIG K+F+LSP+
Sbjct: 390 WMQNQLRQNQTEDRIIDYANELCDKLPNPMGQSSVDCEKLSSMPIVSFTIGGKVFDLSPQ 449
Query: 238 QYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
+YILK GEG CISGF A D+PPPRGPLWILGDVFMG YHT+FD GKLR+GFAEAA
Sbjct: 450 EYILKVGEGPEAQCISGFTALDVPPPRGPLWILGDVFMGRYHTIFDYGKLRVGFAEAA 507
>gi|218188020|gb|EEC70447.1| hypothetical protein OsI_01478 [Oryza sativa Indica Group]
Length = 495
Score = 420 bits (1079), Expect = e-115, Method: Compositional matrix adjust.
Identities = 192/295 (65%), Positives = 233/295 (78%), Gaps = 2/295 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M EQ L+ ++VFSFWLNRDPDA GGE++FGGVDP H+KG HTYVPVT+KGYWQFE+GD+
Sbjct: 203 MKEQQLIEKDVFSFWLNRDPDAPTGGELIFGGVDPNHYKGSHTYVPVTRKGYWQFEMGDL 262
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
LI + STG C GGCAAI DSGTSLL GPT +V +INHAIG EG+VS ECK VV YGD+I
Sbjct: 263 LIDDYSTGFCSGGCAAIADSGTSLLGGPTTIVAQINHAIGAEGIVSMECKQVVRDYGDMI 322
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGDSAVCSACEMAVVWVQ 180
++L++ P K+C QIGLCAF+G V I++VV+KE V G C+ACEMAVVW+Q
Sbjct: 323 LEMLIAQASPMKLCSQIGLCAFDGTRSVRNNIESVVDKEKV--GSDLSCTACEMAVVWIQ 380
Query: 181 NQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYI 240
NQL+ QT+E +L Y ++LC+ LP+P GES +DCD I MPN+SFTI +K F L+PEQY+
Sbjct: 381 NQLRHNQTRELILQYADQLCERLPSPNGESAVDCDEISNMPNLSFTIANKTFTLTPEQYV 440
Query: 241 LKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
+K + VCISGFMAFD+PPPRGPLWILGDVFMG YHTVFD GK RIGFAE+A
Sbjct: 441 VKLEQQGQTVCISGFMAFDVPPPRGPLWILGDVFMGAYHTVFDFGKNRIGFAESA 495
>gi|261264943|gb|ACX55830.1| aspartic proteinase 2 [Castanea mollissima]
Length = 513
Score = 419 bits (1078), Expect = e-115, Method: Compositional matrix adjust.
Identities = 192/297 (64%), Positives = 237/297 (79%), Gaps = 2/297 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV+QGLV E VFSFW NR+ D EEGGEIVFGGVDP H+KGKHTYVPVT+KGYWQF++GD+
Sbjct: 217 MVKQGLVKEPVFSFWFNRNTDEEEGGEIVFGGVDPNHYKGKHTYVPVTQKGYWQFDMGDV 276
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
LI Q+TG C C+AI DSGTSLLAGPT ++TE+NHAIG GVVS ECK VV++YG+ I
Sbjct: 277 LIDGQTTGFCVTTCSAIADSGTSLLAGPTTIITEVNHAIGATGVVSQECKAVVAEYGETI 336
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAG--DSAVCSACEMAVVW 178
+L+ P K+C QIGLC F+G + VS I++VV+ + ++ A+CS CEM VVW
Sbjct: 337 IKMLLEKDQPMKICSQIGLCTFDGTQGVSMDIESVVDNTHKASNGLRDAMCSTCEMTVVW 396
Query: 179 VQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQ 238
+QNQLKQ QT++++L+Y+NELCD LP+PMGES +DC + ++PNVS TIG ++F+LSPEQ
Sbjct: 397 MQNQLKQNQTQDRILTYVNELCDRLPSPMGESAVDCGSLSSLPNVSLTIGGRVFDLSPEQ 456
Query: 239 YILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
Y+LK GEG A CISGF A D+PPPRGPLWILGDVFMG YHTVFD G R+GFAEAA
Sbjct: 457 YVLKVGEGEAAQCISGFTALDVPPPRGPLWILGDVFMGRYHTVFDYGNQRVGFAEAA 513
>gi|1665867|emb|CAA70340.1| aspartic proteinase [Centaurea calcitrapa]
Length = 509
Score = 419 bits (1078), Expect = e-115, Method: Compositional matrix adjust.
Identities = 193/298 (64%), Positives = 241/298 (80%), Gaps = 3/298 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV QGLV E VFSFW NR+ D EEGGE+VFGGVDP HFKGKHTYVPVT+KGYWQF +GD+
Sbjct: 212 MVNQGLVQEPVFSFWFNRNADEEEGGELVFGGVDPNHFKGKHTYVPVTQKGYWQFNMGDV 271
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
LI +++TG C GCAAI DSGTSLLAGPT ++T+INHAIG +GV+S +CK +V QYG I
Sbjct: 272 LIEDKTTGFCADGCAAIADSGTSLLAGPTAIITQINHAIGAKGVMSQQCKTLVDQYGKTI 331
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKEN-VSAG--DSAVCSACEMAVV 177
++L+S P+K+C Q+ LC F+GA VS+ I++VV+K N S+G +C+ CEMAVV
Sbjct: 332 IEMLLSEAQPDKICSQMKLCTFDGARDVSSIIESVVDKNNGKSSGGVHDEMCTFCEMAVV 391
Query: 178 WVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPE 237
W+QNQ+K+ QT++ +++Y+NELCD LP+PMGES +DC+ + +MPN++FTIG K+F L PE
Sbjct: 392 WMQNQIKRNQTEDNIINYVNELCDRLPSPMGESAVDCNDLSSMPNIAFTIGGKVFELCPE 451
Query: 238 QYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
QYILK GEG A CISGF A D+ PPRGPLWILGDVFMG YHTVFD GKLR+GFAEAA
Sbjct: 452 QYILKIGEGEAAQCISGFTAMDVAPPRGPLWILGDVFMGQYHTVFDYGKLRVGFAEAA 509
>gi|1030715|dbj|BAA06876.1| aspartic protease [Oryza sativa]
gi|1711289|dbj|BAA06875.1| aspartic protease [Oryza sativa]
Length = 509
Score = 419 bits (1078), Expect = e-115, Method: Compositional matrix adjust.
Identities = 197/301 (65%), Positives = 236/301 (78%), Gaps = 9/301 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MVEQGLVSE VFSFW NR D EGGEIVFGG+DP H+KG HTYVPV++KGYWQFE+GD+
Sbjct: 212 MVEQGLVSEPVFSFWFNRHSDEGEGGEIVFGGMDPSHYKGNHTYVPVSQKGYWQFEMGDV 271
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
LIG ++TG C GC+AI DSGTSLLAGPT ++TEIN IG GVVS ECK VVSQYG I
Sbjct: 272 LIGGKTTGFCASGCSAIADSGTSLLAGPTAIITEINEKIGATGVVSQECKTVVSQYGQQI 331
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGDS------AVCSACEM 174
DLL++ P K+C Q+GLC F+G VS GIK+VV+ E AG+S +C+ACEM
Sbjct: 332 LDLLLAETQPSKICSQVGLCTFDGKHGVSAGIKSVVDDE---AGESNGLQSGPMCNACEM 388
Query: 175 AVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNL 234
AVVW+QNQL Q +T++ +L+YIN+LCD LP+PMGES +DC + +MP +SFTIG K F L
Sbjct: 389 AVVWMQNQLAQNKTQDLILNYINQLCDKLPSPMGESSVDCGSLASMPEISFTIGAKKFAL 448
Query: 235 SPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
PE+YILK GEG A CISGF A D+PPPRGPLWILGDVFMG YHTVFD GK+R+GFA++
Sbjct: 449 KPEEYILKVGEGAAAQCISGFTAMDIPPPRGPLWILGDVFMGAYHTVFDYGKMRVGFAKS 508
Query: 295 A 295
A
Sbjct: 509 A 509
>gi|115465497|ref|NP_001056348.1| Os05g0567100 [Oryza sativa Japonica Group]
gi|78099759|sp|Q42456.2|ASPR1_ORYSJ RecName: Full=Aspartic proteinase oryzasin-1; Flags: Precursor
gi|51854282|gb|AAU10663.1| aspartic proteinase oryzasin 1 precursor [Oryza sativa Japonica
Group]
gi|113579899|dbj|BAF18262.1| Os05g0567100 [Oryza sativa Japonica Group]
gi|125553350|gb|EAY99059.1| hypothetical protein OsI_21016 [Oryza sativa Indica Group]
gi|169244443|gb|ACA50495.1| aspartic proteinase oryzasin 1 [Oryza sativa Japonica Group]
gi|215695381|dbj|BAG90572.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215737145|dbj|BAG96074.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215740829|dbj|BAG96985.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222632587|gb|EEE64719.1| hypothetical protein OsJ_19575 [Oryza sativa Japonica Group]
Length = 509
Score = 419 bits (1078), Expect = e-115, Method: Compositional matrix adjust.
Identities = 197/301 (65%), Positives = 236/301 (78%), Gaps = 9/301 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MVEQGLVSE VFSFW NR D EGGEIVFGG+DP H+KG HTYVPV++KGYWQFE+GD+
Sbjct: 212 MVEQGLVSEPVFSFWFNRHSDEGEGGEIVFGGMDPSHYKGNHTYVPVSQKGYWQFEMGDV 271
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
LIG ++TG C GC+AI DSGTSLLAGPT ++TEIN IG GVVS ECK VVSQYG I
Sbjct: 272 LIGGKTTGFCASGCSAIADSGTSLLAGPTAIITEINEKIGATGVVSQECKTVVSQYGQQI 331
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGDS------AVCSACEM 174
DLL++ P K+C Q+GLC F+G VS GIK+VV+ E AG+S +C+ACEM
Sbjct: 332 LDLLLAETQPSKICSQVGLCTFDGKHGVSAGIKSVVDDE---AGESNGLQSGPMCNACEM 388
Query: 175 AVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNL 234
AVVW+QNQL Q +T++ +L+YIN+LCD LP+PMGES +DC + +MP +SFTIG K F L
Sbjct: 389 AVVWMQNQLAQNKTQDLILNYINQLCDKLPSPMGESSVDCGSLASMPEISFTIGGKKFAL 448
Query: 235 SPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
PE+YILK GEG A CISGF A D+PPPRGPLWILGDVFMG YHTVFD GK+R+GFA++
Sbjct: 449 KPEEYILKVGEGAAAQCISGFTAMDIPPPRGPLWILGDVFMGAYHTVFDYGKMRVGFAKS 508
Query: 295 A 295
A
Sbjct: 509 A 509
>gi|115436054|ref|NP_001042785.1| Os01g0290000 [Oryza sativa Japonica Group]
gi|8467954|dbj|BAA96578.1| putative aspartic proteinase [Oryza sativa Japonica Group]
gi|113532316|dbj|BAF04699.1| Os01g0290000 [Oryza sativa Japonica Group]
gi|215694819|dbj|BAG90010.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215701475|dbj|BAG92899.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222618242|gb|EEE54374.1| hypothetical protein OsJ_01384 [Oryza sativa Japonica Group]
Length = 495
Score = 418 bits (1075), Expect = e-114, Method: Compositional matrix adjust.
Identities = 191/295 (64%), Positives = 232/295 (78%), Gaps = 2/295 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M EQ L+ ++VFSFWLNRDPDA GGE++FGGVDP H+KG HTYVPVT+KGYWQFE+GD+
Sbjct: 203 MKEQQLIEKDVFSFWLNRDPDAPTGGELIFGGVDPNHYKGSHTYVPVTRKGYWQFEMGDL 262
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
LI + STG C GGCAAI DSGTSLL GPT +V +INHAIG EG+VS ECK VV YGD+I
Sbjct: 263 LIDDYSTGFCSGGCAAIADSGTSLLGGPTTIVAQINHAIGAEGIVSMECKQVVRDYGDMI 322
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGDSAVCSACEMAVVWVQ 180
++L++ P K+C QIGLCAF+G V I++VV+KE V G C+ACEMAVVW+Q
Sbjct: 323 LEMLIAQASPMKLCSQIGLCAFDGTRSVRNNIESVVDKEKV--GSDLSCTACEMAVVWIQ 380
Query: 181 NQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYI 240
NQL+ QT+E +L Y ++LC+ LP+P GES +DCD I MPN+SFTI +K F L+PEQY+
Sbjct: 381 NQLRHNQTRELILQYADQLCERLPSPNGESAVDCDEISNMPNLSFTIANKTFTLTPEQYV 440
Query: 241 LKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
+K + VCISGFMAFD+PPPRGPLWILGDVFM YHTVFD GK RIGFAE+A
Sbjct: 441 VKLEQQGQTVCISGFMAFDVPPPRGPLWILGDVFMAAYHTVFDFGKNRIGFAESA 495
>gi|225460913|ref|XP_002279049.1| PREDICTED: aspartic proteinase [Vitis vinifera]
gi|297737462|emb|CBI26663.3| unnamed protein product [Vitis vinifera]
Length = 514
Score = 418 bits (1075), Expect = e-114, Method: Compositional matrix adjust.
Identities = 188/298 (63%), Positives = 234/298 (78%), Gaps = 3/298 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ Q L+ E +FSFW NR+ + E GGEIVFGG+D H+KGKHTYVPVTKKGYWQF+LGD+
Sbjct: 217 MINQELIKEPIFSFWFNRNSNEEVGGEIVFGGIDSDHYKGKHTYVPVTKKGYWQFDLGDV 276
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
+IG ++TG C GC+AI DSGTSLLAGPT ++TE+NHAIG G VS EC+ VV QYG +I
Sbjct: 277 MIGGKTTGFCASGCSAIADSGTSLLAGPTTIITEVNHAIGASGFVSQECRAVVQQYGQII 336
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGD---SAVCSACEMAVV 177
D+L++ P+K+C QIGLCAFNG VS GI++VV++ N A D +CSAC MAVV
Sbjct: 337 IDMLLTKEQPQKICSQIGLCAFNGIRGVSMGIESVVDENNSKASDGLHDTMCSACSMAVV 396
Query: 178 WVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPE 237
W+QN+L Q +T +++L Y+NELCD LP+PMGES +DC + +MPNVS TIG K+F+LSP+
Sbjct: 397 WIQNKLGQNETIDRILKYVNELCDRLPSPMGESAVDCGSLSSMPNVSLTIGGKVFDLSPK 456
Query: 238 QYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
QYILK GEG CISGF A D+PPP GPLWILGDVFMG YHTVFD G +++GFAEAA
Sbjct: 457 QYILKVGEGEIAQCISGFTALDVPPPHGPLWILGDVFMGQYHTVFDYGNMKVGFAEAA 514
>gi|12231176|dbj|BAB20971.1| aspartic proteinase 3 [Nepenthes alata]
Length = 507
Score = 418 bits (1074), Expect = e-114, Method: Compositional matrix adjust.
Identities = 202/299 (67%), Positives = 236/299 (78%), Gaps = 4/299 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV QGLV E VFSFWLNR EEGGEIVFGGVDP HFKG H+YVPVT KGYWQF++GD+
Sbjct: 209 MVNQGLVKEPVFSFWLNRKVGEEEGGEIVFGGVDPNHFKGTHSYVPVTHKGYWQFDMGDV 268
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
LI ++T CEGGC+AI DSGTSLLAGPT VVT INHAIG GVVS ECK VVSQYG I
Sbjct: 269 LIDGKATEYCEGGCSAIADSGTSLLAGPTSVVTMINHAIGATGVVSEECKAVVSQYGQTI 328
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKEN--VSAG--DSAVCSACEMAV 176
DLL++ + PEK+C QIGLC F+G VS GIK+VV+KEN S+G A+C ACEMAV
Sbjct: 329 MDLLLAEVSPEKICSQIGLCTFDGTRGVSIGIKSVVDKENNGKSSGILRDALCPACEMAV 388
Query: 177 VWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSP 236
VW+++QL+Q QT+ +L+Y+N+LCD LP+PMGES +DC RI +M VS TIG K+F+L P
Sbjct: 389 VWMKSQLEQNQTQNLILNYVNDLCDQLPSPMGESAVDCARISSMATVSSTIGGKVFDLRP 448
Query: 237 EQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
EQYIL+ GEG A CISGF A D+PPP GPLWILGD+ MG YHTVFD G LR+GFAEAA
Sbjct: 449 EQYILRVGEGPAAQCISGFTAMDIPPPGGPLWILGDILMGRYHTVFDYGNLRVGFAEAA 507
>gi|556819|emb|CAA57510.1| cyprosin [Cynara cardunculus]
Length = 509
Score = 418 bits (1074), Expect = e-114, Method: Compositional matrix adjust.
Identities = 190/298 (63%), Positives = 240/298 (80%), Gaps = 3/298 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV QGLV E VFSFW NR+ D EEGGE+VFGGVDP HFKGKHTYVPVT+KGYWQF++GD+
Sbjct: 212 MVNQGLVQEPVFSFWFNRNADEEEGGELVFGGVDPNHFKGKHTYVPVTEKGYWQFDMGDV 271
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
LI +++TG C GCAAI DSGTSLLAGPT ++TEINHAIG +GV+S +CK +VSQYG +
Sbjct: 272 LIEDKTTGFCSDGCAAIADSGTSLLAGPTAIITEINHAIGAKGVMSQQCKTLVSQYGKTM 331
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKEN---VSAGDSAVCSACEMAVV 177
++L+S P+K+C Q+ LC F+GA S+ I++VV++ N S +C+ CEMAVV
Sbjct: 332 IEMLLSEAQPDKICSQMKLCTFDGARDASSIIESVVDENNGKSSSGVHDEMCTFCEMAVV 391
Query: 178 WVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPE 237
W+QNQ+K+ +T++ +++Y+NELCD LP+PMGES +DC+ + +MPN++FTIG K+F L PE
Sbjct: 392 WMQNQIKRNETEDNIINYVNELCDRLPSPMGESAVDCNSLSSMPNIAFTIGGKVFELCPE 451
Query: 238 QYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
QYILK GEG A CISGF A D+ PPRGPLWILGDVFMG YHTVFD GKLR+GFAEAA
Sbjct: 452 QYILKIGEGEAAQCISGFTAMDVAPPRGPLWILGDVFMGRYHTVFDYGKLRVGFAEAA 509
>gi|414887123|tpg|DAA63137.1| TPA: hypothetical protein ZEAMMB73_794362 [Zea mays]
Length = 608
Score = 417 bits (1071), Expect = e-114, Method: Compositional matrix adjust.
Identities = 192/299 (64%), Positives = 233/299 (77%), Gaps = 4/299 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV+QGL+S+ VFSFW NR D EGGEIVFGG+D H+KG HT+VPVT+KGYWQF +GD+
Sbjct: 310 MVKQGLISDPVFSFWFNRHADEGEGGEIVFGGMDSSHYKGDHTFVPVTRKGYWQFNMGDV 369
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
L+ +STG C GGCAA+ DSGTSLLAGPT ++TEIN IG GVVS ECK VVSQYG I
Sbjct: 370 LVDGKSTGFCAGGCAAVADSGTSLLAGPTAIITEINEKIGAAGVVSQECKTVVSQYGQQI 429
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVE----KENVSAGDSAVCSACEMAV 176
DLL++ P K+C Q+GLC F+G VS GI++VV+ K N +C+ACEMAV
Sbjct: 430 LDLLLAETQPAKICSQVGLCTFDGTHGVSAGIRSVVDDEAGKSNGGLKSDPMCNACEMAV 489
Query: 177 VWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSP 236
VW+QNQL Q +T+E +L+YIN+LC+ LP+PMGES +DC + +MP+++FTIG K F L P
Sbjct: 490 VWMQNQLAQNKTQELILNYINQLCERLPSPMGESAVDCGSLASMPDIAFTIGGKKFKLKP 549
Query: 237 EQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
EQYILK GEG A CISGF A D+PPPRGPLWILGDVFMGVYHTVFD GKLR+GFAE+A
Sbjct: 550 EQYILKVGEGQAAQCISGFTAMDIPPPRGPLWILGDVFMGVYHTVFDYGKLRVGFAESA 608
>gi|223946977|gb|ACN27572.1| unknown [Zea mays]
gi|238014788|gb|ACR38429.1| unknown [Zea mays]
gi|413946556|gb|AFW79205.1| aspartic proteinase oryzasin-1 isoform 1 [Zea mays]
gi|413946557|gb|AFW79206.1| aspartic proteinase oryzasin-1 isoform 2 [Zea mays]
Length = 510
Score = 417 bits (1071), Expect = e-114, Method: Compositional matrix adjust.
Identities = 193/299 (64%), Positives = 233/299 (77%), Gaps = 4/299 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV+QGL+S+ VFSFW NR D EGGEIVFGG+D H+KG HT+VPVT+KGYWQF +GD+
Sbjct: 212 MVKQGLISDPVFSFWFNRHADEGEGGEIVFGGMDSSHYKGDHTFVPVTRKGYWQFNMGDV 271
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
L+ +STG C GGCAAI DSGTSLLAGPT ++TEIN IG GVVS ECK VVSQYG I
Sbjct: 272 LVDGKSTGFCAGGCAAIADSGTSLLAGPTAIITEINEKIGAAGVVSQECKTVVSQYGQQI 331
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVE----KENVSAGDSAVCSACEMAV 176
DLL++ P K+C Q+GLC F+G VS GI++VV+ K N +C+ACEMAV
Sbjct: 332 LDLLLAETQPAKICSQVGLCTFDGTHGVSAGIRSVVDDEAGKSNGGLKSDPMCNACEMAV 391
Query: 177 VWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSP 236
VW+QNQL Q +T+E +L+YIN+LC+ LP+PMGES +DC + +MP+++FTIG K F L P
Sbjct: 392 VWMQNQLAQNKTQELILNYINQLCERLPSPMGESAVDCGSLASMPDIAFTIGGKKFKLKP 451
Query: 237 EQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
EQYILK GEG A CISGF A D+PPPRGPLWILGDVFMGVYHTVFD GKLR+GFAE+A
Sbjct: 452 EQYILKVGEGQAAQCISGFTAMDIPPPRGPLWILGDVFMGVYHTVFDYGKLRVGFAESA 510
>gi|425892460|gb|AFB73927.2| preprocirsin [Cirsium vulgare]
Length = 509
Score = 416 bits (1068), Expect = e-114, Method: Compositional matrix adjust.
Identities = 190/298 (63%), Positives = 239/298 (80%), Gaps = 3/298 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV QGLV E VFSFW NR+ + EEGGE+VFGGVDP HFKGKHTYVPVT+KGYWQF +GD+
Sbjct: 212 MVNQGLVQEPVFSFWFNRNANEEEGGELVFGGVDPNHFKGKHTYVPVTEKGYWQFNMGDV 271
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
LI +++TG C GCAAI DSGTSLLAGPT ++TEINHA G +GV+S +CK +VSQYG I
Sbjct: 272 LIEDKTTGFCSDGCAAIADSGTSLLAGPTAIITEINHASGAKGVMSQQCKTLVSQYGKSI 331
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGDSA---VCSACEMAVV 177
++L+S P+K+C Q+ LC F+GA VS+ I++VV+K N + A +C+ CEMAVV
Sbjct: 332 IEMLLSEAQPDKICSQMKLCTFDGARDVSSIIESVVDKNNGKSSGGANDEMCTFCEMAVV 391
Query: 178 WVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPE 237
W+QNQ+K+ +T++ +++Y+NELCD LP+PMGES +DC+ + +MPN++FTIG K+F L PE
Sbjct: 392 WMQNQIKRNETEDNIINYVNELCDRLPSPMGESAVDCNSLSSMPNIAFTIGGKVFELCPE 451
Query: 238 QYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
QYILK GEG A CISGF A D+ PPRGPLWILGDVFMG YHTVFD GK R+GFAEAA
Sbjct: 452 QYILKIGEGEAAQCISGFTAMDVAPPRGPLWILGDVFMGRYHTVFDYGKSRVGFAEAA 509
>gi|226503984|ref|NP_001148782.1| aspartic proteinase oryzasin-1 precursor [Zea mays]
gi|195622118|gb|ACG32889.1| aspartic proteinase oryzasin-1 precursor [Zea mays]
Length = 510
Score = 415 bits (1067), Expect = e-113, Method: Compositional matrix adjust.
Identities = 192/298 (64%), Positives = 232/298 (77%), Gaps = 4/298 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV+QGL+S+ VFSFW NR D EGGEIVFGG+D H+KG HT+VPVT+KGYWQF +GD+
Sbjct: 212 MVKQGLISDPVFSFWFNRHADEGEGGEIVFGGMDSSHYKGDHTFVPVTRKGYWQFNMGDV 271
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
L+ +STG C GGCAAI DSGTSLLAGPT ++TEIN IG GVVS ECK VVSQYG I
Sbjct: 272 LVDGKSTGFCAGGCAAIADSGTSLLAGPTAIITEINEKIGAAGVVSQECKTVVSQYGQQI 331
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVE----KENVSAGDSAVCSACEMAV 176
DLL++ P K+C Q+GLC F+G VS GI++VV+ K N +C+ACEMAV
Sbjct: 332 LDLLLAETQPTKICSQVGLCTFDGTHGVSAGIRSVVDDEAGKSNGGLKSDPMCNACEMAV 391
Query: 177 VWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSP 236
VW+QNQL Q +T+E +L+YIN+LC+ LP+PMGES +DC + +MP+++FTIG K F L P
Sbjct: 392 VWMQNQLAQNKTQELILNYINQLCERLPSPMGESAVDCGSLASMPDIAFTIGGKKFKLKP 451
Query: 237 EQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
EQYILK GEG A CISGF A D+PPPRGPLWILGDVFMGVYHTVFD GKLR+GFAE+
Sbjct: 452 EQYILKVGEGQAAQCISGFTAMDIPPPRGPLWILGDVFMGVYHTVFDYGKLRVGFAES 509
>gi|356555682|ref|XP_003546159.1| PREDICTED: aspartic proteinase-like [Glycine max]
Length = 507
Score = 415 bits (1067), Expect = e-113, Method: Compositional matrix adjust.
Identities = 195/298 (65%), Positives = 237/298 (79%), Gaps = 3/298 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MVEQGLV + VFSFWLNR P+ E GGE+VFGG DP H+KGKHTYVPVT+KGYWQF++GD+
Sbjct: 210 MVEQGLVKDPVFSFWLNRKPEEENGGELVFGGADPAHYKGKHTYVPVTRKGYWQFDMGDV 269
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
LI + TG C C+AI DSGTSLLAGPT V+T IN AIG GVVS EC+ VV+QYG I
Sbjct: 270 LISGKPTGYCTNDCSAIADSGTSLLAGPTTVITMINQAIGAAGVVSKECRSVVNQYGQTI 329
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEK-ENVSAG--DSAVCSACEMAVV 177
+LL++ P+K+C QIGLC F+G VS GI++VV+K E S+G A CSACEMAV+
Sbjct: 330 LELLLAEAKPKKICSQIGLCTFDGTHGVSMGIESVVDKNEKKSSGGIRDAGCSACEMAVI 389
Query: 178 WVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPE 237
W+QNQL+Q QT+++++ Y NELC+ LPNPMG S +DC ++ +MP VSFTIG K+F+LSPE
Sbjct: 390 WMQNQLRQNQTEDRIIDYANELCEKLPNPMGPSSVDCGKLSSMPIVSFTIGGKVFDLSPE 449
Query: 238 QYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
+YILK GEG CISGF A D+PPPRGPLWILGDVFMG YHT+FD GKLR+GFAEAA
Sbjct: 450 EYILKVGEGPEAQCISGFTALDVPPPRGPLWILGDVFMGRYHTIFDYGKLRVGFAEAA 507
>gi|148906206|gb|ABR16259.1| unknown [Picea sitchensis]
Length = 509
Score = 415 bits (1067), Expect = e-113, Method: Compositional matrix adjust.
Identities = 190/297 (63%), Positives = 233/297 (78%), Gaps = 2/297 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV QGL+ E VFSFW+NR EEGGEIVFGGVDP HFKGKHTYVPVT++GYWQF +GD
Sbjct: 213 MVNQGLIKEPVFSFWMNRKVGDEEGGEIVFGGVDPNHFKGKHTYVPVTREGYWQFNMGDF 272
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
LIG QSTG C GGCAAIVDSGTSLLAGP+ +V +IN AIG G+ S ECK VVSQYGDLI
Sbjct: 273 LIGGQSTGFCSGGCAAIVDSGTSLLAGPSGIVAQINEAIGASGLASQECKSVVSQYGDLI 332
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKEN--VSAGDSAVCSACEMAVVW 178
+LL++ P+KVC QIGLC +G V I +V+EK N S S +C+ACEMAVVW
Sbjct: 333 MELLMAQTNPQKVCSQIGLCLSDGTRDVGMRIASVLEKGNEATSTSSSGMCAACEMAVVW 392
Query: 179 VQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQ 238
+NQ+ + +K+++++Y+N+LCD LPNP G++ +DC+ + +MP VSFTIGD+ F+L+P+Q
Sbjct: 393 AKNQIARNASKDQIMTYLNQLCDRLPNPNGQAAVDCNNLSSMPTVSFTIGDRSFDLTPDQ 452
Query: 239 YILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
YILK GEG A CISGFM D+PPP GP+WILGDVFMGVYHTVFD G +R+GF EAA
Sbjct: 453 YILKVGEGSAAQCISGFMGLDVPPPMGPIWILGDVFMGVYHTVFDFGNMRVGFTEAA 509
>gi|73912435|dbj|BAE20414.1| aspartic proteinase [Triticum aestivum]
Length = 498
Score = 415 bits (1067), Expect = e-113, Method: Compositional matrix adjust.
Identities = 188/295 (63%), Positives = 235/295 (79%), Gaps = 2/295 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M EQ L++++VFSFWLNRD DA GGE+VFGG+DP H+KG HTYVPV+++GYWQF +GD+
Sbjct: 206 MQEQKLLADDVFSFWLNRDSDALSGGELVFGGMDPDHYKGNHTYVPVSRRGYWQFNMGDL 265
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
LI STG C GCAAIVDSGTSLLAGPT +V ++NHAIG EG++S ECK VVSQYG++I
Sbjct: 266 LIDGHSTGFCAKGCAAIVDSGTSLLAGPTAIVAQVNHAIGAEGIISTECKEVVSQYGEMI 325
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGDSAVCSACEMAVVWVQ 180
+LL++ P+KVC QIGLC F+G VS GI++VV KENV G +C+ACEMAVVW++
Sbjct: 326 LELLIAQTQPQKVCSQIGLCLFDGTHSVSNGIESVVGKENV--GSDVMCTACEMAVVWIE 383
Query: 181 NQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYI 240
NQL++ +TKE +L Y N+LC+ LP+P GES + C + MPN++FTI K F L+PEQY+
Sbjct: 384 NQLRENKTKELILQYANQLCERLPSPNGESTVSCHEMSKMPNLAFTIASKTFVLTPEQYV 443
Query: 241 LKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
+K + VCISGFMAFD+PPPRGPLWILGDVFMG YHTVFD GK RIGFAE+A
Sbjct: 444 VKLEQSGQTVCISGFMAFDIPPPRGPLWILGDVFMGAYHTVFDFGKDRIGFAESA 498
>gi|413946558|gb|AFW79207.1| hypothetical protein ZEAMMB73_486493 [Zea mays]
Length = 382
Score = 415 bits (1066), Expect = e-113, Method: Compositional matrix adjust.
Identities = 193/299 (64%), Positives = 233/299 (77%), Gaps = 4/299 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV+QGL+S+ VFSFW NR D EGGEIVFGG+D H+KG HT+VPVT+KGYWQF +GD+
Sbjct: 84 MVKQGLISDPVFSFWFNRHADEGEGGEIVFGGMDSSHYKGDHTFVPVTRKGYWQFNMGDV 143
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
L+ +STG C GGCAAI DSGTSLLAGPT ++TEIN IG GVVS ECK VVSQYG I
Sbjct: 144 LVDGKSTGFCAGGCAAIADSGTSLLAGPTAIITEINEKIGAAGVVSQECKTVVSQYGQQI 203
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVE----KENVSAGDSAVCSACEMAV 176
DLL++ P K+C Q+GLC F+G VS GI++VV+ K N +C+ACEMAV
Sbjct: 204 LDLLLAETQPAKICSQVGLCTFDGTHGVSAGIRSVVDDEAGKSNGGLKSDPMCNACEMAV 263
Query: 177 VWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSP 236
VW+QNQL Q +T+E +L+YIN+LC+ LP+PMGES +DC + +MP+++FTIG K F L P
Sbjct: 264 VWMQNQLAQNKTQELILNYINQLCERLPSPMGESAVDCGSLASMPDIAFTIGGKKFKLKP 323
Query: 237 EQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
EQYILK GEG A CISGF A D+PPPRGPLWILGDVFMGVYHTVFD GKLR+GFAE+A
Sbjct: 324 EQYILKVGEGQAAQCISGFTAMDIPPPRGPLWILGDVFMGVYHTVFDYGKLRVGFAESA 382
>gi|357132502|ref|XP_003567869.1| PREDICTED: phytepsin-like [Brachypodium distachyon]
Length = 505
Score = 414 bits (1064), Expect = e-113, Method: Compositional matrix adjust.
Identities = 192/298 (64%), Positives = 234/298 (78%), Gaps = 3/298 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+EQGL+S+ VFSFW NR EGGEIVFGG+DPKH+ G+HTYVPVT+KGYWQF++GD+
Sbjct: 208 MIEQGLISDPVFSFWFNRHAGEGEGGEIVFGGMDPKHYIGEHTYVPVTQKGYWQFDMGDV 267
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
L+G +STG C GGCAAI DSGTSLLAGPT ++TEIN IG GVVS ECK VVSQYG I
Sbjct: 268 LVGGKSTGFCAGGCAAIADSGTSLLAGPTAIITEINEKIGAAGVVSQECKTVVSQYGQQI 327
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAG---DSAVCSACEMAVV 177
DLL++ P+K+C Q+GLC F+G VS GI++VV+ E + +C+ACEMAVV
Sbjct: 328 LDLLLAETQPKKICSQVGLCTFDGTRGVSAGIRSVVDDEAEKSNGLHSDPMCNACEMAVV 387
Query: 178 WVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPE 237
W+QNQL Q +T++ +L+YIN+LCD LP+PMGES +DC + +MP + FTIG K F L PE
Sbjct: 388 WMQNQLSQNKTQDVILNYINQLCDRLPSPMGESSVDCGSLASMPEIEFTIGGKKFALKPE 447
Query: 238 QYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
+YILK GEG A CISGF A D+PPPRGPLWILGDVFMG YHTVFD GKLR+GFA+AA
Sbjct: 448 EYILKVGEGPAAQCISGFTAMDIPPPRGPLWILGDVFMGPYHTVFDYGKLRVGFAKAA 505
>gi|357134751|ref|XP_003568979.1| PREDICTED: aspartic proteinase-like [Brachypodium distachyon]
Length = 498
Score = 413 bits (1062), Expect = e-113, Method: Compositional matrix adjust.
Identities = 189/295 (64%), Positives = 235/295 (79%), Gaps = 2/295 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M +Q L++++VFSFWLNR+ DA GGE+VFGGVD H+KG HTYVPV++KGYWQF +GD+
Sbjct: 206 MQKQKLLADDVFSFWLNRNADATSGGELVFGGVDSNHYKGNHTYVPVSRKGYWQFNMGDL 265
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
LI QSTG C GCAAIVDSGTSLLAGPT +V ++NHAIG EG++S ECK VVSQYG++I
Sbjct: 266 LIDGQSTGFCAKGCAAIVDSGTSLLAGPTAIVAQVNHAIGAEGIISTECKEVVSQYGEMI 325
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGDSAVCSACEMAVVWVQ 180
DLL++ P+KVC Q+GLC F+G VS GI++VV KENV G +C+ACEMAVVW++
Sbjct: 326 LDLLLAQTEPQKVCSQVGLCLFDGTHSVSKGIESVVGKENV--GSDVMCTACEMAVVWIE 383
Query: 181 NQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYI 240
NQL++ +TKE +L Y N+LC+ LP+P GES + C I MPN++FTI K F L+PEQYI
Sbjct: 384 NQLRENKTKELILQYANQLCERLPSPNGESTVSCHEISKMPNLAFTIAGKTFVLTPEQYI 443
Query: 241 LKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
+K + VCISGFMAFD+PPPRGPLWILGDVFMG YHTVFD G+ RIGFAE+A
Sbjct: 444 VKLEQSGQTVCISGFMAFDIPPPRGPLWILGDVFMGAYHTVFDFGEDRIGFAESA 498
>gi|384040313|gb|AFH58568.1| aspartic acid protease [Ananas comosus]
Length = 514
Score = 413 bits (1062), Expect = e-113, Method: Compositional matrix adjust.
Identities = 189/298 (63%), Positives = 235/298 (78%), Gaps = 3/298 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV+QGL+ E VFSFW NR+ + EGGEIVFGG DP H+KG HTYVPVT+KGYWQFE+GD+
Sbjct: 217 MVDQGLIKEPVFSFWFNRNANDGEGGEIVFGGADPNHYKGNHTYVPVTQKGYWQFEMGDV 276
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
L+G QSTG C GGCAAI DSGTSLLAGPT ++ EIN IG GVVS ECK VV++YG I
Sbjct: 277 LVGGQSTGFCNGGCAAIADSGTSLLAGPTTIIAEINQKIGASGVVSQECKAVVAEYGQQI 336
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKE--NVSAGDS-AVCSACEMAVV 177
+L++ + P K+C IGLC F+G + VS GI++VV K+ +AG S A+C+ CEMAVV
Sbjct: 337 LQMLLAEVQPGKICSSIGLCTFDGKQGVSAGIESVVNKDTRRSAAGLSDAMCNVCEMAVV 396
Query: 178 WVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPE 237
W+QNQ+ Q QT+E + +Y+N+LC+ LP+PMGES +DC + +MP++SFTIG K F+L PE
Sbjct: 397 WMQNQISQNQTQELIFNYLNQLCEKLPSPMGESSVDCSSVASMPDISFTIGGKKFSLKPE 456
Query: 238 QYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
QYIL+ GEG A CISGF A D+PPPRGPLWILGDVFMG YHTVFD G +R+GFA+AA
Sbjct: 457 QYILQVGEGYAAQCISGFTALDVPPPRGPLWILGDVFMGAYHTVFDYGNMRVGFADAA 514
>gi|226506070|ref|NP_001150729.1| aspartic proteinase oryzasin-1 precursor [Zea mays]
gi|195641348|gb|ACG40142.1| aspartic proteinase oryzasin-1 precursor [Zea mays]
Length = 518
Score = 413 bits (1062), Expect = e-113, Method: Compositional matrix adjust.
Identities = 193/299 (64%), Positives = 232/299 (77%), Gaps = 4/299 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV+QGL+S+ VFSFW NR EGGEIVFGG+D H+KG HTYVPVT+KGYWQF +GD+
Sbjct: 220 MVKQGLISDPVFSFWFNRHAGEGEGGEIVFGGMDSSHYKGDHTYVPVTQKGYWQFNMGDV 279
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
L+ +STG C GGCAAI DSGTSLLAGPT ++TEIN IG GVVS ECK VVSQYG I
Sbjct: 280 LVDGKSTGFCAGGCAAIADSGTSLLAGPTAIITEINEKIGAAGVVSQECKTVVSQYGQQI 339
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVE----KENVSAGDSAVCSACEMAV 176
DLL++ P K+C Q+GLC F+G VSTGI++VV+ K N +C+ACEMAV
Sbjct: 340 LDLLLAETQPAKICSQVGLCTFDGTHGVSTGIRSVVDDKAGKSNGGLKSDPMCNACEMAV 399
Query: 177 VWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSP 236
VW+QNQL Q +T+E +L+YIN+LC+ LP+PMGES +DC + +MP+++FTIG K F L P
Sbjct: 400 VWMQNQLAQNKTQELILTYINQLCERLPSPMGESAVDCASLGSMPDIAFTIGGKKFKLKP 459
Query: 237 EQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
EQYILK GEG A CISGF A D+PPPRGPLWILGDVFMGVYHTVFD KLR+GFAE+A
Sbjct: 460 EQYILKVGEGQAAQCISGFTAMDIPPPRGPLWILGDVFMGVYHTVFDYXKLRVGFAESA 518
>gi|326494022|dbj|BAJ85473.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326511208|dbj|BAJ87618.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 498
Score = 413 bits (1061), Expect = e-113, Method: Compositional matrix adjust.
Identities = 187/295 (63%), Positives = 236/295 (80%), Gaps = 2/295 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M EQ L++++VFSFWLNRD DA GGE+VFGG+DP H+KG HTYVPV++KGYWQF +GD+
Sbjct: 206 MQEQKLLADDVFSFWLNRDSDALSGGELVFGGMDPHHYKGNHTYVPVSRKGYWQFNMGDL 265
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
LI STG C GCAAIVDSGTSLLAGPT +V ++NHAIG EG++S ECK VVSQYG++I
Sbjct: 266 LIDGHSTGFCAKGCAAIVDSGTSLLAGPTAIVAQVNHAIGAEGIISTECKEVVSQYGEMI 325
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGDSAVCSACEMAVVWVQ 180
++L++ P+KVC QIGLC F+G + VS GI+++V KENV G +C+ACEMAVVW++
Sbjct: 326 LEMLIAQTQPQKVCSQIGLCLFDGTQSVSNGIESIVGKENV--GSDLMCTACEMAVVWIE 383
Query: 181 NQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYI 240
NQL++ +TKE +L Y N+LC+ LP+P GES + C + MPN++F I +K F L+PEQYI
Sbjct: 384 NQLRENKTKELILQYANQLCERLPSPNGESTVSCHEMSKMPNLAFAIANKTFVLTPEQYI 443
Query: 241 LKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
+K + VCISGFMAFD+PPPRGPLWILGDVFMG YHTVFD GK RIGFAE+A
Sbjct: 444 VKLEQSGQTVCISGFMAFDIPPPRGPLWILGDVFMGGYHTVFDFGKDRIGFAESA 498
>gi|459426|emb|CAA54478.1| aspartic protease [Brassica oleracea]
Length = 292
Score = 413 bits (1061), Expect = e-113, Method: Compositional matrix adjust.
Identities = 192/293 (65%), Positives = 232/293 (79%), Gaps = 5/293 (1%)
Query: 6 LVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQ 65
LVSE FSFWLNR+ D EEGGE+VFGGVDPKHFKG+H YVPVT+KGYWQF++GD+LIG
Sbjct: 2 LVSE--FSFWLNRNADDEEGGELVFGGVDPKHFKGQHIYVPVTQKGYWQFDMGDVLIGGA 59
Query: 66 STGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLV 125
TG CE GC+AI DSGTSLLAGPT ++T INHAIG GV S +CK VV QYG I DLL+
Sbjct: 60 PTGYCESGCSAIADSGTSLLAGPTTIITMINHAIGASGVASQQCKTVVDQYGQTILDLLL 119
Query: 126 SGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGD---SAVCSACEMAVVWVQNQ 182
S P+K+C QIGLC F+G VS GI++VV+KEN + A CSACEMAVVW+Q+Q
Sbjct: 120 SETQPKKICSQIGLCTFDGKRGVSMGIESVVDKENAKLSNGVGDAACSACEMAVVWIQSQ 179
Query: 183 LKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILK 242
L+Q T+E++L Y+NELC +P+PMGES +DC ++ TMP VS TIG K+F+L+P +Y+LK
Sbjct: 180 LRQNMTQERILDYVNELCRRIPSPMGESAVDCAQLSTMPTVSLTIGGKVFDLAPHEYVLK 239
Query: 243 TGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
GEG A CISGF+A D+ PPRGPLWILGDVFMG YHTVFD GK ++GFAEAA
Sbjct: 240 VGEGAAAQCISGFIALDVAPPRGPLWILGDVFMGKYHTVFDFGKAQVGFAEAA 292
>gi|388517285|gb|AFK46704.1| unknown [Medicago truncatula]
Length = 510
Score = 412 bits (1060), Expect = e-113, Method: Compositional matrix adjust.
Identities = 192/298 (64%), Positives = 236/298 (79%), Gaps = 3/298 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV+QGLV + VFSFWLNR+P+ E+GGE+VFGGVDP HFKG+HTYVPVT+KGYWQF +GD+
Sbjct: 213 MVKQGLVKDPVFSFWLNRNPNEEQGGELVFGGVDPAHFKGEHTYVPVTRKGYWQFAMGDV 272
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
LI + TG C C+AI DSGTSLLAGPT V+T IN AIG GV S EC+ VV QYG I
Sbjct: 273 LIDGKPTGYCANDCSAIADSGTSLLAGPTTVITMINQAIGASGVYSQECRTVVDQYGHSI 332
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEK-ENVSAG--DSAVCSACEMAVV 177
LLV+ P+KVC QIGLC F+G + +S GI++VVE+ + +S+G A C CEMAVV
Sbjct: 333 LQLLVAEAQPKKVCSQIGLCTFDGTQGISMGIQSVVEQTDRISSGGHQDATCFVCEMAVV 392
Query: 178 WVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPE 237
W+QNQLKQ QT+E++++Y + LCD +PNP+G+S +DC +I +MP VSFTIG K F+L+PE
Sbjct: 393 WMQNQLKQNQTEERIINYADSLCDKMPNPLGQSSVDCAKISSMPKVSFTIGGKKFDLAPE 452
Query: 238 QYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
+YILK GEG A CISGF A D+PPPRGPLWI GD+FMG YHTVFD GKLR+GFAEAA
Sbjct: 453 EYILKVGEGAAAQCISGFTALDVPPPRGPLWIPGDIFMGRYHTVFDYGKLRVGFAEAA 510
>gi|222630120|gb|EEE62252.1| hypothetical protein OsJ_17039 [Oryza sativa Japonica Group]
Length = 501
Score = 412 bits (1059), Expect = e-113, Method: Compositional matrix adjust.
Identities = 190/303 (62%), Positives = 240/303 (79%), Gaps = 13/303 (4%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M EQ L++++VFSFWLNRDPDA GGE+VFGG+DPKH+KG HTYVPV++KGYWQF +GD+
Sbjct: 204 MQEQELLADDVFSFWLNRDPDASSGGELVFGGMDPKHYKGDHTYVPVSRKGYWQFNMGDL 263
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
LI STG C GCAAIVDSGTSLLAGPT +V ++NHAIG EG++S ECK VVS+YG++I
Sbjct: 264 LIDGHSTGFCAKGCAAIVDSGTSLLAGPTAIVAQVNHAIGAEGIISTECKEVVSEYGEMI 323
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGDSAVCSACEMAVVWVQ 180
+LL++ P+KVC Q+GLC F+G VS GI++VV+KEN+ G A+CS CEMAVVW++
Sbjct: 324 LNLLIAQTDPQKVCSQVGLCMFDGKRSVSNGIESVVDKENL--GSDAMCSVCEMAVVWIE 381
Query: 181 NQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQ-- 238
NQL++ +TKE +L+Y N+LC+ LP+P GES + C +I MPN++FTI +K F L+PEQ
Sbjct: 382 NQLRENKTKELILNYANQLCERLPSPNGESTVSCHQISKMPNLAFTIANKTFILTPEQDP 441
Query: 239 ------YILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFA 292
Y K G+ VCISGFMAFD+PPPRGPLWILGDVFMG YHTVFD GK RIGFA
Sbjct: 442 DAFEVVYYFKRGQ---TVCISGFMAFDIPPPRGPLWILGDVFMGAYHTVFDFGKDRIGFA 498
Query: 293 EAA 295
++A
Sbjct: 499 KSA 501
>gi|414871124|tpg|DAA49681.1| TPA: hypothetical protein ZEAMMB73_239621 [Zea mays]
Length = 299
Score = 412 bits (1059), Expect = e-113, Method: Compositional matrix adjust.
Identities = 192/299 (64%), Positives = 232/299 (77%), Gaps = 4/299 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV+QGL+S+ VFSFW NR D EGGEIVFGG+D H+KG HT+VPVT+KGYWQF +GD+
Sbjct: 1 MVKQGLISDPVFSFWFNRHADEGEGGEIVFGGMDSSHYKGDHTFVPVTRKGYWQFNMGDV 60
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
L+ +STG C GGCAAI DSGTSLLAGP ++TEIN IG GVVS ECK VVSQYG I
Sbjct: 61 LVDGKSTGFCAGGCAAIADSGTSLLAGPIAIITEINEKIGAAGVVSQECKTVVSQYGQQI 120
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVE----KENVSAGDSAVCSACEMAV 176
DLL++ P K+C Q+GLC F+G VS GI++VV+ K N +C+ACEMAV
Sbjct: 121 LDLLLAETQPAKICSQVGLCTFDGTHGVSAGIRSVVDDEAGKSNGGLKSDPMCNACEMAV 180
Query: 177 VWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSP 236
VW+QNQL Q +T+E +L+YIN+LC+ LP+PMGES +DC + +MP+++FTIG K F L P
Sbjct: 181 VWMQNQLAQNKTQELILNYINQLCERLPSPMGESAVDCGSLASMPDIAFTIGGKKFKLKP 240
Query: 237 EQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
EQYILK GEG A CISGF A D+PPPRGPLWILGDVFMGVYHTVFD GKLR+GFAE+A
Sbjct: 241 EQYILKVGEGQAAQCISGFTAMDIPPPRGPLWILGDVFMGVYHTVFDYGKLRVGFAESA 299
>gi|22330379|ref|NP_176419.2| phytepsin [Arabidopsis thaliana]
gi|79320483|ref|NP_001031219.1| phytepsin [Arabidopsis thaliana]
gi|75331143|sp|Q8VYL3.1|APA2_ARATH RecName: Full=Aspartic proteinase A2; AltName: Full=Aspartic
protease 57; Short=AtASP57; Flags: Precursor
gi|17979428|gb|AAL49856.1| putative aspartic protease [Arabidopsis thaliana]
gi|23297031|gb|AAN13225.1| putative aspartic protease [Arabidopsis thaliana]
gi|222424000|dbj|BAH19961.1| AT1G62290 [Arabidopsis thaliana]
gi|332195825|gb|AEE33946.1| phytepsin [Arabidopsis thaliana]
gi|332195826|gb|AEE33947.1| phytepsin [Arabidopsis thaliana]
Length = 513
Score = 412 bits (1058), Expect = e-112, Method: Compositional matrix adjust.
Identities = 187/297 (62%), Positives = 236/297 (79%), Gaps = 3/297 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M++QGL+ VFSFWLNRDP +EEGGEIVFGGVDPKHF+G+HT+VPVT++GYWQF++G++
Sbjct: 216 MLKQGLIKRPVFSFWLNRDPKSEEGGEIVFGGVDPKHFRGEHTFVPVTQRGYWQFDMGEV 275
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
LI +STG C GC+AI DSGTSLLAGPT VV IN AIG GVVS +CK VV QYG I
Sbjct: 276 LIAGESTGYCGSGCSAIADSGTSLLAGPTAVVAMINKAIGASGVVSQQCKTVVDQYGQTI 335
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGD---SAVCSACEMAVV 177
DLL++ P+K+C QIGLCA++G VS GI++VV+KEN + A C ACEMAVV
Sbjct: 336 LDLLLAETQPKKICSQIGLCAYDGTHGVSMGIESVVDKENTRSSSGLRDAGCPACEMAVV 395
Query: 178 WVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPE 237
W+Q+QL+Q T+E++++YINE+C+ +P+P GES +DC ++ MP VSFTIG K+F+L+PE
Sbjct: 396 WIQSQLRQNMTQERIVNYINEICERMPSPNGESAVDCSQLSKMPTVSFTIGGKVFDLAPE 455
Query: 238 QYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
+Y+LK GEG CISGF A D+PPPRGPLWILGDVFMG YHTVFD G ++GFAEA
Sbjct: 456 EYVLKIGEGPVAQCISGFTALDIPPPRGPLWILGDVFMGKYHTVFDFGNEQVGFAEA 512
>gi|4589716|dbj|BAA76870.1| aspartic proteinase [Helianthus annuus]
Length = 509
Score = 412 bits (1058), Expect = e-112, Method: Compositional matrix adjust.
Identities = 188/298 (63%), Positives = 237/298 (79%), Gaps = 3/298 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV QGLV E VFSFW NR EEGGE+VFGGVDP HFKGKHTYVPVT+KGYWQF++GD+
Sbjct: 212 MVNQGLVQEPVFSFWFNRHTGEEEGGELVFGGVDPNHFKGKHTYVPVTQKGYWQFDMGDV 271
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
LIG+++TG C GGCAAI DSGTSLLAGPT ++T+INHAIG GV+S +CK +V QYG I
Sbjct: 272 LIGDKTTGFCSGGCAAIADSGTSLLAGPTTIITQINHAIGAAGVMSQQCKTLVDQYGKTI 331
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKEN--VSAG-DSAVCSACEMAVV 177
++L+S P+K+C ++ LC F+G+ VS+ I++VV+K N SAG + +C+ CEMAVV
Sbjct: 332 IEMLLSEAQPDKICSRMNLCTFDGSRDVSSIIESVVDKNNGKSSAGLNDGICAFCEMAVV 391
Query: 178 WVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPE 237
W+Q+QLK+ QT++ +++Y+NELCD +P+PMGES +DC + MPN++FTIG K F+L+PE
Sbjct: 392 WMQSQLKRNQTEDSIINYVNELCDRIPSPMGESAVDCQTLSNMPNIAFTIGGKTFDLTPE 451
Query: 238 QYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
QYILK GEG CISGF A D+ PP GPLWI GDVFMG YHTVFD GK R+GFAEAA
Sbjct: 452 QYILKVGEGEVAQCISGFTALDVAPPHGPLWIHGDVFMGQYHTVFDFGKSRVGFAEAA 509
>gi|297837199|ref|XP_002886481.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297332322|gb|EFH62740.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 513
Score = 410 bits (1055), Expect = e-112, Method: Compositional matrix adjust.
Identities = 187/297 (62%), Positives = 235/297 (79%), Gaps = 3/297 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M++QGL+ VFSFWLNRDP +EEGGEIVFGGVDPKHFKG+HT+VPVT++GYWQF++G++
Sbjct: 216 MLKQGLIERPVFSFWLNRDPKSEEGGEIVFGGVDPKHFKGEHTFVPVTQRGYWQFDMGEV 275
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
LI STG C GC+AI DSGTSLLAGPT V+ IN AIG GVVS +CK VV QYG I
Sbjct: 276 LIAGDSTGYCGSGCSAIADSGTSLLAGPTAVIAMINKAIGASGVVSQQCKTVVDQYGQTI 335
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGD---SAVCSACEMAVV 177
DLL++ P+K+C QIGLCAF+G VS GI++VV+KEN + A C ACEMAV+
Sbjct: 336 LDLLLAETQPKKICSQIGLCAFDGTHGVSMGIESVVDKENTRSSSGLRDAGCPACEMAVM 395
Query: 178 WVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPE 237
W+Q+QL+Q T+E++++YINE+C+ +P+P GES +DC ++ MP VSFTIG K+F+L+PE
Sbjct: 396 WIQSQLRQNMTQERIVNYINEICERMPSPNGESAVDCSQLSKMPTVSFTIGGKVFDLAPE 455
Query: 238 QYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
+Y+LK GEG CISGF A D+PPPRGPLWILGDVFMG YHTVFD G ++GFAEA
Sbjct: 456 EYVLKIGEGPVAQCISGFTALDVPPPRGPLWILGDVFMGKYHTVFDFGNEQVGFAEA 512
>gi|413942271|gb|AFW74920.1| hypothetical protein ZEAMMB73_522985 [Zea mays]
Length = 468
Score = 410 bits (1054), Expect = e-112, Method: Compositional matrix adjust.
Identities = 193/299 (64%), Positives = 232/299 (77%), Gaps = 4/299 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV+QGL+S+ VFSFW NR D EGGEIVFGG+D H+KG HT+VPVT+KGYWQF +GD+
Sbjct: 170 MVKQGLISDPVFSFWFNRHADEGEGGEIVFGGMDSSHYKGDHTFVPVTRKGYWQFNMGDV 229
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
L+ +STG C GGCAAI DSGTSLLAGPT ++TEIN IG GVVS ECK VVSQYG I
Sbjct: 230 LVDGKSTGFCAGGCAAIADSGTSLLAGPTAIITEINEKIGAAGVVSQECKTVVSQYGQQI 289
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVE----KENVSAGDSAVCSACEMAV 176
DLL++ P K+C Q+GLC F+G VS GI++VV+ K N +C+ACEMAV
Sbjct: 290 LDLLLAETQPAKICSQVGLCTFDGTHGVSAGIRSVVDDEARKSNGGLKSDPMCNACEMAV 349
Query: 177 VWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSP 236
VW+QNQL Q +T+E +L+YIN+LC+ LP+PMGES +DC + +MP++ FTIG K F L P
Sbjct: 350 VWMQNQLAQNKTQELILNYINQLCERLPSPMGESAVDCGSLVSMPDIVFTIGGKKFKLKP 409
Query: 237 EQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
EQYILK GEG A CISGF A D+PPPRGPLWILGDVFMGVYHTVFD GKLR+GFAE+A
Sbjct: 410 EQYILKVGEGQAVQCISGFTAMDIPPPRGPLWILGDVFMGVYHTVFDYGKLRVGFAESA 468
>gi|20800441|gb|AAB03843.2| aspartic proteinase [Vigna unguiculata]
gi|33339734|gb|AAQ14346.1| aspartic proteinase [Vigna unguiculata]
Length = 513
Score = 410 bits (1054), Expect = e-112, Method: Compositional matrix adjust.
Identities = 193/298 (64%), Positives = 234/298 (78%), Gaps = 3/298 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MVEQGL+ E VFSFWLNR + EEGGE+VFGGVDP H+KG+HTYVPVT+KGYWQF++GD+
Sbjct: 216 MVEQGLIKEPVFSFWLNRKTEEEEGGELVFGGVDPAHYKGEHTYVPVTRKGYWQFDMGDV 275
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
LIG + TG C GGCAAI DSGTSLLAGPT ++T INHAIG GV+S ECK VV++YG I
Sbjct: 276 LIGGKPTGYCAGGCAAIADSGTSLLAGPTAIITMINHAIGASGVMSQECKTVVAEYGQTI 335
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVV-EKENVSAG--DSAVCSACEMAVV 177
+LL++ P+K+C QIGLC F+G V GI++VV E S+G A CSACEMAVV
Sbjct: 336 LNLLLAETQPKKICSQIGLCTFDGTRGVDMGIESVVDENARKSSGGLHDAGCSACEMAVV 395
Query: 178 WVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPE 237
WVQNQL + QT++++LSY+N+LCD +P+PMGES + C I ++P VSFTIG + F+L PE
Sbjct: 396 WVQNQLSRNQTQDQILSYVNQLCDKMPSPMGESSVGCGDISSLPVVSFTIGGRTFDLRPE 455
Query: 238 QYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
+YILK GEG CISGF A D+ PPRGPLWILGDVFMG YHTVFD G R+GFAEAA
Sbjct: 456 EYILKVGEGPVAQCISGFTAIDIAPPRGPLWILGDVFMGPYHTVFDFGNQRVGFAEAA 513
>gi|223929912|gb|ACN24614.1| aspartic acid protease [Phaseolus vulgaris]
Length = 513
Score = 410 bits (1054), Expect = e-112, Method: Compositional matrix adjust.
Identities = 189/298 (63%), Positives = 233/298 (78%), Gaps = 3/298 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MVEQGL+ E VFSFW NR P+ EEGGEIVFGGVDP H+KGKHTYVPVT+KGYW+F++GD+
Sbjct: 216 MVEQGLIKEPVFSFWFNRKPEEEEGGEIVFGGVDPAHYKGKHTYVPVTRKGYWRFDMGDV 275
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
LIG + TG C GC AI DSGTSLLAGPT ++T INHAIG G++S ECK VV++YG I
Sbjct: 276 LIGGKPTGYCADGCLAIADSGTSLLAGPTTIITMINHAIGAAGIMSQECKTVVAEYGQTI 335
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVE---KENVSAGDSAVCSACEMAVV 177
+LL++ P+K+C QIGLC F+G + GI +VV+ +++ A CSACEMAVV
Sbjct: 336 LNLLLAETQPKKICSQIGLCTFDGTRGIDMGIASVVDEIARKSSGGLHDAACSACEMAVV 395
Query: 178 WVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPE 237
W+QNQL + QT++++LSYIN+LCD +P+PMGES ID I ++P VSFTIG + F+L PE
Sbjct: 396 WMQNQLSRNQTQDQILSYINQLCDKMPSPMGESSIDRGNISSLPVVSFTIGGRTFDLLPE 455
Query: 238 QYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
+YILK GEG CISGF A D+PPPRGPLWILGDVFMG YHTVFD G LR+GFA+AA
Sbjct: 456 EYILKVGEGPVAQCISGFTAIDIPPPRGPLWILGDVFMGRYHTVFDFGNLRVGFADAA 513
>gi|449466825|ref|XP_004151126.1| PREDICTED: aspartic proteinase-like [Cucumis sativus]
Length = 513
Score = 409 bits (1051), Expect = e-112, Method: Compositional matrix adjust.
Identities = 185/298 (62%), Positives = 237/298 (79%), Gaps = 3/298 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV+Q LV E VFSFW NR+ D E+GGEIVFGGVDP H+KG+HTYVPVTKKGYWQF++GD+
Sbjct: 216 MVDQNLVKEPVFSFWFNRNADEEQGGEIVFGGVDPDHYKGEHTYVPVTKKGYWQFDMGDV 275
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
LI +TG C GGC+AI DSGTSLLAGPT ++T++NHAIG GVVS ECK VV++YG+ I
Sbjct: 276 LINGSTTGFCSGGCSAIADSGTSLLAGPTTIITQVNHAIGASGVVSEECKAVVAEYGETI 335
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGD---SAVCSACEMAVV 177
+L++ P+K+C +GLCAF+G VS GI++VV+ + + +C+ACEMAVV
Sbjct: 336 IKMLLAKDQPKKICSTLGLCAFDGERGVSMGIESVVDNTTQKSSNGLRDVMCNACEMAVV 395
Query: 178 WVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPE 237
W Q+QLK+++T++++L+YI+ LC+ LP+PMGES+IDCD + T+P++SFTIG K+F L PE
Sbjct: 396 WAQSQLKEEKTQDQILNYIDGLCEKLPSPMGESVIDCDSLSTLPSISFTIGGKVFELKPE 455
Query: 238 QYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
QY+LK EG CISGF A D+PPPRGPLWILGDVFMG YHTVFD G R+GFAEAA
Sbjct: 456 QYVLKVTEGPVTECISGFAALDVPPPRGPLWILGDVFMGSYHTVFDYGNSRVGFAEAA 513
>gi|449503193|ref|XP_004161880.1| PREDICTED: aspartic proteinase-like [Cucumis sativus]
Length = 516
Score = 409 bits (1051), Expect = e-112, Method: Compositional matrix adjust.
Identities = 185/298 (62%), Positives = 237/298 (79%), Gaps = 3/298 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV+Q LV E VFSFW NR+ D E+GGEIVFGGVDP H+KG+HTYVPVTKKGYWQF++GD+
Sbjct: 219 MVDQNLVKEPVFSFWFNRNADEEQGGEIVFGGVDPDHYKGEHTYVPVTKKGYWQFDMGDV 278
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
LI +TG C GGC+AI DSGTSLLAGPT ++T++NHAIG GVVS ECK VV++YG+ I
Sbjct: 279 LINGSTTGFCSGGCSAIADSGTSLLAGPTTIITQVNHAIGASGVVSEECKAVVAEYGETI 338
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGD---SAVCSACEMAVV 177
+L++ P+K+C +GLCAF+G VS GI++VV+ + + +C+ACEMAVV
Sbjct: 339 IKMLLAKDQPKKICSTLGLCAFDGERGVSMGIESVVDNTTQKSSNGLRDVMCNACEMAVV 398
Query: 178 WVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPE 237
W Q+QLK+++T++++L+YI+ LC+ LP+PMGES+IDCD + T+P++SFTIG K+F L PE
Sbjct: 399 WAQSQLKEEKTQDQILNYIDGLCEKLPSPMGESVIDCDSLSTLPSISFTIGGKVFELKPE 458
Query: 238 QYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
QY+LK EG CISGF A D+PPPRGPLWILGDVFMG YHTVFD G R+GFAEAA
Sbjct: 459 QYVLKVTEGPVTECISGFAALDVPPPRGPLWILGDVFMGSYHTVFDYGNSRVGFAEAA 516
>gi|109675118|gb|ABG37021.1| aspartic protease [Nicotiana tabacum]
Length = 508
Score = 409 bits (1050), Expect = e-112, Method: Compositional matrix adjust.
Identities = 184/298 (61%), Positives = 234/298 (78%), Gaps = 4/298 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV QGLV + VFSFW NR+ EEGGE+VFGGVDP HFKGKHTYVPVT KGYWQF++GD+
Sbjct: 212 MVNQGLVKKPVFSFWFNRNAQEEEGGELVFGGVDPNHFKGKHTYVPVTHKGYWQFDMGDV 271
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
L+G ++TG C GGC+AI DSGTSLLAGPT ++T+INH IG GVVS ECK +V++YG I
Sbjct: 272 LVGGETTGFCSGGCSAIADSGTSLLAGPTTIITQINHVIGASGVVSQECKSLVTEYGKTI 331
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSA---GDSAVCSACEMAVV 177
DLL S P+K+C QIGLC+ +G+ VS I++VV+K N ++ GD +C CEMAV+
Sbjct: 332 LDLLESKAAPQKICSQIGLCSSDGSRDVSMIIESVVDKHNGASNGLGDE-MCRVCEMAVI 390
Query: 178 WVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPE 237
W+QNQ+++ +T + + Y+N+LCD LP+PMGES +DC + +MPNVSFT+G++ F L+P+
Sbjct: 391 WMQNQMRRNETADSIYDYVNQLCDRLPSPMGESAVDCSSLASMPNVSFTVGNQTFGLTPQ 450
Query: 238 QYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
QY+L+ GEG CISGF A D+PPPRGPLWILGDVFMG YHTVFD G R+GFAEAA
Sbjct: 451 QYVLQVGEGPVAQCISGFTALDVPPPRGPLWILGDVFMGRYHTVFDYGNSRVGFAEAA 508
>gi|357480353|ref|XP_003610462.1| Aspartic proteinase [Medicago truncatula]
gi|355511517|gb|AES92659.1| Aspartic proteinase [Medicago truncatula]
Length = 519
Score = 409 bits (1050), Expect = e-111, Method: Compositional matrix adjust.
Identities = 188/298 (63%), Positives = 237/298 (79%), Gaps = 4/298 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MVEQGL+ E VFSFWLNR P+ EEGGEIVFGGVDP H+KG HTYVPV +KGYWQF++GD+
Sbjct: 223 MVEQGLIQEPVFSFWLNRKPEEEEGGEIVFGGVDPAHYKGNHTYVPVKRKGYWQFDMGDV 282
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
I +STG C GC+AI DSGTSLLAGPT V+T INHAIG GVVS ECK +V++YG I
Sbjct: 283 TIDGKSTGYCVDGCSAIADSGTSLLAGPTTVITMINHAIGASGVVSKECKTIVAEYGQTI 342
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVV---EKENVSAGDSAVCSACEMAVV 177
+LL++ P+K+C +IGLC F+G V I++VV E+++ S A CSACEMAVV
Sbjct: 343 LNLLLAEAQPKKICSEIGLCTFDGTHGVDLAIESVVDGNERKSSSGLHGASCSACEMAVV 402
Query: 178 WVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPE 237
W+QNQL+Q +T++++L+YIN LCD +P+PMGES +DC+ I ++P +SFTIG + F+L+PE
Sbjct: 403 WMQNQLRQNKTQDQILTYINNLCDKMPSPMGESSVDCENISSLPVISFTIGGRTFDLAPE 462
Query: 238 QYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
+YI K GEG A CISGF+A D+PPPRGP+WILGD+FMG YHTVFD GK R+GFAEAA
Sbjct: 463 EYI-KVGEGPAAQCISGFVAIDVPPPRGPIWILGDIFMGRYHTVFDFGKSRVGFAEAA 519
>gi|413917603|gb|AFW57535.1| hypothetical protein ZEAMMB73_218341 [Zea mays]
gi|413917604|gb|AFW57536.1| hypothetical protein ZEAMMB73_218341 [Zea mays]
Length = 294
Score = 407 bits (1045), Expect = e-111, Method: Compositional matrix adjust.
Identities = 185/296 (62%), Positives = 236/296 (79%), Gaps = 3/296 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEE-GGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGD 59
M EQ L++E+VFSFWLNR PDA GGE+VFGGVDP HF G HTYVPV++KGYWQF++GD
Sbjct: 1 MQEQELLAEDVFSFWLNRSPDAAAAGGELVFGGVDPAHFSGNHTYVPVSRKGYWQFDMGD 60
Query: 60 ILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDL 119
+LI STG C GCAAIVDSGTSLLAGPT ++ ++N AIG +G++S ECK VVSQYG++
Sbjct: 61 LLIDGHSTGFCAKGCAAIVDSGTSLLAGPTAIIAQVNEAIGADGIISTECKEVVSQYGEM 120
Query: 120 IWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGDSAVCSACEMAVVWV 179
I D+L++ P++VC Q+GLC F+GA VS GI++VV KEN+ G +CSAC+MAVVW+
Sbjct: 121 ILDMLIAQTDPQRVCSQVGLCVFDGARSVSEGIESVVGKENL--GSDVMCSACQMAVVWI 178
Query: 180 QNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQY 239
+NQL++ +TKE +L Y N+LC+ LP+P GES + C I MP+++FTI +K F L+P+QY
Sbjct: 179 ENQLRENKTKELILQYANQLCERLPSPNGESTVSCQEISKMPSLAFTIANKTFTLTPQQY 238
Query: 240 ILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
I+K +G VCISGFMA+D+PPPRGPLWILGDVFMG YHTVFD G RIGFAE+A
Sbjct: 239 IVKLEQGGQTVCISGFMAYDVPPPRGPLWILGDVFMGAYHTVFDFGNDRIGFAESA 294
>gi|226532912|ref|NP_001146573.1| hypothetical protein [Zea mays]
gi|219887869|gb|ACL54309.1| unknown [Zea mays]
gi|413917600|gb|AFW57532.1| hypothetical protein ZEAMMB73_218341 [Zea mays]
Length = 494
Score = 407 bits (1045), Expect = e-111, Method: Compositional matrix adjust.
Identities = 185/296 (62%), Positives = 236/296 (79%), Gaps = 3/296 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEE-GGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGD 59
M EQ L++E+VFSFWLNR PDA GGE+VFGGVDP HF G HTYVPV++KGYWQF++GD
Sbjct: 201 MQEQELLAEDVFSFWLNRSPDAAAAGGELVFGGVDPAHFSGNHTYVPVSRKGYWQFDMGD 260
Query: 60 ILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDL 119
+LI STG C GCAAIVDSGTSLLAGPT ++ ++N AIG +G++S ECK VVSQYG++
Sbjct: 261 LLIDGHSTGFCAKGCAAIVDSGTSLLAGPTAIIAQVNEAIGADGIISTECKEVVSQYGEM 320
Query: 120 IWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGDSAVCSACEMAVVWV 179
I D+L++ P++VC Q+GLC F+GA VS GI++VV KEN+ G +CSAC+MAVVW+
Sbjct: 321 ILDMLIAQTDPQRVCSQVGLCVFDGARSVSEGIESVVGKENL--GSDVMCSACQMAVVWI 378
Query: 180 QNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQY 239
+NQL++ +TKE +L Y N+LC+ LP+P GES + C I MP+++FTI +K F L+P+QY
Sbjct: 379 ENQLRENKTKELILQYANQLCERLPSPNGESTVSCQEISKMPSLAFTIANKTFTLTPQQY 438
Query: 240 ILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
I+K +G VCISGFMA+D+PPPRGPLWILGDVFMG YHTVFD G RIGFAE+A
Sbjct: 439 IVKLEQGGQTVCISGFMAYDVPPPRGPLWILGDVFMGAYHTVFDFGNDRIGFAESA 494
>gi|223949795|gb|ACN28981.1| unknown [Zea mays]
gi|413917601|gb|AFW57533.1| hypothetical protein ZEAMMB73_218341 [Zea mays]
gi|413917602|gb|AFW57534.1| hypothetical protein ZEAMMB73_218341 [Zea mays]
Length = 509
Score = 406 bits (1044), Expect = e-111, Method: Compositional matrix adjust.
Identities = 185/296 (62%), Positives = 236/296 (79%), Gaps = 3/296 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEE-GGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGD 59
M EQ L++E+VFSFWLNR PDA GGE+VFGGVDP HF G HTYVPV++KGYWQF++GD
Sbjct: 216 MQEQELLAEDVFSFWLNRSPDAAAAGGELVFGGVDPAHFSGNHTYVPVSRKGYWQFDMGD 275
Query: 60 ILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDL 119
+LI STG C GCAAIVDSGTSLLAGPT ++ ++N AIG +G++S ECK VVSQYG++
Sbjct: 276 LLIDGHSTGFCAKGCAAIVDSGTSLLAGPTAIIAQVNEAIGADGIISTECKEVVSQYGEM 335
Query: 120 IWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGDSAVCSACEMAVVWV 179
I D+L++ P++VC Q+GLC F+GA VS GI++VV KEN+ G +CSAC+MAVVW+
Sbjct: 336 ILDMLIAQTDPQRVCSQVGLCVFDGARSVSEGIESVVGKENL--GSDVMCSACQMAVVWI 393
Query: 180 QNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQY 239
+NQL++ +TKE +L Y N+LC+ LP+P GES + C I MP+++FTI +K F L+P+QY
Sbjct: 394 ENQLRENKTKELILQYANQLCERLPSPNGESTVSCQEISKMPSLAFTIANKTFTLTPQQY 453
Query: 240 ILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
I+K +G VCISGFMA+D+PPPRGPLWILGDVFMG YHTVFD G RIGFAE+A
Sbjct: 454 IVKLEQGGQTVCISGFMAYDVPPPRGPLWILGDVFMGAYHTVFDFGNDRIGFAESA 509
>gi|110162110|emb|CAL07969.1| aspartic proteinase [Cynara cardunculus]
Length = 506
Score = 403 bits (1036), Expect = e-110, Method: Compositional matrix adjust.
Identities = 176/294 (59%), Positives = 234/294 (79%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV QGLV E VFSFWLNR+ D EEGGE+VFGGVDP HF+G HTYVPVT+KGYWQFE+GD+
Sbjct: 212 MVNQGLVEEAVFSFWLNRNVDEEEGGELVFGGVDPNHFRGNHTYVPVTRKGYWQFEMGDV 271
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
LIG++S+G C GGCAAI DSGTSL+AGPT ++T+IN AIG +GV++ +CK +VSQYG +
Sbjct: 272 LIGDKSSGFCAGGCAAIADSGTSLIAGPTAIITQINQAIGAKGVLNQQCKTLVSQYGKNM 331
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGDSAVCSACEMAVVWVQ 180
+L S + P+++C Q+ LC F+GA +V + I++VV+K N + +C+ CEMA+VW+Q
Sbjct: 332 IQMLTSEVQPDQICSQMKLCTFDGARHVRSMIESVVDKNNDKSSGDEICTFCEMALVWMQ 391
Query: 181 NQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYI 240
N++K+ +T++ +++++NELCD LP ESI+DC+ I +MPN +FTIG K+F L+PEQYI
Sbjct: 392 NEIKRNETEDNIINHVNELCDHLPTSSAESIVDCNGISSMPNTAFTIGRKLFELTPEQYI 451
Query: 241 LKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
K GEG A CISGF A D+ P+GP+WILGD+FMG YHTVFD GKLR+GF EA
Sbjct: 452 FKVGEGEAATCISGFTALDIMSPQGPIWILGDMFMGPYHTVFDYGKLRVGFTEA 505
>gi|115439013|ref|NP_001043786.1| Os01g0663400 [Oryza sativa Japonica Group]
gi|113533317|dbj|BAF05700.1| Os01g0663400 [Oryza sativa Japonica Group]
gi|215701483|dbj|BAG92907.1| unnamed protein product [Oryza sativa Japonica Group]
gi|218188796|gb|EEC71223.1| hypothetical protein OsI_03158 [Oryza sativa Indica Group]
gi|222618996|gb|EEE55128.1| hypothetical protein OsJ_02912 [Oryza sativa Japonica Group]
gi|385717674|gb|AFI71272.1| unnamed protein [Oryza sativa Japonica Group]
Length = 522
Score = 403 bits (1035), Expect = e-110, Method: Compositional matrix adjust.
Identities = 190/298 (63%), Positives = 224/298 (75%), Gaps = 3/298 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV QGLV + VFSFW NR D +GGEIVFGG+DP H+KG HTYVPVT+KGYWQF +GD+
Sbjct: 225 MVRQGLVVDPVFSFWFNRHADEGQGGEIVFGGIDPNHYKGNHTYVPVTRKGYWQFNMGDV 284
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
LIG STG C GCAAI DSGTSLL GPT ++T+IN IG GVVS ECK VVSQYG I
Sbjct: 285 LIGGNSTGFCAAGCAAIADSGTSLLTGPTAIITQINEKIGATGVVSQECKAVVSQYGQQI 344
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKE-NVSAG--DSAVCSACEMAVV 177
D L + P KVC +GLC F+G VS GI++VV+ E S+G SA+C+ACE AVV
Sbjct: 345 LDQLRAETKPAKVCSSVGLCTFDGTHGVSAGIRSVVDDEVGKSSGPFSSAMCNACETAVV 404
Query: 178 WVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPE 237
W+ QL Q QT++ VL YI++LCD LP+PMGES +DC + +MP+++FTIG F L PE
Sbjct: 405 WMHTQLAQNQTQDLVLQYIDQLCDRLPSPMGESSVDCSSLASMPDIAFTIGGNKFVLKPE 464
Query: 238 QYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
QYILK GEG A CISGF A D+PPPRGPLWILGDVFMG YHTVFD G L++GFAEAA
Sbjct: 465 QYILKVGEGTATQCISGFTAMDIPPPRGPLWILGDVFMGAYHTVFDYGNLKVGFAEAA 522
>gi|509163|emb|CAA48939.1| cyprosin [Cynara cardunculus]
Length = 474
Score = 402 bits (1034), Expect = e-110, Method: Compositional matrix adjust.
Identities = 183/300 (61%), Positives = 237/300 (79%), Gaps = 8/300 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ QGLV E VFSFWLNR+ D +EGGE+VFGGVDP HFKG+HTYVPVT+KGYWQFE+GD+
Sbjct: 178 MLNQGLVQEPVFSFWLNRNADEQEGGELVFGGVDPNHFKGEHTYVPVTQKGYWQFEMGDV 237
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
LIG+++TG C GCAAI DSGTSLLAG T +VT+IN AIG GV+S +CK +V QYG +
Sbjct: 238 LIGDKTTGFCASGCAAIADSGTSLLAGTTTIVTQINQAIGAAGVMSQQCKSLVDQYGKSM 297
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGDSA-----VCSACEMA 175
++L+S PEK+C Q+ LC+F+G+ S I++VV+K S G S+ +C+ C+MA
Sbjct: 298 IEMLLSEEQPEKICSQMKLCSFDGSHDTSMIIESVVDK---SKGKSSGLHDEMCTMCQMA 354
Query: 176 VVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLS 235
VVW+QNQ++Q +T+E +++Y+++LC+ LP+PMGES +DC + +MPN++FT+G K FNLS
Sbjct: 355 VVWMQNQIRQNETEENIINYVDKLCERLPSPMGESAVDCSSLSSMPNIAFTVGGKTFNLS 414
Query: 236 PEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
PEQY+LK GEG CISGF A D+ PP GPLWILGDVFMG YHTVFD G LR+GFAEAA
Sbjct: 415 PEQYVLKVGEGATAQCISGFTAMDVAPPHGPLWILGDVFMGQYHTVFDYGNLRVGFAEAA 474
>gi|1168536|sp|P42210.1|ASPR_HORVU RecName: Full=Phytepsin; AltName: Full=Aspartic proteinase;
Contains: RecName: Full=Phytepsin 32 kDa subunit;
Contains: RecName: Full=Phytepsin 29 kDa subunit;
Contains: RecName: Full=Phytepsin 16 kDa subunit;
Contains: RecName: Full=Phytepsin 11 kDa subunit; Flags:
Precursor
gi|18904|emb|CAA39602.1| aspartic proteinase [Hordeum vulgare subsp. vulgare]
Length = 508
Score = 399 bits (1025), Expect = e-109, Method: Compositional matrix adjust.
Identities = 194/298 (65%), Positives = 237/298 (79%), Gaps = 3/298 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+EQGLVS+ VFSFWLNR D EGGEI+FGG+DPKH+ G+HTYVPVT+KGYWQF++GD+
Sbjct: 211 MIEQGLVSDPVFSFWLNRHVDEGEGGEIIFGGMDPKHYVGEHTYVPVTQKGYWQFDMGDV 270
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
L+G +STG C GGCAAI DSGTSLLAGPT ++TEIN IG GVVS ECK +VSQYG I
Sbjct: 271 LVGGKSTGFCAGGCAAIADSGTSLLAGPTAIITEINEKIGAAGVVSQECKTIVSQYGQQI 330
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGD---SAVCSACEMAVV 177
DLL++ P+K+C Q+GLC F+G VS GI++VV+ E V + +CSACEMAVV
Sbjct: 331 LDLLLAETQPKKICSQVGLCTFDGTRGVSAGIRSVVDDEPVKSNGLRADPMCSACEMAVV 390
Query: 178 WVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPE 237
W+QNQL Q +T++ +L Y+N+LC+ LP+PMGES +DC + +MP++ FTIG K F L PE
Sbjct: 391 WMQNQLAQNKTQDLILDYVNQLCNRLPSPMGESAVDCGSLGSMPDIEFTIGGKKFALKPE 450
Query: 238 QYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
+YILK GEG A CISGF A D+PPPRGPLWILGDVFMG YHTVFD GKLRIGFA+AA
Sbjct: 451 EYILKVGEGAAAQCISGFTAMDIPPPRGPLWILGDVFMGPYHTVFDYGKLRIGFAKAA 508
>gi|5822248|pdb|1QDM|A Chain A, Crystal Structure Of Prophytepsin, A Zymogen Of A Barley
Vacuolar Aspartic Proteinase.
gi|5822249|pdb|1QDM|B Chain B, Crystal Structure Of Prophytepsin, A Zymogen Of A Barley
Vacuolar Aspartic Proteinase.
gi|5822250|pdb|1QDM|C Chain C, Crystal Structure Of Prophytepsin, A Zymogen Of A Barley
Vacuolar Aspartic Proteinase
Length = 478
Score = 399 bits (1024), Expect = e-108, Method: Compositional matrix adjust.
Identities = 194/298 (65%), Positives = 237/298 (79%), Gaps = 3/298 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+EQGLVS+ VFSFWLNR D EGGEI+FGG+DPKH+ G+HTYVPVT+KGYWQF++GD+
Sbjct: 181 MIEQGLVSDPVFSFWLNRHVDEGEGGEIIFGGMDPKHYVGEHTYVPVTQKGYWQFDMGDV 240
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
L+G +STG C GGCAAI DSGTSLLAGPT ++TEIN IG GVVS ECK +VSQYG I
Sbjct: 241 LVGGKSTGFCAGGCAAIADSGTSLLAGPTAIITEINEKIGAAGVVSQECKTIVSQYGQQI 300
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGD---SAVCSACEMAVV 177
DLL++ P+K+C Q+GLC F+G VS GI++VV+ E V + +CSACEMAVV
Sbjct: 301 LDLLLAETQPKKICSQVGLCTFDGTRGVSAGIRSVVDDEPVKSNGLRADPMCSACEMAVV 360
Query: 178 WVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPE 237
W+QNQL Q +T++ +L Y+N+LC+ LP+PMGES +DC + +MP++ FTIG K F L PE
Sbjct: 361 WMQNQLAQNKTQDLILDYVNQLCNRLPSPMGESAVDCGSLGSMPDIEFTIGGKKFALKPE 420
Query: 238 QYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
+YILK GEG A CISGF A D+PPPRGPLWILGDVFMG YHTVFD GKLRIGFA+AA
Sbjct: 421 EYILKVGEGAAAQCISGFTAMDIPPPRGPLWILGDVFMGPYHTVFDYGKLRIGFAKAA 478
>gi|357130655|ref|XP_003566963.1| PREDICTED: aspartic proteinase oryzasin-1-like [Brachypodium
distachyon]
Length = 520
Score = 397 bits (1020), Expect = e-108, Method: Compositional matrix adjust.
Identities = 186/298 (62%), Positives = 228/298 (76%), Gaps = 4/298 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV QGLV + +FSFW NR +GGEIVFGG+DP H KG HTYVPV KKGYWQF++ D+
Sbjct: 224 MVSQGLVGDLIFSFWFNRHAGEGQGGEIVFGGIDPSHHKGNHTYVPVPKKGYWQFDMSDV 283
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
LIG STG C+ GCAA+ DSGTSLL+GPT +VT+IN IG GVVS ECK VVSQYG I
Sbjct: 284 LIGGNSTGFCKDGCAAMADSGTSLLSGPTAIVTQINKKIGATGVVSQECKAVVSQYGKQI 343
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGD---SAVCSACEMAVV 177
DLL+ +K+C +GLC F+GA VS GI++VV+ + + D C+ CEMAVV
Sbjct: 344 LDLLLK-YSRKKICSSVGLCTFDGAHGVSAGIQSVVDDKVWGSNDIFSKVTCNMCEMAVV 402
Query: 178 WVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPE 237
W+Q+QL Q QT+E VL YIN+LCDS P+PMGES +DC+R+ +MP+++F+IG K F L+PE
Sbjct: 403 WMQHQLAQNQTQEFVLQYINQLCDSFPSPMGESSVDCNRLASMPDIAFSIGGKQFVLTPE 462
Query: 238 QYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
QYILK GEG+A CISGF A D+PPPRGPLWILGD+FMG YHTVFD G L++GFAEAA
Sbjct: 463 QYILKVGEGVATQCISGFTAVDIPPPRGPLWILGDIFMGAYHTVFDYGNLKVGFAEAA 520
>gi|116793748|gb|ABK26865.1| unknown [Picea sitchensis]
Length = 284
Score = 396 bits (1017), Expect = e-108, Method: Compositional matrix adjust.
Identities = 177/283 (62%), Positives = 229/283 (80%), Gaps = 4/283 (1%)
Query: 16 LNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCA 75
+NR+ D E+GGEIVFGGVDP HFKG+H Y VT+KGYWQF++GD LI NQSTG C GGCA
Sbjct: 1 MNRNSDEEDGGEIVFGGVDPNHFKGEHEYASVTRKGYWQFDMGDFLIDNQSTGFCAGGCA 60
Query: 76 AIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQ 135
AIVDSGTSLLAGP+ ++T+IN+AIG G+VS ECK VVSQYGD+I +LL++ P+K+C
Sbjct: 61 AIVDSGTSLLAGPSGIITQINNAIGASGIVSQECKTVVSQYGDVIMELLMAQTNPKKICS 120
Query: 136 QIGLCAFNGAEYVSTGIKTVV----EKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEK 191
QIGLC+++GA V GI +V+ EKE +S+ C+ACEMAVVWVQNQ+ + QTKE+
Sbjct: 121 QIGLCSYDGARDVGIGIASVLEKTHEKETLSSISDGTCTACEMAVVWVQNQIARNQTKEQ 180
Query: 192 VLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVC 251
+++Y+N+LCD LP+P GES++DCD++ +MP VSF+IG+K F+L+P+QYIL+ GEG C
Sbjct: 181 IMTYLNQLCDRLPSPNGESVVDCDQVSSMPTVSFSIGNKTFSLTPDQYILQVGEGSVAQC 240
Query: 252 ISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
+SGFM D+ PP GP+WILGD+FMGVYHTVFD G R+GFAEA
Sbjct: 241 VSGFMGLDVSPPLGPIWILGDIFMGVYHTVFDYGNSRVGFAEA 283
>gi|1169175|sp|P40782.2|CYPR1_CYNCA RecName: Full=Cyprosin; Flags: Precursor
gi|1585067|prf||2124255A cyprosin
Length = 473
Score = 395 bits (1016), Expect = e-108, Method: Compositional matrix adjust.
Identities = 180/296 (60%), Positives = 232/296 (78%), Gaps = 1/296 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ QGLV E VFSFWLNR+ D +EGGE+VFGGVDP HFKG+HTYVPVT+KGYWQFE+GD+
Sbjct: 178 MLNQGLVQEPVFSFWLNRNADEQEGGELVFGGVDPNHFKGEHTYVPVTQKGYWQFEMGDV 237
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
LIG+++TG C GCAAI DSGTSLLAG T +VT+IN AIG GV+S +CK +V QYG +
Sbjct: 238 LIGDKTTGFCASGCAAIADSGTSLLAGTTTIVTQINQAIGAAGVMSQQCKSLVDQYGKSM 297
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEK-ENVSAGDSAVCSACEMAVVWV 179
++L+S PEK+C Q+ LC+F+G+ S I++VV+K + S+G C C VVW+
Sbjct: 298 IEMLLSEEQPEKICSQMKLCSFDGSHDTSMIIESVVDKSKGKSSGLPMRCVPCARWVVWM 357
Query: 180 QNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQY 239
QNQ++Q +T+E +++Y+++LC+ LP+PMGES +DC + +MPN++FT+G K FNLSPEQY
Sbjct: 358 QNQIRQNETEENIINYVDKLCERLPSPMGESAVDCSSLSSMPNIAFTVGGKTFNLSPEQY 417
Query: 240 ILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
+LK GEG CISGF A D+ PP GPLWILGDVFMG YHTVFD G LR+GFAEAA
Sbjct: 418 VLKVGEGATAQCISGFTAMDVAPPHGPLWILGDVFMGQYHTVFDYGNLRVGFAEAA 473
>gi|224068986|ref|XP_002302872.1| predicted protein [Populus trichocarpa]
gi|222844598|gb|EEE82145.1| predicted protein [Populus trichocarpa]
Length = 505
Score = 394 bits (1013), Expect = e-107, Method: Compositional matrix adjust.
Identities = 184/298 (61%), Positives = 236/298 (79%), Gaps = 3/298 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV QGLV E+VFSFWLNR+ + EEGGEIVFGGVDP H+KG+HTYVPVT KGYWQF++GD+
Sbjct: 208 MVNQGLVKEKVFSFWLNRNVEGEEGGEIVFGGVDPNHYKGEHTYVPVTHKGYWQFDMGDL 267
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
LIG ++TG+C GGC AI DSGTSLLAGPT V+T+IN+AIG G+VS ECK VV+QYG +I
Sbjct: 268 LIGTETTGLCAGGCKAIADSGTSLLAGPTTVITQINNAIGASGIVSEECKTVVAQYGKII 327
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGD---SAVCSACEMAVV 177
++LV+ P KVC QI C F+G + VS I++VVE+ + + D A+C+ACEM VV
Sbjct: 328 LEMLVAQAQPRKVCSQISFCTFDGTQGVSMNIESVVEENSDKSSDGLHDAMCTACEMMVV 387
Query: 178 WVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPE 237
W++N+L+ T++++L Y+N LCD LP+P GES ++C + +MP++SF IG K+F LSPE
Sbjct: 388 WMENRLRLNDTEDQILDYVNNLCDRLPSPNGESAVECSSLSSMPSISFEIGGKLFELSPE 447
Query: 238 QYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
QY+LK GEG++ CISGF A D+PPP GPLWILGDVFMG YHTVFD G L +GFA+AA
Sbjct: 448 QYVLKVGEGVSAQCISGFTALDVPPPHGPLWILGDVFMGRYHTVFDYGNLTVGFADAA 505
>gi|168029783|ref|XP_001767404.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162681300|gb|EDQ67728.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 499
Score = 393 bits (1010), Expect = e-107, Method: Compositional matrix adjust.
Identities = 182/296 (61%), Positives = 225/296 (76%), Gaps = 4/296 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M++QGLV E VFSFWLNR+PD GGE+V GGVDPKHFKG+H Y PVT+KGYWQF+LGD+
Sbjct: 207 MLDQGLVKEPVFSFWLNRNPDESSGGELVLGGVDPKHFKGEHVYTPVTRKGYWQFDLGDV 266
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
I ++TG C GC AI DSGTSLLAGP+ +V EIN AIG GVVS +CK+VV QYGD I
Sbjct: 267 TINGRTTGFCANGCTAIADSGTSLLAGPSGIVAEINQAIGATGVVSQQCKMVVQQYGDQI 326
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENV-SAGDSAVCSACEMAVVWV 179
++L++ + P KVC +GLC F E GI +VVEK+ S + +C+ CEMAVVW
Sbjct: 327 VEMLLAQMNPGKVCTTLGLCNFGAGE---PGIASVVEKDQSHSLREDPLCTVCEMAVVWA 383
Query: 180 QNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQY 239
QNQL Q +TKE++ +Y+N+LC+ LP+P GES +DC+ + +MPNV+FTI +K F L PE+Y
Sbjct: 384 QNQLSQNRTKEQIDAYLNQLCERLPSPNGESAVDCNSLSSMPNVAFTISNKTFELKPEEY 443
Query: 240 ILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
ILK GEG CISGF+ D+PPP GPLWILGDVFMGVYHTVFD G R+GFAEAA
Sbjct: 444 ILKIGEGAEAQCISGFLGLDVPPPAGPLWILGDVFMGVYHTVFDFGNTRLGFAEAA 499
>gi|449433980|ref|XP_004134774.1| PREDICTED: aspartic proteinase-like [Cucumis sativus]
gi|449526063|ref|XP_004170034.1| PREDICTED: aspartic proteinase-like [Cucumis sativus]
Length = 516
Score = 392 bits (1008), Expect = e-107, Method: Compositional matrix adjust.
Identities = 186/298 (62%), Positives = 231/298 (77%), Gaps = 4/298 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV+Q LV E+VFSFWLNR+ + +EGGE+VFGGVDPKHFKG+HTYVPVT KGYWQF++GDI
Sbjct: 220 MVKQKLVKEQVFSFWLNRNAEEKEGGELVFGGVDPKHFKGQHTYVPVTDKGYWQFDIGDI 279
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
LIG ++T C GGC+AI DSGTSLLAGP+ +V IN AIG V ECK +VSQYG I
Sbjct: 280 LIGGETTKYCAGGCSAIADSGTSLLAGPSNIVVSINRAIGAAAVAHPECKAIVSQYGRAI 339
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVV-EKENVSAG--DSAVCSACEMAVV 177
DLL++ PEK+C +IG+C F+ VS I+ VV +K+ S+G A+CSACEMAV+
Sbjct: 340 MDLLLAKAQPEKICSKIGVCTFDETHDVSLKIENVVSDKDGRSSGGFSEAMCSACEMAVL 399
Query: 178 WVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPE 237
W+Q++LKQ +T+E ++ +NELCD N E+++DC RI MPNVSFTIGD++F L+ +
Sbjct: 400 WIQDELKQNKTQEDIIENVNELCDRGLN-QDETLVDCGRISQMPNVSFTIGDRLFELTSK 458
Query: 238 QYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
YILK GEG A CISGF+ FD+PPPRGPLWILGDVFMG YHTVFD GK R+GFAEAA
Sbjct: 459 DYILKVGEGSAAQCISGFIPFDIPPPRGPLWILGDVFMGPYHTVFDFGKARVGFAEAA 516
>gi|75338567|sp|Q9XFX4.1|CARDB_CYNCA RecName: Full=Procardosin-B; Contains: RecName: Full=Cardosin-B
heavy chain; AltName: Full=Cardosin-B 34 kDa subunit;
Contains: RecName: Full=Cardosin-B light chain; AltName:
Full=Cardosin-B 14 kDa subunit; Flags: Precursor
gi|4582534|emb|CAB40349.1| preprocardosin B [Cynara cardunculus]
Length = 506
Score = 392 bits (1007), Expect = e-107, Method: Compositional matrix adjust.
Identities = 172/294 (58%), Positives = 230/294 (78%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV QGLV E VFSFWLNR+ D EEGGE+VFGGVDP HF+G HTYVPVT+KGYWQFE+GD+
Sbjct: 212 MVNQGLVEEAVFSFWLNRNVDEEEGGELVFGGVDPNHFRGNHTYVPVTRKGYWQFEMGDV 271
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
LIG++S+G C GGCAAI DSGTS AGPT ++T+IN AIG +GV++ +CK +V QYG +
Sbjct: 272 LIGDKSSGFCAGGCAAIADSGTSFFAGPTAIITQINQAIGAKGVLNQQCKTLVGQYGKNM 331
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGDSAVCSACEMAVVWVQ 180
+L S + P+K+C + LC F+GA V + I++VV+K N + +C+ CEMA+V +Q
Sbjct: 332 IQMLTSEVQPDKICSHMKLCTFDGAHDVRSMIESVVDKNNDKSSGGEICTFCEMALVRMQ 391
Query: 181 NQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYI 240
N++K+ +T++ +++++NE+CD LP ESI+DC+ I +MPN++FTIG K+F ++PEQYI
Sbjct: 392 NEIKRNETEDNIINHVNEVCDQLPTSSAESIVDCNGISSMPNIAFTIGSKLFEVTPEQYI 451
Query: 241 LKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
K GEG A CISGF A D+ P+GP+WILGD+FMG YHTVFD GKLR+GFAEA
Sbjct: 452 YKVGEGEAATCISGFTALDIMSPQGPIWILGDMFMGPYHTVFDYGKLRVGFAEA 505
>gi|73912433|dbj|BAE20413.1| aspartic proteinase [Triticum aestivum]
Length = 508
Score = 392 bits (1007), Expect = e-106, Method: Compositional matrix adjust.
Identities = 192/299 (64%), Positives = 235/299 (78%), Gaps = 4/299 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPD-AEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGD 59
MVEQGL+S+ VFSFWLNR D EGGEI+FGG+DPKH+ G+HTYVP T+KGYWQF++GD
Sbjct: 210 MVEQGLISDPVFSFWLNRHADDEGEGGEIIFGGMDPKHYVGEHTYVPATQKGYWQFDMGD 269
Query: 60 ILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDL 119
+L+G +STG C GGCAAI DSGTSLLAGPT ++TEIN IG GVVS ECK +VSQYG
Sbjct: 270 VLVGGKSTGFCAGGCAAIADSGTSLLAGPTAIITEINEKIGAAGVVSQECKTIVSQYGQQ 329
Query: 120 IWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAG---DSAVCSACEMAV 176
I DLL++ P+KVC Q+GLC F+G VS GI++VV+ E V + +CSACEMAV
Sbjct: 330 ILDLLLAETQPKKVCSQVGLCTFDGTRGVSAGIRSVVDDEPVKSNGLHTDPMCSACEMAV 389
Query: 177 VWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSP 236
VW+QNQL Q +T++ +L Y+N+LC+ LP+PMGES +DC + +MP++ FTI K F L P
Sbjct: 390 VWMQNQLAQNKTQDLILDYVNQLCNRLPSPMGESAVDCASLGSMPDIEFTISGKKFALKP 449
Query: 237 EQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
E+YILK GEG A CISGF A D+PPPRGPLWILGDVFMG YHTVFD GKLR+GFA+AA
Sbjct: 450 EEYILKVGEGAAAQCISGFTAMDIPPPRGPLWILGDVFMGPYHTVFDYGKLRVGFAKAA 508
>gi|356556454|ref|XP_003546541.1| PREDICTED: aspartic proteinase oryzasin-1-like [Glycine max]
Length = 505
Score = 392 bits (1006), Expect = e-106, Method: Compositional matrix adjust.
Identities = 178/298 (59%), Positives = 230/298 (77%), Gaps = 3/298 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+E GL+S ++FS WLN+DP E GGEIVFGG+D +HF+G+HTYVP+++KGYWQ +LGDI
Sbjct: 208 MLEGGLISHKIFSLWLNQDPSEEMGGEIVFGGIDYRHFRGEHTYVPLSQKGYWQIDLGDI 267
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
L+ N STG+CEGGCAA+VDSGTSL+AGPT VVT+INHAIG EG S ECK ++ YGD I
Sbjct: 268 LLANNSTGLCEGGCAAVVDSGTSLIAGPTTVVTQINHAIGAEGYTSFECKSILHNYGDSI 327
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAG---DSAVCSACEMAVV 177
W+ L++GL P+ +C IG C+ N + IKTVV ++ + +S CS C M V+
Sbjct: 328 WESLIAGLYPDIICSAIGFCSNNEFNTMDDVIKTVVHNQSWNRSQTRESPFCSFCNMIVL 387
Query: 178 WVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPE 237
W+Q QLKQ KEKVL Y++ELC+ LPNP G+S I+C+RI TMP+++FTIG+K F LSPE
Sbjct: 388 WIQVQLKQSNVKEKVLKYVDELCEKLPNPPGQSFINCNRIATMPHITFTIGNKSFPLSPE 447
Query: 238 QYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
QY+L+ EG + VC GF+A D+PPP+GPLW+LG +F+G YHTVFD G LRIGFAEAA
Sbjct: 448 QYVLRVEEGCSTVCYGGFVAIDVPPPQGPLWVLGSIFLGAYHTVFDYGNLRIGFAEAA 505
>gi|40641523|emb|CAE52913.1| putative vacuaolar aspartic proteinase [Physcomitrella patens]
Length = 504
Score = 390 bits (1003), Expect = e-106, Method: Compositional matrix adjust.
Identities = 181/296 (61%), Positives = 224/296 (75%), Gaps = 4/296 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M++QGLV E VFSFWLNR+PD GGE+V GGVDPKHFKG+H Y PVT+KGYWQF+LGD+
Sbjct: 207 MLDQGLVKEPVFSFWLNRNPDESSGGELVLGGVDPKHFKGEHVYTPVTRKGYWQFDLGDV 266
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
I ++TG C GC AI DSGTSLLAGP+ +V EIN AIG GVVS +CK+VV QYGD I
Sbjct: 267 TINGRTTGFCANGCTAIADSGTSLLAGPSGIVAEINQAIGATGVVSQQCKMVVQQYGDQI 326
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENV-SAGDSAVCSACEMAVVWV 179
++L++ + P KVC +GLC F E GI +VVEK+ S + +C+ C MAVVW
Sbjct: 327 VEMLLAQMNPGKVCTTLGLCNFGAGE---PGIASVVEKDQSHSLREDPLCTVCGMAVVWA 383
Query: 180 QNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQY 239
QNQL Q +TKE++ +Y+N+LC+ LP+P GES +DC+ + +MPNV+FTI +K F L PE+Y
Sbjct: 384 QNQLSQNRTKEQIDAYLNQLCERLPSPNGESAVDCNSLSSMPNVAFTISNKTFELKPEEY 443
Query: 240 ILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
ILK GEG CISGF+ D+PPP GPLWILGDVFMGVYHTVFD G R+GFAEAA
Sbjct: 444 ILKIGEGAEAQCISGFLGLDVPPPAGPLWILGDVFMGVYHTVFDFGNTRLGFAEAA 499
>gi|297809619|ref|XP_002872693.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297318530|gb|EFH48952.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 507
Score = 389 bits (1000), Expect = e-106, Method: Compositional matrix adjust.
Identities = 175/295 (59%), Positives = 227/295 (76%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MVE+GLV + VFSFWLNR+P +EGGEIVFGGVDPKHFKG+HTYVPVT KGYWQF++GD+
Sbjct: 213 MVEKGLVKDPVFSFWLNRNPQDQEGGEIVFGGVDPKHFKGEHTYVPVTHKGYWQFDMGDL 272
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
I + TG C GC+AI DSGTSLL GP+ V+T INHAIG +G+VS ECK VV QYG +
Sbjct: 273 QIAGKPTGYCAKGCSAIADSGTSLLTGPSTVITMINHAIGAQGIVSRECKAVVDQYGKTM 332
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGDSAVCSACEMAVVWVQ 180
+ L++ P+KVC QIG+CA++G VS I++VV+ + A+CSACEMA VW++
Sbjct: 333 LNSLLAQEDPKKVCSQIGVCAYDGTHSVSMDIQSVVDDGTSGLLNQAMCSACEMAAVWME 392
Query: 181 NQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYI 240
++L Q QT+E++L+Y ELC+ +P +S +DC+R+ +MP VSF+IG + F+LSP+ YI
Sbjct: 393 SELTQNQTQERILAYAAELCNHIPTKNQQSAVDCERVSSMPIVSFSIGGRTFDLSPQDYI 452
Query: 241 LKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
K G+G+ C SGF A D+PPPRGPLWILGD+FMG YHTVFD GK R+GFA+AA
Sbjct: 453 FKIGDGVESQCTSGFTAMDIPPPRGPLWILGDIFMGPYHTVFDYGKARVGFAKAA 507
>gi|255556616|ref|XP_002519342.1| Aspartic proteinase oryzasin-1 precursor, putative [Ricinus
communis]
gi|223541657|gb|EEF43206.1| Aspartic proteinase oryzasin-1 precursor, putative [Ricinus
communis]
Length = 500
Score = 389 bits (999), Expect = e-106, Method: Compositional matrix adjust.
Identities = 181/299 (60%), Positives = 229/299 (76%), Gaps = 6/299 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV QG V++++FS WLNRDP A GGEIVFGG+D +HF+G+HTYVPVT+KGYWQ E+GD+
Sbjct: 204 MVRQGHVNQKLFSLWLNRDPTAGMGGEIVFGGLDWRHFRGEHTYVPVTEKGYWQIEVGDV 263
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
I +STG+CE GCAAIVDSGTS +AGPT +VT+INHAIG +G+VS ECK VV+++GDLI
Sbjct: 264 FIAKKSTGMCEYGCAAIVDSGTSFIAGPTTIVTQINHAIGAQGIVSLECKSVVTKFGDLI 323
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKEN----VSAGDSAVCSACEMAV 176
W+ L+SGL PE VC IGLC +N T IKT + + S +SA+C+ CEM V
Sbjct: 324 WESLISGLRPEIVCVDIGLCVYNNNS--RTVIKTKADDRDGDKSSSLDESALCTFCEMIV 381
Query: 177 VWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSP 236
W+Q QLKQ++ +EK+ Y++ELC+ LP+PMG+S IDC I MP V+F IG+K F LSP
Sbjct: 382 FWIQVQLKQQKAEEKIFKYVDELCEKLPDPMGKSFIDCGDITNMPYVTFIIGNKSFPLSP 441
Query: 237 EQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
EQY++K E +C+SGF A D+PPP+GPLWILGDVF+G YHTVFD G LRIGFA AA
Sbjct: 442 EQYVVKVEEKYGTICLSGFTALDVPPPQGPLWILGDVFLGAYHTVFDFGNLRIGFARAA 500
>gi|356545806|ref|XP_003541325.1| PREDICTED: aspartic proteinase oryzasin-1-like [Glycine max]
Length = 495
Score = 387 bits (993), Expect = e-105, Method: Compositional matrix adjust.
Identities = 177/298 (59%), Positives = 229/298 (76%), Gaps = 3/298 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+EQGLV++++FS WLN+DP A+ GGEIVFGG+D +HFKG+HTYVP+T+K YWQ E+GDI
Sbjct: 198 MIEQGLVTQKIFSLWLNQDPVAKLGGEIVFGGIDWRHFKGEHTYVPLTQKDYWQIEVGDI 257
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
I N TG+CEGGCAAI+DSGTSL+AGPT +VT+INHAIG EG VS ECK ++ YGD I
Sbjct: 258 QIANNPTGLCEGGCAAIIDSGTSLIAGPTKIVTQINHAIGAEGYVSYECKNIIHNYGDSI 317
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKEN---VSAGDSAVCSACEMAVV 177
W+ ++SGL PE +C IGLC+ N + I+T V E+ +S +C+ C+M V
Sbjct: 318 WEYIISGLKPEIICVDIGLCSRNRTFITNDVIETAVYNESWGESRTKESPLCTFCDMIVF 377
Query: 178 WVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPE 237
W+Q QLKQK TKEK+L Y++ELC+ LPNP+G++ IDC+ I MP ++FTIG+K F LSPE
Sbjct: 378 WMQVQLKQKNTKEKILKYVDELCEKLPNPVGQTFIDCNDIANMPQITFTIGNKSFPLSPE 437
Query: 238 QYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
QY+L+ EG VC GF+ D+P P+GPLW+LGD+F+G YHTVFD G LRIGFAEAA
Sbjct: 438 QYMLRIEEGCNTVCYGGFVPLDVPAPQGPLWVLGDLFLGAYHTVFDYGNLRIGFAEAA 495
>gi|302820804|ref|XP_002992068.1| hypothetical protein SELMODRAFT_186535 [Selaginella moellendorffii]
gi|300140190|gb|EFJ06917.1| hypothetical protein SELMODRAFT_186535 [Selaginella moellendorffii]
Length = 499
Score = 385 bits (990), Expect = e-105, Method: Compositional matrix adjust.
Identities = 176/295 (59%), Positives = 228/295 (77%), Gaps = 4/295 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDP-DAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGD 59
M+ Q L+S+ VFSFWLNRD D E+GGEIVFGGV+ FKGKH Y PVT++GYWQF +GD
Sbjct: 207 MLAQKLISQPVFSFWLNRDASDEEDGGEIVFGGVNKDRFKGKHVYTPVTREGYWQFNMGD 266
Query: 60 ILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDL 119
+++ QSTG C GCAAI DSGTSLLAGPT +V +IN AIG G+VS ECK+VV+QYGDL
Sbjct: 267 VVVDGQSTGFCAKGCAAIADSGTSLLAGPTGIVAQINQAIGATGLVSEECKMVVTQYGDL 326
Query: 120 IWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGDSAVCSACEMAVVWV 179
I +LL++ + P+KVC Q G+C + I +V++KEN GD +CS CEMAVVWV
Sbjct: 327 IVELLLAQVTPDKVCAQAGVCTLRND---NPHIASVLDKENQKVGDDVLCSVCEMAVVWV 383
Query: 180 QNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQY 239
QNQL+Q +TK+++ Y+N+LC+ LP+P G+S+++C +I ++PNVSFTI ++ F L+P+QY
Sbjct: 384 QNQLRQNRTKQQIEDYLNQLCERLPSPNGQSVVECAKISSLPNVSFTIANQTFELTPKQY 443
Query: 240 ILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
IL+ GEG A C+SGF D+PPP GP+WILGDVFMGVYHTVFD G RIGFA+A
Sbjct: 444 ILQVGEGAAAQCLSGFTGMDVPPPAGPIWILGDVFMGVYHTVFDFGNKRIGFAKA 498
>gi|302761358|ref|XP_002964101.1| hypothetical protein SELMODRAFT_166719 [Selaginella moellendorffii]
gi|300167830|gb|EFJ34434.1| hypothetical protein SELMODRAFT_166719 [Selaginella moellendorffii]
Length = 505
Score = 385 bits (990), Expect = e-104, Method: Compositional matrix adjust.
Identities = 176/295 (59%), Positives = 228/295 (77%), Gaps = 4/295 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDP-DAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGD 59
M+ Q L+S+ VFSFWLNRD D E+GGEIVFGGV+ FKGKH Y PVT++GYWQF +GD
Sbjct: 213 MLAQKLISQPVFSFWLNRDASDEEDGGEIVFGGVNKDRFKGKHVYTPVTREGYWQFNMGD 272
Query: 60 ILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDL 119
+ + QSTG C GCAAI DSGTSLLAGPT +V +IN AIG G+VS ECK+VV+QYGDL
Sbjct: 273 VAVDGQSTGFCAKGCAAIADSGTSLLAGPTGIVAQINQAIGATGLVSEECKMVVAQYGDL 332
Query: 120 IWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGDSAVCSACEMAVVWV 179
I +LL++ + P++VC Q G+C+ + I +V++KEN GD +CS CEMAVVWV
Sbjct: 333 IVELLLAQVTPDRVCAQAGVCSLRND---NPHIASVLDKENQKVGDDVLCSVCEMAVVWV 389
Query: 180 QNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQY 239
QNQL+Q +TK+++ Y+N+LC+ LP+P G+S+++C +I ++PNVSFTI ++ F L+P+QY
Sbjct: 390 QNQLRQNRTKQQIEDYLNQLCERLPSPNGQSVVECAKISSLPNVSFTIANQTFELTPKQY 449
Query: 240 ILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
IL+ GEG A CISGF D+PPP GP+WILGDVFMGVYHTVFD G RIGFA+A
Sbjct: 450 ILQVGEGAAAQCISGFTGMDVPPPAGPIWILGDVFMGVYHTVFDFGNKRIGFAKA 504
>gi|297736824|emb|CBI26025.3| unnamed protein product [Vitis vinifera]
Length = 500
Score = 385 bits (990), Expect = e-104, Method: Compositional matrix adjust.
Identities = 176/294 (59%), Positives = 221/294 (75%), Gaps = 5/294 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV+Q LVS E+ SFWLNRDP A+ GGE++FGGVD +HF G HT+VP+T+K YWQ E+GDI
Sbjct: 210 MVQQSLVSMEIVSFWLNRDPKAKIGGEVIFGGVDWRHFMGDHTFVPITRKDYWQIEVGDI 269
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
LI STG CEGGCAAIVD+GTS++AGPT VVT+INHAIG EG+VS CK VV++YG LI
Sbjct: 270 LIAGSSTGFCEGGCAAIVDTGTSMIAGPTTVVTQINHAIGAEGIVSFNCKNVVNKYGRLI 329
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGDSAVCSACEMAVVWVQ 180
W LVSG PE VC IGLCA+NG + S G++TV+ GD+A C+ CEM W+Q
Sbjct: 330 WQFLVSGFQPENVCSDIGLCAYNGTKNASAGMETVIGN-----GDNAACTFCEMIAFWIQ 384
Query: 181 NQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYI 240
QLK+ + KEKV Y+NELC++LPNP G+ ++CD + TMP +SF IGDK F L+ EQY
Sbjct: 385 VQLKEHKAKEKVFQYVNELCENLPNPGGKDFVNCDALATMPVISFAIGDKYFPLTAEQYT 444
Query: 241 LKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
LK VC+SGF A D+P P GPLW+LGDVF+G YHT+FD G L++GFA++
Sbjct: 445 LKVEVNCTTVCLSGFTALDVPRPDGPLWVLGDVFLGAYHTIFDFGNLQVGFAKS 498
>gi|356542078|ref|XP_003539498.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase oryzasin-1-like
[Glycine max]
Length = 449
Score = 385 bits (989), Expect = e-104, Method: Compositional matrix adjust.
Identities = 171/298 (57%), Positives = 233/298 (78%), Gaps = 3/298 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ Q +++ VFSFWLNR+ + E+GG+IVFGG+D H+KG+HTYVPVT+KGYWQ E+GD+
Sbjct: 152 MLNQHFLTQPVFSFWLNRNTNEEQGGQIVFGGIDSDHYKGEHTYVPVTQKGYWQIEIGDV 211
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
LI ++TG+C C AIVDSGTSLLAGPT V+ +INHAIG G+VS ECK +V+QYG I
Sbjct: 212 LINGKTTGLCAAKCLAIVDSGTSLLAGPTGVIAQINHAIGAVGIVSQECKALVAQYGKTI 271
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEK---ENVSAGDSAVCSACEMAVV 177
D L++ LP+++C QIGLC F+G + VS GI++VV+K + + A C+ACEMA V
Sbjct: 272 LDKLINEALPQQICSQIGLCTFDGTQGVSIGIQSVVDKNIXRTSCSWNDAGCTACEMAAV 331
Query: 178 WVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPE 237
W++N+L+ +T++++L + N LCD +P+P GES+++C+ + MPNVSFTIG ++F LSPE
Sbjct: 332 WMKNRLRLNETEDQILDHANALCDLVPSPKGESVVECNTLSEMPNVSFTIGGEVFELSPE 391
Query: 238 QYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
QYILK G+G CISGF+A D+ PPRGPLWILGD+FMG YHTVFD G +++GFAE+A
Sbjct: 392 QYILKVGKGATAQCISGFIALDIAPPRGPLWILGDIFMGSYHTVFDYGNMKVGFAESA 449
>gi|15233518|ref|NP_192355.1| phytepsin [Arabidopsis thaliana]
gi|75338508|sp|Q9XEC4.1|APA3_ARATH RecName: Full=Aspartic proteinase A3; Flags: Precursor
gi|4773885|gb|AAD29758.1|AF076243_5 putative aspartic protease [Arabidopsis thaliana]
gi|13937238|gb|AAK50111.1|AF372974_1 AT4g04460/T26N6_7 [Arabidopsis thaliana]
gi|7267203|emb|CAB77914.1| putative aspartic protease [Arabidopsis thaliana]
gi|332656990|gb|AEE82390.1| phytepsin [Arabidopsis thaliana]
Length = 508
Score = 385 bits (988), Expect = e-104, Method: Compositional matrix adjust.
Identities = 174/295 (58%), Positives = 226/295 (76%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MVE+GLV E +FSFWLNR+P EGGEIVFGGVDPKHFKG+HT+VPVT KGYWQF++GD+
Sbjct: 214 MVEKGLVKEPIFSFWLNRNPKDPEGGEIVFGGVDPKHFKGEHTFVPVTHKGYWQFDMGDL 273
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
I + TG C GC+AI DSGTSLL GP+ V+T INHAIG +G+VS ECK VV QYG +
Sbjct: 274 QIAGKPTGYCAKGCSAIADSGTSLLTGPSTVITMINHAIGAQGIVSRECKAVVDQYGKTM 333
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGDSAVCSACEMAVVWVQ 180
+ L++ P+KVC QIG+CA++G + VS GI++VV+ + A+CSACEMA VW++
Sbjct: 334 LNSLLAQEDPKKVCSQIGVCAYDGTQSVSMGIQSVVDDGTSGLLNQAMCSACEMAAVWME 393
Query: 181 NQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYI 240
++L Q QT+E++L+Y ELCD +P +S +DC R+ +MP V+F+IG + F+L+P+ YI
Sbjct: 394 SELTQNQTQERILAYAAELCDHIPTQNQQSAVDCGRVSSMPIVTFSIGGRSFDLTPQDYI 453
Query: 241 LKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
K GEG+ C SGF A D+ PPRGPLWILGD+FMG YHTVFD GK R+GFA+AA
Sbjct: 454 FKIGEGVESQCTSGFTAMDIAPPRGPLWILGDIFMGPYHTVFDYGKGRVGFAKAA 508
>gi|224106994|ref|XP_002314336.1| predicted protein [Populus trichocarpa]
gi|222863376|gb|EEF00507.1| predicted protein [Populus trichocarpa]
Length = 483
Score = 384 bits (986), Expect = e-104, Method: Compositional matrix adjust.
Identities = 180/300 (60%), Positives = 229/300 (76%), Gaps = 10/300 (3%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M EQG VS++VFS WLNR+P +E GGE+VFGG+D +HFKG HTYVPVT +GYWQ ++GDI
Sbjct: 189 MAEQGHVSQKVFSLWLNRNPSSELGGEVVFGGLDWRHFKGDHTYVPVTGRGYWQIQVGDI 248
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
I N STG+C GGC+AIVDSGTS L+GPT +V +INHAIG G+VS ECK VVS+Y + I
Sbjct: 249 FIANNSTGLCAGGCSAIVDSGTSFLSGPTRIVAQINHAIGARGIVSLECKEVVSKYWNSI 308
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKE-----NVSAGDSAVCSACEMA 175
WD ++SGL PE +C +GLC +N +T I+TVV+ E +V G A+C+ CEM
Sbjct: 309 WDSMISGLRPEIICVDVGLCLYNN----NTVIETVVDGEATDRLSVDEG-GALCTFCEMI 363
Query: 176 VVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLS 235
V W+Q QLK+K+ KEK+ Y++ELC+ LPNP+G+S I+CD I MP VSFTIG++ F LS
Sbjct: 364 VFWIQVQLKEKKAKEKIFHYVDELCERLPNPLGKSFINCDEITAMPYVSFTIGNRSFPLS 423
Query: 236 PEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
PEQYI++ E A +C+SGF A D+PP +GPLWILGDVF+G YHTVFD G RIGFA+AA
Sbjct: 424 PEQYIVRVEESYATICLSGFAALDMPPRQGPLWILGDVFLGAYHTVFDFGNHRIGFAKAA 483
>gi|334186351|ref|NP_001190671.1| phytepsin [Arabidopsis thaliana]
gi|332656991|gb|AEE82391.1| phytepsin [Arabidopsis thaliana]
Length = 504
Score = 382 bits (982), Expect = e-104, Method: Compositional matrix adjust.
Identities = 174/295 (58%), Positives = 225/295 (76%), Gaps = 4/295 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MVE+GLV E +FSFWLNR+P EGGEIVFGGVDPKHFKG+HT+VPVT KGYWQF++GD+
Sbjct: 214 MVEKGLVKEPIFSFWLNRNPKDPEGGEIVFGGVDPKHFKGEHTFVPVTHKGYWQFDMGDL 273
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
I + TG C GC+AI DSGTSLL GP+ V+T INHAIG +G+VS ECK VV QYG
Sbjct: 274 QIAGKPTGYCAKGCSAIADSGTSLLTGPSTVITMINHAIGAQGIVSRECKAVVDQYGKT- 332
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGDSAVCSACEMAVVWVQ 180
+++ LL +KVC QIG+CA++G + VS GI++VV+ + A+CSACEMA VW++
Sbjct: 333 ---MLNSLLAQKVCSQIGVCAYDGTQSVSMGIQSVVDDGTSGLLNQAMCSACEMAAVWME 389
Query: 181 NQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYI 240
++L Q QT+E++L+Y ELCD +P +S +DC R+ +MP V+F+IG + F+L+P+ YI
Sbjct: 390 SELTQNQTQERILAYAAELCDHIPTQNQQSAVDCGRVSSMPIVTFSIGGRSFDLTPQDYI 449
Query: 241 LKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
K GEG+ C SGF A D+ PPRGPLWILGD+FMG YHTVFD GK R+GFA+AA
Sbjct: 450 FKIGEGVESQCTSGFTAMDIAPPRGPLWILGDIFMGPYHTVFDYGKGRVGFAKAA 504
>gi|356565563|ref|XP_003551009.1| PREDICTED: aspartic proteinase oryzasin-1-like [Glycine max]
Length = 494
Score = 380 bits (975), Expect = e-103, Method: Compositional matrix adjust.
Identities = 177/298 (59%), Positives = 226/298 (75%), Gaps = 4/298 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+EQG VS+++FS WLN+DP A+ GGEIVFGG+D +HFKG HTYVP+T+K YWQ E+GDI
Sbjct: 198 MIEQGHVSQKIFSLWLNQDPVAKVGGEIVFGGIDWRHFKGDHTYVPLTQKDYWQIEVGDI 257
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
LI N TG+CEGGCAAI+DSGTSL+AGPT +VT+IN AIG EG VS ECK ++ YGD I
Sbjct: 258 LIANNPTGLCEGGCAAIIDSGTSLIAGPTKIVTQINRAIGAEGYVSYECKNIIHNYGDSI 317
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKEN---VSAGDSAVCSACEMAVV 177
W+ ++SGL PE +C IGLC+ E S I+T E+ +S +C+ C+M V
Sbjct: 318 WEYIISGLKPEIICVDIGLCSLY-LETCSDVIETATHNESWGESRTKESPLCTFCDMIVF 376
Query: 178 WVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPE 237
W+Q QLKQK TKEK+L Y++ELC+ LPNP+G++ IDC+ I MP ++FTIG+K F LSPE
Sbjct: 377 WMQVQLKQKNTKEKILKYVDELCEKLPNPVGQTFIDCNDIANMPQITFTIGNKSFPLSPE 436
Query: 238 QYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
QY+L+ EG VC GF+ D+P P+GPLW+LGD+F+G YHTVFD G LRIGFAEAA
Sbjct: 437 QYMLRIEEGCNTVCYGGFVPLDVPAPQGPLWVLGDLFLGAYHTVFDYGNLRIGFAEAA 494
>gi|359477267|ref|XP_002275241.2| PREDICTED: aspartic proteinase [Vitis vinifera]
Length = 502
Score = 379 bits (973), Expect = e-102, Method: Compositional matrix adjust.
Identities = 175/296 (59%), Positives = 220/296 (74%), Gaps = 7/296 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV+Q LVS E+ SFWLNRDP A+ GGE++FGGVD +HF G HT+VP+T+K YWQ E+GDI
Sbjct: 210 MVQQSLVSMEIVSFWLNRDPKAKIGGEVIFGGVDWRHFMGDHTFVPITRKDYWQIEVGDI 269
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
LI STG CEGGCAAIVD+GTS++AGPT VVT+INHAIG EG+VS CK VV++YG LI
Sbjct: 270 LIAGSSTGFCEGGCAAIVDTGTSMIAGPTTVVTQINHAIGAEGIVSFNCKNVVNKYGRLI 329
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVS--TGIKTVVEKENVSAGDSAVCSACEMAVVW 178
W LVSG PE VC IGLCA+NG + G++TV+ GD+A C+ CEM W
Sbjct: 330 WQFLVSGFQPENVCSDIGLCAYNGTKNARQGAGMETVIGN-----GDNAACTFCEMIAFW 384
Query: 179 VQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQ 238
+Q QLK+ + KEKV Y+NELC++LPNP G+ ++CD + TMP +SF IGDK F L+ EQ
Sbjct: 385 IQVQLKEHKAKEKVFQYVNELCENLPNPGGKDFVNCDALATMPVISFAIGDKYFPLTAEQ 444
Query: 239 YILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
Y LK VC+SGF A D+P P GPLW+LGDVF+G YHT+FD G L++GFA++
Sbjct: 445 YTLKVEVNCTTVCLSGFTALDVPRPDGPLWVLGDVFLGAYHTIFDFGNLQVGFAKS 500
>gi|357450315|ref|XP_003595434.1| Aspartic proteinase [Medicago truncatula]
gi|355484482|gb|AES65685.1| Aspartic proteinase [Medicago truncatula]
Length = 507
Score = 377 bits (967), Expect = e-102, Method: Compositional matrix adjust.
Identities = 173/301 (57%), Positives = 227/301 (75%), Gaps = 6/301 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+EQG +S++VFS W N+DP AE GGEIVFGGVD +HF+G HTY P+++KGYWQ E+GDI
Sbjct: 207 MIEQGHISDKVFSLWFNKDPMAEVGGEIVFGGVDKRHFRGDHTYFPISQKGYWQIEVGDI 266
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
L+ N +TG+CEGGCAAIVDSGTSL+AGPT VVT+INH IG EG VS ECK +V YG+LI
Sbjct: 267 LLANNTTGLCEGGCAAIVDSGTSLIAGPTGVVTQINHVIGTEGYVSYECKNIVHNYGNLI 326
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAG---DSAVCSACEMAVV 177
W+ L+SGL PE +C I LC+ NG + ++ I+TVV E+ +S CS C M V+
Sbjct: 327 WESLISGLNPEILCADIRLCSDNGFQRMNDVIETVVHNESRDGSPLKESLFCSFCNMVVL 386
Query: 178 WVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPE 237
W+Q Q+KQ KEKVL Y++ELC+ LPNP+G+S I+C + MP+++FT G+K+F LSPE
Sbjct: 387 WMQVQIKQSNVKEKVLKYVDELCEKLPNPVGQSFINCSSVSDMPHITFTFGNKLFPLSPE 446
Query: 238 QYILKT---GEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
QY+L+ E + VC SGF+A D+P P+GPLW++GD+F+ YHTVFD LRIGFAE+
Sbjct: 447 QYVLRVESDDEDCSPVCYSGFVALDVPSPQGPLWVVGDIFLQAYHTVFDYANLRIGFAES 506
Query: 295 A 295
Sbjct: 507 T 507
>gi|56182674|gb|AAV84086.1| aspartic proteinase 12 [Fagopyrum esculentum]
Length = 387
Score = 375 bits (964), Expect = e-102, Method: Compositional matrix adjust.
Identities = 174/279 (62%), Positives = 219/279 (78%), Gaps = 5/279 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M++QGL+SE VFSFW NR+ + EEGGE+VFGG+DP HF+G+HTYVPVT+KGYWQF++ D+
Sbjct: 110 MIDQGLISEPVFSFWFNRNAEEEEGGELVFGGIDPDHFRGQHTYVPVTQKGYWQFDMDDV 169
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
LI STG C GGCAAI DSGTSLLAGP VV +INHAIG G+VS ECK VV++YG I
Sbjct: 170 LIDGMSTGFCAGGCAAIADSGTSLLAGPMAVVAQINHAIGATGIVSQECKTVVAEYGKEI 229
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGDSAV----CSACEMAV 176
++L+S P K+C Q+GLC F+G VS GI++VV+K NV ++ C ACEMAV
Sbjct: 230 IEMLLSEAQPLKICSQVGLCTFDGTRGVSMGIESVVDK-NVXKSSGSLKEXKCVACEMAV 288
Query: 177 VWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSP 236
VW+QN+L Q QT+E +L Y N+LC+ LP+PMGES +DC + T+P+VSFTIG K F+L+P
Sbjct: 289 VWIQNRLIQNQTEELILDYANQLCERLPSPMGESAVDCSSLSTLPDVSFTIGGKTFDLAP 348
Query: 237 EQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFM 275
EQY+L+ GEG A CISGF+A D+PPPRGPLWILGDVFM
Sbjct: 349 EQYVLQVGEGPAAQCISGFIALDVPPPRGPLWILGDVFM 387
>gi|357135633|ref|XP_003569413.1| PREDICTED: aspartic proteinase oryzasin-1-like [Brachypodium
distachyon]
Length = 560
Score = 371 bits (953), Expect = e-100, Method: Compositional matrix adjust.
Identities = 176/299 (58%), Positives = 218/299 (72%), Gaps = 5/299 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M++QGLV E+ FSFWLNRD + EGGEIVFGG DPKH+KG HTY VT+K YWQFE+GD
Sbjct: 263 MIDQGLVKEKSFSFWLNRDANDGEGGEIVFGGSDPKHYKGSHTYTRVTRKAYWQFEMGDF 322
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
LIG +STG+C GCAAI DSGTSL+AGP V+ +IN IG GV + ECK VV+ YG +
Sbjct: 323 LIGGKSTGICVDGCAAIADSGTSLIAGPVAVIAQINEKIGANGVANEECKQVVAGYGQQM 382
Query: 121 WDLL-VSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVV-EKENVSAGD--SAVCSACEMAV 176
+LL P +VC +IGLC F+G VS GIK+VV E + + G A C+ACEMAV
Sbjct: 383 IELLEAKQTAPAQVCSKIGLCTFDGTRAVSAGIKSVVGEAQKTALGGMFDATCNACEMAV 442
Query: 177 VWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSP 236
W+Q++ +TKE L Y+N LCD +P+P+G S +DC I ++ +VSF+IG KIF L P
Sbjct: 443 TWMQSEFVHNRTKEDTLEYVNRLCDHMPSPVGSS-VDCRHIDSLQSVSFSIGGKIFELKP 501
Query: 237 EQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
EQYILK G+G CISGF A D+PPP GPLWILGDVFMG YHT+FD GK+R+GFAE+A
Sbjct: 502 EQYILKVGDGFMARCISGFTALDIPPPVGPLWILGDVFMGAYHTIFDYGKMRVGFAESA 560
>gi|302756359|ref|XP_002961603.1| hypothetical protein SELMODRAFT_230037 [Selaginella moellendorffii]
gi|300170262|gb|EFJ36863.1| hypothetical protein SELMODRAFT_230037 [Selaginella moellendorffii]
Length = 423
Score = 369 bits (947), Expect = e-100, Method: Compositional matrix adjust.
Identities = 166/295 (56%), Positives = 221/295 (74%), Gaps = 3/295 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+VEQ LV E VFSFWLNRD GGE+V GGVDPKHFKGKH Y P+T++GYW+ +GD+
Sbjct: 132 IVEQQLVKEPVFSFWLNRDATDGNGGELVLGGVDPKHFKGKHNYAPITREGYWEIRMGDV 191
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
LI TG+C GCAAIVDSGTSLLAGP+ ++ EINHAIG GVVS ECKL+V QYG++I
Sbjct: 192 LIDGHGTGMCSKGCAAIVDSGTSLLAGPSAIIAEINHAIGASGVVSQECKLIVDQYGNII 251
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGDSAVCSACEMAVVWVQ 180
+LL++ + P+KVC Q+G+C+ E I +V++KE + C ACE AV+W++
Sbjct: 252 INLLLAQVSPDKVCSQLGVCSATRNE---PDIASVLDKEREGIDNDLACEACERAVIWIE 308
Query: 181 NQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYI 240
NQL++ +++E+++SY++ELC LP+P GES +DC + MP +SFTI ++ + LSPEQYI
Sbjct: 309 NQLRKNRSREEIVSYLDELCSRLPSPNGESAVDCSSVSRMPKISFTIANRNYELSPEQYI 368
Query: 241 LKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
LK G+G + C+SGF+ D+P P GPLWILGD+FMGVYHTVFD G ++GFA AA
Sbjct: 369 LKIGDGNKKQCLSGFIGLDVPAPAGPLWILGDIFMGVYHTVFDFGNKQVGFAPAA 423
>gi|168031065|ref|XP_001768042.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162680680|gb|EDQ67114.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 455
Score = 369 bits (947), Expect = e-100, Method: Compositional matrix adjust.
Identities = 172/297 (57%), Positives = 214/297 (72%), Gaps = 2/297 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ Q LV E VFSFWLNRD + E+GGE+V GGVDP HFKG HTY PVT+ GYWQF++GD+
Sbjct: 159 MLYQKLVQEPVFSFWLNRDVEGEKGGELVLGGVDPHHFKGNHTYTPVTRLGYWQFDMGDV 218
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
L+ QSTG C GGCAAI DSGTSLLAGPT +V EIN+AIG G++S ECKLVV QY D I
Sbjct: 219 LLDGQSTGFCAGGCAAIADSGTSLLAGPTGIVAEINYAIGATGIISGECKLVVDQYADFI 278
Query: 121 WDLLVSGLL-PEKVCQQIGLCAF-NGAEYVSTGIKTVVEKENVSAGDSAVCSACEMAVVW 178
+L+S LL P K+C + G C G + I +V+EK G+ C CEM V+W
Sbjct: 279 IQMLMSKLLTPLKICAKAGACLVEEGTSTRNPNIASVLEKHENDLGNGVTCVFCEMVVIW 338
Query: 179 VQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQ 238
QNQL++ T+ ++ ++N+LC+ LPNP GES++DC+ + +MP+VSFTI F L+PEQ
Sbjct: 339 AQNQLRKNGTQAQIKEHLNQLCERLPNPNGESMVDCNSLSSMPDVSFTISGTTFKLTPEQ 398
Query: 239 YILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
Y+LK GEG C SGF+ D+PPP GPLWILGDVFMG YHTVFD G R+GFA AA
Sbjct: 399 YVLKVGEGDDAQCTSGFLGIDIPPPAGPLWILGDVFMGAYHTVFDFGNQRLGFALAA 455
>gi|302761354|ref|XP_002964099.1| hypothetical protein SELMODRAFT_142401 [Selaginella moellendorffii]
gi|300167828|gb|EFJ34432.1| hypothetical protein SELMODRAFT_142401 [Selaginella moellendorffii]
Length = 497
Score = 369 bits (947), Expect = e-99, Method: Compositional matrix adjust.
Identities = 174/295 (58%), Positives = 223/295 (75%), Gaps = 6/295 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRDP-DAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGD 59
M+ Q L+S+ VFSFWLNRD D E+GGEIVFGGV+ FKGKH Y PVT++GYWQF +GD
Sbjct: 207 MLAQKLISQPVFSFWLNRDASDEEDGGEIVFGGVNKDRFKGKHVYTPVTREGYWQFNMGD 266
Query: 60 ILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDL 119
+++ QSTG C GCAAI DSGTSLL GPT +V +IN AIG G+VS ECK+VV+QYGDL
Sbjct: 267 VVVDGQSTGFCAKGCAAIADSGTSLLVGPTGIVAQINQAIGATGLVSEECKMVVAQYGDL 326
Query: 120 IWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGDSAVCSACEMAVVWV 179
I +LL++ + P+KVC Q G+C + I +V++KEN GD +CS CEMAVV V
Sbjct: 327 IVELLLAQVTPDKVCAQAGVCTLRND---NPHIASVLDKENQKVGDHGLCSVCEMAVVSV 383
Query: 180 QNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQY 239
QNQL+Q TK+++ +N+LC+ LP+P G+S +DC +I ++PNVSFTI +++F L+P+QY
Sbjct: 384 QNQLRQNPTKQQI--DLNQLCERLPSPNGQSFVDCAKISSLPNVSFTIANQMFELTPKQY 441
Query: 240 ILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
IL+ GEG A CISGF D+ PP GP+WILGDVFMGVYHTVFD G RIGFA+A
Sbjct: 442 ILQVGEGAAAQCISGFTGMDVAPPAGPIWILGDVFMGVYHTVFDFGNKRIGFAKA 496
>gi|302775562|ref|XP_002971198.1| hypothetical protein SELMODRAFT_147484 [Selaginella moellendorffii]
gi|300161180|gb|EFJ27796.1| hypothetical protein SELMODRAFT_147484 [Selaginella moellendorffii]
Length = 423
Score = 367 bits (943), Expect = 2e-99, Method: Compositional matrix adjust.
Identities = 166/295 (56%), Positives = 220/295 (74%), Gaps = 3/295 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+VEQ LV E VFSFWLNRD GGE+V GGVDPKHFKGKH Y P+T++GYW+ +GD+
Sbjct: 132 IVEQQLVKEPVFSFWLNRDATDGNGGELVLGGVDPKHFKGKHNYAPITREGYWEIRMGDV 191
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
LI TG+C GCAAIVDSGTSLLAGP+ ++ EINHAIG GVVS ECKL+V QYG++I
Sbjct: 192 LIDGHGTGMCSKGCAAIVDSGTSLLAGPSAIIAEINHAIGASGVVSQECKLIVDQYGNII 251
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGDSAVCSACEMAVVWVQ 180
+LL++ + P+KVC Q+G+C+ E I +V++KE + C ACE AV+W++
Sbjct: 252 INLLLAQVSPDKVCSQLGVCSATRNE---PDIASVLDKEREGIDNDLACEACERAVIWIE 308
Query: 181 NQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYI 240
NQL++ +++E+++SY++ELC LP+P GES +DC + MP +SFTI + + LSPEQYI
Sbjct: 309 NQLRKNRSREEIVSYLDELCSRLPSPNGESAVDCSSVSRMPKISFTIANHNYELSPEQYI 368
Query: 241 LKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
LK G+G + C+SGF+ D+P P GPLWILGD+FMGVYHTVFD G ++GFA AA
Sbjct: 369 LKIGDGNKKQCLSGFIGLDVPAPAGPLWILGDIFMGVYHTVFDFGNKQVGFALAA 423
>gi|356547093|ref|XP_003541952.1| PREDICTED: LOW QUALITY PROTEIN: cyprosin-like, partial [Glycine
max]
Length = 470
Score = 366 bits (940), Expect = 6e-99, Method: Compositional matrix adjust.
Identities = 171/298 (57%), Positives = 224/298 (75%), Gaps = 4/298 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ Q L+++ VFSFWLNR+ D ++GG+IVFGGVD H+ G+HTYVPVT KGYWQ E+GD+
Sbjct: 174 MLNQHLLAQPVFSFWLNRNTDEKQGGQIVFGGVDSDHYXGEHTYVPVTHKGYWQTEIGDV 233
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
LI ++T C C+AI DSGTSLLAGPT + +INHAIG GVV+ ECK VV+QYG I
Sbjct: 234 LIDRKTTEFCASKCSAIDDSGTSLLAGPTGAIAQINHAIGAVGVVNQECKAVVAQYGKTI 293
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEK---ENVSAGDSAVCSACEMAVV 177
D L++ LP++VC Q LC F+G + VS GI++VV+K + + + A C+ACEMAVV
Sbjct: 294 LDKLINEALPQQVCSQX-LCTFDGTKGVSMGIQSVVDKTIEKTSYSWNDAGCTACEMAVV 352
Query: 178 WVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPE 237
W++N L+ +T++++L Y N LCD LP+P GES+++C + MPNVSFTIG K+F LSPE
Sbjct: 353 WIKNPLRLNETEDQILDYANALCDMLPSPNGESVVECSTLSEMPNVSFTIGGKVFELSPE 412
Query: 238 QYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
QYILK G+G CI GF+A D+ PPRGPLWILGD+FMG YHTVF G ++GFAE+A
Sbjct: 413 QYILKVGKGATAQCIRGFIALDIAPPRGPLWILGDIFMGRYHTVFFYGNKKVGFAESA 470
>gi|8272388|dbj|BAA96446.1| aspartic endopeptidase [Pyrus pyrifolia]
Length = 273
Score = 366 bits (940), Expect = 6e-99, Method: Compositional matrix adjust.
Identities = 167/273 (61%), Positives = 215/273 (78%), Gaps = 3/273 (1%)
Query: 26 GEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLL 85
GEIVFGGVD HFKG+HTYVPVT+KGYWQF++GD+LI +S+G C GC+AI DSGTSLL
Sbjct: 1 GEIVFGGVDSSHFKGEHTYVPVTQKGYWQFDMGDVLIDGESSGFCANGCSAIADSGTSLL 60
Query: 86 AGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGA 145
AGPT VVT+INHAIG GVVS ECK VV QYG I ++L++ P+K+C QIG C F+G
Sbjct: 61 AGPTTVVTQINHAIGASGVVSQECKTVVEQYGKTIIEMLMAKSQPQKICSQIGFCTFDGT 120
Query: 146 EYVSTGIKTVVEKENVSAGD---SAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDS 202
VS GI+++V++ D A C+ACEM VV +Q +L++ QT+E++L Y+N+LC+
Sbjct: 121 RGVSPGIESLVDQNPEKQSDGVHDATCAACEMPVVLMQIRLRKNQTEEQILDYVNQLCER 180
Query: 203 LPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPP 262
LP+P GES++ CD + ++P+VSFTIG K+F+L+PEQY+LK GEG+A CISGF+A D+ P
Sbjct: 181 LPSPSGESVVQCDSLSSLPSVSFTIGGKVFDLAPEQYVLKVGEGVAAQCISGFIALDVAP 240
Query: 263 PRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
PRGPLWILGD+FMG YHTVFD G L +GFAEAA
Sbjct: 241 PRGPLWILGDIFMGRYHTVFDYGNLSVGFAEAA 273
>gi|168033581|ref|XP_001769293.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162679399|gb|EDQ65847.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 485
Score = 365 bits (938), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 175/296 (59%), Positives = 217/296 (73%), Gaps = 17/296 (5%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M++QGLV E VFSFWLNRD D +GGE+VFGGVDP HFKG+HTY PVT+KGYWQF+LGD
Sbjct: 206 MLDQGLVKEPVFSFWLNRDSDESDGGELVFGGVDPDHFKGEHTYTPVTRKGYWQFDLGD- 264
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
GC+AI DSGTSLLAGP+ +V EIN AIG G+VS +CK+VV QYG+ I
Sbjct: 265 ------------GCSAIADSGTSLLAGPSGIVAEINQAIGATGIVSQQCKMVVQQYGEQI 312
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENV-SAGDSAVCSACEMAVVWV 179
++LV+ + P KVC +GLC E GI +V+EKE V S C+ CEMA+VW
Sbjct: 313 VEMLVAQMNPGKVCASLGLCQLAAGE---PGIASVLEKEEVHSLHADPRCTVCEMALVWA 369
Query: 180 QNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQY 239
QNQL+ +TKE++ +Y+N+LC+ LP+P GES +DC+ + MPNV FTI K F L+PEQY
Sbjct: 370 QNQLRMNRTKEEIDAYLNQLCERLPSPNGESAVDCNALSYMPNVGFTIAGKSFELTPEQY 429
Query: 240 ILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
ILK GEG + C+SGF+ D+PPP GPLWILGDVFMGVYHTVFD G R+GFA+AA
Sbjct: 430 ILKIGEGPEKQCVSGFLGLDVPPPAGPLWILGDVFMGVYHTVFDFGNSRLGFAKAA 485
>gi|302761356|ref|XP_002964100.1| hypothetical protein SELMODRAFT_438819 [Selaginella moellendorffii]
gi|300167829|gb|EFJ34433.1| hypothetical protein SELMODRAFT_438819 [Selaginella moellendorffii]
Length = 503
Score = 365 bits (937), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 172/295 (58%), Positives = 223/295 (75%), Gaps = 6/295 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRDP-DAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGD 59
M+ Q L+S+ VFSFWLNRD D E+GGEIVFGGV+ FKGKH Y PVT++GYWQF +GD
Sbjct: 213 MLAQKLISQPVFSFWLNRDASDEEDGGEIVFGGVNKDRFKGKHVYTPVTREGYWQFNMGD 272
Query: 60 ILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDL 119
+++ QSTG C GCAAI DSGTSLL GPT +V +IN AIG G+VS ECK+VV+QYGDL
Sbjct: 273 VVVDGQSTGFCAKGCAAIADSGTSLLVGPTGIVAQINQAIGATGLVSEECKMVVAQYGDL 332
Query: 120 IWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGDSAVCSACEMAVVWV 179
I +LL++ + P+KVC Q G+C + I +V++KEN GD +CS CEMAVV V
Sbjct: 333 IVELLLAQVTPDKVCAQAGVCTLRND---NPHIASVLDKENQKVGDDVLCSVCEMAVVSV 389
Query: 180 QNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQY 239
QNQL+Q TK+++ +N+LC+ LP+P G+S+++C +I ++PNVSFTI +++F L+P+QY
Sbjct: 390 QNQLRQNPTKQQI--DLNQLCERLPSPNGQSLVECAKISSLPNVSFTIANQMFELTPKQY 447
Query: 240 ILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
IL+ GEG A CISGF D+ PP P+WILGDVFMGVYHTVFD G RIGFA+A
Sbjct: 448 ILQVGEGAAAQCISGFTGMDVAPPAVPIWILGDVFMGVYHTVFDFGNKRIGFAKA 502
>gi|413953120|gb|AFW85769.1| hypothetical protein ZEAMMB73_486102 [Zea mays]
Length = 267
Score = 364 bits (935), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 169/267 (63%), Positives = 207/267 (77%), Gaps = 4/267 (1%)
Query: 33 VDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVV 92
+D H+KG HT+VPVT+KGYWQF +GD+L+ +STG C GGCAA+ DSGTSLLAGPT ++
Sbjct: 1 MDSSHYKGDHTFVPVTRKGYWQFNMGDVLVDGKSTGFCAGGCAAMADSGTSLLAGPTAII 60
Query: 93 TEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGI 152
TEIN IG GVVS ECK VVSQYG I DLL++ P K+C Q+GLC F+G VS GI
Sbjct: 61 TEINEKIGVAGVVSQECKTVVSQYGQQILDLLLAETQPAKICSQVGLCTFDGTHGVSAGI 120
Query: 153 KTVVE----KENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMG 208
++VV+ K N +C+ACEMAVVW+QNQL Q +T+E +L+YIN+LC+ LP+PMG
Sbjct: 121 RSVVDDEAGKSNGGLKSDPMCNACEMAVVWMQNQLAQNKTQELILNYINQLCERLPSPMG 180
Query: 209 ESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLW 268
ES +DC + +MP+++FTIG K F L PEQYILK GEG A CISGF A D+PPPRGPLW
Sbjct: 181 ESAVDCGSLASMPDIAFTIGGKKFKLKPEQYILKVGEGQAAQCISGFKAMDIPPPRGPLW 240
Query: 269 ILGDVFMGVYHTVFDSGKLRIGFAEAA 295
ILGDVFMGVYHTVFD GKLR+GFAE+A
Sbjct: 241 ILGDVFMGVYHTVFDYGKLRVGFAESA 267
>gi|75267434|sp|Q9XFX3.1|CARDA_CYNCA RecName: Full=Procardosin-A; Contains: RecName: Full=Cardosin-A
intermediate form 35 kDa subunit; Contains: RecName:
Full=Cardosin-A heavy chain; AltName: Full=Cardosin-A 31
kDa subunit; Contains: RecName: Full=Cardosin-A
intermediate form 30 kDa subunit; Contains: RecName:
Full=Cardosin-A light chain; AltName: Full=Cardosin-A 15
kDa subunit; Flags: Precursor
gi|4581209|emb|CAB40134.1| preprocardosin A [Cynara cardunculus]
Length = 504
Score = 361 bits (927), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 172/298 (57%), Positives = 220/298 (73%), Gaps = 4/298 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ QGLV E FSFWLNR+ D EEGGE+VFGG+DP HF+G HTYVPVT + YWQF +GD+
Sbjct: 208 MLNQGLVKERRFSFWLNRNVDEEEGGELVFGGLDPNHFRGDHTYVPVTYQYYWQFGIGDV 267
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
LIG++STG C GC A DSGTSLL+GPT +VT+INHAIG GV++ +CK VVS+YG I
Sbjct: 268 LIGDKSTGFCAPGCQAFADSGTSLLSGPTAIVTQINHAIGANGVMNQQCKTVVSRYGRDI 327
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAG---DSAVCSACEMAVV 177
++L S + P+K+C + LC F+GA VS+ I++VV+K N + +C+ CEMAVV
Sbjct: 328 IEMLRSKIQPDKICSHMKLCTFDGARDVSSIIESVVDKNNDKSSGGIHDEMCTFCEMAVV 387
Query: 178 WVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPE 237
W+QN++KQ +T++ +++Y NELC+ L E +DC+ + +MPNVSFTIG K F L+PE
Sbjct: 388 WMQNEIKQSETEDNIINYANELCEHLSTSSEELQVDCNTLSSMPNVSFTIGGKKFGLTPE 447
Query: 238 QYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
QYILK G+G A CISGF A D GPLWILGDVFM YHTVFD G L +GFAEAA
Sbjct: 448 QYILKVGKGEATQCISGFTAMD-ATLLGPLWILGDVFMRPYHTVFDYGNLLVGFAEAA 504
>gi|357511711|ref|XP_003626144.1| Aspartic proteinase [Medicago truncatula]
gi|355501159|gb|AES82362.1| Aspartic proteinase [Medicago truncatula]
Length = 426
Score = 353 bits (907), Expect = 4e-95, Method: Compositional matrix adjust.
Identities = 166/240 (69%), Positives = 201/240 (83%), Gaps = 2/240 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+EQ L+ E+VFSFWLN +P+A++GGE+VFGGVDPKHFKGKHTYVPVT+KGYWQ E+GD
Sbjct: 182 MLEQNLIGEKVFSFWLNGNPNAKKGGELVFGGVDPKHFKGKHTYVPVTEKGYWQIEMGDF 241
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
IG STGVCEGGCAAIVDSGTSLLAGPTPVV EINHAIG EGV+S ECK VVSQYG+LI
Sbjct: 242 FIGGLSTGVCEGGCAAIVDSGTSLLAGPTPVVAEINHAIGAEGVLSVECKEVVSQYGELI 301
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKEN--VSAGDSAVCSACEMAVVW 178
WDLLVSG+ P VC Q+GLC+ G + S GI+ V +KE +SA D+ +CS+C+M V+W
Sbjct: 302 WDLLVSGVKPGDVCSQVGLCSIRGDQSNSAGIEMVTDKEQSELSAKDTPLCSSCQMLVLW 361
Query: 179 VQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQ 238
VQNQLKQK TKE+V +Y+N+LC+SLP+P GES+I C+ I MPN+SFTIG+K F L+PEQ
Sbjct: 362 VQNQLKQKATKERVFNYVNQLCESLPSPSGESVISCNDISKMPNISFTIGNKPFVLTPEQ 421
>gi|413934460|gb|AFW69011.1| hypothetical protein ZEAMMB73_821214 [Zea mays]
Length = 324
Score = 350 bits (898), Expect = 4e-94, Method: Compositional matrix adjust.
Identities = 163/265 (61%), Positives = 199/265 (75%), Gaps = 4/265 (1%)
Query: 8 SEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQST 67
S+ VFSFW NR D EGGEIVFGG+D H+KG HT+VPVT+KGYWQF +GD+L+ +ST
Sbjct: 60 SDPVFSFWFNRHADEGEGGEIVFGGMDSSHYKGDHTFVPVTRKGYWQFNMGDVLVDGKST 119
Query: 68 GVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSG 127
G C GGCAA+ DSGTSLLAGPT ++TEIN IG GVVS ECK VVSQYG I DLL++
Sbjct: 120 GFCAGGCAAMADSGTSLLAGPTAIITEINEKIGAAGVVSQECKTVVSQYGQQILDLLLAE 179
Query: 128 LLPEKVCQQIGLCAFNGAEYVSTGIKTVVE----KENVSAGDSAVCSACEMAVVWVQNQL 183
P K+C Q+GLC F+G VS GI++VV+ K N +C+ACEMAVVW+QNQL
Sbjct: 180 TQPAKICSQVGLCTFDGTHGVSAGIRSVVDDEAGKSNGGLKSDPMCNACEMAVVWMQNQL 239
Query: 184 KQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKT 243
Q +T+E +L+YIN+LC+ LP+PMGES +DC + +MP++ FTIG K F L PEQYILK
Sbjct: 240 AQNKTQELILNYINQLCERLPSPMGESAVDCGSLASMPDIVFTIGGKKFKLKPEQYILKV 299
Query: 244 GEGIAEVCISGFMAFDLPPPRGPLW 268
GEG A CISGF A D+PPPRGPLW
Sbjct: 300 GEGQAAQCISGFTAMDIPPPRGPLW 324
>gi|3551952|gb|AAC34854.1| senescence-associated protein 4 [Hemerocallis hybrid cultivar]
Length = 517
Score = 345 bits (886), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 165/273 (60%), Positives = 202/273 (73%), Gaps = 3/273 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MVEQGLV E VFSFWLNR + EGGEIVFGGVDP H KG+H YVPVT+KGYWQF++GD+
Sbjct: 215 MVEQGLVKEAVFSFWLNRKSEDGEGGEIVFGGVDPSHHKGEHVYVPVTQKGYWQFDMGDV 274
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
L+G QSTG CEGGCAAI DSGTSL+AGPT V+TEINH IG GVVS ECK VV QYG I
Sbjct: 275 LVGGQSTGFCEGGCAAIADSGTSLIAGPTTVITEINHKIGAAGVVSQECKAVVQQYGQQI 334
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKE-NVSAGDSAVCSACEMAVVWV 179
D+L++ P K+C QIGLC F+G VS GI++VV + S A+CSACEMAVVW+
Sbjct: 335 LDMLIAQTQPMKICSQIGLCTFDGTRGVSMGIESVVNGNVDKSVASDAMCSACEMAVVWM 394
Query: 180 QNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQY 239
QNQ+K +T++ +L+YIN+LC+ LP+PMGES +DC + TMP++SFTIG K F+L+ EQY
Sbjct: 395 QNQIKHNKTQDLILNYINQLCERLPSPMGESAVDCSVLSTMPSISFTIGGKQFDLTAEQY 454
Query: 240 ILKTGEGIAEVCISGFMAFDLPPPRGP--LWIL 270
+LK GEG A CI RG W++
Sbjct: 455 VLKVGEGPAAQCIKWIHCLGHSSSRGHSGYWVM 487
>gi|293335451|ref|NP_001169605.1| uncharacterized protein LOC100383486 precursor [Zea mays]
gi|224030337|gb|ACN34244.1| unknown [Zea mays]
Length = 556
Score = 345 bits (884), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 165/296 (55%), Positives = 214/296 (72%), Gaps = 4/296 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV Q LV++ VFSFWLNR+P EGGEIVFGG D +H+KG HTY VT+KGYWQFE+GD
Sbjct: 264 MVNQNLVAQPVFSFWLNRNPFDGEGGEIVFGGSDEQHYKGSHTYTRVTRKGYWQFEMGDF 323
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
LIG +STG+C GCAAI DSGTSL+AGP + +IN IG GVV+ ECK VV+ YG I
Sbjct: 324 LIGGRSTGICVDGCAAIADSGTSLIAGPLVAIAQINEQIGAAGVVNQECKQVVAGYGLQI 383
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVS-TGIKTVVEKENVSAGDSAVCSACEMAVVWV 179
LL + P +VC ++GLC F+G VS GI++V +V A+C+ACE+ V W
Sbjct: 384 AGLLEAQTPPSEVCSKVGLCTFDGTRGVSAAGIESV--PGSVDGMAEALCNACEIVVFWT 441
Query: 180 QNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQY 239
Q++L ++ E L Y++ LC+S+P+P+G S +DC R+ ++ V+F+IG + F L P+QY
Sbjct: 442 QSELSPNRSNEGTLEYVDRLCESMPDPVG-SRVDCGRVGSLQTVAFSIGGRAFELRPDQY 500
Query: 240 ILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
+LK GEG A CISGF A D+PPP GPLWILGDVFMG YHT+FD GK+RIGFA++A
Sbjct: 501 VLKVGEGFAAHCISGFTALDVPPPVGPLWILGDVFMGAYHTIFDYGKMRIGFADSA 556
>gi|242053731|ref|XP_002456011.1| hypothetical protein SORBIDRAFT_03g028820 [Sorghum bicolor]
gi|241927986|gb|EES01131.1| hypothetical protein SORBIDRAFT_03g028820 [Sorghum bicolor]
Length = 567
Score = 343 bits (879), Expect = 8e-92, Method: Compositional matrix adjust.
Identities = 167/300 (55%), Positives = 215/300 (71%), Gaps = 11/300 (3%)
Query: 1 MVEQGLVSEEVFSFWLNRDP-DAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGD 59
MV Q LV + VFSFWLNR+P D EEGGEIVFGG D +H+KG HTY VT+K YWQFE+GD
Sbjct: 274 MVNQSLVPQPVFSFWLNRNPFDGEEGGEIVFGGSDEQHYKGSHTYTRVTRKAYWQFEMGD 333
Query: 60 ILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDL 119
LIG +STG+C GCAAI DSGTSL+AGP + +IN IG GVV+ ECK VV+ YG
Sbjct: 334 FLIGERSTGICVDGCAAIADSGTSLIAGPLVAIAQINEQIGAAGVVNHECKQVVAGYGLE 393
Query: 120 IWDLLVSGLL-PEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGD---SAVCSACEMA 175
+ +LL + P +VC +IGLC +G VS GI++V +GD A+C+ACEM
Sbjct: 394 MVELLKAQQTPPSQVCSKIGLCTLDGTHGVSAGIESV-----SGSGDGMSEAICNACEMI 448
Query: 176 VVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLS 235
V W+Q++ +TKE L Y++ LC+++P+P+G S +DC I ++ V+F+IG + F L
Sbjct: 449 VFWMQSEFNTNKTKEGTLEYVDRLCENMPDPVG-SHVDCRHIGSLQTVAFSIGGRAFELR 507
Query: 236 PEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
P+QYIL+ GEG A CISGF A D+PPP GPLWILGDVFMG YHT+FD GK+RIGFA++A
Sbjct: 508 PDQYILRVGEGFAAHCISGFTALDIPPPIGPLWILGDVFMGAYHTIFDYGKMRIGFADSA 567
>gi|414881317|tpg|DAA58448.1| TPA: hypothetical protein ZEAMMB73_088821 [Zea mays]
Length = 557
Score = 341 bits (875), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 166/302 (54%), Positives = 216/302 (71%), Gaps = 15/302 (4%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV Q LV++ VFSFWLNR+P EGGEIVFGG D +H+KG HTY VT+KGYWQFE+GD
Sbjct: 264 MVNQNLVAQPVFSFWLNRNPFDGEGGEIVFGGSDEQHYKGSHTYTRVTRKGYWQFEMGDF 323
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
LIG +STG+C GCAAI DSGTSL+AGP + +IN IG GVV+ ECK VV+ YG
Sbjct: 324 LIGGRSTGICVDGCAAIADSGTSLIAGPLVAIAQINEQIGAAGVVNQECKQVVAGYG--- 380
Query: 121 WDLLVSGLL------PEKVCQQIGLCAFNGAEYVS-TGIKTVVEKENVSAGDSAVCSACE 173
L ++GLL P +VC ++GLC F+G VS GI++V +V A+C+ACE
Sbjct: 381 --LQIAGLLEAQQTPPSEVCSKVGLCTFDGTRGVSAAGIESV--PGSVDGMAEALCNACE 436
Query: 174 MAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFN 233
+ V W Q++L ++ E L Y++ LC+S+P+P+G S +DC R+ ++ V+F+IG + F
Sbjct: 437 IVVFWTQSELSPNRSNEGTLEYVDRLCESMPDPVG-SRVDCGRVGSLQTVAFSIGGRAFE 495
Query: 234 LSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAE 293
L P+QY+LK GEG A CISGF A D+PPP GPLWILGDVFMG YHT+FD GK+RIGFA+
Sbjct: 496 LRPDQYVLKVGEGFAAHCISGFTALDVPPPVGPLWILGDVFMGAYHTIFDYGKMRIGFAD 555
Query: 294 AA 295
+A
Sbjct: 556 SA 557
>gi|413946823|gb|AFW79472.1| hypothetical protein ZEAMMB73_587615 [Zea mays]
Length = 488
Score = 334 bits (856), Expect = 3e-89, Method: Compositional matrix adjust.
Identities = 152/239 (63%), Positives = 194/239 (81%), Gaps = 3/239 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGG-EIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGD 59
M +Q LV+++VFSFWLNRDPDA GG E+VFGGVDPKH+KG HTYVPVT+KGYWQF++GD
Sbjct: 252 MKQQKLVAKDVFSFWLNRDPDASSGGGELVFGGVDPKHYKGDHTYVPVTRKGYWQFDMGD 311
Query: 60 ILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDL 119
++IG STG C GGCAAIVDSGTSLLAGPT +V ++NHAIG EG++S ECK VVS+YG++
Sbjct: 312 LIIGGHSTGFCAGGCAAIVDSGTSLLAGPTTIVAQVNHAIGAEGIISTECKEVVSEYGEM 371
Query: 120 IWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGDSAVCSACEMAVVWV 179
I +LL+S P+KVC QIGLC F+GA VS I++VVEK+ G C+ACEMAVVW+
Sbjct: 372 ILELLISQTSPQKVCTQIGLCVFDGAHSVSNPIESVVEKQK--RGSDLFCTACEMAVVWI 429
Query: 180 QNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQ 238
QNQL++ +TKE +L+Y N+LC+ LP+P GES +DC +I MPN++FTI +K F L+PEQ
Sbjct: 430 QNQLRENKTKELILNYANQLCERLPSPNGESTVDCHQISKMPNLAFTIANKTFTLTPEQ 488
>gi|413948512|gb|AFW81161.1| hypothetical protein ZEAMMB73_941917 [Zea mays]
Length = 243
Score = 327 bits (839), Expect = 3e-87, Method: Compositional matrix adjust.
Identities = 155/243 (63%), Positives = 188/243 (77%), Gaps = 4/243 (1%)
Query: 57 LGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQY 116
+GD+L+ +STG C GGCAAI DSGTSLLAGPT ++TEIN IG GVVS ECK VVSQY
Sbjct: 1 MGDVLVDGKSTGFCAGGCAAIADSGTSLLAGPTAIITEINEKIGAAGVVSQECKTVVSQY 60
Query: 117 GDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVE----KENVSAGDSAVCSAC 172
G I DLL++ P K+C Q+GLC F+G VSTGI++VV+ K N +C+AC
Sbjct: 61 GQQILDLLLAETQPAKICSQVGLCTFDGTHGVSTGIRSVVDDKAGKSNGGLKSDPMCNAC 120
Query: 173 EMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIF 232
EMAVVW+QNQL Q +T+E +L+YIN+LC+ LP+PMGES +DC + +MP+++FTIG K F
Sbjct: 121 EMAVVWMQNQLAQNKTQELILTYINQLCERLPSPMGESAVDCASLGSMPDIAFTIGGKKF 180
Query: 233 NLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFA 292
L PEQYILK GEG A CISGF A D+PPPRGPLWILGDVFMGVYHTVFD GKLR+GFA
Sbjct: 181 KLKPEQYILKVGEGQAAQCISGFTAMDIPPPRGPLWILGDVFMGVYHTVFDYGKLRVGFA 240
Query: 293 EAA 295
E+A
Sbjct: 241 ESA 243
>gi|326510801|dbj|BAJ91748.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 450
Score = 324 bits (830), Expect = 4e-86, Method: Compositional matrix adjust.
Identities = 143/242 (59%), Positives = 190/242 (78%), Gaps = 2/242 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M EQ L++++VFSFWLNRD DA GGE+VFGG+DP H+KG HTYVPV++KGYWQF +GD+
Sbjct: 206 MQEQKLLADDVFSFWLNRDSDALSGGELVFGGMDPHHYKGNHTYVPVSRKGYWQFNMGDL 265
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
LI STG C GCAAIVDSGTSLLAGPT +V ++NHAIG EG++S ECK VVSQYG++I
Sbjct: 266 LIDGHSTGFCAKGCAAIVDSGTSLLAGPTAIVAQVNHAIGAEGIISTECKEVVSQYGEMI 325
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGDSAVCSACEMAVVWVQ 180
++L++ P+KVC QIGLC F+G + VS GI+++V KENV G +C+ACEMAVVW++
Sbjct: 326 LEMLIAQTQPQKVCSQIGLCLFDGTQSVSNGIESIVGKENV--GSDLMCTACEMAVVWIE 383
Query: 181 NQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYI 240
NQL++ +TKE +L Y N+LC+ LP+P GES + C + MPN++F I +K F L+PEQ +
Sbjct: 384 NQLRENKTKELILQYANQLCERLPSPNGESTVSCHEMSKMPNLAFAIANKTFVLTPEQVL 443
Query: 241 LK 242
+
Sbjct: 444 FR 445
>gi|255567717|ref|XP_002524837.1| Aspartic proteinase precursor, putative [Ricinus communis]
gi|223535897|gb|EEF37557.1| Aspartic proteinase precursor, putative [Ricinus communis]
Length = 456
Score = 315 bits (807), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 142/242 (58%), Positives = 187/242 (77%), Gaps = 4/242 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV +GLV E+VFSFWLNR+ AEEGGEIVFGG+DP H+KG+HTYVPVT+KGYWQF++G++
Sbjct: 211 MVNEGLVKEQVFSFWLNRNVQAEEGGEIVFGGMDPNHYKGQHTYVPVTQKGYWQFDMGEV 270
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
LIGN+ TG+C GC AI DSGTSLLAGPT V+T+INHAIG G+VS ECK VV QYG I
Sbjct: 271 LIGNEITGLCADGCKAIADSGTSLLAGPTTVITQINHAIGASGIVSQECKTVVEQYGKFI 330
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGD----SAVCSACEMAV 176
++L + P+K+C QIG C F+G + VST I++VV+K +A D + C+ CEM V
Sbjct: 331 LEMLTAQAQPQKICSQIGFCTFDGTQGVSTNIESVVDKSKETASDGLQQDSACTVCEMIV 390
Query: 177 VWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSP 236
VW+QN+L+ +T +++L+Y+N+LCD LP+P GES +DC + +MP VSFTIG K F L+
Sbjct: 391 VWMQNRLRLNETVDQILNYVNKLCDRLPSPNGESAVDCSSLSSMPIVSFTIGGKAFKLTA 450
Query: 237 EQ 238
+Q
Sbjct: 451 DQ 452
>gi|148910494|gb|ABR18322.1| unknown [Picea sitchensis]
Length = 471
Score = 290 bits (743), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 135/223 (60%), Positives = 167/223 (74%), Gaps = 2/223 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV QGL+ E VFSFW+NR EEGGEIVFGGVDP HFKGKHTYVPVT++GYWQF +GD
Sbjct: 213 MVNQGLIKEPVFSFWMNRKVGDEEGGEIVFGGVDPNHFKGKHTYVPVTREGYWQFNMGDF 272
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
LIG QSTG C GGCAAIVDSGTSLLAGP+ +V +IN AIG G+ S ECK VVSQYGDLI
Sbjct: 273 LIGGQSTGFCSGGCAAIVDSGTSLLAGPSGIVAQINEAIGASGLASQECKSVVSQYGDLI 332
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKEN--VSAGDSAVCSACEMAVVW 178
+LL++ P+KVC QIGLC +G V I +V+EK N S S +C+ACEMAVVW
Sbjct: 333 MELLMAQTNPQKVCSQIGLCLSDGTRDVGMRIASVLEKGNEATSTSSSGMCAACEMAVVW 392
Query: 179 VQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMP 221
+NQ+ + +K+++++Y+N+LCD LPNP G++ +DC P
Sbjct: 393 AKNQIARNASKDQIMTYLNQLCDRLPNPNGQAAVDCKTYQACP 435
>gi|384245845|gb|EIE19337.1| putative aspartic protease [Coccomyxa subellipsoidea C-169]
Length = 508
Score = 290 bits (742), Expect = 6e-76, Method: Compositional matrix adjust.
Identities = 148/300 (49%), Positives = 200/300 (66%), Gaps = 9/300 (3%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV+QGLV E VFSFWLNR+ + GGE+V GGVDP H+ G+H +V VT++ YWQF+LG I
Sbjct: 210 MVQQGLVPEPVFSFWLNRNDPSGPGGELVLGGVDPSHYTGEHLWVNVTRRAYWQFDLGGI 269
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
+ ++ C GC AI DSGTSL+ GP+ + EIN AIG +GV+ AEC+ +V QY I
Sbjct: 270 SVPGTNS-PCADGCQAIADSGTSLIVGPSDEIAEINRAIGAKGVLPAECRELVRQYVPEI 328
Query: 121 WDLLVSGLLPEKVCQQIGLCAFN-----GAEYVSTGIKTVVEKENVSAGDSAVCSACEMA 175
++S L E+VC IGLC+ + GA + + +VE E + A D VC CEMA
Sbjct: 329 MKAVIS-LPEEQVCGAIGLCSASSLHRGGAAKAAASRRLLVEDEALGAPDP-VCQFCEMA 386
Query: 176 VVWVQNQLKQKQTKEKVLSYINELCDSLP-NPMGESIIDCDRIPTMPNVSFTIGDKIFNL 234
V +V+ L +T+E+++ ++ LCD+L ++++DC+ IP+MP V+FTI K F L
Sbjct: 387 VSYVKIALANHETQEQIIGQLDGLCDTLAIFSSSQALVDCEAIPSMPPVTFTIAGKKFTL 446
Query: 235 SPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
S E Y+L+ G A C+SGFM DLPPP GPLWILGDVFMG YHTVFD G R+GFA++
Sbjct: 447 SAEDYVLQVSAGGATQCVSGFMGLDLPPPAGPLWILGDVFMGAYHTVFDVGNERVGFADS 506
>gi|4586590|dbj|BAA76427.1| aspartic proteinase [Cicer arietinum]
Length = 204
Score = 290 bits (741), Expect = 7e-76, Method: Compositional matrix adjust.
Identities = 132/204 (64%), Positives = 166/204 (81%), Gaps = 2/204 (0%)
Query: 94 EINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIK 153
EINHAIG EGV+S ECK VVSQYG+LIWDLLVSG+ P +C Q+GLC+ + S GI+
Sbjct: 1 EINHAIGAEGVLSVECKEVVSQYGELIWDLLVSGVNPGDICSQVGLCSVRSDQSKSAGIE 60
Query: 154 TVVEKEN--VSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESI 211
V E + +SA D+ +CS+C+M V+WVQNQLKQK TKE+V +Y+N+LC+SLP+P GES+
Sbjct: 61 MVTENKQSEMSATDTPLCSSCQMLVIWVQNQLKQKATKERVFNYVNQLCESLPSPSGESV 120
Query: 212 IDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILG 271
I C+ + MPN+SFTIGDK F L+PEQY+L+TGEGI EVC+S F+AFD+PPP+GPLWILG
Sbjct: 121 ISCNDLSRMPNISFTIGDKPFVLTPEQYVLRTGEGITEVCLSAFIAFDIPPPKGPLWILG 180
Query: 272 DVFMGVYHTVFDSGKLRIGFAEAA 295
DVFM YHTVFD G L++GFAEAA
Sbjct: 181 DVFMRAYHTVFDYGNLQVGFAEAA 204
>gi|145352062|ref|XP_001420378.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144580612|gb|ABO98671.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 454
Score = 276 bits (707), Expect = 6e-72, Method: Compositional matrix adjust.
Identities = 135/303 (44%), Positives = 188/303 (62%), Gaps = 10/303 (3%)
Query: 1 MVEQGLVSEEVFSFWLNRDPD-----AEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQF 55
M+EQGLV ++FSFWLNR +E GGE++FGG DP HF G+HTY PVT++GYWQ
Sbjct: 154 MMEQGLVENKMFSFWLNRTSTKDGMPSEVGGELIFGGSDPDHFIGEHTYAPVTREGYWQI 213
Query: 56 ELGDILIGNQSTGVCEG--GCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVV 113
++ D + +S G C+G GC I D+GTSLLAGPT +V +IN IG ++ EC+L++
Sbjct: 214 KMDDFKVDGRSLGACDGDDGCQVIADTGTSLLAGPTEIVNKINDYIGAHSMIGEECRLLI 273
Query: 114 SQYGD-LIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGDSAVCSAC 172
QY + + DL E++C IG C +G E + + K + S C+AC
Sbjct: 274 DQYAEQFVEDL--ENYSSEQICASIGACDADGVEAMEADDDDDLGKSSSSFEGQIACTAC 331
Query: 173 EMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIF 232
+ V + Q+ L Q T++ +++ + +CD +P+ G + +DCD IP MP+V F IG F
Sbjct: 332 KTVVNYAQDMLAQNVTEKIIVNEVKRVCDMVPSVGGTASVDCDNIPNMPDVEFVIGGVPF 391
Query: 233 NLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFA 292
L+PEQY+LK + C+SGFM D+P P GPLWILGDVF+G YHT FD R+GFA
Sbjct: 392 KLTPEQYVLKVYQDGEAQCVSGFMGMDIPKPAGPLWILGDVFLGPYHTEFDYANRRVGFA 451
Query: 293 EAA 295
AA
Sbjct: 452 PAA 454
>gi|146454530|gb|ABQ41931.1| aspartic proteinase 1 [Sonneratia caseolaris]
Length = 203
Score = 276 bits (706), Expect = 8e-72, Method: Compositional matrix adjust.
Identities = 123/203 (60%), Positives = 164/203 (80%), Gaps = 3/203 (1%)
Query: 24 EGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTS 83
EGGE+VFGGVDP H+KG+HTYVPVT+KGYWQF++G++LIG+Q++G C GCAAI DSGTS
Sbjct: 1 EGGELVFGGVDPSHYKGEHTYVPVTQKGYWQFDMGEVLIGDQASGFCGSGCAAIADSGTS 60
Query: 84 LLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFN 143
LLAGPT ++T+INHAIG GVVS ECK VV+QYG I ++L+S PEK+C QIG C F+
Sbjct: 61 LLAGPTSIITQINHAIGASGVVSQECKAVVAQYGKTILEMLLSQSQPEKICSQIGFCTFD 120
Query: 144 GAEYVSTGIKTVVEKENVSAGDS---AVCSACEMAVVWVQNQLKQKQTKEKVLSYINELC 200
G V GIK+VV+ + ++ S A CSACEMAVVW+QN+L+Q QT++++L+Y+NELC
Sbjct: 121 GTRGVDMGIKSVVDDDKSTSSGSVHDASCSACEMAVVWMQNKLRQNQTEDQILNYVNELC 180
Query: 201 DSLPNPMGESIIDCDRIPTMPNV 223
+ +P+PMGES+++C + TMP V
Sbjct: 181 ERIPSPMGESVVECSSLSTMPKV 203
>gi|146454528|gb|ABQ41930.1| aspartic proteinase 1 [Sonneratia alba]
Length = 203
Score = 276 bits (705), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 123/203 (60%), Positives = 163/203 (80%), Gaps = 3/203 (1%)
Query: 24 EGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTS 83
EGGE+VFGGVDP H+KG+HTYVPVT+KGYWQF++G++LIG+Q++G C GCAAI DSGTS
Sbjct: 1 EGGELVFGGVDPSHYKGEHTYVPVTQKGYWQFDMGEVLIGDQASGFCGSGCAAIADSGTS 60
Query: 84 LLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFN 143
LLAGPT ++T+INHAIG GVVS ECK VV+QYG I ++L+S PEK+C QIG C F+
Sbjct: 61 LLAGPTSIITQINHAIGASGVVSQECKAVVAQYGKTILEMLLSQSQPEKICSQIGFCTFD 120
Query: 144 GAEYVSTGIKTVVEKENVSAGDS---AVCSACEMAVVWVQNQLKQKQTKEKVLSYINELC 200
G V GIK+VV+ ++ S A CSACEMAVVW+QN+L+Q QT++++L+Y+NELC
Sbjct: 121 GTRGVDMGIKSVVDDNKSTSSGSVRDASCSACEMAVVWMQNKLRQNQTEDQILNYVNELC 180
Query: 201 DSLPNPMGESIIDCDRIPTMPNV 223
+ +P+PMGES+++C + TMP V
Sbjct: 181 ERIPSPMGESVVECSSLSTMPKV 203
>gi|62319754|dbj|BAD93734.1| putative aspartic proteinase [Arabidopsis thaliana]
Length = 205
Score = 274 bits (701), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 127/204 (62%), Positives = 158/204 (77%), Gaps = 3/204 (1%)
Query: 95 INHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKT 154
INHAIG GVVS +CK VV QYG I DLL+S P+K+C QIGLC F+G VS GI++
Sbjct: 2 INHAIGAAGVVSQQCKTVVDQYGQTILDLLLSETQPKKICSQIGLCTFDGTRGVSMGIES 61
Query: 155 VVEKENVSAGD---SAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESI 211
VV+KEN + A CSACEMAVVW+Q+QL+Q T+E++L+Y+NELC+ LP+PMGES
Sbjct: 62 VVDKENAKLSNGVGDAACSACEMAVVWIQSQLRQNMTQERILNYVNELCERLPSPMGESA 121
Query: 212 IDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILG 271
+DC ++ TMP VS TIG K+F+L+PE+Y+LK GEG CISGF+A D+ PPRGPLWILG
Sbjct: 122 VDCAQLSTMPTVSLTIGGKVFDLAPEEYVLKVGEGPVAQCISGFIALDVAPPRGPLWILG 181
Query: 272 DVFMGVYHTVFDSGKLRIGFAEAA 295
DVFMG YHTVFD G ++GFAEAA
Sbjct: 182 DVFMGKYHTVFDFGNEQVGFAEAA 205
>gi|146454534|gb|ABQ41933.1| aspartic proteinase 1 [Sonneratia apetala]
Length = 203
Score = 274 bits (700), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 122/203 (60%), Positives = 163/203 (80%), Gaps = 3/203 (1%)
Query: 24 EGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTS 83
EGGE+VFGGVDP H+KG+HTYVPVT+KGYWQF++G++LIG++++G C GCAAI DSGTS
Sbjct: 1 EGGELVFGGVDPSHYKGEHTYVPVTQKGYWQFDMGEVLIGDEASGFCGSGCAAIADSGTS 60
Query: 84 LLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFN 143
LLAGPT ++T+INHAIG GVVS ECK VV+QYG I ++L+S PEK+C QIG C F+
Sbjct: 61 LLAGPTSIITQINHAIGASGVVSQECKAVVAQYGKTILEMLLSQSQPEKICSQIGFCTFD 120
Query: 144 GAEYVSTGIKTVVEKENVSAGDS---AVCSACEMAVVWVQNQLKQKQTKEKVLSYINELC 200
G V GIK+VV+ ++ S A CSACEMAVVW+QN+L+Q QT++++L+Y+NELC
Sbjct: 121 GTRGVDMGIKSVVDDNKSTSSGSVRDASCSACEMAVVWMQNKLRQNQTEDQILNYVNELC 180
Query: 201 DSLPNPMGESIIDCDRIPTMPNV 223
+ +P+PMGES+++C + TMP V
Sbjct: 181 ERIPSPMGESVVECSSLSTMPKV 203
>gi|146454532|gb|ABQ41932.1| aspartic proteinase 1 [Sonneratia ovata]
Length = 203
Score = 273 bits (697), Expect = 9e-71, Method: Compositional matrix adjust.
Identities = 122/203 (60%), Positives = 162/203 (79%), Gaps = 3/203 (1%)
Query: 24 EGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTS 83
EGGE+VFGGVDP H+K +HTYVPVT+KGYWQF++G++LIG+Q++G C GCAAI DSGTS
Sbjct: 1 EGGELVFGGVDPSHYKEEHTYVPVTQKGYWQFDMGEVLIGDQASGFCGSGCAAIADSGTS 60
Query: 84 LLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFN 143
LLAGPT ++T+INHAIG GVVS ECK VV+QYG I ++L+S PEK+C QIG C F+
Sbjct: 61 LLAGPTSIITQINHAIGASGVVSQECKAVVAQYGKTILEMLLSQSQPEKICSQIGFCTFD 120
Query: 144 GAEYVSTGIKTVVEKENVSAGDS---AVCSACEMAVVWVQNQLKQKQTKEKVLSYINELC 200
G V GIK+VV+ ++ S A CSACEMAVVW+QN+L+Q QT++++L+Y+NELC
Sbjct: 121 GTRGVDMGIKSVVDDNKSTSSGSVRDASCSACEMAVVWMQNKLRQNQTEDQILNYVNELC 180
Query: 201 DSLPNPMGESIIDCDRIPTMPNV 223
+ +P+PMGES+++C + TMP V
Sbjct: 181 ERIPSPMGESVVECSSLSTMPKV 203
>gi|2160151|gb|AAB60773.1| Strong similarity to Brassica aspartic protease (gb|X77260)
[Arabidopsis thaliana]
Length = 433
Score = 264 bits (674), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 125/215 (58%), Positives = 160/215 (74%), Gaps = 16/215 (7%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M++QGL+ VFSFWLNRDP +EEGGEIVFGGVDPKHF+G+HT+VPVT++GYWQF++G++
Sbjct: 216 MLKQGLIKRPVFSFWLNRDPKSEEGGEIVFGGVDPKHFRGEHTFVPVTQRGYWQFDMGEV 275
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPT-------------PVVTEINHAIGGEGVVSA 107
LI +STG C GC+AI DSGTSLLAGPT VV IN AIG GVVS
Sbjct: 276 LIAGESTGYCGSGCSAIADSGTSLLAGPTVSKYHEFIVLFQLAVVAMINKAIGASGVVSQ 335
Query: 108 ECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGD-- 165
+CK VV QYG I DLL++ P+K+C QIGLCA++G VS GI++VV+KEN +
Sbjct: 336 QCKTVVDQYGQTILDLLLAETQPKKICSQIGLCAYDGTHGVSMGIESVVDKENTRSSSGL 395
Query: 166 -SAVCSACEMAVVWVQNQLKQKQTKEKVLSYINEL 199
A C ACEMAVVW+Q+QL+Q T+E++++YINE+
Sbjct: 396 RDAGCPACEMAVVWIQSQLRQNMTQERIVNYINEV 430
>gi|144228219|gb|ABO93618.1| aspartic proteinase [Vitis vinifera]
Length = 194
Score = 262 bits (670), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 123/194 (63%), Positives = 151/194 (77%), Gaps = 3/194 (1%)
Query: 95 INHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKT 154
INHAIG GVVS ECK VV+QYG+ I DLL+S P+K+C QIGLC F+G V GI++
Sbjct: 1 INHAIGATGVVSQECKTVVAQYGETIMDLLLSEASPQKICSQIGLCTFDGTRGVGMGIES 60
Query: 155 VVEKEN---VSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESI 211
VV+++N S A CSACEMAVVW+Q+QL+Q QTKE++L Y+NELCD LP+PMGES
Sbjct: 61 VVDEKNGDKSSGVHDAGCSACEMAVVWMQSQLRQNQTKERILEYVNELCDRLPSPMGESA 120
Query: 212 IDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILG 271
+DC ++ +MPNVS TI K+F+LS +Y+LK GEG A CISGF+A D+PPPRGPLWILG
Sbjct: 121 VDCLQLSSMPNVSLTISGKVFDLSANEYVLKVGEGAAAQCISGFIAMDVPPPRGPLWILG 180
Query: 272 DVFMGVYHTVFDSG 285
DVFMG YHTVFD G
Sbjct: 181 DVFMGRYHTVFDYG 194
>gi|218188712|gb|EEC71139.1| hypothetical protein OsI_02961 [Oryza sativa Indica Group]
Length = 540
Score = 261 bits (666), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 122/242 (50%), Positives = 170/242 (70%), Gaps = 3/242 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M++Q LV+++VFSFWLNR+ + GGEIVFGG D H+KG HTY VT+K YWQFE+GD
Sbjct: 281 MIQQSLVTDKVFSFWLNRNANDINGGEIVFGGADESHYKGDHTYTRVTRKAYWQFEMGDF 340
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
LIG +STG+C GCA I DSGTSL+AGP + +I+ IG GV + ECK VV+++G +
Sbjct: 341 LIGGRSTGICVDGCAVIADSGTSLIAGPIAAIAQIHAHIGATGVANEECKQVVARHGHEM 400
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGD--SAVCSACEMAVVW 178
+LL P +VC +IGLC +GA +S GI++V+ + + SA + A C+ACEMAV W
Sbjct: 401 LELLQDKTPPAQVCSKIGLCKSDGAHGISDGIESVLGETHKSADEVSDATCNACEMAVTW 460
Query: 179 VQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQ 238
+Q++ Q TKE L Y N+LC ++P+P+G S +DC I +PNV+F+IG + F L+PEQ
Sbjct: 461 MQSEFVQNHTKEGKLEYANQLCGNMPSPVG-SYVDCRHIGHLPNVAFSIGGRAFELTPEQ 519
Query: 239 YI 240
+
Sbjct: 520 VL 521
>gi|115438741|ref|NP_001043650.1| Os01g0631900 [Oryza sativa Japonica Group]
gi|55297073|dbj|BAD68642.1| putative aspartic proteinase [Oryza sativa Japonica Group]
gi|113533181|dbj|BAF05564.1| Os01g0631900 [Oryza sativa Japonica Group]
Length = 522
Score = 260 bits (665), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 122/240 (50%), Positives = 169/240 (70%), Gaps = 3/240 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M++Q LV+++VFSFWLNR+ + GGEIVFGG D H+KG HTY VT+K YWQFE+GD
Sbjct: 266 MIQQSLVTDKVFSFWLNRNANDINGGEIVFGGADESHYKGDHTYTRVTRKAYWQFEMGDF 325
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
LIG +STG+C GCA I DSGTSL+AGP + +I+ IG GV + ECK VV+++G +
Sbjct: 326 LIGGRSTGICVDGCAVIADSGTSLIAGPIAAIAQIHAHIGATGVANEECKQVVARHGHEM 385
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGD--SAVCSACEMAVVW 178
+LL P +VC +IGLC +GA +S GI++V+ + + SA + A C+ACEMAV W
Sbjct: 386 LELLQDKTPPAQVCSKIGLCKSDGAHGISDGIESVLGETHKSADEVSDATCNACEMAVTW 445
Query: 179 VQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQ 238
+Q++ Q TKE L Y N+LC ++P+P+G S +DC I +PNV+F+IG + F L+PEQ
Sbjct: 446 MQSEFVQNHTKEGKLEYANQLCGNMPSPVG-SYVDCRHIGHLPNVAFSIGGRAFELTPEQ 504
>gi|255085919|ref|XP_002508926.1| predicted protein [Micromonas sp. RCC299]
gi|226524204|gb|ACO70184.1| predicted protein [Micromonas sp. RCC299]
Length = 557
Score = 258 bits (660), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 148/333 (44%), Positives = 196/333 (58%), Gaps = 41/333 (12%)
Query: 2 VEQGLVSEEVFSFWLNRDPDAEEGGE---IVFGGVDPKHFKGKHTYVPVTKKGYWQFELG 58
V+QGLV E VFSFWLNRD D GE IV GGVDP HF G+HT++ VT++GYWQ +
Sbjct: 227 VDQGLVVENVFSFWLNRDADEGGDGEGGEIVLGGVDPAHFVGEHTWLNVTREGYWQIAMD 286
Query: 59 DILIGNQSTGVC-EGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYG 117
D+L+G S G C + GCAAIVD+GTSLLAGPT VV +N IG + V+ EC++++ QYG
Sbjct: 287 DVLLGGVSVGQCGKKGCAAIVDTGTSLLAGPTKVVEALNKRIGAKSVLGEECRVMIDQYG 346
Query: 118 D-LIWDLLVSGLLPEKVCQQIGLCAFNGAEYVST-------------------------- 150
D LI DL + +C +GLC + ST
Sbjct: 347 DELIRDL--AEFSATDICTSVGLCGPSSETKTSTSRRRGERRRARLGSSWLEWARGWARV 404
Query: 151 GIKTVVEKENVSAGD------SAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLP 204
G VV + + D +AVC AC AV + ++ L Q T+ +L +CD +P
Sbjct: 405 GRDAVVLGSDAAPIDADGLEGAAVCQACVYAVDYAKSLLTQNATESIILDEFKSVCDLIP 464
Query: 205 NPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILK--TGEGIAEVCISGFMAFDLPP 262
+ GE+ +DCD + MP+V F +G + F L+P+QY+LK G+G CISGFM D+PP
Sbjct: 465 SSGGEAAVDCDAVSKMPDVEFVLGGRPFKLTPDQYVLKVDAGQGGPAQCISGFMGLDIPP 524
Query: 263 PRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
P GPLWILGDVF+G YH+VFD R+G A+AA
Sbjct: 525 PAGPLWILGDVFIGPYHSVFDYDNARVGLADAA 557
>gi|307103455|gb|EFN51715.1| hypothetical protein CHLNCDRAFT_59800 [Chlorella variabilis]
Length = 523
Score = 258 bits (659), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 136/327 (41%), Positives = 197/327 (60%), Gaps = 34/327 (10%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M++Q L+ E VFSFWLNR + EEGGE+V GGVDP HF G+HT+VPVT++G+WQF++ +
Sbjct: 199 MLQQSLLPEPVFSFWLNRKVEGEEGGELVLGGVDPDHFVGEHTWVPVTRRGFWQFKMDGM 258
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
+ C+GGC AI D+GTSLL GP V+ IN AIG E V+ +CK +V QY I
Sbjct: 259 EVEGGGE-FCKGGCQAIADTGTSLLVGPPDVIDAINAAIGAEPVLVEQCKEMVHQYLPEI 317
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNG-----------AEY---VSTGIKTVVEKENVSAGD- 165
L++ + P+ VCQ +GLC+ G A+Y + + +++ ++AG
Sbjct: 318 IK-LINNMPPQAVCQSVGLCSAAGVGEDRRVLSKSAQYRRLLKMYGQQQGQEQPLAAGTG 376
Query: 166 ----------------SAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLP-NPMG 208
+ C C+ V +++ L +T +++ ++ C++ G
Sbjct: 377 EGEEEAQAGGVGGAAANDSCEMCQFVVQYLKIALANNETMAQIMHNLDRACETFSFGSGG 436
Query: 209 ESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLW 268
ES++DC + MP+++FT+G K F L PEQY+LK G E C+SGFM D+PPP GPLW
Sbjct: 437 ESVVDCKALHKMPSIAFTVGGKEFVLGPEQYVLKIGSMGEEQCVSGFMGLDIPPPLGPLW 496
Query: 269 ILGDVFMGVYHTVFDSGKLRIGFAEAA 295
ILGD+F+G YHTVFD G R+GFA+AA
Sbjct: 497 ILGDMFIGPYHTVFDYGNERVGFAQAA 523
>gi|23237804|dbj|BAC16371.1| aspartic proteinase 5 [Glycine max]
Length = 175
Score = 252 bits (643), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 116/176 (65%), Positives = 147/176 (83%), Gaps = 3/176 (1%)
Query: 122 DLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKEN--VSAGDSAVCSACEMAVVWV 179
DLLVSG+ P+ VC Q+GLC F + S GI+ V EKE +S D+A+C++C+M VVW+
Sbjct: 1 DLLVSGVRPDDVCSQVGLC-FKRTKSESNGIEMVTEKEQRELSTKDTALCTSCQMLVVWI 59
Query: 180 QNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQY 239
QNQLKQK+TKE V +Y+N+LC+SLP+P GES++DC+ I +PN++FT+GDK F L+PEQY
Sbjct: 60 QNQLKQKKTKEIVFNYVNQLCESLPSPNGESVVDCNSIYGLPNITFTVGDKPFTLTPEQY 119
Query: 240 ILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
ILKTGEGIAEVC+SGF+AFD+PPPRGPLWILGDVFM VYHTVFD G LR+GFA+AA
Sbjct: 120 ILKTGEGIAEVCLSGFIAFDIPPPRGPLWILGDVFMRVYHTVFDYGNLRVGFAKAA 175
>gi|320165710|gb|EFW42609.1| lysosomal aspartic protease [Capsaspora owczarzaki ATCC 30864]
Length = 462
Score = 248 bits (632), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 126/295 (42%), Positives = 181/295 (61%), Gaps = 28/295 (9%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDA--EEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELG 58
MV QGL++ F+FWL+R+ + + GGE+V GGVDP H+ G TY+PV+K GYWQF L
Sbjct: 190 MVAQGLLNSASFAFWLSRNGTSILKPGGELVLGGVDPSHYTGAFTYIPVSKPGYWQFALD 249
Query: 59 DILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGD 118
+ +G+ + G G I DSGTSLLAGP V +IN IG G+++ EC +++ QY
Sbjct: 250 SVQVGSTTFGANTQG---IADSGTSLLAGPVADVKKINAQIGAIGILAEECDMIIEQYEP 306
Query: 119 LIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGDSAVCSACEMAVVW 178
+I + LV L P +C++IG C N S C C++ +
Sbjct: 307 IIVEGLVQRLDPVTICKEIGSCKANA---------------------STSCYTCKLLITA 345
Query: 179 VQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQ 238
+ +L +T+ + + + C+ LP+P GES++DC ++ TMP +SF +G K F L+P+Q
Sbjct: 346 LDAELGNNRTQAAIEAALEGQCNRLPSPDGESLVDCTKLDTMPTISFVLGGKSFPLTPKQ 405
Query: 239 YILK-TGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFA 292
Y+L+ T EG +E CISGF+ D+PPP GPL+ILGDVFMGVY+T FD R+GFA
Sbjct: 406 YVLEVTSEGQSE-CISGFIGLDVPPPLGPLYILGDVFMGVYYTHFDMANKRVGFA 459
>gi|412987808|emb|CCO19204.1| cathepsin D (lysosomal aspartyl protease) [Bathycoccus prasinos]
Length = 628
Score = 244 bits (624), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 139/349 (39%), Positives = 195/349 (55%), Gaps = 56/349 (16%)
Query: 2 VEQGLVSEEVFSFWLNRDPDAE---EGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELG 58
V Q V ++ FSFWLNRD D + +GGE+VFGGVD KHF G+H +V +TKKGYWQF+L
Sbjct: 281 VAQNQVEKDQFSFWLNRDQDGDGVVDGGELVFGGVDEKHFVGEHVWVDLTKKGYWQFDLD 340
Query: 59 DILIG--------NQSTGVCEGGCA---AIVDSGTSLLAGPTPVVTEINHAIGGEGVVSA 107
D+ +G N T V AI D+GTSLLAGP+ V+ +IN AIG E ++
Sbjct: 341 DVKVGEFSFIDDKNDKTTVSFSSSTKHQAIADTGTSLLAGPSAVIDKINDAIGAENLMIQ 400
Query: 108 ECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLC-------AFNGAEYVSTGIKTVVEKEN 160
ECK+ + +YG+ D + + ++C+ + +C A TG+ K
Sbjct: 401 ECKIAIKRYGEEFLDDIET-YDSSQICESLNICPAAAETNAIEKEISEPTGVLATSRKLL 459
Query: 161 VSAGDSA--------------------------------VCSACEMAVVWVQNQLKQKQT 188
++ + CSACEMAV + + L+ T
Sbjct: 460 MTTREEKKHRGLRGGLSLLGDLFKPSKKNEEKETKKSKVACSACEMAVDYAKELLQANVT 519
Query: 189 KEKVLSYINELCDSLP-NPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGI 247
+ VL+ + ++CD +P P G++ +DC+ I MPN+SFTI K F L+P+QY+L+ +G
Sbjct: 520 RTVVLNELEKVCDFVPAQPGGQAGVDCNAIVEMPNISFTIAGKSFELTPKQYVLEIDDGQ 579
Query: 248 -AEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
+ CISGFM D+P P GPLWILGDVF+G YHTVFD G R+GFA+AA
Sbjct: 580 GSNTCISGFMGLDVPKPMGPLWILGDVFLGPYHTVFDHGGSRVGFAKAA 628
>gi|440803835|gb|ELR24718.1| aspartic proteinase, partial [Acanthamoeba castellanii str. Neff]
Length = 489
Score = 240 bits (613), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 121/299 (40%), Positives = 169/299 (56%), Gaps = 35/299 (11%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
++ Q V+E VF+FWLNRDP GGE+V GGVD H+ G TY P+TK+GYWQF D
Sbjct: 220 LLAQSQVAEPVFAFWLNRDPSGISGGELVLGGVDESHYTGDFTYTPITKEGYWQFLAHDF 279
Query: 61 LIGNQSTGVC-EGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDL 119
LI +S G C GGC AI D+GTSLLAGP+ +V +IN I G++ +EC ++V+QY
Sbjct: 280 LINGKSMGFCPAGGCKAIADTGTSLLAGPSKIVAQINKMINATGILESECDMLVNQYAGQ 339
Query: 120 IWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGDSAVCSACEMAVVWV 179
I ++ GL P++VC + LC C C++ V +
Sbjct: 340 IIQYILQGLQPDQVCSAVNLCP------------------------GGSCQLCKVLVSTI 375
Query: 180 QNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTI----GDKIFNLS 235
L +++++++ + +C GE+ +DC +P++P I G K F L
Sbjct: 376 DAILGTDPSQQEIVALLKYIC------TGEATVDCKTLPSLPTFDVVIPTANGPKTFTLK 429
Query: 236 PEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
PE YILK G E CISGF+ D+P P GPLWI+GDVF+G Y+T FD G ++GFA A
Sbjct: 430 PEDYILKQSMGPEETCISGFIGLDIPAPYGPLWIMGDVFLGPYYTKFDFGNKQLGFAVA 488
>gi|23237802|dbj|BAC16370.1| aspartic proteinase 4 [Glycine max]
Length = 169
Score = 239 bits (610), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 110/169 (65%), Positives = 138/169 (81%), Gaps = 3/169 (1%)
Query: 130 PEKVCQQIGLCAFNGAEYVSTGIKTVVEK-ENVSAGD--SAVCSACEMAVVWVQNQLKQK 186
P+K+C QIGLC F+G VS GI++VV+K E S+G A CSACEMAV+W+QNQL+Q
Sbjct: 1 PKKICSQIGLCTFDGTHGVSMGIESVVDKNERKSSGSIRDAGCSACEMAVIWMQNQLRQN 60
Query: 187 QTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEG 246
QT+++++ Y NELCD LPNPMG+S +DC+++ +MP VSFTIG K+F+LSP++YILK GEG
Sbjct: 61 QTEDRIIDYANELCDKLPNPMGQSSVDCEKLSSMPIVSFTIGGKVFDLSPQEYILKVGEG 120
Query: 247 IAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
CISGF A D+PPPRGPLWILGDVFMG YHT+FD GKLR+GFAEAA
Sbjct: 121 PEAQCISGFTALDVPPPRGPLWILGDVFMGRYHTIFDYGKLRVGFAEAA 169
>gi|255639243|gb|ACU19920.1| unknown [Glycine max]
Length = 177
Score = 230 bits (587), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 102/166 (61%), Positives = 135/166 (81%), Gaps = 2/166 (1%)
Query: 131 EKVCQQIGLCAFNGAEYVSTGIKTVVEKEN--VSAGDSAVCSACEMAVVWVQNQLKQKQT 188
+ +C Q+GLC+ E S GI+ V EKE ++A D+ +CS+C+M V+W+QNQLKQK T
Sbjct: 11 DDICSQVGLCSSKRHESKSAGIEMVTEKEQGELTARDNPLCSSCQMLVLWIQNQLKQKAT 70
Query: 189 KEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIA 248
K++V +Y+N+LC+SLP+P GES+I C+ + MPN++FTIG+K F L+PEQYILKTGEGI
Sbjct: 71 KDRVFNYVNQLCESLPSPSGESVISCNSLSKMPNITFTIGNKPFVLTPEQYILKTGEGIT 130
Query: 249 EVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
EVC+SGF+AFD+PPP+GPLWILGDVFM YHTVFD G L++GFAEA
Sbjct: 131 EVCLSGFIAFDVPPPKGPLWILGDVFMRAYHTVFDYGNLQVGFAEA 176
>gi|14193251|gb|AAK55849.1|AF266465_1 aspartic protease [Manihot esculenta]
Length = 159
Score = 220 bits (560), Expect = 7e-55, Method: Compositional matrix adjust.
Identities = 102/159 (64%), Positives = 124/159 (77%), Gaps = 3/159 (1%)
Query: 140 CAFNGAEYVSTGIKTVVEKENVSAGDS---AVCSACEMAVVWVQNQLKQKQTKEKVLSYI 196
C F+G+ VS I++VV + + S A+CS CEMAV+W+QNQLKQ T E++L+Y
Sbjct: 1 CTFDGSRGVSMTIESVVNENSQEVAGSLHDAMCSTCEMAVIWMQNQLKQNATLERILNYA 60
Query: 197 NELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFM 256
NELC+ LP+PMGES +DC + TMPNVSFTIG K+F+LSPEQY+LK GEG A CISGF
Sbjct: 61 NELCERLPSPMGESAVDCGSLSTMPNVSFTIGGKVFDLSPEQYVLKVGEGEAAQCISGFT 120
Query: 257 AFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
A D+PPPRGPLWILGDVFMG +HTVFD G LR+GFAEAA
Sbjct: 121 ALDVPPPRGPLWILGDVFMGRFHTVFDYGNLRVGFAEAA 159
>gi|357167304|ref|XP_003581098.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like
[Brachypodium distachyon]
Length = 225
Score = 214 bits (546), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 101/199 (50%), Positives = 140/199 (70%), Gaps = 9/199 (4%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M EQ L+++++F+FWLNR+ DA GGE+VF D H+KG HTYVPV ++G WQF +GD+
Sbjct: 23 MQEQKLLADDIFTFWLNREADASSGGELVF--XDSNHYKGNHTYVPVRRRGXWQFNMGDL 80
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
LI +QSTG C GCA IV SGTSLLAGP + ++NHAIG E +++ ECK VSQYG++
Sbjct: 81 LIDDQSTGFCAKGCADIVYSGTSLLAGPICIFAQVNHAIGAERIINTECKEEVSQYGEMT 140
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGDSAVCSACEMAVVWVQ 180
LL+ P+KVC F+G GI++VV K+NV G +C+ACEMA+VW++
Sbjct: 141 LHLLLVQTKPQKVCS-----XFDGTLSDYNGIESVVGKKNV--GSVVICTACEMAIVWIE 193
Query: 181 NQLKQKQTKEKVLSYINEL 199
NQL+ +TKE +L Y+N++
Sbjct: 194 NQLRXNKTKELILQYVNQV 212
>gi|255644659|gb|ACU22832.1| unknown [Glycine max]
Length = 144
Score = 213 bits (542), Expect = 8e-53, Method: Compositional matrix adjust.
Identities = 90/138 (65%), Positives = 120/138 (86%)
Query: 157 EKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDR 216
E+E ++A D+ +CS+C+M V+W+QNQLKQK TK++V +Y+N+LC+SLP+P GES+I C+
Sbjct: 6 EQEELAARDTPLCSSCQMLVLWIQNQLKQKATKDRVFNYVNQLCESLPSPSGESVISCNS 65
Query: 217 IPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMG 276
+ MPN++FTIG+K F L+PEQYIL+TGEGI EVC+SGF+AFD+PPP+GPLWILGDVFM
Sbjct: 66 LSKMPNITFTIGNKPFVLTPEQYILRTGEGITEVCLSGFIAFDVPPPKGPLWILGDVFMR 125
Query: 277 VYHTVFDSGKLRIGFAEA 294
YHTVFD G L++GFAEA
Sbjct: 126 AYHTVFDYGNLQVGFAEA 143
>gi|12231180|dbj|BAB20973.1| aspartic proteinase 5 [Nepenthes alata]
Length = 358
Score = 212 bits (539), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 98/128 (76%), Positives = 110/128 (85%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MVEQGLVSE+VFSFW NRDP A+ GGEIVFGG+D KHF G+H YVP+T+KGYWQFE+G+
Sbjct: 209 MVEQGLVSEKVFSFWFNRDPKAKIGGEIVFGGIDEKHFVGEHIYVPITRKGYWQFEMGNF 268
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
LIGN STG C GGC AIVDSGTSLLAGP VVTE+NHAIG EG+ S ECK VV QYGD+I
Sbjct: 269 LIGNYSTGFCRGGCDAIVDSGTSLLAGPMHVVTEVNHAIGAEGIASMECKEVVYQYGDMI 328
Query: 121 WDLLVSGL 128
WDLLVSG+
Sbjct: 329 WDLLVSGV 336
>gi|62319547|dbj|BAD94980.1| putative aspartic proteinase [Arabidopsis thaliana]
Length = 149
Score = 207 bits (526), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 93/148 (62%), Positives = 119/148 (80%), Gaps = 3/148 (2%)
Query: 151 GIKTVVEKENVSAGD---SAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPM 207
GI++VV+KEN + A CSACEMAVVW+Q+QL+Q T+E++L+Y+NELC+ LP+PM
Sbjct: 2 GIESVVDKENAKLSNGVGDAACSACEMAVVWIQSQLRQNMTQERILNYVNELCERLPSPM 61
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 267
GES +DC ++ TMP VS TIG K+F+L+PE+Y+LK GEG CISGF+A D+ PPRGPL
Sbjct: 62 GESAVDCAQLSTMPTVSLTIGGKVFDLAPEEYVLKVGEGPVAQCISGFIALDVAPPRGPL 121
Query: 268 WILGDVFMGVYHTVFDSGKLRIGFAEAA 295
WILGDVFMG YHTVFD G ++GFAEAA
Sbjct: 122 WILGDVFMGKYHTVFDFGNEQVGFAEAA 149
>gi|47027053|gb|AAT08741.1| aspartic proteinase [Hyacinthus orientalis]
Length = 141
Score = 202 bits (513), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 90/139 (64%), Positives = 109/139 (78%)
Query: 157 EKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDR 216
E ++ S A CS+CEMAV+W+QNQL Q +T+E +L+Y N+LC+ LPNPMGES IDC +
Sbjct: 3 EHKSTSGQIDARCSSCEMAVIWLQNQLLQNKTEEHILNYANQLCEKLPNPMGESAIDCKK 62
Query: 217 IPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMG 276
+ MP VSFTIG K F+L+ EQY+LK GEG A CISGF A D+PPPRGPLWILGDVFM
Sbjct: 63 LARMPTVSFTIGGKTFDLTAEQYVLKVGEGAAAQCISGFTALDVPPPRGPLWILGDVFMS 122
Query: 277 VYHTVFDSGKLRIGFAEAA 295
YHT FD GKLR+GFA+AA
Sbjct: 123 AYHTEFDYGKLRVGFAKAA 141
>gi|149391945|gb|ABR25873.1| retrotransposon protein, sine subclass, expressed [Oryza sativa
Indica Group]
Length = 140
Score = 197 bits (502), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 91/142 (64%), Positives = 111/142 (78%), Gaps = 2/142 (1%)
Query: 154 TVVEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIID 213
+VV+KE V G C+ACEMAVV +QNQL+ QT+E +L Y ++LC+ LP+P GES +D
Sbjct: 1 SVVDKEKV--GSDLSCTACEMAVVGIQNQLRHNQTRELILQYADQLCERLPSPNGESAVD 58
Query: 214 CDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDV 273
CD I MPN+SFTI +K F L+PEQY++K + VCISGFMAFD+PPPRGPLWILGDV
Sbjct: 59 CDEISNMPNLSFTIANKTFTLTPEQYVVKLEQQGQTVCISGFMAFDVPPPRGPLWILGDV 118
Query: 274 FMGVYHTVFDSGKLRIGFAEAA 295
FMG YHTVFD GK RIGFAE+A
Sbjct: 119 FMGAYHTVFDFGKNRIGFAESA 140
>gi|303285091|ref|XP_003061836.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226457166|gb|EEH54466.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 647
Score = 185 bits (470), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 105/305 (34%), Positives = 159/305 (52%), Gaps = 68/305 (22%)
Query: 4 QGLVSEEVFSFWLNRD---PDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
QGLV ++VFSFWLNRD PD GGE+V GGVDP H+ G+H ++PVT++GYWQ + D+
Sbjct: 238 QGLVPDDVFSFWLNRDESHPDGP-GGELVLGGVDPAHYVGEHAWLPVTREGYWQVRMDDV 296
Query: 61 LIGNQSTGVCE--GGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGD 118
++ S G C+ GCAAI+D+GTSLLAGP V+ +IN IG +++ EC++++ QYG+
Sbjct: 297 IVDGASAGECDETDGCAAILDTGTSLLAGPKDVIEKINAKIGARPILNEECRVMIEQYGE 356
Query: 119 LIWDLLVSGLLPEKVCQQIGLCA-----------------------FNGAEYVSTGIKTV 155
+ D V P+ +C GLC + ++ + V
Sbjct: 357 ELID-DVKKFGPKAICVSAGLCHEKTERQPPQRPASSSPFDILGRLAKKSRARASVTRRV 415
Query: 156 VEKEN-----------VSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLP 204
+E +A A C ACEMAV + Q+ +K T+ +L+ + LCD +P
Sbjct: 416 LEGRRGRLWADAAADADAASQPASCRACEMAVAYAQSLIKTNVTRALILNELKSLCDHIP 475
Query: 205 NPMGESI---------------------------IDCDRIPTMPNVSFTIGDKIFNLSPE 237
+ GE++ +DCD + MP+VSF +G K + L+P
Sbjct: 476 SKGGEAVRRLPVRPSFVRHVSLTDTRAPDSSSKGVDCDAVDAMPDVSFVLGGKAWTLTPR 535
Query: 238 QYILK 242
QY+L+
Sbjct: 536 QYVLR 540
Score = 80.9 bits (198), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 34/47 (72%), Positives = 38/47 (80%)
Query: 248 AEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
AE C+SGFM D+PPP GPLWILGDVF+G YHTVFD G R+G AEA
Sbjct: 600 AEQCVSGFMGLDVPPPAGPLWILGDVFIGPYHTVFDHGNARVGIAEA 646
>gi|33347413|gb|AAQ15289.1| aspartic protease [Pyrus pyrifolia]
Length = 199
Score = 184 bits (466), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 81/116 (69%), Positives = 97/116 (83%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV QGL+ E VFSFW NR+ D EEGGEIVFGGVDP H+KGKHTYVPVT+KGYWQF++GD+
Sbjct: 81 MVNQGLLKEPVFSFWFNRNADEEEGGEIVFGGVDPNHYKGKHTYVPVTQKGYWQFDMGDV 140
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQY 116
+I Q+TG C GC+AI DSGTSLL GPT ++TE+NHAIG G+VS ECK VV++Y
Sbjct: 141 MIDGQTTGFCADGCSAIADSGTSLLVGPTTIITELNHAIGASGIVSQECKTVVAEY 196
>gi|24417300|gb|AAN60260.1| unknown [Arabidopsis thaliana]
Length = 168
Score = 182 bits (463), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 83/113 (73%), Positives = 96/113 (84%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M++QGL+ E VFSFWLNR+ D EEGGE+VFGGVDP HFKGKHTYVPVT+KGYWQF++GD+
Sbjct: 56 MLKQGLIKEPVFSFWLNRNADEEEGGELVFGGVDPNHFKGKHTYVPVTQKGYWQFDMGDV 115
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVV 113
LIG TG CE GC+AI DSGTSLLAGPT ++T INHAIG GVVS +CK VV
Sbjct: 116 LIGGAPTGFCESGCSAIADSGTSLLAGPTTIITMINHAIGAAGVVSQQCKTVV 168
>gi|33347411|gb|AAQ15288.1| aspartic protease [Pyrus pyrifolia]
Length = 199
Score = 181 bits (460), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 80/116 (68%), Positives = 96/116 (82%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV QGL+ E VFS W NR+ D EEGGEIVFGGVDP H+KGKHTYVPVT+KGYWQF++GD+
Sbjct: 81 MVNQGLLKEPVFSLWFNRNADEEEGGEIVFGGVDPNHYKGKHTYVPVTQKGYWQFDMGDV 140
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQY 116
+I Q+TG C GC+AI DSGTSLL GPT ++TE+NHAIG G+VS ECK VV++Y
Sbjct: 141 MIDGQTTGFCADGCSAIADSGTSLLVGPTTIITELNHAIGASGIVSQECKTVVAEY 196
>gi|62319708|dbj|BAD95255.1| putative aspartic protease [Arabidopsis thaliana]
Length = 128
Score = 174 bits (441), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 74/128 (57%), Positives = 99/128 (77%)
Query: 168 VCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTI 227
+CSACEMA VW++++L Q QT+E++L+Y ELCD +P +S +DC R+ +MP V+F+I
Sbjct: 1 MCSACEMAAVWMESELTQNQTQERILAYAAELCDHIPTQNQQSAVDCGRVSSMPIVTFSI 60
Query: 228 GDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKL 287
G + F+L+P+ YI K GEG+ C SGF A D+ PPRGPLWILGD+FMG YHTVFD GK
Sbjct: 61 GGRSFDLTPQDYIFKIGEGVESQCTSGFTAMDIAPPRGPLWILGDIFMGPYHTVFDYGKG 120
Query: 288 RIGFAEAA 295
R+GFA+AA
Sbjct: 121 RVGFAKAA 128
>gi|351722839|ref|NP_001235978.1| uncharacterized protein LOC100500551 [Glycine max]
gi|255630611|gb|ACU15665.1| unknown [Glycine max]
Length = 122
Score = 172 bits (437), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 75/122 (61%), Positives = 95/122 (77%)
Query: 174 MAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFN 233
M V+W+Q QLKQ KEKV Y++ELC+ LPNP G+S I+C I TMP+++FTIG+K F
Sbjct: 1 MIVLWIQVQLKQSNVKEKVFKYVDELCEKLPNPPGQSFINCKTIATMPHITFTIGNKSFP 60
Query: 234 LSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAE 293
LSPEQY+L+ EG + VC F+A D+PPP+GPLW+LG +F+G YHTVFD G LRIGFAE
Sbjct: 61 LSPEQYVLRVEEGCSTVCYGSFVAIDVPPPQGPLWVLGSIFLGAYHTVFDYGNLRIGFAE 120
Query: 294 AA 295
AA
Sbjct: 121 AA 122
>gi|117662285|gb|ABK55693.1| aspartic proteinase [Cucumis sativus]
Length = 196
Score = 171 bits (432), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 75/105 (71%), Positives = 89/105 (84%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV+Q LV E VFSFW NR+ D E+GGEIVFGGVDP H+KG+HTYVPVTKKGYWQF++GD+
Sbjct: 92 MVDQNLVKEPVFSFWFNRNADEEQGGEIVFGGVDPDHYKGEHTYVPVTKKGYWQFDMGDV 151
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVV 105
LI +TG C GGC+AI DSGTSLLAGPT ++T++NHAIG GVV
Sbjct: 152 LINGSTTGFCSGGCSAIADSGTSLLAGPTTIITQVNHAIGASGVV 196
>gi|15076933|gb|AAK82987.1| aspartic protease [Oryza sativa Japonica Group]
Length = 118
Score = 159 bits (403), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 70/117 (59%), Positives = 90/117 (76%)
Query: 179 VQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQ 238
+QN L Q +T++ +L+YI +LCD P+PM +S +DC + +MP +SFTI K F L PE+
Sbjct: 2 IQNPLAQTKTQDLILNYIIQLCDKFPSPMEDSSVDCASLASMPEISFTIEAKKFALKPEE 61
Query: 239 YILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
YILK G+ A CISGF A D+PPPRGPLWILGDVFMG YHTVFD GK+R+GFA++A
Sbjct: 62 YILKVGKEAAAQCISGFTAMDIPPPRGPLWILGDVFMGAYHTVFDYGKMRVGFAKSA 118
>gi|4389326|pdb|1B5F|A Chain A, Native Cardosin A From Cynara Cardunculus L.
gi|6729875|pdb|1B5F|C Chain C, Native Cardosin A From Cynara Cardunculus L
Length = 239
Score = 155 bits (393), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 69/100 (69%), Positives = 82/100 (82%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ QGLV E FSFWLNR+ D EEGGE+VFGG+DP HF+G HTYVPVT + YWQF +GD+
Sbjct: 138 MLNQGLVKERRFSFWLNRNVDEEEGGELVFGGLDPNHFRGDHTYVPVTYQYYWQFGIGDV 197
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 100
LIG++STG C GC A DSGTSLL+GPT +VT+INHAIG
Sbjct: 198 LIGDKSTGFCAPGCQAFADSGTSLLSGPTAIVTQINHAIG 237
>gi|33352213|emb|CAE18153.1| aspartic proteinase [Chlamydomonas reinhardtii]
Length = 578
Score = 153 bits (386), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 74/149 (49%), Positives = 100/149 (67%), Gaps = 3/149 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+VE+G ++ VFSFWLNRDP+A GGE+V GG+DP HF G+HT+VPVT++GYWQF + +
Sbjct: 203 LVEEGGLAAPVFSFWLNRDPNAPNGGELVLGGIDPTHFTGEHTWVPVTRQGYWQFTMEGL 262
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
+G S +C GCAAI D+GTSL+AGP+ V +NHAIG +SA+C+ +V Y I
Sbjct: 263 DLGPGSQKMCAKGCAAIADTGTSLIAGPSDEVAALNHAIGATSALSAQCRQLVRDYLPQI 322
Query: 121 WDLLVSGLLP-EKVCQQIGLCAFNGAEYV 148
L LP ++VC IGLC A +
Sbjct: 323 IAQLHD--LPLDQVCASIGLCPMAAASTI 349
Score = 144 bits (364), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 69/135 (51%), Positives = 94/135 (69%), Gaps = 4/135 (2%)
Query: 162 SAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMP 221
SAGDS VCS C+ AV +++ L+ T E++ + +LCD + + G S++DCD+I T+P
Sbjct: 447 SAGDSVVCSFCQTAVAYIKIALQSNSTIEQIADAVGQLCDQV-SFGGPSVVDCDKISTLP 505
Query: 222 NVSFTIGDKIFNLSPEQYILKTGEGIAEV-CISGFMAFDLPPPRGPLWILGDVFMGVYHT 280
+SF IG ++F L PEQY+L+ G E+ CISGFM D+P GPLWILGD+F+G YHT
Sbjct: 506 VISFNIGGRVFPLRPEQYVLQLDAGGGEMQCISGFMGLDVPA--GPLWILGDIFLGAYHT 563
Query: 281 VFDSGKLRIGFAEAA 295
VFD G R+GFA AA
Sbjct: 564 VFDYGAARLGFANAA 578
>gi|147743000|sp|P85137.1|CARDF_CYNCA RecName: Full=Cardosin-F; Contains: RecName: Full=Cardosin-F heavy
chain; Contains: RecName: Full=Cardosin-F light chain
Length = 281
Score = 152 bits (383), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 70/111 (63%), Positives = 83/111 (74%), Gaps = 2/111 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ QGLV FSFWLNR+ D EEGGE+VFGG+DP HF+G HTYVPVT + YWQF +GD+
Sbjct: 114 MLNQGLVKR--FSFWLNRNVDEEEGGELVFGGLDPNHFRGDHTYVPVTYQYYWQFGIGDV 171
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKL 111
LIG++STG C GC A DSGTSLL+GPT +VT+INHAIG G K
Sbjct: 172 LIGDKSTGFCAPGCQAFADSGTSLLSGPTAIVTQINHAIGANGSEELNVKF 222
Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 43/64 (67%), Positives = 45/64 (70%), Gaps = 4/64 (6%)
Query: 232 FNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGF 291
F L+PEQYILK G A CISGF A D GPLWILGDVFM YHTVFD G L +GF
Sbjct: 222 FGLTPEQYILK---GEATQCISGFTAMD-ATLLGPLWILGDVFMRPYHTVFDYGNLLVGF 277
Query: 292 AEAA 295
AEAA
Sbjct: 278 AEAA 281
>gi|147743007|sp|P85138.1|CARDG_CYNCA RecName: Full=Cardosin-G; Contains: RecName: Full=Cardosin-G heavy
chain; Contains: RecName: Full=Cardosin-G light chain
Length = 266
Score = 150 bits (380), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 69/100 (69%), Positives = 81/100 (81%), Gaps = 2/100 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV QGLV FSFWLNR+ D EEGGE+VFGG+DP HF+G HTYVPVT + YWQF +GD+
Sbjct: 100 MVNQGLVKR--FSFWLNRNVDEEEGGELVFGGLDPNHFRGDHTYVPVTYQYYWQFGIGDV 157
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 100
LIG++STG C GC A DSGTSLL+GPT +VT+INHAIG
Sbjct: 158 LIGDKSTGFCAPGCQAFADSGTSLLSGPTAIVTQINHAIG 197
Score = 78.6 bits (192), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 43/64 (67%), Positives = 45/64 (70%), Gaps = 4/64 (6%)
Query: 232 FNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGF 291
F L+PEQYILK G A CISGF A D GPLWILGDVFM YHTVFD G L +GF
Sbjct: 207 FGLTPEQYILK---GEATQCISGFTAMD-ATLLGPLWILGDVFMRPYHTVFDYGNLLVGF 262
Query: 292 AEAA 295
AEAA
Sbjct: 263 AEAA 266
>gi|302840660|ref|XP_002951885.1| hypothetical protein VOLCADRAFT_81669 [Volvox carteri f.
nagariensis]
gi|300262786|gb|EFJ46990.1| hypothetical protein VOLCADRAFT_81669 [Volvox carteri f.
nagariensis]
Length = 559
Score = 150 bits (378), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 75/140 (53%), Positives = 95/140 (67%), Gaps = 3/140 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+V+ GL+SE VFSFWLNRD A GGE+V GGVDP HF G+HT+V VT++GYWQF L I
Sbjct: 199 LVDSGLLSEPVFSFWLNRDSSAAVGGELVLGGVDPAHFTGEHTWVDVTRRGYWQFNLDGI 258
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
+G+Q +C GC AI D+GTSL+AGP V INHAIG +SA+C+ +V +Y I
Sbjct: 259 HLGSQR--LCTQGCPAIADTGTSLIAGPVDEVAAINHAIGATSALSAQCRTLVREYLPEI 316
Query: 121 WDLLVSGLLPEKVCQQIGLC 140
L L ++VC IGLC
Sbjct: 317 VAAL-HNLPLDQVCASIGLC 335
Score = 121 bits (304), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 59/135 (43%), Positives = 85/135 (62%), Gaps = 4/135 (2%)
Query: 162 SAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMP 221
+ GDS CS C+ AV +++ L+ T E++ + LCD + + G S++DC ++ +P
Sbjct: 428 TTGDSVACSFCQTAVQYIRIALESNATIEQIADAVGNLCDQV-SFGGPSVVDCTKLSKLP 486
Query: 222 NVSFTIGDKIFNLSPEQYILKTGEGIAE-VCISGFMAFDLPPPRGPLWILGDVFMGVYHT 280
+ +G + F L PEQY+L+ G E C+SGFM D+P GPLWILGD+F+G YHT
Sbjct: 487 ILELEVGGRTFPLRPEQYVLRVDAGGGEEQCVSGFMGLDVP--VGPLWILGDIFLGAYHT 544
Query: 281 VFDSGKLRIGFAEAA 295
VFD G R+GFA AA
Sbjct: 545 VFDYGGSRLGFAVAA 559
>gi|147743015|sp|P85139.1|CARDH_CYNCA RecName: Full=Cardosin-H; Contains: RecName: Full=Cardosin-H heavy
chain; Contains: RecName: Full=Cardosin-H light chain
Length = 265
Score = 149 bits (376), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 68/100 (68%), Positives = 81/100 (81%), Gaps = 2/100 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ QGLV FSFWLNR+ D EEGGE+VFGG+DP HF+G HTYVPVT + YWQF +GD+
Sbjct: 100 MLNQGLVKR--FSFWLNRNVDEEEGGELVFGGLDPNHFRGDHTYVPVTYQYYWQFGIGDV 157
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 100
LIG++STG C GC A DSGTSLL+GPT +VT+INHAIG
Sbjct: 158 LIGDKSTGFCAPGCQAFADSGTSLLSGPTAIVTQINHAIG 197
Score = 79.7 bits (195), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 43/66 (65%), Positives = 46/66 (69%), Gaps = 4/66 (6%)
Query: 230 KIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRI 289
+F L+PEQYILK G A CISGF A D GPLWILGDVFM YHTVFD G L +
Sbjct: 204 NVFGLTPEQYILK---GEATQCISGFTAMD-ATLLGPLWILGDVFMRPYHTVFDYGNLLV 259
Query: 290 GFAEAA 295
GFAEAA
Sbjct: 260 GFAEAA 265
>gi|47197223|emb|CAF89153.1| unnamed protein product [Tetraodon nigroviridis]
Length = 268
Score = 149 bits (375), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 99/294 (33%), Positives = 143/294 (48%), Gaps = 44/294 (14%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
++ Q V + VFSF+LNR+P + GGE++ GG DP+++ G +YV VT++ YWQ + ++
Sbjct: 19 IMSQKKVEKNVFSFYLNRNPQTQPGGELLLGGTDPQYYTGDFSYVNVTRQAYWQIHVDEL 78
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
+G+Q T +C+ GC AIVD+GTSLL GP+ V + AIG ++ E + D I
Sbjct: 79 SVGSQLT-LCKSGCEAIVDTGTSLLTGPSEEVRSLQKAIGALPLIQGEYMVSC----DKI 133
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGDSAVCSACEMAVVWVQ 180
L V + IG G Y TG + V++ A C C V
Sbjct: 134 PTLPV-------ITFNIG-----GKPYSLTGDQYVLKVRATPASRGRRCGCCGDNRATVA 181
Query: 181 NQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYI 240
+ + L C P C R+ L P Q +
Sbjct: 182 SL----KGNTGALFGFRACCSPSSCPA-----LCLRL---------------RLPPIQQV 217
Query: 241 LKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
+ G+ I C+SGFM D+P P GPLWILGDVF+G Y+ +FD R+GFA+A
Sbjct: 218 SQAGKTI---CLSGFMGLDIPAPAGPLWILGDVFIGQYYNLFDRDNNRVGFAKA 268
>gi|414888160|tpg|DAA64174.1| TPA: hypothetical protein ZEAMMB73_634609 [Zea mays]
Length = 117
Score = 145 bits (367), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 64/99 (64%), Positives = 80/99 (80%)
Query: 197 NELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFM 256
N+LC+ LP+P GES +DC +I MPN++FTI +K F L+PEQYI+K + +CISGFM
Sbjct: 19 NQLCERLPSPNGESTVDCHQISKMPNLAFTIANKTFTLTPEQYIVKLEQAGQTICISGFM 78
Query: 257 AFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
AFD+PPPRGPLWILGDVFM VYHTVFD G+ IGFA++A
Sbjct: 79 AFDVPPPRGPLWILGDVFMSVYHTVFDFGENMIGFAKSA 117
>gi|147742993|sp|P85136.1|CARDE_CYNCA RecName: Full=Cardosin-E; Contains: RecName: Full=Cardosin-E heavy
chain; Contains: RecName: Full=Cardosin-E light chain
Length = 224
Score = 143 bits (361), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 63/89 (70%), Positives = 75/89 (84%)
Query: 12 FSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCE 71
FSFWLNR+ D EEGGE+VFGG+DP HF+G HTYVPVT + YWQF +GD+LIG++STG C
Sbjct: 67 FSFWLNRNVDEEEGGELVFGGLDPNHFRGDHTYVPVTYQYYWQFGIGDVLIGDKSTGFCA 126
Query: 72 GGCAAIVDSGTSLLAGPTPVVTEINHAIG 100
GC A DSGTSLL+GPT +VT+INHAIG
Sbjct: 127 PGCQAFADSGTSLLSGPTAIVTQINHAIG 155
Score = 78.2 bits (191), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 43/64 (67%), Positives = 45/64 (70%), Gaps = 4/64 (6%)
Query: 232 FNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGF 291
F L+PEQYILK G A CISGF A D GPLWILGDVFM YHTVFD G L +GF
Sbjct: 165 FGLTPEQYILK---GEATQCISGFTAMD-ATLLGPLWILGDVFMRPYHTVFDYGNLLVGF 220
Query: 292 AEAA 295
AEAA
Sbjct: 221 AEAA 224
>gi|163256765|emb|CAO02646.1| aspartic protease precursor [Medicago truncatula var. truncatula]
gi|163256777|emb|CAO02652.1| aspartic protease precursor [Medicago truncatula var. truncatula]
gi|163256799|emb|CAO02663.1| aspartic protease precursor [Medicago truncatula var. truncatula]
gi|163256801|emb|CAO02664.1| aspartic protease precursor [Medicago truncatula var. truncatula]
gi|163256809|emb|CAO02668.1| aspartic protease precursor [Medicago truncatula var. truncatula]
Length = 83
Score = 142 bits (358), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 70/83 (84%), Positives = 74/83 (89%)
Query: 45 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGV 104
VPVT+KGYWQ E+GD IG STGVCEGGCAAIVDSGTSLLAGPTPVV EINHAIG EGV
Sbjct: 1 VPVTEKGYWQIEMGDFFIGGLSTGVCEGGCAAIVDSGTSLLAGPTPVVAEINHAIGAEGV 60
Query: 105 VSAECKLVVSQYGDLIWDLLVSG 127
+S ECK VVSQYG+LIWDLLVSG
Sbjct: 61 LSVECKEVVSQYGELIWDLLVSG 83
>gi|308809631|ref|XP_003082125.1| putative vacuaolar aspartic proteinase (ISS) [Ostreococcus tauri]
gi|116060592|emb|CAL55928.1| putative vacuaolar aspartic proteinase (ISS) [Ostreococcus tauri]
Length = 505
Score = 142 bits (357), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 67/149 (44%), Positives = 92/149 (61%), Gaps = 9/149 (6%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAE-----EGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQF 55
M+EQ L+ + +FSFWLNR + + GGE+VFGG DPKHF G+HTY PVT+ GYWQ
Sbjct: 184 MMEQKLIDKNMFSFWLNRTSNVDGTPSVTGGELVFGGSDPKHFVGEHTYAPVTRAGYWQI 243
Query: 56 ELGDILIGNQSTGVCEG--GCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVV 113
++ D + +S GVC+G GC I D+GTSLL GP VV +IN IG ++ EC++++
Sbjct: 244 KMDDFKVAGRSLGVCKGENGCQVIADTGTSLLTGPADVVKKINDYIGAHSMLGEECRMLI 303
Query: 114 SQYGD--LIWDLLVSGLLPEKVCQQIGLC 140
QY D + E++C IG C
Sbjct: 304 DQYADEXXXXXXXLETYTSEQICTSIGAC 332
Score = 57.8 bits (138), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 24/60 (40%), Positives = 34/60 (56%)
Query: 169 CSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIG 228
C AC V + QN L + T + S + +CD +P+ G + +DC+ IP MPNV F IG
Sbjct: 434 CKACTTVVNYAQNLLSENATSRVIASEVKRVCDMIPSYGGTAAVDCEDIPHMPNVEFVIG 493
>gi|872030|emb|CAA61253.1| aspartic protease [Brassica oleracea]
Length = 91
Score = 141 bits (355), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 63/91 (69%), Positives = 76/91 (83%)
Query: 205 NPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 264
+PMGES +DC ++ TMP VS TIG K+F+L+PE+Y+LK GEG A CISGF+A D+ PPR
Sbjct: 1 SPMGESAVDCAQLSTMPTVSLTIGGKVFDLAPEEYVLKVGEGAAAQCISGFIALDVAPPR 60
Query: 265 GPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
GPLWILGDVFMG YHTVFD GK ++GFAEAA
Sbjct: 61 GPLWILGDVFMGKYHTVFDFGKEQVGFAEAA 91
>gi|163256769|emb|CAO02648.1| aspartic protease precursor [Medicago truncatula var. truncatula]
gi|163256773|emb|CAO02650.1| aspartic protease precursor [Medicago truncatula var. truncatula]
gi|163256775|emb|CAO02651.1| aspartic protease precursor [Medicago truncatula var. truncatula]
gi|163256779|emb|CAO02653.1| aspartic protease precursor [Medicago truncatula var. truncatula]
gi|163256781|emb|CAO02654.1| aspartic protease precursor [Medicago truncatula var. truncatula]
gi|163256785|emb|CAO02656.1| aspartic protease precursor [Medicago truncatula var. truncatula]
gi|163256787|emb|CAO02657.1| aspartic protease precursor [Medicago truncatula var. truncatula]
gi|163256789|emb|CAO02658.1| aspartic protease precursor [Medicago truncatula var. truncatula]
Length = 82
Score = 141 bits (355), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 69/82 (84%), Positives = 73/82 (89%)
Query: 46 PVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVV 105
PVT+KGYWQ E+GD IG STGVCEGGCAAIVDSGTSLLAGPTPVV EINHAIG EGV+
Sbjct: 1 PVTEKGYWQIEMGDFFIGGLSTGVCEGGCAAIVDSGTSLLAGPTPVVAEINHAIGAEGVL 60
Query: 106 SAECKLVVSQYGDLIWDLLVSG 127
S ECK VVSQYG+LIWDLLVSG
Sbjct: 61 SVECKEVVSQYGELIWDLLVSG 82
>gi|340708074|pdb|3RFI|A Chain A, Crystal Structure Of The Saposin-Like Domain Of Plant
Aspartic Protease From Solanum Tuberosum
Length = 108
Score = 139 bits (350), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 64/104 (61%), Positives = 86/104 (82%), Gaps = 2/104 (1%)
Query: 103 GVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKEN-- 160
+VS ECK +VSQYG++IWDLLVSG+ P++VC Q GLC +GA++VS+ IKTVVE+E
Sbjct: 5 AIVSMECKTIVSQYGEMIWDLLVSGVRPDQVCSQAGLCFVDGAQHVSSNIKTVVERETEG 64
Query: 161 VSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLP 204
S G++ +C+ACEMAVVW+QNQLKQ+ TKEKVL Y+N+LC+ +P
Sbjct: 65 SSVGEAPLCTACEMAVVWMQNQLKQEGTKEKVLEYVNQLCEKIP 108
>gi|241865152|gb|ACS68654.1| aspartic proteinase [Sonneratia alba]
gi|241865384|gb|ACS68724.1| aspartic proteinase [Sonneratia alba]
Length = 114
Score = 139 bits (350), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 59/112 (52%), Positives = 90/112 (80%), Gaps = 3/112 (2%)
Query: 130 PEKVCQQIGLCAFNGAEYVSTGIKTVV-EKENVSAG--DSAVCSACEMAVVWVQNQLKQK 186
P+K+C Q+G C F+G + GI++VV E + S+G A+C+ACEMAVVW+QNQ+++
Sbjct: 3 PQKICSQVGFCTFDGTHGIRMGIESVVDENSDRSSGGLRDAMCTACEMAVVWMQNQIRKN 62
Query: 187 QTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQ 238
QT+E+++ Y+N+LCD LP+PMGES++DCD + ++P+VSFTIG K+F+LSPE+
Sbjct: 63 QTEEQIIDYVNQLCDRLPSPMGESVVDCDNLSSLPSVSFTIGGKVFDLSPEE 114
>gi|163256803|emb|CAO02665.1| aspartic protease precursor [Medicago truncatula var. truncatula]
gi|163256815|emb|CAO02671.1| aspartic protease precursor [Medicago truncatula var. truncatula]
gi|163256823|emb|CAO02675.1| aspartic protease precursor [Medicago truncatula var. truncatula]
Length = 82
Score = 138 bits (347), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 68/82 (82%), Positives = 72/82 (87%)
Query: 46 PVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVV 105
PVT+KGYWQ E+GD IG STGVCEGGCAAIVDSGTSLLAGPT VV EINHAIG EGV+
Sbjct: 1 PVTEKGYWQVEMGDFFIGGLSTGVCEGGCAAIVDSGTSLLAGPTAVVAEINHAIGAEGVL 60
Query: 106 SAECKLVVSQYGDLIWDLLVSG 127
S ECK VVSQYG+LIWDLLVSG
Sbjct: 61 SVECKEVVSQYGELIWDLLVSG 82
>gi|297737371|emb|CBI26572.3| unnamed protein product [Vitis vinifera]
Length = 121
Score = 137 bits (345), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 61/97 (62%), Positives = 76/97 (78%)
Query: 47 VTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVS 106
+T+KGYWQFE+G+ LIG QST CE GCAAI+DS TSL+AGPT +VTEINHAIG EG+VS
Sbjct: 22 ITQKGYWQFEMGEFLIGYQSTSFCEAGCAAIIDSRTSLIAGPTAIVTEINHAIGAEGIVS 81
Query: 107 AECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFN 143
ECK V+SQYG++IWD L+S + P+ VC C +
Sbjct: 82 QECKEVISQYGNMIWDHLISMVQPDAVCSLHARCYIH 118
>gi|163256791|emb|CAO02659.1| aspartic protease precursor [Medicago truncatula var. truncatula]
gi|163256793|emb|CAO02660.1| aspartic protease precursor [Medicago truncatula var.
longiaculeata]
gi|163256797|emb|CAO02662.1| aspartic protease precursor [Medicago truncatula var. truncatula]
gi|163256805|emb|CAO02666.1| aspartic protease precursor [Medicago truncatula var. truncatula]
gi|163256807|emb|CAO02667.1| aspartic protease precursor [Medicago truncatula var. truncatula]
gi|163256811|emb|CAO02669.1| aspartic protease precursor [Medicago truncatula var. truncatula]
gi|163256813|emb|CAO02670.1| aspartic protease precursor [Medicago tornata]
gi|163256817|emb|CAO02672.1| aspartic protease precursor [Medicago truncatula var. truncatula]
gi|163256821|emb|CAO02674.1| aspartic protease precursor [Medicago truncatula var. truncatula]
gi|163256825|emb|CAO02676.1| aspartic protease precursor [Medicago truncatula var. truncatula]
Length = 82
Score = 137 bits (345), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 68/82 (82%), Positives = 72/82 (87%)
Query: 46 PVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVV 105
PVT+KGYWQ E+GD IG STGVCEGGCAAIVDSGTSLLAGPT VV EINHAIG EGV+
Sbjct: 1 PVTEKGYWQVEMGDFFIGGLSTGVCEGGCAAIVDSGTSLLAGPTVVVAEINHAIGAEGVL 60
Query: 106 SAECKLVVSQYGDLIWDLLVSG 127
S ECK VVSQYG+LIWDLLVSG
Sbjct: 61 SVECKEVVSQYGELIWDLLVSG 82
>gi|111185852|emb|CAL10023.1| cyprosin B precursor [Cynara cardunculus]
Length = 87
Score = 137 bits (344), Expect = 8e-30, Method: Composition-based stats.
Identities = 61/87 (70%), Positives = 71/87 (81%)
Query: 199 LCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAF 258
LCD LP+PMGES +DC+ + +MPN++FTIG K+F+L PEQYILK GEG A CISGF A
Sbjct: 1 LCDRLPSPMGESAVDCNSLSSMPNIAFTIGGKVFDLCPEQYILKIGEGEAAQCISGFTAM 60
Query: 259 DLPPPRGPLWILGDVFMGVYHTVFDSG 285
D+ PPRGPL ILGDVFMG YHTVFD G
Sbjct: 61 DVAPPRGPLRILGDVFMGRYHTVFDYG 87
>gi|5103293|dbj|BAA78908.1| aspartic proteinase [Oryza sativa]
Length = 111
Score = 135 bits (341), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 61/92 (66%), Positives = 74/92 (80%)
Query: 204 PNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPP 263
P+P GES + C +I MPN++FTI ++ F L+PEQYI+K +G VCISGFMAFD+PPP
Sbjct: 1 PSPNGESTVSCHQISKMPNLAFTIANRTFILTPEQYIVKLEQGGQTVCISGFMAFDIPPP 60
Query: 264 RGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
RGPLWILGDVFMG YHTVFD GK RIG A++A
Sbjct: 61 RGPLWILGDVFMGAYHTVFDFGKDRIGVAKSA 92
>gi|163256795|emb|CAO02661.1| aspartic protease precursor [Medicago truncatula var. truncatula]
Length = 82
Score = 135 bits (340), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 67/82 (81%), Positives = 72/82 (87%)
Query: 47 VTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVS 106
VT+KGYWQ E+GD IG STGVCEGGCAAIVDSGTSLLAGPT VV EINHAIG EGV+S
Sbjct: 1 VTEKGYWQVEMGDFFIGGLSTGVCEGGCAAIVDSGTSLLAGPTVVVAEINHAIGAEGVLS 60
Query: 107 AECKLVVSQYGDLIWDLLVSGL 128
ECK VVSQYG+LIWDLLVSG+
Sbjct: 61 VECKEVVSQYGELIWDLLVSGV 82
>gi|356506538|ref|XP_003522037.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like [Glycine
max]
Length = 134
Score = 135 bits (339), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 63/112 (56%), Positives = 84/112 (75%), Gaps = 3/112 (2%)
Query: 130 PEKVCQQIGLCAFNGAEYVSTGIKTVV-EKENVSAGDS--AVCSACEMAVVWVQNQLKQK 186
P+K+C +IGLC F+G V GIK+VV E E S+GD A C ACEMA+VW+QNQL +
Sbjct: 22 PKKICSRIGLCXFDGTRGVDVGIKSVVDENERKSSGDHHGAACPACEMAIVWMQNQLSRN 81
Query: 187 QTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQ 238
QT++++LSYIN+LCD +P+PMGES +DC I ++P VSFTIG + F L PE+
Sbjct: 82 QTQDQILSYINQLCDKMPSPMGESAVDCANISSLPVVSFTIGGRTFELIPEE 133
>gi|350411706|ref|XP_003489428.1| PREDICTED: lysosomal aspartic protease-like [Bombus impatiens]
Length = 386
Score = 134 bits (336), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 59/109 (54%), Positives = 77/109 (70%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV+QGLV + VFSF+LNR+PD + GGE++ GG DP H++G TYVPV +KGYWQF + I
Sbjct: 193 MVKQGLVPQPVFSFYLNRNPDDKAGGELILGGSDPNHYEGPFTYVPVDRKGYWQFRMDGI 252
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAEC 109
+G+Q +CE GC AI D+GTSL+AGP V IN AIG + + E
Sbjct: 253 KVGSQHLAICEKGCEAIADTGTSLIAGPVKEVEAINSAIGATNIAAGEA 301
Score = 105 bits (261), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 44/87 (50%), Positives = 63/87 (72%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 267
GE+++DC+ IP +P ++F +G + F L+ + Y+LK + VC+SGFM D+P P GPL
Sbjct: 299 GEAMVDCNSIPNLPTINFVLGGRSFPLTGKDYVLKVTQFGKTVCLSGFMGMDIPEPNGPL 358
Query: 268 WILGDVFMGVYHTVFDSGKLRIGFAEA 294
WILGDVF+G Y+T FD G R+GFA+A
Sbjct: 359 WILGDVFIGRYYTEFDMGNNRVGFAKA 385
>gi|336454164|gb|AEI58896.1| cathepsin D [Pinctada maxima]
Length = 390
Score = 133 bits (334), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 59/108 (54%), Positives = 82/108 (75%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV+QGLV + +FSF+L+RDP A EGGE++ GG D KH+KG TY+PVT++GYWQFE+ +
Sbjct: 197 MVKQGLVPQPIFSFYLDRDPSASEGGELILGGSDTKHYKGNFTYLPVTRQGYWQFEMDGV 256
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+G S C GGC AI D+GTSL+AGPT ++++N AIG + +V+ E
Sbjct: 257 SVGG-SAKFCSGGCNAIADTGTSLIAGPTSEISKLNKAIGAKPLVAGE 303
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 56/138 (40%), Positives = 80/138 (57%), Gaps = 3/138 (2%)
Query: 157 EKENVSAGDSA-VCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 215
E + VS G SA CS A+ L T E +S +N+ + P GE +DC+
Sbjct: 252 EMDGVSVGGSAKFCSGGCNAIADTGTSLIAGPTSE--ISKLNKAIGAKPLVAGEYTVDCN 309
Query: 216 RIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFM 275
IP +P ++FT+G K F+L + Y+L + C+SGF D+PPP GPLWILGDVF+
Sbjct: 310 AIPKLPKITFTLGGKQFDLEGKDYVLTVTQQGQTTCLSGFAPIDVPPPAGPLWILGDVFI 369
Query: 276 GVYHTVFDSGKLRIGFAE 293
G ++T FD G ++GFA+
Sbjct: 370 GKFYTEFDMGNTQVGFAQ 387
>gi|218944225|gb|ACL13150.1| cathepsin D [Azumapecten farreri]
Length = 396
Score = 132 bits (333), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 57/108 (52%), Positives = 75/108 (69%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV+Q LV VFSF+L+RDPDA GGE++ GG DPKH+ G +Y P+TKKGYWQF++ I
Sbjct: 202 MVQQKLVDSPVFSFYLDRDPDASAGGELIIGGSDPKHYSGNFSYAPITKKGYWQFDMAGI 261
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+G +++ C GGC+AI D+GTSLL GPT V ++N IG E
Sbjct: 262 QVGGKASAYCNGGCSAIADTGTSLLVGPTAEVQQLNKQIGATPFAGGE 309
Score = 97.4 bits (241), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 49/129 (37%), Positives = 73/129 (56%), Gaps = 2/129 (1%)
Query: 166 SAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSF 225
SA C+ A+ L T E + +N+ + P GE +DCD+I ++P +SF
Sbjct: 268 SAYCNGGCSAIADTGTSLLVGPTAE--VQQLNKQIGATPFAGGEYTVDCDKISSLPPISF 325
Query: 226 TIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSG 285
I ++F L YILK + +C+SGF D+P P GPLWILGDVF+G +++ FD G
Sbjct: 326 MIDKQLFTLQGSDYILKVTQQGQTICLSGFAGIDVPAPLGPLWILGDVFLGKFYSEFDLG 385
Query: 286 KLRIGFAEA 294
++GFA+
Sbjct: 386 NNKVGFAQT 394
>gi|329754204|gb|AEC03508.1| cathepsin-D [Polyrhachis vicina]
Length = 384
Score = 132 bits (331), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 61/108 (56%), Positives = 79/108 (73%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV+QGLVS+ VFSF+LNRDPDA+EGGE++ GG DP H+KG TYVPV +K YWQF++ +
Sbjct: 192 MVKQGLVSQPVFSFYLNRDPDAKEGGELILGGSDPNHYKGDFTYVPVDRKAYWQFKMDSV 251
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
IG+ +C+ GC AI D+GTSL+AGP + IN AIG +V E
Sbjct: 252 QIGSD-LKLCKQGCEAIADTGTSLIAGPVKEIEAINKAIGATPIVGGE 298
Score = 111 bits (277), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 50/104 (48%), Positives = 68/104 (65%)
Query: 191 KVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEV 250
K + IN+ + P GE ++DC+ IP +P ++F +G K F L E Y+LK + +
Sbjct: 280 KEIEAINKAIGATPIVGGEYMVDCNSIPNLPTINFVLGGKSFTLEGEDYVLKVAQFGKTI 339
Query: 251 CISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
C+SGFM D+PPP GPLWILGDVF+G Y+T FD G R+GFA A
Sbjct: 340 CLSGFMGMDIPPPNGPLWILGDVFIGKYYTEFDMGNNRVGFATA 383
>gi|449533814|ref|XP_004173866.1| PREDICTED: aspartic proteinase-like, partial [Cucumis sativus]
Length = 290
Score = 131 bits (330), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 57/74 (77%), Positives = 65/74 (87%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MVEQGLV E VFSFWLNR+ + EEGGEIVFGGVDPKH+ GKHTYVPVT+KGYWQF++GD+
Sbjct: 217 MVEQGLVKEPVFSFWLNRNAEEEEGGEIVFGGVDPKHYTGKHTYVPVTQKGYWQFDMGDV 276
Query: 61 LIGNQSTGVCEGGC 74
LI + TG CEGGC
Sbjct: 277 LIDGKPTGYCEGGC 290
>gi|443723962|gb|ELU12180.1| hypothetical protein CAPTEDRAFT_225009 [Capitella teleta]
Length = 364
Score = 130 bits (327), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 58/108 (53%), Positives = 78/108 (72%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV+Q VS VFSF+LNRDP A EGGE++ GG DPK+++G TY+PV+KKGYWQF++ +
Sbjct: 153 MVKQKSVSSPVFSFFLNRDPSASEGGELILGGSDPKYYEGNFTYLPVSKKGYWQFKMDGM 212
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+ ST C+GGC AI D+GTSLLAGP+ V ++N +GG + E
Sbjct: 213 KLAGSSTSYCDGGCQAIADTGTSLLAGPSAEVQKLNQELGGTAIPGGE 260
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 45/88 (51%), Positives = 62/88 (70%)
Query: 206 PMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRG 265
P GE IIDC++IP +PN++F + K F L+ + YIL + +CISGF+ D+P P G
Sbjct: 257 PGGEYIIDCNKIPQLPNITFMLAGKPFTLTGKDYILAVKQLGKTICISGFIGLDVPAPLG 316
Query: 266 PLWILGDVFMGVYHTVFDSGKLRIGFAE 293
PLWILGDVF+G ++T FD G R+GFA+
Sbjct: 317 PLWILGDVFIGRFYTEFDFGNNRVGFAK 344
>gi|225717994|gb|ACO14843.1| Lysosomal aspartic protease precursor [Caligus clemensi]
Length = 386
Score = 130 bits (326), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 58/108 (53%), Positives = 81/108 (75%), Gaps = 2/108 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M +QGL+ E VFSF+LNRDP+A+ GGEI+FGG DP H+KG TY+PVTKKGYWQF++ +
Sbjct: 195 MFKQGLIQEPVFSFYLNRDPNAQVGGEIIFGGSDPDHYKGNITYIPVTKKGYWQFKMDGM 254
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+ +++ C+ GC AI D+GTSL+AGP+ V +N +GG +V+ E
Sbjct: 255 KVSSKT--FCQNGCQAIADTGTSLIAGPSVEVNALNQLLGGMPIVNGE 300
Score = 108 bits (269), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 47/99 (47%), Positives = 69/99 (69%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
+N+L +P GE + +C +PT+P ++FTIG F L+ E Y++K + VC+SGF
Sbjct: 287 LNQLLGGMPIVNGEYMFNCADVPTLPAITFTIGGTDFVLTGEDYVMKITQFGKTVCLSGF 346
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
M D+P P GP+WILGDVF+G Y+T+FD GK R+GFA++
Sbjct: 347 MGLDVPAPMGPIWILGDVFIGRYYTIFDMGKDRVGFAQS 385
>gi|290561455|gb|ADD38128.1| Lysosomal aspartic protease [Lepeophtheirus salmonis]
Length = 384
Score = 130 bits (326), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 57/108 (52%), Positives = 81/108 (75%), Gaps = 2/108 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M +QGL+ E +FSF+LNR+PDA+ GGEI+FGG DP H+KG TY+PVTKKGYWQF++ +
Sbjct: 193 MFKQGLIQEPIFSFYLNRNPDAKVGGEIIFGGSDPDHYKGNITYIPVTKKGYWQFKMDKM 252
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+ ++S C+ GC AI D+GTSL+AGP+ V +N +GG +++ E
Sbjct: 253 EVNSKS--FCQNGCQAIADTGTSLIAGPSIEVNALNQLLGGTPIINGE 298
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 48/99 (48%), Positives = 69/99 (69%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
+N+L P GE + +C+ IP +P ++FTIG + F LS E Y+++ + VC+SGF
Sbjct: 285 LNQLLGGTPIINGEYMFNCEDIPNLPPITFTIGGEEFVLSGEDYVMQITQFGKTVCLSGF 344
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
M D+P P GP+WILGDVF+G Y+TVFD GK R+GFA++
Sbjct: 345 MGLDVPEPMGPIWILGDVFIGRYYTVFDMGKDRVGFAQS 383
>gi|225713714|gb|ACO12703.1| Lysosomal aspartic protease precursor [Lepeophtheirus salmonis]
gi|290462953|gb|ADD24524.1| Lysosomal aspartic protease [Lepeophtheirus salmonis]
Length = 384
Score = 129 bits (325), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 57/108 (52%), Positives = 81/108 (75%), Gaps = 2/108 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M +QGL+ E +FSF+LNR+PDA+ GGEI+FGG DP H+KG TY+PVTKKGYWQF++ +
Sbjct: 193 MFKQGLIQEPIFSFYLNRNPDAKVGGEIIFGGSDPDHYKGNITYIPVTKKGYWQFKMDKM 252
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+ ++S C+ GC AI D+GTSL+AGP+ V +N +GG +++ E
Sbjct: 253 EVNSKS--FCQNGCQAIADTGTSLIAGPSIEVNALNQLLGGTPIINGE 298
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 48/99 (48%), Positives = 69/99 (69%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
+N+L P GE + +C+ IP +P ++FTIG + F LS E Y+++ + VC+SGF
Sbjct: 285 LNQLLGGTPIINGEYMFNCEDIPNLPPITFTIGGEEFVLSGEDYVMQITQFGKTVCLSGF 344
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
M D+P P GP+WILGDVF+G Y+TVFD GK R+GFA++
Sbjct: 345 MGLDVPEPMGPIWILGDVFIGRYYTVFDMGKDRVGFAQS 383
>gi|289740593|gb|ADD19044.1| aspartyl protease [Glossina morsitans morsitans]
Length = 394
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 59/108 (54%), Positives = 78/108 (72%), Gaps = 2/108 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M EQGL+S+ VFSF+LNRDP A EGGEI+FGG DP H+KG+ TY+PVT+K YWQ ++
Sbjct: 203 MYEQGLISQPVFSFYLNRDPKAPEGGEIIFGGSDPNHYKGEFTYLPVTRKAYWQIKMDSA 262
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+GN + +C+GGC I D+GTSL+A P T IN AIGG ++ +
Sbjct: 263 SMGNLN--LCQGGCQVIADTGTSLIALPPSEATSINKAIGGTPIMGGQ 308
Score = 103 bits (258), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 46/99 (46%), Positives = 64/99 (64%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
IN+ P G+ ++ C+ IP +P + F +G K F L + YIL+ + +C+SGF
Sbjct: 295 INKAIGGTPIMGGQYMVACENIPKLPVIRFVLGGKTFELEGKDYILRIAQMGKTICLSGF 354
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
M D+PPP GP+WILGDVF+G Y+T FD G R+GFAEA
Sbjct: 355 MGIDIPPPNGPIWILGDVFIGKYYTEFDMGNDRVGFAEA 393
>gi|195380081|ref|XP_002048799.1| GJ21122 [Drosophila virilis]
gi|194143596|gb|EDW59992.1| GJ21122 [Drosophila virilis]
Length = 391
Score = 129 bits (323), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 59/108 (54%), Positives = 75/108 (69%), Gaps = 2/108 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M EQGL+S+ VFSF+LNRDP A EGGEI+FGG DP H+ G TY+PVT+KGYWQ ++
Sbjct: 200 MFEQGLISQPVFSFYLNRDPKAPEGGEIIFGGSDPNHYTGDFTYLPVTRKGYWQIKMDSA 259
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+ N +C+GGC I D+GTSL+A P T IN AIGG +V +
Sbjct: 260 QLNNLE--LCKGGCQIIADTGTSLIAAPVAEATSINQAIGGTPIVGGQ 305
Score = 107 bits (267), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 48/99 (48%), Positives = 64/99 (64%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
IN+ P G+ I+ CD IP +P + F +G K F L + YIL+ + +C+SGF
Sbjct: 292 INQAIGGTPIVGGQYIVSCDMIPNLPVIKFVLGGKTFELEGKDYILRVAQMGKTICLSGF 351
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
M D+PPP GPLWILGDVF+G Y+T FD G R+GFA+A
Sbjct: 352 MGMDIPPPNGPLWILGDVFIGKYYTEFDMGNDRVGFADA 390
>gi|227018334|gb|ACP18833.1| aspartic proteinase 1 [Chrysomela tremula]
Length = 386
Score = 128 bits (321), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 57/108 (52%), Positives = 80/108 (74%), Gaps = 2/108 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
++EQG+V E VFSF+LNRDP+AE GGEIVFGG DPK++ G TY+PVT+KGYWQ ++
Sbjct: 194 LIEQGVVKEPVFSFYLNRDPNAEVGGEIVFGGSDPKYYTGDFTYLPVTRKGYWQIKMDKA 253
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
++ S +C+GGC AIVD+GTSL+ GP+ + +I A+G + + E
Sbjct: 254 VV--DSNTLCDGGCQAIVDTGTSLITGPSDEIEKIVKAVGATAITAGE 299
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 41/87 (47%), Positives = 58/87 (66%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 267
GE +DC+++ +MPN+ F +G K F L+P+ Y+L+ + C+ GFM D+ P GPL
Sbjct: 298 GEYTVDCNKLSSMPNIDFVLGGKTFTLTPKDYVLQVKQLFLTTCLLGFMGLDVAEPAGPL 357
Query: 268 WILGDVFMGVYHTVFDSGKLRIGFAEA 294
WILGDVF+G Y+T FD G R+G A A
Sbjct: 358 WILGDVFIGKYYTEFDLGNNRVGLAPA 384
>gi|163256819|emb|CAO02673.1| aspartic protease precursor [Medicago truncatula var. truncatula]
Length = 81
Score = 127 bits (320), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 63/81 (77%), Positives = 69/81 (85%)
Query: 48 TKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSA 107
T+KGYWQ ++GD IG STG CEGGCAAIVDSGTSLLAGPT V EINHAIG EGV+S
Sbjct: 1 TEKGYWQVKMGDFFIGGLSTGDCEGGCAAIVDSGTSLLAGPTVFVAEINHAIGAEGVLSV 60
Query: 108 ECKLVVSQYGDLIWDLLVSGL 128
ECK VVSQYG+LIWDLLVSG+
Sbjct: 61 ECKEVVSQYGELIWDLLVSGV 81
>gi|380018765|ref|XP_003693293.1| PREDICTED: lysosomal aspartic protease-like [Apis florea]
Length = 385
Score = 127 bits (319), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 60/111 (54%), Positives = 78/111 (70%), Gaps = 1/111 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV+QGLV + VFSF+LNR+PD + GGE++ GG DP H++G TYVPV KKGYWQF++ I
Sbjct: 193 MVKQGLVPQPVFSFYLNRNPDDKYGGELILGGSDPNHYEGSFTYVPVDKKGYWQFKMDSI 252
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKL 111
IG+ VC+ GC AI D+GTSL+AGP V IN AIG + + E +
Sbjct: 253 QIGSD-LKVCQQGCEAIADTGTSLIAGPVKEVGAINKAIGATPIAAGEAMI 302
Score = 115 bits (288), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 51/99 (51%), Positives = 69/99 (69%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
IN+ + P GE++IDC+ IP +P ++F +G K F+L E Y+LK + VC+SGF
Sbjct: 286 INKAIGATPIAAGEAMIDCNSIPNLPTINFVLGGKSFSLKGEDYVLKVTQFRKTVCLSGF 345
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
M D+PPP GPLWILGDVF+G Y+T FD G R+GFA+A
Sbjct: 346 MGMDIPPPNGPLWILGDVFIGRYYTEFDMGNNRVGFAKA 384
>gi|383859202|ref|XP_003705085.1| PREDICTED: lysosomal aspartic protease-like [Megachile rotundata]
Length = 384
Score = 127 bits (318), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 61/119 (51%), Positives = 80/119 (67%), Gaps = 1/119 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV+QGLVS+ VFSF+LNRDP+AE GGE++ GG DP H+ G TYVPV KKGYWQF + +
Sbjct: 192 MVKQGLVSQPVFSFYLNRDPNAEFGGEMILGGSDPNHYVGPFTYVPVDKKGYWQFAMDRV 251
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDL 119
+G+ VCE GC AI D+GTSL+AGP + +N IG + + E + + DL
Sbjct: 252 EVGSD-VKVCEKGCEAIADTGTSLIAGPVKEIELLNKKIGATPIAAGEAMVECDKIPDL 309
Score = 115 bits (287), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 52/104 (50%), Positives = 70/104 (67%)
Query: 191 KVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEV 250
K + +N+ + P GE++++CD+IP +P ++F G + F L E Y+LK + V
Sbjct: 280 KEIELLNKKIGATPIAAGEAMVECDKIPDLPTITFVFGGRSFPLRGEDYVLKVTQLGKTV 339
Query: 251 CISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
CISGFM D+PPP GPLWILGDVF+G Y+T FD G RIGFAEA
Sbjct: 340 CISGFMGMDIPPPNGPLWILGDVFIGRYYTEFDMGNNRIGFAEA 383
>gi|195120065|ref|XP_002004549.1| GI19550 [Drosophila mojavensis]
gi|193909617|gb|EDW08484.1| GI19550 [Drosophila mojavensis]
Length = 387
Score = 127 bits (318), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 58/108 (53%), Positives = 74/108 (68%), Gaps = 2/108 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M EQ L+++ VFSF+LNRDP A EGGEI+FGG DP H+ G TY+PVT+KGYWQ ++
Sbjct: 196 MFEQSLIAQPVFSFYLNRDPKAPEGGEIIFGGSDPNHYTGDFTYLPVTRKGYWQIKMDSA 255
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
I N +C+GGC I D+GTSL+A P T IN AIGG +V +
Sbjct: 256 QINNVE--LCKGGCQVIADTGTSLIAAPAAEATSINQAIGGTPIVGGQ 301
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 47/99 (47%), Positives = 64/99 (64%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
IN+ P G+ ++ CD IP +P + F +G K F L + YIL+ + +C+SGF
Sbjct: 288 INQAIGGTPIVGGQYVVSCDMIPNLPVIKFVLGGKTFELEGKDYILRIAQMGKTICLSGF 347
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
M D+PPP GPLWILGDVF+G Y+T FD G R+GFA+A
Sbjct: 348 MGMDIPPPNGPLWILGDVFIGKYYTEFDMGNDRVGFADA 386
>gi|66560290|ref|XP_392857.2| PREDICTED: lysosomal aspartic protease [Apis mellifera]
Length = 385
Score = 127 bits (318), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 59/111 (53%), Positives = 77/111 (69%), Gaps = 1/111 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV+QGLV + VFSF+LNR+PD + GGE++ GG DP H++G TYVPV KKGYWQF + I
Sbjct: 193 MVKQGLVPQPVFSFYLNRNPDDKYGGELILGGSDPNHYEGSFTYVPVDKKGYWQFRMDSI 252
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKL 111
IG+ VC+ GC AI D+GTSL+AGP + IN AIG + + E +
Sbjct: 253 QIGSD-LKVCQQGCEAIADTGTSLIAGPVKEIEAINKAIGATPIAAGEAMI 302
Score = 111 bits (277), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 51/104 (49%), Positives = 69/104 (66%)
Query: 191 KVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEV 250
K + IN+ + P GE++IDC+ IP +P ++F +G K F+L E Y+LK + V
Sbjct: 281 KEIEAINKAIGATPIAAGEAMIDCNSIPNLPTINFVLGGKSFSLKGEDYVLKVTQFGKTV 340
Query: 251 CISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
C+SGFM D+ PP GPLWILGDVF+G Y+T FD G R+GFA A
Sbjct: 341 CLSGFMGMDISPPNGPLWILGDVFIGRYYTEFDMGNNRVGFATA 384
>gi|291223847|ref|XP_002731917.1| PREDICTED: putative gut cathepsin D-like aspartic protease-like
[Saccoglossus kowalevskii]
Length = 389
Score = 126 bits (317), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 56/108 (51%), Positives = 78/108 (72%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M++Q L+ + VFSF+L+R+ + +GGE+ GG DPK++ G TYVPVT+KGYWQF++ I
Sbjct: 196 MIQQQLIEKPVFSFYLDRNVNDSQGGELFLGGSDPKYYTGNFTYVPVTRKGYWQFKMDGI 255
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+G ++ C+GGC AI D+GTSL+AGPT V IN AIG +VS E
Sbjct: 256 TLGGSASQFCKGGCQAIADTGTSLIAGPTEEVQAINKAIGATPIVSGE 303
Score = 101 bits (252), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 42/99 (42%), Positives = 67/99 (67%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
IN+ + P GE +++C++I ++P+++F + +K F L YI++ + +C+SGF
Sbjct: 290 INKAIGATPIVSGEYMVNCNKIDSLPDITFVLNNKPFILKGRDYIMQVSQSGVTLCLSGF 349
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
M D+PPP GP+WILGDVF+G ++T FD G R+GFA A
Sbjct: 350 MGMDIPPPMGPIWILGDVFIGRFYTEFDRGNDRVGFATA 388
>gi|224548868|dbj|BAH24176.1| aspartic proteinase [Sitophilus zeamais]
Length = 389
Score = 126 bits (317), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 53/108 (49%), Positives = 79/108 (73%), Gaps = 2/108 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M++Q LV+ +FSF+LNRDP+A+ GGEI+ GG DP +++G TY+PV ++ YWQF++ +
Sbjct: 196 MIDQNLVAAPIFSFYLNRDPNAQTGGEIILGGSDPNYYEGDFTYLPVDRQAYWQFKMDSV 255
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+ +QS +C+GGC AI D+GTSL+AGPT + +N AIG +V E
Sbjct: 256 QVADQS--LCKGGCEAIADTGTSLIAGPTEEIAALNKAIGASAIVGGE 301
Score = 104 bits (259), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 53/138 (38%), Positives = 80/138 (57%), Gaps = 2/138 (1%)
Query: 157 EKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDR 216
+ ++V D ++C A+ L T+E ++ +N+ + GE I+DC+
Sbjct: 251 KMDSVQVADQSLCKGGCEAIADTGTSLIAGPTEE--IAALNKAIGASAIVGGEYIVDCNS 308
Query: 217 IPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMG 276
I ++P ++ T+G +F L E Y+LK E CISGF+ D+P P GPLWILGDVF+G
Sbjct: 309 ISSLPKINITLGGNLFTLEGEDYVLKVSELGQVTCISGFLGLDVPAPAGPLWILGDVFIG 368
Query: 277 VYHTVFDSGKLRIGFAEA 294
Y+T FD+G R+GFA A
Sbjct: 369 KYYTEFDAGNNRVGFATA 386
>gi|195429864|ref|XP_002062977.1| GK21682 [Drosophila willistoni]
gi|194159062|gb|EDW73963.1| GK21682 [Drosophila willistoni]
Length = 389
Score = 126 bits (316), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 59/116 (50%), Positives = 76/116 (65%), Gaps = 6/116 (5%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M EQGL+S VFSF+LNRDP A EGGEI+FGG DP H+ G TY+PVT+K YWQ ++
Sbjct: 198 MYEQGLISSPVFSFYLNRDPSAPEGGEIIFGGSDPNHYTGDFTYLPVTRKAYWQIKMDSA 257
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE----CKLV 112
+G+ VC+GGC I D+GTSL+A P T IN IGG ++ + C L+
Sbjct: 258 SVGDLQ--VCQGGCQVIADTGTSLIAAPLSEATSINQKIGGTPIIGGQYVVSCDLI 311
Score = 108 bits (271), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 53/139 (38%), Positives = 76/139 (54%), Gaps = 2/139 (1%)
Query: 156 VEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 215
++ ++ S GD VC + L E + IN+ P G+ ++ CD
Sbjct: 252 IKMDSASVGDLQVCQGGCQVIADTGTSLIAAPLSEA--TSINQKIGGTPIIGGQYVVSCD 309
Query: 216 RIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFM 275
IP +P + F +G + F L + YIL+ + +C+SGFM D+PPP GPLWILGDVF+
Sbjct: 310 LIPNLPVIKFVLGGRTFELEGKDYILRVSQMGKSICLSGFMGMDIPPPNGPLWILGDVFI 369
Query: 276 GVYHTVFDSGKLRIGFAEA 294
G Y+T FD G R+GFA A
Sbjct: 370 GKYYTEFDMGNDRVGFANA 388
>gi|170063951|ref|XP_001867326.1| lysosomal aspartic protease [Culex quinquefasciatus]
gi|167881401|gb|EDS44784.1| lysosomal aspartic protease [Culex quinquefasciatus]
Length = 387
Score = 126 bits (316), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 60/116 (51%), Positives = 77/116 (66%), Gaps = 6/116 (5%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M QGL+ VFSF+LNRDP A EGGEI+FGG D + G TY+PV +K YWQF++ +
Sbjct: 196 MYNQGLIDSPVFSFYLNRDPSAAEGGEIIFGGSDSAKYTGDFTYLPVDRKAYWQFKMDSV 255
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE----CKLV 112
+G+ T C GC AI D+GTSL+AGPT VT IN AIGG +++ E C L+
Sbjct: 256 KVGD--TEFCNNGCEAIADTGTSLIAGPTSEVTAINKAIGGTPIINGEYMVDCSLI 309
Score = 110 bits (275), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 58/137 (42%), Positives = 79/137 (57%), Gaps = 4/137 (2%)
Query: 159 ENVSAGDSAVCS-ACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRI 217
++V GD+ C+ CE A+ L T E ++ IN+ P GE ++DC I
Sbjct: 253 DSVKVGDTEFCNNGCE-AIADTGTSLIAGPTSE--VTAINKAIGGTPIINGEYMVDCSLI 309
Query: 218 PTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGV 277
P +P + F +G K F L YIL+ + +C+SGFM D+PPP GPLWILGDVF+G
Sbjct: 310 PKLPKIKFVLGGKEFELEGADYILRIAQMGKTICLSGFMGIDIPPPNGPLWILGDVFIGK 369
Query: 278 YHTVFDSGKLRIGFAEA 294
Y+T FD G R+GFA A
Sbjct: 370 YYTEFDMGNDRVGFATA 386
>gi|17981530|gb|AAL51056.1|AF454831_1 cathepsin D [Apriona germari]
Length = 386
Score = 125 bits (314), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 59/108 (54%), Positives = 76/108 (70%), Gaps = 2/108 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ QGLVS+ VFSF+LNRDPDA EGGE+ GG DPK++KG TY+ V ++ YWQF++ I
Sbjct: 196 MITQGLVSQPVFSFYLNRDPDAAEGGELSLGGSDPKYYKGNFTYLSVDRQAYWQFKMDKI 255
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+G T C+ GC AI D+GTSL+AGP VT IN IGG ++ E
Sbjct: 256 QLGK--TVFCKSGCQAIADTGTSLVAGPVDEVTSINKLIGGTPIIGGE 301
Score = 107 bits (267), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 55/137 (40%), Positives = 78/137 (56%), Gaps = 3/137 (2%)
Query: 159 ENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIP 218
+ + G + C + A+ L E ++ IN+L P GE ++DC IP
Sbjct: 253 DKIQLGKTVFCKSGCQAIADTGTSLVAGPVDE--VTSINKLIGGTPIIGGEYVVDC-LIP 309
Query: 219 TMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVY 278
+P + F +G K + L + YIL+ + +C+SGFM D+PPP GPLWILGDVF+G +
Sbjct: 310 KLPEIDFILGGKTYTLEGKDYILRVSQAGKTICLSGFMGIDIPPPNGPLWILGDVFIGKF 369
Query: 279 HTVFDSGKLRIGFAEAA 295
+T FD G RIGFAEAA
Sbjct: 370 YTEFDLGNNRIGFAEAA 386
>gi|125807245|ref|XP_001360320.1| GA13759 [Drosophila pseudoobscura pseudoobscura]
gi|195149648|ref|XP_002015768.1| GL11239 [Drosophila persimilis]
gi|54635492|gb|EAL24895.1| GA13759 [Drosophila pseudoobscura pseudoobscura]
gi|194109615|gb|EDW31658.1| GL11239 [Drosophila persimilis]
Length = 388
Score = 125 bits (314), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 58/116 (50%), Positives = 77/116 (66%), Gaps = 6/116 (5%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M EQGL+S VFSF+LNRDP + EGGEI+FGG DPKH+ G TY+PVT+K YWQ ++
Sbjct: 197 MYEQGLISSPVFSFYLNRDPASPEGGEIIFGGSDPKHYTGDFTYLPVTRKAYWQIKMDSA 256
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE----CKLV 112
+G+ +C+GGC I D+GTSL+A P T IN IGG ++ + C L+
Sbjct: 257 ALGDLE--LCKGGCQVIADTGTSLIAAPMTEATSINQKIGGTPIIGGQYIVSCDLI 310
Score = 107 bits (266), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 53/139 (38%), Positives = 77/139 (55%), Gaps = 2/139 (1%)
Query: 156 VEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 215
++ ++ + GD +C + L E + IN+ P G+ I+ CD
Sbjct: 251 IKMDSAALGDLELCKGGCQVIADTGTSLIAAPMTEA--TSINQKIGGTPIIGGQYIVSCD 308
Query: 216 RIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFM 275
IP +P + F +G K F L + YIL+ + +C+SGFM D+PPP GPLWILGDVF+
Sbjct: 309 LIPKLPVIKFVLGGKTFELEGKDYILRVAQMGKTICLSGFMGIDIPPPNGPLWILGDVFI 368
Query: 276 GVYHTVFDSGKLRIGFAEA 294
G Y+T FD G R+GFA+A
Sbjct: 369 GKYYTEFDMGNDRVGFADA 387
>gi|332376487|gb|AEE63383.1| unknown [Dendroctonus ponderosae]
Length = 388
Score = 125 bits (314), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 56/116 (48%), Positives = 82/116 (70%), Gaps = 6/116 (5%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV QG+VS+ VFSF+LNR+PD + GGE++FGG DP ++ G TY+PV ++ YWQF++ ++
Sbjct: 197 MVNQGIVSQAVFSFYLNRNPDGKVGGELIFGGSDPNYYSGNFTYLPVDRQAYWQFKMDEV 256
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE----CKLV 112
++G ++ C+GGC AI D+GTSL+AGP V +N AIG +V E C L+
Sbjct: 257 IVGQKT--FCKGGCEAIADTGTSLIAGPVDEVKALNEAIGATPLVGGEYAVDCSLI 310
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 44/97 (45%), Positives = 58/97 (59%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
+NE + P GE +DC IP +P + F +G F L + Y+L VC+SGF
Sbjct: 289 LNEAIGATPLVGGEYAVDCSLIPNLPAIKFILGGNTFVLEGKDYVLAESAMGKTVCLSGF 348
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFA 292
D+PPP GPLWILGDVF+G Y+T FD+ R+GFA
Sbjct: 349 FGIDIPPPNGPLWILGDVFIGKYYTEFDAQNNRVGFA 385
>gi|60678793|gb|AAX33731.1| Blo t allergen [Blomia tropicalis]
Length = 402
Score = 125 bits (314), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 59/108 (54%), Positives = 77/108 (71%), Gaps = 5/108 (4%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV+QGLV + VFSF+LNRD + + GGEI+FGG DP ++KG TY P+TK GYWQF++ I
Sbjct: 198 MVKQGLVQQPVFSFYLNRDTNGKVGGEIIFGGSDPAYYKGDFTYAPLTKIGYWQFQMHGI 257
Query: 61 LIGNQSTG-----VCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEG 103
L+ N+S VCE GC AI D+GTSL+AGP+ V +N A+G G
Sbjct: 258 LLENKSNNKTVGHVCESGCEAIADTGTSLIAGPSDQVEHLNRALGAIG 305
Score = 80.5 bits (197), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 38/85 (44%), Positives = 54/85 (63%), Gaps = 3/85 (3%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 267
G +++C I T+P++ F I F LSP+QY+++ E+CIS F++ P PL
Sbjct: 309 GIFVLNCSHINTLPSIIFQINGVKFPLSPDQYVMRQSAMGKEICISSFISL---PANIPL 365
Query: 268 WILGDVFMGVYHTVFDSGKLRIGFA 292
WILGDVF+G Y+T FD G R+GFA
Sbjct: 366 WILGDVFIGNYYTEFDYGNKRVGFA 390
>gi|195027894|ref|XP_001986817.1| GH21578 [Drosophila grimshawi]
gi|193902817|gb|EDW01684.1| GH21578 [Drosophila grimshawi]
Length = 388
Score = 125 bits (314), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 57/108 (52%), Positives = 75/108 (69%), Gaps = 2/108 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M EQGL+S+ VFSF+LNRDP A EGGEI+FGG DP H+ G TY+PVT+KGYWQ ++
Sbjct: 197 MYEQGLISDPVFSFYLNRDPKAPEGGEIIFGGSDPNHYTGDFTYLPVTRKGYWQIKMDSA 256
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+ + +C+GGC I D+GTSL+A P T IN AIGG ++ +
Sbjct: 257 QLNDIE--LCKGGCQVIADTGTSLIAAPQDEATSINQAIGGTPILGGQ 302
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 46/99 (46%), Positives = 61/99 (61%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
IN+ P G+ ++ CD IP +P + F K F L + YIL+ + +C+SGF
Sbjct: 289 INQAIGGTPILGGQYVVSCDAIPNLPVIKFVFNGKTFELEGKDYILRVAQMGKTICLSGF 348
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
M D+PPP GPLWILGDVF+G Y+T FD G R+GFA A
Sbjct: 349 MGMDIPPPNGPLWILGDVFIGKYYTEFDMGNDRVGFANA 387
>gi|336454162|gb|AEI58895.1| cathepsin D [Pteria penguin]
Length = 392
Score = 125 bits (313), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 55/108 (50%), Positives = 76/108 (70%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV+Q LV VFSF+LNRDP A +GGE++ GG D K++KG TY+PVTK+GYW+F++ I
Sbjct: 197 MVKQNLVPASVFSFYLNRDPGASDGGELILGGSDSKYYKGNFTYLPVTKQGYWRFKMDGI 256
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
++ +++ C GGC AI D+GTSLLAGP V +N IG + + E
Sbjct: 257 MMNGKASKYCSGGCKAIADTGTSLLAGPKTEVDALNKQIGATPLAAGE 304
Score = 95.1 bits (235), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 40/97 (41%), Positives = 62/97 (63%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
+N+ + P GE ++DC + +P +SF +G + F+L + Y+L + +C+SGF
Sbjct: 291 LNKQIGATPLAAGEYMVDCSSVSKLPVISFMLGGQQFDLQGKDYVLTVTQQGQTICLSGF 350
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFA 292
D+PPP GPLWILGDVF+G ++T FD G ++GFA
Sbjct: 351 TGIDVPPPNGPLWILGDVFIGKFYTEFDLGNNQVGFA 387
>gi|432870116|ref|XP_004071815.1| PREDICTED: cathepsin D-like [Oryzias latipes]
Length = 397
Score = 124 bits (312), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 63/140 (45%), Positives = 88/140 (62%), Gaps = 3/140 (2%)
Query: 156 VEKENVSAGDS-AVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDC 214
++ + V G+ +C A ++V L +E + +++ +LP MGE IDC
Sbjct: 259 IQMDRVDVGNQLTLCKAGCQSIVDTGTSLMVGPAEE--IRALHKAIGALPLLMGEYFIDC 316
Query: 215 DRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVF 274
+IP++P +SF IG K FNL+ E YILK + A +C+SGFMA D+PPP GPLWILGDVF
Sbjct: 317 KKIPSLPVISFNIGGKTFNLTGEDYILKESQMGASICLSGFMAMDIPPPAGPLWILGDVF 376
Query: 275 MGVYHTVFDSGKLRIGFAEA 294
+G Y+TVFD R+GFA A
Sbjct: 377 IGKYYTVFDRNADRVGFAAA 396
Score = 100 bits (250), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 47/113 (41%), Positives = 74/113 (65%), Gaps = 5/113 (4%)
Query: 6 LVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQ 65
L+ + +FS +++RD AE GGE++ GG+DP++F G YV VT+K YWQ ++ + +GNQ
Sbjct: 210 LLPQNIFSVYISRDTAAEVGGELILGGIDPQYFSGDLHYVNVTRKAYWQIQMDRVDVGNQ 269
Query: 66 STGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE----CKLVVS 114
T +C+ GC +IVD+GTSL+ GP + ++ AIG ++ E CK + S
Sbjct: 270 LT-LCKAGCQSIVDTGTSLMVGPAEEIRALHKAIGALPLLMGEYFIDCKKIPS 321
>gi|340729556|ref|XP_003403066.1| PREDICTED: lysosomal aspartic protease-like [Bombus terrestris]
Length = 385
Score = 124 bits (312), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 54/96 (56%), Positives = 71/96 (73%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV+QGLV + VFSF+LNR+PD + GGE++ GG DP H++G TYVPV +KGYWQF + I
Sbjct: 192 MVKQGLVPQPVFSFYLNRNPDDKAGGELILGGSDPNHYEGPFTYVPVDRKGYWQFRMDGI 251
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEIN 96
+G+Q +C+ GC AI D+GTSL+AGP V IN
Sbjct: 252 KVGSQHLAICQKGCEAIADTGTSLIAGPVKEVEAIN 287
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 44/86 (51%), Positives = 61/86 (70%)
Query: 209 ESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLW 268
E+++DC IP +P ++F +G K F L+ + Y+LK + VC+SGFM D+P P GPLW
Sbjct: 299 EAMVDCSSIPNLPTINFVLGGKSFPLTGKDYVLKVTQFGKTVCLSGFMGMDIPEPNGPLW 358
Query: 269 ILGDVFMGVYHTVFDSGKLRIGFAEA 294
ILGDVF+G Y+T FD G R+GFA+A
Sbjct: 359 ILGDVFIGRYYTEFDMGNNRVGFAKA 384
>gi|60678795|gb|AAX33732.1| Blo t allergen isoform 2 [Blomia tropicalis]
Length = 402
Score = 124 bits (312), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 59/108 (54%), Positives = 76/108 (70%), Gaps = 5/108 (4%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV+QGLV VFSF+LNRD + + GGEI+FGG DP ++KG TY P+TK GYWQF++ I
Sbjct: 198 MVKQGLVQHPVFSFYLNRDTNGKVGGEIIFGGSDPAYYKGDFTYAPLTKIGYWQFQMHGI 257
Query: 61 LIGNQSTG-----VCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEG 103
L+ N+S VCE GC AI D+GTSL+AGP+ V +N A+G G
Sbjct: 258 LLENKSNNKTVGHVCESGCEAIADTGTSLIAGPSDQVEHLNRALGAIG 305
Score = 80.5 bits (197), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 38/85 (44%), Positives = 53/85 (62%), Gaps = 3/85 (3%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 267
G +++C I +PN+ F I F LSP+QY+++ E+CIS F++ P PL
Sbjct: 309 GIFVLNCSHINALPNIIFQINGVKFPLSPDQYVMRQSAMGKEICISSFISL---PANIPL 365
Query: 268 WILGDVFMGVYHTVFDSGKLRIGFA 292
WILGDVF+G Y+T FD G R+GFA
Sbjct: 366 WILGDVFIGNYYTEFDYGNKRVGFA 390
>gi|321472775|gb|EFX83744.1| hypothetical protein DAPPUDRAFT_92408 [Daphnia pulex]
Length = 379
Score = 124 bits (311), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 56/120 (46%), Positives = 85/120 (70%), Gaps = 1/120 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+EQGLV + VFSFWL+R+PDA +GGEI FGG DP+ + G+ ++ PVT+K YWQF++ +
Sbjct: 184 MIEQGLVEDPVFSFWLSRNPDAAQGGEITFGGADPERYTGEISWAPVTRKAYWQFKVDGV 243
Query: 61 LIGNQSTGV-CEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDL 119
+ N++ G C+GGC I D+GTSL+AGP + ++N IGG +++ E + S+ +L
Sbjct: 244 QVSNEADGAFCQGGCQMIADTGTSLIAGPVDEIKKLNTLIGGIPIMAGEYFINCSRIDEL 303
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 49/101 (48%), Positives = 70/101 (69%), Gaps = 3/101 (2%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILK--TGEGIAEVCIS 253
+N L +P GE I+C RI +P +SF+IG K F+L ++Y+++ GI+ CIS
Sbjct: 279 LNTLIGGIPIMAGEYFINCSRIDELPTISFSIGGKSFSLEGKEYVMQIVKSNGIS-ACIS 337
Query: 254 GFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
GF+ ++PPP GPLWILGDVF+G Y+T+FD G R+GFA+A
Sbjct: 338 GFIGLEIPPPAGPLWILGDVFIGRYYTIFDFGNDRVGFADA 378
>gi|46309251|dbj|BAD15111.1| cathepsin D [Todarodes pacificus]
Length = 392
Score = 124 bits (311), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 54/108 (50%), Positives = 75/108 (69%), Gaps = 2/108 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
++ Q LV + VFSF+LNRDP +EGGE++ GG DPKH+ G TY+PVT+KGYWQ ++ +
Sbjct: 201 IISQKLVDQPVFSFYLNRDPSGKEGGELILGGSDPKHYTGNFTYLPVTRKGYWQIKMDKV 260
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+ G + C GGC AI D+GTSL+AGP + ++N AIGG + E
Sbjct: 261 VSGENT--FCSGGCQAIADTGTSLIAGPVDEIKKLNEAIGGRALPGGE 306
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 56/139 (40%), Positives = 80/139 (57%), Gaps = 2/139 (1%)
Query: 156 VEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 215
++ + V +G++ CS A+ L E + +NE P GE ++DC
Sbjct: 255 IKMDKVVSGENTFCSGGCQAIADTGTSLIAGPVDE--IKKLNEAIGGRALPGGEYMVDCA 312
Query: 216 RIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFM 275
IP +PNV F +G K F+L Y+L + +C+SGFM ++PPP GPLWILGDVF+
Sbjct: 313 SIPKLPNVDFVLGGKTFSLKTSDYVLTIKQAGQTICLSGFMGINIPPPAGPLWILGDVFI 372
Query: 276 GVYHTVFDSGKLRIGFAEA 294
G Y+TVFD GK ++GFA A
Sbjct: 373 GKYYTVFDLGKNQVGFAVA 391
>gi|332024025|gb|EGI64243.1| Lysosomal aspartic protease [Acromyrmex echinatior]
Length = 381
Score = 124 bits (311), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 57/111 (51%), Positives = 81/111 (72%), Gaps = 7/111 (6%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV+Q LVS+ VFSF+LNRD A EGGE++ GG DP H++G+ TY+PVT+KGYWQF++ +
Sbjct: 194 MVKQNLVSQAVFSFYLNRDSSAAEGGEMILGGSDPDHYEGEFTYIPVTRKGYWQFKMDGV 253
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINH-----AIGGEGVVS 106
+ + + C+ GC AI D+GTSL+AGPT + +IN +IGGE +V+
Sbjct: 254 QVKDHA--FCKEGCQAIADTGTSLIAGPTSEIKDINEMIGATSIGGEAMVN 302
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 53/134 (39%), Positives = 77/134 (57%), Gaps = 6/134 (4%)
Query: 159 ENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIP 218
+ V D A C A+ L T E + INE+ + + GE++++C++I
Sbjct: 251 DGVQVKDHAFCKEGCQAIADTGTSLIAGPTSE--IKDINEMIGA-TSIGGEAMVNCNQIS 307
Query: 219 TMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVY 278
+MP++SFT+G+K F L E Y+LK + +C+SGFM DLP LWILGDVF+G Y
Sbjct: 308 SMPSISFTLGNKNFTLIGEDYVLKIKQFGKTICMSGFMGMDLPQ---SLWILGDVFIGRY 364
Query: 279 HTVFDSGKLRIGFA 292
+T FD R+GFA
Sbjct: 365 YTEFDMENDRVGFA 378
>gi|312861579|gb|ADR10277.1| cathepsin D [Branchiostoma belcheri]
Length = 395
Score = 124 bits (311), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 58/119 (48%), Positives = 79/119 (66%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV+Q LV + VFSF+LNRDP + GE++ GG DPK++ G T++ VTK GYWQF++ I
Sbjct: 197 MVQQKLVDKPVFSFYLNRDPSSTTRGELLLGGTDPKYYTGDFTFLDVTKPGYWQFKMDGI 256
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDL 119
+I ++T C+GGCAAI D+GTSL+AGPT V +N IG + E + SQ L
Sbjct: 257 MINGKATDYCKGGCAAIADTGTSLIAGPTTEVQALNKQIGATPIPGGEYMVDCSQVSSL 315
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 44/100 (44%), Positives = 65/100 (65%), Gaps = 2/100 (2%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
+N+ + P P GE ++DC ++ ++P +SF +G K F L + Y+L+ VC+SGF
Sbjct: 291 LNKQIGATPIPGGEYMVDCSQVSSLPPISFMLGGKAFELQGKDYVLQVTTMGQTVCVSGF 350
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
+ D+ P GPLWILGDVF+G Y+T+FD G R+GFA A
Sbjct: 351 LGIDV--PAGPLWILGDVFIGPYYTLFDMGNNRVGFAPTA 388
>gi|194757447|ref|XP_001960976.1| GF11236 [Drosophila ananassae]
gi|190622274|gb|EDV37798.1| GF11236 [Drosophila ananassae]
Length = 388
Score = 124 bits (311), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 58/116 (50%), Positives = 76/116 (65%), Gaps = 6/116 (5%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M EQGL+S +FSF+LNRDP EGGEI+FGG DPKH+ G TY+PVT+K YWQ ++
Sbjct: 197 MYEQGLISAPIFSFYLNRDPAGPEGGEIIFGGSDPKHYSGDFTYLPVTRKAYWQIKMDAA 256
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE----CKLV 112
IG+ +C+GGC I D+GTSL+A P T IN IGG ++ + C L+
Sbjct: 257 SIGDLE--LCKGGCQVIADTGTSLIAAPMSEATSINQKIGGTPIIGGQYVVSCDLI 310
Score = 107 bits (266), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 53/139 (38%), Positives = 76/139 (54%), Gaps = 2/139 (1%)
Query: 156 VEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 215
++ + S GD +C + L E + IN+ P G+ ++ CD
Sbjct: 251 IKMDAASIGDLELCKGGCQVIADTGTSLIAAPMSEA--TSINQKIGGTPIIGGQYVVSCD 308
Query: 216 RIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFM 275
IP +P + F +G K F L + YIL+ + +C+SGFM D+PPP GPLWILGDVF+
Sbjct: 309 LIPNLPVIKFVLGGKTFELEGKDYILRVAQMGKTICLSGFMGMDIPPPNGPLWILGDVFI 368
Query: 276 GVYHTVFDSGKLRIGFAEA 294
G Y+T FD G R+GFA+A
Sbjct: 369 GKYYTEFDMGNDRVGFADA 387
>gi|380746491|gb|AFE48185.1| cathepsin D [Pinctada margaritifera]
Length = 390
Score = 124 bits (310), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 55/119 (46%), Positives = 83/119 (69%), Gaps = 1/119 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M++QGLV + +FSF+L+RDP A EGGE++ GG D KH+KG TY+PVT++GYW+F++ +
Sbjct: 197 MIKQGLVPQPIFSFYLDRDPSASEGGELILGGSDTKHYKGNFTYLPVTRQGYWEFKMDGV 256
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDL 119
+G ++ C GGC I D+GTSL+AGP+ V ++N AIG + E + ++ DL
Sbjct: 257 SVG-ENHKFCTGGCNTIADTGTSLIAGPSSEVKKLNAAIGATAIPGGEYMIDCTKIPDL 314
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 42/98 (42%), Positives = 64/98 (65%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
+N + P GE +IDC +IP +P ++F++G + F+L + Y+L + C+SGF
Sbjct: 290 LNAAIGATAIPGGEYMIDCTKIPDLPKITFSLGGQQFDLEGKDYVLTVTQQGQTTCLSGF 349
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAE 293
D+PPP GPLWILGDVF+G ++T FD G ++GFA+
Sbjct: 350 AGIDVPPPAGPLWILGDVFIGKFYTEFDMGNTQVGFAQ 387
>gi|195332251|ref|XP_002032812.1| GM20753 [Drosophila sechellia]
gi|194124782|gb|EDW46825.1| GM20753 [Drosophila sechellia]
Length = 392
Score = 123 bits (309), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 58/116 (50%), Positives = 77/116 (66%), Gaps = 6/116 (5%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M EQGL+S VFSF+LNRDP + EGGEI+FGG DP H+ G+ TY+PVT+K YWQ ++
Sbjct: 201 MYEQGLISAPVFSFYLNRDPASPEGGEIIFGGSDPNHYTGEFTYLPVTRKAYWQIKMDAA 260
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE----CKLV 112
IG+ +C+GGC I D+GTSL+A P T IN IGG ++ + C L+
Sbjct: 261 SIGDLQ--LCKGGCQVIADTGTSLIAAPLEEATSINQKIGGTPIIGGQYVVSCDLI 314
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 52/139 (37%), Positives = 77/139 (55%), Gaps = 2/139 (1%)
Query: 156 VEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 215
++ + S GD +C + L +E + IN+ P G+ ++ CD
Sbjct: 255 IKMDAASIGDLQLCKGGCQVIADTGTSLIAAPLEEA--TSINQKIGGTPIIGGQYVVSCD 312
Query: 216 RIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFM 275
IP +P + F +G K F L + YIL+ + +C+SGFM D+PPP GPLWILGDVF+
Sbjct: 313 LIPQLPVIKFVLGGKTFELEGKDYILRVAQMGKTICLSGFMGMDIPPPNGPLWILGDVFI 372
Query: 276 GVYHTVFDSGKLRIGFAEA 294
G Y+T FD G R+GFA++
Sbjct: 373 GKYYTEFDMGNDRVGFADS 391
>gi|91093044|ref|XP_966517.1| PREDICTED: similar to cathepsin D isoform 1 [Tribolium castaneum]
gi|270002651|gb|EEZ99098.1| hypothetical protein TcasGA2_TC004989 [Tribolium castaneum]
Length = 384
Score = 123 bits (309), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 58/116 (50%), Positives = 78/116 (67%), Gaps = 5/116 (4%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M++Q LV++ VFSF+LNRDP A +GGEI+ GG DP H+KG TY+ V ++ YWQF++ I
Sbjct: 192 MIQQNLVAQPVFSFYLNRDPSAAQGGEIILGGSDPAHYKGDFTYLSVDRQAYWQFKMDSI 251
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE----CKLV 112
+G ++T C GC AI D+GTSL+AGP V IN AIG +V E C L+
Sbjct: 252 SVGGKNT-FCANGCEAIADTGTSLIAGPVSEVQGINKAIGATPIVGGEYMVDCNLI 306
Score = 107 bits (268), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 48/100 (48%), Positives = 67/100 (67%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
IN+ + P GE ++DC+ IP +P + FT+G K F L + Y+L+ + +C+SGF
Sbjct: 285 INKAIGATPIVGGEYMVDCNLIPNLPLIDFTLGGKNFTLEGKDYVLRVAQMGKTICLSGF 344
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
M D+PPP GPLWILGDVF+G ++T FD G R+GFA AA
Sbjct: 345 MGIDIPPPNGPLWILGDVFIGKFYTEFDLGNNRVGFAVAA 384
>gi|195474504|ref|XP_002089531.1| GE23596 [Drosophila yakuba]
gi|194175632|gb|EDW89243.1| GE23596 [Drosophila yakuba]
Length = 392
Score = 123 bits (308), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 58/116 (50%), Positives = 77/116 (66%), Gaps = 6/116 (5%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M EQGL+S VFSF+LNRDP + EGGEI+FGG DP H+ G+ TY+PVT+K YWQ ++
Sbjct: 201 MYEQGLISAPVFSFYLNRDPASPEGGEIIFGGSDPNHYTGEFTYLPVTRKAYWQIKMDAA 260
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE----CKLV 112
IG+ +C+GGC I D+GTSL+A P T IN IGG ++ + C L+
Sbjct: 261 SIGDLQ--LCKGGCQVIADTGTSLIAAPLEEATSINQKIGGTPIIGGQYVVSCDLI 314
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 53/139 (38%), Positives = 77/139 (55%), Gaps = 2/139 (1%)
Query: 156 VEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 215
++ + S GD +C + L +E + IN+ P G+ ++ CD
Sbjct: 255 IKMDAASIGDLQLCKGGCQVIADTGTSLIAAPLEEA--TSINQKIGGTPIIGGQYVVSCD 312
Query: 216 RIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFM 275
IP +P + F +G K F L + YIL+ + +C+SGFM D+PPP GPLWILGDVF+
Sbjct: 313 LIPKLPVIKFVLGGKTFELEGKDYILRVAQMGKTICLSGFMGMDIPPPNGPLWILGDVFI 372
Query: 276 GVYHTVFDSGKLRIGFAEA 294
G Y+T FD G R+GFA+A
Sbjct: 373 GKYYTEFDMGNDRVGFADA 391
>gi|31197673|ref|XP_307784.1| AGAP003277-PA [Anopheles gambiae str. PEST]
gi|347969584|ref|XP_003436430.1| AGAP003277-PB [Anopheles gambiae str. PEST]
gi|347969586|ref|XP_003436431.1| AGAP003277-PC [Anopheles gambiae str. PEST]
gi|347969588|ref|XP_003436432.1| AGAP003277-PD [Anopheles gambiae str. PEST]
gi|30179074|gb|EAA03535.2| AGAP003277-PA [Anopheles gambiae str. PEST]
gi|333466215|gb|EGK96172.1| AGAP003277-PB [Anopheles gambiae str. PEST]
gi|333466216|gb|EGK96173.1| AGAP003277-PC [Anopheles gambiae str. PEST]
gi|333466217|gb|EGK96174.1| AGAP003277-PD [Anopheles gambiae str. PEST]
Length = 389
Score = 123 bits (308), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 60/118 (50%), Positives = 76/118 (64%), Gaps = 6/118 (5%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M QG + VFSF+LNRDP A EGGEI+FGG D KH+ G TY+ V +K YWQF++ +
Sbjct: 198 MFNQGKIDAPVFSFYLNRDPSAAEGGEIIFGGSDSKHYTGDFTYLSVDRKAYWQFKMDSV 257
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE----CKLVVS 114
+G+ C GC AI D+GTSL+AGP VT IN AIGG V++ E C L+ S
Sbjct: 258 TVGDAQ--YCNNGCEAIADTGTSLIAGPVAEVTAINKAIGGTPVLNGEYMVDCSLIPS 313
Score = 113 bits (283), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 58/139 (41%), Positives = 83/139 (59%), Gaps = 4/139 (2%)
Query: 157 EKENVSAGDSAVCS-ACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 215
+ ++V+ GD+ C+ CE A+ L E ++ IN+ P GE ++DC
Sbjct: 253 KMDSVTVGDAQYCNNGCE-AIADTGTSLIAGPVAE--VTAINKAIGGTPVLNGEYMVDCS 309
Query: 216 RIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFM 275
IP++P ++FT+G K F L YIL+ + +C+SGFM D+PPP GPLWILGDVF+
Sbjct: 310 LIPSLPKITFTLGGKQFTLEGADYILRVAQMGKTICLSGFMGIDIPPPNGPLWILGDVFI 369
Query: 276 GVYHTVFDSGKLRIGFAEA 294
G Y+T FD G R+GFA A
Sbjct: 370 GKYYTEFDMGNDRVGFATA 388
>gi|194863696|ref|XP_001970568.1| GG10707 [Drosophila erecta]
gi|190662435|gb|EDV59627.1| GG10707 [Drosophila erecta]
Length = 390
Score = 123 bits (308), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 58/116 (50%), Positives = 77/116 (66%), Gaps = 6/116 (5%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M EQGL+S VFSF+LNRDP + EGGEI+FGG DP H+ G+ TY+PVT+K YWQ ++
Sbjct: 199 MYEQGLISAPVFSFYLNRDPASPEGGEIIFGGSDPNHYTGEFTYLPVTRKAYWQIKMDAA 258
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE----CKLV 112
IG+ +C+GGC I D+GTSL+A P T IN IGG ++ + C L+
Sbjct: 259 SIGDLQ--LCKGGCQVIADTGTSLIAAPLEEATSINQKIGGTPIIGGQYVVSCDLI 312
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 53/139 (38%), Positives = 77/139 (55%), Gaps = 2/139 (1%)
Query: 156 VEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 215
++ + S GD +C + L +E + IN+ P G+ ++ CD
Sbjct: 253 IKMDAASIGDLQLCKGGCQVIADTGTSLIAAPLEEA--TSINQKIGGTPIIGGQYVVSCD 310
Query: 216 RIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFM 275
IP +P + F +G K F L + YIL+ + +C+SGFM D+PPP GPLWILGDVF+
Sbjct: 311 LIPKLPVIKFVLGGKTFELEGKDYILRVSQMGKTICLSGFMGMDIPPPNGPLWILGDVFI 370
Query: 276 GVYHTVFDSGKLRIGFAEA 294
G Y+T FD G R+GFA+A
Sbjct: 371 GKYYTEFDMGNDRVGFADA 389
>gi|147832549|emb|CAN74997.1| hypothetical protein VITISV_036759 [Vitis vinifera]
Length = 182
Score = 123 bits (308), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 71/176 (40%), Positives = 102/176 (57%), Gaps = 12/176 (6%)
Query: 123 LLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKEN---VSAGDSAVCSACEMAVVWV 179
LL+S P K+ QIGL F+G V GI+ V+++N + CSACEMAVVW+
Sbjct: 3 LLLSEASPHKIFSQIGLYIFDGTRGVGMGIENTVDEKNDDKLFVIHDVGCSACEMAVVWM 62
Query: 180 QNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQY 239
Q+QL + QT+E +L Y+NEL + LP+PMGES +DC ++ +M NVSFTIG K+ +LS +Y
Sbjct: 63 QSQLWKNQTQECILEYVNELYERLPSPMGESTMDCLQLSSMTNVSFTIGGKVCDLSANEY 122
Query: 240 ILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
+LK E + I G AF L R + + D H + R+ F+ +
Sbjct: 123 VLKEVESLT---IHG--AFKLGGHRASILEMKDTI----HAAHEHATTRLLFSHLS 169
>gi|21355083|ref|NP_652013.1| cathD [Drosophila melanogaster]
gi|6685167|gb|AAF23824.1|AF220040_1 cathepsin D precursor [Drosophila melanogaster]
gi|7304149|gb|AAF59186.1| cathD [Drosophila melanogaster]
gi|15292549|gb|AAK93543.1| SD07085p [Drosophila melanogaster]
gi|220946566|gb|ACL85826.1| cathD-PA [synthetic construct]
Length = 392
Score = 123 bits (308), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 58/116 (50%), Positives = 77/116 (66%), Gaps = 6/116 (5%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M EQGL+S VFSF+LNRDP + EGGEI+FGG DP H+ G+ TY+PVT+K YWQ ++
Sbjct: 201 MYEQGLISAPVFSFYLNRDPASPEGGEIIFGGSDPNHYTGEFTYLPVTRKAYWQIKMDAA 260
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE----CKLV 112
IG+ +C+GGC I D+GTSL+A P T IN IGG ++ + C L+
Sbjct: 261 SIGDLQ--LCKGGCQVIADTGTSLIAAPLEEATSINQKIGGTPIIGGQYVVSCDLI 314
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 53/139 (38%), Positives = 77/139 (55%), Gaps = 2/139 (1%)
Query: 156 VEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 215
++ + S GD +C + L +E + IN+ P G+ ++ CD
Sbjct: 255 IKMDAASIGDLQLCKGGCQVIADTGTSLIAAPLEEA--TSINQKIGGTPIIGGQYVVSCD 312
Query: 216 RIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFM 275
IP +P + F +G K F L + YIL+ + +C+SGFM D+PPP GPLWILGDVF+
Sbjct: 313 LIPQLPVIKFVLGGKTFELEGKDYILRVAQMGKTICLSGFMGLDIPPPNGPLWILGDVFI 372
Query: 276 GVYHTVFDSGKLRIGFAEA 294
G Y+T FD G R+GFA+A
Sbjct: 373 GKYYTEFDMGNDRVGFADA 391
>gi|195581342|ref|XP_002080493.1| GD10217 [Drosophila simulans]
gi|194192502|gb|EDX06078.1| GD10217 [Drosophila simulans]
Length = 324
Score = 122 bits (307), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 58/116 (50%), Positives = 77/116 (66%), Gaps = 6/116 (5%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M EQGL+S VFSF+LNRDP + EGGEI+FGG DP H+ G+ TY+PVT+K YWQ ++
Sbjct: 133 MYEQGLISAPVFSFYLNRDPASPEGGEIIFGGSDPNHYTGEFTYLPVTRKAYWQIKMDAA 192
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE----CKLV 112
IG+ +C+GGC I D+GTSL+A P T IN IGG ++ + C L+
Sbjct: 193 SIGDLQ--LCKGGCQVIADTGTSLIAAPLEEATSINQKIGGTPIIGGQYLVSCDLI 246
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 53/139 (38%), Positives = 77/139 (55%), Gaps = 2/139 (1%)
Query: 156 VEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 215
++ + S GD +C + L +E + IN+ P G+ ++ CD
Sbjct: 187 IKMDAASIGDLQLCKGGCQVIADTGTSLIAAPLEEA--TSINQKIGGTPIIGGQYLVSCD 244
Query: 216 RIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFM 275
IP +P + F +G K F L + YIL+ + +C+SGFM D+PPP GPLWILGDVF+
Sbjct: 245 LIPQLPVIKFVLGGKTFELEGKDYILRVAQMGKTICLSGFMGMDIPPPNGPLWILGDVFI 304
Query: 276 GVYHTVFDSGKLRIGFAEA 294
G Y+T FD G R+GFA+A
Sbjct: 305 GKYYTEFDMGNDRVGFADA 323
>gi|307167890|gb|EFN61279.1| Lysosomal aspartic protease [Camponotus floridanus]
Length = 354
Score = 122 bits (307), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 57/104 (54%), Positives = 72/104 (69%), Gaps = 2/104 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV+Q LV + VFSF+LNRDP AE GGE++ GG DP H++ TYVPVT+KGYWQF + I
Sbjct: 165 MVKQKLVPKAVFSFYLNRDPSAEVGGELILGGSDPDHYEADLTYVPVTRKGYWQFSMDGI 224
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGV 104
+GN++ C GC AI D+GTSL+AGP V IN IG +
Sbjct: 225 EVGNRT--FCNNGCQAIADTGTSLIAGPVADVAAINKLIGASAI 266
Score = 90.5 bits (223), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 41/87 (47%), Positives = 61/87 (70%), Gaps = 1/87 (1%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 267
G++I+DC++IP +P ++F +G+K F+LS E Y+L+ + +C+SGFM FD+ G
Sbjct: 268 GQAIVDCNKIPQLPEINFNLGNKKFSLSGEDYVLQIKQFGTTICMSGFMGFDI-GSHGLE 326
Query: 268 WILGDVFMGVYHTVFDSGKLRIGFAEA 294
WILGDVF+G Y+T FD R+GFA A
Sbjct: 327 WILGDVFIGRYYTEFDLDNDRVGFAPA 353
>gi|94732449|emb|CAK11131.1| cathepsin D [Danio rerio]
gi|94733132|emb|CAK05390.1| cathepsin D [Danio rerio]
gi|158253911|gb|AAI54316.1| Ctsd protein [Danio rerio]
Length = 398
Score = 122 bits (306), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 50/93 (53%), Positives = 73/93 (78%)
Query: 202 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 261
++P GE ++DC ++PT+P +SF++G K+++L+ EQYILK +G ++C+SGFM D+P
Sbjct: 303 AIPLMQGEYMVDCKKVPTLPTISFSLGGKVYSLTGEQYILKESQGGHDICLSGFMGLDIP 362
Query: 262 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
PP GPLWILGDVF+G Y+TVFD R+GFA+A
Sbjct: 363 PPAGPLWILGDVFIGQYYTVFDRENNRVGFAKA 395
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 50/116 (43%), Positives = 73/116 (62%), Gaps = 5/116 (4%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ Q V + VFSF+LNR+PD + GGE++ GG DPK++ G YV ++++ YWQ + +
Sbjct: 204 MMSQKKVEKNVFSFYLNRNPDTQPGGELLLGGTDPKYYTGDFNYVDISRQAYWQIHMDGM 263
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG----EGVVSAECKLV 112
IG+ +C+GGC AIVD+GTSL+ GP V + AIG +G +CK V
Sbjct: 264 SIGS-GLSLCKGGCEAIVDTGTSLITGPAAEVKALQKAIGAIPLMQGEYMVDCKKV 318
>gi|22651403|gb|AAL61540.1| cathepsin D precursor [Danio rerio]
Length = 398
Score = 122 bits (306), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 50/93 (53%), Positives = 73/93 (78%)
Query: 202 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 261
++P GE ++DC ++PT+P +SF++G K+++L+ EQYILK +G ++C+SGFM D+P
Sbjct: 303 AIPLMQGEYMVDCKKVPTLPTISFSLGGKVYSLTGEQYILKESQGGHDICLSGFMGLDIP 362
Query: 262 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
PP GPLWILGDVF+G Y+TVFD R+GFA+A
Sbjct: 363 PPAGPLWILGDVFIGQYYTVFDRENNRVGFAKA 395
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 50/116 (43%), Positives = 73/116 (62%), Gaps = 5/116 (4%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ Q V + VFSF+LNR+PD + GGE++ GG DPK++ G YV ++++ YWQ + +
Sbjct: 204 MMSQKKVEKNVFSFYLNRNPDTQPGGELLLGGTDPKYYTGDFNYVDISRQAYWQIHMDGM 263
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG----EGVVSAECKLV 112
IG+ +C+GGC AIVD+GTSL+ GP V + AIG +G +CK V
Sbjct: 264 SIGS-GLSLCKGGCEAIVDTGTSLITGPAAEVKALQKAIGAIPLMQGEYMVDCKKV 318
>gi|27503926|gb|AAH42316.1| Ctsd protein [Danio rerio]
gi|38571742|gb|AAH62824.1| Ctsd protein [Danio rerio]
gi|197247273|gb|AAI64814.1| Ctsd protein [Danio rerio]
Length = 398
Score = 122 bits (306), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 50/93 (53%), Positives = 73/93 (78%)
Query: 202 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 261
++P GE ++DC ++PT+P +SF++G K+++L+ EQYILK +G ++C+SGFM D+P
Sbjct: 303 AIPLMQGEYMVDCKKVPTLPTISFSLGGKVYSLTGEQYILKESQGGHDICLSGFMGLDIP 362
Query: 262 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
PP GPLWILGDVF+G Y+TVFD R+GFA+A
Sbjct: 363 PPAGPLWILGDVFIGQYYTVFDRENNRVGFAKA 395
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 50/116 (43%), Positives = 73/116 (62%), Gaps = 5/116 (4%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ Q V + VFSF+LNR+PD + GGE++ GG DPK++ G YV ++++ YWQ + +
Sbjct: 204 MMSQKKVEKNVFSFYLNRNPDTQPGGELLLGGTDPKYYTGDFNYVDISRQAYWQIHMDGM 263
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG----EGVVSAECKLV 112
IG+ +C+GGC AIVD+GTSL+ GP V + AIG +G +CK V
Sbjct: 264 SIGS-GLSLCKGGCEAIVDTGTSLITGPAAEVKALQKAIGAIPLMQGEYMVDCKKV 318
>gi|18858489|ref|NP_571785.1| cathepsin D [Danio rerio]
gi|12053845|emb|CAC20111.1| cathepsin D enzyme [Danio rerio]
Length = 399
Score = 122 bits (305), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 50/93 (53%), Positives = 73/93 (78%)
Query: 202 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 261
++P GE ++DC ++PT+P +SF++G K+++L+ EQYILK +G ++C+SGFM D+P
Sbjct: 304 AIPLMQGEYMVDCKKVPTLPTISFSLGGKVYSLTGEQYILKESQGGHDICLSGFMGLDIP 363
Query: 262 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
PP GPLWILGDVF+G Y+TVFD R+GFA+A
Sbjct: 364 PPAGPLWILGDVFIGQYYTVFDRENNRVGFAKA 396
Score = 100 bits (250), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 50/118 (42%), Positives = 73/118 (61%), Gaps = 7/118 (5%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ Q V + VFSF+LNR+PD + GGE++ GG DPK++ G YV ++++ YWQ + +
Sbjct: 203 MMSQKKVEKNVFSFYLNRNPDTQPGGELLLGGTDPKYYTGDFNYVDISRQAYWQIHMDGM 262
Query: 61 LIGNQSTGVCEGGCAAIVDSG--TSLLAGPTPVVTEINHAIGG----EGVVSAECKLV 112
IG+ +C+GGC AIVD+G TSL+ GP V + AIG +G +CK V
Sbjct: 263 SIGS-GLSLCKGGCEAIVDTGTSTSLITGPAAEVKALQKAIGAIPLMQGEYMVDCKKV 319
>gi|159468321|ref|XP_001692331.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
gi|158278517|gb|EDP04281.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
Length = 303
Score = 121 bits (304), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 52/89 (58%), Positives = 70/89 (78%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+VE+G ++ VFSFWLNRDP+A GGE+V GG+DP HF G+HT+VPVT++GYWQF + +
Sbjct: 203 LVEEGGLAAPVFSFWLNRDPNAPNGGELVLGGIDPTHFTGEHTWVPVTRQGYWQFNMEGL 262
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPT 89
+G S +C GCAAI D+GTSL+AGP+
Sbjct: 263 DLGPGSQKMCAKGCAAIADTGTSLIAGPS 291
>gi|407728652|gb|AFU24355.1| cathepsin D [Ctenopharyngodon idella]
Length = 398
Score = 121 bits (303), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 51/94 (54%), Positives = 71/94 (75%)
Query: 202 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 261
++P GE ++DC ++PT+P +SF +G K ++L+ EQYILK + E+C+SGFM D+P
Sbjct: 303 AIPLIQGEYMVDCKKVPTLPTISFVLGGKTYSLTGEQYILKESQAGQEICLSGFMGLDIP 362
Query: 262 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
PP GPLWILGDVF+G Y+TVFD R+GFA+AA
Sbjct: 363 PPAGPLWILGDVFIGQYYTVFDRENNRVGFAKAA 396
Score = 107 bits (266), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 49/116 (42%), Positives = 75/116 (64%), Gaps = 5/116 (4%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ Q V + +FSF+LNR+PD + GGE++ GG DPK++ G YV ++++ YWQ + +
Sbjct: 204 MMSQKKVEKNIFSFYLNRNPDTQPGGELLLGGTDPKYYTGDFNYVDISRQAYWQIHMDGM 263
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG----EGVVSAECKLV 112
IG++ T +C+GGC AIVD+GTSL+ GP + + AIG +G +CK V
Sbjct: 264 SIGSELT-LCKGGCEAIVDTGTSLITGPATEIKALQKAIGAIPLIQGEYMVDCKKV 318
>gi|237874218|ref|NP_001153867.1| cathepsin D [Acyrthosiphon pisum]
Length = 393
Score = 121 bits (303), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 55/119 (46%), Positives = 77/119 (64%), Gaps = 2/119 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV QG++ +FSF+LNRDP + GGEI+FGG DP+ + G TYVPVT+ GYWQF L ++
Sbjct: 202 MVNQGIIKSAIFSFYLNRDPSSTPGGEIIFGGSDPEKYTGPFTYVPVTRHGYWQFGLDEV 261
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDL 119
++GN T + G AI D+GTSL+AGP + +IN +GG + E + Q +L
Sbjct: 262 IVGN--TSIVSGALQAIADTGTSLIAGPVDNIKQINELLGGTAIPGGEYIIACDQIDNL 318
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 56/135 (41%), Positives = 74/135 (54%), Gaps = 2/135 (1%)
Query: 159 ENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIP 218
+ V G++++ S A+ L + INEL P GE II CD+I
Sbjct: 259 DEVIVGNTSIVSGALQAIADTGTSLIAGPVDN--IKQINELLGGTAIPGGEYIIACDQID 316
Query: 219 TMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVY 278
+P +SF IG F L + YILK + +C+SGFM D+PPP GPLWILGDVF+G Y
Sbjct: 317 NLPVLSFVIGSTTFKLEGKDYILKVSQFGKTICLSGFMGIDIPPPNGPLWILGDVFIGRY 376
Query: 279 HTVFDSGKLRIGFAE 293
+T FD R+GFA
Sbjct: 377 YTEFDLENNRVGFAN 391
>gi|226437842|gb|ACO56332.1| putative gut cathepsin D-like aspartic protease [Callosobruchus
maculatus]
Length = 389
Score = 120 bits (302), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 55/108 (50%), Positives = 77/108 (71%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV Q LVS+ VFSF+LNRDP A +GGE++ GG D H+KG+ TY+PV ++ YWQF++ +
Sbjct: 197 MVSQKLVSQPVFSFYLNRDPAAPQGGELILGGSDKAHYKGEFTYLPVDRQAYWQFKMDKV 256
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+G ++T +C GC AI D+GTSL+AGP+ V IN AIG ++ E
Sbjct: 257 QVGPETT-LCAKGCEAIADTGTSLIAGPSEEVKAINKAIGATPIMGGE 303
Score = 107 bits (268), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 48/99 (48%), Positives = 66/99 (66%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
IN+ + P GE ++ C+ IP +P ++F +G K F L + YIL+ + +C+SGF
Sbjct: 290 INKAIGATPIMGGEYLVSCESIPKLPTINFVLGGKPFALEGKDYILRVSQAGQTLCLSGF 349
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
M D+PPP GPLWILGDVF+G Y+T FD G R+GFAEA
Sbjct: 350 MGIDIPPPNGPLWILGDVFIGRYYTEFDLGNNRVGFAEA 388
>gi|158254091|gb|AAI54325.1| Zgc:63831 [Danio rerio]
Length = 412
Score = 120 bits (301), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 65/157 (41%), Positives = 96/157 (61%), Gaps = 6/157 (3%)
Query: 142 FNG-AEYVSTGIKTV--VEKENVSAGDS-AVCSACEMAVVWVQNQLKQKQTKEKVLSYIN 197
FNG YV+ K ++ + V G + +C + A+V + +E + +
Sbjct: 257 FNGDLHYVNVTRKAYWQIKMDEVQVGSTLTLCKSGCQAIVDTGTSMITGPVQE--VRALQ 314
Query: 198 ELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMA 257
+ ++P MGE IDC +IPT+P VSF++G K+FNL+ ++Y++K VC+SGFMA
Sbjct: 315 KAIGAIPLLMGEYWIDCKKIPTLPVVSFSLGGKMFNLTGQEYVMKVSHMGMNVCLSGFMA 374
Query: 258 FDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
D+PPP GPLWILGDVF+G Y+TVFD + R+GFA A
Sbjct: 375 MDIPPPAGPLWILGDVFIGRYYTVFDRDQDRVGFAPA 411
Score = 95.5 bits (236), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 42/103 (40%), Positives = 68/103 (66%), Gaps = 1/103 (0%)
Query: 6 LVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQ 65
++ + +FSF++NRDP + GGE++ GG D ++F G YV VT+K YWQ ++ ++ +G+
Sbjct: 225 ILPQNIFSFYINRDPAGDVGGELMLGGFDQQYFNGDLHYVNVTRKAYWQIKMDEVQVGST 284
Query: 66 STGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
T +C+ GC AIVD+GTS++ GP V + AIG ++ E
Sbjct: 285 LT-LCKSGCQAIVDTGTSMITGPVQEVRALQKAIGAIPLLMGE 326
>gi|24571212|gb|AAN62917.1| cathepsin D [Ctenopharyngodon idella]
Length = 203
Score = 120 bits (301), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 51/94 (54%), Positives = 71/94 (75%)
Query: 202 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 261
++P GE ++DC ++PT+P +SF +G K ++L+ EQYILK + E+C+SGFM D+P
Sbjct: 108 AIPLIQGEYMVDCKKVPTLPTISFVLGGKTYSLTGEQYILKESQAGQEICLSGFMGLDIP 167
Query: 262 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
PP GPLWILGDVF+G Y+TVFD R+GFA+AA
Sbjct: 168 PPAGPLWILGDVFIGQYYTVFDRENNRVGFAKAA 201
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 49/116 (42%), Positives = 75/116 (64%), Gaps = 5/116 (4%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ Q V + +FSF+LNR+PD + GGE++ GG DPK++ G YV ++++ YWQ + +
Sbjct: 9 MMSQKKVEKNIFSFYLNRNPDTQPGGELLLGGTDPKYYTGDFNYVDISRQAYWQIHMDGM 68
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG----EGVVSAECKLV 112
IG++ T +C+GGC AIVD+GTSL+ GP + + AIG +G +CK V
Sbjct: 69 SIGSELT-LCKGGCEAIVDTGTSLITGPATEIKALQKAIGAIPLIQGEYMVDCKKV 123
>gi|115279794|gb|ABI85390.1| cathepsin D [Hippoglossus hippoglossus]
Length = 399
Score = 120 bits (301), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 56/106 (52%), Positives = 77/106 (72%), Gaps = 1/106 (0%)
Query: 189 KEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIA 248
+E+V + ++ +LP MGE +IDC +IP++P VSF IG K+ NL+ E YI+K + +
Sbjct: 294 REEVRA-LHRAIGALPLIMGEYLIDCKKIPSLPVVSFNIGGKMLNLTGEDYIMKEFQKGS 352
Query: 249 EVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
+C+SGFMA D+PPP GPLWILGDVF+G Y+TVFD R+GFA A
Sbjct: 353 SICLSGFMAMDIPPPAGPLWILGDVFIGKYYTVFDRNADRLGFAPA 398
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 48/113 (42%), Positives = 72/113 (63%), Gaps = 5/113 (4%)
Query: 6 LVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQ 65
L+ + VFSF+++RD A GGE++ GG DP+++ G YV VT+K YWQ ++ + +G Q
Sbjct: 212 LLPQNVFSFYISRDASATVGGELILGGTDPQYYTGDLHYVNVTRKAYWQIKMDGVEVGTQ 271
Query: 66 STGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE----CKLVVS 114
T +C+ GC AIVD+GTSL+ GP V ++ AIG ++ E CK + S
Sbjct: 272 LT-LCKAGCQAIVDTGTSLIVGPREEVRALHRAIGALPLIMGEYLIDCKKIPS 323
>gi|41053329|ref|NP_956325.1| uncharacterized protein LOC336746 precursor [Danio rerio]
gi|34783813|gb|AAH56836.1| Zgc:63831 [Danio rerio]
Length = 412
Score = 120 bits (301), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 65/157 (41%), Positives = 96/157 (61%), Gaps = 6/157 (3%)
Query: 142 FNG-AEYVSTGIKTV--VEKENVSAGDS-AVCSACEMAVVWVQNQLKQKQTKEKVLSYIN 197
FNG YV+ K ++ + V G + +C + A+V + +E + +
Sbjct: 257 FNGDLHYVNVTRKAYWQIKMDEVQVGSTLTLCKSGCQAIVDTGTSMITGPVQE--VRALQ 314
Query: 198 ELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMA 257
+ ++P MGE IDC +IPT+P VSF++G K+FNL+ ++Y++K VC+SGFMA
Sbjct: 315 KAIGAIPLLMGEYWIDCKKIPTLPVVSFSLGGKMFNLTGQEYVMKMSHMGMNVCLSGFMA 374
Query: 258 FDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
D+PPP GPLWILGDVF+G Y+TVFD + R+GFA A
Sbjct: 375 MDIPPPAGPLWILGDVFIGRYYTVFDRDQDRVGFAPA 411
Score = 95.5 bits (236), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 42/103 (40%), Positives = 68/103 (66%), Gaps = 1/103 (0%)
Query: 6 LVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQ 65
++ + +FSF++NRDP + GGE++ GG D ++F G YV VT+K YWQ ++ ++ +G+
Sbjct: 225 ILPQNIFSFYINRDPAGDVGGELMLGGFDQQYFNGDLHYVNVTRKAYWQIKMDEVQVGST 284
Query: 66 STGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
T +C+ GC AIVD+GTS++ GP V + AIG ++ E
Sbjct: 285 LT-LCKSGCQAIVDTGTSMITGPVQEVRALQKAIGAIPLLMGE 326
>gi|260826476|ref|XP_002608191.1| hypothetical protein BRAFLDRAFT_115250 [Branchiostoma floridae]
gi|229293542|gb|EEN64201.1| hypothetical protein BRAFLDRAFT_115250 [Branchiostoma floridae]
Length = 225
Score = 120 bits (301), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 56/115 (48%), Positives = 77/115 (66%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV+Q LV + VFSF+LNRD GGE++ GG DPK++ G T++ VTK GYWQF++ I
Sbjct: 18 MVQQKLVDKPVFSFYLNRDASGATGGELLLGGTDPKYYSGDFTFLDVTKPGYWQFKMDGI 77
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQ 115
+I +++ C+GGCAAI D+GTSL+AGPT V +N IG + E + SQ
Sbjct: 78 MINGKASDFCKGGCAAIADTGTSLIAGPTSEVKALNEQIGATPIPGGEYMVDCSQ 132
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 45/110 (40%), Positives = 66/110 (60%), Gaps = 12/110 (10%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIA------- 248
+NE + P P GE ++DC +I ++P +SF +G K F L + Y+L+ + +
Sbjct: 112 LNEQIGATPIPGGEYMVDCSQIKSLPPISFMLGGKAFQLEGKDYVLQVSQSQSCSPQVTE 171
Query: 249 ---EVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
VC+SGF+ D+P GPLWILGDVF+G Y+T+FD G R+GFA
Sbjct: 172 MGQTVCVSGFLGIDVP--AGPLWILGDVFIGPYYTLFDMGNNRVGFAHTT 219
>gi|157112486|ref|XP_001657556.1| cathepsin d [Aedes aegypti]
gi|205831550|sp|Q03168.2|ASPP_AEDAE RecName: Full=Lysosomal aspartic protease; Flags: Precursor
gi|108878060|gb|EAT42285.1| AAEL006169-PA [Aedes aegypti]
Length = 387
Score = 120 bits (300), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 58/116 (50%), Positives = 75/116 (64%), Gaps = 6/116 (5%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M QGL+ VFSF+LNRDP A EGGEI+FGG D + G TY+ V +K YWQF++ +
Sbjct: 196 MFNQGLIDAPVFSFYLNRDPSAAEGGEIIFGGSDSNKYTGDFTYLSVDRKAYWQFKMDSV 255
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE----CKLV 112
+G+ T C GC AI D+GTSL+AGP VT IN AIGG +++ E C L+
Sbjct: 256 KVGD--TEFCNNGCEAIADTGTSLIAGPVSEVTAINKAIGGTPIMNGEYMVDCSLI 309
Score = 111 bits (277), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 57/139 (41%), Positives = 81/139 (58%), Gaps = 4/139 (2%)
Query: 157 EKENVSAGDSAVCS-ACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 215
+ ++V GD+ C+ CE A+ L E ++ IN+ P GE ++DC
Sbjct: 251 KMDSVKVGDTEFCNNGCE-AIADTGTSLIAGPVSE--VTAINKAIGGTPIMNGEYMVDCS 307
Query: 216 RIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFM 275
IP +P +SF +G K F+L Y+L+ + +C+SGFM D+PPP GPLWILGDVF+
Sbjct: 308 LIPKLPKISFVLGGKSFDLEGADYVLRVAQMGKTICLSGFMGIDIPPPNGPLWILGDVFI 367
Query: 276 GVYHTVFDSGKLRIGFAEA 294
G Y+T FD G R+GFA A
Sbjct: 368 GKYYTEFDMGNDRVGFATA 386
>gi|293230|gb|AAA29350.1| aspartic protease [Aedes aegypti]
Length = 387
Score = 120 bits (300), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 58/116 (50%), Positives = 75/116 (64%), Gaps = 6/116 (5%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M QGL+ VFSF+LNRDP A EGGEI+FGG D + G TY+ V +K YWQF++ +
Sbjct: 196 MFNQGLIDAPVFSFYLNRDPSAAEGGEIIFGGSDSNKYTGDFTYLSVDRKAYWQFKMDSV 255
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE----CKLV 112
+G+ T C GC AI D+GTSL+AGP VT IN AIGG +++ E C L+
Sbjct: 256 KVGD--TEFCNNGCEAIADTGTSLIAGPVSEVTAINKAIGGTPIMNGEYMVDCSLI 309
Score = 111 bits (277), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 57/139 (41%), Positives = 81/139 (58%), Gaps = 4/139 (2%)
Query: 157 EKENVSAGDSAVCS-ACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 215
+ ++V GD+ C+ CE A+ L E ++ IN+ P GE ++DC
Sbjct: 251 KMDSVKVGDTEFCNNGCE-AIADTGTSLIAGPVSE--VTAINKAIGGTPIMNGEYMVDCS 307
Query: 216 RIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFM 275
IP +P +SF +G K F+L Y+L+ + +C+SGFM D+PPP GPLWILGDVF+
Sbjct: 308 LIPKLPKISFVLGGKSFDLEGADYVLRVAQMGKTICLSGFMGIDIPPPNGPLWILGDVFI 367
Query: 276 GVYHTVFDSGKLRIGFAEA 294
G Y+T FD G R+GFA A
Sbjct: 368 GKYYTEFDMGNDRVGFATA 386
>gi|322796189|gb|EFZ18765.1| hypothetical protein SINV_10075 [Solenopsis invicta]
Length = 366
Score = 120 bits (300), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 58/117 (49%), Positives = 77/117 (65%), Gaps = 5/117 (4%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV+Q LV + +FSF+LNRDP A +GGE++ GG DP+H+ G TYV VT+KGYWQF + I
Sbjct: 175 MVKQKLVPQPIFSFYLNRDPTAAQGGEMILGGSDPEHYVGSMTYVDVTRKGYWQFTMDRI 234
Query: 61 LIGNQSTG--VCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE---GVVSAECKLV 112
+G+ S +C+ C AI D+GTSL+AGPT + EIN IG G C +V
Sbjct: 235 TVGDSSPSHILCKNTCQAIADTGTSLIAGPTVEINEINKQIGATMIGGQALVNCAMV 291
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 44/101 (43%), Positives = 64/101 (63%), Gaps = 5/101 (4%)
Query: 196 INELCDSLPNPM--GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCIS 253
INE+ + M G+++++C +P +P V+F +G K F+L E Y+L+ E +C+S
Sbjct: 268 INEINKQIGATMIGGQALVNCAMVPHLPKVNFILGGKTFSLKGEDYVLEITEMGHTICMS 327
Query: 254 GFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
GF D+ PLWILGDVF+G Y+T FD G R+GFAEA
Sbjct: 328 GFQGMDM---GDPLWILGDVFIGRYYTEFDLGNNRVGFAEA 365
>gi|262232673|gb|ACY38599.1| cathepsin D-like aspartic protease [Anisakis simplex]
Length = 453
Score = 119 bits (299), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 59/108 (54%), Positives = 69/108 (63%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV Q LV+E VF+FWLNR PD E GGEI FGG DPKHF Y PVT++ YWQF++ D
Sbjct: 236 MVNQQLVAEPVFAFWLNRTPDDEIGGEITFGGTDPKHFVEPIVYAPVTRRAYWQFKM-DK 294
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+ G T C GC AI D+GTSL+AGP V I IG E + S E
Sbjct: 295 ISGQDGTLACSDGCQAIADTGTSLIAGPKQQVQLIQKYIGAEPLFSGE 342
Score = 100 bits (249), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 44/87 (50%), Positives = 62/87 (71%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 267
GE +I CD++P++P+VS IG K F+L+ Y+L + +C+SGFM DLP G L
Sbjct: 341 GEYMIPCDKVPSLPDVSLVIGGKTFSLTSLDYVLNITKAGKSICLSGFMGIDLPERVGQL 400
Query: 268 WILGDVFMGVYHTVFDSGKLRIGFAEA 294
WILGDVF+G ++TVFD G+ R+GFA+A
Sbjct: 401 WILGDVFIGRFYTVFDMGQERVGFAQA 427
>gi|122938524|gb|ABM69086.1| aspartic proteinase AspMD03 [Musca domestica]
Length = 390
Score = 119 bits (298), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 55/108 (50%), Positives = 73/108 (67%), Gaps = 2/108 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M EQGL+S+ +FSF+LNRDP A EGGEI+FGG DP H+ G TY+PVT+K YWQ ++
Sbjct: 199 MYEQGLISQPIFSFYLNRDPKAPEGGEIIFGGSDPDHYTGDFTYLPVTRKAYWQIKMDSA 258
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+G+ +GGC I D+GTSL+A P T IN AIGG ++ +
Sbjct: 259 SMGDLK--CAKGGCQVIADTGTSLIALPPSEATSINQAIGGTPIMGGQ 304
Score = 100 bits (250), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 45/99 (45%), Positives = 63/99 (63%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
IN+ P G+ ++ C+ IP +P + F +G K F L + Y+L+ + +C+SGF
Sbjct: 291 INQAIGGTPIMGGQYMVACEDIPKLPVIKFVLGGKTFELEGKDYVLRIAQMGKTICLSGF 350
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
M D+PPP GPLWILGDVF+G Y+T FD G R+GFA A
Sbjct: 351 MGIDIPPPNGPLWILGDVFIGKYYTEFDMGNDRVGFAIA 389
>gi|156553448|ref|XP_001600543.1| PREDICTED: lysosomal aspartic protease-like [Nasonia vitripennis]
Length = 384
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 55/109 (50%), Positives = 75/109 (68%), Gaps = 2/109 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV+Q LV + +FSF+LNRDP+A+ GGE++ GG D H+ G TYVPV++K YWQF++ I
Sbjct: 193 MVKQNLVPQPIFSFYLNRDPNAKIGGEMILGGSDSAHYTGDFTYVPVSRKAYWQFKMDKI 252
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAEC 109
IG++ CE GC AI D+GTSL+AGP + IN IG +V+ E
Sbjct: 253 TIGDKL--FCENGCEAIADTGTSLIAGPVGEIEGINKKIGATPIVAGEA 299
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 56/139 (40%), Positives = 81/139 (58%), Gaps = 4/139 (2%)
Query: 157 EKENVSAGDSAVC-SACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 215
+ + ++ GD C + CE A+ L E + IN+ + P GE+++ CD
Sbjct: 248 KMDKITIGDKLFCENGCE-AIADTGTSLIAGPVGE--IEGINKKIGATPIVAGEAMVSCD 304
Query: 216 RIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFM 275
+P +P + F +G K F+L E Y+LK + +C+SGFM D+PPP GPLWILGDVF+
Sbjct: 305 AVPNLPTIDFVVGGKKFSLKGEDYVLKVSQFGKTICLSGFMGIDIPPPNGPLWILGDVFI 364
Query: 276 GVYHTVFDSGKLRIGFAEA 294
G ++T FD G RIGFA A
Sbjct: 365 GRFYTEFDMGNDRIGFANA 383
>gi|391329068|ref|XP_003738999.1| PREDICTED: lysosomal aspartic protease-like [Metaseiulus
occidentalis]
Length = 384
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 55/109 (50%), Positives = 75/109 (68%), Gaps = 1/109 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV Q ++ +FSF+L RD D G E+V GG+DPKH KG+ TY+PV++KGYWQF++ +
Sbjct: 192 MVAQNAIAAPIFSFYLTRDNDHPTGSELVIGGIDPKHHKGEITYIPVSRKGYWQFKMDSV 251
Query: 61 LIGNQS-TGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
IG+ S T +C GC AI D+GTSL+AGPT V +N AIG ++ E
Sbjct: 252 KIGDVSKTTLCANGCQAIADTGTSLIAGPTSEVKALNKAIGAAPFLNGE 300
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 57/142 (40%), Positives = 85/142 (59%), Gaps = 7/142 (4%)
Query: 157 EKENVSAGDSAVCSACE---MAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIID 213
+ ++V GD + + C A+ L T E + +N+ + P GE +++
Sbjct: 247 KMDSVKIGDVSKTTLCANGCQAIADTGTSLIAGPTSE--VKALNKAIGAAPFLNGEYLVN 304
Query: 214 CDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDV 273
C+ +PTMPN++FT+G K F L+P Y++K +G +C+SGF+ D+ PRGPLWILGDV
Sbjct: 305 CNNLPTMPNITFTLGGKDFELTPNDYVMKMSQGGLPLCLSGFIGLDV--PRGPLWILGDV 362
Query: 274 FMGVYHTVFDSGKLRIGFAEAA 295
F+G Y TVFD R+GFA AA
Sbjct: 363 FIGRYFTVFDRQSDRVGFAVAA 384
>gi|348511299|ref|XP_003443182.1| PREDICTED: cathepsin D-like [Oreochromis niloticus]
Length = 397
Score = 119 bits (297), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 55/106 (51%), Positives = 77/106 (72%), Gaps = 1/106 (0%)
Query: 189 KEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIA 248
KE+V + + + ++P MGE++I+C +IPT+P +SF IG K FNL+ E Y++K +
Sbjct: 292 KEEVKA-LQKAIGAIPLLMGEALIECTKIPTLPVISFDIGGKTFNLTGEDYVVKESQMGV 350
Query: 249 EVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
+C+SGFMA D+PPP GPLWILGDVF+G Y+TVFD R+GFA A
Sbjct: 351 TICLSGFMAMDIPPPTGPLWILGDVFIGKYYTVFDRDADRVGFATA 396
Score = 99.0 bits (245), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 45/104 (43%), Positives = 68/104 (65%), Gaps = 1/104 (0%)
Query: 6 LVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQ 65
L+ + +FSF+++RDP A GGE++ GG DP+++ G YV VT+K +WQ + + +GNQ
Sbjct: 210 LLPQNIFSFYISRDPTAAVGGELMLGGTDPQYYTGDLHYVNVTRKAFWQIGMNRVDVGNQ 269
Query: 66 STGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAEC 109
T +C+ GC AIVD+GTSL+ GP V + AIG ++ E
Sbjct: 270 LT-LCKAGCQAIVDTGTSLIVGPKEEVKALQKAIGAIPLLMGEA 312
>gi|238816833|gb|ACR56787.1| aspartic protease 3 [Strongyloides ratti]
Length = 389
Score = 119 bits (297), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 52/94 (55%), Positives = 69/94 (73%)
Query: 202 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 261
++P GE I C +IPTMPNV++TIG + F L+P Y+LK + + VCISGFM D+P
Sbjct: 296 AIPIMQGEYEIICSKIPTMPNVTYTIGGQQFVLTPNDYVLKVTQAMTTVCISGFMGLDIP 355
Query: 262 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
P GPLWILGDVF+G +++VFD G R+GFA+AA
Sbjct: 356 APNGPLWILGDVFIGKFYSVFDHGNKRVGFAQAA 389
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 39/114 (34%), Positives = 66/114 (57%), Gaps = 4/114 (3%)
Query: 3 EQGLVSEEVFSFWLNRDPDAE-EGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDIL 61
+ + +VF+FWLNRD + +GGE+ +D H+KG+ + +T + YW+ +G +
Sbjct: 200 NKNVCKNQVFAFWLNRDLNGNTQGGEMTLCDMDSSHYKGQIAWESLTAEDYWRINIGGVN 259
Query: 62 IGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQ 115
+G Q G +AIVD+GTSLL GPT + +I IG ++ E +++ S+
Sbjct: 260 VGGQQV---SGAVSAIVDTGTSLLTGPTAEIEKIQKQIGAIPIMQGEYEIICSK 310
>gi|209154266|gb|ACI33365.1| Cathepsin D precursor [Salmo salar]
Length = 402
Score = 119 bits (297), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 60/140 (42%), Positives = 89/140 (63%), Gaps = 3/140 (2%)
Query: 156 VEKENVSAGDS-AVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDC 214
+E NV G+ +C A A+V L +E + +++ +LP MGE IDC
Sbjct: 264 IEMSNVEVGNQLTLCKAGCQAIVDTGTSLIIGPAEE--VRVLHKAIGALPLLMGEYWIDC 321
Query: 215 DRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVF 274
++P++P ++F +G K+FNL+ + YILK + ++C+SGFMA D+PPP GPLWILGDVF
Sbjct: 322 KKVPSLPVIAFNLGGKMFNLTGDDYILKESQMGLKICLSGFMAMDIPPPAGPLWILGDVF 381
Query: 275 MGVYHTVFDSGKLRIGFAEA 294
+G Y++VFD R+GFA A
Sbjct: 382 IGRYYSVFDRDADRMGFAPA 401
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 51/113 (45%), Positives = 74/113 (65%), Gaps = 5/113 (4%)
Query: 6 LVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQ 65
L+ + +FSF+++RDP A GGE++ GG DP ++ G YV VT+K YWQ E+ ++ +GNQ
Sbjct: 215 LLPQNIFSFYISRDPLAAVGGELMLGGTDPLYYTGDLHYVNVTRKAYWQIEMSNVEVGNQ 274
Query: 66 STGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE----CKLVVS 114
T +C+ GC AIVD+GTSL+ GP V ++ AIG ++ E CK V S
Sbjct: 275 LT-LCKAGCQAIVDTGTSLIIGPAEEVRVLHKAIGALPLLMGEYWIDCKKVPS 326
>gi|347451476|gb|AEO94539.1| aspartate protease cathepsin D [Triatoma infestans]
Length = 393
Score = 119 bits (297), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 57/116 (49%), Positives = 76/116 (65%), Gaps = 6/116 (5%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M +QG V VFSF+LNRDP A GGEI+FGG DP+ + G TYVPV K+GYWQF + +
Sbjct: 201 MYQQGAVQNPVFSFYLNRDPAAAVGGEIIFGGSDPEKYVGDFTYVPVDKQGYWQFNMDKV 260
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE----CKLV 112
++ ++ C+GGC AI D+GTSL+AGPT V +N +GG + E C L+
Sbjct: 261 IVNGKT--FCKGGCQAIADTGTSLIAGPTEDVIALNKLLGGTPIAGGEYMISCDLI 314
Score = 102 bits (255), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 46/97 (47%), Positives = 62/97 (63%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
+N+L P GE +I CD IP +P + F IG F+L + YIL+ +C+SGF
Sbjct: 293 LNKLLGGTPIAGGEYMISCDLIPKLPKIDFVIGGNKFSLEGKDYILRVSAMGKTICLSGF 352
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFA 292
+ D+PPP GPLWILGDVF+G ++T FD G R+GFA
Sbjct: 353 LGLDVPPPHGPLWILGDVFIGRFYTEFDLGNNRVGFA 389
>gi|257228998|gb|ACV53024.1| cathepsin D2 [Homarus americanus]
Length = 385
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 54/116 (46%), Positives = 72/116 (62%), Gaps = 4/116 (3%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV+Q L+ VFSF+LNRDP + EGGE++ GG DP ++ G TY+PV +KGYWQ ++ I
Sbjct: 192 MVKQSLIPAPVFSFYLNRDPSSPEGGELILGGSDPNYYSGNFTYIPVDRKGYWQIKMDGI 251
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE----CKLV 112
+ CEGGC AI D+GTSL+A P IN IG + + S E C L+
Sbjct: 252 QMNGARVPFCEGGCEAIADTGTSLIAAPVEEARSINKKIGAKPIASGEWSVDCSLI 307
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 46/99 (46%), Positives = 61/99 (61%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
IN+ + P GE +DC IP +P +SF + + F L + YILK E C+SGF
Sbjct: 286 INKKIGAKPIASGEWSVDCSLIPHLPKISFVLNGQPFTLEGKDYILKVSVFGREECVSGF 345
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
+ D+PPP GPLWILGD F+G ++T FD G R+GFA A
Sbjct: 346 IGLDVPPPMGPLWILGDTFIGRFYTEFDLGNNRVGFAIA 384
>gi|405951067|gb|EKC19012.1| Lysosomal aspartic protease [Crassostrea gigas]
Length = 439
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 54/119 (45%), Positives = 77/119 (64%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV Q LV +FSF+L+R+P GGE++ GG DPK++ G TYV VT+KGYWQF++ +
Sbjct: 198 MVAQKLVPAPIFSFYLDRNPTGTPGGEMILGGSDPKYYSGNFTYVNVTRKGYWQFKMDGV 257
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDL 119
+ +++ C GGC AI D+GTSLLAGP+ V +N IG + + E + S+ G L
Sbjct: 258 KVNGKASKYCSGGCNAIADTGTSLLAGPSTEVKSLNAMIGAKPFAAGEYTVDCSKIGSL 316
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 41/94 (43%), Positives = 59/94 (62%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
+N + + P GE +DC +I ++P VSFT+ K F L + YIL E +C+SGF
Sbjct: 292 LNAMIGAKPFAAGEYTVDCSKIGSLPPVSFTLNGKDFTLQGKDYILTVSEMGQTICLSGF 351
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRI 289
+ D+P P GPLWILGD+F+G ++T FD G R+
Sbjct: 352 IGLDIPAPAGPLWILGDIFIGAFYTEFDMGNSRV 385
>gi|307203870|gb|EFN82801.1| Lysosomal aspartic protease [Harpegnathos saltator]
Length = 374
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 57/116 (49%), Positives = 78/116 (67%), Gaps = 6/116 (5%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV+Q LV + VFSF+LNRDPDA+ GGE++ GG D H++G+ TYVPV++KGYWQF + I
Sbjct: 183 MVKQDLVPKAVFSFYLNRDPDAKVGGEMLLGGSDSDHYEGEFTYVPVSRKGYWQFAMDSI 242
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE----CKLV 112
+ + +C GC AI D+GTSL+AGP V IN IG +++ E C L+
Sbjct: 243 QVHGHT--LCASGCQAIADTGTSLIAGPVEEVAVINSLIGATTIIAGEAIVDCDLI 296
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 48/102 (47%), Positives = 67/102 (65%)
Query: 193 LSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCI 252
++ IN L + GE+I+DCD I +P + IG K+F+LS + YIL+ + +C+
Sbjct: 272 VAVINSLIGATTIIAGEAIVDCDLIEKLPGIDVIIGGKMFSLSGKDYILRVKQFGKTICM 331
Query: 253 SGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
SGFM D+PPP GPLWILGDVF+G ++T FD R+GFA A
Sbjct: 332 SGFMGMDIPPPNGPLWILGDVFIGRFYTEFDMENDRVGFAVA 373
>gi|348565205|ref|XP_003468394.1| PREDICTED: cathepsin D-like [Cavia porcellus]
Length = 407
Score = 118 bits (295), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 54/108 (50%), Positives = 77/108 (71%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
++EQ LV + +FSF+LNRDP A+ GGE+V GG+D K++KG TY+ VT+K YWQ + +
Sbjct: 213 LMEQKLVEKNIFSFYLNRDPTAQPGGELVLGGIDSKYYKGSFTYLNVTRKAYWQVHMDQL 272
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+G++ T +C+GGC AIVD+GTSLL GP V E+ AIG ++ E
Sbjct: 273 QVGSELT-LCKGGCEAIVDTGTSLLVGPVDEVKELQKAIGALPLIQGE 319
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 44/94 (46%), Positives = 66/94 (70%)
Query: 202 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 261
+LP GE +I C+++ ++P+V+ +G + L+ E Y+LK + +C+SGFM D+P
Sbjct: 312 ALPLIQGEYMIPCEKVSSLPSVTLKLGGTDYTLASEDYVLKVSQAGKTICLSGFMGMDIP 371
Query: 262 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
PP GPLWILGDVF+G Y+TVFD R+GFA++A
Sbjct: 372 PPSGPLWILGDVFIGRYYTVFDRDNNRVGFAQSA 405
>gi|380036056|ref|NP_001244039.1| cathepsin D1 precursor [Ictalurus punctatus]
gi|330689904|gb|AEC33270.1| cathepsin D1 [Ictalurus punctatus]
Length = 396
Score = 117 bits (294), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 48/93 (51%), Positives = 70/93 (75%)
Query: 202 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 261
++P GE ++DC ++P++P +SF +G + + L+ EQYILK + E+C+SGFMA D+P
Sbjct: 303 AIPLIQGEYMVDCKKVPSLPTISFNLGGQTYTLTGEQYILKESQAGREICLSGFMALDIP 362
Query: 262 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
PP GPLWILGDVF+G Y+T+FD R+GFA+A
Sbjct: 363 PPAGPLWILGDVFIGQYYTMFDRENNRVGFAKA 395
Score = 110 bits (274), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 54/118 (45%), Positives = 75/118 (63%), Gaps = 5/118 (4%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ Q V + VFSF+LNR+PD + GGE++ GG DPK + G YV +T++ YWQ + +
Sbjct: 204 MMSQKKVEKNVFSFYLNRNPDTQPGGELLLGGTDPKFYTGDFHYVNITRQAYWQIHMDGM 263
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG----EGVVSAECKLVVS 114
IG+Q T +C+GGC AIVD+GTSL+ GP V + AIG +G +CK V S
Sbjct: 264 TIGSQLT-LCKGGCEAIVDTGTSLITGPAAEVKALQKAIGAIPLIQGEYMVDCKKVPS 320
>gi|146386352|gb|ABQ23964.1| cathepsin D [Oryctolagus cuniculus]
Length = 292
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 54/108 (50%), Positives = 77/108 (71%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+++Q LV + VFSF+LNRDP A+ GGE++ GGVDPK+++G +Y+ VT+K YWQ + +
Sbjct: 115 LMQQKLVEKNVFSFYLNRDPAAQPGGELMLGGVDPKYYQGSLSYLNVTRKAYWQVHMDQL 174
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+G+ T +CEGGC AIVD+GTSLL GP V E+ AIG ++ E
Sbjct: 175 NVGSGLT-LCEGGCEAIVDTGTSLLVGPVDEVRELQRAIGAVPLIQGE 221
Score = 90.5 bits (223), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 38/79 (48%), Positives = 55/79 (69%)
Query: 202 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 261
++P GE II C+++ ++P V+ +G + + LS E Y LK +G +C+SGFM D+P
Sbjct: 214 AVPLIQGEYIIPCEKVSSLPPVTLKLGGRDYTLSSEDYTLKVSQGGKTICLSGFMGMDIP 273
Query: 262 PPRGPLWILGDVFMGVYHT 280
PP GPLWILGDVF+G Y+T
Sbjct: 274 PPAGPLWILGDVFIGRYYT 292
>gi|238816835|gb|ACR56788.1| aspartic protease 4 [Strongyloides ratti]
Length = 428
Score = 116 bits (290), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 53/108 (49%), Positives = 72/108 (66%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+++Q V + +F+FWLNR D +GGEI GG+DPKH+KG TYVPV++KGYWQF++ D
Sbjct: 197 LIDQHKVPQPIFAFWLNRIADDSDGGEITLGGMDPKHYKGDITYVPVSRKGYWQFKM-DG 255
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+G+ C+ GC AI D+GTSL+AGP V I IG E + E
Sbjct: 256 FVGDNEKIACKNGCQAIADTGTSLIAGPKAQVEAIQKFIGAEPLARGE 303
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 45/99 (45%), Positives = 61/99 (61%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
I + + P GE ++ CD++ ++P V+ IG + F LS + YIL + +SGF
Sbjct: 290 IQKFIGAEPLARGEYMVPCDKVSSLPIVNIVIGGQAFALSGKDYILNVTAMGKSIRLSGF 349
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
M DLP G LWILGDVF+G Y+TVFD GK R+GFA A
Sbjct: 350 MGMDLPERVGELWILGDVFIGRYYTVFDFGKDRVGFAVA 388
>gi|227336874|gb|ACP21315.1| aspartic proteinase precursor [Rhipicephalus microplus]
Length = 391
Score = 116 bits (290), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 52/108 (48%), Positives = 74/108 (68%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV QG+ + VFS +L+R+ +GGE++FGG+D H+ G TYVPVT+KGYWQF + +
Sbjct: 199 MVAQGVAANPVFSVYLDRNTSDPQGGEVLFGGIDKAHYTGNITYVPVTRKGYWQFHMDGV 258
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+G +T C GGC AI D+GTSL+AGPT + ++N AIG ++ E
Sbjct: 259 TVGTNAT-FCNGGCEAIADTGTSLIAGPTAEIQKLNMAIGAAPFLAGE 305
Score = 101 bits (251), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 42/99 (42%), Positives = 65/99 (65%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
+N + P GE ++ C IP +PN++FT+ + F L + YI++ + +C+SGF
Sbjct: 292 LNMAIGAAPFLAGEYMVSCKSIPKLPNITFTLNGQEFQLQGKDYIMQVSQAGIPMCLSGF 351
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
+ D+P P GPLWILGDVF+G Y+T+FD G R+GFA++
Sbjct: 352 IGLDVPAPMGPLWILGDVFIGRYYTIFDRGNDRVGFAQS 390
>gi|83319201|dbj|BAE53722.1| aspartic protease [Haemaphysalis longicornis]
Length = 391
Score = 116 bits (290), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 53/108 (49%), Positives = 75/108 (69%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV QG+ + VFS +L+R+ GGE++FGG+D H+ G TYVPVT+KGYWQF + +
Sbjct: 199 MVAQGVAPKPVFSVYLDRNASDPNGGEVLFGGIDEAHYTGNITYVPVTRKGYWQFHMNGV 258
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+G+ +T C GGCAAI D+GTSL+AGPT + ++N AIG ++ E
Sbjct: 259 KVGDNAT-FCNGGCAAIADTGTSLIAGPTEEIHKLNVAIGAAPFMAGE 305
Score = 101 bits (251), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 52/135 (38%), Positives = 77/135 (57%), Gaps = 3/135 (2%)
Query: 161 VSAGDSAV-CSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPT 219
V GD+A C+ A+ L T+E + +N + P GE I+ C IPT
Sbjct: 258 VKVGDNATFCNGGCAAIADTGTSLIAGPTEE--IHKLNVAIGAAPFMAGEYIVSCKSIPT 315
Query: 220 MPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYH 279
+P ++F + F L + Y+L+ + +C+SGF+ D+P P GPLWILGDVF+G Y+
Sbjct: 316 LPKINFNLNGNEFVLEGKDYVLQVSQAGIPLCLSGFIGLDVPAPLGPLWILGDVFIGRYY 375
Query: 280 TVFDSGKLRIGFAEA 294
T+FD G R+GFAE+
Sbjct: 376 TIFDRGNDRVGFAES 390
>gi|157779726|gb|ABV71391.1| aspartic protease [Haemaphysalis longicornis]
Length = 391
Score = 116 bits (290), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 53/108 (49%), Positives = 75/108 (69%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV QG+ + VFS +L+R+ GGE++FGG+D H+ G TYVPVT+KGYWQF + +
Sbjct: 199 MVAQGVAPKPVFSVYLDRNASDPNGGEVLFGGIDEAHYTGNITYVPVTRKGYWQFHMNGV 258
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+G+ +T C GGCAAI D+GTSL+AGPT + ++N AIG ++ E
Sbjct: 259 KVGDNAT-FCNGGCAAIADTGTSLIAGPTEEIHKLNVAIGAAPFMAGE 305
Score = 101 bits (251), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 52/135 (38%), Positives = 77/135 (57%), Gaps = 3/135 (2%)
Query: 161 VSAGDSAV-CSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPT 219
V GD+A C+ A+ L T+E + +N + P GE I+ C IPT
Sbjct: 258 VKVGDNATFCNGGCAAIADTGTSLIAGPTEE--IHKLNVAIGAAPFMAGEYIVSCKSIPT 315
Query: 220 MPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYH 279
+P ++F + F L + Y+L+ + +C+SGF+ D+P P GPLWILGDVF+G Y+
Sbjct: 316 LPKINFNLNGNEFVLEGKDYVLQVSQAGIPLCLSGFIGLDVPAPLGPLWILGDVFIGRYY 375
Query: 280 TVFDSGKLRIGFAEA 294
T+FD G R+GFAE+
Sbjct: 376 TIFDRGNDRVGFAES 390
>gi|190576608|gb|ACE79095.1| cathepsin D precursor (predicted) [Sorex araneus]
Length = 405
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 65/155 (41%), Positives = 92/155 (59%), Gaps = 7/155 (4%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+++Q LV + +FSF+LNRDP A+ GGE++ GG+D K+FKG TY VT++ YWQ + I
Sbjct: 211 LMQQKLVDKNIFSFYLNRDPTAQPGGELMLGGIDSKYFKGSMTYHNVTRQAYWQVHMDQI 270
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
+GN T +C+GGC AIVD+GTSLL GP V E+ AIG ++ E + + DL
Sbjct: 271 DVGNGLT-LCKGGCEAIVDTGTSLLVGPVDEVKELQKAIGAVPLIQGEYIIPCEKLPDLP 329
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTV 155
L G ++ L + A VS G KT+
Sbjct: 330 TVSLTLG------GKEYSLSPHDYALQVSQGGKTI 358
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 49/93 (52%), Positives = 67/93 (72%)
Query: 202 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 261
++P GE II C+++P +P VS T+G K ++LSP Y L+ +G +C+SGFM D+P
Sbjct: 310 AVPLIQGEYIIPCEKLPDLPTVSLTLGGKEYSLSPHDYALQVSQGGKTICLSGFMGMDIP 369
Query: 262 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
PP GPLWILGDVF+G Y+TVFD + R+G AEA
Sbjct: 370 PPAGPLWILGDVFIGRYYTVFDREQNRVGLAEA 402
>gi|224050910|ref|XP_002199093.1| PREDICTED: cathepsin D [Taeniopygia guttata]
Length = 396
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 54/108 (50%), Positives = 76/108 (70%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+++Q LV + +FSF+LNRDP GGE+V GG DPK++KG+ ++ VT+K YWQ + +
Sbjct: 204 VMKQKLVEKNMFSFYLNRDPSGVPGGEMVLGGTDPKYYKGEFSWFNVTRKAYWQIHMDSV 263
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+GN T VCEGGC AIVD+GTSL+ GPT V +I AIG + ++ E
Sbjct: 264 DVGNGPT-VCEGGCEAIVDTGTSLITGPTKEVKKIQEAIGAKPLIKGE 310
Score = 108 bits (271), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 59/141 (41%), Positives = 82/141 (58%), Gaps = 3/141 (2%)
Query: 156 VEKENVSAGDS-AVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDC 214
+ ++V G+ VC A+V L TKE + I E + P GE +I C
Sbjct: 258 IHMDSVDVGNGPTVCEGGCEAIVDTGTSLITGPTKE--VKKIQEAIGAKPLIKGEYMIPC 315
Query: 215 DRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVF 274
+++PT+P VS IG K F L+ +QY+LK +C+SGF D+PPP GPLWILGDVF
Sbjct: 316 EKVPTLPVVSMNIGGKTFGLTGDQYVLKMTAQGETICMSGFSGLDIPPPGGPLWILGDVF 375
Query: 275 MGVYHTVFDSGKLRIGFAEAA 295
+G Y+T FD R+GFA++A
Sbjct: 376 IGPYYTSFDRDNNRVGFAQSA 396
>gi|21552717|gb|AAM62283.1|AF396662_1 cathepsin D preproprotein [Silurus asotus]
Length = 395
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 47/93 (50%), Positives = 70/93 (75%)
Query: 202 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 261
++P GE ++DC ++P++P +SF +G + + L+ EQYILK + E+C+SGFMA D+P
Sbjct: 302 AIPLIQGEYMVDCKKVPSLPTISFNLGGQTYTLTGEQYILKESQAGREICLSGFMALDIP 361
Query: 262 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
PP GPLWILGDVF+G Y+T+FD ++GFA+A
Sbjct: 362 PPAGPLWILGDVFIGQYYTMFDRENNQVGFAKA 394
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 52/118 (44%), Positives = 73/118 (61%), Gaps = 5/118 (4%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ Q + VFSF+LNR+PD + GGE++ GG DPK + G YV +T++ YWQ + +
Sbjct: 203 MMSQKKFEKNVFSFYLNRNPDTQPGGELLLGGTDPKFYTGDFHYVNITRQAYWQIHMDGM 262
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG----EGVVSAECKLVVS 114
IG+Q + +C GGC AIVD+GTSL+ GP V + AIG +G +CK V S
Sbjct: 263 SIGSQLS-LCNGGCEAIVDTGTSLITGPAAEVKALQKAIGAIPLIQGEYMVDCKKVPS 319
>gi|291416142|ref|XP_002724306.1| PREDICTED: cathepsin D [Oryctolagus cuniculus]
Length = 377
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 54/108 (50%), Positives = 77/108 (71%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+++Q LV + VFSF+LNRDP A+ GGE++ GGVDPK+++G +Y+ VT+K YWQ + +
Sbjct: 183 LMQQKLVEKNVFSFYLNRDPAAQPGGELMLGGVDPKYYQGSLSYLNVTRKAYWQVHMDQL 242
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+G+ T +CEGGC AIVD+GTSLL GP V E+ AIG ++ E
Sbjct: 243 NVGSGLT-LCEGGCEAIVDTGTSLLVGPVDEVRELQRAIGAVPLIQGE 289
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 48/94 (51%), Positives = 66/94 (70%)
Query: 202 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 261
++P GE II C+++ ++P V+ +G + + LS E Y LK +G +C+SGFM D+P
Sbjct: 282 AVPLIQGEYIIPCEKVSSLPPVTLKLGGRDYTLSSEDYTLKVSQGGKTICLSGFMGMDIP 341
Query: 262 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
PP GPLWILGDVF+G Y+TVFD R+GFAEAA
Sbjct: 342 PPAGPLWILGDVFIGRYYTVFDRDGNRVGFAEAA 375
>gi|146217392|gb|ABQ10738.1| cathepsin D [Penaeus monodon]
Length = 386
Score = 115 bits (288), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 52/116 (44%), Positives = 75/116 (64%), Gaps = 4/116 (3%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV Q +V +FSF+LNRDP A EGGE++ GG DP ++ G TYVPV ++GYWQF++ +
Sbjct: 193 MVNQNVVPAPIFSFYLNRDPAAAEGGELILGGSDPAYYTGDFTYVPVDRQGYWQFKMDGL 252
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVV----SAECKLV 112
+ + C+GGC AI D+GTSL+A P+ IN IG + ++ S +C L+
Sbjct: 253 QMNGTTVPFCDGGCEAIADTGTSLIAAPSEEARLINKKIGAKPIMGGEWSVDCNLI 308
Score = 98.6 bits (244), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 44/99 (44%), Positives = 64/99 (64%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
IN+ + P GE +DC+ IP +P +SF + K F L + YIL+ + C+SGF
Sbjct: 287 INKKIGAKPIMGGEWSVDCNLIPHLPTISFVLAGKPFTLEGKDYILRVSQFGQTTCLSGF 346
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
+ D+PPP GP+WILGD+F+G ++T FD G R+GFAE+
Sbjct: 347 IGLDVPPPMGPIWILGDIFIGRFYTEFDMGNNRVGFAES 385
>gi|326433118|gb|EGD78688.1| cathepsin D [Salpingoeca sp. ATCC 50818]
Length = 385
Score = 115 bits (287), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 56/137 (40%), Positives = 81/137 (59%), Gaps = 3/137 (2%)
Query: 159 ENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIP 218
+++S + CS C+ A+ L T + +N+ ++P GE ++DC +IP
Sbjct: 252 DSLSVNGDSYCSNCQ-AIADTGTSLLAGPTD--AIKKLNKQIGAIPIAQGEYMVDCKKIP 308
Query: 219 TMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVY 278
TMPNV + + F L+P+QY+L+ C+SGF D+PPP GPLWILGDVF+G Y
Sbjct: 309 TMPNVDIVLNGQKFTLTPQQYVLQVSAQGQTECLSGFFGLDVPPPAGPLWILGDVFIGAY 368
Query: 279 HTVFDSGKLRIGFAEAA 295
TVFD G R+GFA +A
Sbjct: 369 TTVFDMGNNRVGFAPSA 385
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 52/142 (36%), Positives = 77/142 (54%), Gaps = 8/142 (5%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+V G V +++FWLNR A GGE+ GG DPKH G +VP+T+ GYWQF + +
Sbjct: 195 LVSAGKVESNMYTFWLNRVQGAPSGGELTLGGYDPKHMSGPIQWVPLTRDGYWQFAMDSL 254
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG----GEGVVSAECKLVVSQY 116
+ S C C AI D+GTSLLAGPT + ++N IG +G +CK + +
Sbjct: 255 SVNGDS--YCS-NCQAIADTGTSLLAGPTDAIKKLNKQIGAIPIAQGEYMVDCKKIPTMP 311
Query: 117 G-DLIWDLLVSGLLPEKVCQQI 137
D++ + L P++ Q+
Sbjct: 312 NVDIVLNGQKFTLTPQQYVLQV 333
>gi|118344558|ref|NP_001072052.1| cathepsin D1 precursor [Takifugu rubripes]
gi|55771082|dbj|BAD69801.1| cathepsin D1 [Takifugu rubripes]
Length = 396
Score = 114 bits (286), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 56/125 (44%), Positives = 82/125 (65%), Gaps = 3/125 (2%)
Query: 170 SACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGD 229
S CE A+V L ++E + + + ++P GE ++ CD+IP++P ++F IG
Sbjct: 274 SGCE-AIVDTGTSLLTGPSEE--VKALQKAIGAMPLIQGEYMVSCDKIPSLPVITFNIGG 330
Query: 230 KIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRI 289
K F+LS +QY+LK + +C+SGFMA D+P P GPLWILGDVF+G Y+TVFD R+
Sbjct: 331 KPFSLSGDQYVLKVSQAGKTICLSGFMALDIPAPAGPLWILGDVFIGQYYTVFDRDNNRV 390
Query: 290 GFAEA 294
GFA+A
Sbjct: 391 GFAKA 395
Score = 107 bits (266), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 48/108 (44%), Positives = 72/108 (66%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
++ Q V + VFSF+LNR+PD + GGE++ GG DPK++ G YV VT++ YWQ + +
Sbjct: 204 IMSQKKVEKNVFSFYLNRNPDTQPGGELLLGGTDPKYYTGDFDYVNVTRQAYWQIHMDGM 263
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+G+Q + +C+ GC AIVD+GTSLL GP+ V + AIG ++ E
Sbjct: 264 SVGSQLS-LCKSGCEAIVDTGTSLLTGPSEEVKALQKAIGAMPLIQGE 310
>gi|307166067|gb|EFN60339.1| Lysosomal aspartic protease [Camponotus floridanus]
Length = 370
Score = 114 bits (286), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 52/99 (52%), Positives = 71/99 (71%), Gaps = 2/99 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M++Q LVS+ +FSF+LNR+P AEEGGE++ GG DP H+ G+ TYVPVT +GYWQF + +
Sbjct: 186 MIQQRLVSQPIFSFYLNRNPSAEEGGELILGGCDPNHYVGEFTYVPVTVEGYWQFTMDSV 245
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAI 99
+ GN +C GC AI D+GTSL+ GP+ + IN I
Sbjct: 246 IAGNYI--LCAQGCQAIADTGTSLIVGPSEDIDVINGYI 282
Score = 74.3 bits (181), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 39/100 (39%), Positives = 56/100 (56%), Gaps = 11/100 (11%)
Query: 195 YINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISG 254
YI + D+ N +DCD+I +P ++F + K NL+P YI++ E +C SG
Sbjct: 281 YIQNISDNDGN------VDCDKINELPTINFILSGKPHNLTPHDYIIRDTEDGVAICYSG 334
Query: 255 FMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
F L WILGDVF+G ++TVFD G R+GFA +
Sbjct: 335 FQGSYLSG-----WILGDVFIGHFYTVFDMGNNRVGFAPS 369
>gi|387915174|gb|AFK11196.1| cathepsin D1 [Callorhinchus milii]
Length = 394
Score = 114 bits (286), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 56/118 (47%), Positives = 78/118 (66%), Gaps = 5/118 (4%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+V Q LV VFSF+LNR+PD+ GGE++ GG DPK++ G Y+ VT+K YWQ ++ ++
Sbjct: 202 IVTQKLVPNNVFSFYLNRNPDSLPGGELILGGTDPKYYTGDFHYLNVTRKAYWQVKMDEV 261
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE----CKLVVS 114
IG Q T +C+GGCAAIVD+GTSL+ GP + + AIG ++ E CK V S
Sbjct: 262 SIGEQLT-LCKGGCAAIVDTGTSLITGPAQEIKALQKAIGAIPLIQGEYLIDCKKVAS 318
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 52/140 (37%), Positives = 86/140 (61%), Gaps = 3/140 (2%)
Query: 156 VEKENVSAGDS-AVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDC 214
V+ + VS G+ +C A+V L +E + + + ++P GE +IDC
Sbjct: 256 VKMDEVSIGEQLTLCKGGCAAIVDTGTSLITGPAQE--IKALQKAIGAIPLIQGEYLIDC 313
Query: 215 DRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVF 274
++ ++P ++F +G +++ L+ EQY+L + +C+SGFM D+PPP GPLWILGDVF
Sbjct: 314 KKVASLPAINFKLGGQVYTLTAEQYVLNETQAGHSICLSGFMGLDIPPPGGPLWILGDVF 373
Query: 275 MGVYHTVFDSGKLRIGFAEA 294
+G Y+T+FD K R+GFA++
Sbjct: 374 IGQYYTMFDREKDRVGFAKS 393
>gi|242013446|ref|XP_002427417.1| Lysosomal aspartic protease precursor, putative [Pediculus humanus
corporis]
gi|212511797|gb|EEB14679.1| Lysosomal aspartic protease precursor, putative [Pediculus humanus
corporis]
Length = 383
Score = 114 bits (286), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 51/108 (47%), Positives = 72/108 (66%), Gaps = 2/108 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M EQ LV +FSF+LNR+P+ + GGE++ GG DP ++KG TY+PV +K YWQF++ +
Sbjct: 192 MYEQRLVDSPIFSFYLNRNPNEKVGGELLLGGSDPNYYKGNFTYLPVNRKAYWQFQMDKV 251
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
++ + VC GGC AI D+GTSL+AGP V +IN + G V E
Sbjct: 252 MM--EDITVCRGGCQAIADTGTSLIAGPVEDVNKINKKLNGVPVSGGE 297
Score = 100 bits (249), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 45/100 (45%), Positives = 65/100 (65%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
IN+ + +P GE +I+C IP +P ++F + + F L + YIL+ + VC+SGF
Sbjct: 284 INKKLNGVPVSGGEYMIECRNIPNLPKINFVLKGRSFVLEAKDYILRVSQFGKTVCLSGF 343
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
M D+P P GPLWILGDVF+G ++T FD R+GFAE+A
Sbjct: 344 MGIDIPKPNGPLWILGDVFIGKFYTEFDMKNNRVGFAESA 383
>gi|25452827|sp|Q9DEX3.1|CATD_CLUHA RecName: Full=Cathepsin D; Flags: Precursor
gi|11037777|gb|AAG27733.1|AF312364_1 muscular cathepsin D [Clupea harengus]
Length = 396
Score = 114 bits (285), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 47/94 (50%), Positives = 69/94 (73%)
Query: 202 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 261
++P GE +IDC ++PT+P +SF +G K ++L+ EQY+LK +G +C+SG M ++P
Sbjct: 303 AIPLIQGEYMIDCKKVPTLPTISFNVGGKTYSLTGEQYVLKESQGGKTICLSGLMGLEIP 362
Query: 262 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
PP GPLWILGDVF+G Y+TVFD R+GFA++
Sbjct: 363 PPAGPLWILGDVFIGQYYTVFDRESNRVGFAKST 396
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 55/116 (47%), Positives = 75/116 (64%), Gaps = 5/116 (4%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ Q V + VFSF+LNR+PD E GGE++ GG DPK++ G YVPVT++ YWQ + +
Sbjct: 204 MMSQKKVEQNVFSFYLNRNPDTEPGGELLLGGTDPKYYTGDFNYVPVTRQAYWQIHMDGM 263
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG----EGVVSAECKLV 112
IG+Q T +C+ GC AIVD+GTSL+ GP V + AIG +G +CK V
Sbjct: 264 SIGSQLT-LCKDGCEAIVDTGTSLITGPPAEVRALQKAIGAIPLIQGEYMIDCKKV 318
>gi|260822088|ref|XP_002606435.1| hypothetical protein BRAFLDRAFT_67687 [Branchiostoma floridae]
gi|229291776|gb|EEN62445.1| hypothetical protein BRAFLDRAFT_67687 [Branchiostoma floridae]
Length = 423
Score = 114 bits (285), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 59/130 (45%), Positives = 77/130 (59%), Gaps = 2/130 (1%)
Query: 166 SAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSF 225
SA C A+ L T E + +N+L + P GE +DC++IP++P +SF
Sbjct: 289 SAYCKGGCNAIADTGTSLIAGPTSE--IQALNKLIGATPIVGGEYTVDCNKIPSLPTISF 346
Query: 226 TIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSG 285
+G K F L E Y+LK CISGF+ D+PPPRGPLWILGDVF+G Y+T FD G
Sbjct: 347 VLGGKSFGLKGEDYVLKVSTMGQTECISGFLGIDVPPPRGPLWILGDVFIGPYYTQFDLG 406
Query: 286 KLRIGFAEAA 295
R+GFA AA
Sbjct: 407 NNRVGFARAA 416
Score = 100 bits (249), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 44/91 (48%), Positives = 61/91 (67%)
Query: 18 RDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAI 77
RDP GGE++ GG DPK++ G T+V VT+ GYWQF++ I+I Q++ C+GGC AI
Sbjct: 240 RDPLGTTGGELLLGGTDPKYYSGDFTFVNVTEPGYWQFKMDGIMINGQASAYCKGGCNAI 299
Query: 78 VDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
D+GTSL+AGPT + +N IG +V E
Sbjct: 300 ADTGTSLIAGPTSEIQALNKLIGATPIVGGE 330
>gi|427789779|gb|JAA60341.1| Putative cathepsin d isoform 1 protein [Rhipicephalus pulchellus]
Length = 391
Score = 114 bits (285), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 51/108 (47%), Positives = 74/108 (68%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV QG+ ++ VFS +L+R+ +GGE++FGG+D H+ G TYVPVT+KGYWQF + +
Sbjct: 199 MVAQGVAAKPVFSVYLDRNASDPQGGEVLFGGIDKAHYTGNITYVPVTRKGYWQFHMDGV 258
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+G +T C GGC AI D+GTSL+AGP+ + ++N AIG + E
Sbjct: 259 TVGTNTT-FCNGGCEAIADTGTSLIAGPSEEIQKLNLAIGAAPFTAGE 305
Score = 100 bits (250), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 41/99 (41%), Positives = 64/99 (64%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
+N + P GE ++ C IP +PN++FT+ F L + Y+++ + +C+SGF
Sbjct: 292 LNLAIGAAPFTAGEYLVSCKSIPKLPNITFTLNGHDFQLQGKDYVMQVSQAGIPLCLSGF 351
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
+ D+P P GPLWILGDVF+G Y+T+FD G R+GFA++
Sbjct: 352 IGLDVPAPMGPLWILGDVFIGRYYTIFDRGNDRVGFAQS 390
>gi|146286061|sp|O93428.2|CATD_CHIHA RecName: Full=Cathepsin D; Flags: Precursor
Length = 396
Score = 114 bits (285), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 48/93 (51%), Positives = 68/93 (73%)
Query: 202 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 261
+ P GE +++CD +P++P +SFT+G +++ L+ EQYILK + +C+SGFM D+P
Sbjct: 303 AFPLIQGEYMVNCDTVPSLPVISFTVGGQVYTLTGEQYILKVTQAGKTMCLSGFMGLDIP 362
Query: 262 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
P GPLWILGDVFMG Y+TVFD R+GFA+A
Sbjct: 363 APAGPLWILGDVFMGQYYTVFDRDANRVGFAKA 395
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 50/108 (46%), Positives = 72/108 (66%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
++ Q V + VFSF+LNR+PD E GGE++ GG DPK++ G YV VT++ YWQ + +
Sbjct: 204 IMSQKKVEQNVFSFYLNRNPDTEPGGELLLGGTDPKYYTGDFNYVNVTRQAYWQIRVDSM 263
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+G+Q + +C GGC AIVDSGTSL+ GP+ V + AIG ++ E
Sbjct: 264 AVGDQLS-LCTGGCEAIVDSGTSLITGPSVEVKALQKAIGAFPLIQGE 310
>gi|417400425|gb|JAA47158.1| Putative cathepsin d [Desmodus rotundus]
Length = 409
Score = 114 bits (285), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 54/108 (50%), Positives = 77/108 (71%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+++Q LV E VFSF+LNRDP+A+ GGE++ GGVD K++KG TY+ VT+K YWQ + ++
Sbjct: 215 LMQQKLVDENVFSFYLNRDPNAQPGGELMLGGVDSKYYKGPITYLNVTRKAYWQVHMDEV 274
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+G+ T +C+ GC AIVD+GTSLL GP V E+ AIG ++ E
Sbjct: 275 AVGSGLT-LCKEGCEAIVDTGTSLLVGPVEEVRELQKAIGAVPLIQGE 321
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 45/94 (47%), Positives = 65/94 (69%)
Query: 202 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 261
++P GE ++ C+++ ++P V+ +G K + LS E Y LK +G +C+SGFM D+P
Sbjct: 314 AVPLIQGEYMVPCEKVSSLPEVTLKLGGKAYRLSAEDYTLKVSQGGKSICLSGFMGMDIP 373
Query: 262 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
PP GPLWILGDVF+G Y+TVFD + R+G AEA
Sbjct: 374 PPAGPLWILGDVFIGRYYTVFDRDENRVGLAEAT 407
>gi|213625094|gb|AAI69806.1| LOC443721 protein [Xenopus laevis]
Length = 399
Score = 114 bits (285), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 56/140 (40%), Positives = 86/140 (61%), Gaps = 3/140 (2%)
Query: 156 VEKENVSAGDS-AVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDC 214
+ + +S GD +C A+V L +E ++ + ++P GE +I C
Sbjct: 261 IRMDQLSVGDQLTLCKGGCEAIVDTGTSLITGPVEE--VTALQRAIGAIPLIRGEYMILC 318
Query: 215 DRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVF 274
D IP++P +SFT G ++++L+ EQY+LK + VC+SGF+ D+PPP GPLWI+GDVF
Sbjct: 319 DNIPSLPVISFTFGGRVYSLTGEQYVLKISKAGRTVCLSGFLGLDIPPPAGPLWIIGDVF 378
Query: 275 MGVYHTVFDSGKLRIGFAEA 294
+G Y+TVFD R+GFA+A
Sbjct: 379 IGQYYTVFDRANDRVGFAKA 398
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 50/112 (44%), Positives = 76/112 (67%), Gaps = 1/112 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
++EQ LV +FSF+LNR+PD + GGE++ GG DP ++ G +Y+ VT+K YWQ + +
Sbjct: 207 IMEQKLVDSNLFSFYLNRNPDTQPGGELLLGGTDPTYYTGDFSYMNVTRKAYWQIRMDQL 266
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLV 112
+G+Q T +C+GGC AIVD+GTSL+ GP VT + AIG ++ E ++
Sbjct: 267 SVGDQLT-LCKGGCEAIVDTGTSLITGPVEEVTALQRAIGAIPLIRGEYMIL 317
>gi|260810438|ref|XP_002599971.1| hypothetical protein BRAFLDRAFT_74093 [Branchiostoma floridae]
gi|229285255|gb|EEN55983.1| hypothetical protein BRAFLDRAFT_74093 [Branchiostoma floridae]
Length = 388
Score = 114 bits (285), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 55/119 (46%), Positives = 76/119 (63%), Gaps = 2/119 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV QG V E VFSF+LNRDP A GGE++ GG DP ++ G T++ VT+KGYWQF++ I
Sbjct: 197 MVSQGAVPEPVFSFYLNRDPSATAGGELILGGADPNYYTGDFTFLDVTRKGYWQFKMDGI 256
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDL 119
+G + C+ GC AI D+GTSL+AGP V +++ IG + E K+ S+ L
Sbjct: 257 NVGGST--FCQEGCQAIADTGTSLIAGPIEEVNKLHKQIGATPLAGGEYKVDCSKVTSL 313
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 51/136 (37%), Positives = 80/136 (58%), Gaps = 2/136 (1%)
Query: 159 ENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIP 218
+ ++ G S C A+ L +E ++ +++ + P GE +DC ++
Sbjct: 254 DGINVGGSTFCQEGCQAIADTGTSLIAGPIEE--VNKLHKQIGATPLAGGEYKVDCSKVT 311
Query: 219 TMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVY 278
++P +SF +G K F L+ ++YIL+ + +C+SGFM D+PPP GPLWILGDVF+G Y
Sbjct: 312 SLPTISFILGGKEFELTGKEYILQVKQFGMTICLSGFMGMDIPPPAGPLWILGDVFIGSY 371
Query: 279 HTVFDSGKLRIGFAEA 294
+T FD GK +GFA A
Sbjct: 372 YTQFDLGKNLVGFATA 387
>gi|49522906|gb|AAH75134.1| LOC443721 protein, partial [Xenopus laevis]
Length = 398
Score = 114 bits (284), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 56/140 (40%), Positives = 86/140 (61%), Gaps = 3/140 (2%)
Query: 156 VEKENVSAGDS-AVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDC 214
+ + +S GD +C A+V L +E ++ + ++P GE +I C
Sbjct: 260 IRMDQLSVGDQLTLCKGGCEAIVDTGTSLITGPVEE--VAALQRAIGAIPLIRGEYMILC 317
Query: 215 DRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVF 274
D IP++P +SFT G ++++L+ EQY+LK + VC+SGF+ D+PPP GPLWI+GDVF
Sbjct: 318 DNIPSLPVISFTFGGRVYSLTGEQYVLKISKAGRTVCLSGFLGLDIPPPAGPLWIIGDVF 377
Query: 275 MGVYHTVFDSGKLRIGFAEA 294
+G Y+TVFD R+GFA+A
Sbjct: 378 IGQYYTVFDRANDRVGFAKA 397
Score = 111 bits (277), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 49/112 (43%), Positives = 75/112 (66%), Gaps = 1/112 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
++EQ LV +FSF+LNR+PD + GGE++ GG DP ++ G +Y+ VT+K YWQ + +
Sbjct: 206 IMEQKLVDSNLFSFYLNRNPDTQPGGELLLGGTDPTYYTGDFSYMNVTRKAYWQIRMDQL 265
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLV 112
+G+Q T +C+GGC AIVD+GTSL+ GP V + AIG ++ E ++
Sbjct: 266 SVGDQLT-LCKGGCEAIVDTGTSLITGPVEEVAALQRAIGAIPLIRGEYMIL 316
>gi|148231809|ref|NP_001085308.1| cathepsin D precursor [Xenopus laevis]
gi|62739292|gb|AAH94178.1| LOC443721 protein [Xenopus laevis]
Length = 399
Score = 114 bits (284), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 56/140 (40%), Positives = 86/140 (61%), Gaps = 3/140 (2%)
Query: 156 VEKENVSAGDS-AVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDC 214
+ + +S GD +C A+V L +E ++ + ++P GE +I C
Sbjct: 261 IRMDQLSVGDQLTLCKGGCEAIVDTGTSLITGPVEE--VAALQRAIGAIPLIRGEYMILC 318
Query: 215 DRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVF 274
D IP++P +SFT G ++++L+ EQY+LK + VC+SGF+ D+PPP GPLWI+GDVF
Sbjct: 319 DNIPSLPVISFTFGGRVYSLTGEQYVLKISKAGRTVCLSGFLGLDIPPPAGPLWIIGDVF 378
Query: 275 MGVYHTVFDSGKLRIGFAEA 294
+G Y+TVFD R+GFA+A
Sbjct: 379 IGQYYTVFDRANDRVGFAKA 398
Score = 111 bits (277), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 49/112 (43%), Positives = 75/112 (66%), Gaps = 1/112 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
++EQ LV +FSF+LNR+PD + GGE++ GG DP ++ G +Y+ VT+K YWQ + +
Sbjct: 207 IMEQKLVDSNLFSFYLNRNPDTQPGGELLLGGTDPTYYTGDFSYMNVTRKAYWQIRMDQL 266
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLV 112
+G+Q T +C+GGC AIVD+GTSL+ GP V + AIG ++ E ++
Sbjct: 267 SVGDQLT-LCKGGCEAIVDTGTSLITGPVEEVAALQRAIGAIPLIRGEYMIL 317
>gi|3378161|emb|CAA07719.1| cathepsin D precursor [Chionodraco hamatus]
Length = 396
Score = 114 bits (284), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 48/93 (51%), Positives = 68/93 (73%)
Query: 202 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 261
+ P GE +++CD +P++P +SFT+G +++ L+ EQYILK + +C+SGFM D+P
Sbjct: 303 AFPLIQGEYMVNCDTVPSLPVISFTVGGQVYTLTGEQYILKVTQAGKTMCLSGFMGLDIP 362
Query: 262 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
P GPLWILGDVFMG Y+TVFD R+GFA+A
Sbjct: 363 APAGPLWILGDVFMGQYYTVFDRDANRVGFAKA 395
Score = 109 bits (273), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 50/108 (46%), Positives = 72/108 (66%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
++ Q V + VFSF+LNR+PD E GGE++ GG DPK++ G YV VT++ YWQ + +
Sbjct: 204 IMSQKKVEQNVFSFYLNRNPDTEPGGELLLGGTDPKYYTGDFNYVNVTRQAYWQIRVDSM 263
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+G+Q + +C GGC AIVDSGTSL+ GP+ V + AIG ++ E
Sbjct: 264 AVGDQLS-LCTGGCEAIVDSGTSLITGPSVEVKALQKAIGAFPLIQGE 310
>gi|197631813|gb|ACH70630.1| cathepsin D [Salmo salar]
gi|223648160|gb|ACN10838.1| Cathepsin D precursor [Salmo salar]
Length = 398
Score = 113 bits (283), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 46/93 (49%), Positives = 69/93 (74%)
Query: 202 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 261
+ P GE +++CD+IPTMP+++F +G + ++L+ EQY+LK + +C+SGFM D+P
Sbjct: 305 ATPLIQGEYMVNCDKIPTMPDITFNLGGQSYSLTAEQYVLKESQAGKTICLSGFMGLDIP 364
Query: 262 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
P GPLWILGDVF+G Y+TVFD R+GFA++
Sbjct: 365 APAGPLWILGDVFIGQYYTVFDRDNNRVGFAKS 397
Score = 106 bits (264), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 47/108 (43%), Positives = 74/108 (68%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
++ Q V + VFSF+LNR+P++E GGE++ GG DPK++ G Y+ V+++ YWQ + +
Sbjct: 206 IMSQKKVEQNVFSFYLNRNPESEPGGELLLGGTDPKYYSGDFQYLNVSRQAYWQVHMDGM 265
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+G+Q + +C+GGC AIVD+GTSL+ GPT V + AIG ++ E
Sbjct: 266 GVGSQLS-LCKGGCEAIVDTGTSLITGPTAEVKALQKAIGATPLIQGE 312
>gi|156406785|ref|XP_001641225.1| predicted protein [Nematostella vectensis]
gi|156228363|gb|EDO49162.1| predicted protein [Nematostella vectensis]
Length = 370
Score = 113 bits (283), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 54/108 (50%), Positives = 75/108 (69%), Gaps = 2/108 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV Q LV VFSF+LNR+P A GGE++ GG DPK++KG +YVPVT++GYWQF++ I
Sbjct: 179 MVLQKLVPAPVFSFYLNREPGASPGGELLLGGSDPKYYKGNFSYVPVTQEGYWQFKMDGI 238
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+ S C GC AI D+GTSL+AGPT + ++N+ IG + ++ E
Sbjct: 239 SVKEGS--FCSDGCQAIADTGTSLIAGPTDEIEKLNNLIGAKIIIGGE 284
Score = 108 bits (271), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 58/136 (42%), Positives = 81/136 (59%), Gaps = 2/136 (1%)
Query: 159 ENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIP 218
+ +S + + CS A+ L T E + +N L + GE ++C I
Sbjct: 236 DGISVKEGSFCSDGCQAIADTGTSLIAGPTDE--IEKLNNLIGAKIIIGGEYTVNCSAID 293
Query: 219 TMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVY 278
++P+++FTIG K + L+ +QYILK VCISGF+ D+PPPRGPLWILGDVF+G Y
Sbjct: 294 SLPDITFTIGGKKYVLTGKQYILKVTTLGQSVCISGFLGLDVPPPRGPLWILGDVFIGPY 353
Query: 279 HTVFDSGKLRIGFAEA 294
+T FD G R+GFAEA
Sbjct: 354 YTEFDFGNKRVGFAEA 369
>gi|308483047|ref|XP_003103726.1| CRE-ASP-4 protein [Caenorhabditis remanei]
gi|308259744|gb|EFP03697.1| CRE-ASP-4 protein [Caenorhabditis remanei]
Length = 462
Score = 113 bits (283), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 54/108 (50%), Positives = 73/108 (67%), Gaps = 2/108 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+ EQ V VFSFWLNR+PD++ GGEI FGG+DP+ + TY PVT+KGYWQF++ D
Sbjct: 239 LFEQKKVPSNVFSFWLNRNPDSDLGGEITFGGIDPRRYVEPITYTPVTRKGYWQFKM-DK 297
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
++G+ G C GC AI D+GTSL+AGP + I + IG E ++ E
Sbjct: 298 VVGSGVLG-CSNGCQAIADTGTSLIAGPKAQIEAIQNFIGAEPLIKGE 344
Score = 106 bits (264), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 51/99 (51%), Positives = 66/99 (66%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
I + P GE +I CD+IPT+P VSF IG + F+L E Y+LK +G +C+SGF
Sbjct: 331 IQNFIGAEPLIKGEYMISCDKIPTLPPVSFVIGGQEFSLKGEDYVLKIAQGGKTICLSGF 390
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
M DLP G LWILGDVF+G Y+TVFD + R+GFA+A
Sbjct: 391 MGIDLPERVGELWILGDVFIGRYYTVFDFDQNRVGFAQA 429
>gi|326920173|ref|XP_003206349.1| PREDICTED: cathepsin D-like [Meleagris gallopavo]
Length = 397
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 49/108 (45%), Positives = 77/108 (71%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+++Q L+ + +FSF+LNRDP A+ GGE++ GG DPK+++G ++V VT+K YWQ + +
Sbjct: 205 VMKQKLIEKNIFSFYLNRDPTAQPGGELLLGGTDPKYYRGDFSWVNVTRKAYWQVHMDSV 264
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+ N T +C+GGC AIVD+GTSL+ GPT V E+ AIG + ++ +
Sbjct: 265 NVANGLT-LCKGGCEAIVDTGTSLITGPTKEVKELQTAIGAKPLIKGQ 311
Score = 98.2 bits (243), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 56/141 (39%), Positives = 80/141 (56%), Gaps = 9/141 (6%)
Query: 156 VEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSL---PNPMGESII 212
++ NV+ G + CE A+V L TKE + EL ++ P G+ II
Sbjct: 261 MDSVNVANGLTLCKGGCE-AIVDTGTSLITGPTKE-----VKELQTAIGAKPLIKGQYII 314
Query: 213 DCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGD 272
CD+I ++P V+ +G K + L+ EQY+ K +C+SGF D+PPP GPLWILGD
Sbjct: 315 PCDKISSLPVVTLMLGGKPYKLTGEQYVFKVSAQGETICLSGFSGLDVPPPGGPLWILGD 374
Query: 273 VFMGVYHTVFDSGKLRIGFAE 293
VF+G Y+TVFD +GFA+
Sbjct: 375 VFIGPYYTVFDRDNDSVGFAK 395
>gi|354496335|ref|XP_003510282.1| PREDICTED: cathepsin D [Cricetulus griseus]
gi|344248735|gb|EGW04839.1| Cathepsin D [Cricetulus griseus]
Length = 408
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 47/94 (50%), Positives = 69/94 (73%)
Query: 202 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 261
++P GE +I C+++ ++P+V+ +G K + LSP +Y+LK +G +C+SGFM D+P
Sbjct: 313 AVPLIQGEYMIPCEKVSSLPSVTLKLGGKDYELSPSKYVLKVSQGGKTICLSGFMGMDIP 372
Query: 262 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
PP GPLWILGDVF+G Y+TVFD R+GFA+AA
Sbjct: 373 PPSGPLWILGDVFIGTYYTVFDRDNNRVGFAKAA 406
Score = 108 bits (269), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 49/108 (45%), Positives = 75/108 (69%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+++Q LV + +FSF+LNRDP + GGE++ GG+D K+++G+ +Y+ VT+K YWQ + +
Sbjct: 214 LMQQKLVEKNIFSFFLNRDPTGQPGGELMLGGIDSKYYEGELSYLNVTRKAYWQVHMDQL 273
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+ N T +C+GGC AIVD+GTSLL GP V E+ AIG ++ E
Sbjct: 274 DVANGLT-LCKGGCEAIVDTGTSLLVGPVDEVKELQKAIGAVPLIQGE 320
>gi|74198620|dbj|BAE39786.1| unnamed protein product [Mus musculus]
Length = 410
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 51/108 (47%), Positives = 76/108 (70%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M++Q LV + +FSF+LNRDP+ + GGE++ GG D K++ G+ +Y+ VT+K YWQ + +
Sbjct: 216 MMQQKLVDKNIFSFYLNRDPEGQPGGELMLGGTDSKYYHGELSYLNVTRKAYWQVHMDQL 275
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+GN+ T +C+GGC AIVD+GTSLL GP V E+ AIG ++ E
Sbjct: 276 EVGNELT-LCKGGCEAIVDTGTSLLVGPVGEVKELQKAIGAVPLIQGE 322
Score = 108 bits (270), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 46/93 (49%), Positives = 65/93 (69%)
Query: 202 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 261
++P GE +I C+++ ++P V +G K + L P++YILK +G +C+SGFM D+P
Sbjct: 315 AVPLIQGEYMIPCEKVSSLPTVYLKLGGKNYELHPDKYILKVSQGGKTICLSGFMGMDIP 374
Query: 262 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
PP GPLWILGDVF+G Y+TVFD R+GFA A
Sbjct: 375 PPSGPLWILGDVFIGSYYTVFDRDNNRVGFANA 407
>gi|351712803|gb|EHB15722.1| Cathepsin D, partial [Heterocephalus glaber]
Length = 390
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 52/108 (48%), Positives = 74/108 (68%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
++ Q LV + VFSF+LNRDP A+ GGE++ GG+D K++KG TY+ VT+K YWQ + +
Sbjct: 196 LMSQKLVDKNVFSFYLNRDPSAQPGGELMLGGIDSKYYKGSFTYLNVTRKAYWQVHMDQL 255
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+G+ +C+GGC AIVD+GTSLL GP V E+ AIG ++ E
Sbjct: 256 EVGS-GLNLCKGGCEAIVDTGTSLLVGPVDEVKELQKAIGAIPLIQGE 302
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 45/94 (47%), Positives = 67/94 (71%)
Query: 202 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 261
++P GE ++ C+++ ++P+V+ +G + LSPE Y+LK + +C+SGFM D+P
Sbjct: 295 AIPLIQGEYMVPCEKVSSLPSVTLKLGGSAYPLSPEDYVLKVSQAGRTICLSGFMGMDIP 354
Query: 262 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
PP GPLWILGDVF+G Y+TVFD R+GFA+AA
Sbjct: 355 PPTGPLWILGDVFIGRYYTVFDRDNNRVGFAQAA 388
>gi|195997417|ref|XP_002108577.1| hypothetical protein TRIADDRAFT_19349 [Trichoplax adhaerens]
gi|190589353|gb|EDV29375.1| hypothetical protein TRIADDRAFT_19349, partial [Trichoplax
adhaerens]
Length = 370
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 53/106 (50%), Positives = 73/106 (68%), Gaps = 2/106 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M++QGL+ + VFS +LNR GGE+VFGG DP ++ G +YVP++K+GYWQFEL
Sbjct: 180 MLQQGLIQKPVFSVYLNRQGTQNVGGELVFGGSDPNYYTGAFSYVPLSKEGYWQFELDGG 239
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVS 106
I N+ CEGGC A++D+GTSL+ GP V +INH IG + + S
Sbjct: 240 TIENEF--FCEGGCQAVIDTGTSLIVGPNEEVAKINHLIGADSIQS 283
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 39/86 (45%), Positives = 59/86 (68%), Gaps = 1/86 (1%)
Query: 209 ESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLW 268
+S+++C+ +P +P ++ TIG K ++LS ++YILK +G E+C SGF + G W
Sbjct: 282 QSLVNCNSMPELPVITLTIGGKEYSLSGQEYILKYRQGEQEICRSGFQGGNFEGI-GVQW 340
Query: 269 ILGDVFMGVYHTVFDSGKLRIGFAEA 294
ILGDVF+G Y+T FD G R+GFA+A
Sbjct: 341 ILGDVFIGTYYTEFDKGNGRLGFAKA 366
>gi|4389327|pdb|1B5F|B Chain B, Native Cardosin A From Cynara Cardunculus L.
gi|4389329|pdb|1B5F|D Chain D, Native Cardosin A From Cynara Cardunculus L
Length = 87
Score = 112 bits (281), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 56/84 (66%), Positives = 63/84 (75%), Gaps = 1/84 (1%)
Query: 212 IDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILG 271
+DC+ + +MPNVSFTIG K F L+PEQYILK G+G A CISGF A D GPLWILG
Sbjct: 5 VDCNTLSSMPNVSFTIGGKKFGLTPEQYILKVGKGEATQCISGFTAMD-ATLLGPLWILG 63
Query: 272 DVFMGVYHTVFDSGKLRIGFAEAA 295
DVFM YHTVFD G L +GFAEAA
Sbjct: 64 DVFMRPYHTVFDYGNLLVGFAEAA 87
>gi|307175237|gb|EFN65289.1| Lysosomal aspartic protease [Camponotus floridanus]
Length = 464
Score = 112 bits (281), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 50/100 (50%), Positives = 68/100 (68%), Gaps = 2/100 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M++QGLVS +FSF+LNRD AE GGE + GG DP H+ G TY+PVT+KG+WQF + +I
Sbjct: 91 MIKQGLVSSPIFSFYLNRDSSAELGGEFILGGSDPAHYDGNFTYIPVTRKGFWQFTMDNI 150
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 100
+I + +C C AI D+G S + GP +T+IN IG
Sbjct: 151 IINDHI--LCVESCQAIADTGASHICGPKSDITKINKLIG 188
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 36/100 (36%), Positives = 57/100 (57%), Gaps = 7/100 (7%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ + V+ + + +D E GGE + GG DP H++G TY+ + K WQF + +
Sbjct: 281 MIARSFVTSKPY-----KDSSNETGGEFILGGSDPDHYEGDFTYLSILHKEQWQFNMDTV 335
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 100
+I + +CE GC AI D+G+S ++GP +T IN IG
Sbjct: 336 IINDHI--LCEEGCLAIADTGSSDISGPISDITYINKFIG 373
Score = 47.8 bits (112), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 29/101 (28%), Positives = 51/101 (50%), Gaps = 5/101 (4%)
Query: 193 LSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKT-GEGIAEVC 251
++YIN+ + N G+ ++C RI +P +SF + + F+L+ + Y+++ +C
Sbjct: 365 ITYINKFIGTF-NVNGQERVNCSRISELPTISFILDNVAFDLTGKDYVVQALYNETYTIC 423
Query: 252 ISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFA 292
S F WILG F+G Y+T F+ +GFA
Sbjct: 424 TSRFRGIAWFKFE---WILGVPFIGRYYTEFNVESELVGFA 461
Score = 41.6 bits (96), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 18/50 (36%), Positives = 33/50 (66%), Gaps = 1/50 (2%)
Query: 193 LSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILK 242
++ IN+L ++ N G+ ++C RI +P + F +GDK+FNL + YI++
Sbjct: 180 ITKINKLIGTI-NVDGDERVNCSRISELPTIRFILGDKVFNLIGKDYIIQ 228
>gi|26354406|dbj|BAC40831.1| unnamed protein product [Mus musculus]
Length = 445
Score = 112 bits (281), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 50/108 (46%), Positives = 76/108 (70%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+++Q LV + +FSF+LNRDP+ + GGE++ GG D K++ G+ +Y+ VT+K YWQ + +
Sbjct: 216 LMQQKLVDKNIFSFYLNRDPEGQPGGELMLGGTDSKYYHGELSYLNVTRKAYWQVHMDQL 275
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+GN+ T +C+GGC AIVD+GTSLL GP V E+ AIG ++ E
Sbjct: 276 EVGNELT-LCKGGCEAIVDTGTSLLVGPVEEVKELQKAIGAVPLIQGE 322
Score = 77.4 bits (189), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 31/70 (44%), Positives = 47/70 (67%)
Query: 202 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 261
++P GE +I C+++ ++P V +G K + L P++YILK +G +C+SGFM D+P
Sbjct: 315 AVPLIQGEYMIPCEKVSSLPTVYLKLGGKNYELHPDKYILKVSQGGKTICLSGFMGMDIP 374
Query: 262 PPRGPLWILG 271
PP GPLWIL
Sbjct: 375 PPSGPLWILA 384
>gi|318977821|ref|NP_001187407.1| cathepsin D precursor [Ictalurus punctatus]
gi|308322929|gb|ADO28602.1| cathepsin D [Ictalurus punctatus]
Length = 398
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 51/99 (51%), Positives = 70/99 (70%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
+N+ ++P MGE I C +IP++P VSF +G K+FNL+ Y+ K+ + VC+SGF
Sbjct: 299 LNKAIGAVPLIMGEYWISCSKIPSLPVVSFHLGGKVFNLTGGDYVYKSTKMGVSVCLSGF 358
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
MA D+PPP GPLWILGDVFMG ++TVFD ++GFA A
Sbjct: 359 MALDIPPPAGPLWILGDVFMGRFYTVFDRDNNQVGFAPA 397
Score = 100 bits (248), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 45/103 (43%), Positives = 69/103 (66%), Gaps = 1/103 (0%)
Query: 6 LVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQ 65
L+ + +FSF++NRDP AE GGE++ GG D ++F G Y+ VT+K YWQ ++ + +G+
Sbjct: 211 LLQQNIFSFYINRDPKAEVGGELMLGGCDKQYFDGDLHYLNVTRKAYWQIKMDTVEVGST 270
Query: 66 STGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
T +C+ GC AIVDSGTS++ GP + +N AIG ++ E
Sbjct: 271 LT-LCKDGCQAIVDSGTSMITGPVEEIRALNKAIGAVPLIMGE 312
>gi|195997283|ref|XP_002108510.1| expressed hypothetical protein [Trichoplax adhaerens]
gi|190589286|gb|EDV29308.1| expressed hypothetical protein [Trichoplax adhaerens]
Length = 389
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 55/119 (46%), Positives = 77/119 (64%), Gaps = 1/119 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
++ Q V + VFSF+LNR A+ GGE++ GG D KH+ G TY+PVTKKGYWQF++ I
Sbjct: 196 IMAQKQVGKSVFSFYLNRKEGAKPGGELILGGSDSKHYSGNFTYLPVTKKGYWQFKMDGI 255
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDL 119
+ + + C+GGC AI D+GTSLLAGPT V +I IG +++ E + S+ L
Sbjct: 256 SVKGKGS-FCKGGCQAIADTGTSLLAGPTAEVNKIQTLIGATPLLNGEYTIDCSKISSL 313
Score = 108 bits (269), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 49/98 (50%), Positives = 66/98 (67%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
I L + P GE IDC +I ++P ++FT+G K F L+ +QY+LK +VC+SGF
Sbjct: 289 IQTLIGATPLLNGEYTIDCSKISSLPPITFTLGGKKFTLTGKQYVLKVSSLGLDVCLSGF 348
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAE 293
D+P PRGPLWILGDVF+G Y+T FD K R+GFA+
Sbjct: 349 TGIDIPKPRGPLWILGDVFIGQYYTEFDMAKNRVGFAK 386
>gi|74207446|dbj|BAE30902.1| unnamed protein product [Mus musculus]
Length = 410
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 50/108 (46%), Positives = 76/108 (70%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+++Q LV + +FSF+LNRDP+ + GGE++ GG D K++ G+ +Y+ VT+K YWQ + +
Sbjct: 216 LMQQKLVDKNIFSFYLNRDPEGQPGGELMLGGTDSKYYHGELSYLNVTRKAYWQVHMDQL 275
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+GN+ T +C+GGC AIVD+GTSLL GP V E+ AIG ++ E
Sbjct: 276 EVGNELT-LCKGGCEAIVDTGTSLLVGPVEEVKELQKAIGAVPLIQGE 322
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 46/93 (49%), Positives = 65/93 (69%)
Query: 202 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 261
++P GE +I C+++ ++P V +G K + L P++YILK +G +C+SGFM D+P
Sbjct: 315 AVPLIQGEYMIPCEKVSSLPTVYMKLGGKNYELHPDKYILKVSQGGKTICLSGFMGMDIP 374
Query: 262 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
PP GPLWILGDVF+G Y+TVFD R+GFA A
Sbjct: 375 PPSGPLWILGDVFIGSYYTVFDRDNNRVGFANA 407
>gi|47213062|emb|CAF91576.1| unnamed protein product [Tetraodon nigroviridis]
Length = 395
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 55/125 (44%), Positives = 81/125 (64%), Gaps = 3/125 (2%)
Query: 170 SACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGD 229
S CE A+V L ++E + + + +LP GE ++ CD+IPT+P ++F IG
Sbjct: 274 SGCE-AIVDTGTSLLTGPSEE--VRSLQKAIGALPLIQGEYMVSCDKIPTLPVITFNIGG 330
Query: 230 KIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRI 289
K ++L+ +QY+LK + +C+SGFM D+P P GPLWILGDVF+G Y+TVFD R+
Sbjct: 331 KPYSLTGDQYVLKVSQAGKTICLSGFMGLDIPAPAGPLWILGDVFIGQYYTVFDRDNNRV 390
Query: 290 GFAEA 294
GFA+A
Sbjct: 391 GFAKA 395
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 47/108 (43%), Positives = 73/108 (67%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
++ Q V + VFSF+LNR+P + GGE++ GG DP+++ G +YV VT++ YWQ + ++
Sbjct: 204 IMSQKKVEKNVFSFYLNRNPQTQPGGELLLGGTDPQYYTGDFSYVNVTRQAYWQIHVDEL 263
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+G+Q T +C+ GC AIVD+GTSLL GP+ V + AIG ++ E
Sbjct: 264 SVGSQLT-LCKSGCEAIVDTGTSLLTGPSEEVRSLQKAIGALPLIQGE 310
>gi|403305561|ref|XP_003943328.1| PREDICTED: cathepsin D [Saimiri boliviensis boliviensis]
Length = 522
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 50/108 (46%), Positives = 75/108 (69%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+++Q LV + +FSF+LNRDPDA+ GGE++ GG D K++KG +Y+ VT+K YWQ + +
Sbjct: 183 LMQQKLVDQNIFSFYLNRDPDAQPGGELMLGGTDSKYYKGSLSYLNVTRKAYWQVHMDQV 242
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+ + T +C+GGC AIVD+GTSL+ GP V E+ AIG ++ E
Sbjct: 243 EVASGLT-LCKGGCEAIVDTGTSLMVGPVDEVRELQKAIGAVPLIQGE 289
Score = 40.4 bits (93), Expect = 0.95, Method: Compositional matrix adjust.
Identities = 16/41 (39%), Positives = 27/41 (65%)
Query: 202 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILK 242
++P GE +I C+++ T+P ++ +G K + LSPE Y LK
Sbjct: 282 AVPLIQGEYMIPCEKVSTLPTITLKLGGKDYKLSPEDYTLK 322
>gi|194246053|gb|ACF35518.1| putative aspartic protease [Dermacentor variabilis]
Length = 246
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 50/102 (49%), Positives = 74/102 (72%), Gaps = 1/102 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+EQG+ + VF+F+L+R+ GGE+VFGG+D H+KG TY+PVTKKGYWQF + +
Sbjct: 54 MLEQGVAEKPVFAFYLDRNVADPNGGEVVFGGIDEAHYKGDITYLPVTKKGYWQFNMDGV 113
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
+ +++T CEGGCAA+ D+GTS+L P+ V +IN +G +
Sbjct: 114 SVADKAT-FCEGGCAAVADTGTSMLVAPSEDVKKINKLLGAK 154
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 44/99 (44%), Positives = 67/99 (67%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
IN+L + G+ ++DC+ +P++P ++F + + F LSP+ Y+LK + C+SGF
Sbjct: 147 INKLLGAKEASPGQYLVDCESLPSLPKITFHLNKREFVLSPDDYVLKVTQEGTTFCLSGF 206
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
+ D PP GPLWILGD+F+G Y+T+FD G R+GFAEA
Sbjct: 207 IPMDFPPEMGPLWILGDMFIGRYYTIFDRGNDRVGFAEA 245
>gi|346469557|gb|AEO34623.1| hypothetical protein [Amblyomma maculatum]
Length = 391
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 51/108 (47%), Positives = 74/108 (68%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV QG+ ++ VFS +L+R+ GGE++FGG+D H+ G +YVPVT+KGYWQF + +
Sbjct: 199 MVAQGVATKPVFSVYLDRNATDPNGGEVLFGGIDEAHYTGNISYVPVTRKGYWQFHMDGL 258
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+G+ +T C GGC AI D+GTSL+AGPT + ++N AIG + E
Sbjct: 259 KVGDNAT-FCNGGCEAIADTGTSLIAGPTEEIQKLNLAIGAAPFTAGE 305
Score = 100 bits (249), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 51/137 (37%), Positives = 78/137 (56%), Gaps = 3/137 (2%)
Query: 159 ENVSAGDSAV-CSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRI 217
+ + GD+A C+ A+ L T+E + +N + P GE ++ C I
Sbjct: 256 DGLKVGDNATFCNGGCEAIADTGTSLIAGPTEE--IQKLNLAIGAAPFTAGEYLVSCKSI 313
Query: 218 PTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGV 277
PT+P ++F + F L + YIL+ + +C+SGF+ D+P P GPLWILGDVF+G
Sbjct: 314 PTLPKITFNLNGHEFVLEGKDYILQVSQAGIPLCLSGFIGLDVPAPLGPLWILGDVFIGR 373
Query: 278 YHTVFDSGKLRIGFAEA 294
Y+T+FD G R+GFAE+
Sbjct: 374 YYTIFDRGNDRVGFAES 390
>gi|74198040|dbj|BAE35200.1| unnamed protein product [Mus musculus]
Length = 410
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 50/108 (46%), Positives = 76/108 (70%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+++Q LV + +FSF+LNRDP+ + GGE++ GG D K++ G+ +Y+ VT+K YWQ + +
Sbjct: 216 LMQQKLVDKNIFSFYLNRDPEGQPGGELMLGGTDSKYYHGELSYLNVTRKAYWQVHMDQL 275
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+GN+ T +C+GGC AIVD+GTSLL GP V E+ AIG ++ E
Sbjct: 276 EVGNELT-LCKGGCEAIVDTGTSLLVGPVEEVKELQKAIGAVPLIQGE 322
Score = 108 bits (271), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 46/93 (49%), Positives = 65/93 (69%)
Query: 202 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 261
++P GE +I C+++ ++P V +G K + L P++YILK +G +C+SGFM D+P
Sbjct: 315 AVPLIQGEYMIPCEKVSSLPTVYLKLGGKNYELHPDKYILKVSQGGKTICLSGFMGMDIP 374
Query: 262 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
PP GPLWILGDVF+G Y+TVFD R+GFA A
Sbjct: 375 PPSGPLWILGDVFIGSYYTVFDRDNNRVGFANA 407
>gi|121543617|gb|ABM55520.1| putative cathepsin D [Maconellicoccus hirsutus]
Length = 391
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 54/100 (54%), Positives = 70/100 (70%), Gaps = 4/100 (4%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV+QGLV + VFSF+LNR+ A +GGEI+FGGVDP F+G TYVPV+ KGYWQF + I
Sbjct: 200 MVDQGLVKDSVFSFYLNRNTSAADGGEIIFGGVDPSKFRGNFTYVPVSVKGYWQFGMEKI 259
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 100
+G + + AI D+GTSL+AGP+ + IN AIG
Sbjct: 260 SLGGKDIQTSQ----AIADTGTSLIAGPSEDIAAINKAIG 295
Score = 101 bits (252), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 44/87 (50%), Positives = 61/87 (70%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 267
G+ + C+ I +P+++FTI + LS Y+L+ + +CISGFM D+PPPRGPL
Sbjct: 302 GQYTVSCESIDQLPDITFTINGVDYTLSGRDYVLQVSQLGRTLCISGFMGIDIPPPRGPL 361
Query: 268 WILGDVFMGVYHTVFDSGKLRIGFAEA 294
WILGDVF+G Y+TVFD G R+GFAE+
Sbjct: 362 WILGDVFIGKYYTVFDLGNNRLGFAES 388
>gi|74142218|dbj|BAE31874.1| unnamed protein product [Mus musculus]
Length = 410
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 50/108 (46%), Positives = 76/108 (70%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+++Q LV + +FSF+LNRDP+ + GGE++ GG D K++ G+ +Y+ VT+K YWQ + +
Sbjct: 216 LMQQKLVDKNIFSFYLNRDPEGQPGGELMLGGTDSKYYHGELSYLNVTRKAYWQVHMDQL 275
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+GN+ T +C+GGC AIVD+GTSLL GP V E+ AIG ++ E
Sbjct: 276 EVGNELT-LCKGGCEAIVDTGTSLLVGPVEEVKELQKAIGAVPLIQGE 322
Score = 107 bits (266), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 45/93 (48%), Positives = 64/93 (68%)
Query: 202 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 261
++P GE +I C+++ ++P V +G K + L P++YILK +G +C+SGFM D+P
Sbjct: 315 AVPLIQGEYMIPCEKVSSLPTVYLKLGGKNYELHPDKYILKVSQGGKTICLSGFMGMDIP 374
Query: 262 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
PP GPLWILGDVF+G Y+TVFD R+GF A
Sbjct: 375 PPSGPLWILGDVFIGSYYTVFDRDNNRVGFTNA 407
>gi|74204520|dbj|BAE35336.1| unnamed protein product [Mus musculus]
Length = 410
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 50/108 (46%), Positives = 76/108 (70%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+++Q LV + +FSF+LNRDP+ + GGE++ GG D K++ G+ +Y+ VT+K YWQ + +
Sbjct: 216 LMQQKLVDKNIFSFYLNRDPEGQPGGELMLGGTDSKYYHGELSYLNVTRKAYWQVHMDQL 275
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+GN+ T +C+GGC AIVD+GTSLL GP V E+ AIG ++ E
Sbjct: 276 EVGNELT-LCKGGCEAIVDTGTSLLVGPVEEVKELQKAIGAVPLIQGE 322
Score = 108 bits (271), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 46/93 (49%), Positives = 65/93 (69%)
Query: 202 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 261
++P GE +I C+++ ++P V +G K + L P++YILK +G +C+SGFM D+P
Sbjct: 315 AVPLIQGEYMIPCEKVSSLPTVYLKLGGKNYELHPDKYILKVSQGGKTICLSGFMGMDIP 374
Query: 262 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
PP GPLWILGDVF+G Y+TVFD R+GFA A
Sbjct: 375 PPSGPLWILGDVFIGSYYTVFDRDNNRVGFANA 407
>gi|6753556|ref|NP_034113.1| cathepsin D precursor [Mus musculus]
gi|115718|sp|P18242.1|CATD_MOUSE RecName: Full=Cathepsin D; Flags: Precursor
gi|50299|emb|CAA37067.1| cathepsin D [Mus musculus]
gi|50301|emb|CAA37423.1| unnamed protein product [Mus musculus]
gi|817945|emb|CAA48453.1| cathepsin d [Mus musculus]
gi|32452040|gb|AAH54758.1| Cathepsin D [Mus musculus]
gi|34785578|gb|AAH57931.1| Cathepsin D [Mus musculus]
gi|74139562|dbj|BAE40918.1| unnamed protein product [Mus musculus]
gi|74139905|dbj|BAE31791.1| unnamed protein product [Mus musculus]
gi|74151769|dbj|BAE29674.1| unnamed protein product [Mus musculus]
gi|74177956|dbj|BAE29773.1| unnamed protein product [Mus musculus]
gi|74178091|dbj|BAE29834.1| unnamed protein product [Mus musculus]
gi|74181413|dbj|BAE29980.1| unnamed protein product [Mus musculus]
gi|74184920|dbj|BAE39078.1| unnamed protein product [Mus musculus]
gi|74185047|dbj|BAE39131.1| unnamed protein product [Mus musculus]
gi|74185557|dbj|BAE30245.1| unnamed protein product [Mus musculus]
gi|74186716|dbj|BAE34813.1| unnamed protein product [Mus musculus]
gi|74189047|dbj|BAE39288.1| unnamed protein product [Mus musculus]
gi|74191359|dbj|BAE30262.1| unnamed protein product [Mus musculus]
gi|74191542|dbj|BAE30346.1| unnamed protein product [Mus musculus]
gi|74197068|dbj|BAE35086.1| unnamed protein product [Mus musculus]
gi|74197198|dbj|BAE35144.1| unnamed protein product [Mus musculus]
gi|74199016|dbj|BAE30724.1| unnamed protein product [Mus musculus]
gi|74204247|dbj|BAE39883.1| unnamed protein product [Mus musculus]
gi|74207294|dbj|BAE30833.1| unnamed protein product [Mus musculus]
gi|74207430|dbj|BAE30895.1| unnamed protein product [Mus musculus]
gi|74212520|dbj|BAE31001.1| unnamed protein product [Mus musculus]
gi|74212556|dbj|BAE31018.1| unnamed protein product [Mus musculus]
gi|74212558|dbj|BAE31019.1| unnamed protein product [Mus musculus]
gi|74213416|dbj|BAE35523.1| unnamed protein product [Mus musculus]
gi|74214708|dbj|BAE31193.1| unnamed protein product [Mus musculus]
gi|74217133|dbj|BAE31236.1| unnamed protein product [Mus musculus]
gi|74219445|dbj|BAE29499.1| unnamed protein product [Mus musculus]
gi|74220283|dbj|BAE31319.1| unnamed protein product [Mus musculus]
gi|74220373|dbj|BAE31412.1| unnamed protein product [Mus musculus]
gi|74220638|dbj|BAE31529.1| unnamed protein product [Mus musculus]
gi|74220740|dbj|BAE31342.1| unnamed protein product [Mus musculus]
gi|74222921|dbj|BAE42305.1| unnamed protein product [Mus musculus]
gi|74225262|dbj|BAE31566.1| unnamed protein product [Mus musculus]
gi|74225282|dbj|BAE31575.1| unnamed protein product [Mus musculus]
gi|148686195|gb|EDL18142.1| cathepsin D, isoform CRA_a [Mus musculus]
Length = 410
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 50/108 (46%), Positives = 76/108 (70%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+++Q LV + +FSF+LNRDP+ + GGE++ GG D K++ G+ +Y+ VT+K YWQ + +
Sbjct: 216 LMQQKLVDKNIFSFYLNRDPEGQPGGELMLGGTDSKYYHGELSYLNVTRKAYWQVHMDQL 275
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+GN+ T +C+GGC AIVD+GTSLL GP V E+ AIG ++ E
Sbjct: 276 EVGNELT-LCKGGCEAIVDTGTSLLVGPVEEVKELQKAIGAVPLIQGE 322
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 46/93 (49%), Positives = 65/93 (69%)
Query: 202 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 261
++P GE +I C+++ ++P V +G K + L P++YILK +G +C+SGFM D+P
Sbjct: 315 AVPLIQGEYMIPCEKVSSLPTVYLKLGGKNYELHPDKYILKVSQGGKTICLSGFMGMDIP 374
Query: 262 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
PP GPLWILGDVF+G Y+TVFD R+GFA A
Sbjct: 375 PPSGPLWILGDVFIGSYYTVFDRDNNRVGFANA 407
>gi|74220304|dbj|BAE31329.1| unnamed protein product [Mus musculus]
Length = 410
Score = 112 bits (279), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 50/108 (46%), Positives = 76/108 (70%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+++Q LV + +FSF+LNRDP+ + GGE++ GG D K++ G+ +Y+ VT+K YWQ + +
Sbjct: 216 LMQQKLVDKNIFSFYLNRDPEGQPGGELMLGGTDSKYYHGELSYLNVTRKAYWQVHMDQL 275
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+GN+ T +C+GGC AIVD+GTSLL GP V E+ AIG ++ E
Sbjct: 276 EVGNELT-LCKGGCEAIVDTGTSLLVGPVEEVKELQKAIGAVPLIQGE 322
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 46/93 (49%), Positives = 65/93 (69%)
Query: 202 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 261
++P GE +I C+++ ++P V +G K + L P++YILK +G +C+SGFM D+P
Sbjct: 315 AVPLIQGEYMIPCEKVSSLPTVYLKLGGKNYELHPDKYILKVSQGGKTICLSGFMGMDIP 374
Query: 262 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
PP GPLWILGDVF+G Y+TVFD R+GFA A
Sbjct: 375 PPSGPLWILGDVFIGSYYTVFDRDNNRVGFANA 407
>gi|123993743|gb|ABM84473.1| cathepsin D (lysosomal aspartyl peptidase) [synthetic construct]
Length = 412
Score = 112 bits (279), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 48/94 (51%), Positives = 67/94 (71%)
Query: 202 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 261
++P GE +I C+++ T+P ++ +G K + LSPE Y+LK + +C+SGFM D+P
Sbjct: 317 AVPLIQGEYMIPCEKVSTLPAITLKLGGKGYKLSPEDYMLKVSQAGKTLCLSGFMGMDIP 376
Query: 262 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
PP GPLWILGDVF+G Y+TVFD R+GFAEAA
Sbjct: 377 PPSGPLWILGDVFIGRYYTVFDRDNNRVGFAEAA 410
Score = 107 bits (268), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 49/108 (45%), Positives = 74/108 (68%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+++Q LV + +FSF+L+RDPDA+ GGE++ GG D K++KG +Y+ VT+K YWQ L +
Sbjct: 218 LMQQKLVDQNIFSFYLSRDPDAQPGGELMLGGTDSKYYKGSLSYLNVTRKAYWQVHLDQV 277
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+ + T +C+ GC AIVD+GTSL+ GP V E+ AIG ++ E
Sbjct: 278 EVASGLT-LCKEGCEAIVDTGTSLMVGPVDEVRELQKAIGAVPLIQGE 324
>gi|45384002|ref|NP_990508.1| cathepsin D precursor [Gallus gallus]
gi|461696|sp|Q05744.1|CATD_CHICK RecName: Full=Cathepsin D; Contains: RecName: Full=Cathepsin D
light chain; Contains: RecName: Full=Cathepsin D heavy
chain; Flags: Precursor
gi|259835|gb|AAB24157.1| prepro-cathepsin D [Gallus gallus]
Length = 398
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 49/108 (45%), Positives = 76/108 (70%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+++Q L+ + +FSF+LNRDP A+ GGE++ GG DPK++ G ++V VT+K YWQ + +
Sbjct: 206 VMQQKLIEKNIFSFYLNRDPTAQPGGELLLGGTDPKYYSGDFSWVNVTRKAYWQVHMDSV 265
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+ N T +C+GGC AIVD+GTSL+ GPT V E+ AIG + ++ +
Sbjct: 266 DVANGLT-LCKGGCEAIVDTGTSLITGPTKEVKELQTAIGAKPLIKGQ 312
Score = 96.3 bits (238), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 52/138 (37%), Positives = 78/138 (56%), Gaps = 3/138 (2%)
Query: 156 VEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 215
++ +V+ G + CE A+V L TKE + + + P G+ +I CD
Sbjct: 262 MDSVDVANGLTLCKGGCE-AIVDTGTSLITGPTKE--VKELQTAIGAKPLIKGQYVISCD 318
Query: 216 RIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFM 275
+I ++P V+ +G K + L+ EQY+ K +C+SGF D+PPP GPLWILGDVF+
Sbjct: 319 KISSLPVVTLMLGGKPYQLTGEQYVFKVSAQGETICLSGFSGLDVPPPGGPLWILGDVFI 378
Query: 276 GVYHTVFDSGKLRIGFAE 293
G Y+TVFD +GFA+
Sbjct: 379 GPYYTVFDRDNDSVGFAK 396
>gi|301619112|ref|XP_002938948.1| PREDICTED: cathepsin D-like [Xenopus (Silurana) tropicalis]
Length = 355
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 44/93 (47%), Positives = 70/93 (75%)
Query: 202 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 261
++P G+ ++ CD++PT+P +S T+G +++ L+ EQYI+K + + +C+SGFM ++P
Sbjct: 262 AVPLIQGQYMVQCDKVPTLPVISLTLGGQVYTLTGEQYIMKVSQLGSTICLSGFMGLNIP 321
Query: 262 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
PP GPLWILGDVF+G Y++VFD R+GFA+A
Sbjct: 322 PPAGPLWILGDVFIGQYYSVFDRANNRVGFAKA 354
Score = 108 bits (271), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 48/108 (44%), Positives = 72/108 (66%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
++ Q LV +FSF+LNR+PD + GGE++ GG DPK++ G Y+ VT+K YWQ + +
Sbjct: 163 IMAQKLVESNIFSFYLNRNPDTQPGGELLLGGTDPKYYTGDFHYLSVTRKAYWQIHMDQL 222
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+G+Q T +C+GGC IVD+GTSL+ GP VT + AIG ++ +
Sbjct: 223 GVGDQLT-LCKGGCEVIVDTGTSLITGPLEEVTALQKAIGAVPLIQGQ 269
>gi|74198157|dbj|BAE35255.1| unnamed protein product [Mus musculus]
Length = 335
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 50/108 (46%), Positives = 76/108 (70%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+++Q LV + +FSF+LNRDP+ + GGE++ GG D K++ G+ +Y+ VT+K YWQ + +
Sbjct: 141 LMQQKLVDKNIFSFYLNRDPEGQPGGELMLGGTDSKYYHGELSYLNVTRKAYWQVHMDQL 200
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+GN+ T +C+GGC AIVD+GTSLL GP V E+ AIG ++ E
Sbjct: 201 EVGNELT-LCKGGCEAIVDTGTSLLVGPVEEVKELQKAIGAMPLIQGE 247
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 46/93 (49%), Positives = 65/93 (69%)
Query: 202 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 261
++P GE +I C+++ ++P V +G K + L P++YILK +G +C+SGFM D+P
Sbjct: 240 AMPLIQGEYMIPCEKVSSLPTVYLKLGGKNYELHPDKYILKVSQGGKTICLSGFMGMDIP 299
Query: 262 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
PP GPLWILGDVF+G Y+TVFD R+GFA A
Sbjct: 300 PPSGPLWILGDVFIGSYYTVFDRDNNRVGFANA 332
>gi|170649686|gb|ACB21270.1| cathepsin D preproprotein (predicted) [Callicebus moloch]
Length = 412
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 50/108 (46%), Positives = 75/108 (69%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+++Q LV + +FSF+LNRDPDA+ GGE++ GG D K++KG +Y+ VT+K YWQ + +
Sbjct: 218 LMQQKLVDQNIFSFYLNRDPDAQPGGELMLGGTDSKYYKGSLSYLNVTRKAYWQVHMDQV 277
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+ + T +C+GGC AIVD+GTSL+ GP V E+ AIG ++ E
Sbjct: 278 EVASGLT-LCKGGCEAIVDTGTSLMVGPVDEVRELQKAIGAVPLIQGE 324
Score = 108 bits (271), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 46/94 (48%), Positives = 65/94 (69%)
Query: 202 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 261
++P GE +I C+++ T+P ++ +G K + LSPE Y LK + +C+SGFM D+P
Sbjct: 317 AVPLIQGEYMIPCEKVSTLPAITLKLGGKDYRLSPEDYTLKVSQAGKAICLSGFMGMDIP 376
Query: 262 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
PP GPLWILGDVF+G Y+TVFD R+GFA+A
Sbjct: 377 PPSGPLWILGDVFIGRYYTVFDRDNNRVGFAQAT 410
>gi|18043133|gb|AAH19682.1| Ctsd protein, partial [Mus musculus]
Length = 217
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 50/108 (46%), Positives = 76/108 (70%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+++Q LV + +FSF+LNRDP+ + GGE++ GG D K++ G+ +Y+ VT+K YWQ + +
Sbjct: 23 LMQQKLVDKNIFSFYLNRDPEGQPGGELMLGGTDSKYYHGELSYLNVTRKAYWQVHMDQL 82
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+GN+ T +C+GGC AIVD+GTSLL GP V E+ AIG ++ E
Sbjct: 83 EVGNELT-LCKGGCEAIVDTGTSLLVGPVEEVKELQKAIGAVPLIQGE 129
Score = 108 bits (269), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 46/94 (48%), Positives = 65/94 (69%)
Query: 202 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 261
++P GE +I C+++ ++P V +G K + L P++YILK +G +C+SGFM D+P
Sbjct: 122 AVPLIQGEYMIPCEKVSSLPTVYLKLGGKNYELHPDKYILKVSQGGKTICLSGFMGMDIP 181
Query: 262 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
PP GPLWILGDVF+G Y+TVFD R+GFA A
Sbjct: 182 PPSGPLWILGDVFIGSYYTVFDRDNNRVGFANAV 215
>gi|45360583|ref|NP_988964.1| cathepsin D precursor [Xenopus (Silurana) tropicalis]
gi|38174445|gb|AAH61433.1| cathepsin D (lysosomal aspartyl protease) [Xenopus (Silurana)
tropicalis]
Length = 398
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 58/141 (41%), Positives = 88/141 (62%), Gaps = 5/141 (3%)
Query: 156 VEKENVSAGDS-AVCS-ACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIID 213
+ + +S GD ++C CE A+V L +E ++ + ++P GE +I
Sbjct: 260 IHMDQLSVGDRLSLCKDGCE-AIVDTGTSLITGPVEE--VTALQRAIGAIPLICGEYMIL 316
Query: 214 CDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDV 273
CD IP++P +SFT G + ++L+ EQY+LK + VC+SGF+ D+PPP GPLWI+GDV
Sbjct: 317 CDSIPSLPVISFTFGGRAYSLTGEQYVLKISKAGRTVCLSGFLGLDIPPPAGPLWIIGDV 376
Query: 274 FMGVYHTVFDSGKLRIGFAEA 294
F+G Y+TVFD R+GFA+A
Sbjct: 377 FIGQYYTVFDRANDRVGFAKA 397
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 47/112 (41%), Positives = 72/112 (64%), Gaps = 1/112 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
++EQ LV +FSF+LNR+PD GGE++ GG DP + G Y+ VT+K YWQ + +
Sbjct: 206 IMEQKLVDSNIFSFYLNRNPDTLPGGELLLGGTDPAFYTGDFNYMNVTRKAYWQIHMDQL 265
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLV 112
+G++ + +C+ GC AIVD+GTSL+ GP VT + AIG ++ E ++
Sbjct: 266 SVGDRLS-LCKDGCEAIVDTGTSLITGPVEEVTALQRAIGAIPLICGEYMIL 316
>gi|17549909|ref|NP_510191.1| Protein ASP-4 [Caenorhabditis elegans]
gi|3879202|emb|CAA90633.1| Protein ASP-4 [Caenorhabditis elegans]
Length = 444
Score = 111 bits (278), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 54/108 (50%), Positives = 73/108 (67%), Gaps = 2/108 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+ EQ V +FSFWLNR+PD+E GGEI FGG+D + + TYVPVT+KGYWQF++ D
Sbjct: 222 LFEQKKVPSNLFSFWLNRNPDSEIGGEITFGGIDSRRYVEPITYVPVTRKGYWQFKM-DK 280
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
++G+ G C GC AI D+GTSL+AGP + I + IG E ++ E
Sbjct: 281 VVGSGVLG-CSNGCQAIADTGTSLIAGPKAQIEAIQNFIGAEPLIKGE 327
Score = 104 bits (260), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 49/99 (49%), Positives = 66/99 (66%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
I + P GE +I CD++PT+P VSF IG + F+L E Y+LK +G +C+SGF
Sbjct: 314 IQNFIGAEPLIKGEYMISCDKVPTLPPVSFVIGGQEFSLKGEDYVLKVSQGGKTICLSGF 373
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
M DLP G LWILGDVF+G Y++VFD + R+GFA+A
Sbjct: 374 MGIDLPERVGELWILGDVFIGRYYSVFDFDQNRVGFAQA 412
>gi|185132376|ref|NP_001118183.1| cathepsin D precursor [Oncorhynchus mykiss]
gi|1858020|gb|AAC60301.1| cathepsin D [Oncorhynchus mykiss]
Length = 398
Score = 111 bits (278), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 46/93 (49%), Positives = 68/93 (73%)
Query: 202 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 261
+ P GE +++CD+IPTMP ++F +G + ++L+ EQY+LK + +C+SGFM D+P
Sbjct: 305 ATPLIQGEYMVNCDKIPTMPVITFNLGGQSYSLTAEQYVLKESQAGKTICLSGFMGLDIP 364
Query: 262 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
P GPLWILGDVF+G Y+TVFD R+GFA++
Sbjct: 365 APAGPLWILGDVFIGQYYTVFDRDNNRVGFAKS 397
Score = 107 bits (266), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 47/108 (43%), Positives = 73/108 (67%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
++ Q V + VFSF+LNR+PD+E GGE++ GG DPK++ G Y+ V+++ YWQ + +
Sbjct: 206 IMSQKKVEQNVFSFYLNRNPDSEPGGELLLGGTDPKYYSGDFQYLDVSRQAYWQIHMDGM 265
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+G+Q + +C+GGC AIVD+GTSL+ GP V + AIG ++ E
Sbjct: 266 GVGSQLS-LCKGGCEAIVDTGTSLITGPAAEVKALQRAIGATPLIQGE 312
>gi|449280808|gb|EMC88033.1| Cathepsin D, partial [Columba livia]
Length = 387
Score = 111 bits (278), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 49/108 (45%), Positives = 76/108 (70%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+++Q L+ + +FSF+LNRDP A+ GGE++ GG DPK++ G ++V VT+K YWQ + +
Sbjct: 195 IMQQKLIEKNIFSFYLNRDPSAQPGGELLLGGTDPKYYSGDFSWVNVTRKAYWQVHMDAV 254
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+ N T +C+GGC AIVD+GTSL+ GPT V E+ AIG + ++ +
Sbjct: 255 DVANGLT-LCKGGCEAIVDTGTSLITGPTKEVKELQTAIGAKPLIKGQ 301
Score = 95.5 bits (236), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 51/134 (38%), Positives = 77/134 (57%), Gaps = 3/134 (2%)
Query: 160 NVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPT 219
+V+ G + CE A+V L TKE + + + P G+ +I CD++ +
Sbjct: 255 DVANGLTLCKGGCE-AIVDTGTSLITGPTKE--VKELQTAIGAKPLIKGQYVIPCDKVSS 311
Query: 220 MPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYH 279
+P ++ T+G K + L+ EQY+ K +C+SGF D+PPP GPLWILGDVF+G Y+
Sbjct: 312 LPVITLTLGGKPYQLTGEQYVFKVSVQGETICLSGFSGLDVPPPGGPLWILGDVFIGPYY 371
Query: 280 TVFDSGKLRIGFAE 293
TVFD +GFA+
Sbjct: 372 TVFDRDNDSVGFAK 385
>gi|147906891|ref|NP_001082550.1| cathepsin D precursor [Xenopus laevis]
gi|28436104|dbj|BAC57431.1| cathepsin D [Xenopus laevis]
Length = 409
Score = 111 bits (278), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 50/108 (46%), Positives = 72/108 (66%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
++ Q LV VFSF+LNR+PD + GGE++ GG DPK++ G Y+ VT+K YWQ + +
Sbjct: 207 IMAQKLVESNVFSFYLNRNPDTQPGGELLLGGTDPKYYTGDFHYLNVTRKAYWQIHMDQL 266
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+G+Q T +C+GGC AIVD+GTSL+ GP V + AIG ++ E
Sbjct: 267 GVGDQLT-LCKGGCEAIVDTGTSLITGPVEEVAALQRAIGAIPLIRGE 313
>gi|404498469|dbj|BAM44630.1| cathepsin, partial [Acipenser schrenckii]
Length = 249
Score = 111 bits (278), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 54/113 (47%), Positives = 72/113 (63%), Gaps = 5/113 (4%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ Q LV +FS +LNR+PD GGE++FGG DP HF G+ +VPVTK+GYWQ + +I
Sbjct: 58 MMAQNLVEMPIFSVYLNRNPDYSSGGELIFGGFDPSHFSGELYWVPVTKQGYWQILVDNI 117
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG----EGVVSAEC 109
+ N C GGC AIVD+GTS+L GPT +T + +G EG + EC
Sbjct: 118 KV-NGEVMFCSGGCQAIVDTGTSMLTGPTADITALQQVLGASPLDEGTYAVEC 169
Score = 89.7 bits (221), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 48/132 (36%), Positives = 75/132 (56%), Gaps = 6/132 (4%)
Query: 164 GDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNV 223
G+ CS A+V + T + ++ + ++ + P G ++C I +P+V
Sbjct: 121 GEVMFCSGGCQAIVDTGTSMLTGPTAD--ITALQQVLGASPLDEGTYAVECGSINILPDV 178
Query: 224 SFTIGDKIFNLSPEQYI-LKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVF 282
+FTI F L+P YI ++ EG+ C +GF D+PPP GPLWILGDVF+G +++VF
Sbjct: 179 TFTINGAEFTLNPAAYIVMEESEGL---CTTGFQGMDIPPPGGPLWILGDVFIGQFYSVF 235
Query: 283 DSGKLRIGFAEA 294
D G R+G A+A
Sbjct: 236 DRGNNRVGLAKA 247
>gi|281182624|ref|NP_001162374.1| cathepsin D precursor [Papio anubis]
gi|160904227|gb|ABX52210.1| cathepsin D (predicted) [Papio anubis]
Length = 412
Score = 111 bits (278), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 48/94 (51%), Positives = 66/94 (70%)
Query: 202 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 261
++P GE +I C+++ T+P ++ +G K + LSPE Y LK + +C+SGFM D+P
Sbjct: 317 AVPLIQGEYMIPCEKVSTLPTITLKLGGKGYKLSPEDYTLKVSQAGKTLCLSGFMGMDIP 376
Query: 262 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
PP GPLWILGDVF+G Y+TVFD R+GFAEAA
Sbjct: 377 PPSGPLWILGDVFIGRYYTVFDRDNNRVGFAEAA 410
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 48/108 (44%), Positives = 73/108 (67%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+++Q LV + +FSF+LNRDP A+ GGE++ GG D K+++G +Y+ VT+K YWQ L +
Sbjct: 218 LMQQKLVDQNIFSFYLNRDPTAQPGGELMLGGTDSKYYRGSLSYLNVTRKAYWQVHLDQV 277
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+ + T +C+ GC AIVD+GTSL+ GP V E+ AIG ++ E
Sbjct: 278 EVASGLT-LCKEGCEAIVDTGTSLMVGPVDEVRELQKAIGAVPLIQGE 324
>gi|386869594|ref|NP_001247483.1| cathepsin D precursor [Macaca mulatta]
gi|67971186|dbj|BAE01935.1| unnamed protein product [Macaca fascicularis]
gi|384939322|gb|AFI33266.1| cathepsin D preproprotein [Macaca mulatta]
Length = 412
Score = 111 bits (278), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 48/94 (51%), Positives = 66/94 (70%)
Query: 202 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 261
++P GE +I C+++ T+P ++ +G K + LSPE Y LK + +C+SGFM D+P
Sbjct: 317 AVPLIQGEYMIPCEKVSTLPTITLKLGGKGYKLSPEDYTLKVSQAGKTLCLSGFMGMDIP 376
Query: 262 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
PP GPLWILGDVF+G Y+TVFD R+GFAEAA
Sbjct: 377 PPSGPLWILGDVFIGRYYTVFDRDNNRVGFAEAA 410
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 48/108 (44%), Positives = 73/108 (67%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+++Q LV + +FSF+LNRDP A+ GGE++ GG D K+++G +Y+ VT+K YWQ L +
Sbjct: 218 LMQQKLVDQNIFSFYLNRDPTAQPGGELMLGGTDSKYYRGSLSYLNVTRKAYWQVRLDQV 277
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+ + T +C+ GC AIVD+GTSL+ GP V E+ AIG ++ E
Sbjct: 278 EVASGLT-LCKEGCEAIVDTGTSLMVGPVDEVRELQKAIGAVPLIQGE 324
>gi|197099366|ref|NP_001125492.1| cathepsin D precursor [Pongo abelii]
gi|55728229|emb|CAH90861.1| hypothetical protein [Pongo abelii]
Length = 412
Score = 111 bits (278), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 48/94 (51%), Positives = 66/94 (70%)
Query: 202 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 261
++P GE +I C+++ T+P ++ +G K + LSPE Y LK + +C+SGFM D+P
Sbjct: 317 AVPLIQGEYMIPCEKVSTLPAITLKLGGKGYKLSPEDYTLKVSQAGKTLCLSGFMGMDIP 376
Query: 262 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
PP GPLWILGDVF+G Y+TVFD R+GFAEAA
Sbjct: 377 PPSGPLWILGDVFIGRYYTVFDRDNNRVGFAEAA 410
Score = 107 bits (268), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 49/108 (45%), Positives = 74/108 (68%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+++Q LV + +FSF+L+RDPDA+ GGE++ GG D K++KG +Y+ VT+K YWQ L +
Sbjct: 218 LMQQKLVDQNIFSFYLSRDPDAQPGGELMLGGTDSKYYKGSLSYLNVTRKAYWQVHLDQV 277
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+ + T +C+ GC AIVD+GTSL+ GP V E+ AIG ++ E
Sbjct: 278 EVASGLT-LCKEGCEAIVDTGTSLMVGPVDEVRELQKAIGAVPLIQGE 324
>gi|321461134|gb|EFX72169.1| hypothetical protein DAPPUDRAFT_189045 [Daphnia pulex]
Length = 391
Score = 111 bits (278), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 54/111 (48%), Positives = 71/111 (63%), Gaps = 2/111 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDP-DAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGD 59
M+ QGLV E VFSFWLNRD D GGEIVFGG +P H+ G+ Y+PVT+K YWQF
Sbjct: 194 MMAQGLVDEPVFSFWLNRDASDPVNGGEIVFGGTNPDHYVGEINYIPVTRKAYWQFRADG 253
Query: 60 ILI-GNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAEC 109
++I G C+GGC I D+GTS++AGP V +N +G +++ E
Sbjct: 254 LMIEGIPEYPFCDGGCEMISDTGTSVIAGPAEEVNLLNRLLGAINIINGEA 304
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 45/104 (43%), Positives = 65/104 (62%), Gaps = 2/104 (1%)
Query: 193 LSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEV-- 250
++ +N L ++ GE++I C RIP +P ++ TI + L E YILK +
Sbjct: 287 VNLLNRLLGAINIINGEAVISCLRIPYLPPITITISGLPYTLEGEDYILKVDDPTTNTST 346
Query: 251 CISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
CISGF+ D+PPP GPLWILGDVF+G +++++D G RIG A A
Sbjct: 347 CISGFLGLDIPPPSGPLWILGDVFIGKFYSIYDFGMDRIGLATA 390
>gi|342675479|gb|AEL31665.1| cathepsin D [Cynoglossus semilaevis]
Length = 396
Score = 111 bits (278), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 46/93 (49%), Positives = 70/93 (75%)
Query: 202 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 261
++P GE +++CD+IP++P ++F +G + ++L+ EQYILK + +C+SGFMA D+P
Sbjct: 303 AIPLIQGEYMVNCDKIPSLPVITFKMGGQSYSLTGEQYILKESQAGKTICLSGFMALDIP 362
Query: 262 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
P GPLWILGDVF+G Y+TVFD R+GFA++
Sbjct: 363 APAGPLWILGDVFIGQYYTVFDRDNNRVGFAKS 395
Score = 109 bits (273), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 49/108 (45%), Positives = 73/108 (67%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+++Q V VFSF+LNR+PD GGE++ GG DP ++ G+ YV VT++ YWQ + ++
Sbjct: 204 IMQQKKVESNVFSFYLNRNPDTAPGGELLLGGTDPTYYTGEFNYVNVTRQAYWQVSMDEL 263
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+G+Q T +C+GGC AIVD+GTSLL GP+ V + AIG ++ E
Sbjct: 264 AVGSQLT-LCKGGCQAIVDTGTSLLTGPSAEVKALQKAIGAIPLIQGE 310
>gi|260837471|ref|XP_002613727.1| hypothetical protein BRAFLDRAFT_114822 [Branchiostoma floridae]
gi|229299116|gb|EEN69736.1| hypothetical protein BRAFLDRAFT_114822 [Branchiostoma floridae]
Length = 392
Score = 111 bits (278), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 52/133 (39%), Positives = 84/133 (63%), Gaps = 12/133 (9%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+++Q LV + VFSF+L+RDP GGE++ GG DP ++ G TY+PV+ +GYWQ + +
Sbjct: 200 IMDQKLVDKNVFSFYLDRDPSRAPGGELLLGGTDPTYYTGNFTYIPVSYQGYWQLNMDGV 259
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE----------CK 110
+G+Q +C GGC AIVD+GTSL+AGP+ + ++ AIG + + +
Sbjct: 260 HVGDQK--LCAGGCQAIVDTGTSLIAGPSEEIHKLQAAIGSQQISPGQYLVDCGRLDDLP 317
Query: 111 LVVSQYGDLIWDL 123
+V Q+GD +++L
Sbjct: 318 VVSFQFGDKLFNL 330
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 55/140 (39%), Positives = 80/140 (57%), Gaps = 3/140 (2%)
Query: 156 VEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 215
+ + V GD +C+ A+V L ++E + + S G+ ++DC
Sbjct: 254 LNMDGVHVGDQKLCAGGCQAIVDTGTSLIAGPSEE--IHKLQAAIGSQQISPGQYLVDCG 311
Query: 216 RIPTMPNVSFTIGDKIFNLSPEQYILK-TGEGIAEVCISGFMAFDLPPPRGPLWILGDVF 274
R+ +P VSF GDK+FNL+ ++Y +K +VC+ GFM D+P PRGPLWILGDVF
Sbjct: 312 RLDDLPVVSFQFGDKLFNLTGQEYTVKEQASPTTQVCLVGFMPMDIPNPRGPLWILGDVF 371
Query: 275 MGVYHTVFDSGKLRIGFAEA 294
+G Y+T FD G R+GFA A
Sbjct: 372 IGQYYTEFDRGNNRVGFARA 391
>gi|90076280|dbj|BAE87820.1| unnamed protein product [Macaca fascicularis]
Length = 412
Score = 111 bits (277), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 48/94 (51%), Positives = 66/94 (70%)
Query: 202 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 261
++P GE +I C+++ T+P ++ +G K + LSPE Y LK + +C+SGFM D+P
Sbjct: 317 AVPLIQGEYMIPCEKVSTLPTITLKLGGKGYKLSPEDYTLKVSQAGKTLCLSGFMGMDIP 376
Query: 262 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
PP GPLWILGDVF+G Y+TVFD R+GFAEAA
Sbjct: 377 PPSGPLWILGDVFIGRYYTVFDRDNNRVGFAEAA 410
Score = 106 bits (264), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 48/108 (44%), Positives = 73/108 (67%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+++Q LV + +FSF+LNRDP A+ GGE++ GG D K+++G +Y+ VT+K YWQ L +
Sbjct: 218 LMQQKLVDQNIFSFYLNRDPTAQPGGELMLGGTDSKYYRGSLSYLNVTRKAYWQVRLDQV 277
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+ + T +C+ GC AIVD+GTSL+ GP V E+ AIG ++ E
Sbjct: 278 EVASGLT-LCKEGCEAIVDTGTSLMVGPVDEVRELQKAIGAVPLIQGE 324
>gi|268581165|ref|XP_002645565.1| C. briggsae CBR-ASP-4 protein [Caenorhabditis briggsae]
Length = 446
Score = 111 bits (277), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 54/108 (50%), Positives = 72/108 (66%), Gaps = 2/108 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+ EQ V VFSFWLNR+PD+E GGEI FGG+D + + TY PVT+KGYWQF++ D
Sbjct: 223 LFEQKKVPSNVFSFWLNRNPDSELGGEITFGGIDARRYVEPITYTPVTRKGYWQFKM-DK 281
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
++G+ G C GC AI D+GTSL+AGP + I + IG E ++ E
Sbjct: 282 VVGSGVLG-CSNGCQAIADTGTSLIAGPKAQIEAIQNFIGAEPLIKGE 328
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 50/99 (50%), Positives = 66/99 (66%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
I + P GE +I CD++PT+P VSF IG + F+L E Y+LK +G +C+SGF
Sbjct: 315 IQNFIGAEPLIKGEYMISCDKVPTLPPVSFVIGGQEFSLKGEDYVLKVSQGGKTICLSGF 374
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
M DLP G LWILGDVF+G Y+TVFD + R+GFA+A
Sbjct: 375 MGIDLPERVGELWILGDVFIGRYYTVFDFDQNRVGFAQA 413
>gi|313219527|emb|CBY30450.1| unnamed protein product [Oikopleura dioica]
Length = 396
Score = 111 bits (277), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 50/100 (50%), Positives = 64/100 (64%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
IN + GE I+ C + TMP++ FT+ K++ L+PE Y++K E CISGF
Sbjct: 297 INHAIGAFQFANGEWIVPCRHMDTMPDIDFTLNGKVYTLTPEDYVMKIAAEGQEQCISGF 356
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
M D+PPP GPLWILGDVFMG Y+T FD R+GFAE A
Sbjct: 357 MGMDIPPPAGPLWILGDVFMGKYYTAFDFDNNRVGFAELA 396
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 59/146 (40%), Positives = 84/146 (57%), Gaps = 10/146 (6%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+EQGL+S+ +FSF+LNR +AE+GGE+ GGVD F G ++ VT++ YWQ ++ +
Sbjct: 200 MIEQGLLSDGMFSFFLNRTANAEDGGELTIGGVDNSRFTGDFSWNEVTRQAYWQIKMDNF 259
Query: 61 LIGNQSTGVC---EGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYG 117
+ + C E GC IVDSGTSLLA P + EINHAIG + E +V ++
Sbjct: 260 EVQGKGVSACGGNENGCQVIVDSGTSLLAVPKNLAEEINHAIGAFQFANGEW-IVPCRHM 318
Query: 118 DLIWDL------LVSGLLPEKVCQQI 137
D + D+ V L PE +I
Sbjct: 319 DTMPDIDFTLNGKVYTLTPEDYVMKI 344
>gi|426366854|ref|XP_004050458.1| PREDICTED: cathepsin D [Gorilla gorilla gorilla]
Length = 412
Score = 111 bits (277), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 48/94 (51%), Positives = 66/94 (70%)
Query: 202 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 261
++P GE +I C+++ T+P ++ +G K + LSPE Y LK + +C+SGFM D+P
Sbjct: 317 AVPLIQGEYMIPCEKVSTLPAITLKLGGKGYKLSPEDYTLKVSQAGKTLCLSGFMGMDIP 376
Query: 262 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
PP GPLWILGDVF+G Y+TVFD R+GFAEAA
Sbjct: 377 PPSGPLWILGDVFIGRYYTVFDRDNNRVGFAEAA 410
Score = 107 bits (268), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 49/108 (45%), Positives = 74/108 (68%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+++Q LV + +FSF+L+RDPDA+ GGE++ GG D K++KG +Y+ VT+K YWQ L +
Sbjct: 218 LMQQKLVDQNIFSFYLSRDPDAQPGGELMLGGTDSKYYKGSLSYLNVTRKAYWQVHLDQV 277
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+ + T +C+ GC AIVD+GTSL+ GP V E+ AIG ++ E
Sbjct: 278 EVASGLT-LCKEGCEAIVDTGTSLMVGPVDEVRELQKAIGAVPLIQGE 324
>gi|30584113|gb|AAP36305.1| Homo sapiens cathepsin D (lysosomal aspartyl protease) [synthetic
construct]
gi|60653917|gb|AAX29651.1| cathepsin D [synthetic construct]
Length = 413
Score = 111 bits (277), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 48/94 (51%), Positives = 66/94 (70%)
Query: 202 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 261
++P GE +I C+++ T+P ++ +G K + LSPE Y LK + +C+SGFM D+P
Sbjct: 317 AVPLIQGEYMIPCEKVSTLPAITLKLGGKGYKLSPEDYTLKVSQAGKTLCLSGFMGMDIP 376
Query: 262 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
PP GPLWILGDVF+G Y+TVFD R+GFAEAA
Sbjct: 377 PPSGPLWILGDVFIGRYYTVFDRDNNRVGFAEAA 410
Score = 107 bits (268), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 49/108 (45%), Positives = 74/108 (68%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+++Q LV + +FSF+L+RDPDA+ GGE++ GG D K++KG +Y+ VT+K YWQ L +
Sbjct: 218 LMQQKLVDQNIFSFYLSRDPDAQPGGELMLGGTDSKYYKGSLSYLNVTRKAYWQVHLDQV 277
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+ + T +C+ GC AIVD+GTSL+ GP V E+ AIG ++ E
Sbjct: 278 EVASGLT-LCKEGCEAIVDTGTSLMVGPVDEVRELQKAIGAVPLIQGE 324
>gi|184185542|gb|ACC68942.1| cathepsin D (predicted) [Rhinolophus ferrumequinum]
Length = 410
Score = 111 bits (277), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 54/136 (39%), Positives = 85/136 (62%), Gaps = 1/136 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+++Q LV + +FSF+LNRDP+A+ GGE++ GG D +++KG +Y+ VT+K YWQ + +
Sbjct: 216 LMQQKLVDKNIFSFYLNRDPNAQPGGELMLGGTDSRYYKGALSYLNVTRKAYWQVHMDQV 275
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
+GN T +C+ GC AIVD+GTSL+ GP V E+ AIG ++ E + + L
Sbjct: 276 DVGNSLT-LCKAGCEAIVDTGTSLIVGPVEEVRELQKAIGAVPLIQGEYMIPCEKVSSLP 334
Query: 121 WDLLVSGLLPEKVCQQ 136
+L G K+C +
Sbjct: 335 EVILKLGGKDYKLCAE 350
Score = 101 bits (251), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 54/141 (38%), Positives = 80/141 (56%), Gaps = 3/141 (2%)
Query: 156 VEKENVSAGDS-AVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDC 214
V + V G+S +C A A+V L +E + + + ++P GE +I C
Sbjct: 270 VHMDQVDVGNSLTLCKAGCEAIVDTGTSLIVGPVEE--VRELQKAIGAVPLIQGEYMIPC 327
Query: 215 DRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVF 274
+++ ++P V +G K + L E Y LK + +C+SGFM D+PPP GPLWILGDVF
Sbjct: 328 EKVSSLPEVILKLGGKDYKLCAEDYTLKVSQAGKTICLSGFMGMDIPPPGGPLWILGDVF 387
Query: 275 MGVYHTVFDSGKLRIGFAEAA 295
+G Y+TVFD + R+G AEA
Sbjct: 388 IGRYYTVFDRDENRVGLAEAT 408
>gi|4503143|ref|NP_001900.1| cathepsin D preproprotein [Homo sapiens]
gi|115717|sp|P07339.1|CATD_HUMAN RecName: Full=Cathepsin D; Contains: RecName: Full=Cathepsin D
light chain; Contains: RecName: Full=Cathepsin D heavy
chain; Flags: Precursor
gi|29678|emb|CAA28955.1| cathepsin D [Homo sapiens]
gi|179948|gb|AAA51922.1| cathepsin D [Homo sapiens]
gi|181180|gb|AAB59529.1| preprocathepsin D [Homo sapiens]
gi|16740920|gb|AAH16320.1| Cathepsin D [Homo sapiens]
gi|30582659|gb|AAP35556.1| cathepsin D (lysosomal aspartyl protease) [Homo sapiens]
gi|48146011|emb|CAG33228.1| CTSD [Homo sapiens]
gi|54697170|gb|AAV38957.1| cathepsin D (lysosomal aspartyl protease) [Homo sapiens]
gi|61356567|gb|AAX41260.1| cathepsin D [synthetic construct]
gi|61362282|gb|AAX42193.1| cathepsin D [synthetic construct]
gi|119622866|gb|EAX02461.1| cathepsin D (lysosomal aspartyl peptidase), isoform CRA_a [Homo
sapiens]
gi|119622867|gb|EAX02462.1| cathepsin D (lysosomal aspartyl peptidase), isoform CRA_a [Homo
sapiens]
gi|119622868|gb|EAX02463.1| cathepsin D (lysosomal aspartyl peptidase), isoform CRA_a [Homo
sapiens]
gi|123994405|gb|ABM84804.1| cathepsin D (lysosomal aspartyl peptidase) [synthetic construct]
gi|261860344|dbj|BAI46694.1| cathepsin D [synthetic construct]
Length = 412
Score = 111 bits (277), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 48/94 (51%), Positives = 66/94 (70%)
Query: 202 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 261
++P GE +I C+++ T+P ++ +G K + LSPE Y LK + +C+SGFM D+P
Sbjct: 317 AVPLIQGEYMIPCEKVSTLPAITLKLGGKGYKLSPEDYTLKVSQAGKTLCLSGFMGMDIP 376
Query: 262 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
PP GPLWILGDVF+G Y+TVFD R+GFAEAA
Sbjct: 377 PPSGPLWILGDVFIGRYYTVFDRDNNRVGFAEAA 410
Score = 107 bits (268), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 49/108 (45%), Positives = 74/108 (68%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+++Q LV + +FSF+L+RDPDA+ GGE++ GG D K++KG +Y+ VT+K YWQ L +
Sbjct: 218 LMQQKLVDQNIFSFYLSRDPDAQPGGELMLGGTDSKYYKGSLSYLNVTRKAYWQVHLDQV 277
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+ + T +C+ GC AIVD+GTSL+ GP V E+ AIG ++ E
Sbjct: 278 EVASGLT-LCKEGCEAIVDTGTSLMVGPVDEVRELQKAIGAVPLIQGE 324
>gi|407280288|pdb|1LYA|B Chain B, Crystal Structures Of Native And Inhibited Forms Of Human
Cathepsin D: Implications For Lysosomal Targeting And
Drug Design
gi|407280290|pdb|1LYA|D Chain D, Crystal Structures Of Native And Inhibited Forms Of Human
Cathepsin D: Implications For Lysosomal Targeting And
Drug Design
gi|425684912|pdb|1LYW|B Chain B, Cathepsin D At Ph 7.5
gi|425684913|pdb|1LYW|D Chain D, Cathepsin D At Ph 7.5
gi|425684914|pdb|1LYW|F Chain F, Cathepsin D At Ph 7.5
gi|425684915|pdb|1LYW|H Chain H, Cathepsin D At Ph 7.5
gi|449112963|pdb|1LYB|B Chain B, Crystal Structures Of Native And Inhibited Forms Of Human
Cathepsin D: Implications For Lysosomal Targeting And
Drug Design
gi|449112965|pdb|1LYB|D Chain D, Crystal Structures Of Native And Inhibited Forms Of Human
Cathepsin D: Implications For Lysosomal Targeting And
Drug Design
Length = 241
Score = 111 bits (277), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 48/94 (51%), Positives = 66/94 (70%)
Query: 202 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 261
++P GE +I C+++ T+P ++ +G K + LSPE Y LK + +C+SGFM D+P
Sbjct: 148 AVPLIQGEYMIPCEKVSTLPAITLKLGGKGYKLSPEDYTLKVSQAGKTLCLSGFMGMDIP 207
Query: 262 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
PP GPLWILGDVF+G Y+TVFD R+GFAEAA
Sbjct: 208 PPSGPLWILGDVFIGRYYTVFDRDNNRVGFAEAA 241
Score = 107 bits (268), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 49/108 (45%), Positives = 74/108 (68%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+++Q LV + +FSF+L+RDPDA+ GGE++ GG D K++KG +Y+ VT+K YWQ L +
Sbjct: 49 LMQQKLVDQNIFSFYLSRDPDAQPGGELMLGGTDSKYYKGSLSYLNVTRKAYWQVHLDQV 108
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+ + T +C+ GC AIVD+GTSL+ GP V E+ AIG ++ E
Sbjct: 109 EVASGLT-LCKEGCEAIVDTGTSLMVGPVDEVRELQKAIGAVPLIQGE 155
>gi|74191361|dbj|BAE30263.1| unnamed protein product [Mus musculus]
Length = 410
Score = 111 bits (277), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 50/108 (46%), Positives = 75/108 (69%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+++Q LV + FSF+LNRDP+ + GGE++ GG D K++ G+ +Y+ VT+K YWQ + +
Sbjct: 216 LMQQKLVDKNTFSFYLNRDPEGQPGGELMLGGTDSKYYHGELSYLNVTRKAYWQVHMDQL 275
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+GN+ T +C+GGC AIVD+GTSLL GP V E+ AIG ++ E
Sbjct: 276 EVGNELT-LCKGGCEAIVDTGTSLLVGPVEEVKELQKAIGAVPLIQGE 322
Score = 108 bits (271), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 46/93 (49%), Positives = 65/93 (69%)
Query: 202 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 261
++P GE +I C+++ ++P V +G K + L P++YILK +G +C+SGFM D+P
Sbjct: 315 AVPLIQGEYMIPCEKVSSLPTVYLKLGGKNYELHPDKYILKVSQGGKTICLSGFMGMDIP 374
Query: 262 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
PP GPLWILGDVF+G Y+TVFD R+GFA A
Sbjct: 375 PPSGPLWILGDVFIGSYYTVFDRDNNRVGFANA 407
>gi|332264729|ref|XP_003281384.1| PREDICTED: cathepsin D [Nomascus leucogenys]
Length = 412
Score = 111 bits (277), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 48/94 (51%), Positives = 66/94 (70%)
Query: 202 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 261
++P GE +I C+++ T+P ++ +G K + LSPE Y LK + +C+SGFM D+P
Sbjct: 317 AVPLIQGEYMIPCEKVSTLPAITLKLGGKGYKLSPEDYTLKVSQAGKTLCLSGFMGMDIP 376
Query: 262 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
PP GPLWILGDVF+G Y+TVFD R+GFAEAA
Sbjct: 377 PPSGPLWILGDVFIGRYYTVFDRDNNRVGFAEAA 410
Score = 109 bits (273), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 50/108 (46%), Positives = 74/108 (68%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+++Q LV + +FSF+LNRDPDA+ GGE++ GG D K++KG +Y+ VT+K YWQ L +
Sbjct: 218 LMQQKLVDQNIFSFYLNRDPDAQPGGELMLGGTDSKYYKGSLSYLNVTRKAYWQVHLDQV 277
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+ + T +C+ GC AIVD+GTSL+ GP V E+ AIG ++ E
Sbjct: 278 EVASGLT-LCKEGCEAIVDTGTSLMVGPVDEVRELQKAIGAVPLIQGE 324
>gi|355566182|gb|EHH22561.1| Cathepsin D [Macaca mulatta]
Length = 450
Score = 111 bits (277), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 48/94 (51%), Positives = 66/94 (70%)
Query: 202 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 261
++P GE +I C+++ T+P ++ +G K + LSPE Y LK + +C+SGFM D+P
Sbjct: 355 AVPLIQGEYMIPCEKVSTLPTITLKLGGKGYKLSPEDYTLKVSQAGKTLCLSGFMGMDIP 414
Query: 262 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
PP GPLWILGDVF+G Y+TVFD R+GFAEAA
Sbjct: 415 PPSGPLWILGDVFIGRYYTVFDRDNNRVGFAEAA 448
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 48/108 (44%), Positives = 73/108 (67%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+++Q LV + +FSF+LNRDP A+ GGE++ GG D K+++G +Y+ VT+K YWQ L +
Sbjct: 256 LMQQKLVDQNIFSFYLNRDPTAQPGGELMLGGTDSKYYRGSLSYLNVTRKAYWQVRLDQV 315
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+ + T +C+ GC AIVD+GTSL+ GP V E+ AIG ++ E
Sbjct: 316 EVASGLT-LCKEGCEAIVDTGTSLMVGPVDEVRELQKAIGAVPLIQGE 362
>gi|71043798|ref|NP_001020792.1| cathepsin D precursor [Canis lupus familiaris]
gi|85540968|sp|Q4LAL9.1|CATD_CANFA RecName: Full=Cathepsin D; Flags: Precursor
gi|70561318|emb|CAJ14973.1| cathepsin D [Canis lupus familiaris]
Length = 410
Score = 111 bits (277), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 50/108 (46%), Positives = 76/108 (70%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+++Q LV + +FSF+LNRDP+A+ GGE++ GG D K++KG +Y+ VT+K YWQ + +
Sbjct: 216 LMQQKLVEKNIFSFYLNRDPNAQPGGELMLGGTDSKYYKGPLSYLNVTRKAYWQVHMEQV 275
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+G+ T +C+GGC AIVD+GTSL+ GP V E+ AIG ++ E
Sbjct: 276 DVGSSLT-LCKGGCEAIVDTGTSLIVGPVDEVRELQKAIGAVPLIQGE 322
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 46/94 (48%), Positives = 67/94 (71%)
Query: 202 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 261
++P GE +I C+++ T+P+V+ +G K++ LS E Y LK +G +C+SGFM D+P
Sbjct: 315 AVPLIQGEYMIPCEKVSTLPDVTLKLGGKLYKLSSEDYTLKVSQGGKTICLSGFMGMDIP 374
Query: 262 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
PP GPLWILGDVF+G Y+TVFD + R+G A+A
Sbjct: 375 PPGGPLWILGDVFIGCYYTVFDRDQNRVGLAQAT 408
>gi|149757990|ref|XP_001490885.1| PREDICTED: napsin-A [Equus caballus]
Length = 401
Score = 111 bits (277), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 54/114 (47%), Positives = 77/114 (67%), Gaps = 1/114 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+V+QGL+ + VFSF+LNRDP+A +GGE+V GG DP H+ T+VPVT YWQ + +
Sbjct: 201 LVDQGLLDKPVFSFYLNRDPEAADGGELVLGGSDPSHYIPPLTFVPVTIPAYWQIHMKRV 260
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVS 114
+G T +C GCAAI+D+GTSL+ GPT + ++ AIGG +++ E L S
Sbjct: 261 KVGTGLT-LCAQGCAAILDTGTSLITGPTEEIRALHAAIGGIPLLAGEYLLQCS 313
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 49/143 (34%), Positives = 76/143 (53%), Gaps = 7/143 (4%)
Query: 156 VEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 215
+++ V G + C A++ L T+E + ++ +P GE ++ C
Sbjct: 257 MKRVKVGTGLTLCAQGCA-AILDTGTSLITGPTEE--IRALHAAIGGIPLLAGEYLLQCS 313
Query: 216 RIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFM 275
IP +P VS +G F L+ + Y+++ G +C+SGF A D+PPP GPLWILGDVF+
Sbjct: 314 TIPRLPPVSLLLGGTWFTLTAQDYVIQIVRGGVRLCLSGFAALDMPPPTGPLWILGDVFL 373
Query: 276 GVYHTVFDSGKL----RIGFAEA 294
G + VFD G + R+G A A
Sbjct: 374 GSFVAVFDRGDMNGGARVGLARA 396
>gi|4927648|gb|AAD33219.1| cathepsin D [Hynobius leechii]
Length = 397
Score = 110 bits (276), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 51/108 (47%), Positives = 73/108 (67%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
++ Q LV + VFSF+LNR+PD GGE++ GG DP ++ G TY+ VT K YWQ + +
Sbjct: 204 IMSQKLVEKNVFSFYLNRNPDTRPGGELLLGGTDPNYYTGDFTYLNVTPKAYWQIHMDQL 263
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+G+Q T +C+GGC AIVD+GTSL+ GP+ VT + AIG ++ E
Sbjct: 264 GVGDQLT-LCKGGCEAIVDTGTSLIIGPSAEVTALQKAIGAIPLIQGE 310
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 44/93 (47%), Positives = 64/93 (68%), Gaps = 1/93 (1%)
Query: 202 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 261
++P GE +I CD++P++P ++F +G K F +S E Y+LK + +C+SGFM D+P
Sbjct: 303 AIPLIQGEYMIPCDKVPSLPVITFNLGGKAFTVSGEDYVLKVSQAGHTICLSGFMGMDIP 362
Query: 262 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
PP G LW LGDVF+G Y+TVFD R+G A+A
Sbjct: 363 PPSG-LWTLGDVFIGPYYTVFDRENDRVGLAKA 394
>gi|296219067|ref|XP_002755720.1| PREDICTED: cathepsin D [Callithrix jacchus]
Length = 392
Score = 110 bits (276), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 50/108 (46%), Positives = 74/108 (68%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+++Q LV + +FSF+LNRDPDA+ GGE++ GG D K++KG Y+ VT+K YWQ + +
Sbjct: 198 LMQQKLVDQNIFSFYLNRDPDAQPGGELMLGGTDSKYYKGSLFYLNVTRKAYWQVHMDQV 257
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+ + T +C+GGC AIVD+GTSL+ GP V E+ AIG ++ E
Sbjct: 258 EVASGLT-LCKGGCEAIVDTGTSLMVGPVDEVRELQKAIGAMPLIQGE 304
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 45/94 (47%), Positives = 64/94 (68%)
Query: 202 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 261
++P GE +I C+++ T+P + +G K + LSP+ Y LK + +C+SGFM D+P
Sbjct: 297 AMPLIQGEYMIPCEKVSTLPVIMLKLGGKDYELSPQDYTLKVSQAGKTICLSGFMGMDIP 356
Query: 262 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
PP GPLWILGDVF+G Y+TVFD R+GFA+A
Sbjct: 357 PPSGPLWILGDVFIGRYYTVFDRDNNRVGFAQAT 390
>gi|205289916|gb|ACI02330.1| aspartic protease 1 [Uncinaria stenocephala]
Length = 447
Score = 110 bits (276), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 51/108 (47%), Positives = 71/108 (65%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+EQ V +F+FWLNR+PD+E GGEI GG+DP+ + T+ PVT++GYWQF++ D+
Sbjct: 224 FIEQKKVPSPMFAFWLNRNPDSELGGEITLGGMDPRRYVEPLTWTPVTRRGYWQFKM-DM 282
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+ G S+ C GC AI D+GTSL+AGP V I IG E ++ E
Sbjct: 283 VQGGSSSIACPNGCQAIADTGTSLIAGPKAQVEAIQKFIGAEPLMRGE 330
Score = 103 bits (258), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 47/99 (47%), Positives = 65/99 (65%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
I + + P GE +I CD++P++P++SF IG + F L E Y+L G +C+SGF
Sbjct: 317 IQKFIGAEPLMRGEYMIPCDKVPSLPDLSFVIGGQTFTLKGEDYVLTVKAGGKSICLSGF 376
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
M D P G LWILGDVF+G Y+TVFD G+ R+GFA+A
Sbjct: 377 MGMDFPERIGELWILGDVFIGKYYTVFDVGQARLGFAQA 415
>gi|198422402|ref|XP_002130569.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
Length = 389
Score = 110 bits (276), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 54/108 (50%), Positives = 69/108 (63%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M +Q ++ FSF+LNRD +A GGE+ GGVD K F G +Y PVT KGYWQ + +
Sbjct: 196 MFKQEALAHNQFSFYLNRDANASSGGELYLGGVDTKKFTGSFSYHPVTVKGYWQISMDSV 255
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+G+ ST C GC AIVDSGTSLLAGPT + +IN IG ++ E
Sbjct: 256 SVGS-STSACVSGCKAIVDSGTSLLAGPTDEIEKINKLIGATKFLNGE 302
Score = 110 bits (275), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 57/142 (40%), Positives = 84/142 (59%), Gaps = 5/142 (3%)
Query: 156 VEKENVSAGDS--AVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIID 213
+ ++VS G S A S C+ A+V L T E + IN+L + GE I+
Sbjct: 250 ISMDSVSVGSSTSACVSGCK-AIVDSGTSLLAGPTDE--IEKINKLIGATKFLNGEYIVQ 306
Query: 214 CDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDV 273
C+++ TMP+++F++ + L P Y++K +CISGFM D+PPPRGPLWILGD+
Sbjct: 307 CNKMATMPDITFSLSGVKYILKPNDYVMKESTAGESICISGFMGLDVPPPRGPLWILGDI 366
Query: 274 FMGVYHTVFDSGKLRIGFAEAA 295
FMG ++T FD R+GFA+ A
Sbjct: 367 FMGKFYTTFDFANNRVGFAQLA 388
>gi|403299328|ref|XP_003940441.1| PREDICTED: napsin-A-like [Saimiri boliviensis boliviensis]
Length = 421
Score = 110 bits (276), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 54/119 (45%), Positives = 79/119 (66%), Gaps = 1/119 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+VEQGL+ + VFSF+ NRDP+ +GGE+V GG DP H+ T+VPVT YWQ + +
Sbjct: 207 LVEQGLLDKPVFSFYFNRDPEKPDGGELVLGGSDPAHYIPPLTFVPVTVPAYWQIHMERV 266
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDL 119
+G+ T +C GCAAI+D+GTSL+ GPT + +N AIGG +++ E ++ S+ L
Sbjct: 267 KVGSGLT-LCARGCAAILDTGTSLITGPTEEIQALNAAIGGFPLLAGEYIILCSEIPKL 324
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 55/143 (38%), Positives = 76/143 (53%), Gaps = 7/143 (4%)
Query: 156 VEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 215
+E+ V +G + C A++ L T+E + +N P GE II C
Sbjct: 263 MERVKVGSGLTLCARGCA-AILDTGTSLITGPTEE--IQALNAAIGGFPLLAGEYIILCS 319
Query: 216 RIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFM 275
IP +P VSF +G FNL+ + Y+++T +C+SGF A D+PPP GP WILGDVF+
Sbjct: 320 EIPKLPAVSFLLGGVWFNLTAQDYVIQTTRNGVRLCLSGFQALDVPPPAGPFWILGDVFL 379
Query: 276 GVYHTVFDSG----KLRIGFAEA 294
G Y VFD G R+G A A
Sbjct: 380 GTYVAVFDRGDRKSSARVGLARA 402
>gi|148229393|ref|NP_001085403.1| MGC82347 protein precursor [Xenopus laevis]
gi|48734644|gb|AAH72252.1| MGC82347 protein [Xenopus laevis]
Length = 401
Score = 110 bits (276), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 44/99 (44%), Positives = 72/99 (72%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
+ + ++P G+ ++ CD+IPT+P +S T+G +++ L+ EQYI+K + + +C+SGF
Sbjct: 302 LQKAIGAVPLIQGQYMVQCDKIPTLPVISLTLGGQVYTLTGEQYIMKVSQRGSTICLSGF 361
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
M ++PPP GPLWILGDVF+G Y++VFD R+GFA++
Sbjct: 362 MGLNIPPPAGPLWILGDVFIGQYYSVFDRANDRVGFAKS 400
Score = 110 bits (274), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 50/108 (46%), Positives = 73/108 (67%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
++ Q LV VFSF+LNR+PD + GGE++ GG DPK++ G Y+ VT+K YWQ + +
Sbjct: 209 IMAQKLVESNVFSFYLNRNPDTQPGGELLLGGTDPKYYTGDFHYLNVTRKAYWQIHMDQL 268
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+G+Q T +C+GGC AIVD+GTSL+ GP VT + AIG ++ +
Sbjct: 269 GVGDQLT-LCKGGCEAIVDTGTSLITGPLEEVTALQKAIGAVPLIQGQ 315
>gi|226822856|gb|ACO83090.1| cathepsin D preproprotein (predicted) [Dasypus novemcinctus]
Length = 410
Score = 110 bits (276), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 50/108 (46%), Positives = 74/108 (68%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+++Q LV + VFSF+LNRDP A+ GGE+V GG+DP+H+ G Y+ +T+K YWQ + +
Sbjct: 216 LMQQKLVDKNVFSFYLNRDPTAQPGGELVLGGMDPRHYTGSVDYLNITRKAYWQVHMDRL 275
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+G+ T +C+ GC AIVD+GTSL+ GP V E+ AIG ++ E
Sbjct: 276 EVGDGLT-LCKQGCEAIVDTGTSLMVGPVAEVRELQKAIGAVPLIQGE 322
Score = 103 bits (257), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 44/94 (46%), Positives = 67/94 (71%)
Query: 202 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 261
++P GE +I C+++ ++P ++ +G++ + LS E Y LK +G VC+SGFM D+P
Sbjct: 315 AVPLIQGEYMISCEKVASLPPITLMLGNRGYRLSGEDYTLKVSQGGQTVCLSGFMGMDIP 374
Query: 262 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
PP GPLWILGD+F+G ++TVFD R+GFA+AA
Sbjct: 375 PPGGPLWILGDIFIGRFYTVFDRDLNRVGFAKAA 408
>gi|324507249|gb|ADY43078.1| Cathepsin D [Ascaris suum]
Length = 437
Score = 110 bits (276), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 51/108 (47%), Positives = 71/108 (65%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M++Q L++ VF+FWL+R+PD + GGEI FGG D K + TY PVT++GYWQF++ D
Sbjct: 216 MIDQQLLAAPVFAFWLDRNPDDQIGGEITFGGTDTKRYVEPITYTPVTRRGYWQFKM-DK 274
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
++G ++ C GC AI D+GTSL+AGP V I IG E + E
Sbjct: 275 VVGEEAVLACANGCQAIADTGTSLIAGPKQQVDTIQKFIGAEPLFRGE 322
Score = 102 bits (253), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 48/99 (48%), Positives = 63/99 (63%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
I + + P GE +I CD++P++P+VSF I K ++L P Y+ VCISGF
Sbjct: 309 IQKFIGAEPLFRGEYMIPCDKVPSLPDVSFVIASKTYSLKPTDYVFNMTAMGKSVCISGF 368
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
M +LP G LWILGDVF+G Y+TVFD G R+GFAEA
Sbjct: 369 MGIELPERVGELWILGDVFIGRYYTVFDVGHERVGFAEA 407
>gi|119592255|gb|EAW71849.1| napsin A aspartic peptidase, isoform CRA_c [Homo sapiens]
Length = 328
Score = 110 bits (276), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 54/117 (46%), Positives = 77/117 (65%), Gaps = 1/117 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+VEQGL+ + VFSF+LNRDP+ +GGE+V GG DP H+ T+VPVT YWQ + +
Sbjct: 206 LVEQGLLDKPVFSFYLNRDPEEPDGGELVLGGSDPAHYIPPLTFVPVTVPAYWQIHMERV 265
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYG 117
+G T +C GCAAI+D+GTSL+ GPT + ++ AIGG +++ E + YG
Sbjct: 266 KVGPGLT-LCAKGCAAILDTGTSLITGPTEEIRALHAAIGGIPLLAGEVRSQSGGYG 321
>gi|432102593|gb|ELK30160.1| Napsin-A [Myotis davidii]
Length = 357
Score = 110 bits (276), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 53/101 (52%), Positives = 70/101 (69%), Gaps = 1/101 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV+QGL+ + VFSF+LNRDP+A EGGE+V GG DP H+ TYVPVT YWQ + +
Sbjct: 207 MVDQGLLDKPVFSFYLNRDPEAAEGGELVLGGSDPAHYIPPLTYVPVTVPAYWQVHMERV 266
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 101
+G T +C GC AI+D+GTSL+ GPT + ++ AIGG
Sbjct: 267 TVGPGLT-LCAQGCPAILDTGTSLITGPTEEIRALHRAIGG 306
Score = 43.5 bits (101), Expect = 0.099, Method: Compositional matrix adjust.
Identities = 18/44 (40%), Positives = 30/44 (68%)
Query: 207 MGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEV 250
+G+ II+C IP +P VSF++G FNL+ + Y+++ G G +V
Sbjct: 310 LGKYIIECSVIPALPPVSFSLGGVWFNLTSQDYVIQVGSGQNDV 353
>gi|60654209|gb|AAX29797.1| cathepsin D [synthetic construct]
Length = 413
Score = 110 bits (275), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 47/88 (53%), Positives = 63/88 (71%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 267
GE +I C+++ T+P ++ +G K + LSPE Y LK + +C+SGFM D+PPP GPL
Sbjct: 323 GEYMIPCEKVSTLPAITLKLGGKGYKLSPEDYTLKVSQAGKTLCLSGFMGMDIPPPSGPL 382
Query: 268 WILGDVFMGVYHTVFDSGKLRIGFAEAA 295
WILGDVF+G Y+TVFD R+GFAEAA
Sbjct: 383 WILGDVFIGRYYTVFDRDNNRVGFAEAA 410
Score = 107 bits (268), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 49/108 (45%), Positives = 74/108 (68%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+++Q LV + +FSF+L+RDPDA+ GGE++ GG D K++KG +Y+ VT+K YWQ L +
Sbjct: 218 LMQQKLVDQNIFSFYLSRDPDAQPGGELMLGGTDSKYYKGSLSYLNVTRKAYWQVHLDQV 277
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+ + T +C+ GC AIVD+GTSL+ GP V E+ AIG ++ E
Sbjct: 278 EVASGLT-LCKEGCEAIVDTGTSLMVGPVDEVRELQKAIGAVPLIEGE 324
>gi|410982348|ref|XP_003997519.1| PREDICTED: napsin-A [Felis catus]
Length = 422
Score = 110 bits (275), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 53/108 (49%), Positives = 74/108 (68%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+V+QGL+ + VFSF+LNRDP+A +GGE+V GG DP H+ T+VPVT YWQ + +
Sbjct: 204 LVDQGLLDKPVFSFYLNRDPEAADGGELVLGGSDPAHYIPPLTFVPVTIPAYWQIHMERM 263
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+G T +C GCAAI+D+GTSL+ GPT + +N AIGG ++ E
Sbjct: 264 KVGTGLT-LCAQGCAAILDTGTSLITGPTEEIRALNTAIGGISLLVGE 310
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 53/143 (37%), Positives = 78/143 (54%), Gaps = 7/143 (4%)
Query: 156 VEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 215
+E+ V G + C A++ L T+E + +N + +GE +I C+
Sbjct: 260 MERMKVGTGLTLCAQGCA-AILDTGTSLITGPTEE--IRALNTAIGGISLLVGEYLIQCE 316
Query: 216 RIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFM 275
IPT+P VSF +G FNL+ + Y+++ G +C+SGF A D+P P GPLWILGDVF+
Sbjct: 317 TIPTLPPVSFLLGGVWFNLTAQDYVIQIVRGGFRLCLSGFQALDMPSPAGPLWILGDVFL 376
Query: 276 GVYHTVFDSGKL----RIGFAEA 294
Y VFD G L R+G A +
Sbjct: 377 RTYVAVFDRGNLTSGARVGLARS 399
>gi|387015018|gb|AFJ49628.1| Cathepsin D [Crotalus adamanteus]
Length = 399
Score = 110 bits (275), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 49/111 (44%), Positives = 76/111 (68%), Gaps = 1/111 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
++EQGL+ + +FSF+LNRDP E GGE++FGG D +++ G ++V V++K YWQ + +
Sbjct: 204 VMEQGLLEKNLFSFYLNRDPKGETGGELLFGGTDSQYYSGDFSWVNVSRKAYWQVHMDKV 263
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKL 111
+ N T VC+ GC AIVD+GTSL+ GPT + E+ AIG + ++ + L
Sbjct: 264 DVANGLT-VCKDGCEAIVDTGTSLITGPTKEIKELQKAIGAKPIIKGQYML 313
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 53/138 (38%), Positives = 81/138 (58%), Gaps = 3/138 (2%)
Query: 156 VEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 215
++K +V+ G + CE A+V L TKE + + + + P G+ ++ CD
Sbjct: 260 MDKVDVANGLTVCKDGCE-AIVDTGTSLITGPTKE--IKELQKAIGAKPIIKGQYMLPCD 316
Query: 216 RIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFM 275
++ T+P VS +G + + L+P+QY LK +C+SGF D+PPP GPLWILGDVF+
Sbjct: 317 KLSTLPTVSLVLGGQSYALTPDQYALKVTVQGETLCLSGFSGLDVPPPGGPLWILGDVFI 376
Query: 276 GVYHTVFDSGKLRIGFAE 293
G Y+TVFD +GFA+
Sbjct: 377 GPYYTVFDRDNDSVGFAK 394
>gi|60820131|gb|AAX36524.1| cathepsin D [synthetic construct]
gi|61363243|gb|AAX42359.1| cathepsin D [synthetic construct]
Length = 412
Score = 110 bits (275), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 47/88 (53%), Positives = 63/88 (71%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 267
GE +I C+++ T+P ++ +G K + LSPE Y LK + +C+SGFM D+PPP GPL
Sbjct: 323 GEYMIPCEKVSTLPAITLKLGGKGYKLSPEDYTLKVSQAGKTLCLSGFMGMDIPPPSGPL 382
Query: 268 WILGDVFMGVYHTVFDSGKLRIGFAEAA 295
WILGDVF+G Y+TVFD R+GFAEAA
Sbjct: 383 WILGDVFIGRYYTVFDRDNNRVGFAEAA 410
Score = 107 bits (267), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 49/108 (45%), Positives = 74/108 (68%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+++Q LV + +FSF+L+RDPDA+ GGE++ GG D K++KG +Y+ VT+K YWQ L +
Sbjct: 218 LMQQKLVDQNIFSFYLSRDPDAQPGGELMLGGTDSKYYKGSLSYLNVTRKAYWQVHLDQV 277
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+ + T +C+ GC AIVD+GTSL+ GP V E+ AIG ++ E
Sbjct: 278 EVASGLT-LCKEGCEAIVDTGTSLMVGPVDEVRELQKAIGAVPLIEGE 324
>gi|334562337|gb|AEG79714.1| cathepsin D [Apostichopus japonicus]
Length = 372
Score = 110 bits (275), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 49/110 (44%), Positives = 73/110 (66%), Gaps = 1/110 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
++ + L+S +FSF+L++DP A GGE++ GG D K++ G TYV V+KKGYWQF + +
Sbjct: 197 IMNRKLISNNIFSFYLSKDPSAAVGGELLLGGTDSKYYTGNFTYVKVSKKGYWQFAMDKV 256
Query: 61 LIGNQSTGVCEG-GCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAEC 109
IG + G C G C+AI D+GTSL+AGPT + ++N IG ++ E
Sbjct: 257 SIGGKDAGYCTGKNCSAICDTGTSLIAGPTADINDLNKKIGAIPLIKGEA 306
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 38/80 (47%), Positives = 54/80 (67%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
+N+ ++P GE+II C+ IP++P++SF + F L P+ Y+LK E +CISGF
Sbjct: 292 LNKKIGAIPLIKGEAIILCNTIPSLPDISFQLNGHDFTLKPDDYVLKVSEANETICISGF 351
Query: 256 MAFDLPPPRGPLWILGDVFM 275
+ DLPP GPLWILGDVF+
Sbjct: 352 LGIDLPPEIGPLWILGDVFI 371
>gi|307175238|gb|EFN65290.1| Lysosomal aspartic protease [Camponotus floridanus]
Length = 357
Score = 110 bits (275), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 53/108 (49%), Positives = 64/108 (59%), Gaps = 2/108 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV Q LVS +FSF+LNRDP AE GE + GG DP H+ G TYVPVT KG+WQF + I
Sbjct: 168 MVNQNLVSSHIFSFYLNRDPSAELDGEFILGGSDPAHYDGNFTYVPVTHKGFWQFTMDKI 227
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+ N S +C+ C AI D+G GPT V IN IG + E
Sbjct: 228 EVNNIS--LCQSSCQAIADTGMGETYGPTSDVKTINELIGTTNIDGME 273
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 29/85 (34%), Positives = 44/85 (51%), Gaps = 4/85 (4%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
INEL + N G ++C RIP +P + F +G K FNL+ + YI++ + C S F
Sbjct: 260 INELIGT-TNIDGMERVNCSRIPELPTIRFILGGKAFNLTGKDYIIQFPDEGNTSCRSSF 318
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHT 280
+ +D W LG F+G+ T
Sbjct: 319 LGYDFKEFN---WELGVAFIGIVFT 340
>gi|74191270|dbj|BAE39462.1| unnamed protein product [Mus musculus]
gi|74204799|dbj|BAE35462.1| unnamed protein product [Mus musculus]
Length = 410
Score = 110 bits (275), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 49/108 (45%), Positives = 75/108 (69%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+++Q LV + +FSF+LNRDP+ + GGE++ GG D K++ G+ +Y+ VT+K YWQ + +
Sbjct: 216 LMQQKLVDKNIFSFYLNRDPEGQPGGELMLGGTDSKYYHGELSYLNVTRKAYWQVHMDQL 275
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+GN+ T +C+GGC AIVD+GTSLL GP V E+ A G ++ E
Sbjct: 276 EVGNELT-LCKGGCEAIVDTGTSLLVGPVEEVKELQKATGAVPLIQGE 322
Score = 108 bits (271), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 46/93 (49%), Positives = 65/93 (69%)
Query: 202 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 261
++P GE +I C+++ ++P V +G K + L P++YILK +G +C+SGFM D+P
Sbjct: 315 AVPLIQGEYMIPCEKVSSLPTVYLKLGGKNYELHPDKYILKVSQGGKTICLSGFMGMDIP 374
Query: 262 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
PP GPLWILGDVF+G Y+TVFD R+GFA A
Sbjct: 375 PPSGPLWILGDVFIGSYYTVFDRDNNRVGFANA 407
>gi|74219443|dbj|BAE29498.1| unnamed protein product [Mus musculus]
Length = 410
Score = 110 bits (275), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 49/108 (45%), Positives = 75/108 (69%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+++Q LV + +FSF+LNRDP+ + GGE++ GG D K++ G+ +Y+ VT+K YWQ + +
Sbjct: 216 LMQQKLVDKNIFSFYLNRDPEGQPGGELMLGGTDSKYYHGELSYLNVTRKAYWQVHMDQL 275
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+GN+ T +C+GGC AIVD+G SLL GP V E+ AIG ++ E
Sbjct: 276 EVGNELT-LCKGGCEAIVDTGASLLVGPVEEVKELQKAIGAVPLIQGE 322
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 46/93 (49%), Positives = 65/93 (69%)
Query: 202 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 261
++P GE +I C+++ ++P V +G K + L P++YILK +G +C+SGFM D+P
Sbjct: 315 AVPLIQGEYMIPCEKVSSLPTVYLKLGGKNYELHPDKYILKVSQGGKTICLSGFMGMDIP 374
Query: 262 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
PP GPLWILGDVF+G Y+TVFD R+GFA A
Sbjct: 375 PPSGPLWILGDVFIGSYYTVFDRDNNRVGFANA 407
>gi|115719|sp|P00795.2|CATD_PIG RecName: Full=Cathepsin D; Contains: RecName: Full=Cathepsin D
light chain; Contains: RecName: Full=Cathepsin D heavy
chain; Flags: Precursor
Length = 345
Score = 110 bits (275), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 57/141 (40%), Positives = 84/141 (59%), Gaps = 3/141 (2%)
Query: 156 VEKENVSAGDS-AVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDC 214
+ V+ G S +C A+V L Q +E + + + ++P GE +I C
Sbjct: 207 IHMNQVAVGSSLTLCKGGCEAIVDTGTSLIVGQPEE--VRELGKAIGAVPLIQGEYMIPC 264
Query: 215 DRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVF 274
+++P++P+V+ T+G K + LS E Y LK + +C+SGFM D+PPP GPLWILGDVF
Sbjct: 265 EKVPSLPDVTVTLGGKKYKLSSENYTLKVSQAGQTICLSGFMGMDIPPPGGPLWILGDVF 324
Query: 275 MGVYHTVFDSGKLRIGFAEAA 295
+G Y+TVFD R+G AEAA
Sbjct: 325 IGRYYTVFDRDLNRVGLAEAA 345
Score = 107 bits (266), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 49/108 (45%), Positives = 74/108 (68%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+++Q LV +++FSF+LNRDP A+ GGE++ GG+D K++KG Y VT+K YWQ + +
Sbjct: 153 LMQQKLVDKDIFSFYLNRDPGAQPGGELMLGGIDSKYYKGSLDYHNVTRKAYWQIHMNQV 212
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+G+ T +C+GGC AIVD+GTSL+ G V E+ AIG ++ E
Sbjct: 213 AVGSSLT-LCKGGCEAIVDTGTSLIVGQPEEVRELGKAIGAVPLIQGE 259
>gi|357627475|gb|EHJ77155.1| cathepsin D [Danaus plexippus]
Length = 358
Score = 110 bits (275), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 55/116 (47%), Positives = 75/116 (64%), Gaps = 7/116 (6%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV+QGLV+ VFSF+LNRD A +GGE+V GG DP H++G TYVP++K YWQF++ +
Sbjct: 168 MVKQGLVAP-VFSFYLNRDASAAQGGELVLGGSDPAHYRGPLTYVPLSKDTYWQFQMDGV 226
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAECKLV 112
L+ S C+ GC AI D+GTSL+ GP V +N IG G + +C L+
Sbjct: 227 LVNGSS--FCKRGCQAIADTGTSLIGGPVEEVAALNAKIGATPMAFGQFALDCSLI 280
Score = 104 bits (259), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 47/99 (47%), Positives = 62/99 (62%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
+N + P G+ +DC IP +P V+FTI ++ F L Y+L+ + VC+SGF
Sbjct: 259 LNAKIGATPMAFGQFALDCSLIPRLPPVTFTIANQKFTLEGTDYVLRVSQFGKTVCLSGF 318
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
M D+PPP GPLWILGDVF+G Y+T FD RIGFA A
Sbjct: 319 MGLDIPPPAGPLWILGDVFIGRYYTEFDVANRRIGFAPA 357
>gi|56417363|gb|AAV90625.1| cathepsin D protein [Sus scrofa]
Length = 395
Score = 110 bits (274), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 48/94 (51%), Positives = 67/94 (71%)
Query: 202 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 261
++P GE +I C+++P++P+V+ T+G K + LS E Y LK + +C+SGFM D+P
Sbjct: 300 AVPLIQGEYMIPCEKVPSLPDVTVTLGGKKYKLSSENYTLKVSQAGQTICLSGFMGMDIP 359
Query: 262 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
PP GPLWILGDVF+G Y+TVFD R+G AEAA
Sbjct: 360 PPGGPLWILGDVFIGRYYTVFDRDLNRVGLAEAA 393
Score = 110 bits (274), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 50/108 (46%), Positives = 74/108 (68%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+++Q LV + +FSF+LNRDP A+ GGE++ GG+D K++KG Y VT+K YWQ + +
Sbjct: 201 LMQQKLVDKNIFSFYLNRDPGAQPGGELMLGGIDSKYYKGSLDYHNVTRKAYWQIHMDQV 260
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+G+ T +C+GGC AIVD+GTSL+ GP V E+ AIG ++ E
Sbjct: 261 AVGSSLT-LCKGGCEAIVDTGTSLIVGPVEEVRELQKAIGAVPLIQGE 307
>gi|9581805|emb|CAC00543.1| necepsin II [Necator americanus]
Length = 446
Score = 110 bits (274), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 52/108 (48%), Positives = 71/108 (65%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+EQ V VF+FWLNR+P++E GGEI FGGVD + + T+ PVT++GYWQF++ D+
Sbjct: 223 FIEQKKVPSPVFAFWLNRNPESEIGGEITFGGVDTRRYVEPITWTPVTRRGYWQFKM-DM 281
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+ G S+ C GC AI D+GTSL+AGP V I IG E ++ E
Sbjct: 282 VQGGSSSIACPNGCQAIADTGTSLIAGPKAQVEAIQKYIGAEPLMKGE 329
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 47/99 (47%), Positives = 63/99 (63%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
I + + P GE +I CD++P++P+VSF I K F L E Y+L +C+SGF
Sbjct: 316 IQKYIGAEPLMKGEYMIPCDKVPSLPDVSFIIDGKTFTLKGEDYVLTVKAAGKSICLSGF 375
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
M D P G LWILGDVF+G Y+TVFD G+ R+GFA+A
Sbjct: 376 MGMDFPEKIGELWILGDVFIGKYYTVFDVGQARVGFAQA 414
>gi|432850603|ref|XP_004066829.1| PREDICTED: cathepsin D-like isoform 3 [Oryzias latipes]
Length = 416
Score = 110 bits (274), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 45/93 (48%), Positives = 68/93 (73%)
Query: 202 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 261
++P GE +I+CD+IP++P ++F IG + + L+ +QY+LK + +C+SGFM D+P
Sbjct: 323 AIPLIQGEYMINCDKIPSLPAITFNIGGQSYTLTGDQYVLKESQAGKTICLSGFMGLDIP 382
Query: 262 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
P GPLWILGDVF+G Y+TVFD R+GFA++
Sbjct: 383 APAGPLWILGDVFIGQYYTVFDRDSNRVGFAKS 415
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 48/108 (44%), Positives = 73/108 (67%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+++Q V VFSF+LNR+PD E GGE++ GG DPK++ G YV ++++ YWQ + +
Sbjct: 224 IMQQKKVDSNVFSFYLNRNPDTEPGGELLLGGTDPKYYSGDFHYVNISRQAYWQIHMDGM 283
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+G+Q + +C+GGC AIVD+GTSLL GP+ V + AIG ++ E
Sbjct: 284 AVGSQLS-LCKGGCEAIVDTGTSLLTGPSAEVKALQKAIGAIPLIQGE 330
>gi|74192771|dbj|BAE34900.1| unnamed protein product [Mus musculus]
Length = 410
Score = 110 bits (274), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 49/108 (45%), Positives = 76/108 (70%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+++Q LV + +FSF+LNRDP+ + GGE++ GG D K++ G+ +Y+ VT+K YWQ + +
Sbjct: 216 LMQQKLVDKNIFSFYLNRDPEGQPGGELMLGGTDSKYYHGELSYLNVTRKAYWQVHMDQL 275
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+G++ T +C+GGC AIVD+GTSLL GP V E+ AIG ++ E
Sbjct: 276 EVGSELT-LCKGGCEAIVDTGTSLLVGPVEEVKELQKAIGAVPLIQGE 322
Score = 108 bits (271), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 46/93 (49%), Positives = 65/93 (69%)
Query: 202 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 261
++P GE +I C+++ ++P V +G K + L P++YILK +G +C+SGFM D+P
Sbjct: 315 AVPLIQGEYMIPCEKVSSLPTVYLKLGGKNYELHPDKYILKVSQGGKTICLSGFMGMDIP 374
Query: 262 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
PP GPLWILGDVF+G Y+TVFD R+GFA A
Sbjct: 375 PPSGPLWILGDVFIGSYYTVFDRDNNRVGFANA 407
>gi|313226363|emb|CBY21507.1| unnamed protein product [Oikopleura dioica]
Length = 396
Score = 110 bits (274), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 49/100 (49%), Positives = 64/100 (64%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
IN + GE I+ C + TMP++ FT+ K++ L+PE Y++K E CISGF
Sbjct: 297 INHAIGAFQFANGEWIVPCRHMDTMPDIDFTLNGKVYTLTPEDYVMKIAAEGQEQCISGF 356
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
M D+PPP GPLWILGDVFMG Y+T FD R+GFA+ A
Sbjct: 357 MGMDIPPPAGPLWILGDVFMGKYYTAFDFDNNRVGFADLA 396
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 59/146 (40%), Positives = 84/146 (57%), Gaps = 10/146 (6%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+EQGL+S+ +FSF+LNR +AE+GGE+ GGVD F G ++ VT++ YWQ ++ +
Sbjct: 200 MIEQGLLSDGMFSFFLNRTANAEDGGELTIGGVDNSRFTGDFSWNEVTRQAYWQIKMDNF 259
Query: 61 LIGNQSTGVC---EGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYG 117
+ + C E GC IVDSGTSLLA P + EINHAIG + E +V ++
Sbjct: 260 EVQGKGVSACGGNENGCQVIVDSGTSLLAVPKNLAEEINHAIGAFQFANGEW-IVPCRHM 318
Query: 118 DLIWDL------LVSGLLPEKVCQQI 137
D + D+ V L PE +I
Sbjct: 319 DTMPDIDFTLNGKVYTLTPEDYVMKI 344
>gi|432850599|ref|XP_004066827.1| PREDICTED: cathepsin D-like isoform 1 [Oryzias latipes]
Length = 396
Score = 110 bits (274), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 45/93 (48%), Positives = 68/93 (73%)
Query: 202 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 261
++P GE +I+CD+IP++P ++F IG + + L+ +QY+LK + +C+SGFM D+P
Sbjct: 303 AIPLIQGEYMINCDKIPSLPAITFNIGGQSYTLTGDQYVLKESQAGKTICLSGFMGLDIP 362
Query: 262 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
P GPLWILGDVF+G Y+TVFD R+GFA++
Sbjct: 363 APAGPLWILGDVFIGQYYTVFDRDSNRVGFAKS 395
Score = 108 bits (271), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 48/108 (44%), Positives = 73/108 (67%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+++Q V VFSF+LNR+PD E GGE++ GG DPK++ G YV ++++ YWQ + +
Sbjct: 204 IMQQKKVDSNVFSFYLNRNPDTEPGGELLLGGTDPKYYSGDFHYVNISRQAYWQIHMDGM 263
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+G+Q + +C+GGC AIVD+GTSLL GP+ V + AIG ++ E
Sbjct: 264 AVGSQLS-LCKGGCEAIVDTGTSLLTGPSAEVKALQKAIGAIPLIQGE 310
>gi|345317110|ref|XP_001519470.2| PREDICTED: cathepsin D-like, partial [Ornithorhynchus anatinus]
Length = 157
Score = 110 bits (274), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 45/93 (48%), Positives = 69/93 (74%)
Query: 202 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 261
++P GE +I C+++P++P +SF +G + ++L+ EQY+LK + +C+SGF+ D+P
Sbjct: 64 AVPLYQGEYMIPCNKVPSLPPISFKLGSQTYSLTGEQYVLKVTQAGRTMCLSGFLGMDIP 123
Query: 262 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
PP GPLWILGDVF+G Y+TVFD R+GFA+A
Sbjct: 124 PPGGPLWILGDVFIGQYYTVFDRDNSRVGFAKA 156
Score = 63.9 bits (154), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 31/63 (49%), Positives = 42/63 (66%), Gaps = 1/63 (1%)
Query: 38 FKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINH 97
F G YV VT+K YWQ + + +G+Q + +C+GGC AIVD+GTSL+AGP V +
Sbjct: 2 FTGTINYVNVTRKAYWQVHMDGVTVGSQLS-LCKGGCEAIVDTGTSLIAGPVSEVKALQA 60
Query: 98 AIG 100
AIG
Sbjct: 61 AIG 63
>gi|83523775|ref|NP_001032810.1| cathepsin D precursor [Sus scrofa]
gi|65330113|gb|AAY42144.1| cathepsin D [Sus scrofa]
Length = 410
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 50/108 (46%), Positives = 74/108 (68%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+++Q LV + +FSF+LNRDP A+ GGE++ GG+D K++KG Y VT+K YWQ + +
Sbjct: 216 LMQQKLVDKNIFSFYLNRDPGAQPGGELMLGGIDSKYYKGSLDYHNVTRKAYWQIHMDQV 275
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+G+ T +C+GGC AIVD+GTSL+ GP V E+ AIG ++ E
Sbjct: 276 AVGSSLT-LCKGGCEAIVDTGTSLIVGPVEEVRELQKAIGAVPLIQGE 322
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 48/94 (51%), Positives = 67/94 (71%)
Query: 202 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 261
++P GE +I C+++P++P+V+ T+G K + LS E Y LK + +C+SGFM D+P
Sbjct: 315 AVPLIQGEYMIPCEKVPSLPDVTVTLGGKKYKLSSENYTLKVSQAGQTICLSGFMGMDIP 374
Query: 262 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
PP GPLWILGDVF+G Y+TVFD R+G AEAA
Sbjct: 375 PPGGPLWILGDVFIGRYYTVFDRDLNRVGLAEAA 408
>gi|198421979|ref|XP_002130758.1| PREDICTED: similar to Ctsd protein [Ciona intestinalis]
Length = 385
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 52/109 (47%), Positives = 69/109 (63%), Gaps = 1/109 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M EQG V + +F+F+LNRDP+A +GGEI GGV+P + G Y VT++GYWQ ++ +
Sbjct: 191 MFEQGAVDQNLFAFYLNRDPEAADGGEITLGGVNPARYVGDFNYHDVTRQGYWQIKMDGL 250
Query: 61 LIGNQS-TGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
I + + T C GGC IVDSGTSL+ GP+ IN AIG V E
Sbjct: 251 SIADTAKTTACNGGCQVIVDSGTSLITGPSADTDAINQAIGAIKFVQGE 299
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 40/100 (40%), Positives = 62/100 (62%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
IN+ ++ GE ++ C RIP MP+++F + + L+P+ Y+++ C+S F
Sbjct: 286 INQAIGAIKFVQGEYLVICRRIPEMPDITFVLDGIEYVLTPQDYVIQMTADGQTQCLSAF 345
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
M D+P P GPLWILGD FMG ++T FD G ++GFA+ A
Sbjct: 346 MGMDIPEPTGPLWILGDAFMGKFYTSFDFGTNQVGFAKLA 385
>gi|56118817|ref|NP_001008172.1| MGC89016 protein precursor [Xenopus (Silurana) tropicalis]
gi|51950197|gb|AAH82490.1| MGC89016 protein [Xenopus (Silurana) tropicalis]
Length = 421
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 53/128 (41%), Positives = 79/128 (61%), Gaps = 9/128 (7%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV Q LV + +FSF+L+RD A+EGGEI+ GG DP H+ G TY+ VT+K YWQ ++ +
Sbjct: 198 MVNQKLVDQAIFSFYLSRDASAKEGGEIILGGSDPDHYVGNFTYLDVTRKAYWQIKMDSV 257
Query: 61 LIGNQSTGV---------CEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKL 111
+ ++S + C+GGC AI D+GTSL+ GP+ V ++N IG ++S E +
Sbjct: 258 TVSSESECMNAMMVGGEYCKGGCQAIADTGTSLIVGPSSDVEKLNAEIGALPIISGEYWI 317
Query: 112 VVSQYGDL 119
S+ L
Sbjct: 318 NCSKIASL 325
Score = 97.1 bits (240), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 42/97 (43%), Positives = 65/97 (67%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
+N +LP GE I+C +I ++P ++F +G K F+L+ + Y++ + +C+SGF
Sbjct: 301 LNAEIGALPIISGEYWINCSKIASLPTINFVLGGKSFSLTGKDYVVVVTQMGQTICLSGF 360
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFA 292
+A D+PPP GPLWILGD+F+G Y+T FD R+GFA
Sbjct: 361 VAMDIPPPAGPLWILGDIFIGKYYTEFDLANNRVGFA 397
>gi|3378673|emb|CAA08878.1| Cathepsin D [Podarcis siculus]
Length = 399
Score = 109 bits (273), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 49/110 (44%), Positives = 75/110 (68%), Gaps = 1/110 (0%)
Query: 2 VEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDIL 61
++Q L+ + +FSF+LNRDP GGE++FGGVDP+++ G T+V VT+K YWQ + +
Sbjct: 205 MKQALMEKNLFSFYLNRDPKGATGGELLFGGVDPQYYTGDFTWVNVTRKAYWQIHMEKVD 264
Query: 62 IGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKL 111
+ N T VC+ GC AIVD+GTSL+ GPT + ++ AIG + ++ + L
Sbjct: 265 VDNGLT-VCKDGCEAIVDTGTSLITGPTDEIKQLQKAIGAKPIIKGQYML 313
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 53/140 (37%), Positives = 82/140 (58%), Gaps = 3/140 (2%)
Query: 156 VEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 215
+EK +V G + CE A+V L T E + + + + P G+ ++ CD
Sbjct: 260 MEKVDVDNGLTVCKDGCE-AIVDTGTSLITGPTDE--IKQLQKAIGAKPIIKGQYMLPCD 316
Query: 216 RIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFM 275
++ ++PNV+ +G K + L+P QY+LK +C+SGF D+PPP GPLWILGDVF+
Sbjct: 317 KLSSLPNVNLVLGGKSYALTPNQYVLKVTVQGETLCLSGFSGLDVPPPAGPLWILGDVFI 376
Query: 276 GVYHTVFDSGKLRIGFAEAA 295
G Y+TVFD +GFA+++
Sbjct: 377 GSYYTVFDRDNDAVGFAKSS 396
>gi|327259983|ref|XP_003214815.1| PREDICTED: cathepsin D-like [Anolis carolinensis]
Length = 399
Score = 109 bits (273), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 51/110 (46%), Positives = 75/110 (68%), Gaps = 1/110 (0%)
Query: 2 VEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDIL 61
++QGL+ + +FSF+LNRDP + GGEI+FGGVDPK++ G +V VT+K YWQ + +
Sbjct: 205 MKQGLLDKNMFSFFLNRDPSSSPGGEIIFGGVDPKYYSGDFNWVNVTRKAYWQVHMDRVE 264
Query: 62 IGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKL 111
+ + T VC+ GC AIVD+GTSL+ GPT V + AIG + ++ + L
Sbjct: 265 VPSGLT-VCKNGCEAIVDTGTSLITGPTDEVKALQKAIGAKPIIKGQYIL 313
Score = 95.1 bits (235), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 51/138 (36%), Positives = 80/138 (57%), Gaps = 3/138 (2%)
Query: 156 VEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 215
+++ V +G + + CE A+V L T E + + + + P G+ I+ C+
Sbjct: 260 MDRVEVPSGLTVCKNGCE-AIVDTGTSLITGPTDE--VKALQKAIGAKPIIKGQYILPCE 316
Query: 216 RIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFM 275
++ T+P VSF +G + ++LS E Y+LK +C+SGF D+PPP GPLWILGDVF+
Sbjct: 317 KLATLPIVSFVLGGRSYSLSAENYVLKVTVQGETLCLSGFSGLDVPPPGGPLWILGDVFI 376
Query: 276 GVYHTVFDSGKLRIGFAE 293
G Y+T FD +GFA+
Sbjct: 377 GPYYTAFDRDNDAVGFAK 394
>gi|157644743|gb|ABV59077.1| cathepsin D [Lates calcarifer]
gi|396084116|gb|AFN84539.1| cathepsin D [Lates calcarifer]
Length = 396
Score = 109 bits (273), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 44/93 (47%), Positives = 69/93 (74%)
Query: 202 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 261
++P GE +++CD++P++P ++F +G + ++L+ EQYILK + +C+SGFM D+P
Sbjct: 303 AIPLIQGEYMVNCDKVPSLPVITFNVGGQSYSLTGEQYILKESQAGKTICLSGFMGLDIP 362
Query: 262 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
P GPLWILGDVF+G Y+TVFD R+GFA++
Sbjct: 363 APAGPLWILGDVFIGQYYTVFDRDNNRVGFAKS 395
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 47/108 (43%), Positives = 73/108 (67%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
++ Q V + VFSF+LNR+PD GGE++ GG DPK++ G YV +T++ YWQ + ++
Sbjct: 204 IMSQKKVEQNVFSFYLNRNPDTAPGGELLLGGTDPKYYTGDFNYVNITRQAYWQIHMDEL 263
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
++G Q + +C+GGC AIVD+GTSL+ GP+ V + AIG ++ E
Sbjct: 264 VVGTQLS-LCKGGCEAIVDTGTSLITGPSAEVKALQKAIGAIPLIQGE 310
>gi|74151850|dbj|BAE29712.1| unnamed protein product [Mus musculus]
gi|74151877|dbj|BAE29725.1| unnamed protein product [Mus musculus]
Length = 410
Score = 109 bits (273), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 49/108 (45%), Positives = 75/108 (69%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+++Q LV + +FSF+LNRDP+ + GGE++ G D K++ G+ +Y+ VT+K YWQ + +
Sbjct: 216 LMQQKLVDKNIFSFYLNRDPEGQPGGELMLGDTDSKYYHGELSYLNVTRKAYWQVHMDQL 275
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+GN+ T +C+GGC AIVD+GTSLL GP V E+ AIG ++ E
Sbjct: 276 EVGNELT-LCKGGCEAIVDTGTSLLVGPVEEVKELQKAIGAVPLIQGE 322
Score = 108 bits (269), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 46/93 (49%), Positives = 64/93 (68%)
Query: 202 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 261
++P GE +I C+++ ++P V +G K + L P++YILK G +C+SGFM D+P
Sbjct: 315 AVPLIQGEYMIPCEKVSSLPTVYLKLGGKNYELHPDKYILKVSHGGKTICLSGFMGMDIP 374
Query: 262 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
PP GPLWILGDVF+G Y+TVFD R+GFA A
Sbjct: 375 PPSGPLWILGDVFIGSYYTVFDRDNNRVGFANA 407
>gi|432850601|ref|XP_004066828.1| PREDICTED: cathepsin D-like isoform 2 [Oryzias latipes]
Length = 398
Score = 109 bits (273), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 45/93 (48%), Positives = 68/93 (73%)
Query: 202 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 261
++P GE +I+CD+IP++P ++F IG + + L+ +QY+LK + +C+SGFM D+P
Sbjct: 305 AIPLIQGEYMINCDKIPSLPAITFNIGGQSYTLTGDQYVLKESQAGKTICLSGFMGLDIP 364
Query: 262 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
P GPLWILGDVF+G Y+TVFD R+GFA++
Sbjct: 365 APAGPLWILGDVFIGQYYTVFDRDSNRVGFAKS 397
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 48/108 (44%), Positives = 73/108 (67%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+++Q V VFSF+LNR+PD E GGE++ GG DPK++ G YV ++++ YWQ + +
Sbjct: 206 IMQQKKVDSNVFSFYLNRNPDTEPGGELLLGGTDPKYYSGDFHYVNISRQAYWQIHMDGM 265
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+G+Q + +C+GGC AIVD+GTSLL GP+ V + AIG ++ E
Sbjct: 266 AVGSQLS-LCKGGCEAIVDTGTSLLTGPSAEVKALQKAIGAIPLIQGE 312
>gi|410974821|ref|XP_003993838.1| PREDICTED: cathepsin D [Felis catus]
Length = 418
Score = 109 bits (273), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 49/108 (45%), Positives = 75/108 (69%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+++Q LV + +FSF+LNRDP+A+ GGE++ GG D K++KG +Y+ VT+K YWQ + +
Sbjct: 224 LMKQKLVEKNIFSFYLNRDPNAQPGGELMLGGTDSKYYKGPLSYLNVTRKAYWQVHMDQV 283
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+G T +C+GGC AI+D+GTSL+ GP V E+ AIG ++ E
Sbjct: 284 DVGTSLT-LCKGGCEAILDTGTSLMVGPVDEVRELQKAIGAVPLIQGE 330
Score = 103 bits (258), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 46/94 (48%), Positives = 65/94 (69%)
Query: 202 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 261
++P GE +I C+++ T+P V+ +G K + LS + Y LK +G +C+SGFM D+P
Sbjct: 323 AVPLIQGEYMIPCEKVSTLPEVTVKLGGKGYKLSSKDYTLKVSQGGRTICLSGFMGMDIP 382
Query: 262 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
PP GPLWILGDVF+G Y+TVFD + R+G AEA
Sbjct: 383 PPGGPLWILGDVFIGRYYTVFDRDENRVGLAEAT 416
>gi|42476045|ref|NP_599161.2| cathepsin D precursor [Rattus norvegicus]
gi|38303993|gb|AAH62032.1| Cathepsin D [Rattus norvegicus]
gi|149061703|gb|EDM12126.1| cathepsin D, isoform CRA_c [Rattus norvegicus]
Length = 407
Score = 109 bits (273), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 46/94 (48%), Positives = 68/94 (72%)
Query: 202 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 261
++P GE +I C+++ ++P ++F +G + + L PE+YILK + +C+SGFM D+P
Sbjct: 312 AVPLIQGEYMIPCEKVSSLPIITFKLGGQNYELHPEKYILKVSQAGKTICLSGFMGMDIP 371
Query: 262 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
PP GPLWILGDVF+G Y+TVFD R+GFA+AA
Sbjct: 372 PPSGPLWILGDVFIGCYYTVFDREYNRVGFAKAA 405
Score = 108 bits (269), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 48/108 (44%), Positives = 75/108 (69%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+++Q LV + +FSF+LNRDP + GGE++ GG D +++ G+ +Y+ VT+K YWQ + +
Sbjct: 213 LMKQKLVEKNIFSFYLNRDPTGQPGGELMLGGTDSRYYHGELSYLNVTRKAYWQVHMDQL 272
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+G++ T +C+GGC AIVD+GTSLL GP V E+ AIG ++ E
Sbjct: 273 EVGSELT-LCKGGCEAIVDTGTSLLVGPVDEVKELQKAIGAVPLIQGE 319
>gi|115720|sp|P24268.1|CATD_RAT RecName: Full=Cathepsin D; Contains: RecName: Full=Cathepsin D 12
kDa light chain; Contains: RecName: Full=Cathepsin D 9
kDa light chain; Contains: RecName: Full=Cathepsin D 34
kDa heavy chain; Contains: RecName: Full=Cathepsin D 30
kDa heavy chain; Flags: Precursor
gi|55882|emb|CAA38349.1| preprocathepsin D [Rattus norvegicus]
Length = 407
Score = 109 bits (272), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 46/94 (48%), Positives = 68/94 (72%)
Query: 202 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 261
++P GE +I C+++ ++P ++F +G + + L PE+YILK + +C+SGFM D+P
Sbjct: 312 AVPLIQGEYMIPCEKVSSLPIITFKLGGQNYELHPEKYILKVSQAGKTICLSGFMGMDIP 371
Query: 262 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
PP GPLWILGDVF+G Y+TVFD R+GFA+AA
Sbjct: 372 PPSGPLWILGDVFIGCYYTVFDREYNRVGFAKAA 405
Score = 108 bits (269), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 48/108 (44%), Positives = 75/108 (69%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+++Q LV + +FSF+LNRDP + GGE++ GG D +++ G+ +Y+ VT+K YWQ + +
Sbjct: 213 LMKQKLVEKNIFSFYLNRDPTGQPGGELMLGGTDSRYYHGELSYLNVTRKAYWQVHMDQL 272
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+G++ T +C+GGC AIVD+GTSLL GP V E+ AIG ++ E
Sbjct: 273 EVGSELT-LCKGGCEAIVDTGTSLLVGPVDEVKELQKAIGAVPLIQGE 319
>gi|410044668|ref|XP_001154405.3| PREDICTED: cathepsin D [Pan troglodytes]
Length = 322
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 47/94 (50%), Positives = 66/94 (70%)
Query: 202 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 261
++P GE +I C+++ T+P ++ +G K + LSPE Y LK + +C++GFM D+P
Sbjct: 227 AVPLIQGEYMIPCEKVSTLPAITLKLGGKGYKLSPEDYTLKVSQAGKTLCLTGFMGMDIP 286
Query: 262 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
PP GPLWILGDVF+G Y+TVFD R+GFAEAA
Sbjct: 287 PPSGPLWILGDVFIGRYYTVFDRDNNRVGFAEAA 320
Score = 87.0 bits (214), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 41/90 (45%), Positives = 59/90 (65%), Gaps = 1/90 (1%)
Query: 19 DPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIV 78
DPDA+ GGE++ GG D K++KG +Y+ VT+K YWQ L + + + T +C+ GC AIV
Sbjct: 146 DPDAQPGGELMLGGTDSKYYKGSLSYLNVTRKAYWQVHLDQVEVASGLT-LCKEGCEAIV 204
Query: 79 DSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
D+GTSL+ GP V E+ AIG ++ E
Sbjct: 205 DTGTSLMVGPVDEVRELQKAIGAVPLIQGE 234
>gi|17389633|gb|AAH17842.1| Napsin A aspartic peptidase [Homo sapiens]
gi|123982255|gb|ABM82919.1| napsin A aspartic peptidase [synthetic construct]
gi|123997015|gb|ABM86109.1| napsin A aspartic peptidase [synthetic construct]
Length = 420
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 54/119 (45%), Positives = 79/119 (66%), Gaps = 1/119 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+VEQGL+ + VFSF+LNRDP+ +GGE+V GG DP H+ T+VPVT YWQ + +
Sbjct: 206 LVEQGLLDKPVFSFYLNRDPEEPDGGELVLGGSDPAHYIPPLTFVPVTVPAYWQIHMERV 265
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDL 119
+G T +C GCAAI+D+GTSL+ GPT + ++ AIGG +++ E ++ S+ L
Sbjct: 266 KVGPGLT-LCAKGCAAILDTGTSLITGPTEEIRALHAAIGGIPLLAGEYIILCSEIPKL 323
Score = 97.8 bits (242), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 54/143 (37%), Positives = 76/143 (53%), Gaps = 7/143 (4%)
Query: 156 VEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 215
+E+ V G + C A++ L T+E + ++ +P GE II C
Sbjct: 262 MERVKVGPGLTLCAKGCA-AILDTGTSLITGPTEE--IRALHAAIGGIPLLAGEYIILCS 318
Query: 216 RIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFM 275
IP +P VSF +G FNL+ Y+++T +C+SGF A D+PPP GP WILGDVF+
Sbjct: 319 EIPKLPAVSFLLGGVWFNLTAHDYVIQTTRNGVRLCLSGFQALDVPPPAGPFWILGDVFL 378
Query: 276 GVYHTVFDSGKL----RIGFAEA 294
G Y VFD G + R+G A A
Sbjct: 379 GTYVAVFDRGDMKSSARVGLARA 401
>gi|332241362|ref|XP_003269849.1| PREDICTED: napsin-A-like [Nomascus leucogenys]
Length = 421
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 53/119 (44%), Positives = 79/119 (66%), Gaps = 1/119 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+VEQGL+ + +FSF+LNRDP+ +GGE+V GG DP H+ T+VPVT YWQ + +
Sbjct: 207 LVEQGLLDKPIFSFYLNRDPEEPDGGELVLGGSDPAHYIPPLTFVPVTVPAYWQIHMERV 266
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDL 119
+G T +C GCAAI+D+GTSL+ GPT + ++ AIGG +++ E ++ S+ L
Sbjct: 267 KVGPGLT-LCARGCAAILDTGTSLITGPTEEIRALHAAIGGYPLLAGEYIILCSEIPKL 324
Score = 95.1 bits (235), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 54/143 (37%), Positives = 75/143 (52%), Gaps = 7/143 (4%)
Query: 156 VEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 215
+E+ V G + C A++ L T+E + ++ P GE II C
Sbjct: 263 MERVKVGPGLTLCARGCA-AILDTGTSLITGPTEE--IRALHAAIGGYPLLAGEYIILCS 319
Query: 216 RIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFM 275
IP +P VSF +G FNL+ + Y+++T +C+SGF A D+PPP GP WILGDVF+
Sbjct: 320 EIPKLPAVSFLLGGVWFNLTAQDYVIQTTLNGVRLCLSGFQALDVPPPAGPFWILGDVFL 379
Query: 276 GVYHTVFDSG----KLRIGFAEA 294
G Y VFD G R+G A A
Sbjct: 380 GTYVAVFDRGDRKSSARVGLARA 402
>gi|109125662|ref|XP_001116026.1| PREDICTED: napsin-A-like [Macaca mulatta]
Length = 421
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 54/119 (45%), Positives = 79/119 (66%), Gaps = 1/119 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+VEQGL+ + VFSF+LNRDP+ +GGE+V GG DP H+ T+VPVT YWQ + +
Sbjct: 207 LVEQGLLDKPVFSFYLNRDPEEPDGGELVLGGSDPAHYIPPLTFVPVTVPAYWQIHMERV 266
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDL 119
+G T +C GCAAI+D+GTSL+ GPT + ++ AIGG +++ E ++ S+ L
Sbjct: 267 KVGPGLT-LCVRGCAAILDTGTSLITGPTEEIRALHAAIGGYPLLAGEYIILCSEIPKL 324
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 54/143 (37%), Positives = 75/143 (52%), Gaps = 7/143 (4%)
Query: 156 VEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 215
+E+ V G + C A++ L T+E + ++ P GE II C
Sbjct: 263 MERVKVGPGLTLCVRGCA-AILDTGTSLITGPTEE--IRALHAAIGGYPLLAGEYIILCS 319
Query: 216 RIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFM 275
IP +P VSF +G FNL+ + Y+++T +C+SGF A D+PPP GP WILGDVF+
Sbjct: 320 EIPKLPAVSFLLGGVWFNLTAQDYVIQTTRNGVRLCLSGFQALDVPPPAGPFWILGDVFL 379
Query: 276 GVYHTVFDSGKL----RIGFAEA 294
G Y VFD G R+G A A
Sbjct: 380 GTYVAVFDRGDTKSGARVGLARA 402
>gi|355681641|gb|AER96810.1| cathepsin D [Mustela putorius furo]
Length = 410
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 50/108 (46%), Positives = 75/108 (69%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+++Q LV + +FSF+LNRDP A+ GGE++ GG D K++KG +Y+ VT+K YWQ + +
Sbjct: 216 LMQQKLVEKNIFSFYLNRDPGAQPGGELMLGGTDSKYYKGPLSYLNVTRKAYWQVHMEXV 275
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+G+ T +C+GGC AIVD+GTSL+ GP V E+ AIG ++ E
Sbjct: 276 DVGSSLT-LCKGGCEAIVDTGTSLIVGPVDEVRELQKAIGAVPLIQGE 322
Score = 105 bits (261), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 46/94 (48%), Positives = 64/94 (68%)
Query: 202 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 261
++P GE +I C+++ T+P V+ +G K + L E Y LK +G +C+SGFM D+P
Sbjct: 315 AVPLIQGEYMIPCEKVSTLPEVTLKLGGKPYKLLSEDYTLKVSQGGKTICLSGFMGMDIP 374
Query: 262 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
PP GPLWILGDVF+G Y+TVFD + R+G AEA
Sbjct: 375 PPGGPLWILGDVFIGRYYTVFDRDQNRVGLAEAT 408
>gi|426389739|ref|XP_004061277.1| PREDICTED: napsin-A-like [Gorilla gorilla gorilla]
Length = 420
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 54/119 (45%), Positives = 79/119 (66%), Gaps = 1/119 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+VEQGL+ + VFSF+LNRDP+ +GGE+V GG DP H+ T+VPVT YWQ + +
Sbjct: 206 LVEQGLLDKPVFSFYLNRDPEEPDGGELVLGGSDPAHYIPPLTFVPVTVPAYWQIHMERV 265
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDL 119
+G T +C GCAAI+D+GTSL+ GPT + ++ AIGG +++ E ++ S+ L
Sbjct: 266 KVGPGLT-LCAQGCAAILDTGTSLITGPTEEIRALHAAIGGIPLLAGEYIILCSEIPKL 323
Score = 97.8 bits (242), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 54/143 (37%), Positives = 76/143 (53%), Gaps = 7/143 (4%)
Query: 156 VEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 215
+E+ V G + C A++ L T+E + ++ +P GE II C
Sbjct: 262 MERVKVGPGLTLCAQGCA-AILDTGTSLITGPTEE--IRALHAAIGGIPLLAGEYIILCS 318
Query: 216 RIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFM 275
IP +P VSF +G FNL+ Y+++T +C+SGF A D+PPP GP WILGDVF+
Sbjct: 319 EIPKLPAVSFLLGGVWFNLTAHDYVIQTTRNGVRLCLSGFQALDVPPPAGPFWILGDVFL 378
Query: 276 GVYHTVFDSGKL----RIGFAEA 294
G Y VFD G + R+G A A
Sbjct: 379 GTYVAVFDRGDMKNSARVGLARA 401
>gi|402906424|ref|XP_003916002.1| PREDICTED: napsin-A-like [Papio anubis]
Length = 421
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 54/119 (45%), Positives = 79/119 (66%), Gaps = 1/119 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+VEQGL+ + VFSF+LNRDP+ +GGE+V GG DP H+ T+VPVT YWQ + +
Sbjct: 207 LVEQGLLDKPVFSFYLNRDPEEPDGGELVLGGSDPAHYIPPLTFVPVTVPAYWQIHMERV 266
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDL 119
+G T +C GCAAI+D+GTSL+ GPT + ++ AIGG +++ E ++ S+ L
Sbjct: 267 KVGPGLT-LCVPGCAAILDTGTSLITGPTEEIRALHAAIGGYPLLAGEYIILCSEIPKL 324
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 51/135 (37%), Positives = 73/135 (54%), Gaps = 3/135 (2%)
Query: 156 VEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 215
+E+ V G + C A++ L T+E + ++ P GE II C
Sbjct: 263 MERVKVGPGLTLCVPGCA-AILDTGTSLITGPTEE--IRALHAAIGGYPLLAGEYIILCS 319
Query: 216 RIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFM 275
IP +P VSF +G+ FNL+ + Y+++T +C+SGF A D+PPP GP WILGDVF+
Sbjct: 320 EIPKLPAVSFLLGEVWFNLTAQDYVIQTTRNGVRLCLSGFQALDVPPPAGPFWILGDVFL 379
Query: 276 GVYHTVFDSGKLRIG 290
G Y VFD G + G
Sbjct: 380 GTYVAVFDRGDTKSG 394
>gi|344269496|ref|XP_003406588.1| PREDICTED: LOW QUALITY PROTEIN: napsin-A-like [Loxodonta africana]
Length = 396
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 50/110 (45%), Positives = 73/110 (66%), Gaps = 1/110 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+VEQGLV + VFSF+LNRDP+A +GGE+V GG DP H+ ++PVT YWQ + +
Sbjct: 206 LVEQGLVDKPVFSFYLNRDPEAPDGGELVLGGSDPAHYIPPLNFMPVTIPAYWQIHMERV 265
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECK 110
+G +C GCAAI+D+GTSL+ GP + +N AIGG +++ + +
Sbjct: 266 KVGT-GLNLCAQGCAAILDTGTSLITGPAEEIQALNSAIGGVALLTGQVR 314
Score = 57.0 bits (136), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 23/38 (60%), Positives = 28/38 (73%)
Query: 249 EVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGK 286
+C+SGF A D+PPP GP WI GDVFMG + VFD G+
Sbjct: 328 RLCLSGFQALDVPPPMGPFWIXGDVFMGSHVAVFDRGE 365
>gi|4758754|ref|NP_004842.1| napsin-A preproprotein [Homo sapiens]
gi|6225749|sp|O96009.1|NAPSA_HUMAN RecName: Full=Napsin-A; AltName: Full=Aspartyl protease 4;
Short=ASP4; Short=Asp 4; AltName: Full=Napsin-1;
AltName: Full=TA01/TA02; Flags: Precursor
gi|4154287|gb|AAD04917.1| napsin A [Homo sapiens]
gi|4235425|gb|AAD13215.1| napsin 1 precursor [Homo sapiens]
gi|6561818|gb|AAF17081.1| aspartyl protease 4 [Homo sapiens]
gi|119592253|gb|EAW71847.1| napsin A aspartic peptidase, isoform CRA_a [Homo sapiens]
Length = 420
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 54/119 (45%), Positives = 79/119 (66%), Gaps = 1/119 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+VEQGL+ + VFSF+LNRDP+ +GGE+V GG DP H+ T+VPVT YWQ + +
Sbjct: 206 LVEQGLLDKPVFSFYLNRDPEEPDGGELVLGGSDPAHYIPPLTFVPVTVPAYWQIHMERV 265
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDL 119
+G T +C GCAAI+D+GTSL+ GPT + ++ AIGG +++ E ++ S+ L
Sbjct: 266 KVGPGLT-LCAKGCAAILDTGTSLITGPTEEIRALHAAIGGIPLLAGEYIILCSEIPKL 323
Score = 97.8 bits (242), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 54/143 (37%), Positives = 76/143 (53%), Gaps = 7/143 (4%)
Query: 156 VEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 215
+E+ V G + C A++ L T+E + ++ +P GE II C
Sbjct: 262 MERVKVGPGLTLCAKGCA-AILDTGTSLITGPTEE--IRALHAAIGGIPLLAGEYIILCS 318
Query: 216 RIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFM 275
IP +P VSF +G FNL+ Y+++T +C+SGF A D+PPP GP WILGDVF+
Sbjct: 319 EIPKLPAVSFLLGGVWFNLTAHDYVIQTTRNGVRLCLSGFQALDVPPPAGPFWILGDVFL 378
Query: 276 GVYHTVFDSGKL----RIGFAEA 294
G Y VFD G + R+G A A
Sbjct: 379 GTYVAVFDRGDMKSSARVGLARA 401
>gi|27803878|gb|AAO22152.1| cathepsin D-like aspartic protease [Ancylostoma ceylanicum]
Length = 446
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 51/108 (47%), Positives = 70/108 (64%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+EQ V VF+FWLNR+PD+E GGEI GG+DP+ + T+ PVT++GYWQF++ D
Sbjct: 223 FIEQKKVPSPVFAFWLNRNPDSELGGEITLGGMDPRRYVEPITWTPVTRRGYWQFKM-DK 281
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+ G ++ C GC AI D+GTSL+AGP V I IG E ++ E
Sbjct: 282 VQGGSTSIACPNGCQAIADTGTSLIAGPKAQVEAIQKFIGAEPLMKGE 329
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 46/99 (46%), Positives = 63/99 (63%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
I + + P GE +I CD++P++P +SF I + F L E Y+L G +C+SGF
Sbjct: 316 IQKFIGAEPLMKGEYMIPCDKVPSLPELSFVIEGRTFILKGEDYVLTVKAGGKSICLSGF 375
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
M D P G LWILGDVF+G Y+TVFD G+ R+GFA+A
Sbjct: 376 MGMDFPERIGELWILGDVFIGKYYTVFDIGQARLGFAQA 414
>gi|167524529|ref|XP_001746600.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163774870|gb|EDQ88496.1| predicted protein [Monosiga brevicollis MX1]
Length = 381
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 54/105 (51%), Positives = 70/105 (66%), Gaps = 2/105 (1%)
Query: 192 VLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILK-TGEGIAEV 250
V+ IN+ + GE+I+DC++IP MPNV+ I ++LS EQY+L+ T EG E
Sbjct: 278 VVKQINKAIGATTIAAGEAIVDCNKIPHMPNVTIVINGIQYSLSAEQYVLQVTAEGETE- 336
Query: 251 CISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
CISGF D+P P GPLWILGDVF+G Y TVFD G R+GF +A
Sbjct: 337 CISGFAGIDVPAPEGPLWILGDVFIGAYTTVFDMGNNRVGFGASA 381
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 51/109 (46%), Positives = 67/109 (61%), Gaps = 3/109 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV+Q +V + ++SF+LNR+P+ GGE+ GG D HF G + VT GYWQF + +
Sbjct: 191 MVQQQVVEQNMYSFYLNRNPNGVSGGELTLGGYDESHFAGPIHWTDVTVDGYWQFTMTGL 250
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAEC 109
I N T C C AI D+GTSLLAGPT VV +IN AIG + + E
Sbjct: 251 SIEN--TPYCT-NCKAIADTGTSLLAGPTDVVKQINKAIGATTIAAGEA 296
>gi|397485038|ref|XP_003813670.1| PREDICTED: napsin-A-like [Pan paniscus]
Length = 420
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 54/119 (45%), Positives = 79/119 (66%), Gaps = 1/119 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+VEQGL+ + VFSF+LNRDP+ +GGE+V GG DP H+ T+VPVT YWQ + +
Sbjct: 206 LVEQGLLEKPVFSFYLNRDPEEPDGGELVLGGSDPAHYIPPLTFVPVTVPAYWQIHMERV 265
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDL 119
+G T +C GCAAI+D+GTSL+ GPT + ++ AIGG +++ E ++ S+ L
Sbjct: 266 KVGPGLT-LCAQGCAAILDTGTSLITGPTEEIRALHAAIGGIPLLAGEYIILCSEIPKL 323
Score = 97.8 bits (242), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 54/143 (37%), Positives = 76/143 (53%), Gaps = 7/143 (4%)
Query: 156 VEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 215
+E+ V G + C A++ L T+E + ++ +P GE II C
Sbjct: 262 MERVKVGPGLTLCAQGCA-AILDTGTSLITGPTEE--IRALHAAIGGIPLLAGEYIILCS 318
Query: 216 RIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFM 275
IP +P VSF +G FNL+ Y+++T +C+SGF A D+PPP GP WILGDVF+
Sbjct: 319 EIPKLPAVSFLLGGVWFNLTAHDYVIQTTRNGVRLCLSGFQALDVPPPAGPFWILGDVFL 378
Query: 276 GVYHTVFDSGKL----RIGFAEA 294
G Y VFD G + R+G A A
Sbjct: 379 GTYVAVFDRGDMKSSARVGLARA 401
>gi|114678580|ref|XP_524345.2| PREDICTED: napsin-A isoform 4 [Pan troglodytes]
Length = 420
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 54/119 (45%), Positives = 79/119 (66%), Gaps = 1/119 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+VEQGL+ + VFSF+LNRDP+ +GGE+V GG DP H+ T+VPVT YWQ + +
Sbjct: 206 LVEQGLLDKPVFSFYLNRDPEEPDGGELVLGGSDPAHYIPPLTFVPVTVPAYWQIHMERV 265
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDL 119
+G T +C GCAAI+D+GTSL+ GPT + ++ AIGG +++ E ++ S+ L
Sbjct: 266 KVGPGLT-LCAQGCAAILDTGTSLITGPTEEIRALHAAIGGIPLLAGEYIILCSEIPKL 323
Score = 98.2 bits (243), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 54/143 (37%), Positives = 76/143 (53%), Gaps = 7/143 (4%)
Query: 156 VEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 215
+E+ V G + C A++ L T+E + ++ +P GE II C
Sbjct: 262 MERVKVGPGLTLCAQGCA-AILDTGTSLITGPTEE--IRALHAAIGGIPLLAGEYIILCS 318
Query: 216 RIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFM 275
IP +P VSF +G FNL+ Y+++T +C+SGF A D+PPP GP WILGDVF+
Sbjct: 319 EIPKLPAVSFLLGGVWFNLTAHDYVIQTTRNGVRLCLSGFQALDVPPPTGPFWILGDVFL 378
Query: 276 GVYHTVFDSGKL----RIGFAEA 294
G Y VFD G + R+G A A
Sbjct: 379 GTYVAVFDRGDMKSSARVGLARA 401
>gi|119592254|gb|EAW71848.1| napsin A aspartic peptidase, isoform CRA_b [Homo sapiens]
Length = 357
Score = 108 bits (271), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 53/115 (46%), Positives = 78/115 (67%), Gaps = 1/115 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+VEQGL+ + VFSF+LNRDP+ +GGE+V GG DP H+ T+VPVT YWQ + +
Sbjct: 143 LVEQGLLDKPVFSFYLNRDPEEPDGGELVLGGSDPAHYIPPLTFVPVTVPAYWQIHMERV 202
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQ 115
+G T +C GCAAI+D+GTSL+ GPT + ++ AIGG +++ E ++ S+
Sbjct: 203 KVGPGLT-LCAKGCAAILDTGTSLITGPTEEIRALHAAIGGIPLLAGEYIILCSE 256
Score = 98.2 bits (243), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 54/143 (37%), Positives = 76/143 (53%), Gaps = 7/143 (4%)
Query: 156 VEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 215
+E+ V G + C A++ L T+E + ++ +P GE II C
Sbjct: 199 MERVKVGPGLTLCAKGCA-AILDTGTSLITGPTEE--IRALHAAIGGIPLLAGEYIILCS 255
Query: 216 RIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFM 275
IP +P VSF +G FNL+ Y+++T +C+SGF A D+PPP GP WILGDVF+
Sbjct: 256 EIPKLPAVSFLLGGVWFNLTAHDYVIQTTRNGVRLCLSGFQALDVPPPAGPFWILGDVFL 315
Query: 276 GVYHTVFDSGKL----RIGFAEA 294
G Y VFD G + R+G A A
Sbjct: 316 GTYVAVFDRGDMKSSARVGLARA 338
>gi|395858453|ref|XP_003801583.1| PREDICTED: napsin-A [Otolemur garnettii]
Length = 419
Score = 108 bits (271), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 52/101 (51%), Positives = 70/101 (69%), Gaps = 1/101 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+VEQGL+ + VFSF+LNRDPD +GGE+V GG DP H+ T+VPVT YWQ + +
Sbjct: 205 LVEQGLLDKPVFSFYLNRDPDVADGGELVLGGSDPAHYIPPLTFVPVTIPAYWQIHMERV 264
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 101
+G T +C GCAAI+D+GTSL+ GPT + ++ AIGG
Sbjct: 265 KVGTGLT-LCAQGCAAILDTGTSLITGPTEEIRALHAAIGG 304
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 54/143 (37%), Positives = 78/143 (54%), Gaps = 7/143 (4%)
Query: 156 VEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 215
+E+ V G + C A++ L T+E + ++ +P P GE +I+C
Sbjct: 261 MERVKVGTGLTLCAQGCA-AILDTGTSLITGPTEE--IRALHAAIGGIPLPPGEHLIECS 317
Query: 216 RIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFM 275
IP +P VSF +G FNL+ + Y+++ G +C+SGF D+PPP GPLWILGDVF+
Sbjct: 318 EIPRLPPVSFLLGGVWFNLTGKDYVVQITWGGVHLCLSGFQPLDMPPPAGPLWILGDVFL 377
Query: 276 GVYHTVFDSGKL----RIGFAEA 294
G Y VFD G R+G A A
Sbjct: 378 GAYVAVFDRGDTNTGARVGLARA 400
>gi|395544889|ref|XP_003774338.1| PREDICTED: uncharacterized protein LOC100917285 [Sarcophilus
harrisii]
Length = 370
Score = 108 bits (271), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 48/87 (55%), Positives = 63/87 (72%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 267
GE +I C+++ ++P VSF +G K F L+ EQY+LK +G VC+SGFM D+PPP GPL
Sbjct: 283 GEYMIPCEKVTSLPPVSFVLGGKNFKLTGEQYVLKVSQGGQTVCLSGFMGMDIPPPGGPL 342
Query: 268 WILGDVFMGVYHTVFDSGKLRIGFAEA 294
WILGDVF+G Y+T FD R+GFA A
Sbjct: 343 WILGDVFIGRYYTTFDRDNNRVGFATA 369
Score = 87.0 bits (214), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 44/103 (42%), Positives = 64/103 (62%), Gaps = 5/103 (4%)
Query: 16 LNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCA 75
L DP+A+ GGE+ FGG D K++KG TY+ VT+K YWQ + D++ + +C+ GC
Sbjct: 193 LRPDPNAQPGGELTFGGTDSKYYKGDFTYLNVTRKAYWQIHM-DMVKVESALTLCKDGCE 251
Query: 76 AIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE----CKLVVS 114
AIVD+GTSL+ GP + +N A+G +V E C+ V S
Sbjct: 252 AIVDTGTSLITGPVQEIKALNKALGAVALVGGEYMIPCEKVTS 294
>gi|311324976|gb|ADP89523.1| cathepsin D [Miichthys miiuy]
Length = 396
Score = 108 bits (271), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 45/93 (48%), Positives = 68/93 (73%)
Query: 202 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 261
++P GE ++ CD+IP++P ++F +G + ++L+ EQYILK + +C+SGFM D+P
Sbjct: 303 AIPLIQGEYMVSCDKIPSLPVITFNVGGQSYSLTGEQYILKETQAGKTICLSGFMGLDIP 362
Query: 262 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
P GPLWILGDVF+G Y+TVFD R+GFA++
Sbjct: 363 APAGPLWILGDVFIGQYYTVFDRESNRVGFAKS 395
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 47/108 (43%), Positives = 72/108 (66%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
++ Q V + VFSF+LNR+PD + GGE++ GG DPK++ G YV +T++ YWQ + +
Sbjct: 204 IMSQKKVEKNVFSFYLNRNPDTQPGGELLLGGTDPKYYSGDFHYVNITRQAYWQIHVDGM 263
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+G+Q T +C+ GC AIVD+GTSL+ GP+ V + AIG ++ E
Sbjct: 264 AVGSQLT-LCKSGCEAIVDTGTSLITGPSAEVRSLQKAIGAIPLIQGE 310
>gi|74220823|dbj|BAE31380.1| unnamed protein product [Mus musculus]
Length = 404
Score = 108 bits (271), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 46/93 (49%), Positives = 65/93 (69%)
Query: 202 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 261
++P GE +I C+++ ++P V +G K + L P++YILK +G +C+SGFM D+P
Sbjct: 309 AVPLIQGEYMIPCEKVSSLPTVYLKLGGKNYELHPDKYILKVSQGGKTICLSGFMGMDIP 368
Query: 262 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
PP GPLWILGDVF+G Y+TVFD R+GFA A
Sbjct: 369 PPSGPLWILGDVFIGSYYTVFDRDNNRVGFANA 401
Score = 103 bits (257), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 49/108 (45%), Positives = 74/108 (68%), Gaps = 7/108 (6%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+++Q LV + +FSF+LNRDP+ + GGE++ GG D K++ G+ +Y+ VT+K YW +
Sbjct: 216 LMQQKLVDKNIFSFYLNRDPEGQPGGELMLGGTDSKYYHGELSYLNVTRKAYW------L 269
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+GN+ T +C+GGC AIVD+GTSLL GP V E+ AIG ++ E
Sbjct: 270 EVGNELT-LCKGGCEAIVDTGTSLLVGPVEEVKELQKAIGAVPLIQGE 316
>gi|345322126|ref|XP_001511988.2| PREDICTED: cathepsin E-like [Ornithorhynchus anatinus]
Length = 295
Score = 108 bits (271), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 49/102 (48%), Positives = 66/102 (64%)
Query: 193 LSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCI 252
+ + E + P+P GE +DC R+ +MP+V+FTIG + + LS E+YI+K VCI
Sbjct: 167 IRRLQEHIGATPSPSGEYAVDCRRVSSMPHVTFTIGHREYTLSAEKYIIKEATKEETVCI 226
Query: 253 SGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
SGF A D+ P G LWILGDVFM ++ VFD G R+GFA A
Sbjct: 227 SGFQAVDIVTPSGQLWILGDVFMAEFYCVFDRGNDRVGFARA 268
Score = 39.7 bits (91), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 20/49 (40%), Positives = 26/49 (53%), Gaps = 4/49 (8%)
Query: 70 CEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG----EGVVSAECKLVVS 114
C GC AIVDSGTSL+ GP + + IG G + +C+ V S
Sbjct: 145 CAHGCEAIVDSGTSLITGPPAQIRRLQEHIGATPSPSGEYAVDCRRVSS 193
>gi|348530268|ref|XP_003452633.1| PREDICTED: cathepsin D-like [Oreochromis niloticus]
Length = 396
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 43/93 (46%), Positives = 68/93 (73%)
Query: 202 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 261
++P GE +++C +IP++P ++F +G + + L+ EQY+L+ + +C+SGFM D+P
Sbjct: 303 AIPLIQGEYLVNCSKIPSLPVITFNVGGQSYTLTGEQYVLQESQAGKTICLSGFMGLDIP 362
Query: 262 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
PP GPLWILGDVF+G Y+TVFD R+GFA++
Sbjct: 363 PPAGPLWILGDVFIGQYYTVFDRDNNRVGFAKS 395
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 47/108 (43%), Positives = 71/108 (65%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ Q V + VFSF+LNR+PD E GGE++ GG DPK++ G Y ++++ YWQ + +
Sbjct: 204 MMNQKKVEKNVFSFYLNRNPDTEPGGELLLGGTDPKYYDGDFHYANISRQAYWQVHMDGM 263
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+G+Q + +C+GGC AIVD+GTSL+ GP V + AIG ++ E
Sbjct: 264 TVGSQLS-LCKGGCEAIVDTGTSLITGPAAEVKALQKAIGAIPLIQGE 310
>gi|126681053|gb|ABO26561.1| cathepsin D-like aspartic protease [Ixodes ricinus]
Length = 382
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 53/106 (50%), Positives = 73/106 (68%), Gaps = 5/106 (4%)
Query: 1 MVEQGLVSEE-VFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGD 59
M++QGL+ E+ VFS +LNRDP ++EGGE++FGG+D H+KG TYVPVT KGYWQF +
Sbjct: 187 MMKQGLLGEQNVFSVYLNRDPSSKEGGEVLFGGIDHDHYKGSITYVPVTAKGYWQFHVDG 246
Query: 60 ILIGNQSTG----VCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 101
+ + S +C+ GC AI D+GTSL+ GP V +N +GG
Sbjct: 247 VKSVSASKSAPELLCKDGCEAIADTGTSLITGPPEEVDSLNQYLGG 292
Score = 94.0 bits (232), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 40/88 (45%), Positives = 62/88 (70%), Gaps = 3/88 (3%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 267
G+ ++DCD++ ++PNV+FTI K F+L + Y+LK + +C+SGFM+ ++P PL
Sbjct: 298 GQYLLDCDKLESLPNVTFTISGKEFSLRSKDYVLKVNQQGQTLCVSGFMSLEMPQ---PL 354
Query: 268 WILGDVFMGVYHTVFDSGKLRIGFAEAA 295
WI GDVF+G Y+ +FD + R+GFAE A
Sbjct: 355 WIFGDVFLGPYYPIFDRDQDRVGFAEVA 382
>gi|218847782|ref|NP_001136375.1| cathepsin D-like precursor [Xenopus (Silurana) tropicalis]
gi|159155417|gb|AAI54878.1| LOC613063 protein [Xenopus (Silurana) tropicalis]
Length = 399
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 48/108 (44%), Positives = 72/108 (66%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
++ Q LV +FSF+LNR+PD + GGE++ GG DPK++ G Y+ VT+K YWQ + +
Sbjct: 207 IMAQKLVESNIFSFYLNRNPDTQPGGELLLGGTDPKYYTGDFHYLSVTRKAYWQIHMDQL 266
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+G+Q T +C+GGC IVD+GTSL+ GP VT + AIG ++ +
Sbjct: 267 GVGDQLT-LCKGGCEVIVDTGTSLITGPLEEVTALQKAIGAVPLIQGQ 313
Score = 107 bits (267), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 43/93 (46%), Positives = 69/93 (74%)
Query: 202 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 261
++P G+ ++ CD++PT+P +S T+G +++ L+ EQYI+K + + +C+SGFM ++P
Sbjct: 306 AVPLIQGQYMVQCDKVPTLPVISLTLGGQVYTLTGEQYIMKVSQRGSTICLSGFMGLNIP 365
Query: 262 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
PP GPLWILGDVF+G Y++VFD +GFA+A
Sbjct: 366 PPAGPLWILGDVFIGQYYSVFDRANDCVGFAKA 398
>gi|66815097|ref|XP_641645.1| cathepsin D [Dictyostelium discoideum AX4]
gi|74960832|sp|O76856.1|CATD_DICDI RecName: Full=Cathepsin D; AltName: Full=Ddp44; Flags: Precursor
gi|3288145|emb|CAA76563.1| preprocathepsin D [Dictyostelium discoideum]
gi|6010025|emb|CAB57223.1| cathepsin D [Dictyostelium discoideum]
gi|60469656|gb|EAL67644.1| cathepsin D [Dictyostelium discoideum AX4]
Length = 383
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 56/117 (47%), Positives = 73/117 (62%), Gaps = 7/117 (5%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ QGLVS +FSFWL+R P A GGE+ FG +D + G TYVP+T + YW+F + D
Sbjct: 191 MLSQGLVSSTLFSFWLSRTPGAN-GGELSFGSIDNTKYTGDITYVPLTNETYWEFVMDDF 249
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG-----GEGVVSAECKLV 112
I QS G C C AI DSGTSL+AGP +T +N +G GEGV S +C ++
Sbjct: 250 AIDGQSAGFCGTTCHAICDSGTSLIAGPMADITALNEKLGAVILNGEGVFS-DCSVI 305
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 40/88 (45%), Positives = 56/88 (63%), Gaps = 3/88 (3%)
Query: 208 GESII-DCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGP 266
GE + DC I T+PNV+ T+ + F L+P++Y+L+ E C+SGFM +L G
Sbjct: 295 GEGVFSDCSVINTLPNVTITVAGREFVLTPKEYVLEVTEFGKTECLSGFMGIEL--NMGN 352
Query: 267 LWILGDVFMGVYHTVFDSGKLRIGFAEA 294
WILGDVF+ Y+TVFD G ++GFA A
Sbjct: 353 FWILGDVFISAYYTVFDFGNKQVGFATA 380
>gi|307170557|gb|EFN62764.1| Lysosomal aspartic protease [Camponotus floridanus]
Length = 349
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 49/100 (49%), Positives = 68/100 (68%), Gaps = 2/100 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
++ Q LVS +FSF+LNRD A+ GGE++ GG DP +++G TY+PVT KGYWQ + I
Sbjct: 167 IINQSLVSSRIFSFYLNRDTSADLGGELILGGSDPAYYEGDFTYIPVTHKGYWQISIDRI 226
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 100
I +S +CE C AIVD+G+SL+ GP + +IN IG
Sbjct: 227 KI--KSDNLCEESCQAIVDTGSSLILGPKLDIAKINTFIG 264
>gi|116284100|gb|AAI23963.1| LOC613063 protein [Xenopus (Silurana) tropicalis]
Length = 396
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 48/108 (44%), Positives = 72/108 (66%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
++ Q LV +FSF+LNR+PD + GGE++ GG DPK++ G Y+ VT+K YWQ + +
Sbjct: 204 IMAQKLVESNIFSFYLNRNPDTQPGGELLLGGTDPKYYTGDFHYLSVTRKAYWQIHMDQL 263
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+G+Q T +C+GGC IVD+GTSL+ GP VT + AIG ++ +
Sbjct: 264 GVGDQLT-LCKGGCEVIVDTGTSLITGPLEEVTALQKAIGAVPLIQGQ 310
Score = 107 bits (267), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 43/93 (46%), Positives = 69/93 (74%)
Query: 202 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 261
++P G+ ++ CD++PT+P +S T+G +++ L+ EQYI+K + + +C+SGFM ++P
Sbjct: 303 AVPLIQGQYMVQCDKVPTLPVISLTLGGQVYTLTGEQYIMKVSQRGSTICLSGFMGLNIP 362
Query: 262 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
PP GPLWILGDVF+G Y++VFD +GFA+A
Sbjct: 363 PPAGPLWILGDVFIGQYYSVFDRANDCVGFAKA 395
>gi|395851770|ref|XP_003798425.1| PREDICTED: cathepsin D [Otolemur garnettii]
Length = 405
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 49/108 (45%), Positives = 75/108 (69%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
++EQ LV + +FSF+LNRDP+A+ GGE++ GGVD K++ G +Y+ VT+K YW+ + +
Sbjct: 211 LMEQKLVEKNIFSFYLNRDPNAQPGGELMLGGVDSKYYTGSLSYLNVTRKAYWEVHMEQV 270
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+ + T +C+GGC AIVD+GTSL+ GP V E+ AIG ++ E
Sbjct: 271 EVASGLT-LCKGGCEAIVDTGTSLMVGPVDEVRELQKAIGAIPLIQGE 317
Score = 101 bits (252), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 44/94 (46%), Positives = 66/94 (70%)
Query: 202 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 261
++P GE +I C+++ ++P+V+ + K + LS E Y LK +G +C+SGFM D+P
Sbjct: 310 AIPLIQGEYMIPCEKVSSLPSVTLKLAGKDYTLSGEDYTLKVSQGGKTICLSGFMGMDIP 369
Query: 262 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
P GPLWI+GDVF+G ++TVFD K R+GFA+AA
Sbjct: 370 KPVGPLWIIGDVFIGCFYTVFDREKDRVGFAKAA 403
>gi|313245117|emb|CBY42537.1| unnamed protein product [Oikopleura dioica]
Length = 141
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 49/100 (49%), Positives = 64/100 (64%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
IN + GE I+ C + TMP++ FT+ K++ L+PE Y++K E CISGF
Sbjct: 42 INHAIGAFQFANGEWIVPCRHMDTMPDIDFTLNGKVYTLTPEDYVMKIAAEGQEQCISGF 101
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
M D+PPP GPLWILGDVFMG Y+T FD R+GFA+ A
Sbjct: 102 MGMDIPPPAGPLWILGDVFMGKYYTAFDFDNNRVGFADLA 141
Score = 43.9 bits (102), Expect = 0.096, Method: Compositional matrix adjust.
Identities = 32/81 (39%), Positives = 41/81 (50%), Gaps = 7/81 (8%)
Query: 63 GNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWD 122
G + G E GC IVDSGTSLLA P + EINHAIG + E +V ++ D + D
Sbjct: 10 GVSACGGNENGCQVIVDSGTSLLAVPKNLAEEINHAIGAFQFANGEW-IVPCRHMDTMPD 68
Query: 123 L------LVSGLLPEKVCQQI 137
+ V L PE +I
Sbjct: 69 IDFTLNGKVYTLTPEDYVMKI 89
>gi|66911216|gb|AAH96630.1| LOC613063 protein, partial [Xenopus (Silurana) tropicalis]
Length = 395
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 48/108 (44%), Positives = 72/108 (66%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
++ Q LV +FSF+LNR+PD + GGE++ GG DPK++ G Y+ VT+K YWQ + +
Sbjct: 203 IMAQKLVESNIFSFYLNRNPDTQPGGELLLGGTDPKYYTGDFHYLSVTRKAYWQIHMDQL 262
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+G+Q T +C+GGC IVD+GTSL+ GP VT + AIG ++ +
Sbjct: 263 GVGDQLT-LCKGGCEVIVDTGTSLITGPLEEVTALQKAIGAVPLIQGQ 309
Score = 107 bits (266), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 43/93 (46%), Positives = 69/93 (74%)
Query: 202 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 261
++P G+ ++ CD++PT+P +S T+G +++ L+ EQYI+K + + +C+SGFM ++P
Sbjct: 302 AVPLIQGQYMVQCDKVPTLPVISLTLGGQVYTLTGEQYIMKVSQRGSTICLSGFMGLNIP 361
Query: 262 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
PP GPLWILGDVF+G Y++VFD +GFA+A
Sbjct: 362 PPAGPLWILGDVFIGQYYSVFDRANDCVGFAKA 394
>gi|301769501|ref|XP_002920177.1| PREDICTED: cathepsin D-like [Ailuropoda melanoleuca]
Length = 371
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 49/108 (45%), Positives = 75/108 (69%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
++EQ LV + +FSF+LNR+P A+ GGE++ GG D K++KG +Y+ VT+K YWQ + +
Sbjct: 216 LMEQKLVEKNIFSFYLNRNPGAQPGGELMLGGTDSKYYKGPLSYLNVTRKAYWQVHMEQV 275
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+G+ T +C+GGC AI+D+GTSL+ GP V E+ AIG ++ E
Sbjct: 276 DVGSSLT-LCKGGCEAILDTGTSLIVGPVDEVRELQKAIGAVPLIQGE 322
Score = 37.7 bits (86), Expect = 6.4, Method: Compositional matrix adjust.
Identities = 15/41 (36%), Positives = 26/41 (63%)
Query: 202 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILK 242
++P GE +I C+++ T+P V+ +G + + LS E Y LK
Sbjct: 315 AVPLIQGEYMIPCEKVSTLPEVTLKLGGRAYTLSSEDYTLK 355
>gi|281344446|gb|EFB20030.1| hypothetical protein PANDA_008874 [Ailuropoda melanoleuca]
Length = 345
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 49/108 (45%), Positives = 75/108 (69%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
++EQ LV + +FSF+LNR+P A+ GGE++ GG D K++KG +Y+ VT+K YWQ + +
Sbjct: 201 LMEQKLVEKNIFSFYLNRNPGAQPGGELMLGGTDSKYYKGPLSYLNVTRKAYWQVHMEQV 260
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+G+ T +C+GGC AI+D+GTSL+ GP V E+ AIG ++ E
Sbjct: 261 DVGSSLT-LCKGGCEAILDTGTSLIVGPVDEVRELQKAIGAVPLIQGE 307
Score = 38.5 bits (88), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 16/45 (35%), Positives = 27/45 (60%)
Query: 202 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEG 246
++P GE +I C+++ T+P V+ +G + + LS E Y LK G
Sbjct: 300 AVPLIQGEYMIPCEKVSTLPEVTLKLGGRAYTLSSEDYTLKVSGG 344
>gi|13637914|sp|P80209.2|CATD_BOVIN RecName: Full=Cathepsin D; Flags: Precursor
Length = 390
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 48/94 (51%), Positives = 66/94 (70%)
Query: 202 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 261
++P GE +I C+++ ++P V+ +G K + LSPE Y LK + VC+SGFM D+P
Sbjct: 295 AVPLIQGEYMIPCEKVSSLPEVTVKLGGKDYALSPEDYALKVSQAETTVCLSGFMGMDIP 354
Query: 262 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
PP GPLWILGDVF+G Y+TVFD + R+G AEAA
Sbjct: 355 PPGGPLWILGDVFIGRYYTVFDRDQNRVGLAEAA 388
Score = 106 bits (264), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 49/108 (45%), Positives = 73/108 (67%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+++Q LV + VFSF+LNRDP A+ GGE++ GG D K+++G + VT++ YWQ + +
Sbjct: 196 LMQQKLVDKNVFSFFLNRDPKAQPGGELMLGGTDSKYYRGSLMFHNVTRQAYWQIHMDQL 255
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+G+ T VC+GGC AIVD+GTSL+ GP V E+ AIG ++ E
Sbjct: 256 DVGSSLT-VCKGGCEAIVDTGTSLIVGPVEEVRELQKAIGAVPLIQGE 302
>gi|241687194|ref|XP_002412838.1| aspartyl protease, putative [Ixodes scapularis]
gi|215506640|gb|EEC16134.1| aspartyl protease, putative [Ixodes scapularis]
Length = 320
Score = 108 bits (269), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 54/102 (52%), Positives = 72/102 (70%), Gaps = 2/102 (1%)
Query: 1 MVEQGLVSEE-VFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGD 59
M++QGL+ E+ VFS +LNRDP ++EGGEI+FGG+D H+KG TYVPVT KGYWQF + D
Sbjct: 130 MMKQGLLGEQNVFSVYLNRDPSSKEGGEILFGGIDHDHYKGSITYVPVTAKGYWQFHV-D 188
Query: 60 ILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 101
+ +C+ GC AI D+GTSL+ GP V +N +GG
Sbjct: 189 GASKSVPELLCKDGCEAIADTGTSLITGPPEEVDSLNQYLGG 230
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 43/100 (43%), Positives = 65/100 (65%), Gaps = 3/100 (3%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
+N+ G+ ++DCD++ ++PNV+FTI K F+L + Y+LK + +C+SGF
Sbjct: 224 LNQYLGGTKTEGGQYLLDCDKLESLPNVTFTISGKEFSLRSKDYVLKINQQGQTLCVSGF 283
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
M +P PLWILGDVF+G Y+T+FD + R+GFAE A
Sbjct: 284 MGLGMPQ---PLWILGDVFLGPYYTIFDRDQDRVGFAEVA 320
>gi|299522|gb|AAB26186.1| cathepsin D {EC 3.4.23.5} [cattle, Peptide Partial, 346 aa]
Length = 346
Score = 108 bits (269), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 48/94 (51%), Positives = 66/94 (70%)
Query: 202 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 261
++P GE +I C+++ ++P V+ +G K + LSPE Y LK + VC+SGFM D+P
Sbjct: 251 AVPLIQGEYMIPCEKVSSLPEVTVKLGGKDYALSPEDYALKVSQAETTVCLSGFMGMDIP 310
Query: 262 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
PP GPLWILGDVF+G Y+TVFD + R+G AEAA
Sbjct: 311 PPGGPLWILGDVFIGRYYTVFDRDQNRVGLAEAA 344
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 49/108 (45%), Positives = 73/108 (67%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+++Q LV + VFSF+LNRDP A+ GGE++ GG D K+++G + VT++ YWQ + +
Sbjct: 152 LMQQKLVDKNVFSFFLNRDPKAQPGGELMLGGTDSKYYRGSLMFHNVTRQAYWQIHMDQL 211
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+G+ T VC+GGC AIVD+GTSL+ GP V E+ AIG ++ E
Sbjct: 212 DVGSSLT-VCKGGCEAIVDTGTSLIVGPVEEVRELQKAIGAVPLIQGE 258
>gi|397485042|ref|XP_003813672.1| PREDICTED: napsin-A-like [Pan paniscus]
Length = 420
Score = 108 bits (269), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 52/119 (43%), Positives = 79/119 (66%), Gaps = 1/119 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+VEQGL+ + VFSF+LNRDP+ +GGE+V GG DP H+ T+VPVT YWQ + +
Sbjct: 206 LVEQGLLDKPVFSFYLNRDPEVADGGELVLGGSDPAHYIPPLTFVPVTVPAYWQIHMERV 265
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDL 119
+G++ T +C GCAAI+D+GT ++ GPT + ++ AIGG +++ E + S+ L
Sbjct: 266 KVGSRLT-LCAQGCAAILDTGTPVIVGPTEEIRALHAAIGGIPLLAGEYIIRCSEIPKL 323
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 44/96 (45%), Positives = 59/96 (61%), Gaps = 4/96 (4%)
Query: 203 LPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPP 262
+P GE II C IP +P VS IG FNL+ + Y+++ +G +C+SGF A D+
Sbjct: 306 IPLLAGEYIIRCSEIPKLPAVSLLIGGVWFNLTAQDYVIQFAQGDVRLCLSGFRALDIAS 365
Query: 263 PRGPLWILGDVFMGVYHTVFDSGKL----RIGFAEA 294
P P+WILGDVF+G Y TVFD G + R+G A A
Sbjct: 366 PPVPVWILGDVFLGAYVTVFDRGDMKSGARVGLARA 401
>gi|114678578|ref|XP_530061.2| PREDICTED: napsin-A-like [Pan troglodytes]
Length = 420
Score = 108 bits (269), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 52/119 (43%), Positives = 79/119 (66%), Gaps = 1/119 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+VEQGL+ + VFSF+LNRDP+ +GGE+V GG DP H+ T+VPVT YWQ + +
Sbjct: 206 LVEQGLLDKPVFSFYLNRDPEVADGGELVLGGSDPAHYIPPLTFVPVTVPAYWQIHMERV 265
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDL 119
+G++ T +C GCAAI+D+GT ++ GPT + ++ AIGG +++ E + S+ L
Sbjct: 266 KVGSRLT-LCAQGCAAILDTGTPVIVGPTEEIRALHAAIGGIPLLAGEYIIRCSEIPKL 323
Score = 87.0 bits (214), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 43/96 (44%), Positives = 58/96 (60%), Gaps = 4/96 (4%)
Query: 203 LPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPP 262
+P GE II C IP +P VS IG F L+ + Y+++ +G +C+SGF A D+
Sbjct: 306 IPLLAGEYIIRCSEIPKLPAVSLLIGGVWFTLTAQDYVIQFAQGDVRLCLSGFRALDIAS 365
Query: 263 PRGPLWILGDVFMGVYHTVFDSGKL----RIGFAEA 294
P P+WILGDVF+G Y TVFD G + R+G A A
Sbjct: 366 PPVPVWILGDVFLGAYVTVFDRGDMKSGARVGLARA 401
>gi|144926037|gb|ABP04018.1| cathepsin D [Gadus morhua]
Length = 87
Score = 108 bits (269), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 44/84 (52%), Positives = 64/84 (76%)
Query: 211 IIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWIL 270
++ CD+IPT+P ++FT+G + ++L+ EQY+LK +G +C+SGFM D+P P GPLWIL
Sbjct: 1 MVSCDKIPTLPIITFTVGGQSYSLTGEQYVLKESQGGKTICLSGFMGLDIPAPAGPLWIL 60
Query: 271 GDVFMGVYHTVFDSGKLRIGFAEA 294
GDVF+G Y+T FD R+GFA+A
Sbjct: 61 GDVFIGQYYTAFDRDNNRVGFAKA 84
>gi|262073106|ref|NP_001159993.1| cathepsin D precursor [Bos taurus]
gi|296471411|tpg|DAA13526.1| TPA: cathepsin D [Bos taurus]
Length = 410
Score = 108 bits (269), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 48/94 (51%), Positives = 66/94 (70%)
Query: 202 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 261
++P GE +I C+++ ++P V+ +G K + LSPE Y LK + VC+SGFM D+P
Sbjct: 315 AVPLIQGEYMIPCEKVSSLPQVTVKLGGKDYALSPEDYALKVSQAGTTVCLSGFMGMDIP 374
Query: 262 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
PP GPLWILGDVF+G Y+TVFD + R+G AEAA
Sbjct: 375 PPGGPLWILGDVFIGRYYTVFDRDQNRVGLAEAA 408
Score = 106 bits (264), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 49/108 (45%), Positives = 73/108 (67%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+++Q LV + VFSF+LNRDP A+ GGE++ GG D K+++G + VT++ YWQ + +
Sbjct: 216 LMQQKLVDKNVFSFFLNRDPKAQPGGELMLGGTDSKYYRGSLMFHNVTRQAYWQIHMDQL 275
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+G+ T VC+GGC AIVD+GTSL+ GP V E+ AIG ++ E
Sbjct: 276 DVGSSLT-VCKGGCEAIVDTGTSLIVGPVEEVRELQKAIGAVPLIQGE 322
>gi|440899428|gb|ELR50729.1| Cathepsin D, partial [Bos grunniens mutus]
Length = 394
Score = 108 bits (269), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 48/94 (51%), Positives = 66/94 (70%)
Query: 202 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 261
++P GE +I C+++ ++P V+ +G K + LSPE Y LK + VC+SGFM D+P
Sbjct: 299 AVPLIQGEYMIPCEKVSSLPQVTVKLGGKDYALSPEDYALKVSQAGTTVCLSGFMGMDIP 358
Query: 262 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
PP GPLWILGDVF+G Y+TVFD + R+G AEAA
Sbjct: 359 PPGGPLWILGDVFIGRYYTVFDRDQNRVGLAEAA 392
Score = 106 bits (264), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 49/108 (45%), Positives = 73/108 (67%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+++Q LV + VFSF+LNRDP A+ GGE++ GG D K+++G + VT++ YWQ + +
Sbjct: 200 LMQQKLVDKNVFSFFLNRDPKAQPGGELMLGGTDSKYYRGSLMFHNVTRQAYWQIHMDQL 259
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+G+ T VC+GGC AIVD+GTSL+ GP V E+ AIG ++ E
Sbjct: 260 DVGSSLT-VCKGGCEAIVDTGTSLIVGPVEEVRELQKAIGAVPLIQGE 306
>gi|397490270|ref|XP_003816129.1| PREDICTED: cathepsin D [Pan paniscus]
Length = 603
Score = 108 bits (269), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 49/108 (45%), Positives = 74/108 (68%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+++Q LV + +FSF+L+RDPDA+ GGE++ GG D K++KG +Y+ VT+K YWQ L +
Sbjct: 218 LMQQKLVDQNIFSFYLSRDPDAQPGGELMLGGTDSKYYKGSLSYLNVTRKAYWQVHLDQV 277
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+ + T +C+ GC AIVD+GTSL+ GP V E+ AIG ++ E
Sbjct: 278 EVASGLT-LCKEGCEAIVDTGTSLMVGPVDEVRELQKAIGAVPLIQGE 324
Score = 98.6 bits (244), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 42/87 (48%), Positives = 59/87 (67%)
Query: 202 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 261
++P GE +I C+++ T+P ++ +G K + LSPE Y LK + +C+SGFM D+P
Sbjct: 317 AVPLIQGEYMIPCEKVSTLPAITLKLGGKGYKLSPEDYTLKVSQAGKTLCLSGFMGMDIP 376
Query: 262 PPRGPLWILGDVFMGVYHTVFDSGKLR 288
PP GPLWILGDVF+G Y+TVFD R
Sbjct: 377 PPSGPLWILGDVFIGRYYTVFDRDNNR 403
>gi|301764903|ref|XP_002917936.1| PREDICTED: napsin-A-like [Ailuropoda melanoleuca]
Length = 406
Score = 108 bits (269), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 52/115 (45%), Positives = 77/115 (66%), Gaps = 1/115 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+V+QGL+ + VFSF+LNRDP+A +GGE+V GG DP H+ T++PVT YWQ + +
Sbjct: 204 LVDQGLLDKPVFSFYLNRDPEAADGGELVLGGSDPAHYVPPLTFLPVTIPAYWQIHMERV 263
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQ 115
+G T +C GCAAI+D+GTSL+ GPT + ++ AIGG ++ E + S+
Sbjct: 264 NVGTGLT-LCAQGCAAILDTGTSLITGPTEEIQALHAAIGGVSLLVGEYLIQCSK 317
Score = 101 bits (252), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 53/143 (37%), Positives = 80/143 (55%), Gaps = 7/143 (4%)
Query: 156 VEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 215
+E+ NV G + C A++ L T+E + ++ + +GE +I C
Sbjct: 260 MERVNVGTGLTLCAQGCA-AILDTGTSLITGPTEE--IQALHAAIGGVSLLVGEYLIQCS 316
Query: 216 RIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFM 275
+IPT+P +SF +G FNL+ + Y+++ G +C+SGF A D+PPP GPLWILGDVF+
Sbjct: 317 KIPTLPPISFFLGGVWFNLTAQDYVIQIARGGVRLCLSGFQALDMPPPAGPLWILGDVFL 376
Query: 276 GVYHTVFDSGKL----RIGFAEA 294
Y +FD G L R+G A A
Sbjct: 377 RTYVAIFDRGNLRGGARVGLARA 399
>gi|122114359|gb|AAY42145.2| cathepsin D [Sus scrofa]
Length = 410
Score = 108 bits (269), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 49/108 (45%), Positives = 73/108 (67%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+++Q LV + +FSF+LNRDP A+ G E++ GG+D K++KG Y VT+K YWQ + +
Sbjct: 216 LMQQKLVDKNIFSFYLNRDPGAQPGSELMLGGIDSKYYKGSLDYHNVTRKAYWQIHMDQV 275
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+G+ T +C+GGC AIVD+GTSL+ GP V E+ AIG ++ E
Sbjct: 276 AVGSSLT-LCKGGCEAIVDTGTSLIVGPVEEVRELQKAIGAVPLIQGE 322
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 47/94 (50%), Positives = 66/94 (70%)
Query: 202 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 261
++P GE +I C+++P++P+V+ T+G K + LS E Y LK + +C+SGFM D+P
Sbjct: 315 AVPLIQGEYMIPCEKVPSLPDVTVTLGGKKYKLSSENYTLKVSQAGQTICLSGFMGMDIP 374
Query: 262 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
PP GPLWILGDVF+G Y+TVF R+G AEAA
Sbjct: 375 PPGGPLWILGDVFIGRYYTVFGRDLNRVGSAEAA 408
>gi|311258028|ref|XP_003127411.1| PREDICTED: napsin-A [Sus scrofa]
Length = 416
Score = 108 bits (269), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 58/143 (40%), Positives = 81/143 (56%), Gaps = 7/143 (4%)
Query: 156 VEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 215
VE+ +V G + C A++ L T+E + + +P MGE +I C
Sbjct: 258 VERVHVGTGLTLCAQGCA-AILDTGTSLITGPTEE--IQALQAAIGGIPLLMGEYLIQCS 314
Query: 216 RIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFM 275
+IPT+P VSF +G FNL+ + Y+++ G A +C+SGF A D+PPP GPLWILGDVF+
Sbjct: 315 KIPTLPPVSFHLGGVWFNLTAQDYVIQITRGGASLCLSGFQALDMPPPTGPLWILGDVFL 374
Query: 276 GVYHTVFDSG----KLRIGFAEA 294
G Y VFD G R+G A A
Sbjct: 375 GSYVAVFDRGDRKSDARVGLARA 397
Score = 107 bits (266), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 53/115 (46%), Positives = 76/115 (66%), Gaps = 1/115 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+V+QGL+ + VFSF+LNRDP+A +GGE+V GG DP H+ T+VPVT YWQ + +
Sbjct: 202 LVDQGLLDKPVFSFYLNRDPEAADGGELVLGGSDPAHYIPPLTFVPVTVPAYWQVHVERV 261
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQ 115
+G T +C GCAAI+D+GTSL+ GPT + + AIGG ++ E + S+
Sbjct: 262 HVGTGLT-LCAQGCAAILDTGTSLITGPTEEIQALQAAIGGIPLLMGEYLIQCSK 315
>gi|341884635|gb|EGT40570.1| CBN-ASP-4 protein [Caenorhabditis brenneri]
Length = 447
Score = 107 bits (268), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 51/108 (47%), Positives = 72/108 (66%), Gaps = 2/108 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+ EQ V +F+FWLNR+PD++ GGEI FGG+D + + TY PVT+KGYWQF++ D
Sbjct: 223 LFEQKKVPANLFAFWLNRNPDSDLGGEITFGGIDSRRYVEPITYAPVTRKGYWQFKM-DK 281
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
++G+ G C GC AI D+GTSL+AGP + I + IG E ++ E
Sbjct: 282 VVGSGVLG-CSNGCQAIADTGTSLIAGPKAQIEAIQNFIGAEPLIKGE 328
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 50/99 (50%), Positives = 66/99 (66%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
I + P GE +I CD++PT+P VSF IG + F+L E Y+LK +G +C+SGF
Sbjct: 315 IQNFIGAEPLIKGEYMISCDKVPTLPPVSFVIGGQEFSLKGEDYVLKVSQGGKTICLSGF 374
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
M DLP G LWILGDVF+G Y+TVFD + R+GFA+A
Sbjct: 375 MGIDLPERVGELWILGDVFIGRYYTVFDFDQNRVGFAQA 413
>gi|281348334|gb|EFB23918.1| hypothetical protein PANDA_006240 [Ailuropoda melanoleuca]
Length = 379
Score = 107 bits (268), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 52/115 (45%), Positives = 77/115 (66%), Gaps = 1/115 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+V+QGL+ + VFSF+LNRDP+A +GGE+V GG DP H+ T++PVT YWQ + +
Sbjct: 179 LVDQGLLDKPVFSFYLNRDPEAADGGELVLGGSDPAHYVPPLTFLPVTIPAYWQIHMERV 238
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQ 115
+G T +C GCAAI+D+GTSL+ GPT + ++ AIGG ++ E + S+
Sbjct: 239 NVGTGLT-LCAQGCAAILDTGTSLITGPTEEIQALHAAIGGVSLLVGEYLIQCSK 292
Score = 101 bits (252), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 53/143 (37%), Positives = 80/143 (55%), Gaps = 7/143 (4%)
Query: 156 VEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 215
+E+ NV G + C A++ L T+E + ++ + +GE +I C
Sbjct: 235 MERVNVGTGLTLCAQGCA-AILDTGTSLITGPTEE--IQALHAAIGGVSLLVGEYLIQCS 291
Query: 216 RIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFM 275
+IPT+P +SF +G FNL+ + Y+++ G +C+SGF A D+PPP GPLWILGDVF+
Sbjct: 292 KIPTLPPISFFLGGVWFNLTAQDYVIQIARGGVRLCLSGFQALDMPPPAGPLWILGDVFL 351
Query: 276 GVYHTVFDSGKL----RIGFAEA 294
Y +FD G L R+G A A
Sbjct: 352 RTYVAIFDRGNLRGGARVGLARA 374
>gi|54020914|ref|NP_001005701.1| napsin A aspartic peptidase precursor [Xenopus (Silurana)
tropicalis]
gi|49522956|gb|AAH75272.1| cathepsin D (lysosomal aspartyl protease) [Xenopus (Silurana)
tropicalis]
Length = 402
Score = 107 bits (268), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 53/118 (44%), Positives = 75/118 (63%), Gaps = 5/118 (4%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+EQ L+ E VFSF+L+RDP A GGE+V GG DP ++ G Y+ VT+ YWQ + ++
Sbjct: 195 MMEQKLLEENVFSFYLSRDPMAMVGGELVLGGTDPNYYTGDFHYLNVTRMAYWQIKADEV 254
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE----CKLVVS 114
+ NQ +C+GGC AIVD+GTSL+ GP + ++ AIG + S E CK + S
Sbjct: 255 RVANQLV-LCKGGCQAIVDTGTSLITGPREEIRALHKAIGAFPLFSGEYFVNCKRIQS 311
Score = 104 bits (260), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 47/99 (47%), Positives = 66/99 (66%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
+++ + P GE ++C RI ++P VSF +G +NL+ EQY+LK + +C+SGF
Sbjct: 288 LHKAIGAFPLFSGEYFVNCKRIQSLPTVSFILGGVAYNLTGEQYVLKISKFGHTLCLSGF 347
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
M D+ PP GPLWILGDVF+G Y+TVFD R+GFA A
Sbjct: 348 MGLDIRPPAGPLWILGDVFIGQYYTVFDRDNDRVGFATA 386
>gi|321461133|gb|EFX72168.1| hypothetical protein DAPPUDRAFT_227643 [Daphnia pulex]
Length = 394
Score = 107 bits (268), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 55/113 (48%), Positives = 74/113 (65%), Gaps = 8/113 (7%)
Query: 1 MVEQGLVSEEVFSFWLNRD-PDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGD 59
M+ QGLV E VFSFWLNR+ + E GGEI+FGG +P H++G+ +YVPV++K YWQF +
Sbjct: 195 MMAQGLVDEPVFSFWLNRNLNNPENGGEILFGGTNPTHYEGEISYVPVSRKAYWQFSVDG 254
Query: 60 I-LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG------GEGVV 105
+ L G C GGC I D+GTSL+ GP+ +T + IG GEG+V
Sbjct: 255 VNLAGYDEYPFCNGGCEMISDTGTSLITGPSEEITLFHKLIGAQVNIVGEGIV 307
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 45/102 (44%), Positives = 67/102 (65%), Gaps = 2/102 (1%)
Query: 193 LSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAE--V 250
++ ++L + N +GE I+DC+ IP +P ++FTIG K F L YI+ + +
Sbjct: 288 ITLFHKLIGAQVNIVGEGIVDCNEIPNLPAMTFTIGGKPFVLEGVDYIIPFVDTTTNDTL 347
Query: 251 CISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFA 292
C+SGFM D+P P GPLWILGDVF+G +++V+D G+ RIG A
Sbjct: 348 CLSGFMGLDIPEPAGPLWILGDVFIGKFYSVYDFGQDRIGLA 389
>gi|330800100|ref|XP_003288077.1| preprocathepsin D [Dictyostelium purpureum]
gi|325081901|gb|EGC35401.1| preprocathepsin D [Dictyostelium purpureum]
Length = 386
Score = 107 bits (268), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 52/110 (47%), Positives = 70/110 (63%), Gaps = 3/110 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M++QGLV + +FSFWL++ P GGE+ FG +D + G TYVP+T YW+F++ D
Sbjct: 191 MIDQGLVGQNLFSFWLSKTP-GSNGGELSFGSIDSSKYTGPITYVPLTNTTYWEFKMDDF 249
Query: 61 LIGNQSTGVC-EGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAEC 109
IG QS G C GC AI DSGTSL+AGP +T +N +G V+S E
Sbjct: 250 AIGGQSAGFCGSQGCPAIADSGTSLIAGPIDFITALNQKLGAV-VISGEA 298
Score = 97.8 bits (242), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 46/88 (52%), Positives = 61/88 (69%), Gaps = 1/88 (1%)
Query: 208 GESII-DCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGP 266
GE+I DC I T+PNV+ T+ + FNL+P+ Y+L+ E C+SGFM +LPP GP
Sbjct: 296 GEAIFPDCSVINTLPNVTVTLAGRQFNLTPKDYVLQITEFGKTECLSGFMGIELPPQVGP 355
Query: 267 LWILGDVFMGVYHTVFDSGKLRIGFAEA 294
LWILGDVF+ Y+TVFD G ++GFA A
Sbjct: 356 LWILGDVFISTYYTVFDFGNSQVGFATA 383
>gi|148686196|gb|EDL18143.1| cathepsin D, isoform CRA_b [Mus musculus]
Length = 156
Score = 107 bits (268), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 46/93 (49%), Positives = 65/93 (69%)
Query: 202 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 261
++P GE +I C+++ ++P V +G K + L P++YILK +G +C+SGFM D+P
Sbjct: 61 AVPLIQGEYMIPCEKVSSLPTVYLKLGGKNYELHPDKYILKVSQGGKTICLSGFMGMDIP 120
Query: 262 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
PP GPLWILGDVF+G Y+TVFD R+GFA A
Sbjct: 121 PPSGPLWILGDVFIGSYYTVFDRDNNRVGFANA 153
>gi|432099182|gb|ELK28547.1| Cathepsin D [Myotis davidii]
Length = 351
Score = 107 bits (267), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 50/108 (46%), Positives = 73/108 (67%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+++Q LV + +FSF+LNRDP A+ GGE++ GG D K++KG Y+ VT+K YWQ + +
Sbjct: 174 LMQQKLVEKNIFSFYLNRDPSAQPGGELMLGGTDSKYYKGPIAYLNVTRKAYWQVHMDQV 233
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+GN T +C+ GC AIVD+GTSL+ GP V E+ AIG ++ E
Sbjct: 234 DVGNGLT-LCKEGCEAIVDTGTSLMVGPVDEVRELQKAIGAVPLIQGE 280
Score = 81.3 bits (199), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 39/94 (41%), Positives = 56/94 (59%), Gaps = 17/94 (18%)
Query: 202 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 261
++P GE +I C+++ ++P S + +G +C+SGFM D+P
Sbjct: 273 AVPLIQGEYMIPCEKVSSLPEPS-----------------QVSQGGKTICLSGFMGMDIP 315
Query: 262 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
PP GPLWILGDVF+G Y+TVFD + R+G AEAA
Sbjct: 316 PPAGPLWILGDVFIGRYYTVFDREENRVGLAEAA 349
>gi|196123668|gb|ACG70181.1| cathepsin D-like protein [Homarus americanus]
Length = 386
Score = 107 bits (267), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 57/111 (51%), Positives = 72/111 (64%), Gaps = 4/111 (3%)
Query: 1 MVEQGLVSEEVFSFWLNRD-PDAEE--GGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFEL 57
MV QG V + +FSF+LN D D E GGE+V GG DP H++G+ YVPV+K GYWQ
Sbjct: 191 MVAQGAVDQPIFSFYLNHDVSDMNETLGGELVLGGSDPNHYEGEFHYVPVSKVGYWQVTA 250
Query: 58 GDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
I +G+ TG C C AIVD+GTSL+AGP V EI H +GG G ++ E
Sbjct: 251 EAIKVGDNVTGFCN-PCEAIVDTGTSLIAGPNAEVKEIVHMLGGYGFIAGE 300
Score = 58.5 bits (140), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 44/146 (30%), Positives = 69/146 (47%), Gaps = 15/146 (10%)
Query: 156 VEKENVSAGD--SAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIID 213
V E + GD + C+ CE A+V L E + I + GE +I
Sbjct: 248 VTAEAIKVGDNVTGFCNPCE-AIVDTGTSLIAGPNAE--VKEIVHMLGGYGFIAGEYLIS 304
Query: 214 CDRIPTMPNVSFTIGDKIFNLSPEQYILK-----TGEGIAEVCISGFMAFDLPPPRGPLW 268
C ++P MP +FT+ K F++ +++ TG ++CI G M + W
Sbjct: 305 CHKVPEMPEFTFTLNGKDFSIDGPDLVIEDIDPSTG---VKICIVGIMGLQMGELEA--W 359
Query: 269 ILGDVFMGVYHTVFDSGKLRIGFAEA 294
ILGD F+ ++T FD G+ RIGFA++
Sbjct: 360 ILGDPFIADWYTEFDVGQKRIGFAKS 385
>gi|194374823|dbj|BAG62526.1| unnamed protein product [Homo sapiens]
Length = 325
Score = 107 bits (267), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 49/110 (44%), Positives = 75/110 (68%), Gaps = 1/110 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+VEQGL+ + VFSF+ NRDP+ +GGE+V GG DP H+ T+VPVT YWQ + +
Sbjct: 206 LVEQGLLDKPVFSFYFNRDPEVADGGELVLGGSDPAHYIPPLTFVPVTVPAYWQIHMERV 265
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECK 110
+G++ T +C GCAAI+D+GT ++ GPT + ++ AIGG +++ E +
Sbjct: 266 KVGSRLT-LCAQGCAAILDTGTPVIVGPTEEIRALHAAIGGIPLLAGEVR 314
>gi|339237491|ref|XP_003380300.1| lysosomal aspartic protease [Trichinella spiralis]
gi|316976887|gb|EFV60084.1| lysosomal aspartic protease [Trichinella spiralis]
Length = 405
Score = 107 bits (266), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 45/100 (45%), Positives = 66/100 (66%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
+ E + P MG+ + C+ + ++PNV IG ++F+L PE Y+L+ + +C+SGF
Sbjct: 302 LQEAIGAKPLIMGQYYVSCNEVDSLPNVQMKIGGRMFDLKPEDYVLRVKQMGQSICLSGF 361
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
M DLPP G LWILGD+F+G+Y+TVFD G R+GFA A
Sbjct: 362 MGLDLPPQVGKLWILGDIFIGLYYTVFDVGNSRLGFANAT 401
Score = 104 bits (259), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 48/108 (44%), Positives = 71/108 (65%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+EQ L+SE VF+FWLNR+P+ + GG I FG VD K++ G T+VP+ + YW+F + I
Sbjct: 209 MIEQNLISEAVFAFWLNRNPEDDLGGLISFGTVDEKYYIGNITWVPLVNQRYWEFNMETI 268
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+G++ G C GC I D+GTSL+AGP V + AIG + ++ +
Sbjct: 269 KVGDEHVG-CIDGCTTIADTGTSLIAGPKDEVERLQEAIGAKPLIMGQ 315
>gi|348559312|ref|XP_003465460.1| PREDICTED: napsin-A-like [Cavia porcellus]
Length = 523
Score = 107 bits (266), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 52/108 (48%), Positives = 72/108 (66%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MVEQGL+ + VFSF+LNRD + GGE+V GG DP H+ T+VPVT YWQ + +
Sbjct: 203 MVEQGLLDKPVFSFYLNRDFEGTHGGELVLGGSDPAHYIPPLTFVPVTIPAYWQIHMDRV 262
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
++G T +C GCAAIVD+GTSL+ GP+ + ++ AIGG ++ E
Sbjct: 263 MVGTGLT-LCAQGCAAIVDTGTSLITGPSEEIRALHRAIGGLPWLAGE 309
Score = 97.4 bits (241), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 50/131 (38%), Positives = 74/131 (56%), Gaps = 6/131 (4%)
Query: 168 VCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTI 227
+C+ A+V L ++E + ++ LP GE I C +IPT+P +SF +
Sbjct: 270 LCAQGCAAIVDTGTSLITGPSEE--IRALHRAIGGLPWLAGEHFIQCSKIPTLPPISFLL 327
Query: 228 GDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKL 287
G FNL+ + Y+++ +G +C+SGF A D+PPP GPLWILGDVF+ Y VFD G
Sbjct: 328 GGVWFNLTAQDYVIQISQGGFRLCLSGFQALDVPPPAGPLWILGDVFLRTYVAVFDRGNT 387
Query: 288 ----RIGFAEA 294
R+G A +
Sbjct: 388 SRGARVGLARS 398
>gi|73947914|ref|XP_533610.2| PREDICTED: napsin-A [Canis lupus familiaris]
Length = 422
Score = 107 bits (266), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 52/115 (45%), Positives = 76/115 (66%), Gaps = 1/115 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+V+QGL+ + VFSF+LNRDP+A +GGE+V GG DP H+ T++PVT YWQ + +
Sbjct: 204 LVDQGLLDKPVFSFYLNRDPEAVDGGELVLGGSDPAHYIPPLTFLPVTVPAYWQIHMERV 263
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQ 115
+G +C GCAAI+D+GTSL+ GPT + +N AIGG ++ E + S+
Sbjct: 264 KVGTGLI-LCAQGCAAILDTGTSLITGPTEEIQALNAAIGGFSLLLGEYLIQCSE 317
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 54/145 (37%), Positives = 79/145 (54%), Gaps = 7/145 (4%)
Query: 156 VEKENVSAGDSAV-CSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDC 214
+ E V G + C+ A++ L T+E + +N +GE +I C
Sbjct: 258 IHMERVKVGTGLILCAQGCAAILDTGTSLITGPTEE--IQALNAAIGGFSLLLGEYLIQC 315
Query: 215 DRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVF 274
IPT+P +SF +G FNL+ + Y+++ G +C+SGF A D+PPP GPLWILGDVF
Sbjct: 316 SEIPTLPPISFLLGGVWFNLTAQDYVIQIARGGVRLCLSGFQALDIPPPTGPLWILGDVF 375
Query: 275 MGVYHTVFDSGKL----RIGFAEAA 295
+G + VFD G L R+G A A+
Sbjct: 376 LGAHVAVFDRGNLTGGARVGLARAS 400
>gi|119592252|gb|EAW71846.1| hCG1733572, isoform CRA_b [Homo sapiens]
Length = 512
Score = 107 bits (266), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 54/129 (41%), Positives = 83/129 (64%), Gaps = 2/129 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+VEQGL+ + VFSF+ NRDP+ +GGE+V GG DP H+ T+VPVT YWQ + +
Sbjct: 269 LVEQGLLDKPVFSFYFNRDPEVADGGELVLGGSDPAHYIPPLTFVPVTVPAYWQIHMERV 328
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDL- 119
+G++ T +C GCAAI+D+GT ++ GPT + ++ AIGG +++ E + S+ L
Sbjct: 329 KVGSRLT-LCAQGCAAILDTGTPVIVGPTEEIRALHAAIGGIPLLAGEYIIRCSEIPKLP 387
Query: 120 IWDLLVSGL 128
LL+ G+
Sbjct: 388 AVSLLIGGV 396
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 44/96 (45%), Positives = 59/96 (61%), Gaps = 4/96 (4%)
Query: 203 LPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPP 262
+P GE II C IP +P VS IG FNL+ + Y+++ +G +C+SGF A D+
Sbjct: 369 IPLLAGEYIIRCSEIPKLPAVSLLIGGVWFNLTAQDYVIQFAQGDVRLCLSGFRALDIAS 428
Query: 263 PRGPLWILGDVFMGVYHTVFDSGKL----RIGFAEA 294
P P+WILGDVF+G Y TVFD G + R+G A A
Sbjct: 429 PPVPVWILGDVFLGAYVTVFDRGDMKSGARVGLARA 464
>gi|224460527|gb|ACN43675.1| cathepsin D [Paralichthys olivaceus]
Length = 396
Score = 107 bits (266), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 50/108 (46%), Positives = 71/108 (65%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
++ Q V E VFSF+LNR+PD GGE++ GG DPK++ G YV VT++ YWQ +G +
Sbjct: 204 IMSQKKVEENVFSFYLNRNPDMAPGGELLLGGTDPKYYSGDFNYVNVTRQAYWQIHMGGM 263
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
G+Q T +C+ GC AIVD+GTSL+ GP+ V + AIG ++ E
Sbjct: 264 GAGSQLT-LCKDGCEAIVDTGTSLITGPSAEVKALQKAIGAVPLIQGE 310
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 43/93 (46%), Positives = 68/93 (73%)
Query: 202 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 261
++P GE ++ CD+IP++P ++F +G + ++L+ +QY+LK + +C+SGFM D+P
Sbjct: 303 AVPLIQGEYMVSCDKIPSLPVITFNLGGQSYSLTGDQYVLKVSQAGKVICLSGFMGLDIP 362
Query: 262 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
P GPLWILGDVF+G Y+TVFD R+GFA++
Sbjct: 363 APAGPLWILGDVFIGQYYTVFDRENNRVGFAKS 395
>gi|119592251|gb|EAW71845.1| hCG1733572, isoform CRA_a [Homo sapiens]
Length = 449
Score = 107 bits (266), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 51/119 (42%), Positives = 78/119 (65%), Gaps = 1/119 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+VEQGL+ + VFSF+ NRDP+ +GGE+V GG DP H+ T+VPVT YWQ + +
Sbjct: 206 LVEQGLLDKPVFSFYFNRDPEVADGGELVLGGSDPAHYIPPLTFVPVTVPAYWQIHMERV 265
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDL 119
+G++ T +C GCAAI+D+GT ++ GPT + ++ AIGG +++ E + S+ L
Sbjct: 266 KVGSRLT-LCAQGCAAILDTGTPVIVGPTEEIRALHAAIGGIPLLAGEYIIRCSEIPKL 323
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 44/96 (45%), Positives = 59/96 (61%), Gaps = 4/96 (4%)
Query: 203 LPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPP 262
+P GE II C IP +P VS IG FNL+ + Y+++ +G +C+SGF A D+
Sbjct: 306 IPLLAGEYIIRCSEIPKLPAVSLLIGGVWFNLTAQDYVIQFAQGDVRLCLSGFRALDIAS 365
Query: 263 PRGPLWILGDVFMGVYHTVFDSGKL----RIGFAEA 294
P P+WILGDVF+G Y TVFD G + R+G A A
Sbjct: 366 PPVPVWILGDVFLGAYVTVFDRGDMKSGARVGLARA 401
>gi|6561816|gb|AAF17080.1| aspartyl protease 3 [Homo sapiens]
Length = 450
Score = 107 bits (266), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 51/119 (42%), Positives = 78/119 (65%), Gaps = 1/119 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+VEQGL+ + VFSF+ NRDP+ +GGE+V GG DP H+ T+VPVT YWQ + +
Sbjct: 206 LVEQGLLDKPVFSFYFNRDPEVADGGELVLGGSDPAHYIPPLTFVPVTVPAYWQIHMERV 265
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDL 119
+G++ T +C GCAAI+D+GT ++ GPT + ++ AIGG +++ E + S+ L
Sbjct: 266 KVGSRLT-LCAQGCAAILDTGTPVIVGPTEEIRALHAAIGGIPLLAGEYIIRCSEIPKL 323
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 44/96 (45%), Positives = 59/96 (61%), Gaps = 4/96 (4%)
Query: 203 LPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPP 262
+P GE II C IP +P VS IG FNL+ + Y+++ +G +C+SGF A D+
Sbjct: 306 IPLLAGEYIIRCSEIPKLPAVSLLIGGVWFNLTAQDYVIQFAQGDVRLCLSGFRALDIAS 365
Query: 263 PRGPLWILGDVFMGVYHTVFDSGKL----RIGFAEA 294
P P+WILGDVF+G Y TVFD G + R+G A A
Sbjct: 366 PPVPVWILGDVFLGAYVTVFDRGDMKSGARVGLARA 401
>gi|18203300|sp|Q9MZS8.1|CATD_SHEEP RecName: Full=Cathepsin D; Flags: Precursor
gi|8886526|gb|AAF80494.1|AF164143_1 cathepsin D [Ovis aries]
Length = 365
Score = 107 bits (266), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 50/108 (46%), Positives = 73/108 (67%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
++ Q LV + VFSF+LNRDP A+ G E++ GG D K+++G TY VT++ YWQ + +
Sbjct: 191 LMRQKLVDKNVFSFFLNRDPKAQPGEELMLGGTDSKYYRGSLTYHNVTRQAYWQIHMDQL 250
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+G+ T VC+GGC AIVD+GTSL+ GP V E++ AIG ++ E
Sbjct: 251 DVGSSLT-VCKGGCEAIVDTGTSLMVGPVDEVRELHKAIGAVPLIQGE 297
Score = 87.4 bits (215), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 37/81 (45%), Positives = 55/81 (67%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
+++ ++P GE +I C+++ ++P V+ +G K + LSPE Y LK + VC+SGF
Sbjct: 284 LHKAIGAVPLIQGEYMIPCEKVSSLPQVTLKLGGKDYTLSPEDYTLKVSQAGTTVCLSGF 343
Query: 256 MAFDLPPPRGPLWILGDVFMG 276
M D+PPP GPLWILGDVF+G
Sbjct: 344 MGMDIPPPGGPLWILGDVFIG 364
>gi|47198554|emb|CAF89152.1| unnamed protein product [Tetraodon nigroviridis]
Length = 301
Score = 107 bits (266), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 70/242 (28%), Positives = 110/242 (45%), Gaps = 73/242 (30%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
++ Q V + VFSF+LNR+P + GGE++ GG DP+++ G +YV VT++ YWQ + ++
Sbjct: 131 IMSQKKVEKNVFSFYLNRNPQTQPGGELLLGGTDPQYYTGDFSYVNVTRQAYWQIHVDEL 190
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
+G+Q T +C+ GC AIVD+GTSLL GP+ V + AIG ++ E
Sbjct: 191 SVGSQLT-LCKSGCEAIVDTGTSLLTGPSEEVRSLQKAIGALPLIQGEVS---------- 239
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGDSAVCSACEMAVVWVQ 180
C ++G F G S VC + VV+
Sbjct: 240 -------------CGRVGPPVF---------------------GPSVVCGRVKSPVVFGF 265
Query: 181 NQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYI 240
+ CD +P T+P ++F IG K ++L+ +QY+
Sbjct: 266 QYMVS--------------CDKIP--------------TLPVITFNIGGKPYSLTGDQYV 297
Query: 241 LK 242
LK
Sbjct: 298 LK 299
>gi|449666857|ref|XP_002161366.2| PREDICTED: lysosomal aspartic protease-like [Hydra magnipapillata]
Length = 387
Score = 106 bits (265), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 57/134 (42%), Positives = 79/134 (58%), Gaps = 2/134 (1%)
Query: 161 VSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTM 220
V G+ C++ A+ L T+E ++ +N++ + P GE IIDC ++P++
Sbjct: 255 VVNGEPMFCASGCNAIADTGTSLIAGPTEE--VNKLNQMIGATPIVGGEYIIDCAKVPSL 312
Query: 221 PNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHT 280
P + F IG K F L Y+LK CISGF+A D+PPPRGPLWILGDVF+G Y+T
Sbjct: 313 PALEFWIGGKQFVLKGSDYVLKVSTLGQTECISGFIAIDVPPPRGPLWILGDVFIGPYYT 372
Query: 281 VFDSGKLRIGFAEA 294
VFD R+GFA
Sbjct: 373 VFDLKNNRVGFANT 386
Score = 104 bits (259), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 51/108 (47%), Positives = 70/108 (64%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV+Q LV + VFSF+L+R+ + GGE++ GGVD F G TY PVT +GYWQF++ +
Sbjct: 195 MVDQKLVEDAVFSFYLDRNVNDSTGGELLLGGVDSSKFVGDITYTPVTVEGYWQFKMDKV 254
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
++ N C GC AI D+GTSL+AGPT V ++N IG +V E
Sbjct: 255 VV-NGEPMFCASGCNAIADTGTSLIAGPTEEVNKLNQMIGATPIVGGE 301
>gi|158523297|gb|ABW70789.1| cathepsin D [Scophthalmus maximus]
Length = 396
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 47/108 (43%), Positives = 69/108 (63%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
++ Q V + VFSF+LNR+PD GGE++ GG DPK++ G Y+ +T+K YWQ + +
Sbjct: 204 IMSQKKVEQNVFSFYLNRNPDTAPGGELLLGGTDPKYYTGDFNYINITRKAYWQIHMDGL 263
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+G+Q T +C GGC IVD+GTSL+ GP V + AIG ++ E
Sbjct: 264 AVGSQLT-LCNGGCEVIVDTGTSLITGPAAEVKALQKAIGAVPLIQGE 310
Score = 106 bits (264), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 43/93 (46%), Positives = 67/93 (72%)
Query: 202 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 261
++P GE ++ CD+IP++P ++F +G + ++L+ +QY+LK +C+SGFM D+P
Sbjct: 303 AVPLIQGEYMVSCDKIPSLPVITFNLGGRGYSLTGDQYVLKESHAGKTICLSGFMGLDIP 362
Query: 262 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
P GPLWILGDVF+G Y+TVFD R+GFA++
Sbjct: 363 APAGPLWILGDVFIGQYYTVFDRDNDRVGFAKS 395
>gi|355751850|gb|EHH55970.1| Cathepsin D [Macaca fascicularis]
Length = 403
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 45/85 (52%), Positives = 61/85 (71%)
Query: 211 IIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWIL 270
+I C+++ T+P ++ +G K + LSPE Y LK + +C+SGFM D+PPP GPLWIL
Sbjct: 317 MIPCEKVSTLPTITLKLGGKGYKLSPEDYTLKVSQAGKTLCLSGFMGMDIPPPSGPLWIL 376
Query: 271 GDVFMGVYHTVFDSGKLRIGFAEAA 295
GDVF+G Y+TVFD R+GFAEAA
Sbjct: 377 GDVFIGRYYTVFDRDNNRVGFAEAA 401
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 28/57 (49%), Positives = 43/57 (75%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFEL 57
+++Q LV + +FSF+LNRDP A+ GGE++ GG D K+++G +Y+ VT+K YWQ L
Sbjct: 226 LMQQKLVDQNIFSFYLNRDPTAQPGGELMLGGTDSKYYRGSLSYLNVTRKAYWQVRL 282
>gi|431920733|gb|ELK18506.1| Napsin-A [Pteropus alecto]
Length = 760
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 52/117 (44%), Positives = 74/117 (63%), Gaps = 5/117 (4%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+V QGL+ + VFSF+L RDP+ +GGE+V GG DP H+ TYVPVT YWQ + +
Sbjct: 192 LVAQGLLDKPVFSFYLTRDPEEADGGELVLGGSDPTHYIPPLTYVPVTVPAYWQIHMERV 251
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE----CKLVV 113
+G T +C GCAAI+D+GTSL+ GP+ + ++ AIGG ++ E C L+
Sbjct: 252 QVGTGLT-LCAHGCAAILDTGTSLITGPSEEIRALHRAIGGISLLVGEYLIQCSLIT 307
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 38/81 (46%), Positives = 52/81 (64%)
Query: 207 MGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGP 266
+GE +I C I +P VSF +G FNL+ + Y+++ G VC+SGF + D+PP GP
Sbjct: 296 VGEYLIQCSLITELPPVSFNLGGVWFNLTAQDYVIQIARGGVRVCLSGFRSLDMPPSLGP 355
Query: 267 LWILGDVFMGVYHTVFDSGKL 287
LWILGDVF+ Y VFD G +
Sbjct: 356 LWILGDVFLRSYVPVFDRGNM 376
>gi|431910128|gb|ELK13201.1| Cathepsin D [Pteropus alecto]
Length = 375
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 48/108 (44%), Positives = 73/108 (67%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+++Q LV + +FSF+LNRDP+A+ GGE++ GG D K++ G +Y+ VT+K YWQ + +
Sbjct: 181 LMQQKLVDKNIFSFYLNRDPNAQPGGELMLGGTDSKYYTGSLSYLNVTRKAYWQVHMEQV 240
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+GN T +C+ GC AIVD+GTSL+ GP V + AIG ++ E
Sbjct: 241 DVGNSLT-LCKAGCEAIVDTGTSLVVGPVEEVRALQKAIGAVPLIQGE 287
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 56/141 (39%), Positives = 82/141 (58%), Gaps = 3/141 (2%)
Query: 156 VEKENVSAGDS-AVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDC 214
V E V G+S +C A A+V L +E + + + ++P GE +I C
Sbjct: 235 VHMEQVDVGNSLTLCKAGCEAIVDTGTSLVVGPVEE--VRALQKAIGAVPLIQGEYMIPC 292
Query: 215 DRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVF 274
+++ ++P V+ +G K + L E Y LK +G +C+SGFM D+PPP GPLWILGDVF
Sbjct: 293 EKVSSLPEVTLKLGGKGYKLGAEDYTLKVSQGGKTICLSGFMGMDIPPPGGPLWILGDVF 352
Query: 275 MGVYHTVFDSGKLRIGFAEAA 295
+G Y+TVFD + R+G AEA
Sbjct: 353 IGRYYTVFDRDENRVGLAEAT 373
>gi|148232796|ref|NP_001083566.1| napsin A aspartic peptidase precursor [Xenopus laevis]
gi|38197533|gb|AAH61685.1| MGC68767 protein [Xenopus laevis]
Length = 392
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 50/106 (47%), Positives = 71/106 (66%), Gaps = 1/106 (0%)
Query: 189 KEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIA 248
KE++ + +++ + P GE I+C RI ++P VSF +G +NL+ EQYILK +
Sbjct: 282 KEEIRA-LHKAIGAFPLFAGEYFINCKRIQSLPTVSFILGGVAYNLTGEQYILKISKFGH 340
Query: 249 EVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
+C+SGFM D+ PP GP+WILGDVF+G Y+TVFD R+GFA A
Sbjct: 341 TICLSGFMGLDIRPPAGPIWILGDVFIGQYYTVFDRDHDRVGFATA 386
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 50/118 (42%), Positives = 75/118 (63%), Gaps = 5/118 (4%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M++Q L+ E VFSF+L+RDP A GGE++ GG DP ++ G Y+ VT+ YWQ + ++
Sbjct: 195 MMQQNLLEENVFSFYLSRDPMATVGGELILGGTDPNYYTGDFHYLNVTRMAYWQIKADEV 254
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE----CKLVVS 114
+ NQ +C+GGC AIVD+GTSL+ GP + ++ AIG + + E CK + S
Sbjct: 255 RVNNQLV-LCKGGCQAIVDTGTSLITGPKEEIRALHKAIGAFPLFAGEYFINCKRIQS 311
>gi|332241360|ref|XP_003269848.1| PREDICTED: napsin-A-like [Nomascus leucogenys]
Length = 421
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 51/119 (42%), Positives = 78/119 (65%), Gaps = 1/119 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+VEQGL+ + +FSF+LNRDP+ +GGE+V GG DP H+ T+VPVT YWQ + +
Sbjct: 207 LVEQGLLDKPIFSFYLNRDPEVADGGELVLGGSDPAHYIPPLTFVPVTVPAYWQIHMERV 266
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDL 119
+G+ T +C GCAAI+D+GT ++ GPT + ++ AIGG +++ E + S+ L
Sbjct: 267 KVGSGLT-LCARGCAAILDTGTPVIIGPTEEIRALHAAIGGISLLAGEYLIRCSEIPKL 324
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 41/91 (45%), Positives = 56/91 (61%), Gaps = 4/91 (4%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 267
GE +I C IP +P VS IG FNL+ + Y+++ +G +C+SGF A D+ P P+
Sbjct: 312 GEYLIRCSEIPKLPAVSLLIGGVWFNLTAQDYVIQFAQGDVRLCLSGFRALDIASPPVPV 371
Query: 268 WILGDVFMGVYHTVFDSGKL----RIGFAEA 294
WILGDVF+G Y VFD G + R+G A A
Sbjct: 372 WILGDVFLGAYVAVFDRGDMKSGARVGLARA 402
>gi|18677168|gb|AAL78216.1| aspartic protease precursor Hgg-33 [Heterodera glycines]
Length = 334
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 57/114 (50%), Positives = 68/114 (59%), Gaps = 7/114 (6%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV Q V E VF+FWLNRDP +E GGEI GG D + F TY PVT K YWQF++ I
Sbjct: 61 MVTQQKVPEPVFAFWLNRDPTSEFGGEITIGGTDQRRFVAPITYTPVTHKAYWQFKMDSI 120
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG------GEGVVSAE 108
GN C+ GC AI D+GTSLLAGP + +I IG GE ++S E
Sbjct: 121 SGGNGKLA-CQNGCQAIADTGTSLLAGPKNDIEKIQKHIGAIPLFHGEYMISCE 173
>gi|12697815|dbj|BAB21620.1| cathepsin D [Bos taurus]
Length = 386
Score = 106 bits (264), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 49/108 (45%), Positives = 73/108 (67%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+++Q LV + VFSF+LNRDP A+ GGE++ GG D K+++G + VT++ YWQ + +
Sbjct: 192 LMQQKLVDKNVFSFFLNRDPKAQPGGELMLGGTDSKYYRGSLMFHNVTRQAYWQIHMDQL 251
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+G+ T VC+GGC AIVD+GTSL+ GP V E+ AIG ++ E
Sbjct: 252 DVGSSLT-VCKGGCEAIVDTGTSLIVGPVEEVRELQKAIGAVPLIQGE 298
Score = 106 bits (264), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 47/94 (50%), Positives = 65/94 (69%)
Query: 202 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 261
++P GE +I C+++ ++P V+ +G K + SPE Y LK + VC+SGFM D+P
Sbjct: 291 AVPLIQGEYMIPCEKVSSLPQVTVKLGGKDYAXSPEDYALKVSQAGTTVCLSGFMGMDIP 350
Query: 262 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
PP GPLWILGDVF+G Y+TVFD + R+G AEAA
Sbjct: 351 PPGGPLWILGDVFIGRYYTVFDRDQNRVGLAEAA 384
>gi|358255149|dbj|GAA56870.1| cathepsin D [Clonorchis sinensis]
Length = 425
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 49/100 (49%), Positives = 72/100 (72%), Gaps = 2/100 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ QGLVSE VFSF+L+R+ GGE++ GG DPK++KG+ + P+T + YWQF++ +
Sbjct: 197 MISQGLVSEPVFSFYLDRNASDPVGGELLLGGTDPKYYKGEILWAPLTHEAYWQFKVDSM 256
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 100
+G S +CE GC AI D+GTSL+AGP+ V ++N A+G
Sbjct: 257 NVG--SMKLCENGCQAIADTGTSLIAGPSEEVGKLNDALG 294
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 47/99 (47%), Positives = 62/99 (62%), Gaps = 2/99 (2%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
+N+ ++ P G IDC R+ T+P V F+I K+ L P YIL+ +CISGF
Sbjct: 289 LNDALGAIKIPGGTYYIDCSRVSTLPPVQFSISGKLMQLDPSDYILRMTSFGKTICISGF 348
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
M D+ P GPLWILGDVF+G Y+T+FD G R+GFA A
Sbjct: 349 MGIDI--PAGPLWILGDVFIGKYYTIFDVGNARVGFATA 385
>gi|315440803|gb|ADU20407.1| aspartic protease 1 [Clonorchis sinensis]
Length = 425
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 49/100 (49%), Positives = 72/100 (72%), Gaps = 2/100 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ QGLVSE VFSF+L+R+ GGE++ GG DPK++KG+ + P+T + YWQF++ +
Sbjct: 197 MISQGLVSEPVFSFYLDRNASDPVGGELLLGGTDPKYYKGEILWAPLTHEAYWQFKVDSM 256
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 100
+G S +CE GC AI D+GTSL+AGP+ V ++N A+G
Sbjct: 257 NVG--SMKLCENGCQAIADTGTSLIAGPSEEVGKLNDALG 294
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 47/99 (47%), Positives = 62/99 (62%), Gaps = 2/99 (2%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
+N+ ++ P G IDC R+ T+P V F+I K+ L P YIL+ +CISGF
Sbjct: 289 LNDALGAIKIPGGTYYIDCSRVSTLPPVQFSISGKLMQLDPSDYILRMTSFGKTICISGF 348
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
M D+ P GPLWILGDVF+G Y+T+FD G R+GFA A
Sbjct: 349 MGIDI--PAGPLWILGDVFIGKYYTIFDVGNARVGFATA 385
>gi|118429511|gb|ABK91803.1| aspartic protease precursor [Clonorchis sinensis]
Length = 425
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 49/100 (49%), Positives = 72/100 (72%), Gaps = 2/100 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ QGLVSE VFSF+L+R+ GGE++ GG DPK++KG+ + P+T + YWQF++ +
Sbjct: 197 MISQGLVSEPVFSFYLDRNASDPVGGELLLGGTDPKYYKGEILWAPLTHEAYWQFKVDSM 256
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 100
+G S +CE GC AI D+GTSL+AGP+ V ++N A+G
Sbjct: 257 NVG--SMKLCENGCQAIADTGTSLIAGPSEEVGKLNDALG 294
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 47/99 (47%), Positives = 62/99 (62%), Gaps = 2/99 (2%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
+N+ ++ P G IDC R+ T+P V F+I K+ L P YIL+ +CISGF
Sbjct: 289 LNDALGAIKIPGGTYYIDCSRVSTLPPVQFSISGKLMQLDPSDYILRMTSFGKTICISGF 348
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
M D+ P GPLWILGDVF+G Y+T+FD G R+GFA A
Sbjct: 349 MGIDI--PAGPLWILGDVFIGKYYTIFDVGNARVGFATA 385
>gi|205364148|gb|ACI04532.1| aspartic protease 1 precursor [Ancylostoma duodenale]
Length = 446
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 50/108 (46%), Positives = 69/108 (63%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+EQ V VF+FWLNR+PD+E GGEI GG+D + + T+ PVT++GYWQF++ D
Sbjct: 223 FIEQKKVPSPVFAFWLNRNPDSELGGEITLGGMDTRRYVEPITWTPVTRRGYWQFKM-DK 281
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+ G ++ C GC AI D+GTSL+AGP V I IG E ++ E
Sbjct: 282 VQGGSTSIACPNGCQAIADTGTSLIAGPKAQVEAIQKFIGAEPLMKGE 329
Score = 97.4 bits (241), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 45/99 (45%), Positives = 62/99 (62%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
I + + P GE +I CD++P++P +SF I + L E Y+L G +C+SGF
Sbjct: 316 IQKFIGAEPLMKGEYMIPCDKVPSLPELSFVIEGRTSTLKGEDYVLTVKAGGKSICLSGF 375
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
M D P G LWILGDVF+G Y+TVFD G+ R+GFA+A
Sbjct: 376 MGMDFPERIGELWILGDVFIGKYYTVFDIGQARLGFAQA 414
>gi|21907889|dbj|BAC05689.1| aspartic protease BmAsp-2 [Brugia malayi]
Length = 452
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 49/108 (45%), Positives = 69/108 (63%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ Q +V + VF+FWL+R+P + GGEI FGG+D F TY PV++ GYWQF++ +
Sbjct: 227 MISQKVVHQPVFAFWLDRNPSDKIGGEITFGGIDANRFVSPITYTPVSRHGYWQFKMDRV 286
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
L ++ G C GC AI D+GTSL+AGP + +I IG E V + E
Sbjct: 287 LGRGKAIG-CGNGCQAIADTGTSLIAGPKSQIDKIQEYIGAEHVYAGE 333
Score = 95.1 bits (235), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 43/87 (49%), Positives = 57/87 (65%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 267
GE II C ++P++P ++F I K + L Y+L A +C+SGFM DLP G L
Sbjct: 332 GEYIIPCYKVPSLPEITFVIAGKSYTLKGSDYVLNVTSKGATICLSGFMGIDLPKRVGEL 391
Query: 268 WILGDVFMGVYHTVFDSGKLRIGFAEA 294
WILGDVF+G Y+TVFD G +IGFA+A
Sbjct: 392 WILGDVFIGRYYTVFDVGNSQIGFAQA 418
>gi|187472089|gb|ACD11354.1| aspartic protease 2 [Wuchereria bancrofti]
Length = 166
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 49/108 (45%), Positives = 69/108 (63%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ Q +V + VF+FWL+R+P + GGEI FGG+D F TY PV++ GYWQF++ +
Sbjct: 40 MISQKVVHQPVFAFWLDRNPSDKIGGEITFGGIDANRFVSPITYTPVSRHGYWQFKMDSV 99
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
G ++ G C GC AI D+GTSL+AGP + +I IG E V + E
Sbjct: 100 RGGGKAIG-CSNGCQAIADTGTSLIAGPKSQIDKIQEYIGAEHVYAGE 146
>gi|315274255|gb|ADU03675.1| putative cathepsin D3 [Ixodes ricinus]
Length = 398
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 49/101 (48%), Positives = 67/101 (66%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV Q LV VFSF+LNR+ + GGEI FGG D + + G +YVPV+ KGYWQF + +I
Sbjct: 202 MVAQLLVPRPVFSFFLNRNTTSPTGGEITFGGTDDRFYSGDISYVPVSTKGYWQFTVDNI 261
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 101
++ N S +C GC AI D+GTSL+AGP+ + ++ IG
Sbjct: 262 VVKNSSFKLCAEGCEAIADTGTSLMAGPSLEIMKLQKLIGA 302
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 44/99 (44%), Positives = 66/99 (66%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
+ +L +LP G+ + C+ I +P++ F IG + + L+ Y+LK + +C+SGF
Sbjct: 296 LQKLIGALPFSHGQYTVRCEDIHKLPDIKFHIGGQEYVLTGSDYVLKITQFGRMICLSGF 355
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
+ D+P PRGPLWILGDVF+G Y+TVFD G R+GFA+A
Sbjct: 356 VGLDIPEPRGPLWILGDVFIGRYYTVFDYGASRVGFAKA 394
>gi|241813645|ref|XP_002416518.1| aspartic protease, putative [Ixodes scapularis]
gi|215510982|gb|EEC20435.1| aspartic protease, putative [Ixodes scapularis]
Length = 392
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 51/114 (44%), Positives = 71/114 (62%), Gaps = 4/114 (3%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV Q LV VFSF+LNR+ + GGEI FGG D + + G +YVPV+ KGYWQF + +I
Sbjct: 196 MVAQLLVPRPVFSFFLNRNTTSPTGGEITFGGTDDRFYSGDISYVPVSTKGYWQFTVDNI 255
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG----EGVVSAECK 110
++ N S +C GC AI D+GTSL+AGP+ + ++ IG G + C+
Sbjct: 256 VVKNSSFKLCAEGCEAIADTGTSLMAGPSLEIMKLQKLIGALPFSHGQYTVRCQ 309
Score = 99.0 bits (245), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 44/99 (44%), Positives = 65/99 (65%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
+ +L +LP G+ + C I +P++ F IG + + L+ Y+LK + +C+SGF
Sbjct: 290 LQKLIGALPFSHGQYTVRCQDIHQLPDIKFHIGGQEYVLTGSDYVLKITQFGRMICLSGF 349
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
+ D+P PRGPLWILGDVF+G Y+TVFD G R+GFA+A
Sbjct: 350 VGLDIPEPRGPLWILGDVFIGRYYTVFDYGASRVGFAKA 388
>gi|37779010|gb|AAP20165.1| cathepsin D [Pagrus major]
Length = 154
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 47/105 (44%), Positives = 69/105 (65%), Gaps = 1/105 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
++ Q V + VFSF+LNR+PD E GGE++ GG DPK++ G YV +T++ YWQ + +
Sbjct: 51 IMSQKKVEKNVFSFYLNRNPDTEPGGELLLGGTDPKYYTGDFNYVNITRQAYWQIHMDGM 110
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVV 105
+G Q + +C GC AIVDSGTSL+ GP+ V + AIG ++
Sbjct: 111 AVGTQLS-LCTSGCEAIVDSGTSLITGPSAEVRSLQKAIGATPLI 154
>gi|195997419|ref|XP_002108578.1| expressed hypothetical protein [Trichoplax adhaerens]
gi|190589354|gb|EDV29376.1| expressed hypothetical protein [Trichoplax adhaerens]
Length = 383
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 52/119 (43%), Positives = 73/119 (61%), Gaps = 8/119 (6%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFEL--G 58
MV+QGLV + VFSF+LNR GGE++ GG DP ++ G+ YVP++++ YWQF + G
Sbjct: 194 MVQQGLVRKPVFSFYLNRKQGGAVGGELILGGSDPNYYSGQFNYVPLSRESYWQFAMDGG 253
Query: 59 DILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE---GVVSAECKLVVS 114
+ G T VC GGC AI D+GT+L+ GP V I AIG + G + +C + S
Sbjct: 254 KVATG---TTVCNGGCQAIADTGTTLIVGPPEDVQRIQQAIGAQNAGGQYTVDCSTISS 309
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 41/90 (45%), Positives = 57/90 (63%), Gaps = 2/90 (2%)
Query: 205 NPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 264
N G+ +DC I ++P ++FTI + L+ EQYI + + E CISGF +
Sbjct: 295 NAGGQYTVDCSTISSLPTITFTINGVNYPLTGEQYIWQVTQQGQEQCISGFQGGVIGT-- 352
Query: 265 GPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
GP WILGDVF+GVY+T FD G+ R+GFA+A
Sbjct: 353 GPQWILGDVFIGVYYTEFDMGQNRLGFAKA 382
>gi|86278345|gb|ABC88426.1| cathepsin D-like aspartic proteinase preproprotein [Meloidogyne
incognita]
Length = 454
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 53/114 (46%), Positives = 67/114 (58%), Gaps = 6/114 (5%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ Q V E VFSFWLNRDP ++ GGEI GG D + + Y PVT+K YWQF++ +
Sbjct: 218 MISQQKVPEPVFSFWLNRDPYSKVGGEITIGGTDKRRYVEPLNYTPVTRKAYWQFKMEGV 277
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG------GEGVVSAE 108
C+ GC AI D+GTSL+AGP + EI H IG GE +VS E
Sbjct: 278 HNSKGEKIACQNGCEAIADTGTSLIAGPKAQIEEIQHYIGAVPLMHGEYMVSCE 331
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 47/109 (43%), Positives = 66/109 (60%), Gaps = 4/109 (3%)
Query: 186 KQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGE 245
K E++ YI ++P GE ++ C+R+P +P+++ IG + L YIL
Sbjct: 306 KAQIEEIQHYIG----AVPLMHGEYMVSCERVPRLPDIALVIGGHSYVLKGSDYILNVTA 361
Query: 246 GIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
+C+SGFM DLPP G LWILGDVF+G Y+TVFD G+ RIG A+A
Sbjct: 362 MGKSICLSGFMGIDLPPKVGELWILGDVFIGRYYTVFDVGQQRIGLAQA 410
>gi|342186743|emb|CBI83255.1| cathepsin D [Oreochromis mossambicus]
Length = 239
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 47/108 (43%), Positives = 71/108 (65%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ Q V + VFSF+LNR+PD E GGE++ GG DPK++ G Y ++++ YWQ + +
Sbjct: 73 MMNQKKVEKNVFSFYLNRNPDTEPGGELLLGGTDPKYYDGDFHYANISRQAYWQVHMDGM 132
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+G+Q + +C+GGC AIVD+GTSL+ GP V + AIG ++ E
Sbjct: 133 TVGSQLS-LCKGGCEAIVDTGTSLITGPAAEVKALQKAIGAIPLIQGE 179
Score = 75.5 bits (184), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 47/68 (69%)
Query: 202 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 261
++P GE +++C +IP++P ++F +G + + L+ EQY+L + +C+SGFM D+P
Sbjct: 172 AIPLIQGEYLVNCSKIPSLPVITFNVGGQSYTLTGEQYVLLESQAGKTICLSGFMGLDIP 231
Query: 262 PPRGPLWI 269
PP GPLWI
Sbjct: 232 PPAGPLWI 239
>gi|355703800|gb|EHH30291.1| hypothetical protein EGK_10923 [Macaca mulatta]
Length = 423
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 51/119 (42%), Positives = 78/119 (65%), Gaps = 1/119 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+VEQGL+ + VFSF+LNRD + +GGE+V GG DP H+ T+VPVT YWQ + +
Sbjct: 207 LVEQGLLDKPVFSFYLNRDSEVADGGELVLGGSDPAHYIPPLTFVPVTVPAYWQIHMERV 266
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDL 119
++G+ T +C GCAAI+D+GT ++ GPT + ++ AIGG +++ E + S+ L
Sbjct: 267 MVGSGLT-LCARGCAAILDTGTPVIIGPTEEIRALHEAIGGIPLLAGEYIIRCSEIPKL 324
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 44/103 (42%), Positives = 61/103 (59%), Gaps = 4/103 (3%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
++E +P GE II C IP +P VS IG FNL+ + Y+++ +G +C+SGF
Sbjct: 300 LHEAIGGIPLLAGEYIIRCSEIPKLPTVSLLIGGVWFNLTAQDYVIQFAQGDVRLCLSGF 359
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKL----RIGFAEA 294
A D+ P P+WILGDVF+G Y VFD G + R+G A A
Sbjct: 360 RALDIALPPVPVWILGDVFLGAYVAVFDRGDMKSGARVGLARA 402
>gi|74136511|ref|NP_001028152.1| gastricsin precursor [Monodelphis domestica]
gi|73621388|sp|Q689Z7.1|PEPC_MONDO RecName: Full=Gastricsin; AltName: Full=Pepsinogen C; Flags:
Precursor
gi|51534970|dbj|BAD36918.1| pepsinogen C [Monodelphis domestica]
Length = 391
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 47/102 (46%), Positives = 69/102 (67%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M++QG +SE +FSF+ +R P + GGE++ GGVDP+ + G+ T+ PVT++ YWQ + +
Sbjct: 201 MLQQGQLSEPIFSFYFSRQPTHQYGGELILGGVDPQLYSGQITWTPVTQEVYWQIGIEEF 260
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
IGNQ+TG C GC AIVD+GT LLA P ++ A G +
Sbjct: 261 AIGNQATGWCSQGCQAIVDTGTFLLAVPQQYMSAFLQATGAQ 302
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 35/89 (39%), Positives = 49/89 (55%), Gaps = 5/89 (5%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRG-P 266
G+ +++C+ I MP ++F I F L P Y+ C G A LP P G P
Sbjct: 307 GDFMVNCNYIQDMPTITFVINGSQFPLPPSAYVFNNNG----YCRLGIEATYLPSPNGQP 362
Query: 267 LWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
LWILGDVF+ Y++V+D R+GFA +A
Sbjct: 363 LWILGDVFLKEYYSVYDMANNRVGFAYSA 391
>gi|344307517|ref|XP_003422427.1| PREDICTED: LOW QUALITY PROTEIN: cathepsin D-like [Loxodonta
africana]
Length = 419
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 47/108 (43%), Positives = 73/108 (67%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
++ Q LV + +FSF+LNRDP A+ GGE++ GG+D K++ G + VT++ YWQ + +
Sbjct: 225 LMAQKLVEKNMFSFYLNRDPTAQPGGELMLGGIDSKYYTGTLNFNKVTREAYWQIHMDRV 284
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+GN T +C+GGC AIVD+GTSL+ GP +TE+ A+G ++ E
Sbjct: 285 DVGNGLT-LCKGGCEAIVDTGTSLMVGPVEEITELQKALGAIPLIQGE 331
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 43/103 (41%), Positives = 66/103 (64%)
Query: 193 LSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCI 252
++ + + ++P GE +I C+++ ++P VS +G + + LS E Y+LK + VC+
Sbjct: 315 ITELQKALGAIPLIQGEYMIPCEKVSSLPPVSLQLGGRSYTLSSEDYVLKVSQAGRSVCL 374
Query: 253 SGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
SGFM+ D+PPP PL L DVF+G Y+TVFD +GFAEAA
Sbjct: 375 SGFMSMDIPPPEEPLXDLSDVFIGRYYTVFDRDNNTVGFAEAA 417
>gi|403299330|ref|XP_003940442.1| PREDICTED: napsin-A-like [Saimiri boliviensis boliviensis]
Length = 425
Score = 105 bits (261), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 50/119 (42%), Positives = 78/119 (65%), Gaps = 1/119 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+VEQGL+ + VFSF+LNRDP+ +GGE+V GG DP H+ T+VPVT YWQ + +
Sbjct: 211 LVEQGLLDKPVFSFYLNRDPEVADGGELVLGGSDPAHYIPPLTFVPVTVPAYWQIHMERV 270
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDL 119
+G++ T +C GCAA++D+GT ++ GP + ++ AIGG +++ E + S+ L
Sbjct: 271 KVGSELT-LCARGCAAVLDTGTPVIIGPAEEIRALHKAIGGLPLLAGEYIIRCSEIPKL 328
Score = 91.7 bits (226), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 44/103 (42%), Positives = 61/103 (59%), Gaps = 4/103 (3%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
+++ LP GE II C IP +P VS +G FNL+ + Y+++ +G C+SGF
Sbjct: 304 LHKAIGGLPLLAGEYIIRCSEIPKLPTVSLFLGGVWFNLTAQDYVIQFVQGDFRFCVSGF 363
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKL----RIGFAEA 294
D+P P GP+WILGDVF+G Y VFD G + R+G A A
Sbjct: 364 RGLDIPSPPGPMWILGDVFLGAYVAVFDRGDMKSGARVGLARA 406
>gi|342305186|dbj|BAK55647.1| cathepsin D [Oplegnathus fasciatus]
Length = 396
Score = 105 bits (261), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 43/93 (46%), Positives = 67/93 (72%)
Query: 202 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 261
++P GE ++ CD+IP++P ++F +G + + L+ EQY+LK + +C+SGFM D+P
Sbjct: 303 AIPFIQGEYMVSCDKIPSLPVITFNVGGQSYVLTGEQYVLKVSQAGKTICLSGFMGLDIP 362
Query: 262 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
P GPLWILGDVF+G Y+TVFD ++GFA++
Sbjct: 363 APAGPLWILGDVFIGQYYTVFDRENNQVGFAKS 395
Score = 104 bits (259), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 47/108 (43%), Positives = 68/108 (62%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
++ Q V + VFSF+LNR+PD E GGE++ GG DPK++ G YV +T++ YWQ + +
Sbjct: 204 IMSQKKVEKNVFSFYLNRNPDTEPGGELLLGGTDPKYYSGDFHYVNITRQAYWQIHMDGM 263
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+G Q +C GC AIVD+GTSL+ GP+ V + AIG + E
Sbjct: 264 AVGGQ-LNLCTSGCEAIVDTGTSLITGPSAEVRSLQKAIGAIPFIQGE 310
>gi|344312912|emb|CCC33063.1| cathepsin D-1 [Dermanyssus gallinae]
Length = 383
Score = 105 bits (261), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 47/119 (39%), Positives = 70/119 (58%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV+Q +V + +FSF+L RDP G E+V GG+DPKH+KG TY P+T++ YWQF + +
Sbjct: 192 MVKQKVVEKAIFSFFLTRDPQHPIGSELVLGGIDPKHYKGDITYAPLTRESYWQFRVDKV 251
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDL 119
+ ++ VC+ GC I D+GTSL GPT V + + + + + GDL
Sbjct: 252 TLNGKAAPVCQKGCEGIADTGTSLFVGPTADVAALASQLDAQETAPGLYLVDCEKAGDL 310
Score = 80.5 bits (197), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 35/88 (39%), Positives = 54/88 (61%), Gaps = 2/88 (2%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 267
G ++DC++ +PN+ FTI + F L+P Y+++ + C+ F D+P P+
Sbjct: 298 GLYLVDCEKAGDLPNIEFTIAGRPFELTPLDYVVRLKQSGQTFCVLAFQGMDIP--DDPI 355
Query: 268 WILGDVFMGVYHTVFDSGKLRIGFAEAA 295
WILGD+F+G Y TVFD R+GFA+AA
Sbjct: 356 WILGDIFIGKYFTVFDRENNRVGFADAA 383
>gi|315274244|gb|ADU03674.1| cathepsin D2 [Ixodes ricinus]
Length = 387
Score = 105 bits (261), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 48/108 (44%), Positives = 71/108 (65%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV QG+ ++ VFS +L+R+ GGE++FGG+D H+ G +YVPV+K+GYWQ +
Sbjct: 195 MVAQGVAAKPVFSVFLDRNASDPAGGEVLFGGIDESHYTGNISYVPVSKRGYWQVHMDGT 254
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+GN + C GGC AI+D+GTSL+AGP+ + ++N IG S E
Sbjct: 255 RVGNNGS-FCSGGCEAILDTGTSLIAGPSDEIEKLNLLIGAAPFASGE 301
Score = 96.3 bits (238), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 43/99 (43%), Positives = 61/99 (61%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
+N L + P GE I+ C I +P ++FT+ K F L + Y+L+ +C+SGF
Sbjct: 288 LNLLIGAAPFASGEYIVSCKSIDKLPKITFTLAGKDFVLEGKDYVLQMSSAGVPLCLSGF 347
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
+ D+P P GPLWILGDVF+G Y+T+FD G R+G A A
Sbjct: 348 IGLDVPAPLGPLWILGDVFIGRYYTIFDRGNDRVGLANA 386
>gi|307171409|gb|EFN63273.1| Lysosomal aspartic protease [Camponotus floridanus]
Length = 234
Score = 104 bits (260), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 49/112 (43%), Positives = 72/112 (64%), Gaps = 2/112 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
++EQ LVS +FSF+LNRD A+ GGE+ GG DP +++G TY+PVT+KGYWQ + I
Sbjct: 7 IIEQDLVSPHIFSFYLNRDTSADLGGELTLGGSDPAYYEGDFTYIPVTRKGYWQITIDRI 66
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLV 112
+ + +CE C AIVD+G+SL+ GP + +IN IG + +C +
Sbjct: 67 EM--KFDYLCEESCQAIVDTGSSLILGPKLDIAKINTFIGVDDSRIVDCNKI 116
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 30/84 (35%), Positives = 47/84 (55%), Gaps = 1/84 (1%)
Query: 211 IIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWIL 270
I+DC++I +P + F +G K F+L+ + YI++ +CI F L W+L
Sbjct: 110 IVDCNKIFQLPTIRFIMGGKAFDLTGKDYIIRHPNN-ESICIIVFSKHYLKYHNEIKWVL 168
Query: 271 GDVFMGVYHTVFDSGKLRIGFAEA 294
G F+G Y+T FD + R+GFA A
Sbjct: 169 GMPFIGRYYTEFDMERDRVGFALA 192
>gi|410968030|ref|XP_003990516.1| PREDICTED: pepsin B-like [Felis catus]
Length = 390
Score = 104 bits (260), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 48/120 (40%), Positives = 73/120 (60%), Gaps = 10/120 (8%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M++QG ++ +FSF+ +R P E GGE++ GG++ + + G+ + PVT++ YWQ + +
Sbjct: 200 MMQQGQLTSPIFSFYFSRQPTYEYGGELILGGMNSQFYSGEIVWTPVTRELYWQVAIDEF 259
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
L+GNQ TG+C GC AIVD+GT +LA P + A G E VSQYGD +
Sbjct: 260 LVGNQPTGLCSQGCQAIVDTGTYVLAVPQQYMNSFLQATGAE----------VSQYGDFV 309
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 34/86 (39%), Positives = 48/86 (55%), Gaps = 5/86 (5%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRG-P 266
G+ +++C+ I +MP ++F I L P Y+L C G A LP P G P
Sbjct: 306 GDFVVNCNSIQSMPTITFVISGSPLPLPPSAYVLNNNG----YCTLGIEATYLPSPSGQP 361
Query: 267 LWILGDVFMGVYHTVFDSGKLRIGFA 292
LW LGDVF+ Y+T++D G R+GFA
Sbjct: 362 LWTLGDVFLKEYYTIYDMGNNRMGFA 387
>gi|426244096|ref|XP_004015868.1| PREDICTED: napsin-A [Ovis aries]
Length = 443
Score = 104 bits (260), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 51/119 (42%), Positives = 76/119 (63%), Gaps = 1/119 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+V+QGL+ + VFSF+LNR+P+A +GGE+V GG DP H+ T+VPVT +WQ + +
Sbjct: 206 LVDQGLLDKPVFSFYLNRNPEAADGGELVLGGSDPAHYIPPLTFVPVTIPAFWQIHMERV 265
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDL 119
+G T +C GCAAI+D+GTSL+ GPT + + AIG ++ E + S+ L
Sbjct: 266 QVGTGLT-LCARGCAAILDTGTSLITGPTEEIRALQKAIGAVPLLMGEYYIKCSKIPTL 323
Score = 100 bits (250), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 53/135 (39%), Positives = 76/135 (56%), Gaps = 3/135 (2%)
Query: 156 VEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 215
+E+ V G + C A++ L T+E + + + ++P MGE I C
Sbjct: 262 MERVQVGTGLTLCARGCA-AILDTGTSLITGPTEE--IRALQKAIGAVPLLMGEYYIKCS 318
Query: 216 RIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFM 275
+IPT+P VSF +G FNL+ + Y+++ VC+SGFMA D+PPP GP WILGDVF+
Sbjct: 319 KIPTLPPVSFLLGGVWFNLTAQDYVIQITRSGFSVCLSGFMALDVPPPSGPFWILGDVFL 378
Query: 276 GVYHTVFDSGKLRIG 290
G Y VFD G + G
Sbjct: 379 GSYVAVFDRGDRKSG 393
>gi|25137349|emb|CAD18832.1| cyprosin [Cynara cardunculus]
Length = 107
Score = 104 bits (260), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 48/110 (43%), Positives = 80/110 (72%), Gaps = 8/110 (7%)
Query: 103 GVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVS 162
GV+S +CK +V QYG + ++L+S PEK+C Q+ LC+F+G+ S I++VV+K S
Sbjct: 1 GVMSQQCKSLVDQYGKSMIEMLLSEEQPEKICSQMKLCSFDGSHDTSMIIESVVDK---S 57
Query: 163 AGDSA-----VCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPM 207
G S+ +C+ C+MAVVW+QNQ++Q +T+E +++Y+++LC+ LP+PM
Sbjct: 58 KGKSSGLHDEMCTMCQMAVVWMQNQIRQNETEENIINYVDKLCERLPSPM 107
>gi|205363469|gb|ACI04164.1| cathepsin D-like aspartic protease precursor [Fasciola hepatica]
Length = 429
Score = 104 bits (259), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 51/101 (50%), Positives = 66/101 (65%), Gaps = 2/101 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ QGLV E VFSF+LNR+ GGE++ GG+DP ++ G TYVPVT + YWQF++ I
Sbjct: 198 MIAQGLVPEPVFSFYLNRNASDPVGGELLLGGIDPNYYTGDITYVPVTHEAYWQFKVDKI 257
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 101
S +C GC AI D+GTSL+AGP V +N IGG
Sbjct: 258 EFPGVS--ICADGCQAIADTGTSLIAGPKKEVDALNEQIGG 296
Score = 80.9 bits (198), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 40/103 (38%), Positives = 61/103 (59%), Gaps = 2/103 (1%)
Query: 190 EKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAE 249
+K + +NE P G +++CD+I + ++F + + L + YI+K
Sbjct: 284 KKEVDALNEQIGGTWMPGGIYVVNCDKIDNLSAITFVVAGRKMVLEAKDYIMKLSNMGRT 343
Query: 250 VCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFA 292
VC++ F+ D+P GPLWILGDVF+G Y+TVFD G+ RIGFA
Sbjct: 344 VCVTSFIGIDVP--VGPLWILGDVFIGSYYTVFDMGQKRIGFA 384
>gi|332514729|gb|AEE69372.1| cathepsin D [Fasciola gigantica]
Length = 429
Score = 104 bits (259), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 51/101 (50%), Positives = 66/101 (65%), Gaps = 2/101 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ QGLV E VFSF+LNR+ GGE++ GG+DP ++ G TYVPVT + YWQF++ I
Sbjct: 198 MIAQGLVPEPVFSFYLNRNASDPVGGELLLGGIDPNYYTGDITYVPVTHEAYWQFKVDKI 257
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 101
S +C GC AI D+GTSL+AGP V +N IGG
Sbjct: 258 EFPGVS--ICADGCQAIADTGTSLIAGPKKEVDALNEQIGG 296
Score = 80.5 bits (197), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 40/103 (38%), Positives = 61/103 (59%), Gaps = 2/103 (1%)
Query: 190 EKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAE 249
+K + +NE P G +++CD+I + ++F + + L + YI+K
Sbjct: 284 KKEVDALNEQIGGTWMPGGIYVVNCDKIDNLSAITFVVAGRKMVLEAKDYIMKLSNMGRT 343
Query: 250 VCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFA 292
VC++ F+ D+P GPLWILGDVF+G Y+TVFD G+ RIGFA
Sbjct: 344 VCVTSFIGIDVP--VGPLWILGDVFIGSYYTVFDMGQKRIGFA 384
>gi|297705581|ref|XP_002829653.1| PREDICTED: napsin-A, partial [Pongo abelii]
Length = 392
Score = 104 bits (259), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 53/129 (41%), Positives = 83/129 (64%), Gaps = 2/129 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+V+QGL+ + +FSF+LNRDP +GGE+V GG DP H+ T+VPVT YWQ + +
Sbjct: 178 LVKQGLLDKPIFSFYLNRDPKVADGGELVLGGSDPAHYIPPLTFVPVTVPAYWQIHMERV 237
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDL- 119
+G+ T +C GCAAI+D+GT ++ GPT + ++ AIGG +++ E + S+ L
Sbjct: 238 KVGSGLT-LCARGCAAILDTGTPVIVGPTEEIRALHAAIGGIPLLAGEYIIRCSEIPKLP 296
Query: 120 IWDLLVSGL 128
LL++G+
Sbjct: 297 AVSLLIAGV 305
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 42/96 (43%), Positives = 57/96 (59%), Gaps = 4/96 (4%)
Query: 203 LPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPP 262
+P GE II C IP +P VS I FNL+ + Y+++ +G +C+SGF A D+
Sbjct: 278 IPLLAGEYIIRCSEIPKLPAVSLLIAGVWFNLTAQDYVIQFAQGDVRLCLSGFRALDIAS 337
Query: 263 PRGPLWILGDVFMGVYHTVFDSGKL----RIGFAEA 294
P P+WILGDVF+G Y VFD G + R+G A A
Sbjct: 338 PPVPVWILGDVFLGAYVAVFDRGDMKSGARVGLARA 373
>gi|241275826|ref|XP_002406708.1| aspartic protease, putative [Ixodes scapularis]
gi|215496940|gb|EEC06580.1| aspartic protease, putative [Ixodes scapularis]
Length = 345
Score = 104 bits (259), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 48/108 (44%), Positives = 71/108 (65%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV QG+ ++ VFS +L+R+ GGE++FGG+D H+ G +YVPV+K+GYWQ +
Sbjct: 196 MVAQGVAAKPVFSVFLDRNASDPAGGEVLFGGIDESHYIGNISYVPVSKRGYWQVHMDGT 255
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+GN + C GGC AI+D+GTSL+AGP+ + ++N IG S E
Sbjct: 256 RVGNNGS-FCSGGCEAILDTGTSLIAGPSDEIEKLNLLIGAAPFASGE 302
>gi|74199699|dbj|BAE41511.1| unnamed protein product [Mus musculus]
Length = 419
Score = 104 bits (259), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 48/108 (44%), Positives = 71/108 (65%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MVEQGL+ + VFSF+LNRD + +GGE+V GG DP H+ T++PVT YWQ + +
Sbjct: 201 MVEQGLLEKPVFSFYLNRDSEGSDGGELVLGGSDPAHYVPPLTFIPVTIPAYWQVHMESV 260
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+G +C GC+AI+D+GTSL+ GP+ + +N AIGG ++ +
Sbjct: 261 KVGT-GLSLCAQGCSAILDTGTSLITGPSEEIRALNKAIGGYPFLNGQ 307
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 52/144 (36%), Positives = 77/144 (53%), Gaps = 7/144 (4%)
Query: 156 VEKENVSAGDS-AVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDC 214
V E+V G ++C+ A++ L ++E + +N+ P G+ I C
Sbjct: 255 VHMESVKVGTGLSLCAQGCSAILDTGTSLITGPSEE--IRALNKAIGGYPFLNGQYFIQC 312
Query: 215 DRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVF 274
+ PT+P VSF +G FNL+ + Y++K + +C+ GF A D+P P GPLWILGDVF
Sbjct: 313 SKTPTLPPVSFHLGGVWFNLTGQDYVIKILQSDVGLCLLGFQALDIPKPAGPLWILGDVF 372
Query: 275 MGVYHTVFDSGKL----RIGFAEA 294
+G Y VFD G R+G A A
Sbjct: 373 LGPYVAVFDRGDKNVGPRVGLARA 396
>gi|12832561|dbj|BAB22158.1| unnamed protein product [Mus musculus]
Length = 419
Score = 104 bits (259), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 48/108 (44%), Positives = 71/108 (65%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MVEQGL+ + VFSF+LNRD + +GGE+V GG DP H+ T++PVT YWQ + +
Sbjct: 201 MVEQGLLEKPVFSFYLNRDSEGSDGGELVLGGSDPAHYVPPLTFIPVTIPAYWQVHMESV 260
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+G +C GC+AI+D+GTSL+ GP+ + +N AIGG ++ +
Sbjct: 261 KVGT-GLSLCAQGCSAILDTGTSLITGPSEEIRALNKAIGGYPFLNGQ 307
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 51/144 (35%), Positives = 76/144 (52%), Gaps = 7/144 (4%)
Query: 156 VEKENVSAGDS-AVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDC 214
V E+V G ++C+ A++ L ++E + +N+ P G+ I C
Sbjct: 255 VHMESVKVGTGLSLCAQGCSAILDTGTSLITGPSEE--IRALNKAIGGYPFLNGQYFIQC 312
Query: 215 DRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVF 274
+ PT+P VSF +G FNL+ + Y++K + +C+ GF A D+P GPLWILGDVF
Sbjct: 313 SKTPTLPPVSFHLGGVWFNLTGQDYVIKILQSDVGLCLLGFQALDIPNAAGPLWILGDVF 372
Query: 275 MGVYHTVFDSGKL----RIGFAEA 294
+G Y VFD G R+G A A
Sbjct: 373 LGPYVAVFDRGDKNVGPRVGLARA 396
>gi|6978973|dbj|BAA90785.1| aspartic proteinase family member similar to renin [Mus musculus]
Length = 419
Score = 104 bits (259), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 48/108 (44%), Positives = 71/108 (65%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MVEQGL+ + VFSF+LNRD + +GGE+V GG DP H+ T++PVT YWQ + +
Sbjct: 201 MVEQGLLEKPVFSFYLNRDSEGSDGGELVLGGSDPAHYVPPLTFIPVTIPAYWQVHMESV 260
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+G +C GC+AI+D+GTSL+ GP+ + +N AIGG ++ +
Sbjct: 261 KVGT-GLSLCAQGCSAILDTGTSLITGPSEEIRALNKAIGGYPFLNGQ 307
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 51/144 (35%), Positives = 76/144 (52%), Gaps = 7/144 (4%)
Query: 156 VEKENVSAGDS-AVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDC 214
V E+V G ++C+ A++ L ++E + +N+ P G+ I C
Sbjct: 255 VHMESVKVGTGLSLCAQGCSAILDTGTSLITGPSEE--IRALNKAIGGYPFLNGQYFIQC 312
Query: 215 DRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVF 274
+ PT+P VS +G FNL+ + Y++K + +C+ GF A D+P P GPLWILGDVF
Sbjct: 313 SKTPTLPPVSSHLGGVWFNLTGQDYVIKILQSDVGLCLLGFQALDIPKPAGPLWILGDVF 372
Query: 275 MGVYHTVFDSGKL----RIGFAEA 294
+G Y VFD G R+G A A
Sbjct: 373 LGPYVAVFDRGDKNVGPRVGLARA 396
>gi|116282368|gb|ABJ97285.1| cathepsin D-like aspartic protease [Fasciola hepatica]
Length = 429
Score = 104 bits (259), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 51/101 (50%), Positives = 66/101 (65%), Gaps = 2/101 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ QGLV E VFSF+LNR+ GGE++ GG+DP ++ G TYVPVT + YWQF++ I
Sbjct: 198 MIAQGLVPEPVFSFYLNRNASDPVGGELLLGGIDPNYYTGDITYVPVTHEAYWQFKVDKI 257
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 101
S +C GC AI D+GTSL+AGP V +N IGG
Sbjct: 258 EFPGVS--ICADGCQAIADTGTSLIAGPKKEVDALNEQIGG 296
Score = 75.5 bits (184), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 38/103 (36%), Positives = 59/103 (57%), Gaps = 2/103 (1%)
Query: 190 EKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAE 249
+K + +NE P G +++ D+I + ++F + + + YI+K
Sbjct: 284 KKEVDALNEQIGGTWMPGGIYVVNWDKIDNLSAITFVVAGRKMVFEAKDYIMKLSNMGRT 343
Query: 250 VCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFA 292
VC++ F+ D+P GPLWILGDVF+G Y+TVFD G+ RIGFA
Sbjct: 344 VCVTSFIGIDVP--VGPLWILGDVFIGSYYTVFDMGQKRIGFA 384
>gi|6680552|ref|NP_032463.1| napsin-A precursor [Mus musculus]
gi|6016430|sp|O09043.1|NAPSA_MOUSE RecName: Full=Napsin-A; AltName: Full=KDAP-1; AltName:
Full=Kidney-derived aspartic protease-like protein;
Short=KAP; Flags: Precursor
gi|1906810|dbj|BAA19004.1| kidney-derived aspartic protease-like protein [Mus musculus]
gi|7340352|emb|CAB82907.1| Napsin [Mus musculus]
gi|15928694|gb|AAH14813.1| Napsin A aspartic peptidase [Mus musculus]
gi|74220342|dbj|BAE31398.1| unnamed protein product [Mus musculus]
Length = 419
Score = 104 bits (259), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 48/108 (44%), Positives = 71/108 (65%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MVEQGL+ + VFSF+LNRD + +GGE+V GG DP H+ T++PVT YWQ + +
Sbjct: 201 MVEQGLLEKPVFSFYLNRDSEGSDGGELVLGGSDPAHYVPPLTFIPVTIPAYWQVHMESV 260
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+G +C GC+AI+D+GTSL+ GP+ + +N AIGG ++ +
Sbjct: 261 KVGT-GLSLCAQGCSAILDTGTSLITGPSEEIRALNKAIGGYPFLNGQ 307
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 52/144 (36%), Positives = 77/144 (53%), Gaps = 7/144 (4%)
Query: 156 VEKENVSAGDS-AVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDC 214
V E+V G ++C+ A++ L ++E + +N+ P G+ I C
Sbjct: 255 VHMESVKVGTGLSLCAQGCSAILDTGTSLITGPSEE--IRALNKAIGGYPFLNGQYFIQC 312
Query: 215 DRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVF 274
+ PT+P VSF +G FNL+ + Y++K + +C+ GF A D+P P GPLWILGDVF
Sbjct: 313 SKTPTLPPVSFHLGGVWFNLTGQDYVIKILQSDVGLCLLGFQALDIPKPAGPLWILGDVF 372
Query: 275 MGVYHTVFDSGKL----RIGFAEA 294
+G Y VFD G R+G A A
Sbjct: 373 LGPYVAVFDRGDKNVGPRVGLARA 396
>gi|328869722|gb|EGG18099.1| cathepsin D [Dictyostelium fasciculatum]
Length = 476
Score = 103 bits (258), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 48/105 (45%), Positives = 66/105 (62%), Gaps = 1/105 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M++QGLV++ +F+FWL++ GGE+ FG +D F G TYVP+T + YW+F + D+
Sbjct: 280 MMDQGLVAQPLFAFWLSKTASPTNGGELSFGSIDNSKFTGAITYVPLTNRTYWEFSMDDV 339
Query: 61 LIGNQSTGVC-EGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGV 104
S G C + GC AI DSGTSLLAGPT + IN +G V
Sbjct: 340 QYDGNSLGYCGKTGCRAIADSGTSLLAGPTEQIEAINTKLGAVSV 384
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 41/88 (46%), Positives = 57/88 (64%), Gaps = 1/88 (1%)
Query: 208 GESII-DCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGP 266
GE+I C+ I ++P+V + F L+P YIL+ E C+SGFM D+P P GP
Sbjct: 386 GEAIFPSCNVISSLPDVQIVLAGTTFVLTPTDYILQITEFGKTTCLSGFMGIDIPAPIGP 445
Query: 267 LWILGDVFMGVYHTVFDSGKLRIGFAEA 294
L+ILGDVF+ Y+T+FD G R+GFA+A
Sbjct: 446 LYILGDVFISTYYTIFDFGNSRVGFAQA 473
>gi|402906426|ref|XP_003916003.1| PREDICTED: napsin-A-like [Papio anubis]
Length = 423
Score = 103 bits (258), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 51/119 (42%), Positives = 77/119 (64%), Gaps = 1/119 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+VEQGL+ + VFSF+LNRD + +GGE+V GG DP H+ T+VPVT YWQ + +
Sbjct: 207 LVEQGLLDKPVFSFYLNRDSEVADGGELVLGGSDPAHYIPPLTFVPVTVPAYWQIHMERV 266
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDL 119
+G+ T +C GCAAI+D+GT ++ GPT + ++ AIGG +++ E + S+ L
Sbjct: 267 TVGSGLT-LCARGCAAILDTGTPVIIGPTEEIRALHEAIGGIPLLAGEYIIRCSEIPKL 324
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 51/144 (35%), Positives = 77/144 (53%), Gaps = 7/144 (4%)
Query: 156 VEKENVSAGDS-AVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDC 214
+ E V+ G +C+ A++ + T+E + ++E +P GE II C
Sbjct: 261 IHMERVTVGSGLTLCARGCAAILDTGTPVIIGPTEE--IRALHEAIGGIPLLAGEYIIRC 318
Query: 215 DRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVF 274
IP +P VS IG FNL+ + Y+++ +G +C+SGF A D+ P P+WILGDVF
Sbjct: 319 SEIPKLPTVSLLIGGVWFNLTAQDYVIQFAQGDVRLCLSGFRALDIALPPVPVWILGDVF 378
Query: 275 MGVYHTVFDSGKL----RIGFAEA 294
+G Y VFD G + R+G A A
Sbjct: 379 LGAYVAVFDRGDMKSGARVGLARA 402
>gi|148690790|gb|EDL22737.1| napsin A aspartic peptidase, isoform CRA_a [Mus musculus]
Length = 393
Score = 103 bits (258), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 48/108 (44%), Positives = 71/108 (65%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MVEQGL+ + VFSF+LNRD + +GGE+V GG DP H+ T++PVT YWQ + +
Sbjct: 176 MVEQGLLEKPVFSFYLNRDSEGSDGGELVLGGSDPAHYVPPLTFIPVTIPAYWQVHMESV 235
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+G +C GC+AI+D+GTSL+ GP+ + +N AIGG ++ +
Sbjct: 236 KVGT-GLSLCAQGCSAILDTGTSLITGPSEEIRALNKAIGGYPFLNGQ 282
Score = 89.7 bits (221), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 52/144 (36%), Positives = 77/144 (53%), Gaps = 7/144 (4%)
Query: 156 VEKENVSAGDS-AVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDC 214
V E+V G ++C+ A++ L ++E + +N+ P G+ I C
Sbjct: 230 VHMESVKVGTGLSLCAQGCSAILDTGTSLITGPSEE--IRALNKAIGGYPFLNGQYFIQC 287
Query: 215 DRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVF 274
+ PT+P VSF +G FNL+ + Y++K + +C+ GF A D+P P GPLWILGDVF
Sbjct: 288 SKTPTLPPVSFHLGGVWFNLTGQDYVIKILQSDVGLCLLGFQALDIPKPAGPLWILGDVF 347
Query: 275 MGVYHTVFDSGKL----RIGFAEA 294
+G Y VFD G R+G A A
Sbjct: 348 LGPYVAVFDRGDKNVGPRVGLARA 371
>gi|351702766|gb|EHB05685.1| Napsin-A [Heterocephalus glaber]
Length = 417
Score = 103 bits (258), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 51/115 (44%), Positives = 75/115 (65%), Gaps = 1/115 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+VEQGL+ + +FSF+LNRD +GGE+V GG DP H+ T+VPVT YWQ + +
Sbjct: 203 LVEQGLLEKPIFSFYLNRDFAGADGGELVLGGADPAHYIPPLTFVPVTVPAYWQIHMERV 262
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQ 115
+G T +C GCAAIVD+GTSL+ GP+ + ++ AIGG ++ E ++ S+
Sbjct: 263 KVGTGLT-LCAQGCAAIVDTGTSLITGPSEEIRALHRAIGGLPWLAGEHFILCSK 316
Score = 102 bits (253), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 56/143 (39%), Positives = 78/143 (54%), Gaps = 7/143 (4%)
Query: 156 VEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 215
+E+ V G + C A+V L ++E + ++ LP GE I C
Sbjct: 259 MERVKVGTGLTLCAQGCA-AIVDTGTSLITGPSEE--IRALHRAIGGLPWLAGEHFILCS 315
Query: 216 RIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFM 275
+IPT+P VSF +G FNL+ + Y+++ +G C+SGF A D+PPP GPLWILGDVF+
Sbjct: 316 KIPTLPPVSFLLGGVWFNLTAQDYVIQISQGGFRFCLSGFHALDMPPPAGPLWILGDVFL 375
Query: 276 GVYHTVFDSGKL----RIGFAEA 294
G Y VFD G R+G A A
Sbjct: 376 GAYVAVFDRGSTSSGARVGLARA 398
>gi|355756059|gb|EHH59806.1| hypothetical protein EGM_10003 [Macaca fascicularis]
Length = 423
Score = 103 bits (258), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 54/129 (41%), Positives = 82/129 (63%), Gaps = 2/129 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+VEQGL+ + VFSF+LNRD + +GGE+V GG DP H+ T+VPVT YWQ + +
Sbjct: 207 LVEQGLLDKPVFSFYLNRDSEVADGGELVLGGSDPAHYIPPLTFVPVTVPAYWQIHMERV 266
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDL- 119
+G+ T +C GCAAI+D+GT ++ GPT + ++ AIGG +++ E + S+ L
Sbjct: 267 TVGSGLT-LCARGCAAILDTGTPVIIGPTEEIRALHEAIGGIPLLAGEYIIRCSEIPKLP 325
Query: 120 IWDLLVSGL 128
LL+ G+
Sbjct: 326 TVSLLIGGV 334
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 51/144 (35%), Positives = 77/144 (53%), Gaps = 7/144 (4%)
Query: 156 VEKENVSAGDS-AVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDC 214
+ E V+ G +C+ A++ + T+E + ++E +P GE II C
Sbjct: 261 IHMERVTVGSGLTLCARGCAAILDTGTPVIIGPTEE--IRALHEAIGGIPLLAGEYIIRC 318
Query: 215 DRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVF 274
IP +P VS IG FNL+ + Y+++ +G +C+SGF A D+ P P+WILGDVF
Sbjct: 319 SEIPKLPTVSLLIGGVWFNLTAQDYVIQFAQGDVRLCLSGFRALDIALPPVPVWILGDVF 378
Query: 275 MGVYHTVFDSGKL----RIGFAEA 294
+G Y VFD G + R+G A A
Sbjct: 379 LGAYVAVFDRGDMKSGARVGLARA 402
>gi|297462061|ref|XP_001790669.2| PREDICTED: napsin-A [Bos taurus]
gi|297485858|ref|XP_002695173.1| PREDICTED: napsin-A [Bos taurus]
gi|296477597|tpg|DAA19712.1| TPA: napsin A aspartic peptidase [Bos taurus]
Length = 408
Score = 103 bits (258), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 48/100 (48%), Positives = 69/100 (69%), Gaps = 1/100 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+V+QGL+ + VFSF+LNR+P+A +GGE+V GG DP H+ T+VPVT +WQ + +
Sbjct: 206 LVDQGLLDKPVFSFYLNRNPEAADGGELVLGGSDPAHYIPPLTFVPVTIPAFWQIHMERV 265
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 100
+G T +C GCAAI+D+GTSL+ GPT + + AIG
Sbjct: 266 QVGTGLT-LCARGCAAILDTGTSLITGPTEEIRALQKAIG 304
Score = 100 bits (249), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 52/135 (38%), Positives = 77/135 (57%), Gaps = 3/135 (2%)
Query: 156 VEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 215
+E+ V G + C A++ L T+E + + + ++P MG+ I+C
Sbjct: 262 MERVQVGTGLTLCARGCA-AILDTGTSLITGPTEE--IRALQKAIGAVPLLMGKYYIECS 318
Query: 216 RIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFM 275
+IPT+P VSF +G FNL+ + Y+++ VC+SGFMA D+PPP GP WILGDVF+
Sbjct: 319 KIPTLPPVSFLLGGVWFNLTAQDYVIQITRSGFSVCLSGFMALDVPPPSGPFWILGDVFL 378
Query: 276 GVYHTVFDSGKLRIG 290
G Y VFD G + G
Sbjct: 379 GSYVAVFDRGDRKSG 393
>gi|71727523|gb|AAZ39883.1| cathepsin D-like aspartic protease [Opisthorchis viverrini]
Length = 425
Score = 103 bits (257), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 47/100 (47%), Positives = 70/100 (70%), Gaps = 2/100 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ QGLV E VFSF+L+R+ GGE++ GG DPK++KG+ + P+T + YWQF++ +
Sbjct: 197 MISQGLVPEPVFSFYLDRNASDPVGGELLLGGTDPKYYKGEILWAPLTHEAYWQFKVDSM 256
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 100
+G +CE GC AI D+GTSL+AGP+ V ++N A+G
Sbjct: 257 SVGGMK--LCENGCQAIADTGTSLIAGPSEEVGKLNDALG 294
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 53/137 (38%), Positives = 77/137 (56%), Gaps = 4/137 (2%)
Query: 159 ENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIP 218
+++S G +C A+ L ++E + +N+ ++ P G I+CDR+
Sbjct: 254 DSMSVGGMKLCENGCQAIADTGTSLIAGPSEE--VGKLNDALGAIKLPGGTYYINCDRVS 311
Query: 219 TMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVY 278
T+P V F I K+ L P YIL+ +CISGFM D+ P GPLWILGDVF+G Y
Sbjct: 312 TLPLVQFNINGKLMELEPSDYILRMTSFGKTLCISGFMGIDI--PAGPLWILGDVFIGKY 369
Query: 279 HTVFDSGKLRIGFAEAA 295
+T+FD G R+GFA A+
Sbjct: 370 YTIFDVGNARVGFATAS 386
>gi|355706343|gb|AES02606.1| napsin A aspartic peptidase [Mustela putorius furo]
Length = 146
Score = 103 bits (257), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 46/92 (50%), Positives = 61/92 (66%), Gaps = 4/92 (4%)
Query: 207 MGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGP 266
+GE +I C +IPT+P VSF +G FNL+ + Y+++ G +C+SGF A D+PPP GP
Sbjct: 34 LGEYLIQCSKIPTLPPVSFLLGGVWFNLTAQDYVIQIARGGVRLCLSGFQALDMPPPAGP 93
Query: 267 LWILGDVFMGVYHTVFDSGKL----RIGFAEA 294
LWILGDVF+ Y VFD G L R+G A A
Sbjct: 94 LWILGDVFLRTYVAVFDRGNLTGGARVGLARA 125
Score = 38.5 bits (88), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 18/36 (50%), Positives = 26/36 (72%)
Query: 73 GCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
GCAAI+D+GTSL+ GPT + ++ AIGG ++ E
Sbjct: 1 GCAAILDTGTSLITGPTEEIQALHAAIGGVSLLLGE 36
>gi|327271207|ref|XP_003220379.1| PREDICTED: gastricsin-like [Anolis carolinensis]
Length = 388
Score = 103 bits (257), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 46/102 (45%), Positives = 71/102 (69%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+++ L+S+ VFSF+L+R P+++ GGE+VFGGVD + + G+ + PVT++ YWQ + +
Sbjct: 198 MLQENLLSQSVFSFYLSRQPNSQYGGEVVFGGVDTRLYSGEIYWAPVTQELYWQIGIQEF 257
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
IG Q+TG C GC AIVD+GTSLL P ++ A+G +
Sbjct: 258 SIGGQATGWCSQGCQAIVDTGTSLLTVPQQYMSNFLSAVGAQ 299
Score = 71.2 bits (173), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 36/92 (39%), Positives = 50/92 (54%), Gaps = 5/92 (5%)
Query: 205 NPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 264
N G+ ++C+ + +P +SFTI F L P YIL C G LP
Sbjct: 301 NQYGQYAVNCNNVQNLPTISFTINGVSFPLPPSAYILNNNG----YCTVGIEPTYLPSQN 356
Query: 265 G-PLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
G PLWILGD+F+ Y++V+D G R+GFA +A
Sbjct: 357 GQPLWILGDIFLREYYSVYDMGNNRVGFATSA 388
>gi|149061702|gb|EDM12125.1| cathepsin D, isoform CRA_b [Rattus norvegicus]
Length = 87
Score = 103 bits (257), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 43/85 (50%), Positives = 63/85 (74%)
Query: 211 IIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWIL 270
+I C+++ ++P ++F +G + + L PE+YILK + +C+SGFM D+PPP GPLWIL
Sbjct: 1 MIPCEKVSSLPIITFKLGGQNYELHPEKYILKVSQAGKTICLSGFMGMDIPPPSGPLWIL 60
Query: 271 GDVFMGVYHTVFDSGKLRIGFAEAA 295
GDVF+G Y+TVFD R+GFA+AA
Sbjct: 61 GDVFIGCYYTVFDREYNRVGFAKAA 85
>gi|327278613|ref|XP_003224055.1| PREDICTED: cathepsin E-like [Anolis carolinensis]
Length = 396
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 49/113 (43%), Positives = 73/113 (64%), Gaps = 4/113 (3%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ Q LV +FS +L+R+PD+ GGE++FGG DP F G ++PV+KKGYWQ +L +I
Sbjct: 200 MMAQNLVELPLFSVYLSRNPDSSIGGELIFGGYDPSLFSGNLNWIPVSKKGYWQIQLDNI 259
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE---GVVSAECK 110
+G + C GC AIVD+GTSL+ GP+ + ++ + IG + G + EC
Sbjct: 260 QVGG-TIAFCAEGCQAIVDTGTSLITGPSDDIKQMQNLIGAQPVDGEYAVECS 311
Score = 87.4 bits (215), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 38/85 (44%), Positives = 55/85 (64%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 267
GE ++C + MP+V+FT+ ++L+PE Y L ++C SGF A ++ P GPL
Sbjct: 304 GEYAVECSNLSMMPSVTFTLNGIPYSLTPEAYTLMENSDGMQLCSSGFQALNMQTPEGPL 363
Query: 268 WILGDVFMGVYHTVFDSGKLRIGFA 292
WILGDVF+G Y++VFD G R+G A
Sbjct: 364 WILGDVFIGQYYSVFDRGNDRVGLA 388
>gi|31559113|gb|AAP50847.1| cathepsin D [Bombyx mori]
gi|90992734|gb|ABE03014.1| aspartic protease [Bombyx mori]
Length = 385
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 49/116 (42%), Positives = 73/116 (62%), Gaps = 7/116 (6%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV QGLV + VFSF+LNRDP A GGE++ GG DP H++G VP+ + YW+F + +
Sbjct: 194 MVAQGLV-QPVFSFYLNRDPGATTGGELLLGGSDPAHYRGDLVRVPLLRDTYWEFHMDSV 252
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVV----SAECKLV 112
+ ++ C GC+AI D+GTSL+AGP+ V +N A+G + + +C L+
Sbjct: 253 NV--NASRFCAQGCSAIADTGTSLIAGPSKEVEALNAAVGATAIAFGQYAVDCSLI 306
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 48/128 (37%), Positives = 69/128 (53%), Gaps = 2/128 (1%)
Query: 159 ENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIP 218
++V+ S C+ A+ L +KE + +N + G+ +DC IP
Sbjct: 250 DSVNVNASRFCAQGCSAIADTGTSLIAGPSKE--VEALNAAVGATAIAFGQYAVDCSLIP 307
Query: 219 TMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVY 278
+P V+FTI F L Y+L+ + VC+SGFMA D+P P GPLWILGDVF+G Y
Sbjct: 308 HLPRVTFTIAGNDFTLEGNDYVLRVAQMGHTVCLSGFMALDVPKPMGPLWILGDVFIGKY 367
Query: 279 HTVFDSGK 286
+T FD+G
Sbjct: 368 YTEFDAGN 375
>gi|395531206|ref|XP_003767673.1| PREDICTED: cathepsin E [Sarcophilus harrisii]
Length = 395
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 50/113 (44%), Positives = 71/113 (62%), Gaps = 4/113 (3%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ Q LV +FS ++ R+PD+ G E+VFGG D HF G +VPVTK+GYWQ L +I
Sbjct: 203 MIAQNLVDMPIFSVYMTRNPDSPTGSELVFGGYDHAHFTGSLNWVPVTKQGYWQIALDNI 262
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG---EGVVSAECK 110
+G + C GC AIVD+GTSL+ GP+ + ++ +AIG +G + EC
Sbjct: 263 QVGG-TIMFCAEGCQAIVDTGTSLITGPSDKIKQLQNAIGAVLTDGEYAMECN 314
Score = 84.3 bits (207), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 39/87 (44%), Positives = 53/87 (60%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 267
GE ++C+ + MP+V+FTI + L P+ Y L E C SGF D+ PP GPL
Sbjct: 307 GEYAMECNNLNVMPDVTFTINGIPYTLPPKAYTLTDFVDGMEFCTSGFQGLDIHPPAGPL 366
Query: 268 WILGDVFMGVYHTVFDSGKLRIGFAEA 294
WILGDVF+G +++VFD G +G A A
Sbjct: 367 WILGDVFIGQFYSVFDRGNNLVGLAPA 393
>gi|118082412|ref|XP_416090.2| PREDICTED: cathepsin E-A-like [Gallus gallus]
Length = 404
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 43/100 (43%), Positives = 66/100 (66%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
+ E + P+ GE ++DC R+ ++P++SFTIG + L+ EQY++K C+SGF
Sbjct: 300 LQEYIGASPSRSGEFLVDCRRLSSLPHISFTIGHHDYKLTAEQYVVKESIDDQTFCMSGF 359
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
+ D+P GPLWILGDVFM ++ +FD G R+GFA++A
Sbjct: 360 QSLDIPTHNGPLWILGDVFMSAFYCIFDRGNDRVGFAKSA 399
Score = 97.1 bits (240), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 49/108 (45%), Positives = 64/108 (59%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
++ Q LV E VFSF+L R D E GGE++ GG+D +KG +VPVT+K YWQ L +I
Sbjct: 207 IMNQKLVEEPVFSFYLKRGDDTENGGELILGGIDHSLYKGSIHWVPVTEKSYWQIHLNNI 266
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
I + C GC AIVDSGTSL+ GP+ + + IG S E
Sbjct: 267 KIQGRVV-FCSHGCEAIVDSGTSLITGPSSQIRRLQEYIGASPSRSGE 313
>gi|301618285|ref|XP_002938556.1| PREDICTED: cathepsin E-A-like [Xenopus (Silurana) tropicalis]
Length = 402
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 43/99 (43%), Positives = 65/99 (65%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
+ EL + P GE I+DC R+ ++P V+FTIG + + L+PEQY +K ++ C++GF
Sbjct: 300 LQELLGATPTLFGEYILDCSRVSSLPRVTFTIGQRDYTLTPEQYTIKERSQKSDFCLTGF 359
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
A D+ GPLWILGD+FM +++VFD RIG A++
Sbjct: 360 QAMDISTKDGPLWILGDIFMSKFYSVFDREHDRIGLAKS 398
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 47/119 (39%), Positives = 74/119 (62%), Gaps = 1/119 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+V Q LV +++FSF LNRD D+E GGE++FGG+D +KG+ ++P+T+KGYWQ L ++
Sbjct: 207 IVNQKLVEQQLFSFHLNRDYDSEYGGELIFGGIDHSLYKGQIHWIPLTEKGYWQIRLDNV 266
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDL 119
+ ++ C+ C IVDSGTSL+ GP + ++ +G + E L S+ L
Sbjct: 267 KVDGEAM-FCQSSCQVIVDSGTSLITGPKAEIKKLQELLGATPTLFGEYILDCSRVSSL 324
>gi|340373429|ref|XP_003385244.1| PREDICTED: cathepsin D-like [Amphimedon queenslandica]
Length = 382
Score = 102 bits (255), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 51/112 (45%), Positives = 70/112 (62%), Gaps = 5/112 (4%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+V+Q LV++ VF F+L+RD + GGE+ GG DP H+K YVP++ K YWQF+L I
Sbjct: 194 LVQQSLVAQPVFGFYLDRDENGTLGGELALGGTDPSHYKAPINYVPLSDKTYWQFKLDKI 253
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG---EGVVSAEC 109
+G T +C GC AI D+GTSLL GP+ V +I IG +GV +C
Sbjct: 254 KVG--GTTLCSNGCQAIADTGTSLLVGPSVDVQKIMKEIGAKNTDGVYMIDC 303
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 43/91 (47%), Positives = 56/91 (61%), Gaps = 5/91 (5%)
Query: 205 NPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 264
N G +IDC + +P VSF IG + LSP+QYI+K C+ GF + D +
Sbjct: 294 NTDGVYMIDCGNMSNLPTVSFVIGGAQYLLSPQQYIMKEEAEGQTFCLVGFDSLD----Q 349
Query: 265 G-PLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
G PLWILGDVF+G Y+T FD G+ R+GFA A
Sbjct: 350 GEPLWILGDVFIGYYYTEFDVGQGRVGFAPA 380
>gi|307170097|gb|EFN62519.1| Lysosomal aspartic protease [Camponotus floridanus]
Length = 285
Score = 102 bits (255), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 44/99 (44%), Positives = 66/99 (66%), Gaps = 2/99 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
++EQGLVS +FSF+LNRD A+ GG ++ GG DP +++G TY+PVT+KGYWQ + I
Sbjct: 96 IIEQGLVSSRIFSFYLNRDTSADLGGTLILGGSDPTYYEGDFTYIPVTRKGYWQITIDRI 155
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAI 99
+ S +CE C +VD+G+SL+ GP + + H +
Sbjct: 156 KM--TSEDLCEESCQVVVDTGSSLITGPELDIARLIHLL 192
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/83 (37%), Positives = 47/83 (56%), Gaps = 2/83 (2%)
Query: 212 IDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILG 271
+DC+RI +P + F +G K F+L+ + YI++ +C S F +D WILG
Sbjct: 204 VDCNRIFQLPTIRFIMGGKAFDLTGKDYIIRHPNH-ESICTSIFFTYDSYNSEIK-WILG 261
Query: 272 DVFMGVYHTVFDSGKLRIGFAEA 294
F+G Y+T FD + R+GFA A
Sbjct: 262 MPFIGRYYTEFDMERDRVGFALA 284
>gi|307167892|gb|EFN61281.1| Lysosomal aspartic protease [Camponotus floridanus]
Length = 243
Score = 102 bits (255), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 51/109 (46%), Positives = 71/109 (65%), Gaps = 3/109 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+V QGLVSE VFS +LN + AEEGGE++ GG DPK+++G TYV V+KKGYWQ+ L I
Sbjct: 62 IVNQGLVSEPVFSVYLNPNSSAEEGGELILGGSDPKYYEGNFTYVNVSKKGYWQYPLQSI 121
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAEC 109
+G + + AI D+GTSL+ GP+ ++ IN I + S +C
Sbjct: 122 TVGGK---LVAEDIQAIADTGTSLIVGPSEIIDPINKDIHADSDGSVDC 167
Score = 63.2 bits (152), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 32/83 (38%), Positives = 47/83 (56%), Gaps = 5/83 (6%)
Query: 212 IDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILG 271
+DCD+I +P + F I +NL+ +YIL+ C SGF A +LP I+G
Sbjct: 165 VDCDKIDKLPVIDFNINGVSYNLTYHEYILQVNIFGFPRCTSGFQASNLP-----FLIMG 219
Query: 272 DVFMGVYHTVFDSGKLRIGFAEA 294
D F+ Y+TV+D G R+GFA +
Sbjct: 220 DRFLIRYYTVYDMGNNRVGFARS 242
>gi|68051036|emb|CAI46901.1| nothepsin [Podarcis siculus]
Length = 414
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 44/90 (48%), Positives = 61/90 (67%)
Query: 204 PNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPP 263
P P GE I+DC R+ ++P ++FTIG + + ++ +QYI+K G C+SGF A DL P
Sbjct: 308 PAPHGEFIVDCRRLSSLPPITFTIGQREYTITSKQYIIKQTSGGEAFCLSGFQALDLGPR 367
Query: 264 RGPLWILGDVFMGVYHTVFDSGKLRIGFAE 293
P+WILGDVF+G Y+TVFD R+GFA
Sbjct: 368 SKPMWILGDVFIGQYYTVFDRANDRVGFAR 397
Score = 95.1 bits (235), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 48/118 (40%), Positives = 69/118 (58%), Gaps = 5/118 (4%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ Q L+ E VFSF LNR + E GGE++FGG+D +KG +VPVT++ YW+ + ++
Sbjct: 207 MLRQQLIEEPVFSFILNRGGNTENGGELIFGGIDHSLYKGSIHWVPVTEQKYWKIHMDNV 266
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG----EGVVSAECKLVVS 114
I C+ GCAAIVDSGTSL+ GP + + IG G +C+ + S
Sbjct: 267 KIQGH-IAACKDGCAAIVDSGTSLITGPPSQIIRLQQKIGAHPAPHGEFIVDCRRLSS 323
>gi|77548262|gb|ABA91059.1| Aspartic proteinase precursor, putative [Oryza sativa Japonica
Group]
gi|125535484|gb|EAY81972.1| hypothetical protein OsI_37150 [Oryza sativa Indica Group]
gi|125578210|gb|EAZ19356.1| hypothetical protein OsJ_34907 [Oryza sativa Japonica Group]
Length = 76
Score = 102 bits (255), Expect = 2e-19, Method: Composition-based stats.
Identities = 45/76 (59%), Positives = 54/76 (71%)
Query: 33 VDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVV 92
+DPKHFKG HTYV V++KGYWQF D+L STG GCAAIVDS TSLL PT +V
Sbjct: 1 MDPKHFKGDHTYVHVSRKGYWQFNTRDLLTDGHSTGFYAKGCAAIVDSRTSLLTDPTAIV 60
Query: 93 TEINHAIGGEGVVSAE 108
++NHA EG++S E
Sbjct: 61 AQVNHATEAEGIISTE 76
>gi|116283893|gb|AAH48900.1| Ctsd protein [Mus musculus]
Length = 87
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 43/85 (50%), Positives = 60/85 (70%)
Query: 211 IIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWIL 270
+I C+++ ++P V +G K + L P++YILK +G +C+SGFM D+PPP GPLWIL
Sbjct: 1 MIPCEKVSSLPTVYLKLGGKNYELHPDKYILKVSQGGKTICLSGFMGMDIPPPSGPLWIL 60
Query: 271 GDVFMGVYHTVFDSGKLRIGFAEAA 295
GDVF+G Y+TVFD R+GFA A
Sbjct: 61 GDVFIGSYYTVFDRDNNRVGFANAV 85
>gi|312097106|ref|XP_003148873.1| aspartic protease BmAsp-2 [Loa loa]
gi|307755962|gb|EFO15196.1| aspartic protease BmAsp-2 [Loa loa]
Length = 417
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 47/108 (43%), Positives = 67/108 (62%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M++Q V E +F+FWL+R+P+ E GGEI GG+D F Y P++K GYWQF++ D
Sbjct: 199 MIKQKTVKESLFAFWLDRNPNDEIGGEITLGGIDVNRFVAPLVYTPISKHGYWQFQM-DS 257
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+ G+ C GC AI D+GTSL+AGP + +I IG E + + E
Sbjct: 258 IQGDGKAISCANGCQAIADTGTSLIAGPKSQIDKIQKYIGAEHLYADE 305
Score = 90.9 bits (224), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 41/86 (47%), Positives = 55/86 (63%)
Query: 209 ESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLW 268
E II C ++P++P ++F I K + L Y+L +C+SGFM DLP G LW
Sbjct: 305 EYIIPCYKVPSLPEITFVIAGKSYTLKGSDYVLNVSAQGKSICLSGFMGIDLPERVGELW 364
Query: 269 ILGDVFMGVYHTVFDSGKLRIGFAEA 294
ILGDVF+G Y+TVFD G +IGFA+A
Sbjct: 365 ILGDVFIGHYYTVFDVGNSQIGFAQA 390
>gi|194246055|gb|ACF35519.1| putative aspartic protease [Dermacentor variabilis]
Length = 125
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 41/87 (47%), Positives = 62/87 (71%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 267
G+ ++DC+ +P++P ++F + + F LSP+ Y+LK + C+SGF+ D PP GPL
Sbjct: 38 GQYLVDCESLPSLPKITFHLNKREFVLSPDDYVLKVTQEGTTFCLSGFIPMDFPPEMGPL 97
Query: 268 WILGDVFMGVYHTVFDSGKLRIGFAEA 294
WILGD+F+G Y+T+FD G R+GFAEA
Sbjct: 98 WILGDMFIGRYYTIFDRGNDRVGFAEA 124
>gi|440898030|gb|ELR49612.1| Napsin-A, partial [Bos grunniens mutus]
Length = 406
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 47/100 (47%), Positives = 69/100 (69%), Gaps = 1/100 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+V++GL+ + VFSF+LNR+P+A +GGE+V GG DP H+ T+VPVT +WQ + +
Sbjct: 206 LVDRGLLDKPVFSFYLNRNPEAADGGELVLGGSDPAHYIPPLTFVPVTIPAFWQIHMERV 265
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 100
+G T +C GCAAI+D+GTSL+ GPT + + AIG
Sbjct: 266 QVGTGLT-LCARGCAAILDTGTSLITGPTEEIRALQKAIG 304
Score = 100 bits (249), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 52/135 (38%), Positives = 77/135 (57%), Gaps = 3/135 (2%)
Query: 156 VEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 215
+E+ V G + C A++ L T+E + + + ++P MG+ I+C
Sbjct: 262 MERVQVGTGLTLCARGCA-AILDTGTSLITGPTEE--IRALQKAIGAVPLLMGKYYIECS 318
Query: 216 RIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFM 275
+IPT+P VSF +G FNL+ + Y+++ VC+SGFMA D+PPP GP WILGDVF+
Sbjct: 319 KIPTLPPVSFLLGGVWFNLTAQDYVIQITRSGFSVCLSGFMALDVPPPSGPFWILGDVFL 378
Query: 276 GVYHTVFDSGKLRIG 290
G Y VFD G + G
Sbjct: 379 GSYVAVFDRGDRKSG 393
>gi|47210711|emb|CAF90003.1| unnamed protein product [Tetraodon nigroviridis]
Length = 361
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 55/142 (38%), Positives = 85/142 (59%), Gaps = 6/142 (4%)
Query: 156 VEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 215
V+ V DS S CE A+V L T++ + + +L + P +GE +IDC
Sbjct: 223 VDSVAVQGVDSFCPSGCE-AIVDTGTSLITGPTRD--ILRLQQLIGATPTNIGEFLIDCI 279
Query: 216 RIPTMPNVSFTIGDKIFNLSPEQYILKTGE---GIAEVCISGFMAFDLPPPRGPLWILGD 272
R+ ++P+V+F +G + + L+ E+Y+ + G+ G E C SGF A D+ P+GPLWILGD
Sbjct: 280 RLSSLPHVTFVLGGEEYTLTAERYVRRVGQEMLGEKEFCFSGFQAADMVTPKGPLWILGD 339
Query: 273 VFMGVYHTVFDSGKLRIGFAEA 294
VF+ Y++VFD G+ R+G A A
Sbjct: 340 VFLRQYYSVFDRGQDRVGLAPA 361
Score = 81.3 bits (199), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 39/100 (39%), Positives = 61/100 (61%), Gaps = 1/100 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ Q ++ + VFSF+L+R ++ GE++ GG+D F G ++PVT KGYWQ ++ +
Sbjct: 167 MMAQKVLEKPVFSFYLSRKSGSKLQGELLLGGIDEALFTGPINWLPVTAKGYWQIKVDSV 226
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 100
+ + C GC AIVD+GTSL+ GPT + + IG
Sbjct: 227 AVQGVDS-FCPSGCEAIVDTGTSLITGPTRDILRLQQLIG 265
>gi|112983576|ref|NP_001037351.1| cathepsin D precursor [Bombyx mori]
gi|66269351|gb|AAY43135.1| CathD [Bombyx mori]
Length = 384
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 49/116 (42%), Positives = 72/116 (62%), Gaps = 7/116 (6%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV QGLV + VFSF+LNRDP A GGE++ GG DP H++G VP+ + YW+F + +
Sbjct: 194 MVAQGLV-QPVFSFYLNRDPGATTGGELLLGGSDPAHYRGDLVRVPLLRDTYWEFHMDSV 252
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE----CKLV 112
+ ++ C GC+AI D+GTSL+AGP+ V +N A+G + + C L+
Sbjct: 253 NV--NASRFCAQGCSAIADTGTSLIAGPSKEVEALNAAVGATAIAFGQYVVDCSLI 306
Score = 100 bits (249), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 51/138 (36%), Positives = 75/138 (54%), Gaps = 2/138 (1%)
Query: 157 EKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDR 216
++V+ S C+ A+ L +KE + +N + G+ ++DC
Sbjct: 248 HMDSVNVNASRFCAQGCSAIADTGTSLIAGPSKE--VEALNAAVGATAIAFGQYVVDCSL 305
Query: 217 IPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMG 276
IP +P V+FTI F L Y+L+ + VC+SGFMA D+P P PLWILGDVF+G
Sbjct: 306 IPHLPRVTFTIAGNDFTLEGHDYVLRVAQFGHTVCLSGFMALDVPKPMAPLWILGDVFIG 365
Query: 277 VYHTVFDSGKLRIGFAEA 294
Y+T FD+G ++GFA A
Sbjct: 366 KYYTEFDAGNRQLGFAPA 383
>gi|320163747|gb|EFW40646.1| cathepsin D [Capsaspora owczarzaki ATCC 30864]
Length = 382
Score = 102 bits (253), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 45/87 (51%), Positives = 58/87 (66%), Gaps = 2/87 (2%)
Query: 2 VEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDIL 61
+ QG+ + +F FWLNRDP A +GGEI FG +D H+ G Y PVT++GYWQF LG +
Sbjct: 191 ILQGVAAAPLFGFWLNRDPTAADGGEIDFGAIDDSHYTGPILYTPVTRQGYWQFALGAVT 250
Query: 62 IGNQSTGVCEGGCAAIVDSGTSLLAGP 88
+ ++ C GC AI DSGTSLL GP
Sbjct: 251 VSGKN--YCASGCQAIADSGTSLLVGP 275
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 40/92 (43%), Positives = 54/92 (58%), Gaps = 3/92 (3%)
Query: 205 NPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 264
N GE +DC +I ++PN+ FTI + F L+ Y+LK G C+ G M+ DL
Sbjct: 291 NIAGEYTLDCSKIASLPNLVFTISGQQFALTGADYVLKITSGSTTECLLGLMSMDL-SAE 349
Query: 265 GPLWILGDVFMGVYHTVFD--SGKLRIGFAEA 294
G WILGDVF+G ++TVFD R+GFA A
Sbjct: 350 GIQWILGDVFIGKFYTVFDFNGNAPRVGFATA 381
>gi|344257339|gb|EGW13443.1| Napsin-A [Cricetulus griseus]
Length = 532
Score = 102 bits (253), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 48/119 (40%), Positives = 74/119 (62%), Gaps = 1/119 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MVEQGL+ + +FSF+LNRD + +GGE+V GG DP H+ T++PVT YWQ + +
Sbjct: 135 MVEQGLLQKPIFSFYLNRDAEGSDGGELVLGGSDPAHYIPPLTFIPVTIPAYWQVHMESV 194
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDL 119
+G +C GC I+D+GTSL+ GP+ + +N AIGG ++ + + S+ +L
Sbjct: 195 NVGT-GLSLCAQGCGVILDTGTSLITGPSEEIHALNKAIGGLPFLAGQYFIQCSKTPEL 252
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 49/136 (36%), Positives = 71/136 (52%), Gaps = 4/136 (2%)
Query: 156 VEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 215
+E NV G S C + ++ L ++E + +N+ LP G+ I C
Sbjct: 191 MESVNVGTGLSLCAQGCGV-ILDTGTSLITGPSEE--IHALNKAIGGLPFLAGQYFIQCS 247
Query: 216 RIPTMPNVSFTIGDKIFNLSPEQYILKT-GEGIAEVCISGFMAFDLPPPRGPLWILGDVF 274
+ P +P VSF +G FNL+ + Y++K +C+ GF A D+P P GPLWILGDVF
Sbjct: 248 KTPELPTVSFRLGGVWFNLTGQDYVIKILNSDDVGLCLLGFQALDIPKPAGPLWILGDVF 307
Query: 275 MGVYHTVFDSGKLRIG 290
+G Y VFD G +G
Sbjct: 308 LGPYVAVFDRGVKTVG 323
>gi|195583376|ref|XP_002081498.1| GD11051 [Drosophila simulans]
gi|194193507|gb|EDX07083.1| GD11051 [Drosophila simulans]
Length = 399
Score = 101 bits (252), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 44/99 (44%), Positives = 63/99 (63%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
INE P+ G+ ++ CD +P +P ++FT+G + F L +Y+ + +C S F
Sbjct: 299 INESIGGTPSSFGQFLVACDSVPALPRITFTLGGRTFFLESHEYVFQDIYQDRRICSSAF 358
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
+A DLP P GPLWILGDVF+G Y+T FD + RIGFA+A
Sbjct: 359 IAVDLPSPSGPLWILGDVFLGKYYTEFDMERHRIGFADA 397
Score = 71.2 bits (173), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 38/102 (37%), Positives = 59/102 (57%), Gaps = 10/102 (9%)
Query: 1 MVEQGLVSEEVFSF-WLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGD 59
M+EQGL+++ +F+ + +P I FGG +P ++ G TYV V+ + YWQ ++
Sbjct: 213 MMEQGLLTKPIFNMARMMVEP-------IFFGGSNPHYYTGNFTYVQVSHRAYWQVKMDS 265
Query: 60 ILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 101
+I N +C+ GC I+D+GTS LA P IN +IGG
Sbjct: 266 AVIRNLE--LCQQGCEVIIDTGTSFLALPYDQAILINESIGG 305
>gi|118344572|ref|NP_001072053.1| cathepsin D2 precursor [Takifugu rubripes]
gi|55771084|dbj|BAD69802.1| cathepsin D2 [Takifugu rubripes]
Length = 386
Score = 101 bits (252), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 44/96 (45%), Positives = 69/96 (71%), Gaps = 1/96 (1%)
Query: 6 LVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQ 65
L+ + VFSF+LNRDP+A GG+++ GG++P+H+ G+ YV VT+K YWQ E+ I +G+Q
Sbjct: 199 LLPQNVFSFYLNRDPEAAIGGQLILGGLNPEHYAGELHYVNVTRKAYWQIEVNRINVGDQ 258
Query: 66 STGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 101
+C+ C IVD+GTSL+ GP+ + +++AI G
Sbjct: 259 -LSLCKPSCQTIVDTGTSLITGPSEEIRALHNAIPG 293
Score = 101 bits (252), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 65/178 (36%), Positives = 93/178 (52%), Gaps = 19/178 (10%)
Query: 123 LLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTV--VEKENVSAGDS-AVCSACEMAVVWV 179
L++ GL PE ++ YV+ K +E ++ GD ++C +V
Sbjct: 221 LILGGLNPEHYAGEL--------HYVNVTRKAYWQIEVNRINVGDQLSLCKPSCQTIVDT 272
Query: 180 QNQLKQKQTKEKVLSYINELCDSLP---NPMGESIIDCDRIPTMPNVSFTIGDKIFNLSP 236
L ++E I L +++P E+IIDC++IP+MP +SF IG K+F L+P
Sbjct: 273 GTSLITGPSEE-----IRALHNAIPGMSRQKDENIIDCEQIPSMPVISFNIGGKLFPLNP 327
Query: 237 EQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
E YI K + C S FMA D+ PP PLW LGDVF+ Y+TVFD R+GFA A
Sbjct: 328 EDYIWKEMDRGTAFCQSRFMALDMGPPAAPLWNLGDVFIMKYYTVFDRDADRVGFALA 385
>gi|354497676|ref|XP_003510945.1| PREDICTED: napsin-A [Cricetulus griseus]
Length = 569
Score = 101 bits (252), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 48/119 (40%), Positives = 74/119 (62%), Gaps = 1/119 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MVEQGL+ + +FSF+LNRD + +GGE+V GG DP H+ T++PVT YWQ + +
Sbjct: 201 MVEQGLLQKPIFSFYLNRDAEGSDGGELVLGGSDPAHYIPPLTFIPVTIPAYWQVHMESV 260
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDL 119
+G +C GC I+D+GTSL+ GP+ + +N AIGG ++ + + S+ +L
Sbjct: 261 NVGT-GLSLCAQGCGVILDTGTSLITGPSEEIHALNKAIGGLPFLAGQYFIQCSKTPEL 318
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 52/144 (36%), Positives = 74/144 (51%), Gaps = 8/144 (5%)
Query: 156 VEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 215
+E NV G S C + ++ L ++E + +N+ LP G+ I C
Sbjct: 257 MESVNVGTGLSLCAQGCGV-ILDTGTSLITGPSEE--IHALNKAIGGLPFLAGQYFIQCS 313
Query: 216 RIPTMPNVSFTIGDKIFNLSPEQYILKT-GEGIAEVCISGFMAFDLPPPRGPLWILGDVF 274
+ P +P VSF +G FNL+ + Y++K +C+ GF A D+P P GPLWILGDVF
Sbjct: 314 KTPELPTVSFRLGGVWFNLTGQDYVIKILNSDDVGLCLLGFQALDIPKPAGPLWILGDVF 373
Query: 275 MGVYHTVFDSG----KLRIGFAEA 294
+G Y VFD G R+G A A
Sbjct: 374 LGPYVAVFDRGVKTVGPRVGLARA 397
>gi|195334342|ref|XP_002033842.1| GM21542 [Drosophila sechellia]
gi|194125812|gb|EDW47855.1| GM21542 [Drosophila sechellia]
Length = 398
Score = 101 bits (252), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 44/99 (44%), Positives = 63/99 (63%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
INE P+ G+ ++ CD +P +P ++FT+G + F L +Y+ + +C S F
Sbjct: 298 INESIGGTPSSFGQFLVACDSVPALPRITFTLGGRTFFLESHEYVFQDIYQDRRICSSAF 357
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
+A DLP P GPLWILGDVF+G Y+T FD + RIGFA+A
Sbjct: 358 IAVDLPSPSGPLWILGDVFLGKYYTEFDMERHRIGFADA 396
Score = 87.4 bits (215), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 42/101 (41%), Positives = 65/101 (64%), Gaps = 3/101 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+EQGL+++ +FS +L+R + ++GG I FGG +P ++ G TYV V+ + YWQ ++
Sbjct: 207 MMEQGLLTKPIFSVYLSRHGE-KDGGAIFFGGSNPHYYTGNFTYVQVSHRAYWQVKMDSA 265
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 101
+I N +C+ GC I+D+GTS LA P IN +IGG
Sbjct: 266 VIRNLE--LCQQGCEVIIDTGTSFLALPYDQAILINESIGG 304
>gi|395860891|ref|XP_003802735.1| PREDICTED: pepsin F-like [Otolemur garnettii]
Length = 470
Score = 101 bits (251), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 81/297 (27%), Positives = 132/297 (44%), Gaps = 36/297 (12%)
Query: 3 EQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILI 62
++G +SE +F+F+L+ + ++ GGVD ++ G+ +VP+TK+ YWQ L I +
Sbjct: 204 KKGRISENLFAFYLSNGGKGDS--MLMLGGVDHSYYSGELRWVPLTKQQYWQVALDSISM 261
Query: 63 GNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWD 122
N + C GC AI+D+G+S++ GP V I + I S G + D
Sbjct: 262 -NGTIIACHDGCQAILDTGSSVVNGPNACVLNIQNVIHAHQ----------SFNGKYVID 310
Query: 123 LLVSGLLPEKVCQQIGLCAFNGAEY---VSTGIKTVVEKENVSAGDSAVCSACEMAVVWV 179
+ LP+ V G Y + I+ V VS DS + + W+
Sbjct: 311 CNTTTHLPDIV------FVIGGVNYPVPARSYIRKVAFNTCVSTFDSFPDTMFN-SNTWI 363
Query: 180 QNQLKQKQTKEKVLSYINELCDSLPNPMGES--IIDCDRIPTMPNVSFTIGDKIFNLSPE 237
+ L + D N +G + +IDC+ +P++ F IG + +
Sbjct: 364 LGDV--------FLRLYFSVYDRANNRVGLASFVIDCNTTTHLPDIVFVIGGVSYPVPAR 415
Query: 238 QYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
YI K G C+S F + +WILGDVF+ +Y +V+D R+G A A
Sbjct: 416 SYIQKVAFG---TCVSTFKSLPNNVFSSKIWILGDVFLRLYFSVYDRANNRVGLAPA 469
>gi|24653643|ref|NP_610961.1| CG10104 [Drosophila melanogaster]
gi|7303185|gb|AAF58249.1| CG10104 [Drosophila melanogaster]
Length = 404
Score = 101 bits (251), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 44/99 (44%), Positives = 63/99 (63%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
INE P+ G+ ++ CD +P +P ++FT+G + F L +Y+ + +C S F
Sbjct: 304 INESIGGTPSSFGQFLVPCDSVPDLPKITFTLGGRRFFLESHEYVFRDIYQDRRICSSAF 363
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
+A DLP P GPLWILGDVF+G Y+T FD + RIGFA+A
Sbjct: 364 IAVDLPSPSGPLWILGDVFLGKYYTEFDMERHRIGFADA 402
Score = 87.8 bits (216), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 42/101 (41%), Positives = 66/101 (65%), Gaps = 3/101 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+EQGL+++ +FS +L+R+ + ++GG I FGG +P ++ G TYV V+ + YWQ ++
Sbjct: 213 MMEQGLLTKPIFSVYLSRNGE-KDGGAIFFGGSNPHYYTGNFTYVQVSHRAYWQVKMDSA 271
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 101
+I N +C+ GC I+D+GTS LA P IN +IGG
Sbjct: 272 VIRNLE--LCQQGCEVIIDTGTSFLALPYDQAILINESIGG 310
>gi|195560402|ref|XP_002077401.1| GD13241 [Drosophila simulans]
gi|194202512|gb|EDX16088.1| GD13241 [Drosophila simulans]
Length = 137
Score = 101 bits (251), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 44/99 (44%), Positives = 63/99 (63%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
INE P+ G+ ++ CD +P +P ++FT+G + F L +Y+ + +C S F
Sbjct: 37 INESIGGTPSSFGQFLVACDSVPALPRITFTLGGRTFFLESHEYVFQDIYQDRRICSSAF 96
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
+A DLP P GPLWILGDVF+G Y+T FD + RIGFA+A
Sbjct: 97 IAVDLPSPSGPLWILGDVFLGKYYTEFDMERHRIGFADA 135
>gi|21063965|gb|AAM29212.1| AT05209p [Drosophila melanogaster]
Length = 404
Score = 101 bits (251), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 44/99 (44%), Positives = 63/99 (63%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
INE P+ G+ ++ CD +P +P ++FT+G + F L +Y+ + +C S F
Sbjct: 304 INESIGGTPSSFGQFLVPCDSVPDLPKITFTLGGRRFFLESHEYVFRDIYQDRRICSSAF 363
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
+A DLP P GPLWILGDVF+G Y+T FD + RIGFA+A
Sbjct: 364 IAVDLPSPSGPLWILGDVFLGKYYTEFDMERHRIGFADA 402
Score = 87.8 bits (216), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 42/101 (41%), Positives = 66/101 (65%), Gaps = 3/101 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+EQGL+++ +FS +L+R+ + ++GG I FGG +P ++ G TYV V+ + YWQ ++
Sbjct: 213 MMEQGLLTKPIFSVYLSRNGE-KDGGAIFFGGSNPHYYTGNFTYVQVSHRAYWQVKMDSA 271
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 101
+I N +C+ GC I+D+GTS LA P IN +IGG
Sbjct: 272 VIRNLE--LCQQGCEVIIDTGTSFLALPYDQAILINESIGG 310
>gi|210109642|gb|ACJ07131.1| cathepsin D-like protein, partial [Homarus gammarus]
Length = 231
Score = 100 bits (250), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 54/103 (52%), Positives = 67/103 (65%), Gaps = 4/103 (3%)
Query: 1 MVEQGLVSEEVFSFWLNRD-PDAEE--GGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFEL 57
MV QG V + +FSF+LN D D E GGE+V GG DP H++G+ YVPV+K GYWQ
Sbjct: 130 MVAQGAVDQPIFSFYLNHDVSDMNETLGGELVLGGSDPNHYEGEFHYVPVSKVGYWQVTA 189
Query: 58 GDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 100
I +G+ TG C C AIVD+GTSL+AGP V EI H +G
Sbjct: 190 EAIKVGDNVTGFCN-PCEAIVDTGTSLIAGPNAEVQEIVHMLG 231
>gi|449481456|ref|XP_002189698.2| PREDICTED: cathepsin E-A-like [Taeniopygia guttata]
Length = 405
Score = 100 bits (250), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 42/91 (46%), Positives = 62/91 (68%)
Query: 204 PNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPP 263
P+ GE ++DC R+ ++P++SFTIG + + L+ EQYI+K C+SGF + D+P
Sbjct: 308 PSNTGEFLVDCRRLSSLPHISFTIGHREYKLAAEQYIIKESIDDQTFCMSGFQSLDIPTR 367
Query: 264 RGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
G LWILGDVFM ++ +FD G R+GFA+A
Sbjct: 368 TGSLWILGDVFMSAFYCIFDRGNDRVGFAKA 398
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 47/101 (46%), Positives = 63/101 (62%), Gaps = 1/101 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
++ Q LV E VFSF+L R D E GGE++ GG+D +KG +VPVT+K YWQ + +I
Sbjct: 207 IMNQHLVEEPVFSFYLKRGEDTENGGELILGGIDHSLYKGSIHWVPVTEKSYWQIHMNNI 266
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 101
I + T C GC AIVDSGTSL+ GP+ + + IG
Sbjct: 267 KIQGRVT-FCSHGCEAIVDSGTSLITGPSSQIRRLQAYIGA 306
>gi|341887183|gb|EGT43118.1| hypothetical protein CAEBREN_03875 [Caenorhabditis brenneri]
Length = 397
Score = 100 bits (249), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 46/104 (44%), Positives = 68/104 (65%), Gaps = 1/104 (0%)
Query: 192 VLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEV- 250
++ I +P GE + C +IP++PN++FT+G + FNL + YIL+ G +
Sbjct: 291 IIKKIQHKIGGIPLLNGEYEVVCSKIPSLPNITFTLGGQDFNLQGKDYILQLSNGNGGMT 350
Query: 251 CISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
C+SGFM D+P P GPLWILGDVF+G +++VFD G R+GFA +
Sbjct: 351 CLSGFMGMDIPAPAGPLWILGDVFIGRFYSVFDHGNKRVGFATS 394
Score = 81.6 bits (200), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 41/116 (35%), Positives = 65/116 (56%), Gaps = 3/116 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRDP-DAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGD 59
+ ++F+FWL+RD D GGEI D H+ G + P+ + YW+ +LG
Sbjct: 202 FANSAICPNQLFAFWLSRDANDIANGGEITLCDTDSNHYSGNIAWEPLVSEDYWRIKLGA 261
Query: 60 ILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQ 115
+ I Q T G +IVD+GTSLL GP+ ++ +I H IGG +++ E ++V S+
Sbjct: 262 VSI--QGTTYTNGPMDSIVDTGTSLLTGPSDIIKKIQHKIGGIPLLNGEYEVVCSK 315
>gi|13928928|ref|NP_113858.1| napsin A aspartic peptidase precursor [Rattus norvegicus]
gi|6689137|emb|CAB65392.1| napsin [Rattus norvegicus]
gi|51260062|gb|AAH78790.1| Napsin A aspartic peptidase [Rattus norvegicus]
gi|149056039|gb|EDM07470.1| napsin A aspartic peptidase, isoform CRA_a [Rattus norvegicus]
Length = 420
Score = 100 bits (249), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 47/119 (39%), Positives = 75/119 (63%), Gaps = 1/119 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+VEQ L+ + VFSF+LNRD + +GGE+V GG DP H+ T++PVT YWQ + +
Sbjct: 202 LVEQRLLEKPVFSFYLNRDSEGSDGGELVLGGSDPDHYVPPLTFIPVTIPAYWQVHMQSV 261
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDL 119
+G +C GC AI+D+GTSL+ GP+ + +N A+GG +++ + + S+ +L
Sbjct: 262 KVGT-GLNLCAQGCGAILDTGTSLITGPSEEIRALNKAVGGFPLLTGQYLIQCSKIPEL 319
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 42/95 (44%), Positives = 59/95 (62%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
+N+ P G+ +I C +IP +P VSF++G FNL+ + Y++K + +C+ GF
Sbjct: 295 LNKAVGGFPLLTGQYLIQCSKIPELPTVSFSLGGVWFNLTGQDYVIKILQSDVGLCLLGF 354
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIG 290
A D+P P GPLWILGDVF+G Y VFD G IG
Sbjct: 355 QALDIPKPEGPLWILGDVFLGSYVAVFDRGDKNIG 389
>gi|195430468|ref|XP_002063276.1| GK21477 [Drosophila willistoni]
gi|194159361|gb|EDW74262.1| GK21477 [Drosophila willistoni]
Length = 402
Score = 100 bits (249), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 50/116 (43%), Positives = 71/116 (61%), Gaps = 6/116 (5%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+EQ L++ VFS +LNRD A+EGG + FGG +P+++ G TYVPV+++ YWQ +
Sbjct: 210 MMEQNLLASPVFSVYLNRDVAAKEGGALFFGGSNPQYYTGNFTYVPVSRRSYWQITMDSA 269
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAECKLV 112
I + +CE GC I+D+GTS LA P IN +IGG G+ S C+ V
Sbjct: 270 HI--KDLNLCEQGCEVIIDTGTSFLAMPYDQAMLINKSIGGTPSSYGMFSIPCEQV 323
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 45/99 (45%), Positives = 60/99 (60%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
IN+ P+ G I C+++P +P ++F +G + F+L YI K VC S
Sbjct: 302 INKSIGGTPSSYGMFSIPCEQVPHLPTMTFQLGGRKFHLEGRDYIFKDTYQDGIVCASAL 361
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
+A DLP P GPLWILGDVF+G Y+T FD G RIGFA+A
Sbjct: 362 IAVDLPSPSGPLWILGDVFLGKYYTEFDMGNHRIGFADA 400
>gi|307168170|gb|EFN61449.1| Lysosomal aspartic protease [Camponotus floridanus]
Length = 382
Score = 100 bits (249), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 47/101 (46%), Positives = 63/101 (62%), Gaps = 3/101 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRD-PDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGD 59
++EQGLVS VFSF+LNRD DA GGE++ GG DP H++G TY+PV++KGYWQF L
Sbjct: 194 IIEQGLVSSPVFSFYLNRDFSDALNGGELILGGSDPTHYEGDFTYIPVSRKGYWQFTLDK 253
Query: 60 ILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 100
I+ +C+ C A+ D + GP + IN IG
Sbjct: 254 IIA--SYINLCDENCQAVADVSADAIVGPKQHIVFINDLIG 292
Score = 73.9 bits (180), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 42/100 (42%), Positives = 59/100 (59%), Gaps = 4/100 (4%)
Query: 195 YINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISG 254
+IN+L ++ N GE ++C RI +P +SF +G K FNL+ E YI++ + +CIS
Sbjct: 286 FINDLIGTV-NINGEERVNCHRIDLLPTISFILGGKAFNLTGEDYIIQLPDNGNTICISR 344
Query: 255 FMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
F+ D R WILG F+G Y+T FD RIGFA A
Sbjct: 345 FVGLD---SREVEWILGVPFIGRYYTEFDMVNDRIGFALA 381
>gi|268579917|ref|XP_002644941.1| C. briggsae CBR-ASP-3 protein [Caenorhabditis briggsae]
Length = 397
Score = 100 bits (248), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 45/104 (43%), Positives = 69/104 (66%), Gaps = 1/104 (0%)
Query: 192 VLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEV- 250
++ I ++P GE + C +IP++PN++FT+G + F+L + YIL+ G +
Sbjct: 291 IIKKIQHKIGAIPLLNGEYEVACSKIPSLPNITFTLGGQNFDLQGKDYILQLSNGNGGMT 350
Query: 251 CISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
C+SGFM D+P P GPLWILGDVF+G +++VFD G R+GFA +
Sbjct: 351 CLSGFMGMDIPAPAGPLWILGDVFIGRFYSVFDHGNKRVGFATS 394
Score = 77.4 bits (189), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 39/116 (33%), Positives = 62/116 (53%), Gaps = 3/116 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRDP-DAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGD 59
+ ++F+FWL+RD D GGEI D H+ G + P+ YW+ +LG
Sbjct: 202 FANSAICPNQLFAFWLSRDANDIANGGEITLCDTDSNHYTGNIAWEPLVSMDYWRIKLGA 261
Query: 60 ILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQ 115
+ I Q T G +IVD+GTSLL GP+ ++ +I H IG +++ E ++ S+
Sbjct: 262 VSI--QGTTFTNGPMDSIVDTGTSLLTGPSDIIKKIQHKIGAIPLLNGEYEVACSK 315
>gi|118102416|ref|XP_001235024.1| PREDICTED: cathepsin E [Gallus gallus]
Length = 397
Score = 100 bits (248), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 46/104 (44%), Positives = 67/104 (64%), Gaps = 1/104 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ Q LV +FS +++ +PD+ GGE++FGG DP F G +VPVT++GYWQ +L ++
Sbjct: 205 MMAQDLVEMPIFSVYMSANPDSSLGGEVLFGGFDPSRFLGTLHWVPVTQQGYWQIQLDNV 264
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGV 104
+G + C GC AIVD+GTSLL GPT + E+ IG +
Sbjct: 265 QVGG-TVAFCADGCQAIVDTGTSLLTGPTKDIKEMQRYIGATAM 307
Score = 97.4 bits (241), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 56/143 (39%), Positives = 79/143 (55%), Gaps = 8/143 (5%)
Query: 156 VEKENVSAGDS-AVCSACEMAVVWVQNQLKQKQTKE--KVLSYINELCDSLPNPMGESII 212
++ +NV G + A C+ A+V L TK+ ++ YI GE I+
Sbjct: 259 IQLDNVQVGGTVAFCADGCQAIVDTGTSLLTGPTKDIKEMQRYIGATAMD-----GEYIV 313
Query: 213 DCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGD 272
DC R+ +MP V+FTI + LS + Y L ++C+SGF D+PPP GPLWILGD
Sbjct: 314 DCGRLSSMPIVTFTINGIPYVLSAQAYTLMEQSDGVDICLSGFQGMDVPPPAGPLWILGD 373
Query: 273 VFMGVYHTVFDSGKLRIGFAEAA 295
VF+ Y++VFD G R+GFA A
Sbjct: 374 VFIRQYYSVFDRGNNRVGFAPTA 396
>gi|281207795|gb|EFA81975.1| cathepsin D [Polysphondylium pallidum PN500]
Length = 390
Score = 100 bits (248), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 50/118 (42%), Positives = 74/118 (62%), Gaps = 8/118 (6%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ QGLV + VF+FWL++ P A GGE+ FG +D + G TYVP+T + YW+F++ D
Sbjct: 195 MMAQGLVQQPVFAFWLSKVPGAN-GGELTFGSIDTTRYTGPITYVPLTNETYWEFKMDDF 253
Query: 61 LIGNQSTGVC-EGGCAAIVDSGTSLLAGPTPVVTEINHAIG-----GEGVVSAECKLV 112
+ S G C GC AI DSGTSL+AGP+ + +N +G GEG+ ++ C ++
Sbjct: 254 ALNGNSLGYCGADGCHAICDSGTSLIAGPSAQINALNTKLGAVVMNGEGIFTS-CSVI 310
Score = 88.2 bits (217), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 40/88 (45%), Positives = 54/88 (61%), Gaps = 1/88 (1%)
Query: 208 GESII-DCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGP 266
GE I C I T+PN+ T+ + F L+P Y+L+ C+SGFM D+P P GP
Sbjct: 300 GEGIFTSCSVISTLPNIEITVAGRQFLLTPTDYVLQVTSMGQTECLSGFMGIDIPAPIGP 359
Query: 267 LWILGDVFMGVYHTVFDSGKLRIGFAEA 294
LWILGDVF+ Y+ +FD G ++GFA A
Sbjct: 360 LWILGDVFISTYYAIFDYGNRQVGFATA 387
>gi|195382956|ref|XP_002050194.1| GJ22010 [Drosophila virilis]
gi|194144991|gb|EDW61387.1| GJ22010 [Drosophila virilis]
Length = 394
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 47/99 (47%), Positives = 61/99 (61%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
INE P G+ + C+++P +P +SFT+G + F L E Y+ VC S F
Sbjct: 294 INESIGGTPAAYGQFSVPCEQVPHLPTLSFTLGGRRFELKGEDYVFHDIFSDRTVCASAF 353
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
+A DLP P GPLWILGDVF+G Y+T FD G RIGFA+A
Sbjct: 354 IAVDLPSPSGPLWILGDVFLGKYYTEFDMGNHRIGFADA 392
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 48/116 (41%), Positives = 66/116 (56%), Gaps = 6/116 (5%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
++EQ L+ VFS +LNR + +GG + FGG P++++G TYVPVT + YWQ +L
Sbjct: 202 VMEQNLLKRPVFSVYLNRIASSRQGGYLFFGGSSPRYYRGNFTYVPVTHRAYWQVKLEAA 261
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAECKLV 112
IG +C GC I+D+GTS LA P IN +IGG G S C+ V
Sbjct: 262 RIG--PLQLCLNGCQVIIDTGTSFLAVPYEQAILINESIGGTPAAYGQFSVPCEQV 315
>gi|21542388|sp|P55956.2|ASP3_CAEEL RecName: Full=Aspartic protease 3; Flags: Precursor
Length = 398
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 45/104 (43%), Positives = 67/104 (64%), Gaps = 1/104 (0%)
Query: 192 VLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIA-EV 250
V+ I +P GE ++C +IP++PN++F +G + F+L + YIL+ G
Sbjct: 291 VIKKIQHKIGGIPLFNGEYEVECSKIPSLPNITFNLGGQNFDLQGKDYILQMSNGNGGST 350
Query: 251 CISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
C+SGFM D+P P GPLWILGDVF+G +++VFD G R+GFA +
Sbjct: 351 CLSGFMGMDIPAPAGPLWILGDVFIGRFYSVFDHGNKRVGFATS 394
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 43/119 (36%), Positives = 61/119 (51%), Gaps = 7/119 (5%)
Query: 1 MVEQGLVSEEVFSFWLNRDP-DAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGD 59
+ ++F+FWL+RD D GGEI DP H+ G + P+ + YW+ +L
Sbjct: 202 FANSAICKNQLFAFWLSRDANDITNGGEITLCETDPNHYVGNIAWEPLVSEDYWRIKLAS 261
Query: 60 ILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG----EGVVSAECKLVVS 114
++I T G +IVD+GTSLL GPT V+ +I H IGG G EC + S
Sbjct: 262 VVI--DGTTYTSGPIDSIVDTGTSLLTGPTDVIKKIQHKIGGIPLFNGEYEVECSKIPS 318
>gi|326933745|ref|XP_003212960.1| PREDICTED: cathepsin E-like [Meleagris gallopavo]
Length = 403
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 50/117 (42%), Positives = 72/117 (61%), Gaps = 4/117 (3%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ Q LV +FS +++ +PD+ GGE++FGG DP F G +VPVT +GYWQ +L ++
Sbjct: 211 MMAQDLVELPIFSVYMSANPDSSLGGEVLFGGFDPSRFLGTLHWVPVTVQGYWQIQLDNV 270
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG---EGVVSAECKLVVS 114
+G + C GC AIVD+GTSLL GPT + E+ IG +G +C L+ S
Sbjct: 271 QVGG-TVVFCANGCQAIVDTGTSLLTGPTKDIKEMQRYIGATPMDGEYVVDCSLLSS 326
Score = 97.8 bits (242), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 57/144 (39%), Positives = 81/144 (56%), Gaps = 10/144 (6%)
Query: 156 VEKENVSAGDSAV-CSACEMAVVWVQNQLKQKQTKE--KVLSYINELCDSLPNPM-GESI 211
++ +NV G + V C+ A+V L TK+ ++ YI PM GE +
Sbjct: 265 IQLDNVQVGGTVVFCANGCQAIVDTGTSLLTGPTKDIKEMQRYIGA------TPMDGEYV 318
Query: 212 IDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILG 271
+DC + +MP V+FTI + LS + Y L ++C+SGF D+PPP GPLWILG
Sbjct: 319 VDCSLLSSMPIVTFTINGMPYLLSAQAYTLMEQSDGMDICLSGFQGMDVPPPAGPLWILG 378
Query: 272 DVFMGVYHTVFDSGKLRIGFAEAA 295
DVF+ Y++VFD G R+GFA AA
Sbjct: 379 DVFIRQYYSVFDRGNNRVGFAPAA 402
>gi|308512535|ref|XP_003118450.1| CRE-ASP-3 protein [Caenorhabditis remanei]
gi|308239096|gb|EFO83048.1| CRE-ASP-3 protein [Caenorhabditis remanei]
Length = 397
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 46/104 (44%), Positives = 68/104 (65%), Gaps = 1/104 (0%)
Query: 192 VLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEV- 250
V+ I +P GE ++C +IP +PN++FT+G + F+L + YIL+ G +
Sbjct: 291 VIKKIQHKIGGIPLFNGEYEVECSKIPKLPNITFTLGGQNFDLQGKDYILQMSNGNGGMT 350
Query: 251 CISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
C+SGFM D+P P GPLWILGDVF+G +++VFD G R+GFA +
Sbjct: 351 CLSGFMGMDIPAPAGPLWILGDVFIGRFYSVFDHGNKRVGFATS 394
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 41/116 (35%), Positives = 61/116 (52%), Gaps = 3/116 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRDP-DAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGD 59
+ ++F+FWL+RD D GGEI D H+ G + P+ YW+ +LG
Sbjct: 202 FANSAICPNQLFAFWLSRDANDIANGGEITLCDTDSNHYTGNIAWEPLVSMDYWRIKLGA 261
Query: 60 ILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQ 115
+ I T G +IVD+GTSLL GPT V+ +I H IGG + + E ++ S+
Sbjct: 262 VSI--DGTTYTNGPIDSIVDTGTSLLTGPTDVIKKIQHKIGGIPLFNGEYEVECSK 315
>gi|453232811|ref|NP_509142.2| Protein ASP-3 [Caenorhabditis elegans]
gi|412984028|emb|CCD72415.2| Protein ASP-3 [Caenorhabditis elegans]
Length = 398
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 45/104 (43%), Positives = 67/104 (64%), Gaps = 1/104 (0%)
Query: 192 VLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIA-EV 250
V+ I +P GE ++C +IP++PN++F +G + F+L + YIL+ G
Sbjct: 291 VIKKIQHKIGGIPLFNGEYEVECSKIPSLPNITFNLGGQNFDLQGKDYILQMSNGNGGST 350
Query: 251 CISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
C+SGFM D+P P GPLWILGDVF+G +++VFD G R+GFA +
Sbjct: 351 CLSGFMGMDIPAPAGPLWILGDVFIGRFYSVFDHGNKRVGFATS 394
Score = 79.7 bits (195), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 43/119 (36%), Positives = 61/119 (51%), Gaps = 7/119 (5%)
Query: 1 MVEQGLVSEEVFSFWLNRDP-DAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGD 59
+ ++F+FWL+RD D GGEI DP H+ G + P+ + YW+ +L
Sbjct: 202 FANSAICKNQLFAFWLSRDANDITNGGEITLCDTDPNHYVGNIAWEPLVSEDYWRIKLAS 261
Query: 60 ILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG----EGVVSAECKLVVS 114
++I T G +IVD+GTSLL GPT V+ +I H IGG G EC + S
Sbjct: 262 VVI--DGTTYTSGPIDSIVDTGTSLLTGPTDVIKKIQHKIGGIPLFNGEYEVECSKIPS 318
>gi|324120787|dbj|BAJ78783.1| aspartic proteinase [Pinguicula macroceras]
Length = 80
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 44/56 (78%), Positives = 52/56 (92%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFE 56
MVEQGLV E+VFSFWLNR+ + E+GGE+VFGGVDPKHF G+HTYVPVT+KGYWQF+
Sbjct: 25 MVEQGLVKEQVFSFWLNRNGNDEDGGELVFGGVDPKHFIGEHTYVPVTQKGYWQFD 80
>gi|444731560|gb|ELW71913.1| Cathepsin D [Tupaia chinensis]
Length = 684
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 48/118 (40%), Positives = 75/118 (63%), Gaps = 5/118 (4%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+++Q LV + +F+F+LNRDP + GGE++ GGVD K++ G Y VT+K YWQ + +
Sbjct: 250 LMKQKLVEKNIFAFYLNRDPSGQPGGELMLGGVDTKYYTGSLDYYNVTRKAYWQIHMDKL 309
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE----CKLVVS 114
+G+ T +C+ GC IVD+GTSL+ GP V E++ A+G ++ E C+ V S
Sbjct: 310 EVGDGLT-LCQEGCEVIVDTGTSLIVGPVDEVRELHKAMGAVPLIQGEYMIPCEKVAS 366
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 38/98 (38%), Positives = 64/98 (65%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
+++ ++P GE +I C+++ ++P ++ +G+K ++L E+Y +K +G + +SGF
Sbjct: 343 LHKAMGAVPLIQGEYMIPCEKVASLPQITIRLGNKDYHLKGEEYTIKVSQGGKPLGLSGF 402
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAE 293
M +PPP GPLWILGDVF+G Y+ VFD R+G E
Sbjct: 403 MGMHIPPPAGPLWILGDVFIGCYYAVFDRDNNRVGPLE 440
>gi|253762217|gb|ACT35560.1| pepsinogen A2 precursor [Siniperca chuatsi]
Length = 376
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 49/114 (42%), Positives = 75/114 (65%), Gaps = 9/114 (7%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV+QGLVS+ +FS +L+ + +E+G E+VFGG+D H+ G+ T++P++ YWQ ++ +
Sbjct: 194 MVKQGLVSQPLFSVYLSSN--SEQGSEVVFGGIDSSHYTGQITWIPLSSATYWQIKMDSV 251
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG------GEGVVSAE 108
I Q T C GGC AI+D+GTSL+ GPT + +N +G GE VVS +
Sbjct: 252 TINGQ-TVACSGGCQAIIDTGTSLIVGPTSDINNMNAWVGASTNQYGEAVVSCQ 304
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 38/132 (28%), Positives = 62/132 (46%), Gaps = 10/132 (7%)
Query: 164 GDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNV 223
G + CS A++ L T + ++ +N + N GE+++ C I +MP V
Sbjct: 255 GQTVACSGGCQAIIDTGTSLIVGPTSD--INNMNAWVGASTNQYGEAVVSCQNIQSMPAV 312
Query: 224 SFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFD 283
+FT+ + F + Y+ + G C +GF LWILGDVF+ Y+ VFD
Sbjct: 313 TFTLNGQAFTIPASAYVSQNSYG----CNTGFGQ----GGSDQLWILGDVFIREYYVVFD 364
Query: 284 SGKLRIGFAEAA 295
+ +G A +A
Sbjct: 365 AQAQYVGLASSA 376
>gi|395821502|ref|XP_003784077.1| PREDICTED: gastricsin-like [Otolemur garnettii]
Length = 390
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 42/102 (41%), Positives = 67/102 (65%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M++Q +++ +FSF+ +R P A+ GGE++ GGVD + + G+ + PVT++ YWQ + +
Sbjct: 200 MLQQNQLTQPIFSFYFSRQPTAQYGGELILGGVDSQLYSGEIVWTPVTQEMYWQIAIQEF 259
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
IGNQ+TG+C GC IVD+GTSLL P ++ A G +
Sbjct: 260 SIGNQATGLCSQGCQGIVDTGTSLLTVPQQYISSFVEATGAQ 301
Score = 61.6 bits (148), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 33/89 (37%), Positives = 45/89 (50%), Gaps = 5/89 (5%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRG-P 266
G+ ++ C + MP ++FTIG L P Y+L C G L G P
Sbjct: 306 GDFVVSCSNVQNMPTIAFTIGGAQLPLPPSTYVLNNNG----YCTLGIEPTYLSSQSGEP 361
Query: 267 LWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
LWILGDVF+ Y++VFD +GFA +A
Sbjct: 362 LWILGDVFLREYYSVFDMANNMVGFALSA 390
>gi|224085770|ref|XP_002189383.1| PREDICTED: cathepsin E [Taeniopygia guttata]
Length = 435
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 48/117 (41%), Positives = 74/117 (63%), Gaps = 4/117 (3%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ Q LV +FS +++ +PD+ +GGE++FGG D F G +VPVT++GYWQ +L +I
Sbjct: 243 MMAQNLVELPIFSVYMSSNPDSPQGGEVLFGGFDTSRFTGTLNWVPVTQQGYWQIQLDNI 302
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG---EGVVSAECKLVVS 114
+G T C GC AIVD+GTSL+ GPT + ++ + IG +G + +C + S
Sbjct: 303 QLGGTVT-FCANGCQAIVDTGTSLITGPTKEIKKLQNLIGAVSVDGEYTVDCSNLSS 358
Score = 81.6 bits (200), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 38/88 (43%), Positives = 57/88 (64%), Gaps = 2/88 (2%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQY-ILKTGEGIAEVCISGFMAFDLPPPRGP 266
GE +DC + +MP+++ TI + LS + Y +++ +G+A C SGF D+PPP GP
Sbjct: 347 GEYTVDCSNLSSMPDLTITINGLPYTLSAQAYTLMEYADGMA-FCTSGFQGSDIPPPTGP 405
Query: 267 LWILGDVFMGVYHTVFDSGKLRIGFAEA 294
LWILGDVF+ +++VFD G +G A A
Sbjct: 406 LWILGDVFIRQFYSVFDRGNNMVGLAPA 433
>gi|326911558|ref|XP_003202125.1| PREDICTED: cathepsin E-A-like [Meleagris gallopavo]
Length = 404
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 42/100 (42%), Positives = 65/100 (65%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
+ E + P+ GE ++DC R+ ++P++SFTIG + L+ EQY++K C+SGF
Sbjct: 300 LQEYIGASPSRSGEFLVDCRRLSSLPHISFTIGHHEYKLTAEQYVVKESIDDQTFCMSGF 359
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
+ D+P G LWILGDVFM ++ +FD G R+GFA++A
Sbjct: 360 QSLDIPTRSGSLWILGDVFMSAFYCIFDRGNDRVGFAKSA 399
Score = 97.8 bits (242), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 49/108 (45%), Positives = 64/108 (59%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
++ Q LV E VFSF+L R D E GGE++ GG+D +KG +VPVT+K YWQ L +I
Sbjct: 207 IMNQKLVEEPVFSFYLKRGDDTENGGELILGGIDHSLYKGSIHWVPVTEKSYWQIHLNNI 266
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
I + C GC AIVDSGTSL+ GP+ + + IG S E
Sbjct: 267 KIQGR-VAFCSHGCEAIVDSGTSLITGPSSQIRRLQEYIGASPSRSGE 313
>gi|253762215|gb|ACT35559.1| pepsinogen A2 precursor [Siniperca scherzeri]
Length = 376
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 49/114 (42%), Positives = 75/114 (65%), Gaps = 9/114 (7%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV+QGLVS+ +FS +L+ + +E+G E+VFGG+D H+ G+ T++P++ YWQ ++ +
Sbjct: 194 MVKQGLVSQPLFSVYLSSN--SEQGSEVVFGGIDSSHYTGQITWIPLSSATYWQIKMDSV 251
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG------GEGVVSAE 108
I Q T C GGC AI+D+GTSL+ GPT + +N +G GE VVS +
Sbjct: 252 TINGQ-TVACSGGCQAIIDTGTSLIVGPTSDINNMNAWVGASTNQYGEAVVSCQ 304
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 38/132 (28%), Positives = 63/132 (47%), Gaps = 10/132 (7%)
Query: 164 GDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNV 223
G + CS A++ L T + ++ +N + N GE+++ C I +MP+V
Sbjct: 255 GQTVACSGGCQAIIDTGTSLIVGPTSD--INNMNAWVGASTNQYGEAVVSCQNIQSMPDV 312
Query: 224 SFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFD 283
+FT+ + F + Y+ + G C +GF LWILGDVF+ Y+ VFD
Sbjct: 313 TFTLNGQAFTIPASAYVSQNSYG----CNTGFGQ----GGSDQLWILGDVFIREYYVVFD 364
Query: 284 SGKLRIGFAEAA 295
+ +G A +A
Sbjct: 365 AHAQYVGLASSA 376
>gi|157423181|gb|AAI53793.1| Cathepsin E2 [Xenopus laevis]
Length = 397
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 46/102 (45%), Positives = 66/102 (64%), Gaps = 1/102 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ Q LV +FS +++RDP++ GGE+VFGG D F G+ +VPVT +GYWQ +L +I
Sbjct: 200 MIAQNLVELPMFSVYMSRDPNSPVGGELVFGGFDASRFSGQLNWVPVTNQGYWQIQLDNI 259
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
I N C GGC AIVD+GTS++ GP+ + ++ IG
Sbjct: 260 QI-NGEVVFCSGGCQAIVDTGTSMITGPSSDIVQLQSIIGAS 300
Score = 88.2 bits (217), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 40/85 (47%), Positives = 56/85 (65%), Gaps = 3/85 (3%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 267
G+ +DC + MP ++FTI + ++P+QY L+ +G VC SGF D+ PP GPL
Sbjct: 304 GDYEVDCTVLNKMPTMTFTINGIGYQMTPQQYTLQDDDG---VCSSGFQGLDISPPAGPL 360
Query: 268 WILGDVFMGVYHTVFDSGKLRIGFA 292
WILGDVF+G Y++VFD G R+G A
Sbjct: 361 WILGDVFIGQYYSVFDRGNNRVGLA 385
>gi|253762219|gb|ACT35561.1| pepsinogen A2 precursor [Siniperca chuatsi]
Length = 376
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 49/114 (42%), Positives = 75/114 (65%), Gaps = 9/114 (7%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV+QGLVS+ +FS +L+ + +E+G E+VFGG+D H+ G+ T++P++ YWQ ++ +
Sbjct: 194 MVKQGLVSQPLFSVYLSSN--SEQGSEVVFGGIDSSHYTGQITWIPLSSATYWQIKMDSV 251
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG------GEGVVSAE 108
I Q T C GGC AI+D+GTSL+ GPT + +N +G GE VVS +
Sbjct: 252 TINGQ-TVACSGGCQAIIDTGTSLIVGPTSDINNMNAWVGASTNQYGEAVVSCQ 304
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 38/132 (28%), Positives = 63/132 (47%), Gaps = 10/132 (7%)
Query: 164 GDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNV 223
G + CS A++ L T + ++ +N + N GE+++ C I +MP+V
Sbjct: 255 GQTVACSGGCQAIIDTGTSLIVGPTSD--INNMNAWVGASTNQYGEAVVSCQNIQSMPDV 312
Query: 224 SFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFD 283
+FT+ + F + Y+ + G C +GF LWILGDVF+ Y+ VFD
Sbjct: 313 TFTLNGQAFTIPASAYVSQNSYG----CNTGFGQGG----SDQLWILGDVFIREYYVVFD 364
Query: 284 SGKLRIGFAEAA 295
+ +G A +A
Sbjct: 365 AHAQYVGLASSA 376
>gi|193499293|gb|ACF18589.1| pepsinogen A2 precursor [Siniperca scherzeri]
Length = 376
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 49/114 (42%), Positives = 75/114 (65%), Gaps = 9/114 (7%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV+QGLVS+ +FS +L+ + +E+G E+VFGG+D H+ G+ T++P++ YWQ ++ +
Sbjct: 194 MVKQGLVSQPLFSVYLSSN--SEQGSEVVFGGIDSSHYTGQITWIPLSSATYWQIKMDSV 251
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG------GEGVVSAE 108
I Q T C GGC AI+D+GTSL+ GPT + +N +G GE VVS +
Sbjct: 252 TINGQ-TVACSGGCQAIIDTGTSLIVGPTSDINNMNAWVGASTNQYGEAVVSCQ 304
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 38/132 (28%), Positives = 62/132 (46%), Gaps = 10/132 (7%)
Query: 164 GDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNV 223
G + CS A++ L T + ++ +N + N GE+++ C I +MP+V
Sbjct: 255 GQTVACSGGCQAIIDTGTSLIVGPTSD--INNMNAWVGASTNQYGEAVVSCQNIQSMPDV 312
Query: 224 SFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFD 283
+FT+ + F + Y+ + G C +GF LWILGDVF+ Y+ VFD
Sbjct: 313 TFTLNGQAFTIPASAYVFQNSYG----CNTGFGQ----GGSDQLWILGDVFIREYYVVFD 364
Query: 284 SGKLRIGFAEAA 295
+ +G A A
Sbjct: 365 AHAQYVGLASFA 376
>gi|324120785|dbj|BAJ78782.1| aspartic proteinase [Drosera capensis]
Length = 80
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 44/55 (80%), Positives = 50/55 (90%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQF 55
M+EQGLV+E VFSFWL+R+ + EEGGEIVFGGVDP HFKG HTYVPVT+KGYWQF
Sbjct: 25 MMEQGLVNEPVFSFWLDRNAENEEGGEIVFGGVDPSHFKGNHTYVPVTQKGYWQF 79
>gi|334322038|ref|XP_001372237.2| PREDICTED: cathepsin E-like [Monodelphis domestica]
Length = 352
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 48/113 (42%), Positives = 71/113 (62%), Gaps = 4/113 (3%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ Q LV +FS ++ R+P++ G E++FGG D HF G +VPVTK+GYWQ L +I
Sbjct: 160 MMAQNLVDIPMFSVYMTRNPESTTGSELIFGGYDHSHFSGSLNWVPVTKQGYWQIALDNI 219
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG---EGVVSAECK 110
+G + C GC AIVD+GTSL+ GP+ + ++ +AIG +G + EC
Sbjct: 220 QVGG-TIMFCAEGCQAIVDTGTSLITGPSEKIKQLQNAIGAVPTDGEYAMECN 271
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 38/87 (43%), Positives = 52/87 (59%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 267
GE ++C+ + MP+++FTI + L P+ Y L E C SGF D+ PP GPL
Sbjct: 264 GEYAMECNNLNVMPDITFTINGIPYTLPPKAYTLTDFVDGMEFCTSGFQGLDIQPPAGPL 323
Query: 268 WILGDVFMGVYHTVFDSGKLRIGFAEA 294
WILGDVF+G ++ VFD G +G A A
Sbjct: 324 WILGDVFLGQFYAVFDRGNNLVGLAPA 350
>gi|327279867|ref|XP_003224677.1| PREDICTED: cathepsin E-A-like [Anolis carolinensis]
Length = 406
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 43/94 (45%), Positives = 63/94 (67%)
Query: 202 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 261
+ P GE ++DC R+ ++P V+F+IG++ F L+ E YI+K +G +C+SGF A D+
Sbjct: 306 AFPTATGEFLVDCRRVSSLPPVTFSIGEREFTLTAENYIIKEFDGKENLCLSGFQAQDIS 365
Query: 262 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
PLWILGDVFM ++ VFD G R+GFA+ A
Sbjct: 366 SHNMPLWILGDVFMSAFYCVFDRGNDRVGFAKPA 399
Score = 94.0 bits (232), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 49/118 (41%), Positives = 68/118 (57%), Gaps = 5/118 (4%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+++Q LV E +FSF LNR+ + + GG ++ GG+D F G + PVTKKGYWQ + +
Sbjct: 207 IIKQHLVEEPLFSFSLNREHNVDNGGVLILGGIDHSLFTGPIHWFPVTKKGYWQIHMNSV 266
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE----CKLVVS 114
I Q T C GC AIVDSGTSL+ GP + + +IG + E C+ V S
Sbjct: 267 KIQGQVTS-CISGCEAIVDSGTSLITGPLSQIVRLQQSIGAFPTATGEFLVDCRRVSS 323
>gi|148236737|ref|NP_001079044.1| cathepsin E-B precursor [Xenopus laevis]
gi|46395760|sp|Q805F2.1|CATEB_XENLA RecName: Full=Cathepsin E-B; Flags: Precursor
gi|28460655|dbj|BAC57454.1| cathepsin E2 [Xenopus laevis]
Length = 397
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 46/100 (46%), Positives = 66/100 (66%), Gaps = 1/100 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ Q LV +FS +++RDP++ GGE+VFGG D F G+ +VPVT +GYWQ +L +I
Sbjct: 200 MIAQNLVELPMFSVYMSRDPNSPVGGELVFGGFDASRFSGQLNWVPVTNQGYWQIQLDNI 259
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 100
I N C GGC AIVD+GTS++ GP+ + ++ IG
Sbjct: 260 QI-NGEVVFCSGGCQAIVDTGTSMITGPSSDIVQLQSIIG 298
Score = 88.2 bits (217), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 40/85 (47%), Positives = 56/85 (65%), Gaps = 3/85 (3%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 267
G+ +DC + MP ++FTI + ++P+QY L+ +G VC SGF D+ PP GPL
Sbjct: 304 GDYEVDCTVLNKMPTMTFTINGIGYQMTPQQYTLQDDDG---VCSSGFQGLDISPPAGPL 360
Query: 268 WILGDVFMGVYHTVFDSGKLRIGFA 292
WILGDVF+G Y++VFD G R+G A
Sbjct: 361 WILGDVFIGQYYSVFDRGNNRVGLA 385
>gi|345802472|ref|XP_854465.2| PREDICTED: pepsin B-like [Canis lupus familiaris]
Length = 390
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 41/88 (46%), Positives = 63/88 (71%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV+QG +++ +FSF+ +R P E GGE++ GGVD + + G+ + PVT++ YWQ + +
Sbjct: 200 MVQQGQLTQPIFSFYFSRQPTYEYGGELILGGVDTQFYSGEIVWAPVTREMYWQVAIDEF 259
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGP 88
L+ NQ+TG+C GC AIVD+GT +LA P
Sbjct: 260 LVNNQATGLCSQGCQAIVDTGTYVLAVP 287
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 48/89 (53%), Gaps = 5/89 (5%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRG-P 266
G+ +++C+ I +MP ++F I L P Y+ C G A LP P G P
Sbjct: 306 GDFVVNCNSIQSMPTITFVISGSPLPLPPSAYVFNNNG----YCTLGIEATYLPSPTGQP 361
Query: 267 LWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
LW LGDVF+ Y+T++D ++GFA +A
Sbjct: 362 LWTLGDVFLKEYYTIYDLANNKMGFAPSA 390
>gi|413953307|gb|AFW85956.1| hypothetical protein ZEAMMB73_368303 [Zea mays]
Length = 342
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 41/71 (57%), Positives = 54/71 (76%)
Query: 197 NELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFM 256
N+ C+ LP+P ES +DC +I MPN++FTI +K F L+PEQYI+K + +CISGFM
Sbjct: 19 NQFCERLPSPNDESTVDCHQISKMPNLAFTIANKTFTLTPEQYIVKLEQAGQTICISGFM 78
Query: 257 AFDLPPPRGPL 267
AFD+PPPRGPL
Sbjct: 79 AFDVPPPRGPL 89
>gi|125858582|gb|AAI29608.1| Ce1-A protein [Xenopus laevis]
Length = 394
Score = 99.0 bits (245), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 45/100 (45%), Positives = 67/100 (67%), Gaps = 1/100 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ Q LV +FS +++R+P++ GGE+VFGG D F G+ +VPVT +GYWQ +L ++
Sbjct: 197 MIAQNLVELPMFSIYMSRNPNSAVGGELVFGGFDASRFSGQLNWVPVTNQGYWQIQLDNV 256
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 100
I N C GGC AIVD+GTSL+ GP+ + ++ + IG
Sbjct: 257 QI-NGEVLFCSGGCQAIVDTGTSLITGPSSDIVQLQNIIG 295
Score = 94.0 bits (232), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 43/85 (50%), Positives = 57/85 (67%), Gaps = 3/85 (3%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 267
G+ +DC + MP V+FTI + ++P+QY L+ G G VC SGF D+PPP GPL
Sbjct: 301 GDYEVDCSVLNEMPTVTFTINGIGYQMTPQQYTLQDGGG---VCSSGFQGLDIPPPAGPL 357
Query: 268 WILGDVFMGVYHTVFDSGKLRIGFA 292
WILGDVF+G Y++VFD G R+G A
Sbjct: 358 WILGDVFIGQYYSVFDRGNNRVGLA 382
>gi|307177550|gb|EFN66654.1| Lysosomal aspartic protease [Camponotus floridanus]
Length = 328
Score = 99.0 bits (245), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 49/116 (42%), Positives = 69/116 (59%), Gaps = 5/116 (4%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPV----TKKGYWQFE 56
M+EQGLVS +FSF+LNR+ A+ GG+++FGG DP ++G TY+PV T KGYWQF
Sbjct: 170 MIEQGLVSSRIFSFYLNRNTSADLGGKLIFGGSDPACYEGDFTYIPVLHIFTDKGYWQFI 229
Query: 57 LGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLV 112
+ I I N++ +CE C A VD+ + GP V+ IN I +C +
Sbjct: 230 IDSIQI-NENFTLCEASCYATVDTSAWKIIGPEKDVSSINRFIETNSQGRVDCDRI 284
Score = 41.6 bits (96), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 21/53 (39%), Positives = 31/53 (58%), Gaps = 4/53 (7%)
Query: 190 EKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILK 242
EK +S IN ++ N G +DCDRI +P + F +G K FNL+ YI++
Sbjct: 261 EKDVSSINRFIET--NSQGR--VDCDRIFQLPTIRFNLGGKAFNLTGRDYIIR 309
>gi|301784222|ref|XP_002927531.1| PREDICTED: pepsin B-like [Ailuropoda melanoleuca]
Length = 390
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 41/102 (40%), Positives = 65/102 (63%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV+Q +++ +FSF+ +R P E GGE++ GGVD + + G+ + PVT++ YWQ + +
Sbjct: 200 MVQQDQLTQPIFSFYFSRQPTYEYGGELILGGVDSQFYSGEIVWTPVTREMYWQIAIDEF 259
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
L+ NQ+TG+C GC AIVD+GT +LA P + G +
Sbjct: 260 LVSNQATGLCSQGCQAIVDTGTYMLAVPQQFIGSFLQTTGAQ 301
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 37/86 (43%), Positives = 49/86 (56%), Gaps = 5/86 (5%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRG-P 266
G+ ++DCD I +MP ++F I L P Y+L C G A LP P G P
Sbjct: 306 GDFVVDCDSIQSMPTITFVISWTALPLPPSAYVLNNNG----YCTLGIEATYLPSPTGQP 361
Query: 267 LWILGDVFMGVYHTVFDSGKLRIGFA 292
LWILGDVF+ Y+T++D G R+GFA
Sbjct: 362 LWILGDVFLKEYYTIYDIGNNRMGFA 387
>gi|148227998|ref|NP_001079043.1| cathepsin E-A precursor [Xenopus laevis]
gi|46395761|sp|Q805F3.1|CATEA_XENLA RecName: Full=Cathepsin E-A; Flags: Precursor
gi|28460653|dbj|BAC57453.1| cathepsin E1 [Xenopus laevis]
gi|213625998|gb|AAI69692.1| Cathepsin E1 [Xenopus laevis]
gi|213627772|gb|AAI69694.1| Cathepsin E1 [Xenopus laevis]
Length = 397
Score = 98.6 bits (244), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 45/100 (45%), Positives = 67/100 (67%), Gaps = 1/100 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ Q LV +FS +++R+P++ GGE+VFGG D F G+ +VPVT +GYWQ +L ++
Sbjct: 200 MIAQNLVELPMFSVYMSRNPNSAVGGELVFGGFDASRFSGQLNWVPVTNQGYWQIQLDNV 259
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 100
I N C GGC AIVD+GTSL+ GP+ + ++ + IG
Sbjct: 260 QI-NGEVLFCSGGCQAIVDTGTSLITGPSSDIVQLQNIIG 298
Score = 94.0 bits (232), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 43/85 (50%), Positives = 57/85 (67%), Gaps = 3/85 (3%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 267
G+ +DC + MP V+FTI + ++P+QY L+ G G VC SGF D+PPP GPL
Sbjct: 304 GDYEVDCSVLNEMPTVTFTINGIGYQMTPQQYTLQDGGG---VCSSGFQGLDIPPPAGPL 360
Query: 268 WILGDVFMGVYHTVFDSGKLRIGFA 292
WILGDVF+G Y++VFD G R+G A
Sbjct: 361 WILGDVFIGQYYSVFDRGNNRVGLA 385
>gi|281339613|gb|EFB15197.1| hypothetical protein PANDA_017311 [Ailuropoda melanoleuca]
Length = 318
Score = 98.6 bits (244), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 41/102 (40%), Positives = 65/102 (63%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV+Q +++ +FSF+ +R P E GGE++ GGVD + + G+ + PVT++ YWQ + +
Sbjct: 131 MVQQDQLTQPIFSFYFSRQPTYEYGGELILGGVDSQFYSGEIVWTPVTREMYWQIAIDEF 190
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
L+ NQ+TG+C GC AIVD+GT +LA P + G +
Sbjct: 191 LVSNQATGLCSQGCQAIVDTGTYMLAVPQQFIGSFLQTTGAQ 232
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 37/86 (43%), Positives = 49/86 (56%), Gaps = 5/86 (5%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRG-P 266
G+ ++DCD I +MP ++F I L P Y+L C G A LP P G P
Sbjct: 237 GDFVVDCDSIQSMPTITFVISWTALPLPPSAYVLNNNG----YCTLGIEATYLPSPTGQP 292
Query: 267 LWILGDVFMGVYHTVFDSGKLRIGFA 292
LWILGDVF+ Y+T++D G R+GFA
Sbjct: 293 LWILGDVFLKEYYTIYDIGNNRMGFA 318
>gi|195121164|ref|XP_002005091.1| GI20282 [Drosophila mojavensis]
gi|193910159|gb|EDW09026.1| GI20282 [Drosophila mojavensis]
Length = 392
Score = 98.6 bits (244), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 46/99 (46%), Positives = 61/99 (61%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
INE P G+ + CD++ +P ++FT+G++ F L E Y+ VC S F
Sbjct: 292 INESIGGTPAAYGQFSVPCDQVAHLPTLTFTLGNRRFQLKGEDYVFHDIFPDRTVCASAF 351
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
+A DLP P GPLWILGDVF+G Y+T FD G RIGFA+A
Sbjct: 352 IAVDLPSPSGPLWILGDVFLGKYYTEFDMGNHRIGFADA 390
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 47/117 (40%), Positives = 65/117 (55%), Gaps = 6/117 (5%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
++EQ L+ VFS +LNR+ + +GG + FGG ++++G TYVPVT + YWQ +L
Sbjct: 200 IMEQNLLKRPVFSVYLNRELGSNQGGYLFFGGSSSRYYRGNFTYVPVTHRAYWQVKLETA 259
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAECKLVV 113
IG +C GC I+D+GTS LA P IN +IGG G S C V
Sbjct: 260 RIGKLQ--LCLNGCQVIIDTGTSFLAVPYEQAILINESIGGTPAAYGQFSVPCDQVA 314
>gi|89111566|dbj|BAE80442.1| pepsinogen B isozyme [Canis lupus familiaris]
Length = 374
Score = 98.6 bits (244), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 43/102 (42%), Positives = 65/102 (63%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV+QG +++ +FSF+ +R P E GGE++ GGVD + + G+ + PVT++ YWQ + +
Sbjct: 184 MVQQGQLTQPIFSFYFSRQPTYEYGGELILGGVDTQFYSGEIVWAPVTREMYWQVAIDEF 243
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
LIGNQ+TG+C GC IVD+GT L P + A G +
Sbjct: 244 LIGNQATGLCSQGCQGIVDTGTFPLTVPQQYLDSFVKATGAQ 285
Score = 68.6 bits (166), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 35/89 (39%), Positives = 48/89 (53%), Gaps = 5/89 (5%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRG-P 266
G +++C+ I +MP ++F I L P Y+L C G LP P G P
Sbjct: 290 GNFVVNCNSIQSMPTITFVISGSPLPLPPSTYVLNNNG----YCTLGIEVTYLPSPNGQP 345
Query: 267 LWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
LWILGDVF+ Y+TVFD R+GFA ++
Sbjct: 346 LWILGDVFLREYYTVFDMAANRVGFALSS 374
>gi|344236711|gb|EGV92814.1| Cathepsin E [Cricetulus griseus]
Length = 388
Score = 98.2 bits (243), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 46/112 (41%), Positives = 68/112 (60%), Gaps = 4/112 (3%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ Q LV +FS +++ DP G E+ FGG DP HF G ++PVTK+GYWQ L +
Sbjct: 196 MMAQNLVDLPIFSVYMSSDPQGGSGSELTFGGFDPSHFSGNLNWIPVTKQGYWQIALDGV 255
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG---EGVVSAEC 109
+G+ + C GC AIVD+GTSL+ GP+ + ++ AIG +G + +C
Sbjct: 256 QVGD-TVMFCSEGCQAIVDTGTSLITGPSHKIKQLQEAIGATPMDGEYAVDC 306
Score = 85.1 bits (209), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 42/90 (46%), Positives = 54/90 (60%), Gaps = 1/90 (1%)
Query: 206 PM-GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 264
PM GE +DC + TMPNV+F + + LSP YIL + C SGF D+ PP
Sbjct: 297 PMDGEYAVDCANLNTMPNVAFILNGVSYTLSPTAYILPDLVDGMQFCGSGFQGLDIQPPS 356
Query: 265 GPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
GPLWILGDVF+ ++ VFD G ++G A A
Sbjct: 357 GPLWILGDVFIRQFYAVFDRGNNQVGLAPA 386
>gi|2851407|sp|P16228.3|CATE_RAT RecName: Full=Cathepsin E; Flags: Precursor
gi|1113086|dbj|BAA08128.1| cathepsin E precursor [Rattus rattus]
gi|149058663|gb|EDM09820.1| cathepsin E, isoform CRA_a [Rattus norvegicus]
Length = 398
Score = 98.2 bits (243), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 48/112 (42%), Positives = 68/112 (60%), Gaps = 4/112 (3%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ Q LV+ +FS +L+ DP G E+ FGG DP HF G ++PVTK+GYWQ L I
Sbjct: 206 MMAQNLVALPMFSVYLSSDPQGGSGSELTFGGYDPSHFSGSLNWIPVTKQGYWQIALDGI 265
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG---EGVVSAEC 109
+G+ + C GC AIVD+GTSL+ GP + ++ AIG +G + +C
Sbjct: 266 QVGD-TVMFCSEGCQAIVDTGTSLITGPPKKIKQLQEAIGATPMDGEYAVDC 316
Score = 83.6 bits (205), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 42/91 (46%), Positives = 54/91 (59%), Gaps = 1/91 (1%)
Query: 205 NPM-GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPP 263
PM GE +DC + MPNV+F I + LSP YIL + C SGF D+ PP
Sbjct: 306 TPMDGEYAVDCATLNMMPNVTFLINGVSYTLSPTAYILPDLVDGMQFCGSGFQGLDIQPP 365
Query: 264 RGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
GPLWILGDVF+ +++VFD G ++G A A
Sbjct: 366 AGPLWILGDVFIRKFYSVFDRGNNQVGLAPA 396
>gi|1507725|gb|AAB06575.1| aspartic protease, partial [Ancylostoma caninum]
Length = 442
Score = 98.2 bits (243), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 45/91 (49%), Positives = 58/91 (63%)
Query: 204 PNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPP 263
P GE +I CD++P P +SF I + F L E Y+L G +C+SGFM D P
Sbjct: 320 PTYEGEYMIPCDKVPFPPRLSFVIEARTFTLKGEDYVLTVKAGGKSICLSGFMGMDFPER 379
Query: 264 RGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
G LWILGDVF+G Y+TVFD G+ R+GFA+A
Sbjct: 380 IGELWILGDVFIGKYYTVFDVGQARLGFAQA 410
Score = 90.5 bits (223), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 43/91 (47%), Positives = 60/91 (65%), Gaps = 4/91 (4%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+EQ V VF+ WLNR+PD+E GGEI GG+D + + T+ PVT++GYWQF++ D
Sbjct: 216 FIEQKKVPSPVFALWLNRNPDSELGGEITLGGMDTRRYVEPITWTPVTRRGYWQFKM-DK 274
Query: 61 LIGNQSTGVCEG---GCAAIVDSGTSLLAGP 88
+ G ++ C GC AI D+GTSL+AGP
Sbjct: 275 VQGGSTSIACPNEFSGCQAIADTGTSLIAGP 305
>gi|38303893|gb|AAH62002.1| Ctse protein [Rattus norvegicus]
Length = 398
Score = 98.2 bits (243), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 48/112 (42%), Positives = 68/112 (60%), Gaps = 4/112 (3%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ Q LV+ +FS +L+ DP G E+ FGG DP HF G ++PVTK+GYWQ L I
Sbjct: 206 MMAQNLVALPMFSVYLSSDPQGGSGSELTFGGYDPSHFSGSLNWIPVTKQGYWQIALDGI 265
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG---EGVVSAEC 109
+G+ + C GC AIVD+GTSL+ GP + ++ AIG +G + +C
Sbjct: 266 QVGD-TVMFCSEGCQAIVDTGTSLITGPPKKIKQLQEAIGATPMDGEYAVDC 316
Score = 83.6 bits (205), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 42/91 (46%), Positives = 54/91 (59%), Gaps = 1/91 (1%)
Query: 205 NPM-GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPP 263
PM GE +DC + MPNV+F I + LSP YIL + C SGF D+ PP
Sbjct: 306 TPMDGEYAVDCATLNMMPNVTFLINGVSYTLSPTAYILPDLVDGMQFCGSGFQGLDIQPP 365
Query: 264 RGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
GPLWILGDVF+ +++VFD G ++G A A
Sbjct: 366 AGPLWILGDVFIRKFYSVFDRGNNQVGLAPA 396
>gi|60600181|gb|AAX26634.1| unknown [Schistosoma japonicum]
gi|226476850|emb|CAX72341.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
Length = 262
Score = 98.2 bits (243), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 46/116 (39%), Positives = 75/116 (64%), Gaps = 6/116 (5%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M++QG+V VFSF+L+R+ GGE++ GG+D K++ G+ YV +T+K YW F++ ++
Sbjct: 27 MIKQGVVDSPVFSFYLSRNITNVLGGELMIGGIDDKYYTGEINYVNLTEKSYWLFKMDNL 86
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAECKLV 112
I + S +C GC AI D+GTS++AGPT V +IN +G G+ + C ++
Sbjct: 87 TISDLS--ICTDGCQAIADTGTSMIAGPTDEVKQINQKLGATHLPGGIYTVSCDVI 140
Score = 93.6 bits (231), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 49/140 (35%), Positives = 79/140 (56%), Gaps = 4/140 (2%)
Query: 155 VVEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDC 214
+ + +N++ D ++C+ A+ + T E + IN+ + P G + C
Sbjct: 80 LFKMDNLTISDLSICTDGCQAIADTGTSMIAGPTDE--VKQINQKLGATHLPGGIYTVSC 137
Query: 215 DRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVF 274
D I +P++ F I K L P YI+K + +E+C++GF+ DL PR LWILGDVF
Sbjct: 138 DVINNLPSIDFVINGKHMTLEPTDYIMKVSKLGSEICLTGFIGMDL--PRKKLWILGDVF 195
Query: 275 MGVYHTVFDSGKLRIGFAEA 294
+G ++T+FD GK R+GFA+A
Sbjct: 196 IGKFYTIFDMGKNRVGFAKA 215
>gi|226476878|emb|CAX72319.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
Length = 262
Score = 98.2 bits (243), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 46/116 (39%), Positives = 75/116 (64%), Gaps = 6/116 (5%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M++QG+V VFSF+L+R+ GGE++ GG+D K++ G+ YV +T+K YW F++ ++
Sbjct: 27 MIKQGVVDSPVFSFYLSRNITNVLGGELMIGGIDDKYYTGEINYVNLTEKSYWLFKMDNL 86
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAECKLV 112
I + S +C GC AI D+GTS++AGPT V +IN +G G+ + C ++
Sbjct: 87 TISDLS--ICTDGCQAIADTGTSMIAGPTDEVKQINQKLGATHLPGGIYTVSCDVI 140
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 49/140 (35%), Positives = 79/140 (56%), Gaps = 4/140 (2%)
Query: 155 VVEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDC 214
+ + +N++ D ++C+ A+ + T E + IN+ + P G + C
Sbjct: 80 LFKMDNLTISDLSICTDGCQAIADTGTSMIAGPTDE--VKQINQKLGATHLPGGIYTVSC 137
Query: 215 DRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVF 274
D I +P++ F I K L P YI+K + +E+C++GF+ DL PR LWILGDVF
Sbjct: 138 DVINNLPSIDFVINGKHMTLEPTDYIMKVSKLGSEICLTGFIGMDL--PRKKLWILGDVF 195
Query: 275 MGVYHTVFDSGKLRIGFAEA 294
+G ++T+FD GK R+GFA+A
Sbjct: 196 IGKFYTIFDMGKNRVGFAKA 215
>gi|449282010|gb|EMC88940.1| Cathepsin E-B, partial [Columba livia]
Length = 387
Score = 98.2 bits (243), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 48/108 (44%), Positives = 65/108 (60%), Gaps = 1/108 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
++ Q LV E +FSF+L R+ D E GGE++ GG+D +KG +VPVT+K YWQ L +I
Sbjct: 189 IMNQQLVEEPIFSFYLKREDDTENGGELILGGIDHSLYKGSIHWVPVTEKSYWQIHLNNI 248
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
I + C GC AIVDSGTSL+ GP+ + + IG S E
Sbjct: 249 KIQGR-VAFCSHGCEAIVDSGTSLITGPSSQIRRLQEYIGASPSHSGE 295
Score = 98.2 bits (243), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 41/99 (41%), Positives = 64/99 (64%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
+ E + P+ GE ++DC R+ ++P++SFTIG + L+ EQY++K C+SGF
Sbjct: 282 LQEYIGASPSHSGEFLVDCRRLSSLPHISFTIGHHEYKLTAEQYVVKESIEDQTFCMSGF 341
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
+ D+ GPLWILGDVFM ++ +FD G R+GFA++
Sbjct: 342 QSLDITTRAGPLWILGDVFMSAFYCIFDRGNDRVGFAKS 380
>gi|226476838|emb|CAX72335.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
Length = 429
Score = 98.2 bits (243), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 46/116 (39%), Positives = 75/116 (64%), Gaps = 6/116 (5%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M++QG+V VFSF+L+R+ GGE++ GG+D K++ G+ YV +T+K YW F++ ++
Sbjct: 194 MIKQGVVDSPVFSFYLSRNITNVLGGELMIGGIDDKYYTGEINYVNLTEKSYWLFKMDNL 253
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAECKLV 112
I + S +C GC AI D+GTS++AGPT V +IN +G G+ + C ++
Sbjct: 254 TISDLS--ICTDGCQAIADTGTSMIAGPTDEVKQINQKLGATHLPGGIYTVSCDVI 307
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 49/140 (35%), Positives = 79/140 (56%), Gaps = 4/140 (2%)
Query: 155 VVEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDC 214
+ + +N++ D ++C+ A+ + T E + IN+ + P G + C
Sbjct: 247 LFKMDNLTISDLSICTDGCQAIADTGTSMIAGPTDE--VKQINQKLGATHLPGGIYTVSC 304
Query: 215 DRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVF 274
D I +P++ F I K L P YI+K + +E+C++GF+ DL PR LWILGDVF
Sbjct: 305 DVINNLPSIDFVINGKHMTLEPTDYIMKVSKLGSEICLTGFIGMDL--PRKKLWILGDVF 362
Query: 275 MGVYHTVFDSGKLRIGFAEA 294
+G ++T+FD GK R+GFA+A
Sbjct: 363 IGKFYTIFDMGKNRVGFAKA 382
>gi|226476906|emb|CAX72305.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
Length = 429
Score = 98.2 bits (243), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 46/116 (39%), Positives = 75/116 (64%), Gaps = 6/116 (5%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M++QG+V VFSF+L+R+ GGE++ GG+D K++ G+ YV +T+K YW F++ ++
Sbjct: 194 MIKQGVVDSPVFSFYLSRNITNVLGGELMIGGIDDKYYTGEINYVNLTEKSYWLFKMDNL 253
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAECKLV 112
I + S +C GC AI D+GTS++AGPT V +IN +G G+ + C ++
Sbjct: 254 TISDLS--ICTDGCQAIADTGTSMIAGPTDEVKQINQKLGATHLPGGIYTVSCDVI 307
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 49/140 (35%), Positives = 79/140 (56%), Gaps = 4/140 (2%)
Query: 155 VVEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDC 214
+ + +N++ D ++C+ A+ + T E + IN+ + P G + C
Sbjct: 247 LFKMDNLTISDLSICTDGCQAIADTGTSMIAGPTDE--VKQINQKLGATHLPGGIYTVSC 304
Query: 215 DRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVF 274
D I +P++ F I K L P YI+K + +E+C++GF+ DL PR LWILGDVF
Sbjct: 305 DVINNLPSIDFVINGKHMTLEPTDYIMKVSKLGSEICLTGFIGMDL--PRKKLWILGDVF 362
Query: 275 MGVYHTVFDSGKLRIGFAEA 294
+G ++T+FD GK R+GFA+A
Sbjct: 363 IGKFYTIFDMGKNRVGFAKA 382
>gi|126309845|ref|XP_001370435.1| PREDICTED: gastricsin-like [Monodelphis domestica]
Length = 390
Score = 97.8 bits (242), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 43/102 (42%), Positives = 66/102 (64%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+++ L++ VF+F+L+ + ++ GGE+VFGGVD + G + PVT++ YWQ +
Sbjct: 199 FLQENLLNSPVFAFYLSGNENSNNGGEVVFGGVDTSMYTGDIYWAPVTEEAYWQIAINGF 258
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
IG Q+TG C GGC AIVD+GTSLL P + +E+ IG +
Sbjct: 259 SIGGQATGWCSGGCQAIVDTGTSLLTAPQQIFSELMQYIGAQ 300
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 36/107 (33%), Positives = 56/107 (52%), Gaps = 4/107 (3%)
Query: 190 EKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAE 249
+++ S + + + + G ++ C +MP ++F I F L P Y+L + E
Sbjct: 287 QQIFSELMQYIGAQQDENGSYLVSCSNTQSMPTITFNINGVDFPLPPSAYVLPSNSNYCE 346
Query: 250 VCISGFMAFDLPPPRG-PLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
V G M LP G PLWILGDVF+ Y++++D G R+GFA A
Sbjct: 347 V---GIMPTYLPSQNGQPLWILGDVFLRNYYSIYDLGNNRVGFANLA 390
>gi|226476876|emb|CAX72320.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
Length = 429
Score = 97.8 bits (242), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 46/116 (39%), Positives = 75/116 (64%), Gaps = 6/116 (5%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M++QG+V VFSF+L+R+ GGE++ GG+D K++ G+ YV +T+K YW F++ ++
Sbjct: 194 MIKQGVVDSPVFSFYLSRNITNVLGGELMIGGIDDKYYTGEINYVNLTEKSYWLFKMDNL 253
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAECKLV 112
I + S +C GC AI D+GTS++AGPT V +IN +G G+ + C ++
Sbjct: 254 TISDLS--ICTDGCQAIADTGTSMIAGPTDEVKQINQKLGATHLPGGIYTVSCDVI 307
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 49/140 (35%), Positives = 79/140 (56%), Gaps = 4/140 (2%)
Query: 155 VVEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDC 214
+ + +N++ D ++C+ A+ + T E + IN+ + P G + C
Sbjct: 247 LFKMDNLTISDLSICTDGCQAIADTGTSMIAGPTDE--VKQINQKLGATHLPGGIYTVSC 304
Query: 215 DRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVF 274
D I +P++ F I K L P YI+K + +E+C++GF+ DL PR LWILGDVF
Sbjct: 305 DVINNLPSIDFVINGKHMTLEPTDYIMKVSKLGSEICLTGFIGMDL--PRKKLWILGDVF 362
Query: 275 MGVYHTVFDSGKLRIGFAEA 294
+G ++T+FD GK R+GFA+A
Sbjct: 363 IGKFYTIFDMGKNRVGFAKA 382
>gi|226476810|emb|CAX72321.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
Length = 429
Score = 97.8 bits (242), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 46/116 (39%), Positives = 75/116 (64%), Gaps = 6/116 (5%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M++QG+V VFSF+L+R+ GGE++ GG+D K++ G+ YV +T+K YW F++ ++
Sbjct: 194 MIKQGVVDSPVFSFYLSRNITNVLGGELMIGGIDDKYYTGEINYVNLTEKSYWLFKMDNL 253
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAECKLV 112
I + S +C GC AI D+GTS++AGPT V +IN +G G+ + C ++
Sbjct: 254 TISDLS--ICTDGCQAIADTGTSMIAGPTDEVKQINQKLGATHLPGGIYTVSCDVI 307
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 49/140 (35%), Positives = 79/140 (56%), Gaps = 4/140 (2%)
Query: 155 VVEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDC 214
+ + +N++ D ++C+ A+ + T E + IN+ + P G + C
Sbjct: 247 LFKMDNLTISDLSICTDGCQAIADTGTSMIAGPTDE--VKQINQKLGATHLPGGIYTVSC 304
Query: 215 DRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVF 274
D I +P++ F I K L P YI+K + +E+C++GF+ DL PR LWILGDVF
Sbjct: 305 DVINNLPSIDFVINGKHMTLEPTDYIMKVSKLGSEICLTGFIGMDL--PRKKLWILGDVF 362
Query: 275 MGVYHTVFDSGKLRIGFAEA 294
+G ++T+FD GK R+GFA+A
Sbjct: 363 IGKFYTIFDMGKNRVGFAKA 382
>gi|226476830|emb|CAX72331.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
Length = 429
Score = 97.8 bits (242), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 46/116 (39%), Positives = 75/116 (64%), Gaps = 6/116 (5%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M++QG+V VFSF+L+R+ GGE++ GG+D K++ G+ YV +T+K YW F++ ++
Sbjct: 194 MIKQGVVDSPVFSFYLSRNITNVLGGELMIGGIDDKYYTGEINYVNLTEKSYWLFKMDNL 253
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAECKLV 112
I + S +C GC AI D+GTS++AGPT V +IN +G G+ + C ++
Sbjct: 254 TISDLS--ICTDGCQAIADTGTSMIAGPTDEVKQINQKLGATHLPGGIYTVSCDVI 307
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 49/140 (35%), Positives = 79/140 (56%), Gaps = 4/140 (2%)
Query: 155 VVEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDC 214
+ + +N++ D ++C+ A+ + T E + IN+ + P G + C
Sbjct: 247 LFKMDNLTISDLSICTDGCQAIADTGTSMIAGPTDE--VKQINQKLGATHLPGGIYTVSC 304
Query: 215 DRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVF 274
D I +P++ F I K L P YI+K + +E+C++GF+ DL PR LWILGDVF
Sbjct: 305 DVINNLPSIDFVINGKHMTLEPTDYIMKVSKLGSEICLTGFIGMDL--PRKKLWILGDVF 362
Query: 275 MGVYHTVFDSGKLRIGFAEA 294
+G ++T+FD GK R+GFA+A
Sbjct: 363 IGKFYTIFDMGKNRVGFAKA 382
>gi|226476812|emb|CAX72322.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
Length = 429
Score = 97.8 bits (242), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 46/116 (39%), Positives = 75/116 (64%), Gaps = 6/116 (5%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M++QG+V VFSF+L+R+ GGE++ GG+D K++ G+ YV +T+K YW F++ ++
Sbjct: 194 MIKQGVVDSPVFSFYLSRNITNVLGGELMIGGIDDKYYTGEINYVNLTEKSYWLFKMDNL 253
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAECKLV 112
I + S +C GC AI D+GTS++AGPT V +IN +G G+ + C ++
Sbjct: 254 TISDLS--ICTDGCQAIADTGTSMIAGPTDEVKQINQKLGATHLPGGIYTVSCDVI 307
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 49/140 (35%), Positives = 79/140 (56%), Gaps = 4/140 (2%)
Query: 155 VVEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDC 214
+ + +N++ D ++C+ A+ + T E + IN+ + P G + C
Sbjct: 247 LFKMDNLTISDLSICTDGCQAIADTGTSMIAGPTDE--VKQINQKLGATHLPGGIYTVSC 304
Query: 215 DRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVF 274
D I +P++ F I K L P YI+K + +E+C++GF+ DL PR LWILGDVF
Sbjct: 305 DVINNLPSIDFVINGKHMTLEPTDYIMKVSKLGSEICLTGFIGMDL--PRKKLWILGDVF 362
Query: 275 MGVYHTVFDSGKLRIGFAEA 294
+G ++T+FD GK R+GFA+A
Sbjct: 363 IGKFYTIFDMGKNRVGFAKA 382
>gi|226476818|emb|CAX72325.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
Length = 429
Score = 97.8 bits (242), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 46/116 (39%), Positives = 75/116 (64%), Gaps = 6/116 (5%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M++QG+V VFSF+L+R+ GGE++ GG+D K++ G+ YV +T+K YW F++ ++
Sbjct: 194 MIKQGVVDSPVFSFYLSRNITNVLGGELMIGGIDDKYYTGEINYVNLTEKSYWLFKMDNL 253
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAECKLV 112
I + S +C GC AI D+GTS++AGPT V +IN +G G+ + C ++
Sbjct: 254 TISDLS--ICTDGCQAIADTGTSMIAGPTDEVKQINQKLGATHLPGGIYTVSCDVI 307
Score = 94.4 bits (233), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 49/140 (35%), Positives = 79/140 (56%), Gaps = 4/140 (2%)
Query: 155 VVEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDC 214
+ + +N++ D ++C+ A+ + T E + IN+ + P G + C
Sbjct: 247 LFKMDNLTISDLSICTDGCQAIADTGTSMIAGPTDE--VKQINQKLGATHLPGGIYTVSC 304
Query: 215 DRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVF 274
D I +P++ F I K L P YI+K + +E+C++GF+ DL PR LWILGDVF
Sbjct: 305 DVINNLPSIDFVINGKHMTLEPTDYIMKVSKSGSEICLTGFIGMDL--PRKKLWILGDVF 362
Query: 275 MGVYHTVFDSGKLRIGFAEA 294
+G ++T+FD GK R+GFA+A
Sbjct: 363 IGKFYTIFDMGKNRVGFAKA 382
>gi|351710945|gb|EHB13864.1| Cathepsin E, partial [Heterocephalus glaber]
Length = 391
Score = 97.8 bits (242), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 48/104 (46%), Positives = 66/104 (63%), Gaps = 2/104 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ Q LV+ +FS +++ +P GGE+ FGG DP HF G +VPVTK+ YWQ L I
Sbjct: 200 MMAQNLVALPLFSVYMSSNPGGS-GGELTFGGYDPSHFSGSLNWVPVTKQAYWQIALDGI 258
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGV 104
L+G+ S C GC AIVD+GTSL+ GP P + ++ A+G V
Sbjct: 259 LVGD-SVMFCSEGCQAIVDTGTSLITGPPPKIKQLQEALGATYV 301
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 48/147 (32%), Positives = 74/147 (50%), Gaps = 18/147 (12%)
Query: 156 VEKENVSAGDSAV-CSACEMAVVWVQNQL------KQKQTKEKV-LSYINELCDSLPNPM 207
+ + + GDS + CS A+V L K KQ +E + +Y++E
Sbjct: 253 IALDGILVGDSVMFCSEGCQAIVDTGTSLITGPPPKIKQLQEALGATYVDE--------- 303
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 267
E ++C + M +V+F I ++ LSP Y L +VC +GF ++ PP GPL
Sbjct: 304 -EYAVECANLNMMQDVTFVINGVLYTLSPTAYTLLDYADGMQVCSTGFQGLEIQPPAGPL 362
Query: 268 WILGDVFMGVYHTVFDSGKLRIGFAEA 294
WILGDVF+ ++ VFD G ++G A A
Sbjct: 363 WILGDVFIRQFYAVFDRGNNQVGLAPA 389
>gi|195485971|ref|XP_002091310.1| GE13586 [Drosophila yakuba]
gi|194177411|gb|EDW91022.1| GE13586 [Drosophila yakuba]
Length = 404
Score = 97.8 bits (242), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 43/99 (43%), Positives = 62/99 (62%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
INE P+ G+ ++ C+ I +P ++FT+G + F L +Y+ + +C S F
Sbjct: 304 INESIGGTPSSFGQFLVPCENISALPKITFTLGGRTFFLESHEYVFRDIYQDRRICSSAF 363
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
+A DLP P GPLWILGDVF+G Y+T FD + RIGFA+A
Sbjct: 364 IAVDLPSPSGPLWILGDVFLGKYYTEFDMERHRIGFADA 402
Score = 88.2 bits (217), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 43/101 (42%), Positives = 65/101 (64%), Gaps = 3/101 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+EQGL+++ +FS +L+R + +EGG I FGG +P ++ G TYV V+ + YWQ ++
Sbjct: 213 MMEQGLLTKPIFSVYLSRHGE-KEGGAIFFGGSNPHYYTGNFTYVQVSHRAYWQVKMDSA 271
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 101
+I N +C+ GC I+D+GTS LA P IN +IGG
Sbjct: 272 VIRNLE--LCQQGCEVIIDTGTSFLALPYDQAILINESIGG 310
>gi|2102722|gb|AAB63357.1| aspartic protease precursor, partial [Schistosoma japonicum]
Length = 428
Score = 97.8 bits (242), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 46/116 (39%), Positives = 75/116 (64%), Gaps = 6/116 (5%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M++QG+V VFSF+L+R+ GGE++ GG+D K++ G+ YV +T+K YW F++ ++
Sbjct: 193 MIKQGVVDSPVFSFYLSRNITNVLGGELMIGGIDDKYYTGEINYVNLTEKSYWLFKMDNL 252
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAECKLV 112
I + S +C GC AI D+GTS++AGPT V +IN +G G+ + C ++
Sbjct: 253 TISDLS--ICTDGCQAIADTGTSMIAGPTDEVKQINQKLGATHLPGGIYTVSCDVI 306
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 49/140 (35%), Positives = 79/140 (56%), Gaps = 4/140 (2%)
Query: 155 VVEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDC 214
+ + +N++ D ++C+ A+ + T E + IN+ + P G + C
Sbjct: 246 LFKMDNLTISDLSICTDGCQAIADTGTSMIAGPTDE--VKQINQKLGATHLPGGIYTVSC 303
Query: 215 DRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVF 274
D I +P++ F I K L P YI+K + +E+C++GF+ DL PR LWILGDVF
Sbjct: 304 DVINNLPSIDFVINGKHMTLEPTDYIMKVSKLGSEICLTGFIGMDL--PRKKLWILGDVF 361
Query: 275 MGVYHTVFDSGKLRIGFAEA 294
+G ++T+FD GK R+GFA+A
Sbjct: 362 IGKFYTIFDMGKNRVGFAKA 381
>gi|226476902|emb|CAX72307.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
Length = 429
Score = 97.8 bits (242), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 46/116 (39%), Positives = 75/116 (64%), Gaps = 6/116 (5%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M++QG+V VFSF+L+R+ GGE++ GG+D K++ G+ YV +T+K YW F++ ++
Sbjct: 194 MIKQGVVDSPVFSFYLSRNITNVLGGELMIGGIDDKYYTGEINYVNLTEKSYWLFKMDNL 253
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAECKLV 112
I + S +C GC AI D+GTS++AGPT V +IN +G G+ + C ++
Sbjct: 254 TISDLS--ICTDGCQAIADTGTSMIAGPTDEVKQINQKLGATHLPGGIYTVSCDVI 307
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 49/140 (35%), Positives = 79/140 (56%), Gaps = 4/140 (2%)
Query: 155 VVEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDC 214
+ + +N++ D ++C+ A+ + T E + IN+ + P G + C
Sbjct: 247 LFKMDNLTISDLSICTDGCQAIADTGTSMIAGPTDE--VKQINQKLGATHLPGGIYTVSC 304
Query: 215 DRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVF 274
D I +P++ F I K L P YI+K + +E+C++GF+ DL PR LWILGDVF
Sbjct: 305 DVINNLPSIDFVINGKHMTLEPTDYIMKVSKLGSEICLTGFIGMDL--PRKKLWILGDVF 362
Query: 275 MGVYHTVFDSGKLRIGFAEA 294
+G ++T+FD GK R+GFA+A
Sbjct: 363 IGKFYTIFDMGKNRVGFAKA 382
>gi|189502972|gb|ACE06867.1| unknown [Schistosoma japonicum]
Length = 429
Score = 97.8 bits (242), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 46/116 (39%), Positives = 75/116 (64%), Gaps = 6/116 (5%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M++QG+V VFSF+L+R+ GGE++ GG+D K++ G+ YV +T+K YW F++ ++
Sbjct: 194 MIKQGVVDSPVFSFYLSRNITNVLGGELMIGGIDDKYYTGEINYVNLTEKSYWLFKMDNL 253
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAECKLV 112
I + S +C GC AI D+GTS++AGPT V +IN +G G+ + C ++
Sbjct: 254 TISDLS--ICTDGCQAIADTGTSMIAGPTDEVKQINQKLGATHLPGGIYTVSCDVI 307
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 49/140 (35%), Positives = 79/140 (56%), Gaps = 4/140 (2%)
Query: 155 VVEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDC 214
+ + +N++ D ++C+ A+ + T E + IN+ + P G + C
Sbjct: 247 LFKMDNLTISDLSICTDGCQAIADTGTSMIAGPTDE--VKQINQKLGATHLPGGIYTVSC 304
Query: 215 DRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVF 274
D I +P++ F I K L P YI+K + +E+C++GF+ DL PR LWILGDVF
Sbjct: 305 DVINNLPSIDFVINGKHMTLEPTDYIMKVSKLGSEICLTGFIGMDL--PRKKLWILGDVF 362
Query: 275 MGVYHTVFDSGKLRIGFAEA 294
+G ++T+FD GK R+GFA+A
Sbjct: 363 IGKFYTIFDMGKNRVGFAKA 382
>gi|126309851|ref|XP_001370482.1| PREDICTED: gastricsin-like [Monodelphis domestica]
Length = 390
Score = 97.8 bits (242), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 43/102 (42%), Positives = 66/102 (64%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+++ L++ VF+F+L+ + ++ GGE+VFGGVD + G + PVT++ YWQ +
Sbjct: 199 FLQENLLNSPVFAFYLSGNENSNNGGEVVFGGVDTSMYTGDIYWAPVTEEAYWQIAINGF 258
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
IG Q+TG C GGC AIVD+GTSLL P + +E+ IG +
Sbjct: 259 SIGGQATGWCSGGCQAIVDTGTSLLTAPQQIFSELMQYIGAQ 300
Score = 70.5 bits (171), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 37/107 (34%), Positives = 56/107 (52%), Gaps = 4/107 (3%)
Query: 190 EKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAE 249
+++ S + + + + G ++ C +MP ++F I F L P Y+L + E
Sbjct: 287 QQIFSELMQYIGAQQDENGSYLVSCSNTQSMPTITFNINGVDFPLPPSAYVLPSNSNYCE 346
Query: 250 VCISGFMAFDLPPPRG-PLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
V G M LP G PLWILGDVF+ Y++V+D G R+GFA A
Sbjct: 347 V---GIMPTYLPSQNGQPLWILGDVFLRNYYSVYDLGNNRVGFANLA 390
>gi|126309849|ref|XP_001370462.1| PREDICTED: gastricsin-like [Monodelphis domestica]
Length = 390
Score = 97.8 bits (242), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 43/102 (42%), Positives = 66/102 (64%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+++ L++ VF+F+L+ + ++ GGE+VFGGVD + G + PVT++ YWQ +
Sbjct: 199 FLQENLLNSPVFAFYLSGNENSNNGGEVVFGGVDTSMYTGDIYWAPVTEEAYWQIAINGF 258
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
IG Q+TG C GGC AIVD+GTSLL P + +E+ IG +
Sbjct: 259 SIGGQATGWCSGGCQAIVDTGTSLLTAPQQIFSELMQYIGAQ 300
Score = 70.5 bits (171), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 37/107 (34%), Positives = 56/107 (52%), Gaps = 4/107 (3%)
Query: 190 EKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAE 249
+++ S + + + + G ++ C +MP ++F I F L P Y+L + E
Sbjct: 287 QQIFSELMQYIGAQQDENGSYLVSCSNTQSMPTITFNINGVDFPLPPSAYVLPSNSNYCE 346
Query: 250 VCISGFMAFDLPPPRG-PLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
V G M LP G PLWILGDVF+ Y++V+D G R+GFA A
Sbjct: 347 V---GIMPTYLPSQNGQPLWILGDVFLRNYYSVYDLGNNRVGFANLA 390
>gi|432943847|ref|XP_004083297.1| PREDICTED: cathepsin E-A-like [Oryzias latipes]
Length = 412
Score = 97.8 bits (242), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 41/99 (41%), Positives = 66/99 (66%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
+++L + P G+ ++DC R+ ++P+V+F +G+ + L+ E YI K E+C +GF
Sbjct: 303 LHQLIGATPTHFGDFVVDCARLSSLPHVTFVLGEVEYTLTSEHYIRKETFSSRELCFTGF 362
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
MA ++ GPLWILGDVF+ Y+T+FD G+ R+GFA A
Sbjct: 363 MAAEMFSADGPLWILGDVFLTQYYTIFDKGQDRVGFARA 401
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 44/132 (33%), Positives = 68/132 (51%), Gaps = 12/132 (9%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAE-EGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGD 59
M+ Q ++ E +FSF+L+R G+++ GG D + G +VPVT KGYWQ +
Sbjct: 209 MLAQKILDEPIFSFYLSRSKSKSVPEGQLLLGGTDESLYSGPINWVPVTIKGYWQIRMDS 268
Query: 60 ILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDL 119
+ + S+ +C GC AIVD+GTSL+AGP + ++ IG + +GD
Sbjct: 269 VSVQGVSS-LCRRGCEAIVDTGTSLIAGPPREILRLHQLIGA----------TPTHFGDF 317
Query: 120 IWDLLVSGLLPE 131
+ D LP
Sbjct: 318 VVDCARLSSLPH 329
>gi|2347147|gb|AAC37302.1| aspartic proteinase precursor [Schistosoma japonicum]
gi|226476814|emb|CAX72323.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476816|emb|CAX72324.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476820|emb|CAX72326.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476822|emb|CAX72327.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476824|emb|CAX72328.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476826|emb|CAX72329.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476834|emb|CAX72333.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476836|emb|CAX72334.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476840|emb|CAX72336.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476842|emb|CAX72337.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476844|emb|CAX72338.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476846|emb|CAX72339.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476852|emb|CAX72342.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476880|emb|CAX72318.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476882|emb|CAX72317.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476886|emb|CAX72315.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476890|emb|CAX72313.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476892|emb|CAX72312.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476894|emb|CAX72311.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476896|emb|CAX72310.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476898|emb|CAX72309.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476900|emb|CAX72308.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226482870|emb|CAX79402.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
Length = 429
Score = 97.8 bits (242), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 46/116 (39%), Positives = 75/116 (64%), Gaps = 6/116 (5%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M++QG+V VFSF+L+R+ GGE++ GG+D K++ G+ YV +T+K YW F++ ++
Sbjct: 194 MIKQGVVDSPVFSFYLSRNITNVLGGELMIGGIDDKYYTGEINYVNLTEKSYWLFKMDNL 253
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAECKLV 112
I + S +C GC AI D+GTS++AGPT V +IN +G G+ + C ++
Sbjct: 254 TISDLS--ICTDGCQAIADTGTSMIAGPTDEVKQINQKLGATHLPGGIYTVSCDVI 307
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 49/140 (35%), Positives = 79/140 (56%), Gaps = 4/140 (2%)
Query: 155 VVEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDC 214
+ + +N++ D ++C+ A+ + T E + IN+ + P G + C
Sbjct: 247 LFKMDNLTISDLSICTDGCQAIADTGTSMIAGPTDE--VKQINQKLGATHLPGGIYTVSC 304
Query: 215 DRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVF 274
D I +P++ F I K L P YI+K + +E+C++GF+ DL PR LWILGDVF
Sbjct: 305 DVINNLPSIDFVINGKHMTLEPTDYIMKVSKLGSEICLTGFIGMDL--PRKKLWILGDVF 362
Query: 275 MGVYHTVFDSGKLRIGFAEA 294
+G ++T+FD GK R+GFA+A
Sbjct: 363 IGKFYTIFDMGKNRVGFAKA 382
>gi|226476888|emb|CAX72314.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476904|emb|CAX72306.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
Length = 429
Score = 97.8 bits (242), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 46/116 (39%), Positives = 75/116 (64%), Gaps = 6/116 (5%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M++QG+V VFSF+L+R+ GGE++ GG+D K++ G+ YV +T+K YW F++ ++
Sbjct: 194 MIKQGVVDSPVFSFYLSRNITNVLGGELMIGGIDDKYYTGEINYVNLTEKSYWLFKMDNL 253
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAECKLV 112
I + S +C GC AI D+GTS++AGPT V +IN +G G+ + C ++
Sbjct: 254 TISDLS--ICTDGCQAIADTGTSMIAGPTDEVKQINQKLGATHLPGGIYTVSCDVI 307
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 49/140 (35%), Positives = 79/140 (56%), Gaps = 4/140 (2%)
Query: 155 VVEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDC 214
+ + +N++ D ++C+ A+ + T E + IN+ + P G + C
Sbjct: 247 LFKMDNLTISDLSICTDGCQAIADTGTSMIAGPTDE--VKQINQKLGATHLPGGIYTVSC 304
Query: 215 DRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVF 274
D I +P++ F I K L P YI+K + +E+C++GF+ DL PR LWILGDVF
Sbjct: 305 DVINNLPSIDFVINGKHMTLEPTDYIMKVSKLGSEICLTGFIGMDL--PRKKLWILGDVF 362
Query: 275 MGVYHTVFDSGKLRIGFAEA 294
+G ++T+FD GK R+GFA+A
Sbjct: 363 IGKFYTIFDMGKNRVGFAKA 382
>gi|226476854|emb|CAX72343.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
Length = 435
Score = 97.8 bits (242), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 46/116 (39%), Positives = 75/116 (64%), Gaps = 6/116 (5%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M++QG+V VFSF+L+R+ GGE++ GG+D K++ G+ YV +T+K YW F++ ++
Sbjct: 200 MIKQGVVDSPVFSFYLSRNITNVLGGELMIGGIDDKYYTGEINYVNLTEKSYWLFKMDNL 259
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAECKLV 112
I + S +C GC AI D+GTS++AGPT V +IN +G G+ + C ++
Sbjct: 260 TISDLS--ICTDGCQAIADTGTSMIAGPTDEVKQINQKLGATHLPGGIYTVSCDVI 313
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 49/140 (35%), Positives = 79/140 (56%), Gaps = 4/140 (2%)
Query: 155 VVEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDC 214
+ + +N++ D ++C+ A+ + T E + IN+ + P G + C
Sbjct: 253 LFKMDNLTISDLSICTDGCQAIADTGTSMIAGPTDE--VKQINQKLGATHLPGGIYTVSC 310
Query: 215 DRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVF 274
D I +P++ F I K L P YI+K + +E+C++GF+ DL PR LWILGDVF
Sbjct: 311 DVINNLPSIDFVINGKHMTLEPTDYIMKVSKLGSEICLTGFIGMDL--PRKKLWILGDVF 368
Query: 275 MGVYHTVFDSGKLRIGFAEA 294
+G ++T+FD GK R+GFA+A
Sbjct: 369 IGKFYTIFDMGKNRVGFAKA 388
>gi|1585311|prf||2124395A Asp protease
Length = 380
Score = 97.8 bits (242), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 46/116 (39%), Positives = 75/116 (64%), Gaps = 6/116 (5%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M++QG+V VFSF+L+R+ GGE++ GG+D K++ G+ YV +T+K YW F++ ++
Sbjct: 194 MIKQGVVDSPVFSFYLSRNITNVLGGELMIGGIDDKYYTGEINYVNLTEKSYWLFKMDNL 253
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAECKLV 112
I + S +C GC AI D+GTS++AGPT V +IN +G G+ + C ++
Sbjct: 254 TISDLS--ICTDGCQAIADTGTSMIAGPTDEVKQINQKLGATHLPGGIYTVSCDVI 307
Score = 89.7 bits (221), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 47/137 (34%), Positives = 76/137 (55%), Gaps = 4/137 (2%)
Query: 155 VVEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDC 214
+ + +N++ D ++C+ A+ + T E + IN+ + P G + C
Sbjct: 247 LFKMDNLTISDLSICTDGCQAIADTGTSMIAGPTDE--VKQINQKLGATHLPGGIYTVSC 304
Query: 215 DRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVF 274
D I +P++ F I K L P YI+K + +E+C++GF+ DL PR LWILGDVF
Sbjct: 305 DVINNLPSIDFVINGKHMTLEPTDYIMKVSKLGSEICLTGFIGMDL--PRKKLWILGDVF 362
Query: 275 MGVYHTVFDSGKLRIGF 291
+G ++T+FD GK R+GF
Sbjct: 363 IGKFYTIFDMGKNRVGF 379
>gi|194218271|ref|XP_001501895.2| PREDICTED: pepsin A-like [Equus caballus]
Length = 387
Score = 97.8 bits (242), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 49/110 (44%), Positives = 71/110 (64%), Gaps = 9/110 (8%)
Query: 3 EQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILI 62
+QGLVS+++FS +L+ D E G ++FGG+DP ++ G +VPV+ +GYWQ + + +
Sbjct: 203 DQGLVSQDLFSVYLSSDD--ESGSVVMFGGIDPSYYTGSLHWVPVSNEGYWQITMDSVTV 260
Query: 63 GNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG------GEGVVS 106
+S C GGC AIVD+GTSLLAGPT + I +G GEGV+S
Sbjct: 261 NGESIA-CSGGCQAIVDTGTSLLAGPTSAIDNIQSYLGFSEDSSGEGVIS 309
Score = 77.4 bits (189), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 51/134 (38%), Positives = 66/134 (49%), Gaps = 10/134 (7%)
Query: 164 GDSAVCSACEMAVVWVQNQLKQKQTK--EKVLSYINELCDSLPNPMGESIIDCDRIPTMP 221
G+S CS A+V L T + + SY+ DS GE +I C I ++P
Sbjct: 262 GESIACSGGCQAIVDTGTSLLAGPTSAIDNIQSYLGFSEDS----SGEGVISCSSIYSLP 317
Query: 222 NVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTV 281
++ FT+ F L P YIL+ + CISGF DL G LWILGDVF+ Y TV
Sbjct: 318 DIVFTLNGVEFPLRPSAYILEEDDS----CISGFEGMDLDTSSGELWILGDVFIRQYFTV 373
Query: 282 FDSGKLRIGFAEAA 295
FD +IG A A
Sbjct: 374 FDRANNQIGLASVA 387
>gi|226476856|emb|CAX72344.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
Length = 429
Score = 97.8 bits (242), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 46/116 (39%), Positives = 75/116 (64%), Gaps = 6/116 (5%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M++QG+V VFSF+L+R+ GGE++ GG+D K++ G+ YV +T+K YW F++ ++
Sbjct: 194 MIKQGVVDSPVFSFYLSRNITNVLGGELMIGGIDDKYYTGEINYVNLTEKSYWLFKMDNL 253
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAECKLV 112
I + S +C GC AI D+GTS++AGPT V +IN +G G+ + C ++
Sbjct: 254 TISDLS--ICTDGCQAIADTGTSMIAGPTDEVKQINQKLGATHLPGGIYTVSCDVI 307
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 49/140 (35%), Positives = 79/140 (56%), Gaps = 4/140 (2%)
Query: 155 VVEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDC 214
+ + +N++ D ++C+ A+ + T E + IN+ + P G + C
Sbjct: 247 LFKMDNLTISDLSICTDGCQAIADTGTSMIAGPTDE--VKQINQKLGATHLPGGIYTVSC 304
Query: 215 DRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVF 274
D I +P++ F I K L P YI+K + +E+C++GF+ DL PR LWILGDVF
Sbjct: 305 DVINNLPSIDFVINGKHMTLEPTDYIMKVSKLGSEICLTGFIGMDL--PRKKLWILGDVF 362
Query: 275 MGVYHTVFDSGKLRIGFAEA 294
+G ++T+FD GK R+GFA+A
Sbjct: 363 IGKFYTIFDMGKNRVGFAKA 382
>gi|354478111|ref|XP_003501259.1| PREDICTED: cathepsin E-like isoform 1 [Cricetulus griseus]
Length = 396
Score = 97.8 bits (242), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 46/112 (41%), Positives = 68/112 (60%), Gaps = 4/112 (3%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ Q LV +FS +++ DP G E+ FGG DP HF G ++PVTK+GYWQ L +
Sbjct: 204 MMAQNLVDLPIFSVYMSSDPQGGSGSELTFGGFDPSHFSGNLNWIPVTKQGYWQIALDGV 263
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG---EGVVSAEC 109
+G+ + C GC AIVD+GTSL+ GP+ + ++ AIG +G + +C
Sbjct: 264 QVGD-TVMFCSEGCQAIVDTGTSLITGPSHKIKQLQEAIGATPMDGEYAVDC 314
Score = 85.1 bits (209), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 42/90 (46%), Positives = 54/90 (60%), Gaps = 1/90 (1%)
Query: 206 PM-GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 264
PM GE +DC + TMPNV+F + + LSP YIL + C SGF D+ PP
Sbjct: 305 PMDGEYAVDCANLNTMPNVAFILNGVSYTLSPTAYILPDLVDGMQFCGSGFQGLDIQPPS 364
Query: 265 GPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
GPLWILGDVF+ ++ VFD G ++G A A
Sbjct: 365 GPLWILGDVFIRQFYAVFDRGNNQVGLAPA 394
>gi|226476832|emb|CAX72332.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
Length = 429
Score = 97.8 bits (242), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 46/116 (39%), Positives = 75/116 (64%), Gaps = 6/116 (5%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M++QG+V VFSF+L+R+ GGE++ GG+D K++ G+ YV +T+K YW F++ ++
Sbjct: 194 MIKQGVVDSPVFSFYLSRNITNVLGGELMIGGIDDKYYTGEINYVNLTEKSYWLFKMDNL 253
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAECKLV 112
I + S +C GC AI D+GTS++AGPT V +IN +G G+ + C ++
Sbjct: 254 TISDLS--ICTDGCQAIADTGTSMIAGPTDEVKQINQKLGATHLPGGIYTVSCDVI 307
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 49/140 (35%), Positives = 79/140 (56%), Gaps = 4/140 (2%)
Query: 155 VVEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDC 214
+ + +N++ D ++C+ A+ + T E + IN+ + P G + C
Sbjct: 247 LFKMDNLTISDLSICTDGCQAIADTGTSMIAGPTDE--VKQINQKLGATHLPGGIYTVSC 304
Query: 215 DRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVF 274
D I +P++ F I K L P YI+K + +E+C++GF+ DLP R LWILGDVF
Sbjct: 305 DVINNLPSIDFVINGKHMTLEPTDYIMKVFKLGSEICLTGFIGMDLP--RKKLWILGDVF 362
Query: 275 MGVYHTVFDSGKLRIGFAEA 294
+G ++T+FD GK R+GFA+A
Sbjct: 363 IGKFYTIFDMGKNRVGFAKA 382
>gi|6978719|ref|NP_037070.1| cathepsin E precursor [Rattus norvegicus]
gi|1113084|dbj|BAA07285.1| cathepsin E precursor [Rattus norvegicus]
Length = 365
Score = 97.4 bits (241), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 46/100 (46%), Positives = 63/100 (63%), Gaps = 1/100 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ Q LV+ +FS +L+ DP G E+ FGG DP HF G ++PVTK+GYWQ L I
Sbjct: 206 MMAQNLVALPMFSVYLSSDPQGGSGSELTFGGYDPSHFSGSLNWIPVTKQGYWQIALDGI 265
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 100
+G+ + C GC AIVD+GTSL+ GP + ++ AIG
Sbjct: 266 QVGD-TVMFCSEGCQAIVDTGTSLITGPPKKIKQLQEAIG 304
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 24/46 (52%), Positives = 32/46 (69%)
Query: 249 EVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
+ C SGF D+ PP GPLWILGDVF+ +++VFD G ++G A A
Sbjct: 318 QFCGSGFQGLDIQPPAGPLWILGDVFIRKFYSVFDRGNNQVGLAPA 363
>gi|344277046|ref|XP_003410316.1| PREDICTED: cathepsin E [Loxodonta africana]
Length = 396
Score = 97.4 bits (241), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 49/112 (43%), Positives = 68/112 (60%), Gaps = 4/112 (3%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ Q LV +FS +++ DP G E++FGG D HF G +VPVTK+GYWQ L +I
Sbjct: 204 MMAQNLVDLPMFSVYMSSDPAGGMGSELIFGGYDHSHFSGSLNWVPVTKQGYWQIALDNI 263
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE---GVVSAEC 109
+G + C GC AIVD+GTSL+ GP+ + ++ AIG E G + EC
Sbjct: 264 QVGG-TVMFCSEGCQAIVDTGTSLITGPSNNIKQLQRAIGAEPENGEYAVEC 314
Score = 81.6 bits (200), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 38/87 (43%), Positives = 51/87 (58%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 267
GE ++C + MP+V+FTI + LSP Y L C SGF D+ PP GPL
Sbjct: 308 GEYAVECVNLNVMPDVTFTINGVSYTLSPTAYTLLDSADGMNFCSSGFQGLDIQPPAGPL 367
Query: 268 WILGDVFMGVYHTVFDSGKLRIGFAEA 294
WILGDVF+ +++VFD G ++G A A
Sbjct: 368 WILGDVFIRQFYSVFDRGNNQVGLAPA 394
>gi|195150257|ref|XP_002016071.1| GL10692 [Drosophila persimilis]
gi|194109918|gb|EDW31961.1| GL10692 [Drosophila persimilis]
Length = 399
Score = 97.4 bits (241), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 45/99 (45%), Positives = 60/99 (60%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
INE P+ G+ + CD++P +P ++F +G + F L YI + E+C S
Sbjct: 299 INESIGGTPSEYGQYSVPCDQVPQLPRLTFQLGSQQFFLDGSNYIFRDVYQDREICFSAI 358
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
+ DLP P GPLWILGDVF+G Y+T FD G RIGFAEA
Sbjct: 359 IGVDLPSPSGPLWILGDVFLGKYYTEFDMGNHRIGFAEA 397
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 47/116 (40%), Positives = 69/116 (59%), Gaps = 6/116 (5%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
++EQ L+S VFS +LNR+ + EGG + FGG +P++++G TYVPV+++ YWQ +
Sbjct: 207 IMEQKLLSNPVFSVYLNREQEHPEGGALFFGGSNPRYYRGNFTYVPVSRRAYWQVRMEAA 266
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAECKLV 112
I + +C+ GC I+D+GTS LA P IN +IGG G S C V
Sbjct: 267 TINDLR--LCQHGCEVIIDTGTSFLALPYDQAILINESIGGTPSEYGQYSVPCDQV 320
>gi|149058665|gb|EDM09822.1| cathepsin E, isoform CRA_c [Rattus norvegicus]
Length = 365
Score = 97.4 bits (241), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 46/100 (46%), Positives = 63/100 (63%), Gaps = 1/100 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ Q LV+ +FS +L+ DP G E+ FGG DP HF G ++PVTK+GYWQ L I
Sbjct: 206 MMAQNLVALPMFSVYLSSDPQGGSGSELTFGGYDPSHFSGSLNWIPVTKQGYWQIALDGI 265
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 100
+G+ + C GC AIVD+GTSL+ GP + ++ AIG
Sbjct: 266 QVGD-TVMFCSEGCQAIVDTGTSLITGPPKKIKQLQEAIG 304
Score = 57.8 bits (138), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 24/46 (52%), Positives = 32/46 (69%)
Query: 249 EVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
+ C SGF D+ PP GPLWILGDVF+ +++VFD G ++G A A
Sbjct: 318 QFCGSGFQGLDIQPPAGPLWILGDVFIRKFYSVFDRGNNQVGLAPA 363
>gi|198457045|ref|XP_001360531.2| GA10074 [Drosophila pseudoobscura pseudoobscura]
gi|198135836|gb|EAL25106.2| GA10074 [Drosophila pseudoobscura pseudoobscura]
Length = 399
Score = 97.4 bits (241), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 45/99 (45%), Positives = 60/99 (60%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
INE P+ G+ + CD++P +P ++F +G + F L YI + E+C S
Sbjct: 299 INESIGGTPSEYGQYSVPCDQVPQLPRLTFQLGSQQFFLDGSNYIFRDVYQDREICFSAI 358
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
+ DLP P GPLWILGDVF+G Y+T FD G RIGFAEA
Sbjct: 359 IGVDLPSPSGPLWILGDVFLGKYYTEFDMGNHRIGFAEA 397
Score = 93.6 bits (231), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 47/116 (40%), Positives = 67/116 (57%), Gaps = 6/116 (5%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
++EQ L+S VFS +LNR + EGG + FGG +P++++G TYVPV+ + YWQ +
Sbjct: 207 IMEQKLLSNPVFSVYLNRQQEHPEGGALFFGGSNPRYYRGNFTYVPVSHRAYWQVRMEAA 266
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAECKLV 112
I + +C+ GC I+D+GTS LA P IN +IGG G S C V
Sbjct: 267 TIND--LRLCQHGCEVIIDTGTSFLALPYDQAILINESIGGTPSEYGQYSVPCDQV 320
>gi|126723599|ref|NP_001075713.1| cathepsin E precursor [Oryctolagus cuniculus]
gi|1168791|sp|P43159.1|CATE_RABIT RecName: Full=Cathepsin E; Flags: Precursor
gi|402729|gb|AAC37308.1| procathepsin E [Oryctolagus cuniculus]
Length = 396
Score = 97.4 bits (241), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 49/113 (43%), Positives = 70/113 (61%), Gaps = 4/113 (3%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ Q LVS +FS +++ +P+ G E+ FGG D HF G +VPVTK+GYWQ L +I
Sbjct: 204 MMAQNLVSLPMFSVYMSSNPEGGSGSELTFGGYDSSHFSGSLNWVPVTKQGYWQIALDEI 263
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG---EGVVSAECK 110
+G S C GC AIVD+GTSL+ GP+ + ++ AIG +G + EC+
Sbjct: 264 QVGG-SPMFCPEGCQAIVDTGTSLITGPSDKIIQLQAAIGATPMDGEYAVECE 315
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 39/91 (42%), Positives = 53/91 (58%), Gaps = 1/91 (1%)
Query: 205 NPM-GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPP 263
PM GE ++C+ + MP+V+F I + LS Y L + C SGF D+ PP
Sbjct: 304 TPMDGEYAVECENLNIMPDVTFVINGVPYTLSATAYTLPDFVDGMQFCGSGFQGLDIQPP 363
Query: 264 RGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
GPLWILGDVF+ +++VFD G R+G A A
Sbjct: 364 AGPLWILGDVFIRQFYSVFDRGSNRVGLAPA 394
>gi|449280945|gb|EMC88160.1| Cathepsin E, partial [Columba livia]
Length = 374
Score = 97.4 bits (241), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 48/113 (42%), Positives = 69/113 (61%), Gaps = 4/113 (3%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ Q LV +FS +L+ +P++ GGE++FGG DP F G +VPVT++GYWQ +L +I
Sbjct: 182 MMAQNLVELPIFSVYLSTNPESSLGGELLFGGFDPSRFMGTLNWVPVTQQGYWQIQLDNI 241
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG---EGVVSAECK 110
+ + C GC AIVD+GTSL+ GPT V + IG +G + EC
Sbjct: 242 QLAG-TVAFCTNGCQAIVDTGTSLITGPTKDVKVLQKYIGATPVDGEYAVECN 293
Score = 84.3 bits (207), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 56/144 (38%), Positives = 81/144 (56%), Gaps = 12/144 (8%)
Query: 156 VEKENVS-AGDSAVCSACEMAVVWVQNQLKQKQTKE-KVLS-YINELCDSLPNPM-GESI 211
++ +N+ AG A C+ A+V L TK+ KVL YI P+ GE
Sbjct: 236 IQLDNIQLAGTVAFCTNGCQAIVDTGTSLITGPTKDVKVLQKYIGA------TPVDGEYA 289
Query: 212 IDCDRIPTMPNVSFTIGDKIFNLSPEQYIL-KTGEGIAEVCISGFMAFDLPPPRGPLWIL 270
++C+ + MP+V+FTI + LS + Y L + +G+A C SGF D+ PP GPLWIL
Sbjct: 290 VECNNLNVMPDVTFTINGLPYLLSAQAYTLVENSDGMA-FCTSGFQGLDIAPPYGPLWIL 348
Query: 271 GDVFMGVYHTVFDSGKLRIGFAEA 294
GDVF+ +++VFD G R+G A A
Sbjct: 349 GDVFIRQFYSVFDRGNNRVGLAPA 372
>gi|221048011|gb|ACL98113.1| pepsinogen [Epinephelus coioides]
Length = 311
Score = 97.4 bits (241), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 52/131 (39%), Positives = 77/131 (58%), Gaps = 13/131 (9%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV+QGLVS+ +FS +L+ E+G E+VFGG+D H+ G+ T+VP+T YWQ ++ +
Sbjct: 194 MVKQGLVSQPLFSVYLSSH--GEQGSEVVFGGIDSSHYTGQVTWVPLTSATYWQIKMDGV 251
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
I Q T C GGC AI+D+GTSL+ GPT + +N +G +QYG+
Sbjct: 252 KINGQ-TVACAGGCQAIIDTGTSLIVGPTNDINNMNSWVGAS----------TNQYGEST 300
Query: 121 WDLLVSGLLPE 131
+ G +PE
Sbjct: 301 VNCQNVGSMPE 311
>gi|345797646|ref|XP_545694.3| PREDICTED: cathepsin E [Canis lupus familiaris]
Length = 396
Score = 97.4 bits (241), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 49/112 (43%), Positives = 68/112 (60%), Gaps = 4/112 (3%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ Q LV +FS +++ DP+ G E++FGG D HF G +VPVTK+GYWQ L I
Sbjct: 204 MMAQNLVDIPMFSVYMSSDPEGGTGSELIFGGYDHSHFSGNLNWVPVTKQGYWQIALDAI 263
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE---GVVSAEC 109
+G + C GC AIVD+GTSL+ GP+ + ++ +AIG E G EC
Sbjct: 264 QVGG-TVMFCSEGCQAIVDTGTSLITGPSDEIKQLQNAIGAEPMDGEYGVEC 314
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 40/90 (44%), Positives = 52/90 (57%), Gaps = 1/90 (1%)
Query: 206 PM-GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 264
PM GE ++C + MP+V+F I + L P Y L E C SGF D+ PP
Sbjct: 305 PMDGEYGVECANLNVMPDVTFIINGVSYTLQPTAYTLLDYVDGMEFCSSGFQGLDIQPPA 364
Query: 265 GPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
GPLWILGDVF+ +++VFD G R+G A A
Sbjct: 365 GPLWILGDVFIRKFYSVFDRGNNRVGLALA 394
>gi|194883084|ref|XP_001975634.1| GG20455 [Drosophila erecta]
gi|190658821|gb|EDV56034.1| GG20455 [Drosophila erecta]
Length = 404
Score = 97.1 bits (240), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 44/99 (44%), Positives = 62/99 (62%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
INE P+ G+ ++ C+ I +P ++FT+G + F L +Y+ + +C S F
Sbjct: 304 INESIGGTPSSFGQFLVPCESIAGLPKITFTLGGRRFFLESHEYVFRDIYQDRRICSSAF 363
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
+A DLP P GPLWILGDVF+G Y+T FD K RIGFA+A
Sbjct: 364 IAVDLPSPSGPLWILGDVFLGKYYTEFDMEKHRIGFADA 402
Score = 88.2 bits (217), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 43/101 (42%), Positives = 65/101 (64%), Gaps = 3/101 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+EQGL+++ +FS +L+R + +EGG I FGG +P ++ G TYV V+ + YWQ ++
Sbjct: 213 MMEQGLLTKPIFSVYLSRHGE-KEGGAIFFGGSNPHYYTGNFTYVQVSHRAYWQVKMDSA 271
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 101
+I N +C+ GC I+D+GTS LA P IN +IGG
Sbjct: 272 VIRNLE--LCQQGCEVIIDTGTSFLALPYDQAILINESIGG 310
>gi|194756946|ref|XP_001960731.1| GF13504 [Drosophila ananassae]
gi|190622029|gb|EDV37553.1| GF13504 [Drosophila ananassae]
Length = 402
Score = 97.1 bits (240), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 61/179 (34%), Positives = 90/179 (50%), Gaps = 26/179 (14%)
Query: 1 MVEQGLVSEEVFSFWLNRDP-DAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGD 59
M+EQGL+ VFS +LNR + EEGG + FGG +P++++G TYVPV+++ YWQ ++
Sbjct: 209 MIEQGLLPRAVFSVYLNRHLGNQEEGGVLFFGGSNPEYYRGNFTYVPVSRRAYWQVKMDA 268
Query: 60 ILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDL 119
I + +C+ GC I+D+GTS LA P IN +IGG + + Q DL
Sbjct: 269 ATI--RKLELCQNGCEVIIDTGTSFLALPYDQAILINKSIGGRPSAYGQFSVPCDQVSDL 326
Query: 120 IWDLLVSGLLPEKVCQQIGLCAF--NGAEYVSTGIKTVVEKENVSAGDSAVCSACEMAV 176
++ +G AF G EYV I D +CS+ +AV
Sbjct: 327 -----------PRITFTMGGRAFFLEGHEYVFRDI----------FKDQRICSSAFVAV 364
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 41/99 (41%), Positives = 63/99 (63%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
IN+ P+ G+ + CD++ +P ++FT+G + F L +Y+ + +C S F
Sbjct: 302 INKSIGGRPSAYGQFSVPCDQVSDLPRITFTMGGRAFFLEGHEYVFRDIFKDQRICSSAF 361
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
+A DLP P+GPLWILGDVF+G Y+T FD + RIGFA++
Sbjct: 362 VAVDLPSPQGPLWILGDVFLGKYYTEFDMERHRIGFADS 400
>gi|414586467|tpg|DAA37038.1| TPA: hypothetical protein ZEAMMB73_448442 [Zea mays]
Length = 428
Score = 97.1 bits (240), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 40/71 (56%), Positives = 53/71 (74%)
Query: 197 NELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFM 256
N+ C+ LP+P GES +DC +I MPN++FTI + F L+PEQYI+K + +CISGFM
Sbjct: 19 NQFCERLPSPNGESTVDCHQISKMPNLAFTIANNTFTLTPEQYIVKLEQAGQTICISGFM 78
Query: 257 AFDLPPPRGPL 267
AFD+ PPRGPL
Sbjct: 79 AFDVAPPRGPL 89
>gi|402857430|ref|XP_003893258.1| PREDICTED: cathepsin E [Papio anubis]
Length = 396
Score = 97.1 bits (240), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 47/112 (41%), Positives = 71/112 (63%), Gaps = 4/112 (3%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ Q LV +FS +++ +P+ G E++FGG D HF G ++VPVTK+GYWQ L +I
Sbjct: 204 MMAQNLVDLPMFSVYMSSNPEGGAGSELIFGGYDHSHFSGSLSWVPVTKQGYWQIALDNI 263
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG---EGVVSAEC 109
+G + C GC AIVD+GTSL+ GP+ + ++ +AIG +G + EC
Sbjct: 264 QVGG-TVMFCSEGCQAIVDTGTSLITGPSDKIKQLQNAIGAAPVDGEYAVEC 314
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 39/87 (44%), Positives = 52/87 (59%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 267
GE ++C + MP+V+FTI + LSP Y L + C SGF D+ PP GPL
Sbjct: 308 GEYAVECANLNVMPDVTFTINGVPYTLSPTAYTLLDFVDGMQFCSSGFQGLDIHPPAGPL 367
Query: 268 WILGDVFMGVYHTVFDSGKLRIGFAEA 294
WILGDVF+ +++VFD G R+G A A
Sbjct: 368 WILGDVFIRQFYSVFDRGNNRVGLAPA 394
>gi|166361871|gb|ABY87034.1| pepsinogen A1 [Epinephelus coioides]
gi|166361875|gb|ABY87036.1| pepsinogen A1 [Epinephelus coioides]
Length = 376
Score = 97.1 bits (240), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 52/131 (39%), Positives = 77/131 (58%), Gaps = 13/131 (9%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV+QGLVS+ +FS +L+ E+G E+VFGG+D H+ G+ T+VP+T YWQ ++ +
Sbjct: 194 MVKQGLVSQPLFSVYLSSH--GEQGSEVVFGGIDSSHYTGQVTWVPLTSATYWQIKMDGV 251
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
I Q T C GGC AI+D+GTSL+ GPT + +N +G +QYG+
Sbjct: 252 KINGQ-TVACAGGCQAIIDTGTSLIVGPTNDINNMNSWVGAS----------TNQYGEST 300
Query: 121 WDLLVSGLLPE 131
+ G +PE
Sbjct: 301 VNCQNVGSMPE 311
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 31/102 (30%), Positives = 51/102 (50%), Gaps = 8/102 (7%)
Query: 193 LSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCI 252
++ +N + N GES ++C + +MP V+FT+ F + Y+ + G C
Sbjct: 282 INNMNSWVGASTNQYGESTVNCQNVGSMPEVTFTLNGHDFTIPASAYVSQNYYG----CN 337
Query: 253 SGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
+GF LWILGDVF+ Y+ +FD+ IG A++
Sbjct: 338 TGFGQ----GGSDQLWILGDVFIREYYVIFDAQARYIGLAQS 375
>gi|426250269|ref|XP_004018860.1| PREDICTED: gastricsin [Ovis aries]
Length = 431
Score = 97.1 bits (240), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 42/102 (41%), Positives = 69/102 (67%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV++G ++ +FSF+L+ +++GG ++FGGVD + + G+ + PVT++ YWQ + +
Sbjct: 200 MVQEGALTSPIFSFYLSSQQGSQDGGAVIFGGVDSRLYTGQIYWAPVTQELYWQIGIEEF 259
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
LIG+Q+TG C GC AIVD+GTSLL P ++ + A G +
Sbjct: 260 LIGDQATGWCSAGCQAIVDTGTSLLTVPQQFLSALLQATGAQ 301
Score = 75.1 bits (183), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 49/140 (35%), Positives = 70/140 (50%), Gaps = 8/140 (5%)
Query: 159 ENVSAGDSAV--CSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDR 216
E GD A CSA A+V L ++ LS + + + + G+ +DC+
Sbjct: 257 EEFLIGDQATGWCSAGCQAIVDTGTSLLT--VPQQFLSALLQATGAQKDQYGQFPVDCNN 314
Query: 217 IPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRG-PLWILGDVFM 275
I +P ++F I F L P YIL G+ C+ G +P G PLWILGDVF+
Sbjct: 315 IQNLPTLTFVINGMQFPLPPASYILSNGD---SYCVLGVEVTYIPSQNGQPLWILGDVFL 371
Query: 276 GVYHTVFDSGKLRIGFAEAA 295
Y++V+D G R+GFA AA
Sbjct: 372 RSYYSVYDLGNNRVGFATAA 391
>gi|4589842|dbj|BAA76892.1| pepsinogen C [Gallus gallus]
Length = 389
Score = 97.1 bits (240), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 47/103 (45%), Positives = 66/103 (64%), Gaps = 1/103 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+++ L+ VFSF+L+ + +GGE+VFGGVDP + G+ T+ PVT+ YWQ + D
Sbjct: 199 MLQENLLDFPVFSFYLSGQ-EGSQGGELVFGGVDPNLYTGQITWTPVTQTTYWQIGIEDF 257
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEG 103
+G QS+G C GC IVD+GTSLL P V TE+ IG +
Sbjct: 258 AVGGQSSGWCSQGCQGIVDTGTSLLTVPNQVFTELMQYIGAQA 300
Score = 71.6 bits (174), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 53/176 (30%), Positives = 77/176 (43%), Gaps = 14/176 (7%)
Query: 122 DLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGDSAVCSACEMAVVWVQN 181
+L+ G+ P QI Y GI E V S CS +V
Sbjct: 224 ELVFGGVDPNLYTGQITWTPVTQTTYWQIGI----EDFAVGGQSSGWCSQGCQGIVDTGT 279
Query: 182 QLKQ--KQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQY 239
L Q +++ YI D G+ + C I MP ++F I F L P Y
Sbjct: 280 SLLTVPNQVFTELMQYIGAQADD----SGQYVASCSNIEYMPTITFVISGTSFPLPPSAY 335
Query: 240 ILKTGEGIAEVCISGFMAFDLPPPRG-PLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
+L++ ++ C G + LP G PLWILGDVF+ VY++++D G ++GFA A
Sbjct: 336 MLQSN---SDYCTVGIESTYLPSQTGQPLWILGDVFLRVYYSIYDMGNNQVGFATA 388
>gi|198451348|ref|XP_001358330.2| GA19187 [Drosophila pseudoobscura pseudoobscura]
gi|198131448|gb|EAL27468.2| GA19187 [Drosophila pseudoobscura pseudoobscura]
Length = 393
Score = 97.1 bits (240), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 52/119 (43%), Positives = 70/119 (58%), Gaps = 3/119 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M +QGLV + VFSF+L R+ A++GGE++FGG+D F TYVP+T GYWQF++ +
Sbjct: 207 MCDQGLVDQCVFSFYLKRNGSAQQGGELLFGGIDASRFTAPLTYVPLTHAGYWQFQMQSV 266
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDL 119
+ ++ G AIVD+GTSLLA P IN +GG S E L S G L
Sbjct: 267 EVVGKT---ISQGRQAIVDTGTSLLAAPPREYLIINSLLGGLPTASGEYLLRCSDIGRL 322
Score = 81.6 bits (200), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 42/101 (41%), Positives = 55/101 (54%), Gaps = 6/101 (5%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTG-EGIAEVCISG 254
IN L LP GE ++ C I +P V F IG + F L P Y+++ + + VC+S
Sbjct: 298 INSLLGGLPTASGEYLLRCSDIGRLPEVFFVIGGQRFGLQPRDYVMQVANDDGSSVCLSA 357
Query: 255 FMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
F D WILGDVF+G Y+T FD + RIGFA AA
Sbjct: 358 FTLMD-----ADFWILGDVFIGRYYTAFDVAQRRIGFAPAA 393
>gi|354478113|ref|XP_003501260.1| PREDICTED: cathepsin E-like isoform 2 [Cricetulus griseus]
Length = 363
Score = 97.1 bits (240), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 44/100 (44%), Positives = 63/100 (63%), Gaps = 1/100 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ Q LV +FS +++ DP G E+ FGG DP HF G ++PVTK+GYWQ L +
Sbjct: 204 MMAQNLVDLPIFSVYMSSDPQGGSGSELTFGGFDPSHFSGNLNWIPVTKQGYWQIALDGV 263
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 100
+G+ + C GC AIVD+GTSL+ GP+ + ++ AIG
Sbjct: 264 QVGD-TVMFCSEGCQAIVDTGTSLITGPSHKIKQLQEAIG 302
Score = 57.8 bits (138), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 24/46 (52%), Positives = 31/46 (67%)
Query: 249 EVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
+ C SGF D+ PP GPLWILGDVF+ ++ VFD G ++G A A
Sbjct: 316 QFCGSGFQGLDIQPPSGPLWILGDVFIRQFYAVFDRGNNQVGLAPA 361
>gi|296230582|ref|XP_002760770.1| PREDICTED: cathepsin E isoform 1 [Callithrix jacchus]
Length = 396
Score = 97.1 bits (240), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 47/112 (41%), Positives = 69/112 (61%), Gaps = 4/112 (3%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ Q LV +FS +++ +P+ G E++FGG D HF G +VPVTK+ YWQ L DI
Sbjct: 204 MMAQNLVDLPMFSVYMSSNPEGGAGSELIFGGYDHSHFSGSLNWVPVTKQAYWQIALDDI 263
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG---EGVVSAEC 109
+G + C GC AIVD+GTSL+ GP+ + ++ +AIG +G + EC
Sbjct: 264 QVGGTAM-FCSEGCQAIVDTGTSLITGPSDKIKQLQNAIGAAPVDGEYAVEC 314
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 39/87 (44%), Positives = 52/87 (59%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 267
GE ++C + MP+V+FTI + LSP Y L + C SGF D+ PP GPL
Sbjct: 308 GEYAVECANLNVMPDVTFTINGVPYTLSPTAYTLLDFVDGMQFCSSGFQGLDIHPPAGPL 367
Query: 268 WILGDVFMGVYHTVFDSGKLRIGFAEA 294
WILGDVF+ +++VFD G R+G A A
Sbjct: 368 WILGDVFIRQFYSVFDRGNNRVGLAPA 394
>gi|254596794|gb|ACT75642.1| pepsinogen A [Channa argus]
Length = 361
Score = 97.1 bits (240), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 49/118 (41%), Positives = 71/118 (60%), Gaps = 13/118 (11%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV QGLVS+ +FS +L+ + + +G E+VFGGVD H+ G+ ++P+T YWQ ++ +
Sbjct: 179 MVSQGLVSQPMFSVYLSSN--SAQGSEVVFGGVDSNHYTGQIAWIPLTSATYWQIKMDSV 236
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGD 118
I Q T C GGC AI+D+GTSL+ GPT ++ IN +G QYGD
Sbjct: 237 SINGQ-TVACSGGCQAIIDTGTSLIVGPTSDISNINSWVGAS----------TDQYGD 283
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 39/132 (29%), Positives = 63/132 (47%), Gaps = 10/132 (7%)
Query: 164 GDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNV 223
G + CS A++ L T + +S IN + + G++ ++C I +MP V
Sbjct: 240 GQTVACSGGCQAIIDTGTSLIVGPTSD--ISNINSWVGASTDQYGDATVNCQNIQSMPEV 297
Query: 224 SFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFD 283
+FT+ F + Y+ ++ G C +GF LWILGDVF+ Y+ VFD
Sbjct: 298 TFTLNGNAFTIPATAYVSQSYYG----CTTGFGQGG----SDQLWILGDVFIRQYYAVFD 349
Query: 284 SGKLRIGFAEAA 295
+ IG A++A
Sbjct: 350 TQGPYIGLAKSA 361
>gi|390477486|ref|XP_003735302.1| PREDICTED: cathepsin E isoform 2 [Callithrix jacchus]
Length = 401
Score = 97.1 bits (240), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 47/112 (41%), Positives = 69/112 (61%), Gaps = 4/112 (3%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ Q LV +FS +++ +P+ G E++FGG D HF G +VPVTK+ YWQ L DI
Sbjct: 209 MMAQNLVDLPMFSVYMSSNPEGGAGSELIFGGYDHSHFSGSLNWVPVTKQAYWQIALDDI 268
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG---EGVVSAEC 109
+G + C GC AIVD+GTSL+ GP+ + ++ +AIG +G + EC
Sbjct: 269 QVGGTAM-FCSEGCQAIVDTGTSLITGPSDKIKQLQNAIGAAPVDGEYAVEC 319
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 39/87 (44%), Positives = 52/87 (59%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 267
GE ++C + MP+V+FTI + LSP Y L + C SGF D+ PP GPL
Sbjct: 313 GEYAVECANLNVMPDVTFTINGVPYTLSPTAYTLLDFVDGMQFCSSGFQGLDIHPPAGPL 372
Query: 268 WILGDVFMGVYHTVFDSGKLRIGFAEA 294
WILGDVF+ +++VFD G R+G A A
Sbjct: 373 WILGDVFIRQFYSVFDRGNNRVGLAPA 399
>gi|190014572|dbj|BAG48264.1| pepsinogen 2 [Thunnus orientalis]
Length = 376
Score = 97.1 bits (240), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 47/118 (39%), Positives = 74/118 (62%), Gaps = 7/118 (5%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV QGLVS+ +FS +L+ + +++G E+VFGG+D H+ GK T++P+T YWQ ++ +
Sbjct: 194 MVSQGLVSQPLFSVYLSSN--SQQGNEVVFGGIDSSHYTGKITWIPLTSATYWQIQMDSV 251
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG----EGVVSAECKLVVS 114
I Q T C GGC AI+D+GTSL+ GP+ + +N +G G + C+ + S
Sbjct: 252 TINGQ-TVACSGGCQAIIDTGTSLIVGPSRDIYNMNAWVGASTTQNGDATVNCQNIQS 308
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 30/87 (34%), Positives = 48/87 (55%), Gaps = 8/87 (9%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 267
G++ ++C I +MP V+FT+ F + Y+ ++ G C +GF L
Sbjct: 297 GDATVNCQNIQSMPEVTFTLNGHAFTIPASAYVSQSYYG----CRTGFGG----EGNQQL 348
Query: 268 WILGDVFMGVYHTVFDSGKLRIGFAEA 294
WILGDVF+ Y+++FD+ RIG A+A
Sbjct: 349 WILGDVFIRQYYSIFDTQGQRIGLAQA 375
>gi|45382395|ref|NP_990208.1| gastricsin precursor [Gallus gallus]
gi|4589840|dbj|BAA76893.1| pepsinogen C [Gallus gallus]
Length = 389
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 47/103 (45%), Positives = 66/103 (64%), Gaps = 1/103 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+++ L+ VFSF+L+ + +GGE+VFGGVDP + G+ T+ PVT+ YWQ + D
Sbjct: 199 MLQENLLDFPVFSFYLSGQ-EGSQGGELVFGGVDPNLYTGQITWTPVTQTTYWQIGIEDF 257
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEG 103
+G QS+G C GC IVD+GTSLL P V TE+ IG +
Sbjct: 258 AVGGQSSGWCSQGCQGIVDTGTSLLTVPNQVFTELMQYIGAQA 300
Score = 71.6 bits (174), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 53/176 (30%), Positives = 77/176 (43%), Gaps = 14/176 (7%)
Query: 122 DLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGDSAVCSACEMAVVWVQN 181
+L+ G+ P QI Y GI E V S CS +V
Sbjct: 224 ELVFGGVDPNLYTGQITWTPVTQTTYWQIGI----EDFAVGGQSSGWCSQGCQGIVDTGT 279
Query: 182 QLKQ--KQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQY 239
L Q +++ YI D G+ + C I MP ++F I F L P Y
Sbjct: 280 SLLTVPNQVFTELMQYIGAQADD----SGQYVASCSNIEYMPTITFVISGTSFPLPPSAY 335
Query: 240 ILKTGEGIAEVCISGFMAFDLPPPRG-PLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
+L++ ++ C G + LP G PLWILGDVF+ VY++++D G ++GFA A
Sbjct: 336 MLQSN---SDYCTVGIESTYLPSQTGQPLWILGDVFLRVYYSIYDMGNNQVGFATA 388
>gi|315440805|gb|ADU20408.1| aspartic protease 2 [Clonorchis sinensis]
Length = 385
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 49/118 (41%), Positives = 76/118 (64%), Gaps = 6/118 (5%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ QGLV + +FS WL+R+ + GGEI+FGG++ +H+ G +VP++ + YWQ +L I
Sbjct: 196 MISQGLVEKPLFSVWLDRNDVQDIGGEIMFGGINKEHYMGDMYFVPLSSETYWQIDLDGI 255
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG---EGVVSA-ECKLVVS 114
+ S +C GC AIVD+GT+L+ GPT V ++N A+G EG +S EC + +
Sbjct: 256 QV--TSLTLCAQGCQAIVDTGTTLIVGPTADVNQLNEALGAVSIEGGLSVLECSQIYT 311
Score = 75.1 bits (183), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 36/100 (36%), Positives = 56/100 (56%), Gaps = 2/100 (2%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
+NE ++ G S+++C +I T+P + F+I + L P Y+ + +C SGF
Sbjct: 288 LNEALGAVSIEGGLSVLECSQIYTLPPIEFSINGENLTLQPTDYVQEMSYRGGTICTSGF 347
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
+ P P WILGDVF+G Y+TVFD + R+GFA +
Sbjct: 348 SGMETP--GAPTWILGDVFIGAYYTVFDKEQRRVGFARST 385
>gi|270124457|dbj|BAI52799.1| pepsinogen [Anguilla japonica]
Length = 375
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 48/118 (40%), Positives = 72/118 (61%), Gaps = 13/118 (11%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M++QGLVS+++FS +L+ + A G E+VFGGVDP H+ G ++P++ + YWQ + +
Sbjct: 194 MMKQGLVSQDLFSVYLSSNSQA--GSEVVFGGVDPSHYTGSINWIPLSSETYWQITMQSV 251
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGD 118
I Q T C GC AIVD+GTSL+ GP+ + IN+ +G +QYGD
Sbjct: 252 TINGQ-TVACTSGCQAIVDTGTSLIVGPSSDIGNINYYVGAS----------TNQYGD 298
Score = 58.2 bits (139), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 32/99 (32%), Positives = 49/99 (49%), Gaps = 9/99 (9%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
IN + N G++ + C + +MP+V+F I F L Y+ ++ G C +GF
Sbjct: 285 INYYVGASTNQYGDATVQCGNVGSMPDVTFNINGVSFTLPASAYVSQSSYG----CRTGF 340
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
LWILGDVF+ Y+T+FD +G A+A
Sbjct: 341 -----GNGNDQLWILGDVFIRQYYTIFDRTSNYVGLAQA 374
>gi|195144214|ref|XP_002013091.1| GL23572 [Drosophila persimilis]
gi|194102034|gb|EDW24077.1| GL23572 [Drosophila persimilis]
Length = 393
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 52/119 (43%), Positives = 70/119 (58%), Gaps = 3/119 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M +QGLV + VFSF+L R+ A++GGE++FGG+D F TYVP+T GYWQF++ +
Sbjct: 207 MCDQGLVDQCVFSFYLKRNGSAQQGGELLFGGIDASRFTAPLTYVPLTHAGYWQFQMQSV 266
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDL 119
+ ++ G AIVD+GTSLLA P IN +GG S E L S G L
Sbjct: 267 EVVGKT---ISQGRQAIVDTGTSLLAAPPREYLIINSLLGGLPTASGEYLLRCSDIGRL 322
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 42/101 (41%), Positives = 56/101 (55%), Gaps = 6/101 (5%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTG-EGIAEVCISG 254
IN L LP GE ++ C I +P V F IG + F L P Y+++ + + VC+S
Sbjct: 298 INSLLGGLPTASGEYLLRCSDIGRLPEVFFVIGGQRFGLQPRDYVMQVANDDGSSVCLSA 357
Query: 255 FMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
F D WILGDVF+G Y+T FD+ + RIGFA AA
Sbjct: 358 FTLMD-----ADFWILGDVFIGRYYTAFDAAQRRIGFAPAA 393
>gi|358333762|dbj|GAA52230.1| cathepsin D [Clonorchis sinensis]
Length = 408
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 49/118 (41%), Positives = 76/118 (64%), Gaps = 6/118 (5%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ QGLV + +FS WL+R+ + GGEI+FGG++ +H+ G +VP++ + YWQ +L I
Sbjct: 219 MISQGLVEKPLFSVWLDRNDVQDIGGEIMFGGINKEHYMGDMYFVPLSSETYWQIDLDGI 278
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG---EGVVSA-ECKLVVS 114
+ S +C GC AIVD+GT+L+ GPT V ++N A+G EG +S EC + +
Sbjct: 279 QV--TSLTLCAQGCQAIVDTGTTLIVGPTADVNQLNEALGAVSIEGGLSVLECSQIYT 334
Score = 74.7 bits (182), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 58/103 (56%), Gaps = 2/103 (1%)
Query: 193 LSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCI 252
++ +NE ++ G S+++C +I T+P + F+I + L P Y+ + +C
Sbjct: 308 VNQLNEALGAVSIEGGLSVLECSQIYTLPPIEFSINGENLTLQPTDYVQEMSYRGGTICT 367
Query: 253 SGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
SGF + P P WILGDVF+G Y+TVFD + R+GFA +
Sbjct: 368 SGFSGMETP--GAPTWILGDVFIGAYYTVFDKEQRRVGFARST 408
>gi|301786118|ref|XP_002928474.1| PREDICTED: cathepsin E-like [Ailuropoda melanoleuca]
Length = 396
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 50/112 (44%), Positives = 67/112 (59%), Gaps = 4/112 (3%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ Q LV +FS +++ DP+ G E++FGG D HF G +VPVTK+GYWQ L I
Sbjct: 204 MMAQNLVDIPMFSVYMSSDPEGGAGSELIFGGYDHSHFSGNLHWVPVTKQGYWQIALDAI 263
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE---GVVSAEC 109
+G + C GC AIVD+GTSL+ GP+ V ++ AIG E G EC
Sbjct: 264 QVGG-AVMFCSEGCQAIVDTGTSLITGPSDKVKQLQKAIGAEPMDGEYGVEC 314
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 41/90 (45%), Positives = 53/90 (58%), Gaps = 1/90 (1%)
Query: 206 PM-GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 264
PM GE ++C + MP+V+FTI + L P Y L E C SGF D+ PP
Sbjct: 305 PMDGEYGVECANLNVMPDVTFTINGISYTLQPTAYTLLDFVDGMEFCSSGFQGLDIQPPA 364
Query: 265 GPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
GPLWILGDVF+ +++VFD G R+G A A
Sbjct: 365 GPLWILGDVFIRRFYSVFDRGNNRVGLAPA 394
>gi|355681644|gb|AER96811.1| cathepsin E [Mustela putorius furo]
Length = 375
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 49/112 (43%), Positives = 67/112 (59%), Gaps = 4/112 (3%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ Q LV +FS +++ DP+ G E++FGG D HF G +VPVTK+GYWQ L I
Sbjct: 183 MMAQNLVDIPMFSVYMSSDPEGGAGSELIFGGYDHSHFSGNLNWVPVTKQGYWQIALDAI 242
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE---GVVSAEC 109
+G + C GC AIVD+GTSL+ GP+ + ++ AIG E G EC
Sbjct: 243 QVGG-AVMFCSEGCQAIVDTGTSLITGPSDKIKQLQKAIGAEPMDGEYGVEC 293
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 41/90 (45%), Positives = 53/90 (58%), Gaps = 1/90 (1%)
Query: 206 PM-GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 264
PM GE ++C + MP+V+FTI + L P Y L E C SGF D+ PP
Sbjct: 284 PMDGEYGVECANLNVMPDVTFTINGVSYTLQPTAYTLLDFVDGMEFCSSGFQGLDIQPPA 343
Query: 265 GPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
GPLWILGDVF+ +++VFD G R+G A A
Sbjct: 344 GPLWILGDVFIRQFYSVFDRGNNRVGLAPA 373
>gi|195159706|ref|XP_002020719.1| GL15694 [Drosophila persimilis]
gi|194117669|gb|EDW39712.1| GL15694 [Drosophila persimilis]
Length = 401
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 49/101 (48%), Positives = 66/101 (65%), Gaps = 3/101 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+ EQGL+ E F F+L R+ +EEGG+++ GGVD G TYVPV+++GYWQF + +
Sbjct: 216 LYEQGLIDEPTFGFYLARNGSSEEGGQLLLGGVDETLMAGDLTYVPVSQEGYWQFSVNN- 274
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 101
I T +C+ GC AI D+GTSLLA P V T+IN IG
Sbjct: 275 -ISWNGTVLCD-GCQAIADTGTSLLACPQAVYTQINQLIGA 313
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 41/134 (30%), Positives = 63/134 (47%), Gaps = 9/134 (6%)
Query: 159 ENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIP 218
N+S + +C C+ + L Q V + IN+L ++ G + I C +
Sbjct: 273 NNISWNGTVLCDGCQAIADTGTSLLACPQ---AVYTQINQLIGAVL-IEGSNYIPCATLD 328
Query: 219 TMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVY 278
++P +SF IG F+L YI + C+S F W+LGDVF+G Y
Sbjct: 329 SLPVLSFNIGGTTFDLPASAYISVFHDEGYTSCMSTFTDIGTD-----FWVLGDVFLGQY 383
Query: 279 HTVFDSGKLRIGFA 292
+T FD G+ R+GFA
Sbjct: 384 YTQFDFGQNRVGFA 397
>gi|281339451|gb|EFB15035.1| hypothetical protein PANDA_018433 [Ailuropoda melanoleuca]
Length = 388
Score = 96.3 bits (238), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 50/112 (44%), Positives = 67/112 (59%), Gaps = 4/112 (3%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ Q LV +FS +++ DP+ G E++FGG D HF G +VPVTK+GYWQ L I
Sbjct: 196 MMAQNLVDIPMFSVYMSSDPEGGAGSELIFGGYDHSHFSGNLHWVPVTKQGYWQIALDAI 255
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE---GVVSAEC 109
+G + C GC AIVD+GTSL+ GP+ V ++ AIG E G EC
Sbjct: 256 QVGG-AVMFCSEGCQAIVDTGTSLITGPSDKVKQLQKAIGAEPMDGEYGVEC 306
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 41/90 (45%), Positives = 53/90 (58%), Gaps = 1/90 (1%)
Query: 206 PM-GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 264
PM GE ++C + MP+V+FTI + L P Y L E C SGF D+ PP
Sbjct: 297 PMDGEYGVECANLNVMPDVTFTINGISYTLQPTAYTLLDFVDGMEFCSSGFQGLDIQPPA 356
Query: 265 GPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
GPLWILGDVF+ +++VFD G R+G A A
Sbjct: 357 GPLWILGDVFIRRFYSVFDRGNNRVGLAPA 386
>gi|198475392|ref|XP_001357030.2| GA17303 [Drosophila pseudoobscura pseudoobscura]
gi|198138802|gb|EAL34096.2| GA17303 [Drosophila pseudoobscura pseudoobscura]
Length = 401
Score = 96.3 bits (238), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 49/101 (48%), Positives = 66/101 (65%), Gaps = 3/101 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+ EQGL+ E F F+L R+ +EEGG+++ GGVD G TYVPV+++GYWQF + +
Sbjct: 216 LYEQGLIDEPTFGFYLARNGSSEEGGQLLLGGVDETLMAGDLTYVPVSQEGYWQFSVNN- 274
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 101
I T +C+ GC AI D+GTSLLA P V T+IN IG
Sbjct: 275 -ISWNGTVLCD-GCQAIADTGTSLLACPQAVYTQINQLIGA 313
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 41/134 (30%), Positives = 63/134 (47%), Gaps = 9/134 (6%)
Query: 159 ENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIP 218
N+S + +C C+ + L Q V + IN+L ++ G + I C +
Sbjct: 273 NNISWNGTVLCDGCQAIADTGTSLLACPQ---AVYTQINQLIGAVL-IEGSNYIPCATLD 328
Query: 219 TMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVY 278
++P +SF IG F+L YI + C+S F W+LGDVF+G Y
Sbjct: 329 SLPVLSFNIGGTTFDLPASAYISVFHDEGYTSCMSTFTDIGTD-----FWVLGDVFLGQY 383
Query: 279 HTVFDSGKLRIGFA 292
+T FD G+ R+GFA
Sbjct: 384 YTQFDFGQNRVGFA 397
>gi|9858101|gb|AAG00993.1|AF286865_1 heme-binding aspartic proteinase, partial [Rhipicephalus microplus]
Length = 354
Score = 96.3 bits (238), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 47/118 (39%), Positives = 70/118 (59%), Gaps = 6/118 (5%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV+QG+V + VFS +L++ P + GGEI FGG++ + + G YVPV++ +WQ + +I
Sbjct: 164 MVKQGVVRQAVFSLYLSKQPSEQNGGEIYFGGINAQRYTGAIHYVPVSQAAHWQVVMDNI 223
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAECKLVVS 114
+ Q T +C GGC +VDSGTS L+GP+ V +N IG G C + S
Sbjct: 224 NV--QGTTLCVGGCPTVVDSGTSFLSGPSADVETLNRVIGATKTAAGYFEVNCATISS 279
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 33/103 (32%), Positives = 49/103 (47%), Gaps = 9/103 (8%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILK----TGEGIAEVC 251
+N + + G ++C I ++P ++F + K F L E Y ++ TG E C
Sbjct: 256 LNRVIGATKTAAGYFEVNCATISSLPPITFNLNGKSFPLQGEAYTIRIPLTTG---GEQC 312
Query: 252 ISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
+ D LWILG VF Y+TVFD + R+GFA A
Sbjct: 313 FTRISESDASGTN--LWILGAVFTQTYYTVFDRAQNRVGFATA 353
>gi|432116085|gb|ELK37212.1| Cathepsin E [Myotis davidii]
Length = 396
Score = 96.3 bits (238), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 49/112 (43%), Positives = 67/112 (59%), Gaps = 4/112 (3%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ Q LV +FS +++ DP+ G E++FGG D HF G +VPVTK+GYWQ L I
Sbjct: 204 MMAQNLVDVPMFSVYMSSDPEGGAGSELIFGGYDHSHFSGSLNWVPVTKQGYWQIALDTI 263
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE---GVVSAEC 109
+G + C GC AIVD+GTSL+ GP + ++ AIG E G + EC
Sbjct: 264 QVGG-AVMFCSEGCQAIVDTGTSLITGPPAEIKQLQKAIGAEPVDGEYAVEC 314
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 40/87 (45%), Positives = 52/87 (59%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 267
GE ++CD + MP+V+FTI + L P Y L E C SGF D+ PP GPL
Sbjct: 308 GEYAVECDNLNVMPDVTFTINGVPYTLQPTAYTLLDFVDGMEFCSSGFQGLDIQPPAGPL 367
Query: 268 WILGDVFMGVYHTVFDSGKLRIGFAEA 294
WILGDVF+ +++VFD G R+G A A
Sbjct: 368 WILGDVFIRQFYSVFDRGDNRVGLAPA 394
>gi|335955136|gb|AEH76574.1| pepsinogen [Epinephelus bruneus]
Length = 375
Score = 96.3 bits (238), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 52/131 (39%), Positives = 77/131 (58%), Gaps = 13/131 (9%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV+QGLVS+ +FS +L+ D +G E+VFGG+D H+ G+ T+VP+T YWQ ++ +
Sbjct: 194 MVKQGLVSQPLFSVYLSSHGD--QGSEVVFGGIDNSHYTGQVTWVPLTSATYWQIKMDGV 251
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
I Q T C GGC AI+D+GTSL+ GPT + +N +G +QYG+
Sbjct: 252 KINGQ-TVACAGGCQAIIDTGTSLIVGPTNDINNMNSWVGAS----------TNQYGEST 300
Query: 121 WDLLVSGLLPE 131
+ G +PE
Sbjct: 301 VNCQNVGSMPE 311
Score = 58.2 bits (139), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 50/194 (25%), Positives = 85/194 (43%), Gaps = 17/194 (8%)
Query: 102 EGVVSAEC-KLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKEN 160
+G+VS + +S +GD +++ G+ Q+ A Y + V K N
Sbjct: 197 QGLVSQPLFSVYLSSHGDQGSEVVFGGIDNSHYTGQVTWVPLTSATYWQIKMDGV--KIN 254
Query: 161 VSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTM 220
G + C+ A++ L T + ++ +N + N GES ++C + +M
Sbjct: 255 ---GQTVACAGGCQAIIDTGTSLIVGPTND--INNMNSWVGASTNQYGESTVNCQNVGSM 309
Query: 221 PNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHT 280
P V+FT+ F L Y+ + G C +GF LWILGDVF+ Y+
Sbjct: 310 PEVTFTLNGHDFTLPASAYVSQNYYG----CNTGF-----GQGGSELWILGDVFIREYYA 360
Query: 281 VFDSGKLRIGFAEA 294
+FD+ IG A++
Sbjct: 361 IFDAQARYIGLAQS 374
>gi|410986349|ref|XP_003999473.1| PREDICTED: cathepsin E [Felis catus]
Length = 396
Score = 96.3 bits (238), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 49/112 (43%), Positives = 69/112 (61%), Gaps = 4/112 (3%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ Q LV +FS +++ DP++ G E++FGG D HF G +VPVTK+GYWQ L I
Sbjct: 204 MMAQNLVDIPMFSVYMSSDPESGVGSELIFGGYDHSHFSGTLNWVPVTKQGYWQIALDVI 263
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE---GVVSAEC 109
+G + C GC AIVD+GTSL+ GP+ + ++ AIG E G + EC
Sbjct: 264 QVGG-TVMFCSEGCQAIVDTGTSLITGPSDKIKQLQKAIGAEPMDGEYAVEC 314
Score = 80.9 bits (198), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 40/90 (44%), Positives = 52/90 (57%), Gaps = 1/90 (1%)
Query: 206 PM-GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 264
PM GE ++C + MP+V+F I + L P Y L E C SGF D+ PP
Sbjct: 305 PMDGEYAVECANLNVMPDVTFIINGVSYTLQPTAYTLLDFVDGMEFCSSGFQGLDIQPPA 364
Query: 265 GPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
GPLWILGDVF+ +++VFD G R+G A A
Sbjct: 365 GPLWILGDVFIRQFYSVFDRGNNRVGLAPA 394
>gi|402580152|gb|EJW74102.1| aspartic protease BmAsp-2 [Wuchereria bancrofti]
Length = 170
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 43/87 (49%), Positives = 57/87 (65%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 267
GE II C ++P++P ++F I K + L Y+L A +C+SGFM DLP G L
Sbjct: 50 GEYIIPCYKVPSLPEITFVIAGKSYTLKGSDYVLNVTSKGATICLSGFMGIDLPKRAGEL 109
Query: 268 WILGDVFMGVYHTVFDSGKLRIGFAEA 294
WILGDVF+G Y+TVFD G +IGFA+A
Sbjct: 110 WILGDVFIGRYYTVFDVGNSQIGFAQA 136
Score = 41.6 bits (96), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 19/39 (48%), Positives = 24/39 (61%)
Query: 70 CEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
C GC AI D+GTSL+AGP + +I IG E V + E
Sbjct: 13 CSNGCQAIADTGTSLIAGPKSQIDKIQEYIGAEHVYAGE 51
>gi|307187508|gb|EFN72561.1| Lysosomal aspartic protease [Camponotus floridanus]
Length = 275
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 47/115 (40%), Positives = 68/115 (59%), Gaps = 5/115 (4%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+EQGLVS +FS +L+RD E GG ++ GG DP ++G TY+PVT+KGYWQF + +I
Sbjct: 45 MIEQGLVSSPIFSVYLHRDVSEENGGVLILGGSDPAFYEGNLTYIPVTRKGYWQFTIDEI 104
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAI---GGEGVVSAECKLV 112
I +C C AI D+GT + GP + IN+ I +G + +C +
Sbjct: 105 KI--DYINLCIESCQAIADTGTPWIIGPISEINRINNFIEVFNDDGYETVDCDRI 157
Score = 58.2 bits (139), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 33/90 (36%), Positives = 49/90 (54%), Gaps = 4/90 (4%)
Query: 205 NPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 264
N G +DCDRI +P +SF + K + ++ L E ++C+S F+ +P
Sbjct: 145 NDDGYETVDCDRISELPTISFILDGKA-GIRSYRHRLYHSEDGTKLCVSTFVGCHIP--- 200
Query: 265 GPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
G WILGD F+G ++T FD R+GFA A
Sbjct: 201 GINWILGDPFIGRFYTEFDMKNDRVGFALA 230
>gi|166235886|ref|NP_031825.2| cathepsin E preproprotein [Mus musculus]
gi|341940308|sp|P70269.2|CATE_MOUSE RecName: Full=Cathepsin E; Flags: Precursor
gi|5748654|emb|CAA08880.2| cathepsin E protein [Mus musculus]
gi|74146932|dbj|BAE25449.1| unnamed protein product [Mus musculus]
gi|74192082|dbj|BAE34257.1| unnamed protein product [Mus musculus]
gi|74219155|dbj|BAE26716.1| unnamed protein product [Mus musculus]
gi|74222421|dbj|BAE38113.1| unnamed protein product [Mus musculus]
gi|148707758|gb|EDL39705.1| cathepsin E [Mus musculus]
Length = 397
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 47/112 (41%), Positives = 67/112 (59%), Gaps = 4/112 (3%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ Q LV+ +FS +L+ DP G E+ FGG DP HF G ++PVTK+ YWQ L I
Sbjct: 205 MMAQNLVALPMFSVYLSSDPQGGSGSELTFGGYDPSHFSGSLNWIPVTKQAYWQIALDGI 264
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG---EGVVSAEC 109
+G+ + C GC AIVD+GTSL+ GP + ++ AIG +G + +C
Sbjct: 265 QVGD-TVMFCSEGCQAIVDTGTSLITGPPDKIKQLQEAIGATPIDGEYAVDC 315
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 41/87 (47%), Positives = 55/87 (63%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 267
GE +DC + TMPNV+F I + + L+P YIL + C SGF D+PPP GPL
Sbjct: 309 GEYAVDCATLDTMPNVTFLINEVSYTLNPTDYILPDLVEGMQFCGSGFQGLDIPPPAGPL 368
Query: 268 WILGDVFMGVYHTVFDSGKLRIGFAEA 294
WILGDVF+ +++VFD G ++G A A
Sbjct: 369 WILGDVFIRQFYSVFDRGNNQVGLAPA 395
>gi|2288908|emb|CAA71859.1| cathepsin E [Mus musculus]
Length = 397
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 47/112 (41%), Positives = 67/112 (59%), Gaps = 4/112 (3%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ Q LV+ +FS +L+ DP G E+ FGG DP HF G ++PVTK+ YWQ L I
Sbjct: 205 MMAQNLVALPMFSVYLSSDPQGGSGSELTFGGYDPSHFSGSLNWIPVTKQAYWQIALDGI 264
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG---EGVVSAEC 109
+G+ + C GC AIVD+GTSL+ GP + ++ AIG +G + +C
Sbjct: 265 QVGD-TVMFCSEGCQAIVDTGTSLITGPPDKIKQLQEAIGATPIDGEYAVDC 315
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 41/87 (47%), Positives = 55/87 (63%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 267
GE +DC + TMPNV+F I + + L+P YIL + C SGF D+PPP GPL
Sbjct: 309 GEYAVDCATLDTMPNVTFLINEVSYTLNPTDYILPDLVDGMQFCGSGFQGLDIPPPAGPL 368
Query: 268 WILGDVFMGVYHTVFDSGKLRIGFAEA 294
WILGDVF+ +++VFD G ++G A A
Sbjct: 369 WILGDVFIRQFYSVFDRGNNQVGLAPA 395
>gi|71159493|gb|AAZ29603.1| pepsinogen, partial [Pagrus pagrus]
Length = 205
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 47/118 (39%), Positives = 73/118 (61%), Gaps = 7/118 (5%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M++QGLVS+ +FS +L+ + +E+G E+VFGG D H+ G+ T++P++ YWQ + +
Sbjct: 75 MIKQGLVSQPMFSVYLSGN--SEQGSEVVFGGTDSNHYTGQITWIPLSSATYWQISMDSV 132
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAECKLVVS 114
I Q T C GGC AI+D+GTSL+ GPT + +N +G G + C+ V S
Sbjct: 133 TINGQ-TVACSGGCQAIIDTGTSLIVGPTNDINNMNSWVGASTNQYGEATVNCQNVQS 189
>gi|355745980|gb|EHH50605.1| hypothetical protein EGM_01462 [Macaca fascicularis]
Length = 401
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 47/112 (41%), Positives = 70/112 (62%), Gaps = 4/112 (3%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ Q LV +FS +++ +P+ G E++FGG D HF G +VPVTK+GYWQ L +I
Sbjct: 209 MMAQNLVDLPMFSVYMSSNPEGGAGSELIFGGYDHSHFSGSLDWVPVTKQGYWQIALDNI 268
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG---EGVVSAEC 109
+G + C GC AIVD+GTSL+ GP+ + ++ +AIG +G + EC
Sbjct: 269 QVGG-TVMFCSEGCQAIVDTGTSLITGPSDKIKQLQNAIGAAPVDGEYAVEC 319
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 39/87 (44%), Positives = 52/87 (59%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 267
GE ++C + MP+V+FTI + LSP Y L + C SGF D+ PP GPL
Sbjct: 313 GEYAVECANLNVMPDVTFTINGVPYTLSPTAYTLLDFVDGMQFCSSGFQGLDIHPPAGPL 372
Query: 268 WILGDVFMGVYHTVFDSGKLRIGFAEA 294
WILGDVF+ +++VFD G R+G A A
Sbjct: 373 WILGDVFIRQFYSVFDRGNNRVGLAPA 399
>gi|149707989|ref|XP_001491088.1| PREDICTED: cathepsin E [Equus caballus]
Length = 396
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 48/112 (42%), Positives = 67/112 (59%), Gaps = 4/112 (3%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ Q LV +FS +++ DP+ G E++FGG D HF G +VPVTK+GYWQ L I
Sbjct: 204 MMAQNLVDVPMFSVYMSSDPEGGAGSELIFGGYDHSHFSGSLNWVPVTKQGYWQIALDAI 263
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE---GVVSAEC 109
+G + C GC AIVD+GTSL+ GP + ++ AIG + G + EC
Sbjct: 264 QVGG-TVMFCSQGCQAIVDTGTSLITGPPDKIKQLQEAIGAQPMDGEYAVEC 314
Score = 77.8 bits (190), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 39/91 (42%), Positives = 52/91 (57%), Gaps = 1/91 (1%)
Query: 205 NPM-GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPP 263
PM GE ++C + MP+V+FTI + L P Y L + C SGF D+ PP
Sbjct: 304 QPMDGEYAVECVNLNVMPDVTFTINGVPYTLQPTAYTLLDFVDGMQFCSSGFQGLDIQPP 363
Query: 264 RGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
GPLWILGDVF+ +++VFD G +G A A
Sbjct: 364 AGPLWILGDVFIRQFYSVFDRGNNLVGLAPA 394
>gi|46395759|sp|Q800A0.1|CATE_RANCA RecName: Full=Cathepsin E; Flags: Precursor
gi|29647357|dbj|BAC75398.1| cathepsin E [Rana catesbeiana]
Length = 397
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 48/103 (46%), Positives = 66/103 (64%), Gaps = 8/103 (7%)
Query: 190 EKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAE 249
E++ +YI + N GE + C + MP+V+FTI ++L+PEQY+L+ G G
Sbjct: 291 EQLQNYI-----GVTNTNGEYGVSCSTLSLMPSVTFTINGLDYSLTPEQYMLEDGGG--- 342
Query: 250 VCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFA 292
C SGF D+ PP GPLWILGDVF+G Y++VFD G R+GFA
Sbjct: 343 YCSSGFQGLDISPPSGPLWILGDVFIGQYYSVFDRGNNRVGFA 385
Score = 94.0 bits (232), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 42/100 (42%), Positives = 64/100 (64%), Gaps = 1/100 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ Q LV +F ++NRDP++ +GGE+V GG D F G+ +VP+T +GYWQ ++ I
Sbjct: 200 MIAQNLVELPLFGVYMNRDPNSADGGELVLGGFDTSRFSGQLNWVPITVQGYWQIQVDSI 259
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 100
+ Q C GC AIVD+GTSL+ GP+ + ++ + IG
Sbjct: 260 QVAGQVI-FCSDGCQAIVDTGTSLITGPSGDIEQLQNYIG 298
>gi|162423778|gb|ABX89619.1| pepsinogen [Diplodus sargus]
Length = 376
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 44/102 (43%), Positives = 67/102 (65%), Gaps = 3/102 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV+QGLVS+ +FS +L+ + +E+G E+VFGG D H+ G+ T++P++ YWQ + +
Sbjct: 194 MVKQGLVSQPMFSVYLSGN--SEQGSEVVFGGTDSSHYTGQITWIPLSSATYWQISMDSV 251
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
I Q T C GGC AI+D+GTSL+ GPT + +N +G
Sbjct: 252 TINGQ-TVACSGGCQAIIDTGTSLIVGPTSDINNMNSWVGAS 292
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 32/99 (32%), Positives = 51/99 (51%), Gaps = 8/99 (8%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
+N + N GE+ ++C I MP+V+FT+ F + Y+ ++ G C++GF
Sbjct: 285 MNSWVGASTNEYGEATVNCQNIQGMPDVTFTLNGHAFTVPASAYVSQSSYG----CMTGF 340
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
LWILGDVF+ Y+ VFD+ IG A++
Sbjct: 341 G----QGGSQQLWILGDVFIREYYAVFDTQAQYIGLAKS 375
>gi|355558837|gb|EHH15617.1| hypothetical protein EGK_01732 [Macaca mulatta]
Length = 401
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 47/112 (41%), Positives = 70/112 (62%), Gaps = 4/112 (3%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ Q LV +FS +++ +P+ G E++FGG D HF G +VPVTK+GYWQ L +I
Sbjct: 209 MMAQNLVDLPMFSVYMSSNPEGGVGSELIFGGYDHSHFSGSLNWVPVTKQGYWQIALDNI 268
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG---EGVVSAEC 109
+G + C GC AIVD+GTSL+ GP+ + ++ +AIG +G + EC
Sbjct: 269 QVGG-TVMFCSEGCQAIVDTGTSLITGPSDKIKQLQNAIGAAPVDGEYAVEC 319
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 39/87 (44%), Positives = 52/87 (59%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 267
GE ++C + MP+V+FTI + LSP Y L + C SGF D+ PP GPL
Sbjct: 313 GEYAVECANLNVMPDVTFTINGVPYTLSPTAYTLLDFVDGMQFCSSGFQGLDIHPPAGPL 372
Query: 268 WILGDVFMGVYHTVFDSGKLRIGFAEA 294
WILGDVF+ +++VFD G R+G A A
Sbjct: 373 WILGDVFIRQFYSVFDRGNNRVGLAPA 399
>gi|149725292|ref|XP_001501875.1| PREDICTED: pepsin A-like [Equus caballus]
Length = 387
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 50/112 (44%), Positives = 70/112 (62%), Gaps = 9/112 (8%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M +QGLVS+++FS +L+ D E G ++FGG+D ++ G +VPV+ +GYWQ + I
Sbjct: 201 MWDQGLVSQDLFSVYLSSDD--ESGSVVMFGGIDSSYYSGSLNWVPVSNEGYWQITMDSI 258
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG------GEGVVS 106
+ +S C GGC AIVD+GTSLLAGPT + I IG GE V+S
Sbjct: 259 TMNGESIA-CSGGCQAIVDTGTSLLAGPTSAIDNIQSYIGASEDSSGESVIS 309
Score = 80.9 bits (198), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 52/134 (38%), Positives = 68/134 (50%), Gaps = 10/134 (7%)
Query: 164 GDSAVCSACEMAVVWVQNQLKQKQTK--EKVLSYINELCDSLPNPMGESIIDCDRIPTMP 221
G+S CS A+V L T + + SYI DS GES+I C I ++P
Sbjct: 262 GESIACSGGCQAIVDTGTSLLAGPTSAIDNIQSYIGASEDS----SGESVISCSSIDSLP 317
Query: 222 NVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTV 281
++ FT+ F LSP YIL+ + CISGF D+ G LWILGDVF+ Y TV
Sbjct: 318 DIVFTLNGVEFPLSPSAYILQEDDS----CISGFEGMDVDTSSGELWILGDVFIRQYFTV 373
Query: 282 FDSGKLRIGFAEAA 295
FD ++G A A
Sbjct: 374 FDRANNQVGLAPVA 387
>gi|109018632|ref|XP_001090284.1| PREDICTED: cathepsin E isoform 4 [Macaca mulatta]
Length = 396
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 47/112 (41%), Positives = 70/112 (62%), Gaps = 4/112 (3%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ Q LV +FS +++ +P+ G E++FGG D HF G +VPVTK+GYWQ L +I
Sbjct: 204 MMAQNLVDLPMFSVYMSSNPEGGVGSELIFGGYDHSHFSGSLNWVPVTKQGYWQIALDNI 263
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG---EGVVSAEC 109
+G + C GC AIVD+GTSL+ GP+ + ++ +AIG +G + EC
Sbjct: 264 QVGG-TVMFCSEGCQAIVDTGTSLITGPSDKIKQLQNAIGAAPVDGEYAVEC 314
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 39/87 (44%), Positives = 52/87 (59%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 267
GE ++C + MP+V+FTI + LSP Y L + C SGF D+ PP GPL
Sbjct: 308 GEYAVECANLNVMPDVTFTINGVPYTLSPTAYTLLDFVDGMQFCSSGFQGLDIHPPAGPL 367
Query: 268 WILGDVFMGVYHTVFDSGKLRIGFAEA 294
WILGDVF+ +++VFD G R+G A A
Sbjct: 368 WILGDVFIRQFYSVFDRGNNRVGLAPA 394
>gi|395534115|ref|XP_003769093.1| PREDICTED: gastricsin-like [Sarcophilus harrisii]
Length = 392
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 41/102 (40%), Positives = 65/102 (63%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
++++ L++ VF+F+L+ + ++ GGE+ FGGVD F G + PVT++ YWQ +
Sbjct: 201 LLQENLINAPVFAFYLSGNENSNNGGEVTFGGVDTSMFTGDIYWAPVTQEAYWQIAINGF 260
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
IG Q+TG C GC A+VD+GTSLL P + +E+ IG +
Sbjct: 261 SIGGQATGWCSEGCQAVVDTGTSLLTAPQQIFSELMQYIGAQ 302
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 37/107 (34%), Positives = 55/107 (51%), Gaps = 4/107 (3%)
Query: 190 EKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAE 249
+++ S + + + N G ++ C + M ++F I F L P Y+L + E
Sbjct: 289 QQIFSELMQYIGAQQNENGAYLVSCSNVQNMSTITFNINGVNFPLPPSAYVLPSNSNYCE 348
Query: 250 VCISGFMAFDLPPPRG-PLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
V G M LP G PLWILGDVF+ Y++V+D G R+GFA A
Sbjct: 349 V---GIMPTYLPSQNGQPLWILGDVFLRNYYSVYDLGNNRVGFANLA 392
>gi|170577100|ref|XP_001893881.1| aspartic protease BmAsp-2, identical [Brugia malayi]
gi|158599838|gb|EDP37282.1| aspartic protease BmAsp-2, identical [Brugia malayi]
Length = 170
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 43/87 (49%), Positives = 57/87 (65%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 267
GE II C ++P++P ++F I K + L Y+L A +C+SGFM DLP G L
Sbjct: 50 GEYIIPCYKVPSLPEITFVIAGKSYTLKGSDYVLNVTSKGATICLSGFMGIDLPKRVGEL 109
Query: 268 WILGDVFMGVYHTVFDSGKLRIGFAEA 294
WILGDVF+G Y+TVFD G +IGFA+A
Sbjct: 110 WILGDVFIGRYYTVFDVGNSQIGFAQA 136
Score = 41.6 bits (96), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 21/50 (42%), Positives = 28/50 (56%)
Query: 59 DILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
D ++G C GC AI D+GTSL+AGP + +I IG E V + E
Sbjct: 2 DRVLGRGKAIGCGNGCQAIADTGTSLIAGPKSQIDKIQEYIGAEHVYAGE 51
>gi|327271205|ref|XP_003220378.1| PREDICTED: gastricsin-like [Anolis carolinensis]
Length = 388
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 44/102 (43%), Positives = 63/102 (61%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ QG +SE +FSF+ +R P + GGE++ GGVD + F G ++ PVT++ YWQ + +
Sbjct: 199 MLNQGQLSEPIFSFYFSRQPTVQYGGELILGGVDTQLFSGDVSWAPVTREVYWQIGVEEF 258
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
IGN++TG C GC AIVD+GT L P A+G E
Sbjct: 259 AIGNEATGWCSEGCQAIVDTGTCQLTIPRQYFDTFLQAVGAE 300
Score = 61.2 bits (147), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 31/88 (35%), Positives = 46/88 (52%), Gaps = 3/88 (3%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 267
GE +++C+ + MP ++F I F L P Y+ V ++A PL
Sbjct: 304 GELLVNCNNVQNMPTITFVINGAQFPLPPSAYVANNDGYCTVVVEPTYLA---SQSGEPL 360
Query: 268 WILGDVFMGVYHTVFDSGKLRIGFAEAA 295
WILGDVF+ Y++VFD R+GFA +A
Sbjct: 361 WILGDVFLKEYYSVFDMANNRVGFALSA 388
>gi|1657354|emb|CAA66056.1| procathepsin E [Mus musculus]
gi|13529380|gb|AAH05432.1| Cathepsin E [Mus musculus]
gi|71059833|emb|CAJ18460.1| Ctse [Mus musculus]
Length = 397
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 47/112 (41%), Positives = 66/112 (58%), Gaps = 4/112 (3%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ Q LV+ +FS +L+ DP G E+ FGG DP HF G ++PVTK+ YWQ L I
Sbjct: 205 MMAQNLVALPMFSVYLSSDPQGGSGSELTFGGYDPSHFSGSLNWIPVTKQAYWQIALDGI 264
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG---EGVVSAEC 109
+G+ + C GC AIVD+GTSL+ GP + + AIG +G + +C
Sbjct: 265 QVGD-TVMFCSEGCQAIVDTGTSLITGPPDKIKHLQEAIGATPIDGEYAVDC 315
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 41/87 (47%), Positives = 55/87 (63%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 267
GE +DC + TMPNV+F I + + L+P YIL + C SGF D+PPP GPL
Sbjct: 309 GEYAVDCATLDTMPNVTFLINEVSYTLNPTDYILPDLVDGMQFCGSGFQGLDIPPPAGPL 368
Query: 268 WILGDVFMGVYHTVFDSGKLRIGFAEA 294
WILGDVF+ +++VFD G ++G A A
Sbjct: 369 WILGDVFIRQFYSVFDRGNNQVGLAPA 395
>gi|226476848|emb|CAX72340.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
Length = 429
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 45/116 (38%), Positives = 74/116 (63%), Gaps = 6/116 (5%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M++Q +V VFSF+L+R+ GGE++ GG+D K++ G+ YV +T+K YW F++ ++
Sbjct: 194 MIKQRVVDSPVFSFYLSRNITNVLGGELMIGGIDDKYYTGEINYVNLTEKSYWLFKMDNL 253
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAECKLV 112
I + S +C GC AI D+GTS++AGPT V +IN +G G+ + C ++
Sbjct: 254 TISDLS--ICTDGCQAIADTGTSMIAGPTDEVKQINQKLGATHLPGGIYTVSCDVI 307
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 49/140 (35%), Positives = 79/140 (56%), Gaps = 4/140 (2%)
Query: 155 VVEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDC 214
+ + +N++ D ++C+ A+ + T E + IN+ + P G + C
Sbjct: 247 LFKMDNLTISDLSICTDGCQAIADTGTSMIAGPTDE--VKQINQKLGATHLPGGIYTVSC 304
Query: 215 DRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVF 274
D I +P++ F I K L P YI+K + +E+C++GF+ DL PR LWILGDVF
Sbjct: 305 DVINNLPSIDFVINGKHMTLEPTDYIMKVSKLGSEICLTGFIGMDL--PRKKLWILGDVF 362
Query: 275 MGVYHTVFDSGKLRIGFAEA 294
+G ++T+FD GK R+GFA+A
Sbjct: 363 IGKFYTIFDMGKNRVGFAKA 382
>gi|1246039|gb|AAB35843.1| pepsinogen 2 [tuna, Peptide, 360 aa]
Length = 360
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 46/118 (38%), Positives = 74/118 (62%), Gaps = 7/118 (5%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV QGLVS+ +FS +L+ + +++G E+VFGG+D H+ G+ T++P+T YWQ ++ +
Sbjct: 178 MVSQGLVSQPLFSVYLSSN--SQQGSEVVFGGIDSSHYTGQITWIPLTSATYWQIQMDSV 235
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG----EGVVSAECKLVVS 114
I Q T C GGC AI+D+GTSL+ GP+ + +N +G G + C+ + S
Sbjct: 236 TINGQ-TVACSGGCQAIIDTGTSLIVGPSRDIYNMNAWVGASTTQNGDATVNCQNIQS 292
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 30/87 (34%), Positives = 48/87 (55%), Gaps = 8/87 (9%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 267
G++ ++C I +MP V+FT+ F + Y+ ++ G C +GF L
Sbjct: 281 GDATVNCQNIQSMPEVTFTLNGHAFTIPASAYVSQSYYG----CRTGFGG----EGNQQL 332
Query: 268 WILGDVFMGVYHTVFDSGKLRIGFAEA 294
WILGDVF+ Y+++FD+ RIG A+A
Sbjct: 333 WILGDVFIRQYYSIFDTQGQRIGLAQA 359
>gi|355329699|dbj|BAL14143.1| pepsinogen 2 [Pagrus major]
Length = 377
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 46/118 (38%), Positives = 73/118 (61%), Gaps = 7/118 (5%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M++QGLVS+ +FS +L+ + +E+G E+VFGG D H+ G+ T++P++ YWQ + +
Sbjct: 195 MIKQGLVSQPMFSVYLSGN--SEQGSEVVFGGTDSNHYTGQITWIPLSSATYWQISMDSV 252
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAECKLVVS 114
I Q T C GGC AI+D+GTSL+ GPT + +N +G G + C+ + S
Sbjct: 253 TINGQ-TVACSGGCQAIIDTGTSLIVGPTNDINNMNSWVGASTNQYGEATVNCQNIQS 309
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 32/100 (32%), Positives = 52/100 (52%), Gaps = 8/100 (8%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
+N + N GE+ ++C I +MP+V+FT+ F + Y+ ++ G C +GF
Sbjct: 286 MNSWVGASTNQYGEATVNCQNIQSMPDVTFTLNGHAFTVPASAYVSQSYYG----CSTGF 341
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
LWILGDVF+ Y+ VF++ IG A++A
Sbjct: 342 G----QGGSQQLWILGDVFIREYYAVFNAQSQYIGLAKSA 377
>gi|195399279|ref|XP_002058248.1| GJ15983 [Drosophila virilis]
gi|194150672|gb|EDW66356.1| GJ15983 [Drosophila virilis]
Length = 372
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 47/114 (41%), Positives = 68/114 (59%), Gaps = 3/114 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV QGLV + VFSF+L RD + +GGE++FGG D + G TYVP++++GYWQF +
Sbjct: 195 MVSQGLVDQSVFSFYLARDGTSSQGGELIFGGSDSSLYSGDLTYVPISEQGYWQFTMAGA 254
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVS 114
I QS +C+ C AI D+GTSL+ P ++N + + +C V S
Sbjct: 255 SIDGQS--LCD-NCQAIADTGTSLIVAPANAYMQLNDILNVDDQGLVDCSSVSS 305
Score = 68.6 bits (166), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 39/90 (43%), Positives = 52/90 (57%), Gaps = 15/90 (16%)
Query: 209 ESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKT-GEGIAEVCISGF--MAFDLPPPRG 265
+ ++DC + +MP ++F IG F+L P QYI+++ GE C S F M D
Sbjct: 295 QGLVDCSSVSSMPVITFNIGGTNFDLEPAQYIIQSDGE-----CQSSFEYMGTDF----- 344
Query: 266 PLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
WILGDVF+G Y+T FD G RIGFA A
Sbjct: 345 --WILGDVFIGQYYTEFDLGNNRIGFAPVA 372
>gi|256072901|ref|XP_002572772.1| cathepsin D (A01 family) [Schistosoma mansoni]
gi|360043052|emb|CCD78464.1| cathepsin D (A01 family) [Schistosoma mansoni]
Length = 428
Score = 95.1 bits (235), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 44/113 (38%), Positives = 72/113 (63%), Gaps = 6/113 (5%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M++QG+V VFSF+L+R+ A GGE++ GG+D K++ G+ YV +T++ YW F++ +
Sbjct: 194 MIQQGIVESPVFSFYLSRNISAVLGGELMIGGIDKKYYSGEINYVDLTEQSYWLFKMDKL 253
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAEC 109
I + + C GC AI D+GTS++AGPT + +IN +G G+ + C
Sbjct: 254 TISDMT--ACPDGCLAIADTGTSMIAGPTDEIQKINAKLGATRLPGGIYTVSC 304
Score = 87.8 bits (216), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 47/140 (33%), Positives = 73/140 (52%), Gaps = 4/140 (2%)
Query: 155 VVEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDC 214
+ + + ++ D C +A+ + T E + IN + P G + C
Sbjct: 247 LFKMDKLTISDMTACPDGCLAIADTGTSMIAGPTDE--IQKINAKLGATRLPGGIYTVSC 304
Query: 215 DRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVF 274
I +P + F I K L P Y+LK + +E+C++GFM DL P+ LWILGD+F
Sbjct: 305 GNINNLPTIDFVINGKAMTLEPTDYLLKVSKMGSEICLTGFMGLDL--PKRKLWILGDIF 362
Query: 275 MGVYHTVFDSGKLRIGFAEA 294
+G ++TVFD GK R+GFA+A
Sbjct: 363 IGKFYTVFDMGKNRVGFAKA 382
>gi|300835014|gb|ADK37836.1| putative aspartic endopeptidase [Entomophthora planchoniana]
Length = 282
Score = 95.1 bits (235), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 52/118 (44%), Positives = 75/118 (63%), Gaps = 8/118 (6%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MVEQGL E +FSFWLN + ++ GGE+V GG+D KHFKGK T+VPV++KGYW+ L +
Sbjct: 97 MVEQGL--EPLFSFWLNTNGGSDHGGEMVLGGIDKKHFKGKITWVPVSRKGYWEVSLDKV 154
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAECKLVVS 114
G++ + + G A +D+G+SLL P+ + IN IG + G + +C V S
Sbjct: 155 AFGDEEVELPKTGAA--IDTGSSLLVVPSDLADMINRFIGAKKGFGGQYTIDCAQVPS 210
Score = 90.5 bits (223), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 41/100 (41%), Positives = 61/100 (61%), Gaps = 4/100 (4%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
IN + G+ IDC ++P++P+++ T F L YIL+ + CISGF
Sbjct: 187 INRFIGAKKGFGGQYTIDCAQVPSLPDLTLTFAGHPFTLQGSDYILQ----VQNQCISGF 242
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
D+PPP GPLWI+GDVF+ Y++V+D G R+GFA++A
Sbjct: 243 TGLDVPPPLGPLWIIGDVFLRKYYSVYDLGNNRVGFAKSA 282
>gi|149725197|ref|XP_001502028.1| PREDICTED: pepsin A-like [Equus caballus]
Length = 387
Score = 95.1 bits (235), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 52/110 (47%), Positives = 70/110 (63%), Gaps = 9/110 (8%)
Query: 3 EQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILI 62
+QGLVS+++FS +L+ D E G ++FGG+D ++ G +VPVT +GYWQ + I I
Sbjct: 203 DQGLVSQDLFSVYLSSD--DESGSVVMFGGIDSSYYTGSLHWVPVTTEGYWQIAVDSITI 260
Query: 63 GNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG------GEGVVS 106
+S C GGC AIVD+GTSLLAGPT + I IG GEGV+S
Sbjct: 261 NGESIA-CSGGCQAIVDTGTSLLAGPTSGIDNIQSYIGARKDLLGEGVIS 309
Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 52/134 (38%), Positives = 66/134 (49%), Gaps = 10/134 (7%)
Query: 164 GDSAVCSACEMAVVWVQNQLKQKQTK--EKVLSYINELCDSLPNPMGESIIDCDRIPTMP 221
G+S CS A+V L T + + SYI D L GE +I C I ++P
Sbjct: 262 GESIACSGGCQAIVDTGTSLLAGPTSGIDNIQSYIGARKDLL----GEGVISCSAIDSLP 317
Query: 222 NVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTV 281
++ FT+ F L P YILK + CISGF DL G LWILGDVF+ Y TV
Sbjct: 318 DIVFTMNGVEFPLPPSAYILKEDDS----CISGFEGVDLDTSSGELWILGDVFIRQYFTV 373
Query: 282 FDSGKLRIGFAEAA 295
FD ++G A A
Sbjct: 374 FDRANNQVGLAPVA 387
>gi|410990039|ref|XP_004001257.1| PREDICTED: pepsin B-like [Felis catus]
Length = 346
Score = 95.1 bits (235), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 41/102 (40%), Positives = 63/102 (61%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M++QG ++ +FSF+ +R P + GGE++ GGVD + + G+ + PVT++ YWQ + +
Sbjct: 200 MMQQGQLTRPIFSFYFSRQPTYQYGGELILGGVDTQFYSGEIVWTPVTRELYWQIAIDEF 259
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
LIGNQ+TG C GC IVD+GT L P + A G +
Sbjct: 260 LIGNQATGFCSQGCQGIVDTGTFPLTVPQQYLDSFVKATGAQ 301
Score = 47.4 bits (111), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 22/37 (59%), Positives = 27/37 (72%), Gaps = 1/37 (2%)
Query: 260 LPPPRG-PLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
LP P G PLWILGDVF+ Y+TV+D R+GFA +A
Sbjct: 310 LPSPNGQPLWILGDVFLREYYTVYDMAVNRVGFALSA 346
>gi|1778026|gb|AAB63442.1| aspartic proteinase [Schistosoma mansoni]
Length = 427
Score = 95.1 bits (235), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 44/113 (38%), Positives = 72/113 (63%), Gaps = 6/113 (5%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M++QG+V VFSF+L+R+ A GGE++ GG+D K++ G+ YV +T++ YW F++ +
Sbjct: 193 MIQQGIVESPVFSFYLSRNISAVLGGELMIGGIDKKYYSGEINYVDLTEQSYWLFKMDKL 252
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAEC 109
I + + C GC AI D+GTS++AGPT + +IN +G G+ + C
Sbjct: 253 TISDMT--ACPDGCLAIADTGTSMIAGPTDEIQKINAKLGATRLPGGIYTVSC 303
Score = 87.8 bits (216), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 47/140 (33%), Positives = 73/140 (52%), Gaps = 4/140 (2%)
Query: 155 VVEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDC 214
+ + + ++ D C +A+ + T E + IN + P G + C
Sbjct: 246 LFKMDKLTISDMTACPDGCLAIADTGTSMIAGPTDE--IQKINAKLGATRLPGGIYTVSC 303
Query: 215 DRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVF 274
I +P + F I K L P Y+LK + +E+C++GFM DL P+ LWILGD+F
Sbjct: 304 GNINNLPTIDFVINGKAMTLEPTDYLLKVSKMGSEICLTGFMGLDL--PKRKLWILGDIF 361
Query: 275 MGVYHTVFDSGKLRIGFAEA 294
+G ++TVFD GK R+GFA+A
Sbjct: 362 IGKFYTVFDMGKNRVGFAKA 381
>gi|256072903|ref|XP_002572773.1| cathepsin D (A01 family) [Schistosoma mansoni]
gi|360043053|emb|CCD78465.1| cathepsin D (A01 family) [Schistosoma mansoni]
Length = 430
Score = 95.1 bits (235), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 44/113 (38%), Positives = 72/113 (63%), Gaps = 6/113 (5%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M++QG+V VFSF+L+R+ A GGE++ GG+D K++ G+ YV +T++ YW F++ +
Sbjct: 196 MIQQGIVESPVFSFYLSRNISAVLGGELMIGGIDKKYYSGEINYVDLTEQSYWLFKMDKL 255
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAEC 109
I + + C GC AI D+GTS++AGPT + +IN +G G+ + C
Sbjct: 256 TISDMT--ACPDGCLAIADTGTSMIAGPTDEIQKINAKLGATRLPGGIYTVSC 306
Score = 87.8 bits (216), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 47/140 (33%), Positives = 73/140 (52%), Gaps = 4/140 (2%)
Query: 155 VVEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDC 214
+ + + ++ D C +A+ + T E + IN + P G + C
Sbjct: 249 LFKMDKLTISDMTACPDGCLAIADTGTSMIAGPTDE--IQKINAKLGATRLPGGIYTVSC 306
Query: 215 DRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVF 274
I +P + F I K L P Y+LK + +E+C++GFM DL P+ LWILGD+F
Sbjct: 307 GNINNLPTIDFVINGKAMTLEPTDYLLKVSKMGSEICLTGFMGLDL--PKRKLWILGDIF 364
Query: 275 MGVYHTVFDSGKLRIGFAEA 294
+G ++TVFD GK R+GFA+A
Sbjct: 365 IGKFYTVFDMGKNRVGFAKA 384
>gi|1619323|emb|CAA69878.1| aspartic protease [Trematomus bernacchii]
Length = 406
Score = 95.1 bits (235), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 52/134 (38%), Positives = 77/134 (57%), Gaps = 2/134 (1%)
Query: 161 VSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTM 220
V G S CS A++ L T E +L+ I +L + P +GE IIDC R+ ++
Sbjct: 269 VVQGVSPFCSHGCQAIIDTGTSLIAGPT-EDILN-IQQLIGASPTNIGEFIIDCARLISL 326
Query: 221 PNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHT 280
P ++F +G K + L+ E Y+ K G +C SGF A D+ GPLWILGDVF+ +++
Sbjct: 327 PRITFVLGGKEYTLTSEHYVRKEMLGDRMLCFSGFQAVDMVSSEGPLWILGDVFLTQFYS 386
Query: 281 VFDSGKLRIGFAEA 294
+FD G+ R+G A A
Sbjct: 387 IFDRGQDRVGLATA 400
Score = 84.7 bits (208), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 43/103 (41%), Positives = 63/103 (61%), Gaps = 2/103 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAE-EGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGD 59
M+ Q L+ + VFSF+L+R E + GE++ GG+D + G ++PVT KGYWQ ++ D
Sbjct: 208 MLAQKLLDQPVFSFYLSRKQLVEPQRGELLLGGIDEALYNGPINWLPVTAKGYWQIKM-D 266
Query: 60 ILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
L+ + C GC AI+D+GTSL+AGPT + I IG
Sbjct: 267 SLVVQGVSPFCSHGCQAIIDTGTSLIAGPTEDILNIQQLIGAS 309
>gi|363743175|ref|XP_003642787.1| PREDICTED: renin-like [Gallus gallus]
Length = 451
Score = 95.1 bits (235), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 39/87 (44%), Positives = 57/87 (65%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 267
GE ++DC+++P +PN+SF +G K + LS Y+L+ + ++C+ D+PPP GPL
Sbjct: 364 GEYVVDCEKVPQLPNISFHLGGKAYTLSGSAYVLRQTQYGEDICVVALSGLDIPPPAGPL 423
Query: 268 WILGDVFMGVYHTVFDSGKLRIGFAEA 294
WILG F+G Y+T FD RIGFA A
Sbjct: 424 WILGASFIGHYYTKFDRRNNRIGFATA 450
Score = 77.8 bits (190), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 37/116 (31%), Positives = 66/116 (56%), Gaps = 5/116 (4%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
++ Q ++ E+VFS + +R+ + GGEI+ GG DP ++ G Y+ +++ GYWQ + +
Sbjct: 259 ILSQQILKEDVFSVYYSRNSPLKPGGEIILGGTDPAYYTGDFHYLSISRSGYWQISMKGV 318
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG----EGVVSAECKLV 112
+G + C+ GC+ +D+G S + GP V+ + AIG EG +C+ V
Sbjct: 319 SVGAEML-FCKEGCSVAIDTGASYITGPAGPVSVLMKAIGAAEMTEGEYVVDCEKV 373
>gi|195029909|ref|XP_001987814.1| GH19747 [Drosophila grimshawi]
gi|193903814|gb|EDW02681.1| GH19747 [Drosophila grimshawi]
Length = 390
Score = 94.7 bits (234), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 43/99 (43%), Positives = 59/99 (59%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
INE P G+ + C+++P +P +SF +G + F + E Y+ VC S F
Sbjct: 290 INESIGGTPAAYGQFSVPCEQVPHLPTLSFALGGRRFQMKGEDYVFHDIFADRTVCASAF 349
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
+A DLP P GPLWILGDVF+ Y+T FD G RIGFA++
Sbjct: 350 IAVDLPSPSGPLWILGDVFLSKYYTEFDMGNHRIGFADS 388
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 47/116 (40%), Positives = 67/116 (57%), Gaps = 6/116 (5%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
++EQ L+ VFS +LNRD + +GG + FGG ++++G TYVPVT + YWQ +L
Sbjct: 198 VMEQSLLERPVFSVYLNRDSTSLQGGYLFFGGSSRRYYRGNFTYVPVTHRAYWQVKLEAA 257
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAECKLV 112
IG +C+ GC I+D+GTS +A P IN +IGG G S C+ V
Sbjct: 258 YIGKLQ--MCQKGCHVIIDTGTSFIAVPYEQAILINESIGGTPAAYGQFSVPCEQV 311
>gi|326933881|ref|XP_003213026.1| PREDICTED: gastricsin-like [Meleagris gallopavo]
Length = 389
Score = 94.7 bits (234), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 43/102 (42%), Positives = 67/102 (65%), Gaps = 1/102 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+++ L+ +FSF+L+ + +GGE++FGGV+P + G+ ++ PVT+ YWQ + D
Sbjct: 199 MLQENLLDSPIFSFYLSGQ-EGSQGGELIFGGVNPNLYTGQISWTPVTQTTYWQIGIEDF 257
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
+G QS+G C GC AIVD+GTSLL P V +E+ IG +
Sbjct: 258 TVGGQSSGWCSQGCQAIVDTGTSLLTVPNQVFSELMQYIGAQ 299
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 57/176 (32%), Positives = 78/176 (44%), Gaps = 14/176 (7%)
Query: 122 DLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGDSAVCSACEMAVVWVQN 181
+L+ G+ P QI Y GI E V S CS A+V
Sbjct: 224 ELIFGGVNPNLYTGQISWTPVTQTTYWQIGI----EDFTVGGQSSGWCSQGCQAIVDTGT 279
Query: 182 QLKQ--KQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQY 239
L Q +++ YI DS G+ + C I MP ++F I F L P Y
Sbjct: 280 SLLTVPNQVFSELMQYIGAQADS----NGQYVASCSNIEYMPTLTFVISGTSFPLPPSAY 335
Query: 240 ILKTGEGIAEVCISGFMAFDLPPPRG-PLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
+L++ G C G + LP G PLWILGDVF+ VY++++D G R+GFA A
Sbjct: 336 MLQSNSG---YCTVGIESTYLPSETGQPLWILGDVFLRVYYSIYDMGNNRVGFATA 388
>gi|307187033|gb|EFN72332.1| Lysosomal aspartic protease [Camponotus floridanus]
Length = 206
Score = 94.7 bits (234), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 47/101 (46%), Positives = 62/101 (61%), Gaps = 3/101 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRD-PDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGD 59
M+EQGLVS +FS +LNRD D GG ++ GG DP ++G TY+PVT+KGYWQF +
Sbjct: 1 MIEQGLVSSPIFSVYLNRDVSDEINGGVLILGGSDPAFYEGNLTYIPVTRKGYWQFTIDK 60
Query: 60 ILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 100
I I +C C AI D+GT + GP ++ IN IG
Sbjct: 61 IKI--DYINLCIESCQAIADTGTPWIIGPISDISRINDLIG 99
>gi|73620985|sp|P81498.2|PEPC_SUNMU RecName: Full=Gastricsin; AltName: Full=Pepsinogen C-1; Flags:
Precursor
gi|9798662|dbj|BAB11753.1| pepsinogen C [Suncus murinus]
Length = 389
Score = 94.7 bits (234), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 45/118 (38%), Positives = 73/118 (61%), Gaps = 4/118 (3%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+++G ++ VFSF+L+ ++ GG ++FGGVD + G+ + PVT++ YWQ + +
Sbjct: 199 MLQEGALTSPVFSFYLSNQQGSQNGGAVIFGGVDNSLYTGQIFWAPVTQELYWQIGVEEF 258
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAECKLVVS 114
LIG Q+TG C+ GC AIVD+GTSLL P ++ + A G + G ++ C + S
Sbjct: 259 LIGGQATGWCQQGCQAIVDTGTSLLTVPQQFMSALQQATGAQQDQYGQLAVNCNSIQS 316
Score = 71.2 bits (173), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 37/107 (34%), Positives = 58/107 (54%), Gaps = 5/107 (4%)
Query: 190 EKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAE 249
++ +S + + + + G+ ++C+ I ++P ++F I F L P Y+L T
Sbjct: 287 QQFMSALQQATGAQQDQYGQLAVNCNSIQSLPTLTFIINGVQFPLPPSAYVLNTNG---- 342
Query: 250 VCISGFMAFDLPPPRG-PLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
C G LP G PLWILGDVF+ Y++V+D G R+GFA AA
Sbjct: 343 YCFLGVEPTYLPSQNGQPLWILGDVFLRSYYSVYDMGNNRVGFATAA 389
>gi|229368735|gb|ACQ63017.1| progastricsin (predicted) [Dasypus novemcinctus]
Length = 276
Score = 94.7 bits (234), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 46/118 (38%), Positives = 74/118 (62%), Gaps = 4/118 (3%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+++G ++ +FSF+L+ +++GGE+VFGGVD + G+ + PVT++ YWQ + +
Sbjct: 86 MLQEGALTSPIFSFYLSSQEGSQDGGEVVFGGVDQNLYSGEIYWAPVTQELYWQIGIEEF 145
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAECKLVVS 114
LIG Q++G C GC AIVD+GTSLL P ++ + A G + G + +C V S
Sbjct: 146 LIGGQASGWCSQGCQAIVDTGTSLLTVPQQYMSALMQATGAQENEYGEYAVDCSSVQS 203
Score = 70.9 bits (172), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 39/107 (36%), Positives = 58/107 (54%), Gaps = 5/107 (4%)
Query: 190 EKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAE 249
++ +S + + + N GE +DC + ++P ++F I F L P YIL +G
Sbjct: 174 QQYMSALMQATGAQENEYGEYAVDCSSVQSLPTLTFVINGVQFPLPPSAYILSN-DG--- 229
Query: 250 VCISGFMAFDLPPPRG-PLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
C G LP G PLWILGDVF+ Y++++D G +IGFA AA
Sbjct: 230 YCTVGVEPTYLPSQNGQPLWILGDVFLRSYYSIYDMGNNQIGFATAA 276
>gi|47215111|emb|CAG02535.1| unnamed protein product [Tetraodon nigroviridis]
Length = 375
Score = 94.7 bits (234), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 41/94 (43%), Positives = 65/94 (69%), Gaps = 1/94 (1%)
Query: 6 LVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQ 65
L+ + VFSF+LNRDP+A GGE++ GG +P+H+ G+ YV VT++ YWQ ++ I +G++
Sbjct: 214 LLPQNVFSFYLNRDPEAAVGGELMLGGANPEHYTGQLHYVSVTRQSYWQIQVNGIEVGDR 273
Query: 66 STGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAI 99
+C C A+VD+GTSL+ GP + ++ AI
Sbjct: 274 -LNLCRPSCQAVVDTGTSLITGPAEEIEALHEAI 306
Score = 64.7 bits (156), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 33/62 (53%), Positives = 40/62 (64%), Gaps = 1/62 (1%)
Query: 233 NLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFA 292
++ P +T EG VC+SGFMA D PP PLWILGDVF+ Y+TVFD R+GFA
Sbjct: 315 DVQPSPSRPETQEG-ESVCVSGFMAVDAGPPEEPLWILGDVFIMKYYTVFDRSADRVGFA 373
Query: 293 EA 294
A
Sbjct: 374 PA 375
>gi|45643446|gb|AAS72876.1| aspartyl protease [Triatoma infestans]
Length = 387
Score = 94.7 bits (234), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 47/97 (48%), Positives = 62/97 (63%), Gaps = 2/97 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHF-KGKHTYVPVTKKGYWQFELGD 59
M++QGL+ + VFS +LNR+PD E GGEI+FGGVD K F K T VP+T YW F++ +
Sbjct: 197 MIDQGLLDKPVFSVYLNRNPDEEVGGEIIFGGVDEKRFNKESLTTVPLTNPTYWMFKMDE 256
Query: 60 ILI-GNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEI 95
+ G C+ GC A D+GTS + GPT V EI
Sbjct: 257 VSTSGTNGKSWCQNGCRATADTGTSFIVGPTKEVAEI 293
Score = 63.2 bits (152), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 31/85 (36%), Positives = 48/85 (56%), Gaps = 5/85 (5%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 267
G + CD + +P+++F + K + L E Y+L+ E + CI GF + LP P
Sbjct: 304 GVGYVPCDELHKLPDITFHLNGKGYTLKAEDYVLEMTEAGEKACIVGFAS--LP---QPF 358
Query: 268 WILGDVFMGVYHTVFDSGKLRIGFA 292
WILGDVF+G Y+T+F+ + FA
Sbjct: 359 WILGDVFLGKYYTIFNVEDRTVSFA 383
>gi|195134382|ref|XP_002011616.1| GI11126 [Drosophila mojavensis]
gi|193906739|gb|EDW05606.1| GI11126 [Drosophila mojavensis]
Length = 421
Score = 94.7 bits (234), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 43/100 (43%), Positives = 63/100 (63%), Gaps = 3/100 (3%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ QGL++ VFSF+L R+ +++GGE++FGG DP +KG TY +T++ YWQF +
Sbjct: 208 MLAQGLIANPVFSFYLARNGTSQQGGELIFGGSDPSLYKGSMTYADITQQNYWQFNMDSA 267
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 100
+ Q V CAAI D+GTSLL PT + +I +G
Sbjct: 268 TLNGQ---VLCTNCAAIADTGTSLLVAPTDIYNKIKVVLG 304
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 36/87 (41%), Positives = 45/87 (51%), Gaps = 9/87 (10%)
Query: 209 ESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLW 268
+ IDC MP FTIG K+F + YI+ T G CI G + W
Sbjct: 308 DDTIDCSNTSNMPTFLFTIGGKVFGVPNSAYIISTDTG----CILGVSGME-----SQFW 358
Query: 269 ILGDVFMGVYHTVFDSGKLRIGFAEAA 295
ILGDVF+G Y++ FD GK RIGFA +
Sbjct: 359 ILGDVFLGQYYSEFDLGKNRIGFASVS 385
>gi|5921649|gb|AAD56283.1|AF156787_1 pepsinogen A form IIa [Pseudopleuronectes americanus]
Length = 378
Score = 94.7 bits (234), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 48/128 (37%), Positives = 80/128 (62%), Gaps = 11/128 (8%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ +GLV +++FS +L+ P+A++G + FGGVDP H+ G T++P++ + YWQ + +
Sbjct: 197 MMTEGLVDQDLFSVYLS--PNAQQGSVVTFGGVDPNHYNGAITWIPLSSELYWQITVDSV 254
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG-----GEGVVSAECKLVVSQ 115
+ Q C GGC AIVD+GTSL+ GP ++ IN+ +G G+ VV+ +SQ
Sbjct: 255 TVNGQVVA-CSGGCQAIVDTGTSLIVGPQSSISNINNYVGATSQNGDYVVNCNS---ISQ 310
Query: 116 YGDLIWDL 123
D+I+ +
Sbjct: 311 MPDVIFHI 318
Score = 46.6 bits (109), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 26/87 (29%), Positives = 45/87 (51%), Gaps = 8/87 (9%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 267
G+ +++C+ I MP+V F I + F + Y+ ++ C +G L L
Sbjct: 299 GDYVVNCNSISQMPDVIFHIHGQQFTIPASAYVRQSQY---YGCSTG-----LGNGGDNL 350
Query: 268 WILGDVFMGVYHTVFDSGKLRIGFAEA 294
WILGDVF+ Y+++F + +G A+A
Sbjct: 351 WILGDVFIRQYYSIFSRAQNMVGLAQA 377
>gi|432103960|gb|ELK30793.1| Gastricsin [Myotis davidii]
Length = 390
Score = 94.7 bits (234), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 43/102 (42%), Positives = 65/102 (63%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M++QG ++ +FSF+ +R P + GGE++ GGVD + + G+ + PVT++ YWQ + +
Sbjct: 200 MLQQGQLTLPIFSFYFSRQPTRQYGGELILGGVDQQLYSGQIVWAPVTQELYWQIAIQEF 259
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
IG+Q+TG C GC AIVD+GT LLA P + A G E
Sbjct: 260 AIGDQATGWCSQGCQAIVDTGTFLLAVPQQYMGSFLQATGAE 301
Score = 71.2 bits (173), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 36/89 (40%), Positives = 51/89 (57%), Gaps = 5/89 (5%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRG-P 266
G+ ++ C+ + ++P ++FTI F L P Y+L C G A LP P G P
Sbjct: 306 GDFVVACNSVESLPTITFTISGSQFPLPPSAYVLNNNG----YCRLGIEATYLPSPNGQP 361
Query: 267 LWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
LWILGDVF+ Y++V+D R+GFA AA
Sbjct: 362 LWILGDVFLKEYYSVYDMAHNRVGFAFAA 390
>gi|397504824|ref|XP_003822980.1| PREDICTED: cathepsin E [Pan paniscus]
Length = 396
Score = 94.4 bits (233), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 46/112 (41%), Positives = 69/112 (61%), Gaps = 4/112 (3%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ Q LV +FS +++ +P+ G E++FGG D HF G +VPVTK+ YWQ L +I
Sbjct: 204 MLAQNLVDLPMFSVYMSSNPEGGAGSELIFGGYDHSHFSGSLNWVPVTKQAYWQIALDNI 263
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG---EGVVSAEC 109
+G + C GC AIVD+GTSL+ GP+ + ++ +AIG +G + EC
Sbjct: 264 QVGG-TVMFCSEGCQAIVDTGTSLITGPSDKIKQLQNAIGAAPVDGEYAVEC 314
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 39/87 (44%), Positives = 52/87 (59%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 267
GE ++C + MP+V+FTI + LSP Y L + C SGF D+ PP GPL
Sbjct: 308 GEYAVECANLNVMPDVTFTINGVPYTLSPTAYTLLDFVDGMQFCSSGFQGLDIHPPAGPL 367
Query: 268 WILGDVFMGVYHTVFDSGKLRIGFAEA 294
WILGDVF+ +++VFD G R+G A A
Sbjct: 368 WILGDVFIRQFYSVFDRGNNRVGLAPA 394
>gi|348514690|ref|XP_003444873.1| PREDICTED: pepsin A-like [Oreochromis niloticus]
Length = 377
Score = 94.4 bits (233), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 43/102 (42%), Positives = 65/102 (63%), Gaps = 3/102 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ +GLVSE +FS +L+ + +E+G E+VFGG D H+ G T++P++ YWQ + +
Sbjct: 195 MITEGLVSEPIFSVYLSGN--SEQGSEVVFGGTDSTHYTGTITWIPLSSATYWQINMDSV 252
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
I Q T C GGC AI+D+GTSL+ GPT + +N +G
Sbjct: 253 TINGQ-TVACSGGCQAIIDTGTSLIVGPTTDINNLNSWVGAS 293
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 38/132 (28%), Positives = 67/132 (50%), Gaps = 10/132 (7%)
Query: 164 GDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNV 223
G + CS A++ L T + ++ +N + + G++I++C IP+MP+V
Sbjct: 256 GQTVACSGGCQAIIDTGTSLIVGPTTD--INNLNSWVGASTDQSGDAIVNCQNIPSMPDV 313
Query: 224 SFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFD 283
+FT+ F + Y+ ++ G C++GF LWILGDVF+ Y+ VF+
Sbjct: 314 TFTLNGNAFTVPASAYVSQSSSG----CMTGFGQ----GGTMQLWILGDVFIREYYAVFN 365
Query: 284 SGKLRIGFAEAA 295
+ IG A++A
Sbjct: 366 AQTQNIGLAKSA 377
>gi|46397366|sp|P14091.2|CATE_HUMAN RecName: Full=Cathepsin E; Contains: RecName: Full=Cathepsin E form
I; Contains: RecName: Full=Cathepsin E form II; Flags:
Precursor
Length = 401
Score = 94.4 bits (233), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 46/112 (41%), Positives = 69/112 (61%), Gaps = 4/112 (3%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ Q LV +FS +++ +P+ G E++FGG D HF G +VPVTK+ YWQ L +I
Sbjct: 209 MMAQNLVDLPMFSVYMSSNPEGGAGSELIFGGYDHSHFSGSLNWVPVTKQAYWQIALDNI 268
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG---EGVVSAEC 109
+G + C GC AIVD+GTSL+ GP+ + ++ +AIG +G + EC
Sbjct: 269 QVGG-TVMFCSEGCQAIVDTGTSLITGPSDKIKQLQNAIGAAPVDGEYAVEC 319
Score = 82.8 bits (203), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 39/87 (44%), Positives = 52/87 (59%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 267
GE ++C + MP+V+FTI + LSP Y L + C SGF D+ PP GPL
Sbjct: 313 GEYAVECANLNVMPDVTFTINGVPYTLSPTAYTLLDFVDGMQFCSSGFQGLDIHPPAGPL 372
Query: 268 WILGDVFMGVYHTVFDSGKLRIGFAEA 294
WILGDVF+ +++VFD G R+G A A
Sbjct: 373 WILGDVFIRQFYSVFDRGNNRVGLAPA 399
>gi|4503145|ref|NP_001901.1| cathepsin E isoform a preproprotein [Homo sapiens]
gi|114572172|ref|XP_001163151.1| PREDICTED: cathepsin E isoform 2 [Pan troglodytes]
gi|181194|gb|AAA52130.1| cathepsin E precursor [Homo sapiens]
gi|181205|gb|AAA52300.1| cathepsin E [Homo sapiens]
gi|7339520|emb|CAB82850.1| procathepsin E [Homo sapiens]
gi|27502799|gb|AAH42537.1| Cathepsin E [Homo sapiens]
gi|61358295|gb|AAX41543.1| cathepsin E [synthetic construct]
gi|119611998|gb|EAW91592.1| cathepsin E, isoform CRA_a [Homo sapiens]
gi|158257546|dbj|BAF84746.1| unnamed protein product [Homo sapiens]
gi|325463731|gb|ADZ15636.1| cathepsin E [synthetic construct]
Length = 396
Score = 94.4 bits (233), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 46/112 (41%), Positives = 69/112 (61%), Gaps = 4/112 (3%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ Q LV +FS +++ +P+ G E++FGG D HF G +VPVTK+ YWQ L +I
Sbjct: 204 MMAQNLVDLPMFSVYMSSNPEGGAGSELIFGGYDHSHFSGSLNWVPVTKQAYWQIALDNI 263
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG---EGVVSAEC 109
+G + C GC AIVD+GTSL+ GP+ + ++ +AIG +G + EC
Sbjct: 264 QVGG-TVMFCSEGCQAIVDTGTSLITGPSDKIKQLQNAIGAAPVDGEYAVEC 314
Score = 82.8 bits (203), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 39/87 (44%), Positives = 52/87 (59%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 267
GE ++C + MP+V+FTI + LSP Y L + C SGF D+ PP GPL
Sbjct: 308 GEYAVECANLNVMPDVTFTINGVPYTLSPTAYTLLDFVDGMQFCSSGFQGLDIHPPAGPL 367
Query: 268 WILGDVFMGVYHTVFDSGKLRIGFAEA 294
WILGDVF+ +++VFD G R+G A A
Sbjct: 368 WILGDVFIRQFYSVFDRGNNRVGLAPA 394
>gi|60816208|gb|AAX36374.1| cathepsin E [synthetic construct]
Length = 396
Score = 94.4 bits (233), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 46/112 (41%), Positives = 69/112 (61%), Gaps = 4/112 (3%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ Q LV +FS +++ +P+ G E++FGG D HF G +VPVTK+ YWQ L +I
Sbjct: 204 MMAQNLVDLPMFSVYMSSNPEGGAGSELIFGGYDHSHFSGSLNWVPVTKQAYWQIALDNI 263
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG---EGVVSAEC 109
+G + C GC AIVD+GTSL+ GP+ + ++ +AIG +G + EC
Sbjct: 264 QVGG-TVMFCSEGCQAIVDTGTSLITGPSDKIKQLQNAIGAAPVDGEYAVEC 314
Score = 81.3 bits (199), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 38/87 (43%), Positives = 51/87 (58%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 267
GE ++C + MP+V+FTI + LSP Y L + C SGF D+ PP GPL
Sbjct: 308 GEYAVECANLNVMPDVTFTINGVPYTLSPTAYTLLDFVDGMQFCSSGFQGLDIHPPAGPL 367
Query: 268 WILGDVFMGVYHTVFDSGKLRIGFAEA 294
WILGDVF+ +++VFD G R+G A
Sbjct: 368 WILGDVFIRQFYSVFDRGNNRVGLTPA 394
>gi|374431137|gb|AEZ51819.1| pepsin, partial [Oreochromis niloticus]
Length = 339
Score = 94.4 bits (233), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 43/102 (42%), Positives = 65/102 (63%), Gaps = 3/102 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ +GLVSE +FS +L+ + +E+G E+VFGG D H+ G T++P++ YWQ + +
Sbjct: 157 MITEGLVSEPIFSVYLSGN--SEQGSEVVFGGTDSTHYTGTITWIPLSSATYWQINMDSV 214
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
I Q T C GGC AI+D+GTSL+ GPT + +N +G
Sbjct: 215 TINGQ-TVACSGGCQAIIDTGTSLIVGPTTDINNLNSWVGAS 255
Score = 63.9 bits (154), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 38/132 (28%), Positives = 67/132 (50%), Gaps = 10/132 (7%)
Query: 164 GDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNV 223
G + CS A++ L T + ++ +N + + G++I++C IP+MP+V
Sbjct: 218 GQTVACSGGCQAIIDTGTSLIVGPTTD--INNLNSWVGASTDQSGDAIVNCQNIPSMPDV 275
Query: 224 SFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFD 283
+FT+ F + Y+ ++ G C++GF LWILGDVF+ Y+ VF+
Sbjct: 276 TFTLNGNAFTVPASAYVSQSSSG----CMTGFGQ----GGTMQLWILGDVFIREYYAVFN 327
Query: 284 SGKLRIGFAEAA 295
+ IG A++A
Sbjct: 328 AQTQNIGLAKSA 339
>gi|307187094|gb|EFN72348.1| Cathepsin D [Camponotus floridanus]
Length = 308
Score = 94.4 bits (233), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 46/110 (41%), Positives = 68/110 (61%), Gaps = 4/110 (3%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
++EQGLVS +FSF+LNRD A+ GG+++FG DP ++G TY+PVT K YWQF + I
Sbjct: 189 IIEQGLVSSRIFSFYLNRDTSADLGGKLIFGSSDPACYEGDFTYIPVTSKQYWQFTIDSI 248
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINH--AIGGEGVVSAE 108
+ + + C GGC AIVD+ + GP ++ I +G+VS +
Sbjct: 249 QMNDIT--WCLGGCEAIVDTNAWQITGPQSDISNIYDFTETNPQGIVSKQ 296
Score = 75.1 bits (183), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 37/98 (37%), Positives = 56/98 (57%), Gaps = 2/98 (2%)
Query: 16 LNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCA 75
L+RD A+ GGE+ GG +P +++G TYVPVT KGYWQF + I + + +C+ C
Sbjct: 10 LHRDTSADFGGELTLGGSNPTYYEGDFTYVPVTNKGYWQFIIDSIQV--KHITLCKESCQ 67
Query: 76 AIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVV 113
AIVD+G + GP + I I + + CK ++
Sbjct: 68 AIVDTGCWQIVGPKMDIQFIYLLIETDSLGRVNCKKIL 105
>gi|213688|gb|AAA49530.1| pepsinogen [Rana catesbeiana]
Length = 384
Score = 94.4 bits (233), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 39/102 (38%), Positives = 65/102 (63%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M++Q L+++ +F+F+L+ +++ GGE+ FGGVD ++ G+ + PVT + YWQ +
Sbjct: 193 MIQQNLINQPLFAFYLSGQQNSQNGGEVAFGGVDQNYYSGQIYWTPVTSETYWQIGIQGF 252
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
+ Q+TG C GC IVD+GTSLL P V + + +IG +
Sbjct: 253 SVNGQATGWCSQGCQGIVDTGTSLLTAPQSVFSSLMQSIGAQ 294
Score = 75.1 bits (183), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 40/107 (37%), Positives = 58/107 (54%), Gaps = 4/107 (3%)
Query: 190 EKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAE 249
+ V S + + + + G+ + C I ++P +SFTI F L P Y+L+ G
Sbjct: 281 QSVFSSLMQSIGAQQDQNGQYAVSCSNIQSLPTISFTISGVSFPLPPSAYVLQQNSG--- 337
Query: 250 VCISGFMAFDLPPPRG-PLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
C G M LP G PLWILGDVF+ Y++V+D G ++GFA AA
Sbjct: 338 YCTIGIMPTYLPSQNGQPLWILGDVFLRQYYSVYDLGNNQVGFAAAA 384
>gi|50978660|ref|NP_001003028.1| pepsin B precursor [Canis lupus familiaris]
gi|73621387|sp|Q8SQ41.1|PEPB_CANFA RecName: Full=Pepsin B; Flags: Precursor
gi|19911571|dbj|BAB86888.1| pepsinogen B [Canis lupus familiaris]
Length = 390
Score = 94.4 bits (233), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 41/102 (40%), Positives = 64/102 (62%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M++QG +++ +FSF+ + P E GGE++ GGVD + + G+ + PVT++ YWQ + +
Sbjct: 200 MMQQGQLTQPIFSFYFSPQPTYEYGGELILGGVDTQFYSGEIVWAPVTREMYWQVAIDEF 259
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
LIGNQ+TG+C GC IVD+GT L P + A G +
Sbjct: 260 LIGNQATGLCSQGCQGIVDTGTFPLTVPQQYLDSFVKATGAQ 301
Score = 68.6 bits (166), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 35/89 (39%), Positives = 48/89 (53%), Gaps = 5/89 (5%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRG-P 266
G +++C+ I +MP ++F I L P Y+L C G LP P G P
Sbjct: 306 GNFVVNCNSIQSMPTITFVISGSPLPLPPSTYVLNNNG----YCTLGIEVTYLPSPNGQP 361
Query: 267 LWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
LWILGDVF+ Y+TVFD R+GFA ++
Sbjct: 362 LWILGDVFLREYYTVFDMAANRVGFALSS 390
>gi|187608619|ref|NP_001120469.1| cathepsin E precursor [Xenopus (Silurana) tropicalis]
gi|170284872|gb|AAI61297.1| LOC100145572 protein [Xenopus (Silurana) tropicalis]
Length = 397
Score = 94.4 bits (233), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 45/100 (45%), Positives = 65/100 (65%), Gaps = 1/100 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ Q LV +FS +++R+P++ GGE+VFGG D F G+ +V VT +GYWQ +L +I
Sbjct: 200 MMTQNLVELPMFSVYMSRNPNSPVGGELVFGGFDASRFSGQLNWVSVTNQGYWQIQLDNI 259
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 100
I N C GGC AIVD+GTSL+ GP+ + ++ IG
Sbjct: 260 QI-NGEVVFCTGGCQAIVDTGTSLITGPSSDIVQLQSIIG 298
Score = 90.9 bits (224), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 42/85 (49%), Positives = 56/85 (65%), Gaps = 3/85 (3%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 267
G+ +DC + MP V+FTI + ++P+QY L+ G GI C SGF D+ PP GPL
Sbjct: 304 GDYEVDCSVLNEMPTVTFTINGIGYQMTPQQYTLQDGGGI---CSSGFQGLDISPPAGPL 360
Query: 268 WILGDVFMGVYHTVFDSGKLRIGFA 292
WILGDVF+G Y++VFD G R+G A
Sbjct: 361 WILGDVFIGQYYSVFDRGNNRVGLA 385
>gi|426333516|ref|XP_004028322.1| PREDICTED: cathepsin E isoform 1 [Gorilla gorilla gorilla]
Length = 396
Score = 94.4 bits (233), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 46/112 (41%), Positives = 69/112 (61%), Gaps = 4/112 (3%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ Q LV +FS +++ +P+ G E++FGG D HF G +VPVTK+ YWQ L +I
Sbjct: 204 MMAQNLVDLPMFSVYMSSNPEGGAGSELIFGGYDHSHFSGSLNWVPVTKQAYWQIALDNI 263
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG---EGVVSAEC 109
+G + C GC AIVD+GTSL+ GP+ + ++ +AIG +G + EC
Sbjct: 264 QVGG-TVMFCSEGCQAIVDTGTSLITGPSDKIKQLQNAIGSAPVDGEYAVEC 314
Score = 82.8 bits (203), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 39/87 (44%), Positives = 52/87 (59%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 267
GE ++C + MP+V+FTI + LSP Y L + C SGF D+ PP GPL
Sbjct: 308 GEYAVECANLNVMPDVTFTINGVPYTLSPTAYTLLDFVDGMQFCSSGFQGLDIHPPAGPL 367
Query: 268 WILGDVFMGVYHTVFDSGKLRIGFAEA 294
WILGDVF+ +++VFD G R+G A A
Sbjct: 368 WILGDVFIRQFYSVFDRGNNRVGLAPA 394
>gi|126309841|ref|XP_001370380.1| PREDICTED: gastricsin-like isoform 1 [Monodelphis domestica]
Length = 388
Score = 94.4 bits (233), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 42/102 (41%), Positives = 66/102 (64%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M++Q +++ +FSF+L+ ++ GGE++FGGVD + G+ + PVT++ YWQ + +
Sbjct: 198 MLQQNVLTNPIFSFYLSNQQSSQSGGEVIFGGVDNNLYSGQIYWAPVTQELYWQIGIQEF 257
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
IG Q+TG C GC AIVD+GTSLL P ++ A GG+
Sbjct: 258 SIGGQATGWCSQGCQAIVDTGTSLLTVPQQYMSAFLQATGGQ 299
Score = 71.6 bits (174), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 38/89 (42%), Positives = 49/89 (55%), Gaps = 5/89 (5%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRG-P 266
G+ ++DC+ I +P +SF I F LSP YIL C G L G P
Sbjct: 304 GQYVVDCNSIQNLPTISFLINGVQFPLSPSAYILNNNG----YCTVGIEPTYLASQNGQP 359
Query: 267 LWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
LWILGDVF+ Y++V+D G R+GFA AA
Sbjct: 360 LWILGDVFLRSYYSVYDMGNNRVGFATAA 388
>gi|57233525|gb|AAW48311.1| pepsinogen [Oreochromis mossambicus]
Length = 260
Score = 94.4 bits (233), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 43/102 (42%), Positives = 65/102 (63%), Gaps = 3/102 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ +GLVSE +FS +L+ + +E+G E+VFGG D H+ G T++P++ YWQ + +
Sbjct: 102 MITEGLVSEPIFSVYLSGN--SEQGSEVVFGGTDSTHYTGTITWIPLSSATYWQINMDSV 159
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
I Q T C GGC AI+D+GTSL+ GPT + +N +G
Sbjct: 160 TINGQ-TVACSGGCQAIIDTGTSLIVGPTTDINNLNSWVGAS 200
Score = 40.0 bits (92), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 27/108 (25%), Positives = 51/108 (47%), Gaps = 10/108 (9%)
Query: 164 GDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNV 223
G + CS A++ L T + ++ +N + + G++I++C IP+MP+V
Sbjct: 163 GQTVACSGGCQAIIDTGTSLIVGPTTD--INNLNSWVGASTDQSGDAIVNCQNIPSMPDV 220
Query: 224 SFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILG 271
+FT+ F + Y+ ++ + C++GF LWILG
Sbjct: 221 TFTLNGNAFTVPASAYVSQS----SSSCMTGFGQ----GGTMQLWILG 260
>gi|126309843|ref|XP_001370404.1| PREDICTED: gastricsin-like isoform 2 [Monodelphis domestica]
Length = 389
Score = 94.4 bits (233), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 42/102 (41%), Positives = 66/102 (64%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M++Q +++ +FSF+L+ ++ GGE++FGGVD + G+ + PVT++ YWQ + +
Sbjct: 198 MLQQNVLTNPIFSFYLSNQQSSQSGGEVIFGGVDNNLYSGQIYWAPVTQELYWQIGIQEF 257
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
IG Q+TG C GC AIVD+GTSLL P ++ A GG+
Sbjct: 258 SIGGQATGWCSQGCQAIVDTGTSLLTVPQQYMSAFLQATGGQ 299
Score = 74.7 bits (182), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 39/89 (43%), Positives = 50/89 (56%), Gaps = 4/89 (4%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRG-P 266
G+ ++DC+ I +P +SF I F LSP YIL G C G L G P
Sbjct: 304 GQYVVDCNSIQNLPTISFLINGVQFPLSPSAYILNQNNG---YCTVGIEPTYLASQNGQP 360
Query: 267 LWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
LWILGDVF+ Y++V+D G R+GFA AA
Sbjct: 361 LWILGDVFLRSYYSVYDMGNNRVGFATAA 389
>gi|5921651|gb|AAD56284.1|AF156788_1 pepsinogen A form IIb precursor [Pseudopleuronectes americanus]
Length = 376
Score = 94.4 bits (233), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 44/114 (38%), Positives = 72/114 (63%), Gaps = 9/114 (7%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M++Q LVS+ +FS +L+ + A++G E++FGG D H+ G+ +++P+T YWQ ++ +
Sbjct: 195 MIKQNLVSQPLFSVYLSSN--AQQGSEVIFGGTDSSHYTGQISWIPLTSATYWQIKMDSV 252
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG------GEGVVSAE 108
I Q T C GGC AI+D+GTSL+ GP ++ +N +G GE VS +
Sbjct: 253 TINGQ-TVACSGGCQAIIDTGTSLIVGPNSDISNMNSWVGASTNQYGEATVSCQ 305
Score = 62.0 bits (149), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 34/102 (33%), Positives = 53/102 (51%), Gaps = 9/102 (8%)
Query: 193 LSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCI 252
+S +N + N GE+ + C I +MP+V+FT+ K F + Y+ ++ G C
Sbjct: 283 ISNMNSWVGASTNQYGEATVSCQNIQSMPDVTFTLNGKAFTVPASAYVSQSNYG----CS 338
Query: 253 SGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
+GF LWILGDVF+ Y+ VFD+ IG A++
Sbjct: 339 TGF-----GQGGSNLWILGDVFIREYYAVFDAPSKYIGLAKS 375
>gi|430811193|emb|CCJ31368.1| unnamed protein product, partial [Pneumocystis jirovecii]
Length = 411
Score = 94.0 bits (232), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 40/88 (45%), Positives = 61/88 (69%), Gaps = 4/88 (4%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 267
G+ I+DC++IPT+P+++F G F+L P +YI+K I C++ F D+PPP GPL
Sbjct: 327 GDYILDCNKIPTLPDITFGFGHHNFSLGPNEYIIK----IQSKCMTTFTGMDIPPPAGPL 382
Query: 268 WILGDVFMGVYHTVFDSGKLRIGFAEAA 295
WI+GDVF+ Y++V+D GK +G A+A
Sbjct: 383 WIIGDVFLRKYYSVYDLGKNMVGLAKAT 410
Score = 84.3 bits (207), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 44/110 (40%), Positives = 62/110 (56%), Gaps = 16/110 (14%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQ------ 54
MVEQ ++E VF+FW+ EGGE FGG+DP H++G TY+PV +K YW+
Sbjct: 219 MVEQNAINEPVFAFWMGNIEKDIEGGECTFGGIDPMHYEGDLTYIPVRRKAYWEAFCLVD 278
Query: 55 ---FELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 101
F G IG ++ G AI+D+GTSL+ P + +N+AIG
Sbjct: 279 LSFFAYGKDFIGMENVG-------AILDTGTSLIVMPKNIADLLNNAIGA 321
>gi|73535294|pdb|1TZS|A Chain A, Crystal Structure Of An Activation Intermediate Of
Cathepsin E
Length = 351
Score = 94.0 bits (232), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 46/112 (41%), Positives = 69/112 (61%), Gaps = 4/112 (3%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ Q LV +FS +++ +P+ G E++FGG D HF G +VPVTK+ YWQ L +I
Sbjct: 151 MMAQNLVDLPMFSVYMSSNPEGGAGSELIFGGYDHSHFSGSLNWVPVTKQAYWQIALDNI 210
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG---EGVVSAEC 109
+G + C GC AIVD+GTSL+ GP+ + ++ +AIG +G + EC
Sbjct: 211 QVGG-TVMFCSEGCQAIVDTGTSLITGPSDKIKQLQNAIGAAPVDGEYAVEC 261
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 39/87 (44%), Positives = 52/87 (59%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 267
GE ++C + MP+V+FTI + LSP Y L + C SGF D+ PP GPL
Sbjct: 255 GEYAVECANLNVMPDVTFTINGVPYTLSPTAYTLLDFVDGMQFCSSGFQGLDIHPPAGPL 314
Query: 268 WILGDVFMGVYHTVFDSGKLRIGFAEA 294
WILGDVF+ +++VFD G R+G A A
Sbjct: 315 WILGDVFIRQFYSVFDRGNNRVGLAPA 341
>gi|444725492|gb|ELW66056.1| Gastricsin [Tupaia chinensis]
Length = 389
Score = 94.0 bits (232), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 42/102 (41%), Positives = 68/102 (66%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+++G+++ VFSF+L+ +E+GG ++FGGVD + G+ + PVT++ YWQ + +
Sbjct: 199 MLQEGVLTSPVFSFYLSNQQGSEDGGAVIFGGVDNSLYSGQIYWAPVTQELYWQIGIEEF 258
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
LIG Q++G C GC AIVD+GTSLL P ++ + A G +
Sbjct: 259 LIGGQASGWCSQGCQAIVDTGTSLLTVPQQYMSTLLQATGAQ 300
Score = 71.6 bits (174), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 38/107 (35%), Positives = 58/107 (54%), Gaps = 5/107 (4%)
Query: 190 EKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAE 249
++ +S + + + + G+ +++CD I ++P +F I F L P YIL
Sbjct: 287 QQYMSTLLQATGAQEDEYGQFLVNCDNIQSLPTFTFIINGVQFPLPPSAYILSNNGA--- 343
Query: 250 VCISGFMAFDLPPPRG-PLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
C+ G A LP G PLWILGDVF+ Y++V+D R+GFA AA
Sbjct: 344 -CMVGVEATYLPSQNGQPLWILGDVFLRSYYSVYDMSNNRVGFATAA 389
>gi|387915422|gb|AFK11320.1| cathepsin E-A-like protein [Callorhinchus milii]
Length = 401
Score = 94.0 bits (232), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 46/118 (38%), Positives = 75/118 (63%), Gaps = 5/118 (4%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ Q LV E +FS +NR+ D++ GGE++ GG++ + + G +VPVT++GYWQ + ++
Sbjct: 208 MMHQNLVVEPIFSVLINREMDSDYGGELLLGGINHECYTGSINWVPVTERGYWQIRMDNV 267
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG----GEGVVSAECKLVVS 114
I T +C GCAAIVD+GTSL+ GP + +++ +G G+G +CK + S
Sbjct: 268 KIDGMLT-LCINGCAAIVDTGTSLITGPEKEIRKLHKQLGAMSVGDGEYVVDCKRISS 324
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 44/105 (41%), Positives = 68/105 (64%), Gaps = 1/105 (0%)
Query: 190 EKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAE 249
EK + +++ ++ GE ++DC RI +M +V+FTIG+ F+LSP Y+ K +G
Sbjct: 295 EKEIRKLHKQLGAMSVGDGEYVVDCKRISSMASVTFTIGEVEFSLSPNDYV-KKFQGDHS 353
Query: 250 VCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
+C+SGF D+ GPLWILGDVF+ ++T+FD G R+GFA +
Sbjct: 354 LCLSGFQEMDMVTRAGPLWILGDVFLTKFYTIFDRGNDRVGFARS 398
>gi|307178902|gb|EFN67430.1| Lysosomal aspartic protease [Camponotus floridanus]
Length = 377
Score = 94.0 bits (232), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 44/111 (39%), Positives = 67/111 (60%), Gaps = 2/111 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+EQGL+S +FSF LNR+ GGE + GG DP +KG TY+P+++KG+WQ + I
Sbjct: 222 MIEQGLLSSHIFSFHLNRNSSELNGGEFILGGSDPAFYKGDFTYIPLSRKGFWQLSVDKI 281
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKL 111
LI + +C GC A + +G S + GP + IN+ + G V+ + +L
Sbjct: 282 LI--EDINLCGKGCQANIVTGDSAIIGPEKHIQFINNIVIGTVNVNGDERL 330
>gi|332024604|gb|EGI64802.1| Lysosomal aspartic protease [Acromyrmex echinatior]
Length = 361
Score = 93.6 bits (231), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 51/113 (45%), Positives = 69/113 (61%), Gaps = 6/113 (5%)
Query: 1 MVEQGLVSEEVFSFWLNRDP-DAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGD 59
MV+QGLVS +FSF+LNR+ D+ G ++ GG DP + G+ TYV VT KGYWQF +
Sbjct: 169 MVQQGLVSSPIFSFYLNRNLLDSSAGSVLILGGSDPALYDGELTYVNVTHKGYWQFTMDK 228
Query: 60 ILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE---GVVSAEC 109
I + N++ +C GC AI D+G S LAGP + I I + GVV +C
Sbjct: 229 IQMENET--LCVNGCQAIADTGFSRLAGPPTDIAIITSRIAIDDFNGVVYVDC 279
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 37/89 (41%), Positives = 50/89 (56%), Gaps = 3/89 (3%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYIL--KTGEGIAEVCISGFMAFDLPPPRG 265
G +DCD+I +PNV+F + K F L+ E YI+ K + VC S F G
Sbjct: 273 GVVYVDCDQISNLPNVTFFLSGKPFVLTAEDYIIVRKIDKKGTPVCYSAF-EIAAQSEFG 331
Query: 266 PLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
+W+LGD F+G Y+T FD G R+GFA A
Sbjct: 332 IMWVLGDSFLGRYYTEFDMGNDRVGFAPA 360
>gi|195575783|ref|XP_002077756.1| GD23099 [Drosophila simulans]
gi|194189765|gb|EDX03341.1| GD23099 [Drosophila simulans]
Length = 405
Score = 93.6 bits (231), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 48/117 (41%), Positives = 68/117 (58%), Gaps = 6/117 (5%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+ E+GL+ E VF F+L R+ A EGG++ GG D G+ TY PVT++GYWQF + +I
Sbjct: 221 LYEEGLIDEPVFGFYLARNGSAVEGGQLTLGGTDQNLIAGEMTYTPVTQQGYWQFSVNNI 280
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG---EGVVSAECKLVVS 114
+ V GC AI D+GTSL+A P+ ++N+ IGG +G C V S
Sbjct: 281 ---TWNGTVISSGCQAIADTGTSLIAAPSAAYIQLNNLIGGVLIQGEYYVPCSTVSS 334
Score = 62.0 bits (149), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 33/88 (37%), Positives = 45/88 (51%), Gaps = 5/88 (5%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 267
GE + C + ++P ++ IG F L P YI EG C+S F
Sbjct: 323 GEYYVPCSTVSSLPVLTINIGGTNFYLPPSVYIQTYTEGNYTTCMSTFTDIGTG-----F 377
Query: 268 WILGDVFMGVYHTVFDSGKLRIGFAEAA 295
WILGDVF+G Y++ FD G+ R+GFA A
Sbjct: 378 WILGDVFLGQYYSEFDFGQNRVGFATLA 405
>gi|195350353|ref|XP_002041705.1| GM16820 [Drosophila sechellia]
gi|194123478|gb|EDW45521.1| GM16820 [Drosophila sechellia]
Length = 405
Score = 93.6 bits (231), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 48/117 (41%), Positives = 68/117 (58%), Gaps = 6/117 (5%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+ E+GL+ E VF F+L R+ A EGG++ GG D G+ TY PVT++GYWQF + +I
Sbjct: 221 LYEEGLIDEPVFGFYLARNGSAVEGGQLTLGGTDQNLIAGEMTYTPVTQQGYWQFSVNNI 280
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG---EGVVSAECKLVVS 114
+ V GC AI D+GTSL+A P+ ++N+ IGG +G C V S
Sbjct: 281 ---TWNGTVISSGCQAIADTGTSLIAAPSAAYIQLNNLIGGVLIQGEYYVPCSTVSS 334
Score = 62.0 bits (149), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 33/88 (37%), Positives = 45/88 (51%), Gaps = 5/88 (5%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 267
GE + C + ++P ++ IG F L P YI EG C+S F
Sbjct: 323 GEYYVPCSTVSSLPVLTINIGGTNFYLPPSVYIQTYTEGNYTTCMSTFTDIGTG-----F 377
Query: 268 WILGDVFMGVYHTVFDSGKLRIGFAEAA 295
WILGDVF+G Y++ FD G+ R+GFA A
Sbjct: 378 WILGDVFLGQYYSEFDFGQNRVGFATLA 405
>gi|307190308|gb|EFN74391.1| Lysosomal aspartic protease [Camponotus floridanus]
Length = 98
Score = 93.6 bits (231), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 45/93 (48%), Positives = 60/93 (64%), Gaps = 3/93 (3%)
Query: 7 VSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQS 66
V + +FSF+L+RDP A EGGE++ GG DP H++G TYVPVT+KGYWQF + I + S
Sbjct: 1 VEQPIFSFYLSRDPSAMEGGELILGGSDPNHYEGNFTYVPVTQKGYWQFTMDRIEM---S 57
Query: 67 TGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAI 99
V +I D+GTSL+ GP + IN I
Sbjct: 58 DYVVTENKQSIADTGTSLIVGPNSDIDIINEFI 90
>gi|335287195|ref|XP_003355296.1| PREDICTED: gastricsin-like [Sus scrofa]
Length = 391
Score = 93.6 bits (231), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 47/119 (39%), Positives = 71/119 (59%), Gaps = 5/119 (4%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M++Q +++ +FSF+ +R P E GGE++ GGVD + + G+ + PVT++ YWQ + +
Sbjct: 200 MLQQDQLTQPIFSFYFSRQPTYEYGGELILGGVDTQLYSGQIVWTPVTRELYWQIAIQEF 259
Query: 61 LIGNQSTGVC-EGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAECKLVVS 114
IG+Q+TG C GC AIVD+GT LLA P + A G + G +C LV S
Sbjct: 260 AIGDQATGWCFSQGCQAIVDTGTFLLAVPQQYLASFLQATGAQEAQNGDFVVDCDLVQS 318
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 36/89 (40%), Positives = 51/89 (57%), Gaps = 5/89 (5%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRG-P 266
G+ ++DCD + +MP ++F IG F L P Y+ + C G A LP G P
Sbjct: 307 GDFVVDCDLVQSMPTITFIIGGSQFPLPPSAYVFSNNDS----CRLGIEASYLPSSSGEP 362
Query: 267 LWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
LWILGDVF+ Y++V+D R+GFA +A
Sbjct: 363 LWILGDVFLKEYYSVYDMANNRVGFALSA 391
>gi|403294878|ref|XP_003938389.1| PREDICTED: cathepsin E [Saimiri boliviensis boliviensis]
Length = 396
Score = 93.6 bits (231), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 46/112 (41%), Positives = 69/112 (61%), Gaps = 4/112 (3%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ Q LV +FS +++ +P+ G E++FGG D HF G +VPVTK+ YWQ L +I
Sbjct: 204 MMAQNLVDLPMFSVYMSSNPEGGVGSELIFGGYDHSHFSGSLNWVPVTKQAYWQIALDNI 263
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG---EGVVSAEC 109
+G + C GC AIVD+GTSL+ GP+ + ++ +AIG +G + EC
Sbjct: 264 QVGG-TVMFCSEGCQAIVDTGTSLITGPSDKIKQLQNAIGAAPVDGEYAVEC 314
Score = 84.3 bits (207), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 40/87 (45%), Positives = 52/87 (59%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 267
GE ++C + MP+V+FTI + LSP Y L E C SGF D+ PP GPL
Sbjct: 308 GEYAVECANLNVMPDVTFTINGVPYTLSPTAYTLLDFVDGMEFCSSGFQGLDIHPPAGPL 367
Query: 268 WILGDVFMGVYHTVFDSGKLRIGFAEA 294
WILGDVF+ +++VFD G R+G A A
Sbjct: 368 WILGDVFIRQFYSVFDRGNNRVGLAPA 394
>gi|190576563|gb|ACE79054.1| gastricsin precursor (predicted) [Sorex araneus]
Length = 389
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 46/120 (38%), Positives = 70/120 (58%), Gaps = 10/120 (8%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M++ G + VFSF+L+ +++GG +VFGGVD + G+ + PVT++ YWQ +
Sbjct: 199 MLQAGALDSPVFSFYLSNQQSSQDGGAVVFGGVDNSLYTGQIFWTPVTQELYWQIGVEQF 258
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
LIG Q+TG C GC AIVD+GTSLL P ++ + A G + + QYG ++
Sbjct: 259 LIGGQATGWCSQGCQAIVDTGTSLLTVPQQYMSALQQATGAQ----------LDQYGQMV 308
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 37/107 (34%), Positives = 58/107 (54%), Gaps = 5/107 (4%)
Query: 190 EKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAE 249
++ +S + + + + G+ +++C+ I +P ++F I F L P Y+L
Sbjct: 287 QQYMSALQQATGAQLDQYGQMVVNCNNIQNLPTLTFVINGVQFPLLPSAYVLNNNG---- 342
Query: 250 VCISGFMAFDLPPPRG-PLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
C G LP P G PLWILGDVF+ Y++V+D G R+GFA AA
Sbjct: 343 YCTLGVEPTYLPSPTGQPLWILGDVFLRSYYSVYDMGNNRVGFATAA 389
>gi|301786583|ref|XP_002928700.1| PREDICTED: pepsin A-like isoform 2 [Ailuropoda melanoleuca]
Length = 393
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 49/119 (41%), Positives = 69/119 (57%), Gaps = 3/119 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M QGLVS+++FS +L+ D + G ++FGG+D +F G +VPV+ +GYWQ + +
Sbjct: 200 MWNQGLVSQDLFSVYLSSD--DQSGSVVMFGGIDSSYFTGNLNWVPVSVEGYWQITMDSV 257
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDL 119
I Q+ C GC AIVD+GTSLLAGPT + I IG + E + S DL
Sbjct: 258 TINGQAIA-CSQGCQAIVDTGTSLLAGPTNSIANIQSYIGASEDSNGEMTISCSAINDL 315
Score = 70.5 bits (171), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 48/137 (35%), Positives = 63/137 (45%), Gaps = 9/137 (6%)
Query: 164 GDSAVCSACEMAVVWVQNQLKQKQTKE--KVLSYINELCDSLPNPMGESIIDCDRIPTMP 221
G + CS A+V L T + SYI DS GE I C I +P
Sbjct: 261 GQAIACSQGCQAIVDTGTSLLAGPTNSIANIQSYIGASEDS----NGEMTISCSAINDLP 316
Query: 222 NVSFTIGDKIFNLSPEQYILKTGEGIA---EVCISGFMAFDLPPPRGPLWILGDVFMGVY 278
++ FTI + L P YIL+ A + C+SGF +LP G LWILGD+F+ Y
Sbjct: 317 DIVFTINGIQYPLPPSAYILQVSGLWASRLQDCVSGFQGMNLPTASGELWILGDIFIRQY 376
Query: 279 HTVFDSGKLRIGFAEAA 295
VFD ++G A A
Sbjct: 377 FAVFDRANNQVGLAPVA 393
>gi|301786581|ref|XP_002928699.1| PREDICTED: pepsin A-like isoform 1 [Ailuropoda melanoleuca]
gi|281347483|gb|EFB23067.1| hypothetical protein PANDA_018738 [Ailuropoda melanoleuca]
Length = 385
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 49/119 (41%), Positives = 69/119 (57%), Gaps = 3/119 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M QGLVS+++FS +L+ D + G ++FGG+D +F G +VPV+ +GYWQ + +
Sbjct: 200 MWNQGLVSQDLFSVYLSSD--DQSGSVVMFGGIDSSYFTGNLNWVPVSVEGYWQITMDSV 257
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDL 119
I Q+ C GC AIVD+GTSLLAGPT + I IG + E + S DL
Sbjct: 258 TINGQAIA-CSQGCQAIVDTGTSLLAGPTNSIANIQSYIGASEDSNGEMTISCSAINDL 315
Score = 72.0 bits (175), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 47/134 (35%), Positives = 62/134 (46%), Gaps = 11/134 (8%)
Query: 164 GDSAVCSACEMAVVWVQNQLKQKQTKE--KVLSYINELCDSLPNPMGESIIDCDRIPTMP 221
G + CS A+V L T + SYI DS GE I C I +P
Sbjct: 261 GQAIACSQGCQAIVDTGTSLLAGPTNSIANIQSYIGASEDS----NGEMTISCSAINDLP 316
Query: 222 NVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTV 281
++ FTI + L P YIL+ + C+SGF +LP G LWILGD+F+ Y V
Sbjct: 317 DIVFTINGIQYPLPPSAYILQNQD-----CVSGFQGMNLPTASGELWILGDIFIRQYFAV 371
Query: 282 FDSGKLRIGFAEAA 295
FD ++G A A
Sbjct: 372 FDRANNQVGLAPVA 385
>gi|57619120|ref|NP_001009868.1| pregnancy-associated glycoprotein precursor [Felis catus]
gi|2689727|gb|AAB91422.1| pregnancy-associated glycoprotein [Felis catus]
Length = 388
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 47/106 (44%), Positives = 68/106 (64%), Gaps = 3/106 (2%)
Query: 3 EQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILI 62
+QGL+S+E+F+F+L++ EEG ++FGGVD ++ G +VPV+K+ YWQ + I +
Sbjct: 203 KQGLISQELFAFYLSKKD--EEGSVVMFGGVDHSYYSGDLNWVPVSKRLYWQLSMDSISM 260
Query: 63 GNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
N C+GGC AI+D+GTSLL GP+ VV I IG S E
Sbjct: 261 -NGEVIACDGGCQAIIDTGTSLLIGPSHVVFNIQMIIGANQSYSGE 305
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 35/87 (40%), Positives = 44/87 (50%), Gaps = 3/87 (3%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 267
GE ++DCD T+P++ FTI + + YI EG C SGF
Sbjct: 304 GEYVVDCDAANTLPDIVFTINGIDYPVPASAYI---QEGPQGTCYSGFDESGDSLLVSDS 360
Query: 268 WILGDVFMGVYHTVFDSGKLRIGFAEA 294
WILGDVF+ +Y TVFD RIG A A
Sbjct: 361 WILGDVFLRLYFTVFDRENNRIGLALA 387
>gi|384490965|gb|EIE82161.1| hypothetical protein RO3G_06866 [Rhizopus delemar RA 99-880]
Length = 403
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 47/121 (38%), Positives = 69/121 (57%), Gaps = 4/121 (3%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEE--GGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELG 58
M+ + L+ +E+FSFWL+ D GGE+ FGG+D F G T+ PVT+KGYW+ EL
Sbjct: 214 MINKKLIDQEIFSFWLSDTNDGNNNLGGELAFGGIDEARFSGNITWSPVTRKGYWEIELQ 273
Query: 59 DILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGD 118
+ +Q + G A +D+GTSLL PT V +N+ IGG+ + + S G+
Sbjct: 274 NTKFNDQPMNM--GSIGAAIDTGTSLLIAPTAVAEFVNNQIGGQADAYGQYTVDCSSVGN 331
Query: 119 L 119
L
Sbjct: 332 L 332
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 58/103 (56%), Gaps = 4/103 (3%)
Query: 192 VLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVC 251
V ++N + G+ +DC + +P F K F L + YIL + C
Sbjct: 304 VAEFVNNQIGGQADAYGQYTVDCSSVGNLPEFCFQFSGKDFCLQGKDYILD----VDGQC 359
Query: 252 ISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
+SGF+A D+PPP GPLWI+GDVF+ +++++D R+GFA++
Sbjct: 360 MSGFVALDIPPPAGPLWIVGDVFLRKFYSIYDLQNHRVGFAQS 402
>gi|2687645|gb|AAB88862.1| cathepsin D [Sparus aurata]
Length = 399
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 45/99 (45%), Positives = 63/99 (63%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
+++ +LP GE +DC T+G ++FNL+ E Y++K + +C+SGF
Sbjct: 300 LHKAIGALPLIDGEYGLDCSGSHRCLLSLSTLGGRMFNLTGEDYVMKESQMGMSICVSGF 359
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
MA D+PPP GPLWILGDVF+G Y+TVFD R+GFA A
Sbjct: 360 MAMDIPPPAGPLWILGDVFIGKYYTVFDRNADRVGFAPA 398
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 50/129 (38%), Positives = 72/129 (55%), Gaps = 12/129 (9%)
Query: 6 LVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFK-GKHTYVPVTKKGYWQFELGDILIGN 64
L+ + +FSF+L RDP A GGE+ GG DP G YV VT+K YW + + +GN
Sbjct: 211 LLPQNIFSFYLTRDPKAAVGGELTLGGTDPHVLTLGDLHYVNVTRKAYWHIGMDGLQVGN 270
Query: 65 QSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE----------CKLVVS 114
Q + +C+ GC AIVD+GTSL+ GP V ++ AIG ++ E C L +S
Sbjct: 271 QLS-LCKAGCEAIVDTGTSLIVGPVEEVRALHKAIGALPLIDGEYGLDCSGSHRCLLSLS 329
Query: 115 QYGDLIWDL 123
G +++L
Sbjct: 330 TLGGRMFNL 338
>gi|162423776|gb|ABX89618.1| pepsinogen [Sparus aurata]
Length = 376
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 46/118 (38%), Positives = 73/118 (61%), Gaps = 7/118 (5%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV+QGLVS+ +FS +L+ + +E+G E+VFGG D H+ G+ T++P++ Y+Q + +
Sbjct: 194 MVQQGLVSQPMFSVYLSGN--SEQGSEVVFGGTDSNHYTGQITWIPLSSATYYQISMDSV 251
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAECKLVVS 114
I Q T C GGC AI+D+GTSL+ GPT + +N +G G + C+ + S
Sbjct: 252 TINGQ-TVACSGGCQAIIDTGTSLIVGPTSDINNMNSWVGASTNEYGEATVNCQNIQS 308
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 32/100 (32%), Positives = 53/100 (53%), Gaps = 8/100 (8%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
+N + N GE+ ++C I +MP+V+FT+ F + Y+ ++ G C++GF
Sbjct: 285 MNSWVGASTNEYGEATVNCQNIQSMPDVTFTLNGHAFTVPASAYVSQSSYG----CMTGF 340
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
LWILGDVF+ Y+ VFD+ +G A++A
Sbjct: 341 G----QGGSQQLWILGDVFIREYYAVFDTQAQSVGLAKSA 376
>gi|195433875|ref|XP_002064932.1| GK15196 [Drosophila willistoni]
gi|194161017|gb|EDW75918.1| GK15196 [Drosophila willistoni]
Length = 415
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 49/130 (37%), Positives = 77/130 (59%), Gaps = 11/130 (8%)
Query: 1 MVEQGLVSEEVFSFWLNRD-PDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGD 59
++ QGL+ E +FS +LNR+ +A GGE++ GG+DP + G TYVPV+++GYWQFE+
Sbjct: 229 LISQGLIPEPIFSIYLNRNGTNATNGGELILGGIDPALYSGCLTYVPVSQQGYWQFEMTS 288
Query: 60 ILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG------GEGVVSAECKLVV 113
+ +Q C+ C AI+D GTSL+ P + EIN +G G +C +
Sbjct: 289 ATLNDQE--FCD-NCQAILDVGTSLIVVPNSEIKEINQILGVTNPNATSGAFLVDCA-TI 344
Query: 114 SQYGDLIWDL 123
S+ D+I+ +
Sbjct: 345 SKLPDIIFTI 354
Score = 77.4 bits (189), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 63/200 (31%), Positives = 90/200 (45%), Gaps = 37/200 (18%)
Query: 120 IWDLLVSGLLPEKV----CQQIGLCAFNGAEYVSTGIKTVV------------------E 157
+++L+ GL+PE + + G A NG E + GI + E
Sbjct: 226 LYNLISQGLIPEPIFSIYLNRNGTNATNGGELILGGIDPALYSGCLTYVPVSQQGYWQFE 285
Query: 158 KENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGES---IIDC 214
+ + D C C+ A++ V L E + IN++ + NP S ++DC
Sbjct: 286 MTSATLNDQEFCDNCQ-AILDVGTSLIVVPNSE--IKEINQIL-GVTNPNATSGAFLVDC 341
Query: 215 DRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVF 274
I +P++ FTI K F L YILK G C+SGF D WILG+VF
Sbjct: 342 ATISKLPDIIFTIARKEFALKSTDYILKYGN----TCVSGFSTLD----GIDFWILGEVF 393
Query: 275 MGVYHTVFDSGKLRIGFAEA 294
MG Y+TVFD G +IG A A
Sbjct: 394 MGAYYTVFDIGYNQIGIATA 413
>gi|16119024|gb|AAL14708.1|AF420068_1 aspartic protease [Clonorchis sinensis]
Length = 419
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 49/136 (36%), Positives = 75/136 (55%), Gaps = 4/136 (2%)
Query: 159 ENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIP 218
++++ G +C A+ L ++E + +N+ ++ P G IDC R+
Sbjct: 248 DSMNVGSMKLCENGCQAIADTGTSLIAGPSEE--VGKLNDALGAINIPGGTYYIDCSRVS 305
Query: 219 TMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVY 278
T+P V F+I K+ L P YIL+ +CISGFM ++ P GPLWI G+VF+G Y
Sbjct: 306 TLPPVQFSISGKLMQLDPSDYILRMTWFGKTICISGFMGINI--PGGPLWIFGEVFIGKY 363
Query: 279 HTVFDSGKLRIGFAEA 294
+T+FD G R+GFA A
Sbjct: 364 YTIFDVGNARVGFATA 379
Score = 87.4 bits (215), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 43/100 (43%), Positives = 65/100 (65%), Gaps = 8/100 (8%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ QG F F L+R+ GGE++ GG DPK++KG+ + P+T + YWQF++ +
Sbjct: 197 MISQG------FGFRLDRNRSDPVGGELLLGGTDPKYYKGEILWAPLTHEAYWQFKVDSM 250
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 100
+G S +CE GC AI D+GTSL+AGP+ V ++N A+G
Sbjct: 251 NVG--SMKLCENGCQAIADTGTSLIAGPSEEVGKLNDALG 288
>gi|195470499|ref|XP_002087544.1| GE17593 [Drosophila yakuba]
gi|194173645|gb|EDW87256.1| GE17593 [Drosophila yakuba]
Length = 404
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 48/117 (41%), Positives = 69/117 (58%), Gaps = 6/117 (5%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+ E+GL+ E VF F+L R+ A EGG++ GG D + G+ TY PVT++GYWQF + +I
Sbjct: 220 LYEEGLIDEPVFGFYLARNGSAVEGGQLTLGGTDQELIAGEMTYTPVTEQGYWQFAVNNI 279
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG---EGVVSAECKLVVS 114
+ V GC AI D+GTSL+A P+ ++N+ IGG +G C V S
Sbjct: 280 ---TWNGTVISSGCQAIADTGTSLIAAPSAAYIQLNNLIGGILIQGDYYVPCSTVSS 333
Score = 60.5 bits (145), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 32/88 (36%), Positives = 45/88 (51%), Gaps = 5/88 (5%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 267
G+ + C + ++P ++ IG F L P YI EG C+S F
Sbjct: 322 GDYYVPCSTVSSLPVLTINIGGTDFYLPPSVYIQTYTEGNYTTCMSTFTDIGTG-----F 376
Query: 268 WILGDVFMGVYHTVFDSGKLRIGFAEAA 295
WILGDVF+G Y++ FD G+ R+GFA A
Sbjct: 377 WILGDVFLGQYYSEFDFGQNRVGFATLA 404
>gi|440905526|gb|ELR55898.1| Gastricsin [Bos grunniens mutus]
Length = 391
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 42/102 (41%), Positives = 67/102 (65%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+++G ++ VFSF+L+ +++GG ++FGGVD + G+ + PVT++ YWQ +
Sbjct: 200 MLQEGALTSPVFSFYLSSQQGSQDGGAVIFGGVDSCLYTGQIYWAPVTQELYWQIGFEEF 259
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
LIG+Q+TG C GC AIVD+GTSLL P ++ + A G +
Sbjct: 260 LIGDQATGWCSTGCQAIVDTGTSLLTVPQQFLSALLQATGAQ 301
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 38/106 (35%), Positives = 57/106 (53%), Gaps = 4/106 (3%)
Query: 190 EKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAE 249
++ LS + + + + G+ +DC+ I +P ++F I F L P YIL +
Sbjct: 288 QQFLSALLQATGAQEDQYGQFPVDCNNIQNLPTLTFVINGVQFPLPPASYILNNDD---S 344
Query: 250 VCISGFMAFDLPPPRG-PLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
CI G +P G PLWILGDVF+ Y++V+D G R+GFA A
Sbjct: 345 YCILGVEVTYVPSQNGQPLWILGDVFLRSYYSVYDLGNNRVGFATA 390
>gi|431892878|gb|ELK03306.1| Cathepsin E [Pteropus alecto]
Length = 396
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 48/112 (42%), Positives = 67/112 (59%), Gaps = 4/112 (3%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ Q LV +FS +++ D + G E++FGG D HF G +VPVTK+GYWQ L I
Sbjct: 204 MMAQNLVDVPMFSVYMSSDLEGGAGSELIFGGYDHSHFSGSLNWVPVTKQGYWQIALDTI 263
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE---GVVSAEC 109
+G + C GC AIVD+GTSL+ GP+ + ++ AIG E G + EC
Sbjct: 264 QVGG-AVIFCSEGCQAIVDTGTSLITGPSEEIKQLQKAIGAEPTNGEYAVEC 314
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 40/87 (45%), Positives = 52/87 (59%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 267
GE ++CD + MP+V+FTI + L P Y L E C SGF D+ PP GPL
Sbjct: 308 GEYAVECDNLNVMPDVTFTINGVPYTLQPTAYTLPDSVDETEFCFSGFQGLDIQPPAGPL 367
Query: 268 WILGDVFMGVYHTVFDSGKLRIGFAEA 294
WILGDVF+ +++VFD G R+G A A
Sbjct: 368 WILGDVFIRQFYSVFDRGNNRVGLAPA 394
>gi|332247693|ref|XP_003272996.1| PREDICTED: cathepsin E [Nomascus leucogenys]
Length = 396
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 45/112 (40%), Positives = 68/112 (60%), Gaps = 4/112 (3%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ Q LV +FS +++ +P+ G E++FGG D HF G +VPVTK+ YWQ L +I
Sbjct: 204 MMAQNLVDLPMFSVYMSSNPEGGAGSELIFGGYDHSHFSGSLNWVPVTKQAYWQIALDNI 263
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG---EGVVSAEC 109
+G + C GC AIVD+GTSL+ GP+ + ++ + IG +G + EC
Sbjct: 264 QVGG-TVMFCSEGCQAIVDTGTSLITGPSDKIKQLQNTIGAAPVDGEYAVEC 314
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 39/87 (44%), Positives = 52/87 (59%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 267
GE ++C + MP+V+FTI + LSP Y L + C SGF D+ PP GPL
Sbjct: 308 GEYAVECANLNVMPDVTFTINGVPYTLSPTAYTLLDFVDGMQFCSSGFQGLDIHPPAGPL 367
Query: 268 WILGDVFMGVYHTVFDSGKLRIGFAEA 294
WILGDVF+ +++VFD G R+G A A
Sbjct: 368 WILGDVFIRQFYSVFDRGNNRVGLAPA 394
>gi|195134378|ref|XP_002011614.1| GI11124 [Drosophila mojavensis]
gi|193906737|gb|EDW05604.1| GI11124 [Drosophila mojavensis]
Length = 373
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 45/85 (52%), Positives = 60/85 (70%), Gaps = 3/85 (3%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV QGLV E VFSF+L RD + EGGE++FGG D + G+ TYVP++++GYWQF + I
Sbjct: 196 MVSQGLVDESVFSFYLARDGTSNEGGELIFGGSDSSLYTGELTYVPISQQGYWQFAVDSI 255
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLL 85
I Q+ +C+ C AI D+GTSLL
Sbjct: 256 SIDGQT--LCD-NCQAIADTGTSLL 277
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 39/100 (39%), Positives = 55/100 (55%), Gaps = 13/100 (13%)
Query: 198 ELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF-- 255
++ ++L N + ++DC + +MP ++ IG F L P QYI+++ C SGF
Sbjct: 285 DILNNLLNVDEDGLVDCSAVDSMPVLNLNIGGTKFTLEPAQYIIQSDGD----CQSGFEF 340
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
M D WILGDVF+G Y+T FD G RIGFA A
Sbjct: 341 MGTDF-------WILGDVFIGKYYTEFDLGNNRIGFAPVA 373
>gi|119567604|gb|ABL84270.1| aspartic protease [Musca domestica]
Length = 379
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 50/119 (42%), Positives = 68/119 (57%), Gaps = 8/119 (6%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M QGLV +FSF LNRD + +GG+++ GGVD + G TYVPV+ +GYWQFE+
Sbjct: 197 MFAQGLVDANMFSFLLNRDGTSTDGGQMILGGVDSSLYTGDITYVPVSSQGYWQFEVTSG 256
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG-----EGVVSAECKLVVS 114
I QS +C+ C AI D+GTSL+ P+ +N IG +G +C V S
Sbjct: 257 AIKGQS--ICD-NCQAIADTGTSLIVAPSDAYNTLNAEIGATYNEDDGNYYVDCSAVDS 312
Score = 64.7 bits (156), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 36/89 (40%), Positives = 48/89 (53%), Gaps = 13/89 (14%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF--MAFDLPPPRG 265
G +DC + ++P+V+F IG F L YI+ + C+S F M D
Sbjct: 301 GNYYVDCSAVDSLPDVTFVIGGTTFTLPASAYIVT----VDGNCMSSFTYMGTDF----- 351
Query: 266 PLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
WILGDVF+G Y+TVFD R+GFAEA
Sbjct: 352 --WILGDVFIGKYYTVFDFANNRVGFAEA 378
>gi|226821415|gb|ACO82383.1| cathepsin D [Lutjanus argentimaculatus]
Length = 166
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 39/82 (47%), Positives = 58/82 (70%), Gaps = 1/82 (1%)
Query: 6 LVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQ 65
L+ + +FSF+++RDP A GGE++ GG DP+++ G YV VT+K YWQ ++ + +G Q
Sbjct: 86 LLPQNIFSFYISRDPTAAVGGELILGGTDPQYYTGDLHYVNVTRKAYWQIQMNGVAVGEQ 145
Query: 66 STGVCEGGCAAIVDSGTSLLAG 87
T +C G C AIVD+GTSL+ G
Sbjct: 146 LT-LCXGSCQAIVDTGTSLIVG 166
>gi|4099023|gb|AAD00524.1| aspartic protease [Onchocerca volvulus]
Length = 422
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 41/87 (47%), Positives = 58/87 (66%), Gaps = 1/87 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ Q ++ + VF+FWL+R+P E GGEI GG+D F TY PV++ GYWQF++ I
Sbjct: 228 MISQKVLQQPVFAFWLDRNPSDEVGGEITLGGIDTNRFVSPITYTPVSRHGYWQFKMDSI 287
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAG 87
+++ G C GC AI D+GTSL+AG
Sbjct: 288 QGKDEAIG-CANGCQAIADTGTSLIAG 313
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 44/106 (41%), Positives = 64/106 (60%), Gaps = 1/106 (0%)
Query: 189 KEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIA 248
K K++ + N L ++ P GE II C ++ ++P ++F I K + L Y+ +
Sbjct: 315 KVKLIKFSNILVLNMCMP-GEYIIPCYKVSSLPEITFVIAGKSYTLKGSDYVFECNNKGK 373
Query: 249 EVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
+C+SG M DLP G LWILGDVF+G Y+TVFD G +IGFA+A
Sbjct: 374 SICLSGSMGIDLPERLGELWILGDVFIGRYYTVFDVGNSQIGFAQA 419
>gi|34740274|dbj|BAC87742.1| pepsinogen [Paralichthys olivaceus]
Length = 377
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 44/118 (37%), Positives = 73/118 (61%), Gaps = 7/118 (5%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M++Q LVS+ +FS +L+ + ++G E+VFGG+D H+ G+ +++P+T YWQ ++ +
Sbjct: 195 MIKQNLVSQPLFSVYLSSN--NQQGSEVVFGGIDGNHYTGQVSWIPLTSATYWQIKMDSV 252
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAECKLVVS 114
I Q T C GGC AI+D+GTSL+ GPT + +N +G G + C+ + S
Sbjct: 253 TINGQ-TVACSGGCQAIIDTGTSLIVGPTNDINNMNSWVGASTNQYGEATVNCQNIQS 309
Score = 57.0 bits (136), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 38/131 (29%), Positives = 63/131 (48%), Gaps = 10/131 (7%)
Query: 164 GDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNV 223
G + CS A++ L T + ++ +N + N GE+ ++C I +MP V
Sbjct: 256 GQTVACSGGCQAIIDTGTSLIVGPTND--INNMNSWVGASTNQYGEATVNCQNIQSMPEV 313
Query: 224 SFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFD 283
+FT+ K F + Y+ ++ G +GF LWILGDVF+ Y+T+FD
Sbjct: 314 TFTLNGKAFTVPASAYVSQSYYGRN----TGFGQ----GGTDQLWILGDVFIREYYTIFD 365
Query: 284 SGKLRIGFAEA 294
+ IG A++
Sbjct: 366 AQSKYIGLAKS 376
>gi|196015458|ref|XP_002117586.1| hypothetical protein TRIADDRAFT_61606 [Trichoplax adhaerens]
gi|190579908|gb|EDV19996.1| hypothetical protein TRIADDRAFT_61606 [Trichoplax adhaerens]
Length = 397
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 41/104 (39%), Positives = 65/104 (62%), Gaps = 1/104 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ Q L+ ++VFSF+ +RD + GGEI+FGG D ++++GK Y V KG W ++
Sbjct: 195 MLAQDLIKKKVFSFYFSRDEEGTAGGEIIFGGSDSRYYEGKFHYTNVIHKGSWIIKVDSG 254
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGV 104
+ N+ C GC AI+++GTSL+ GP+ + I HAIG + +
Sbjct: 255 TV-NRGVKFCTHGCTAIIETGTSLIFGPSKDIQRIQHAIGAQKI 297
Score = 75.1 bits (183), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 36/86 (41%), Positives = 51/86 (59%), Gaps = 3/86 (3%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 267
G++ IDC RI ++P ++FTI + L PE Y+ + + CISGF+ +
Sbjct: 299 GQNFIDCTRIKSLPKITFTIDKIRYTLDPEHYVHQYTLKGNKHCISGFLELE---EEEDT 355
Query: 268 WILGDVFMGVYHTVFDSGKLRIGFAE 293
WI GDVF+ Y+T FD GK RIGFA+
Sbjct: 356 WIFGDVFLRSYYTEFDVGKDRIGFAK 381
>gi|329665035|ref|NP_001192720.1| gastricsin precursor [Bos taurus]
Length = 391
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 42/102 (41%), Positives = 67/102 (65%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+++G ++ VFSF+L+ +++GG ++FGGVD + G+ + PVT++ YWQ +
Sbjct: 200 MLQEGALTSPVFSFYLSSQQGSQDGGAVIFGGVDNCLYTGQIYWAPVTQELYWQIGFEEF 259
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
LIG+Q+TG C GC AIVD+GTSLL P ++ + A G +
Sbjct: 260 LIGDQATGWCSTGCQAIVDTGTSLLTVPQQFLSALLQATGAQ 301
Score = 67.4 bits (163), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 37/106 (34%), Positives = 56/106 (52%), Gaps = 4/106 (3%)
Query: 190 EKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAE 249
++ LS + + + + G+ +DC+ I +P ++ I F L P YIL +
Sbjct: 288 QQFLSALLQATGAQEDQYGQFPVDCNNIQNLPTLTLVINGVQFPLPPASYILNNDD---S 344
Query: 250 VCISGFMAFDLPPPRG-PLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
CI G +P G PLWILGDVF+ Y++V+D G R+GFA A
Sbjct: 345 YCILGVEVTYVPSQNGQPLWILGDVFLRSYYSVYDLGNNRVGFATA 390
>gi|296474377|tpg|DAA16492.1| TPA: progastricsin (pepsinogen C) [Bos taurus]
Length = 421
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 42/102 (41%), Positives = 67/102 (65%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+++G ++ VFSF+L+ +++GG ++FGGVD + G+ + PVT++ YWQ +
Sbjct: 200 MLQEGALTSPVFSFYLSSQQGSQDGGAVIFGGVDNCLYTGQIYWAPVTQELYWQIGFEEF 259
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
LIG+Q+TG C GC AIVD+GTSLL P ++ + A G +
Sbjct: 260 LIGDQATGWCSTGCQAIVDTGTSLLTVPQQFLSALLQATGAQ 301
>gi|67003878|gb|AAY60835.1| pepsinogen A [Melanogrammus aeglefinus]
Length = 269
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 50/133 (37%), Positives = 77/133 (57%), Gaps = 13/133 (9%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MVEQ LV++++FS +L+R +EG +VFGGVD G+ T++P+T YWQ ++ +
Sbjct: 109 MVEQNLVAQQMFSVYLSRG--GQEGSVVVFGGVDESLMSGQITWIPLTSATYWQIKMDSV 166
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
I N + C GGC AI+D+GTSL+ GP+ ++ +N +G +QYGD
Sbjct: 167 TI-NGNPVACNGGCQAIIDTGTSLIVGPSSDISNMNSWVGAS----------TNQYGDAT 215
Query: 121 WDLLVSGLLPEKV 133
+ G +PE V
Sbjct: 216 VNCQNIGSMPEVV 228
Score = 43.1 bits (100), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 25/82 (30%), Positives = 38/82 (46%), Gaps = 9/82 (10%)
Query: 193 LSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCI 252
+S +N + N G++ ++C I +MP V FT+ F + Y+ + G C
Sbjct: 197 ISNMNSWVGASTNQYGDATVNCQNIGSMPEVVFTLSGHSFTVPASAYVSQYSYG----CQ 252
Query: 253 SGFMAFDLPPPRGPLWILGDVF 274
+GF LWILGDVF
Sbjct: 253 TGFGG-----SNEQLWILGDVF 269
>gi|307167891|gb|EFN61280.1| Lysosomal aspartic protease [Camponotus floridanus]
Length = 431
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 51/152 (33%), Positives = 71/152 (46%), Gaps = 40/152 (26%)
Query: 1 MVEQGLVSEEVFSFWLNR------------------------------------DPDAEE 24
+++Q LV + VFSF+LNR DP A+
Sbjct: 199 LIKQKLVPQPVFSFYLNRHVFSYSIFKSISNKYIYNKKKYIYIAILKRIYNVYRDPSAKV 258
Query: 25 GGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIG----NQSTGVCEGGCAAIVDS 80
GGE++ GG DP ++ G YV VTKKGYWQF + + I N+ +C GGC AI D+
Sbjct: 259 GGELILGGSDPAYYTGHFKYVDVTKKGYWQFLMDRVRITRTKFNKGRTLCMGGCQAIADT 318
Query: 81 GTSLLAGPTPVVTEINHAIGGEGVVSAECKLV 112
G SL+ GPT + IN IG + ++
Sbjct: 319 GMSLIVGPTSEIDIINKYIGANKTTDSSGNII 350
Score = 77.8 bits (190), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 41/98 (41%), Positives = 58/98 (59%), Gaps = 7/98 (7%)
Query: 197 NELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFM 256
N+ DS N + ++++C+ I +P + F +G K F L+ YILK E C SGF+
Sbjct: 340 NKTTDSSGNII--NVVNCNTIHKLPIIRFILGGKRFPLNSNNYILKNTEYGITTCTSGFV 397
Query: 257 AFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
PLWILGDVF+G Y+T FD GK R+GFA++
Sbjct: 398 G-----SNSPLWILGDVFIGRYYTEFDLGKNRVGFAQS 430
>gi|194218276|ref|XP_001501986.2| PREDICTED: pepsin A-like [Equus caballus]
Length = 387
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 51/110 (46%), Positives = 69/110 (62%), Gaps = 9/110 (8%)
Query: 3 EQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILI 62
+QGLVS+++FS +L+ D E G ++FGG+D ++ G +VPVT +GYWQ + I I
Sbjct: 203 DQGLVSQDLFSVYLSSD--DESGSVVMFGGIDSSYYTGSLHWVPVTTEGYWQIAVDSITI 260
Query: 63 GNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG------GEGVVS 106
+S C GGC AIVD+GTSLLAGPT + I IG GE V+S
Sbjct: 261 NGESIA-CSGGCQAIVDTGTSLLAGPTSGIDNIQSYIGARKDLLGEEVIS 309
Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 52/134 (38%), Positives = 66/134 (49%), Gaps = 10/134 (7%)
Query: 164 GDSAVCSACEMAVVWVQNQLKQKQTK--EKVLSYINELCDSLPNPMGESIIDCDRIPTMP 221
G+S CS A+V L T + + SYI D L GE +I C I ++P
Sbjct: 262 GESIACSGGCQAIVDTGTSLLAGPTSGIDNIQSYIGARKDLL----GEEVISCSAIDSLP 317
Query: 222 NVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTV 281
++ FT+ F L P YILK + CISGF DL G LWILGDVF+ Y TV
Sbjct: 318 DIVFTMNGVEFPLPPSAYILKEDDS----CISGFEGVDLDTSSGELWILGDVFIRQYFTV 373
Query: 282 FDSGKLRIGFAEAA 295
FD ++G A A
Sbjct: 374 FDRANNQVGLAPVA 387
>gi|118344566|ref|NP_001072055.1| nothepsin precursor [Takifugu rubripes]
gi|55771088|dbj|BAD69804.1| nothepsin [Takifugu rubripes]
Length = 414
Score = 92.0 bits (227), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 45/100 (45%), Positives = 65/100 (65%), Gaps = 2/100 (2%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGE-GIAEVCISG 254
+ +L + P +G + DC R+ ++P V+F +G + + L+PE+YI + G E C SG
Sbjct: 310 LQQLIGATPTNIG-VVTDCVRLSSLPRVTFVLGGEEYTLTPERYIRRVEMLGDKEFCFSG 368
Query: 255 FMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
F A D+ P+GPLWILGDVF+ Y++VFD G RIGFA A
Sbjct: 369 FQAADILSPKGPLWILGDVFLTQYYSVFDRGHDRIGFALA 408
Score = 79.7 bits (195), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 42/104 (40%), Positives = 62/104 (59%), Gaps = 4/104 (3%)
Query: 1 MVEQGLVSEEVFSFWLN---RDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFEL 57
M+ Q +V E +FSF+L+ R ++ GE++ GG+D F G ++PVT KGYWQ ++
Sbjct: 214 MLAQQMVEEPIFSFYLSKYERFSGSKLQGELLLGGMDQDLFTGPINWLPVTTKGYWQIKV 273
Query: 58 GDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 101
+ + T C GC AIVD+GTSL+AGPT + + IG
Sbjct: 274 DSVAVQGVDT-FCPEGCQAIVDTGTSLIAGPTRDILRLQQLIGA 316
>gi|28573989|ref|NP_787961.1| CG33128 [Drosophila melanogaster]
gi|7296076|gb|AAF51371.1| CG33128 [Drosophila melanogaster]
gi|17862850|gb|AAL39902.1| LP12231p [Drosophila melanogaster]
gi|220956466|gb|ACL90776.1| CG33128-PA [synthetic construct]
Length = 405
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 47/117 (40%), Positives = 68/117 (58%), Gaps = 6/117 (5%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+ E+GL+ E VF F+L R+ A +GG++ GG D G+ TY PVT++GYWQF + +I
Sbjct: 221 LYEEGLIDEPVFGFYLARNGSAVDGGQLTLGGTDQNLIAGEMTYTPVTQQGYWQFAVNNI 280
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG---EGVVSAECKLVVS 114
+ V GC AI D+GTSL+A P+ ++N+ IGG +G C V S
Sbjct: 281 ---TWNGTVISSGCQAIADTGTSLIAAPSAAYIQLNNLIGGVPIQGDYYVPCSTVSS 334
Score = 62.0 bits (149), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 35/100 (35%), Positives = 50/100 (50%), Gaps = 6/100 (6%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
+N L +P G+ + C + ++P ++ IG F L P YI EG C+S F
Sbjct: 312 LNNLIGGVP-IQGDYYVPCSTVSSLPVLTINIGGTNFYLPPSVYIQTYTEGNYTTCMSTF 370
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
WILGDVF+G Y++ FD G+ R+GFA A
Sbjct: 371 TDIGTG-----FWILGDVFLGQYYSEFDFGQNRVGFATLA 405
>gi|195161645|ref|XP_002021673.1| GL26637 [Drosophila persimilis]
gi|194103473|gb|EDW25516.1| GL26637 [Drosophila persimilis]
Length = 387
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 51/119 (42%), Positives = 69/119 (57%), Gaps = 9/119 (7%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ QGL+ E V SF+L R A +GGE++ GGVDP + G TYVPV+ GYWQF++ +
Sbjct: 199 MIAQGLLDEPVISFYLQRQGTAVQGGELILGGVDPSLYTGNLTYVPVSVAGYWQFKVNSV 258
Query: 61 LIGNQSTGVCE-GGCAAIVDSGTSLLAGPTPVVTEINHAIG----GEGVVSAECKLVVS 114
+S G+ GC AI D+GTSL+ P +IN +G GEG +C V S
Sbjct: 259 ----KSGGILLCSGCQAIADTGTSLIVVPEAAYAKINSLLGATDNGEGEAFVKCADVSS 313
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 48/137 (35%), Positives = 70/137 (51%), Gaps = 7/137 (5%)
Query: 159 ENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIP 218
+V +G +CS C+ A+ L E + IN L + N GE+ + C +
Sbjct: 256 NSVKSGGILLCSGCQ-AIADTGTSLIV--VPEAAYAKINSLLGATDNGEGEAFVKCADVS 312
Query: 219 TMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVY 278
++P V+ IG IF L+P+ Y++K E C+S F + LWILGDVF+G +
Sbjct: 313 SLPKVNLNIGGTIFTLAPKDYVVKLTEAGQTRCMSSFTSMS----GNTLWILGDVFIGKF 368
Query: 279 HTVFDSGKLRIGFAEAA 295
+TVFD G IGFA A
Sbjct: 369 YTVFDKGNNTIGFARVA 385
>gi|297662235|ref|XP_002809619.1| PREDICTED: renin [Pongo abelii]
Length = 406
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 42/104 (40%), Positives = 70/104 (67%), Gaps = 3/104 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEE--GGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELG 58
++ QG++ E+VFSF+ NRD + + GG+IV GG DP+H++G YV + K G WQ ++
Sbjct: 213 IISQGVLKEDVFSFYYNRDSENSQSLGGQIVLGGSDPQHYEGNFHYVNLIKTGVWQIQMK 272
Query: 59 DILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
+ +G+ ST +CE GC A+VD+G S ++G T + ++ A+G +
Sbjct: 273 GVSVGS-STLLCEDGCLALVDTGASYISGSTSSIEKLMEALGAK 315
Score = 74.7 bits (182), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 57/103 (55%), Gaps = 2/103 (1%)
Query: 194 SYINELCDSL--PNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVC 251
S I +L ++L + + ++ C+ PT+P++SF +G K + L+ Y+ + ++C
Sbjct: 303 SSIEKLMEALGAKKRLFDYVVKCNEGPTLPDISFHLGGKEYTLTSADYVFQESYSSKKLC 362
Query: 252 ISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
A D+PPP GP W LG F+ ++T FD RIGFA A
Sbjct: 363 TLAIHAMDIPPPTGPTWALGATFIRKFYTEFDRRNNRIGFALA 405
>gi|73621391|sp|Q9GMY4.1|PEPC_SORUN RecName: Full=Gastricsin; AltName: Full=Pepsinogen C; Flags:
Precursor
gi|9798664|dbj|BAB11754.1| pepsinogen C [Sorex unguiculatus]
Length = 389
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 43/102 (42%), Positives = 64/102 (62%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M++ G + VFSF+L+ +++GG +VFGGVD + G+ + PVT++ YWQ +
Sbjct: 199 MLQAGALDSPVFSFYLSNQQSSKDGGAVVFGGVDNSLYTGQIFWTPVTQELYWQIGVEQF 258
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
LIG Q+TG C GC AIVD+GTSLL P ++ + A G +
Sbjct: 259 LIGGQATGWCSQGCQAIVDTGTSLLTVPQQYLSALQQATGAQ 300
Score = 72.0 bits (175), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 38/107 (35%), Positives = 58/107 (54%), Gaps = 5/107 (4%)
Query: 190 EKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAE 249
++ LS + + + + G+ +++C+ I +P ++F I F L P Y+L
Sbjct: 287 QQYLSALQQATGAQLDQDGQMVVNCNNIQNLPTLTFVINGVQFPLLPSAYVLNNNG---- 342
Query: 250 VCISGFMAFDLPPPRG-PLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
C G LP P G PLWILGDVF+ Y++V+D G R+GFA AA
Sbjct: 343 YCTLGVEPTYLPSPTGQPLWILGDVFLRSYYSVYDMGNNRVGFATAA 389
>gi|328771090|gb|EGF81130.1| hypothetical protein BATDEDRAFT_16209 [Batrachochytrium
dendrobatidis JAM81]
Length = 400
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 51/143 (35%), Positives = 75/143 (52%), Gaps = 5/143 (3%)
Query: 156 VEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 215
VE E V+ G + A + + L E IN + N G+ I+DC
Sbjct: 260 VELEGVTIGGKKLAIKSSRAAIDTGSSLFALPVAEA--DAINGILGGKKNWNGQFIVDCA 317
Query: 216 RIPTMPNVSFTIGDKIFNLSPEQYILKTGEGI---AEVCISGFMAFDLPPPRGPLWILGD 272
I ++P ++ G + F ++ YIL+ G + CISGFM D+P P GPLWI+GD
Sbjct: 318 TIDSLPELTLQFGGQKFVITGSDYILQVSAGPVGGGDQCISGFMGLDIPAPAGPLWIVGD 377
Query: 273 VFMGVYHTVFDSGKLRIGFAEAA 295
VF+ ++T++D G R+GFAEAA
Sbjct: 378 VFLRKFYTIYDVGNARVGFAEAA 400
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 47/137 (34%), Positives = 65/137 (47%), Gaps = 21/137 (15%)
Query: 28 IVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAG 87
IVFG V+ HFKG T+VPV +K YW+ EL + IG + + A +D+G+SL A
Sbjct: 233 IVFGAVNHDHFKGAVTWVPVVRKAYWEVELEGVTIGGKKLAIKS--SRAAIDTGSSLFAL 290
Query: 88 PTPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEY 147
P IN +GG+ + G I D LPE Q F G ++
Sbjct: 291 PVAEADAINGILGGKK----------NWNGQFIVDCATIDSLPELTLQ------FGGQKF 334
Query: 148 VSTGIKTVVEKENVSAG 164
V TG +++ VSAG
Sbjct: 335 VITGSDYILQ---VSAG 348
>gi|99031884|pdb|2BKS|A Chain A, Crystal Structure Of Renin-Pf00074777 Complex
gi|99031885|pdb|2BKS|B Chain B, Crystal Structure Of Renin-Pf00074777 Complex
gi|99031886|pdb|2BKT|A Chain A, Crystal Structure Of Renin-Pf00257567 Complex
gi|99031887|pdb|2BKT|B Chain B, Crystal Structure Of Renin-Pf00257567 Complex
gi|119390207|pdb|2IKO|A Chain A, Crystal Structure Of Human Renin Complexed With Inhibitor
gi|119390208|pdb|2IKO|B Chain B, Crystal Structure Of Human Renin Complexed With Inhibitor
gi|119390209|pdb|2IKU|A Chain A, Crystal Structure Of Human Renin Complexed With Inhibitors
gi|119390210|pdb|2IKU|B Chain B, Crystal Structure Of Human Renin Complexed With Inhibitors
gi|119390211|pdb|2IL2|A Chain A, Crystal Structure Of Human Renin Complexed With Inhibitor
gi|119390212|pdb|2IL2|B Chain B, Crystal Structure Of Human Renin Complexed With Inhibitor
gi|151568107|pdb|2V0Z|C Chain C, Crystal Structure Of Renin With Inhibitor 10 (Aliskiren)
gi|151568108|pdb|2V0Z|O Chain O, Crystal Structure Of Renin With Inhibitor 10 (Aliskiren)
gi|151568109|pdb|2V10|C Chain C, Crystal Structure Of Renin With Inhibitor 9
gi|151568110|pdb|2V10|O Chain O, Crystal Structure Of Renin With Inhibitor 9
gi|151568111|pdb|2V11|C Chain C, Crystal Structure Of Renin With Inhibitor 6
gi|151568112|pdb|2V11|O Chain O, Crystal Structure Of Renin With Inhibitor 6
gi|151568113|pdb|2V12|C Chain C, Crystal Structure Of Renin With Inhibitor 8
gi|151568114|pdb|2V12|O Chain O, Crystal Structure Of Renin With Inhibitor 8
gi|157830213|pdb|1BBS|A Chain A, X-Ray Analyses Of Peptide Inhibitor Complexes Define The
Structural Basis Of Specificity For Human And Mouse
Renins
gi|157830214|pdb|1BBS|B Chain B, X-Ray Analyses Of Peptide Inhibitor Complexes Define The
Structural Basis Of Specificity For Human And Mouse
Renins
gi|157833710|pdb|1RNE|A Chain A, The Crystal Structure Of Recombinant Glycosylated Human
Renin Alone And In Complex With A Transition State
Analog Inhibitor
gi|157836332|pdb|2REN|A Chain A, Structure Of Recombinant Human Renin, A Target For
Cardiovascular- Active Drugs, At 2.5 Angstroms
Resolution
gi|193885216|pdb|2V13|A Chain A, Crystal Structure Of Renin With Inhibitor 7
gi|193885217|pdb|2V16|C Chain C, Crystal Structure Of Renin With Inhibitor 3
gi|193885218|pdb|2V16|O Chain O, Crystal Structure Of Renin With Inhibitor 3
gi|242556522|pdb|3G72|A Chain A, Design And Preparation Of Potent, Non-Peptidic,
Bioavailable Renin Inhibitors
gi|242556523|pdb|3G72|B Chain B, Design And Preparation Of Potent, Non-Peptidic,
Bioavailable Renin Inhibitors
gi|308388162|pdb|3OQF|A Chain A, Crystal Structure Analysis Of Renin-Indole-Piperazine
Inhibitor Complexes
gi|308388163|pdb|3OQF|B Chain B, Crystal Structure Analysis Of Renin-Indole-Piperazine
Inhibitor Complexes
gi|310689956|pdb|3OOT|A Chain A, Crystal Structure Analysis Of Renin-Indole-Piperazin
Inhibitor Complexes
gi|310689957|pdb|3OOT|B Chain B, Crystal Structure Analysis Of Renin-Indole-Piperazin
Inhibitor Complexes
gi|310689958|pdb|3OQK|A Chain A, Crystal Structure Analysis Of Renin-Indole-Piperazin
Inhibitor Complexes
gi|310689959|pdb|3OQK|B Chain B, Crystal Structure Analysis Of Renin-Indole-Piperazin
Inhibitor Complexes
gi|342350963|pdb|3Q3T|A Chain A, Alkyl Amine Renin Inhibitors: Filling S1 From S3
gi|342350964|pdb|3Q3T|B Chain B, Alkyl Amine Renin Inhibitors: Filling S1 From S3
gi|345110923|pdb|3SFC|A Chain A, Structure-Based Optimization Of Potent 4- And
6-Azaindole-3- Carboxamides As Renin Inhibitors
gi|345110924|pdb|3SFC|B Chain B, Structure-Based Optimization Of Potent 4- And
6-Azaindole-3- Carboxamides As Renin Inhibitors
gi|358439749|pdb|3Q4B|A Chain A, Clinically Useful Alkyl Amine Renin Inhibitors
gi|358439750|pdb|3Q4B|B Chain B, Clinically Useful Alkyl Amine Renin Inhibitors
gi|358439751|pdb|3Q5H|A Chain A, Clinically Useful Alkyl Amine Renin Inhibitors
gi|358439752|pdb|3Q5H|B Chain B, Clinically Useful Alkyl Amine Renin Inhibitors
gi|400261138|pdb|3VSW|A Chain A, Human Renin In Complex With Compound 8
gi|400261139|pdb|3VSW|B Chain B, Human Renin In Complex With Compound 8
gi|400261140|pdb|3VSX|A Chain A, Human Renin In Complex With Compound 18
gi|400261141|pdb|3VSX|B Chain B, Human Renin In Complex With Compound 18
gi|430800765|pdb|3VYD|A Chain A, Human Renin In Complex With Inhibitor 6
gi|430800766|pdb|3VYD|B Chain B, Human Renin In Complex With Inhibitor 6
gi|430800767|pdb|3VYE|A Chain A, Human Renin In Complex With Inhibitor 7
gi|430800768|pdb|3VYE|B Chain B, Human Renin In Complex With Inhibitor 7
gi|430800769|pdb|3VYF|A Chain A, Human Renin In Complex With Inhibitor 9
gi|430800770|pdb|3VYF|B Chain B, Human Renin In Complex With Inhibitor 9
gi|449802496|pdb|4GJ8|A Chain A, Crystal Structure Of Renin In Complex With Pkf909-724
(compound 3)
gi|449802497|pdb|4GJ8|B Chain B, Crystal Structure Of Renin In Complex With Pkf909-724
(compound 3)
gi|449802498|pdb|4GJ9|A Chain A, Crystal Structure Of Renin In Complex With Gp055321
(compound 4)
gi|449802499|pdb|4GJ9|B Chain B, Crystal Structure Of Renin In Complex With Gp055321
(compound 4)
gi|449802500|pdb|4GJA|A Chain A, Crystal Structure Of Renin In Complex With Nvp-ayl747
(compound 5)
gi|449802501|pdb|4GJA|B Chain B, Crystal Structure Of Renin In Complex With Nvp-ayl747
(compound 5)
gi|449802502|pdb|4GJB|A Chain A, Crystal Structure Of Renin In Complex With Nvp-bbv031
(compound 6)
gi|449802503|pdb|4GJB|B Chain B, Crystal Structure Of Renin In Complex With Nvp-bbv031
(compound 6)
gi|449802504|pdb|4GJC|A Chain A, Crystal Structure Of Renin In Complex With Nvp-bch965
(compound 9)
gi|449802505|pdb|4GJC|B Chain B, Crystal Structure Of Renin In Complex With Nvp-bch965
(compound 9)
gi|449802506|pdb|4GJD|A Chain A, Crystal Structure Of Renin In Complex With Nvp-bgq311
(compound 12)
gi|449802507|pdb|4GJD|B Chain B, Crystal Structure Of Renin In Complex With Nvp-bgq311
(compound 12)
Length = 340
Score = 91.7 bits (226), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 41/104 (39%), Positives = 70/104 (67%), Gaps = 3/104 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEE--GGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELG 58
++ QG++ E+VFSF+ NRD + + GG+IV GG DP+H++G Y+ + K G WQ ++
Sbjct: 147 IISQGVLKEDVFSFYYNRDSENSQSLGGQIVLGGSDPQHYEGNFHYINLIKTGVWQIQMK 206
Query: 59 DILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
+ +G+ ST +CE GC A+VD+G S ++G T + ++ A+G +
Sbjct: 207 GVSVGS-STLLCEDGCLALVDTGASYISGSTSSIEKLMEALGAK 249
Score = 75.1 bits (183), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 57/103 (55%), Gaps = 2/103 (1%)
Query: 194 SYINELCDSL--PNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVC 251
S I +L ++L + + ++ C+ PT+P++SF +G K + L+ Y+ + ++C
Sbjct: 237 SSIEKLMEALGAKKRLFDYVVKCNEGPTLPDISFHLGGKEYTLTSADYVFQESYSSKKLC 296
Query: 252 ISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
A D+PPP GP W LG F+ ++T FD RIGFA A
Sbjct: 297 TLAIHAMDIPPPTGPTWALGATFIRKFYTEFDRRNNRIGFALA 339
>gi|309319873|pdb|2X0B|A Chain A, Crystal Structure Of Human Angiotensinogen Complexed With
Renin
gi|309319875|pdb|2X0B|C Chain C, Crystal Structure Of Human Angiotensinogen Complexed With
Renin
gi|309319877|pdb|2X0B|E Chain E, Crystal Structure Of Human Angiotensinogen Complexed With
Renin
gi|309319879|pdb|2X0B|G Chain G, Crystal Structure Of Human Angiotensinogen Complexed With
Renin
Length = 383
Score = 91.7 bits (226), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 41/104 (39%), Positives = 70/104 (67%), Gaps = 3/104 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEE--GGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELG 58
++ QG++ E+VFSF+ NRD + + GG+IV GG DP+H++G Y+ + K G WQ ++
Sbjct: 190 IISQGVLKEDVFSFYYNRDSENSQSLGGQIVLGGSDPQHYEGNFHYINLIKTGVWQIQMK 249
Query: 59 DILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
+ +G+ ST +CE GC A+VD+G S ++G T + ++ A+G +
Sbjct: 250 GVSVGS-STLLCEDGCLALVDTGASYISGSTSSIEKLMEALGAK 292
Score = 75.1 bits (183), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 57/103 (55%), Gaps = 2/103 (1%)
Query: 194 SYINELCDSL--PNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVC 251
S I +L ++L + + ++ C+ PT+P++SF +G K + L+ Y+ + ++C
Sbjct: 280 SSIEKLMEALGAKKRLFDYVVKCNEGPTLPDISFHLGGKEYTLTSADYVFQESYSSKKLC 339
Query: 252 ISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
A D+PPP GP W LG F+ ++T FD RIGFA A
Sbjct: 340 TLAIHAMDIPPPTGPTWALGATFIRKFYTEFDRRNNRIGFALA 382
>gi|35952|emb|CAA24937.1| unnamed protein product [Homo sapiens]
Length = 300
Score = 91.7 bits (226), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 41/104 (39%), Positives = 70/104 (67%), Gaps = 3/104 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEE--GGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELG 58
++ QG++ E+VFSF+ NRD + + GG+IV GG DP+H++G Y+ + K G WQ ++
Sbjct: 106 IISQGVLKEDVFSFYYNRDSENSQSLGGQIVLGGSDPQHYEGNFHYINLIKTGVWQIQMK 165
Query: 59 DILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
+ +G+ ST +CE GC A+VD+G S ++G T + ++ A+G +
Sbjct: 166 GVSVGS-STLLCEDGCLALVDTGASYISGSTSSIEKLMEALGAK 208
Score = 75.5 bits (184), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 32/86 (37%), Positives = 49/86 (56%)
Query: 209 ESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLW 268
+ ++ C+ PT+P++SF +G K + L+ YI + ++C A D+PPP GP W
Sbjct: 214 DYVVKCNEGPTLPDISFHLGGKEYTLTSADYIFQESYSSKKLCTLAIHAMDIPPPTGPTW 273
Query: 269 ILGDVFMGVYHTVFDSGKLRIGFAEA 294
LG F+ ++T FD RIGFA A
Sbjct: 274 ALGATFIRKFYTEFDRRNNRIGFALA 299
>gi|195159708|ref|XP_002020720.1| GL15705 [Drosophila persimilis]
gi|194117670|gb|EDW39713.1| GL15705 [Drosophila persimilis]
Length = 408
Score = 91.7 bits (226), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 49/121 (40%), Positives = 70/121 (57%), Gaps = 10/121 (8%)
Query: 1 MVEQGLVSEEVFSFWLNRD-PDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGD 59
+V QGL+ VFS +LNR+ +A +GGE+V GG+D F G TYVPV+++GYWQF +
Sbjct: 222 LVAQGLIKHPVFSIYLNRNGTNATDGGELVLGGIDATLFSGCLTYVPVSQQGYWQFVMTS 281
Query: 60 ILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG------GEGVVSAECKLVV 113
++G ++ C C AI+D GTSLL PT + +IN + GV C +
Sbjct: 282 AVLGGKT--FCT-HCQAILDVGTSLLVAPTAAIKKINQLLAVLNPKDASGVFLVNCSTIA 338
Query: 114 S 114
S
Sbjct: 339 S 339
Score = 74.3 bits (181), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 50/134 (37%), Positives = 69/134 (51%), Gaps = 15/134 (11%)
Query: 164 GDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGES---IIDCDRIPTM 220
G C+ C+ A++ V L T + IN+L L NP S +++C I ++
Sbjct: 285 GGKTFCTHCQ-AILDVGTSLLVAPTA--AIKKINQLLAVL-NPKDASGVFLVNCSTIASL 340
Query: 221 PNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHT 280
P + FTI K F L P YIL+ GE C+S F + LWILG+VFMG Y+T
Sbjct: 341 PTMVFTIARKEFPLQPSDYILQYGE----TCVSSFTSL----AGSDLWILGEVFMGAYYT 392
Query: 281 VFDSGKLRIGFAEA 294
V+D G +IG A A
Sbjct: 393 VYDMGYNQIGLATA 406
>gi|110590169|pdb|2G24|A Chain A, Ketopiperazine-Based Renin Inhibitors: Optimization Of The
"c" Ring
gi|110590170|pdb|2G24|B Chain B, Ketopiperazine-Based Renin Inhibitors: Optimization Of The
"c" Ring
gi|110590171|pdb|2G26|A Chain A, Ketopiperazine-Based Renin Inhibitors: Optimization Of The
"c" Ring
gi|110590172|pdb|2G26|B Chain B, Ketopiperazine-Based Renin Inhibitors: Optimization Of The
"c" Ring
gi|110590173|pdb|2G27|A Chain A, Ketopiperazine-Based Renin Inhibitors: Optimization Of The
"c" Ring
gi|110590174|pdb|2G27|B Chain B, Ketopiperazine-Based Renin Inhibitors: Optimization Of The
"c" Ring
gi|110591465|pdb|2FS4|A Chain A, Ketopiperazine-Based Renin Inhibitors: Optimization Of The
C Ring
gi|110591466|pdb|2FS4|B Chain B, Ketopiperazine-Based Renin Inhibitors: Optimization Of The
C Ring
gi|110591524|pdb|2G1N|A Chain A, Ketopiperazine-Based Renin Inhibitors: Optimization Of The
"c" Ring
gi|110591525|pdb|2G1N|B Chain B, Ketopiperazine-Based Renin Inhibitors: Optimization Of The
"c" Ring
gi|110591526|pdb|2G1O|A Chain A, Ketopiperazine-Based Renin Inhibitors: Optimization Of The
"c" Ring
gi|110591527|pdb|2G1O|B Chain B, Ketopiperazine-Based Renin Inhibitors: Optimization Of The
"c" Ring
gi|110591528|pdb|2G1R|A Chain A, Ketopiperazine-Based Renin Inhibitors: Optimization Of The
C Ring
gi|110591529|pdb|2G1R|B Chain B, Ketopiperazine-Based Renin Inhibitors: Optimization Of The
C Ring
gi|110591530|pdb|2G1S|A Chain A, Ketopiperazine-Based Renin Inhibitors: Optimization Of The
C Ring
gi|110591531|pdb|2G1S|B Chain B, Ketopiperazine-Based Renin Inhibitors: Optimization Of The
C Ring
gi|110591532|pdb|2G1Y|A Chain A, Ketopiperazine-Based Renin Inhibitors: Optimization Of The
"c" Ring
gi|110591533|pdb|2G1Y|B Chain B, Ketopiperazine-Based Renin Inhibitors: Optimization Of The
"c" Ring
gi|110591534|pdb|2G20|A Chain A, Ketopiperazine-Based Renin Inhibitors: Optimization Of The
C Ring
gi|110591535|pdb|2G20|B Chain B, Ketopiperazine-Based Renin Inhibitors: Optimization Of The
C Ring
gi|110591536|pdb|2G21|A Chain A, Ketopiperazine-Based Renin Inhibitors: Optimization Of The
"c" Ring
gi|110591537|pdb|2G21|B Chain B, Ketopiperazine-Based Renin Inhibitors: Optimization Of The
"c" Ring
gi|110591538|pdb|2G22|A Chain A, Ketopiperazine-based Renin Inhibitors: Optimization Of The
"c" Ring
gi|110591539|pdb|2G22|B Chain B, Ketopiperazine-based Renin Inhibitors: Optimization Of The
"c" Ring
Length = 333
Score = 91.7 bits (226), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 41/104 (39%), Positives = 70/104 (67%), Gaps = 3/104 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEE--GGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELG 58
++ QG++ E+VFSF+ NRD + + GG+IV GG DP+H++G Y+ + K G WQ ++
Sbjct: 140 IISQGVLKEDVFSFYYNRDSENSQSLGGQIVLGGSDPQHYEGNFHYINLIKTGVWQIQMK 199
Query: 59 DILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
+ +G+ ST +CE GC A+VD+G S ++G T + ++ A+G +
Sbjct: 200 GVSVGS-STLLCEDGCLALVDTGASYISGSTSSIEKLMEALGAK 242
Score = 75.1 bits (183), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 57/103 (55%), Gaps = 2/103 (1%)
Query: 194 SYINELCDSL--PNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVC 251
S I +L ++L + + ++ C+ PT+P++SF +G K + L+ Y+ + ++C
Sbjct: 230 SSIEKLMEALGAKKRLFDYVVKCNEGPTLPDISFHLGGKEYTLTSADYVFQESYSSKKLC 289
Query: 252 ISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
A D+PPP GP W LG F+ ++T FD RIGFA A
Sbjct: 290 TLAIHAMDIPPPTGPTWALGATFIRKFYTEFDRRNNRIGFALA 332
>gi|397504905|ref|XP_003823019.1| PREDICTED: renin [Pan paniscus]
Length = 406
Score = 91.7 bits (226), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 41/104 (39%), Positives = 70/104 (67%), Gaps = 3/104 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEE--GGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELG 58
++ QG++ E+VFSF+ NRD + + GG+IV GG DP+H++G Y+ + K G WQ ++
Sbjct: 213 IISQGVLKEDVFSFYYNRDSENSQSLGGQIVLGGSDPQHYEGNFHYINLIKTGVWQIQMK 272
Query: 59 DILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
+ +G+ ST +CE GC A+VD+G S ++G T + ++ A+G +
Sbjct: 273 GVSVGS-STLLCEDGCLALVDTGASYISGSTSSIEKLMEALGAK 315
Score = 74.7 bits (182), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 57/103 (55%), Gaps = 2/103 (1%)
Query: 194 SYINELCDSL--PNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVC 251
S I +L ++L + + ++ C+ PT+P++SF +G K + L+ Y+ + ++C
Sbjct: 303 SSIEKLMEALGAKKRLFDYVVKCNEGPTLPDISFHLGGKEYTLTSADYVFQESYSSKKLC 362
Query: 252 ISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
A D+PPP GP W LG F+ ++T FD RIGFA A
Sbjct: 363 TLAIHAMDIPPPTGPTWALGATFIRKFYTEFDRRNNRIGFALA 405
>gi|1065326|pdb|1HRN|A Chain A, High Resolution Crystal Structures Of Recombinant Human
Renin In Complex With Polyhydroxymonoamide Inhibitors
gi|1065327|pdb|1HRN|B Chain B, High Resolution Crystal Structures Of Recombinant Human
Renin In Complex With Polyhydroxymonoamide Inhibitors
gi|1310896|pdb|1BIM|A Chain A, Crystallographic Studies On The Binding Modes Of P2-P3
Butanediamide Renin Inhibitors
gi|1310897|pdb|1BIM|B Chain B, Crystallographic Studies On The Binding Modes Of P2-P3
Butanediamide Renin Inhibitors
gi|1310898|pdb|1BIL|A Chain A, Crystallographic Studies On The Binding Modes Of P2-P3
Butanediamide Renin Inhibitors
gi|1310899|pdb|1BIL|B Chain B, Crystallographic Studies On The Binding Modes Of P2-P3
Butanediamide Renin Inhibitors
gi|241913388|pdb|3GW5|A Chain A, Crystal Structure Of Human Renin Complexed With A Novel
Inhibitor
gi|241913389|pdb|3GW5|B Chain B, Crystal Structure Of Human Renin Complexed With A Novel
Inhibitor
gi|283807203|pdb|3KM4|A Chain A, Optimization Of Orally Bioavailable Alkyl Amine Renin
Inhibitors
gi|283807204|pdb|3KM4|B Chain B, Optimization Of Orally Bioavailable Alkyl Amine Renin
Inhibitors
Length = 337
Score = 91.7 bits (226), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 41/104 (39%), Positives = 70/104 (67%), Gaps = 3/104 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEE--GGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELG 58
++ QG++ E+VFSF+ NRD + + GG+IV GG DP+H++G Y+ + K G WQ ++
Sbjct: 144 IISQGVLKEDVFSFYYNRDSENSQSLGGQIVLGGSDPQHYEGNFHYINLIKTGVWQIQMK 203
Query: 59 DILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
+ +G+ ST +CE GC A+VD+G S ++G T + ++ A+G +
Sbjct: 204 GVSVGS-STLLCEDGCLALVDTGASYISGSTSSIEKLMEALGAK 246
Score = 75.1 bits (183), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 57/103 (55%), Gaps = 2/103 (1%)
Query: 194 SYINELCDSL--PNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVC 251
S I +L ++L + + ++ C+ PT+P++SF +G K + L+ Y+ + ++C
Sbjct: 234 SSIEKLMEALGAKKRLFDYVVKCNEGPTLPDISFHLGGKEYTLTSADYVFQESYSSKKLC 293
Query: 252 ISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
A D+PPP GP W LG F+ ++T FD RIGFA A
Sbjct: 294 TLAIHAMDIPPPTGPTWALGATFIRKFYTEFDRRNNRIGFALA 336
>gi|402855684|ref|XP_003892446.1| PREDICTED: LOW QUALITY PROTEIN: gastricsin-like [Papio anubis]
Length = 377
Score = 91.7 bits (226), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 41/101 (40%), Positives = 62/101 (61%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M++QG +++ FSF+ P + GGE++ GGVDP+ + G+ PVT++ YWQ + +
Sbjct: 187 MLQQGQITQPDFSFYFTHQPTRQYGGELILGGVDPQLYSGQIIXTPVTRELYWQIPIEEF 246
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 101
+GNQ+TG+C GC AIV +GT LLA P + A G
Sbjct: 247 AVGNQATGLCSEGCQAIVVTGTFLLAVPQQYMGSFLQATGA 287
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 36/89 (40%), Positives = 50/89 (56%), Gaps = 5/89 (5%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRG-P 266
G+ ++ C I +MP ++F IG F L P Y+ G + I A LP P G P
Sbjct: 293 GDFVVHCSYIQSMPTITFIIGGAQFPLPPSAYVFNN-NGYCRLRIE---ATXLPLPSGQP 348
Query: 267 LWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
LWILGDVF+ Y++V+D R+GFA +A
Sbjct: 349 LWILGDVFLKEYYSVYDMANNRLGFAFSA 377
>gi|190613737|pdb|3D91|A Chain A, Human Renin In Complex With Remikiren
gi|190613738|pdb|3D91|B Chain B, Human Renin In Complex With Remikiren
gi|242556515|pdb|3G6Z|A Chain A, Design And Preparation Of Potent, Non-Peptidic,
Bioavailable Renin Inhibitors
gi|242556516|pdb|3G6Z|B Chain B, Design And Preparation Of Potent, Non-Peptidic,
Bioavailable Renin Inhibitors
gi|242556519|pdb|3G70|A Chain A, Design And Preparation Of Potent, Non-Peptidic,
Bioavailable Renin Inhibitors
gi|242556520|pdb|3G70|B Chain B, Design And Preparation Of Potent, Non-Peptidic,
Bioavailable Renin Inhibitors
gi|290560276|pdb|3K1W|A Chain A, New Classes Of Potent And Bioavailable Human Renin
Inhibitors
gi|290560277|pdb|3K1W|B Chain B, New Classes Of Potent And Bioavailable Human Renin
Inhibitors
gi|315113750|pdb|3OWN|A Chain A, Potent Macrocyclic Renin Inhibitors
gi|315113751|pdb|3OWN|B Chain B, Potent Macrocyclic Renin Inhibitors
Length = 341
Score = 91.7 bits (226), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 41/104 (39%), Positives = 70/104 (67%), Gaps = 3/104 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEE--GGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELG 58
++ QG++ E+VFSF+ NRD + + GG+IV GG DP+H++G Y+ + K G WQ ++
Sbjct: 147 IISQGVLKEDVFSFYYNRDSENSQSLGGQIVLGGSDPQHYEGNFHYINLIKTGVWQIQMK 206
Query: 59 DILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
+ +G+ ST +CE GC A+VD+G S ++G T + ++ A+G +
Sbjct: 207 GVSVGS-STLLCEDGCLALVDTGASYISGSTSSIEKLMEALGAK 249
Score = 75.1 bits (183), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 57/103 (55%), Gaps = 2/103 (1%)
Query: 194 SYINELCDSL--PNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVC 251
S I +L ++L + + ++ C+ PT+P++SF +G K + L+ Y+ + ++C
Sbjct: 237 SSIEKLMEALGAKKRLFDYVVKCNEGPTLPDISFHLGGKEYTLTSADYVFQESYSSKKLC 296
Query: 252 ISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
A D+PPP GP W LG F+ ++T FD RIGFA A
Sbjct: 297 TLAIHAMDIPPPTGPTWALGATFIRKFYTEFDRRNNRIGFALA 339
>gi|118138205|pdb|2I4Q|A Chain A, Human ReninPF02342674 COMPLEX
gi|118138206|pdb|2I4Q|B Chain B, Human ReninPF02342674 COMPLEX
Length = 336
Score = 91.7 bits (226), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 41/104 (39%), Positives = 70/104 (67%), Gaps = 3/104 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEE--GGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELG 58
++ QG++ E+VFSF+ NRD + + GG+IV GG DP+H++G Y+ + K G WQ ++
Sbjct: 143 IISQGVLKEDVFSFYYNRDSENSQSLGGQIVLGGSDPQHYEGNFHYINLIKTGVWQIQMK 202
Query: 59 DILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
+ +G+ ST +CE GC A+VD+G S ++G T + ++ A+G +
Sbjct: 203 GVSVGS-STLLCEDGCLALVDTGASYISGSTSSIEKLMEALGAK 245
Score = 75.1 bits (183), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 57/103 (55%), Gaps = 2/103 (1%)
Query: 194 SYINELCDSL--PNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVC 251
S I +L ++L + + ++ C+ PT+P++SF +G K + L+ Y+ + ++C
Sbjct: 233 SSIEKLMEALGAKKRLFDYVVKCNEGPTLPDISFHLGGKEYTLTSADYVFQESYSSKKLC 292
Query: 252 ISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
A D+PPP GP W LG F+ ++T FD RIGFA A
Sbjct: 293 TLAIHAMDIPPPTGPTWALGATFIRKFYTEFDRRNNRIGFALA 335
>gi|73621390|sp|Q9GMY3.1|PEPC_RHIFE RecName: Full=Gastricsin; AltName: Full=Pepsinogen C; Flags:
Precursor
gi|9798666|dbj|BAB11755.1| pepsinogen C [Rhinolophus ferrumequinum]
Length = 389
Score = 91.7 bits (226), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 41/102 (40%), Positives = 67/102 (65%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+++G ++ VFSF+L+ ++ GG ++FGGVD ++G+ + PVT++ YWQ + +
Sbjct: 199 MLQEGALTSPVFSFYLSNQQGSQNGGAVIFGGVDNSLYQGQIYWAPVTQELYWQIGIEEF 258
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
LIG Q++G C GC AIVD+GTSLL P ++ + A G +
Sbjct: 259 LIGGQASGWCSQGCQAIVDTGTSLLTVPQQYMSALLQATGAQ 300
Score = 67.8 bits (164), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 37/107 (34%), Positives = 55/107 (51%), Gaps = 5/107 (4%)
Query: 190 EKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAE 249
++ +S + + + + G+ ++C+ I +P +F I F L P YIL
Sbjct: 287 QQYMSALLQATGAQEDQYGQFFVNCNYIQNLPTFTFIINGVQFPLPPSSYILNNNG---- 342
Query: 250 VCISGFMAFDLPPPRG-PLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
C G LP G PLWILGDVF+ Y++V+D G R+GFA AA
Sbjct: 343 YCTVGVEPTYLPSQNGQPLWILGDVFLRSYYSVYDMGNNRVGFATAA 389
>gi|326933695|ref|XP_003212936.1| PREDICTED: protein phosphatase 1 regulatory subunit 15B-like
[Meleagris gallopavo]
Length = 571
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 36/82 (43%), Positives = 54/82 (65%)
Query: 211 IIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWIL 270
++DC+++P +PN+SF +G K + LS Y+L+ + ++C+ D+PPP GPLWIL
Sbjct: 13 VVDCEKVPQLPNISFHLGGKAYTLSGSAYVLRQTQYGEDICVVALSGLDVPPPAGPLWIL 72
Query: 271 GDVFMGVYHTVFDSGKLRIGFA 292
G F+G Y+T FD RIGFA
Sbjct: 73 GASFIGHYYTKFDRRNNRIGFA 94
>gi|444728681|gb|ELW69127.1| Napsin-A [Tupaia chinensis]
Length = 191
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 39/80 (48%), Positives = 53/80 (66%)
Query: 211 IIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWIL 270
++ C IP +P+VSF +G FNL+ + Y+++ +C+SGF A D+PPP GPLWIL
Sbjct: 85 LVRCSDIPKLPSVSFLLGGVWFNLTAQDYVIQVARAGVHLCLSGFQALDMPPPVGPLWIL 144
Query: 271 GDVFMGVYHTVFDSGKLRIG 290
GDVF+G Y TVFD G G
Sbjct: 145 GDVFLGTYVTVFDRGDTNSG 164
>gi|4506475|ref|NP_000528.1| renin preproprotein [Homo sapiens]
gi|57114109|ref|NP_001009122.1| renin precursor [Pan troglodytes]
gi|132326|sp|P00797.1|RENI_HUMAN RecName: Full=Renin; AltName: Full=Angiotensinogenase; Flags:
Precursor
gi|38503275|sp|P60016.1|RENI_PANTR RecName: Full=Renin; AltName: Full=Angiotensinogenase; Flags:
Precursor
gi|11118368|gb|AAG30305.1|AF193456_1 renin [Pan troglodytes]
gi|190994|gb|AAA60363.1| renin [Homo sapiens]
gi|337340|gb|AAD03461.1| renin [Homo sapiens]
gi|29126911|gb|AAH47752.1| Renin [Homo sapiens]
gi|49168484|emb|CAG38737.1| REN [Homo sapiens]
gi|54311156|gb|AAH33474.1| Renin [Homo sapiens]
gi|166706825|gb|ABY87560.1| renin [Homo sapiens]
gi|208967276|dbj|BAG73652.1| renin [synthetic construct]
gi|312153236|gb|ADQ33130.1| renin [synthetic construct]
Length = 406
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 41/104 (39%), Positives = 70/104 (67%), Gaps = 3/104 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEE--GGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELG 58
++ QG++ E+VFSF+ NRD + + GG+IV GG DP+H++G Y+ + K G WQ ++
Sbjct: 213 IISQGVLKEDVFSFYYNRDSENSQSLGGQIVLGGSDPQHYEGNFHYINLIKTGVWQIQMK 272
Query: 59 DILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
+ +G+ ST +CE GC A+VD+G S ++G T + ++ A+G +
Sbjct: 273 GVSVGS-STLLCEDGCLALVDTGASYISGSTSSIEKLMEALGAK 315
Score = 74.7 bits (182), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 57/103 (55%), Gaps = 2/103 (1%)
Query: 194 SYINELCDSL--PNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVC 251
S I +L ++L + + ++ C+ PT+P++SF +G K + L+ Y+ + ++C
Sbjct: 303 SSIEKLMEALGAKKRLFDYVVKCNEGPTLPDISFHLGGKEYTLTSADYVFQESYSSKKLC 362
Query: 252 ISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
A D+PPP GP W LG F+ ++T FD RIGFA A
Sbjct: 363 TLAIHAMDIPPPTGPTWALGATFIRKFYTEFDRRNNRIGFALA 405
>gi|125984612|ref|XP_001356070.1| GA14340 [Drosophila pseudoobscura pseudoobscura]
gi|54644388|gb|EAL33129.1| GA14340 [Drosophila pseudoobscura pseudoobscura]
Length = 387
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 51/118 (43%), Positives = 68/118 (57%), Gaps = 7/118 (5%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ QGL+ E V SF+L R A +GGE++ GGVDP + G TYVPV+ GYWQF++ +
Sbjct: 199 MIAQGLLDEPVISFYLQRQGTAVQGGELILGGVDPSLYTGNLTYVPVSVAGYWQFKVNSV 258
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG----GEGVVSAECKLVVS 114
G +C GC AI D+GTSL+ P +IN +G GEG +C V S
Sbjct: 259 KSGGFL--LCS-GCQAIADTGTSLIVVPEAAYAKINSLLGATDNGEGEAFVKCADVSS 313
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 49/137 (35%), Positives = 70/137 (51%), Gaps = 7/137 (5%)
Query: 159 ENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIP 218
+V +G +CS C+ A+ L E + IN L + N GE+ + C +
Sbjct: 256 NSVKSGGFLLCSGCQ-AIADTGTSLIV--VPEAAYAKINSLLGATDNGEGEAFVKCADVS 312
Query: 219 TMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVY 278
++P V+ IG IF L+P+ Y++K E C+S F LWILGDVF+G +
Sbjct: 313 SLPKVNLNIGGTIFTLAPKDYVVKLTEAGQTRCMSSFTTMS----GNTLWILGDVFIGKF 368
Query: 279 HTVFDSGKLRIGFAEAA 295
+TVFD G RIGFA A
Sbjct: 369 YTVFDKGNNRIGFARVA 385
>gi|291409611|ref|XP_002721072.1| PREDICTED: pepsin-3-like isoform 2 [Oryctolagus cuniculus]
Length = 387
Score = 91.3 bits (225), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 50/112 (44%), Positives = 70/112 (62%), Gaps = 9/112 (8%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M +GLVSE++FS +L+ D D+ G ++FGGVD ++ G +VPV+ +GYWQ + I
Sbjct: 201 MWNEGLVSEDLFSVYLSSDDDS--GSVVMFGGVDSSYYTGSLNWVPVSYEGYWQITVDSI 258
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG------GEGVVS 106
+ + T C GC AIVD+GTSLLAGPT ++ I IG GE +VS
Sbjct: 259 TMDGE-TIACADGCQAIVDTGTSLLAGPTSAISNIQSYIGASENSDGEMIVS 309
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 45/132 (34%), Positives = 64/132 (48%), Gaps = 6/132 (4%)
Query: 164 GDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNV 223
G++ C+ A+V L T +S I + N GE I+ C + ++PN+
Sbjct: 262 GETIACADGCQAIVDTGTSLLAGPTS--AISNIQSYIGASENSDGEMIVSCSSMYSLPNI 319
Query: 224 SFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFD 283
FTI + + YIL+ + CISGF +L G LWILGDVF+ Y TVFD
Sbjct: 320 VFTINGVQYPVPASAYILEEDDA----CISGFEGMNLDTYTGELWILGDVFIRQYFTVFD 375
Query: 284 SGKLRIGFAEAA 295
++G A AA
Sbjct: 376 RANNQLGLAAAA 387
>gi|291409609|ref|XP_002721071.1| PREDICTED: pepsin-3-like isoform 1 [Oryctolagus cuniculus]
Length = 387
Score = 91.3 bits (225), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 50/112 (44%), Positives = 70/112 (62%), Gaps = 9/112 (8%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M +GLVSE++FS +L+ D D+ G ++FGGVD ++ G +VPV+ +GYWQ + I
Sbjct: 201 MWNEGLVSEDLFSVYLSSDDDS--GSVVMFGGVDSSYYTGSLNWVPVSYEGYWQITVDSI 258
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG------GEGVVS 106
+ + T C GC AIVD+GTSLLAGPT ++ I IG GE +VS
Sbjct: 259 TMDGE-TIACADGCQAIVDTGTSLLAGPTSAISNIQSYIGASENSDGEMIVS 309
Score = 68.9 bits (167), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 44/132 (33%), Positives = 64/132 (48%), Gaps = 6/132 (4%)
Query: 164 GDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNV 223
G++ C+ A+V L T +S I + N GE I+ C + ++PN+
Sbjct: 262 GETIACADGCQAIVDTGTSLLAGPTS--AISNIQSYIGASENSDGEMIVSCSSMYSLPNI 319
Query: 224 SFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFD 283
FTI + + YIL+ + C+SGF +L G LWILGDVF+ Y TVFD
Sbjct: 320 VFTINGVQYPVPASAYILEEDDD----CLSGFDGMNLDTSYGELWILGDVFIRQYFTVFD 375
Query: 284 SGKLRIGFAEAA 295
++G A AA
Sbjct: 376 RANNQVGLAAAA 387
>gi|149390983|gb|ABR25509.1| retotransposon sine subfamily [Oryza sativa Indica Group]
Length = 52
Score = 91.3 bits (225), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 41/46 (89%), Positives = 43/46 (93%)
Query: 250 VCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
VCISGFMAFD+PPPRGPLWILGDVFMG YHTVFD GK RIGFAE+A
Sbjct: 7 VCISGFMAFDVPPPRGPLWILGDVFMGAYHTVFDFGKNRIGFAESA 52
>gi|291409620|ref|XP_002721076.1| PREDICTED: pepsinogen III-like [Oryctolagus cuniculus]
Length = 387
Score = 91.3 bits (225), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 50/112 (44%), Positives = 69/112 (61%), Gaps = 9/112 (8%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M +GLVSE++FS +L+ D E G ++FGG+D ++ G +VPV+ +GYWQ L I
Sbjct: 201 MWNEGLVSEDLFSVYLSSDD--ESGSVVMFGGIDSSYYTGSLNWVPVSYEGYWQITLDSI 258
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG------GEGVVS 106
+ + T C GC AIVD+GTSLLAGPT ++ I IG GE +VS
Sbjct: 259 TMDGE-TIACADGCQAIVDTGTSLLAGPTSAISNIQSYIGASENSDGEMIVS 309
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 45/132 (34%), Positives = 64/132 (48%), Gaps = 6/132 (4%)
Query: 164 GDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNV 223
G++ C+ A+V L T +S I + N GE I+ C + ++PN+
Sbjct: 262 GETIACADGCQAIVDTGTSLLAGPTS--AISNIQSYIGASENSDGEMIVSCSSMYSLPNI 319
Query: 224 SFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFD 283
FTI + + YIL+ + CISGF +L G LWILGDVF+ Y TVFD
Sbjct: 320 VFTINGVQYPVPASAYILEEDDA----CISGFEGMNLDTYTGELWILGDVFIRQYFTVFD 375
Query: 284 SGKLRIGFAEAA 295
++G A AA
Sbjct: 376 RANNQLGLAAAA 387
>gi|195386060|ref|XP_002051722.1| GJ17077 [Drosophila virilis]
gi|194148179|gb|EDW63877.1| GJ17077 [Drosophila virilis]
Length = 404
Score = 91.3 bits (225), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 51/117 (43%), Positives = 70/117 (59%), Gaps = 6/117 (5%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+ EQGL+ E F F+L R A++GG++V GGVD + F G TYVPV+++GYWQF +
Sbjct: 220 LYEQGLIDEPSFGFYLARTGSAQDGGQLVLGGVDYQLFSGNLTYVPVSQEGYWQFVVTSA 279
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG---EGVVSAECKLVVS 114
++ VC C AI D+GTSLLA P T++N IGG +G +C V S
Sbjct: 280 VM--NGFVVCS-NCQAIADTGTSLLACPGSSYTQLNQLIGGYLMDGDYYVDCSTVDS 333
Score = 70.5 bits (171), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 35/88 (39%), Positives = 48/88 (54%), Gaps = 5/88 (5%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 267
G+ +DC + ++P +SF IG IFNL YI E C+S F +
Sbjct: 322 GDYYVDCSTVDSLPVLSFNIGGTIFNLPASAYISSFTENNTTFCMSSFTYINTD-----F 376
Query: 268 WILGDVFMGVYHTVFDSGKLRIGFAEAA 295
WILGDVF+G ++T FD G+ R+GFA A
Sbjct: 377 WILGDVFIGQFYTQFDFGENRVGFAPVA 404
>gi|195433873|ref|XP_002064931.1| GK19045 [Drosophila willistoni]
gi|194161016|gb|EDW75917.1| GK19045 [Drosophila willistoni]
Length = 411
Score = 91.3 bits (225), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 49/117 (41%), Positives = 69/117 (58%), Gaps = 6/117 (5%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+ EQGL+ E VF F+L R+ A +GGE++ GG D + G +YVPV+++GYWQF + +I
Sbjct: 227 LYEQGLIDEPVFGFYLGRNGTATDGGELILGGSDESLYVGNLSYVPVSQQGYWQFAVNNI 286
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG---EGVVSAECKLVVS 114
T VC+ C AI D+GTSL+A P +++N IG EG C V S
Sbjct: 287 TW--NGTVVCD-NCQAIADTGTSLIACPFSAYSQLNQLIGALYVEGSYYVSCSTVDS 340
Score = 70.5 bits (171), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 35/88 (39%), Positives = 50/88 (56%), Gaps = 5/88 (5%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 267
G + C + ++P +SF+IG+ IF L P YI G+G + C+S F
Sbjct: 329 GSYYVSCSTVDSLPVLSFSIGNGIFELPPSAYISTFGDGNSTYCMSTFTYIGT-----DF 383
Query: 268 WILGDVFMGVYHTVFDSGKLRIGFAEAA 295
WILGDVF+G ++T FD G+ R+GFA A
Sbjct: 384 WILGDVFIGQFYTEFDFGENRVGFAPVA 411
>gi|296230510|ref|XP_002760737.1| PREDICTED: renin isoform 1 [Callithrix jacchus]
gi|50401196|sp|Q9TSZ1.1|RENI_CALJA RecName: Full=Renin; AltName: Full=Angiotensinogenase; Flags:
Precursor
gi|6687184|emb|CAB64879.1| preprorenin [Callithrix jacchus]
Length = 400
Score = 91.3 bits (225), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 41/104 (39%), Positives = 69/104 (66%), Gaps = 3/104 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEE--GGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELG 58
++ QGL+ E+VFSF+ NRD + + GG+IV GG DP+H++G Y+ + + G WQ +
Sbjct: 207 IISQGLLKEDVFSFYYNRDSENSQSLGGQIVLGGSDPQHYEGNFHYINLIRTGLWQIPMK 266
Query: 59 DILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
+ +G+ ST +CE GC A+VD+G S ++G T + ++ A+G +
Sbjct: 267 GVSVGS-STLLCEDGCLALVDTGASYISGSTSSIEKLMEALGAK 309
Score = 77.4 bits (189), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 58/103 (56%), Gaps = 2/103 (1%)
Query: 194 SYINELCDSL--PNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVC 251
S I +L ++L + + ++ C+ PT+P++SF +G K + L+ Y+ + ++C
Sbjct: 297 SSIEKLMEALGAKKRLFDYVVKCNEGPTLPDISFHLGGKEYTLTSADYVFQESYSSKKLC 356
Query: 252 ISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
A D+PPP GP W LG F+ ++T FD G RIGFA A
Sbjct: 357 TLAIHAMDIPPPTGPTWALGATFIRKFYTEFDRGNNRIGFALA 399
>gi|149725191|ref|XP_001501954.1| PREDICTED: pepsin A-like [Equus caballus]
Length = 387
Score = 91.3 bits (225), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 48/110 (43%), Positives = 69/110 (62%), Gaps = 9/110 (8%)
Query: 3 EQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILI 62
+QGLVS+++FS +L+ D E G ++FGG+D ++ G +VPV+++ YWQ + I +
Sbjct: 203 DQGLVSQDLFSVYLSSDD--ESGSVVMFGGIDSSYYSGSLNWVPVSEEAYWQITVDSITM 260
Query: 63 GNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG------GEGVVS 106
+S C GGC AIVD+GTSLLAGPT + I IG GE V+S
Sbjct: 261 NGESIA-CSGGCQAIVDTGTSLLAGPTSGIDNIQSYIGASEDSSGEAVIS 309
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 54/134 (40%), Positives = 68/134 (50%), Gaps = 10/134 (7%)
Query: 164 GDSAVCSACEMAVVWVQNQLKQKQTK--EKVLSYINELCDSLPNPMGESIIDCDRIPTMP 221
G+S CS A+V L T + + SYI DS GE++I C I ++P
Sbjct: 262 GESIACSGGCQAIVDTGTSLLAGPTSGIDNIQSYIGASEDS----SGEAVISCSSIYSLP 317
Query: 222 NVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTV 281
++ FTI F LSP YIL+ + CISGF DL G LWILGDVF+ Y TV
Sbjct: 318 DIVFTINGVEFPLSPSAYILEEDDS----CISGFEGMDLDTSSGELWILGDVFIRQYFTV 373
Query: 282 FDSGKLRIGFAEAA 295
FD +IG A A
Sbjct: 374 FDRANNQIGLAPVA 387
>gi|291223845|ref|XP_002731921.1| PREDICTED: expressed hypothetical protein-like [Saccoglossus
kowalevskii]
Length = 959
Score = 90.9 bits (224), Expect = 5e-16, Method: Composition-based stats.
Identities = 43/101 (42%), Positives = 64/101 (63%), Gaps = 3/101 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ Q L+SE VFS ++ D + GE++ GG D H+ G+ TY+PVT KGYWQF + I
Sbjct: 769 MLAQHLISEPVFSVYVRGDGNK---GELILGGSDQHHYSGEFTYLPVTIKGYWQFTMDSI 825
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 101
+ ++ + C GC A+VD+GTS++AGP + +N IG
Sbjct: 826 HVYDKPSQYCLDGCQAVVDTGTSVIAGPMEDIETLNTEIGA 866
Score = 71.2 bits (173), Expect = 5e-10, Method: Composition-based stats.
Identities = 33/82 (40%), Positives = 50/82 (60%), Gaps = 2/82 (2%)
Query: 211 IIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWIL 270
+I+C + ++P++SF +G K+F L P YI + G +E+C+S + GP+WIL
Sbjct: 875 VINCHLVDSLPDISFVLGGKLFALEPRDYIEQDNTGDSEICLSNLVGHG--NGIGPIWIL 932
Query: 271 GDVFMGVYHTVFDSGKLRIGFA 292
G VF Y+ FD GK R+GFA
Sbjct: 933 GAVFTRKYYVEFDRGKDRVGFA 954
>gi|193499291|gb|ACF18588.1| pepsinogen A1 precursor [Siniperca scherzeri]
gi|193499301|gb|ACF18593.1| pepsinogen A1 precursor [Siniperca scherzeri]
Length = 378
Score = 90.9 bits (224), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 39/103 (37%), Positives = 67/103 (65%), Gaps = 3/103 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ +GLV++++FS +L+ ++++G + FGG+DP H+ G T++P++ + YWQ + +
Sbjct: 197 MMNEGLVNQDLFSVYLS--ANSQQGSVVTFGGIDPNHYYGSITWIPLSSELYWQITVDSV 254
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEG 103
+ Q C GGC AIVD+GTSL+ GP ++ IN +G G
Sbjct: 255 TVNGQVV-ACSGGCQAIVDTGTSLIVGPQSSISNINSGVGASG 296
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 25/87 (28%), Positives = 45/87 (51%), Gaps = 8/87 (9%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 267
G+ +++C+ + MP+V+F I + F L Y+ ++ C +GF L
Sbjct: 299 GDYVVNCNNVAQMPDVTFHIHGQEFTLPASAYVRQSQY---YGCRTGF-----GNGGDSL 350
Query: 268 WILGDVFMGVYHTVFDSGKLRIGFAEA 294
WILGDVF+ Y+++F + +G A +
Sbjct: 351 WILGDVFIRQYYSIFSRAQNMVGLARS 377
>gi|283806592|ref|NP_001164549.1| pepsin II-1 precursor [Oryctolagus cuniculus]
gi|129777|sp|P28712.1|PEPA1_RABIT RecName: Full=Pepsin II-1; AltName: Full=Pepsin A; Flags: Precursor
gi|22218074|dbj|BAC07514.1| pepsinogen II-1 [Oryctolagus cuniculus]
Length = 387
Score = 90.9 bits (224), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 47/112 (41%), Positives = 70/112 (62%), Gaps = 9/112 (8%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M +GLVSE++FS +L+ + E+G ++FGG+D ++ G +VPV+ +GYWQ + I
Sbjct: 201 MWNEGLVSEDLFSVYLSSN--GEKGSMVMFGGIDSSYYTGSLNWVPVSHEGYWQITMDSI 258
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG------GEGVVS 106
I N T C C A+VD+GTSLLAGPT +++I IG GE ++S
Sbjct: 259 TI-NGETIACADSCQAVVDTGTSLLAGPTSAISKIQSYIGASKNLLGENIIS 309
Score = 77.8 bits (190), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 50/134 (37%), Positives = 69/134 (51%), Gaps = 10/134 (7%)
Query: 164 GDSAVCSACEMAVVWVQNQLKQKQTK--EKVLSYINELCDSLPNPMGESIIDCDRIPTMP 221
G++ C+ AVV L T K+ SYI + N +GE+II C I ++P
Sbjct: 262 GETIACADSCQAVVDTGTSLLAGPTSAISKIQSYIG----ASKNLLGENIISCSAIDSLP 317
Query: 222 NVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTV 281
++ FTI + + L YILK + C+SGF +L G LWILGDVF+ Y TV
Sbjct: 318 DIVFTINNVQYPLPASAYILKEDDD----CLSGFDGMNLDTSYGELWILGDVFIRQYFTV 373
Query: 282 FDSGKLRIGFAEAA 295
FD ++G A AA
Sbjct: 374 FDRANNQVGLAAAA 387
>gi|193499289|gb|ACF18587.1| pepsinogen A1 precursor [Siniperca chuatsi]
gi|193499299|gb|ACF18592.1| pepsinogen A1 precursor [Siniperca chuatsi]
gi|210076783|gb|ACJ06745.1| pepsinogen [Siniperca chuatsi]
gi|312369796|gb|ADQ74489.1| pepsinogen [Siniperca chuatsi]
Length = 378
Score = 90.9 bits (224), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 40/103 (38%), Positives = 67/103 (65%), Gaps = 3/103 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ +GLV++++FS +L+ ++++G + FGGVDP H+ G T++P++ + YWQ + +
Sbjct: 197 MMNEGLVNQDLFSVYLS--ANSQQGSVVTFGGVDPNHYYGSITWIPLSSELYWQITVDSV 254
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEG 103
+ Q C GGC AIVD+GTSL+ GP ++ IN +G G
Sbjct: 255 TVNGQVV-ACSGGCQAIVDTGTSLIVGPQNSISNINSGVGASG 296
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 26/87 (29%), Positives = 45/87 (51%), Gaps = 8/87 (9%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 267
G+ +++C+ I MP+V+F I + F L Y+ ++ C +GF L
Sbjct: 299 GDYVVNCNNIAQMPDVTFHIHGQEFTLPASAYVRQSQY---YGCRTGF-----GNGGDSL 350
Query: 268 WILGDVFMGVYHTVFDSGKLRIGFAEA 294
WILGDVF+ Y+++F + +G A +
Sbjct: 351 WILGDVFIRQYYSIFSRAQNMVGLARS 377
>gi|73621386|sp|Q9GMY8.1|PEPA_SORUN RecName: Full=Pepsin A; Flags: Precursor
gi|9798656|dbj|BAB11750.1| pepsinogen A [Sorex unguiculatus]
Length = 387
Score = 90.9 bits (224), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 45/116 (38%), Positives = 70/116 (60%), Gaps = 7/116 (6%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M QGLVS+++FS +L+ + + G ++FGG+D ++ G +VP++ +GYWQ + I
Sbjct: 201 MWNQGLVSQDLFSVYLSSN--DQSGSVVMFGGIDSSYYTGSLNWVPLSSEGYWQITVDSI 258
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG----EGVVSAECKLV 112
+ QS C GGC AIVD+GTSLL+GPT + I IG +G ++ C +
Sbjct: 259 TMNGQSI-ACNGGCQAIVDTGTSLLSGPTNAIANIQSKIGASQNSQGQMAVSCSSI 313
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 37/91 (40%), Positives = 49/91 (53%), Gaps = 4/91 (4%)
Query: 205 NPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 264
N G+ + C I +P++ FTI + L YIL++ EG C SGF D+P
Sbjct: 301 NSQGQMAVSCSSIKNLPDIVFTINGIQYPLPASAYILQSQEG----CSSGFQGMDIPTSS 356
Query: 265 GPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
G LWILGDVF+ Y TVFD ++G A A
Sbjct: 357 GELWILGDVFIRQYFTVFDRANNQVGLAPVA 387
>gi|332248100|ref|XP_003273199.1| PREDICTED: renin isoform 1 [Nomascus leucogenys]
Length = 391
Score = 90.9 bits (224), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 41/104 (39%), Positives = 70/104 (67%), Gaps = 3/104 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEE--GGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELG 58
++ QG++ E+VFSF+ NRD + + GG+IV GG DP+H++G Y+ + K G WQ ++
Sbjct: 198 IISQGVLKEDVFSFYYNRDSENSQSLGGQIVLGGSDPQHYEGNFHYINLIKTGVWQIQMK 257
Query: 59 DILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
+ +G+ ST +CE GC A+VD+G S ++G T + ++ A+G +
Sbjct: 258 GVSVGS-STLLCEDGCLALVDTGASYISGSTSSIEKLMEALGAK 300
Score = 74.3 bits (181), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 57/103 (55%), Gaps = 2/103 (1%)
Query: 194 SYINELCDSL--PNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVC 251
S I +L ++L + + ++ C+ PT+P++SF +G K + L+ Y+ + ++C
Sbjct: 288 SSIEKLMEALGAKKRLFDYVVKCNEGPTLPDISFHLGGKEYTLTSADYVFQESYSSKKLC 347
Query: 252 ISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
A D+PPP GP W LG F+ ++T FD RIGFA A
Sbjct: 348 TLAIHAMDIPPPTGPTWALGATFIRKFYTEFDRRNNRIGFALA 390
>gi|311260416|ref|XP_003128442.1| PREDICTED: gastricsin-like [Sus scrofa]
Length = 394
Score = 90.9 bits (224), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 48/120 (40%), Positives = 74/120 (61%), Gaps = 9/120 (7%)
Query: 4 QGLVSEE-----VFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELG 58
QGL+ E+ VFSF+L+ +++GGE+V GGVD + G+ + PVT++ YWQ +
Sbjct: 202 QGLLQEDALTSPVFSFYLSNQQSSQDGGELVLGGVDSSLYTGQIYWAPVTQELYWQIGIE 261
Query: 59 DILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAECKLVVS 114
+ LIG++++G C GC AIVD+GTSLL P ++++ A G E G +CK + S
Sbjct: 262 EFLIGDEASGWCSEGCQAIVDTGTSLLTVPQDYLSDLVQATGAEENEYGEFLVDCKDIQS 321
Score = 67.4 bits (163), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 39/92 (42%), Positives = 52/92 (56%), Gaps = 5/92 (5%)
Query: 205 NPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 264
N GE ++DC I ++P +F I F L P YIL+ +G C+ G +
Sbjct: 307 NEYGEFLVDCKDIQSLPTFTFIINGVEFPLPPSAYILEE-DGF---CMVGVEPTYVSSQN 362
Query: 265 G-PLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
G PLWILGDVF+ Y++VFD G R+GFA AA
Sbjct: 363 GQPLWILGDVFLRSYYSVFDLGNNRVGFATAA 394
>gi|125986537|ref|XP_001357032.1| GA16570 [Drosophila pseudoobscura pseudoobscura]
gi|54645358|gb|EAL34098.1| GA16570 [Drosophila pseudoobscura pseudoobscura]
Length = 408
Score = 90.9 bits (224), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 49/121 (40%), Positives = 69/121 (57%), Gaps = 10/121 (8%)
Query: 1 MVEQGLVSEEVFSFWLNRD-PDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGD 59
+V QGL+ VFS +LNR +A +GGE+V GG+D F G TYVPV+++GYWQF +
Sbjct: 222 LVAQGLIKHPVFSIYLNRSGTNATDGGELVLGGIDATLFSGCLTYVPVSQQGYWQFVMTS 281
Query: 60 ILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG------GEGVVSAECKLVV 113
++G ++ C C AI+D GTSLL PT + +IN + GV C +
Sbjct: 282 AVLGGKT--FCT-HCQAILDVGTSLLVAPTAAIKKINQLLAVLNPQDSSGVFLVNCSTIA 338
Query: 114 S 114
S
Sbjct: 339 S 339
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 49/134 (36%), Positives = 69/134 (51%), Gaps = 15/134 (11%)
Query: 164 GDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGES---IIDCDRIPTM 220
G C+ C+ A++ V L T + IN+L L NP S +++C I ++
Sbjct: 285 GGKTFCTHCQ-AILDVGTSLLVAPTA--AIKKINQLLAVL-NPQDSSGVFLVNCSTIASL 340
Query: 221 PNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHT 280
P + FTI K F L P YIL+ G+ C+S F + LWILG+VFMG Y+T
Sbjct: 341 PTMVFTIARKEFPLQPSDYILQYGD----TCVSSFTSL----AGSDLWILGEVFMGAYYT 392
Query: 281 VFDSGKLRIGFAEA 294
V+D G +IG A A
Sbjct: 393 VYDMGYNQIGLATA 406
>gi|126310959|ref|XP_001372683.1| PREDICTED: chymosin-like [Monodelphis domestica]
Length = 383
Score = 90.9 bits (224), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 41/101 (40%), Positives = 65/101 (64%), Gaps = 4/101 (3%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ + LV++++FS +++RD +G ++ G +DP ++ G +VPVT++GYWQF + I
Sbjct: 202 MMNKNLVAQDLFSVYMSRD---SQGSMLILGAIDPSYYTGSLHWVPVTEQGYWQFSVDSI 258
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 101
+ Q CEGGC AI+D+GTSLL GP+ + I IG
Sbjct: 259 TVNGQVVA-CEGGCQAILDTGTSLLVGPSYDIANIQSIIGA 298
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 33/87 (37%), Positives = 45/87 (51%), Gaps = 8/87 (9%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 267
GE I+C + +MP V I + + L P Y +G+ C SGF + L
Sbjct: 304 GEYDINCSNLSSMPTVVVHINGRQYPLPPSAYT-NQDQGL---CSSGFQS----EGSDQL 355
Query: 268 WILGDVFMGVYHTVFDSGKLRIGFAEA 294
WILGDVF+ Y++VFD G R+G A A
Sbjct: 356 WILGDVFIREYYSVFDRGNNRVGLATA 382
>gi|122938522|gb|ABM69085.1| aspartic proteinase AspMD02 [Musca domestica]
Length = 379
Score = 90.9 bits (224), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 45/101 (44%), Positives = 66/101 (65%), Gaps = 3/101 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ Q L+SE+VFSF+L R ++EGG +V GG D ++++G YVPV+++GYWQFE+ +
Sbjct: 196 MISQHLISEKVFSFYLARRGTSDEGGVMVVGGNDDRYYEGDFHYVPVSEQGYWQFEMAEA 255
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 101
+ +C+ C AI D+GTSL+A PT EI IG
Sbjct: 256 HV--NGVRICD-RCQAIADTGTSLIAVPTDKYEEIQKEIGA 293
Score = 63.2 bits (152), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 33/86 (38%), Positives = 47/86 (54%), Gaps = 8/86 (9%)
Query: 209 ESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLW 268
E ++DC +I +P V+F +GD F L Y++K+ + C S F W
Sbjct: 301 EYMLDCSKIDDLPVVTFRLGDGTFTLEGRDYVIKSDDN---QCSSAF-----EDGGTDFW 352
Query: 269 ILGDVFMGVYHTVFDSGKLRIGFAEA 294
ILGDVF+G Y+T FD+ R+GFA A
Sbjct: 353 ILGDVFIGKYYTTFDAEHNRVGFALA 378
>gi|2832610|emb|CAA11580.1| cathepsin [Chionodraco hamatus]
Length = 402
Score = 90.9 bits (224), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 44/100 (44%), Positives = 62/100 (62%), Gaps = 1/100 (1%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMP-NVSFTIGDKIFNLSPEQYILKTGEGIAEVCISG 254
I +L + P GE I+DC R+ P + F +G K + L+ +QYI K G ++C SG
Sbjct: 302 IQQLIGATPTNFGEFIVDCARLSNFPQHQHFVLGGKEYTLTSDQYIRKEMLGDRKLCFSG 361
Query: 255 FMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
F A D+ GPLWILGDVF+ Y+++FD G+ R+GFA A
Sbjct: 362 FQAVDMISSEGPLWILGDVFLTQYYSIFDRGQDRVGFAIA 401
Score = 81.3 bits (199), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 53/182 (29%), Positives = 88/182 (48%), Gaps = 19/182 (10%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ Q L+ + +FSF+L+R + GE++ GG D + + ++PVT K YWQ ++ +
Sbjct: 209 MLAQNLLDKPIFSFYLSRKLNGSPEGELLLGGTDERLYDLPINWLPVTAKAYWQIKIDSV 268
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
++ + C GC AIVD+GTSL+ GPT + +I IG + +G+ I
Sbjct: 269 VVQGVNP-FCPHGCQAIVDTGTSLITGPTDDILDIQQLIGA----------TPTNFGEFI 317
Query: 121 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGDSAVCSACEMAVVWVQ 180
D P+ +G G EY T + KE + GD +C + AV +
Sbjct: 318 VDCARLSNFPQHQHFVLG-----GKEYTLTS-DQYIRKEML--GDRKLCFSGFQAVDMIS 369
Query: 181 NQ 182
++
Sbjct: 370 SE 371
>gi|402857516|ref|XP_003893299.1| PREDICTED: renin [Papio anubis]
gi|62287423|sp|Q6DLS0.1|RENI_MACFA RecName: Full=Renin; AltName: Full=Angiotensinogenase; Flags:
Precursor
gi|50346961|gb|AAT75162.1| renin [Macaca fascicularis]
Length = 406
Score = 90.9 bits (224), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 41/104 (39%), Positives = 70/104 (67%), Gaps = 3/104 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEE--GGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELG 58
++ QG++ E+VFSF+ NRD + + GG+IV GG DP+H++G Y+ + K G WQ ++
Sbjct: 213 ILSQGVLKEDVFSFYYNRDSENAQSLGGQIVLGGSDPQHYEGNFHYINLIKTGVWQIQMK 272
Query: 59 DILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
+ +G+ ST +CE GC A+VD+G S ++G T + ++ A+G +
Sbjct: 273 GVSVGS-STLLCEDGCLALVDTGASYISGSTSSIEKLMEALGAK 315
Score = 74.7 bits (182), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 57/103 (55%), Gaps = 2/103 (1%)
Query: 194 SYINELCDSL--PNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVC 251
S I +L ++L + + ++ C+ PT+P++SF +G K + L+ Y+ + ++C
Sbjct: 303 SSIEKLMEALGAKKRLFDYVVKCNEGPTLPDISFHLGGKEYTLTSADYVFQESYSSKKLC 362
Query: 252 ISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
A D+PPP GP W LG F+ ++T FD RIGFA A
Sbjct: 363 TLAIHAMDIPPPTGPTWALGATFIRKFYTEFDRRNNRIGFALA 405
>gi|443286990|dbj|BAM76489.1| pepsinogen, partial [Gadus macrocephalus]
Length = 326
Score = 90.5 bits (223), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 40/100 (40%), Positives = 68/100 (68%), Gaps = 3/100 (3%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M++QGLV++++FS +L+++ + +G + FGGVDP H+ G T++P++ + YWQ + +
Sbjct: 145 MMKQGLVTQDMFSVYLSKN--SAQGSVVTFGGVDPHHYSGSITWIPLSSELYWQITVHSV 202
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 100
+ + C GGC AIVD+GTSL+ GP ++ IN A+G
Sbjct: 203 SVNGRPV-ACVGGCQAIVDTGTSLIVGPQSDISSINSAVG 241
Score = 44.3 bits (103), Expect = 0.074, Method: Compositional matrix adjust.
Identities = 26/84 (30%), Positives = 41/84 (48%), Gaps = 8/84 (9%)
Query: 211 IIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWIL 270
+++C+ I MP V F I F+L Y+ ++ C SG + LWIL
Sbjct: 250 VVNCNGIQQMPAVVFHINGHKFSLPASAYVRQSKY---YGCRSG-----MASGGSNLWIL 301
Query: 271 GDVFMGVYHTVFDSGKLRIGFAEA 294
GDVF+ Y+++F + +G A A
Sbjct: 302 GDVFIREYYSIFSRAQNMVGLATA 325
>gi|166361873|gb|ABY87035.1| pepsinogen A2 [Epinephelus coioides]
gi|166361877|gb|ABY87037.1| pepsinogen A2 [Epinephelus coioides]
Length = 377
Score = 90.5 bits (223), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 39/100 (39%), Positives = 65/100 (65%), Gaps = 3/100 (3%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M++QGLVS+++FS +L+ + + G + FGG+DP H+ G +++P++ + YWQ + +
Sbjct: 197 MMKQGLVSQDLFSVYLSSN--SNRGSVVTFGGIDPNHYSGSISWIPLSSELYWQITVDSV 254
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 100
+ Q C GGC AIVD+GTSL+ GP ++ IN +G
Sbjct: 255 TVNGQVV-ACNGGCQAIVDTGTSLIVGPQSSISNINQVVG 293
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 37/106 (34%), Positives = 55/106 (51%), Gaps = 10/106 (9%)
Query: 190 EKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAE 249
+ +S IN++ + G ++ C+ I MP+V+F I + F L YI ++ G
Sbjct: 282 QSSISNINQVVGAYSQN-GNDMVSCNNIGQMPDVTFHIQGQEFTLPSSAYIRQSYYG--- 337
Query: 250 VCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
C SGF LWILGDVF+ Y ++FD G+ R+G A AA
Sbjct: 338 -CHSGF-----GNGGSSLWILGDVFIRQYFSIFDRGQNRVGLARAA 377
>gi|118102563|ref|XP_425832.2| PREDICTED: gastricsin-like [Gallus gallus]
Length = 388
Score = 90.5 bits (223), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 41/102 (40%), Positives = 60/102 (58%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+EQ + + +FSF+ +R+P GGE+V GGVD + F G + PVT++ YWQ + +
Sbjct: 199 MLEQNQLKQPIFSFYFSRNPTYNYGGELVLGGVDSRLFTGDVVWAPVTQELYWQVAIDEF 258
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
IG G C GC AIVD+GT LL P ++ A+G +
Sbjct: 259 AIGQSVMGWCSQGCQAIVDTGTFLLTVPQQYLSRFLQAVGAQ 300
Score = 67.0 bits (162), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 35/84 (41%), Positives = 47/84 (55%), Gaps = 5/84 (5%)
Query: 212 IDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRG-PLWIL 270
+DC+ I ++P ++F I LSP Y+LK C G LP G PLWIL
Sbjct: 308 VDCNDINSLPTITFIISGARLPLSPSAYVLKNNG----YCTVGIEVTYLPSQNGQPLWIL 363
Query: 271 GDVFMGVYHTVFDSGKLRIGFAEA 294
GDVF+ Y+T+FD RIGFA++
Sbjct: 364 GDVFLKEYYTIFDMAYNRIGFAKS 387
>gi|194854120|ref|XP_001968292.1| GG24793 [Drosophila erecta]
gi|190660159|gb|EDV57351.1| GG24793 [Drosophila erecta]
Length = 404
Score = 90.5 bits (223), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 43/101 (42%), Positives = 63/101 (62%), Gaps = 3/101 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+ E+GL+ E VF F+L R+ A EGG++ GG D G+ TY PVT++GYWQF + +I
Sbjct: 220 LYEEGLIDEPVFGFYLARNGSAVEGGQLTLGGTDQNLIAGEMTYTPVTQQGYWQFAVNNI 279
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 101
+ + GC AI D+GTSL+A P+ ++N+ IGG
Sbjct: 280 ---TWNGTLISSGCQAIADTGTSLIAVPSAAYIQLNNLIGG 317
Score = 57.4 bits (137), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 31/88 (35%), Positives = 43/88 (48%), Gaps = 5/88 (5%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 267
G+ + C ++P ++ IG F L P YI G C+S F
Sbjct: 322 GDYYVPCSTASSLPVLTLNIGGTDFYLPPSVYIQSYTYGNLTTCMSTFTDIGTG-----F 376
Query: 268 WILGDVFMGVYHTVFDSGKLRIGFAEAA 295
WILGDVF+G Y++ FD G+ R+GFA A
Sbjct: 377 WILGDVFLGQYYSEFDFGQNRVGFASLA 404
>gi|30685656|ref|NP_193936.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|332659144|gb|AEE84544.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 354
Score = 90.5 bits (223), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 48/104 (46%), Positives = 65/104 (62%), Gaps = 3/104 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEE--GGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELG 58
MV QG +++ VFS WL R ++ E GGE+VFGG+ P HF G HTYV V G + F +
Sbjct: 172 MVFQGKIAKNVFSIWLRRFSNSGEINGGEVVFGGIIPAHFSGDHTYVDVEGPGNF-FAMS 230
Query: 59 DILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
+I +G ++T +C GC AIVDSG+S + P EI+ IG E
Sbjct: 231 NIWVGGKNTNICSSGCKAIVDSGSSNINVPMDSADEIHRYIGVE 274
Score = 53.9 bits (128), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 47/86 (54%), Gaps = 11/86 (12%)
Query: 213 DCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGD 272
+C+ T+P+V+FTIG K F L+P YI ++ C S F+ W LG
Sbjct: 276 NCNNFETLPDVTFTIGGKAFVLTPLDYIRRS----RSQCTSKFVG----KTNRSHWTLGI 327
Query: 273 VFMGVYHTVFD---SGKLRIGFAEAA 295
FM V+HTVFD + +++GFA++
Sbjct: 328 PFMRVFHTVFDYQNTLAVKVGFAKST 353
>gi|407726061|dbj|BAM46128.1| pepsinogen C [Cynops pyrrhogaster]
Length = 383
Score = 90.5 bits (223), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 47/112 (41%), Positives = 67/112 (59%), Gaps = 7/112 (6%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ QGL+S+ +F F+L + ++ GGE+VFGGVD ++ G+ T+ PVT++ YWQ +
Sbjct: 194 MMNQGLLSQNIFGFYLGQQ-GSQSGGELVFGGVDSNYYTGQITWTPVTQQMYWQIGISGF 252
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG------GEGVVS 106
+ Q TG C GC IVD+GTSLL P + + IG GE VVS
Sbjct: 253 GVNGQPTGWCGQGCQGIVDTGTSLLTAPGQYIAALMQEIGATQDSNGEYVVS 304
Score = 71.2 bits (173), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 39/90 (43%), Positives = 52/90 (57%), Gaps = 7/90 (7%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKT-GEGIAEVCISGFMAFDLPPPRG- 265
GE ++ C I ++P +SFTIG L P YIL+ GE C G M LP G
Sbjct: 299 GEYVVSCSNIDSLPTLSFTIGGTSLPLPPSAYILQNNGE-----CSVGIMPTYLPSQNGQ 353
Query: 266 PLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
PLWILGDVF+ Y++++D ++GFA AA
Sbjct: 354 PLWILGDVFLRQYYSIYDVTNNQVGFATAA 383
>gi|340374170|ref|XP_003385611.1| PREDICTED: cathepsin D-like [Amphimedon queenslandica]
Length = 389
Score = 90.5 bits (223), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 45/108 (41%), Positives = 68/108 (62%), Gaps = 2/108 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ Q LV+ +F +L+ D A GG++ GG + K++ G YVP+T K Y+ ++ +
Sbjct: 200 MISQNLVNAGMFGVFLSADTSASLGGDLNLGGPNEKYYTGDFNYVPLTSKTYYMIKVDGM 259
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
GN S +C+GGC IVD+GTSL+AGPT VT+I AIG + ++ E
Sbjct: 260 NAGNLS--LCDGGCNGIVDTGTSLIAGPTAEVTKIATAIGAKSTLAGE 305
Score = 75.1 bits (183), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 47/149 (31%), Positives = 77/149 (51%), Gaps = 8/149 (5%)
Query: 146 EYVSTGIKT--VVEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSL 203
YV KT +++ + ++AG+ ++C +V L T E ++ I +
Sbjct: 242 NYVPLTSKTYYMIKVDGMNAGNLSLCDGGCNGIVDTGTSLIAGPTAE--VTKIATAIGAK 299
Query: 204 PNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPP 263
GE IDC ++P++P+V+ TI + + L+ + Y+L + C+ GFM +LP
Sbjct: 300 STLAGEYTIDCTKVPSLPDVTITIAGQKYTLTGKDYVLN----VEGQCLLGFMGINLPDQ 355
Query: 264 RGPLWILGDVFMGVYHTVFDSGKLRIGFA 292
WILGDV + VY+TVFD R+GFA
Sbjct: 356 LKNSWILGDVLIRVYYTVFDYSGGRVGFA 384
>gi|291409613|ref|XP_002721073.1| PREDICTED: pepsinogen III-like [Oryctolagus cuniculus]
Length = 387
Score = 90.5 bits (223), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 49/118 (41%), Positives = 70/118 (59%), Gaps = 7/118 (5%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M +GLVS+++FS +L+ D E G ++FGG+D ++ G +VPV+ +GYWQ L I
Sbjct: 201 MWNEGLVSQDLFSVYLSSDD--ESGSLVMFGGIDSSYYTGSLNWVPVSYEGYWQITLDSI 258
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG----EGVVSAECKLVVS 114
+ + T C GC AIVD+GTSLLAGPT ++ I IG EG + C + S
Sbjct: 259 TMDGE-TIACADGCQAIVDTGTSLLAGPTSAISNIQSYIGASENYEGEMIVSCSSMYS 315
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 45/132 (34%), Positives = 63/132 (47%), Gaps = 6/132 (4%)
Query: 164 GDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNV 223
G++ C+ A+V L T +S I + N GE I+ C + ++PN+
Sbjct: 262 GETIACADGCQAIVDTGTSLLAGPTS--AISNIQSYIGASENYEGEMIVSCSSMYSLPNI 319
Query: 224 SFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFD 283
FTI + + YIL+ VC SGF D+ G LWILGDVF+ Y TVFD
Sbjct: 320 VFTINGVQYPVPASAYILEEDS----VCTSGFEGMDVDTSTGELWILGDVFIRQYFTVFD 375
Query: 284 SGKLRIGFAEAA 295
++G A AA
Sbjct: 376 RANNQLGLAAAA 387
>gi|183180254|gb|ACC44394.1| ASP-3 [Caenorhabditis remanei]
gi|183180258|gb|ACC44396.1| ASP-3 [Caenorhabditis remanei]
gi|183180266|gb|ACC44400.1| ASP-3 [Caenorhabditis remanei]
gi|183180268|gb|ACC44401.1| ASP-3 [Caenorhabditis remanei]
Length = 225
Score = 90.5 bits (223), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 42/98 (42%), Positives = 62/98 (63%), Gaps = 1/98 (1%)
Query: 190 EKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAE 249
V+ I +P GE ++C +IP +PN++FT+G + F+L + YIL+ G
Sbjct: 128 SDVIKKIQHKIGGIPLFNGEYEVECSKIPKLPNITFTLGGQNFDLQGKDYILQMSNGNGG 187
Query: 250 V-CISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGK 286
+ C+SGFM D+P P GPLWILGDVF+G +++VFD G
Sbjct: 188 MTCLSGFMGMDIPAPAGPLWILGDVFIGRFYSVFDHGN 225
Score = 75.5 bits (184), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 40/116 (34%), Positives = 61/116 (52%), Gaps = 3/116 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRDP-DAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGD 59
+ ++F+FWL+RD D GGEI D H+ G + P+ YW+ +LG
Sbjct: 41 FANSAICPNQLFAFWLSRDANDIANGGEITLCDTDSNHYTGNIAWEPLVSMDYWRIKLGA 100
Query: 60 ILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQ 115
+ I T G +IVD+GTSLL GP+ V+ +I H IGG + + E ++ S+
Sbjct: 101 VSI--DGTTYTNGPIDSIVDTGTSLLTGPSDVIKKIQHKIGGIPLFNGEYEVECSK 154
>gi|426333405|ref|XP_004028268.1| PREDICTED: renin [Gorilla gorilla gorilla]
Length = 406
Score = 90.5 bits (223), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 41/104 (39%), Positives = 70/104 (67%), Gaps = 3/104 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEE--GGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELG 58
++ QG++ E+VFSF+ NRD + + GG+IV GG DP+H++G Y+ + K G WQ ++
Sbjct: 213 IISQGVLKEDVFSFYYNRDSENFQSLGGQIVLGGSDPQHYEGNFHYINLIKTGVWQIQMK 272
Query: 59 DILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
+ +G+ ST +CE GC A+VD+G S ++G T + ++ A+G +
Sbjct: 273 GVSVGS-STLLCEDGCLALVDTGASYISGSTSSIEKLMEALGAK 315
Score = 74.7 bits (182), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 57/103 (55%), Gaps = 2/103 (1%)
Query: 194 SYINELCDSL--PNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVC 251
S I +L ++L + + ++ C+ PT+P++SF +G K + L+ Y+ + ++C
Sbjct: 303 SSIEKLMEALGAKKRLFDYVVKCNEGPTLPDISFHLGGKEYTLTSADYVFQESYSSKKLC 362
Query: 252 ISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
A D+PPP GP W LG F+ ++T FD RIGFA A
Sbjct: 363 TLAIHAMDIPPPTGPTWALGATFIRKFYTEFDRRNNRIGFALA 405
>gi|147905812|ref|NP_001079036.1| gastricsin precursor [Xenopus laevis]
gi|12082174|dbj|BAB20797.1| pepsinogen C [Xenopus laevis]
gi|213625030|gb|AAI69665.1| Pepsinogen C [Xenopus laevis]
gi|213626584|gb|AAI69663.1| Pepsinogen C [Xenopus laevis]
Length = 383
Score = 90.5 bits (223), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 41/102 (40%), Positives = 64/102 (62%), Gaps = 1/102 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M++Q L+++ +F F+L+ ++ GGE+ FGGVD ++ G+ + PVT + YWQ +
Sbjct: 193 MMQQNLLNQPIFGFYLSGQ-SSQNGGEVAFGGVDQNYYTGQIYWTPVTSETYWQIGIQGF 251
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
I Q+TG C GC AIVD+GTSLL P V + + +IG +
Sbjct: 252 SINGQATGWCSQGCQAIVDTGTSLLTAPQSVFSSLIQSIGAQ 293
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 38/89 (42%), Positives = 51/89 (57%), Gaps = 4/89 (4%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRG-P 266
G+ ++ C I +P +SFTI F L P Y+L+ G C G M LP G P
Sbjct: 298 GQYVVSCSNIQNLPTISFTISGVSFPLPPSAYVLQQSSG---YCTIGIMPTYLPSQNGQP 354
Query: 267 LWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
LWILGDVF+ Y++V+D G ++GFA AA
Sbjct: 355 LWILGDVFLREYYSVYDLGNNQVGFATAA 383
>gi|291409618|ref|XP_002721075.1| PREDICTED: pepsin II-4-like [Oryctolagus cuniculus]
Length = 387
Score = 90.5 bits (223), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 49/112 (43%), Positives = 69/112 (61%), Gaps = 9/112 (8%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M +GLVS+++FS +L+ D E+G ++FGG+D ++ G +VPV+ +GYWQ + I
Sbjct: 201 MWNEGLVSQDLFSVYLSSDD--EKGSLVMFGGIDSSYYTGSLNWVPVSYEGYWQITMDSI 258
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG------GEGVVS 106
I N T C C AIVD+GTSLLAGPT ++ I IG GE V+S
Sbjct: 259 SI-NGETIACADSCQAIVDTGTSLLAGPTSAISNIQSYIGASKNLLGENVIS 309
Score = 71.6 bits (174), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 45/131 (34%), Positives = 64/131 (48%), Gaps = 6/131 (4%)
Query: 164 GDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNV 223
G++ C+ A+V L T +S I + N +GE++I C I ++P++
Sbjct: 262 GETIACADSCQAIVDTGTSLLAGPTS--AISNIQSYIGASKNLLGENVISCSAIDSLPDI 319
Query: 224 SFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFD 283
FTI + L YILK + CISG +L G LWILGDVF+ Y TVFD
Sbjct: 320 VFTINGIQYPLPASAYILKEDDD----CISGLEGMNLDTSTGELWILGDVFIRQYFTVFD 375
Query: 284 SGKLRIGFAEA 294
++G A A
Sbjct: 376 RANNQLGLAAA 386
>gi|183180242|gb|ACC44388.1| ASP-3 [Caenorhabditis remanei]
gi|183180244|gb|ACC44389.1| ASP-3 [Caenorhabditis remanei]
gi|183180246|gb|ACC44390.1| ASP-3 [Caenorhabditis remanei]
gi|183180248|gb|ACC44391.1| ASP-3 [Caenorhabditis remanei]
gi|183180250|gb|ACC44392.1| ASP-3 [Caenorhabditis remanei]
gi|183180252|gb|ACC44393.1| ASP-3 [Caenorhabditis remanei]
gi|183180256|gb|ACC44395.1| ASP-3 [Caenorhabditis remanei]
gi|183180260|gb|ACC44397.1| ASP-3 [Caenorhabditis remanei]
gi|183180262|gb|ACC44398.1| ASP-3 [Caenorhabditis remanei]
gi|183180264|gb|ACC44399.1| ASP-3 [Caenorhabditis remanei]
gi|183180270|gb|ACC44402.1| ASP-3 [Caenorhabditis remanei]
gi|183180272|gb|ACC44403.1| ASP-3 [Caenorhabditis remanei]
Length = 225
Score = 90.5 bits (223), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 42/96 (43%), Positives = 62/96 (64%), Gaps = 1/96 (1%)
Query: 192 VLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEV- 250
V+ I +P GE ++C +IP +PN++FT+G + F+L + YIL+ G +
Sbjct: 130 VIKKIQHKIGGIPLFNGEYEVECSKIPKLPNITFTLGGQNFDLQGKDYILQMSNGNGGMT 189
Query: 251 CISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGK 286
C+SGFM D+P P GPLWILGDVF+G +++VFD G
Sbjct: 190 CLSGFMGMDIPAPAGPLWILGDVFIGRFYSVFDHGN 225
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 41/116 (35%), Positives = 61/116 (52%), Gaps = 3/116 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRDP-DAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGD 59
+ ++F+FWL+RD D GGEI D H+ G + P+ YW+ +LG
Sbjct: 41 FANSAICPNQLFAFWLSRDANDIANGGEITLCDTDSNHYTGNIAWEPLVSMDYWRIKLGA 100
Query: 60 ILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQ 115
+ I T G +IVD+GTSLL GPT V+ +I H IGG + + E ++ S+
Sbjct: 101 VSI--DGTTYTNGPIDSIVDTGTSLLTGPTDVIKKIQHKIGGIPLFNGEYEVECSK 154
>gi|2961350|emb|CAA18108.1| aspartic proteinase like protein [Arabidopsis thaliana]
gi|7269050|emb|CAB79160.1| aspartic proteinase like protein [Arabidopsis thaliana]
Length = 336
Score = 90.5 bits (223), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 48/104 (46%), Positives = 65/104 (62%), Gaps = 3/104 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEE--GGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELG 58
MV QG +++ VFS WL R ++ E GGE+VFGG+ P HF G HTYV V G + F +
Sbjct: 172 MVFQGKIAKNVFSIWLRRFSNSGEINGGEVVFGGIIPAHFSGDHTYVDVEGPGNF-FAMS 230
Query: 59 DILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
+I +G ++T +C GC AIVDSG+S + P EI+ IG E
Sbjct: 231 NIWVGGKNTNICSSGCKAIVDSGSSNINVPMDSADEIHRYIGVE 274
>gi|194218273|ref|XP_001501915.2| PREDICTED: pepsin A-like [Equus caballus]
Length = 387
Score = 90.5 bits (223), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 47/110 (42%), Positives = 68/110 (61%), Gaps = 9/110 (8%)
Query: 3 EQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILI 62
+QGLVS+++FS +L+ D E G ++FGG+D ++ G +VPV+++ YWQ + I +
Sbjct: 203 DQGLVSQDLFSVYLSSDD--ESGSVVMFGGIDSSYYSGSLNWVPVSEEAYWQITVDSITM 260
Query: 63 GNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG------GEGVVS 106
+S C GGC AIVD+GTSLLAGP + I IG GEG +S
Sbjct: 261 NGESI-ACSGGCQAIVDTGTSLLAGPPSAIDNIQSYIGASEDSSGEGAIS 309
Score = 75.5 bits (184), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 52/134 (38%), Positives = 64/134 (47%), Gaps = 10/134 (7%)
Query: 164 GDSAVCSACEMAVVWVQNQLKQ--KQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMP 221
G+S CS A+V L + + SYI DS GE I C I ++P
Sbjct: 262 GESIACSGGCQAIVDTGTSLLAGPPSAIDNIQSYIGASEDS----SGEGAISCSSIDSLP 317
Query: 222 NVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTV 281
++ FTI F L+P YIL+ CISGF DL G LWILGDVF+ Y TV
Sbjct: 318 DIVFTINGVEFPLTPSAYILEEDGS----CISGFEGMDLDTSSGELWILGDVFIRQYFTV 373
Query: 282 FDSGKLRIGFAEAA 295
FD +IG A A
Sbjct: 374 FDRANNQIGLAPVA 387
>gi|112950081|gb|ABI26643.1| aspartic proteinase [Cucumis sativus]
Length = 399
Score = 90.1 bits (222), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 49/117 (41%), Positives = 62/117 (52%), Gaps = 6/117 (5%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV Q LVS VFS + R A GE++FGG D ++G YVPVT++GYWQF + +
Sbjct: 197 MVSQSLVSNPVFSVYFGRSKAANNNGEVIFGGSDSTVYQGPINYVPVTQQGYWQFTMDGV 256
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG---EGVVSAECKLVVS 114
+ Q AI D+GTSLLA PT +N AIG EG +C V S
Sbjct: 257 YVNGQQ---VISSAQAIADTGTSLLAAPTSAFYTLNEAIGATYQEGDYFVDCSSVSS 310
Score = 68.2 bits (165), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 34/85 (40%), Positives = 49/85 (57%), Gaps = 9/85 (10%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 267
G+ +DC + ++PN+ F+IG ++L P YI++ I C+S A D
Sbjct: 299 GDYFVDCSSVSSLPNIQFSIGGINYSLPPSAYIVE----IEGECMSATTAMDQEQ----- 349
Query: 268 WILGDVFMGVYHTVFDSGKLRIGFA 292
WILGDVF+G Y+T FD G R+GFA
Sbjct: 350 WILGDVFLGSYYTEFDLGNNRVGFA 374
>gi|301030231|gb|ADK47877.1| cathepsin D [Triatoma infestans]
Length = 390
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 46/104 (44%), Positives = 64/104 (61%), Gaps = 2/104 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHF-KGKHTYVPVTKKGYWQFELGD 59
M++Q L+ + VFS +LNR+PD E GGEI+FGGVD + + K T VP+T YW F++
Sbjct: 197 MIKQELLDKPVFSVYLNRNPDEEVGGEIIFGGVDEELYNKESMTTVPLTSTSYWMFQMDG 256
Query: 60 ILIGNQS-TGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
I + T C+ GC I D+GTS + GP+ V EI +G E
Sbjct: 257 ISTSAEDGTSWCQNGCPGIADTGTSFIVGPSSDVDEIMELVGAE 300
Score = 64.3 bits (155), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 32/85 (37%), Positives = 45/85 (52%), Gaps = 2/85 (2%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 267
G + CD + +P+++F I K + + E YILK + CI GF LP P
Sbjct: 304 GIGFVSCDDLDKLPDITFHINGKGYTIKAEDYILKVTQAGETACIVGFTT--LPSAPQPF 361
Query: 268 WILGDVFMGVYHTVFDSGKLRIGFA 292
WILGDVF+G +TVF+ + FA
Sbjct: 362 WILGDVFLGKVYTVFNVEDRTVSFA 386
>gi|45384244|ref|NP_990385.1| embryonic pepsinogen precursor [Gallus gallus]
gi|129801|sp|P16476.1|PEPE_CHICK RecName: Full=Embryonic pepsinogen; Flags: Precursor
gi|222853|dbj|BAA00153.1| pepsinogen [Gallus gallus]
Length = 383
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 43/118 (36%), Positives = 69/118 (58%), Gaps = 8/118 (6%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV + L+ + +FS +L+R+P G +VFGG+D +F G ++PV+ +GYWQ + I
Sbjct: 202 MVNESLLEQNLFSVYLSREP---MGSMVVFGGIDESYFTGSINWIPVSYQGYWQISMDSI 258
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAECKLVVS 114
++ Q C GC AI+D+GTSL+AGP + +I A+G G S C +++
Sbjct: 259 IVNKQEIA-CSSGCQAIIDTGTSLVAGPASDINDIQSAVGANQNTYGEYSVNCSHILA 315
Score = 60.8 bits (146), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 33/90 (36%), Positives = 47/90 (52%), Gaps = 8/90 (8%)
Query: 205 NPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 264
N GE ++C I MP+V F IG + + Y + G+G C+S F
Sbjct: 301 NTYGEYSVNCSHILAMPDVVFVIGGIQYPVPALAYTEQNGQG---TCMSSFQN-----SS 352
Query: 265 GPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
LWILGDVF+ VY+++FD R+G A+A
Sbjct: 353 ADLWILGDVFIRVYYSIFDRANNRVGLAKA 382
>gi|395534129|ref|XP_003769100.1| PREDICTED: LOW QUALITY PROTEIN: gastricsin-like [Sarcophilus
harrisii]
Length = 391
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 41/102 (40%), Positives = 63/102 (61%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M++Q +++ +FSF+L + GGE++FGGVD + G+ + PVT++ YWQ + +
Sbjct: 198 MLQQNILTNPIFSFYLGNQQSSXNGGEVIFGGVDNNLYTGQIYWAPVTQELYWQIGIQEF 257
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
IG Q+TG C GC AIVD+GTSLL P ++ A G +
Sbjct: 258 SIGGQATGWCSQGCQAIVDTGTSLLTVPQQYMSAFLQATGAQ 299
Score = 71.2 bits (173), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 39/107 (36%), Positives = 57/107 (53%), Gaps = 5/107 (4%)
Query: 190 EKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAE 249
++ +S + + + G+ ++DC+ I ++P +SF I F LSP YIL
Sbjct: 286 QQYMSAFLQATGAQQDQYGQYVVDCNNIQSLPTISFLINGVQFPLSPSAYILNNNG---- 341
Query: 250 VCISGFMAFDLPPPRG-PLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
C G LP G PLWILGDVF+ Y++V+D R+GFA AA
Sbjct: 342 YCTVGTEPTYLPFQNGQPLWILGDVFLRSYYSVYDMNNNRVGFATAA 388
>gi|360431|prf||1403354A pepsinogen
Length = 383
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 43/118 (36%), Positives = 69/118 (58%), Gaps = 8/118 (6%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV + L+ + +FS +L+R+P G +VFGG+D +F G ++PV+ +GYWQ + I
Sbjct: 202 MVNESLLEQNLFSVYLSREP---MGSMVVFGGIDESYFTGSINWIPVSYQGYWQISMDSI 258
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAECKLVVS 114
++ Q C GC AI+D+GTSL+AGP + +I A+G G S C +++
Sbjct: 259 IVNKQEIA-CSSGCQAIIDTGTSLVAGPASDINDIQSAVGANQNTYGEYSVNCSHILA 315
Score = 61.2 bits (147), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 33/90 (36%), Positives = 47/90 (52%), Gaps = 8/90 (8%)
Query: 205 NPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 264
N GE ++C I MP+V F IG + + Y + G+G C+S F
Sbjct: 301 NTYGEYSVNCSHILAMPDVVFVIGGIQYPVPALAYTQQNGQG---TCMSSFQN-----SS 352
Query: 265 GPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
LWILGDVF+ VY+++FD R+G A+A
Sbjct: 353 ADLWILGDVFIRVYYSIFDRANNRVGLAKA 382
>gi|321250483|ref|XP_003191823.1| endopeptidase [Cryptococcus gattii WM276]
gi|317458290|gb|ADV20036.1| Endopeptidase, putative [Cryptococcus gattii WM276]
Length = 432
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 57/167 (34%), Positives = 88/167 (52%), Gaps = 9/167 (5%)
Query: 131 EKVCQQIGLCAFNGA-EYVSTGIKTV--VEKENVSAGDSAVCSACEMAVVWVQNQLKQKQ 187
E + I A++G+ YV K VE E++S GD + A + L
Sbjct: 271 EAIFGGIDKSAYSGSLHYVPVRRKGYWEVELESISFGDDELELENTGAAIDTGTSLIVMP 330
Query: 188 TKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGI 247
T V +N+ + + G+ +DC+ +P++P ++FT K + LS E YIL G
Sbjct: 331 TD--VAEMLNKEIGAEKSWNGQYTVDCNTVPSLPELAFTFDGKAYKLSGEDYILNAGG-- 386
Query: 248 AEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
CIS F D+PPP GPL+I+GDVF+ Y+TV+D G+ +GFA++
Sbjct: 387 --TCISSFTGMDIPPPMGPLYIVGDVFLRKYYTVYDLGRNAVGFAKS 431
Score = 78.2 bits (191), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 46/118 (38%), Positives = 64/118 (54%), Gaps = 8/118 (6%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ Q L+ + VFSF L + +GGE +FGG+D + G YVPV +KGYW+ EL I
Sbjct: 247 MLNQDLLDDPVFSFRLGSSEN--DGGEAIFGGIDKSAYSGSLHYVPVRRKGYWEVELESI 304
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAECKLVVS 114
G+ + G A +D+GTSL+ PT V +N IG E G + +C V S
Sbjct: 305 SFGDDELELENTGAA--IDTGTSLIVMPTDVAEMLNKEIGAEKSWNGQYTVDCNTVPS 360
>gi|403294825|ref|XP_003938364.1| PREDICTED: renin [Saimiri boliviensis boliviensis]
Length = 400
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 40/104 (38%), Positives = 69/104 (66%), Gaps = 3/104 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEE--GGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELG 58
++ QG++ E+VFSF+ NRD + + GG+IV GG DP+H++G Y+ + + G WQ +
Sbjct: 207 IISQGVLKEDVFSFYYNRDSENSQSLGGQIVLGGSDPQHYEGNFHYINLIRTGLWQIPMK 266
Query: 59 DILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
+ +G+ ST +CE GC A+VD+G S ++G T + ++ A+G +
Sbjct: 267 GVSVGS-STLLCEDGCLALVDTGASYISGSTSSIEKLMEALGAK 309
Score = 73.6 bits (179), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 34/103 (33%), Positives = 57/103 (55%), Gaps = 2/103 (1%)
Query: 194 SYINELCDSL--PNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVC 251
S I +L ++L + + ++ C+ PT+P+++F +G K + L+ Y+ + ++C
Sbjct: 297 SSIEKLMEALGAKKRLFDYVVKCNEGPTLPDIAFHLGGKEYTLTSADYVFQESYSSKKLC 356
Query: 252 ISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
A D+PPP GP W LG F+ ++T FD RIGFA A
Sbjct: 357 TLAIHAMDIPPPTGPTWALGATFIRKFYTEFDRRNNRIGFALA 399
>gi|326933879|ref|XP_003213025.1| PREDICTED: gastricsin-like [Meleagris gallopavo]
Length = 390
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 40/102 (39%), Positives = 61/102 (59%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M++Q + + +FSF+ +R+P GGE+V GGVD + F G + PVT++ YWQ + +
Sbjct: 198 MLQQNQLKQPIFSFYFSRNPTYNYGGELVLGGVDSRLFTGDIVWAPVTQELYWQVAIDEF 257
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
IG G C GC AIVD+GT LL P ++ + A+G +
Sbjct: 258 AIGQSVMGWCSQGCQAIVDTGTFLLTVPQQYLSRLLKAVGAQ 299
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 34/87 (39%), Positives = 48/87 (55%), Gaps = 5/87 (5%)
Query: 209 ESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRG-PL 267
E +DC+ + ++P +SF I L+P Y+LK C G LP G PL
Sbjct: 307 EYAVDCNVVHSLPTISFIINGVQLPLTPSAYVLKNNG----YCTVGIEVTYLPSQNGQPL 362
Query: 268 WILGDVFMGVYHTVFDSGKLRIGFAEA 294
WILGDVF+ Y+++FD RIGFA++
Sbjct: 363 WILGDVFLKEYYSIFDMAYNRIGFAKS 389
>gi|332024606|gb|EGI64804.1| Lysosomal aspartic protease [Acromyrmex echinatior]
Length = 785
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 49/110 (44%), Positives = 66/110 (60%), Gaps = 12/110 (10%)
Query: 1 MVEQGLVSEEVFSFWLNR---------DPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKG 51
M++QGLV E VFSF++NR + +E GE++ GG DP H+ G+ TYV VT KG
Sbjct: 154 MIKQGLV-EPVFSFYINRHVLFFMHYENSVSEFAGELILGGSDPNHYLGELTYVNVTHKG 212
Query: 52 YWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 101
YWQ + I IG+ S +C C AIVD+GT++L G + IN IG
Sbjct: 213 YWQITMDKIQIGHNS--LCSNSCEAIVDTGTTMLIGSRLDIEIINELIGA 260
Score = 65.1 bits (157), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 45/145 (31%), Positives = 71/145 (48%), Gaps = 18/145 (12%)
Query: 156 VEKENVSAGDSAVCS-ACEMAV-----VWVQNQLKQKQTKEKV-LSYINELCDSLPNPMG 208
+ + + G +++CS +CE V + + ++L + E + +YIN+
Sbjct: 216 ITMDKIQIGHNSLCSNSCEAIVDTGTTMLIGSRLDIEIINELIGATYIND---------- 265
Query: 209 ESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR-GPL 267
E I+DC+RI +PN+SF IG K F ++ YI E C+S F + + +
Sbjct: 266 EIIVDCNRIHYLPNISFFIGGKSFEVTHADYIYVGTENGTTKCMSAFESLNFDDEYVESI 325
Query: 268 WILGDVFMGVYHTVFDSGKLRIGFA 292
WILG VF+ Y FD RIGFA
Sbjct: 326 WILGSVFIHRYFIEFDMKNDRIGFA 350
>gi|354493821|ref|XP_003509038.1| PREDICTED: gastricsin-like [Cricetulus griseus]
gi|344238302|gb|EGV94405.1| Gastricsin [Cricetulus griseus]
Length = 391
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 42/102 (41%), Positives = 67/102 (65%), Gaps = 1/102 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
++++G +S+ +F +L GG+IVFGGVD + G+ T++PVT++ YWQ + D
Sbjct: 202 LLQEGALSQPLFGVYLGSQ-QGSNGGQIVFGGVDENLYTGEITWIPVTQELYWQITIDDF 260
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
LIG+Q +G C GCA IVD+GTSLL P+ ++++ IG +
Sbjct: 261 LIGDQVSGWCSQGCAGIVDTGTSLLTMPSQYLSDLLQTIGAQ 302
Score = 63.5 bits (153), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 34/88 (38%), Positives = 48/88 (54%), Gaps = 5/88 (5%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRG-P 266
GE + CD + ++P +F + F LSP YIL+ VC+ G + L G
Sbjct: 307 GEYFVSCDSVSSLPTFNFVLNGVEFPLSPSFYILQE----DGVCMVGLESSPLTSESGQS 362
Query: 267 LWILGDVFMGVYHTVFDSGKLRIGFAEA 294
+WILGDVF+ Y+ +FD G R+GFA A
Sbjct: 363 MWILGDVFLRSYYAIFDMGNNRVGFATA 390
>gi|395756876|ref|XP_003780197.1| PREDICTED: napsin-A-like, partial [Pongo abelii]
Length = 108
Score = 89.7 bits (221), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 42/88 (47%), Positives = 54/88 (61%), Gaps = 4/88 (4%)
Query: 211 IIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWIL 270
II C IP +P VSF G FNL+ + Y+++T +C+SGF D+PPP GP WIL
Sbjct: 2 IILCSEIPKLPAVSFLFGGVWFNLAGQDYVIQTTRNGVRLCLSGFQDLDVPPPAGPFWIL 61
Query: 271 GDVFMGVYHTVFDSGKL----RIGFAEA 294
GDVF+G Y VFD G + R+G A A
Sbjct: 62 GDVFLGTYVAVFDRGDMKSSARVGLARA 89
>gi|225907444|gb|ACO36148.1| aspartic protease [Perca flavescens]
Length = 283
Score = 89.7 bits (221), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 47/131 (35%), Positives = 74/131 (56%), Gaps = 2/131 (1%)
Query: 164 GDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNV 223
G S+ C A+V L T + + + +L + P+ + E +IDC R+ ++P+V
Sbjct: 142 GVSSFCPHGCQAIVDTGTSLIAGPTTD--ILKLQQLIGATPSNINEYLIDCARLSSLPHV 199
Query: 224 SFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFD 283
+F +G + L+ QY+ K G C SGF A D+ GPLWILGDVF+ ++++FD
Sbjct: 200 TFVLGGIEYTLTAAQYVRKEMLGNRMFCFSGFQAVDMFSADGPLWILGDVFLTEFYSIFD 259
Query: 284 SGKLRIGFAEA 294
G+ R+GFA A
Sbjct: 260 RGQDRVGFASA 270
Score = 81.6 bits (200), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 42/101 (41%), Positives = 62/101 (61%), Gaps = 2/101 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEG-GEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGD 59
M+ Q V E VFSF+L+R + GE++ GG+D + G ++PVT KGYWQ ++
Sbjct: 78 MLAQRTVDEPVFSFYLSRKTSSSNPEGELLLGGIDKLLYSGPINWLPVTAKGYWQIKMDS 137
Query: 60 ILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 100
+ + S+ C GC AIVD+GTSL+AGPT + ++ IG
Sbjct: 138 VAVQGVSS-FCPHGCQAIVDTGTSLIAGPTTDILKLQQLIG 177
>gi|195399277|ref|XP_002058247.1| GJ15982 [Drosophila virilis]
gi|194150671|gb|EDW66355.1| GJ15982 [Drosophila virilis]
Length = 374
Score = 89.7 bits (221), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 47/99 (47%), Positives = 64/99 (64%), Gaps = 8/99 (8%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV QGLV + VFSF+L RD + +GGE++FGG D + G TYVP++++GYWQF +
Sbjct: 197 MVSQGLVDQSVFSFYLARDGTSSQGGELIFGGSDSSLYSGDLTYVPISEQGYWQFTMAGA 256
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAI 99
I QS +C+ C AI D+GTSLL VV+E + I
Sbjct: 257 SIDGQS--LCD-NCQAIADTGTSLL-----VVSEAAYDI 287
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 37/101 (36%), Positives = 54/101 (53%), Gaps = 15/101 (14%)
Query: 198 ELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKT-GEGIAEVCISGF- 255
++ +++ N ++DC + +P ++ IG F L P QYI+++ G+ C S F
Sbjct: 286 DILNNVLNVDENGLVDCSTVDKLPVLNLNIGGGKFTLEPAQYIIQSDGQ-----CQSSFE 340
Query: 256 -MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
M D WILGDVF+G Y+T FD G RIGFA A
Sbjct: 341 YMGTDF-------WILGDVFIGKYYTEFDLGNNRIGFAPVA 374
>gi|307167627|gb|EFN61155.1| Lysosomal aspartic protease [Camponotus floridanus]
Length = 208
Score = 89.7 bits (221), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 47/112 (41%), Positives = 64/112 (57%), Gaps = 2/112 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M++Q LVS +FSF LNR+ A+ GG+ +FGG DP ++GK TYVPVT +G WQ + I
Sbjct: 21 MIQQRLVSSRIFSFNLNRNVSADFGGKFIFGGSDPAFYEGKITYVPVTHRGLWQITIDSI 80
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLV 112
+ N + CE C A VD+ T + GP V+ IN I S C +
Sbjct: 81 RLNNFTW--CEKSCQANVDTSTWKIIGPEKDVSLINRLIEINSHGSINCSRI 130
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 41/105 (39%), Positives = 59/105 (56%), Gaps = 6/105 (5%)
Query: 190 EKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAE 249
EK +S IN L + N G I+C RI +P ++F +G K F+L+ + YI++ +
Sbjct: 107 EKDVSLINRLIE--INSHGS--INCSRIFHLPTITFNLGGKAFDLTGKDYIIRVND--KS 160
Query: 250 VCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
+C++ F DL WILG FMG Y+T FD K R+GFA A
Sbjct: 161 ICVTVFWKHDLIYDDKMKWILGIPFMGRYYTEFDMEKNRVGFAFA 205
>gi|291416270|ref|XP_002724368.1| PREDICTED: pepsin II-4-like [Oryctolagus cuniculus]
Length = 387
Score = 89.7 bits (221), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 48/112 (42%), Positives = 69/112 (61%), Gaps = 9/112 (8%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M +GLVS+++FS +L+ D E+G ++FGG+D ++ G +VPV+ +GYWQ + +
Sbjct: 201 MWNEGLVSQDLFSVYLSSDD--EKGSLVMFGGIDSSYYTGSLNWVPVSYEGYWQITMDSV 258
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG------GEGVVS 106
I N T C C AIVD+GTSLLAGPT ++ I IG GE V+S
Sbjct: 259 SI-NGETIACADSCQAIVDTGTSLLAGPTSAISNIQSYIGASKNLLGENVIS 309
Score = 72.0 bits (175), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 45/131 (34%), Positives = 64/131 (48%), Gaps = 6/131 (4%)
Query: 164 GDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNV 223
G++ C+ A+V L T +S I + N +GE++I C I ++P++
Sbjct: 262 GETIACADSCQAIVDTGTSLLAGPTS--AISNIQSYIGASKNLLGENVISCSAISSLPDI 319
Query: 224 SFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFD 283
FTI + L YILK + CISG +L G LWILGDVF+ Y TVFD
Sbjct: 320 VFTINGIQYPLPASAYILKEDDD----CISGLEGMNLDTSTGELWILGDVFIRQYFTVFD 375
Query: 284 SGKLRIGFAEA 294
++G A A
Sbjct: 376 RANNQLGLAAA 386
>gi|388326405|pdb|3VCM|A Chain A, Crystal Structure Of Human Prorenin
gi|388326406|pdb|3VCM|B Chain B, Crystal Structure Of Human Prorenin
Length = 335
Score = 89.7 bits (221), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 41/102 (40%), Positives = 68/102 (66%), Gaps = 4/102 (3%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
++ QG++ E+VFSF+ NRD GG+IV GG DP+H++G Y+ + K G WQ ++ +
Sbjct: 147 IISQGVLKEDVFSFYYNRD---SLGGQIVLGGSDPQHYEGNFHYINLIKTGVWQIQMKGV 203
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
+G+ ST +CE GC A+VD+G S ++G T + ++ A+G +
Sbjct: 204 SVGS-STLLCEDGCLALVDTGASYISGSTSSIEKLMEALGAK 244
Score = 75.1 bits (183), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 57/103 (55%), Gaps = 2/103 (1%)
Query: 194 SYINELCDSL--PNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVC 251
S I +L ++L + + ++ C+ PT+P++SF +G K + L+ Y+ + ++C
Sbjct: 232 SSIEKLMEALGAKKRLFDYVVKCNEGPTLPDISFHLGGKEYTLTSADYVFQESYSSKKLC 291
Query: 252 ISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
A D+PPP GP W LG F+ ++T FD RIGFA A
Sbjct: 292 TLAIHAMDIPPPTGPTWALGATFIRKFYTEFDRRNNRIGFALA 334
>gi|327278828|ref|XP_003224162.1| PREDICTED: pepsin A-like isoform 2 [Anolis carolinensis]
Length = 386
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 44/102 (43%), Positives = 64/102 (62%), Gaps = 3/102 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ +GLVS+++FS +L+ D + G ++FGGVD ++ G +VP++ + YWQ L I
Sbjct: 200 MMSEGLVSQDLFSVYLSSDD--QSGSFVMFGGVDTSYYSGSLNWVPLSSESYWQITLDSI 257
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
+ QS C GGC AIVD+GTSLLAGP + I + IG
Sbjct: 258 TLNGQSI-ACSGGCQAIVDTGTSLLAGPPNGIANIQYYIGAS 298
Score = 63.2 bits (152), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 34/88 (38%), Positives = 46/88 (52%), Gaps = 4/88 (4%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 267
G +I C+ + ++P++ FTI F L YIL G C GF D+P G L
Sbjct: 303 GGYMISCNAMNSLPDIIFTINGIEFPLPASAYIL----GQNGYCTPGFEGIDIPTQSGEL 358
Query: 268 WILGDVFMGVYHTVFDSGKLRIGFAEAA 295
WILGDVF+ Y+ VFD ++G A A
Sbjct: 359 WILGDVFIRQYYCVFDRANNQVGLAPVA 386
>gi|410959160|ref|XP_003986180.1| PREDICTED: gastricsin [Felis catus]
Length = 389
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 45/122 (36%), Positives = 67/122 (54%), Gaps = 10/122 (8%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ G++S VFSF+L R + ++G ++FGG+D +G + PVT++ YWQ +
Sbjct: 199 MLRAGVLSSPVFSFYLGRQMNPQKGAVLIFGGIDHSLHRGPIHWAPVTQERYWQIGFEEF 258
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
LIG +TG C GC AIVD+GTSLL P ++ + A G + QYG +
Sbjct: 259 LIGGHATGWCSQGCEAIVDTGTSLLTVPQQYLSYLLQATGAQA----------DQYGQFM 308
Query: 121 WD 122
D
Sbjct: 309 VD 310
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 36/107 (33%), Positives = 61/107 (57%), Gaps = 5/107 (4%)
Query: 190 EKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAE 249
++ LSY+ + + + G+ ++DC+ + ++P ++F I F+L Y+ + +
Sbjct: 287 QQYLSYLLQATGAQADQYGQFMVDCNNVQSLPTLTFLINRVQFSLPYSSYLFRGND---- 342
Query: 250 VCISGFMAFDLPPPRG-PLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
+C A LP G PLWILGDVF+ Y+++FD G R+GFA AA
Sbjct: 343 ICAIRVQATYLPSSSGQPLWILGDVFLRSYYSIFDIGNKRVGFAVAA 389
>gi|327278826|ref|XP_003224161.1| PREDICTED: pepsin A-like isoform 1 [Anolis carolinensis]
Length = 387
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 44/102 (43%), Positives = 64/102 (62%), Gaps = 3/102 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ +GLVS+++FS +L+ D + G ++FGGVD ++ G +VP++ + YWQ L I
Sbjct: 200 MMSEGLVSQDLFSVYLSSDD--QSGSFVMFGGVDTSYYSGSLNWVPLSSESYWQITLDSI 257
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
+ QS C GGC AIVD+GTSLLAGP + I + IG
Sbjct: 258 TLNGQSI-ACSGGCQAIVDTGTSLLAGPPNGIANIQYYIGAS 298
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 33/88 (37%), Positives = 46/88 (52%), Gaps = 3/88 (3%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 267
G +I C+ + ++P++ FTI F L YI + G C GF D+P G L
Sbjct: 303 GGYMISCNAMNSLPDIIFTINGIEFPLPASAYIRQGQNG---YCTPGFEGIDIPTQSGEL 359
Query: 268 WILGDVFMGVYHTVFDSGKLRIGFAEAA 295
WILGDVF+ Y+ VFD ++G A A
Sbjct: 360 WILGDVFIRQYYCVFDRANNQVGLAPVA 387
>gi|22218078|dbj|BAC07516.1| pepsinogen III [Oryctolagus cuniculus]
Length = 387
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 49/112 (43%), Positives = 68/112 (60%), Gaps = 9/112 (8%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M +GLVSE++FS +L+ D E G ++FGG+D ++ G +VPV+ +GYWQ L I
Sbjct: 201 MWNEGLVSEDLFSVYLSSDD--ESGSVVMFGGIDSSYYTGSLNWVPVSYEGYWQITLDSI 258
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG------GEGVVS 106
+ + T C C AIVD+GTSLLAGPT ++ I IG GE +VS
Sbjct: 259 TMDGE-TIACADSCQAIVDTGTSLLAGPTSAISNIQSYIGASENSDGEMIVS 309
Score = 70.9 bits (172), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 45/132 (34%), Positives = 64/132 (48%), Gaps = 6/132 (4%)
Query: 164 GDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNV 223
G++ C+ A+V L T +S I + N GE I+ C + ++PN+
Sbjct: 262 GETIACADSCQAIVDTGTSLLAGPTS--AISNIQSYIGASENSDGEMIVSCSSMYSLPNI 319
Query: 224 SFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFD 283
FTI + + YIL+ + CISGF +L G LWILGDVF+ Y TVFD
Sbjct: 320 VFTINGVQYPVPASAYILEEDDA----CISGFEGMNLDTYTGELWILGDVFIRQYFTVFD 375
Query: 284 SGKLRIGFAEAA 295
++G A AA
Sbjct: 376 RANNQLGLAAAA 387
>gi|283806594|ref|NP_001164550.1| pepsin-3 precursor [Oryctolagus cuniculus]
gi|129783|sp|P27822.1|PEPA3_RABIT RecName: Full=Pepsin-3; AltName: Full=Pepsin A; AltName:
Full=Pepsin III; Flags: Precursor
gi|165598|gb|AAA85370.1| pepsinogen [Oryctolagus cuniculus]
Length = 387
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 49/112 (43%), Positives = 68/112 (60%), Gaps = 9/112 (8%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M +GLVSE++FS +L+ D E G ++FGG+D ++ G +VPV+ +GYWQ L I
Sbjct: 201 MWNEGLVSEDLFSVYLSSDD--ESGSVVMFGGIDSSYYTGSLNWVPVSYEGYWQITLDSI 258
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG------GEGVVS 106
+ + T C C AIVD+GTSLLAGPT ++ I IG GE +VS
Sbjct: 259 TMDGE-TIACADSCQAIVDTGTSLLAGPTSAISNIQSYIGASENSDGEMIVS 309
Score = 70.9 bits (172), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 45/132 (34%), Positives = 64/132 (48%), Gaps = 6/132 (4%)
Query: 164 GDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNV 223
G++ C+ A+V L T +S I + N GE I+ C + ++PN+
Sbjct: 262 GETIACADSCQAIVDTGTSLLAGPTS--AISNIQSYIGASENSDGEMIVSCSSMYSLPNI 319
Query: 224 SFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFD 283
FTI + + YIL+ + CISGF +L G LWILGDVF+ Y TVFD
Sbjct: 320 VFTINGVQYPVPASAYILEEDDA----CISGFEGMNLDTYTGELWILGDVFIRQYFTVFD 375
Query: 284 SGKLRIGFAEAA 295
++G A AA
Sbjct: 376 RANNQLGLAAAA 387
>gi|326523981|dbj|BAJ97001.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 428
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 44/105 (41%), Positives = 63/105 (60%), Gaps = 6/105 (5%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILK-TGEGIAEV---- 250
IN + + N +G+ +DC + ++P ++FT G F L+P Y+L+ +G I
Sbjct: 323 INTMLGATKNFLGQWTLDCATLDSLPTLTFTFGGHKFPLAPTDYVLQVSGSPIGGGGGEA 382
Query: 251 -CISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
CISGFM D+PP G LWI+GDVF+ Y TV+D G R+GFA A
Sbjct: 383 QCISGFMGIDMPPQLGQLWIVGDVFLRRYFTVYDKGNNRVGFATA 427
Score = 88.2 bits (217), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 41/102 (40%), Positives = 63/102 (61%), Gaps = 2/102 (1%)
Query: 1 MVEQGLVSEEVFSFWL-NRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGD 59
M+ Q L+ + +F+FWL + + DAE GGE+VFG +D H++G Y PV +KGYW+ + +
Sbjct: 229 MINQKLIDKPLFTFWLGDTNKDAENGGELVFGEIDKDHYEGDIVYAPVVRKGYWEVKFNE 288
Query: 60 ILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 101
+LI ++ G A +D+GTSL+A PT IN +G
Sbjct: 289 LLINDEPADFL-GNATAAIDTGTSLIACPTEAAETINTMLGA 329
>gi|443286988|dbj|BAM76488.1| pepsinogen, partial [Gadus macrocephalus]
Length = 323
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 47/133 (35%), Positives = 77/133 (57%), Gaps = 13/133 (9%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV+Q LVS+ +FS +L+ ++EG +VFGG++ ++ G+ T++P+T YWQ ++ +
Sbjct: 142 MVQQNLVSQPLFSVYLSSH--SQEGSVVVFGGIEESYYTGQITWIPLTSATYWQIKMDSV 199
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
I N + C GGC AI+D+GTSL+ GP+ ++ +N +G QYGD
Sbjct: 200 TI-NGNPVACNGGCQAIIDTGTSLIVGPSSDISNMNSWVGAS----------TDQYGDAT 248
Query: 121 WDLLVSGLLPEKV 133
+ G +PE V
Sbjct: 249 VNCQNIGSMPEVV 261
Score = 55.1 bits (131), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 30/102 (29%), Positives = 51/102 (50%), Gaps = 9/102 (8%)
Query: 193 LSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCI 252
+S +N + + G++ ++C I +MP V FT+ F + Y+ ++ G C
Sbjct: 230 ISNMNSWVGASTDQYGDATVNCQNIGSMPEVVFTLSGHSFTVPASAYVSQSSYG----CN 285
Query: 253 SGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
+GF LWILGDVF+ ++ +FDS IG A++
Sbjct: 286 TGFGG-----GNDQLWILGDVFIRQFYVIFDSQNKNIGLAQS 322
>gi|327270926|ref|XP_003220239.1| PREDICTED: embryonic pepsinogen-like [Anolis carolinensis]
Length = 382
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 44/100 (44%), Positives = 64/100 (64%), Gaps = 4/100 (4%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ +GLV E +FS +L R G I FGG+D ++ G ++PVT++GYWQ EL I
Sbjct: 201 MMNEGLVQENLFSVYLGR---GGTGSIITFGGIDESYYTGSINWIPVTEQGYWQIELDSI 257
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 100
L+ ++ C GC AIVD+GTSL+AGP ++ + +AIG
Sbjct: 258 LVNGEAI-ACSDGCQAIVDTGTSLVAGPPSDISNLQNAIG 296
Score = 60.8 bits (146), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 41/131 (31%), Positives = 62/131 (47%), Gaps = 10/131 (7%)
Query: 164 GDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNV 223
G++ CS A+V L + +S + + P G+ I+C + MP+V
Sbjct: 261 GEAIACSDGCQAIVDTGTSLVAGPPSD--ISNLQNAIGATPGQYGQYDINCGNLGNMPDV 318
Query: 224 SFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFD 283
F I F L+P Y L+ + E C SGF G LWILGDVF+ Y+++FD
Sbjct: 319 VFVINGIQFPLTPTAYTLEESQ---EECHSGFQNMS-----GYLWILGDVFIREYYSIFD 370
Query: 284 SGKLRIGFAEA 294
++G A+A
Sbjct: 371 RANNQVGLAKA 381
>gi|290543422|ref|NP_001166408.1| cathepsin E precursor [Cavia porcellus]
gi|115721|sp|P25796.1|CATE_CAVPO RecName: Full=Cathepsin E; Flags: Precursor
gi|191295|gb|AAA37052.1| procathepsin E [Cavia porcellus]
gi|1246041|gb|AAB35844.1| procathepsin E [Cavia]
Length = 391
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 45/104 (43%), Positives = 63/104 (60%), Gaps = 2/104 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ Q LV+ +FS +++ +P G E+ FGG DP HF G +VPVTK+ YWQ L I
Sbjct: 200 MMAQNLVALPMFSVYMSSNPGGS-GSELTFGGYDPSHFSGSLNWVPVTKQAYWQIALDGI 258
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGV 104
+G+ S C GC AIVD+GTSL+ GP + ++ A+G V
Sbjct: 259 QVGD-SVMFCSEGCQAIVDTGTSLITGPPGKIKQLQEALGATYV 301
Score = 70.5 bits (171), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 49/147 (33%), Positives = 73/147 (49%), Gaps = 18/147 (12%)
Query: 156 VEKENVSAGDSAV-CSACEMAVVWVQNQL------KQKQTKEKV-LSYINELCDSLPNPM 207
+ + + GDS + CS A+V L K KQ +E + +Y++E
Sbjct: 253 IALDGIQVGDSVMFCSEGCQAIVDTGTSLITGPPGKIKQLQEALGATYVDE--------- 303
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 267
G S+ C + M +V+F I + L+P Y L +VC +GF ++ PP GPL
Sbjct: 304 GYSV-QCANLNMMLDVTFIINGVPYTLNPTAYTLLDFVDGMQVCSTGFEGLEIQPPAGPL 362
Query: 268 WILGDVFMGVYHTVFDSGKLRIGFAEA 294
WILGDVF+ ++ VFD G R+G A A
Sbjct: 363 WILGDVFIRQFYAVFDRGNNRVGLAPA 389
>gi|149725185|ref|XP_001501907.1| PREDICTED: pepsin A-like [Equus caballus]
Length = 387
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 46/117 (39%), Positives = 68/117 (58%), Gaps = 3/117 (2%)
Query: 3 EQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILI 62
+QGLVS+++FS +L+ D E G ++F G+D ++ G +VPV+++ YWQ + I +
Sbjct: 203 DQGLVSQDLFSVYLSSDD--ESGSMVIFSGIDSSYYSGSLCWVPVSEEAYWQITVDSITM 260
Query: 63 GNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDL 119
+S C GGC AIVD+GTSLLAGP + I IG S+E + S L
Sbjct: 261 NGESIA-CSGGCQAIVDTGTSLLAGPPSAIDNIQSYIGASEDYSSEAVISCSSIDSL 316
Score = 72.0 bits (175), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 49/134 (36%), Positives = 64/134 (47%), Gaps = 10/134 (7%)
Query: 164 GDSAVCSACEMAVVWVQNQLKQ--KQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMP 221
G+S CS A+V L + + SYI D E++I C I ++P
Sbjct: 262 GESIACSGGCQAIVDTGTSLLAGPPSAIDNIQSYIGASEDY----SSEAVISCSSIDSLP 317
Query: 222 NVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTV 281
++ FTI F LSP YIL+ + CISGF DL G LWILGDVF+ Y T+
Sbjct: 318 DIVFTINGVEFPLSPSAYILEEDDS----CISGFEGMDLDTSSGELWILGDVFIRQYFTI 373
Query: 282 FDSGKLRIGFAEAA 295
FD +I A A
Sbjct: 374 FDRANNQICLAPVA 387
>gi|126352481|ref|NP_001075412.1| pregnancy-associated glycoprotein precursor [Equus caballus]
gi|2499824|sp|Q28389.1|PAG_HORSE RecName: Full=Pregnancy-associated glycoprotein; Short=PAG; Flags:
Precursor
gi|601912|gb|AAC14885.1| pregnancy-associated glycoprotein [Equus caballus]
Length = 388
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 46/106 (43%), Positives = 65/106 (61%), Gaps = 3/106 (2%)
Query: 3 EQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILI 62
QGL+S+++F+F+L+R ++G ++FGGVDP ++ G+ +VPV+K YWQ + I I
Sbjct: 202 NQGLLSQKLFAFYLSRK--GKKGSVVMFGGVDPSYYTGELHWVPVSKPLYWQISMDSISI 259
Query: 63 GNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
N C+GGC AIVD+GTSLL GP V I I S E
Sbjct: 260 -NGKVIACDGGCQAIVDTGTSLLLGPQDAVLNIQEIIQARRSTSGE 304
Score = 61.2 bits (147), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 36/99 (36%), Positives = 50/99 (50%), Gaps = 2/99 (2%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
I E+ + + GE IDCD + T+P++ FTI + + YI K + +C S F
Sbjct: 291 IQEIIQARRSTSGEYFIDCDAVNTLPDILFTIDGIGYPVPANAYIQK--DAALGICFSSF 348
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
+ WILGDVF+ +Y TVFD RIG A A
Sbjct: 349 EGNEDISNNSEEWILGDVFLRLYFTVFDRENDRIGLATA 387
>gi|327271277|ref|XP_003220414.1| PREDICTED: renin-like [Anolis carolinensis]
Length = 398
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 37/86 (43%), Positives = 57/86 (66%)
Query: 209 ESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLW 268
+ +++C +I +P++SF +GD+ + LS Y+L+ + E+C F AFD+PPP GP+W
Sbjct: 312 DYVVECKKIHLLPDISFHLGDRSYTLSGYAYVLQYSDYGKELCAVAFSAFDIPPPLGPIW 371
Query: 269 ILGDVFMGVYHTVFDSGKLRIGFAEA 294
ILG F+G Y+T FD RIGFA +
Sbjct: 372 ILGATFIGQYYTEFDRQNNRIGFARS 397
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 40/116 (34%), Positives = 66/116 (56%), Gaps = 5/116 (4%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
++ + ++SEEVFS + +R+ + GGEI+ GG DP ++ G YV ++ GYW +L +
Sbjct: 206 IISERVLSEEVFSVYYSRNSEMNTGGEIILGGSDPSYYTGDFHYVSISTPGYWHIDLKGV 265
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG----EGVVSAECKLV 112
+G++ C GC A VD+G+S + GP V+ + +IG E ECK +
Sbjct: 266 SLGSEML-FCHEGCTAAVDTGSSFITGPASAVSILMKSIGATLLEERDYVVECKKI 320
>gi|338712318|ref|XP_001501960.2| PREDICTED: pepsin II-1-like [Equus caballus]
Length = 397
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 46/110 (41%), Positives = 68/110 (61%), Gaps = 9/110 (8%)
Query: 3 EQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILI 62
+QGLVS+++FS +L+ + E G ++FGG+D ++ G +VPV+ +GYWQ + I +
Sbjct: 213 DQGLVSQDLFSVYLSSND--ESGSVVMFGGIDSSYYTGSLHWVPVSHEGYWQITVDSITV 270
Query: 63 GNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG------GEGVVS 106
+S C GGC A+VD+GTSLL GPT + I IG GE V+S
Sbjct: 271 NGESIA-CSGGCQAVVDTGTSLLTGPTSAIDNIQSYIGARKDLLGEAVIS 319
Score = 81.6 bits (200), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 52/134 (38%), Positives = 69/134 (51%), Gaps = 10/134 (7%)
Query: 164 GDSAVCSACEMAVVWVQNQLKQKQTK--EKVLSYINELCDSLPNPMGESIIDCDRIPTMP 221
G+S CS AVV L T + + SYI D L GE++I C I ++P
Sbjct: 272 GESIACSGGCQAVVDTGTSLLTGPTSAIDNIQSYIGARKDLL----GEAVISCSSIDSLP 327
Query: 222 NVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTV 281
++ FTI F L+P YIL+ + +CISGF +L G LWILGDVF+ Y TV
Sbjct: 328 DIVFTINGVEFPLTPSAYILEEDD----ICISGFKGMNLDTSSGELWILGDVFIRQYFTV 383
Query: 282 FDSGKLRIGFAEAA 295
FD ++G A A
Sbjct: 384 FDRANNQVGLASVA 397
>gi|195134380|ref|XP_002011615.1| GI11125 [Drosophila mojavensis]
gi|193906738|gb|EDW05605.1| GI11125 [Drosophila mojavensis]
Length = 371
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 46/116 (39%), Positives = 68/116 (58%), Gaps = 7/116 (6%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV Q L+ + VFSF+L RD + +GGE++FGG D + G TYVP++++GYWQF +
Sbjct: 194 MVSQDLIDQSVFSFYLARDGTSSQGGELIFGGSDSSLYSGDFTYVPISQEGYWQFTMAGA 253
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINH--AIGGEGVVSAECKLVVS 114
+ + +C+ C AI D+GTSLL P +N + EG+V +C V S
Sbjct: 254 SV--EGYSLCD-NCQAIADTGTSLLVAPANAYELLNEILNVNDEGLV--DCSTVSS 304
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 41/99 (41%), Positives = 58/99 (58%), Gaps = 11/99 (11%)
Query: 198 ELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKT-GEGIAEVCISGFM 256
EL + + N E ++DC + ++P ++F IG F+LSP YI++T GE ++ V +M
Sbjct: 283 ELLNEILNVNDEGLVDCSTVSSLPVITFNIGGTNFDLSPSAYIIQTDGECMSSV---QYM 339
Query: 257 AFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
D WILGDVF+G Y+T FD G RIGFA A
Sbjct: 340 GTDF-------WILGDVFIGQYYTEFDLGNNRIGFAPVA 371
>gi|74136391|ref|NP_001028088.1| renin precursor [Macaca mulatta]
gi|67461396|sp|Q6DLW5.2|RENI_MACMU RecName: Full=Renin; AltName: Full=Angiotensinogenase; Flags:
Precursor
gi|61699710|gb|AAT74864.2| prorenin [Macaca mulatta]
Length = 406
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 41/104 (39%), Positives = 69/104 (66%), Gaps = 3/104 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEE--GGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELG 58
++ QG++ E+VFSF+ NRD + + GG+IV GG DP+H++G Y+ + K G WQ +
Sbjct: 213 ILSQGVLKEDVFSFYYNRDSENAQSLGGQIVLGGSDPQHYEGNFHYINLIKTGVWQIPMK 272
Query: 59 DILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
+ +G+ ST +CE GC A+VD+G S ++G T + ++ A+G +
Sbjct: 273 GVSVGS-STLLCEDGCLALVDTGASYISGSTSSIEKLMEALGAK 315
Score = 74.7 bits (182), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 57/103 (55%), Gaps = 2/103 (1%)
Query: 194 SYINELCDSL--PNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVC 251
S I +L ++L + + ++ C+ PT+P++SF +G K + L+ Y+ + ++C
Sbjct: 303 SSIEKLMEALGAKKRLFDYVVKCNEGPTLPDISFHLGGKEYTLTSADYVFQESYSSKKLC 362
Query: 252 ISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
A D+PPP GP W LG F+ ++T FD RIGFA A
Sbjct: 363 TLAIHAMDIPPPTGPTWALGATFIRKFYTEFDRRNNRIGFALA 405
>gi|426251840|ref|XP_004019629.1| PREDICTED: pepsin A-like [Ovis aries]
Length = 386
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 45/106 (42%), Positives = 66/106 (62%), Gaps = 3/106 (2%)
Query: 3 EQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILI 62
+QGLVS+++FS +L+ + E G ++FGG+D ++ G +VPV+ +GYWQ + I +
Sbjct: 202 DQGLVSQDLFSVYLSSN--EESGSVVMFGGIDSSYYSGSLNWVPVSVEGYWQITVDSITM 259
Query: 63 GNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+S C GC AIVD+GTSLLAGPT ++ I IG S E
Sbjct: 260 NGESIA-CSDGCQAIVDTGTSLLAGPTTAISNIQSYIGASEDSSGE 304
Score = 78.6 bits (192), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 51/134 (38%), Positives = 66/134 (49%), Gaps = 10/134 (7%)
Query: 164 GDSAVCSACEMAVVWVQNQLKQKQTK--EKVLSYINELCDSLPNPMGESIIDCDRIPTMP 221
G+S CS A+V L T + SYI DS GE +I C I ++P
Sbjct: 261 GESIACSDGCQAIVDTGTSLLAGPTTAISNIQSYIGASEDS----SGEEVISCSSIDSLP 316
Query: 222 NVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTV 281
++ FTI + + P YIL+ + VC SGF D+P G LWILGDVF+ Y TV
Sbjct: 317 DIVFTINGVQYPVPPSAYILQNDD----VCSSGFEGMDIPTSSGDLWILGDVFIRQYFTV 372
Query: 282 FDSGKLRIGFAEAA 295
FD +IG A A
Sbjct: 373 FDRANNQIGLAPVA 386
>gi|49019530|emb|CAD80097.1| pepsin A3 [Trematomus bernacchii]
Length = 378
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 39/100 (39%), Positives = 66/100 (66%), Gaps = 3/100 (3%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ + LV++++FS +L+ D A++G + FGGVDP H+ G +++P++++ YWQ + +
Sbjct: 197 MMSENLVNKDMFSVYLSSD--AQQGSVVTFGGVDPNHYSGSISWIPLSRELYWQITVDSV 254
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 100
+ N C GGC AIVD+GTSL+ GP ++ IN +G
Sbjct: 255 TV-NGEVVACNGGCQAIVDTGTSLIVGPQSSISNINSKVG 293
Score = 43.5 bits (101), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 25/87 (28%), Positives = 45/87 (51%), Gaps = 8/87 (9%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 267
G+ +++C+ I +P+V F I + F + YI ++ + +GF L
Sbjct: 299 GDYLVNCNSIAQLPDVIFHIHGETFTIPGSAYIRQSQSYGSR---TGF-----GNGGSNL 350
Query: 268 WILGDVFMGVYHTVFDSGKLRIGFAEA 294
WILGDVF+ Y+ +F+ + +G A+A
Sbjct: 351 WILGDVFIRQYYAIFNRSQNMVGLAKA 377
>gi|344295434|ref|XP_003419417.1| PREDICTED: pepsin A-2/A-3-like [Loxodonta africana]
Length = 384
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 45/99 (45%), Positives = 64/99 (64%), Gaps = 5/99 (5%)
Query: 3 EQGLVSEEVFSFWLNRDPDAEEGGEIV-FGGVDPKHFKGKHTYVPVTKKGYWQFELGDIL 61
+QGLVS+++FS +L+ D EEGG +V FGG+D ++ G +VPV+ +GYWQ L +
Sbjct: 202 DQGLVSQDLFSVYLSSD---EEGGSVVIFGGIDSSYYTGSLNWVPVSYEGYWQITLDSVS 258
Query: 62 IGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 100
I +S C C AI+D+GTSLLAGPT + I +G
Sbjct: 259 IDGESVA-CSDTCQAIIDTGTSLLAGPTTAIANIQEYLG 296
Score = 63.9 bits (154), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 32/87 (36%), Positives = 46/87 (52%), Gaps = 4/87 (4%)
Query: 209 ESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLW 268
E + C ++PN+ FTI + +SP YI++ + C+ G DL G LW
Sbjct: 302 EEEVSCSTADSLPNIVFTINGVQYPVSPSSYIVEEDQS----CVVGLEGMDLDTYSGELW 357
Query: 269 ILGDVFMGVYHTVFDSGKLRIGFAEAA 295
ILGDVF+ Y+TVFD ++G A A
Sbjct: 358 ILGDVFIRQYYTVFDRANNQVGLASVA 384
>gi|56269596|gb|AAH86835.1| Nots protein [Danio rerio]
Length = 443
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 37/100 (37%), Positives = 62/100 (62%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
+ + + P GE ++DC R+ ++P VSF I ++LS EQYI + ++C SGF
Sbjct: 334 LQQFIGATPTANGEFVVDCVRVSSLPVVSFLINSVEYSLSGEQYIRRETLNNKQICFSGF 393
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
+ ++P P GP+WILGDVF+ ++++D G+ R+G A +
Sbjct: 394 QSIEVPSPAGPMWILGDVFLSQVYSIYDRGENRVGLARLS 433
Score = 74.7 bits (182), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 41/110 (37%), Positives = 62/110 (56%), Gaps = 3/110 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+EQ ++ + VFSF+L + + GGE+VFGG+D F ++PVT+KGYWQ +L +
Sbjct: 239 MMEQNMLDQPVFSFYLTNN-GSGFGGELVFGGMDESRFLPPINWIPVTQKGYWQIKLDAV 297
Query: 61 LIGNQSTGVCEG--GCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+ + GC AIVD+GTSL+ GP + + IG + E
Sbjct: 298 KVQGALSFCYRSVQGCQAIVDTGTSLIGGPARDILILQQFIGATPTANGE 347
>gi|195443988|ref|XP_002069667.1| GK11645 [Drosophila willistoni]
gi|194165752|gb|EDW80653.1| GK11645 [Drosophila willistoni]
Length = 388
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 43/101 (42%), Positives = 61/101 (60%), Gaps = 6/101 (5%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILK-TGEGIAEVCISG 254
IN L P GE +++C+ I +P++SF IG + F L P YI++ T + + +C+S
Sbjct: 293 INRLIGGFPTTSGEYVVNCNHIDRLPDISFIIGGQRFALQPRDYIIRLTDDDGSALCLSA 352
Query: 255 FMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
F D WILGDVF+G ++T FD G+LRIGFA AA
Sbjct: 353 FTGMDTE-----FWILGDVFIGRFYTAFDMGQLRIGFAPAA 388
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 48/108 (44%), Positives = 59/108 (54%), Gaps = 3/108 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M +Q LV + VFSF+L R+ GGE++FGG D F G+ TYVP++ GYWQFE+ I
Sbjct: 202 MCDQKLVDQCVFSFYLKRNGSERSGGELLFGGTDSTKFTGQLTYVPLSNPGYWQFEMQGI 261
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
I Q AI D+GTSLL P IN IGG S E
Sbjct: 262 EIDGQRIAEHR---QAIADTGTSLLVAPPREHLIINRLIGGFPTTSGE 306
>gi|197247086|gb|AAI65335.1| Nots protein [Danio rerio]
Length = 416
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 37/100 (37%), Positives = 62/100 (62%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
+ + + P GE ++DC R+ ++P VSF I ++LS EQYI + ++C SGF
Sbjct: 307 LQQFIGATPTANGEFVVDCVRVSSLPVVSFLINSVEYSLSGEQYIRRETLNNKQICFSGF 366
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
+ ++P P GP+WILGDVF+ ++++D G+ R+G A +
Sbjct: 367 QSIEVPSPAGPMWILGDVFLSQVYSIYDRGENRVGLARLS 406
Score = 75.1 bits (183), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 41/110 (37%), Positives = 62/110 (56%), Gaps = 3/110 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+EQ ++ + VFSF+L + + GGE+VFGG+D F ++PVT+KGYWQ +L +
Sbjct: 212 MMEQNMLDQPVFSFYLTNN-GSGFGGELVFGGMDESRFLPPINWIPVTQKGYWQIKLDAV 270
Query: 61 LIGNQSTGVCEG--GCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+ + GC AIVD+GTSL+ GP + + IG + E
Sbjct: 271 KVQGALSFCYRSVQGCQAIVDTGTSLIGGPARDILILQQFIGATPTANGE 320
>gi|405117936|gb|AFR92711.1| endopeptidase [Cryptococcus neoformans var. grubii H99]
Length = 438
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 39/87 (44%), Positives = 57/87 (65%), Gaps = 4/87 (4%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 267
G+ +DC+ + ++P ++FT G K + LS + YIL G CIS F D+P P GPL
Sbjct: 355 GQYTVDCNTVSSLPELAFTFGGKDYTLSADDYILNAGG----TCISSFTGMDIPAPIGPL 410
Query: 268 WILGDVFMGVYHTVFDSGKLRIGFAEA 294
WI+GDVF+ Y+TV+D G+ +GFAE+
Sbjct: 411 WIVGDVFLRKYYTVYDLGRNAVGFAES 437
Score = 83.6 bits (205), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 49/118 (41%), Positives = 66/118 (55%), Gaps = 8/118 (6%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ Q L+ E VFSF L E+GGE +FGG+D + GK YVPV +KGYW+ EL I
Sbjct: 253 MLNQHLLDEPVFSFRLGSSD--EDGGEAIFGGIDDSAYSGKLAYVPVRRKGYWEVELESI 310
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAECKLVVS 114
G++ + G A +D+GTSL+ PT V +N IG E G + +C V S
Sbjct: 311 SFGDEELELENTGAA--IDTGTSLIVMPTDVAELLNKEIGAEKSWNGQYTVDCNTVSS 366
>gi|337347|gb|AAA60364.1| renin [Homo sapiens]
Length = 403
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 40/102 (39%), Positives = 69/102 (67%), Gaps = 2/102 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
++ QG++ E+VFSF+ NR+ + GG+IV GG DP+H++G Y+ + K G WQ ++ +
Sbjct: 213 IISQGVLKEDVFSFYYNRNSQSL-GGQIVLGGSDPQHYEGNFHYINLIKTGVWQIQMKGV 271
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
+G+ ST +CE GC A+VD+G S ++G T + ++ A+G +
Sbjct: 272 SVGS-STLLCEDGCLALVDTGASYISGSTSCIEKLMEALGAK 312
Score = 74.3 bits (181), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 49/86 (56%)
Query: 209 ESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLW 268
+ ++ C+ PT+P++SF +G K + L+ Y+ + ++C A D+PPP GP W
Sbjct: 317 DYVVKCNEGPTLPDISFHLGGKEYTLTSADYVFQESYSSKKLCTLAIHAMDIPPPTGPTW 376
Query: 269 ILGDVFMGVYHTVFDSGKLRIGFAEA 294
LG F+ ++T FD RIGFA A
Sbjct: 377 ALGATFIRKFYTEFDRRNNRIGFALA 402
>gi|195114666|ref|XP_002001888.1| GI14567 [Drosophila mojavensis]
gi|193912463|gb|EDW11330.1| GI14567 [Drosophila mojavensis]
Length = 402
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 50/117 (42%), Positives = 66/117 (56%), Gaps = 6/117 (5%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+ EQ L+ E F F+L RD A+ GG++V GG+D + F G TYV V ++GYWQF +
Sbjct: 218 LYEQRLIDEPTFGFYLARDGSAQAGGQLVLGGIDSQLFSGNLTYVSVVQQGYWQFVVNSA 277
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG---EGVVSAECKLVVS 114
+G VC C AI D+GTSLLA P T +N IGG +G +C V S
Sbjct: 278 EMGGYV--VCY-NCQAIADTGTSLLACPGSAYTMLNQLIGGYLMDGDYYVDCSTVSS 331
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 34/88 (38%), Positives = 48/88 (54%), Gaps = 5/88 (5%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 267
G+ +DC + ++P + F IG IF+L P YI E C+S F +
Sbjct: 320 GDYYVDCSTVSSLPALKFNIGGTIFSLPPSAYISSFTEYNTTYCMSSFTYINTD-----F 374
Query: 268 WILGDVFMGVYHTVFDSGKLRIGFAEAA 295
WILGDVF+G ++T FD G+ R+GFA A
Sbjct: 375 WILGDVFIGQFYTQFDFGENRVGFAPVA 402
>gi|37790800|gb|AAR03502.1| renin [Homo sapiens]
gi|119611911|gb|EAW91505.1| renin [Homo sapiens]
Length = 403
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 40/102 (39%), Positives = 69/102 (67%), Gaps = 2/102 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
++ QG++ E+VFSF+ NR+ + GG+IV GG DP+H++G Y+ + K G WQ ++ +
Sbjct: 213 IISQGVLKEDVFSFYYNRNSQSL-GGQIVLGGSDPQHYEGNFHYINLIKTGVWQIQMKGV 271
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
+G+ ST +CE GC A+VD+G S ++G T + ++ A+G +
Sbjct: 272 SVGS-STLLCEDGCLALVDTGASYISGSTSSIEKLMEALGAK 312
Score = 74.7 bits (182), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 57/103 (55%), Gaps = 2/103 (1%)
Query: 194 SYINELCDSL--PNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVC 251
S I +L ++L + + ++ C+ PT+P++SF +G K + L+ Y+ + ++C
Sbjct: 300 SSIEKLMEALGAKKRLFDYVVKCNEGPTLPDISFHLGGKEYTLTSADYVFQESYSSKKLC 359
Query: 252 ISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
A D+PPP GP W LG F+ ++T FD RIGFA A
Sbjct: 360 TLAIHAMDIPPPTGPTWALGATFIRKFYTEFDRRNNRIGFALA 402
>gi|348521340|ref|XP_003448184.1| PREDICTED: cathepsin E-A-like [Oreochromis niloticus]
Length = 406
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 40/99 (40%), Positives = 60/99 (60%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
+ +L + P+ +GE +IDC R+ ++P V+F + + L+ EQYI K G E C SGF
Sbjct: 302 LQQLIGATPSNLGEFVIDCARLSSLPQVTFVLNGTEYTLTSEQYIRKETLGKNEFCFSGF 361
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
A ++ P WILGDVF+ Y+++FD G R+G A A
Sbjct: 362 QAVEIFSSTDPQWILGDVFLTEYYSIFDKGYDRVGLARA 400
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 56/177 (31%), Positives = 83/177 (46%), Gaps = 21/177 (11%)
Query: 1 MVEQGLVSEEVFSFWLNRDPD-AEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGD 59
M+ Q LV + VFSF+L+R + GE++ GG + + G + PVT KGYWQ ++
Sbjct: 208 MIAQKLVDQPVFSFYLSRKSSRTKPAGELLLGGTNEALYIGPINWHPVTAKGYWQIKMDS 267
Query: 60 ILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDL 119
+ + + C GC AIVD+GTSL+AGPT + + IG S G+
Sbjct: 268 VAVQGVNL-FCPSGCQAIVDTGTSLIAGPTNDILRLQQLIGA----------TPSNLGEF 316
Query: 120 IWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGDSAVCSACEMAV 176
+ D LP+ NG EY T + + KE + G + C + AV
Sbjct: 317 VIDCARLSSLPQVT------FVLNGTEYTLTS-EQYIRKETL--GKNEFCFSGFQAV 364
>gi|328860092|gb|EGG09199.1| hypothetical protein MELLADRAFT_42703 [Melampsora larici-populina
98AG31]
Length = 429
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 38/87 (43%), Positives = 58/87 (66%), Gaps = 4/87 (4%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 267
G+ +DCD+IPT+P+++FT K F ++ E YIL+ + CIS F D+PP G L
Sbjct: 346 GQYTVDCDKIPTLPDLTFTFAGKDFTITAEDYILQ----VQGTCISSFSGLDMPPNVGEL 401
Query: 268 WILGDVFMGVYHTVFDSGKLRIGFAEA 294
WI+GD F+ ++TV+D G+ +GFA+A
Sbjct: 402 WIIGDTFLRKWYTVYDLGRNAVGFAKA 428
Score = 81.3 bits (199), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 40/105 (38%), Positives = 63/105 (60%), Gaps = 5/105 (4%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDA---EEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFEL 57
M+EQGL+ E VF+F+L ++ +GGE +FGG+D H++G Y PV ++GYW+ L
Sbjct: 239 MIEQGLLDEPVFAFYLGTSHESGVDNQGGEAIFGGIDEAHYEGDIHYAPVRRRGYWEVAL 298
Query: 58 GDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
+ G + + G A +D+GTSL+A PT IN ++G +
Sbjct: 299 EGVRFGKEEMKLVNVGAA--IDTGTSLIALPTDTAEIINASLGAK 341
>gi|49019802|emb|CAD80095.2| pepsin A1 [Trematomus bernacchii]
Length = 375
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 42/118 (35%), Positives = 71/118 (60%), Gaps = 7/118 (5%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV++ LVS+ +FS +L+ + + +G E+VFGG+D H+ G+ ++P++ YWQ + +
Sbjct: 194 MVKERLVSQPLFSVYLSSN--SAQGSEVVFGGIDSSHYTGEIAWIPLSSATYWQINMDSV 251
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAECKLVVS 114
I Q T C GGC AI+D+GTSL+ GP ++ +N +G G + C+ + S
Sbjct: 252 TINGQ-TVACSGGCQAIIDTGTSLIVGPNSDISSMNSRVGASTNQYGEATVNCRDIQS 308
Score = 60.8 bits (146), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 33/102 (32%), Positives = 52/102 (50%), Gaps = 9/102 (8%)
Query: 193 LSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCI 252
+S +N + N GE+ ++C I +MP V+FT+ F + Y+ ++ G C
Sbjct: 282 ISSMNSRVGASTNQYGEATVNCRDIQSMPEVTFTLNGHAFTIPASAYVSQSSYG----CS 337
Query: 253 SGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
+GF LWILGDVF+ Y+ +FDS IG A++
Sbjct: 338 TGF-----GQGGQQLWILGDVFIRQYYAIFDSSTQTIGLAKS 374
>gi|407726059|dbj|BAM46127.1| pepsinogen C [Cynops pyrrhogaster]
Length = 385
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 40/100 (40%), Positives = 62/100 (62%), Gaps = 1/100 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ QGL+S+ +F F+++ + + GGE++FGGVD ++ G+ T+ PVT++ YWQ +
Sbjct: 194 MMNQGLLSQNIFGFYMSEE-GTQPGGELIFGGVDSNYYTGEITWTPVTQQMYWQIGIQGF 252
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 100
+ Q TG C GC IVD+GTSLL P + + IG
Sbjct: 253 AVNGQETGWCSQGCQGIVDTGTSLLTAPGQYMAALMQDIG 292
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 34/88 (38%), Positives = 51/88 (57%), Gaps = 5/88 (5%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRG-P 266
G+ ++ C + ++P +SFTIG L P YI++ + C G MA LP G P
Sbjct: 299 GQYVVTCSSVTSLPTLSFTIGGTSLPLPPSAYIVQG----SAACTVGIMATYLPSQDGQP 354
Query: 267 LWILGDVFMGVYHTVFDSGKLRIGFAEA 294
LWILGDVF+ Y++++D R+GFA +
Sbjct: 355 LWILGDVFLRQYYSIYDVTNNRVGFATS 382
>gi|311621984|gb|ADP95548.1| ASP-3 [Caenorhabditis sp. 5 AC-2008]
Length = 208
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 57/179 (31%), Positives = 95/179 (53%), Gaps = 28/179 (15%)
Query: 109 CKLVVSQY-GDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGDSA 167
C + Y G++ W+ LVS + ++G + +G Y + + ++V+ G S
Sbjct: 56 CDTDSNHYSGNIAWEPLVS---EDYWRIKLGAVSISGTTYTNGPMDSIVD-----TGTSL 107
Query: 168 VCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTI 227
+ ++ +K+ Q K + IN GE + C +IP++PN++FT+
Sbjct: 108 LTGPSDI--------IKKIQHKIGGIPLIN----------GEYEVVCSKIPSLPNITFTL 149
Query: 228 GDKIFNLSPEQYILKTGEGIAEV-CISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSG 285
G + F+L + YIL+ G + C+SGFM D+P P GPLWILGDVF+G +++VFD G
Sbjct: 150 GGQNFDLQGKDYILQLPNGNGGMTCLSGFMGMDIPAPAGPLWILGDVFIGRFYSVFDHG 208
Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 40/116 (34%), Positives = 64/116 (55%), Gaps = 3/116 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRD-PDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGD 59
+ ++F+FWL+RD D GGEI D H+ G + P+ + YW+ +LG
Sbjct: 25 FANSAICPNQLFAFWLSRDVNDIANGGEITLCDTDSNHYSGNIAWEPLVSEDYWRIKLGA 84
Query: 60 ILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQ 115
+ I T G +IVD+GTSLL GP+ ++ +I H IGG +++ E ++V S+
Sbjct: 85 VSI--SGTTYTNGPMDSIVDTGTSLLTGPSDIIKKIQHKIGGIPLINGEYEVVCSK 138
>gi|283806612|ref|NP_001164557.1| pepsin II-2/3 precursor [Oryctolagus cuniculus]
gi|129781|sp|P27821.1|PEPA2_RABIT RecName: Full=Pepsin II-2/3; AltName: Full=Pepsin A; Flags:
Precursor
gi|165600|gb|AAA85369.1| pepsinogen [Oryctolagus cuniculus]
Length = 387
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 47/112 (41%), Positives = 68/112 (60%), Gaps = 9/112 (8%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M +GLVS+++FS +L+ D E+G ++FGG+D ++ G +VPV+ +GYWQ + +
Sbjct: 201 MWNEGLVSQDLFSVYLSSDD--EKGSLVMFGGIDSSYYTGSLNWVPVSYEGYWQITMDSV 258
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG------GEGVVS 106
I N T C C AIVD+GTSLL GPT ++ I IG GE V+S
Sbjct: 259 SI-NGETIACADSCQAIVDTGTSLLTGPTSAISNIQSYIGASKNLLGENVIS 309
Score = 67.8 bits (164), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 43/131 (32%), Positives = 63/131 (48%), Gaps = 6/131 (4%)
Query: 164 GDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNV 223
G++ C+ A+V L T +S I + N +GE++I C I ++P++
Sbjct: 262 GETIACADSCQAIVDTGTSLLTGPTS--AISNIQSYIGASKNLLGENVISCSAIDSLPDI 319
Query: 224 SFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFD 283
FTI + L YILK + C SG ++ G LWILGDVF+ Y TVFD
Sbjct: 320 VFTINGIQYPLPASAYILKEDDD----CTSGLEGMNVDTYTGELWILGDVFIRQYFTVFD 375
Query: 284 SGKLRIGFAEA 294
++G A A
Sbjct: 376 RANNQLGLAAA 386
>gi|291409616|ref|XP_002721074.1| PREDICTED: pepsin II-4-like [Oryctolagus cuniculus]
Length = 387
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 48/118 (40%), Positives = 71/118 (60%), Gaps = 7/118 (5%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M + LVSE++FS +L+ D ++G ++FGG+D ++KG +VPV+ +GYWQF + +
Sbjct: 201 MWDHRLVSEDLFSVYLSSDD--KKGSLVMFGGIDESYYKGSLHWVPVSYEGYWQFTMDSV 258
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAI----GGEGVVSAECKLVVS 114
I N T C C AI+D+GTSLLAGPT +++I I EG +C V S
Sbjct: 259 TI-NGKTIACADSCQAIIDTGTSLLAGPTNAISKIQRHIRAYDNSEGEAIVKCSDVKS 315
Score = 73.9 bits (180), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 41/104 (39%), Positives = 54/104 (51%), Gaps = 4/104 (3%)
Query: 191 KVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEV 250
+S I + N GE+I+ C + ++P+V FTI + L YILK + V
Sbjct: 287 NAISKIQRHIRAYDNSEGEAIVKCSDVKSLPDVVFTIHGVKYPLPASAYILKEDD----V 342
Query: 251 CISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
C SGF DL G LWILGDVF+ Y TVFD ++G A A
Sbjct: 343 CTSGFEGMDLDTSSGELWILGDVFIRKYFTVFDRANNKLGLAPA 386
>gi|311621962|gb|ADP95537.1| ASP-3 [Caenorhabditis sp. 5 AC-2008]
gi|311621968|gb|ADP95540.1| ASP-3 [Caenorhabditis sp. 5 AC-2008]
gi|311621976|gb|ADP95544.1| ASP-3 [Caenorhabditis sp. 5 AC-2008]
gi|311621986|gb|ADP95549.1| ASP-3 [Caenorhabditis sp. 5 AC-2008]
gi|375581440|gb|AFA55976.1| aspartyl protease, partial [Caenorhabditis sp. 5 AC-2008]
gi|375581442|gb|AFA55977.1| aspartyl protease, partial [Caenorhabditis sp. 5 AC-2008]
gi|375581468|gb|AFA55990.1| aspartyl protease, partial [Caenorhabditis sp. 5 AC-2008]
gi|375581470|gb|AFA55991.1| aspartyl protease, partial [Caenorhabditis sp. 5 AC-2008]
Length = 224
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 57/179 (31%), Positives = 95/179 (53%), Gaps = 28/179 (15%)
Query: 109 CKLVVSQY-GDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGDSA 167
C + Y G++ W+ LVS + ++G + +G Y + + ++V+ G S
Sbjct: 72 CDTDSNHYSGNIAWEPLVS---EDYWRIKLGAVSISGTTYTNGPMDSIVD-----TGTSL 123
Query: 168 VCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTI 227
+ ++ +K+ Q K + IN GE + C +IP++PN++FT+
Sbjct: 124 LTGPSDI--------IKKIQHKIGGIPLIN----------GEYEVVCSKIPSLPNITFTL 165
Query: 228 GDKIFNLSPEQYILKTGEGIAEV-CISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSG 285
G + F+L + YIL+ G + C+SGFM D+P P GPLWILGDVF+G +++VFD G
Sbjct: 166 GGQNFDLQGKDYILQLPNGNGGMTCLSGFMGMDIPAPAGPLWILGDVFIGRFYSVFDHG 224
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 40/116 (34%), Positives = 64/116 (55%), Gaps = 3/116 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRD-PDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGD 59
+ ++F+FWL+RD D GGEI D H+ G + P+ + YW+ +LG
Sbjct: 41 FANSAICPNQLFAFWLSRDVNDIANGGEITLCDTDSNHYSGNIAWEPLVSEDYWRIKLGA 100
Query: 60 ILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQ 115
+ I T G +IVD+GTSLL GP+ ++ +I H IGG +++ E ++V S+
Sbjct: 101 VSI--SGTTYTNGPMDSIVDTGTSLLTGPSDIIKKIQHKIGGIPLINGEYEVVCSK 154
>gi|444724657|gb|ELW65256.1| Gastricsin [Tupaia chinensis]
Length = 403
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 46/131 (35%), Positives = 71/131 (54%), Gaps = 17/131 (12%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGD- 59
M++QG +++ +FSF+ +R P + GGE++ GGVD + + G+ + PVT++ YWQ + D
Sbjct: 200 MLQQGQLTQPIFSFYFSRQPTRQYGGELILGGVDTQLYSGQIVWTPVTRELYWQIAIQDH 259
Query: 60 ------------ILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----G 103
+I +STG+C GC AIVD+GT LLA P + A G + G
Sbjct: 260 WLLLPGRAGYRAFIIETKSTGLCSQGCQAIVDTGTFLLAIPQQFMGSFLQATGAQQAQNG 319
Query: 104 VVSAECKLVVS 114
+C V S
Sbjct: 320 DFVVDCNYVQS 330
Score = 70.5 bits (171), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 35/89 (39%), Positives = 50/89 (56%), Gaps = 5/89 (5%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRG-P 266
G+ ++DC+ + +MP ++F I F L P Y+ C G A LP P G P
Sbjct: 319 GDFVVDCNYVQSMPTITFIISGSQFPLPPSAYVFNNNG----YCRLGIEATYLPSPNGQP 374
Query: 267 LWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
LWILGDVF+ Y++V+D R+GFA +A
Sbjct: 375 LWILGDVFLKEYYSVYDMANNRVGFAFSA 403
>gi|195471992|ref|XP_002088286.1| GE18491 [Drosophila yakuba]
gi|194174387|gb|EDW87998.1| GE18491 [Drosophila yakuba]
Length = 392
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 48/117 (41%), Positives = 64/117 (54%), Gaps = 9/117 (7%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV QGL+ E V SF+L R A GGE++ GG+D +KG TYVPV+ YWQF + I
Sbjct: 204 MVSQGLLDEPVISFYLKRQGTAVRGGELILGGIDSSLYKGSLTYVPVSVPAYWQFAVNTI 263
Query: 61 LIGNQSTGVCE-GGCAAIVDSGTSLLAGPTPVVTEINHAIG----GEGVVSAECKLV 112
++ G+ GC AI D+GTSL+ P +IN +G G+G C V
Sbjct: 264 ----KTNGIVLCNGCQAIADTGTSLIVAPLAAYRKINRQLGATDNGDGEAFVSCSRV 316
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 40/100 (40%), Positives = 55/100 (55%), Gaps = 4/100 (4%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
IN + N GE+ + C R+ T+P V+ IG IF L+P YI++ + C+S F
Sbjct: 295 INRQLGATDNGDGEAFVSCSRVSTLPKVNLNIGGTIFTLAPRDYIVRLTQNGRTYCMSAF 354
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
+ WILGDVF+G ++TVFD G RIGFA A
Sbjct: 355 TYMEGL----SFWILGDVFIGKFYTVFDKGNERIGFARVA 390
>gi|344295436|ref|XP_003419418.1| PREDICTED: pepsin F-like [Loxodonta africana]
Length = 386
Score = 88.2 bits (217), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 47/127 (37%), Positives = 77/127 (60%), Gaps = 12/127 (9%)
Query: 3 EQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILI 62
QGL+S+ +F+F+L D +G ++FGGVDP ++KG+ +VPVT+ +WQF + I +
Sbjct: 202 NQGLISQGLFAFYLTAQED--KGSVVMFGGVDPAYYKGELNWVPVTRPFFWQFTIDSISM 259
Query: 63 GNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEIN------HAIGGEGVVSAECKLVVSQY 116
+ G C GGC I+D+GTSL+ GP + I H+ GGE +++ E V+
Sbjct: 260 NGKVIG-CNGGCDTILDTGTSLVTGPHRSILNIQNLINAKHSYGGEYIMNCEA---VNTL 315
Query: 117 GDLIWDL 123
D+I+++
Sbjct: 316 PDIIFNI 322
Score = 59.7 bits (143), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 38/103 (36%), Positives = 55/103 (53%), Gaps = 10/103 (9%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
I L ++ + GE I++C+ + T+P++ F I + + YI K G+GI C S F
Sbjct: 291 IQNLINAKHSYGGEYIMNCEAVNTLPDIIFNINGIDYPMPASAYIRKEGKGI---CYSNF 347
Query: 256 MAFDLPPP---RGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
A P +WILGDVF+ +Y +VFD RIG A AA
Sbjct: 348 QA----TPGNLSQEIWILGDVFLRLYFSVFDRENNRIGLAPAA 386
>gi|307167628|gb|EFN61156.1| Cathepsin E-A [Camponotus floridanus]
Length = 164
Score = 88.2 bits (217), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 44/112 (39%), Positives = 64/112 (57%), Gaps = 5/112 (4%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M++Q L++ F +LNR A+ GG+++FGG D ++K YVPVT++GYWQF + I
Sbjct: 39 MIDQWLMAPRAFRIYLNRSSSADLGGKLIFGGFDSAYYKQILAYVPVTRRGYWQFTIDSI 98
Query: 61 LIGNQSTG----VCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+ NQ G C GGC AI+D+ T + GP V+ IN VS +
Sbjct: 99 EM-NQVRGNTYEWCRGGCQAIIDTSTWKIMGPEEDVSAINRLFLSNIYVSKQ 149
>gi|283806610|ref|NP_001164556.1| pepsin II-4 precursor [Oryctolagus cuniculus]
gi|129787|sp|P28713.1|PEPA4_RABIT RecName: Full=Pepsin II-4; AltName: Full=Pepsin A; Flags: Precursor
gi|22218076|dbj|BAC07515.1| pepsinogen II-4 [Oryctolagus cuniculus]
Length = 387
Score = 88.2 bits (217), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 47/112 (41%), Positives = 68/112 (60%), Gaps = 9/112 (8%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M +GLVS+++FS +L+ D E+G ++FGG+D ++ G +VPV+ +GYWQ + +
Sbjct: 201 MWNEGLVSQDLFSVYLSSDD--EKGSLVMFGGIDSSYYTGSLNWVPVSYEGYWQITMDSV 258
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG------GEGVVS 106
I N T C C AIVD+GTSLL GPT ++ I IG GE V+S
Sbjct: 259 SI-NGETIACADSCQAIVDTGTSLLTGPTSAISNIQSYIGASKNLLGENVIS 309
Score = 67.8 bits (164), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 43/131 (32%), Positives = 63/131 (48%), Gaps = 6/131 (4%)
Query: 164 GDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNV 223
G++ C+ A+V L T +S I + N +GE++I C I ++P++
Sbjct: 262 GETIACADSCQAIVDTGTSLLTGPTS--AISNIQSYIGASKNLLGENVISCSAIDSLPDI 319
Query: 224 SFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFD 283
FTI + L YILK + C SG ++ G LWILGDVF+ Y TVFD
Sbjct: 320 VFTINGIQYPLPASAYILKEDDD----CTSGLEGMNVDTYTGELWILGDVFIRQYFTVFD 375
Query: 284 SGKLRIGFAEA 294
++G A A
Sbjct: 376 RANNQLGLAAA 386
>gi|441648777|ref|XP_003266334.2| PREDICTED: LOW QUALITY PROTEIN: gastricsin [Nomascus leucogenys]
Length = 388
Score = 88.2 bits (217), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 43/102 (42%), Positives = 65/102 (63%), Gaps = 1/102 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV++G ++ VFSF+L+ + GG +VFGGVD + G+ + PVT++ YWQ + +
Sbjct: 199 MVQEGALTSPVFSFYLSNQ-EGSSGGAVVFGGVDSSLYTGQIYWAPVTQELYWQIGIEEF 257
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
LIG Q++G C GC AIVD+GTSLL P ++ + A G +
Sbjct: 258 LIGGQASGWCSEGCQAIVDTGTSLLTVPQQYMSALLQATGAQ 299
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 38/107 (35%), Positives = 58/107 (54%), Gaps = 5/107 (4%)
Query: 190 EKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAE 249
++ +S + + + + G+ +++C+ I +P ++F I F L P YIL
Sbjct: 286 QQYMSALLQATGAQEDEYGQFLVNCNSIQNLPTLTFIINGVEFPLPPSSYILSNNG---- 341
Query: 250 VCISGFMAFDLPPPRG-PLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
C G A LP G PLWILGDVF+ Y++V+D G R+GFA AA
Sbjct: 342 YCTVGVEATYLPSQSGQPLWILGDVFLRSYYSVYDLGNNRVGFATAA 388
>gi|19921120|ref|NP_609458.1| CG17134 [Drosophila melanogaster]
gi|7297766|gb|AAF53016.1| CG17134 [Drosophila melanogaster]
gi|17944939|gb|AAL48533.1| RE02351p [Drosophila melanogaster]
gi|220947772|gb|ACL86429.1| CG17134-PA [synthetic construct]
gi|220957078|gb|ACL91082.1| CG17134-PA [synthetic construct]
Length = 391
Score = 88.2 bits (217), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 44/100 (44%), Positives = 60/100 (60%), Gaps = 3/100 (3%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ QGL+ E V SF+L R A GGE++ GG+D ++G TYVPV+ YWQF++
Sbjct: 204 MISQGLLDEPVISFYLKRQGTAVRGGELILGGIDSSLYRGSLTYVPVSVPAYWQFKVNT- 262
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 100
I T +C GC AI D+GTSL+A P +IN +G
Sbjct: 263 -IKTNGTLLCN-GCQAIADTGTSLIAVPLAAYRKINRQLG 300
Score = 78.2 bits (191), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 36/88 (40%), Positives = 51/88 (57%), Gaps = 4/88 (4%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 267
GE+ + C R+ ++P V+ IG +F L+P YI+K + C+S F +
Sbjct: 306 GEAFVRCGRVSSLPKVNLNIGGTVFTLAPRDYIVKVTQNGQTYCMSAFTYMEGLS----F 361
Query: 268 WILGDVFMGVYHTVFDSGKLRIGFAEAA 295
WILGDVF+G ++TVFD G RIGFA A
Sbjct: 362 WILGDVFIGKFYTVFDKGNERIGFARVA 389
>gi|73620984|sp|P81497.2|PEPA_SUNMU RecName: Full=Pepsin A; Flags: Precursor
gi|9798654|dbj|BAB11749.1| pepsinogen A [Suncus murinus]
Length = 387
Score = 88.2 bits (217), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 44/114 (38%), Positives = 70/114 (61%), Gaps = 5/114 (4%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M QGLVS+++FS +L+ + + G ++FGG+D ++ G +VP++ +GYWQ + I
Sbjct: 201 MWNQGLVSQDLFSVYLSSN--DQSGSVVIFGGIDSSYYTGNLNWVPLSSEGYWQITVDSI 258
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVS 114
+ Q+ C G C AIVD+GTSLL+GP + I +IG +A ++VVS
Sbjct: 259 TMNGQAIA-CSGSCQAIVDTGTSLLSGPNNAIANIQKSIGASQ--NANGQMVVS 309
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 37/91 (40%), Positives = 50/91 (54%), Gaps = 4/91 (4%)
Query: 205 NPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 264
N G+ ++ C I ++P++ FTI + L YIL+ + C SGF D+P P
Sbjct: 301 NANGQMVVSCSSIQSLPDIVFTINGIQYPLPASAYILQNQQD----CTSGFQGMDIPTPS 356
Query: 265 GPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
G LWILGDVF+ Y VFD G R+G A A
Sbjct: 357 GELWILGDVFIRQYFAVFDRGNNRVGLAPVA 387
>gi|298706992|emb|CBJ29800.1| aspartyl protease [Ectocarpus siliculosus]
Length = 410
Score = 88.2 bits (217), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 54/174 (31%), Positives = 88/174 (50%), Gaps = 15/174 (8%)
Query: 122 DLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTV-VEKENVSAGDSAVCSACEMAVVWVQ 180
+L++ G P+ +I Y + V V +V++ SA+ + +V +
Sbjct: 250 ELIIGGTDPDHYLHEINYVPVTKKGYWQIDMDNVDVSGSSVTSVKSAILDSGTSLLVGPK 309
Query: 181 NQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYI 240
+K+ +K +S++N GE ++ C +P ++FTIG K + L ++Y+
Sbjct: 310 EDVKKIASKVGAISFMN----------GEYLMPCSS--DLPPLTFTIGGKEYTLEGDEYV 357
Query: 241 LKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
+ G +VCI M D+P P GPLWILGDVFM Y+TVFD G +IG A +
Sbjct: 358 ISAGND--KVCILAIMGMDIPEPMGPLWILGDVFMRKYYTVFDYGNAQIGLATS 409
Score = 78.6 bits (192), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 41/108 (37%), Positives = 65/108 (60%), Gaps = 7/108 (6%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+VE+G + + VF+F+L ++ GE++ GG DP H+ + YVPVTKKGYWQ ++ ++
Sbjct: 228 LVEEGELDDAVFAFYLGN----QKEGELIIGGTDPDHYLHEINYVPVTKKGYWQIDMDNV 283
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+ S + +AI+DSGTSLL GP V +I +G ++ E
Sbjct: 284 DVSGSSVTSVK---SAILDSGTSLLVGPKEDVKKIASKVGAISFMNGE 328
>gi|291409605|ref|XP_002721070.1| PREDICTED: pepsin II-1-like [Oryctolagus cuniculus]
Length = 387
Score = 88.2 bits (217), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 45/112 (40%), Positives = 69/112 (61%), Gaps = 9/112 (8%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M QGLVS+++FS +L+ D ++G ++FGG+D ++ G +VPV+ +GYWQ + I
Sbjct: 201 MWNQGLVSQDLFSVYLSSD--EQKGSLVMFGGIDSSYYTGSLNWVPVSHEGYWQITVDSI 258
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG------GEGVVS 106
+ + T C C A+VD+GTSLLAGPT ++ I IG GE ++S
Sbjct: 259 TMDGE-TIACADSCQAVVDTGTSLLAGPTSAISNIQSYIGASKNLLGENIIS 309
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 48/131 (36%), Positives = 66/131 (50%), Gaps = 6/131 (4%)
Query: 164 GDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNV 223
G++ C+ AVV L T +S I + N +GE+II C I ++P++
Sbjct: 262 GETIACADSCQAVVDTGTSLLAGPTS--AISNIQSYIGASKNLLGENIISCSAIDSLPDI 319
Query: 224 SFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFD 283
FTI + + L YILK + CISGF +L G LWILGDVF+ Y TVFD
Sbjct: 320 VFTINNVQYPLPASAYILKEDDD----CISGFEGMNLDTSYGELWILGDVFIRQYFTVFD 375
Query: 284 SGKLRIGFAEA 294
++G A A
Sbjct: 376 RANNQVGLAAA 386
>gi|24580865|ref|NP_722705.1| CG31928, isoform A [Drosophila melanogaster]
gi|442625179|ref|NP_001259869.1| CG31928, isoform B [Drosophila melanogaster]
gi|7296077|gb|AAF51372.1| CG31928, isoform A [Drosophila melanogaster]
gi|162944774|gb|ABY20456.1| IP19242p [Drosophila melanogaster]
gi|162944778|gb|ABY20458.1| IP19348p [Drosophila melanogaster]
gi|162951741|gb|ABY21732.1| IP19042p [Drosophila melanogaster]
gi|440213127|gb|AGB92406.1| CG31928, isoform B [Drosophila melanogaster]
Length = 418
Score = 88.2 bits (217), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 50/119 (42%), Positives = 70/119 (58%), Gaps = 10/119 (8%)
Query: 1 MVEQGLVSEEVFSFWLNRD-PDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGD 59
++EQGL+ E +FS ++NR+ DA GG ++ GG DP + G TYVPV+K G+WQ +G
Sbjct: 220 ILEQGLIDEPIFSLYVNRNASDASNGGVLLLGGSDPTLYSGCLTYVPVSKVGFWQITVGQ 279
Query: 60 ILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG------GEGVVSAECKLV 112
+ IG S +C C AI D GTSL+ P P + IN +G +GV +CK V
Sbjct: 280 VEIG--SKKLC-SNCQAIFDMGTSLIIVPCPALKIINKKLGIKETDRKDGVYIIDCKKV 335
Score = 58.5 bits (140), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 33/94 (35%), Positives = 45/94 (47%), Gaps = 13/94 (13%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAF--------- 258
G IIDC ++ +P + F IG K F L+P YIL + C+SGF +
Sbjct: 326 GVYIIDCKKVSHLPKIVFNIGWKDFTLNPSDYIL----NYSGTCVSGFSSLSDCNGTQTN 381
Query: 259 DLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFA 292
D +W+ GDVF G T+FD G +G A
Sbjct: 382 DDSEDLNNIWVFGDVFFGAIFTLFDFGLKLVGMA 415
>gi|441613408|ref|XP_004088137.1| PREDICTED: renin isoform 2 [Nomascus leucogenys]
Length = 388
Score = 88.2 bits (217), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 40/102 (39%), Positives = 69/102 (67%), Gaps = 2/102 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
++ QG++ E+VFSF+ NR+ + GG+IV GG DP+H++G Y+ + K G WQ ++ +
Sbjct: 198 IISQGVLKEDVFSFYYNRNSQSL-GGQIVLGGSDPQHYEGNFHYINLIKTGVWQIQMKGV 256
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
+G+ ST +CE GC A+VD+G S ++G T + ++ A+G +
Sbjct: 257 SVGS-STLLCEDGCLALVDTGASYISGSTSSIEKLMEALGAK 297
Score = 74.3 bits (181), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 57/103 (55%), Gaps = 2/103 (1%)
Query: 194 SYINELCDSL--PNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVC 251
S I +L ++L + + ++ C+ PT+P++SF +G K + L+ Y+ + ++C
Sbjct: 285 SSIEKLMEALGAKKRLFDYVVKCNEGPTLPDISFHLGGKEYTLTSADYVFQESYSSKKLC 344
Query: 252 ISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
A D+PPP GP W LG F+ ++T FD RIGFA A
Sbjct: 345 TLAIHAMDIPPPTGPTWALGATFIRKFYTEFDRRNNRIGFALA 387
>gi|162944764|gb|ABY20451.1| IP19142p [Drosophila melanogaster]
Length = 418
Score = 88.2 bits (217), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 50/119 (42%), Positives = 70/119 (58%), Gaps = 10/119 (8%)
Query: 1 MVEQGLVSEEVFSFWLNRD-PDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGD 59
++EQGL+ E +FS ++NR+ DA GG ++ GG DP + G TYVPV+K G+WQ +G
Sbjct: 220 ILEQGLIDEPIFSLYVNRNASDASNGGVLLLGGSDPTLYSGCLTYVPVSKVGFWQITVGQ 279
Query: 60 ILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG------GEGVVSAECKLV 112
+ IG S +C C AI D GTSL+ P P + IN +G +GV +CK V
Sbjct: 280 VEIG--SKKLC-SNCQAIFDMGTSLIIVPCPALKIINKKLGIKETDRKDGVYIIDCKKV 335
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 32/94 (34%), Positives = 44/94 (46%), Gaps = 13/94 (13%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAF--------- 258
G IIDC ++ +P + F IG K F L+P YI + C+SGF +
Sbjct: 326 GVYIIDCKKVSHLPKIVFNIGWKDFTLNPSDYIF----NYSGTCVSGFSSLSDCNGTQTN 381
Query: 259 DLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFA 292
D +W+ GDVF G T+FD G +G A
Sbjct: 382 DDSEDLNNIWVFGDVFFGAIFTLFDFGLKLVGMA 415
>gi|345783299|ref|XP_540923.3| PREDICTED: pepsin F-like [Canis lupus familiaris]
Length = 388
Score = 87.8 bits (216), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 44/106 (41%), Positives = 66/106 (62%), Gaps = 3/106 (2%)
Query: 3 EQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILI 62
+QGL+++E+F+F+L++ EEG ++FG VD +++G +VPV+K YWQ + I +
Sbjct: 203 KQGLIAQELFAFYLSKK--EEEGSVVMFGDVDHSYYRGDLNWVPVSKPFYWQLSMDSISV 260
Query: 63 GNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
N + C GGC AI+D+GTSLL GP VV I I + S E
Sbjct: 261 -NGAVIACSGGCEAIIDTGTSLLIGPPSVVLNIQKIINAKRSYSGE 305
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 37/104 (35%), Positives = 52/104 (50%), Gaps = 13/104 (12%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
I ++ ++ + GE +IDC I T+P++ F I + + YI + EG C S F
Sbjct: 292 IQKIINAKRSYSGEFMIDCSTINTLPDIVFAINGVNYPVPASAYIREGQEG---TCYSNF 348
Query: 256 MAFDLPPPRGPL-----WILGDVFMGVYHTVFDSGKLRIGFAEA 294
RG L WILGDVF+ +Y TVF+ G IG A A
Sbjct: 349 -----DEGRGGLFVSDSWILGDVFLRLYFTVFNRGNDTIGLAPA 387
>gi|311621974|gb|ADP95543.1| ASP-3 [Caenorhabditis sp. 5 AC-2008]
Length = 217
Score = 87.8 bits (216), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 57/179 (31%), Positives = 94/179 (52%), Gaps = 28/179 (15%)
Query: 109 CKLVVSQY-GDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGDSA 167
C + Y G + W+ LVS + ++G + +G Y + + ++V+ G S
Sbjct: 65 CDTDSNHYSGSIAWEPLVS---EDYWRIKLGAVSISGTTYTNGPMDSIVD-----TGTSL 116
Query: 168 VCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTI 227
+ ++ +K+ Q K + IN GE + C +IP++PN++FT+
Sbjct: 117 LTGPSDI--------IKKIQHKIGGIPLIN----------GEYEVVCSKIPSLPNITFTL 158
Query: 228 GDKIFNLSPEQYILKTGEGIAEV-CISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSG 285
G + F+L + YIL+ G + C+SGFM D+P P GPLWILGDVF+G +++VFD G
Sbjct: 159 GGQNFDLQGKDYILQLPNGNGGMTCLSGFMGMDIPAPAGPLWILGDVFIGRFYSVFDHG 217
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 40/116 (34%), Positives = 64/116 (55%), Gaps = 3/116 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRD-PDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGD 59
+ ++F+FWL+RD D GGEI D H+ G + P+ + YW+ +LG
Sbjct: 34 FANSAICPNQLFAFWLSRDVNDIANGGEITLCDTDSNHYSGSIAWEPLVSEDYWRIKLGA 93
Query: 60 ILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQ 115
+ I T G +IVD+GTSLL GP+ ++ +I H IGG +++ E ++V S+
Sbjct: 94 VSI--SGTTYTNGPMDSIVDTGTSLLTGPSDIIKKIQHKIGGIPLINGEYEVVCSK 147
>gi|18859121|ref|NP_571879.1| nothepsin [Danio rerio]
gi|12053847|emb|CAC20112.1| nothepsin [Danio rerio]
Length = 416
Score = 87.8 bits (216), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 36/98 (36%), Positives = 61/98 (62%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
+ + + P GE ++DC R+ ++P VSF I ++LS EQY+ + ++C SGF
Sbjct: 307 LQQFIGATPTANGEFVVDCVRVSSLPVVSFLINSVEYSLSGEQYVRRETLNNKQICFSGF 366
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAE 293
+ ++P P GP+WILGDVF+ ++++D G+ R+G A
Sbjct: 367 QSIEVPSPAGPVWILGDVFLSQVYSIYDRGENRVGLAR 404
Score = 71.2 bits (173), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 42/114 (36%), Positives = 62/114 (54%), Gaps = 11/114 (9%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFEL--- 57
M+EQ ++ + VFSF+L + + GGE+VFG D F ++PVT+KGYWQ +L
Sbjct: 212 MMEQNMLDQPVFSFYLTNN-GSGFGGELVFGANDESRFLPPINWIPVTQKGYWQIKLDAV 270
Query: 58 ---GDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
G + ++S GC AIVD+GTSL+ GP + + IG + E
Sbjct: 271 KVQGALSFSDRSV----QGCQAIVDTGTSLIGGPARDILILQQFIGATPTANGE 320
>gi|375581438|gb|AFA55975.1| aspartyl protease, partial [Caenorhabditis sp. 5 AC-2008]
Length = 216
Score = 87.8 bits (216), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 57/179 (31%), Positives = 94/179 (52%), Gaps = 28/179 (15%)
Query: 109 CKLVVSQY-GDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGDSA 167
C + Y G + W+ LVS + ++G + +G Y + + ++V+ G S
Sbjct: 64 CDTDSNHYSGSIAWEPLVS---EDYWRIKLGAVSISGTTYTNGPMDSIVD-----TGTSL 115
Query: 168 VCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTI 227
+ ++ +K+ Q K + IN GE + C +IP++PN++FT+
Sbjct: 116 LTGPSDI--------IKKIQHKIGGIPLIN----------GEYEVVCSKIPSLPNITFTL 157
Query: 228 GDKIFNLSPEQYILKTGEGIAEV-CISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSG 285
G + F+L + YIL+ G + C+SGFM D+P P GPLWILGDVF+G +++VFD G
Sbjct: 158 GGQNFDLQGKDYILQLPNGNGGMTCLSGFMGMDIPAPAGPLWILGDVFIGRFYSVFDHG 216
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 40/108 (37%), Positives = 63/108 (58%), Gaps = 3/108 (2%)
Query: 9 EEVFSFWLNRD-PDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQST 67
++F+FWL+RD D GGEI D H+ G + P+ + YW+ +LG + I T
Sbjct: 41 NQLFAFWLSRDVNDIANGGEITLCDTDSNHYSGSIAWEPLVSEDYWRIKLGAVSI--SGT 98
Query: 68 GVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQ 115
G +IVD+GTSLL GP+ ++ +I H IGG +++ E ++V S+
Sbjct: 99 TYTNGPMDSIVDTGTSLLTGPSDIIKKIQHKIGGIPLINGEYEVVCSK 146
>gi|148691635|gb|EDL23582.1| progastricsin (pepsinogen C) [Mus musculus]
Length = 392
Score = 87.8 bits (216), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 44/103 (42%), Positives = 65/103 (63%), Gaps = 2/103 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ +G +S+ +F +L +GG+IVFGGVD + G+ T++PVT++ YWQ + D
Sbjct: 202 MLGEGALSQPLFGVYLGSQ-QGSDGGQIVFGGVDENLYTGELTWIPVTQELYWQITIDDF 260
Query: 61 LIGNQSTGVC-EGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
LIGNQ++G C GC IVD+GTSLL P + E+ IG +
Sbjct: 261 LIGNQASGWCSSSGCQGIVDTGTSLLVMPAQYLNELLQTIGAQ 303
Score = 68.9 bits (167), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 38/102 (37%), Positives = 56/102 (54%), Gaps = 8/102 (7%)
Query: 195 YINELCDSL---PNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVC 251
Y+NEL ++ G+ + CD + ++P ++F + F LSP YI++ EG C
Sbjct: 292 YLNELLQTIGAQEGEYGQYFVSCDSVSSLPTLTFVLNGVQFPLSPSSYIIQE-EG---SC 347
Query: 252 ISGFMAFDLPPPRG-PLWILGDVFMGVYHTVFDSGKLRIGFA 292
+ G + L G PLWILGDVF+ Y+ VFD G R+G A
Sbjct: 348 MVGLESLSLNAESGQPLWILGDVFLRSYYAVFDMGNNRVGLA 389
>gi|375581456|gb|AFA55984.1| aspartyl protease, partial [Caenorhabditis sp. 5 AC-2008]
Length = 223
Score = 87.8 bits (216), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 41/97 (42%), Positives = 62/97 (63%), Gaps = 1/97 (1%)
Query: 190 EKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAE 249
++ I +P GE + C +IP++PN++FT+G + F+L + YIL+ G
Sbjct: 127 SDIIKKIQHKIGGIPLINGEYEVVCSKIPSLPNITFTLGGQNFDLQGKDYILQLPNGNGG 186
Query: 250 V-CISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSG 285
+ C+SGFM D+P P GPLWILGDVF+G +++VFD G
Sbjct: 187 MTCLSGFMGMDIPAPAGPLWILGDVFIGRFYSVFDHG 223
Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 40/116 (34%), Positives = 64/116 (55%), Gaps = 3/116 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRD-PDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGD 59
+ ++F+FWL+RD D GGEI D H+ G + P+ + YW+ +LG
Sbjct: 40 FANSAICPNQLFAFWLSRDVNDIANGGEITLCDTDSNHYSGSIAWEPLVSEDYWRIKLGA 99
Query: 60 ILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQ 115
+ I T G +IVD+GTSLL GP+ ++ +I H IGG +++ E ++V S+
Sbjct: 100 VSI--SGTTYTNGPMDSIVDTGTSLLTGPSDIIKKIQHKIGGIPLINGEYEVVCSK 153
>gi|190014570|dbj|BAG48263.1| pepsinogen 1 [Thunnus orientalis]
Length = 378
Score = 87.8 bits (216), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 46/133 (34%), Positives = 74/133 (55%), Gaps = 14/133 (10%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+++GLV++++FS +L+ + +G + FGG DP H+ G T++P++ + YWQ + +
Sbjct: 197 MMKEGLVNQDIFSVYLSSH--SAQGSVVTFGGTDPNHYTGPITWIPLSNELYWQITVDSV 254
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
+ Q C GGC AIVD+GTSL+ GP ++ IN +G SQ GD I
Sbjct: 255 TVNGQVV-ACSGGCQAIVDTGTSLIVGPQSSISNINSFVGAS-----------SQNGDYI 302
Query: 121 WDLLVSGLLPEKV 133
+ G +P V
Sbjct: 303 VNCNNIGQMPHVV 315
Score = 45.1 bits (105), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 27/87 (31%), Positives = 45/87 (51%), Gaps = 8/87 (9%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 267
G+ I++C+ I MP+V F I + F + Y+ ++ C +GF L
Sbjct: 299 GDYIVNCNNIGQMPHVVFHIQGQEFTIPASGYVRQSRY---FGCRTGFGNGG-----DNL 350
Query: 268 WILGDVFMGVYHTVFDSGKLRIGFAEA 294
WILGDVF+ Y+++F + +G A+A
Sbjct: 351 WILGDVFIRQYYSIFSRAQNMVGLAKA 377
>gi|157837066|pdb|5PEP|A Chain A, X-Ray Analyses Of Aspartic Proteases. Ii.
Three-Dimensional Structure Of The Hexagonal Crystal
Form Of Porcine Pepsin At 2.3 Angstroms Resolution
Length = 326
Score = 87.8 bits (216), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 49/118 (41%), Positives = 71/118 (60%), Gaps = 11/118 (9%)
Query: 3 EQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILI 62
+QGLVS+++FS +L+ + D+ G ++ GG+D ++ G +VPV+ +GYWQ L I +
Sbjct: 142 DQGLVSQDLFSVYLSSNDDS--GSVVLLGGIDSSYYTGSLNWVPVSVEGYWQITLDSITM 199
Query: 63 GNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG------GEGVVSAECKLVVS 114
+ T C GGC AIVD+GTSLL GPT + I IG GE V+S C + S
Sbjct: 200 DGE-TIACSGGCQAIVDTGTSLLTGPTSAIANIQSDIGASENSDGEMVIS--CSSIAS 254
Score = 79.7 bits (195), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 47/132 (35%), Positives = 66/132 (50%), Gaps = 6/132 (4%)
Query: 164 GDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNV 223
G++ CS A+V L T ++ I + N GE +I C I ++P++
Sbjct: 201 GETIACSGGCQAIVDTGTSLLTGPTS--AIANIQSDIGASENSDGEMVISCSSIASLPDI 258
Query: 224 SFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFD 283
FTI + LSP YIL+ + C SGF D+P G LWILGDVF+ Y+TVFD
Sbjct: 259 VFTINGVQYPLSPSAYILQDDDS----CTSGFEGMDVPTSSGELWILGDVFIRQYYTVFD 314
Query: 284 SGKLRIGFAEAA 295
++G A A
Sbjct: 315 RANNKVGLAPVA 326
>gi|311621980|gb|ADP95546.1| ASP-3 [Caenorhabditis sp. 5 AC-2008]
Length = 194
Score = 87.8 bits (216), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 57/179 (31%), Positives = 94/179 (52%), Gaps = 28/179 (15%)
Query: 109 CKLVVSQY-GDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGDSA 167
C + Y G + W+ LVS + ++G + +G Y + + ++V+ G S
Sbjct: 42 CDTDSNHYSGSIAWEPLVS---EDYWRIKLGAVSISGTTYTNGPMDSIVD-----TGTSL 93
Query: 168 VCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTI 227
+ ++ +K+ Q K + IN GE + C +IP++PN++FT+
Sbjct: 94 LTGPSDI--------IKKIQHKIGGIPLIN----------GEYEVVCSKIPSLPNITFTL 135
Query: 228 GDKIFNLSPEQYILKTGEGIAEV-CISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSG 285
G + F+L + YIL+ G + C+SGFM D+P P GPLWILGDVF+G +++VFD G
Sbjct: 136 GGQNFDLQGKDYILQLPNGNGGMTCLSGFMGMDIPAPAGPLWILGDVFIGRFYSVFDHG 194
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 40/116 (34%), Positives = 64/116 (55%), Gaps = 3/116 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRD-PDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGD 59
+ ++F+FWL+RD D GGEI D H+ G + P+ + YW+ +LG
Sbjct: 11 FANSAICPNQLFAFWLSRDVNDIANGGEITLCDTDSNHYSGSIAWEPLVSEDYWRIKLGA 70
Query: 60 ILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQ 115
+ I T G +IVD+GTSLL GP+ ++ +I H IGG +++ E ++V S+
Sbjct: 71 VSI--SGTTYTNGPMDSIVDTGTSLLTGPSDIIKKIQHKIGGIPLINGEYEVVCSK 124
>gi|311621954|gb|ADP95533.1| ASP-3 [Caenorhabditis sp. 5 AC-2008]
gi|311621956|gb|ADP95534.1| ASP-3 [Caenorhabditis sp. 5 AC-2008]
gi|311621958|gb|ADP95535.1| ASP-3 [Caenorhabditis sp. 5 AC-2008]
gi|311621960|gb|ADP95536.1| ASP-3 [Caenorhabditis sp. 5 AC-2008]
gi|311621964|gb|ADP95538.1| ASP-3 [Caenorhabditis sp. 5 AC-2008]
gi|311621966|gb|ADP95539.1| ASP-3 [Caenorhabditis sp. 5 AC-2008]
gi|311621970|gb|ADP95541.1| ASP-3 [Caenorhabditis sp. 5 AC-2008]
gi|311621978|gb|ADP95545.1| ASP-3 [Caenorhabditis sp. 5 AC-2008]
gi|311621990|gb|ADP95551.1| ASP-3 [Caenorhabditis sp. 5 AC-2008]
gi|311621994|gb|ADP95553.1| ASP-3 [Caenorhabditis sp. 5 AC-2008]
gi|311621996|gb|ADP95554.1| ASP-3 [Caenorhabditis sp. 5 AC-2008]
gi|375581434|gb|AFA55973.1| aspartyl protease, partial [Caenorhabditis sp. 5 AC-2008]
gi|375581436|gb|AFA55974.1| aspartyl protease, partial [Caenorhabditis sp. 5 AC-2008]
gi|375581444|gb|AFA55978.1| aspartyl protease, partial [Caenorhabditis sp. 5 AC-2008]
gi|375581446|gb|AFA55979.1| aspartyl protease, partial [Caenorhabditis sp. 5 AC-2008]
gi|375581448|gb|AFA55980.1| aspartyl protease, partial [Caenorhabditis sp. 5 AC-2008]
gi|375581450|gb|AFA55981.1| aspartyl protease, partial [Caenorhabditis sp. 5 AC-2008]
gi|375581452|gb|AFA55982.1| aspartyl protease, partial [Caenorhabditis sp. 5 AC-2008]
gi|375581454|gb|AFA55983.1| aspartyl protease, partial [Caenorhabditis sp. 5 AC-2008]
gi|375581460|gb|AFA55986.1| aspartyl protease, partial [Caenorhabditis sp. 5 AC-2008]
gi|375581462|gb|AFA55987.1| aspartyl protease, partial [Caenorhabditis sp. 5 AC-2008]
gi|375581464|gb|AFA55988.1| aspartyl protease, partial [Caenorhabditis sp. 5 AC-2008]
gi|375581466|gb|AFA55989.1| aspartyl protease, partial [Caenorhabditis sp. 5 AC-2008]
Length = 224
Score = 87.8 bits (216), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 41/97 (42%), Positives = 62/97 (63%), Gaps = 1/97 (1%)
Query: 190 EKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAE 249
++ I +P GE + C +IP++PN++FT+G + F+L + YIL+ G
Sbjct: 128 SDIIKKIQHKIGGIPLINGEYEVVCSKIPSLPNITFTLGGQNFDLQGKDYILQLPNGNGG 187
Query: 250 V-CISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSG 285
+ C+SGFM D+P P GPLWILGDVF+G +++VFD G
Sbjct: 188 MTCLSGFMGMDIPAPAGPLWILGDVFIGRFYSVFDHG 224
Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 40/116 (34%), Positives = 64/116 (55%), Gaps = 3/116 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRD-PDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGD 59
+ ++F+FWL+RD D GGEI D H+ G + P+ + YW+ +LG
Sbjct: 41 FANSAICPNQLFAFWLSRDVNDIANGGEITLCDTDSNHYSGSIAWEPLVSEDYWRIKLGA 100
Query: 60 ILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQ 115
+ I T G +IVD+GTSLL GP+ ++ +I H IGG +++ E ++V S+
Sbjct: 101 VSI--SGTTYTNGPMDSIVDTGTSLLTGPSDIIKKIQHKIGGIPLINGEYEVVCSK 154
>gi|301606846|ref|XP_002933025.1| PREDICTED: gastricsin isoform 1 [Xenopus (Silurana) tropicalis]
Length = 383
Score = 87.8 bits (216), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 40/102 (39%), Positives = 62/102 (60%), Gaps = 1/102 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M++Q L++E VF F+L+ + + + GGE+ FGGVD ++ G+ + PVT + YWQ +
Sbjct: 193 MLQQNLLNEPVFGFYLSGE-NTQSGGEVAFGGVDQNYYTGQIYWTPVTSETYWQIGIQGF 251
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
I Q++G C GC IVD+GTSLL P + + IG +
Sbjct: 252 SINGQASGWCSQGCQGIVDTGTSLLTAPQSIFASLMQDIGAQ 293
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 37/89 (41%), Positives = 50/89 (56%), Gaps = 4/89 (4%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRG-P 266
GE ++ C I +P +SFTI F L P Y+L+ G C G M L G P
Sbjct: 298 GEYVVSCSSIQNLPTISFTISGVSFPLPPSAYVLQQSSG---YCTIGIMPTYLSSQNGQP 354
Query: 267 LWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
+WILGDVF+ Y++V+D G ++GFA AA
Sbjct: 355 MWILGDVFLRQYYSVYDLGNNQVGFASAA 383
>gi|194900440|ref|XP_001979765.1| GG22202 [Drosophila erecta]
gi|190651468|gb|EDV48723.1| GG22202 [Drosophila erecta]
Length = 395
Score = 87.8 bits (216), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 52/119 (43%), Positives = 64/119 (53%), Gaps = 3/119 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M +Q LV E VFSF+L R+ GGE++FGGVD F G TYVP+T GYWQF L I
Sbjct: 209 MCDQQLVDECVFSFYLKRNGSERMGGELLFGGVDKTKFSGSLTYVPLTHAGYWQFPLDGI 268
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDL 119
+G + AI D+GTSLLA P IN +GG + E L S+ L
Sbjct: 269 ELGGTTISRHR---QAIADTGTSLLAAPPREYLIINSLLGGLPTSNNEYLLNCSEIDSL 324
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 40/101 (39%), Positives = 58/101 (57%), Gaps = 6/101 (5%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILK-TGEGIAEVCISG 254
IN L LP E +++C I ++P + F IG + F L P Y++ T + + +C+S
Sbjct: 300 INSLLGGLPTSNNEYLLNCSEIDSLPEIVFIIGGRRFGLQPRDYVMSVTNDDGSRICLSA 359
Query: 255 FMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
F D WILGDVF+G Y+T FD+G+ +IGFA AA
Sbjct: 360 FTLMD-----AEFWILGDVFIGRYYTAFDAGQRQIGFAPAA 395
>gi|195034430|ref|XP_001988894.1| GH11416 [Drosophila grimshawi]
gi|193904894|gb|EDW03761.1| GH11416 [Drosophila grimshawi]
Length = 400
Score = 87.8 bits (216), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 49/115 (42%), Positives = 66/115 (57%), Gaps = 6/115 (5%)
Query: 3 EQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILI 62
QGL+ + F F+L + AE GGE++ GGVD F+G T VPV++ GYWQF + + +
Sbjct: 218 NQGLIDKPTFGFYLTHNGSAELGGELILGGVDNTLFEGNLTSVPVSQMGYWQFAMAVVAM 277
Query: 63 GNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG---EGVVSAECKLVVS 114
N V C AI D+GTSLLA P +T IN+ IG +G +C LV S
Sbjct: 278 DN---NVICSDCQAIADTGTSLLAVPANQLTYINNIIGAYQMDGDYFVDCSLVNS 329
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 40/132 (30%), Positives = 68/132 (51%), Gaps = 9/132 (6%)
Query: 161 VSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTM 220
V+ ++ +CS C+ A+ L + L+YIN + + G+ +DC + ++
Sbjct: 275 VAMDNNVICSDCQ-AIADTGTSLLAVPANQ--LTYINNIIGAYQMD-GDYFVDCSLVNSL 330
Query: 221 PNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHT 280
P ++F IG+ +F+L+ +YI E + C+S F + D WILGD F+G Y+T
Sbjct: 331 PTLNFLIGESVFSLTSAEYITVIQESDTKYCMSSFTSIDT-----NFWILGDTFIGHYYT 385
Query: 281 VFDSGKLRIGFA 292
FD G + FA
Sbjct: 386 QFDFGHNSVSFA 397
>gi|166796432|gb|AAI59305.1| LOC496913 protein [Xenopus (Silurana) tropicalis]
Length = 381
Score = 87.8 bits (216), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 40/102 (39%), Positives = 62/102 (60%), Gaps = 1/102 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M++Q L++E VF F+L+ + + + GGE+ FGGVD ++ G+ + PVT + YWQ +
Sbjct: 191 MLQQNLLNEPVFGFYLSGE-NTQSGGEVAFGGVDQNYYTGQIYWTPVTSETYWQIGIQGF 249
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
I Q++G C GC IVD+GTSLL P + + IG +
Sbjct: 250 SINGQASGWCSQGCQGIVDTGTSLLTAPQSIFASLMQDIGAQ 291
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 37/89 (41%), Positives = 50/89 (56%), Gaps = 4/89 (4%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRG-P 266
GE ++ C I +P +SFTI F L P Y+L+ G C G M L G P
Sbjct: 296 GEYVVSCSSIQNLPTISFTISGVSFPLPPSAYVLQQSSG---YCTIGIMPTYLSSQNGQP 352
Query: 267 LWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
+WILGDVF+ Y++V+D G ++GFA AA
Sbjct: 353 MWILGDVFLRQYYSVYDLGNNQVGFASAA 381
>gi|301606848|ref|XP_002933026.1| PREDICTED: gastricsin isoform 2 [Xenopus (Silurana) tropicalis]
Length = 382
Score = 87.8 bits (216), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 40/102 (39%), Positives = 62/102 (60%), Gaps = 1/102 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M++Q L++E VF F+L+ + + + GGE+ FGGVD ++ G+ + PVT + YWQ +
Sbjct: 193 MLQQNLLNEPVFGFYLSGE-NTQSGGEVAFGGVDQNYYTGQIYWTPVTSETYWQIGIQGF 251
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
I Q++G C GC IVD+GTSLL P + + IG +
Sbjct: 252 SINGQASGWCSQGCQGIVDTGTSLLTAPQSIFASLMQDIGAQ 293
Score = 70.5 bits (171), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 36/89 (40%), Positives = 50/89 (56%), Gaps = 5/89 (5%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRG-P 266
GE ++ C I +P +SFTI F L P Y+L+ + C G M L G P
Sbjct: 298 GEYVVSCSSIQNLPTISFTISGVSFPLPPSAYVLQN----SGYCTIGIMPTYLSSQNGQP 353
Query: 267 LWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
+WILGDVF+ Y++V+D G ++GFA AA
Sbjct: 354 MWILGDVFLRQYYSVYDLGNNQVGFASAA 382
>gi|56971213|gb|AAH88063.1| LOC496913 protein, partial [Xenopus (Silurana) tropicalis]
Length = 380
Score = 87.8 bits (216), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 40/102 (39%), Positives = 62/102 (60%), Gaps = 1/102 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M++Q L++E VF F+L+ + + + GGE+ FGGVD ++ G+ + PVT + YWQ +
Sbjct: 190 MLQQNLLNEPVFGFYLSGE-NTQSGGEVAFGGVDQNYYTGQIYWTPVTSETYWQIGIQGF 248
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
I Q++G C GC IVD+GTSLL P + + IG +
Sbjct: 249 SINGQASGWCSQGCQGIVDTGTSLLTAPQSIFASLMQDIGAQ 290
Score = 75.1 bits (183), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 38/89 (42%), Positives = 51/89 (57%), Gaps = 4/89 (4%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRG-P 266
GE ++ C I +P +SFTI F L P Y+L+ G CI G M L G P
Sbjct: 295 GEYVVSCSSIQNLPTISFTISGVSFPLPPSAYVLQQSSG---YCIIGIMPTYLSSQNGQP 351
Query: 267 LWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
+WILGDVF+ Y++V+D G ++GFA AA
Sbjct: 352 MWILGDVFLRQYYSVYDLGNNQVGFASAA 380
>gi|66865247|gb|AAY57527.1| pepsinogen A [Gadus morhua]
Length = 277
Score = 87.8 bits (216), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 46/133 (34%), Positives = 76/133 (57%), Gaps = 13/133 (9%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV+Q LVS+ +FS +L+ ++EG +VFGG++ ++ G+ ++P+T YWQ ++ +
Sbjct: 113 MVQQNLVSQPLFSVYLSSH--SQEGSVVVFGGIEASYYTGQIAWIPLTSATYWQIKMDSV 170
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
I N + C GGC AI+D+GTSL+ GP+ ++ +N +G QYGD
Sbjct: 171 TI-NGNPVACNGGCQAIIDTGTSLIVGPSSDISNMNSWVGAS----------TDQYGDAT 219
Query: 121 WDLLVSGLLPEKV 133
+ G +PE V
Sbjct: 220 VNCQNIGSMPEVV 232
Score = 43.5 bits (101), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 25/86 (29%), Positives = 41/86 (47%), Gaps = 9/86 (10%)
Query: 193 LSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCI 252
+S +N + + G++ ++C I +MP V FT+ F + Y+ ++ G C
Sbjct: 201 ISNMNSWVGASTDQYGDATVNCQNIGSMPEVVFTLSGHSFTVPASAYVSQSSYG----CN 256
Query: 253 SGFMAFDLPPPRGPLWILGDVFMGVY 278
+GF LWILGDVF G +
Sbjct: 257 TGFGG-----GNDQLWILGDVFKGEF 277
>gi|224458280|ref|NP_001138943.1| gastricsin precursor [Pongo abelii]
gi|222425206|dbj|BAH20552.1| pepsinogen C [Pongo abelii]
Length = 388
Score = 87.8 bits (216), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 43/102 (42%), Positives = 64/102 (62%), Gaps = 1/102 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV++G ++ VFSF+L+ GG +VFGGVD + G+ + PVT++ YWQ + +
Sbjct: 199 MVQEGALTSPVFSFYLSNQ-QGSSGGAVVFGGVDSSLYTGQIYWAPVTQELYWQIGIEEF 257
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
LIG Q++G C GC AIVD+GTSLL P ++ + A G +
Sbjct: 258 LIGGQASGWCSEGCQAIVDTGTSLLTVPQQYMSALLQATGAQ 299
Score = 68.2 bits (165), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 37/107 (34%), Positives = 57/107 (53%), Gaps = 5/107 (4%)
Query: 190 EKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAE 249
++ +S + + + + G+ +++C+ I +P ++F I F L P YIL
Sbjct: 286 QQYMSALLQATGAQEDEYGQFLVNCNSIQNLPTLTFIINGVEFPLPPSSYILSNNG---- 341
Query: 250 VCISGFMAFDLPPPRG-PLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
C G LP G PLWILGDVF+ Y++V+D G R+GFA AA
Sbjct: 342 YCTVGVELTYLPSQNGQPLWILGDVFLRSYYSVYDLGNNRVGFATAA 388
>gi|311621982|gb|ADP95547.1| ASP-3 [Caenorhabditis sp. 5 AC-2008]
Length = 189
Score = 87.8 bits (216), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 57/179 (31%), Positives = 94/179 (52%), Gaps = 28/179 (15%)
Query: 109 CKLVVSQY-GDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGDSA 167
C + Y G + W+ LVS + ++G + +G Y + + ++V+ G S
Sbjct: 37 CDTDSNHYSGSIAWEPLVS---EDYWRIKLGAVSISGTTYTNGPMDSIVD-----TGTSL 88
Query: 168 VCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTI 227
+ ++ +K+ Q K + IN GE + C +IP++PN++FT+
Sbjct: 89 LTGPSDI--------IKKIQHKIGGIPLIN----------GEYEVVCSKIPSLPNITFTL 130
Query: 228 GDKIFNLSPEQYILKTGEGIAEV-CISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSG 285
G + F+L + YIL+ G + C+SGFM D+P P GPLWILGDVF+G +++VFD G
Sbjct: 131 GGQNFDLQGKDYILQLPNGNGGMTCLSGFMGMDIPAPAGPLWILGDVFIGRFYSVFDHG 189
Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 40/116 (34%), Positives = 64/116 (55%), Gaps = 3/116 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRD-PDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGD 59
+ ++F+FWL+RD D GGEI D H+ G + P+ + YW+ +LG
Sbjct: 6 FANSAICPNQLFAFWLSRDVNDIANGGEITLCDTDSNHYSGSIAWEPLVSEDYWRIKLGA 65
Query: 60 ILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQ 115
+ I T G +IVD+GTSLL GP+ ++ +I H IGG +++ E ++V S+
Sbjct: 66 VSI--SGTTYTNGPMDSIVDTGTSLLTGPSDIIKKIQHKIGGIPLINGEYEVVCSK 119
>gi|296471634|tpg|DAA13749.1| TPA: pepsin A precursor [Bos taurus]
Length = 367
Score = 87.4 bits (215), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 46/117 (39%), Positives = 68/117 (58%), Gaps = 3/117 (2%)
Query: 3 EQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILI 62
+QGLVS+++FS +L+ + E G ++FG +D ++ G +VPV+ +GYWQ + I +
Sbjct: 188 DQGLVSQDLFSVYLSSN--EESGSVVIFGDIDSSYYSGSLNWVPVSVEGYWQITVDSITM 245
Query: 63 GNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDL 119
+S C GC AIVD+GTSLLAGPT ++ I IG S E + S L
Sbjct: 246 NGESIA-CSDGCQAIVDTGTSLLAGPTTAISNIQSYIGASEDSSGEVVISCSSIDSL 301
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 48/129 (37%), Positives = 64/129 (49%), Gaps = 10/129 (7%)
Query: 164 GDSAVCSACEMAVVWVQNQLKQKQTK--EKVLSYINELCDSLPNPMGESIIDCDRIPTMP 221
G+S CS A+V L T + SYI DS GE +I C I ++P
Sbjct: 247 GESIACSDGCQAIVDTGTSLLAGPTTAISNIQSYIGASEDS----SGEVVISCSSIDSLP 302
Query: 222 NVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTV 281
++ FTI + + P YIL++ +C SGF D+ G LWILGDVF+ Y TV
Sbjct: 303 DIVFTINGVQYPVPPSAYILQSNG----ICSSGFEGMDISTSSGDLWILGDVFIRQYFTV 358
Query: 282 FDSGKLRIG 290
FD G +IG
Sbjct: 359 FDRGNNQIG 367
>gi|380865655|gb|AFF19538.1| pepsin F, partial [Camelus dromedarius]
Length = 354
Score = 87.4 bits (215), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 43/108 (39%), Positives = 68/108 (62%), Gaps = 3/108 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+ +Q L+ E +F+F+L+ E G ++FGG+D ++KG+ +VPV+++ YWQ + I
Sbjct: 194 LKKQRLLKEPIFAFYLSTK--KENGSVVMFGGLDHSYYKGELKWVPVSQRLYWQISMDSI 251
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
+ + G C+GGC AIVD+GT++L GPT VVT I AI + E
Sbjct: 252 TMNGKILG-CKGGCQAIVDTGTAVLVGPTNVVTNIQKAINARPLTGYE 298
>gi|73620983|sp|P00792.2|PEPA_BOVIN RecName: Full=Pepsin A; Flags: Precursor
gi|24415088|emb|CAD55693.1| pepsinogen A [synthetic construct]
gi|37622272|gb|AAQ95219.1| pepsinogen A [Bos taurus]
Length = 372
Score = 87.4 bits (215), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 46/117 (39%), Positives = 68/117 (58%), Gaps = 3/117 (2%)
Query: 3 EQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILI 62
+QGLVS+++FS +L+ + E G ++FG +D ++ G +VPV+ +GYWQ + I +
Sbjct: 188 DQGLVSQDLFSVYLSSN--EESGSVVIFGDIDSSYYSGSLNWVPVSVEGYWQITVDSITM 245
Query: 63 GNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDL 119
+S C GC AIVD+GTSLLAGPT ++ I IG S E + S L
Sbjct: 246 NGESIA-CSDGCQAIVDTGTSLLAGPTTAISNIQSYIGASEDSSGEVVISCSSIDSL 301
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 50/134 (37%), Positives = 66/134 (49%), Gaps = 10/134 (7%)
Query: 164 GDSAVCSACEMAVVWVQNQLKQKQTK--EKVLSYINELCDSLPNPMGESIIDCDRIPTMP 221
G+S CS A+V L T + SYI DS GE +I C I ++P
Sbjct: 247 GESIACSDGCQAIVDTGTSLLAGPTTAISNIQSYIGASEDS----SGEVVISCSSIDSLP 302
Query: 222 NVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTV 281
++ FTI + + P YIL++ +C SGF D+ G LWILGDVF+ Y TV
Sbjct: 303 DIVFTINGVQYPVPPSAYILQSNG----ICSSGFEGMDISTSSGDLWILGDVFIRQYFTV 358
Query: 282 FDSGKLRIGFAEAA 295
FD G +IG A A
Sbjct: 359 FDRGNNQIGLAPVA 372
>gi|51534968|dbj|BAD36917.1| pepsinogen C [Mus caroli]
Length = 377
Score = 87.4 bits (215), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 44/103 (42%), Positives = 65/103 (63%), Gaps = 2/103 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ +G +S+ +F +L GG+IVFGGVD + G+ T++PVT++ YWQ + D
Sbjct: 187 MLGEGALSQPLFGVYLGSQ-QGSNGGQIVFGGVDENLYTGELTWIPVTQELYWQITIDDF 245
Query: 61 LIGNQSTGVC-EGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
LIGNQ++G C GC IVD+GTSLL P ++E+ IG +
Sbjct: 246 LIGNQASGWCSSSGCQGIVDTGTSLLVMPAQYLSELLQTIGAQ 288
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 37/102 (36%), Positives = 56/102 (54%), Gaps = 8/102 (7%)
Query: 195 YINELCDSL---PNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVC 251
Y++EL ++ G+ + CD + ++P ++F + F LSP YI++ EG C
Sbjct: 277 YLSELLQTIGAQEGEYGQYFVSCDTVSSLPTLTFVLNGVQFPLSPSSYIIQE-EG---SC 332
Query: 252 ISGFMAFDLPPPRG-PLWILGDVFMGVYHTVFDSGKLRIGFA 292
+ G + L G PLWILGDVF+ Y+ VFD G R+G A
Sbjct: 333 MVGLESLSLNAESGQPLWILGDVFLRSYYAVFDMGNNRVGLA 374
>gi|195575781|ref|XP_002077755.1| GD23098 [Drosophila simulans]
gi|194189764|gb|EDX03340.1| GD23098 [Drosophila simulans]
Length = 419
Score = 87.4 bits (215), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 45/101 (44%), Positives = 64/101 (63%), Gaps = 4/101 (3%)
Query: 1 MVEQGLVSEEVFSFWLNRD-PDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGD 59
++EQGL+ E +FS ++NR+ DA GG ++ GG DP ++G TYVPV+K G+WQ +G
Sbjct: 221 ILEQGLIDEPIFSLYVNRNASDASNGGVLLLGGSDPTLYRGCLTYVPVSKVGFWQITVGQ 280
Query: 60 ILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 100
+ IG S +C C AI D GTSL+ P P + IN +G
Sbjct: 281 VEIG--SKKLCS-NCQAIFDVGTSLIIVPCPALKIINKKLG 318
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 34/94 (36%), Positives = 47/94 (50%), Gaps = 13/94 (13%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAF--------- 258
G IIDC+++P +P + F IG K F L+P YIL + C+SGF +
Sbjct: 327 GAYIIDCNKVPRLPKIVFNIGWKDFTLNPSDYIL----NYSGTCVSGFSSLSDCNGAQTN 382
Query: 259 DLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFA 292
D +W+ GDVF G T+FD G +G A
Sbjct: 383 DDGEDLNNIWVFGDVFFGAVFTLFDFGLKLVGIA 416
>gi|494476|pdb|1PSA|A Chain A, Structure Of A Pepsin(Slash)renin Inhibitor Complex
Reveals A Novel Crystal Packing Induced By Minor
Chemical Alterations In The Inhibitor
gi|494478|pdb|1PSA|B Chain B, Structure Of A Pepsin(Slash)renin Inhibitor Complex
Reveals A Novel Crystal Packing Induced By Minor
Chemical Alterations In The Inhibitor
gi|67463919|pdb|1YX9|A Chain A, Effect Of Dimethyl Sulphoxide On The Crystal Structure Of
Porcine Pepsin
Length = 326
Score = 87.4 bits (215), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 47/110 (42%), Positives = 68/110 (61%), Gaps = 9/110 (8%)
Query: 3 EQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILI 62
+QGLVS+++FS +L+ + D+ G ++ GG+D ++ G +VPV+ +GYWQ L I +
Sbjct: 142 DQGLVSQDLFSVYLSSNDDS--GSVVLLGGIDSSYYTGSLNWVPVSVEGYWQITLDSITM 199
Query: 63 GNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG------GEGVVS 106
+ T C GGC AIVD+GTSLL GPT + I IG GE V+S
Sbjct: 200 DGE-TIACSGGCQAIVDTGTSLLTGPTSAIANIQSDIGASENSDGEMVIS 248
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 47/132 (35%), Positives = 66/132 (50%), Gaps = 6/132 (4%)
Query: 164 GDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNV 223
G++ CS A+V L T ++ I + N GE +I C I ++P++
Sbjct: 201 GETIACSGGCQAIVDTGTSLLTGPTS--AIANIQSDIGASENSDGEMVISCSSIDSLPDI 258
Query: 224 SFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFD 283
FTI + LSP YIL+ + C SGF D+P G LWILGDVF+ Y+TVFD
Sbjct: 259 VFTINGVQYPLSPSAYILQDDDS----CTSGFEGMDVPTSSGELWILGDVFIRQYYTVFD 314
Query: 284 SGKLRIGFAEAA 295
++G A A
Sbjct: 315 RANNKVGLAPVA 326
>gi|292658855|ref|NP_001001600.2| pepsin A preproprotein [Bos taurus]
Length = 386
Score = 87.4 bits (215), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 46/117 (39%), Positives = 68/117 (58%), Gaps = 3/117 (2%)
Query: 3 EQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILI 62
+QGLVS+++FS +L+ + E G ++FG +D ++ G +VPV+ +GYWQ + I +
Sbjct: 202 DQGLVSQDLFSVYLSSN--EESGSVVIFGDIDSSYYSGSLNWVPVSVEGYWQITVDSITM 259
Query: 63 GNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDL 119
+S C GC AIVD+GTSLLAGPT ++ I IG S E + S L
Sbjct: 260 NGESIA-CSDGCQAIVDTGTSLLAGPTTAISNIQSYIGASEDSSGEVVISCSSIDSL 315
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 50/134 (37%), Positives = 66/134 (49%), Gaps = 10/134 (7%)
Query: 164 GDSAVCSACEMAVVWVQNQLKQKQTK--EKVLSYINELCDSLPNPMGESIIDCDRIPTMP 221
G+S CS A+V L T + SYI DS GE +I C I ++P
Sbjct: 261 GESIACSDGCQAIVDTGTSLLAGPTTAISNIQSYIGASEDS----SGEVVISCSSIDSLP 316
Query: 222 NVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTV 281
++ FTI + + P YIL++ +C SGF D+ G LWILGDVF+ Y TV
Sbjct: 317 DIVFTINGVQYPVPPSAYILQSNG----ICSSGFEGMDISTSSGDLWILGDVFIRQYFTV 372
Query: 282 FDSGKLRIGFAEAA 295
FD G +IG A A
Sbjct: 373 FDRGNNQIGLAPVA 386
>gi|195350351|ref|XP_002041704.1| GM16818 [Drosophila sechellia]
gi|194123477|gb|EDW45520.1| GM16818 [Drosophila sechellia]
Length = 419
Score = 87.4 bits (215), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 45/101 (44%), Positives = 64/101 (63%), Gaps = 4/101 (3%)
Query: 1 MVEQGLVSEEVFSFWLNRD-PDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGD 59
++EQGL+ E +FS ++NR+ DA GG ++ GG DP ++G TYVPV+K G+WQ +G
Sbjct: 221 ILEQGLIDEPIFSLYVNRNASDANNGGVLLLGGSDPTLYRGCLTYVPVSKVGFWQITVGQ 280
Query: 60 ILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 100
+ IG S +C C AI D GTSL+ P P + IN +G
Sbjct: 281 VEIG--SKKLCS-NCQAIFDVGTSLIIVPCPALKIINKKLG 318
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 33/94 (35%), Positives = 46/94 (48%), Gaps = 13/94 (13%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAF--------- 258
G IIDC+++ +P + F IG K F L+P YIL + C+SGF +
Sbjct: 327 GAYIIDCNKVSRLPKIVFNIGWKDFTLNPSDYIL----NYSGTCVSGFSSLSDCNGAQTN 382
Query: 259 DLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFA 292
D +W+ GDVF G T+FD G +G A
Sbjct: 383 DDGEDLNNIWVFGDVFFGAVFTLFDFGLKLVGIA 416
>gi|151553998|gb|AAI49645.1| PGA5 protein [Bos taurus]
Length = 381
Score = 87.4 bits (215), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 46/117 (39%), Positives = 68/117 (58%), Gaps = 3/117 (2%)
Query: 3 EQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILI 62
+QGLVS+++FS +L+ + E G ++FG +D ++ G +VPV+ +GYWQ + I +
Sbjct: 197 DQGLVSQDLFSVYLSSN--EESGSVVIFGDIDSSYYSGSLNWVPVSVEGYWQITVDSITM 254
Query: 63 GNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDL 119
+S C GC AIVD+GTSLLAGPT ++ I IG S E + S L
Sbjct: 255 NGESIA-CSDGCQAIVDTGTSLLAGPTTAISNIQSYIGASEDSSGEVVISCSSIDSL 310
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 50/134 (37%), Positives = 66/134 (49%), Gaps = 10/134 (7%)
Query: 164 GDSAVCSACEMAVVWVQNQLKQKQTK--EKVLSYINELCDSLPNPMGESIIDCDRIPTMP 221
G+S CS A+V L T + SYI DS GE +I C I ++P
Sbjct: 256 GESIACSDGCQAIVDTGTSLLAGPTTAISNIQSYIGASEDS----SGEVVISCSSIDSLP 311
Query: 222 NVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTV 281
++ FTI + + P YIL++ +C SGF D+ G LWILGDVF+ Y TV
Sbjct: 312 DIVFTINGVQYPVPPSAYILQSNG----ICSSGFEGMDISTSSGDLWILGDVFIRQYFTV 367
Query: 282 FDSGKLRIGFAEAA 295
FD G +IG A A
Sbjct: 368 FDRGNNQIGLAPVA 381
>gi|11990128|emb|CAC19555.1| pepsin A [Camelus dromedarius]
Length = 390
Score = 87.4 bits (215), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 42/99 (42%), Positives = 64/99 (64%), Gaps = 3/99 (3%)
Query: 3 EQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILI 62
++GL+SE++FS +L+ + E G ++FGG+D ++ G +VPV+ +GYWQ + I +
Sbjct: 205 DEGLISEDLFSVYLSSND--ESGSVVIFGGIDSSYYTGSLNWVPVSVEGYWQITVDSITM 262
Query: 63 GNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 101
+S C GC AIVD+GTSLLAGPT ++ I IG
Sbjct: 263 EGESIA-CSSGCQAIVDTGTSLLAGPTDAISNIQSYIGA 300
Score = 73.9 bits (180), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 49/134 (36%), Positives = 66/134 (49%), Gaps = 11/134 (8%)
Query: 164 GDSAVCSACEMAVVWVQNQLKQKQTK--EKVLSYINELCDSLPNPMGESIIDCDRIPTMP 221
G+S CS+ A+V L T + SYI DS G+ ++ C I ++P
Sbjct: 264 GESIACSSGCQAIVDTGTSLLAGPTDAISNIQSYIGASEDS----YGDMVVSCSSISSLP 319
Query: 222 NVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTV 281
N+ FTI + LSP YIL++ + C SGF DL LWILGDVF+ Y TV
Sbjct: 320 NIVFTINGVQYPLSPSAYILESDDS----CTSGFEGMDLSSSE-ELWILGDVFIRQYFTV 374
Query: 282 FDSGKLRIGFAEAA 295
FD ++G A A
Sbjct: 375 FDRANNQVGLAAVA 388
>gi|157836865|pdb|3PEP|A Chain A, Revised 2.3 Angstroms Structure Of Porcine Pepsin.
Evidence For A Flexible Subdomain
Length = 326
Score = 87.4 bits (215), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 47/110 (42%), Positives = 68/110 (61%), Gaps = 9/110 (8%)
Query: 3 EQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILI 62
+QGLVS+++FS +L+ + D+ G ++ GG+D ++ G +VPV+ +GYWQ L I +
Sbjct: 142 DQGLVSQDLFSVYLSSNDDS--GSVVLLGGIDSSYYTGSLNWVPVSVEGYWQITLDSITM 199
Query: 63 GNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG------GEGVVS 106
+ T C GGC AIVD+GTSLL GPT + I IG GE V+S
Sbjct: 200 DGE-TIACSGGCQAIVDTGTSLLTGPTSAIANIQSDIGASENSDGEMVIS 248
Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 47/132 (35%), Positives = 66/132 (50%), Gaps = 6/132 (4%)
Query: 164 GDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNV 223
G++ CS A+V L T ++ I + N GE +I C I ++P++
Sbjct: 201 GETIACSGGCQAIVDTGTSLLTGPTS--AIANIQSDIGASENSDGEMVISCSSIDSLPDI 258
Query: 224 SFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFD 283
FTI + LSP YIL+ + C SGF D+P G LWILGDVF+ Y+TVFD
Sbjct: 259 VFTIDGVQYPLSPSAYILQDDDS----CTSGFEGMDVPTSSGELWILGDVFIRQYYTVFD 314
Query: 284 SGKLRIGFAEAA 295
++G A A
Sbjct: 315 RANNKVGLAPVA 326
>gi|12843350|dbj|BAB25952.1| unnamed protein product [Mus musculus]
Length = 396
Score = 87.4 bits (215), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 44/103 (42%), Positives = 64/103 (62%), Gaps = 2/103 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ +G +S+ +F +L GG+IVFGGVD + G+ T++PVT++ YWQ + D
Sbjct: 202 MLGEGALSQPLFGVYLGSQ-QGSNGGQIVFGGVDENLYTGELTWIPVTQELYWQITIDDF 260
Query: 61 LIGNQSTGVC-EGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
LIGNQ++G C GC IVD+GTSLL P + E+ IG +
Sbjct: 261 LIGNQASGWCSSSGCQGIVDTGTSLLVMPAQYLNELLQTIGAQ 303
Score = 55.1 bits (131), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 31/89 (34%), Positives = 48/89 (53%), Gaps = 8/89 (8%)
Query: 195 YINELCDSL---PNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVC 251
Y+NEL ++ G+ + CD + ++P ++F + F LSP YI++ EG C
Sbjct: 292 YLNELLQTIGAQEGEYGQYFVSCDSVSSLPTLTFVLNGVQFPLSPSSYIIQE-EG---SC 347
Query: 252 ISGFMAFDLPPPRG-PLWILGDVFMGVYH 279
+ G + L G PLWILGDVF+ Y+
Sbjct: 348 MVGLESLSLNAESGQPLWILGDVFLRSYY 376
>gi|157836875|pdb|3PSG|A Chain A, The High Resolution Crystal Structure Of Porcine
Pepsinogen
Length = 370
Score = 87.4 bits (215), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 47/110 (42%), Positives = 68/110 (61%), Gaps = 9/110 (8%)
Query: 3 EQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILI 62
+QGLVS+++FS +L+ + D+ G ++ GG+D ++ G +VPV+ +GYWQ L I +
Sbjct: 186 DQGLVSQDLFSVYLSSNDDS--GSVVLLGGIDSSYYTGSLNWVPVSVEGYWQITLDSITM 243
Query: 63 GNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG------GEGVVS 106
+ T C GGC AIVD+GTSLL GPT + I IG GE V+S
Sbjct: 244 DGE-TIACSGGCQAIVDTGTSLLTGPTSAIANIQSDIGASENSDGEMVIS 292
Score = 78.6 bits (192), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 47/132 (35%), Positives = 66/132 (50%), Gaps = 6/132 (4%)
Query: 164 GDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNV 223
G++ CS A+V L T ++ I + N GE +I C I ++P++
Sbjct: 245 GETIACSGGCQAIVDTGTSLLTGPTS--AIANIQSDIGASENSDGEMVISCSSIDSLPDI 302
Query: 224 SFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFD 283
FTI + LSP YIL+ + C SGF D+P G LWILGDVF+ Y+TVFD
Sbjct: 303 VFTIDGVQYPLSPSAYILQDDDS----CTSGFEGMDVPTSSGELWILGDVFIRQYYTVFD 358
Query: 284 SGKLRIGFAEAA 295
++G A A
Sbjct: 359 RANNKVGLAPVA 370
>gi|440893605|gb|ELR46308.1| Pepsin A, partial [Bos grunniens mutus]
Length = 388
Score = 87.4 bits (215), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 46/117 (39%), Positives = 68/117 (58%), Gaps = 3/117 (2%)
Query: 3 EQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILI 62
+QGLVS+++FS +L+ + E G ++FG +D ++ G +VPV+ +GYWQ + I +
Sbjct: 204 DQGLVSQDLFSVYLSSN--EESGSVVIFGDIDSSYYSGSLNWVPVSVEGYWQITVDSITM 261
Query: 63 GNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDL 119
+S C GC AIVD+GTSLLAGPT ++ I IG S E + S L
Sbjct: 262 NGESIA-CSDGCQAIVDTGTSLLAGPTTAISNIQSYIGASEDSSGEVVISCSSIDSL 317
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 52/134 (38%), Positives = 68/134 (50%), Gaps = 10/134 (7%)
Query: 164 GDSAVCSACEMAVVWVQNQLKQKQTK--EKVLSYINELCDSLPNPMGESIIDCDRIPTMP 221
G+S CS A+V L T + SYI DS GE +I C I ++P
Sbjct: 263 GESIACSDGCQAIVDTGTSLLAGPTTAISNIQSYIGASEDS----SGEVVISCSSIDSLP 318
Query: 222 NVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTV 281
++ FTI + + P YIL++ +GI C SGF D+ G LWILGDVF+ Y TV
Sbjct: 319 DIVFTINGVQYPVPPSAYILQS-DGI---CSSGFEGMDISTSSGDLWILGDVFIRQYFTV 374
Query: 282 FDSGKLRIGFAEAA 295
FD G +IG A A
Sbjct: 375 FDRGNNQIGLAPVA 388
>gi|194760707|ref|XP_001962579.1| GF14369 [Drosophila ananassae]
gi|190616276|gb|EDV31800.1| GF14369 [Drosophila ananassae]
Length = 371
Score = 87.4 bits (215), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 50/126 (39%), Positives = 75/126 (59%), Gaps = 11/126 (8%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV QGLV VFSF+L RD + +GGE++FGG D + G TYVP++++GYWQF +
Sbjct: 195 MVSQGLVDSSVFSFYLARDGTSSQGGELIFGGSDSSLYSGDLTYVPISEQGYWQFTMD-- 252
Query: 61 LIGNQSTG--VCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG-GEGVVSAECKLVVSQYG 117
G+ G +CE C AI D+GTSL+ P ++ + +G V +C +VS
Sbjct: 253 --GSSFDGYTLCE-DCQAIADTGTSLIVAPYEAYITLSELLNVDDGYV--DCS-IVSSLP 306
Query: 118 DLIWDL 123
D+I+++
Sbjct: 307 DVIFNI 312
Score = 64.7 bits (156), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 37/86 (43%), Positives = 47/86 (54%), Gaps = 13/86 (15%)
Query: 212 IDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF--MAFDLPPPRGPLWI 269
+DC + ++P+V F IG F L P YIL++ C+S F M D WI
Sbjct: 297 VDCSIVSSLPDVIFNIGGSNFALKPSAYILQSDGN----CMSAFEYMGTDF-------WI 345
Query: 270 LGDVFMGVYHTVFDSGKLRIGFAEAA 295
LGDVF+G Y+T FD G RIGFA A
Sbjct: 346 LGDVFIGQYYTEFDLGNNRIGFAPVA 371
>gi|195470497|ref|XP_002087543.1| GE17582 [Drosophila yakuba]
gi|194173644|gb|EDW87255.1| GE17582 [Drosophila yakuba]
Length = 421
Score = 87.4 bits (215), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 44/101 (43%), Positives = 63/101 (62%), Gaps = 4/101 (3%)
Query: 1 MVEQGLVSEEVFSFWLNRDP-DAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGD 59
++EQGL+ E +FS ++NR+ DA GG ++ GG DP + G TYVP++K G+WQ +G
Sbjct: 221 ILEQGLIDEPIFSLYVNRNASDASNGGVLLLGGADPTLYSGCLTYVPLSKVGFWQITVGQ 280
Query: 60 ILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 100
+ IG S +C C AI D GTSL+ P P + IN +G
Sbjct: 281 VAIG--SKKLCS-NCQAIFDMGTSLIIVPCPALKIINRKLG 318
Score = 58.2 bits (139), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 33/96 (34%), Positives = 45/96 (46%), Gaps = 15/96 (15%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGP- 266
G IIDC ++ +PNV IG K F L+P Y+L + C+SGF + G
Sbjct: 327 GVYIIDCSKVSRLPNVVLNIGWKDFTLTPSDYVLN----YSGTCVSGFSSLSACNGSGTP 382
Query: 267 ----------LWILGDVFMGVYHTVFDSGKLRIGFA 292
+W+ GDVF G T+FD G +G A
Sbjct: 383 INDDGDDLNNIWVFGDVFFGAIFTLFDFGLKLVGMA 418
>gi|253723303|pdb|2PSG|A Chain A, Refined Structure Of Porcine Pepsinogen At 1.8 Angstroms
Resolution
Length = 370
Score = 87.4 bits (215), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 47/110 (42%), Positives = 68/110 (61%), Gaps = 9/110 (8%)
Query: 3 EQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILI 62
+QGLVS+++FS +L+ + D+ G ++ GG+D ++ G +VPV+ +GYWQ L I +
Sbjct: 186 DQGLVSQDLFSVYLSSNDDS--GSVVLLGGIDSSYYTGSLNWVPVSVEGYWQITLDSITM 243
Query: 63 GNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG------GEGVVS 106
+ T C GGC AIVD+GTSLL GPT + I IG GE V+S
Sbjct: 244 DGE-TIACSGGCQAIVDTGTSLLTGPTSAIANIQSDIGASENSDGEMVIS 292
Score = 78.6 bits (192), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 47/132 (35%), Positives = 66/132 (50%), Gaps = 6/132 (4%)
Query: 164 GDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNV 223
G++ CS A+V L T ++ I + N GE +I C I ++P++
Sbjct: 245 GETIACSGGCQAIVDTGTSLLTGPTS--AIANIQSDIGASENSDGEMVISCSSIDSLPDI 302
Query: 224 SFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFD 283
FTI + LSP YIL+ + C SGF D+P G LWILGDVF+ Y+TVFD
Sbjct: 303 VFTIDGVQYPLSPSAYILQDDDS----CTSGFEGMDVPTSSGELWILGDVFIRQYYTVFD 358
Query: 284 SGKLRIGFAEAA 295
++G A A
Sbjct: 359 RANNKVGLAPVA 370
>gi|29244579|ref|NP_080249.2| gastricsin precursor [Mus musculus]
gi|73921722|sp|Q9D7R7.1|PEPC_MOUSE RecName: Full=Gastricsin; AltName: Full=Pepsinogen C; Flags:
Precursor
gi|12843461|dbj|BAB25990.1| unnamed protein product [Mus musculus]
gi|68534888|gb|AAH99409.1| Progastricsin (pepsinogen C) [Mus musculus]
Length = 392
Score = 87.4 bits (215), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 44/103 (42%), Positives = 64/103 (62%), Gaps = 2/103 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ +G +S+ +F +L GG+IVFGGVD + G+ T++PVT++ YWQ + D
Sbjct: 202 MLGEGALSQPLFGVYLGSQ-QGSNGGQIVFGGVDENLYTGELTWIPVTQELYWQITIDDF 260
Query: 61 LIGNQSTGVC-EGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
LIGNQ++G C GC IVD+GTSLL P + E+ IG +
Sbjct: 261 LIGNQASGWCSSSGCQGIVDTGTSLLVMPAQYLNELLQTIGAQ 303
Score = 68.9 bits (167), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 38/102 (37%), Positives = 56/102 (54%), Gaps = 8/102 (7%)
Query: 195 YINELCDSL---PNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVC 251
Y+NEL ++ G+ + CD + ++P ++F + F LSP YI++ EG C
Sbjct: 292 YLNELLQTIGAQEGEYGQYFVSCDSVSSLPTLTFVLNGVQFPLSPSSYIIQE-EG---SC 347
Query: 252 ISGFMAFDLPPPRG-PLWILGDVFMGVYHTVFDSGKLRIGFA 292
+ G + L G PLWILGDVF+ Y+ VFD G R+G A
Sbjct: 348 MVGLESLSLNAESGQPLWILGDVFLRSYYAVFDMGNNRVGLA 389
>gi|292658825|ref|NP_999038.2| pepsin A preproprotein [Sus scrofa]
gi|121073319|gb|ABM47074.1| pepsinogen A [Sus scrofa]
Length = 385
Score = 87.4 bits (215), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 47/110 (42%), Positives = 68/110 (61%), Gaps = 9/110 (8%)
Query: 3 EQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILI 62
+QGLVS+++FS +L+ + D+ G ++ GG+D ++ G +VPV+ +GYWQ L I +
Sbjct: 201 DQGLVSQDLFSVYLSSNDDS--GSVVLLGGIDSSYYTGSLNWVPVSVEGYWQITLDSITM 258
Query: 63 GNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG------GEGVVS 106
+ T C GGC AIVD+GTSLL GPT + I IG GE V+S
Sbjct: 259 DGE-TIACSGGCQAIVDTGTSLLTGPTSAIANIQSDIGASENSDGEMVIS 307
Score = 78.2 bits (191), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 46/129 (35%), Positives = 65/129 (50%), Gaps = 6/129 (4%)
Query: 164 GDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNV 223
G++ CS A+V L T ++ I + N GE +I C I ++P++
Sbjct: 260 GETIACSGGCQAIVDTGTSLLTGPTS--AIANIQSDIGASENSDGEMVISCSSIDSLPDI 317
Query: 224 SFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFD 283
FTI + LSP YIL+ + C SGF D+P G LWILGDVF+ Y+TVFD
Sbjct: 318 VFTINGVQYPLSPSAYILQDDDS----CTSGFEGMDVPTSSGELWILGDVFIRQYYTVFD 373
Query: 284 SGKLRIGFA 292
++G A
Sbjct: 374 RANNKVGLA 382
>gi|194858591|ref|XP_001969211.1| GG24067 [Drosophila erecta]
gi|190661078|gb|EDV58270.1| GG24067 [Drosophila erecta]
Length = 372
Score = 87.4 bits (215), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 42/88 (47%), Positives = 57/88 (64%), Gaps = 3/88 (3%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV QGLV VFSF+L RD + +GGE++FGG D + G TYVP++++GYWQF +
Sbjct: 195 MVSQGLVDSSVFSFYLARDGTSTKGGELIFGGSDSSLYSGSLTYVPISEQGYWQFNMASS 254
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGP 88
I S +C+ C AI D+GTSL+ P
Sbjct: 255 SIDGFS--LCD-DCQAIADTGTSLIVAP 279
Score = 65.1 bits (157), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 35/86 (40%), Positives = 49/86 (56%), Gaps = 13/86 (15%)
Query: 212 IDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF--MAFDLPPPRGPLWI 269
+DC + ++P+++F IG F LSP Y++++ C+S F M D WI
Sbjct: 298 LDCSTVSSLPDITFNIGGTDFVLSPSAYVIQSDGN----CMSAFEYMGTDF-------WI 346
Query: 270 LGDVFMGVYHTVFDSGKLRIGFAEAA 295
LGDVF+G Y+T FD G RIGFA A
Sbjct: 347 LGDVFIGQYYTEFDLGNNRIGFAPVA 372
>gi|194854124|ref|XP_001968293.1| GG24794 [Drosophila erecta]
gi|190660160|gb|EDV57352.1| GG24794 [Drosophila erecta]
Length = 412
Score = 87.4 bits (215), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 51/127 (40%), Positives = 69/127 (54%), Gaps = 11/127 (8%)
Query: 1 MVEQGLVSEEVFSFWLNRD-PDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGD 59
+V +GL+S VFS +LNR+ DA GGE++ GG D + G TYVPV+ GYWQF +
Sbjct: 226 LVAEGLISRAVFSIYLNRNGTDATHGGELILGGTDSGLYSGCLTYVPVSSAGYWQFTMTS 285
Query: 60 ILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG------GEGVVSAECKLVV 113
+ CE C AI+D GTSL+ P PV+ IN +G GV C V
Sbjct: 286 ATVNGFQ--FCE-NCEAILDVGTSLIVVPEPVLAAINQILGVLDATPSNGVYQVNCS-TV 341
Query: 114 SQYGDLI 120
S+ D++
Sbjct: 342 SELPDVV 348
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 36/108 (33%), Positives = 57/108 (52%), Gaps = 12/108 (11%)
Query: 190 EKVLSYINELC---DSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEG 246
E VL+ IN++ D+ P+ G ++C + +P+V TI + F L Y+L+ G
Sbjct: 312 EPVLAAINQILGVLDATPSN-GVYQVNCSTVSELPDVVLTIARRKFPLKFSDYVLRYGN- 369
Query: 247 IAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
C+SGF + L ILG++F+G Y+TV+D+ IG A A
Sbjct: 370 ---TCVSGFTSMK----GNNLLILGEIFLGAYYTVYDTVYKLIGLAPA 410
>gi|50978822|ref|NP_001003117.1| pepsin A preproprotein [Canis lupus familiaris]
gi|73621384|sp|Q9GMY6.1|PEPA_CANFA RecName: Full=Pepsin A; Flags: Precursor
gi|9798660|dbj|BAB11752.1| pepsinogen A [Canis lupus familiaris]
Length = 386
Score = 87.4 bits (215), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 42/102 (41%), Positives = 63/102 (61%), Gaps = 3/102 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M +GLVS+++FS +L+ D + G ++FGG+D ++ G +VPV+ +GYWQ + +
Sbjct: 200 MWNEGLVSQDLFSVYLSSD--DQSGSVVMFGGIDSSYYSGNLNWVPVSVEGYWQITVDSV 257
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
+ Q+ C GC AIVD+GTSLLAGPT + I IG
Sbjct: 258 TMNGQAIA-CSDGCQAIVDTGTSLLAGPTNAIANIQSYIGAS 298
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 37/91 (40%), Positives = 52/91 (57%), Gaps = 4/91 (4%)
Query: 205 NPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 264
N G+ +I C I ++P++ FTI + L P YIL++ +G C+SGF +LP
Sbjct: 300 NSYGQMVISCSAINSLPDIVFTINGIQYPLPPSAYILQSQQG----CVSGFQGMNLPTAS 355
Query: 265 GPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
G LWILGDVF+ Y VFD ++G A A
Sbjct: 356 GELWILGDVFIRQYFAVFDRANNQVGLAPVA 386
>gi|195433871|ref|XP_002064930.1| GK15195 [Drosophila willistoni]
gi|194161015|gb|EDW75916.1| GK15195 [Drosophila willistoni]
Length = 431
Score = 87.4 bits (215), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 39/103 (37%), Positives = 62/103 (60%), Gaps = 4/103 (3%)
Query: 1 MVEQGLVSEEVFSFWLNRD---PDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFEL 57
++ Q L+ E +F+ LNR+ A GG+++ GG DP + G TYVP+++ GYWQF +
Sbjct: 227 LIAQKLIDEPIFALILNRNGSASSASNGGQLILGGTDPTLYSGCLTYVPLSQVGYWQFTV 286
Query: 58 GDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 100
I++G+ +C C AI+D GTSL+ P+ + IN +G
Sbjct: 287 TSIVLGSNGKSLCS-QCEAILDVGTSLIVAPSAALATINQQLG 328
Score = 60.8 bits (146), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 34/90 (37%), Positives = 47/90 (52%), Gaps = 13/90 (14%)
Query: 212 IDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAF-----DLPPPRG- 265
IDC++ T+P++ TI K F L YIL+ G C+SGF + D+ G
Sbjct: 341 IDCNKTSTLPDLILTIARKDFVLPASSYILQYGS----TCVSGFTSLHEGGQDVSSVDGV 396
Query: 266 ---PLWILGDVFMGVYHTVFDSGKLRIGFA 292
LWILGDVF+G ++ FD G R+ A
Sbjct: 397 DYSNLWILGDVFLGSFYVEFDVGYKRVALA 426
>gi|395730069|ref|XP_003775659.1| PREDICTED: LOW QUALITY PROTEIN: gastricsin-like [Pongo abelii]
Length = 390
Score = 87.4 bits (215), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 57/88 (64%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M++QG +++ VFSF+ P + GGE++ GGVDP+ + + PV+ + YWQ + +
Sbjct: 200 MLQQGQLTQPVFSFYFTCQPTCQYGGELILGGVDPQLYSDXVIWTPVSLELYWQIAIEEF 259
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGP 88
IGNQ+TG+C GC AIVDSGTS P
Sbjct: 260 AIGNQATGLCSDGCQAIVDSGTSXAGIP 287
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 35/89 (39%), Positives = 48/89 (53%), Gaps = 5/89 (5%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRG-P 266
G+ ++DC I +MP ++F IG F L P Y+ C G A LP G P
Sbjct: 306 GDFVVDCSYIQSMPTITFIIGRAQFPLPPSAYVFNNNG----YCRLGTEATYLPSHSGQP 361
Query: 267 LWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
LW+LGDVF Y++V+D R+GFA +A
Sbjct: 362 LWVLGDVFFKGYYSVYDMANNRVGFAFSA 390
>gi|118572685|sp|P00791.3|PEPA_PIG RecName: Full=Pepsin A; Flags: Precursor
Length = 385
Score = 87.4 bits (215), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 47/110 (42%), Positives = 68/110 (61%), Gaps = 9/110 (8%)
Query: 3 EQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILI 62
+QGLVS+++FS +L+ + D+ G ++ GG+D ++ G +VPV+ +GYWQ L I +
Sbjct: 201 DQGLVSQDLFSVYLSSNDDS--GSVVLLGGIDSSYYTGSLNWVPVSVEGYWQITLDSITM 258
Query: 63 GNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG------GEGVVS 106
+ T C GGC AIVD+GTSLL GPT + I IG GE V+S
Sbjct: 259 DGE-TIACSGGCQAIVDTGTSLLTGPTSAIANIQSDIGASENSDGEMVIS 307
Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 47/132 (35%), Positives = 66/132 (50%), Gaps = 6/132 (4%)
Query: 164 GDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNV 223
G++ CS A+V L T ++ I + N GE +I C I ++P++
Sbjct: 260 GETIACSGGCQAIVDTGTSLLTGPTS--AIANIQSDIGASENSDGEMVISCSSIDSLPDI 317
Query: 224 SFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFD 283
FTI + LSP YIL+ + C SGF D+P G LWILGDVF+ Y+TVFD
Sbjct: 318 VFTINGVQYPLSPSAYILQDDDS----CTSGFEGMDVPTSSGELWILGDVFIRQYYTVFD 373
Query: 284 SGKLRIGFAEAA 295
++G A A
Sbjct: 374 RANNKVGLAPVA 385
>gi|164604|gb|AAA31096.1| pepsinogen A precursor [Sus scrofa]
Length = 385
Score = 87.4 bits (215), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 47/110 (42%), Positives = 68/110 (61%), Gaps = 9/110 (8%)
Query: 3 EQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILI 62
+QGLVS+++FS +L+ + D+ G ++ GG+D ++ G +VPV+ +GYWQ L I +
Sbjct: 201 DQGLVSQDLFSVYLSSNDDS--GSVVLLGGIDSSYYTGSLNWVPVSVEGYWQITLDSITM 258
Query: 63 GNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG------GEGVVS 106
+ T C GGC AIVD+GTSLL GPT + I IG GE V+S
Sbjct: 259 DGE-TIACSGGCQAIVDTGTSLLTGPTSAIANIQSDIGASENSYGEMVIS 307
Score = 79.7 bits (195), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 47/132 (35%), Positives = 66/132 (50%), Gaps = 6/132 (4%)
Query: 164 GDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNV 223
G++ CS A+V L T ++ I + N GE +I C I ++P++
Sbjct: 260 GETIACSGGCQAIVDTGTSLLTGPTS--AIANIQSDIGASENSYGEMVISCSSIDSLPDI 317
Query: 224 SFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFD 283
FTI + LSP YIL+ + C SGF D+P G LWILGDVF+ Y+TVFD
Sbjct: 318 VFTINGVQYPLSPSAYILQDDDS----CTSGFEGMDVPTSSGELWILGDVFIRQYYTVFD 373
Query: 284 SGKLRIGFAEAA 295
++G A A
Sbjct: 374 RANNKVGLAPVA 385
>gi|253723333|pdb|4PEP|A Chain A, The Molecular And Crystal Structures Of Monoclinic Porcine
Pepsin Refined At 1.8 Angstroms Resolution
Length = 326
Score = 87.4 bits (215), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 47/110 (42%), Positives = 68/110 (61%), Gaps = 9/110 (8%)
Query: 3 EQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILI 62
+QGLVS+++FS +L+ + D+ G ++ GG+D ++ G +VPV+ +GYWQ L I +
Sbjct: 142 DQGLVSQDLFSVYLSSNDDS--GSVVLLGGIDSSYYTGSLNWVPVSVEGYWQITLDSITM 199
Query: 63 GNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG------GEGVVS 106
+ T C GGC AIVD+GTSLL GPT + I IG GE V+S
Sbjct: 200 DGE-TIACSGGCQAIVDTGTSLLTGPTSAIANIQSDIGASENSDGEMVIS 248
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 47/132 (35%), Positives = 66/132 (50%), Gaps = 6/132 (4%)
Query: 164 GDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNV 223
G++ CS A+V L T ++ I + N GE +I C I ++P++
Sbjct: 201 GETIACSGGCQAIVDTGTSLLTGPTS--AIANIQSDIGASENSDGEMVISCSSIDSLPDI 258
Query: 224 SFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFD 283
FTI + LSP YIL+ + C SGF D+P G LWILGDVF+ Y+TVFD
Sbjct: 259 VFTIDGVQYPLSPSAYILQDDDS----CTSGFEGMDVPTSSGELWILGDVFIRQYYTVFD 314
Query: 284 SGKLRIGFAEAA 295
++G A A
Sbjct: 315 RANNKVGLAPVA 326
>gi|13096225|pdb|1F34|A Chain A, Crystal Structure Of Ascaris Pepsin Inhibitor-3 Bound To
Porcine Pepsin
Length = 326
Score = 87.4 bits (215), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 47/110 (42%), Positives = 68/110 (61%), Gaps = 9/110 (8%)
Query: 3 EQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILI 62
+QGLVS+++FS +L+ + D+ G ++ GG+D ++ G +VPV+ +GYWQ L I +
Sbjct: 142 DQGLVSQDLFSVYLSSNDDS--GSVVLLGGIDSSYYTGSLNWVPVSVEGYWQITLDSITM 199
Query: 63 GNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG------GEGVVS 106
+ T C GGC AIVD+GTSLL GPT + I IG GE V+S
Sbjct: 200 DGE-TIACSGGCQAIVDTGTSLLTGPTSAIANIQSDIGASENSDGEMVIS 248
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 47/132 (35%), Positives = 66/132 (50%), Gaps = 6/132 (4%)
Query: 164 GDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNV 223
G++ CS A+V L T ++ I + N GE +I C I ++P++
Sbjct: 201 GETIACSGGCQAIVDTGTSLLTGPTS--AIANIQSDIGASENSDGEMVISCSSIDSLPDI 258
Query: 224 SFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFD 283
FTI + LSP YIL+ + C SGF D+P G LWILGDVF+ Y+TVFD
Sbjct: 259 VFTINGVQYPLSPSAYILQDDDS----CTSGFEGMDVPTSSGELWILGDVFIRQYYTVFD 314
Query: 284 SGKLRIGFAEAA 295
++G A A
Sbjct: 315 RANNKVGLAPVA 326
>gi|194745302|ref|XP_001955127.1| GF16402 [Drosophila ananassae]
gi|190628164|gb|EDV43688.1| GF16402 [Drosophila ananassae]
Length = 300
Score = 87.0 bits (214), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 46/101 (45%), Positives = 60/101 (59%), Gaps = 3/101 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M +Q LV + +FSF+L R+ +GGE++FGGVD F G TYVP+T GYWQF++ +
Sbjct: 114 MCDQHLVDQCLFSFYLKRNGSERKGGELLFGGVDSTKFSGTLTYVPLTHAGYWQFQMDGV 173
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 101
IG V AI D+GTSLLA P IN +GG
Sbjct: 174 EIGGT---VISRHRQAIADTGTSLLAAPPREYLIINSLLGG 211
Score = 78.2 bits (191), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 40/101 (39%), Positives = 56/101 (55%), Gaps = 6/101 (5%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILK-TGEGIAEVCISG 254
IN L LP E +++C + +P + F IG + F L P+ Y+++ T + VC+S
Sbjct: 205 INSLLGGLPTTNNEYLLNCSGLDKLPEIVFIIGGQRFGLQPKDYVMQATDDDGTSVCLSA 264
Query: 255 FMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
F D WILGDVF+G Y+T FD G +IGFA AA
Sbjct: 265 FTLMD-----AEFWILGDVFIGRYYTAFDVGHRQIGFAPAA 300
>gi|301786579|ref|XP_002928698.1| PREDICTED: pepsin F-like [Ailuropoda melanoleuca]
Length = 388
Score = 87.0 bits (214), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 44/106 (41%), Positives = 65/106 (61%), Gaps = 3/106 (2%)
Query: 3 EQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILI 62
QGL+SEE+F+F+L++ EEG ++FGGVD ++ G+ +VPV++ YWQ + I +
Sbjct: 203 RQGLISEELFAFYLSKK--EEEGSVVMFGGVDHSYYSGELNWVPVSRPFYWQLSMDSISM 260
Query: 63 GNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
N C+ GC AI+D+GTSLL GP+ +V I I S E
Sbjct: 261 -NGVVIACDSGCEAIIDTGTSLLIGPSDIVFNIQKIINANQSYSGE 305
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 37/99 (37%), Positives = 52/99 (52%), Gaps = 3/99 (3%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
I ++ ++ + GE IIDCD + T+P++ FTI + + YI + +GI C S F
Sbjct: 292 IQKIINANQSYSGEYIIDCDAVNTLPDIVFTINGINYPVPASAYIREGHQGI---CYSNF 348
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
WILGDVF+ +Y TVFD G IG A A
Sbjct: 349 DEGAGGSFLSDSWILGDVFLRLYFTVFDRGNSTIGLAPA 387
>gi|431910410|gb|ELK13483.1| Pepsin F [Pteropus alecto]
Length = 487
Score = 87.0 bits (214), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 41/106 (38%), Positives = 68/106 (64%), Gaps = 3/106 (2%)
Query: 3 EQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILI 62
+QGL+S+ +F+F+L+ + ++ ++FGGVDP ++KG +VPVT++G+WQ + + +
Sbjct: 304 KQGLISQNIFAFYLSSN--NKKRSVVLFGGVDPSYYKGDLKWVPVTQRGFWQISMDSVSM 361
Query: 63 GNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
N C+ GC AIVD+GTSL+ GP + EI I + + S E
Sbjct: 362 -NGKIIACKSGCQAIVDTGTSLVIGPNDPILEILKTINAQPISSGE 406
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 33/97 (34%), Positives = 49/97 (50%), Gaps = 6/97 (6%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
I + ++ P GE I++C + T+P+ FTI + + YI K +++C S F
Sbjct: 393 ILKTINAQPISSGEYIVNCSSVDTLPDTIFTINGIKYPVPASAYIQKYN---SDICYSNF 449
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFA 292
+WILGDVF+ +Y TVFD RIG A
Sbjct: 450 EGM---VANSDVWILGDVFLRLYFTVFDRANDRIGLA 483
>gi|195438439|ref|XP_002067144.1| GK24836 [Drosophila willistoni]
gi|194163229|gb|EDW78130.1| GK24836 [Drosophila willistoni]
Length = 372
Score = 87.0 bits (214), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 41/88 (46%), Positives = 56/88 (63%), Gaps = 3/88 (3%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV Q LV VFSF+L RD A +GGE++FGG D + G TYVP++++GYWQFE+
Sbjct: 195 MVSQSLVDSSVFSFYLARDGSATDGGELIFGGSDASLYTGDLTYVPISEQGYWQFEMTSA 254
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGP 88
+ +C+ C AI D+GTSL+ P
Sbjct: 255 SFDGYT--LCD-DCQAIADTGTSLIVAP 279
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 38/100 (38%), Positives = 54/100 (54%), Gaps = 13/100 (13%)
Query: 198 ELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF-- 255
E+ L N + ++DC + ++P+++F IG F L P YI+++ C+S F
Sbjct: 284 EILSELLNVDDDGLVDCSTVSSLPDLTFNIGGTDFTLKPSAYIIQSDGN----CMSAFEY 339
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
M D WILGDVF+G Y+T FD G RIGFA A
Sbjct: 340 MGTDF-------WILGDVFIGQYYTEFDLGNNRIGFAPVA 372
>gi|355329701|dbj|BAL14144.1| pepsinogen 1 [Pagrus major]
Length = 369
Score = 87.0 bits (214), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 38/102 (37%), Positives = 66/102 (64%), Gaps = 3/102 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ +GLV++++FS +L+ + +++G + FGGVDP ++ G T++P++ + YWQ + +
Sbjct: 188 MMTEGLVNQDLFSVYLSSN--SQQGSVVTFGGVDPNYYTGSITWIPLSNELYWQITVDSV 245
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
+ Q C GGC AIVD+GTSL+ GP ++ IN +G
Sbjct: 246 TVNGQVV-ACNGGCQAIVDTGTSLIVGPQSSISNINSYVGAS 286
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 27/87 (31%), Positives = 44/87 (50%), Gaps = 8/87 (9%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 267
G+ ++C+ I MP+V+F I + F L Y+ ++ C +GF L
Sbjct: 290 GDYTVNCNSIAQMPDVTFYIHGQQFTLPASAYVHQSQY---YGCRTGF-----GNGGDSL 341
Query: 268 WILGDVFMGVYHTVFDSGKLRIGFAEA 294
WILGDVF+ Y+++F + +G A A
Sbjct: 342 WILGDVFIRQYYSIFSRAQNMVGLARA 368
>gi|449299914|gb|EMC95927.1| hypothetical protein BAUCODRAFT_34686 [Baudoinia compniacensis UAMH
10762]
Length = 376
Score = 87.0 bits (214), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 44/114 (38%), Positives = 70/114 (61%), Gaps = 7/114 (6%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ QGL+ E+VF+F+L+ D + + E FGG+D H++GK T +P+ +K YW+ +L I
Sbjct: 190 MINQGLIDEQVFAFYLS-DTNKGDESEATFGGIDESHYEGKMTKIPLRRKAYWEVDLDAI 248
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAECK 110
G+Q+ + G AI+D+GTSL+A PT + +N IG + G + EC
Sbjct: 249 TFGDQTAEIDSTG--AILDTGTSLIALPTTLAELLNREIGAKKSYNGQYTIECN 300
Score = 74.3 bits (181), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 34/87 (39%), Positives = 55/87 (63%), Gaps = 4/87 (4%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 267
G+ I+C++ ++P+++FT+ F + P YIL+ + CIS FM FD+P P GPL
Sbjct: 293 GQYTIECNKRDSLPDLTFTLTGYNFTIGPYDYILE----VQGSCISSFMGFDIPEPAGPL 348
Query: 268 WILGDVFMGVYHTVFDSGKLRIGFAEA 294
ILGD F+ +++V+D G +G A+A
Sbjct: 349 AILGDAFLRKWYSVYDLGNNAVGLAKA 375
>gi|281347482|gb|EFB23066.1| hypothetical protein PANDA_018737 [Ailuropoda melanoleuca]
Length = 270
Score = 87.0 bits (214), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 44/106 (41%), Positives = 65/106 (61%), Gaps = 3/106 (2%)
Query: 3 EQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILI 62
QGL+SEE+F+F+L++ EEG ++FGGVD ++ G+ +VPV++ YWQ + I +
Sbjct: 137 RQGLISEELFAFYLSKK--EEEGSVVMFGGVDHSYYSGELNWVPVSRPFYWQLSMDSISM 194
Query: 63 GNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
N C+ GC AI+D+GTSLL GP+ +V I I S E
Sbjct: 195 -NGVVIACDSGCEAIIDTGTSLLIGPSDIVFNIQKIINANQSYSGE 239
>gi|169731523|gb|ACA64894.1| progastricsin (predicted) [Callicebus moloch]
Length = 388
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 43/102 (42%), Positives = 63/102 (61%), Gaps = 1/102 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+++G ++ VFSF+L+ GG +VFGGVD + G+ + PVT++ YWQ + +
Sbjct: 199 MLQEGALTSPVFSFYLSNQ-QGSSGGAVVFGGVDSSLYTGQIYWAPVTQELYWQIGIEEF 257
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
LIG Q++G C GC AIVD+GTSLL P ++ A G E
Sbjct: 258 LIGGQASGWCSEGCQAIVDTGTSLLTVPQQYLSAFLEATGAE 299
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 37/106 (34%), Positives = 58/106 (54%), Gaps = 3/106 (2%)
Query: 190 EKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAE 249
++ LS E + + G+ +++CD I ++P ++F I F L P YIL + +G
Sbjct: 286 QQYLSAFLEATGAEEDEYGQFLVNCDSIQSLPTLTFIINGVEFPLPPSSYIL-SNDGYCT 344
Query: 250 VCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
V + + PLWILGDVF+ Y++V+D G R+GFA AA
Sbjct: 345 VGVEP--TYLSSQNSQPLWILGDVFLRSYYSVYDLGNNRVGFATAA 388
>gi|195339961|ref|XP_002036585.1| GM18746 [Drosophila sechellia]
gi|194130465|gb|EDW52508.1| GM18746 [Drosophila sechellia]
Length = 392
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 50/120 (41%), Positives = 66/120 (55%), Gaps = 11/120 (9%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV QGL+ E V SF+L R A GGE++ GG+D ++G TYVPV+ YWQF + I
Sbjct: 204 MVSQGLLDEPVISFYLKRQGTAVRGGELILGGIDSSLYRGSLTYVPVSVPAYWQFTVNTI 263
Query: 61 LIGNQSTGV--CEGGCAAIVDSGTSLLAGPTPVVTEINHAIG----GEGVVSAECKLVVS 114
++ G+ C GC AI D+GTSL+A P +IN +G G G C V S
Sbjct: 264 ----KTNGILLCN-GCQAIADTGTSLIAVPLAAYRKINRQLGATDNGGGEAFVRCGRVSS 318
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 39/100 (39%), Positives = 55/100 (55%), Gaps = 4/100 (4%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
IN + N GE+ + C R+ ++P V+ IG +F L+P YI+K + C+S F
Sbjct: 295 INRQLGATDNGGGEAFVRCGRVSSLPKVNLNIGGTVFTLAPRDYIVKVTQYGQTYCMSAF 354
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
+ WILGDVF+G ++TVFD G RIGFA A
Sbjct: 355 TYMEGL----SFWILGDVFIGKFYTVFDKGNERIGFARVA 390
>gi|301606850|ref|XP_002933027.1| PREDICTED: gastricsin isoform 3 [Xenopus (Silurana) tropicalis]
Length = 380
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 40/102 (39%), Positives = 60/102 (58%), Gaps = 3/102 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M++Q L++E VF F+L + + GGE+ FGGVD ++ G+ + PVT + YWQ +
Sbjct: 193 MLQQNLLNEPVFGFYLK---NTQSGGEVAFGGVDQNYYTGQIYWTPVTSETYWQIGIQGF 249
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
I Q++G C GC IVD+GTSLL P + + IG +
Sbjct: 250 SINGQASGWCSQGCQGIVDTGTSLLTAPQSIFASLMQDIGAQ 291
Score = 71.2 bits (173), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 37/89 (41%), Positives = 50/89 (56%), Gaps = 5/89 (5%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRG-P 266
GE ++ C I +P +SFTI F L P Y+L+ I C G M L G P
Sbjct: 296 GEYVVSCSSIQNLPTISFTISGVSFPLPPSAYVLQ----INGYCTIGIMPTYLSSQNGQP 351
Query: 267 LWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
+WILGDVF+ Y++V+D G ++GFA AA
Sbjct: 352 MWILGDVFLRQYYSVYDLGNNQVGFASAA 380
>gi|407260952|ref|XP_003946102.1| PREDICTED: renin-1-like [Mus musculus]
Length = 400
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 41/104 (39%), Positives = 65/104 (62%), Gaps = 3/104 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEE--GGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELG 58
++ QG++ EEVFS + NR GGE+V GG DP+H++G YV ++K WQ +
Sbjct: 206 ILSQGVLKEEVFSVYYNRKTKGSHLLGGEVVLGGSDPQHYQGNFHYVSISKTDSWQITMK 265
Query: 59 DILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
+ +G+ ST +CE GCA +VD+G+S ++ PT + I A+G +
Sbjct: 266 GVSVGS-STLLCEEGCAVVVDTGSSFISAPTSSLKLIMQALGAK 308
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 33/86 (38%), Positives = 54/86 (62%)
Query: 209 ESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLW 268
E +++C ++PT+P++SF +G + + LS Y+L+ ++C A D+PPP GP+W
Sbjct: 314 EYVVNCSQVPTLPDISFDLGGRAYTLSSTDYVLQYPYRRDKLCTLALHAMDIPPPTGPVW 373
Query: 269 ILGDVFMGVYHTVFDSGKLRIGFAEA 294
+LG F+ ++T FD RIGFA A
Sbjct: 374 VLGATFIRKFYTEFDRHNNRIGFALA 399
>gi|49019527|emb|CAD80096.1| pepsin A2 [Trematomus bernacchii]
Length = 373
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 40/111 (36%), Positives = 68/111 (61%), Gaps = 3/111 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV++GLVS+ +FS +L+ + + E+VFGG+D H+ G+ ++P++ YWQ ++ +
Sbjct: 192 MVKEGLVSQPLFSVYLSSK--SAQDSEVVFGGIDSSHYTGQIIWIPLSSDTYWQIKMDSV 249
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKL 111
I Q T C GGC AI+D+GTS + GPT ++ +N +G E ++
Sbjct: 250 TINGQ-TVACAGGCQAIIDTGTSQIVGPTSDISNMNSWVGASTNQYGEARV 299
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 33/102 (32%), Positives = 53/102 (51%), Gaps = 9/102 (8%)
Query: 193 LSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCI 252
+S +N + N GE+ ++C I +MP V+FT+ F + Y+ ++ G C
Sbjct: 280 ISNMNSWVGASTNQYGEARVNCQNIQSMPEVTFTLNGNAFTIPASAYVSQSSYG----CS 335
Query: 253 SGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
+GF LWILGDVF+ Y+ +FDS +IG A++
Sbjct: 336 TGF-----GQSGQQLWILGDVFIRQYYAIFDSSTPKIGLAKS 372
>gi|301622166|ref|XP_002940408.1| PREDICTED: renin-like [Xenopus (Silurana) tropicalis]
Length = 371
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 40/87 (45%), Positives = 53/87 (60%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 267
GE +DCD+I +P+VSF +G + L YIL+ + E+C F D+PPP GPL
Sbjct: 284 GEYTVDCDKISQLPDVSFHMGGNEYTLKGPAYILQQSQFGEEICSVAFTPLDIPPPVGPL 343
Query: 268 WILGDVFMGVYHTVFDSGKLRIGFAEA 294
WILG F+G Y+T FD RIGFA +
Sbjct: 344 WILGASFIGQYYTEFDRRNNRIGFATS 370
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 39/130 (30%), Positives = 71/130 (54%), Gaps = 9/130 (6%)
Query: 1 MVEQGLVSEEVFSFWLNR---DPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFEL 57
++ + ++ E+VFS + +R D + GGEI+ GG DP ++ G Y+ + K+GYW +
Sbjct: 176 IISEQVLQEDVFSVYYSRSYRDSHLKPGGEIILGGSDPSYYTGSFQYLNLEKEGYWHIRM 235
Query: 58 GDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG----GEGVVSAECKLVV 113
+ IG + C+ GC+ +D+G + + GP V+ + AIG EG + +C +
Sbjct: 236 KGVSIGAEIL-FCKDGCSVAIDTGAAYITGPASSVSVLMKAIGATELAEGEYTVDCD-KI 293
Query: 114 SQYGDLIWDL 123
SQ D+ + +
Sbjct: 294 SQLPDVSFHM 303
>gi|431910409|gb|ELK13482.1| Pepsin A [Pteropus alecto]
Length = 386
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 39/100 (39%), Positives = 64/100 (64%), Gaps = 3/100 (3%)
Query: 3 EQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILI 62
+QGLVS+++FS +L+ D D+ G ++FGG+D ++ G +VP++ + YWQ + +++
Sbjct: 202 DQGLVSQDLFSVYLSSDDDS--GSVVIFGGIDSSYYSGSLNWVPLSSETYWQITVDSVIL 259
Query: 63 GNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
++ C C AIVD+GTSLLAGPT ++ I IG
Sbjct: 260 DGEAIA-CSATCQAIVDTGTSLLAGPTTAISSIQKYIGAS 298
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 49/132 (37%), Positives = 67/132 (50%), Gaps = 6/132 (4%)
Query: 164 GDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNV 223
G++ CSA A+V L T +S I + + N G+ ++ C +PN+
Sbjct: 261 GEAIACSATCQAIVDTGTSLLAGPTT--AISSIQKYIGASENSDGDMVVSCSAASELPNI 318
Query: 224 SFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFD 283
FTI + L YIL++ + VCISGF DLP G LWILGDVF+ Y TVFD
Sbjct: 319 IFTINGVQYPLPSSAYILESDD----VCISGFQGMDLPTSSGDLWILGDVFIRQYFTVFD 374
Query: 284 SGKLRIGFAEAA 295
++G A AA
Sbjct: 375 RANNQVGLASAA 386
>gi|55741443|ref|NP_001001536.2| pregnancy-associated glycoprotein 2 precursor [Sus scrofa]
gi|55247976|gb|AAV48824.1| pregnancy-associated glycoprotein 10 [Sus scrofa]
Length = 389
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 44/102 (43%), Positives = 64/102 (62%), Gaps = 3/102 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+ +Q +SE VF+F+L+ D EEG ++FGGVD K++KG +VP+T+ YWQ L I
Sbjct: 202 LKKQDQISEPVFAFYLS--SDKEEGSVVMFGGVDKKYYKGDLKWVPLTQTSYWQIALDRI 259
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
+ G C GC AIVD+GTS+L GP+ V +I+ I +
Sbjct: 260 TCRGRVIG-CPRGCQAIVDTGTSMLHGPSKAVAKIHSLINAK 300
Score = 57.4 bits (137), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 29/87 (33%), Positives = 46/87 (52%), Gaps = 2/87 (2%)
Query: 209 ESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLW 268
E ++ C+ +P++ FTI + + + + YI K C S F + +W
Sbjct: 305 EYVVPCNARKALPDIVFTINNVDYPVPAQAYIRKNANN--NRCYSTFEEIMDTLNQREIW 362
Query: 269 ILGDVFMGVYHTVFDSGKLRIGFAEAA 295
ILGDVF+ +Y TV+D G+ RIG A+A
Sbjct: 363 ILGDVFLRLYFTVYDEGQNRIGLAQAT 389
>gi|344263771|ref|XP_003403969.1| PREDICTED: gastricsin-like [Loxodonta africana]
Length = 389
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 38/101 (37%), Positives = 65/101 (64%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+++ ++ +FSF+L+ +++GG ++ GGVD + G+ + PVT++ YWQ + +
Sbjct: 199 MLQENALTSPIFSFYLSNYQGSQDGGALILGGVDDSLYSGQIYWAPVTRELYWQIGIEEF 258
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 101
LIG Q++G C GC AIVD+GTSLL P ++++ A G
Sbjct: 259 LIGGQASGWCSQGCQAIVDTGTSLLTVPQQYLSDLLQATGA 299
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 37/106 (34%), Positives = 56/106 (52%), Gaps = 5/106 (4%)
Query: 190 EKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAE 249
++ LS + + ++ + G+ +++C+ I +P +F I L P YIL
Sbjct: 287 QQYLSDLLQATGAVEDEYGQFLVNCEDIQDLPTFTFIINGAQLPLPPSSYILDNNG---- 342
Query: 250 VCISGFMAFDLPPPRG-PLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
C G M+ L G PLWILGDVF+ Y++VFD G R+GFA A
Sbjct: 343 YCTVGLMSTYLASQDGQPLWILGDVFLRSYYSVFDMGNNRVGFATA 388
>gi|297838681|ref|XP_002887222.1| hypothetical protein ARALYDRAFT_894690 [Arabidopsis lyrata subsp.
lyrata]
gi|297333063|gb|EFH63481.1| hypothetical protein ARALYDRAFT_894690 [Arabidopsis lyrata subsp.
lyrata]
Length = 242
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 48/114 (42%), Positives = 66/114 (57%), Gaps = 14/114 (12%)
Query: 1 MVEQGLVSEEVFSFWLN--------RDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGY 52
MV+Q L+++ +FSFWL DP+ GG+IVFGG DPKHF G+H YV +
Sbjct: 110 MVKQKLITKPIFSFWLRSYKGDGKGEDPN---GGQIVFGGFDPKHFHGEHAYVSSDDR-- 164
Query: 53 WQFELGDIL-IGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVV 105
W+ ++ I G T +C+ C AIVDSGT+ ++GP V I+ AIG V
Sbjct: 165 WKIKMSKIYNNGKPGTNLCDLQCKAIVDSGTTDISGPELSVRSIDEAIGAHRVT 218
>gi|8896138|gb|AAF81254.1| pregnancy-associated glycoprotein 4 [Sus scrofa]
Length = 389
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 44/102 (43%), Positives = 64/102 (62%), Gaps = 3/102 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+ +Q +SE VF+F+L+ D EEG ++FGGVD K++KG +VP+T+ YWQ L I
Sbjct: 202 LKKQDQISEPVFAFYLS--SDKEEGSVVMFGGVDKKYYKGDLKWVPLTQTSYWQIALDRI 259
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
+ G C GC AIVD+GTS+L GP+ V +I+ I +
Sbjct: 260 TCRGRVIG-CPRGCQAIVDTGTSMLHGPSKAVAKIHSLINAK 300
Score = 57.0 bits (136), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 29/87 (33%), Positives = 46/87 (52%), Gaps = 2/87 (2%)
Query: 209 ESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLW 268
E ++ C+ +P++ FTI + + + + YI K C S F + +W
Sbjct: 305 EYVVPCNARKALPDIVFTINNVDYPVPAQAYIRKNANN--NRCYSTFEDIMDTLNQREIW 362
Query: 269 ILGDVFMGVYHTVFDSGKLRIGFAEAA 295
ILGDVF+ +Y TV+D G+ RIG A+A
Sbjct: 363 ILGDVFLRLYFTVYDEGQNRIGLAQAT 389
>gi|109287596|emb|CAJ55260.1| renin-like aspartic protease [Echis ocellatus]
Length = 395
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 42/132 (31%), Positives = 72/132 (54%), Gaps = 10/132 (7%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
++ + ++SE VFS + +R ++ GGEI+ GG DP H+ G YV +++GYW +L +
Sbjct: 206 IMSEKVLSENVFSVYYSRHSESNTGGEIILGGSDPSHYTGDFHYVSTSREGYWHVDLKGV 265
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG---EGVVSAECKL------ 111
I N+ +C GC A +D+GTS ++GP ++ + IG +G +CK
Sbjct: 266 SIENKIV-LCHDGCTATIDTGTSFISGPASSISVLMETIGATLSDGDYVIDCKKINLLPD 324
Query: 112 VVSQYGDLIWDL 123
+ GD+ + L
Sbjct: 325 ITFHLGDMTYSL 336
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 37/87 (42%), Positives = 54/87 (62%), Gaps = 2/87 (2%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 267
G+ +IDC +I +P+++F +GD ++LS Y+LK + C FMA D+PPP GPL
Sbjct: 310 GDYVIDCKKINLLPDITFHLGDMTYSLSSSTYVLKFSDETE--CTVAFMAVDIPPPLGPL 367
Query: 268 WILGDVFMGVYHTVFDSGKLRIGFAEA 294
W+LG F+ Y+ FD RIGFA +
Sbjct: 368 WLLGATFIKQYYIEFDRQNNRIGFATS 394
>gi|158257160|dbj|BAF84553.1| unnamed protein product [Homo sapiens]
Length = 388
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 46/116 (39%), Positives = 68/116 (58%), Gaps = 7/116 (6%)
Query: 3 EQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILI 62
QGLVS+++FS +L+ D + G ++FGG+D ++ G +VPVT +GYWQ + I +
Sbjct: 204 NQGLVSQDLFSVYLSAD--DQSGSVVIFGGIDSSYYTGSLNWVPVTVEGYWQITVDSITM 261
Query: 63 GNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG----EGVVSAECKLVVS 114
++ C GC AIVD+GTSLL GPT +T I IG +G + C + S
Sbjct: 262 NGEAIA-CAEGCQAIVDTGTSLLTGPTSPITNIQSDIGASENSDGDMVVSCSAISS 316
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 38/91 (41%), Positives = 53/91 (58%), Gaps = 4/91 (4%)
Query: 205 NPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 264
N G+ ++ C I ++P++ FTI + + P YIL++ EG CISGF +LP
Sbjct: 302 NSDGDMVVSCSAISSLPDIVFTINGVQYPVPPSAYILQS-EG---SCISGFQGMNLPTES 357
Query: 265 GPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
G LWILGDVF+ Y TVFD ++G A A
Sbjct: 358 GELWILGDVFIRQYFTVFDRANNQVGLAPVA 388
>gi|444706401|gb|ELW47743.1| Cathepsin E [Tupaia chinensis]
Length = 396
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 41/112 (36%), Positives = 66/112 (58%), Gaps = 4/112 (3%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ +V++ +FS +++ D + G E++FGG D HF G ++PVTK+G+WQ L +
Sbjct: 202 MIVHNMVAQPMFSVYMSSDIENGTGSELIFGGYDCSHFSGSLNWIPVTKQGFWQIALDGV 261
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG---EGVVSAEC 109
+G+ + C GC AIVD+GTS + GP + ++ AIG G+ EC
Sbjct: 262 QVGD-TMMFCSKGCQAIVDTGTSRIIGPLNKIERLHRAIGATLVNGIYFVEC 312
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 35/92 (38%), Positives = 50/92 (54%), Gaps = 8/92 (8%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDK----IFNLSPEQYILKT-GEGIAEVCISGFMAFDLPP 262
G ++C + MPNV+F I + LSP Y+L+ G+G+ +C SGF
Sbjct: 306 GIYFVECVNLTVMPNVTFIISGVPYFFFYTLSPTAYVLQALGDGM-RLCSSGFEGLHFLT 364
Query: 263 PRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
P WILGDVF+ +++VFD G R+G A A
Sbjct: 365 E--PSWILGDVFLRQFYSVFDRGNNRVGLAPA 394
>gi|195578345|ref|XP_002079026.1| GD23736 [Drosophila simulans]
gi|194191035|gb|EDX04611.1| GD23736 [Drosophila simulans]
Length = 564
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 45/102 (44%), Positives = 61/102 (59%), Gaps = 7/102 (6%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV QGL+ E V SF+L R A GGE++ GG+D ++G TYVPV+ YWQF + I
Sbjct: 376 MVSQGLLDEPVISFYLKRQGTAVRGGELILGGIDSSLYRGSLTYVPVSVPAYWQFTVNTI 435
Query: 61 LIGNQSTGV--CEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 100
++ G+ C GC AI D+GTSL+A P +IN +G
Sbjct: 436 ----KTNGILLCN-GCQAIADTGTSLIAVPLAAYRKINRQLG 472
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 40/100 (40%), Positives = 54/100 (54%), Gaps = 4/100 (4%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
IN + N GE+ + C RI +P V+ IG +F L+P YI+K + C+S F
Sbjct: 467 INRQLGATDNGGGEAFVRCGRISALPKVNLNIGGTVFTLAPRDYIVKVTQYGQTYCMSAF 526
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
+ WILGDVF+G ++TVFD G RIGFA A
Sbjct: 527 TYME----GLSFWILGDVFIGKFYTVFDKGNERIGFARVA 562
Score = 67.8 bits (164), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 30/62 (48%), Positives = 39/62 (62%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV QGL+ E V SF+L R A GGE++ GG+D ++G TYVPV+ YWQF +I
Sbjct: 204 MVSQGLLDEPVISFYLKRQGTAVRGGELILGGIDSSLYRGSLTYVPVSVPAYWQFTRFNI 263
Query: 61 LI 62
L
Sbjct: 264 LF 265
>gi|109287598|emb|CAJ55261.1| renin-like aspartic protease [Echis ocellatus]
Length = 395
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 38/87 (43%), Positives = 55/87 (63%), Gaps = 2/87 (2%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 267
G+ +IDC++I +P++SF +GD ++LS Y+LK + C F A D+PPPRGPL
Sbjct: 310 GDYVIDCNQINLLPDISFHLGDMTYSLSSSTYVLKYSDETE--CTVAFSAIDIPPPRGPL 367
Query: 268 WILGDVFMGVYHTVFDSGKLRIGFAEA 294
W+LG F+ Y+ FD RIGFA +
Sbjct: 368 WLLGATFIKQYYIEFDRQNNRIGFATS 394
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 36/101 (35%), Positives = 61/101 (60%), Gaps = 1/101 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
++ + ++SE VFS + +R ++ GGEI+ GG DP H+ G YV +++GYW +L +
Sbjct: 206 IMSEKVLSENVFSVYYSRHSESNTGGEIILGGSDPSHYTGDFHYVSTSREGYWHVDLKGV 265
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 101
I N+ +C GC A +D+GTS ++GP ++ + IG
Sbjct: 266 SIENK-IALCHDGCTATIDTGTSFISGPASSISVLMETIGA 305
>gi|331215715|ref|XP_003320537.1| saccharopepsin [Puccinia graminis f. sp. tritici CRL 75-36-700-3]
gi|309299527|gb|EFP76118.1| saccharopepsin [Puccinia graminis f. sp. tritici CRL 75-36-700-3]
Length = 430
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 37/87 (42%), Positives = 58/87 (66%), Gaps = 4/87 (4%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 267
G+ +DC RIP +P+++F G K F ++ E YIL+ ++ C+S F D+PP G L
Sbjct: 347 GQYTVDCSRIPELPDLTFNFGGKEFTITGEDYILQ----VSGTCVSAFTGLDMPPNIGEL 402
Query: 268 WILGDVFMGVYHTVFDSGKLRIGFAEA 294
WI+GDVF+ ++TV+D G+ +GFA+A
Sbjct: 403 WIVGDVFLRKWYTVYDWGRDAVGFAKA 429
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 45/122 (36%), Positives = 68/122 (55%), Gaps = 15/122 (12%)
Query: 1 MVEQGLVSEEVFSFWL------NRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQ 54
M+E G++ + VF+F+L DP+ GGE+VFGGVD H++G+ Y PV ++GYW+
Sbjct: 240 MMENGMLDDPVFAFYLGSAQGNKADPN---GGEVVFGGVDEAHYEGEIFYAPVRRRGYWE 296
Query: 55 FELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG----EGVVSAECK 110
EL + G + + G A +D+GTSL+A PT IN IG G + +C
Sbjct: 297 VELKSVKFGKEEMKLHNVGAA--IDTGTSLIALPTDTAEIINAEIGATKSWSGQYTVDCS 354
Query: 111 LV 112
+
Sbjct: 355 RI 356
>gi|18959216|ref|NP_579818.1| gastricsin precursor [Rattus norvegicus]
gi|129798|sp|P04073.1|PEPC_RAT RecName: Full=Gastricsin; AltName: Full=Pepsinogen C; Flags:
Precursor
gi|56881|emb|CAA28305.1| unnamed protein product [Rattus norvegicus]
gi|206083|gb|AAA41827.1| pepsinogen [Rattus norvegicus]
gi|149069457|gb|EDM18898.1| progastricsin (pepsinogen C) [Rattus norvegicus]
Length = 392
Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 44/103 (42%), Positives = 65/103 (63%), Gaps = 2/103 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ +G +S+ +F +L GG+IVFGGVD + G+ T+VPVT++ YWQ + D
Sbjct: 202 MLGEGALSQPLFGVYLGSQ-QGSNGGQIVFGGVDKNLYTGEITWVPVTQELYWQITIDDF 260
Query: 61 LIGNQSTGVCEG-GCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
LIG+Q++G C GC IVD+GTSLL P ++E+ IG +
Sbjct: 261 LIGDQASGWCSSQGCQGIVDTGTSLLVMPAQYLSELLQTIGAQ 303
Score = 67.4 bits (163), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 35/104 (33%), Positives = 55/104 (52%), Gaps = 8/104 (7%)
Query: 195 YINELCDSL---PNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVC 251
Y++EL ++ GE + CD + ++P +SF + F LSP YI++ C
Sbjct: 292 YLSELLQTIGAQEGEYGEYFVSCDSVSSLPTLSFVLNGVQFPLSPSSYIIQE----DNFC 347
Query: 252 ISGFMAFDLPPPRG-PLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
+ G + L G PLWILGDVF+ Y+ +FD G ++G A +
Sbjct: 348 MVGLESISLTSESGQPLWILGDVFLRSYYAIFDMGNNKVGLATS 391
>gi|317419142|emb|CBN81179.1| Pepsinogen 1 [Dicentrarchus labrax]
Length = 378
Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 38/102 (37%), Positives = 65/102 (63%), Gaps = 3/102 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ +GLV++++FS +L+ + +++G + FGGVD H+ G T++P++ + YWQ + +
Sbjct: 197 MMNEGLVNQDLFSVYLSSN--SQQGSVVTFGGVDSNHYHGSITWIPLSNELYWQITVDSV 254
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
+ Q C GGC AIVD+GTSL+ GP ++ IN +G
Sbjct: 255 TVNGQVV-ACNGGCQAIVDTGTSLVVGPQSSISNINSMVGAS 295
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 27/87 (31%), Positives = 45/87 (51%), Gaps = 8/87 (9%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 267
G+ +++C+ I MP V+F I + F L Y+ ++ C +GF L
Sbjct: 299 GDYVVNCNNIAQMPAVTFHIHGQEFTLPASAYVRQSQY---FGCRTGF-----GNGGDSL 350
Query: 268 WILGDVFMGVYHTVFDSGKLRIGFAEA 294
WILGDVF+ Y+++F + +G A+A
Sbjct: 351 WILGDVFIRQYYSIFSRAQNMVGLAKA 377
>gi|34576991|gb|AAQ75735.1| pregnancy-associated glycoprotein 8 [Sus scrofa]
Length = 342
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 44/102 (43%), Positives = 64/102 (62%), Gaps = 3/102 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+ +Q +SE VF+F+L+ D EEG ++FGGVD K++KG +VP+T+ YWQ L I
Sbjct: 155 LKKQDQISEPVFAFYLS--SDKEEGSVVMFGGVDKKYYKGDLKWVPLTQTSYWQIALDRI 212
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
+ G C GC AIVD+GTS+L GP+ V +I+ I +
Sbjct: 213 TCRGRVIG-CPRGCQAIVDTGTSMLHGPSKAVAKIHSLINAK 253
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 29/87 (33%), Positives = 46/87 (52%), Gaps = 2/87 (2%)
Query: 209 ESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLW 268
E ++ C+ +P++ FTI + + + + YI K C S F + +W
Sbjct: 258 EYVVPCNARKALPDIVFTINNVDYPVPAQAYIRKNANN--NRCYSTFEDIMDTLNQREIW 315
Query: 269 ILGDVFMGVYHTVFDSGKLRIGFAEAA 295
ILGDVF+ +Y TV+D G+ RIG A+A
Sbjct: 316 ILGDVFLRLYFTVYDEGQNRIGLAQAT 342
>gi|1066345|gb|AAA81531.1| pregnancy-associated glycoprotein [Sus scrofa]
Length = 387
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 44/99 (44%), Positives = 63/99 (63%), Gaps = 3/99 (3%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+ +Q +SE VF+F+L+ D EEG ++FGGVD K++KG +VP+T+ YWQ L I
Sbjct: 202 LKKQDQISEPVFAFYLS--SDKEEGSVVMFGGVDKKYYKGDLKWVPLTQTSYWQIALDRI 259
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAI 99
+ G C GC AIVD+GTS+L GP+ V +I+ I
Sbjct: 260 TCRGRVIG-CPRGCQAIVDTGTSMLHGPSKAVAKIHSLI 297
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 29/87 (33%), Positives = 46/87 (52%), Gaps = 2/87 (2%)
Query: 209 ESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLW 268
E ++ C+ +P++ FTI + + + + YI K C S F + +W
Sbjct: 303 EYVVPCNARKALPDIVFTINNVDYPVPAQAYIRKNANN--NRCYSTFEDIMDTLNQREIW 360
Query: 269 ILGDVFMGVYHTVFDSGKLRIGFAEAA 295
ILGDVF+ +Y TV+D G+ RIG A+A
Sbjct: 361 ILGDVFLRLYFTVYDEGQNRIGLAQAT 387
>gi|326933645|ref|XP_003212911.1| PREDICTED: embryonic pepsinogen-like [Meleagris gallopavo]
Length = 383
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 41/118 (34%), Positives = 68/118 (57%), Gaps = 8/118 (6%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV + L+ + +FS +L+R+ G ++FGG+D +F G ++PV+ +GYWQ + I
Sbjct: 202 MVNESLLEQNLFSVYLSRE---TMGSMVIFGGIDDSYFTGSINWIPVSYQGYWQISMDSI 258
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAECKLVVS 114
++ Q C GC AI+D+GTSL+AGP + +I A+G G + C V++
Sbjct: 259 IVNKQEIA-CSSGCQAIIDTGTSLVAGPASDINDIQSAVGANQNSYGEYNVNCSHVLA 315
Score = 57.4 bits (137), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 31/90 (34%), Positives = 46/90 (51%), Gaps = 8/90 (8%)
Query: 205 NPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 264
N GE ++C + MP+V F IG + + Y + +G C+S F
Sbjct: 301 NSYGEYNVNCSHVLAMPDVVFVIGGIQYPVPALAYTEQNDQG---TCMSSFQN-----SS 352
Query: 265 GPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
LWILGDVF+ VY+++FD R+G A+A
Sbjct: 353 ADLWILGDVFIRVYYSIFDRANNRVGLAKA 382
>gi|2499826|sp|Q29079.1|PAG2_PIG RecName: Full=Pregnancy-associated glycoprotein 2; Short=PAG 2;
Flags: Precursor
gi|1223879|gb|AAA92055.1| pregnancy-associated glycoprotein [Sus scrofa]
Length = 420
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 44/99 (44%), Positives = 63/99 (63%), Gaps = 3/99 (3%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+ +Q +SE VF+F+L+ D EEG ++FGGVD K++KG +VP+T+ YWQ L I
Sbjct: 202 LKKQDQISEPVFAFYLS--SDKEEGSVVMFGGVDKKYYKGDLKWVPLTQTSYWQIALDRI 259
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAI 99
+ G C GC AIVD+GTS+L GP+ V +I+ I
Sbjct: 260 TCRGRVIG-CPRGCQAIVDTGTSMLHGPSKAVAKIHSLI 297
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 28/85 (32%), Positives = 45/85 (52%), Gaps = 2/85 (2%)
Query: 211 IIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWIL 270
++ C+ +P++ FTI + + + + YI K C S F + +WIL
Sbjct: 338 VVPCNARKALPDIVFTINNVDYPVPAQAYIRKNANN--NRCYSTFEDIMDTLNQREIWIL 395
Query: 271 GDVFMGVYHTVFDSGKLRIGFAEAA 295
GDVF+ +Y TV+D G+ RIG A+A
Sbjct: 396 GDVFLRLYFTVYDEGQNRIGLAQAT 420
>gi|392975293|gb|AFM95213.1| hypothetical protein, partial [Cynara cardunculus var. scolymus]
Length = 56
Score = 86.3 bits (212), Expect = 2e-14, Method: Composition-based stats.
Identities = 36/54 (66%), Positives = 42/54 (77%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQ 54
M+ QGLV E FSFWLN + D EEGGE+VFGG+D HF+G HTYVPVT + YWQ
Sbjct: 3 MLNQGLVKERRFSFWLNGNVDEEEGGELVFGGLDHNHFRGDHTYVPVTYQYYWQ 56
>gi|195501958|ref|XP_002098019.1| GE10129 [Drosophila yakuba]
gi|194184120|gb|EDW97731.1| GE10129 [Drosophila yakuba]
Length = 396
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 51/119 (42%), Positives = 63/119 (52%), Gaps = 3/119 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M EQ LV E VFSF+L R+ GGE++FGG+D F G TYVP+T YWQF L I
Sbjct: 210 MCEQQLVDECVFSFYLKRNGSERMGGELLFGGLDKTKFSGTLTYVPLTHAAYWQFPLDAI 269
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDL 119
+G + AI D+GTSLLA P IN +GG + E L S+ L
Sbjct: 270 EVGGTAISHHR---QAIADTGTSLLAAPPREYLIINSLLGGLPTANNEYLLNCSEIDSL 325
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 40/101 (39%), Positives = 58/101 (57%), Gaps = 6/101 (5%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILK-TGEGIAEVCISG 254
IN L LP E +++C I ++P + F IG + F L P Y++ T + + +C+S
Sbjct: 301 INSLLGGLPTANNEYLLNCSEIDSLPEIVFIIGGQRFGLQPRDYVMSATNDDGSRICLSA 360
Query: 255 FMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
F + WILGDVF+G Y+T FD+G+ RIGFA AA
Sbjct: 361 FTLME-----AEFWILGDVFIGRYYTAFDAGQRRIGFAPAA 396
>gi|332267172|ref|XP_003282561.1| PREDICTED: pepsin A-5 [Nomascus leucogenys]
Length = 372
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 44/100 (44%), Positives = 61/100 (61%), Gaps = 3/100 (3%)
Query: 3 EQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILI 62
QGLVS+++FS +L+ D + G ++FGG+D ++ G +VPVT +GYWQ + I +
Sbjct: 188 NQGLVSQDLFSVYLSAD--DQSGSVVIFGGIDSSYYSGSLNWVPVTVEGYWQITVDSITM 245
Query: 63 GNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
N T C GC AIVD+GTSLL GPT + I IG
Sbjct: 246 -NGETIACAEGCQAIVDTGTSLLTGPTSPIANIQSDIGAS 284
Score = 71.6 bits (174), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 37/91 (40%), Positives = 53/91 (58%), Gaps = 4/91 (4%)
Query: 205 NPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 264
N G+ ++ C I ++P++ FTI + + P YIL++ EG CISGF ++P
Sbjct: 286 NSDGDMVVSCSAISSLPDIVFTINGVQYPVPPSAYILQS-EG---SCISGFQGMNVPTES 341
Query: 265 GPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
G LWILGDVF+ Y TVFD ++G A A
Sbjct: 342 GELWILGDVFIRQYFTVFDRANNQVGLAPVA 372
>gi|426368717|ref|XP_004051349.1| PREDICTED: pepsin A-5-like isoform 2 [Gorilla gorilla gorilla]
Length = 388
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 44/100 (44%), Positives = 61/100 (61%), Gaps = 3/100 (3%)
Query: 3 EQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILI 62
QGLVS+++FS +L+ D + G ++FGG+D ++ G +VPVT +GYWQ + I +
Sbjct: 204 NQGLVSQDLFSVYLSAD--DQSGSVVIFGGIDSSYYTGSLNWVPVTVEGYWQITVDSITM 261
Query: 63 GNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
N T C GC AIVD+GTSLL GPT + I IG
Sbjct: 262 -NGETIACAEGCQAIVDTGTSLLTGPTSPIANIQSDIGAS 300
Score = 71.2 bits (173), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 37/91 (40%), Positives = 53/91 (58%), Gaps = 4/91 (4%)
Query: 205 NPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 264
N G+ ++ C I ++P++ FTI + + P YIL++ EG CISGF ++P
Sbjct: 302 NSDGDMVVSCSAISSLPDIVFTINGVQYPVPPSAYILQS-EG---SCISGFEGMNVPTES 357
Query: 265 GPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
G LWILGDVF+ Y TVFD ++G A A
Sbjct: 358 GELWILGDVFIRQYFTVFDRANNQVGLAAVA 388
>gi|426368715|ref|XP_004051348.1| PREDICTED: pepsin A-5-like isoform 1 [Gorilla gorilla gorilla]
Length = 388
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 44/100 (44%), Positives = 61/100 (61%), Gaps = 3/100 (3%)
Query: 3 EQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILI 62
QGLVS+++FS +L+ D + G ++FGG+D ++ G +VPVT +GYWQ + I +
Sbjct: 204 NQGLVSQDLFSVYLSAD--DQSGSVVIFGGIDSSYYTGSLNWVPVTVEGYWQITVDSITM 261
Query: 63 GNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
N T C GC AIVD+GTSLL GPT + I IG
Sbjct: 262 -NGETIACAEGCQAIVDTGTSLLTGPTSPIANIQSDIGAS 300
Score = 72.0 bits (175), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 37/91 (40%), Positives = 53/91 (58%), Gaps = 4/91 (4%)
Query: 205 NPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 264
N G+ ++ C I ++P++ FTI + + P YIL++ EG CISGF ++P
Sbjct: 302 NSDGDMVVSCSAISSLPDIVFTINGVQYPVPPSAYILQS-EG---SCISGFQGMNVPTES 357
Query: 265 GPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
G LWILGDVF+ Y TVFD ++G A A
Sbjct: 358 GELWILGDVFIRQYFTVFDRANNQVGLAAVA 388
>gi|311621972|gb|ADP95542.1| ASP-3 [Caenorhabditis sp. 5 AC-2008]
Length = 222
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 40/95 (42%), Positives = 61/95 (64%), Gaps = 1/95 (1%)
Query: 190 EKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAE 249
++ I +P GE + C +IP++PN++FT+G + F+L + YIL+ G
Sbjct: 128 SDIIKKIQHKIGGIPLINGEYEVVCSKIPSLPNITFTLGGQNFDLQGKDYILQLPNGNGG 187
Query: 250 V-CISGFMAFDLPPPRGPLWILGDVFMGVYHTVFD 283
+ C+SGFM D+P P GPLWILGDVF+G +++VFD
Sbjct: 188 MTCLSGFMGMDIPAPAGPLWILGDVFIGRFYSVFD 222
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 40/116 (34%), Positives = 64/116 (55%), Gaps = 3/116 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRD-PDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGD 59
+ ++F+FWL+RD D GGEI D H+ G + P+ + YW+ +LG
Sbjct: 41 FANSAICPNQLFAFWLSRDVNDIANGGEITLCDTDSNHYSGSIAWEPLVSEDYWRIKLGA 100
Query: 60 ILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQ 115
+ I T G +IVD+GTSLL GP+ ++ +I H IGG +++ E ++V S+
Sbjct: 101 VSI--SGTTYTNGPMDSIVDTGTSLLTGPSDIIKKIQHKIGGIPLINGEYEVVCSK 154
>gi|195997415|ref|XP_002108576.1| hypothetical protein TRIADDRAFT_51616 [Trichoplax adhaerens]
gi|190589352|gb|EDV29374.1| hypothetical protein TRIADDRAFT_51616 [Trichoplax adhaerens]
Length = 362
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 51/121 (42%), Positives = 73/121 (60%), Gaps = 13/121 (10%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFEL--G 58
M+ QGLV + +FSF++NR+ + GE++ GG D K+++GK YVPV+ G WQF + G
Sbjct: 168 MILQGLV-KPIFSFYINREKEGPIAGELILGGSDSKYYRGKLNYVPVSVIGLWQFAIDGG 226
Query: 59 DILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE-----GVVSAECKLVV 113
I G C GGC AI D+G+SL+ GP + I +AIG + GVV +CK +
Sbjct: 227 KIASGES---FCIGGCQAIADTGSSLIFGPPKDIIPILYAIGADPRDPNGVV--DCKTIS 281
Query: 114 S 114
S
Sbjct: 282 S 282
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 32/84 (38%), Positives = 52/84 (61%), Gaps = 2/84 (2%)
Query: 211 IIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWIL 270
++DC I ++P ++FTI ++LS EQYIL+ + + C SGF ++ P+ + L
Sbjct: 274 VVDCKTISSLPIITFTINRVDYSLSGEQYILQLTQDGRKECFSGFAFYN--GPQNSVLTL 331
Query: 271 GDVFMGVYHTVFDSGKLRIGFAEA 294
GD+F+ Y+T FD G R+GFA+
Sbjct: 332 GDIFLHAYYTEFDMGLNRVGFAKT 355
>gi|313220508|emb|CBY31359.1| unnamed protein product [Oikopleura dioica]
gi|313229843|emb|CBY07548.1| unnamed protein product [Oikopleura dioica]
Length = 397
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 40/102 (39%), Positives = 58/102 (56%)
Query: 191 KVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEV 250
++ IN+ ++ GE ++ C P MP + I D + ++P+ Y+L +
Sbjct: 290 EITDQINKAIGAIEFIAGEWLVICRNKPRMPTIDIYIDDVRYRMTPDDYVLTIEDQGQTQ 349
Query: 251 CISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFA 292
CIS FM D+P P GPLWILGD FMG+ +TVFD R+GFA
Sbjct: 350 CISAFMGLDIPEPAGPLWILGDAFMGMKYTVFDFDTNRVGFA 391
Score = 81.6 bits (200), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 46/126 (36%), Positives = 77/126 (61%), Gaps = 15/126 (11%)
Query: 1 MVEQGLVSEEVFSFWLNRDP----DAEEGGEIVFGGVDPKHFKG----KHTYVPVTKKGY 52
++E+G V+ VF+F+++R+ +++ GGEI +GGV+P+ F+G + V+++ Y
Sbjct: 188 LMEEGQVNG-VFAFFVHRNSSKPGESDIGGEIAWGGVNPERFEGTFPDSFIWHEVSRQAY 246
Query: 53 WQFELG------DILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVS 106
WQ +G D + +Q +CEGGC IVDSGTSL+ GPT + +IN AIG ++
Sbjct: 247 WQVNMGTVTVNGDGFVSDQPIVMCEGGCQGIVDSGTSLITGPTEITDQINKAIGAIEFIA 306
Query: 107 AECKLV 112
E ++
Sbjct: 307 GEWLVI 312
>gi|397516556|ref|XP_003846117.1| PREDICTED: LOW QUALITY PROTEIN: pepsin A-5, partial [Pan paniscus]
Length = 274
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 44/100 (44%), Positives = 61/100 (61%), Gaps = 3/100 (3%)
Query: 3 EQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILI 62
QGLVS+++FS +L+ D + G ++FGG+D ++ G +VPVT +GYWQ + I +
Sbjct: 131 NQGLVSQDLFSVYLSADD--KSGSVVIFGGIDSSYYTGSLNWVPVTVEGYWQITVDSITM 188
Query: 63 GNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
N T C GC AIVD+GTSLL GPT + I IG
Sbjct: 189 -NGKTIACAEGCQAIVDTGTSLLTGPTSPIANIQSDIGAS 227
>gi|12248414|dbj|BAB20092.1| pepsinogen A [Rana catesbeiana]
Length = 385
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 42/102 (41%), Positives = 62/102 (60%), Gaps = 3/102 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M QGL+ +++FS +L+ + G ++FGGVD ++ G +VP+T + YWQ + I
Sbjct: 199 MWNQGLIPQDLFSVYLSSQ--GQSGSFVLFGGVDTSYYTGNLNWVPLTAETYWQITVDSI 256
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
IG Q C G C+AIVD+GTSLLAGP+ + I + IG
Sbjct: 257 SIGGQVIA-CSGSCSAIVDTGTSLLAGPSTPIANIQYYIGAN 297
Score = 68.6 bits (166), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 34/88 (38%), Positives = 47/88 (53%), Gaps = 4/88 (4%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 267
G+ +I+C+ I MP V FTI + L Y+ ++ + C SGF A +LP G L
Sbjct: 302 GQYVINCNNISNMPTVVFTINGVQYPLPASAYVRQSQQS----CTSGFQAMNLPTSSGDL 357
Query: 268 WILGDVFMGVYHTVFDSGKLRIGFAEAA 295
WILGDVF+ Y+ VFD + A A
Sbjct: 358 WILGDVFIREYYVVFDRANNYVAMAPVA 385
>gi|195339269|ref|XP_002036242.1| GM12869 [Drosophila sechellia]
gi|194130122|gb|EDW52165.1| GM12869 [Drosophila sechellia]
Length = 372
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 42/88 (47%), Positives = 56/88 (63%), Gaps = 3/88 (3%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV QGLV VFSF+L RD + GGE++FGG D + G TYVP++++GYWQF +
Sbjct: 195 MVSQGLVDNSVFSFYLARDGTSTMGGELIFGGSDSSLYSGALTYVPISEQGYWQFTMAGS 254
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGP 88
I S +C+ C AI D+GTSL+ P
Sbjct: 255 SIDGYS--LCD-DCQAIADTGTSLIVAP 279
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 36/86 (41%), Positives = 49/86 (56%), Gaps = 13/86 (15%)
Query: 212 IDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF--MAFDLPPPRGPLWI 269
+DC + ++P+V+F IG F L P YI+++ + C+S F M D WI
Sbjct: 298 LDCSSVSSLPDVTFNIGGTNFVLKPSAYIIQSDDS----CMSAFEYMGTDF-------WI 346
Query: 270 LGDVFMGVYHTVFDSGKLRIGFAEAA 295
LGDVF+G Y+T FD G RIGFA A
Sbjct: 347 LGDVFIGQYYTEFDLGNNRIGFAPVA 372
>gi|281183192|ref|NP_001162218.1| gastricsin precursor [Papio anubis]
gi|157939796|gb|ABW05535.1| progastricsin (predicted) [Papio anubis]
Length = 388
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 42/102 (41%), Positives = 64/102 (62%), Gaps = 1/102 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV++G ++ +FS +L+ D GG +VFGGVD + G+ + PVT++ YWQ + +
Sbjct: 199 MVQEGALTSPIFSVYLS-DQQGSSGGAVVFGGVDSSLYTGQIYWAPVTQELYWQIGIEEF 257
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
LIG Q++G C GC AIVD+GTSLL P ++ + A G +
Sbjct: 258 LIGGQASGWCSEGCQAIVDTGTSLLTVPQQYLSALLQATGAQ 299
Score = 61.6 bits (148), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 35/107 (32%), Positives = 54/107 (50%), Gaps = 5/107 (4%)
Query: 190 EKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAE 249
++ LS + + + + G+ +++C+ I +P ++F I F L P YIL
Sbjct: 286 QQYLSALLQATGAQEDEYGQFLVNCNSIQNLPTLTFIINGVEFPLPPSSYILNNNG---- 341
Query: 250 VCISGFMAFDLPPPRG-PLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
C G L PLWILGDVF+ Y++V+D R+GFA AA
Sbjct: 342 YCTVGVEPTYLSAQNSQPLWILGDVFLRSYYSVYDLSNNRVGFATAA 388
>gi|20129385|ref|NP_609235.1| beta-site APP-cleaving enzyme [Drosophila melanogaster]
gi|7297427|gb|AAF52686.1| beta-site APP-cleaving enzyme [Drosophila melanogaster]
gi|54650878|gb|AAV37018.1| GH11417p [Drosophila melanogaster]
gi|220951556|gb|ACL88321.1| CG13095-PA [synthetic construct]
gi|220959834|gb|ACL92460.1| CG13095-PA [synthetic construct]
Length = 372
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 42/88 (47%), Positives = 56/88 (63%), Gaps = 3/88 (3%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV QGLV VFSF+L RD + GGE++FGG D + G TYVP++++GYWQF +
Sbjct: 195 MVSQGLVDNSVFSFYLARDGTSTMGGELIFGGSDASLYSGALTYVPISEQGYWQFTMAGS 254
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGP 88
I S +C+ C AI D+GTSL+ P
Sbjct: 255 SIDGYS--LCD-DCQAIADTGTSLIVAP 279
Score = 64.7 bits (156), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 36/86 (41%), Positives = 48/86 (55%), Gaps = 13/86 (15%)
Query: 212 IDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF--MAFDLPPPRGPLWI 269
+DC + ++P+V+F IG F L P YI+++ C+S F M D WI
Sbjct: 298 LDCSSVSSLPDVTFNIGGTNFVLKPSAYIIQSDGN----CMSAFEYMGTDF-------WI 346
Query: 270 LGDVFMGVYHTVFDSGKLRIGFAEAA 295
LGDVF+G Y+T FD G RIGFA A
Sbjct: 347 LGDVFIGQYYTEFDLGNNRIGFAPVA 372
>gi|355561685|gb|EHH18317.1| hypothetical protein EGK_14890 [Macaca mulatta]
gi|355748551|gb|EHH53034.1| hypothetical protein EGM_13592 [Macaca fascicularis]
Length = 388
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 42/102 (41%), Positives = 64/102 (62%), Gaps = 1/102 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV++G ++ +FS +L+ D GG +VFGGVD + G+ + PVT++ YWQ + +
Sbjct: 199 MVQEGALTSPIFSVYLS-DQQGSSGGAVVFGGVDSSLYTGQIYWAPVTQELYWQIGIEEF 257
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
LIG Q++G C GC AIVD+GTSLL P ++ + A G +
Sbjct: 258 LIGGQASGWCSEGCQAIVDTGTSLLTVPQQYMSALLQATGAQ 299
Score = 60.8 bits (146), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 34/107 (31%), Positives = 54/107 (50%), Gaps = 5/107 (4%)
Query: 190 EKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAE 249
++ +S + + + + G+ +++C+ I +P ++F I F L P YIL
Sbjct: 286 QQYMSALLQATGAQEDEYGQFLVNCNSIQNLPTLTFIINGVEFPLPPSSYILNNNG---- 341
Query: 250 VCISGFMAFDLPPPRG-PLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
C G L PLWILGDVF+ Y++V+D R+GFA AA
Sbjct: 342 YCTVGVEPTYLSAQNSQPLWILGDVFLRSYYSVYDLSNNRVGFATAA 388
>gi|129797|sp|P03955.2|PEPC_MACFU RecName: Full=Gastricsin; AltName: Full=Pepsinogen C; Flags:
Precursor
gi|38073|emb|CAA42426.1| pepsinogen C [Macaca fuscata]
Length = 377
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 42/102 (41%), Positives = 64/102 (62%), Gaps = 1/102 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV++G ++ +FS +L+ D GG +VFGGVD + G+ + PVT++ YWQ + +
Sbjct: 188 MVQEGALTSPIFSVYLS-DQQGSSGGAVVFGGVDSSLYTGQIYWAPVTQELYWQIGIEEF 246
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
LIG Q++G C GC AIVD+GTSLL P ++ + A G +
Sbjct: 247 LIGGQASGWCSEGCQAIVDTGTSLLTVPQQYMSALLQATGAQ 288
Score = 60.8 bits (146), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 34/107 (31%), Positives = 54/107 (50%), Gaps = 5/107 (4%)
Query: 190 EKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAE 249
++ +S + + + + G+ +++C+ I +P ++F I F L P YIL
Sbjct: 275 QQYMSALLQATGAQEDEYGQFLVNCNSIQNLPTLTFIINGVEFPLPPSSYILNNNG---- 330
Query: 250 VCISGFMAFDLPPPRG-PLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
C G L PLWILGDVF+ Y++V+D R+GFA AA
Sbjct: 331 YCTVGVEPTYLSAQNSQPLWILGDVFLRSYYSVYDLSNNRVGFATAA 377
>gi|410045159|ref|XP_001145764.3| PREDICTED: pepsin A-5 isoform 1 [Pan troglodytes]
Length = 434
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 44/100 (44%), Positives = 61/100 (61%), Gaps = 3/100 (3%)
Query: 3 EQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILI 62
QGLVS+++FS +L+ D + G ++FGG+D ++ G +VPVT +GYWQ + I +
Sbjct: 204 NQGLVSQDLFSVYLSAD--DKSGSVVIFGGIDSSYYTGSLNWVPVTVEGYWQITVDSITM 261
Query: 63 GNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
N T C GC AIVD+GTSLL GPT + I IG
Sbjct: 262 -NGKTIACAEGCQAIVDTGTSLLTGPTSPIANIQSDIGAS 300
Score = 68.2 bits (165), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 35/87 (40%), Positives = 51/87 (58%), Gaps = 4/87 (4%)
Query: 205 NPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 264
N G+ ++ C I ++P++ FTI + + P YIL++ EG CISGF ++P
Sbjct: 302 NSDGDMVVSCSAISSLPDIVFTINGVQYPVPPSAYILQS-EG---SCISGFQGMNVPTES 357
Query: 265 GPLWILGDVFMGVYHTVFDSGKLRIGF 291
G LWILGDVF+ Y TVFD ++G
Sbjct: 358 GELWILGDVFIRQYFTVFDRANNKVGL 384
>gi|1246038|gb|AAB35842.1| pepsinogen A [turtles, Peptide, 361 aa]
Length = 361
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 41/102 (40%), Positives = 64/102 (62%), Gaps = 3/102 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ +GLVS+++FS +L+ D + G ++FGG D ++ G ++P++ + YW+ + I
Sbjct: 185 MMSEGLVSQDLFSVYLSSD--EQSGSFVMFGGNDTSYYSGSLNWIPLSAETYWEITMDSI 242
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
+ Q T C GGC AI+D+GTSLLAGP V+ IN IG
Sbjct: 243 TMNGQ-TIACSGGCQAIIDTGTSLLAGPPSDVSNINSYIGAS 283
Score = 38.5 bits (88), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 23/84 (27%), Positives = 36/84 (42%), Gaps = 9/84 (10%)
Query: 212 IDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILG 271
+ C + ++P + F I F + YI+ + S LWILG
Sbjct: 287 VSCSSMSSLPEIVFNINGIAFPVPASAYIINDSSSCSSSFESMDQG---------LWILG 337
Query: 272 DVFMGVYHTVFDSGKLRIGFAEAA 295
DVF+ +Y+ VFD ++G A A
Sbjct: 338 DVFIRLYYVVFDRANNQVGLASLA 361
>gi|222425186|dbj|BAH20542.1| pepsinogen A-35 [Pongo abelii]
Length = 388
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 44/100 (44%), Positives = 61/100 (61%), Gaps = 3/100 (3%)
Query: 3 EQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILI 62
QGLVS+++FS +L+ D + G ++FGG+D ++ G +VPVT +GYWQ + I +
Sbjct: 204 NQGLVSQDLFSVYLSAD--DKSGSVVIFGGIDSSYYTGSLNWVPVTVEGYWQITVDSITM 261
Query: 63 GNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
N T C GC AIVD+GTSLL GPT + I IG
Sbjct: 262 -NGKTIACAEGCQAIVDTGTSLLTGPTSPIANIQSDIGAS 300
Score = 74.3 bits (181), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 39/91 (42%), Positives = 53/91 (58%), Gaps = 4/91 (4%)
Query: 205 NPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 264
N G+ ++ C I ++P++ FTI + L P YILK+ EG CISGF ++P
Sbjct: 302 NSDGDMVVSCSAISSLPDIVFTINGVQYPLPPSAYILKS-EG---SCISGFQGMNVPTES 357
Query: 265 GPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
G LWILGDVF+ Y TVFD ++G A A
Sbjct: 358 GELWILGDVFIRQYFTVFDRANNQVGLAPVA 388
>gi|8896140|gb|AAF81255.1| pregnancy-associated glycoprotein 6 [Sus scrofa]
Length = 389
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 43/102 (42%), Positives = 64/102 (62%), Gaps = 3/102 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+ +Q +SE VF+F+L+ D EEG ++FGG+D K++KG +VP+T+ YWQ L I
Sbjct: 202 LKKQDQISEPVFAFYLS--SDKEEGSVVMFGGLDKKYYKGDLKWVPLTQTSYWQIALDRI 259
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
+ G C GC AIVD+GTS+L GP+ V +I+ I +
Sbjct: 260 TCRGRVIG-CPRGCQAIVDTGTSMLHGPSKAVAKIHSLINAK 300
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 29/87 (33%), Positives = 46/87 (52%), Gaps = 2/87 (2%)
Query: 209 ESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLW 268
E ++ C+ +P++ FTI + + + + YI K C S F + +W
Sbjct: 305 EYVVPCNARKALPDIVFTINNVDYPVPAQAYIRKNANN--NRCYSTFEDIMDTLNQREIW 362
Query: 269 ILGDVFMGVYHTVFDSGKLRIGFAEAA 295
ILGDVF+ +Y TV+D G+ RIG A+A
Sbjct: 363 ILGDVFLRLYFTVYDEGQNRIGLAQAT 389
>gi|24647683|ref|NP_650623.1| CG5863 [Drosophila melanogaster]
gi|7300255|gb|AAF55418.1| CG5863 [Drosophila melanogaster]
Length = 395
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 51/119 (42%), Positives = 63/119 (52%), Gaps = 3/119 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M +Q LV E VFSF+L R+ +GGE++FGGVD F G TYVP+T GYWQF L I
Sbjct: 209 MCDQQLVDECVFSFYLKRNGSERKGGELLFGGVDKTKFSGSLTYVPLTHAGYWQFPLDVI 268
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDL 119
+ AI D+GTSLLA P IN +GG + E L S+ L
Sbjct: 269 EVAGTRINQNR---QAIADTGTSLLAAPPREYLIINSLLGGLPTSNNEYLLNCSEIDSL 324
Score = 84.3 bits (207), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 41/101 (40%), Positives = 58/101 (57%), Gaps = 6/101 (5%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILK-TGEGIAEVCISG 254
IN L LP E +++C I ++P + F IG + F L P Y++ T + + +C+S
Sbjct: 300 INSLLGGLPTSNNEYLLNCSEIDSLPEIVFIIGGQRFGLQPRDYVMSATNDDGSSICLSA 359
Query: 255 FMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
F D WILGDVF+G Y+T FD+G+ RIGFA AA
Sbjct: 360 FTLMD-----AEFWILGDVFIGRYYTAFDAGQRRIGFAPAA 395
>gi|222425180|dbj|BAH20539.1| pepsinogen A-43 [Pongo abelii]
Length = 388
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 44/100 (44%), Positives = 61/100 (61%), Gaps = 3/100 (3%)
Query: 3 EQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILI 62
QGLVS+++FS +L+ D + G ++FGG+D ++ G +VPVT +GYWQ + I +
Sbjct: 204 NQGLVSQDLFSVYLSAD--DKSGSVVIFGGIDSSYYTGSLNWVPVTVEGYWQITVDSITM 261
Query: 63 GNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
N T C GC AIVD+GTSLL GPT + I IG
Sbjct: 262 -NGKTIACAEGCQAIVDTGTSLLTGPTSPIANIQSDIGAS 300
Score = 73.9 bits (180), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 39/91 (42%), Positives = 53/91 (58%), Gaps = 4/91 (4%)
Query: 205 NPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 264
N G+ ++ C I ++P++ FTI + L P YILK+ EG CISGF ++P
Sbjct: 302 NSDGDMVVSCSAISSLPDIVFTINGVQYPLPPSAYILKS-EG---SCISGFQGMNVPTES 357
Query: 265 GPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
G LWILGDVF+ Y TVFD ++G A A
Sbjct: 358 GELWILGDVFIRQYFTVFDRANNQVGLAPVA 388
>gi|222425184|dbj|BAH20541.1| pepsinogen A-14 [Pongo abelii]
Length = 388
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 44/100 (44%), Positives = 61/100 (61%), Gaps = 3/100 (3%)
Query: 3 EQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILI 62
QGLVS+++FS +L+ D + G ++FGG+D ++ G +VPVT +GYWQ + I +
Sbjct: 204 NQGLVSQDLFSVYLSAD--DKSGSVVIFGGIDSSYYTGSLNWVPVTVEGYWQITVDSITM 261
Query: 63 GNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
N T C GC AIVD+GTSLL GPT + I IG
Sbjct: 262 -NGKTIACAEGCQAIVDTGTSLLTGPTSPIANIQSDIGAS 300
Score = 74.7 bits (182), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 39/91 (42%), Positives = 53/91 (58%), Gaps = 4/91 (4%)
Query: 205 NPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 264
N G+ ++ C I ++P++ FTI + L P YILK+ EG CISGF ++P
Sbjct: 302 NSNGDMVVSCSAISSLPDIVFTINGVQYPLPPSAYILKS-EG---SCISGFQGMNVPTES 357
Query: 265 GPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
G LWILGDVF+ Y TVFD ++G A A
Sbjct: 358 GELWILGDVFIRQYFTVFDRANNQVGLAPVA 388
>gi|358057753|dbj|GAA96408.1| hypothetical protein E5Q_03075 [Mixia osmundae IAM 14324]
Length = 453
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 48/120 (40%), Positives = 65/120 (54%), Gaps = 8/120 (6%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAE--EGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELG 58
M+ GLV + FSFWL D E GGE V GG DP H+KGK + PV +KGYW+ EL
Sbjct: 264 MIADGLVDKAEFSFWLGDTADGEGAPGGEFVMGGTDPAHYKGKIQWAPVRRKGYWEVELS 323
Query: 59 DILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAECKLVVS 114
I G + G A +D+GTSL+A P+ + +N IG + G + +C + S
Sbjct: 324 KIKFGKDELELESTGAA--IDTGTSLIALPSDLAELLNKEIGAKKSWNGQYTVDCAAIPS 381
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 29/85 (34%), Positives = 47/85 (55%), Gaps = 4/85 (4%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 267
G+ +DC IP++P+++ + + ++ YIL+ CIS F D P GP+
Sbjct: 370 GQYTVDCAAIPSLPDLTMYFAGEPYTITGADYILQA----QGTCISAFTGLDFPESIGPI 425
Query: 268 WILGDVFMGVYHTVFDSGKLRIGFA 292
WI+GDVF+ + TV+ K +GFA
Sbjct: 426 WIVGDVFLRRFFTVYSLEKDAVGFA 450
>gi|302696543|ref|XP_003037950.1| hypothetical protein SCHCODRAFT_71897 [Schizophyllum commune H4-8]
gi|300111647|gb|EFJ03048.1| hypothetical protein SCHCODRAFT_71897 [Schizophyllum commune H4-8]
Length = 406
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 49/119 (41%), Positives = 71/119 (59%), Gaps = 4/119 (3%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ +GL+ E VFSF L + E+GGE +FGGVD +KG TYVPV +K YW+ EL I
Sbjct: 221 MINKGLLDEPVFSFRLGKS--EEDGGEAIFGGVDKSAYKGDLTYVPVRRKAYWEVELEKI 278
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDL 119
G++ + G A +D+GTSL+A PT + IN IG + + + ++ S+ DL
Sbjct: 279 SFGSEELELESTGAA--IDTGTSLIALPTDMAEMINAEIGAKKSWNGQYQVECSKVPDL 335
Score = 75.5 bits (184), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 35/87 (40%), Positives = 52/87 (59%), Gaps = 4/87 (4%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 267
G+ ++C ++P +P +S G K + L YIL+ + CIS F D+ P G L
Sbjct: 323 GQYQVECSKVPDLPELSLYFGGKPYTLKGTDYILE----VQGTCISSFTGLDINVPGGSL 378
Query: 268 WILGDVFMGVYHTVFDSGKLRIGFAEA 294
WI+GDVF+ Y+TV+D G+ +GFAEA
Sbjct: 379 WIIGDVFLRKYYTVYDLGRDAVGFAEA 405
>gi|47211933|emb|CAF92442.1| unnamed protein product [Tetraodon nigroviridis]
Length = 304
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 48/114 (42%), Positives = 69/114 (60%), Gaps = 7/114 (6%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ QGLVS +FS +L+ + EG I+FGG DP+H+ T++P++ + YWQ + I
Sbjct: 126 MMSQGLVSMHLFSVYLSSN---LEGSVIIFGGTDPEHYHEPITWIPLSSQLYWQIPIDSI 182
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVS 114
I ++ G C GGC A+VD+GTSL+ GP V +I IG GV E V+S
Sbjct: 183 TINGKAVG-CIGGCQAMVDTGTSLIIGPMWSVRQIIFGIGARGV---EGDFVIS 232
Score = 51.2 bits (121), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 32/87 (36%), Positives = 45/87 (51%), Gaps = 9/87 (10%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 267
G+ +I CD MPNV+F I + F L P Y ++ C GF P +
Sbjct: 227 GDFVISCD-FDVMPNVTFHIQGQEFPLPPSAYTIQFP---TYGCRPGFA-----PTYENM 277
Query: 268 WILGDVFMGVYHTVFDSGKLRIGFAEA 294
WILGDVF+ Y+T+F + R+G A+A
Sbjct: 278 WILGDVFIRHYYTIFSRSENRLGLAKA 304
>gi|224458278|ref|NP_001138942.1| pepsinogen A precursor [Pongo abelii]
gi|222425178|dbj|BAH20538.1| pepsinogen A-75 [Pongo abelii]
Length = 388
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 44/100 (44%), Positives = 61/100 (61%), Gaps = 3/100 (3%)
Query: 3 EQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILI 62
QGLVS+++FS +L+ D + G ++FGG+D ++ G +VPVT +GYWQ + I +
Sbjct: 204 NQGLVSQDLFSVYLSAD--DKSGSVVIFGGIDSSYYTGSLNWVPVTVEGYWQITVDSITM 261
Query: 63 GNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
N T C GC AIVD+GTSLL GPT + I IG
Sbjct: 262 -NGKTIACAEGCQAIVDTGTSLLTGPTSPIANIQSDIGAS 300
Score = 73.9 bits (180), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 39/91 (42%), Positives = 53/91 (58%), Gaps = 4/91 (4%)
Query: 205 NPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 264
N G+ ++ C I ++P++ FTI + L P YILK+ EG CISGF ++P
Sbjct: 302 NSDGDMVVSCSAISSLPDIVFTINGVQYPLPPSAYILKS-EG---SCISGFQGMNVPTES 357
Query: 265 GPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
G LWILGDVF+ Y TVFD ++G A A
Sbjct: 358 GELWILGDVFIRQYFTVFDRANNQVGLAPVA 388
>gi|23943854|ref|NP_055039.1| pepsin A-5 preproprotein [Homo sapiens]
gi|378522017|sp|P0DJD9.1|PEPA5_HUMAN RecName: Full=Pepsin A-5; AltName: Full=Pepsinogen-5; Flags:
Precursor
gi|20810074|gb|AAH29055.1| Pepsinogen 5, group I (pepsinogen A) [Homo sapiens]
gi|119594334|gb|EAW73928.1| pepsinogen 5, group I (pepsinogen A) [Homo sapiens]
gi|219520836|gb|AAI71889.1| Pepsinogen 5, group I (pepsinogen A) [Homo sapiens]
gi|223461673|gb|AAI47000.1| Pepsinogen 5, group I (pepsinogen A) [Homo sapiens]
Length = 388
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 44/100 (44%), Positives = 61/100 (61%), Gaps = 3/100 (3%)
Query: 3 EQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILI 62
QGLVS+++FS +L+ D + G ++FGG+D ++ G +VPVT +GYWQ + I +
Sbjct: 204 NQGLVSQDLFSVYLSAD--DKSGSVVIFGGIDSSYYTGSLNWVPVTVEGYWQITVDSITM 261
Query: 63 GNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
N T C GC AIVD+GTSLL GPT + I IG
Sbjct: 262 -NGETIACAEGCQAIVDTGTSLLTGPTSPIANIQSDIGAS 300
Score = 71.6 bits (174), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 37/91 (40%), Positives = 53/91 (58%), Gaps = 4/91 (4%)
Query: 205 NPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 264
N G+ ++ C I ++P++ FTI + + P YIL++ EG CISGF ++P
Sbjct: 302 NSDGDMVVSCSAISSLPDIVFTINGVQYPVPPSAYILQS-EG---SCISGFQGMNVPTES 357
Query: 265 GPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
G LWILGDVF+ Y TVFD ++G A A
Sbjct: 358 GELWILGDVFIRQYFTVFDRANNQVGLAPVA 388
>gi|351707910|gb|EHB10829.1| Gastricsin [Heterocephalus glaber]
Length = 391
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 38/102 (37%), Positives = 63/102 (61%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
++ +G +S+ +FS +L + +GG ++ GGVD + G+ ++ PVT++ YWQ + D+
Sbjct: 201 LIREGALSQPLFSVYLGSQQGSSDGGALILGGVDESLYNGQISWTPVTQELYWQIGIEDV 260
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
+ NQ+ G C GC IVD+GTSLL P +T + AIG +
Sbjct: 261 QLDNQALGWCSQGCQGIVDTGTSLLTLPQQYLTTLIQAIGAQ 302
Score = 67.8 bits (164), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 36/106 (33%), Positives = 58/106 (54%), Gaps = 5/106 (4%)
Query: 190 EKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAE 249
++ L+ + + + N GE ++DC+ I ++P ++ + F L P YIL+ +
Sbjct: 289 QQYLTTLIQAIGAQENEFGEYVVDCNSIQSLPTLTVILSGVKFPLLPSAYILQEDQ---- 344
Query: 250 VCISGFMA-FDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
C+ G A + PLWILGDVF+ Y++VFD G R+GFA A
Sbjct: 345 YCMVGLSATYLYSESSQPLWILGDVFLRSYYSVFDLGNNRVGFAPA 390
>gi|219521036|gb|AAI71897.1| Pepsinogen 5, group I (pepsinogen A) [Homo sapiens]
Length = 388
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 44/100 (44%), Positives = 61/100 (61%), Gaps = 3/100 (3%)
Query: 3 EQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILI 62
QGLVS+++FS +L+ D + G ++FGG+D ++ G +VPVT +GYWQ + I +
Sbjct: 204 NQGLVSQDLFSVYLSAD--DKSGSVVIFGGIDSSYYTGSLNWVPVTVEGYWQITVDSITM 261
Query: 63 GNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
N T C GC AIVD+GTSLL GPT + I IG
Sbjct: 262 -NGETIACAEGCQAIVDTGTSLLTGPTSPIANIQSDIGAS 300
Score = 71.6 bits (174), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 37/91 (40%), Positives = 53/91 (58%), Gaps = 4/91 (4%)
Query: 205 NPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 264
N G+ ++ C I ++P++ FTI + + P YIL++ EG CISGF ++P
Sbjct: 302 NSDGDMVVSCSAISSLPDIVFTINGVQYPVPPSAYILQS-EG---SCISGFQGMNVPTES 357
Query: 265 GPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
G LWILGDVF+ Y TVFD ++G A A
Sbjct: 358 GELWILGDVFIRKYFTVFDRANNQVGLAPVA 388
>gi|301344561|gb|ADK74002.1| pepsinogen A1 precursor [Lateolabrax japonicus]
Length = 370
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 37/102 (36%), Positives = 66/102 (64%), Gaps = 3/102 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+++GLV++++FS +L+ + +++G + FGG+DP H+ G T++P++ + YWQ + +
Sbjct: 197 MMKEGLVNQDLFSVYLSSN--SQQGSVVTFGGIDPNHYYGPITWIPLSNELYWQITVDSV 254
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
+ Q C GC AIVD+GTSL+ GP ++ IN +G
Sbjct: 255 TVNGQVV-ACSDGCQAIVDTGTSLIVGPQGSISSINSGVGAS 295
>gi|194759276|ref|XP_001961875.1| GF15194 [Drosophila ananassae]
gi|190615572|gb|EDV31096.1| GF15194 [Drosophila ananassae]
Length = 411
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 48/130 (36%), Positives = 75/130 (57%), Gaps = 11/130 (8%)
Query: 1 MVEQGLVSEEVFSFWLNRD-PDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGD 59
+++QGL+ + VFS +LNR+ A GGE++ GG+D F+G TYVPV+ +GYWQF +G
Sbjct: 225 LIQQGLIKKPVFSIYLNRNGTSAITGGELILGGIDSGLFRGCLTYVPVSTQGYWQFTMGS 284
Query: 60 ILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAI------GGEGVVSAECKLVV 113
I T C C AI+D GTSL+ P +T++N + GV +C +
Sbjct: 285 ADI--NGTNFCN-KCEAILDVGTSLMVIPQNALTKVNQILEVLNPKETNGVFLVDCA-KI 340
Query: 114 SQYGDLIWDL 123
+ D+I+++
Sbjct: 341 PNFPDIIFNI 350
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 43/129 (33%), Positives = 66/129 (51%), Gaps = 15/129 (11%)
Query: 169 CSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPM---GESIIDCDRIPTMPNVSF 225
C+ CE A++ V L + L+ +N++ + L NP G ++DC +IP P++ F
Sbjct: 293 CNKCE-AILDVGTSLMV--IPQNALTKVNQILEVL-NPKETNGVFLVDCAKIPNFPDIIF 348
Query: 226 TIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSG 285
IG K F L Y+L+ G C+S F + + ILG+ FMG Y+TV+D G
Sbjct: 349 NIGRKDFPLKSSDYVLRYG----NTCVSSFTSLE----GLDFIILGEPFMGAYYTVYDLG 400
Query: 286 KLRIGFAEA 294
+G A A
Sbjct: 401 YNMLGLAPA 409
>gi|363734016|ref|XP_003641326.1| PREDICTED: pepsin A-like, partial [Gallus gallus]
Length = 265
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 42/117 (35%), Positives = 70/117 (59%), Gaps = 7/117 (5%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ QGL+++++FS +L PD + G ++FGG+D HF G +++P+T + YWQ ++ I
Sbjct: 80 MMSQGLLAQDLFSIYLT--PDKQNGSFVLFGGIDDAHFTGNLSWIPLTAQTYWQIKVDRI 137
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG----EGVVSAECKLVV 113
+ C GC AI+DSGTS+LAGP + I++ +G G+ + C V+
Sbjct: 138 TMHGLPIA-CTHGCQAILDSGTSMLAGPGISIRRIHYKMGATRSPSGLHTVRCGSVL 193
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 32/91 (35%), Positives = 49/91 (53%), Gaps = 5/91 (5%)
Query: 205 NPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 264
+P G + C + +P++ F I F L + Y+L+ +G C+SGF A+ LP
Sbjct: 180 SPSGLHTVRCGSV--LPDIVFVIAGTPFPLPSQFYVLQMEDG---SCMSGFEAYALPTAA 234
Query: 265 GPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
LWILG +F+ Y++VFD +G A AA
Sbjct: 235 DELWILGHIFLRRYYSVFDRANSMVGLAPAA 265
>gi|296198131|ref|XP_002746573.1| PREDICTED: gastricsin [Callithrix jacchus]
gi|18203304|sp|Q9N2D3.1|PEPC_CALJA RecName: Full=Gastricsin; AltName: Full=Pepsinogen C; Flags:
Precursor
gi|7008023|dbj|BAA90872.1| pepsinogen C [Callithrix jacchus]
Length = 388
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 41/102 (40%), Positives = 63/102 (61%), Gaps = 1/102 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+++G ++ VFSF+L+ GG ++FGGVD + G+ + PVT++ YWQ + +
Sbjct: 199 MLQEGALTSPVFSFYLSNQ-QGSSGGAVIFGGVDSSLYTGQIYWAPVTQELYWQIGIEEF 257
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
LIG Q++G C GC AIVD+GTSLL P ++ A G +
Sbjct: 258 LIGGQASGWCSEGCQAIVDTGTSLLTVPQQYMSAFLEATGAQ 299
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 38/107 (35%), Positives = 54/107 (50%), Gaps = 5/107 (4%)
Query: 190 EKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAE 249
++ +S E + + G+ +++CD I +P ++F I F L P YIL
Sbjct: 286 QQYMSAFLEATGAQEDEYGQFLVNCDSIQNLPTLTFIINGVEFPLPPSSYILSNNG---- 341
Query: 250 VCISGFMAFDLPPPRG-PLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
C G L PLWILGDVF+ Y++VFD G R+GFA AA
Sbjct: 342 YCTVGVEPTYLSSQNSQPLWILGDVFLRSYYSVFDLGNNRVGFATAA 388
>gi|354497176|ref|XP_003510697.1| PREDICTED: chymosin-like [Cricetulus griseus]
gi|344243543|gb|EGV99646.1| Chymosin [Cricetulus griseus]
Length = 379
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 47/135 (34%), Positives = 75/135 (55%), Gaps = 14/135 (10%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ + LV++++FS +++R+ ++G + G +D +F G +VPVT +GYWQF + I
Sbjct: 200 MMNRHLVAQDLFSVYMSRN---DQGSMLTLGAIDQSYFVGSLHWVPVTVQGYWQFTVDRI 256
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
I N C+GGC A++D+GT+LLAGP + I AIG V QYG
Sbjct: 257 TI-NDEVVACQGGCTAVLDTGTALLAGPGRDILNIQQAIGA----------VQGQYGQFK 305
Query: 121 WDLLVSGLLPEKVCQ 135
+ G++P V +
Sbjct: 306 INCWRLGIMPTIVFE 320
Score = 50.8 bits (120), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 31/87 (35%), Positives = 46/87 (52%), Gaps = 10/87 (11%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 267
G+ I+C R+ MP + F I + F L P Y T + + + C SGF +
Sbjct: 302 GQFKINCWRLGIMPTIVFEIHGRKFPLPPSAY---TNQEL-DSCSSGFKL------GSHI 351
Query: 268 WILGDVFMGVYHTVFDSGKLRIGFAEA 294
WILGDVF+ +++VFD R+G A+A
Sbjct: 352 WILGDVFIREFYSVFDRANNRVGLAKA 378
>gi|345778443|ref|XP_851336.2| PREDICTED: LOW QUALITY PROTEIN: gastricsin [Canis lupus familiaris]
Length = 399
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 48/122 (39%), Positives = 66/122 (54%), Gaps = 11/122 (9%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M++ GL+S VFSF+L R A G ++F G+D KG + PVT++ YWQ + +
Sbjct: 209 MLQAGLLSSXVFSFYLGRXETAR-GAVLIFRGMDHSLHKGPVYWAPVTQEHYWQIRIEEF 267
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 120
LIG +T C GC AIVD+GTSLL P ++ + AIG + QYG I
Sbjct: 268 LIGGCATSWCSQGCQAIVDTGTSLLTVPQQNLSALLQAIGAQA----------DQYGQFI 317
Query: 121 WD 122
D
Sbjct: 318 VD 319
Score = 64.3 bits (155), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 37/107 (34%), Positives = 57/107 (53%), Gaps = 4/107 (3%)
Query: 190 EKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAE 249
++ LS + + + + G+ I+DC+ + +P ++F I ++ P L T GI
Sbjct: 296 QQNLSALLQAIGAQADQYGQFIVDCNNVQNLPTLTFLIRSRVRFSLPYHSFLFTENGI-- 353
Query: 250 VCISGFMAFDLPPPRG-PLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
C G A LP G PLWILG VF+ Y+++FD G R+G A AA
Sbjct: 354 -CALGVQATYLPSSSGQPLWILGVVFLRSYYSIFDIGNSRVGCATAA 399
>gi|410974069|ref|XP_003993470.1| PREDICTED: pepsin A-like [Felis catus]
Length = 387
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 41/102 (40%), Positives = 63/102 (61%), Gaps = 3/102 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M +GLVS+++FS +L+ + + G ++FGG+D ++ G ++PV+ +GYWQ + I
Sbjct: 201 MWNEGLVSQDLFSVYLSGN--DQSGSVVMFGGIDSSYYTGNLNWIPVSVEGYWQISVDSI 258
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
+ QS C GGC AIVD+GTSLL GP+ + I IG
Sbjct: 259 TMNGQSI-ACNGGCQAIVDTGTSLLTGPSNAIANIQSDIGAS 299
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 37/91 (40%), Positives = 49/91 (53%), Gaps = 4/91 (4%)
Query: 205 NPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 264
N G+ I C I +P++ FTI + L P YIL++ +G CISG +LP
Sbjct: 301 NSYGQMGISCSAINNLPDIVFTINGNEYPLPPSAYILQSQQG----CISGLQGMNLPTAS 356
Query: 265 GPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
G LWILGDVF+ Y VFD ++G A A
Sbjct: 357 GELWILGDVFIRQYFAVFDRANNQVGLAPVA 387
>gi|871442|emb|CAA25391.1| renin [Mus musculus]
Length = 387
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 41/102 (40%), Positives = 65/102 (63%), Gaps = 2/102 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
++ QG++ EEVFS + NR GGE+V GG DP+H++G YV ++K WQ + +
Sbjct: 196 ILSQGVLKEEVFSVYYNRGSHLL-GGEVVLGGSDPQHYQGNFHYVSISKTDSWQITMKGV 254
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
+G+ ST +CE GCA +VD+G+S ++ PT + I A+G +
Sbjct: 255 SVGS-STLLCEEGCAVVVDTGSSFISAPTSSLKLIMQALGAK 295
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 33/86 (38%), Positives = 54/86 (62%)
Query: 209 ESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLW 268
E +++C ++PT+P++SF +G + + LS Y+L+ ++C A D+PPP GP+W
Sbjct: 301 EYVVNCSQVPTLPDISFDLGGRAYTLSSTDYVLQYPNRRDKLCTLALHAMDIPPPTGPVW 360
Query: 269 ILGDVFMGVYHTVFDSGKLRIGFAEA 294
+LG F+ ++T FD RIGFA A
Sbjct: 361 VLGATFIRKFYTEFDRHNNRIGFALA 386
>gi|149025623|gb|EDL81866.1| prochymosin [Rattus norvegicus]
Length = 379
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 40/101 (39%), Positives = 64/101 (63%), Gaps = 4/101 (3%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ + LV++++FS +++R+ ++G + G +D +F G +VPVT +GYWQF + I
Sbjct: 200 MMNRHLVAQDLFSVYMSRN---DQGSMLTLGAIDQSYFIGSLHWVPVTVQGYWQFTVDRI 256
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 101
I N C+GGC A++D+GT+LL GP + I HAIG
Sbjct: 257 TI-NDEVVACQGGCPAVLDTGTALLTGPGRDILNIQHAIGA 296
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 31/83 (37%), Positives = 40/83 (48%), Gaps = 10/83 (12%)
Query: 212 IDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILG 271
IDC R+ MP V F I + F L P Y C SGF +WILG
Sbjct: 306 IDCWRLNFMPTVVFEINGREFPLPPSAYT----NQFQGSCSSGFRH------GSQMWILG 355
Query: 272 DVFMGVYHTVFDSGKLRIGFAEA 294
DVF+ +++VFD R+G A+A
Sbjct: 356 DVFIREFYSVFDRANNRVGLAKA 378
>gi|9910338|ref|NP_064476.1| embryonic pepsinogen precursor [Rattus norvegicus]
gi|7106000|emb|CAB75983.1| prochymosin [Rattus norvegicus]
Length = 379
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 40/101 (39%), Positives = 64/101 (63%), Gaps = 4/101 (3%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ + LV++++FS +++R+ ++G + G +D +F G +VPVT +GYWQF + I
Sbjct: 200 MMNRHLVAQDLFSVYMSRN---DQGSMLTLGAIDQSYFIGSLHWVPVTVQGYWQFTVDRI 256
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 101
I N C+GGC A++D+GT+LL GP + I HAIG
Sbjct: 257 TI-NDEVVACQGGCPAVLDTGTALLTGPGRDILNIQHAIGA 296
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 31/83 (37%), Positives = 40/83 (48%), Gaps = 10/83 (12%)
Query: 212 IDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILG 271
IDC R+ MP V F I + F L P Y C SGF +WILG
Sbjct: 306 IDCWRLNFMPTVVFEINGREFPLPPSAYT----NQFQGSCSSGFRH------GSQMWILG 355
Query: 272 DVFMGVYHTVFDSGKLRIGFAEA 294
DVF+ +++VFD R+G A+A
Sbjct: 356 DVFIREFYSVFDRANNRVGLAKA 378
>gi|200688|gb|AAA40043.1| renin (Ren-1-d) [Mus musculus]
gi|148669208|gb|EDL01155.1| mCG129412 [Mus musculus]
Length = 402
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 41/102 (40%), Positives = 65/102 (63%), Gaps = 2/102 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
++ QG++ EEVFS + NR GGE+V GG DP+H++G YV ++K WQ + +
Sbjct: 211 ILSQGVLKEEVFSVYYNRGSHLL-GGEVVLGGSDPQHYQGNFHYVSISKTDSWQITMKGV 269
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
+G+ ST +CE GCA +VD+G+S ++ PT + I A+G +
Sbjct: 270 SVGS-STLLCEEGCAVVVDTGSSFISAPTSSLKLIMQALGAK 310
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 33/86 (38%), Positives = 54/86 (62%)
Query: 209 ESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLW 268
E +++C ++PT+P++SF +G + + LS Y+L+ ++C A D+PPP GP+W
Sbjct: 316 EYVVNCSQVPTLPDISFDLGGRAYTLSSTDYVLQYPYRRDKLCTLALHAMDIPPPTGPVW 375
Query: 269 ILGDVFMGVYHTVFDSGKLRIGFAEA 294
+LG F+ ++T FD RIGFA A
Sbjct: 376 VLGATFIRKFYTEFDRHNNRIGFALA 401
>gi|149245862|ref|XP_001472682.1| PREDICTED: renin-1-like isoform 1 [Mus musculus]
Length = 425
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 41/102 (40%), Positives = 65/102 (63%), Gaps = 2/102 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
++ QG++ EEVFS + NR GGE+V GG DP+H++G YV ++K WQ + +
Sbjct: 234 ILSQGVLKEEVFSVYYNRGSHLL-GGEVVLGGSDPQHYQGNFHYVSISKTDSWQITMKGV 292
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
+G+ ST +CE GCA +VD+G+S ++ PT + I A+G +
Sbjct: 293 SVGS-STLLCEEGCAVVVDTGSSFISAPTSSLKLIMQALGAK 333
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 33/86 (38%), Positives = 54/86 (62%)
Query: 209 ESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLW 268
E +++C ++PT+P++SF +G + + LS Y+L+ ++C A D+PPP GP+W
Sbjct: 339 EYVVNCSQVPTLPDISFDLGGRAYTLSSTDYVLQYPYRRDKLCTLALHAMDIPPPTGPVW 398
Query: 269 ILGDVFMGVYHTVFDSGKLRIGFAEA 294
+LG F+ ++T FD RIGFA A
Sbjct: 399 VLGATFIRKFYTEFDRHNNRIGFALA 424
>gi|384498765|gb|EIE89256.1| endopeptidase [Rhizopus delemar RA 99-880]
Length = 401
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 44/103 (42%), Positives = 62/103 (60%), Gaps = 3/103 (2%)
Query: 1 MVEQGLVSEEVFSFWLNR-DPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGD 59
MV + LV E +FSFWLN + D + GGE++FGGVD HF+G + V +KGYW+ + +
Sbjct: 213 MVNRDLVDEPLFSFWLNDANKDQDNGGELIFGGVDEDHFEGDIHWSDVRRKGYWEITMEN 272
Query: 60 ILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
I G+ + G A +D+G+SLL PT V IN +G E
Sbjct: 273 IKFGDDYVDIDPVGAA--IDTGSSLLVAPTTVAALINKELGAE 313
Score = 84.7 bits (208), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 47/137 (34%), Positives = 74/137 (54%), Gaps = 6/137 (4%)
Query: 156 VEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 215
+ EN+ GD V A + + L T V + IN+ + N G+ ++DC+
Sbjct: 268 ITMENIKFGDDYVDIDPVGAAIDTGSSLLVAPTT--VAALINKELGAEKNWAGQYVVDCN 325
Query: 216 RIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFM 275
++P++P F K F L + Y+L+ + CISGFM D+P P GPLWI+GDVF+
Sbjct: 326 KVPSLPEFCFVFNGKDFCLEGKDYVLE----VQGQCISGFMGMDIPEPAGPLWIVGDVFL 381
Query: 276 GVYHTVFDSGKLRIGFA 292
+++V+D G R+G A
Sbjct: 382 RKFYSVYDLGNNRVGLA 398
>gi|351707611|gb|EHB10530.1| Renin [Heterocephalus glaber]
Length = 397
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 37/102 (36%), Positives = 59/102 (57%)
Query: 193 LSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCI 252
L I E + + E +++C+R+PT+P++SF +GD+ + L+ Y+L+ ++C
Sbjct: 295 LRLIMEALGAKEHSTDEYVVNCNRVPTLPDISFHLGDRAYTLTSADYVLQDPYRNDDLCT 354
Query: 253 SGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
D+PPP GPLW LG F+ ++T FD RIGFA A
Sbjct: 355 LALHGLDIPPPTGPLWALGASFIRKFYTEFDRHNNRIGFALA 396
Score = 67.0 bits (162), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 37/102 (36%), Positives = 63/102 (61%), Gaps = 2/102 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
++ Q ++ E+VFS + +RD G ++ GG DP+H++G YV ++K G WQ + +
Sbjct: 206 ILSQRVLKEDVFSVYYSRDSHLLGGELLL-GGSDPQHYQGNFHYVSISKSGSWQITMKGV 264
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
+G +T +CE GC A+VD+G S ++GPT + I A+G +
Sbjct: 265 SVGF-ATLLCEEGCMAVVDTGASYISGPTSSLRLIMEALGAK 305
>gi|194762106|ref|XP_001963199.1| GF19728 [Drosophila ananassae]
gi|190616896|gb|EDV32420.1| GF19728 [Drosophila ananassae]
Length = 390
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 46/103 (44%), Positives = 61/103 (59%), Gaps = 5/103 (4%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+EQ L+ E V SF+L A +GGE++ GG+D +KG T+VPVTK YWQF+L I
Sbjct: 203 MIEQKLLDEPVISFYLKLKGTAVQGGEMILGGIDSSLYKGSLTWVPVTKAAYWQFKLTAI 262
Query: 61 LIGNQSTGV-CEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
++ GV AI D+GTSL+ P T IN+ IG E
Sbjct: 263 ----KTKGVFISRNTQAIADTGTSLIVLPKAAYTRINNLIGAE 301
Score = 81.3 bits (199), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 41/100 (41%), Positives = 56/100 (56%), Gaps = 4/100 (4%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
IN L + N GE+ + C R+ +PNV+ IGD+ F L+P YI++ E C+S F
Sbjct: 294 INNLIGAEDNGEGEAFVRCGRVSALPNVNLHIGDRFFTLTPSDYIIRITESGETYCMSVF 353
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
+ L ILGD F+G ++TVFD G RIGFA A
Sbjct: 354 TYME----GNTLTILGDAFIGKFYTVFDKGNNRIGFAPVA 389
>gi|344275645|ref|XP_003409622.1| PREDICTED: LOW QUALITY PROTEIN: gastricsin-like [Loxodonta
africana]
Length = 384
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 38/85 (44%), Positives = 55/85 (64%)
Query: 2 VEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDIL 61
V QG +++ +FSF+ + P + GGE+ GGVD + + G+ + PVT + YWQ + +
Sbjct: 195 VMQGQLTQPIFSFYFSHQPTHQYGGELTLGGVDTQLYSGRIVWTPVTWELYWQIAIQEFA 254
Query: 62 IGNQSTGVCEGGCAAIVDSGTSLLA 86
GNQ G+C GC AIVD+GTSLLA
Sbjct: 255 TGNQVMGLCFQGCQAIVDTGTSLLA 279
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 29/92 (31%), Positives = 47/92 (51%), Gaps = 11/92 (11%)
Query: 208 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTG----EGIAEVCISGFMAFDLPPP 263
G+ +++C+ + +MP ++F + F L P Y+ G C+S
Sbjct: 300 GDFVVNCNYVQSMPTITFIVSGFQFPLPPSAYVFNNNGYCRLGTEAACLSSLTV------ 353
Query: 264 RGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
PLWILGDVF+ Y++V+D R+GFA +A
Sbjct: 354 -XPLWILGDVFLKEYYSVYDMANGRVGFALSA 384
>gi|13676837|ref|NP_112469.1| renin-1 precursor [Mus musculus]
gi|132327|sp|P06281.1|RENI1_MOUSE RecName: Full=Renin-1; AltName: Full=Angiotensinogenase; AltName:
Full=Kidney renin; Flags: Precursor
gi|53931|emb|CAA34636.1| unnamed protein product [Mus musculus]
gi|26342875|dbj|BAC35094.1| unnamed protein product [Mus musculus]
gi|26351563|dbj|BAC39418.1| unnamed protein product [Mus musculus]
gi|38512029|gb|AAH61053.1| Renin 1 structural [Mus musculus]
gi|148707703|gb|EDL39650.1| mCG131545 [Mus musculus]
Length = 402
Score = 85.1 bits (209), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 41/102 (40%), Positives = 65/102 (63%), Gaps = 2/102 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
++ QG++ EEVFS + NR GGE+V GG DP+H++G YV ++K WQ + +
Sbjct: 211 ILSQGVLKEEVFSVYYNRGSHLL-GGEVVLGGSDPQHYQGNFHYVSISKTDSWQITMKGV 269
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
+G+ ST +CE GCA +VD+G+S ++ PT + I A+G +
Sbjct: 270 SVGS-STLLCEEGCAVVVDTGSSFISAPTSSLKLIMQALGAK 310
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 33/86 (38%), Positives = 54/86 (62%)
Query: 209 ESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLW 268
E +++C ++PT+P++SF +G + + LS Y+L+ ++C A D+PPP GP+W
Sbjct: 316 EYVVNCSQVPTLPDISFDLGGRAYTLSSTDYVLQYPNRRDKLCTLALHAMDIPPPTGPVW 375
Query: 269 ILGDVFMGVYHTVFDSGKLRIGFAEA 294
+LG F+ ++T FD RIGFA A
Sbjct: 376 VLGATFIRKFYTEFDRHNNRIGFALA 401
>gi|129776|sp|P03954.2|PEPA1_MACFU RecName: Full=Pepsin A-1; AltName: Full=Pepsin III-3; Flags:
Precursor
gi|38075|emb|CAA42424.1| prepropepsin a [Macaca fuscata]
Length = 388
Score = 85.1 bits (209), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 46/110 (41%), Positives = 67/110 (60%), Gaps = 9/110 (8%)
Query: 3 EQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILI 62
+QGLVS+++FS +L+ D + G ++FGG+D ++ G +VPV+ +GYWQ + I +
Sbjct: 204 DQGLVSQDLFSVYLSAD--DQSGSVVIFGGIDSSYYTGSLNWVPVSVEGYWQISVDSITM 261
Query: 63 GNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG------GEGVVS 106
++ C GC AIVD+GTSLL GPT + I IG GE VVS
Sbjct: 262 NGEAIA-CAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGEMVVS 310
Score = 72.0 bits (175), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 36/91 (39%), Positives = 50/91 (54%), Gaps = 4/91 (4%)
Query: 205 NPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 264
N GE ++ C I ++P++ FTI + + P YIL++ C SGF D+P
Sbjct: 302 NSDGEMVVSCSAISSLPDIVFTINGIQYPVPPSAYILQSQGS----CTSGFQGMDVPTES 357
Query: 265 GPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
G LWILGDVF+ Y TVFD ++G A A
Sbjct: 358 GELWILGDVFIRQYFTVFDRANNQVGLAPVA 388
>gi|189011689|ref|NP_001098064.1| pepsin A precursor [Macaca mulatta]
gi|129793|sp|P11489.1|PEPA_MACMU RecName: Full=Pepsin A; Flags: Precursor
gi|342275|gb|AAA36902.1| pepsinogen A precursor (EC 3.4.23.1) [Macaca mulatta]
Length = 388
Score = 85.1 bits (209), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 46/110 (41%), Positives = 67/110 (60%), Gaps = 9/110 (8%)
Query: 3 EQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILI 62
+QGLVS+++FS +L+ D + G ++FGG+D ++ G +VPV+ +GYWQ + I +
Sbjct: 204 DQGLVSQDLFSVYLSAD--DQSGSVVIFGGIDSSYYTGSLNWVPVSVEGYWQISVDSITM 261
Query: 63 GNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG------GEGVVS 106
++ C GC AIVD+GTSLL GPT + I IG GE VVS
Sbjct: 262 NGEAIA-CAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGEMVVS 310
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 37/91 (40%), Positives = 50/91 (54%), Gaps = 4/91 (4%)
Query: 205 NPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 264
N GE ++ C I ++P++ FTI + L P YIL++ C SGF D+P
Sbjct: 302 NSDGEMVVSCSAISSLPDIVFTINGVQYPLPPSAYILQS----QGSCTSGFQGMDVPTES 357
Query: 265 GPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
G LWILGDVF+ Y TVFD ++G A A
Sbjct: 358 GELWILGDVFIRQYFTVFDRANNQVGLAPVA 388
>gi|21629629|gb|AAM61957.1| synthetic renin 2/1d [Mus musculus]
Length = 401
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 41/102 (40%), Positives = 65/102 (63%), Gaps = 2/102 (1%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
++ QG++ EEVFS + NR P GGE+V GG DP+H++G YV ++K WQ + +
Sbjct: 210 ILSQGVLKEEVFSVYYNRGPHLL-GGEVVLGGSDPEHYQGDFHYVSLSKTDSWQITMKGV 268
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
+G+ ST +CE GC +VD+G+S ++ PT + I A+G +
Sbjct: 269 SVGS-STLLCEEGCEVVVDTGSSFISAPTSSLKLIMQALGAK 309
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 33/88 (37%), Positives = 54/88 (61%)
Query: 207 MGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGP 266
+ E ++ C ++PT+P++SF +G + + LS Y+L+ ++C A D+PPP GP
Sbjct: 313 LHEYVVSCSQVPTLPDISFNLGGRAYTLSSTDYVLQYPNRRDKLCTVALHAMDIPPPTGP 372
Query: 267 LWILGDVFMGVYHTVFDSGKLRIGFAEA 294
+W+LG F+ ++T FD RIGFA A
Sbjct: 373 VWVLGATFIRKFYTEFDRHNNRIGFALA 400
>gi|348560226|ref|XP_003465915.1| PREDICTED: pepsin F-like [Cavia porcellus]
Length = 574
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 46/130 (35%), Positives = 72/130 (55%), Gaps = 13/130 (10%)
Query: 4 QGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIG 63
QG +S ++F+F+L+ AE G ++ GGVDP ++ G +VPV+K YWQ + I +G
Sbjct: 390 QGRLSRKLFAFYLSSK--AEAGSMLMLGGVDPAYYSGDLHWVPVSKPRYWQLAMDSISMG 447
Query: 64 NQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDL 123
+ C GGC AI+D+GTSL+ GP + I +G +G S+ G+L+ +
Sbjct: 448 GKDI-ACHGGCQAILDTGTSLVNGPRQAIAAIQKILGAKG----------SRNGELVVNC 496
Query: 124 LVSGLLPEKV 133
LPE +
Sbjct: 497 STISALPEVI 506
Score = 55.1 bits (131), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 35/106 (33%), Positives = 54/106 (50%), Gaps = 5/106 (4%)
Query: 190 EKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAE 249
+ ++ I ++ + + GE +++C I +P V FTIG + + YI K
Sbjct: 472 RQAIAAIQKILGAKGSRNGELVVNCSTISALPEVIFTIGGLSYPVPASAYIQKGNN---- 527
Query: 250 VCISGFMAFDLPPPRGP-LWILGDVFMGVYHTVFDSGKLRIGFAEA 294
VC SGF + G +W+LGDVF+ +Y +VFD RIG A A
Sbjct: 528 VCYSGFEEEAVDQALGTEVWVLGDVFLRLYFSVFDRENDRIGLAPA 573
>gi|193499295|gb|ACF18590.1| pepsinogen C precursor [Siniperca scherzeri]
Length = 387
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 51/154 (33%), Positives = 82/154 (53%), Gaps = 4/154 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ Q L++ ++F+F+L+ ++G E+ FGGVD ++G+ + PVT + YWQ +
Sbjct: 195 MISQNLLNADIFAFYLSSG--EQQGSELSFGGVDSSMYQGQIYWTPVTSETYWQIGVQGF 252
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDL- 119
I Q +G C GC +IVD+GTS+L P+ ++ I AIG + + SQ +L
Sbjct: 253 QINGQESGWCSQGCQSIVDTGTSMLTAPSQLLGYIMQAIGAQQSQYGMYMVDCSQVNNLP 312
Query: 120 IWDLLVSGL-LPEKVCQQIGLCAFNGAEYVSTGI 152
++SG+ P I NG +Y S GI
Sbjct: 313 TLTFVISGVSFPLPPSAYIIEQTQNGYQYCSVGI 346
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 35/107 (32%), Positives = 57/107 (53%), Gaps = 1/107 (0%)
Query: 190 EKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAE 249
++L YI + + + G ++DC ++ +P ++F I F L P YI++ + +
Sbjct: 281 SQLLGYIMQAIGAQQSQYGMYMVDCSQVNNLPTLTFVISGVSFPLPPSAYIIEQTQNGYQ 340
Query: 250 VCISGFMAFDLPPPRG-PLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
C G LP G PLWI GDVF+ Y++++D R+GFA AA
Sbjct: 341 YCSVGITPTYLPSRNGQPLWIFGDVFLREYYSIYDRTNNRVGFAAAA 387
>gi|51534964|dbj|BAD36915.1| pepsinogen C [Myocastor coypus]
Length = 393
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 41/103 (39%), Positives = 64/103 (62%), Gaps = 1/103 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDA-EEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGD 59
++ +G +S+ +FS +L DA EGG ++ GGVD + G ++ PVT++ YWQ + D
Sbjct: 202 LLREGALSQSLFSVYLGSQQDATNEGGALILGGVDESLYSGAISWTPVTQELYWQIGIED 261
Query: 60 ILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
L+ +++G C GC AIVD+GTSLL P ++ + AIG E
Sbjct: 262 FLLDGEASGWCSEGCQAIVDTGTSLLTVPQQYLSTLIEAIGAE 304
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 40/106 (37%), Positives = 61/106 (57%), Gaps = 5/106 (4%)
Query: 190 EKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAE 249
++ LS + E + + GE ++DC+ + +P ++F I F LSP YIL +G+ +
Sbjct: 291 QQYLSTLIEAIGAEEDEYGEYLVDCNNVQNLPTLTFVISGVQFPLSPSAYIL-SGD---Q 346
Query: 250 VCISGFMAFDLPPPRG-PLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
C+ G + L G P WILGDVF+ Y++VFD G R+GFA A
Sbjct: 347 YCMVGLESTYLSSDNGQPFWILGDVFLRSYYSVFDLGNNRVGFATA 392
>gi|219520803|gb|AAI71814.1| Pepsinogen 4, group I (pepsinogen A) [Homo sapiens]
Length = 388
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 45/116 (38%), Positives = 67/116 (57%), Gaps = 7/116 (6%)
Query: 3 EQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILI 62
QGLVS+++FS +L+ D + G ++FGG+D ++ G +VPVT +GYWQ + I +
Sbjct: 204 NQGLVSQDLFSVYLSAD--DQSGSVVIFGGIDSSYYTGSLNWVPVTVEGYWQITVDSITM 261
Query: 63 GNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG----EGVVSAECKLVVS 114
++ C GC AIVD+GTSLL GPT + I IG +G + C + S
Sbjct: 262 NGEAIA-CAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGDMVVSCSAISS 316
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 38/91 (41%), Positives = 53/91 (58%), Gaps = 4/91 (4%)
Query: 205 NPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 264
N G+ ++ C I ++P++ FTI + + P YIL++ EG CISGF +LP
Sbjct: 302 NSDGDMVVSCSAISSLPDIVFTINGVQYPVPPSAYILQS-EG---SCISGFQGMNLPTES 357
Query: 265 GPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
G LWILGDVF+ Y TVFD ++G A A
Sbjct: 358 GELWILGDVFIRQYFTVFDRANNQVGLAPVA 388
>gi|189066533|dbj|BAG35783.1| unnamed protein product [Homo sapiens]
gi|193785072|dbj|BAG54225.1| unnamed protein product [Homo sapiens]
gi|219521010|gb|AAI71815.1| Pepsinogen 3, group I (pepsinogen A) [Homo sapiens]
Length = 388
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 45/116 (38%), Positives = 67/116 (57%), Gaps = 7/116 (6%)
Query: 3 EQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILI 62
QGLVS+++FS +L+ D + G ++FGG+D ++ G +VPVT +GYWQ + I +
Sbjct: 204 NQGLVSQDLFSVYLSAD--DQSGSVVIFGGIDSSYYTGSLNWVPVTVEGYWQITVDSITM 261
Query: 63 GNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG----EGVVSAECKLVVS 114
++ C GC AIVD+GTSLL GPT + I IG +G + C + S
Sbjct: 262 NGEAIA-CAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGDMVVSCSAISS 316
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 38/91 (41%), Positives = 53/91 (58%), Gaps = 4/91 (4%)
Query: 205 NPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 264
N G+ ++ C I ++P++ FTI + + P YIL++ EG CISGF +LP
Sbjct: 302 NSDGDMVVSCSAISSLPDIVFTINGVQYPVPPSAYILQS-EG---SCISGFQGMNLPTES 357
Query: 265 GPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
G LWILGDVF+ Y TVFD ++G A A
Sbjct: 358 GELWILGDVFIRQYFTVFDRANNQVGLAPVA 388
>gi|222425200|dbj|BAH20549.1| pepsinogen A-50 [Pongo abelii]
Length = 388
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 42/100 (42%), Positives = 61/100 (61%), Gaps = 3/100 (3%)
Query: 3 EQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILI 62
QGLVS+++FS +L+ D + G ++FGG+D ++ G +VPVT +GYWQ + I +
Sbjct: 204 NQGLVSQDLFSVYLSAD--DQSGSVVIFGGIDSSYYTGSLNWVPVTVEGYWQITVDSITM 261
Query: 63 GNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
++ C GC AIVD+GTSLL GPT + I IG
Sbjct: 262 NGEAIA-CAEGCQAIVDTGTSLLTGPTSPIANIQSDIGAS 300
Score = 71.6 bits (174), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 37/91 (40%), Positives = 53/91 (58%), Gaps = 4/91 (4%)
Query: 205 NPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 264
N G+ ++ C I ++P++ FTI + + P YIL++ EG CISGF ++P
Sbjct: 302 NSDGDMVVSCSAISSLPDIVFTINGVQYPVPPSAYILQS-EG---SCISGFQGMNVPTES 357
Query: 265 GPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
G LWILGDVF+ Y TVFD ++G A A
Sbjct: 358 GELWILGDVFIRQYFTVFDRANNQVGLAPVA 388
>gi|61742104|gb|AAX54898.1| pepsinogen A2 precursor [Hippoglossus hippoglossus]
Length = 127
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 38/98 (38%), Positives = 61/98 (62%), Gaps = 5/98 (5%)
Query: 21 DAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDS 80
+A++G E+VFGG D HF G+ +++P+T YWQ ++ + I Q T C GGC AI+D+
Sbjct: 3 NAQQGSEVVFGGYDANHFTGQISWIPLTSATYWQIKMDSVTINGQ-TVACSGGCQAIIDT 61
Query: 81 GTSLLAGPTPVVTEINHAIGGE----GVVSAECKLVVS 114
GTSL+AGP+ ++ +N +G G + C+ + S
Sbjct: 62 GTSLIAGPSSDISNMNSWVGASTNQYGEATVNCQNIQS 99
Score = 38.9 bits (89), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 15/51 (29%), Positives = 29/51 (56%)
Query: 193 LSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKT 243
+S +N + N GE+ ++C I +MP+V+FT+ K F + Y+ ++
Sbjct: 73 ISNMNSWVGASTNQYGEATVNCQNIQSMPDVTFTLDGKAFTIPASAYVSQS 123
>gi|219521691|gb|AAI71808.1| Pepsinogen 4, group I (pepsinogen A) [Homo sapiens]
Length = 388
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 45/116 (38%), Positives = 67/116 (57%), Gaps = 7/116 (6%)
Query: 3 EQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILI 62
QGLVS+++FS +L+ D + G ++FGG+D ++ G +VPVT +GYWQ + I +
Sbjct: 204 NQGLVSQDLFSVYLSAD--DQSGSVVIFGGIDSSYYTGSLNWVPVTVEGYWQITVDSITM 261
Query: 63 GNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG----EGVVSAECKLVVS 114
++ C GC AIVD+GTSLL GPT + I IG +G + C + S
Sbjct: 262 NGEAIA-CAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGDMVVSCSAISS 316
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 38/91 (41%), Positives = 53/91 (58%), Gaps = 4/91 (4%)
Query: 205 NPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 264
N G+ ++ C I ++P++ FTI + + P YIL++ EG CISGF +LP
Sbjct: 302 NSDGDMVVSCSAISSLPDIVFTINGVQYPVPPSAYILQS-EG---SCISGFQGMNLPTES 357
Query: 265 GPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
G LWILGDVF+ Y TVFD ++G A A
Sbjct: 358 GELWILGDVFIRQYFTVFDRTNNQVGLAPVA 388
>gi|16974928|pdb|1FLH|A Chain A, Crystal Structure Of Human Uropepsin At 2.45 A Resolution
Length = 326
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 42/100 (42%), Positives = 61/100 (61%), Gaps = 3/100 (3%)
Query: 3 EQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILI 62
QGLVS+++FS +L+ D + G ++FGG+D ++ G +VPVT +GYWQ + I +
Sbjct: 142 NQGLVSQDLFSVYLSADD--QSGSVVIFGGIDSSYYTGSLNWVPVTVEGYWQITVDSITM 199
Query: 63 GNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
++ C GC AIVD+GTSLL GPT + I IG
Sbjct: 200 NGEAIA-CAEGCQAIVDTGTSLLTGPTSPIANIQSDIGAS 238
Score = 72.0 bits (175), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 37/91 (40%), Positives = 53/91 (58%), Gaps = 4/91 (4%)
Query: 205 NPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 264
N G+ ++ C I ++P++ FTI + + P YIL++ EG CISGF ++P
Sbjct: 240 NSDGDMVVSCSAISSLPDIVFTINGVQYPVPPSAYILQS-EG---SCISGFQGMNVPTES 295
Query: 265 GPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
G LWILGDVF+ Y TVFD ++G A A
Sbjct: 296 GELWILGDVFIRQYFTVFDRANNQVGLAPVA 326
>gi|47223178|emb|CAG11313.1| unnamed protein product [Tetraodon nigroviridis]
Length = 372
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 39/83 (46%), Positives = 54/83 (65%)
Query: 212 IDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILG 271
++CD + T+P+VSF +G + ++L+ E YIL + +VCI F D+PPP GPLWILG
Sbjct: 290 VNCDAVETLPSVSFHLGGQEYSLTQEDYILWQSQIEGDVCIVTFRGLDVPPPVGPLWILG 349
Query: 272 DVFMGVYHTVFDSGKLRIGFAEA 294
F+ Y+T FD RIGFA A
Sbjct: 350 ANFIARYYTEFDRRNNRIGFATA 372
Score = 77.8 bits (190), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 34/102 (33%), Positives = 59/102 (57%), Gaps = 1/102 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
++ Q ++ EEVFS + +RDP GGE+V GG DP ++ G Y+ + G W+ + +
Sbjct: 182 IMSQHVLKEEVFSVYYSRDPKHSPGGELVLGGTDPNYYTGSFNYMGTRETGKWEVTMKGV 241
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
+G + C GC A++D+G+S + GP V+ + +IG +
Sbjct: 242 SVGMEMM-FCAEGCTAVIDTGSSYITGPASSVSLLMKSIGAQ 282
>gi|1585064|prf||2124254A pepsin:ISOTYPE=3a
gi|1585065|prf||2124254B pepsin:ISOTYPE=3b
Length = 326
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 42/100 (42%), Positives = 61/100 (61%), Gaps = 3/100 (3%)
Query: 3 EQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILI 62
QGLVS+++FS +L+ D + G ++FGG+D ++ G +VPVT +GYWQ + I +
Sbjct: 142 NQGLVSQDLFSVYLSADD--QSGSVVIFGGIDSSYYTGSLNWVPVTVEGYWQITVDSITM 199
Query: 63 GNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
++ C GC AIVD+GTSLL GPT + I IG
Sbjct: 200 NGEAIA-CAEGCQAIVDTGTSLLTGPTSPIANIQSDIGAS 238
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 38/91 (41%), Positives = 53/91 (58%), Gaps = 4/91 (4%)
Query: 205 NPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 264
N G+ ++ C I ++P++ FTI + + P YIL++ EG CISGF +LP
Sbjct: 240 NSDGDMVVSCSAISSLPDIVFTINGVQYPVPPSAYILQS-EG---SCISGFQGMNLPTES 295
Query: 265 GPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
G LWILGDVF+ Y TVFD ++G A A
Sbjct: 296 GELWILGDVFIRQYFTVFDRANNQVGLAPVA 326
>gi|1065259|pdb|1PSO|E Chain E, The Crystal Structure Of Human Pepsin And Its Complex With
Pepstatin
gi|5542461|pdb|1QRP|E Chain E, Human Pepsin 3a In Complex With A Phosphonate Inhibitor
Iva-Val-Val- Leu(P)-(O)phe-Ala-Ala-Ome
gi|157833570|pdb|1PSN|A Chain A, The Crystal Structure Of Human Pepsin And Its Complex With
Pepstatin
gi|361132440|pdb|3UTL|A Chain A, Human Pepsin 3b
Length = 326
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 42/100 (42%), Positives = 61/100 (61%), Gaps = 3/100 (3%)
Query: 3 EQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILI 62
QGLVS+++FS +L+ D + G ++FGG+D ++ G +VPVT +GYWQ + I +
Sbjct: 142 NQGLVSQDLFSVYLSADD--QSGSVVIFGGIDSSYYTGSLNWVPVTVEGYWQITVDSITM 199
Query: 63 GNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
++ C GC AIVD+GTSLL GPT + I IG
Sbjct: 200 NGEAIA-CAEGCQAIVDTGTSLLTGPTSPIANIQSDIGAS 238
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 38/91 (41%), Positives = 53/91 (58%), Gaps = 4/91 (4%)
Query: 205 NPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 264
N G+ ++ C I ++P++ FTI + + P YIL++ EG CISGF +LP
Sbjct: 240 NSDGDMVVSCSAISSLPDIVFTINGVQYPVPPSAYILQS-EG---SCISGFQGMNLPTES 295
Query: 265 GPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
G LWILGDVF+ Y TVFD ++G A A
Sbjct: 296 GELWILGDVFIRQYFTVFDRANNQVGLAPVA 326
>gi|119372298|ref|NP_001073275.1| pepsin A preproprotein [Homo sapiens]
gi|378521956|sp|P0DJD8.1|PEPA3_HUMAN RecName: Full=Pepsin A-3; AltName: Full=Pepsinogen-3; Flags:
Precursor
gi|182887917|gb|AAI60184.1| Pepsinogen 3, group I (pepsinogen A) [synthetic construct]
Length = 388
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 42/100 (42%), Positives = 61/100 (61%), Gaps = 3/100 (3%)
Query: 3 EQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILI 62
QGLVS+++FS +L+ D + G ++FGG+D ++ G +VPVT +GYWQ + I +
Sbjct: 204 NQGLVSQDLFSVYLSAD--DQSGSVVIFGGIDSSYYTGSLNWVPVTVEGYWQITVDSITM 261
Query: 63 GNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
++ C GC AIVD+GTSLL GPT + I IG
Sbjct: 262 NGEAIA-CAEGCQAIVDTGTSLLTGPTSPIANIQSDIGAS 300
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 38/91 (41%), Positives = 53/91 (58%), Gaps = 4/91 (4%)
Query: 205 NPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 264
N G+ ++ C I ++P++ FTI + + P YIL++ EG CISGF +LP
Sbjct: 302 NSDGDMVVSCSAISSLPDIVFTINGVQYPVPPSAYILQS-EG---SCISGFQGMNLPTES 357
Query: 265 GPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
G LWILGDVF+ Y TVFD ++G A A
Sbjct: 358 GELWILGDVFIRQYFTVFDRANNQVGLAPVA 388
>gi|195349117|ref|XP_002041093.1| GM15229 [Drosophila sechellia]
gi|194122698|gb|EDW44741.1| GM15229 [Drosophila sechellia]
Length = 395
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 50/119 (42%), Positives = 63/119 (52%), Gaps = 3/119 (2%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M +Q LV + VFSF+L R+ +GGE++FGGVD F G TYVP+T GYWQF L I
Sbjct: 209 MCDQKLVDDCVFSFYLKRNGSDRKGGELLFGGVDKTKFSGSLTYVPLTHAGYWQFPLDAI 268
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDL 119
+ AI D+GTSLLA P IN +GG + E L S+ L
Sbjct: 269 EVAGTRISQHR---QAIADTGTSLLAAPPREYLIINSLLGGLPTSNNEYLLNCSEIDSL 324
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 41/101 (40%), Positives = 57/101 (56%), Gaps = 6/101 (5%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILK-TGEGIAEVCISG 254
IN L LP E +++C I ++P + F IG + F L P Y++ T + + +C+S
Sbjct: 300 INSLLGGLPTSNNEYLLNCSEIDSLPEIVFIIGGQRFGLQPRDYVMSATNDDGSSICLSA 359
Query: 255 FMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
F D WILGDVF+G Y+T FD+G RIGFA AA
Sbjct: 360 FTLMD-----AEFWILGDVFIGRYYTAFDAGHRRIGFAPAA 395
>gi|297688536|ref|XP_002821738.1| PREDICTED: pepsin A-4 [Pongo abelii]
Length = 388
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 42/100 (42%), Positives = 61/100 (61%), Gaps = 3/100 (3%)
Query: 3 EQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILI 62
QGLVS+++FS +L+ D + G ++FGG+D ++ G +VPVT +GYWQ + I +
Sbjct: 204 NQGLVSQDLFSVYLSAD--DQSGSVVIFGGIDSSYYTGSLNWVPVTVEGYWQITVDSITM 261
Query: 63 GNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
++ C GC AIVD+GTSLL GPT + I IG
Sbjct: 262 NGEAIA-CAEGCQAIVDTGTSLLTGPTSPIANIQSDIGAS 300
Score = 71.6 bits (174), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 37/91 (40%), Positives = 53/91 (58%), Gaps = 4/91 (4%)
Query: 205 NPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 264
N G+ ++ C I ++P++ FTI + + P YIL++ EG CISGF ++P
Sbjct: 302 NSDGDMVVSCSAISSLPDIVFTINGVQYPVPPSAYILQS-EG---SCISGFQGMNVPTES 357
Query: 265 GPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
G LWILGDVF+ Y TVFD ++G A A
Sbjct: 358 GELWILGDVFIRQYFTVFDRANNQVGLAPVA 388
>gi|309387095|gb|ADO67801.1| pregnancy-associated glycoprotein 12 [Bubalus bubalis]
Length = 367
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 45/112 (40%), Positives = 69/112 (61%), Gaps = 8/112 (7%)
Query: 3 EQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILI 62
QG SE VF+F+LN+ EG ++FGGVD +++KG+ ++PV++ YWQ + I I
Sbjct: 191 SQGAFSEPVFAFYLNKC--KPEGSVVMFGGVDHRYYKGELNWMPVSQTRYWQISMNRISI 248
Query: 63 GNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAI-----GGEGVVSAEC 109
N + C GC A++D+GTS++ GPT +VT I+ + G E VVS +
Sbjct: 249 -NGNVTACSRGCQALLDTGTSMIHGPTRLVTNIHKLMNARHQGSEYVVSCDA 299
Score = 46.2 bits (108), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 29/82 (35%), Positives = 39/82 (47%), Gaps = 7/82 (8%)
Query: 209 ESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFM-AFDLPPPRGPL 267
E ++ CD + T+P V F I + L P+ YI+K C+S F + P
Sbjct: 292 EYVVSCDAVKTLPPVIFNINGIDYPLPPQAYIIKA----QNFCLSIFHGGTETSSPE--T 345
Query: 268 WILGDVFMGVYHTVFDSGKLRI 289
WILGD F+ Y VFD RI
Sbjct: 346 WILGDAFLRQYFAVFDRRNDRI 367
>gi|195046646|ref|XP_001992192.1| GH24624 [Drosophila grimshawi]
gi|193893033|gb|EDV91899.1| GH24624 [Drosophila grimshawi]
Length = 446
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 44/101 (43%), Positives = 59/101 (58%), Gaps = 5/101 (4%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGD- 59
M+ QGLVS+++FSF+L D GGE++FGG DP F G Y VT++GYWQF + +
Sbjct: 209 MISQGLVSKQIFSFYLASDGSNSYGGELIFGGSDPMWFIGNMVYTNVTREGYWQFMMDNA 268
Query: 60 ILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 100
L GN + C AIVD+GTSL+ P I +G
Sbjct: 269 TLNGN----ILCTNCPAIVDTGTSLIMAPAASYQIIMEVVG 305
Score = 57.8 bits (138), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 30/83 (36%), Positives = 44/83 (53%), Gaps = 7/83 (8%)
Query: 211 IIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWIL 270
I+DC I MP ++F I +F + P Y++ +G+ C G + WIL
Sbjct: 311 IVDCSTISQMPVLTFAINGVMFGIPPSAYVIV--DGLNNECTLGIQSMS-----ENFWIL 363
Query: 271 GDVFMGVYHTVFDSGKLRIGFAE 293
GDVF+G Y+T FD + R+GFA
Sbjct: 364 GDVFIGQYYTEFDLAQNRLGFAS 386
>gi|119372302|ref|NP_001073276.1| pepsin A preproprotein [Homo sapiens]
gi|378521995|sp|P0DJD7.1|PEPA4_HUMAN RecName: Full=Pepsin A-4; AltName: Full=Pepsinogen-4; Flags:
Precursor
gi|387012|gb|AAA98529.1| pepsinogen [Homo sapiens]
gi|157170280|gb|AAI52845.1| Pepsinogen 4, group I (pepsinogen A) [synthetic construct]
gi|219520853|gb|AAI71920.1| Pepsinogen 4, group I (pepsinogen A) [Homo sapiens]
gi|219521176|gb|AAI71910.1| Pepsinogen 4, group I (pepsinogen A) [Homo sapiens]
gi|223462201|gb|AAI50660.1| Pepsinogen 4, group I (pepsinogen A) [Homo sapiens]
gi|261860840|dbj|BAI46942.1| pepsinogen 4, group I [synthetic construct]
Length = 388
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 42/100 (42%), Positives = 61/100 (61%), Gaps = 3/100 (3%)
Query: 3 EQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILI 62
QGLVS+++FS +L+ D + G ++FGG+D ++ G +VPVT +GYWQ + I +
Sbjct: 204 NQGLVSQDLFSVYLSAD--DQSGSVVIFGGIDSSYYTGSLNWVPVTVEGYWQITVDSITM 261
Query: 63 GNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
++ C GC AIVD+GTSLL GPT + I IG
Sbjct: 262 NGEAIA-CAEGCQAIVDTGTSLLTGPTSPIANIQSDIGAS 300
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 38/91 (41%), Positives = 53/91 (58%), Gaps = 4/91 (4%)
Query: 205 NPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 264
N G+ ++ C I ++P++ FTI + + P YIL++ EG CISGF +LP
Sbjct: 302 NSDGDMVVSCSAISSLPDIVFTINGVQYPVPPSAYILQS-EG---SCISGFQGMNLPTES 357
Query: 265 GPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
G LWILGDVF+ Y TVFD ++G A A
Sbjct: 358 GELWILGDVFIRQYFTVFDRANNQVGLAPVA 388
>gi|387013|gb|AAA60061.1| pepsinogen A [Homo sapiens]
Length = 388
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 42/100 (42%), Positives = 61/100 (61%), Gaps = 3/100 (3%)
Query: 3 EQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILI 62
QGLVS+++FS +L+ D + G ++FGG+D ++ G +VPVT +GYWQ + I +
Sbjct: 204 NQGLVSQDLFSVYLSAD--DQSGSVVIFGGIDSSYYTGSLNWVPVTVEGYWQITVDSITM 261
Query: 63 GNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
++ C GC AIVD+GTSLL GPT + I IG
Sbjct: 262 NGEAIA-CAEGCQAIVDTGTSLLTGPTSPIANIQSDIGAS 300
Score = 71.2 bits (173), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 37/91 (40%), Positives = 53/91 (58%), Gaps = 4/91 (4%)
Query: 205 NPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 264
N G+ ++ C I ++P++ FTI + + P YIL++ EG CISGF +LP
Sbjct: 302 NSDGDMVVSCSAISSLPDIVFTINGVQYPVPPSAYILQS-EG---SCISGFQGMNLPTES 357
Query: 265 GPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
G LWILGDVF+ Y TVF+ ++G A A
Sbjct: 358 GELWILGDVFIRQYFTVFERANNQVGLAPVA 388
>gi|999902|pdb|1HTR|B Chain B, Crystal And Molecular Structures Of Human Progastricsin At
1.62 Angstroms Resolution
gi|2982065|pdb|1AVF|A Chain A, Activation Intermediate 2 Of Human Gastricsin From Human
Stomach
gi|2982067|pdb|1AVF|J Chain J, Activation Intermediate 2 Of Human Gastricsin From Human
Stomach
Length = 329
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 42/102 (41%), Positives = 63/102 (61%), Gaps = 1/102 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV++G ++ VFS +L+ GG +VFGGVD + G+ + PVT++ YWQ + +
Sbjct: 140 MVQEGALTSPVFSVYLSNQ-QGSSGGAVVFGGVDSSLYTGQIYWAPVTQELYWQIGIEEF 198
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
LIG Q++G C GC AIVD+GTSLL P ++ + A G +
Sbjct: 199 LIGGQASGWCSEGCQAIVDTGTSLLTVPQQYMSALLQATGAQ 240
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 36/107 (33%), Positives = 57/107 (53%), Gaps = 5/107 (4%)
Query: 190 EKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAE 249
++ +S + + + + G+ +++C+ I +P+++F I F L P YIL
Sbjct: 227 QQYMSALLQATGAQEDEYGQFLVNCNSIQNLPSLTFIINGVEFPLPPSSYILSNNG---- 282
Query: 250 VCISGFMAFDLPPPRG-PLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
C G L G PLWILGDVF+ Y++V+D G R+GFA AA
Sbjct: 283 YCTVGVEPTYLSSQNGQPLWILGDVFLRSYYSVYDLGNNRVGFATAA 329
>gi|229576947|ref|NP_001153272.1| pepsinogen A precursor [Pongo abelii]
gi|222425188|dbj|BAH20543.1| pepsinogen A-19 [Pongo abelii]
gi|222425190|dbj|BAH20544.1| pepsinogen A-13 [Pongo abelii]
gi|222425204|dbj|BAH20551.1| pepsinogen A-41 [Pongo abelii]
Length = 388
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 45/116 (38%), Positives = 67/116 (57%), Gaps = 7/116 (6%)
Query: 3 EQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILI 62
QGLVS+++FS +L+ D + G ++FGG+D ++ G +VPVT +GYWQ + I +
Sbjct: 204 NQGLVSQDLFSVYLSAD--DQSGSVVIFGGIDSSYYTGSLNWVPVTVEGYWQITVDSITM 261
Query: 63 GNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG----EGVVSAECKLVVS 114
++ C GC AIVD+GTSLL GPT + I IG +G + C + S
Sbjct: 262 NGEAIA-CAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGDMVVSCSAISS 316
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 38/91 (41%), Positives = 53/91 (58%), Gaps = 4/91 (4%)
Query: 205 NPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 264
N G+ ++ C I ++P++ FTI + L P YIL++ EG CISGF ++P
Sbjct: 302 NSDGDMVVSCSAISSLPDIVFTINGVQYPLPPSAYILQS-EG---SCISGFQGMNVPTES 357
Query: 265 GPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
G LWILGDVF+ Y TVFD ++G A A
Sbjct: 358 GELWILGDVFIRQYFTVFDRANNQVGLAPVA 388
>gi|193499297|gb|ACF18591.1| pepsinogen C precursor [Siniperca chuatsi]
gi|253762213|gb|ACT35558.1| pepsinogen C precursor [Siniperca scherzeri]
Length = 387
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 53/158 (33%), Positives = 85/158 (53%), Gaps = 12/158 (7%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
M+ Q L++ ++F+F+L+ ++G E+ FGGVD ++G+ + PVT + YWQ +
Sbjct: 195 MISQNLLNADIFAFYLSSG--EQQGSELSFGGVDSSMYQGQIYWTPVTSETYWQIGVQGF 252
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAECKLVVSQY 116
I Q +G C GC +IVD+GTS+L P+ ++ I AIG + G+ +C SQ
Sbjct: 253 QINGQESGWCSQGCQSIVDTGTSMLTAPSQLLGYIMQAIGAQQNQYGMYMVDC----SQV 308
Query: 117 GDL-IWDLLVSGL-LPEKVCQQIGLCAFNGAEYVSTGI 152
+L ++SG+ P I NG +Y S GI
Sbjct: 309 NNLPTLTFVISGVSFPLPPSAYIIEHTQNGYQYCSVGI 346
Score = 74.3 bits (181), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 36/107 (33%), Positives = 57/107 (53%), Gaps = 1/107 (0%)
Query: 190 EKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAE 249
++L YI + + N G ++DC ++ +P ++F I F L P YI++ + +
Sbjct: 281 SQLLGYIMQAIGAQQNQYGMYMVDCSQVNNLPTLTFVISGVSFPLPPSAYIIEHTQNGYQ 340
Query: 250 VCISGFMAFDLPPPRG-PLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
C G LP G PLWI GDVF+ Y++++D R+GFA AA
Sbjct: 341 YCSVGITPTYLPSRNGQPLWIFGDVFLREYYSIYDRTNNRVGFAAAA 387
>gi|130484814|ref|NP_001076103.1| gastricsin precursor [Oryctolagus cuniculus]
gi|73621389|sp|Q9GMY2.1|PEPC_RABIT RecName: Full=Gastricsin; AltName: Full=Pepsinogen C; Flags:
Precursor
gi|9798668|dbj|BAB11756.1| pepsinogen C [Oryctolagus cuniculus]
Length = 388
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 41/102 (40%), Positives = 63/102 (61%), Gaps = 1/102 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV+ G +S VFSF+L+ +GG +V GGVD + G + PVT++ YWQ + +
Sbjct: 199 MVQDGTISSSVFSFYLSSQ-QGTDGGALVLGGVDSSLYTGDIYWAPVTRELYWQIGIDEF 257
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
LI ++++G C GC AIVD+GTSLL P ++++ A G +
Sbjct: 258 LISSEASGWCSQGCQAIVDTGTSLLTVPQEYMSDLLEATGAQ 299
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 43/108 (39%), Positives = 59/108 (54%), Gaps = 7/108 (6%)
Query: 190 EKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKT-GEGIA 248
++ +S + E + N GE ++DCD ++P +F I F LSP YIL T G+
Sbjct: 286 QEYMSDLLEATGAQENEYGEFLVDCDSTESLPTFTFVINGVEFPLSPSAYILNTDGQ--- 342
Query: 249 EVCISGFMAFDLPPPRG-PLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
C+ G A L G PLWILGDVF+ Y++VFD R+GFA A
Sbjct: 343 --CMVGVEATYLSSQDGEPLWILGDVFLRAYYSVFDMANNRVGFAALA 388
>gi|1585066|prf||2124254C pepsin:ISOTYPE=3c
Length = 326
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 42/100 (42%), Positives = 62/100 (62%), Gaps = 3/100 (3%)
Query: 3 EQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILI 62
QGLVS+++FS +L+ D + G ++FGG+D ++ G +VPVT +GYWQ + I +
Sbjct: 142 NQGLVSQDLFSVYLSADD--KSGSVVIFGGIDSSYYTGSLNWVPVTVEGYWQITVDSITM 199
Query: 63 GNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
++ C GC AIVD+GTSLL GPT + +I IG
Sbjct: 200 NGEAIA-CAEGCQAIVDTGTSLLTGPTSPIAKIQSDIGAS 238
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 38/91 (41%), Positives = 53/91 (58%), Gaps = 4/91 (4%)
Query: 205 NPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 264
N G+ ++ C I ++P++ FTI + + P YIL++ EG CISGF +LP
Sbjct: 240 NSDGDMVVSCSAISSLPDIVFTINGVQYPVPPSAYILQS-EG---SCISGFQGMNLPTES 295
Query: 265 GPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
G LWILGDVF+ Y TVFD ++G A A
Sbjct: 296 GELWILGDVFIRQYFTVFDRANNQVGLAPVA 326
>gi|222425198|dbj|BAH20548.1| pepsinogen A-36 [Pongo abelii]
Length = 388
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 42/100 (42%), Positives = 61/100 (61%), Gaps = 3/100 (3%)
Query: 3 EQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILI 62
QGLVS+++FS +L+ D + G ++FGG+D ++ G +VPVT +GYWQ + I +
Sbjct: 204 NQGLVSQDLFSVYLSAD--DQSGSVVIFGGIDSSYYTGSLNWVPVTVEGYWQITVDSITM 261
Query: 63 GNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
++ C GC AIVD+GTSLL GPT + I IG
Sbjct: 262 NGEAIA-CAEGCQAIVDTGTSLLTGPTSPIANIQSDIGAS 300
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 38/91 (41%), Positives = 53/91 (58%), Gaps = 4/91 (4%)
Query: 205 NPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 264
N G+ ++ C I ++P++ FTI + L P YIL++ EG CISGF ++P
Sbjct: 302 NSDGDMVVSCSAISSLPDIVFTINGVQYPLPPSAYILQS-EG---SCISGFQGMNVPTES 357
Query: 265 GPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
G LWILGDVF+ Y TVFD ++G A A
Sbjct: 358 GELWILGDVFIRQYFTVFDRANNQVGLAPVA 388
>gi|4505757|ref|NP_002621.1| gastricsin isoform 1 preproprotein [Homo sapiens]
gi|129796|sp|P20142.1|PEPC_HUMAN RecName: Full=Gastricsin; AltName: Full=Pepsinogen C; Flags:
Precursor
gi|387015|gb|AAA60063.1| pepsinogen C [Homo sapiens]
gi|551176|gb|AAA60074.1| pepsinogen [Homo sapiens]
gi|1658286|gb|AAB18273.1| gastricsin [Homo sapiens]
gi|49522219|gb|AAH73740.1| Progastricsin (pepsinogen C) [Homo sapiens]
gi|119624464|gb|EAX04059.1| progastricsin (pepsinogen C) [Homo sapiens]
Length = 388
Score = 84.7 bits (208), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 42/102 (41%), Positives = 63/102 (61%), Gaps = 1/102 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV++G ++ VFS +L+ GG +VFGGVD + G+ + PVT++ YWQ + +
Sbjct: 199 MVQEGALTSPVFSVYLSNQ-QGSSGGAVVFGGVDSSLYTGQIYWAPVTQELYWQIGIEEF 257
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
LIG Q++G C GC AIVD+GTSLL P ++ + A G +
Sbjct: 258 LIGGQASGWCSEGCQAIVDTGTSLLTVPQQYMSALLQATGAQ 299
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 36/107 (33%), Positives = 57/107 (53%), Gaps = 5/107 (4%)
Query: 190 EKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAE 249
++ +S + + + + G+ +++C+ I +P+++F I F L P YIL
Sbjct: 286 QQYMSALLQATGAQEDEYGQFLVNCNSIQNLPSLTFIINGVEFPLPPSSYILSNNG---- 341
Query: 250 VCISGFMAFDLPPPRG-PLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
C G L G PLWILGDVF+ Y++V+D G R+GFA AA
Sbjct: 342 YCTVGVEPTYLSSQNGQPLWILGDVFLRSYYSVYDLGNNRVGFATAA 388
>gi|222425194|dbj|BAH20546.1| pepsinogen A-28 [Pongo abelii]
gi|222425196|dbj|BAH20547.1| pepsinogen A-17 [Pongo abelii]
gi|222425202|dbj|BAH20550.1| pepsinogen A-71 [Pongo abelii]
Length = 388
Score = 84.7 bits (208), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 45/116 (38%), Positives = 67/116 (57%), Gaps = 7/116 (6%)
Query: 3 EQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILI 62
QGLVS+++FS +L+ D + G ++FGG+D ++ G +VPVT +GYWQ + I +
Sbjct: 204 NQGLVSQDLFSVYLSAD--DQSGSVVIFGGIDSSYYTGSLNWVPVTVEGYWQITVDSITM 261
Query: 63 GNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG----EGVVSAECKLVVS 114
++ C GC AIVD+GTSLL GPT + I IG +G + C + S
Sbjct: 262 NGEAIA-CAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGDMVVSCSAISS 316
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 38/91 (41%), Positives = 53/91 (58%), Gaps = 4/91 (4%)
Query: 205 NPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 264
N G+ ++ C I ++P++ FTI + L P YIL++ EG CISGF ++P
Sbjct: 302 NSDGDMVVSCSAISSLPDIVFTINGVQYPLPPSAYILQS-EG---SCISGFQGMNVPTES 357
Query: 265 GPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
G LWILGDVF+ Y TVFD ++G A A
Sbjct: 358 GELWILGDVFIRQYFTVFDRANNQVGLAPVA 388
>gi|387014|gb|AAA60062.1| pepsinogen [Homo sapiens]
Length = 385
Score = 84.7 bits (208), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 42/102 (41%), Positives = 63/102 (61%), Gaps = 1/102 (0%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV++G ++ VFS +L+ GG +VFGGVD + G+ + PVT++ YWQ + +
Sbjct: 196 MVQEGALTSPVFSVYLSNQ-QGSSGGAVVFGGVDSSLYTGQIYWAPVTQELYWQIGIEEF 254
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
LIG Q++G C GC AIVD+GTSLL P ++ + A G +
Sbjct: 255 LIGGQASGWCSEGCQAIVDTGTSLLTVPQQYMSALLQATGAQ 296
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 36/107 (33%), Positives = 57/107 (53%), Gaps = 5/107 (4%)
Query: 190 EKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAE 249
++ +S + + + + G+ +++C+ I +P+++F I F L P YIL
Sbjct: 283 QQYMSALLQATGAQEDEYGQFLVNCNSIQNLPSLTFIINGVEFPLPPSSYILSNNG---- 338
Query: 250 VCISGFMAFDLPPPRG-PLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
C G L G PLWILGDVF+ Y++V+D G R+GFA AA
Sbjct: 339 YCTVGVEPTYLSSQNGQPLWILGDVFLRSYYSVYDLGNNRVGFATAA 385
>gi|222425192|dbj|BAH20545.1| pepsinogen A-59 [Pongo abelii]
Length = 388
Score = 84.7 bits (208), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 42/100 (42%), Positives = 61/100 (61%), Gaps = 3/100 (3%)
Query: 3 EQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILI 62
QGLVS+++FS +L+ D + G ++FGG+D ++ G +VPVT +GYWQ + I +
Sbjct: 204 NQGLVSQDLFSVYLSAD--DQSGSVVIFGGIDSSYYTGSLNWVPVTVEGYWQITVDSITM 261
Query: 63 GNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 102
++ C GC AIVD+GTSLL GPT + I IG
Sbjct: 262 NGEAIA-CAEGCQAIVDTGTSLLTGPTSPIANIQSDIGAS 300
Score = 71.6 bits (174), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 37/91 (40%), Positives = 53/91 (58%), Gaps = 4/91 (4%)
Query: 205 NPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 264
N G+ ++ C I ++P++ FTI + + P YIL++ EG CISGF ++P
Sbjct: 302 NSDGDMVVSCSAISSLPDIVFTINGVQYPVPPSAYILQS-EG---SCISGFQGMNVPTES 357
Query: 265 GPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
G LWILGDVF+ Y TVFD ++G A A
Sbjct: 358 GELWILGDVFIRQYFTVFDRANNQVGLAPVA 388
>gi|194759256|ref|XP_001961865.1| GF15184 [Drosophila ananassae]
gi|190615562|gb|EDV31086.1| GF15184 [Drosophila ananassae]
Length = 403
Score = 84.7 bits (208), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 44/100 (44%), Positives = 58/100 (58%), Gaps = 3/100 (3%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
+ EQGL+ + VF F+L R+ ++EGG++ GG D G TYVPVTK+GYWQF + I
Sbjct: 219 LYEQGLIDQPVFGFYLARNGTSDEGGQLTLGGTDYNLIDGDLTYVPVTKQGYWQFAVNQI 278
Query: 61 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 100
V G AI D+GTSL+ P T+IN IG
Sbjct: 279 TWNGT---VVSGPVQAIADTGTSLIVVPADAYTKINELIG 315
Score = 64.7 bits (156), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 36/100 (36%), Positives = 52/100 (52%), Gaps = 6/100 (6%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
INEL ++ GE + C + ++P ++F G F+L P YI EG + C+S F
Sbjct: 310 INELIGAI-YIQGEWYVPCSTVDSLPVITFNFGGTNFDLPPSVYIQTYNEGEYDSCVSTF 368
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 295
WILGDVF+G ++T FD G+ R+GF A
Sbjct: 369 SYIGTE-----FWILGDVFLGQFYTEFDFGQNRVGFGNLA 403
>gi|2689725|gb|AAB91421.1| pregnancy-associated glycoprotein [Equus zebra]
Length = 388
Score = 84.7 bits (208), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 43/97 (44%), Positives = 62/97 (63%), Gaps = 3/97 (3%)
Query: 3 EQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILI 62
QGL+S+++F+F+L+ ++G ++FGGVDP ++ G+ +VPV+K YWQ + I I
Sbjct: 202 NQGLLSQKLFAFYLSSK--GKKGSVVMFGGVDPSYYTGELHWVPVSKPLYWQISMDSISI 259
Query: 63 GNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAI 99
N C+GGC AIVD+GTSLL GP V I I
Sbjct: 260 -NGKVIACDGGCQAIVDTGTSLLLGPQDAVLNIQEII 295
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 35/99 (35%), Positives = 49/99 (49%), Gaps = 2/99 (2%)
Query: 196 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 255
I E+ + + G IDCD + T+P++ FTI + + YI K + +C S F
Sbjct: 291 IQEIIQARRSTSGGYFIDCDAVNTLPDILFTIDGIGYPVPANAYIQK--DAALGICFSSF 348
Query: 256 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 294
+ WILGDVF+ +Y TVFD RIG A A
Sbjct: 349 EGNEDILNNLEEWILGDVFLRLYFTVFDRENDRIGLATA 387
>gi|66270071|gb|AAY43365.1| aspartic protease [Phytophthora infestans]
gi|310704916|gb|ADP08191.1| aspartic protease 1 [Phytophthora infestans]
Length = 390
Score = 84.7 bits (208), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 44/93 (47%), Positives = 58/93 (62%), Gaps = 10/93 (10%)
Query: 207 MGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGP 266
MGE +I C P++SF + K + L+ E+Y LK+G +C+ FM D+P P GP
Sbjct: 304 MGEYLISC--TAAAPDISFVLNGKTYTLTKEEYTLKSGP----ICLFAFMGIDIPAPAGP 357
Query: 267 LWILGDVFMGVYHTVF----DSGKLRIGFAEAA 295
LWILGDVFM ++TVF DS K R+GFA AA
Sbjct: 358 LWILGDVFMRKHYTVFDWGTDSRKPRVGFALAA 390
Score = 76.3 bits (186), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 46/109 (42%), Positives = 63/109 (57%), Gaps = 9/109 (8%)
Query: 1 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 60
MV++GL+ E VF+F+L D GE+ FGGVD H+KG+ YV VT YW +LG +
Sbjct: 206 MVQEGLLDEPVFAFYLGDQKD----GELTFGGVDKAHYKGEIEYVDVTSATYWSVKLGAV 261
Query: 61 -LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 108
G + T V + AIVDSGTSL+AGP V ++ +G + E
Sbjct: 262 ETKGEKLTDVDK----AIVDSGTSLIAGPKDQVAKLAALVGAHKFIMGE 306
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.319 0.139 0.427
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,182,471,179
Number of Sequences: 23463169
Number of extensions: 227774816
Number of successful extensions: 480946
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 2066
Number of HSP's successfully gapped in prelim test: 1449
Number of HSP's that attempted gapping in prelim test: 470698
Number of HSP's gapped (non-prelim): 7230
length of query: 295
length of database: 8,064,228,071
effective HSP length: 141
effective length of query: 154
effective length of database: 9,050,888,538
effective search space: 1393836834852
effective search space used: 1393836834852
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 76 (33.9 bits)